RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5721
         (290 letters)



>gnl|CDD|238665 cd01369, KISc_KHC_KIF5, Kinesin motor domain, kinesin heavy chain
           (KHC) or KIF5-like subgroup. Members of this group have
           been associated with organelle transport. This catalytic
           (head) domain has ATPase activity and belongs to the
           larger group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In most kinesins, the motor domain is found at
           the N-terminus (N-type). N-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 325

 Score =  150 bits (380), Expect = 3e-43
 Identities = 54/67 (80%), Positives = 60/67 (89%)

Query: 8   ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
           AL NVI+AL DG  THIPYRDSKLTRILQ+SLGGN+RTT+IICCSP+S+NESET STL F
Sbjct: 259 ALGNVINALTDGKSTHIPYRDSKLTRILQDSLGGNSRTTLIICCSPSSYNESETLSTLRF 318

Query: 68  GRRAKTI 74
           G RAKTI
Sbjct: 319 GARAKTI 325


>gnl|CDD|214526 smart00129, KISc, Kinesin motor, catalytic domain. ATPase.
           Microtubule-dependent molecular motors that play
           important roles in intracellular transport of organelles
           and in cell division.
          Length = 335

 Score =  123 bits (311), Expect = 4e-33
 Identities = 47/75 (62%), Positives = 55/75 (73%), Gaps = 1/75 (1%)

Query: 8   ALRNVISALADGNK-THIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLD 66
           AL NVI+ALA  +K  HIPYRDSKLTR+LQ+SLGGN++T +I   SP+S N  ET STL 
Sbjct: 261 ALGNVINALAQHSKSRHIPYRDSKLTRLLQDSLGGNSKTLMIANVSPSSSNLEETLSTLR 320

Query: 67  FGRRAKTIKNVVTVN 81
           F  RAK IKN   VN
Sbjct: 321 FASRAKEIKNKPIVN 335


>gnl|CDD|215803 pfam00225, Kinesin, Kinesin motor domain. 
          Length = 326

 Score =  122 bits (309), Expect = 7e-33
 Identities = 41/67 (61%), Positives = 50/67 (74%)

Query: 8   ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
           AL NVI+ALA+   +HIPYRDSKLTR+LQ+SLGGN++T +I   SP+  N  ET STL F
Sbjct: 260 ALGNVINALAENQSSHIPYRDSKLTRLLQDSLGGNSKTLMIANISPSDSNYEETLSTLRF 319

Query: 68  GRRAKTI 74
             RAK I
Sbjct: 320 ASRAKNI 326


>gnl|CDD|238054 cd00106, KISc, Kinesin motor domain. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In most
           kinesins, the motor domain is found at the N-terminus
           (N-type), in some its is found in the middle (M-type),
           or C-terminal (C-type). N-type and M-type kinesins are
           (+) end-directed motors, while C-type kinesins are (-)
           end-directed motors, i.e. they transport cargo towards
           the (-) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 328

 Score =  113 bits (285), Expect = 2e-29
 Identities = 43/66 (65%), Positives = 50/66 (75%), Gaps = 1/66 (1%)

Query: 8   ALRNVISALADGNK-THIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLD 66
           AL NVISAL+ G K  HIPYRDSKLTR+LQ+SLGGN++T +I   SP+S N  ET STL 
Sbjct: 263 ALGNVISALSSGQKKKHIPYRDSKLTRLLQDSLGGNSKTLMIANISPSSENYDETLSTLR 322

Query: 67  FGRRAK 72
           F  RAK
Sbjct: 323 FASRAK 328


>gnl|CDD|238668 cd01372, KISc_KIF4, Kinesin motor domain, KIF4-like subfamily.
           Members of this group seem to perform a variety of
           functions, and have been implicated in neuronal
           organelle transport and chromosome segregation during
           mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 341

 Score =  111 bits (279), Expect = 2e-28
 Identities = 42/70 (60%), Positives = 52/70 (74%), Gaps = 2/70 (2%)

Query: 8   ALRNVISALADGNK--THIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTL 65
           AL NVISAL D +K  +H+PYRDSKLTR+LQ+SLGGN+ T +I C SPA  N  ET +TL
Sbjct: 272 ALGNVISALGDESKKGSHVPYRDSKLTRLLQDSLGGNSHTLMIACVSPADSNFEETLNTL 331

Query: 66  DFGRRAKTIK 75
            +  RA+ IK
Sbjct: 332 KYANRARNIK 341


>gnl|CDD|238667 cd01371, KISc_KIF3, Kinesin motor domain, kinesins II or KIF3_like
           proteins. Subgroup of kinesins, which form heterotrimers
           composed of 2 kinesins and one non-motor accessory
           subunit. Kinesins II play important roles in ciliary
           transport, and have been implicated in neuronal
           transport, melanosome transport, the secretory pathway,
           and mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In this group the motor
           domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 333

 Score =  107 bits (268), Expect = 5e-27
 Identities = 42/67 (62%), Positives = 49/67 (73%)

Query: 8   ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
           AL NVISAL DG  THIPYRDSKLTR+LQ+SLGGN++T +     PA +N  ET STL +
Sbjct: 267 ALGNVISALVDGKSTHIPYRDSKLTRLLQDSLGGNSKTVMCANIGPADYNYDETLSTLRY 326

Query: 68  GRRAKTI 74
             RAK I
Sbjct: 327 ANRAKNI 333


>gnl|CDD|238660 cd01364, KISc_BimC_Eg5, Kinesin motor domain, BimC/Eg5 spindle pole
           proteins, participate in spindle assembly and chromosome
           segregation during cell division. This catalytic (head)
           domain has ATPase activity and belongs to the larger
           group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In most kinesins, the motor domain is found at
           the N-terminus (N-type), N-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 352

 Score = 98.1 bits (245), Expect = 1e-23
 Identities = 41/70 (58%), Positives = 50/70 (71%), Gaps = 1/70 (1%)

Query: 12  VISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDFGRRA 71
           VI+AL +    HIPYR+SKLTR+LQ+SLGG  +T+II   SPAS N  ET STL++  RA
Sbjct: 282 VINALVE-KSPHIPYRESKLTRLLQDSLGGRTKTSIIATISPASINLEETLSTLEYAHRA 340

Query: 72  KTIKNVVTVN 81
           K IKN   VN
Sbjct: 341 KNIKNKPEVN 350


>gnl|CDD|238661 cd01365, KISc_KIF1A_KIF1B, Kinesin motor domain, KIF1_like
           proteins. KIF1A (Unc104) transports synaptic vesicles to
           the nerve  terminal, KIF1B has been implicated in
           transport of mitochondria. Both proteins are expressed
           in neurons. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           In contrast to the majority of dimeric kinesins, most
           KIF1A/Unc104 kinesins are monomeric motors. A
           lysine-rich loop in KIF1A binds to the negatively
           charged C-terminus of tubulin and compensates for the
           lack of a second motor domain, allowing KIF1A to move
           processively.
          Length = 356

 Score = 97.8 bits (244), Expect = 2e-23
 Identities = 41/80 (51%), Positives = 49/80 (61%), Gaps = 7/80 (8%)

Query: 9   LRNVISALAD-------GNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESET 61
           L  VISALAD          + IPYRDS LT +L+E+LGGN++T +I   SPA  N  ET
Sbjct: 277 LGKVISALADNSSAKSKKKSSFIPYRDSVLTWLLKENLGGNSKTAMIATISPADINYEET 336

Query: 62  KSTLDFGRRAKTIKNVVTVN 81
            STL +  RAK I NV  VN
Sbjct: 337 LSTLRYADRAKKIVNVAVVN 356


>gnl|CDD|238670 cd01374, KISc_CENP_E, Kinesin motor domain, CENP-E/KIP2-like
           subgroup, involved in chromosome movement and/or spindle
           elongation during mitosis. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In most
           kinesins, the motor domain is found at the N-terminus
           (N-type). N-type kinesins are (+) end-directed motors,
           i.e. they transport cargo towards the (+) end of the
           microtubule. Kinesin motor domains hydrolyze ATP at a
           rate of about 80 per second, and move along the
           microtubule at a speed of about 6400 Angstroms per
           second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 321

 Score = 94.3 bits (235), Expect = 2e-22
 Identities = 42/68 (61%), Positives = 47/68 (69%), Gaps = 1/68 (1%)

Query: 8   ALRNVISALADG-NKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLD 66
            L  VIS L++G N  HIPYRDSKLTRILQ SL GNART II   SPAS +  ET +TL 
Sbjct: 254 TLGTVISKLSEGKNSGHIPYRDSKLTRILQPSLSGNARTAIICTISPASSHVEETLNTLK 313

Query: 67  FGRRAKTI 74
           F  RAK +
Sbjct: 314 FASRAKKV 321


>gnl|CDD|238666 cd01370, KISc_KIP3_like, Kinesin motor domain, KIP3-like subgroup.
           The yeast kinesin KIP3 plays a role in positioning the
           mitotic spindle. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 338

 Score = 93.9 bits (234), Expect = 4e-22
 Identities = 39/69 (56%), Positives = 49/69 (71%), Gaps = 2/69 (2%)

Query: 8   ALRNVISALADGNK--THIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTL 65
           AL N I+AL DG K   HIPYRDSKLTR+L++SLGGN +T +I   SP+S +  ET +TL
Sbjct: 270 ALGNCINALVDGKKKNKHIPYRDSKLTRLLKDSLGGNCKTVMIANISPSSSHYEETHNTL 329

Query: 66  DFGRRAKTI 74
            +  RAK I
Sbjct: 330 KYANRAKNI 338


>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
          Length = 568

 Score = 94.4 bits (235), Expect = 1e-21
 Identities = 65/254 (25%), Positives = 104/254 (40%), Gaps = 33/254 (12%)

Query: 8   ALRNVISAL-ADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLD 66
            L NVI+AL       HIPYR+SKLTR+LQ+SLGGN  T +I   SP+S +  ET +TL 
Sbjct: 269 TLGNVINALGDKKKSGHIPYRESKLTRLLQDSLGGNCNTRVICTISPSSNSFEETINTLK 328

Query: 67  FGRRAKTIKNVVTVNE----ELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSR---WRN 119
           F  RAK+IKN + VN         EE K    +++ +      + +    + S    +  
Sbjct: 329 FASRAKSIKNKIQVNSSSDSSREIEEIKFDLSEDRSEIEILVFREQSQLSQSSLSGIFAY 388

Query: 120 GETVKEDEQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERE 179
            +++K+ E   L   +D+                         + S + E ++ L+EE  
Sbjct: 389 MQSLKK-ETETLKSRIDLIMK---------------------SIISGTFERKKLLKEEGW 426

Query: 180 RLYQQLD-EKDEEINQQSQYAEKLKEQIMEQEEL--ITTARRDYEQLQGEMTRLTQENES 236
           +    L   + E         E+L ++  +  +L  +            E T    E+E 
Sbjct: 427 KYKSTLQFLRIEIDRLLLLREEELSKKKTKIHKLNKLRHDLSSLLSSIPEETSDRVESEK 486

Query: 237 AKEEVKEVLQALEE 250
           A +        L  
Sbjct: 487 ASKLRSSASTKLNL 500


>gnl|CDD|238662 cd01366, KISc_C_terminal, Kinesin motor domain,
           KIFC2/KIFC3/ncd-like carboxy-terminal kinesins. Ncd is a
           spindle motor protein necessary for chromosome
           segregation in meiosis. KIFC2/KIFC3-like kinesins have
           been implicated in motility of the Golgi apparatus as
           well as dentritic and axonal transport in neurons. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In this subgroup the motor domain is found at
           the C-terminus (C-type). C-type kinesins are (-)
           end-directed motors, i.e. they transport cargo towards
           the (-) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 329

 Score = 84.5 bits (210), Expect = 7e-19
 Identities = 32/69 (46%), Positives = 47/69 (68%), Gaps = 1/69 (1%)

Query: 8   ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
           AL +VISAL   +  H+PYR+SKLT +LQ+SLGGN++T + +  SP   N SET  +L F
Sbjct: 261 ALGDVISALRSKDS-HVPYRNSKLTYLLQDSLGGNSKTLMFVNISPLESNLSETLCSLRF 319

Query: 68  GRRAKTIKN 76
             R ++++ 
Sbjct: 320 ASRVRSVEL 328


>gnl|CDD|238669 cd01373, KISc_KLP2_like, Kinesin motor domain, KLP2-like subgroup.
           Members of this subgroup seem to play a role in mitosis
           and meiosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 337

 Score = 81.4 bits (201), Expect = 9e-18
 Identities = 40/69 (57%), Positives = 49/69 (71%), Gaps = 3/69 (4%)

Query: 9   LRNVISALAD---GNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTL 65
           L +VI AL D   G + H+PYRDSKLT +L++SLGGNA+TTII   SP+S    ET STL
Sbjct: 269 LGHVIMALVDVAHGKQRHVPYRDSKLTFLLRDSLGGNAKTTIIANVSPSSKCFGETLSTL 328

Query: 66  DFGRRAKTI 74
            F +RAK I
Sbjct: 329 KFAQRAKLI 337


>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
          Length = 1320

 Score = 80.0 bits (197), Expect = 1e-16
 Identities = 42/80 (52%), Positives = 55/80 (68%), Gaps = 4/80 (5%)

Query: 9   LRNVISALAD----GNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKST 64
           L N+I+ LA+    G + HIPYRDS+LT +LQESLGGNA+  ++   SP+   +SET ST
Sbjct: 365 LGNLINILAEISQTGKQRHIPYRDSRLTFLLQESLGGNAKLAMVCAISPSQSCKSETFST 424

Query: 65  LDFGRRAKTIKNVVTVNEEL 84
           L F +RAK IKN   VNE +
Sbjct: 425 LRFAQRAKAIKNKAVVNEVM 444


>gnl|CDD|238663 cd01367, KISc_KIF2_like, Kinesin motor domain, KIF2-like group.
           KIF2 is a protein expressed in neurons, which has been
           associated with axonal transport and neuron development;
           alternative splice forms have been implicated in
           lysosomal translocation. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Kinesins are microtubule-dependent
           molecular motors that play important roles in
           intracellular transport and in cell division. In this
           subgroup the motor domain is found in the middle
           (M-type) of the protein chain. M-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second (KIF2 may be slower). To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 322

 Score = 71.5 bits (176), Expect = 2e-14
 Identities = 28/65 (43%), Positives = 43/65 (66%), Gaps = 1/65 (1%)

Query: 8   ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
           AL+  I ALA  NK H+P+R SKLT++L++S  GN++T +I   SP++ +   T +TL +
Sbjct: 259 ALKECIRALAS-NKAHVPFRGSKLTQVLRDSFIGNSKTVMIATISPSASSCEHTLNTLRY 317

Query: 68  GRRAK 72
             R K
Sbjct: 318 ADRVK 322


>gnl|CDD|238659 cd01363, Motor_domain, Myosin and Kinesin motor domain. These
           ATPases belong to the P-loop NTPase family and provide
           the driving force in myosin and kinesin mediated
           processes.
          Length = 186

 Score = 68.3 bits (167), Expect = 7e-14
 Identities = 29/46 (63%), Positives = 39/46 (84%), Gaps = 1/46 (2%)

Query: 8   ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSP 53
            L NVISALA+ + +H+PYR+SKLTR+LQ+SLGGN+RT ++ C SP
Sbjct: 142 TLGNVISALAERD-SHVPYRESKLTRLLQDSLGGNSRTLMVACISP 186


>gnl|CDD|238671 cd01375, KISc_KIF9_like, Kinesin motor domain, KIF9-like subgroup;
           might play a role in cell shape remodeling. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Kinesins are
           microtubule-dependent molecular motors that play
           important roles in intracellular transport and in cell
           division. In most kinesins, the motor domain is found at
           the N-terminus (N-type). N-type kinesins are (+)
           end-directed motors, i.e. they transport cargo towards
           the (+) end of the microtubule. Kinesin motor domains
           hydrolyze ATP at a rate of about 80 per second, and move
           along the microtubule at a speed of about 6400 Angstroms
           per second. To achieve that, kinesin head groups work in
           pairs. Upon replacing ADP with ATP, a kinesin motor
           domain increases its affinity for microtubule binding
           and locks in place. Also, the neck linker binds to the
           motor domain, which repositions the other head domain
           through the coiled-coil domain close to a second tubulin
           dimer, about 80 Angstroms along the microtubule.
           Meanwhile, ATP hydrolysis takes place, and when the
           second head domain binds to the microtubule, the first
           domain again replaces ADP with ATP, triggering a
           conformational change that pulls the first domain
           forward.
          Length = 334

 Score = 69.8 bits (171), Expect = 1e-13
 Identities = 29/64 (45%), Positives = 42/64 (65%)

Query: 9   LRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDFG 68
           L  VI+AL++  +TH+PYR+SKLT +L++SLGGN +T ++        N  ET STL F 
Sbjct: 271 LEQVINALSEKARTHVPYRNSKLTHVLRDSLGGNCKTVMLATIWVEPSNLDETLSTLRFA 330

Query: 69  RRAK 72
           +R  
Sbjct: 331 QRVA 334


>gnl|CDD|238672 cd01376, KISc_KID_like, Kinesin motor domain, KIF22/Kid-like
           subgroup. Members of this group might play a role in
           regulating chromosomal movement along microtubules in
           mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 319

 Score = 66.7 bits (163), Expect = 1e-12
 Identities = 29/65 (44%), Positives = 40/65 (61%), Gaps = 1/65 (1%)

Query: 8   ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
            L  V+ AL  G    IPYR+SKLTR+LQ+SLGG +R  ++   +P      +T STL+F
Sbjct: 256 VLSKVVDALNKGLP-RIPYRESKLTRLLQDSLGGGSRCIMVANIAPERSFYQDTLSTLNF 314

Query: 68  GRRAK 72
             R+K
Sbjct: 315 ASRSK 319


>gnl|CDD|238664 cd01368, KISc_KIF23_like, Kinesin motor domain, KIF23-like
           subgroup. Members of this group may play a role in
           mitosis. This catalytic (head) domain has ATPase
           activity and belongs to the larger group of P-loop
           NTPases. Kinesins are microtubule-dependent molecular
           motors that play important roles in intracellular
           transport and in cell division. In most kinesins, the
           motor domain is found at the N-terminus (N-type). N-type
           kinesins are (+) end-directed motors, i.e. they
           transport cargo towards the (+) end of the microtubule.
           Kinesin motor domains hydrolyze ATP at a rate of about
           80 per second, and move along the microtubule at a speed
           of about 6400 Angstroms per second. To achieve that,
           kinesin head groups work in pairs. Upon replacing ADP
           with ATP, a kinesin motor domain increases its affinity
           for microtubule binding and locks in place. Also, the
           neck linker binds to the motor domain, which repositions
           the other head domain through the coiled-coil domain
           close to a second tubulin dimer, about 80 Angstroms
           along the microtubule. Meanwhile, ATP hydrolysis takes
           place, and when the second head domain binds to the
           microtubule, the first domain again replaces ADP with
           ATP, triggering a conformational change that pulls the
           first domain forward.
          Length = 345

 Score = 48.5 bits (116), Expect = 2e-06
 Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 2/65 (3%)

Query: 8   ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
            LR   + L+      +PYRDSKLT + Q    G  +  +I+  +P + +  ET   + F
Sbjct: 283 VLRE--NQLSGSTNKMVPYRDSKLTHLFQNYFDGEGKARMIVNVNPCASDYDETLHVMKF 340

Query: 68  GRRAK 72
              A+
Sbjct: 341 SAIAQ 345


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 45.1 bits (107), Expect = 3e-05
 Identities = 24/82 (29%), Positives = 46/82 (56%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           EE ++LE E   L ++L+E + E+ +  +  E+LKE+I   +E +       E+L+  + 
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354

Query: 229 RLTQENESAKEEVKEVLQALEE 250
            L +  E  +E++  +L+ LEE
Sbjct: 355 ELEEAKEELEEKLSALLEELEE 376



 Score = 42.8 bits (101), Expect = 2e-04
 Identities = 41/180 (22%), Positives = 78/180 (43%), Gaps = 11/180 (6%)

Query: 81  NEELTAEEWKRRYEKEK-----EKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPV 135
            EE+   E KR+  +E+     E+    + +L+ LE EL          E E   L + +
Sbjct: 778 KEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEI 837

Query: 136 DMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQ 195
           +         E     +   +      L  L   E+++LE+E + L ++ +E +EE+ + 
Sbjct: 838 EELEEKLDELEEELEELEKELEELKEELEEL-EAEKEELEDELKELEEEKEELEEELREL 896

Query: 196 SQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENES-----AKEEVKEVLQALEE 250
                +LKE+I +  E +       E+L+ E+  L +E E       + E++  ++ LEE
Sbjct: 897 ESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEE 956



 Score = 41.6 bits (98), Expect = 3e-04
 Identities = 24/88 (27%), Positives = 43/88 (48%)

Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTR 229
           E ++LEEE   L  QL++ +EE+         L++ + E    +    R  E+L+ E+  
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727

Query: 230 LTQENESAKEEVKEVLQALEEPTRTLLN 257
           L +E E  +  ++E+ + LEE    L  
Sbjct: 728 LEEELEQLQSRLEELEEELEELEEELEE 755



 Score = 41.2 bits (97), Expect = 5e-04
 Identities = 28/82 (34%), Positives = 53/82 (64%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           EE  +LEEE E L ++L+E ++EI +     E+L+E++ E +E +   + + E+L+GE++
Sbjct: 246 EELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEIS 305

Query: 229 RLTQENESAKEEVKEVLQALEE 250
            L +  E  + E++E+ + LEE
Sbjct: 306 LLRERLEELENELEELEERLEE 327



 Score = 41.2 bits (97), Expect = 6e-04
 Identities = 26/86 (30%), Positives = 51/86 (59%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
                E   LEEE E+L  +L+E +EE+ +  +  E+L+E++ E EE + +      +L+
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLK 778

Query: 225 GEMTRLTQENESAKEEVKEVLQALEE 250
            E+  L ++ ++ +EE++E+ + LEE
Sbjct: 779 EEIEELEEKRQALQEELEELEEELEE 804



 Score = 40.1 bits (94), Expect = 0.001
 Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 15/164 (9%)

Query: 87  EEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPE 146
           EE +R+ E+ + +    K +L  LE EL + ++     E+E   L + ++         E
Sbjct: 705 EELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELE 764

Query: 147 SAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQI 206
               S+                E   KL+EE E L ++     EE+ +  +  E+ + ++
Sbjct: 765 EELESL---------------EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRL 809

Query: 207 MEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
              E  + +  +  E+L+ E+  L +E E  +E++ E+ + LEE
Sbjct: 810 DALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853



 Score = 37.4 bits (87), Expect = 0.008
 Identities = 21/83 (25%), Positives = 42/83 (50%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
             E + LEE  ERL ++L++  EE+ +     E+L+ ++ E  E +       E+L+  +
Sbjct: 403 KREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRL 462

Query: 228 TRLTQENESAKEEVKEVLQALEE 250
             L +E    +EE++ + + L  
Sbjct: 463 KELERELAELQEELQRLEKELSS 485



 Score = 35.8 bits (83), Expect = 0.026
 Identities = 24/83 (28%), Positives = 46/83 (55%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
             + +KLEEE + L  +L   ++ + +  +  E+L+ Q+ E +  +     + EQLQ  +
Sbjct: 680 EAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRL 739

Query: 228 TRLTQENESAKEEVKEVLQALEE 250
             L +E E  +EE++E+ + LEE
Sbjct: 740 EELEEELEELEEELEELQERLEE 762



 Score = 35.5 bits (82), Expect = 0.031
 Identities = 28/82 (34%), Positives = 48/82 (58%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           EE ++ E+E E L  +L+E  EE+ +  +   +LKE+I E E  I+  R   E+L+ E+ 
Sbjct: 260 EELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELE 319

Query: 229 RLTQENESAKEEVKEVLQALEE 250
            L +  E  KE+++ + + LEE
Sbjct: 320 ELEERLEELKEKIEALKEELEE 341



 Score = 35.5 bits (82), Expect = 0.038
 Identities = 23/88 (26%), Positives = 46/88 (52%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
            EE ++L+EE   L ++++E + EI+   +  E+L+ ++ E EE +   +   E L+ E+
Sbjct: 280 REELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEEL 339

Query: 228 TRLTQENESAKEEVKEVLQALEEPTRTL 255
                  E  ++ + E+ +A EE    L
Sbjct: 340 EERETLLEELEQLLAELEEAKEELEEKL 367



 Score = 34.3 bits (79), Expect = 0.074
 Identities = 24/82 (29%), Positives = 44/82 (53%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           E  ++LE E E L ++L+E  E+I    +  E+ +  + E E+L+       E+L+ +++
Sbjct: 309 ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLS 368

Query: 229 RLTQENESAKEEVKEVLQALEE 250
            L +E E   E ++E L  LE 
Sbjct: 369 ALLEELEELFEALREELAELEA 390



 Score = 34.3 bits (79), Expect = 0.086
 Identities = 22/93 (23%), Positives = 46/93 (49%), Gaps = 11/93 (11%)

Query: 169 EERQKLEEERERLYQQLDEKD---EEINQQSQYAEKLKEQIME----QEEL----ITTAR 217
           E +++ E + ER  + L+  +   EE+ +Q +  E+  E+       + EL    +    
Sbjct: 172 ERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLL 231

Query: 218 RDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
              ++L+ E+  L +E    +EE++E+ + LEE
Sbjct: 232 AKLKELRKELEELEEELSRLEEELEELQEELEE 264



 Score = 33.1 bits (76), Expect = 0.17
 Identities = 24/93 (25%), Positives = 49/93 (52%), Gaps = 12/93 (12%)

Query: 169 EERQKLEEERERLY------------QQLDEKDEEINQQSQYAEKLKEQIMEQEELITTA 216
           E  Q+L+ E   L             ++L+E +EE+++  +  E+L+E++ E E+ I   
Sbjct: 213 ERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEEL 272

Query: 217 RRDYEQLQGEMTRLTQENESAKEEVKEVLQALE 249
           + + E+L+ E+  L +E    KEE++E+   + 
Sbjct: 273 KSELEELREELEELQEELLELKEEIEELEGEIS 305



 Score = 32.4 bits (74), Expect = 0.33
 Identities = 44/192 (22%), Positives = 78/192 (40%), Gaps = 12/192 (6%)

Query: 69  RRAKTIKNVVTVNEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWR-------NGE 121
              +  K +  +  EL  EE +   E+ +E+    K ++E+LE E+S  R       N  
Sbjct: 261 ELEEAEKEIEELKSEL--EELREELEELQEELLELKEEIEELEGEISLLRERLEELENEL 318

Query: 122 TVKEDEQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERER- 180
              E+    L + ++         E+    +   +         L  +    LEE  E  
Sbjct: 319 EELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELF 378

Query: 181 --LYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAK 238
             L ++L E + E+ +     E+LK +I   EE +       E L+ E+  L  E E  +
Sbjct: 379 EALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQ 438

Query: 239 EEVKEVLQALEE 250
            E++E+ + LEE
Sbjct: 439 TELEELNEELEE 450



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 2/94 (2%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQ-SQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
             +++L EERE L ++L++   E+ +   +  EKL   + E EEL    R +  +L+ E+
Sbjct: 334 ALKEEL-EERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAEL 392

Query: 228 TRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQ 261
             +  E E  K E++ + + LE  +  L +   +
Sbjct: 393 AEIRNELEELKREIESLEERLERLSERLEDLKEE 426



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 37/165 (22%), Positives = 66/165 (40%), Gaps = 18/165 (10%)

Query: 86  AEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTP 145
            EE +   E+ +E+    K K+E L+ EL          E     L +  +         
Sbjct: 311 LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSAL 370

Query: 146 ESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQ 205
                 +  A+            EE  +LE E   +  +L+E   EI    +  E+L E+
Sbjct: 371 LEELEELFEAL-----------REELAELEAELAEIRNELEELKREIESLEERLERLSER 419

Query: 206 IMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
           + + +E +       ++L+ E+  L  E E   EE++E+ + LEE
Sbjct: 420 LEDLKEEL-------KELEAELEELQTELEELNEELEELEEQLEE 457



 Score = 29.7 bits (67), Expect = 2.2
 Identities = 25/87 (28%), Positives = 49/87 (56%), Gaps = 4/87 (4%)

Query: 168 NEERQKLEEERERL--YQQLDEKDEEINQQSQYAE--KLKEQIMEQEELITTARRDYEQL 223
            ++ +KLE + E+   YQ+L  +  E+      A+  +L++++ E EE ++    + E+L
Sbjct: 199 EKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEEL 258

Query: 224 QGEMTRLTQENESAKEEVKEVLQALEE 250
           Q E+    +E E  K E++E+ + LEE
Sbjct: 259 QEELEEAEKEIEELKSELEELREELEE 285


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 40.4 bits (95), Expect = 0.001
 Identities = 23/101 (22%), Positives = 46/101 (45%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
            E ++LEE   +L ++L E + EI +  +  E+ +E++ E E  I       EQL+ E+ 
Sbjct: 740 AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK 799

Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGTLL 269
            L +  +  + E+  + +        L +  R+ +     L
Sbjct: 800 ALREALDELRAELTLLNEEAANLRERLESLERRIAATERRL 840



 Score = 40.0 bits (94), Expect = 0.001
 Identities = 24/95 (25%), Positives = 50/95 (52%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
            EE ++L+EE +   ++L+E   E+ +  +  E+L+ ++ E EE I   +++   L  E+
Sbjct: 238 REELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEI 297

Query: 228 TRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQS 262
           +RL Q+ +  +E +  + + LEE    L     + 
Sbjct: 298 SRLEQQKQILRERLANLERQLEELEAQLEELESKL 332



 Score = 38.9 bits (91), Expect = 0.002
 Identities = 21/86 (24%), Positives = 49/86 (56%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
           S   ++  +LE E E+L +++ +  +E+ +     E+L+E++ E EE +  A  + E+L+
Sbjct: 729 SALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788

Query: 225 GEMTRLTQENESAKEEVKEVLQALEE 250
            ++ +L +E ++ +E + E+   L  
Sbjct: 789 AQIEQLKEELKALREALDELRAELTL 814



 Score = 38.9 bits (91), Expect = 0.003
 Identities = 22/88 (25%), Positives = 39/88 (44%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
             E ++LEE  E   ++L E + EI +     E+LKE++    E +   R +   L  E 
Sbjct: 760 EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEA 819

Query: 228 TRLTQENESAKEEVKEVLQALEEPTRTL 255
             L +  ES +  +    + LE+    +
Sbjct: 820 ANLRERLESLERRIAATERRLEDLEEQI 847



 Score = 37.7 bits (88), Expect = 0.006
 Identities = 19/83 (22%), Positives = 43/83 (51%)

Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGE 226
              + ++LE + E L  +LDE  EE+ +  +  E+LKE++   E  +     + E+L+  
Sbjct: 314 LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373

Query: 227 MTRLTQENESAKEEVKEVLQALE 249
           +  L ++ E+ + +V ++   + 
Sbjct: 374 LEELEEQLETLRSKVAQLELQIA 396



 Score = 37.7 bits (88), Expect = 0.007
 Identities = 35/175 (20%), Positives = 64/175 (36%), Gaps = 8/175 (4%)

Query: 81  NEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAAS 140
             E   E+ + R  +  ++ +  + ++E+LE  L          E E   L   ++    
Sbjct: 737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKE 796

Query: 141 IAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAE 200
                  A   +   +        +L NEE   L E  E L +++   +  +    +  E
Sbjct: 797 ELKALREALDELRAEL--------TLLNEEAANLRERLESLERRIAATERRLEDLEEQIE 848

Query: 201 KLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTL 255
           +L E I      I       E+L+ E+  L  E  S +E +  +   LEE +  L
Sbjct: 849 ELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903



 Score = 37.0 bits (86), Expect = 0.011
 Identities = 21/83 (25%), Positives = 44/83 (53%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
            +++Q L E    L +QL+E + ++ +     ++L E++ E EE +   + + E L+ E+
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAEL 360

Query: 228 TRLTQENESAKEEVKEVLQALEE 250
             L  E E  +  ++E+ + LE 
Sbjct: 361 EELEAELEELESRLEELEEQLET 383



 Score = 36.2 bits (84), Expect = 0.018
 Identities = 22/101 (21%), Positives = 45/101 (44%), Gaps = 7/101 (6%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ--- 224
            E+ ++L E+ E L  +++E +E I +     E L  +    EE +   R + E+L    
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903

Query: 225 ----GEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQ 261
                + + L +E E  +E++ ++   LE     + N   +
Sbjct: 904 RELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQER 944



 Score = 36.2 bits (84), Expect = 0.020
 Identities = 44/204 (21%), Positives = 79/204 (38%), Gaps = 20/204 (9%)

Query: 69  RRAKTIKNVVTVNEELT-----AEEWKR---RYEKEKEKSSRYKGK---LEKLELELS-- 115
           RR +T + +    E L        E +R     E++ EK+ RYK     L +LEL L   
Sbjct: 173 RRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELELALLVL 232

Query: 116 RWRNGETVKEDEQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEER-QKL 174
           R        E+ Q  L +  +    +    +     +        L L     EE  ++L
Sbjct: 233 RLEELREELEELQEELKEAEEELEELTAELQELEEKL------EELRLEVSELEEEIEEL 286

Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQEN 234
           ++E   L  ++   +++     +    L+ Q+ E E  +       ++L  E+  L ++ 
Sbjct: 287 QKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL 346

Query: 235 ESAKEEVKEVLQALEEPTRTLLNF 258
           E  KEE++ +   LEE    L   
Sbjct: 347 EELKEELESLEAELEELEAELEEL 370



 Score = 35.8 bits (83), Expect = 0.027
 Identities = 27/90 (30%), Positives = 47/90 (52%), Gaps = 3/90 (3%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
            L   E ++L+ E E L ++L+E  EE+ +  +  E+L+E++ E E+ +  A R+  QLQ
Sbjct: 429 KLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQ 488

Query: 225 GEMT---RLTQENESAKEEVKEVLQALEEP 251
             +    RL +  E   E VK +L+     
Sbjct: 489 ARLDSLERLQENLEGFSEGVKALLKNQSGL 518



 Score = 35.8 bits (83), Expect = 0.029
 Identities = 21/76 (27%), Positives = 36/76 (47%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
             E ++LE E E L  +L+E +E++        +L+ QI      I       E+L+   
Sbjct: 357 EAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRR 416

Query: 228 TRLTQENESAKEEVKE 243
            RL QE E   ++++E
Sbjct: 417 ERLQQEIEELLKKLEE 432



 Score = 35.0 bits (81), Expect = 0.045
 Identities = 21/74 (28%), Positives = 40/74 (54%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           EE  +LEE+ E L ++L+  + E+ +     E+L+ ++ E EE + T R    QL+ ++ 
Sbjct: 337 EELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIA 396

Query: 229 RLTQENESAKEEVK 242
            L  E E  +  ++
Sbjct: 397 SLNNEIERLEARLE 410



 Score = 32.7 bits (75), Expect = 0.27
 Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 1/84 (1%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIME-QEELITTARRDYEQLQGE 226
           +EE ++LE +R  L ++L+E  E++ Q     E L+ +I   QE L        E+ +  
Sbjct: 900 SEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEAL 959

Query: 227 MTRLTQENESAKEEVKEVLQALEE 250
             ++  + E A+  +K +   ++E
Sbjct: 960 ENKIEDDEEEARRRLKRLENKIKE 983



 Score = 28.9 bits (65), Expect = 4.6
 Identities = 22/94 (23%), Positives = 42/94 (44%), Gaps = 9/94 (9%)

Query: 166 LSNEERQKLEEER--ERLYQQLDEKDE-------EINQQSQYAEKLKEQIMEQEELITTA 216
           L + ER+    ER  E L +Q++E  E       EI +  +  E+L+ ++        + 
Sbjct: 826 LESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASL 885

Query: 217 RRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
                 L+ E+  L++E    + +  E+ + LEE
Sbjct: 886 EEALALLRSELEELSEELRELESKRSELRRELEE 919


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 39.8 bits (94), Expect = 0.001
 Identities = 24/97 (24%), Positives = 51/97 (52%), Gaps = 13/97 (13%)

Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGE 226
             E R +L++  +RL Q    K+E ++++ +  EK +E++ ++E+ +   +++ E+ + E
Sbjct: 77  LRERRNELQKLEKRLLQ----KEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEE 132

Query: 227 MTRLTQENESA--------KEEVKE-VLQALEEPTRT 254
           +  L +E             EE KE +L+ +EE  R 
Sbjct: 133 LEELIEEQLQELERISGLTAEEAKEILLEKVEEEARH 169



 Score = 32.1 bits (74), Expect = 0.31
 Identities = 17/83 (20%), Positives = 50/83 (60%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
             +++ L E +E +++  +E ++E+ ++    +KL+++++++EE +       E+ + E+
Sbjct: 53  AIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEEL 112

Query: 228 TRLTQENESAKEEVKEVLQALEE 250
            +  +E E  ++E+++  + LEE
Sbjct: 113 EKKEKELEQKQQELEKKEEELEE 135


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 38.2 bits (89), Expect = 0.002
 Identities = 31/153 (20%), Positives = 56/153 (36%), Gaps = 32/153 (20%)

Query: 137 MAASIAPTPESAPASILPAIPGSGLMLGSL---------------SNEERQKLEEERERL 181
             A +        A I   +  + L LG +                +   QKLEE +  L
Sbjct: 22  GIALLVLFGAGLGALISTLLGIALLALGLVLLALGLLCLLLKAPVQSVRPQKLEELQGEL 81

Query: 182 ---YQQLDEKDEEINQQSQYAEKLKEQIMEQEEL--------------ITTARRDYEQLQ 224
               QQL E  EE+    +   +L+ ++ + +E               + +     ++L 
Sbjct: 82  SELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELA 141

Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTRTLLN 257
            E+  L Q+     EE++E L+ L+E  + L  
Sbjct: 142 KELRELRQDLREEVEELREELERLQENLQRLQE 174


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 39.3 bits (92), Expect = 0.002
 Identities = 42/176 (23%), Positives = 75/176 (42%), Gaps = 8/176 (4%)

Query: 82  EELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASI 141
           E+L  E  K+  +  +E+  R K K+ +LE E++        KE E  +  + +    + 
Sbjct: 271 EQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAE 330

Query: 142 APTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEI----NQQSQ 197
                +    +   I         L+ EE  +L+EE E L  +L+E D+E     ++   
Sbjct: 331 IDKLLAEIEELEREIEEERKRRDKLT-EEYAELKEELEDLRAELEEVDKEFAETRDELKD 389

Query: 198 YAEKLKEQIMEQEELITTARRDYEQLQ---GEMTRLTQENESAKEEVKEVLQALEE 250
           Y EKL++   E  EL     R  E+LQ    E+  L       + ++ E+ +  E+
Sbjct: 390 YREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445



 Score = 37.4 bits (87), Expect = 0.008
 Identities = 22/87 (25%), Positives = 47/87 (54%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           E   KLE E ++L  +++E + EI ++ +  +KL E+  E +E +   R + E++  E  
Sbjct: 322 ERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFA 381

Query: 229 RLTQENESAKEEVKEVLQALEEPTRTL 255
               E +  +E+++++ + + E  R L
Sbjct: 382 ETRDELKDYREKLEKLKREINELKREL 408



 Score = 37.0 bits (86), Expect = 0.013
 Identities = 37/178 (20%), Positives = 66/178 (37%), Gaps = 23/178 (12%)

Query: 92  RYEKEKEKSSRYKGKLEKLE-----LELSRWRNGET--------------VKEDEQVNLA 132
           R  +E+EK+ RY+  L++         L      E                 E     ++
Sbjct: 202 RLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEIS 261

Query: 133 DPVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEI 192
           +       I    E     I        L +     E+  +LE E   L + + EK+ E+
Sbjct: 262 ELEKRLEEIEQLLEELNKKIKDLGEEEQLRV----KEKIGELEAEIASLERSIAEKEREL 317

Query: 193 NQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
               +   KL+ +I +    I    R+ E+ +    +LT+E    KEE++++   LEE
Sbjct: 318 EDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEE 375



 Score = 34.3 bits (79), Expect = 0.074
 Identities = 21/86 (24%), Positives = 36/86 (41%), Gaps = 3/86 (3%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
            E  + L+ E   L  +L   +  +++ SQ       +I E E+ I    ++ E+L+  +
Sbjct: 680 RERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERL 739

Query: 228 TRLT---QENESAKEEVKEVLQALEE 250
             L       E   E VK  L+ LE 
Sbjct: 740 EELEEDLSSLEQEIENVKSELKELEA 765



 Score = 34.3 bits (79), Expect = 0.086
 Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 2/89 (2%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
                L++ER+ L  QL E + +I +     EK ++++ E +  +     +  +++    
Sbjct: 882 SRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKG 941

Query: 229 RLTQ--ENESAKEEVKEVLQALEEPTRTL 255
              +  E E + E+V+  LQ +EE  R L
Sbjct: 942 EDEEIPEEELSLEDVQAELQRVEEEIRAL 970



 Score = 33.5 bits (77), Expect = 0.14
 Identities = 20/82 (24%), Positives = 40/82 (48%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           EE Q+L EE   L   +   + +IN+  +  E    +I +QE  +     D  + + E+ 
Sbjct: 413 EELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472

Query: 229 RLTQENESAKEEVKEVLQALEE 250
            L +E +  ++E+ ++ + L E
Sbjct: 473 DLKEEYDRVEKELSKLQRELAE 494



 Score = 33.1 bits (76), Expect = 0.17
 Identities = 24/91 (26%), Positives = 50/91 (54%), Gaps = 4/91 (4%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
           +  +E  Q+L +      +++ E ++EI Q  Q  EKLKE++ E EE +++  ++ E ++
Sbjct: 702 NRLDELSQELSDAS----RKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVK 757

Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTRTL 255
            E+  L    E  +E++ ++ +AL +    L
Sbjct: 758 SELKELEARIEELEEDLHKLEEALNDLEARL 788



 Score = 32.7 bits (75), Expect = 0.28
 Identities = 19/81 (23%), Positives = 40/81 (49%)

Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTR 229
           E++ LE+E + L +Q  +  E+I    +  E L  +  E EE +         L+  +  
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGD 886

Query: 230 LTQENESAKEEVKEVLQALEE 250
           L +E +  + +++E+ + +EE
Sbjct: 887 LKKERDELEAQLRELERKIEE 907



 Score = 32.3 bits (74), Expect = 0.34
 Identities = 18/89 (20%), Positives = 43/89 (48%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
           +    E +  EE   +L  ++D+   EI +  +  E+ +++  +  E     + + E L+
Sbjct: 311 AEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLR 370

Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTR 253
            E+  + +E    ++E+K+  + LE+  R
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKR 399



 Score = 32.0 bits (73), Expect = 0.43
 Identities = 38/164 (23%), Positives = 75/164 (45%), Gaps = 10/164 (6%)

Query: 87  EEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPE 146
            E ++  E+ +++  + K +LE+LE +LS         + E   L   ++         E
Sbjct: 719 GEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLE 778

Query: 147 SAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEIN---QQSQYAEK-- 201
            A   +   +  S +        E  KLEEE  R+  +L E ++++N    + +Y EK  
Sbjct: 779 EALNDLEARLSHSRI---PEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEI 835

Query: 202 --LKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKE 243
             L+EQ ++ +E I +  ++ E L G+   L +E E  +  +++
Sbjct: 836 QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRD 879



 Score = 31.6 bits (72), Expect = 0.52
 Identities = 17/86 (19%), Positives = 40/86 (46%), Gaps = 3/86 (3%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
             E  +L+ E +RL ++L    EE+   +     ++ +I E EE       + ++ + ++
Sbjct: 398 KREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKL 457

Query: 228 TRLTQENESAKEE---VKEVLQALEE 250
            +L  +    ++E   +KE    +E+
Sbjct: 458 EQLAADLSKYEQELYDLKEEYDRVEK 483



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 20/86 (23%), Positives = 41/86 (47%)

Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
             N ++++LEEE E L   L + +  +    +  ++L+ Q+ E E  I       E+ + 
Sbjct: 858 NLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRK 917

Query: 226 EMTRLTQENESAKEEVKEVLQALEEP 251
            ++ L  + E+ +EE+ E+     E 
Sbjct: 918 RLSELKAKLEALEEELSEIEDPKGED 943



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 24/90 (26%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 168 NEERQKLEEERERL--YQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
            ++ ++L  ERE+   YQ L ++  E     +  E LKE             R  E ++ 
Sbjct: 197 RQQLERLRREREKAERYQALLKEKREY----EGYELLKE--------KEALERQKEAIER 244

Query: 226 EMTRLTQENESAKEEVKEVLQALEEPTRTL 255
           ++  L +E E   EE+ E+ + LEE  + L
Sbjct: 245 QLASLEEELEKLTEEISELEKRLEEIEQLL 274


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 38.8 bits (91), Expect = 0.003
 Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 2/78 (2%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
            EE ++LEEE + L ++L++  + + +  + A    +   E+ E +       E+L  E+
Sbjct: 333 KEELKELEEELKELEEELEKIKKLLKKLPKKAR--GQLPPEKREQLEKLLETKEKLSEEL 390

Query: 228 TRLTQENESAKEEVKEVL 245
             L +E +  KEE++ + 
Sbjct: 391 EELEEELKELKEELESLY 408


>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242). 
          Length = 401

 Score = 38.3 bits (89), Expect = 0.003
 Identities = 20/92 (21%), Positives = 42/92 (45%), Gaps = 3/92 (3%)

Query: 164 GSLSNEERQKLEEERERLYQ---QLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDY 220
           G L   + +KL E+ E L      ++E+ +E +Q     +KL  Q+     +    +  Y
Sbjct: 70  GQLLQTKIRKLTEQDEGLRNIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRY 129

Query: 221 EQLQGEMTRLTQENESAKEEVKEVLQALEEPT 252
           ++L+     L +EN+  + ++  ++Q   E  
Sbjct: 130 QKLEELSRHLKEENQCLQIQLDALVQECNEKI 161


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 38.5 bits (90), Expect = 0.003
 Identities = 18/84 (21%), Positives = 40/84 (47%), Gaps = 1/84 (1%)

Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
             LE+       +L+E  +E N+QS+   +LK +I   ++ + T     ++++  +  L 
Sbjct: 313 HSLEKLDTA-IDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQ 371

Query: 232 QENESAKEEVKEVLQALEEPTRTL 255
            E     EE+ ++   L++  +T 
Sbjct: 372 AEFVDNAEELAKLQDELDKIVKTK 395



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 4/79 (5%)

Query: 181 LYQQLDEKDEEINQQSQYAEKLKEQIMEQEEL----ITTARRDYEQLQGEMTRLTQENES 236
           L QQ+   D +I+   Q  +   + I EQ +     I   +  Y++L  E   +  E E 
Sbjct: 179 LNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEE 238

Query: 237 AKEEVKEVLQALEEPTRTL 255
             +E+  ++  +E+P+  L
Sbjct: 239 LTDELLNLVMDIEDPSAAL 257


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 38.3 bits (90), Expect = 0.004
 Identities = 23/88 (26%), Positives = 45/88 (51%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           E  ++LE++ E L +Q DE  E I +Q     +L+E++ E  + +    ++ E+L   + 
Sbjct: 348 ESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKLSEMLQ 407

Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLL 256
            L ++   A+E+++     L E  R L 
Sbjct: 408 GLRKDELEAREKLERYRNKLHEIKRYLE 435



 Score = 32.5 bits (75), Expect = 0.25
 Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 5/73 (6%)

Query: 189 DEEINQQSQYAEKL-----KEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKE 243
           +E++ +  +  +K      K +I E E L+     D EQ+  E+  L +  E  +EEV++
Sbjct: 85  EEQLFEAEELNDKFRFRKAKHEINEIESLLDLIEEDIEQILEELQELLESEEKNREEVEQ 144

Query: 244 VLQALEEPTRTLL 256
           +     E  ++LL
Sbjct: 145 LKDLYRELRKSLL 157


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 38.3 bits (90), Expect = 0.004
 Identities = 21/86 (24%), Positives = 47/86 (54%), Gaps = 4/86 (4%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
            EE + L +E E+L ++L+EK E+   Q +  + L+E   E ++ I  A+++ +++  E+
Sbjct: 536 AEEAEALLKEAEKLKEELEEKKEK--LQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKEL 593

Query: 228 TRLTQENESA--KEEVKEVLQALEEP 251
            +L +   ++    E+ E  + L + 
Sbjct: 594 RQLQKGGYASVKAHELIEARKRLNKA 619



 Score = 34.8 bits (81), Expect = 0.054
 Identities = 22/89 (24%), Positives = 45/89 (50%), Gaps = 14/89 (15%)

Query: 162 MLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYE 221
           ++ SL   ER +LE++ E     L E ++   +  +  EKL+E   E+++L+  A ++ +
Sbjct: 521 LIASLEELER-ELEQKAEEAEALLKEAEKLKEELEEKKEKLQE---EEDKLLEEAEKEAQ 576

Query: 222 QLQGEMTRLTQENESAKEEVKEVLQALEE 250
           Q            + AK+E  E+++ L +
Sbjct: 577 QAI----------KEAKKEADEIIKELRQ 595



 Score = 28.3 bits (64), Expect = 5.9
 Identities = 20/73 (27%), Positives = 38/73 (52%), Gaps = 7/73 (9%)

Query: 184 QLDEKDEEINQ-----QSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAK 238
            + E  E++N+     +    E L+++  E E L+  A +  E+L+ +  +L +E +   
Sbjct: 510 LIGEDKEKLNELIASLEELERE-LEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLL 568

Query: 239 EEV-KEVLQALEE 250
           EE  KE  QA++E
Sbjct: 569 EEAEKEAQQAIKE 581


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 38.0 bits (89), Expect = 0.005
 Identities = 27/119 (22%), Positives = 49/119 (41%), Gaps = 21/119 (17%)

Query: 148 APASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIM 207
               + P  P + L            L++E   L QQL+ +  E  Q    AE  +++++
Sbjct: 131 PGPFVPPEDPENLL----------HALQQEVLTLKQQLELQAREKAQSQALAEAQQQELV 180

Query: 208 EQEELITTARRDYEQLQGEMTRLTQENESA----KEEVKEV-LQAL------EEPTRTL 255
             E L        ++L+ ++ +L ++        K++ KE+  QA       EE TR L
Sbjct: 181 ALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKRLELSEEETRIL 239


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 37.4 bits (87), Expect = 0.008
 Identities = 20/84 (23%), Positives = 42/84 (50%), Gaps = 5/84 (5%)

Query: 167 SNEERQKLEEERERLYQ---QLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQL 223
           +  ER++L+ E ERL Q   QLD + E+++      E+ ++ +  +E  +    +  +  
Sbjct: 72  ARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNE 131

Query: 224 QGEMTRLTQENESAKEEVKEVLQA 247
              +  LT   E A++ + ++L A
Sbjct: 132 LYRVAGLTP--EQARKLLLKLLDA 153



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 2/91 (2%)

Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
           RQ+L +E ER+ Q+  ++ EE  + +   E  +  + E+ +    ARR+ E+LQ E  RL
Sbjct: 28  RQRLAKEAERILQEAQKEAEEKLEAAL-LEAKELLLRERNQQRQEARREREELQREEERL 86

Query: 231 TQENESAKEEVKEVLQALEEPTRTLLNFFRQ 261
            Q+ E      ++ L  LE            
Sbjct: 87  VQKEEQLDARAEK-LDNLENQLEEREKALSA 116


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 37.0 bits (86), Expect = 0.010
 Identities = 39/180 (21%), Positives = 75/180 (41%), Gaps = 22/180 (12%)

Query: 74  IKNVVTVNEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLAD 133
           I++++   EE   E  K    +E+++    + ++E LE  L+     +   E+ +  L +
Sbjct: 201 IEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLE 260

Query: 134 PVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEIN 193
              +        E                       E ++L EE E   ++L+E + EI 
Sbjct: 261 IESLELEALKIREEE-------------------LRELERLLEELEEKIERLEELEREIE 301

Query: 194 QQSQYAEKLKEQIMEQEELITTARR---DYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
           +  +  E L+  + E EEL+   +      E+L+ ++ +L  E E   EE  E+ + LEE
Sbjct: 302 ELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEE 361



 Score = 33.6 bits (77), Expect = 0.12
 Identities = 19/89 (21%), Positives = 43/89 (48%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           E  +KL+   ERL +  ++ ++  ++  + AE+  E     EE +       E+L+ E+ 
Sbjct: 319 ELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELE 378

Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLN 257
           +  +  +  +E ++E+ + L E +  L  
Sbjct: 379 KALERLKQLEEAIQELKEELAELSAALEE 407



 Score = 32.0 bits (73), Expect = 0.37
 Identities = 33/164 (20%), Positives = 61/164 (37%), Gaps = 10/164 (6%)

Query: 87  EEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPE 146
           E++++  E  KE     K K+E+LE +LS         ED    L + +     +    E
Sbjct: 167 EKYEKLSELLKEVIKEAKAKIEELEGQLSE---LLEDIEDLLEALEEELKELKKLEEIQE 223

Query: 147 SAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQI 206
                 L               E   +LEEE+ERL +      E  + + +  +  +E++
Sbjct: 224 EQEEEEL-------EQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEEL 276

Query: 207 MEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
            E E L+       E+L+     + +  E  +     + +  E 
Sbjct: 277 RELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEEL 320



 Score = 29.7 bits (67), Expect = 2.0
 Identities = 18/56 (32%), Positives = 33/56 (58%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQL 223
            EE ++LEEE E+L ++L+E  +++ +  Q  E+L+ +  E EEL     +  + L
Sbjct: 687 LEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKAL 742



 Score = 29.7 bits (67), Expect = 2.2
 Identities = 41/171 (23%), Positives = 77/171 (45%), Gaps = 14/171 (8%)

Query: 86  AEEWKRRYEKEKEKSSRY---KGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIA 142
             E +R  E+ +EK  R    + ++E+LE EL   R      E+    L    +    + 
Sbjct: 276 LRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLE 335

Query: 143 PTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKL 202
              E   + +          L    NE  + LEE  + L ++L+E ++E+ +  +  ++L
Sbjct: 336 EKLEKLESELE--------ELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQL 387

Query: 203 KEQIMEQEELITTARRDYEQLQ---GEMTRLTQENESAKEEVKEVLQALEE 250
           +E I E +E +       E++Q    E+ +  +E E   EE++E ++ LEE
Sbjct: 388 EEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEE 438



 Score = 29.0 bits (65), Expect = 3.7
 Identities = 24/85 (28%), Positives = 48/85 (56%), Gaps = 2/85 (2%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
            +E  +L EE E L ++L E +EE+ +  +  E LKE++ E+ E +     + E+L+ ++
Sbjct: 493 EKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKL 552

Query: 228 TR--LTQENESAKEEVKEVLQALEE 250
               L +E    ++ ++E+ + LEE
Sbjct: 553 QLQQLKEELRQLEDRLQELKELLEE 577



 Score = 29.0 bits (65), Expect = 3.8
 Identities = 19/88 (21%), Positives = 36/88 (40%)

Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQEN 234
           EE  + L +  + + EE+ ++    ++  E   E EEL    R   E+L   +       
Sbjct: 469 EEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528

Query: 235 ESAKEEVKEVLQALEEPTRTLLNFFRQS 262
           E  +E+++++   LEE          Q 
Sbjct: 529 EELEEKLEKLENLLEELEELKEKLQLQQ 556



 Score = 28.2 bits (63), Expect = 6.9
 Identities = 16/81 (19%), Positives = 34/81 (41%)

Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTR 229
           E +  + E      +L E + E+ +  +  E   E++  Q EL    +   E+L+ ++  
Sbjct: 606 EERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEE 665

Query: 230 LTQENESAKEEVKEVLQALEE 250
           L  E     + ++   Q  E+
Sbjct: 666 LEAEIRRELQRIENEEQLEEK 686


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 36.1 bits (84), Expect = 0.011
 Identities = 26/92 (28%), Positives = 47/92 (51%), Gaps = 1/92 (1%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
           S  ++   KL+   E+L ++L+E  + I +     EKLK+   E EE  T    + +QL+
Sbjct: 58  SFPSQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEE-RTELLEELKQLE 116

Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
            E+ +L  E E  ++   E ++ L+E T+   
Sbjct: 117 KELKKLKAELEKYEKNDPERIEKLKEETKVAK 148


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 37.0 bits (86), Expect = 0.012
 Identities = 24/89 (26%), Positives = 55/89 (61%), Gaps = 2/89 (2%)

Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGE 226
           + +E ++ E+E ++L ++LD+  EE+ +  +  E+L++++ E E+  +    +YE+L+ E
Sbjct: 610 AEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYS--EEEYEELREE 667

Query: 227 MTRLTQENESAKEEVKEVLQALEEPTRTL 255
              L++E    + E++E+ +  EE  +TL
Sbjct: 668 YLELSRELAGLRAELEELEKRREEIKKTL 696



 Score = 28.9 bits (65), Expect = 4.1
 Identities = 26/91 (28%), Positives = 42/91 (46%)

Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ 222
           L   + EE +K  EE E+L ++L +   EI    +  EKL+E   +  EL        E+
Sbjct: 512 LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEE 571

Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEEPTR 253
           L   +  L +    + EE++E L+ LE    
Sbjct: 572 LAELLKELEELGFESVEELEERLKELEPFYN 602



 Score = 28.5 bits (64), Expect = 5.9
 Identities = 17/71 (23%), Positives = 33/71 (46%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
            E + K EE      +      E++ ++ +  EK KE+I E+   IT    + ++   E+
Sbjct: 365 EEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKEL 424

Query: 228 TRLTQENESAK 238
            +  +E + AK
Sbjct: 425 KKAIEELKKAK 435



 Score = 28.1 bits (63), Expect = 6.4
 Identities = 26/87 (29%), Positives = 49/87 (56%), Gaps = 6/87 (6%)

Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEEL---ITTARRDYEQLQGEMTRLT 231
           E+E E + ++++E   E+ +  +  EKL++++ E EEL   I    ++ E L+G   +L 
Sbjct: 199 EKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLE 258

Query: 232 ---QENESAKEEVKEVLQALEEPTRTL 255
              +E E   EE+K+ ++ LEE  + L
Sbjct: 259 EKIRELEERIEELKKEIEELEEKVKEL 285


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 35.8 bits (83), Expect = 0.014
 Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 3/80 (3%)

Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQ---IMEQEELITTARRDYEQL 223
             EE  +L+E+   L Q+  E  +E++      E+L+++   I +          +  +L
Sbjct: 78  LQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENREL 137

Query: 224 QGEMTRLTQENESAKEEVKE 243
           + E+  L QENE+ + E + 
Sbjct: 138 REELAELKQENEALEAENER 157



 Score = 33.4 bits (77), Expect = 0.070
 Identities = 25/87 (28%), Positives = 48/87 (55%), Gaps = 6/87 (6%)

Query: 166 LSNEE--RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQL 223
           LS+    R++L E    L Q+L E  EE+ +  +   +L+++  E ++ ++T   + E+L
Sbjct: 58  LSDTPSARERLPE----LQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERL 113

Query: 224 QGEMTRLTQENESAKEEVKEVLQALEE 250
           Q E+ R+ Q + +A E  +E  +  EE
Sbjct: 114 QKELARIKQLSANAIELDEENRELREE 140


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 36.5 bits (84), Expect = 0.018
 Identities = 25/93 (26%), Positives = 43/93 (46%), Gaps = 7/93 (7%)

Query: 59  SETKSTLDFGRRAKTIKNVVTVNEELTAEE----WKRRYEKEKEKSSRYKGKLEKLELEL 114
           S    T D G  A+ I +V TV+ EL  E     WK R E  K+K ++ K   +K E   
Sbjct: 80  SHLNDTQDVGIHARHISSVSTVDSELNDEYGNPIWKNRVESWKDKKNKKKKSAKKKEAHK 139

Query: 115 SRWRNGETVKEDEQVNLADPVDMAASIAPTPES 147
           ++    + ++E    +  +P+   + + P P +
Sbjct: 140 AQIPPEQQMEEKPSADAYEPL---SRVIPIPRN 169


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 35.8 bits (83), Expect = 0.027
 Identities = 26/118 (22%), Positives = 46/118 (38%), Gaps = 29/118 (24%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
           NEE  +LE    RL Q+LD         +    +  EQ+   +E +  A+ + EQ +  +
Sbjct: 441 NEEEYELELRLGRLKQRLD--------SATATPEELEQLEINDEALEKAQEEQEQAEANV 492

Query: 228 TRLTQENESAK-----------------EEVKEVLQALEE----PTRTLLNFFRQSSP 264
            +L  E    +                  ++++ L  LE        +LL+F R  +P
Sbjct: 493 EQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQAGSLLHFLRNEAP 550



 Score = 33.9 bits (78), Expect = 0.098
 Identities = 14/76 (18%), Positives = 34/76 (44%)

Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
            +L E  ++  + L     +  Q  +   +   ++ EQ+     AR   +Q + ++ RL 
Sbjct: 603 TELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQ 662

Query: 232 QENESAKEEVKEVLQA 247
            E +S K++++  +  
Sbjct: 663 NEQQSLKDKLELAIAE 678



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 6/87 (6%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDY-----EQL 223
           E + +L ++   L  QL E  +E+NQ+   A         + EL+   +  +     EQL
Sbjct: 288 EAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQL 347

Query: 224 QGEMTRLTQENESAKEEVKEVLQALEE 250
           Q ++ +L     S  EEV+  L AL  
Sbjct: 348 QADLDQLPSI-RSELEEVEARLDALTG 373


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 35.7 bits (83), Expect = 0.028
 Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 20/86 (23%)

Query: 168 NEERQKLEEERERLYQQ---LDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
            E R +L+    RL Q+   LD K E +++        KE+ +E++E         ++L 
Sbjct: 72  KERRNELQRLERRLLQREETLDRKMESLDK--------KEENLEKKE---------KELS 114

Query: 225 GEMTRLTQENESAKEEVKEVLQALEE 250
            +   L ++ E  +E + E  + LE 
Sbjct: 115 NKEKNLDEKEEELEELIAEQREELER 140


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 34.9 bits (80), Expect = 0.031
 Identities = 22/86 (25%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
            EE    +E +    + L  +D +  Q++  A K+ E+  +   +  TA   Y+Q   E 
Sbjct: 121 QEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEK 180

Query: 228 TRLTQENESAKEEVKEVLQALEEPTR 253
               QENE +KE V++  +   E   
Sbjct: 181 AS-EQENEDSKEPVEKAERTKAETDD 205


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 35.1 bits (81), Expect = 0.036
 Identities = 32/160 (20%), Positives = 57/160 (35%), Gaps = 20/160 (12%)

Query: 102 RYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPESAPASILP-----AI 156
            YK KLEK+E +L      E     +   +   V     +A           P       
Sbjct: 356 AYKPKLEKVERKLP-----ELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTE 410

Query: 157 PGSGLMLGSLSNEER------QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQE 210
                 +       +      ++LEEE   L ++L+E   EI +     E+ + + +  +
Sbjct: 411 EEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRRE-VRDK 469

Query: 211 ELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
                  R+       + RL +E E  K+ V+E+ + L E
Sbjct: 470 VRK---DREIRARDRRIERLEKELEEKKKRVEELERKLAE 506


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 35.1 bits (81), Expect = 0.041
 Identities = 23/101 (22%), Positives = 46/101 (45%), Gaps = 11/101 (10%)

Query: 140 SIAPTPESAPASILPAIPGSGLMLGSL-------SNEERQKLEEERERLYQQLDEKDEEI 192
           ++    +    +I P +  S L  G L       +++ R +L+     + Q +  K++ +
Sbjct: 8   TMTRAVKPRRFAIRPILYASVLSAGVLLCAFSAHASDNRDQLKS----IQQDIAAKEKSV 63

Query: 193 NQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQE 233
            QQ Q    L  Q+ +QEE I+ A R   + Q  + +L ++
Sbjct: 64  RQQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQ 104


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 34.6 bits (80), Expect = 0.048
 Identities = 21/88 (23%), Positives = 41/88 (46%)

Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ 222
           + S+  + R+KL+  +E +       DE         EK ++++      I+  R+  E+
Sbjct: 173 INSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEE 232

Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEE 250
           LQ E+  LT   E+   +  E+L+ + E
Sbjct: 233 LQQELQELTIAIEALTNKKSELLEEIAE 260



 Score = 31.9 bits (73), Expect = 0.38
 Identities = 25/106 (23%), Positives = 44/106 (41%), Gaps = 13/106 (12%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           +E + L  +     +QL+E  +E+ + +   E L  +  E  E I  A +  E+ +G   
Sbjct: 214 QELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRG--- 270

Query: 229 RLTQENESAKE--EVKEVLQALEEPTRTLLNFFRQSSPKGTLLYNH 272
                  SAKE  ++K  +  L+    T   F   S    ++ YN 
Sbjct: 271 ------WSAKEISKLKAKVSLLQ--KLTGWKFLSLSGSTLSMSYNR 308


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 34.9 bits (81), Expect = 0.052
 Identities = 35/227 (15%), Positives = 71/227 (31%), Gaps = 52/227 (22%)

Query: 29  SKLTRILQESLGGNARTTIIICCSPASFNESET---KSTLDFGRRAKTIKNVVTVNEELT 85
            KL   L +         +       S    E        +  +  K IK +     EL 
Sbjct: 60  DKLRSYLPK------LNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELE 113

Query: 86  AEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVK---------EDEQVNLADPVD 136
            E   +  E+E E+   +       +L+LS     + V          + E++ L   V+
Sbjct: 114 NE--IKELEQEIERLEPWGN----FDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVE 167

Query: 137 MAASIAPTPESAPASILPA--------------------IPGSGLMLGSLSNEERQKLEE 176
               I+         ++                      +   G       +E  ++++E
Sbjct: 168 NVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGT-----PSELIREIKE 222

Query: 177 ERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQL 223
           E E + ++ +   EE+    + A+K  E+++   E +       E L
Sbjct: 223 ELEEIEKERESLLEELK---ELAKKYLEELLALYEYLEIELERAEAL 266



 Score = 30.7 bits (70), Expect = 0.93
 Identities = 24/91 (26%), Positives = 42/91 (46%), Gaps = 9/91 (9%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQ-----EELITTARRD 219
            LSNE  +KL     +L + LD+    +         L+E+  +      EELI     +
Sbjct: 39  ELSNERLRKLRSLLTKLSEALDK----LRSYLPKLNPLREEKKKVSVKSLEELIKDVEEE 94

Query: 220 YEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
            E+++ E+  L +E    + E+KE+ Q +E 
Sbjct: 95  LEKIEKEIKELEEEISELENEIKELEQEIER 125


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 34.6 bits (79), Expect = 0.052
 Identities = 16/70 (22%), Positives = 26/70 (37%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
              Q   EE  R      +  + I Q+     +  +QI  + E I    R  ++L+    
Sbjct: 210 NAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQA 269

Query: 229 RLTQENESAK 238
           RL QE    +
Sbjct: 270 RLEQEVAQLE 279



 Score = 30.8 bits (69), Expect = 1.00
 Identities = 15/80 (18%), Positives = 35/80 (43%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           EE  +     ++  Q + ++D +I+Q++Q      EQI E+E  +        +L+ E+ 
Sbjct: 217 EELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVA 276

Query: 229 RLTQENESAKEEVKEVLQAL 248
           +L    ++     ++     
Sbjct: 277 QLEAYYQAYVRLRQQAAATQ 296



 Score = 29.2 bits (65), Expect = 3.0
 Identities = 15/84 (17%), Positives = 37/84 (44%), Gaps = 4/84 (4%)

Query: 163 LGSLSNEERQKLEEERERLY---QQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRD 219
           L +L+ E+R++LE + + L    +QL     ++  Q    +    QI ++ + + T    
Sbjct: 153 LKTLA-EQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANA 211

Query: 220 YEQLQGEMTRLTQENESAKEEVKE 243
            +    E+ R     +   + +++
Sbjct: 212 AQARTEELARRAAAAQQTAQAIQQ 235



 Score = 28.1 bits (62), Expect = 5.9
 Identities = 15/89 (16%), Positives = 37/89 (41%), Gaps = 3/89 (3%)

Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
           + L  E      +    + E        +K +++     + +  AR++  + Q E+ RLT
Sbjct: 84  RALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLT 143

Query: 232 QENESAKEEVKEVL---QALEEPTRTLLN 257
           ++ +  +  +K +    + LE   ++L  
Sbjct: 144 KQAQDLQTRLKTLAEQRRQLEAQAQSLQA 172


>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B.  A
           small number of taxonomically diverse prokaryotic
           species, including Methanosarcina barkeri, have what
           appears to be a second ATP synthase, in addition to the
           normal F1F0 ATPase in bacteria and A1A0 ATPase in
           archaea. These enzymes use ion gradients to synthesize
           ATP, CC and in principle may run in either direction.
           This model represents the F0 subunit B of this apparent
           second ATP synthase.
          Length = 246

 Score = 34.3 bits (79), Expect = 0.056
 Identities = 24/92 (26%), Positives = 48/92 (52%), Gaps = 10/92 (10%)

Query: 164 GSLSNEERQKLEEERERLYQQLDEKDEEINQQ-SQYAEKLKEQI-MEQEELITTARRDYE 221
           G L++ + +K E E+ER  ++ +EK+EE++QQ      K KE+   E++ L+  AR + +
Sbjct: 43  GELADADTKKREAEQER--REYEEKNEELDQQREVLLTKAKEEAQAERQRLLDEAREEAD 100

Query: 222 QLQGEMTRLTQENESAKEEVKEVLQALEEPTR 253
           +++       +  E+ + E   +   L   T 
Sbjct: 101 EIRE------KWQEALRREQAALSDELRRRTG 126


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 33.9 bits (78), Expect = 0.087
 Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 16/93 (17%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           E  Q+L E + RL QQL +  E++    +   +L  +               ++L  E+ 
Sbjct: 70  ELIQELNELKARLQQQLLQSREQLQLLIESLAQLSSEF--------------QELANEIF 115

Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLNFFRQ 261
              + N    E  ++ L+ L +P R +L  FR+
Sbjct: 116 --EELNRRLAELNQQNLKQLLKPLREVLEKFRE 146


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 33.3 bits (77), Expect = 0.13
 Identities = 26/90 (28%), Positives = 49/90 (54%), Gaps = 2/90 (2%)

Query: 169 EERQKLEEERERLYQQLDEKDEEI-NQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
           E  ++LE++ E L  Q D+  E I  ++  Y+E L+E++ E E+ +    ++ E+L   +
Sbjct: 344 ETVRELEKQLEELEAQYDQLVERIAEKKVAYSE-LQEELEEIEKQLEEIEKEQEELSESL 402

Query: 228 TRLTQENESAKEEVKEVLQALEEPTRTLLN 257
             L ++   A+E+++E  Q L E  R L  
Sbjct: 403 QGLRKDELEAREKLQEYRQKLHEIKRYLEK 432



 Score = 29.5 bits (67), Expect = 2.2
 Identities = 21/56 (37%), Positives = 31/56 (55%)

Query: 201 KLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
           K K+ I E EEL+ T   D EQ+  E+  L +  E  ++EV+E+     E  +TLL
Sbjct: 98  KAKKAIKEIEELLDTIEEDIEQILEELNELKESEEKNRKEVEELKDKYRELRKTLL 153


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 33.3 bits (76), Expect = 0.16
 Identities = 20/100 (20%), Positives = 40/100 (40%), Gaps = 7/100 (7%)

Query: 170 ERQKLEEERERLYQQLDEKDEEI--NQQSQYAEKLKEQIMEQEELITTARR-----DYEQ 222
           E++KL++E E+  ++L E++       + +  E LK    E E +I   +        E 
Sbjct: 540 EQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEI 599

Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQS 262
              E     +E +    +     QA +   +  + +F Q 
Sbjct: 600 KSIEDLVKLKETKQKIPQKPTNFQADKIGDKVRIRYFGQK 639


>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
           C-methyltransferase; Provisional.
          Length = 390

 Score = 32.8 bits (75), Expect = 0.17
 Identities = 20/121 (16%), Positives = 41/121 (33%), Gaps = 18/121 (14%)

Query: 127 EQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSLS---------------NEER 171
           EQ   +  V+       T     A+   +   +GL+L +++                ++ 
Sbjct: 3   EQEKSSAVVEETREAVETTSQPVATEKKSKNRTGLVLSAVAIAIALAAGAGLYYHGKQQA 62

Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
           Q      + L  QL         Q    ++L+  + +Q + +  A R    L  ++  L 
Sbjct: 63  QNQTATNDALANQLTALQ---KAQESQKQELEGILKQQAKALDQANRQQAALAKQLDELQ 119

Query: 232 Q 232
           Q
Sbjct: 120 Q 120


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 32.0 bits (73), Expect = 0.22
 Identities = 20/79 (25%), Positives = 36/79 (45%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           EER+ LE+E         E D EI +  +   +L+ Q    +E      ++  +L+ E +
Sbjct: 110 EERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIARLKSEAS 169

Query: 229 RLTQENESAKEEVKEVLQA 247
           ++ QE   A+ E +    A
Sbjct: 170 KIKQELVDAELEFQTTAAA 188



 Score = 29.7 bits (67), Expect = 1.4
 Identities = 17/76 (22%), Positives = 36/76 (47%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           +++ K  EER+ L +++ EK+    +      +L+ +I+E +      +   E    E+ 
Sbjct: 103 DDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIA 162

Query: 229 RLTQENESAKEEVKEV 244
           RL  E    K+E+ + 
Sbjct: 163 RLKSEASKIKQELVDA 178


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 32.6 bits (75), Expect = 0.24
 Identities = 11/44 (25%), Positives = 26/44 (59%), Gaps = 1/44 (2%)

Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQ-YAEKLKEQIMEQ 209
           S E  + LE+E ++L  +L E  E ++   +  A++L++++  +
Sbjct: 340 SEESLEALEKEVKKLKAELLEAAEALSAIRKKAAKELEKEVTAE 383



 Score = 30.3 bits (69), Expect = 1.3
 Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 9/80 (11%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDY-----EQL 223
           E   K++EE      QLD  +E +    +  +KLK +++E  E ++  R+       +++
Sbjct: 325 EYLDKIKEE----LAQLDNSEESLEALEKEVKKLKAELLEAAEALSAIRKKAAKELEKEV 380

Query: 224 QGEMTRLTQENESAKEEVKE 243
             E+  L  E      E+K 
Sbjct: 381 TAELKALAMEKARFTVELKP 400



 Score = 28.7 bits (65), Expect = 4.3
 Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 4/90 (4%)

Query: 162 MLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYE 221
            L   S E R  L+E  E    +L+E +E +      A K    I +  E +   + +  
Sbjct: 277 ELEEASEELRAYLDE-LEFDPNRLEEVEERLFALKSLARKYGVTIEDLLEYLDKIKEELA 335

Query: 222 QLQG---EMTRLTQENESAKEEVKEVLQAL 248
           QL      +  L +E +  K E+ E  +AL
Sbjct: 336 QLDNSEESLEALEKEVKKLKAELLEAAEAL 365



 Score = 27.9 bits (63), Expect = 7.8
 Identities = 18/92 (19%), Positives = 40/92 (43%), Gaps = 11/92 (11%)

Query: 164 GSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKE----QIMEQEELITTARRD 219
             L    ++  +E  +  ++Q   + E++ ++ +   +  +    Q+ E EEL     + 
Sbjct: 149 AGLEELAQEAYQEAYQA-WKQARRELEDLQEKERERAQRADLLQFQLEELEEL---NLQP 204

Query: 220 --YEQLQGEMTRLTQENESAKEEVKEVLQALE 249
              E+L+ E  RL   +E   E ++  L+ L 
Sbjct: 205 GEDEELEEERKRL-SNSEKLAEAIQNALELLS 235


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.7 bits (75), Expect = 0.25
 Identities = 41/181 (22%), Positives = 67/181 (37%), Gaps = 25/181 (13%)

Query: 92  RYEKEKEKSSRYKGK--------------LEKLELELSRWRNGETVKEDEQVNLADPVDM 137
           RYE+++E++   + +              LE LE E+   R      E E+  LA+ V  
Sbjct: 224 RYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRD 283

Query: 138 AASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQ 197
                   E     +L     +   L     E    +E  RE L  + +E  + + +   
Sbjct: 284 LRERLEELEEERDDLL-----AEAGLDDADAE---AVEARREELEDRDEELRDRLEECRV 335

Query: 198 YAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEV---KEVLQALEEPTRT 254
            A+   E+     E         E+L+ E   L  E E A+E V   +E ++ LEE    
Sbjct: 336 AAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEE 395

Query: 255 L 255
           L
Sbjct: 396 L 396



 Score = 31.9 bits (73), Expect = 0.38
 Identities = 28/109 (25%), Positives = 51/109 (46%), Gaps = 10/109 (9%)

Query: 163 LGSLSNEERQKLEEERERLYQQLDEKD----EEINQQSQYAEKLKEQIMEQEELITTARR 218
           L  L++E R++L E+RER  +   E D    EE  +  + AE+  EQ+ E+ + +   R 
Sbjct: 618 LAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELREERD 677

Query: 219 DYEQLQGEMTRLTQENESAKEEVKEV------LQALEEPTRTLLNFFRQ 261
           D +   G +    +E E  +E  + +      L+AL +    L + +  
Sbjct: 678 DLQAEIGAVENELEELEELRERREALENRVEALEALYDEAEELESMYGD 726



 Score = 31.9 bits (73), Expect = 0.38
 Identities = 25/81 (30%), Positives = 42/81 (51%), Gaps = 7/81 (8%)

Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTR 229
             ++LEE RE L + + E+ E I ++ + AE+L+E+  E E        + E+ +     
Sbjct: 510 RIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEA-------EAEEKREAAAE 562

Query: 230 LTQENESAKEEVKEVLQALEE 250
             +E E A+EEV E+   L E
Sbjct: 563 AEEEAEEAREEVAELNSKLAE 583



 Score = 31.9 bits (73), Expect = 0.45
 Identities = 21/82 (25%), Positives = 37/82 (45%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           E  ++L EE   L  +L+E  E +  + +  E+L+E+I E  E    A  D    +  + 
Sbjct: 356 ERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLE 415

Query: 229 RLTQENESAKEEVKEVLQALEE 250
            L +E +  +E   E+   L  
Sbjct: 416 ELREERDELREREAELEATLRT 437



 Score = 31.2 bits (71), Expect = 0.74
 Identities = 25/92 (27%), Positives = 42/92 (45%), Gaps = 11/92 (11%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAE------KLKEQIMEQEELITTAR----R 218
           E  ++LE E E L ++++E +E + +     E      +L+E+  + EELI   R     
Sbjct: 475 ERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEE 534

Query: 219 DYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
             E+ +    R   E E+  EE +E     EE
Sbjct: 535 KRERAEELRER-AAELEAEAEEKREAAAEAEE 565



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 25/84 (29%), Positives = 43/84 (51%), Gaps = 3/84 (3%)

Query: 169 EERQKLEEERERLYQQLDEKD---EEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
           EE ++ EE+RE+  +  DE D   EE  ++ +  E L+ +I +  E I    R+ E+L  
Sbjct: 220 EEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETEREREELAE 279

Query: 226 EMTRLTQENESAKEEVKEVLQALE 249
           E+  L +  E  +EE  ++L    
Sbjct: 280 EVRDLRERLEELEEERDDLLAEAG 303



 Score = 29.6 bits (67), Expect = 2.2
 Identities = 23/87 (26%), Positives = 38/87 (43%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           E+   LEE  E L ++  E + E+ +  +  E  +E+I E EE I   R  +     ++ 
Sbjct: 349 EDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLG 408

Query: 229 RLTQENESAKEEVKEVLQALEEPTRTL 255
                 E  +EE  E+ +   E   TL
Sbjct: 409 NAEDFLEELREERDELREREAELEATL 435


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 32.6 bits (75), Expect = 0.26
 Identities = 15/76 (19%), Positives = 38/76 (50%), Gaps = 3/76 (3%)

Query: 173 KLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQ 232
           +L+EER +L ++L+E   E N+ S+   ++   +   E+       + ++L+ ++  L  
Sbjct: 33  ELDEERRKLLRELEELQAERNELSK---EIGRALKRGEDDAEELIAEVKELKEKLKELEA 89

Query: 233 ENESAKEEVKEVLQAL 248
             +  + E+  +L  +
Sbjct: 90  ALDELEAELDTLLLTI 105


>gnl|CDD|144876 pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain.  These
           proteins contain several 22 residue repeats which form a
           pair of alpha helices. This family includes:
           Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein
           E.
          Length = 191

 Score = 31.4 bits (72), Expect = 0.33
 Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 14/101 (13%)

Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQ-YAEKLKEQIMEQ------------ 209
           LG  + E   +L +E E L ++L +  E++  + Q Y ++LK ++ +             
Sbjct: 21  LGPYAQEFWARLSKETEALREELQKDLEDVRARLQPYLDELKAKVGQNLEELRQRLAPYA 80

Query: 210 EELITTARRDYEQLQGEMTRLTQE-NESAKEEVKEVLQALE 249
           EEL     RD E+L+ ++    +E  +  ++ V+ +   L 
Sbjct: 81  EELRKRLNRDAEELRRKLAPYAEELRDRLRQNVEALRARLG 121



 Score = 31.4 bits (72), Expect = 0.35
 Identities = 27/125 (21%), Positives = 48/125 (38%), Gaps = 28/125 (22%)

Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQ--------YAEKLKEQIMEQ-------- 209
           L  E ++ LE+ R RL   LDE   ++ Q  +        YAE+L++++           
Sbjct: 39  LREELQKDLEDVRARLQPYLDELKAKVGQNLEELRQRLAPYAEELRKRLNRDAEELRRKL 98

Query: 210 ----EELITTARRDYEQLQGEM--------TRLTQENESAKEEVKEVLQALEEPTRTLLN 257
               EEL    R++ E L+  +         +L +  E  KE V    +  +      + 
Sbjct: 99  APYAEELRDRLRQNVEALRARLGPYVEELRQKLAERLEELKESVGPYAEEYKAQLSEQVE 158

Query: 258 FFRQS 262
             R+ 
Sbjct: 159 ELREK 163


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 31.9 bits (73), Expect = 0.33
 Identities = 17/88 (19%), Positives = 35/88 (39%), Gaps = 5/88 (5%)

Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ 222
           + +L        E ER RL   L E           A +L +++  ++++   A    E 
Sbjct: 83  VANLRASLSAA-EAERSRLQALLAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVEL 141

Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEE 250
           L  ++  L ++  +    ++  L A E+
Sbjct: 142 LNQQIAALRRQLAA----LEAALDASEK 165


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 31.1 bits (71), Expect = 0.33
 Identities = 13/84 (15%), Positives = 44/84 (52%), Gaps = 3/84 (3%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQ--SQYAEKLKEQIMEQEELITTARRDYEQLQG 225
            +E +KL+ E ++  ++L ++++++ +Q  +   E  K +  E ++     ++  +  Q 
Sbjct: 41  EKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQ 100

Query: 226 EMTRLTQE-NESAKEEVKEVLQAL 248
           E+ +  QE  +   +++ + ++ +
Sbjct: 101 ELQQKQQELLQPIYDKIDKAIKEV 124


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 32.1 bits (74), Expect = 0.35
 Identities = 22/104 (21%), Positives = 46/104 (44%), Gaps = 28/104 (26%)

Query: 168 NEERQKLEEERERLYQQLDE---------------------KDEEINQQSQYAEKLKEQI 206
            E  ++ +E+RE  +++L+E                     KD E   + ++ E L E+ 
Sbjct: 118 EEIEEEFQEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEEF-EALPEE- 175

Query: 207 MEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
            E+EEL        ++L+ E+  + ++    + E +E L+ L+ 
Sbjct: 176 -EREEL----EEKIDELEEELQEILRQLRELEREAREKLRELDR 214



 Score = 29.7 bits (68), Expect = 2.0
 Identities = 21/92 (22%), Positives = 41/92 (44%), Gaps = 21/92 (22%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQ-QSQYAEKLKE---QIMEQ--EELITTARR 218
           +L  EER++LEE+ + L ++L E   ++ + + +  EKL+E   ++       L+   + 
Sbjct: 171 ALPEEEREELEEKIDELEEELQEILRQLRELEREAREKLRELDREVALFAVGPLLDELKE 230

Query: 219 DYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
            Y  L                +V   L A++E
Sbjct: 231 KYADL---------------PKVLAYLDAVQE 247


>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
           Validated.
          Length = 339

 Score = 31.6 bits (73), Expect = 0.37
 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 4/48 (8%)

Query: 163 LGSLSNEER----QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQI 206
           LG L  EER      + E ++ +   L+E+ EE+   +  A    E I
Sbjct: 46  LGKLPPEERKEAGALINELKQAIEAALEERKEELEAAALNARLAAETI 93


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 31.9 bits (73), Expect = 0.39
 Identities = 30/161 (18%), Positives = 58/161 (36%), Gaps = 25/161 (15%)

Query: 107 LEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSL 166
             +LE ++ R          E  + A            P+   ++  PA+P       SL
Sbjct: 83  AAELESQVLRLEAEVARLRAEADSQAAIEF--------PDDLLSAEDPAVPELIKGQQSL 134

Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQI-MEQEELI------------ 213
               +  L  + E +  Q+ + + E+       + L++Q+ +  EEL             
Sbjct: 135 FESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLVS 194

Query: 214 ----TTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
                   R+  + QGE+ RL  E E  K ++ E+    ++
Sbjct: 195 RLELLELERERAEAQGELGRLEAELEVLKRQIDELQLERQQ 235


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 32.0 bits (73), Expect = 0.39
 Identities = 16/82 (19%), Positives = 37/82 (45%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
              R ++  E+  L   L E+  +  + +Q  E+ K+ + +    ++  ++  E+L+   
Sbjct: 174 AAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANE 233

Query: 228 TRLTQENESAKEEVKEVLQALE 249
           +RL  E  SA+    +  +A  
Sbjct: 234 SRLKNEIASAEAAAAKAREAAA 255



 Score = 29.7 bits (67), Expect = 1.9
 Identities = 11/84 (13%), Positives = 36/84 (42%)

Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
              +++ +++ +++   +++I +Q     KL++Q+   E  I +      +   ++ +L 
Sbjct: 34  AADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLR 93

Query: 232 QENESAKEEVKEVLQALEEPTRTL 255
           ++       +  +     E  R L
Sbjct: 94  KQIADLNARLNALEVQEREQRRRL 117


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 31.8 bits (72), Expect = 0.39
 Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 2/89 (2%)

Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDY--EQLQ 224
           + E + K   E+ERL Q   E+ +   QQ Q  E  K+  +EQ++    AR+    ++ +
Sbjct: 89  AEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKK 148

Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTR 253
            E  +     E+AK +     +   E   
Sbjct: 149 AEAAKAKAAAEAAKLKAAAEAKKKAEEAA 177


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 31.5 bits (72), Expect = 0.43
 Identities = 25/104 (24%), Positives = 45/104 (43%), Gaps = 9/104 (8%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           E+ +KL +E     ++L+E +EE+ +     E L  +  E    I  A +  EQ +G  T
Sbjct: 211 EKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRG-FT 269

Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGTLLYNH 272
                     E++KE L+ L+    T     + S    ++ Y+ 
Sbjct: 270 F------KEIEKLKEQLKLLQS--LTGWKITKLSGNTLSMTYDR 305



 Score = 30.4 bits (69), Expect = 0.99
 Identities = 19/100 (19%), Positives = 41/100 (41%), Gaps = 11/100 (11%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAE-----------KLKEQIMEQEELITTA 216
            +E + L   + +L  + D  +EE+ Q  Q  +           + KE++ +  + I   
Sbjct: 164 MKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIK 223

Query: 217 RRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
            +  E+L+ E+  L  + E    +  E+   + E  + L 
Sbjct: 224 VKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLE 263


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 31.9 bits (73), Expect = 0.44
 Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 14/90 (15%)

Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT- 228
           ERQK +EE E L ++LD+  +++    +  +   +  M  EE       +Y+Q   E+T 
Sbjct: 192 ERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTPEEK----DDEYDQRVRELTF 247

Query: 229 --------RLTQENESAKEEVKEVLQALEE 250
                   R   E E AKEE  E L+ LE 
Sbjct: 248 DRRAQPTDRTKTEEELAKEE-AERLKKLEA 276


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 31.5 bits (72), Expect = 0.48
 Identities = 8/40 (20%), Positives = 23/40 (57%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIM 207
            E+ ++LEE  + L ++  E + ++ +  +  E L++++ 
Sbjct: 327 REKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLE 366


>gnl|CDD|220484 pfam09943, DUF2175, Uncharacterized protein conserved in archaea
           (DUF2175).  This domain, found in various hypothetical
           archaeal proteins, has no known function.
          Length = 101

 Score = 29.7 bits (67), Expect = 0.48
 Identities = 17/71 (23%), Positives = 38/71 (53%), Gaps = 4/71 (5%)

Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEEL---ITTARRDYEQ 222
           L  +  +K++ ++      L   ++ +++    A++L EQ+ E EE+   I   R+D E+
Sbjct: 31  LVEDASKKIKGDKPEDLVALLRLEDYLHEGIVLAKEL-EQVAESEEIKKVIRQVRKDLEK 89

Query: 223 LQGEMTRLTQE 233
           L  ++TR  ++
Sbjct: 90  LAAKLTRKIEK 100


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 31.6 bits (72), Expect = 0.49
 Identities = 20/96 (20%), Positives = 51/96 (53%), Gaps = 7/96 (7%)

Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
           L++E  +KLE+E ++L  +L+++++E+ +     EK  E++ ++ EL+       + ++ 
Sbjct: 283 LASELEKKLEKELKKLENKLEKQEDELEE----LEKAAEELRQKGELLYAN---LQLIEE 335

Query: 226 EMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQ 261
            +  +   +    EE+K  L   + P+     +F++
Sbjct: 336 GLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKK 371



 Score = 27.7 bits (62), Expect = 8.2
 Identities = 28/172 (16%), Positives = 57/172 (33%), Gaps = 27/172 (15%)

Query: 74  IKNVVTVNEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLAD 133
           ++  +   E    E+ +   E+ ++ +   + K E L   L     G        V LAD
Sbjct: 291 LEKELKKLEN-KLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLK-----SVRLAD 344

Query: 134 PVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEER-----QKLEEERERLYQQLDEK 188
                       E     +  +   S        N +R     +KL+  +  L +QL E 
Sbjct: 345 FYGN--------EEIKIELDKSKTPS-------ENAQRYFKKYKKLKGAKVNLDRQLSEL 389

Query: 189 DEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEE 240
            E I      A+   E+   ++ +        E+   +  +  ++ +   E+
Sbjct: 390 KEAIAYYES-AKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 29.2 bits (66), Expect = 0.50
 Identities = 18/68 (26%), Positives = 39/68 (57%)

Query: 183 QQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVK 242
           ++L EKDE+I Q  +  EKL ++ ++    I   R   ++L+ E+  L ++ E  ++E++
Sbjct: 5   KKLAEKDEQIAQLMEEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELE 64

Query: 243 EVLQALEE 250
            + + L+ 
Sbjct: 65  NLEERLKR 72


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 31.5 bits (72), Expect = 0.51
 Identities = 20/80 (25%), Positives = 36/80 (45%), Gaps = 3/80 (3%)

Query: 162 MLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQ--YAEKLKEQIMEQEELITTARRD 219
           +L  L  E   +LE +   L +QL  + E   ++ +  Y EKL++++  Q E      ++
Sbjct: 201 LLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKN 260

Query: 220 YEQLQG-EMTRLTQENESAK 238
              LQ  E+ R   +    K
Sbjct: 261 ELALQAIELQREFNKEIKEK 280


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 31.6 bits (72), Expect = 0.51
 Identities = 23/85 (27%), Positives = 43/85 (50%)

Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
           +K E+E + L   LDE  E I  Q     +L++ + E E+ +T    + E++Q  +T L 
Sbjct: 350 RKFEKELKELESVLDEILENIEAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLR 409

Query: 232 QENESAKEEVKEVLQALEEPTRTLL 256
           ++   A+E ++ +   L E  R + 
Sbjct: 410 KDELEARENLERLKSKLHEIKRYME 434


>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1112

 Score = 31.5 bits (72), Expect = 0.55
 Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 5/55 (9%)

Query: 176 EERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELIT--TARRDYEQLQGEMT 228
           +ERE +Y Q+D+ ++E  +     EK+ ++I+ +   I   TARR  E  + E+T
Sbjct: 78  DEREDIYAQIDKLEKEAYEIL---EKVLDEILPEAFAIVKETARRFAENEEIEVT 129


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
           (base unit) of the ATP synthase. The base unit is
           thought to translocate protons through membrane (inner
           membrane in mitochondria, thylakoid membrane in plants,
           cytoplasmic membrane in bacteria). The B subunits are
           thought to interact with the stalk of the CF(1)
           subunits. This domain should not be confused with the ab
           CF(1) proteins (in the head of the ATP synthase) which
           are found in pfam00006.
          Length = 132

 Score = 30.3 bits (69), Expect = 0.58
 Identities = 25/84 (29%), Positives = 40/84 (47%), Gaps = 7/84 (8%)

Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
              +    L E  ++L Q   E  E IN   + A+KLKE+I      +  A++D E+L  
Sbjct: 45  RLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEEI------LAEAQKDAERLLE 98

Query: 226 E-MTRLTQENESAKEEVKEVLQAL 248
                + QE E A  E+++ + AL
Sbjct: 99  SARAEIEQEKEQALAELRQQVAAL 122


>gnl|CDD|226396 COG3879, COG3879, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 247

 Score = 30.8 bits (70), Expect = 0.66
 Identities = 11/62 (17%), Positives = 26/62 (41%)

Query: 194 QQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTR 253
           Q S+     + + ++  + + + ++    L  E+  L  + +S +  V     ALE+   
Sbjct: 40  QTSKGESVRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSVRRSVLTDDAALEDRLE 99

Query: 254 TL 255
            L
Sbjct: 100 KL 101


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 31.1 bits (70), Expect = 0.73
 Identities = 21/87 (24%), Positives = 45/87 (51%), Gaps = 4/87 (4%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
            E +  EEE + L   +DE  +++ +Q    E+ +    E+E+L     R+ +++  +  
Sbjct: 337 SEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQEREKLT----RELDKINIQSD 392

Query: 229 RLTQENESAKEEVKEVLQALEEPTRTL 255
           +LT+  +S K E + + ++LE+  R  
Sbjct: 393 KLTKSVKSRKLEAQGIFKSLEKTLRQY 419


>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexins.  BAR domains are dimerization, lipid binding and
           curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. BAR domains form dimers that bind to
           membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 218

 Score = 30.4 bits (69), Expect = 0.73
 Identities = 13/75 (17%), Positives = 28/75 (37%)

Query: 176 EERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENE 235
           ++R      L    +++  +    EKLK     +   +     + E+ +  +    +  E
Sbjct: 110 DDRADALLTLQSLKKDLASKKAQLEKLKAAPGIKPAKVEELEEELEEAESALEEARKRYE 169

Query: 236 SAKEEVKEVLQALEE 250
              E +KE L+   E
Sbjct: 170 EISERLKEELKRFHE 184


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 30.9 bits (70), Expect = 0.75
 Identities = 17/63 (26%), Positives = 30/63 (47%)

Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
           R  L +E   L ++   K++E  +Q    +KL+ +I E E  + T  + Y+QL  E    
Sbjct: 363 RTPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQLYKQLLDEYENA 422

Query: 231 TQE 233
            + 
Sbjct: 423 PKS 425



 Score = 29.0 bits (65), Expect = 3.8
 Identities = 13/61 (21%), Positives = 32/61 (52%)

Query: 183 QQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVK 242
            ++D +++   ++    E L+EQI E E  I     + + L  ++ ++ +E  + ++E +
Sbjct: 256 AEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENE 315

Query: 243 E 243
           E
Sbjct: 316 E 316



 Score = 27.4 bits (61), Expect = 9.3
 Identities = 24/104 (23%), Positives = 41/104 (39%), Gaps = 13/104 (12%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
             E    E+  +    +L+   E+I++     E++  +I      I       +Q+  E+
Sbjct: 255 PAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKI-------KQVNEEL 307

Query: 228 TRLTQENESAKEEVK------EVLQALEEPTRTLLNFFRQSSPK 265
           T + QENE  +EE K      E+L   E     L      SS +
Sbjct: 308 TTVRQENEELEEEYKIKKRTVELLPDAENNVAKLQALVVASSER 351


>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20.  This
           family consists of several phage minor structural
           protein GP20 sequences of around 180 residues in length.
           The function of this family is unknown.
          Length = 156

 Score = 30.0 bits (68), Expect = 0.79
 Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 18/68 (26%)

Query: 173 KLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQ 232
           K  EER+ L +QL  +D++I       E LK+++           +D E+LQ ++ +L Q
Sbjct: 24  KANEERDSLKKQLKNRDKQI-------EDLKKKV-----------KDNEELQKKIEKLKQ 65

Query: 233 ENESAKEE 240
           +N++AKEE
Sbjct: 66  QNKTAKEE 73


>gnl|CDD|240376 PTZ00350, PTZ00350, adenylosuccinate synthetase; Provisional.
          Length = 436

 Score = 30.8 bits (70), Expect = 0.83
 Identities = 19/67 (28%), Positives = 37/67 (55%), Gaps = 6/67 (8%)

Query: 159 SGLMLGSLSNEE------RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEEL 212
           +GL +G L N E      R+ +E+ +E+   +  + +EE+ +   YAEKLK+ I++    
Sbjct: 157 TGLRVGDLLNFETFEKKYRKLVEKLQEQYNIEEYDAEEELERYKGYAEKLKDMIVDTVYF 216

Query: 213 ITTARRD 219
           +  A ++
Sbjct: 217 MNKAIKE 223


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 31.3 bits (70), Expect = 0.84
 Identities = 38/184 (20%), Positives = 79/184 (42%), Gaps = 22/184 (11%)

Query: 69   RRAKTIKNVVTVNEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQ 128
             R + +  +    +++ AEE K+  E + +     K + EK ++E  + +  E  K+ E+
Sbjct: 1592 ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEE 1651

Query: 129  VNLADPVDM--AASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLD 186
            +  A+  +   AA  A   E                      EE +K EE+ ++  + L 
Sbjct: 1652 LKKAEEENKIKAAEEAKKAEEDKKKA----------------EEAKKAEEDEKKAAEALK 1695

Query: 187  EKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQ 246
            ++ EE  +  +  +K  E+  + EEL    ++  E+ + +     +E E  K++ +E  +
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEEL----KKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751

Query: 247  ALEE 250
              EE
Sbjct: 1752 DEEE 1755


>gnl|CDD|221101 pfam11382, DUF3186, Protein of unknown function (DUF3186).  This
           bacterial family of proteins has no known function.
          Length = 307

 Score = 30.8 bits (70), Expect = 0.85
 Identities = 39/156 (25%), Positives = 56/156 (35%), Gaps = 36/156 (23%)

Query: 158 GSGL----MLGSLSNE------ERQKLEEERERLYQQLDEKDEEINQ--QSQYAEKLKE- 204
           GSGL    +L  L +E      E  +L  ERE L +Q    D+ I        A +L   
Sbjct: 25  GSGLLQENLLSGLEDEFSDLRTENDRLRAEREALNEQASAADQFIAALAPRLVAGRLTGK 84

Query: 205 QIMEQEELITTARRDY-------EQLQ-------GEMTRLTQE--NESAKEEVKEVLQAL 248
            +     L+ T   D        E L        G +T LT    +  + +++K V+   
Sbjct: 85  SVA----LVRTPDADDDDVDAVSEALTQAGATVVGRIT-LTDNFVDPESADKLKSVVNND 139

Query: 249 EEPTRTLLNF--FRQSSPKGTLLYNHYLLGWPNQSE 282
            +P    L+       S  G LL    LL  P  S 
Sbjct: 140 LKPAGDQLSTAAADPGSQAGDLLGAALLLNPPTAST 175


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 31.1 bits (70), Expect = 0.86
 Identities = 16/80 (20%), Positives = 34/80 (42%), Gaps = 2/80 (2%)

Query: 177 ERERLYQQLDEKDEEINQQSQYAEK--LKEQIMEQEELITTARRDYEQLQGEMTRLTQEN 234
            RE      D   E+ +++     +  LKE++++++E + T   D    +       +  
Sbjct: 6   RREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIA 65

Query: 235 ESAKEEVKEVLQALEEPTRT 254
           +  K+  KE  + L E  +T
Sbjct: 66  DEVKKSTKEESKQLLEVLKT 85



 Score = 28.4 bits (63), Expect = 6.1
 Identities = 17/103 (16%), Positives = 43/103 (41%), Gaps = 1/103 (0%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           ++++++  +      +     E +    +  +  KE+  +  E++ T     +++Q E+ 
Sbjct: 39  DKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEIL 98

Query: 229 RLTQENESAKEEV-KEVLQALEEPTRTLLNFFRQSSPKGTLLY 270
           + T  +   KE + K++     E  +     FR   PK   +Y
Sbjct: 99  QKTIPSFEPKESILKKLEDIKPEQAKKQTKLFRLFEPKQLPIY 141


>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein.  This family of proteins
            is found in bacteria. Proteins in this family are
            typically between 397 and 1543 amino acids in length.
            This family is the SidE protein in the Dot/Icm pathway of
            Legionella pneumophila bacteria. There is little
            literature describing the family.
          Length = 1443

 Score = 31.0 bits (70), Expect = 0.87
 Identities = 19/85 (22%), Positives = 44/85 (51%), Gaps = 6/85 (7%)

Query: 165  SLSNEERQKLEEERERLYQQLDEKDEEIN--QQSQYAEKLKEQIMEQEELITTARRDYEQ 222
            SL+  + +  E + + + Q  +   +      +S  AEKL+E I + ++L+   ++ Y +
Sbjct: 1268 SLNQMKPKTFEMQEKEIQQNFELLAKLEKTLDKSDTAEKLREDIPKLKDLLIAKQKAYPE 1327

Query: 223  LQGEMTRLTQENESAKEEVKEVLQA 247
                M +L  ++E+   +++E+ QA
Sbjct: 1328 ----MVQLQYKSEALITQLRELCQA 1348


>gnl|CDD|153311 cd07627, BAR_Vps5p, The Bin/Amphiphysin/Rvs (BAR) domain of yeast
           Sorting Nexin Vps5p.  BAR domains are dimerization,
           lipid binding and curvature sensing modules found in
           many different proteins with diverse functions. Sorting
           nexins (SNXs) are Phox homology (PX) domain containing
           proteins that are involved in regulating membrane
           traffic and protein sorting in the endosomal system.
           SNXs differ from each other in their lipid-binding
           specificity, subcellular localization and specific
           function in the endocytic pathway. A subset of SNXs also
           contain BAR domains. The PX-BAR structural unit
           determines the specific membrane targeting of SNXs.
           Vsp5p is the yeast counterpart of human SNX1 and is part
           of the retromer complex, which functions in the
           endosome-to-Golgi retrieval of vacuolar protein sorting
           receptor Vps10p, the Golgi-resident membrane protein
           A-ALP, and endopeptidase Kex2. BAR domains form dimers
           that bind to membranes, induce membrane bending and
           curvature, and may also be involved in protein-protein
           interactions.
          Length = 216

 Score = 30.4 bits (69), Expect = 0.91
 Identities = 19/73 (26%), Positives = 37/73 (50%)

Query: 178 RERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESA 237
           R++L+Q     + E++++    EKLK Q   Q+E + +   + E+ +   + L +E E  
Sbjct: 110 RQKLWQYWQSAESELSKKKAQLEKLKRQGKTQQEKLNSLLSELEEAERRASELKKEFEEV 169

Query: 238 KEEVKEVLQALEE 250
            E +K  L+  E 
Sbjct: 170 SELIKSELERFER 182


>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904).  This
           family consists of several bacterial and archaeal
           hypothetical proteins of unknown function.
          Length = 72

 Score = 28.4 bits (64), Expect = 0.95
 Identities = 16/69 (23%), Positives = 35/69 (50%)

Query: 182 YQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEV 241
            + L+E + +I Q  +    L+ +I E +E       + E+L+ E  +L +E  + +  +
Sbjct: 3   LELLEELESKIQQAVETIALLQMEIEELKEENEQLSEENEELEEENKKLKEERNAWQSRL 62

Query: 242 KEVLQALEE 250
           + +L  L+E
Sbjct: 63  RALLGKLDE 71


>gnl|CDD|165311 PHA03011, PHA03011, hypothetical protein; Provisional.
          Length = 120

 Score = 29.1 bits (65), Expect = 1.0
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQI 206
            +E   +E E + L   + + D+EI+      +KLKE I
Sbjct: 77  LDEYNLIENEIKDLEIIIQDNDDEIHFLRAEIDKLKENI 115


>gnl|CDD|237616 PRK14127, PRK14127, cell division protein GpsB; Provisional.
          Length = 109

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 13/50 (26%), Positives = 18/50 (36%), Gaps = 1/50 (2%)

Query: 218 RDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGT 267
           +DYE  Q E+  L QEN   K +V E L        +  +          
Sbjct: 37  KDYEAFQKEIEELQQENARLKAQVDE-LTKQVSVGASSSSVATTQPSSSA 85


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 30.7 bits (69), Expect = 1.1
 Identities = 24/103 (23%), Positives = 43/103 (41%), Gaps = 9/103 (8%)

Query: 172  QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQE-ELITTARRDYEQLQGE-MTR 229
            ++L+     L + +     + N+  +   +LKEQ+      L    +   + L G    R
Sbjct: 2088 EELDCTNVNLVEGV----RKYNEYGRGMGELKEQLSNTVVILGVKEKNADDALSGTPGER 2143

Query: 230  LTQENES-AKEEVKEVLQALEEPTRTLLNFFRQ--SSPKGTLL 269
            L +E +S   E  +++L  LEE  R       +  SS K  +L
Sbjct: 2144 LEREVKSVFVEAPRDMLFLLEEEVRKRKGSVMKFKSSKKPAVL 2186


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 30.2 bits (69), Expect = 1.2
 Identities = 13/46 (28%), Positives = 23/46 (50%)

Query: 190 EEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENE 235
           E +    +   +L+EQI + E  +     + E+L+ E+ RL  E E
Sbjct: 1   ERLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELE 46


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 30.2 bits (68), Expect = 1.3
 Identities = 21/83 (25%), Positives = 35/83 (42%), Gaps = 4/83 (4%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ---- 224
           E +QK   E+ERL Q   E+     Q+ Q  E  K+  ++Q++    A +     +    
Sbjct: 91  ELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAE 150

Query: 225 GEMTRLTQENESAKEEVKEVLQA 247
            E  R     + A  E K+  +A
Sbjct: 151 AEAKRAAAAAKKAAAEAKKKAEA 173


>gnl|CDD|218006 pfam04286, DUF445, Protein of unknown function (DUF445).  Predicted
           to be a membrane protein.
          Length = 366

 Score = 29.9 bits (68), Expect = 1.4
 Identities = 17/81 (20%), Positives = 34/81 (41%), Gaps = 5/81 (6%)

Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
           RQ+ ++   RL   L    E   +     EKLK +++    +       +E L+  +   
Sbjct: 206 RQEFDKALRRLIDDLLNDPELAAE----VEKLKRRLLADPAVQDYLEGLWESLKDALLDD 261

Query: 231 TQENESA-KEEVKEVLQALEE 250
             + +S  ++ + E+L  L E
Sbjct: 262 LADPDSLLRQRISELLAELGE 282


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 29.3 bits (66), Expect = 1.4
 Identities = 18/63 (28%), Positives = 30/63 (47%), Gaps = 7/63 (11%)

Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGE 226
           +N   Q L++E ERL  Q +   +   +  +  EKL++++   EE       DY+ L   
Sbjct: 95  TNPSDQALQKENERLKNQNESLQKRNEELEKELEKLRQRLSTIEE-------DYQTLIDI 147

Query: 227 MTR 229
           M R
Sbjct: 148 MDR 150


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 30.0 bits (69), Expect = 1.4
 Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 11/78 (14%)

Query: 173 KLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQ 232
           +L+EER  L  +L+E   E N  S+   + K +  + E LI            E+  L +
Sbjct: 32  ELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIA-----------EVKELKE 80

Query: 233 ENESAKEEVKEVLQALEE 250
           E ++ + E+ E+   LEE
Sbjct: 81  EIKALEAELDELEAELEE 98


>gnl|CDD|183108 PRK11377, PRK11377, dihydroxyacetone kinase subunit M; Provisional.
          Length = 473

 Score = 30.1 bits (68), Expect = 1.4
 Identities = 26/129 (20%), Positives = 49/129 (37%), Gaps = 17/129 (13%)

Query: 134 PVDMAASIAPTPE--SAPASI-LPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDE 190
                 ++ P P   S  A    P +        ++  +    +EEE+ERL Q +D    
Sbjct: 243 EEVAPPTLRPVPSPVSGKAFYYQPVLC-------TVQAKSTLTVEEEQERLRQAIDFTLL 295

Query: 191 EINQQSQYAEKLKEQIMEQEELITTARR----DYEQLQGEMTRLTQENESAKEEVKEVLQ 246
           ++        K +   ++    I +       D E L     RL  E+ +A+   ++VL+
Sbjct: 296 DLM---TLTAKAEASGLDDIAAIFSGHHTLLDDPELLAAASERLQHEHCTAEYAWQQVLK 352

Query: 247 ALEEPTRTL 255
            L +  + L
Sbjct: 353 ELSQQYQQL 361


>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein.  YkyA is a
           family of proteins containing a lipoprotein signal and a
           hydrolase domain. It is similar to cell wall binding
           proteins and might also be recognisable by a host immune
           defence system. It is thus likely to belong to pathways
           important for pathogenicity.
          Length = 205

 Score = 29.6 bits (67), Expect = 1.5
 Identities = 13/88 (14%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 169 EERQKLEEERERLYQQ-LDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
           ++ ++LE++ + LY Q +    ++     + +++    + ++E+L+   +   E+ + E 
Sbjct: 40  KKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLLKKEKESIEKSEEEF 99

Query: 228 TRLTQ-----ENESAKEEVKEVLQALEE 250
               +     E++  K++ K++++ ++E
Sbjct: 100 KSAKKYIEKIEDKKLKKKAKQLVKVMKE 127


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 29.8 bits (67), Expect = 1.5
 Identities = 25/104 (24%), Positives = 41/104 (39%), Gaps = 12/104 (11%)

Query: 179 ERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAK 238
           ERL  +L + ++E +   QY  KL+ Q             +YE L  E+  L +E E   
Sbjct: 32  ERLDSELRDAEKERDTYKQYLSKLESQ--------NVEISNYEALDSELDELKKEEERLL 83

Query: 239 EEVKEVLQALEEPTRTLLNFFRQSSPKGTLLYNHYLLGWPNQSE 282
           +E++E    LE+    L     +   +   L N  L      + 
Sbjct: 84  DELEE----LEKEDDDLDGELVELQEEKEQLENEELQYLREYNL 123


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 29.2 bits (66), Expect = 1.5
 Identities = 21/71 (29%), Positives = 37/71 (52%)

Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
           R +  E RE L + L + + EI +     E+LK Q+ + E  +   +    QL+ ++  L
Sbjct: 45  RDRDLEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKLKTL 104

Query: 231 TQENESAKEEV 241
            Q+ ++ KEEV
Sbjct: 105 EQKLKNEKEEV 115


>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein. 
          Length = 122

 Score = 28.8 bits (65), Expect = 1.6
 Identities = 17/83 (20%), Positives = 33/83 (39%), Gaps = 15/83 (18%)

Query: 171 RQKLEEERERLYQQLDEKDEEINQQS---------------QYAEKLKEQIMEQEELITT 215
           +Q+LE+  E+L +    + E   Q S                +   L E I +Q++ +  
Sbjct: 11  QQELEQAEEKLEELQQYRLEYRQQLSGSGQGISAAELRNYQAFISALDEAIAQQQQELEQ 70

Query: 216 ARRDYEQLQGEMTRLTQENESAK 238
           A +  EQ + +     QE +  +
Sbjct: 71  AEKQVEQAREQWQEANQERKKLE 93


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 29.6 bits (67), Expect = 1.8
 Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 70  RAKTIKNVVTVNEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKED 126
           + K        NE+L AE   +  +KEK+K  + K ++E+LE  + +     T KE+
Sbjct: 310 KRKLKSKFERDNEKLDAE--VKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEE 364


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 29.6 bits (67), Expect = 1.8
 Identities = 19/117 (16%), Positives = 55/117 (47%), Gaps = 7/117 (5%)

Query: 141 IAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAE 200
                + A   I  A   +  ++ +   E +++L ++ +     +++   +I ++ +  E
Sbjct: 50  REEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWE 109

Query: 201 KLKEQIMEQ------EELITTARRD-YEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
           +  E+++E+      EE     R +  E+++  + +L +  E   ++ +E+L++ EE
Sbjct: 110 EELERLIEEAKAEGYEEGYEKGREEGLEEVRELIEQLNKIIEKLIKKREEILESSEE 166


>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A. 
          Length = 427

 Score = 29.8 bits (68), Expect = 1.8
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 163 LGSLSNEERQKLEEERERLYQQLDE 187
           L  L+ EE +KLE+E E L +++ E
Sbjct: 386 LRRLTKEEIEKLEKEIEELEKEIAE 410


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 29.6 bits (66), Expect = 1.9
 Identities = 23/104 (22%), Positives = 47/104 (45%), Gaps = 3/104 (2%)

Query: 169 EERQKLEEERERLYQ---QLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
            ER  L  +R    Q   +L++  + + Q+ Q  +++ E   + E  +   + + +Q   
Sbjct: 206 AERNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLK 265

Query: 226 EMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGTLL 269
           +MT   ++ E+ + +++   + L E  RTL      S  K  LL
Sbjct: 266 KMTIQRRDEETERIDLQLENEQLHEDLRTLQERLESSQQKAGLL 309


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 29.3 bits (66), Expect = 2.0
 Identities = 14/46 (30%), Positives = 27/46 (58%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELIT 214
            E + L+ + E +  ++DE  +EI+Q     +KL+++I E +E I 
Sbjct: 52  NEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIV 97


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 29.6 bits (67), Expect = 2.0
 Identities = 15/72 (20%), Positives = 31/72 (43%), Gaps = 1/72 (1%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQ-YAEKLKEQIMEQEELITTARRDYEQLQGEM 227
            E + L+ E ERL ++    D++I Q  Q   ++L ++I + +      +   +QLQ  +
Sbjct: 80  SENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSERQQLQGLIDQLQRRL 139

Query: 228 TRLTQENESAKE 239
             +         
Sbjct: 140 AGVLTGPSGGGS 151


>gnl|CDD|215505 PLN02935, PLN02935, Bifunctional NADH kinase/NAD(+) kinase.
          Length = 508

 Score = 29.8 bits (67), Expect = 2.0
 Identities = 11/49 (22%), Positives = 19/49 (38%)

Query: 86  AEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADP 134
           A EWKR+YE E+ ++ + + K            +        Q  L + 
Sbjct: 64  AAEWKRKYELERARNQQLEHKELSSGECNEESNDQRLENLANQPMLYNE 112


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 28.7 bits (65), Expect = 2.1
 Identities = 12/85 (14%), Positives = 42/85 (49%), Gaps = 7/85 (8%)

Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
           +++LE+E ++   +L++ ++E+ +  +  +K     + +        ++ ++   E  R 
Sbjct: 20  QKQLEKEFKKRQAELEKLEKELQKLKEKLQK-DAATLSEAA-REKKEKELQKKVQEFQRK 77

Query: 231 TQE-----NESAKEEVKEVLQALEE 250
            Q+      +  +EE++++L  + +
Sbjct: 78  QQKLQQDLQKRQQEELQKILDKINK 102


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 29.7 bits (67), Expect = 2.2
 Identities = 28/151 (18%), Positives = 59/151 (39%), Gaps = 15/151 (9%)

Query: 107 LEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSL 166
           LE+ + EL        +  +    +A   ++A    P   +  + I   + GS L   + 
Sbjct: 325 LEERDKELLESLPKLALPAEHVKEIAA--ELAEIDKPA--TTDSEIPHRLSGSEL---TQ 377

Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGE 226
                Q+++ E +    QL ++  E+ ++    +K    I  +E+          QL  E
Sbjct: 378 LEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEEQ--------IAQLLEE 429

Query: 227 MTRLTQENESAKEEVKEVLQALEEPTRTLLN 257
           +     E   ++ E++E+L+ LE     +  
Sbjct: 430 LGEAQNELFRSEAEIEELLRQLETLKEAIEA 460


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 29.3 bits (66), Expect = 2.3
 Identities = 19/92 (20%), Positives = 40/92 (43%), Gaps = 9/92 (9%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTAR---------R 218
            +  +K + E E L + L+  + E+        +L+ +I E  E I  A          R
Sbjct: 30  RKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDER 89

Query: 219 DYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
           +   L  E+    +   S ++E+ E+++ +E+
Sbjct: 90  ELRALNIEIQIAKERINSLEDELAELMEEIEK 121



 Score = 28.9 bits (65), Expect = 2.6
 Identities = 21/83 (25%), Positives = 40/83 (48%), Gaps = 3/83 (3%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
           N E Q  +E    L  +L E  EEI +  +  E LKE++   E+ +  A     +L+ E+
Sbjct: 95  NIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAE---ARLEEEV 151

Query: 228 TRLTQENESAKEEVKEVLQALEE 250
             + +E +    + +E+ + L+ 
Sbjct: 152 AEIREEGQELSSKREELKEKLDP 174



 Score = 27.7 bits (62), Expect = 5.7
 Identities = 19/85 (22%), Positives = 39/85 (45%), Gaps = 2/85 (2%)

Query: 168 NEERQKLEEERERLYQQLDE--KDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
             E Q++ E  +R  ++L     + E+   +   +  KE+I   E+ +     + E+L+ 
Sbjct: 65  ESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEK 124

Query: 226 EMTRLTQENESAKEEVKEVLQALEE 250
           E+  L +  E  ++ + E    LEE
Sbjct: 125 EIEDLKERLERLEKNLAEAEARLEE 149


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 12/51 (23%), Positives = 25/51 (49%), Gaps = 4/51 (7%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRD 219
           +E+ ++ +ER+ L  +       IN     AE+L  ++ +  + I   RR+
Sbjct: 136 QEQFEVTQERDALQAE----KAYINALEGQAEQLTAEVRDILDQILDTRRE 182



 Score = 28.5 bits (64), Expect = 4.9
 Identities = 23/94 (24%), Positives = 43/94 (45%), Gaps = 1/94 (1%)

Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ 222
           L  L      K +EE E   ++ + + E I Q+    ++L + + +Q E +     D  Q
Sbjct: 31  LQLLQEAVNSKRQEEAEPAAEEAELQAELIQQELAINDQLSQALNQQTERLNALASDDRQ 90

Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
           L   + +L Q + + +E+   VL+     +R LL
Sbjct: 91  LANLLLQLLQSSRTIREQ-IAVLRGSLLLSRILL 123


>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
          Length = 900

 Score = 29.4 bits (67), Expect = 2.4
 Identities = 15/75 (20%), Positives = 37/75 (49%), Gaps = 8/75 (10%)

Query: 171 RQKLEEERERLYQQLDEKDEEINQ-QSQYAEKLKEQIMEQEE-------LITTARRDYEQ 222
            ++L +  ER +++  E+ +EI + + + AE L  +++ + E       ++     D ++
Sbjct: 756 PEQLPKTVERFFEEWKEQKKEIEKLRKELAELLASELLSKAEEVGGVKVVVEEVDADEDE 815

Query: 223 LQGEMTRLTQENESA 237
           L+     L +E + A
Sbjct: 816 LRKIANELAKEGKVA 830


>gnl|CDD|234038 TIGR02865, spore_II_E, stage II sporulation protein E.  Stage II
           sporulation protein E (SpoIIE) is a multiple membrane
           spanning protein with two separable functions. It plays
           a role in the switch to polar cell division during
           sporulation. By means of it protein phosphatase
           activity, located in the C-terminal region, it activates
           sigma-F. All proteins that score above the trusted
           cutoff to this model are found in endospore-forming
           Gram-positive bacteria. Surprisingly, a sequence from
           the Cyanobacterium-like (and presumably
           non-spore-forming) photosynthesizer Heliobacillus
           mobilis is homologous, and scores between the trusted
           and noise cutoffs [Cellular processes, Sporulation and
           germination].
          Length = 764

 Score = 29.3 bits (66), Expect = 2.6
 Identities = 23/98 (23%), Positives = 43/98 (43%), Gaps = 12/98 (12%)

Query: 171 RQKLEEERERLYQQL-------DEKDEEINQQSQYAEKLKEQIME--QEELITTARRD-Y 220
           R++LEE R  + +QL       ++  +EIN +  + + L+E+I+    +  I       Y
Sbjct: 428 RKRLEEGRRLVAEQLKGVAESVEDIAKEINLEIVFHQLLEEKIIRALNKNGIPYEDVLAY 487

Query: 221 EQLQGEMT-RLTQENESAKEE-VKEVLQALEEPTRTLL 256
               G +   LT      + E  K++   + E T  L+
Sbjct: 488 NTEGGNIDVELTIAACGGRGECEKKIAPIISEVTGELM 525


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 28.7 bits (65), Expect = 2.7
 Identities = 17/65 (26%), Positives = 38/65 (58%), Gaps = 5/65 (7%)

Query: 169 EERQKLEEERERLYQQ---LDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
           +E +KLEEE+E L ++   L+ K E I ++ +   +++E+    E  I   ++  +QL+ 
Sbjct: 127 QEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADE--IAFLKKQNQQLKS 184

Query: 226 EMTRL 230
           ++ ++
Sbjct: 185 QLEQI 189


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 29.4 bits (65), Expect = 2.7
 Identities = 45/201 (22%), Positives = 67/201 (33%), Gaps = 24/201 (11%)

Query: 86  AEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDM--AASIAP 143
              W    E E +  S     +   E+  S        KE EQ    +PVD    A    
Sbjct: 231 DGNWVWTRELEPQDDSEDDYVIPDAEIISSPTLEVTAPKEVEQPLQPEPVDEETVAETKA 290

Query: 144 TPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLK 203
             E  P   +   P S       + EE      +  +  Q   E       +     K K
Sbjct: 291 EEEPQPTQTVETKPTSAP---ESTVEENLPEINQPTQAVQPTSETISTTPVEPTDQLKPK 347

Query: 204 EQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVL------------------ 245
           E    QEEL  T   + E+L  +   L + N    EE+K  L                  
Sbjct: 348 EVDQIQEELKKTKEIEVEELPTKKNDLVEINFDDLEELKFELVQTNQEKEPEKAVENWAT 407

Query: 246 -QALEEPTRTLLNFFRQSSPK 265
              L+EPT++ +++++Q  PK
Sbjct: 408 DYQLDEPTQSNIDWYKQEDPK 428


>gnl|CDD|233327 TIGR01241, FtsH_fam, ATP-dependent metalloprotease FtsH.
           HflB(FtsH) is a pleiotropic protein required for correct
           cell division in bacteria. It has ATP-dependent zinc
           metalloprotease activity. It was formerly designated
           cell division protein FtsH [Cellular processes, Cell
           division, Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 495

 Score = 29.2 bits (66), Expect = 2.8
 Identities = 13/40 (32%), Positives = 21/40 (52%), Gaps = 8/40 (20%)

Query: 234 NESAKEEVKEVLQALEEPTRTLLNFFRQ---SSPKGTLLY 270
            + AKEE+ E++  L+ P++     F +     PKG LL 
Sbjct: 60  IDEAKEELMEIVDFLKNPSK-----FTKLGAKIPKGVLLV 94


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 29.1 bits (65), Expect = 2.9
 Identities = 16/72 (22%), Positives = 37/72 (51%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           E+ ++L++E+E L ++L+E + E  +  +  ++L+ +    EE++     +   L+    
Sbjct: 142 EKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWD 201

Query: 229 RLTQENESAKEE 240
            L    E  +EE
Sbjct: 202 ELEPGVELPEEE 213


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 28.0 bits (63), Expect = 3.0
 Identities = 21/80 (26%), Positives = 36/80 (45%), Gaps = 3/80 (3%)

Query: 174 LEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ---LQGEMTRL 230
           LEEE+  L  +L    E+     +  +KL+E + EQ E+   A++ YE+      E    
Sbjct: 1   LEEEKSSLQAELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEE 60

Query: 231 TQENESAKEEVKEVLQALEE 250
            Q       E+K+ +  L+ 
Sbjct: 61  LQALRKQLNELKKEIAQLKA 80



 Score = 26.9 bits (60), Expect = 7.8
 Identities = 21/89 (23%), Positives = 50/89 (56%), Gaps = 7/89 (7%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYA----EKLKEQIMEQEEL---ITTARRDYE 221
            E ++L E++E   +++ +  E++ +Q++ A    +K + ++++  E    +   R+   
Sbjct: 10  AELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLN 69

Query: 222 QLQGEMTRLTQENESAKEEVKEVLQALEE 250
           +L+ E+ +L  E ESA+ E+ E  ++ EE
Sbjct: 70  ELKKEIAQLKAEAESAQAELSEAEESWEE 98


>gnl|CDD|129807 TIGR00724, urea_amlyse_rel, biotin-dependent carboxylase
           uncharacterized domain.  Urea amidolyase of
           Saccharomyces cerevisiae is a 1835 amino acid protein
           with an amidase domain, a biotin/lipoyl cofactor
           attachment domain, a carbamoyl-phosphate synthase L
           chain-like domain, and uncharacterized regions. It has
           both urea carboxylase and allophanate hydrolase
           activities. This model models a domain that represents
           uncharacterized prokaryotic proteins of about 300 amino
           acids, regions of prokaryotic urea carboxylase and of
           the urea carboxylase region of yeast urea amidolyase,
           and regions of other biotin-containing proteins [Unknown
           function, General].
          Length = 314

 Score = 29.0 bits (65), Expect = 3.0
 Identities = 32/121 (26%), Positives = 44/121 (36%), Gaps = 23/121 (19%)

Query: 82  EELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKED-----EQVNL-ADPV 135
           E     EWK         S R   +L+  +L+     N E +         QV     P+
Sbjct: 194 EAFWRSEWKVSSN-----SDRMGYRLQGPKLKH-ARPNRELLTHGIVYGSIQVPPNGQPI 247

Query: 136 DMAASIAPTPESAP--ASI----LPAI----PGSGLMLGSLSNEERQKLEEERERLYQQL 185
            + A  A T    P  A +    L  +    PG  +    LS EE  KL E +ER  +QL
Sbjct: 248 ILMAD-AQTTGGYPKIAVVIEADLWKVAQVRPGQSIKFVPLSLEEALKLRESQERYIKQL 306

Query: 186 D 186
            
Sbjct: 307 R 307


>gnl|CDD|233588 TIGR01823, PabB-fungal, aminodeoxychorismate synthase, fungal
           clade.  This model represents the fungal clade of a
           para-aminobenzoate synthesis enzyme,
           aminodeoxychorismate synthase, which acts on chorismate
           in a pathway that yields PABA, a precursor of folate.
          Length = 742

 Score = 29.1 bits (65), Expect = 3.1
 Identities = 12/52 (23%), Positives = 20/52 (38%), Gaps = 4/52 (7%)

Query: 47  IIICCSPASFNESETKSTLDFGRRAKTIKNVVTVNEELTAEEWKRRYEKEKE 98
           I +  SP  F E    +        + IK  V    ++  E+ +R  +  KE
Sbjct: 534 IFLSTSPEKFLEVGMDTHAKL----RPIKGTVKKGPQMNLEKARRILKTPKE 581


>gnl|CDD|240169 cd05146, RIO3_euk, RIO kinase family; eukaryotic RIO3, catalytic
           domain. The RIO kinase catalytic domain family is part
           of a larger superfamily, that includes the catalytic
           domains of other kinases such as the typical
           serine/threonine/tyrosine protein kinases (PKs),
           aminoglycoside phosphotransferase, choline kinase, and
           phosphoinositide 3-kinase (PI3K). RIO kinases are
           atypical protein serine kinases containing a kinase
           catalytic signature, but otherwise show very little
           sequence similarity to typical PKs. Serine kinases
           catalyze the transfer of the gamma-phosphoryl group from
           ATP to serine residues in protein substrates. The RIO
           catalytic domain is truncated compared to the catalytic
           domains of typical PKs, with deletions of the loops
           responsible for substrate binding. RIO3 is present only
           in multicellular eukaryotes. Its function is still
           unknown.
          Length = 197

 Score = 28.6 bits (64), Expect = 3.1
 Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 4/42 (9%)

Query: 192 INQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQE 233
           I      A KLK+  +  EE+    +  Y Q+   M +L +E
Sbjct: 106 IGDDQVPAPKLKDAKLNDEEM----KNAYYQVLSMMKQLYKE 143


>gnl|CDD|226873 COG4467, COG4467, Regulator of replication initiation timing
           [Replication,    recombination, and repair].
          Length = 114

 Score = 27.8 bits (62), Expect = 3.2
 Identities = 16/73 (21%), Positives = 34/73 (46%)

Query: 177 ERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENES 236
           +++ ++ Q+D  +E++         LK+ +    E  T  R + E+L+  +   T E  +
Sbjct: 2   DKKEIFDQVDNLEEQLGVLLAELGGLKQHLGSLVEENTALRLENEKLRERLGEPTLEKTA 61

Query: 237 AKEEVKEVLQALE 249
            K+E   V +  E
Sbjct: 62  VKKEKPAVKKKGE 74


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
           exopolysaccharide biosynthesis [Cell envelope
           biogenesis, outer membrane].
          Length = 458

 Score = 28.9 bits (65), Expect = 3.3
 Identities = 14/80 (17%), Positives = 37/80 (46%)

Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
             +L E R+++  +L +    +  +    E+ +  + ++   +        +LQ ++  L
Sbjct: 319 EAQLAELRQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRLSKLPKLQVQLREL 378

Query: 231 TQENESAKEEVKEVLQALEE 250
            +E E+A+   + +LQ  +E
Sbjct: 379 EREAEAARSLYETLLQRYQE 398


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
           mutation causes misplacement of the septum during cell
           division, resulting in the formation of small, circular,
           anucleate mini-cells. Inactivation of divIVA produces a
           mini-cell phenotype, whereas overproduction of DivIVA
           results in a filamentation phenotype. These proteins
           appear to contain coiled-coils.
          Length = 131

 Score = 28.0 bits (63), Expect = 3.4
 Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 218 RDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
           +DYE L  E   L +E E  +E+++E  +  E   +TL+
Sbjct: 32  KDYEALYKENEELKEEIERLEEKLEEYKELEETLQKTLV 70


>gnl|CDD|222707 pfam14357, DUF4404, Domain of unknown function (DUF4404).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved residues (P and G)
           that may be functionally important.
          Length = 85

 Score = 27.2 bits (61), Expect = 3.4
 Identities = 13/47 (27%), Positives = 20/47 (42%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEE 211
           SL  E R  L E  + +  QL E+D    +    A++L   +   E 
Sbjct: 18  SLDEETRALLSELIDDIEAQLAEEDATEAEDESLADRLNLAVERFEA 64


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 28.4 bits (64), Expect = 3.4
 Identities = 18/70 (25%), Positives = 31/70 (44%), Gaps = 5/70 (7%)

Query: 176 EERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENE 235
           EE++ L       + E+ Q  +  EKLK+Q+   E+ I   R      + E  +  +   
Sbjct: 92  EEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRA-----KKEALKARKAAA 146

Query: 236 SAKEEVKEVL 245
            A+E+V   L
Sbjct: 147 KAQEKVNRSL 156


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 28.1 bits (63), Expect = 3.5
 Identities = 15/77 (19%), Positives = 31/77 (40%), Gaps = 6/77 (7%)

Query: 140 SIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYA 199
           S+ P  ES  A  +       L+   +        ++      +   E+ +  +  S+  
Sbjct: 70  SLHPKDESEAAPDVVPKSSEELVQ--MKKAISALKQKI----KRDSAERKKAASSGSRAI 123

Query: 200 EKLKEQIMEQEELITTA 216
           EKL++++ EQE  +  A
Sbjct: 124 EKLRKKLAEQEAALLLA 140


>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit.  This
           family consist of exonuclease VII, large subunit
           EC:3.1.11.6 This enzyme catalyzes exonucleolytic
           cleavage in either 5'->3' or 3'->5' direction to yield
           5'-phosphomononucleotides. This exonuclease VII enzyme
           is composed of one large subunit and 4 small ones.
          Length = 295

 Score = 28.8 bits (65), Expect = 3.6
 Identities = 30/134 (22%), Positives = 49/134 (36%), Gaps = 25/134 (18%)

Query: 136 DMAASI-APTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQ 194
           D+ A + A TP +A   ++P              E  QKLE   +RL + L  + E+   
Sbjct: 124 DLVADVRAATPTAAAELLVPD-----------RTELLQKLEGLEQRLSRALKNRLEKEQD 172

Query: 195 Q-SQYAEKLKEQIMEQ------------EELITTARRDYEQLQGEMTRLTQENESAKEEV 241
           + +   E+LK                  + LI++ +    + Q  + RL    E  K  +
Sbjct: 173 RLNLLRERLKSLSRRLLEQHEERLAELRDRLISSIQNLLSRKQSRLERLILNRELEKNSL 232

Query: 242 KEVLQALEEPTRTL 255
            E   A    T  L
Sbjct: 233 LENKLATANLTAQL 246


>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein.  A number of members of this
           family are annotated as being transmembrane proteins
           induced by tumour necrosis factor alpha, but no
           literature was found to support this.
          Length = 330

 Score = 28.6 bits (64), Expect = 3.6
 Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 13/111 (11%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           EE ++LE+E    +QQL E      Q+ +  EKL+EQ       I   R+  +QL   + 
Sbjct: 4   EEWKELEKE----FQQLQETHRLYKQKLEEVEKLQEQCTSA---IARQRKRLKQLIVSLK 56

Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLNFFR---QSSPKGTLLYNHYLLG 276
           +L    +S   E  E+++ LEE  +   N F       PK   LY   +LG
Sbjct: 57  KL---KKSLTPEDSELVEQLEEQIKERKNQFFDMESFLPKKNGLYLRLVLG 104


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 28.9 bits (65), Expect = 3.6
 Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 12/90 (13%)

Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQEN 234
           +E R+R ++ LD   +E N+ ++   KLK            A+ D  +L  E   L +E 
Sbjct: 40  KEWRQRQFE-LDSLRKEFNKLNKEVAKLK-----------IAKEDATELIAETKELKKEI 87

Query: 235 ESAKEEVKEVLQALEEPTRTLLNFFRQSSP 264
              + EV+E   AL+   +T+ N    S P
Sbjct: 88  TEKEAEVQEAKAALDAKLKTIGNLVHDSVP 117


>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
           ArcB; Provisional.
          Length = 779

 Score = 28.8 bits (65), Expect = 3.7
 Identities = 12/37 (32%), Positives = 22/37 (59%), Gaps = 3/37 (8%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQ 205
            ER+K EE R+  ++QL     EI ++ +   +L++Q
Sbjct: 120 AEREKAEEARQEAFEQL---KNEIKEREETQIELEQQ 153



 Score = 28.4 bits (64), Expect = 5.3
 Identities = 25/88 (28%), Positives = 37/88 (42%), Gaps = 25/88 (28%)

Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
           LS    Q LEE R+RL              S+   KL+E  M + +L         QL+ 
Sbjct: 73  LSVVVEQ-LEESRQRL--------------SRLVAKLEE--MRERDLELN-----VQLKD 110

Query: 226 EMTRLTQE---NESAKEEVKEVLQALEE 250
            + +L QE    E A+E  +E  + L+ 
Sbjct: 111 NIAQLNQEIAEREKAEEARQEAFEQLKN 138


>gnl|CDD|198290 cd03181, GST_C_EF1Bgamma_like, Glutathione S-transferase
           C-terminal-like, alpha helical domain of the Gamma
           subunit of Elongation Factor 1B and similar proteins.
           Glutathione S-transferase (GST) C-terminal domain
           family, Gamma subunit of Elongation Factor 1B
           (EF1Bgamma) subfamily; EF1Bgamma is part of the
           eukaryotic translation elongation factor-1 (EF1) complex
           which plays a central role in the elongation cycle
           during protein biosynthesis. EF1 consists of two
           functionally distinct units, EF1A and EF1B. EF1A
           catalyzes the GTP-dependent binding of aminoacyl-tRNA to
           the ribosomal A site concomitant with the hydrolysis of
           GTP. The resulting inactive EF1A:GDP complex is recycled
           to the active GTP form by the guanine-nucleotide
           exchange factor EF1B, a complex composed of at least two
           subunits, alpha and gamma. Metazoan EFB1 contain a third
           subunit, beta. The EF1B gamma subunit contains a GST
           fold consisting of an N-terminal thioredoxin-fold domain
           and a C-terminal alpha helical domain. The GST-like
           domain of EF1Bgamma is believed to mediate the
           dimerization of the EF1 complex, which in yeast is a
           dimer of the heterotrimer EF1A:EF1Balpha:EF1Bgamma. In
           addition to its role in protein biosynthesis, EF1Bgamma
           may also display other functions. The recombinant rice
           protein has been shown to possess GSH conjugating
           activity. The yeast EF1Bgamma binds to membranes in a
           calcium dependent manner and is also part of a complex
           that binds to the msrA (methionine sulfoxide reductase)
           promoter suggesting a function in the regulation of its
           gene expression. Also included in this subfamily is the
           GST_C-like domain at the N-terminus of human valyl-tRNA
           synthetase (ValRS) and its homologs. Metazoan ValRS
           forms a stable complex with Elongation Factor-1H
           (EF-1H), and together, they catalyze consecutive steps
           in protein biosynthesis, tRNA aminoacylation and its
           transfer to EF.
          Length = 123

 Score = 27.5 bits (62), Expect = 3.8
 Identities = 12/27 (44%), Positives = 16/27 (59%), Gaps = 2/27 (7%)

Query: 232 QENESAKEEVKEVLQALEEP--TRTLL 256
           +  + AKE++K  L  LEE   TRT L
Sbjct: 36  KAVDKAKEDLKRALGVLEEHLLTRTYL 62


>gnl|CDD|224644 COG1730, GIM5, Predicted prefoldin, molecular chaperone implicated
           in de novo protein folding [Posttranslational
           modification, protein turnover, chaperones].
          Length = 145

 Score = 27.7 bits (62), Expect = 3.8
 Identities = 12/44 (27%), Positives = 25/44 (56%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEE 211
            +  ++LE+  E+L Q L E  + I Q  Q A++L+++    ++
Sbjct: 100 KKRIEELEKAIEKLQQALAELAQRIEQLEQEAQQLQQKQAAAQK 143


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 29.0 bits (65), Expect = 3.9
 Identities = 35/187 (18%), Positives = 61/187 (32%), Gaps = 30/187 (16%)

Query: 85  TAEEWKRRYEKEKEKSSRYKGK-------LEKLELELSRWRNGETVKEDEQVNLA----- 132
             + + +  E+  +   + + +       LE LEL+L        V  +E+V  A     
Sbjct: 453 QVQAFLKEAEEIMQAKRQKRDRLLELEEDLETLELQLRHLDAAGAVPTEEEVAAARARRD 512

Query: 133 ----DPVDMAASIAPTPESAPASILPAIPGSGLMLGSLS-----NEERQKLEEERERLYQ 183
               D     A + PT  +   ++  A   +  +L            RQ+ E  R RL Q
Sbjct: 513 ALWQDIRRGYALLEPTASAYEEAVESADQLADRLLREAQLVGELQSLRQQEEAARRRLEQ 572

Query: 184 QLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKE 243
              E +      +   E  + Q                    EM     E  +A+E+V+ 
Sbjct: 573 LEKELEVLELALAALREAWQAQWAA---------AGLPLTPAEMEDWLAERATAREQVRA 623

Query: 244 VLQALEE 250
             +A  E
Sbjct: 624 YFKARAE 630


>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein.  This family
           consists of several eukaryotic paraflagellar rod
           component proteins. The eukaryotic flagellum represents
           one of the most complex macromolecular structures found
           in any organism and contains more than 250 proteins. In
           addition to its locomotive role, the flagellum is
           probably involved in nutrient uptake since receptors for
           host low-density lipoproteins are localised on the
           flagellar membrane as well as on the flagellar pocket
           membrane.
          Length = 289

 Score = 28.5 bits (64), Expect = 3.9
 Identities = 15/77 (19%), Positives = 33/77 (42%)

Query: 179 ERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAK 238
           + +   ++ K  + +   +  E+ KE+  E++      R+ Y   + E  +  Q+N   +
Sbjct: 2   KDISAVIEAKHRQKSACEEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQQ 61

Query: 239 EEVKEVLQALEEPTRTL 255
           +     +Q LE   R L
Sbjct: 62  QACWRAIQELERELRDL 78


>gnl|CDD|212684 cd11734, Ssq1_like_NBD, Nucleotide-binding domain of Saccharomyces
           cerevisiae Ssq1 and similar proteins.  Ssq1p (also
           called Stress-seventy subfamily Q protein 1, Ssc2p,
           Ssh1p, mtHSP70 homolog) belongs to the heat shock
           protein 70 (HSP70) family of chaperones that assist in
           protein folding and assembly, and can direct incompetent
           "client" proteins towards degradation. Typically, HSP70s
           have a nucleotide-binding domain (NBD) and a
           substrate-binding domain (SBD). The nucleotide sits in a
           deep cleft formed between the two lobes of the NBD. The
           two subdomains of each lobe change conformation between
           ATP-bound, ADP-bound, and nucleotide-free states. ATP
           binding opens up the substrate-binding site;
           substrate-binding increases the rate of ATP hydrolysis.
           Hsp70 chaperone activity is regulated by various
           co-chaperones: J-domain proteins and nucleotide exchange
           factors (NEFs). S. cerevisiae Ssq1p is a mitochondrial
           chaperone that is involved in iron-sulfur (Fe/S) center
           biogenesis. Ssq1p plays a role in the maturation of
           Yfh1p, a nucleus-encoded mitochondrial protein involved
           in iron homeostasis (and a homolog of human frataxin,
           implicated in the neurodegenerative disease,
           Friedreich's ataxia).
          Length = 373

 Score = 28.6 bits (64), Expect = 3.9
 Identities = 18/58 (31%), Positives = 27/58 (46%), Gaps = 4/58 (6%)

Query: 29  SKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDFGRRAKTIKNVVTVNEELTA 86
            KL +  +  LG      +I    PA FN+S+ ++T D G  A     V+ +  E TA
Sbjct: 120 KKLKKTAEAYLGKRVDEAVITV--PAYFNDSQRQATKDAGTLAGL--KVLRIINEPTA 173


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 28.5 bits (64), Expect = 3.9
 Identities = 10/68 (14%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 179 ERLYQQLDEKDEEINQ-QSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESA 237
           +   ++L  + EE+   +++ ++++ +   ++++ I   +++ ++L+ E+T L+   ++ 
Sbjct: 36  DDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKAL 95

Query: 238 KEEVKEVL 245
           + E+++ L
Sbjct: 96  EAELQDKL 103



 Score = 28.1 bits (63), Expect = 6.4
 Identities = 19/115 (16%), Positives = 44/115 (38%), Gaps = 18/115 (15%)

Query: 171 RQKLEEERERLYQQLDEK----DEEINQQSQYAEKLKEQIMEQEEL-------------- 212
           R+ L    + + + L  +    D ++ +     ++ K+ + E EEL              
Sbjct: 4   RKLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKA 63

Query: 213 ITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGT 267
               +   E+++ E+  L +E       +K +   L++   ++ N   +S P G 
Sbjct: 64  KGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSIPNIPHESVPVGK 118


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 28.6 bits (64), Expect = 4.0
 Identities = 20/96 (20%), Positives = 42/96 (43%), Gaps = 7/96 (7%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           EE Q LE++ + L ++    +EE     +  E+L+ ++ E    +        +L+ E  
Sbjct: 47  EEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAEV-------AKLEEERE 99

Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSP 264
           +   E    ++E++E  +A E   + LL      + 
Sbjct: 100 KKEAETRQLQQELREAQEAHERARQELLEAAAAPTA 135


>gnl|CDD|223541 COG0465, HflB, ATP-dependent Zn proteases [Posttranslational
           modification, protein turnover, chaperones].
          Length = 596

 Score = 28.8 bits (65), Expect = 4.1
 Identities = 12/40 (30%), Positives = 20/40 (50%), Gaps = 6/40 (15%)

Query: 234 NESAKEEVKEVLQALEEPTRTLLNFFRQSS--PKGTLLYN 271
            + AKEE+ E++  L+ P +    +    +  PKG LL  
Sbjct: 155 VDEAKEELSELVDFLKNPKK----YQALGAKIPKGVLLVG 190


>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility
           and secretion / Signal transduction mechanisms].
          Length = 408

 Score = 28.8 bits (64), Expect = 4.1
 Identities = 21/95 (22%), Positives = 36/95 (37%), Gaps = 1/95 (1%)

Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ 222
                 E   ++ E  + L ++      EI    +   +L  +I    E  T    +   
Sbjct: 308 AVEHMEESASEVSEGVK-LVEETGSSLGEIAAAIEEVSQLISEIAAATEEQTAVLEEINA 366

Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLN 257
              E+  +TQEN +A EE+    + L+E    LL 
Sbjct: 367 SIEELDDVTQENAAAVEELAAASEELKELAEKLLE 401


>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
           protein turnover, chaperones].
          Length = 579

 Score = 28.8 bits (65), Expect = 4.1
 Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 1/62 (1%)

Query: 189 DEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQAL 248
           DEEI +  + AE       +  EL+  AR + E L   + +  +E     EE KE ++  
Sbjct: 485 DEEIERMVEDAEANAALDKKFRELV-EARNEAESLIYSLEKALKEIVKVSEEEKEKIEEA 543

Query: 249 EE 250
             
Sbjct: 544 IT 545


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
           subunit.  CENP-Q is one of the components that assembles
           onto the CENP-A-nucleosome distal (CAD) centromere. The
           centromere, which is the basic element of chromosome
           inheritance, is epigenetically determined in mammals.
           CENP-A, the centromere-specific histone H3 variant,
           assembles an array of nucleosomes and it is this that
           seems to be the prime candidate for specifying
           centromere identity. CENP-A nucleosomes directly recruit
           a proximal CENP-A nucleosome associated complex (NAC)
           comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
           CENP-C and CENP-H. Assembly of the CENP-A NAC at
           centromeres is dependent on CENP-M, CENP-N and CENP-T.
           Additionally, there are seven other subunits which make
           up the CENP-A-nucleosome distal (CAD) centromere,
           CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
           CENP-S, also assembling on the CENP-A NAC. Fta7 is the
           equivalent component of the fission yeast Sim4 complex.
          Length = 159

 Score = 28.1 bits (63), Expect = 4.2
 Identities = 24/107 (22%), Positives = 44/107 (41%), Gaps = 9/107 (8%)

Query: 154 PAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELI 213
           P      L    L   +R  LE +     +QL+  +EEI ++    EK  E + E E+  
Sbjct: 13  PPTKEDSLDYDQLLRRKRA-LERQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNA 71

Query: 214 TTARRDYEQ--------LQGEMTRLTQENESAKEEVKEVLQALEEPT 252
               R+ E+        L+   + + +ENE  ++ + E+ +    P 
Sbjct: 72  KALEREREEESKNLHPVLRLLESEVLEENELLQDSLLELSERNFSPN 118


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 28.8 bits (64), Expect = 4.2
 Identities = 17/92 (18%), Positives = 31/92 (33%), Gaps = 10/92 (10%)

Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQ-------QSQYAEKLKEQIMEQEELITTARR 218
                RQ   ++ +   +QL    E + Q          + E+   +  E E   ++   
Sbjct: 673 ELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGS 732

Query: 219 DYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
           D   L      L Q  +    + + VL+A  E
Sbjct: 733 D---LAAREDALNQSLKELMHQARTVLKARTE 761


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 28.5 bits (64), Expect = 4.2
 Identities = 17/75 (22%), Positives = 38/75 (50%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
            E Q+  EE  +L+++ DE  +E ++  +   +L ++I E  E     + +  +L+ ++ 
Sbjct: 186 NEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIK 245

Query: 229 RLTQENESAKEEVKE 243
            L  + ++AK   K 
Sbjct: 246 ALRAKEKAAKRREKR 260


>gnl|CDD|145958 pfam03084, Sigma_1_2, Reoviral Sigma1/Sigma2 family.  Reoviruses
           are double-stranded RNA viruses. They lack a membrane
           envelope and their capsid is organised in two concentric
           icosahedral layers: an inner core and an outer capsid
           layer. The sigma1 protein is found in the outer capsid,
           and the sigma2 protein is found in the core. There are
           four other kinds of protein (besides sigma2) in the
           core, termed lambda 1-3, mu2. Interactions between
           sigma2 and lambda 1 and lambda 3 are thought to initiate
           core formation, followed by mu2 and lambda2. Sigma1 is a
           trimeric protein, and is positioned at the 12 vertices
           of the icosahedral outer capsid layer. Its N-terminal
           fibrous tail, arranged as a triple coiled coil, anchors
           it in the virion, and a C-terminal globular head
           interacts with the cellular receptor. These two parts
           form by separate trimerisation events. The N-terminal
           fibrous tail forms on the polysome, without the
           involvement of ATP or chaperones. The post-
           translational assembly of the C-terminal globular head
           involves the chaperone activity of Hsp90, which is
           associated with phosphorylation of Hsp90 during the
           process. Sigma1 protein acts as a cell attachment
           protein, and determines viral virulence, pathways of
           spread, and tropism. Junctional adhesion molecule has
           been identified as a receptor for sigma1. In type 3
           reoviruses, a small region, predicted to form a beta
           sheet, in the N-terminal tail was found to bind target
           cell surface sialic acid (i.e. sialic acid acts as a
           co-receptor) and promote apoptosis. The sigma1 protein
           also binds to the lambda2 core protein.
          Length = 415

 Score = 28.7 bits (64), Expect = 4.3
 Identities = 13/50 (26%), Positives = 20/50 (40%), Gaps = 8/50 (16%)

Query: 162 MLGSLSNEERQKLEEE--------RERLYQQLDEKDEEINQQSQYAEKLK 203
             G +S  + Q+L +         R +L     E D   NQ   YA ++K
Sbjct: 343 RAGLISQAKGQQLTQNADDTQQFKRAKLEAIKAEDDALYNQNPNYARRIK 392


>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
          Length = 663

 Score = 28.6 bits (64), Expect = 4.3
 Identities = 18/67 (26%), Positives = 39/67 (58%), Gaps = 6/67 (8%)

Query: 189 DEEINQQSQYAEKLKEQIMEQEELITTARRDYEQL----QGEMTRLT-QENESAKEEVKE 243
           DEEI +  + AE+ KEQ  E+++ +  A+ + E L    + +++ L  + +++ K+E+K+
Sbjct: 543 DEEIEKMVKEAEEYKEQ-DEKKKELVDAKNEAETLIYSVEKQLSDLKDKISDADKDELKQ 601

Query: 244 VLQALEE 250
            +  L  
Sbjct: 602 KITKLRS 608


>gnl|CDD|217436 pfam03226, Yippee, Yippee putative zinc-binding protein. 
          Length = 109

 Score = 27.2 bits (61), Expect = 4.4
 Identities = 12/24 (50%), Positives = 15/24 (62%), Gaps = 5/24 (20%)

Query: 89  WKRRYEKEKEKSSRYK-GK--LEK 109
           WK  YE   E+S +YK GK  LE+
Sbjct: 82  WK--YEFAYEESQKYKEGKFILER 103


>gnl|CDD|217331 pfam03036, Perilipin, Perilipin family.  The perilipin family
           includes lipid droplet-associated protein (perilipin)
           and adipose differentiation-related protein
           (adipophilin).
          Length = 390

 Score = 28.2 bits (63), Expect = 4.8
 Identities = 32/112 (28%), Positives = 40/112 (35%), Gaps = 16/112 (14%)

Query: 139 ASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQY 198
           A  A   E      +   P   + LGSLS     KL   R R YQ    K  +  Q+ Q 
Sbjct: 194 APDATKAEGFEVGSVEQAPSYFVRLGSLSA----KL---RRRAYQHSLAKLRQARQRGQE 246

Query: 199 AEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
           A     Q+    ELI +AR+   Q      +L    E  KE      Q   E
Sbjct: 247 AIH---QLPSTAELIESARKGVHQ------KLGGAQEKLKELWLSWSQGQPE 289


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 28.2 bits (63), Expect = 4.9
 Identities = 15/52 (28%), Positives = 28/52 (53%), Gaps = 4/52 (7%)

Query: 164 GSLSNEERQKLEEERERLYQQLDEKDEEINQQS----QYAEKLKEQIMEQEE 211
             LS ++RQ+L +E  +  Q   +K  E+  ++    +   K KE+ ++QEE
Sbjct: 360 TDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEE 411


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 28.4 bits (63), Expect = 5.0
 Identities = 37/171 (21%), Positives = 74/171 (43%), Gaps = 27/171 (15%)

Query: 74  IKNVVTVNEELTAEEWKR-RYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLA 132
            ++   + EEL   + +  R+EK +EK +  + + EKLE EL  W++   + +D  +NL 
Sbjct: 280 KEDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENELKSWKS---LLQDIGLNLR 336

Query: 133 DPVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEI 192
            P D++  I                        L NEE Q L+E+   +     + +  +
Sbjct: 337 TPDDLSRRIV----------------------VLQNEELQ-LKEKNGSISSSAKQLETTL 373

Query: 193 NQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKE 243
            Q     +K   +I+E ++ +   +    +LQ  +T +T+E +  +  +  
Sbjct: 374 QQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNS 424


>gnl|CDD|178588 PLN03014, PLN03014, carbonic anhydrase.
          Length = 347

 Score = 28.2 bits (62), Expect = 5.1
 Identities = 15/49 (30%), Positives = 26/49 (53%)

Query: 200 EKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQAL 248
           E LK+ ++E+EEL T A    EQ+   +   T  ++ A + V+ + Q  
Sbjct: 88  EALKKLLIEKEELKTVAAAKVEQITAALQTGTSSDKKAFDPVETIKQGF 136


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 28.5 bits (64), Expect = 5.1
 Identities = 11/38 (28%), Positives = 21/38 (55%)

Query: 82  EELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRN 119
           EEL  E+ ++   KE+++  + K + EK + EL +   
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 28.4 bits (63), Expect = 5.2
 Identities = 16/82 (19%), Positives = 40/82 (48%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           +E+  L+ E + L  +L+ K   + +++   +  +E+       I   R  YE+ + ++ 
Sbjct: 336 QEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLR 395

Query: 229 RLTQENESAKEEVKEVLQALEE 250
            L ++ E+ +E  +   + L+E
Sbjct: 396 VLQKKIENLQETFRRKERRLKE 417


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 28.1 bits (63), Expect = 5.3
 Identities = 11/42 (26%), Positives = 25/42 (59%), Gaps = 5/42 (11%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQE 210
           +E+ K E+E + L +  ++  E+     + AEKL+ + +++E
Sbjct: 69  KEKAKREKEEKGLRELEEDTPED-----ELAEKLRLRKLQEE 105


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 27.9 bits (62), Expect = 5.3
 Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 4/63 (6%)

Query: 200 EKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFF 259
           EKLK++I  +EE + +  +  ++   EM  L +E    K +  E L+ +      +    
Sbjct: 14  EKLKDEIKAEEEALESEEKHLKEYDKEMEELLEE----KMQHVEELRQIHADINDMETEI 69

Query: 260 RQS 262
           +QS
Sbjct: 70  KQS 72


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 28.3 bits (63), Expect = 5.4
 Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 3/74 (4%)

Query: 179 ERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT---QENE 235
           ERL Q + +   E+ Q  Q   +L+ QI     L  + + D  QL+ E   L        
Sbjct: 409 ERLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMV 468

Query: 236 SAKEEVKEVLQALE 249
           SAK++ K+ +Q++E
Sbjct: 469 SAKQKDKQSMQSME 482


>gnl|CDD|216073 pfam00709, Adenylsucc_synt, Adenylosuccinate synthetase. 
          Length = 420

 Score = 28.3 bits (64), Expect = 5.4
 Identities = 26/83 (31%), Positives = 43/83 (51%), Gaps = 15/83 (18%)

Query: 160 GLMLGSLSNEE--RQKLEE-------ERERLYQQLDEKDEEINQQ-SQYAEKLKEQIMEQ 209
           G+ +G L +EE  R+KLE+         E+LY       EEI ++  +YAE+L+  I + 
Sbjct: 143 GIRVGDLLDEEVLREKLEQLLEEKNAILEKLYGAEPLDVEEILEEYLEYAERLRPYITDT 202

Query: 210 EELITTARRD-----YEQLQGEM 227
            EL+  A ++     +E  QG +
Sbjct: 203 SELLNRALKEGKKVLFEGAQGTL 225


>gnl|CDD|149682 pfam08702, Fib_alpha, Fibrinogen alpha/beta chain family.
           Fibrinogen is a protein involved in platelet aggregation
           and is essential for the coagulation of blood. This
           domain forms part of the central coiled coiled region of
           the protein which is formed from two sets of three
           non-identical chains (alpha, beta and gamma).
          Length = 146

 Score = 27.3 bits (61), Expect = 5.5
 Identities = 10/63 (15%), Positives = 20/63 (31%), Gaps = 5/63 (7%)

Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTR 229
             QK  +  + +Y    +              L+ +I  QE  I   +      + ++ R
Sbjct: 68  GDQKQAQPNDNIYNAYSKSLR-----KMIEYILETKINTQESQIRVLQEVLRSNRSKIQR 122

Query: 230 LTQ 232
           L  
Sbjct: 123 LEV 125


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 22/106 (20%), Positives = 39/106 (36%), Gaps = 13/106 (12%)

Query: 167 SNEERQKLEEERERLY----------QQLDEKDEEINQQSQYAEKLKEQIMEQEELITTA 216
            NE R+  E ER+                D   +     S+    L+E+I++ E  I   
Sbjct: 27  VNEVRRCDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEV 86

Query: 217 RRDYEQLQGE---MTRLTQENESAKEEVKEVLQALEEPTRTLLNFF 259
             + E L+ E   +       +  K  + E L+ L E +   ++F 
Sbjct: 87  EENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNLDIDFK 132


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 27.2 bits (61), Expect = 5.7
 Identities = 19/87 (21%), Positives = 45/87 (51%), Gaps = 4/87 (4%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKL----KEQIMEQEELITTARRDYEQL 223
            EE ++ EE  ++  ++L++K+EE+ +     +K     + +    E+     ++  ++ 
Sbjct: 20  REEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEK 79

Query: 224 QGEMTRLTQENESAKEEVKEVLQALEE 250
           + E+  L  E E  K E++++ + LEE
Sbjct: 80  EEEIKELKAELEELKAEIEKLEEKLEE 106


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 28.1 bits (63), Expect = 5.7
 Identities = 24/93 (25%), Positives = 42/93 (45%), Gaps = 2/93 (2%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
             +  E  KL  +  +L      K E +NQ      +LKE+I ++ E I   +R   Q +
Sbjct: 48  EGATNEDGKLAADLLKLEVA--RKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRR 105

Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTRTLLN 257
            +++  + + E  +    E LQ   + TR+ LN
Sbjct: 106 SDLSSASYQLEKRRASQLEKLQDEIKRTRSKLN 138


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 27.7 bits (62), Expect = 5.8
 Identities = 17/47 (36%), Positives = 29/47 (61%), Gaps = 3/47 (6%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKL---KEQIMEQEEL 212
           EER+KLEE+RE   ++ +E +EE  ++ +  E+    ++   EQEE 
Sbjct: 29  EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEY 75


>gnl|CDD|238111 cd00187, TOP4c, DNA Topoisomerase, subtype IIA; domain A';
           bacterial DNA topoisomerase IV (C subunit, ParC),
           bacterial DNA gyrases (A subunit, GyrA),mammalian DNA
           toposiomerases II. DNA topoisomerases are essential
           enzymes that regulate the conformational changes in DNA
           topology by catalysing the concerted breakage and
           rejoining of DNA strands during normal cellular growth.
          Length = 445

 Score = 27.9 bits (63), Expect = 6.2
 Identities = 8/28 (28%), Positives = 17/28 (60%)

Query: 160 GLMLGSLSNEERQKLEEERERLYQQLDE 187
            + L  L+  ER+KL +E + L  ++++
Sbjct: 396 DMRLRRLTKLEREKLLKELKELEAEIED 423


>gnl|CDD|212667 cd10170, HSP70_NBD, Nucleotide-binding domain of the HSP70 family. 
           HSP70 (70-kDa heat shock protein) family chaperones
           assist in protein folding and assembly and can direct
           incompetent "client" proteins towards degradation.
           Typically, HSP70s have a nucleotide-binding domain (NBD)
           and a substrate-binding domain (SBD). The nucleotide
           sits in a deep cleft formed between the two lobes of the
           NBD. The two subdomains of each lobe change conformation
           between ATP-bound, ADP-bound, and nucleotide-free
           states. ATP binding opens up the substrate-binding site;
           substrate-binding increases the rate of ATP hydrolysis.
           HSP70 chaperone activity is regulated by various
           co-chaperones: J-domain proteins and nucleotide exchange
           factors (NEFs). Some HSP70 family members are not
           chaperones but instead, function as NEFs to remove ADP
           from their HSP70 chaperone partners during the ATP
           hydrolysis cycle, some may function as both chaperones
           and NEFs.
          Length = 369

 Score = 28.0 bits (63), Expect = 6.2
 Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 3/53 (5%)

Query: 30  KLTRILQESLGGNARTTIIICCSPASFNESETKSTLDFGRRAKTIKNVVTVNE 82
           KL    +  LG      +I    PA FN+++ ++T +    A  +  V  +NE
Sbjct: 118 KLKEDAEAYLGEPVTEAVITV--PAYFNDAQREATKEAAEIAG-LNVVRLINE 167


>gnl|CDD|224006 COG1080, PtsA, Phosphoenolpyruvate-protein kinase (PTS system EI
           component in bacteria) [Carbohydrate transport and
           metabolism].
          Length = 574

 Score = 27.9 bits (63), Expect = 6.2
 Identities = 21/81 (25%), Positives = 31/81 (38%), Gaps = 7/81 (8%)

Query: 174 LEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARR----DYEQLQGEMTR 229
           +E E ERL   L     E+       EK  E   E++  I  A      D E  +     
Sbjct: 37  VEAEIERLDAALAAARAELEAL---KEKAAEDGGEEKAAIFEAHLMLLEDPELTEEAEEL 93

Query: 230 LTQENESAKEEVKEVLQALEE 250
           + Q   SA+  +KEV++    
Sbjct: 94  IRQGKVSAEYALKEVIEEFAA 114


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 28.0 bits (63), Expect = 6.3
 Identities = 10/40 (25%), Positives = 17/40 (42%), Gaps = 3/40 (7%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAE---KLKEQ 205
                L EE E L ++L E +  + +  Q      +L+E 
Sbjct: 69  ASLFDLREENEELKKELLELESRLQELEQLEAENARLREL 108


>gnl|CDD|220947 pfam11026, DUF2721, Protein of unknown function (DUF2721).  This
           family is conserved in bacteria. The function is not
           known.
          Length = 130

 Score = 27.1 bits (61), Expect = 6.3
 Identities = 15/53 (28%), Positives = 22/53 (41%), Gaps = 17/53 (32%)

Query: 148 APASILPAIPGSGLMLGSLSN--------------EERQKLEEERERLYQQLD 186
           APA +L AI   GL+L   +N                 +  + ER R+ +QL 
Sbjct: 4   APAFLLSAI---GLLLLVYTNRLARIADRIRQLEARLEETDDPERARIRRQLR 53


>gnl|CDD|220180 pfam09325, Vps5, Vps5 C terminal like.  Vps5 is a sorting nexin
           that functions in membrane trafficking. This is the C
           terminal dimerisation domain.
          Length = 236

 Score = 27.6 bits (62), Expect = 6.6
 Identities = 13/75 (17%), Positives = 40/75 (53%)

Query: 176 EERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENE 235
            +R + +QQ    +++++++     KLK     +++ +  A+++ ++ + ++ +  +E E
Sbjct: 128 NQRVKAWQQWQNAEQDLSKKRAQLAKLKGAGKSRQDKLQQAKKEVDEAERKVQQAKKEFE 187

Query: 236 SAKEEVKEVLQALEE 250
              E +K+ L+  E 
Sbjct: 188 DISETIKKELERFET 202


>gnl|CDD|217163 pfam02646, RmuC, RmuC family.  This family contains several
           bacterial RmuC DNA recombination proteins. The function
           of the RMUC protein is unknown but it is suspected that
           it is either a structural protein that protects DNA
           against nuclease action, or is itself involved in DNA
           cleavage at the regions of DNA secondary structures.
          Length = 301

 Score = 27.6 bits (62), Expect = 6.7
 Identities = 16/81 (19%), Positives = 36/81 (44%), Gaps = 17/81 (20%)

Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
           RQ+L++  + + + L              +  ++++ E  E +     + E+LQ  +  L
Sbjct: 1   RQRLDQMLKPVDETL--------------QTFEQRVEESFEKVA---EELERLQEGLGEL 43

Query: 231 TQENESAKEEVKEVLQALEEP 251
            + N+   +EV  +  AL+ P
Sbjct: 44  QELNQQLAKEVGNLTAALKNP 64


>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118).  This is
           the highly conserved family of the major tail subunit
           protein.
          Length = 121

 Score = 27.0 bits (59), Expect = 6.8
 Identities = 15/59 (25%), Positives = 34/59 (57%), Gaps = 4/59 (6%)

Query: 169 EERQKLEEERERLYQQLDEKDEE----INQQSQYAEKLKEQIMEQEELITTARRDYEQL 223
           EER K+E ERE+  ++  E+ ++    +N+Q +  +K  + + +Q++ I +  +   +L
Sbjct: 50  EERDKMETEREKRDKESKEERDKFISTMNEQQRLMDKQNDMMGQQQQSIDSLSKSVGKL 108


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 28.0 bits (63), Expect = 7.1
 Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 7/71 (9%)

Query: 169 EERQKLEEERERLYQQL-------DEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYE 221
            ++Q+ E       ++L       DE ++   +     E L E + E  E     R+  E
Sbjct: 530 RQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLE 589

Query: 222 QLQGEMTRLTQ 232
           QLQ  + RL  
Sbjct: 590 QLQARIQRLAA 600


>gnl|CDD|180307 PRK05892, PRK05892, nucleoside diphosphate kinase regulator;
           Provisional.
          Length = 158

 Score = 27.0 bits (60), Expect = 7.5
 Identities = 10/31 (32%), Positives = 14/31 (45%)

Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQ 195
            L+   R  LE E  RL  + D    E+N +
Sbjct: 7   GLAPAARDHLEAELARLRARRDRLAVEVNDR 37


>gnl|CDD|234263 TIGR03568, NeuC_NnaA, UDP-N-acetyl-D-glucosamine 2-epimerase,
           UDP-hydrolysing.  This family of enzymes catalyzes the
           combined epimerization and UDP-hydrolysis of
           UDP-N-acetylglucosamine to N-acetylmannosamine. This is
           in contrast to the related enzyme WecB (TIGR00236) which
           retains the UDP moiety. NeuC acts in concert with NeuA
           and NeuB to synthesize CMP-N5-acetyl-neuraminate.
          Length = 365

 Score = 27.9 bits (63), Expect = 7.6
 Identities = 12/20 (60%), Positives = 18/20 (90%)

Query: 231 TQENESAKEEVKEVLQALEE 250
           T E ESA+E++KE+L+AL+E
Sbjct: 212 TLEKESAEEQIKELLKALDE 231


>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972).  This
           family consists of several hypothetical bacterial
           sequences. The function of this family is unknown.
          Length = 106

 Score = 26.5 bits (59), Expect = 7.8
 Identities = 19/70 (27%), Positives = 32/70 (45%), Gaps = 14/70 (20%)

Query: 181 LYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEE 240
           L+  L E ++++NQ      +LK+Q+ E  E              E   L  ENE  +E 
Sbjct: 6   LFDALSELEQQLNQLLAELGELKQQLAELLE--------------ENAELRIENEHLRER 51

Query: 241 VKEVLQALEE 250
           ++E+ Q  +E
Sbjct: 52  LEELEQEQKE 61


>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein.  Bacillus subtilis stage V
           sporulation protein R is involved in spore cortex
           formation. Little is known about cortex biosynthesis,
           except that it depends on several sigma E controlled
           genes, including spoVR.
          Length = 427

 Score = 27.5 bits (62), Expect = 7.9
 Identities = 18/67 (26%), Positives = 29/67 (43%), Gaps = 7/67 (10%)

Query: 201 KLKEQIMEQEELITTARRDYEQLQGE--MTRLTQENESAKEEVKEVLQALEEPTRTLLNF 258
            L+E+   QEE     R +Y Q Q       +  + E  +EE     +  EEP   +L F
Sbjct: 176 SLEEEKARQEE-----REEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQEDILYF 230

Query: 259 FRQSSPK 265
             +++P 
Sbjct: 231 LEKNAPL 237


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 27.7 bits (62), Expect = 8.1
 Identities = 25/102 (24%), Positives = 48/102 (47%), Gaps = 15/102 (14%)

Query: 151 SILPAIPGSGLMLGSLSNEERQKLE--EERERLYQQLDEKDEEINQQSQYAEKLKEQIME 208
           +I  ++P +   L  LS +  + LE    RE   + L+ + E++ Q+  Y E  +     
Sbjct: 227 AISKSLPETKQYLKKLSQDLSKALEKISSRE---KHLNNQLEQLVQE--YREARRT---- 277

Query: 209 QEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
               ++  +  Y Q    ++ LT+E     EE+++V Q +EE
Sbjct: 278 ----LSQVQEKYNQASQGVSELTRELNEISEELEQVKQEMEE 315


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 28.0 bits (63), Expect = 8.3
 Identities = 16/82 (19%), Positives = 31/82 (37%), Gaps = 8/82 (9%)

Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
           ++  + +EE E+L QQL +   ++ Q     E LK+            R     L   + 
Sbjct: 73  DKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDD------NDEETRETLSTL--SLR 124

Query: 229 RLTQENESAKEEVKEVLQALEE 250
           +L        ++++     L E
Sbjct: 125 QLESRLAQTLDQLQNAQNDLAE 146


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 27.4 bits (61), Expect = 8.3
 Identities = 25/93 (26%), Positives = 44/93 (47%), Gaps = 11/93 (11%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITT----------AR 217
            EER+K  +E+ +  QQ  +  +E+ ++ +Y ++L++Q  + EEL+             R
Sbjct: 91  AEERRKTLQEQTQQEQQRAQYQDELARK-RYQKELEQQRRQNEELLKMQEESVLRQEAMR 149

Query: 218 RDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
           R  E+   EM R T E E+  E      +   E
Sbjct: 150 RATEEEILEMRRETIEEEAELERENIRAKIEAE 182


>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are about 260 amino acids in
           length.
          Length = 250

 Score = 27.6 bits (62), Expect = 8.5
 Identities = 12/78 (15%), Positives = 31/78 (39%), Gaps = 10/78 (12%)

Query: 184 QLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQEN-------ES 236
           Q     ++    +  ++K  ++  +Q +       +  QL  E+  L   N        +
Sbjct: 26  QALAAAQQTVAAAAQSQKKIDKWADQTQ---ELLAEIRQLLKEIENLRVYNDQLQRLVAN 82

Query: 237 AKEEVKEVLQALEEPTRT 254
            ++E+  + Q +E+  +T
Sbjct: 83  QQQEIASLQQQIEQIEKT 100


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 27.5 bits (61), Expect = 8.5
 Identities = 21/82 (25%), Positives = 44/82 (53%), Gaps = 12/82 (14%)

Query: 170 ERQKLEEERERLYQQ---LDEKDEEINQQSQYAEKLKEQIMEQ-EELITTARRDYEQLQG 225
           ERQ+L++   RL ++   LD KDE ++         KE+ +E  E+ +T   +  ++ + 
Sbjct: 95  ERQELKQIESRLTERATSLDRKDENLSS--------KEKTLESKEQSLTDKSKHIDEREE 146

Query: 226 EMTRLTQENESAKEEVKEVLQA 247
           ++ +L ++ ++  E V  + QA
Sbjct: 147 QVEKLEEQKKAELERVAALSQA 168


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 27.9 bits (62), Expect = 8.6
 Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 4/79 (5%)

Query: 174 LEEERERL--YQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
           L+E R+ L   + L+       Q+  Y    +  +  +   + +A+  Y+QL  ++ R  
Sbjct: 232 LDEYRDELERLEALERALRNFLQR--YRRYARTMLRRRATRLRSAQTQYDQLSRDLGRAR 289

Query: 232 QENESAKEEVKEVLQALEE 250
            E E+A+EE +E+    E 
Sbjct: 290 DELETAREEERELDARTEA 308


>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
           Provisional.
          Length = 319

 Score = 27.3 bits (60), Expect = 8.7
 Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 9/111 (8%)

Query: 154 PAIPGSGLMLGSLSNEERQKLEEERERLYQQ--------LDEKDEEINQQSQYAEKLKEQ 205
           P+  G       L+ E+RQ LE+ +  + QQ         +E+  E  QQ+   ++  +Q
Sbjct: 102 PSAGGEVKTPEQLTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQ 161

Query: 206 IMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
           +  +++ +    R  EQ   + TR +Q      +  +    + ++P + LL
Sbjct: 162 L-AEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLL 211


>gnl|CDD|220897 pfam10883, DUF2681, Protein of unknown function (DUF2681).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 81 and 117 amino acids
           in length.
          Length = 87

 Score = 25.8 bits (57), Expect = 8.7
 Identities = 18/42 (42%), Positives = 24/42 (57%), Gaps = 2/42 (4%)

Query: 216 ARRDYEQLQGEMTRLTQENESAKEEVK--EVLQALEEPTRTL 255
           A+R+  +LQ E  +L  E   A+ EVK  +V Q  EE TR L
Sbjct: 28  AQRENRKLQAENEQLATEKAVAETEVKNAKVRQKNEENTRRL 69


>gnl|CDD|233056 TIGR00619, sbcd, exonuclease SbcD.  All proteins in this family for
           which functions are known are double-stranded DNA
           exonuclease (as part of a complex with SbcC homologs).
           This complex functions in the initiation of
           recombination and recombinational repair and is
           particularly important in regulating the stability of
           DNA sections that can form secondary structures. This
           family is likely homologous to the MRE11 family. This
           family is based on the phylogenomic analysis of JA Eisen
           (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 253

 Score = 27.4 bits (61), Expect = 8.8
 Identities = 16/66 (24%), Positives = 20/66 (30%), Gaps = 5/66 (7%)

Query: 125 EDEQVNLADPVDMAASIAPTPESAPASILPA-----IPGSGLMLGSLSNEERQKLEEERE 179
            D Q+ L         +       P   L         G  L+L  L  + RQ LE    
Sbjct: 111 GDPQILLLKDTANGELLIVGLPLLPREALLTRAGLDGFGLELLLLHLDVKLRQALEGLIR 170

Query: 180 RLYQQL 185
           RL   L
Sbjct: 171 RLDPDL 176


>gnl|CDD|153370 cd07686, F-BAR_Fer, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Fer (Fes related)
           tyrosine kinase.  F-BAR domains are dimerization modules
           that bind and bend membranes and are found in proteins
           involved in membrane dynamics and actin reorganization.
           Fer (Fes related) is a cytoplasmic (or nonreceptor)
           tyrosine kinase expressed in a wide variety of tissues,
           and is found to reside in both the cytoplasm and the
           nucleus. It plays important roles in neuronal
           polarization and neurite development, cytoskeletal
           reorganization, cell migration, growth factor signaling,
           and the regulation of cell-cell interactions mediated by
           adherens junctions and focal adhesions. Fer kinase also
           regulates cell cycle progression in malignant cells. It
           contains an N-terminal F-BAR domain, an SH2 domain, and
           a C-terminal catalytic kinase domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 234

 Score = 27.3 bits (60), Expect = 8.8
 Identities = 18/67 (26%), Positives = 37/67 (55%), Gaps = 1/67 (1%)

Query: 209 QEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGTL 268
           + E+    + + E+L+    +LT+E  SAKE+ K+ +   +E T      + +++ K  +
Sbjct: 118 EAEMYKVTKTELEKLKCSYRQLTKEVNSAKEKYKDAVAKGKE-TEKARERYDKATMKLHM 176

Query: 269 LYNHYLL 275
           L+N Y+L
Sbjct: 177 LHNQYVL 183


>gnl|CDD|227365 COG5032, TEL1, Phosphatidylinositol kinase and protein kinases of the
            PI-3 kinase family [Signal transduction mechanisms / Cell
            division and chromosome partitioning / Chromatin
            structure and dynamics / DNA replication, recombination,
            and repair / Intracellular trafficking and secretion].
          Length = 2105

 Score = 27.8 bits (62), Expect = 8.9
 Identities = 34/220 (15%), Positives = 64/220 (29%), Gaps = 52/220 (23%)

Query: 103  YKGKLEKLELELSRWRNGETV-------------------KEDEQVNLADPVDMAASIAP 143
                L++    L+R+    +                    K +   +LA+ +DM  S+  
Sbjct: 1089 LVKYLQRWYEALNRYFELLSKGDRLFAISFTKLRNVDALGKLELYSSLAE-IDMFLSLHR 1147

Query: 144  TPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLK 203
              +     +  A    G         + Q+L E       Q   + +E     QY     
Sbjct: 1148 RRKLLETLVATAYEQVGEW------YKAQQLYEV-----AQRKARSKEFPFSLQYLYWHI 1196

Query: 204  EQIMEQEELITTARRD------YEQLQGEMTRLTQEN------ESAKEEVKEVL--QALE 249
              I   ++L +            E L  E  R    +      ES  EE+ + +     +
Sbjct: 1197 NDIDCADKLQSVLAELSLVTGISELLLEESWRRALFSNIKDSLESELEEIIDGMYKSNED 1256

Query: 250  EPTRTLLNFFRQSSPKGTLLYNHYLLGWPNQSERVSLWYR 289
                 LL+   +         +  L G  + S+ + L   
Sbjct: 1257 FGALMLLSLSAELW-------DKILEGRSSCSKSIKLSLN 1289


>gnl|CDD|221190 pfam11727, ISG65-75, Invariant surface glycoprotein.  This family
           is found in Trypanosome species, and appears to be one
           of two invariant surface glycoproteins, ISG65 and ISG75.
           that are found in the mammalian stage of the parasitic
           protozoan. the sequence suggests the two families are
           polypeptides with N-terminal signal sequences,
           hydrophilic extracellular domains, single trans-membrane
           alpha-helices and short cytoplasmic domains. they are
           both expressed in the bloodstream form but not in the
           midgut stage. Both polypeptides are distributed over the
           entire surface of the parasite.
          Length = 286

 Score = 27.5 bits (61), Expect = 8.9
 Identities = 5/42 (11%), Positives = 22/42 (52%)

Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTA 216
           + +++++ +  ++  ++I++    A+K   +  +  E +  A
Sbjct: 95  DSDKKKIKEICEKAKDKIDEALPEAKKAGREAEKLAEEVKEA 136


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 27.3 bits (61), Expect = 8.9
 Identities = 13/62 (20%), Positives = 31/62 (50%), Gaps = 6/62 (9%)

Query: 195 QSQYA---EKLKEQIMEQE---ELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQAL 248
           +  Y    E+L++QI E +     I   +++ ++L+  + +L +E +   E  ++  + L
Sbjct: 26  KELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRL 85

Query: 249 EE 250
            E
Sbjct: 86  SE 87


>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV.  Bacterial DNA
           topoisomerase IV, GyrA, ParC.
          Length = 444

 Score = 27.5 bits (62), Expect = 9.0
 Identities = 23/100 (23%), Positives = 37/100 (37%), Gaps = 17/100 (17%)

Query: 98  EKSSRY-KGKLEKLELELSRWRNGETVKEDEQVNL----ADPVDMA-----ASIAPTPES 147
            +   Y  GKLE   L +           DE + L     D    A          +   
Sbjct: 340 TRRKEYLLGKLEAERLHILEGLFIALSIIDEIIVLIRSSKDLAKEAKEKLMERFELSEIQ 399

Query: 148 APASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDE 187
           A A IL       + L  L+  E +KLE+E + L +++++
Sbjct: 400 ADA-IL------DMRLRRLTKLEVEKLEKELKELEKEIED 432


>gnl|CDD|205988 pfam13815, Dzip-like_N, Iguana/Dzip1-like DAZ-interacting protein
           N-terminal.  The DAZ gene-product - Deleted in
           Azoospermia - and a closely related sequence are
           required early in germ-cell development in order to
           maintain germ-cell populations. This family is the
           N-terminal region that is the only part of the protein
           in some fungi and lower metazoa.
          Length = 118

 Score = 26.5 bits (59), Expect = 9.0
 Identities = 12/45 (26%), Positives = 25/45 (55%)

Query: 174 LEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARR 218
           L   +E L Q   E +E + +  Q AE+ ++++ +Q E + + +R
Sbjct: 71  LLHSQECLTQSNSELEERLQEALQEAEEREQELSKQSEELKSLKR 115


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 26.4 bits (59), Expect = 9.2
 Identities = 22/85 (25%), Positives = 42/85 (49%), Gaps = 11/85 (12%)

Query: 173 KLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQ 232
           +L+EER  L  +L+E   E N+ S+   K K++  + E LI            E+  L  
Sbjct: 33  ELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALI-----------AEVKELKD 81

Query: 233 ENESAKEEVKEVLQALEEPTRTLLN 257
           E ++ + E++E+   L++   ++ N
Sbjct: 82  ELKALEAELRELEAELDKLLLSIPN 106


>gnl|CDD|218760 pfam05812, Herpes_BLRF2, Herpesvirus BLRF2 protein.  This family
           consists of several Herpesvirus BLRF2 proteins.
          Length = 118

 Score = 26.5 bits (59), Expect = 9.3
 Identities = 10/39 (25%), Positives = 22/39 (56%)

Query: 219 DYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLN 257
             E+L  E+ +L  EN++ K+++++ +   E P   +L 
Sbjct: 4   TVEELAAELQKLQMENKALKKKLRQSVGPGESPDDEILT 42


>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
           topoisomerase IV), A subunit [DNA replication,
           recombination, and repair].
          Length = 804

 Score = 27.6 bits (62), Expect = 9.3
 Identities = 12/58 (20%), Positives = 34/58 (58%)

Query: 161 LMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARR 218
           L L  L+  E +K+E+E + L +++ + ++ +  + +  + +K++++E ++     RR
Sbjct: 426 LRLRRLTGLEEEKIEKELKELEKEIADLEKILASEERLLDIIKKELLEIKKKFGDERR 483


>gnl|CDD|218623 pfam05524, PEP-utilisers_N, PEP-utilising enzyme, N-terminal. 
          Length = 123

 Score = 26.4 bits (59), Expect = 9.4
 Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 6/79 (7%)

Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARR---DYEQLQGEMTRLT 231
           E E  RL   L++  EE+      AE+  E + E+E  I  A     D  +L  E+  L 
Sbjct: 34  EAEIARLEAALEKAREELEAL---AERAAESLGEEEAAIFEAHLMLLDDPELLEEVEELI 90

Query: 232 QENESAKEEVKEVLQALEE 250
           +E  SA+  VKEV++    
Sbjct: 91  REGLSAEAAVKEVIEEYAA 109


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 27.2 bits (61), Expect = 9.4
 Identities = 17/97 (17%), Positives = 28/97 (28%), Gaps = 23/97 (23%)

Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITT------------ 215
             +   L  E   L  ++DE   E+N               Q E   T            
Sbjct: 134 RPKIAALTAEIAALQAEIDEAQAEVNAA---------YQEAQCEAEGTGGTGVAGKGPVY 184

Query: 216 --ARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
              R   +  Q  +  L    ++A  +++    ALE 
Sbjct: 185 KEKREKLDAAQARLETLKARLDAAIAQLEAQKAALER 221


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 27.7 bits (62), Expect = 9.4
 Identities = 19/84 (22%), Positives = 44/84 (52%), Gaps = 4/84 (4%)

Query: 164 GSLSNEERQKLE----EERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRD 219
            +++ E + +LE    EE + L +QL+E  ++     + A +  E I++ E+  +  + D
Sbjct: 262 QTITEERKAELEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLD 321

Query: 220 YEQLQGEMTRLTQENESAKEEVKE 243
            E+L+  +  L +  E   ++++E
Sbjct: 322 VEELKALLEALEEILEKNLQKLEE 345


>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator.  DivIC from B.
           subtilis is necessary for both vegetative and
           sporulation septum formation. These proteins are mainly
           composed of an amino terminal coiled-coil.
          Length = 80

 Score = 25.6 bits (57), Expect = 9.6
 Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 3/45 (6%)

Query: 217 RRDYEQLQGEMTRLTQENESAKEEVKEVLQ---ALEEPTRTLLNF 258
            ++   LQ E+ +L  ENE  + EVK++      +EE  R+ L  
Sbjct: 23  NQEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEERARSELGL 67


>gnl|CDD|220373 pfam09738, DUF2051, Double stranded RNA binding protein (DUF2051). 
           This is a novel protein identified as interacting with
           the leucine-rich repeat domain of human flightless-I,
           FliI protein.
          Length = 294

 Score = 27.2 bits (60), Expect = 9.8
 Identities = 19/97 (19%), Positives = 45/97 (46%), Gaps = 12/97 (12%)

Query: 165 SLSNEERQKLEEERERLYQQLDEK-----------DEEINQQSQYAEKLKEQIMEQEELI 213
           S S +    + E ++ L  +++EK           D E        + LK+++ E EE +
Sbjct: 69  SSSVDTEASIRELKDSL-IEVEEKVKKAMVSNAQLDNEKTNLMYEVDLLKDKLEELEESL 127

Query: 214 TTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
              +R++E+   ++  L +  +  + ++KE+ + L +
Sbjct: 128 AELQREFEEKTRDLEALKRAIDILQHQLKELKEQLRQ 164


>gnl|CDD|214016 cd12923, iSH2_PI3K_IA_R, Inter-Src homology 2 (iSH2) helical domain
           of Class IA Phosphoinositide 3-kinase Regulatory
           subunits.  PI3Ks catalyze the transfer of the
           gamma-phosphoryl group from ATP to the 3-hydroxyl of the
           inositol ring of D-myo-phosphatidylinositol (PtdIns) or
           its derivatives. They play an important role in a
           variety of fundamental cellular processes, including
           cell motility, the Ras pathway, vesicle trafficking and
           secretion, immune cell activation, and apoptosis. They
           are classified according to their substrate specificity,
           regulation, and domain structure. Class IA PI3Ks are
           heterodimers of a p110 catalytic (C) subunit and a
           p85-related regulatory (R) subunit. The R subunit
           down-regulates PI3K basal activity, stabilizes the C
           subunit, and plays a role in the activation downstream
           of tyrosine kinases. All R subunits contain two SH2
           domains that flank an intervening helical domain (iSH2),
           which binds to the N-terminal adaptor-binding domain
           (ABD) of the catalytic subunit. In vertebrates, there
           are three genes (PIK3R1, PIK3R2, and PIK3R3) that encode
           for different Class IA PI3K R subunits.
          Length = 152

 Score = 26.8 bits (60), Expect = 9.8
 Identities = 19/69 (27%), Positives = 36/69 (52%), Gaps = 3/69 (4%)

Query: 200 EKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENES---AKEEVKEVLQALEEPTRTLL 256
           EKL +++ E  +      R+Y++L  +  +L+QE +    A E  +E ++  EE  RT  
Sbjct: 4   EKLAKKLKEINKEYLDKSREYDELYEKYNKLSQEIQLKRQALEAFEEAVKMFEEQLRTQE 63

Query: 257 NFFRQSSPK 265
            F +++ P 
Sbjct: 64  KFQKEAQPH 72


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 27.4 bits (61), Expect = 10.0
 Identities = 12/53 (22%), Positives = 27/53 (50%), Gaps = 6/53 (11%)

Query: 160 GLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEEL 212
             ++  L+ ++ +   ++ ER   + ++K EE      Y E++K+ I + EE 
Sbjct: 6   TELISKLAEKQAKSKLQQLERARDKQEKKAEE------YQEQIKKAIEKAEEA 52


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.127    0.351 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,611,958
Number of extensions: 1412506
Number of successful extensions: 5085
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4184
Number of HSP's successfully gapped: 1010
Length of query: 290
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 194
Effective length of database: 6,679,618
Effective search space: 1295845892
Effective search space used: 1295845892
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 58 (26.0 bits)