RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5329
         (163 letters)



>gnl|CDD|238681 cd01385, MYSc_type_IX, Myosin motor domain, type IX myosins. Myosin
           IX is a processive single-headed motor, which might play
           a role in signalling. This catalytic (head) domain has
           ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 692

 Score =  115 bits (290), Expect = 1e-30
 Identities = 44/74 (59%), Positives = 55/74 (74%)

Query: 15  EEEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVE 74
           E+      FEQLCINYANE LQYYFNQH+F+ EQEEY  EGI W +IE++DN  C+QL  
Sbjct: 387 EDFGRCNSFEQLCINYANEQLQYYFNQHIFKLEQEEYQGEGITWTNIEYTDNVGCIQLFS 446

Query: 75  GKPNGLLCVLDDQA 88
            KP GLL +LD+++
Sbjct: 447 KKPTGLLYLLDEES 460


>gnl|CDD|214580 smart00242, MYSc, Myosin. Large ATPases.  ATPase; molecular motor.
           Muscle contraction consists of a cyclical interaction
           between myosin and actin. The core of the myosin
           structure is similar in fold to that of kinesin.
          Length = 677

 Score =  114 bits (288), Expect = 3e-30
 Identities = 41/67 (61%), Positives = 52/67 (77%)

Query: 23  FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 82
           FEQLCINYANE LQ +FNQHVF+ EQEEY +EGI W  I+F DN  C+ L+E KP G+L 
Sbjct: 387 FEQLCINYANEKLQQFFNQHVFKLEQEEYEREGIDWTFIDFFDNQDCIDLIEKKPPGILS 446

Query: 83  VLDDQAK 89
           +LD++ +
Sbjct: 447 LLDEECR 453


>gnl|CDD|238071 cd00124, MYSc, Myosin motor domain. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 679

 Score =  107 bits (269), Expect = 1e-27
 Identities = 41/65 (63%), Positives = 51/65 (78%)

Query: 23  FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 82
           FEQLCINYANE LQ +FNQHVF+ EQEEY +EGI W  I+F+DN   + L+E KP GLL 
Sbjct: 380 FEQLCINYANEKLQQFFNQHVFKLEQEEYQEEGIDWESIDFTDNQEVIDLIEKKPGGLLS 439

Query: 83  VLDDQ 87
           +LD++
Sbjct: 440 LLDEE 444


>gnl|CDD|238677 cd01381, MYSc_type_VII, Myosin motor domain, type VII myosins.
           Myosins in this group have been associated with
           functions in sensory systems such as vision and hearing.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 671

 Score =  100 bits (251), Expect = 2e-25
 Identities = 38/74 (51%), Positives = 53/74 (71%)

Query: 16  EEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEG 75
           E  +   FEQLCIN+ANE+LQ +F QH+F+ EQEEYN E I W+HIEF DN   L L+  
Sbjct: 373 ENFDVNSFEQLCINFANENLQQFFVQHIFKLEQEEYNLEHINWQHIEFVDNQDALDLIAI 432

Query: 76  KPNGLLCVLDDQAK 89
           KP  ++ ++D+++K
Sbjct: 433 KPLNIMSLIDEESK 446


>gnl|CDD|238680 cd01384, MYSc_type_XI, Myosin motor domain, plant-specific type XI
           myosin, involved in organelle transport. This catalytic
           (head) domain has ATPase activity and belongs to the
           larger group of P-loop NTPases. Myosins are
           actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 674

 Score =  100 bits (251), Expect = 3e-25
 Identities = 39/71 (54%), Positives = 48/71 (67%)

Query: 16  EEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEG 75
           E  +   FEQ CIN  NE LQ +FNQHVF+ EQEEY KE I W +IEF DN   L L+E 
Sbjct: 378 ESFKTNSFEQFCINLTNEKLQQHFNQHVFKMEQEEYTKEEIDWSYIEFVDNQDVLDLIEK 437

Query: 76  KPNGLLCVLDD 86
           KP G++ +LD+
Sbjct: 438 KPGGIIALLDE 448


>gnl|CDD|215687 pfam00063, Myosin_head, Myosin head (motor domain). 
          Length = 679

 Score =  100 bits (251), Expect = 3e-25
 Identities = 38/68 (55%), Positives = 51/68 (75%), Gaps = 1/68 (1%)

Query: 23  FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFS-DNTLCLQLVEGKPNGLL 81
           FEQLCINYANE LQ +FN H+F+ EQEEY +EGI W  I++  DN  C+ L+E KP G+L
Sbjct: 381 FEQLCINYANEKLQQFFNHHMFKLEQEEYVREGIAWTFIDYGLDNQACIDLIEKKPPGIL 440

Query: 82  CVLDDQAK 89
            +LD++ +
Sbjct: 441 SLLDEECR 448


>gnl|CDD|238676 cd01380, MYSc_type_V, Myosin motor domain, type V myosins. Myosins
           V transport a variety of intracellular cargo
           processively along actin filaments, such as membraneous
           organelles and mRNA. This catalytic (head) domain has
           ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 691

 Score = 98.2 bits (245), Expect = 2e-24
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 1/67 (1%)

Query: 23  FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 82
           FEQ CINYANE LQ  FNQHVF+ EQEEY KEGI W  I+F DN  C+ L+E K  G+L 
Sbjct: 386 FEQFCINYANEKLQQQFNQHVFKLEQEEYLKEGIEWTFIDFYDNQPCIDLIESKL-GILS 444

Query: 83  VLDDQAK 89
           +LD++ +
Sbjct: 445 LLDEECR 451


>gnl|CDD|238683 cd01387, MYSc_type_XV, Myosin motor domain, type XV myosins. In
           vertebrates, myosin XV appears to be expressed in
           sensory tissue and play a role in hearing. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Myosins are
           actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 677

 Score = 96.9 bits (241), Expect = 6e-24
 Identities = 40/72 (55%), Positives = 50/72 (69%)

Query: 16  EEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEG 75
           E+     FEQLCINYANE+LQY FN+ VFQ EQEEY +E + W  I F+DN   + L+  
Sbjct: 372 EDLSFNSFEQLCINYANENLQYLFNKIVFQEEQEEYIREQLDWTEIAFADNQPVINLISL 431

Query: 76  KPNGLLCVLDDQ 87
           KP G+L +LDDQ
Sbjct: 432 KPYGILRILDDQ 443


>gnl|CDD|238679 cd01383, MYSc_type_VIII, Myosin motor domain, plant-specific type
           VIII myosins, a subgroup which has been associated with
           endocytosis, cytokinesis, cell-to-cell coupling and
           gating at plasmodesmata. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 677

 Score = 91.8 bits (228), Expect = 3e-22
 Identities = 39/66 (59%), Positives = 50/66 (75%)

Query: 23  FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 82
           FEQ CINYANE LQ +FN+H+F+ EQEEY ++GI W  +EF DN  CL L E KP GLL 
Sbjct: 383 FEQFCINYANERLQQHFNRHLFKLEQEEYEEDGIDWTKVEFEDNQECLDLFEKKPLGLLS 442

Query: 83  VLDDQA 88
           +LD+++
Sbjct: 443 LLDEES 448


>gnl|CDD|238675 cd01379, MYSc_type_III, Myosin motor domain, type III myosins.
           Myosin III has been shown to play a role in  the vision
           process in insects and in hearing in mammals. Myosin
           III, an unconventional myosin, does not form dimers.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 653

 Score = 86.8 bits (215), Expect = 2e-20
 Identities = 43/92 (46%), Positives = 62/92 (67%), Gaps = 4/92 (4%)

Query: 16  EEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEG 75
           E  ++  FEQLCIN ANE +QYYFNQH+F +EQ+EY  EG+  R +E+ DN   L +   
Sbjct: 384 ENFKKNSFEQLCINIANEQIQYYFNQHIFAWEQQEYLNEGVDARLVEYEDNRPLLDMFLQ 443

Query: 76  KPNGLLCVLDDQA---KETEKAGV-LYEDNIK 103
           KP GLL +LD+++   + T++  V  +EDN+K
Sbjct: 444 KPLGLLALLDEESRFPQATDQTLVEKFEDNLK 475


>gnl|CDD|238673 cd01377, MYSc_type_II, Myosin motor domain, type II myosins. Myosin
           II mediates cortical contraction in cell motility, and
           is the motor in smooth and skeletal muscle. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Myosins are
           actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 693

 Score = 85.3 bits (212), Expect = 6e-20
 Identities = 38/75 (50%), Positives = 49/75 (65%), Gaps = 4/75 (5%)

Query: 23  FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFS-DNTLCLQLVEGKPNGLL 81
           FEQLCINY NE LQ +FN H+F  EQEEY +EGI W  I+F  D    + L+E  P G+L
Sbjct: 392 FEQLCINYTNEKLQQFFNHHMFVLEQEEYQREGIEWTFIDFGLDLQPTIDLIEKNPMGIL 451

Query: 82  CVLDDQA---KETEK 93
            +LD++    K T+K
Sbjct: 452 SLLDEECVFPKATDK 466


>gnl|CDD|238674 cd01378, MYSc_type_I, Myosin motor domain, type I myosins. Myosin I
           generates movement at the leading edge in cell motility,
           and class I myosins have been implicated in phagocytosis
           and vesicle transport. Myosin I, an unconventional
           myosin, does not form dimers. This catalytic (head)
           domain has ATPase activity and belongs to the larger
           group of P-loop NTPases. Myosins are actin-dependent
           molecular motors that play important roles in muscle
           contraction, cell motility, and organelle transport. The
           head domain is a molecular motor, which utilizes ATP
           hydrolysis to generate directed movement toward the plus
           end along actin filaments. A cyclical interaction
           between myosin and actin provides the driving force.
           Rates of ATP hydrolysis and consequently the speed of
           movement along actin filaments vary widely, from about
           0.04 micrometer per second for myosin I to 4.5
           micrometer per second for myosin II in skeletal muscle.
           Myosin II moves in discrete steps about 5-10 nm long and
           generates 1-5 piconewtons of force. Upon ATP binding,
           the myosin head dissociates from an actin filament. ATP
           hydrolysis causes the head to pivot and associate with a
           new actin subunit. The release of Pi causes the head to
           pivot and move the filament (power stroke). Release of
           ADP completes the cycle.
          Length = 674

 Score = 83.0 bits (206), Expect = 3e-19
 Identities = 33/65 (50%), Positives = 43/65 (66%), Gaps = 1/65 (1%)

Query: 23  FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGK-PNGLL 81
           FEQ CINY NE LQ  F +   + EQEEY +EGI+W  IE+ +N +   L+EGK P G+ 
Sbjct: 384 FEQFCINYVNEKLQQIFIELTLKAEQEEYVREGIKWTPIEYFNNKIVCDLIEGKRPPGIF 443

Query: 82  CVLDD 86
            +LDD
Sbjct: 444 SILDD 448


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 82.0 bits (203), Expect = 7e-19
 Identities = 40/71 (56%), Positives = 52/71 (73%), Gaps = 1/71 (1%)

Query: 19  EEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGK-P 77
           E+  FEQLCINY NE LQ +FNQH+F+ EQEEY KEGI W  I++ DN  C+ L+E K P
Sbjct: 442 EKNSFEQLCINYTNEKLQQFFNQHMFKLEQEEYVKEGIEWSFIDYFDNQPCIDLIEKKNP 501

Query: 78  NGLLCVLDDQA 88
            G+L +LD++ 
Sbjct: 502 LGILSLLDEEC 512


>gnl|CDD|238678 cd01382, MYSc_type_VI, Myosin motor domain, type VI myosins. Myosin
           VI is a monomeric myosin, which moves towards the
           minus-end of actin filaments, in contrast to most other
           myosins. It has been implicated in endocytosis,
           secretion, and cell migration. This catalytic (head)
           domain has ATPase activity and belongs to the larger
           group of P-loop NTPases. Myosins are actin-dependent
           molecular motors that play important roles in muscle
           contraction, cell motility, and organelle transport. The
           head domain is a molecular motor, which utilizes ATP
           hydrolysis to generate directed movement toward the
           minus end along actin filaments. A cyclical interaction
           between myosin and actin provides the driving force.
           Rates of ATP hydrolysis and consequently the speed of
           movement along actin filaments vary widely, from about
           0.04 micrometer per second for myosin I to 4.5
           micrometer per second for myosin II in skeletal muscle.
           Myosin II moves in discrete steps about 5-10 nm long and
           generates 1-5 piconewtons of force. Upon ATP binding,
           the myosin head dissociates from an actin filament. ATP
           hydrolysis causes the head to pivot and associate with a
           new actin subunit. The release of Pi causes the head to
           pivot and move the filament (power stroke). Release of
           ADP completes the cycle.
          Length = 717

 Score = 81.0 bits (200), Expect = 2e-18
 Identities = 31/71 (43%), Positives = 46/71 (64%)

Query: 19  EEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPN 78
           E   FEQ CINY NE LQ +FN+ + + EQE Y +EG+    + + DN  C+ L+E K N
Sbjct: 411 EHNSFEQFCINYCNEKLQQFFNERILKEEQELYQREGLGVNEVHYVDNQDCIDLIEAKLN 470

Query: 79  GLLCVLDDQAK 89
           G+L +LD++ +
Sbjct: 471 GILDILDEENR 481


>gnl|CDD|240229 PTZ00014, PTZ00014, myosin-A; Provisional.
          Length = 821

 Score = 54.3 bits (131), Expect = 4e-09
 Identities = 26/64 (40%), Positives = 36/64 (56%)

Query: 24  EQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLCV 83
           EQL IN  NE LQ  F   VF+ E + Y  EGI    +E++ N   + L+ GK   +L +
Sbjct: 481 EQLFINITNEMLQKNFVDIVFERESKLYKDEGISTEELEYTSNESVIDLLCGKGKSVLSI 540

Query: 84  LDDQ 87
           L+DQ
Sbjct: 541 LEDQ 544


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 51.5 bits (124), Expect = 2e-08
 Identities = 28/62 (45%), Positives = 46/62 (74%)

Query: 89  KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           K T+K   + E   KK EEEK+E++++    ++KEEEEE+EK+++EE+++ EEEE EEE+
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466

Query: 149 TK 150
            +
Sbjct: 467 EE 468



 Score = 45.7 bits (109), Expect = 3e-06
 Identities = 24/63 (38%), Positives = 46/63 (73%)

Query: 88  AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
            K  EKA    E+  K+++++    +++E +EEE++E++E+EK+EEEE+ + E+EE+EE+
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472

Query: 148 ETK 150
           + K
Sbjct: 473 KKK 475



 Score = 45.3 bits (108), Expect = 4e-06
 Identities = 26/55 (47%), Positives = 38/55 (69%), Gaps = 1/55 (1%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE-KMEEEEKEEEETKYLVRD 155
           IKK  E+ E++ EEE KE++K+    K+K+EEEE+E + +EEEKEEEE +     
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466



 Score = 41.8 bits (99), Expect = 5e-05
 Identities = 20/58 (34%), Positives = 40/58 (68%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
           ++   K+ E+     +      ++++EEEEEE+ K+EE++EEEE+E +EE+E+E+ ++
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 41.8 bits (99), Expect = 6e-05
 Identities = 26/67 (38%), Positives = 46/67 (68%), Gaps = 4/67 (5%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           V   + K  E+     ++  KK    K++EEEEE ++E+KEEE+E+E++E EE+++ EEE
Sbjct: 416 VEKAEKKREEEK----KEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471

Query: 143 EKEEEET 149
           +K+++ T
Sbjct: 472 KKKKQAT 478



 Score = 39.5 bits (93), Expect = 3e-04
 Identities = 17/46 (36%), Positives = 33/46 (71%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            K+  +K ++  E+ +++ +EE++EK+KK    K+K EEEE+E+E+
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450



 Score = 39.5 bits (93), Expect = 4e-04
 Identities = 16/51 (31%), Positives = 35/51 (68%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
                ++ +K  E+ E+ +EEEK+E+++K    ++++E+ EEE++++EE K
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455



 Score = 35.7 bits (83), Expect = 0.007
 Identities = 17/49 (34%), Positives = 33/49 (67%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
              ++  K+ ++  E  E+++EEE++++KK+    +K EEEE+EE+E K
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451



 Score = 34.5 bits (80), Expect = 0.015
 Identities = 16/54 (29%), Positives = 34/54 (62%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           L E+ I+     K+  ++ +   E+ E++ E+EKKE+++K    ++++EEEE +
Sbjct: 394 LTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447



 Score = 31.4 bits (72), Expect = 0.20
 Identities = 12/47 (25%), Positives = 27/47 (57%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           EEE +     ++  ++ K+  E+ EKK EEEK++ +++    ++ + 
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEE 442



 Score = 27.2 bits (61), Expect = 4.4
 Identities = 13/50 (26%), Positives = 30/50 (60%)

Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            + +EE E     ++  K+ +K  E+ ++K+EEE+KEK ++    +++ +
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEE 442


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 50.8 bits (121), Expect = 6e-08
 Identities = 45/152 (29%), Positives = 73/152 (48%), Gaps = 6/152 (3%)

Query: 4   NECFTIRGEEEEEEEEEEDFEQLCINYANEHLQYYFNQHVFQY-EQEEYNKEGIRWRHIE 62
            +     GE E E +E+ED  ++    A E  +   ++      E E   + G +  H  
Sbjct: 761 RKETEHEGETEAEGKEDEDEGEI---QAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEG 817

Query: 63  FSDNTLCLQLVEGKPNGLLCVLDDQ--AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEE 120
            S+       V+ +        ++Q  AK+ EK       +   + EE+EEEEEEE +EE
Sbjct: 818 QSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEE 877

Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
           E+EEEEE+E++E EE   +E  E  +++  YL
Sbjct: 878 EEEEEEEEEEEENEEPLSLEWPETRQKQAIYL 909



 Score = 33.8 bits (77), Expect = 0.031
 Identities = 22/61 (36%), Positives = 32/61 (52%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           E E     ++   + EE E E E E E  E+E E E  +E +E E++ + E E K E ET
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVET 757

Query: 150 K 150
           +
Sbjct: 758 E 758



 Score = 33.4 bits (76), Expect = 0.038
 Identities = 23/59 (38%), Positives = 32/59 (54%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           ETE  G   E  I+  EE +E E+E E + E K E E +  ++E E E   E E +E+E
Sbjct: 720 ETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDE 778



 Score = 33.0 bits (75), Expect = 0.067
 Identities = 21/64 (32%), Positives = 32/64 (50%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           + +AKE +  G    + ++ E E + E  E+E + E  EE EE E + E E E   E E 
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVET 757

Query: 145 EEEE 148
           E + 
Sbjct: 758 EGDR 761



 Score = 31.5 bits (71), Expect = 0.20
 Identities = 27/72 (37%), Positives = 36/72 (50%), Gaps = 2/72 (2%)

Query: 86  DQAKETEKAGVLYEDNIKKE--EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           D   ETE   V +E   + E  E+E E E  EE +E E E E E E K E E E   +E 
Sbjct: 705 DHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKET 764

Query: 144 KEEEETKYLVRD 155
           + E ET+   ++
Sbjct: 765 EHEGETEAEGKE 776



 Score = 30.7 bits (69), Expect = 0.33
 Identities = 14/46 (30%), Positives = 23/46 (50%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +   E ++E E E   E E E E   E+K E+E E   E ++ + +
Sbjct: 662 ESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHK 707



 Score = 30.3 bits (68), Expect = 0.50
 Identities = 21/50 (42%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 103 KKEEEEKEEEEEEEVKEE---EKEEEEEKEKKEEEEKE-KMEEEEKEEEE 148
             EE E+  E E E  EE   E E+E E E K E E E ++  E K E+E
Sbjct: 645 TGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQE 694



 Score = 30.0 bits (67), Expect = 0.61
 Identities = 16/44 (36%), Positives = 18/44 (40%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
             E   EE E     E E  EE   E ++E E E   E E E E
Sbjct: 641 TGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGE 684



 Score = 29.2 bits (65), Expect = 1.1
 Identities = 18/49 (36%), Positives = 25/49 (51%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           E  I+ +E + + E E E  E E E E E  + E E +   E EE E+E
Sbjct: 696 EGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDE 744



 Score = 28.8 bits (64), Expect = 1.3
 Identities = 17/50 (34%), Positives = 27/50 (54%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           E   + E +E + + E E +E E E E E E  E+E + +  EE +E E+
Sbjct: 694 EGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVED 743



 Score = 28.4 bits (63), Expect = 1.7
 Identities = 17/66 (25%), Positives = 32/66 (48%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           + + +   +  +  E   ++E E + E +E + K E + EE E E + E E  + E E +
Sbjct: 674 ETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIE 733

Query: 145 EEEETK 150
             EE +
Sbjct: 734 TGEEGE 739



 Score = 28.4 bits (63), Expect = 2.0
 Identities = 22/63 (34%), Positives = 32/63 (50%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           + +A+E E  G    +  + E E +  EE EEV++E + E E K + E E   K  E E 
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEG 768

Query: 145 EEE 147
           E E
Sbjct: 769 ETE 771



 Score = 28.4 bits (63), Expect = 2.2
 Identities = 16/46 (34%), Positives = 22/46 (47%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +  EE   E E+E   E + E E E E   E + E+  E E E +E
Sbjct: 658 ENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKE 703



 Score = 28.4 bits (63), Expect = 2.3
 Identities = 14/43 (32%), Positives = 22/43 (51%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           E E K E E E     E++ E+E E + E ++   + E + EE
Sbjct: 672 ETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEE 714



 Score = 28.0 bits (62), Expect = 2.5
 Identities = 20/61 (32%), Positives = 31/61 (50%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           E E  G +       + E + EE E E + E +  E+E E +  EE E++E+E + E E 
Sbjct: 692 EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEG 751

Query: 150 K 150
           K
Sbjct: 752 K 752



 Score = 28.0 bits (62), Expect = 2.5
 Identities = 18/61 (29%), Positives = 30/61 (49%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           + +  E++G   E   + E + + E E E   E + E+E E E + +E   K E E +E 
Sbjct: 656 EGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEV 715

Query: 147 E 147
           E
Sbjct: 716 E 716



 Score = 27.7 bits (61), Expect = 4.1
 Identities = 19/62 (30%), Positives = 32/62 (51%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           +Q  ETE  G    +     E + E+E E E++ +E + + E E +E E + + E E  E
Sbjct: 668 EQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTE 727

Query: 146 EE 147
           +E
Sbjct: 728 DE 729



 Score = 26.9 bits (59), Expect = 6.1
 Identities = 14/46 (30%), Positives = 22/46 (47%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E E   E   EE +   + E E  E+   E +++ E E K E E++
Sbjct: 637 EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESE 682



 Score = 26.5 bits (58), Expect = 7.7
 Identities = 16/63 (25%), Positives = 31/63 (49%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           + +  E E      E+  + E+E + E E +   E E + +E + + E E + K +E+E 
Sbjct: 722 EAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEG 781

Query: 145 EEE 147
           E +
Sbjct: 782 EIQ 784



 Score = 26.5 bits (58), Expect = 7.9
 Identities = 14/64 (21%), Positives = 27/64 (42%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
               ++E E + + E E   E   E + E+E + E E ++ + + + E E      +   
Sbjct: 664 GGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEA 723

Query: 159 ESCE 162
           E  E
Sbjct: 724 EGTE 727



 Score = 26.5 bits (58), Expect = 9.7
 Identities = 16/48 (33%), Positives = 20/48 (41%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
            D  K +  E E   E   +E E+  E E E  EE   E  +E E E 
Sbjct: 628 GDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETET 675


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 45.1 bits (107), Expect = 1e-06
 Identities = 17/46 (36%), Positives = 27/46 (58%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           EE E+E  +E E ++  + + E KEKK+ E  +   E+EK + E K
Sbjct: 99  EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144



 Score = 37.0 bits (86), Expect = 0.001
 Identities = 14/42 (33%), Positives = 24/42 (57%)

Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
              ++EEE  EEE  +E E+E   E + E  E++++E  + K
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPK 133



 Score = 35.9 bits (83), Expect = 0.003
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
            +E E+E+  E + + +EK++ E  + K E+EK K E
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142



 Score = 35.1 bits (81), Expect = 0.006
 Identities = 13/43 (30%), Positives = 24/43 (55%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
             +EEEE E +  ++ E+E+  + + E KEK + E  + +  K
Sbjct: 94  SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEK 136



 Score = 34.3 bits (79), Expect = 0.010
 Identities = 8/46 (17%), Positives = 27/46 (58%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           E+  ++E  ++ E+E+    + E +E++++E  + + +++  + E 
Sbjct: 98  EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEP 143



 Score = 32.4 bits (74), Expect = 0.055
 Identities = 13/51 (25%), Positives = 25/51 (49%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
           E E      ++  +++  E + E +E+ K E  + + EKEK + E K+   
Sbjct: 98  EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148



 Score = 27.8 bits (62), Expect = 2.2
 Identities = 11/54 (20%), Positives = 29/54 (53%), Gaps = 6/54 (11%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           ++   ETE+       +  + + E +E+++ EV + + E+E+ K + ++ +  K
Sbjct: 103 EESTDETEQE------DPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150



 Score = 27.4 bits (61), Expect = 2.2
 Identities = 12/42 (28%), Positives = 23/42 (54%)

Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
           ++E+EE EE+   E E+++  E + + +E+ K  V     E 
Sbjct: 95  DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEK 136


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 42.0 bits (98), Expect = 6e-05
 Identities = 20/41 (48%), Positives = 27/41 (65%), Gaps = 2/41 (4%)

Query: 103 KKEEE--EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
           KK EE  EK + E E+   EE+E E+EKEK+ E E+E+  E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619



 Score = 41.2 bits (96), Expect = 9e-05
 Identities = 17/44 (38%), Positives = 31/44 (70%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           K  +++EE  E+  +E E++  EE+E+++E+EKE+  E E+E E
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619



 Score = 38.5 bits (89), Expect = 9e-04
 Identities = 16/40 (40%), Positives = 29/40 (72%), Gaps = 1/40 (2%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           E+ ++K + E E++  EE +E EKE+E+E+E++ E E E+
Sbjct: 582 EEAVEKAKREAEQKAREE-REREKEKEKEREREREREAER 620



 Score = 36.2 bits (83), Expect = 0.005
 Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 1/42 (2%)

Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           K  ++ EE  E+ K E E+K  +EE E+EK +E+E+E E  +
Sbjct: 576 KLAKKREEAVEKAKREAEQK-AREEREREKEKEKERERERER 616



 Score = 30.4 bits (68), Expect = 0.38
 Identities = 13/28 (46%), Positives = 21/28 (75%), Gaps = 1/28 (3%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKK 131
           +EE E+E+E+E+E +E E+E E E+  K
Sbjct: 597 REEREREKEKEKE-REREREREAERAAK 623



 Score = 29.3 bits (65), Expect = 1.0
 Identities = 15/50 (30%), Positives = 26/50 (52%)

Query: 92  EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
           EKA    E   ++E E ++E+E+E  +E E+E E   +      + +M E
Sbjct: 586 EKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHESRMSE 635



 Score = 28.9 bits (64), Expect = 1.3
 Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 3/36 (8%)

Query: 120 EEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           +++EE  EK K+E E+K +   EE+E E+ K   R+
Sbjct: 579 KKREEAVEKAKREAEQKAR---EEREREKEKEKERE 611


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 40.7 bits (96), Expect = 1e-04
 Identities = 22/54 (40%), Positives = 37/54 (68%), Gaps = 2/54 (3%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE--EEEKEKMEEEEKEEEETK 150
           E+ I K  EE+ +EE +E KEE+K+EE E +  +   EE+ K+EE+E++++  K
Sbjct: 269 EEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 35.3 bits (82), Expect = 0.009
 Identities = 23/60 (38%), Positives = 40/60 (66%), Gaps = 4/60 (6%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEE-KEKKEEEEKEKMEEEEKE---EEETKYL 152
           L  + ++K ++ +EEEEE+ +K  E+E +EE +EKKEE++KE+ E +  +   EE+ K  
Sbjct: 253 LSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLE 312



 Score = 30.3 bits (69), Expect = 0.42
 Identities = 15/55 (27%), Positives = 34/55 (61%), Gaps = 3/55 (5%)

Query: 89  KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEE---EEKEKKEEEEKEKME 140
           +E E+  +   +  ++EE ++++EE+++ + E K  +   EE+ K EE+E++K  
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320


>gnl|CDD|238682 cd01386, MYSc_type_XVIII, Myosin motor domain, type XVIII myosins.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 767

 Score = 39.8 bits (93), Expect = 3e-04
 Identities = 21/63 (33%), Positives = 29/63 (46%), Gaps = 10/63 (15%)

Query: 23  FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 82
           FE+LC NY  E LQ  F+   F    E Y +EG+    +EF        L E  P   + 
Sbjct: 397 FEELCHNYLQERLQLLFHHRTFVQPLERYAEEGVE---VEFD-------LAEPSPGTTVA 446

Query: 83  VLD 85
           ++D
Sbjct: 447 LVD 449


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 39.0 bits (91), Expect = 6e-04
 Identities = 25/68 (36%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 88  AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           AKETE+  +L  D     E E    E ++ +EE++EEEEE+++ E   KE  ++EE EE+
Sbjct: 364 AKETEQDYIL--DAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEED 421

Query: 148 ETKYLVRD 155
           + +    D
Sbjct: 422 DVESKYED 429



 Score = 30.5 bits (69), Expect = 0.44
 Identities = 19/52 (36%), Positives = 32/52 (61%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
            A E E A    +D  +++EEE+EEE+E+E   +E  ++EE E+ + E K +
Sbjct: 377 SALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYE 428



 Score = 28.2 bits (63), Expect = 2.5
 Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 4/58 (6%)

Query: 95  GVLYED-NIKKEEEEKEEEE---EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             L+E  N +K  + KE E+    +     E E+   +   EEEE E+ EEEE E+E 
Sbjct: 350 QALWETLNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEG 407


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 38.7 bits (90), Expect = 6e-04
 Identities = 21/65 (32%), Positives = 34/65 (52%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           L+ QA+E EK     +   K+ E+    E+  +  E+  ++ EEK+K+ EE K K   E 
Sbjct: 73  LEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEA 132

Query: 144 KEEEE 148
           K + E
Sbjct: 133 KAKAE 137



 Score = 34.8 bits (80), Expect = 0.012
 Identities = 15/59 (25%), Positives = 31/59 (52%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            +A E  +   L +    ++  ++ E+  ++ +E++K+ EE K K+  E K K E E +
Sbjct: 83  QRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAE 141



 Score = 32.9 bits (75), Expect = 0.059
 Identities = 15/66 (22%), Positives = 36/66 (54%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
             AK+ ++     E   ++ E+++  E+  + + E++   E+  K+ E+  ++ EE++K+
Sbjct: 61  PAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQ 120

Query: 146 EEETKY 151
            EE K 
Sbjct: 121 AEEAKA 126



 Score = 31.0 bits (70), Expect = 0.25
 Identities = 20/62 (32%), Positives = 28/62 (45%), Gaps = 3/62 (4%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
              E E      E+  K+ EEE + +   E K   K+  E K+K E E K K E + K +
Sbjct: 134 AKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAK---KKAAEAKKKAEAEAKAKAEAKAKAK 190

Query: 147 EE 148
            E
Sbjct: 191 AE 192



 Score = 27.9 bits (62), Expect = 2.4
 Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 14/80 (17%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEK--------------EEEEEKEK 130
           +  AK+ E+A    E+  K+ EE K ++  E   + E               EEE + + 
Sbjct: 101 EKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKA 160

Query: 131 KEEEEKEKMEEEEKEEEETK 150
             E +K+  E ++K E E K
Sbjct: 161 AAEAKKKAAEAKKKAEAEAK 180



 Score = 27.9 bits (62), Expect = 2.7
 Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 1/66 (1%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK-EEEEKEKMEEEEK 144
           ++ K+ E+     E     E+  ++E E+    E+  ++ E+  K+ EE++K+  E + K
Sbjct: 68  ERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAK 127

Query: 145 EEEETK 150
           +  E K
Sbjct: 128 QAAEAK 133



 Score = 26.7 bits (59), Expect = 6.8
 Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 3/52 (5%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E+   K   E +++  E  K   K E E K K E + K K EE + + E  K
Sbjct: 154 EEAKAKAAAEAKKKAAEAKK---KAEAEAKAKAEAKAKAKAEEAKAKAEAAK 202



 Score = 26.0 bits (57), Expect = 9.9
 Identities = 11/51 (21%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKK--EEEEKEKMEEEEKEEEETK 150
           I+++++   ++E+E  K+ E++ EE ++++  E+  ++++E+    E+  K
Sbjct: 55  IQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAK 105


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 38.8 bits (90), Expect = 6e-04
 Identities = 7/50 (14%), Positives = 17/50 (34%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           D  +++  +   +     + + +      E+      E  +E E E  E 
Sbjct: 370 DIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEK 419



 Score = 35.0 bits (80), Expect = 0.012
 Identities = 9/52 (17%), Positives = 16/52 (30%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            D   +     + + E      E+      E  +E E E  E+     +  K
Sbjct: 377 GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAK 428



 Score = 35.0 bits (80), Expect = 0.013
 Identities = 10/59 (16%), Positives = 18/59 (30%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           ET +A +  E       +     + +        EE      E  ++ + E  EK    
Sbjct: 365 ETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPI 423



 Score = 34.2 bits (78), Expect = 0.024
 Identities = 8/50 (16%), Positives = 18/50 (36%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           E+  + + E ++  +         + + E      EE   +  E  +E E
Sbjct: 364 EETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETE 413



 Score = 32.3 bits (73), Expect = 0.10
 Identities = 10/45 (22%), Positives = 16/45 (35%), Gaps = 1/45 (2%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           + + E      EE      E  +E E  E  EK     +  + +E
Sbjct: 388 QVDAEAASAAPEEPAALASEAHDETEP-EVPEKAAPIPDPAKPDE 431



 Score = 28.4 bits (63), Expect = 2.1
 Identities = 9/52 (17%), Positives = 16/52 (30%), Gaps = 1/52 (1%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEE-EKEKKEEEEKEKMEEEEKEEEET 149
              +  E      EE   +  E  +E E E  +K     +  + +E      
Sbjct: 386 AHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGP 437


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 38.2 bits (89), Expect = 8e-04
 Identities = 24/63 (38%), Positives = 39/63 (61%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           DQA   E+     +D    E EE +EEE+E    E+KE++ + EK++EE +E+ EEE++E
Sbjct: 42  DQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101

Query: 146 EEE 148
             +
Sbjct: 102 SSD 104



 Score = 33.2 bits (76), Expect = 0.037
 Identities = 21/65 (32%), Positives = 35/65 (53%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           D  +  E   V  E+      E+KE++ + E ++EE EEE E+E +E  ++ + E EEK 
Sbjct: 55  DDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKT 114

Query: 146 EEETK 150
           E   +
Sbjct: 115 ESNVE 119



 Score = 32.0 bits (73), Expect = 0.089
 Identities = 25/59 (42%), Positives = 34/59 (57%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            ++ KE EK     ED   K + EKE+EE EE  EEE EE  ++ +KE EEK +   E+
Sbjct: 62  IEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEK 120



 Score = 30.5 bits (69), Expect = 0.28
 Identities = 17/52 (32%), Positives = 30/52 (57%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +    ++E +K +++E    EE KEEE+E    E++E +   E+E EE E +
Sbjct: 43  QAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEE 94



 Score = 30.1 bits (68), Expect = 0.34
 Identities = 18/67 (26%), Positives = 37/67 (55%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           D +  E E+     ++    E++E + + E+E +E E+E EEE E+  +E +++ EE+ +
Sbjct: 56  DQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTE 115

Query: 145 EEEETKY 151
              E + 
Sbjct: 116 SNVEKEI 122



 Score = 30.1 bits (68), Expect = 0.37
 Identities = 19/52 (36%), Positives = 33/52 (63%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            D    +E+E ++ +++E  E E+ +EEEKE    E+KE   + EKE+EE++
Sbjct: 41  SDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE 92



 Score = 29.3 bits (66), Expect = 0.78
 Identities = 19/48 (39%), Positives = 30/48 (62%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            KE++   E+E+EE +EE +EE+EE   + E+E E+  E   E+E T 
Sbjct: 77  DKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITN 124



 Score = 27.0 bits (60), Expect = 4.1
 Identities = 19/52 (36%), Positives = 34/52 (65%), Gaps = 5/52 (9%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEE-----EKEKKEEEEKEKMEEEEKEEEET 149
           ++E ++ +++E  E++E ++EE+E     +KE K + EKE  E EE+ EEE 
Sbjct: 48  EQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99



 Score = 25.9 bits (57), Expect = 9.3
 Identities = 13/53 (24%), Positives = 28/53 (52%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           + + KE +      ++  ++E EE++EE  +E ++E +E+ E   +KE     
Sbjct: 74  NSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 37.3 bits (87), Expect = 0.002
 Identities = 16/48 (33%), Positives = 28/48 (58%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
           EE E+E + E E   E K  ++E+E   EE+     EE+++ ++T+ L
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDL 280



 Score = 36.9 bits (86), Expect = 0.003
 Identities = 19/67 (28%), Positives = 32/67 (47%), Gaps = 4/67 (5%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEE----KEEEEEKEKKEEEEKEKMEE 141
            +  ETE+     +  I+   E K  ++E+E   EE      EE+E   K E+  +    
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286

Query: 142 EEKEEEE 148
           +EK++EE
Sbjct: 287 KEKKDEE 293



 Score = 35.0 bits (81), Expect = 0.011
 Identities = 17/61 (27%), Positives = 29/61 (47%)

Query: 91  TEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           T+K     E   ++EE + E E   E K  ++E+E   E+      E+ E+ +K E+  K
Sbjct: 223 TKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDK 282

Query: 151 Y 151
            
Sbjct: 283 L 283



 Score = 30.0 bits (68), Expect = 0.54
 Identities = 13/56 (23%), Positives = 25/56 (44%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
              E E   E +   +E+E   EE+     EE+++  + E+ ++ E     +D  L
Sbjct: 239 TDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEEL 294



 Score = 30.0 bits (68), Expect = 0.61
 Identities = 16/38 (42%), Positives = 27/38 (71%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           E+K+E +  +  EE+ ++EE+KEK + EEKE++E  E 
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKERIEIAET 685



 Score = 29.2 bits (66), Expect = 0.98
 Identities = 14/32 (43%), Positives = 25/32 (78%), Gaps = 1/32 (3%)

Query: 102 IKKEEEEKE-EEEEEEVKEEEKEEEEEKEKKE 132
           I+K++E K  +  EE+ K+EEK+E+ ++E+KE
Sbjct: 647 IEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678



 Score = 27.3 bits (61), Expect = 5.0
 Identities = 14/31 (45%), Positives = 22/31 (70%)

Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           E+ KE +  +  E++ K+EE+KEK + EEKE
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 36.9 bits (86), Expect = 0.002
 Identities = 22/31 (70%), Positives = 26/31 (83%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
            E E+EEEEEEEV EEE+EEEEE+E+  EEE
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 32.7 bits (75), Expect = 0.066
 Identities = 17/33 (51%), Positives = 23/33 (69%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
              E EE +EEE+E  EE+E++EEEE+   EEE
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 32.7 bits (75), Expect = 0.078
 Identities = 21/38 (55%), Positives = 24/38 (63%), Gaps = 4/38 (10%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
               E EEEEE +EE  EEEEE    EEEE+E+  EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEE----EEEEEERTFEEE 333



 Score = 31.2 bits (71), Expect = 0.24
 Identities = 17/34 (50%), Positives = 23/34 (67%)

Query: 114 EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
               + EE+EEEEE+  +EEEE+E+ EE   EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 29.6 bits (67), Expect = 0.72
 Identities = 19/34 (55%), Positives = 23/34 (67%)

Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
               E E+EEEEE+E  EEEE+E+ EEE   EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 29.6 bits (67), Expect = 0.85
 Identities = 16/34 (47%), Positives = 22/34 (64%)

Query: 113 EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
                 EEE+EEEEE  ++EEEE+E+ E   +EE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 28.8 bits (65), Expect = 1.3
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
               E E+E++EEEE  + EEEE+EEEE  + 
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFE 331



 Score = 28.8 bits (65), Expect = 1.4
 Identities = 15/34 (44%), Positives = 22/34 (64%)

Query: 123 EEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
              E +E++EEEE+   EEEE+EEEE +    +V
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEV 334



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 16/30 (53%), Positives = 21/30 (70%)

Query: 120 EEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
               E EE+E++EEE  E+ EEEE+EEE T
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERT 329



 Score = 28.1 bits (63), Expect = 2.4
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
                 + EEEE+E++E  E+E+ EEEE+E  
Sbjct: 298 PPPSPPEPEEEEEEEEEVPEEEEEEEEEEERT 329


>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal.  This is the N terminal of
           the DNA repair protein Rad54.
          Length = 191

 Score = 36.6 bits (85), Expect = 0.003
 Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 7/60 (11%)

Query: 92  EKAGVLYE---DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE----KEKMEEEEK 144
           E A VLY+   D+  K EEEK E+++E  + E K     K  K   E    K+K+EE  K
Sbjct: 131 EFAIVLYDPTVDDKPKIEEEKAEKDQEPEESETKLSNGPKVHKSLAEILGIKKKVEERPK 190



 Score = 28.1 bits (63), Expect = 1.7
 Identities = 8/28 (28%), Positives = 16/28 (57%)

Query: 123 EEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           ++ + +E+K E+++E  E E K     K
Sbjct: 143 DKPKIEEEKAEKDQEPEESETKLSNGPK 170


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 37.2 bits (86), Expect = 0.003
 Identities = 14/64 (21%), Positives = 29/64 (45%), Gaps = 14/64 (21%)

Query: 101 NIKKEEEEKE-EEEEEEVKEEEKEEEEEKEKK-------------EEEEKEKMEEEEKEE 146
           N  +E  + + ++  E+  E E E +EE++K              ++ +  K+ EEE+  
Sbjct: 539 NSTEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAA 598

Query: 147 EETK 150
            + K
Sbjct: 599 LKMK 602


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 36.9 bits (85), Expect = 0.003
 Identities = 23/74 (31%), Positives = 42/74 (56%), Gaps = 1/74 (1%)

Query: 74  EGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEE-EEVKEEEKEEEEEKEKKE 132
           +G  + L+   DD++ E+E+    YE +   E +E +E+EE +E  E+  E+E E +  +
Sbjct: 917 DGGWSFLMVGSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSD 976

Query: 133 EEEKEKMEEEEKEE 146
           EE+ E  +E E + 
Sbjct: 977 EEDGEDWDELESKA 990



 Score = 27.7 bits (61), Expect = 3.7
 Identities = 14/50 (28%), Positives = 29/50 (58%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
            +++ +E EEEV E E   ++E ++ +E+E+     E+  E+E++    D
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSD 976



 Score = 27.3 bits (60), Expect = 4.8
 Identities = 14/47 (29%), Positives = 25/47 (53%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
            ++E +E EEE  + E   ++E  E  E+EE ++  E+  E+E    
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESEND 973


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 36.3 bits (84), Expect = 0.004
 Identities = 19/63 (30%), Positives = 31/63 (49%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           DQ  E  K   L     ++E  ++E EE  E ++ E+ EE+ + ++     E   E+E E
Sbjct: 377 DQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDE 436

Query: 146 EEE 148
            EE
Sbjct: 437 LEE 439



 Score = 32.8 bits (75), Expect = 0.054
 Identities = 16/53 (30%), Positives = 25/53 (47%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           ED    E   + EEEE   +E E+  E E+ ++ EE+    E     E+E + 
Sbjct: 381 EDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFED 433



 Score = 31.3 bits (71), Expect = 0.18
 Identities = 16/52 (30%), Positives = 28/52 (53%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           L  +  ++ E E+ E  EE+ +  E     E E ++E+E E+ + EE+EE  
Sbjct: 398 LQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449



 Score = 30.9 bits (70), Expect = 0.31
 Identities = 17/65 (26%), Positives = 31/65 (47%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           V +  ++  E+  +  E   + E E+ E  EE+    E     E++ + E+E +E   EE
Sbjct: 385 VAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEE 444

Query: 143 EKEEE 147
           E+E  
Sbjct: 445 EEEAR 449



 Score = 30.5 bits (69), Expect = 0.32
 Identities = 19/63 (30%), Positives = 29/63 (46%)

Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
           +IK   E  +E E+ +V E   + EEE+  + E E+    E+ +  EE   L     L  
Sbjct: 369 DIKDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPE 428

Query: 161 CEF 163
            EF
Sbjct: 429 DEF 431


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 36.3 bits (83), Expect = 0.005
 Identities = 24/62 (38%), Positives = 38/62 (61%)

Query: 89   KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            K  E+  +   +  KK EE+K++ EE +  EE++++  E  KKE EE +K EE +K+E E
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAE 1713

Query: 149  TK 150
             K
Sbjct: 1714 EK 1715



 Score = 35.5 bits (81), Expect = 0.010
 Identities = 30/78 (38%), Positives = 46/78 (58%)

Query: 85   DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
             ++AK+ E+     E+  K EE+EK+  E  + + EE ++ EE +KKE EEK+K EE +K
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723

Query: 145  EEEETKYLVRDVHLESCE 162
             EEE K    +   E+ E
Sbjct: 1724 AEEENKIKAEEAKKEAEE 1741



 Score = 34.7 bits (79), Expect = 0.019
 Identities = 23/64 (35%), Positives = 41/64 (64%)

Query: 85   DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            +++ K+ E+      +  KK EE K+ EEE ++K  E+ ++ E++KK+ EE +K EE+EK
Sbjct: 1629 EEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688

Query: 145  EEEE 148
            +  E
Sbjct: 1689 KAAE 1692



 Score = 33.6 bits (76), Expect = 0.042
 Identities = 27/78 (34%), Positives = 41/78 (52%)

Query: 73   VEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
             + K   L     ++ K+ E+     E+N  K  EE ++ EE++ K EE ++ EE EKK 
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690

Query: 133  EEEKEKMEEEEKEEEETK 150
             E  +K  EE K+ EE K
Sbjct: 1691 AEALKKEAEEAKKAEELK 1708



 Score = 33.6 bits (76), Expect = 0.047
 Identities = 28/67 (41%), Positives = 42/67 (62%)

Query: 84   LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            +++  K  E+   +  +  KK EE K + EE +  EEEK++ E+ +KKE EEK+K EE +
Sbjct: 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELK 1653

Query: 144  KEEEETK 150
            K EEE K
Sbjct: 1654 KAEEENK 1660



 Score = 32.8 bits (74), Expect = 0.082
 Identities = 24/64 (37%), Positives = 37/64 (57%)

Query: 87   QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
             A E E A    E   KK+EE K++ +  + K EEK++ +E +KK EE+K+K +E +K  
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414

Query: 147  EETK 150
               K
Sbjct: 1415 AAKK 1418



 Score = 32.4 bits (73), Expect = 0.093
 Identities = 24/62 (38%), Positives = 39/62 (62%)

Query: 89   KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            K  E+   + +   K+ EE+K+ EE ++ +EE K +  E+ KK EE+K+K EE +K EE+
Sbjct: 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686

Query: 149  TK 150
             K
Sbjct: 1687 EK 1688



 Score = 32.0 bits (72), Expect = 0.12
 Identities = 27/62 (43%), Positives = 36/62 (58%)

Query: 89   KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            K  E      E   K+ EE K+ EE ++ + EEK++ EE +K EEE K K EE +KE EE
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741

Query: 149  TK 150
             K
Sbjct: 1742 DK 1743



 Score = 32.0 bits (72), Expect = 0.14
 Identities = 28/62 (45%), Positives = 42/62 (67%)

Query: 89   KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            KE E+A    E   K+ EE+K+ EE ++ +EE K + EE +K+ EE+K+K EE +K+EEE
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755

Query: 149  TK 150
             K
Sbjct: 1756 KK 1757



 Score = 32.0 bits (72), Expect = 0.15
 Identities = 26/66 (39%), Positives = 42/66 (63%)

Query: 85   DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
             ++AK+ E+      +  KK EE K+ EEE ++K EE ++E E++KK+ EE +K EEE+K
Sbjct: 1698 AEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757

Query: 145  EEEETK 150
            +    K
Sbjct: 1758 KIAHLK 1763



 Score = 31.6 bits (71), Expect = 0.19
 Identities = 19/51 (37%), Positives = 32/51 (62%)

Query: 100  DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            +  KK+ EE ++ +E + K EE ++ +E +KK EE K+K +E +K  E  K
Sbjct: 1460 EEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKK 1510



 Score = 30.5 bits (68), Expect = 0.48
 Identities = 26/61 (42%), Positives = 35/61 (57%)

Query: 90   ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
            E +K   L +    K+ EE   EE  ++ EEEK+ + E+ KK EE K K EE +K EEE 
Sbjct: 1573 EEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEK 1632

Query: 150  K 150
            K
Sbjct: 1633 K 1633



 Score = 29.7 bits (66), Expect = 0.68
 Identities = 26/66 (39%), Positives = 43/66 (65%)

Query: 85   DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            +++  + E+A    E  IK EE +K EEE+++V++ +K+E EEK+K EE +K + E + K
Sbjct: 1603 EEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIK 1662

Query: 145  EEEETK 150
              EE K
Sbjct: 1663 AAEEAK 1668



 Score = 29.7 bits (66), Expect = 0.79
 Identities = 23/65 (35%), Positives = 38/65 (58%)

Query: 86   DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            D+AK+  +      D +KK    K++ +E + K EEK++ +E +KK EE K+  E ++K 
Sbjct: 1394 DEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKA 1453

Query: 146  EEETK 150
            EE  K
Sbjct: 1454 EEAKK 1458



 Score = 29.3 bits (65), Expect = 0.95
 Identities = 23/64 (35%), Positives = 40/64 (62%)

Query: 87   QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
            +A E +KA    + +  K+ EEK++ +E + K EE ++ +E +KK EE K+K +  +K+ 
Sbjct: 1279 KADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKA 1338

Query: 147  EETK 150
            EE K
Sbjct: 1339 EEAK 1342



 Score = 29.3 bits (65), Expect = 1.1
 Identities = 19/51 (37%), Positives = 32/51 (62%)

Query: 100  DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            D  KK+ EE ++ +E + K EE +++ ++ KK  E K+K +E +K EE  K
Sbjct: 1473 DEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523



 Score = 29.0 bits (64), Expect = 1.5
 Identities = 19/51 (37%), Positives = 33/51 (64%)

Query: 100  DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            D  KK+ EEK++ +E + K EE +++ ++ KK    K+K +E +K+ EE K
Sbjct: 1381 DAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKK 1431



 Score = 28.6 bits (63), Expect = 2.1
 Identities = 21/65 (32%), Positives = 39/65 (60%)

Query: 86   DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            ++AK+   A     +  KK +E K++ EE++ K +E ++    +KK +E K+K EE++K 
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKA 1433

Query: 146  EEETK 150
            +E  K
Sbjct: 1434 DEAKK 1438



 Score = 28.2 bits (62), Expect = 2.2
 Identities = 20/62 (32%), Positives = 38/62 (61%)

Query: 89   KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             E E A    E   +K E  ++++EE + K +  +++ E++KK +E K+K EE++K+ +E
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409

Query: 149  TK 150
             K
Sbjct: 1410 LK 1411



 Score = 27.8 bits (61), Expect = 3.0
 Identities = 21/51 (41%), Positives = 32/51 (62%)

Query: 100  DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            D  KK+ EE ++ +E + K EE ++ EE +KK EE K+  E ++K EE  K
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484



 Score = 27.8 bits (61), Expect = 3.0
 Identities = 21/51 (41%), Positives = 32/51 (62%)

Query: 100  DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            D  KK+ EE ++ EE + K EE ++ +E +KK EE K+  E ++K EE  K
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKK 1497



 Score = 27.8 bits (61), Expect = 3.6
 Identities = 21/51 (41%), Positives = 33/51 (64%)

Query: 100  DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            D  KK+ EEK++ +E + K EE ++ +E +KK EE K+  E ++K EE  K
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471



 Score = 27.8 bits (61), Expect = 3.7
 Identities = 18/51 (35%), Positives = 29/51 (56%)

Query: 100  DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            D  KK+ EE ++ +E + K EE +++ +  KK+ EE +K  E  K E E  
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAA 1355



 Score = 27.4 bits (60), Expect = 4.4
 Identities = 20/62 (32%), Positives = 38/62 (61%)

Query: 89   KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            K+ E+A    +   KK EE+K+ +E ++  EE+K++ +E +K    +K+  E ++K EE+
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430

Query: 149  TK 150
             K
Sbjct: 1431 KK 1432



 Score = 27.0 bits (59), Expect = 5.7
 Identities = 21/57 (36%), Positives = 39/57 (68%)

Query: 99   EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
            E+ IK EE +KE EE+++  EE K++EEEK+K    +KE+ ++ E+  +E + ++ +
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEE 1783



 Score = 27.0 bits (59), Expect = 5.8
 Identities = 23/69 (33%), Positives = 41/69 (59%), Gaps = 4/69 (5%)

Query: 86   DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE----EEKEKMEE 141
            ++AK+ ++A    E+  K +E +K+ EE ++  +E K+  E K+K +E    EE +K +E
Sbjct: 1467 EEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526

Query: 142  EEKEEEETK 150
             +K EE  K
Sbjct: 1527 AKKAEEAKK 1535



 Score = 27.0 bits (59), Expect = 5.9
 Identities = 17/48 (35%), Positives = 27/48 (56%)

Query: 103  KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            KK EE K+ +E ++  EE K++ +  +KK EE K+  E  + E E   
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA 1356



 Score = 27.0 bits (59), Expect = 6.4
 Identities = 22/66 (33%), Positives = 37/66 (56%)

Query: 85   DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            ++  K  E A    E    + E  +E+ E  E K+EE +++ +  KK+ EEK+K +E +K
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK 1398

Query: 145  EEEETK 150
            + EE K
Sbjct: 1399 KAEEDK 1404



 Score = 26.6 bits (58), Expect = 7.3
 Identities = 22/70 (31%), Positives = 42/70 (60%), Gaps = 5/70 (7%)

Query: 86   DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE-----EEKEKME 140
            ++ K+ ++A    E++ KK +E K+    ++  +E K++ EEK+K +E     EE +K +
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKAD 1447

Query: 141  EEEKEEEETK 150
            E +K+ EE K
Sbjct: 1448 EAKKKAEEAK 1457



 Score = 26.6 bits (58), Expect = 7.5
 Identities = 22/57 (38%), Positives = 39/57 (68%)

Query: 99   EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
            ED  K EE +K+EEE++++   +KEEE++ E+  +E++  +EEE  EE+E + +  D
Sbjct: 1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797



 Score = 26.3 bits (57), Expect = 9.6
 Identities = 19/64 (29%), Positives = 34/64 (53%)

Query: 85   DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            D+  K+ E+A    +   KK EE K+  E  + + E   +E E  +++ E  EK +EE K
Sbjct: 1318 DEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377

Query: 145  EEEE 148
            ++ +
Sbjct: 1378 KKAD 1381


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 36.2 bits (83), Expect = 0.005
 Identities = 22/53 (41%), Positives = 35/53 (66%), Gaps = 2/53 (3%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEE--EEEKEKKEEEEKEKMEEEEKEEEETK 150
           D + +EEEE E EE+EE  E+E  +  +E+ E++EEEE + M++    E +TK
Sbjct: 137 DLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTK 189



 Score = 34.2 bits (78), Expect = 0.023
 Identities = 23/50 (46%), Positives = 31/50 (62%), Gaps = 3/50 (6%)

Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
            EEEEEV+ EE EE  EKE  +  +++    EE+EEEE K +   + LES
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKS---EEEEEEELKTMKDFIDLES 186



 Score = 30.0 bits (67), Expect = 0.53
 Identities = 17/46 (36%), Positives = 29/46 (63%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           E+   +E+EE  E+E  +V +E+ EEEEE+E K  ++   +E + K
Sbjct: 144 EEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTK 189



 Score = 27.3 bits (60), Expect = 4.6
 Identities = 16/43 (37%), Positives = 23/43 (53%), Gaps = 3/43 (6%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
           E E+   + ED    E+E  +  +E   K EE+EEEE K  K+
Sbjct: 141 EEEEEVEMEEDEEYYEKEPGKVVDE---KSEEEEEEELKTMKD 180


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 36.0 bits (83), Expect = 0.005
 Identities = 24/45 (53%), Positives = 33/45 (73%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           +  EEE+EEEEEEE +E+E EEEE ++++EEEE E     E+E E
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 36.0 bits (83), Expect = 0.005
 Identities = 24/50 (48%), Positives = 36/50 (72%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +++ ++E  E+EEEEEEE +EEE+E EEE+ + EEEE+E   +   EEE 
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEM 486



 Score = 34.9 bits (80), Expect = 0.013
 Identities = 24/48 (50%), Positives = 38/48 (79%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           ++EE  +EEEEEEE +EEE++E EE+E ++EEE+E++E +   EEE +
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 34.9 bits (80), Expect = 0.014
 Identities = 24/48 (50%), Positives = 33/48 (68%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
              +E +EEE  EE +EEE+EEEEE+++ EEEE E  EEEE+ E +  
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNG 481



 Score = 34.1 bits (78), Expect = 0.023
 Identities = 26/64 (40%), Positives = 40/64 (62%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           Q  E E++    E+  ++EEEE++E EEEE ++EE+EEE E +   EEE E   E + + 
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDG 496

Query: 147 EETK 150
           EE +
Sbjct: 497 EEPE 500



 Score = 31.8 bits (72), Expect = 0.13
 Identities = 25/67 (37%), Positives = 34/67 (50%)

Query: 89  KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +E E      E+  ++EEEE+E EEEE   EEE+EE E     EEE +   E +   EE 
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEP 499

Query: 149 TKYLVRD 155
            +   R 
Sbjct: 500 EEDAERR 506



 Score = 30.3 bits (68), Expect = 0.51
 Identities = 19/69 (27%), Positives = 33/69 (47%)

Query: 82  CVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
              ++  +E E+     E+  ++ EEE+ E+EEEE + E     EE+ +   E     EE
Sbjct: 439 SEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEE 498

Query: 142 EEKEEEETK 150
            E++ E   
Sbjct: 499 PEEDAERRN 507



 Score = 28.7 bits (64), Expect = 1.7
 Identities = 21/44 (47%), Positives = 31/44 (70%), Gaps = 2/44 (4%)

Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           +E EE +  E+EEEEE+E++EEE++   EEEE E+EE +  V  
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQES--EEEEGEDEEEEEEVEA 478


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 36.3 bits (84), Expect = 0.005
 Identities = 26/48 (54%), Positives = 36/48 (75%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           K+ E+E  +EEEEE KEE+KEEEE+   KEEE  E+ E+EEK+++  K
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76



 Score = 35.5 bits (82), Expect = 0.008
 Identities = 24/60 (40%), Positives = 39/60 (65%), Gaps = 2/60 (3%)

Query: 89  KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           KE EK     E+  +KEE+++EEE+  + +EE  EEEE++EKK  ++ +K++E   E E 
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK--KKTKKVKETTTEWEL 86



 Score = 34.0 bits (78), Expect = 0.026
 Identities = 20/43 (46%), Positives = 33/43 (76%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           EKE E+E   +EEE+E+EE+KE++E+   ++ E +E+EE+E K
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEK 70



 Score = 32.0 bits (73), Expect = 0.12
 Identities = 21/46 (45%), Positives = 37/46 (80%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           E  ++KE  ++EEEEE+E K+EE+E+  +KE++ +EE+EK E+++K
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKK 73



 Score = 26.7 bits (59), Expect = 8.0
 Identities = 19/54 (35%), Positives = 30/54 (55%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           +++ +E E+     E    KEEE  EEEE+EE K++ K+ +E   + E   K K
Sbjct: 38  EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 36.2 bits (84), Expect = 0.005
 Identities = 28/96 (29%), Positives = 41/96 (42%), Gaps = 13/96 (13%)

Query: 68  LC--LQLVEG---KPNGL-----LCVLDDQAKETEKAGVLYEDNIKKEEE---EKEEEEE 114
           LC  L LVEG   + N L     L +LD   K  E      E N          K  +EE
Sbjct: 181 LCLDLDLVEGLLERKNELPHLKKLIILDTLIKSKEININKEEKNNGSNVNNNGNKNNKEE 240

Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           ++  +   E E+      E +KEK+E+ +  +E+ K
Sbjct: 241 QKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAK 276


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 35.6 bits (82), Expect = 0.006
 Identities = 15/62 (24%), Positives = 33/62 (53%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           + KE ++A  L +    ++E  K+ E+E    +E+K++ EE  K+   ++++ EE   + 
Sbjct: 82  KKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKA 141

Query: 147 EE 148
             
Sbjct: 142 AA 143



 Score = 35.6 bits (82), Expect = 0.007
 Identities = 13/46 (28%), Positives = 33/46 (71%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            +E+ +K+E+++ E  ++++  E+E+ K+ E+E+   +E++K+ EE
Sbjct: 77  AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEE 122



 Score = 34.4 bits (79), Expect = 0.016
 Identities = 15/49 (30%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 105 EEEEKEEEEEEEVKEEEKE---EEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E ++K+  E+E +K+ EKE    +E+K++ EE  K+   ++++ EE   
Sbjct: 91  ELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAA 139



 Score = 33.6 bits (77), Expect = 0.035
 Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 1/63 (1%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKE-EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           QA E E+   L ++ +  +E++K+ EE  ++   ++K+ EE   K     K K E E K 
Sbjct: 96  QAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKR 155

Query: 146 EEE 148
              
Sbjct: 156 AAA 158



 Score = 30.2 bits (68), Expect = 0.51
 Identities = 14/52 (26%), Positives = 25/52 (48%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            +  KK E E  ++   E K++ + E   K   E ++K + E ++K   E K
Sbjct: 165 AEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAK 216



 Score = 30.2 bits (68), Expect = 0.53
 Identities = 14/61 (22%), Positives = 26/61 (42%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
             E       +  KK   E +++ E E   +   E ++K + E ++K   E ++K   E 
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEA 223

Query: 150 K 150
           K
Sbjct: 224 K 224



 Score = 29.4 bits (66), Expect = 0.87
 Identities = 8/48 (16%), Positives = 30/48 (62%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           ++ +++++++ EE  +++  E+E  K+ ++E    + ++++ EE   +
Sbjct: 79  EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ 126



 Score = 28.6 bits (64), Expect = 1.7
 Identities = 17/67 (25%), Positives = 26/67 (38%), Gaps = 4/67 (5%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEE---KEEEEEEEVKEEEKEEEEEKEKKEEEE-KEKMEE 141
           + AK+        E+   K       K E E +      K+   E +KK E E  +K   
Sbjct: 122 EAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAA 181

Query: 142 EEKEEEE 148
           E K++ E
Sbjct: 182 EAKKKAE 188



 Score = 28.6 bits (64), Expect = 1.7
 Identities = 12/40 (30%), Positives = 26/40 (65%)

Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           K  EE+ + KE+++ EE ++++  E+E+ K  E+E+   +
Sbjct: 75  KRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQ 114



 Score = 27.1 bits (60), Expect = 5.0
 Identities = 11/48 (22%), Positives = 32/48 (66%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +++  ++ EE+ ++ ++++ EE ++K+  E+E  +++E+E    +E K
Sbjct: 70  QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQK 117



 Score = 26.3 bits (58), Expect = 8.4
 Identities = 15/65 (23%), Positives = 25/65 (38%), Gaps = 3/65 (4%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEE---EVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           Q K+ E+A        K+ EE   +       + + E K      +K   E K+K E E 
Sbjct: 116 QKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA 175

Query: 144 KEEEE 148
            ++  
Sbjct: 176 AKKAA 180



 Score = 26.3 bits (58), Expect = 8.4
 Identities = 14/50 (28%), Positives = 20/50 (40%), Gaps = 2/50 (4%)

Query: 103 KKEEEEKEEEEEEEVKEEE--KEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           + EE  K+   +++  EE   K     K K E E K      +K   E K
Sbjct: 119 QAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAK 168


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 36.2 bits (83), Expect = 0.006
 Identities = 15/49 (30%), Positives = 37/49 (75%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           ED   ++    ++++E+E ++++ EE++E+E++EEEE +  ++E++E+E
Sbjct: 143 EDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191



 Score = 33.1 bits (75), Expect = 0.061
 Identities = 16/52 (30%), Positives = 37/52 (71%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           L E+++   +    ++++E+  E++ +EE+++E++EEEE+ K  ++E EE+E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191



 Score = 32.7 bits (74), Expect = 0.085
 Identities = 18/54 (33%), Positives = 37/54 (68%), Gaps = 5/54 (9%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK-----KEEEEKEKMEEEEKEEEE 148
           DN   ++++++E+E+++ +E+++EEEEE+E+      E+EE E  E+   E+ E
Sbjct: 149 DNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSE 202



 Score = 31.9 bits (72), Expect = 0.15
 Identities = 18/51 (35%), Positives = 34/51 (66%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           +E+E E++++EE  EEE+EEEEE +  ++E++E    E+   E+++    D
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTD 207



 Score = 29.6 bits (66), Expect = 0.89
 Identities = 15/58 (25%), Positives = 32/58 (55%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
           IK+       EE+   ++    +++++++ E+++ E+ +EEE+EEEE      D   E
Sbjct: 132 IKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189



 Score = 29.6 bits (66), Expect = 0.94
 Identities = 20/68 (29%), Positives = 39/68 (57%), Gaps = 6/68 (8%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKE------EEEEKEKKEEEEKE 137
           L ++        V+ +D+  ++E++ +EE++EE +EEE+E      E+EE E  E+   E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYE 199

Query: 138 KMEEEEKE 145
           K E ++ +
Sbjct: 200 KSEVDKTD 207



 Score = 26.9 bits (59), Expect = 7.0
 Identities = 12/47 (25%), Positives = 27/47 (57%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           I K    +   EE+    +    +++ E ++E++ ++ ++EE+EEEE
Sbjct: 131 IIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 35.7 bits (83), Expect = 0.007
 Identities = 12/65 (18%), Positives = 39/65 (60%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           + +QA + +      + +    +++ ++E+++E  +++  ++E++EKKE +E EK+ +++
Sbjct: 126 VLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDD 185

Query: 144 KEEEE 148
               +
Sbjct: 186 DFVWD 190



 Score = 34.6 bits (80), Expect = 0.015
 Identities = 14/64 (21%), Positives = 39/64 (60%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            D   + +    L +D+I  ++++++++E+++  + + E+EE+KE KE E+    ++   
Sbjct: 130 ADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVW 189

Query: 145 EEEE 148
           +E++
Sbjct: 190 DEDD 193



 Score = 31.5 bits (72), Expect = 0.16
 Identities = 8/64 (12%), Positives = 36/64 (56%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
           +D+   ++++ ++++ ++  ++E ++E++ +   ++E E+ +E ++ E+ +         
Sbjct: 132 DDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDE 191

Query: 159 ESCE 162
           +  E
Sbjct: 192 DDSE 195



 Score = 31.1 bits (71), Expect = 0.23
 Identities = 15/67 (22%), Positives = 36/67 (53%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           LDD   + +      +++   ++ + E+EE++E KE EK  +++    +E++ E + +  
Sbjct: 142 LDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQAR 201

Query: 144 KEEEETK 150
           K+ + T 
Sbjct: 202 KDAKLTA 208



 Score = 30.7 bits (70), Expect = 0.27
 Identities = 11/63 (17%), Positives = 29/63 (46%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
             DD   +        ED+ + ++++  ++E+EE KE ++ E+   +     +++  E  
Sbjct: 138 DDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEAL 197

Query: 143 EKE 145
            + 
Sbjct: 198 RQA 200



 Score = 30.7 bits (70), Expect = 0.28
 Identities = 12/56 (21%), Positives = 34/56 (60%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
             +E++ E++++++V +E++E++E KE ++  + +    +E + E  +   +D  L
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKL 206



 Score = 30.3 bits (69), Expect = 0.38
 Identities = 11/55 (20%), Positives = 33/55 (60%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLV 153
           +D+  ++++E +++++ + ++EEK+E +E EK  +++    +E++ E        
Sbjct: 149 DDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKD 203



 Score = 30.0 bits (68), Expect = 0.51
 Identities = 9/66 (13%), Positives = 36/66 (54%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            D      K   +       ++++ +++ +++  +++ ++E++ E  ++++ +  +EE+K
Sbjct: 114 KDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKK 173

Query: 145 EEEETK 150
           E +E +
Sbjct: 174 EAKELE 179



 Score = 29.6 bits (67), Expect = 0.67
 Identities = 6/64 (9%), Positives = 37/64 (57%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            D+  +      +   N   ++++ +++++ +  + + ++++E + +++++ +  +E+E+
Sbjct: 112 LDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEE 171

Query: 145 EEEE 148
           ++E 
Sbjct: 172 KKEA 175



 Score = 28.8 bits (65), Expect = 1.4
 Identities = 9/63 (14%), Positives = 36/63 (57%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           D     +      +D+    +++  ++++++  ++E +++++ + ++EE+KE  E E+  
Sbjct: 123 DIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLS 182

Query: 146 EEE 148
           +++
Sbjct: 183 DDD 185



 Score = 27.3 bits (61), Expect = 5.0
 Identities = 11/71 (15%), Positives = 33/71 (46%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           DD   + +      +D+    ++E EE++E +  E+  ++++    +++ E  +   ++ 
Sbjct: 145 DDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDA 204

Query: 145 EEEETKYLVRD 155
           +   T   V+ 
Sbjct: 205 KLTATADPVKA 215



 Score = 26.9 bits (60), Expect = 5.5
 Identities = 10/59 (16%), Positives = 35/59 (59%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
            DD   + +   +  +D+ + ++E+ ++++ ++  EE+KE +E ++  ++++    E++
Sbjct: 135 DDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDD 193



 Score = 26.5 bits (59), Expect = 8.6
 Identities = 8/64 (12%), Positives = 40/64 (62%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           +   K+ +   V   D + + +++ ++++++++ +++ +++++ E  +E++ +   ++E 
Sbjct: 110 NALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDED 169

Query: 145 EEEE 148
           EE++
Sbjct: 170 EEKK 173


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 35.8 bits (83), Expect = 0.007
 Identities = 15/56 (26%), Positives = 34/56 (60%)

Query: 93  KAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             G+  ED        K +++EE  +E+E++E  +++K+  + K++ E+++KE E+
Sbjct: 530 NLGIRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 28.8 bits (65), Expect = 1.2
 Identities = 12/43 (27%), Positives = 25/43 (58%)

Query: 89  KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK 131
           K  +K  +  E   K+  +E++   + + +EE+K++E EK +K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 35.2 bits (82), Expect = 0.009
 Identities = 15/41 (36%), Positives = 26/41 (63%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            K+  ++E +E    + +    EEE+E++EEEE+E+  EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 34.5 bits (80), Expect = 0.013
 Identities = 15/40 (37%), Positives = 25/40 (62%)

Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            ++  +EE+KE    + +    +EEEE+E+ EEEE+  EE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 34.5 bits (80), Expect = 0.016
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
           + +K+    + +    E +EEE+EEEEE+E  EEE
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 33.3 bits (77), Expect = 0.034
 Identities = 14/39 (35%), Positives = 25/39 (64%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           +D + +E +E    + +    EE+EEEEE+E++EE  +E
Sbjct: 281 KDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 32.9 bits (76), Expect = 0.046
 Identities = 17/43 (39%), Positives = 28/43 (65%), Gaps = 2/43 (4%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +++  +EE +EV   + +    +E++EEEE+E  EEEE  EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEE--EEEEPSEEE 320



 Score = 31.4 bits (72), Expect = 0.15
 Identities = 15/44 (34%), Positives = 25/44 (56%)

Query: 94  AGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           A  L + +   EE ++    + +    E+EEEEE+E++EEE  E
Sbjct: 275 AAALADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318



 Score = 27.1 bits (61), Expect = 4.8
 Identities = 17/37 (45%), Positives = 23/37 (62%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEE 120
           LD++ KE   A        ++EEEE+EEEEEE  +EE
Sbjct: 284 LDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 26.8 bits (60), Expect = 5.6
 Identities = 14/38 (36%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 113 EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +++ + EE KE    + +    E+E+ EEEE+EEEE  
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEE-EEEEEEEEEEP 316


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 35.4 bits (82), Expect = 0.010
 Identities = 13/52 (25%), Positives = 25/52 (48%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E    K+E++ EE+ EE   + EK +E E + ++   +        +E+E  
Sbjct: 315 EQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPD 366


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 35.2 bits (82), Expect = 0.010
 Identities = 16/52 (30%), Positives = 25/52 (48%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E+  +KEE ++E EEE + + EE    E +E+ + E       EE  E    
Sbjct: 40  EEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAA 91



 Score = 32.5 bits (75), Expect = 0.074
 Identities = 19/39 (48%), Positives = 22/39 (56%)

Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           EEEE  KEE K E EE+ K E EE    E EE+ + E  
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAA 77



 Score = 32.1 bits (74), Expect = 0.12
 Identities = 19/45 (42%), Positives = 22/45 (48%), Gaps = 1/45 (2%)

Query: 105 EEEEKE-EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           EEEE   EE + E +EE K E EE    E EE+ K E       E
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAE 83



 Score = 30.6 bits (70), Expect = 0.31
 Identities = 15/40 (37%), Positives = 21/40 (52%)

Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           EEEE   +E ++E EEE + + EE      EEE + E   
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAA 78



 Score = 29.4 bits (67), Expect = 0.77
 Identities = 15/41 (36%), Positives = 21/41 (51%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           EEEE  + + + + EEE K + EE    + EEE K E    
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAA 79



 Score = 29.0 bits (66), Expect = 1.1
 Identities = 15/34 (44%), Positives = 20/34 (58%)

Query: 117 VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           V+EEE  +EE K + EEE K + EE    E E +
Sbjct: 38  VEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEE 71


>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain.  This family consists
           of several ATP synthase E chain sequences which are
           components of the CF(0) subunit.
          Length = 83

 Score = 33.2 bits (76), Expect = 0.012
 Identities = 16/58 (27%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 94  AGVLYEDNIKKEEEEKEE-EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            G  ++ ++KK+EE+  E E +E++ E+ K    +K++ ++E K   E      +ET 
Sbjct: 24  YGAKHQRSLKKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKSLAEGSSAPFDETP 81


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 35.2 bits (81), Expect = 0.012
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 5/59 (8%)

Query: 97  LYEDNIKKE-----EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +Y++N  K+     E   +E   E+   EEK  E EK  KEE E+E+  EE++  EE K
Sbjct: 224 VYDNNDHKDFCVEIEGGMDEHSFEDFLLEEKRRELEKLAKEEAERERQAEEQRRREEEK 282



 Score = 31.4 bits (71), Expect = 0.20
 Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 3/56 (5%)

Query: 106 EEEKEEEEEEEVK---EEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
           EE    EEE ++    +EE+   EE  + + E+  +M+ E KE+    +L+   H+
Sbjct: 424 EELYWVEEEHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEKTLKMFLLSQKHI 479



 Score = 29.8 bits (67), Expect = 0.68
 Identities = 19/55 (34%), Positives = 34/55 (61%), Gaps = 5/55 (9%)

Query: 107 EEKEEEEEEEVKEE-EKEEEEEKEKKEEEEKEKME----EEEKEEEETKYLVRDV 156
           EEK  E E+  KEE E+E + E++++ EEEK  ME    + + E E+ +  ++++
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNL 306



 Score = 28.3 bits (63), Expect = 1.9
 Identities = 19/48 (39%), Positives = 24/48 (50%), Gaps = 3/48 (6%)

Query: 103 KKEEEEK---EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           K+ E EK   EE E E   EE++  EEEK   E +  +   E EK  E
Sbjct: 254 KRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 34.3 bits (79), Expect = 0.014
 Identities = 14/59 (23%), Positives = 36/59 (61%)

Query: 93  KAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
                 + + K    EK+ +E E    +E++E++E+E++EEEE+++  +++ ++++  Y
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDY 200



 Score = 33.2 bits (76), Expect = 0.038
 Identities = 19/49 (38%), Positives = 32/49 (65%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             +  EEEE  +E+   ++++ KE E E   +E+E+ E+ EEEE+EE+E
Sbjct: 139 VGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDE 187



 Score = 32.8 bits (75), Expect = 0.050
 Identities = 16/62 (25%), Positives = 36/62 (58%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
              + ++   L+ +  +  +E+    E++  + E ++ +EE EK EEEE+E+ EE+E  +
Sbjct: 131 ALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFD 190

Query: 147 EE 148
           ++
Sbjct: 191 DD 192


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 33.9 bits (78), Expect = 0.018
 Identities = 17/48 (35%), Positives = 36/48 (75%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            ++E E EEEE++E K++++ ++E+KEKK+++EK    +  K++++ K
Sbjct: 143 VEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190



 Score = 33.9 bits (78), Expect = 0.019
 Identities = 18/48 (37%), Positives = 33/48 (68%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            + E  ++E   +  KE E EEEE+KEKK+++E +K ++E+K+++E  
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177



 Score = 32.4 bits (74), Expect = 0.058
 Identities = 20/53 (37%), Positives = 35/53 (66%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           E     + E++ E EEEE KE++K++E +KEKKE+++K++   E K  ++ K 
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188



 Score = 32.4 bits (74), Expect = 0.071
 Identities = 17/46 (36%), Positives = 29/46 (63%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            E   E E  E+    + E+E E E++E++EK+K +E +KE++E K
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKK 171



 Score = 32.0 bits (73), Expect = 0.083
 Identities = 19/64 (29%), Positives = 44/64 (68%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
             + +  E E    + ++   +EEE+KE+++++EVK+E+KE++++KEK  E +  K +++
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188

Query: 143 EKEE 146
           +K++
Sbjct: 189 KKKK 192



 Score = 32.0 bits (73), Expect = 0.085
 Identities = 17/48 (35%), Positives = 36/48 (75%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           K+ E E+EE++E++ K+E K+E++EK+ K+E+  E    ++K++++ K
Sbjct: 145 KEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 31.6 bits (72), Expect = 0.099
 Identities = 17/49 (34%), Positives = 36/49 (73%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +  E E  E+E   +V++E + EEEEK++K+++++ K E++EK++++ K
Sbjct: 128 LGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEK 176



 Score = 29.7 bits (67), Expect = 0.57
 Identities = 15/53 (28%), Positives = 29/53 (54%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           Y        E   E E  E +   K E+E + ++EE++++K ++E K+E++ K
Sbjct: 118 YGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170



 Score = 28.5 bits (64), Expect = 1.2
 Identities = 18/48 (37%), Positives = 32/48 (66%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            +   E E  E+E   + EKE E E+E+K+E++K+K  ++EK+E++ K
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 34.0 bits (78), Expect = 0.024
 Identities = 24/69 (34%), Positives = 40/69 (57%), Gaps = 5/69 (7%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEK-----EEEEEKEKKEEEEKEKM 139
           ++  +E+ K  V+ ED   +     E+E+E+E K   +     EE EEKEK++ +  +KM
Sbjct: 242 EESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKM 301

Query: 140 EEEEKEEEE 148
            E+E E+EE
Sbjct: 302 MEDEDEDEE 310



 Score = 30.6 bits (69), Expect = 0.34
 Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 74  EGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE 133
           E KP+G     +++ +E EK        +KK  E+++E+EE E+  E   EEEE E+ E 
Sbjct: 273 EPKPSGERSDSEEETEEKEKEK---RKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEP 329

Query: 134 EEKEKMEEEEKEEEET 149
               K EEE++E   +
Sbjct: 330 PPLPKKEEEKEEVTVS 345



 Score = 29.0 bits (65), Expect = 0.99
 Identities = 20/64 (31%), Positives = 36/64 (56%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           LD+   E E        + ++E EEKE+E+ + +K+  ++E+E++E +   E    EEE 
Sbjct: 265 LDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEES 324

Query: 144 KEEE 147
           +E E
Sbjct: 325 EEPE 328



 Score = 27.9 bits (62), Expect = 2.6
 Identities = 15/48 (31%), Positives = 25/48 (52%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           KKE++E  E   +E  EEE  + +   + E  E   ++E+E E+E   
Sbjct: 229 KKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKP 276


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 33.8 bits (77), Expect = 0.025
 Identities = 14/57 (24%), Positives = 39/57 (68%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           E+ I+K  +E EEE  EE  ++E++++++ ++ E+++ +  +E++++E++    + D
Sbjct: 35  ENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHD 91



 Score = 33.0 bits (75), Expect = 0.049
 Identities = 15/53 (28%), Positives = 34/53 (64%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           K  +E +EE  EEE  +EE +++++ E +++++ +  E++E E+++   L  D
Sbjct: 40  KLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDD 92



 Score = 30.0 bits (67), Expect = 0.51
 Identities = 16/61 (26%), Positives = 38/61 (62%), Gaps = 3/61 (4%)

Query: 100 DNIKKE---EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
           D +KKE    +  +E EEE ++EE+ +EE++ +  +E+E +  +++++++E+       +
Sbjct: 30  DTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTL 89

Query: 157 H 157
           H
Sbjct: 90  H 90


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 33.5 bits (77), Expect = 0.025
 Identities = 13/39 (33%), Positives = 21/39 (53%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           E K+ +  EE   EE E     +++E  E EK E ++K+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 32.7 bits (75), Expect = 0.049
 Identities = 14/35 (40%), Positives = 19/35 (54%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
            K  EE   EE E     ++EE  E EKKE ++K+
Sbjct: 162 AKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 32.3 bits (74), Expect = 0.072
 Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 2/37 (5%)

Query: 104 KEEEEKEE--EEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           K+ +  EE   EE E     KEEE  + +K+E +K+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 27.7 bits (62), Expect = 2.6
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E ++    EE   EE +     +E+E  E E+KE ++ K
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 33.6 bits (77), Expect = 0.026
 Identities = 18/44 (40%), Positives = 25/44 (56%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            +   E+EE   +   E+EE++   EE+E E  EEEE EE E K
Sbjct: 31  AQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVK 74



 Score = 32.8 bits (75), Expect = 0.049
 Identities = 16/60 (26%), Positives = 31/60 (51%)

Query: 77  PNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK 136
           P  LL   + +        +  ++  + +  +++EE+E  V+E+E E  EE+E +E E K
Sbjct: 15  PAVLLTFGNGKGPFASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVK 74


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 33.9 bits (78), Expect = 0.027
 Identities = 13/48 (27%), Positives = 25/48 (52%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
             E+E++EEE+  +  EEE+ E+ E+E  +  E    E       +++
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSE 417



 Score = 32.8 bits (75), Expect = 0.057
 Identities = 12/51 (23%), Positives = 27/51 (52%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
            + + ++E+E+EE+  +E +EEE E+ EE+  +  E+       +   +  
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSE 417



 Score = 32.8 bits (75), Expect = 0.070
 Identities = 16/49 (32%), Positives = 30/49 (61%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           D I  EE +++E+EEEE + +E EEEE ++ +EE  + + +   +   +
Sbjct: 363 DPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSD 411



 Score = 32.0 bits (73), Expect = 0.10
 Identities = 15/50 (30%), Positives = 24/50 (48%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           ED  +++  ++ EEEE E  EEE  +  E    E       + E K ++E
Sbjct: 374 EDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKE 423



 Score = 32.0 bits (73), Expect = 0.12
 Identities = 14/54 (25%), Positives = 30/54 (55%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           L   + ++ +E+++EEEE+   E E+EE E+ E++  + +E    E   +  + 
Sbjct: 362 LDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSD 415



 Score = 30.1 bits (68), Expect = 0.53
 Identities = 12/50 (24%), Positives = 23/50 (46%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           + + + EEEE E+ EEE  +  E    E       + + K ++E   + +
Sbjct: 380 QRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSD 429



 Score = 29.3 bits (66), Expect = 0.98
 Identities = 16/46 (34%), Positives = 25/46 (54%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            K     +  + EEV E+E EEEE++  + EEE+ +  EEE  +  
Sbjct: 356 DKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSR 401



 Score = 28.9 bits (65), Expect = 1.1
 Identities = 15/61 (24%), Positives = 32/61 (52%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
           D  +   +  + EE +E ++EE+E+  ++ ++EE E  + E  +  E+ +     DV  +
Sbjct: 356 DKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSD 415

Query: 160 S 160
           S
Sbjct: 416 S 416



 Score = 27.4 bits (61), Expect = 3.6
 Identities = 12/44 (27%), Positives = 26/44 (59%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +  +K     + +  EE +E+E++E+++  ++ + EE E  EEE
Sbjct: 353 KMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEE 396



 Score = 27.4 bits (61), Expect = 4.0
 Identities = 12/46 (26%), Positives = 19/46 (41%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           ++EE E  EEE  + +E+   E       + E K   E     + E
Sbjct: 386 EEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431



 Score = 27.0 bits (60), Expect = 4.8
 Identities = 14/49 (28%), Positives = 26/49 (53%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
               K  +++    +    EE  E+E+E+E++  +E E+ E E+ EEE 
Sbjct: 349 TKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEG 397


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 33.6 bits (77), Expect = 0.028
 Identities = 13/43 (30%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 109 KEEEEEEEVKEEEKEE-EEEKEKKEEEEKEKMEEEEKEEEETK 150
           K  E++ + K ++ E   +++EKK EE +E++++  ++ EE K
Sbjct: 11  KLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAK 53


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 33.0 bits (76), Expect = 0.030
 Identities = 24/55 (43%), Positives = 36/55 (65%), Gaps = 1/55 (1%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKEEEKEEEE-EKEKKEEEEKEKMEEEEKEEEETK 150
           L  D  +KEEEE+EE E EE+ EEE+ +E  EKE  + + +++ E E K++E  K
Sbjct: 94  LGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148



 Score = 28.4 bits (64), Expect = 1.3
 Identities = 18/50 (36%), Positives = 32/50 (64%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
           ++EE E EE +EEE  +E  E+E  K K+E+  + + +++E  +E+ K L
Sbjct: 105 EEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQMKML 154



 Score = 26.5 bits (59), Expect = 5.1
 Identities = 15/48 (31%), Positives = 29/48 (60%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           KK  +    +++E+ +EEE+E E E+  +EE+  E +E+E  + +  K
Sbjct: 88  KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREK 135


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 33.2 bits (77), Expect = 0.032
 Identities = 17/36 (47%), Positives = 25/36 (69%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           E ++    E  +EEE+E EEE+E++EEEE E+ E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 30.9 bits (71), Expect = 0.22
 Identities = 17/34 (50%), Positives = 23/34 (67%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           ++    E  EEEE + EE+EEEEE+E+ EE E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 30.5 bits (70), Expect = 0.29
 Identities = 16/34 (47%), Positives = 23/34 (67%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK 136
           ++    +  EEEEE  EEE+EEEEE+E +E E +
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 30.1 bits (69), Expect = 0.44
 Identities = 14/33 (42%), Positives = 22/33 (66%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           +    E  EEE +E E+EEEEE+E++ EE + +
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 29.7 bits (68), Expect = 0.62
 Identities = 17/39 (43%), Positives = 25/39 (64%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           D I +  + +  E  EE +EE +EEEEE+E++E EE E 
Sbjct: 219 DAILEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257



 Score = 29.0 bits (66), Expect = 1.0
 Identities = 16/31 (51%), Positives = 22/31 (70%)

Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +  E  EEEE+E +EEEE+E+ EE E+ E E
Sbjct: 228 RLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 28.2 bits (64), Expect = 1.8
 Identities = 17/31 (54%), Positives = 20/31 (64%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
                 EEEE+E EEEEE +EEE+ EE E E
Sbjct: 228 RLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 28.2 bits (64), Expect = 2.1
 Identities = 15/33 (45%), Positives = 23/33 (69%)

Query: 114 EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           +  + E  +EEEEE E++EEEE+E+  EE + E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.8 bits (63), Expect = 2.1
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           +       EE+EEE E+E++EEEE+E  E E +
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.8 bits (63), Expect = 2.2
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E ++    E  E+E++E EE+E+ EEEE+ EE   
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257



 Score = 27.8 bits (63), Expect = 2.3
 Identities = 16/33 (48%), Positives = 22/33 (66%)

Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           +    E  EEEEE+ ++EEEE+E+ E EE E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.4 bits (62), Expect = 2.8
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           +    E  EEE+EE EE+E++EEEE+ +  E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.4 bits (62), Expect = 3.2
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +    E  EE+E++ EEE+E+ EEEE EE E +
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 26.7 bits (60), Expect = 6.2
 Identities = 17/34 (50%), Positives = 23/34 (67%)

Query: 93  KAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEE 126
           + G L E   ++EEE +EEEEEEE +E E+ E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 33.7 bits (77), Expect = 0.034
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
           D +KKEEE   +E EE  KE++  + E  E +EE+E+ + EE +   E   +   ++ LE
Sbjct: 73  DELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLE 132


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 33.6 bits (77), Expect = 0.038
 Identities = 16/44 (36%), Positives = 30/44 (68%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           K+E +  EE E + KEE+ ++EE ++K++E+  E  E+ +K+E 
Sbjct: 389 KDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432



 Score = 33.2 bits (76), Expect = 0.044
 Identities = 18/54 (33%), Positives = 36/54 (66%), Gaps = 1/54 (1%)

Query: 98  YEDNIKKEEEEKEEEEEEEVK-EEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           Y   + + ++E +  EE E K +EEK ++EE EKK++E+ ++ +E+ +++E  K
Sbjct: 381 YTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 29.8 bits (67), Expect = 0.70
 Identities = 14/46 (30%), Positives = 27/46 (58%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            ++ ++  ++    ++E +  EE E K +EEK K EE EK+++E  
Sbjct: 375 NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQA 420



 Score = 29.0 bits (65), Expect = 1.3
 Identities = 12/51 (23%), Positives = 28/51 (54%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
              K++E  KE  ++ +   ++  E +++    EE + K +EE+ ++EE +
Sbjct: 363 SGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE 413


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 33.5 bits (77), Expect = 0.040
 Identities = 22/103 (21%), Positives = 47/103 (45%), Gaps = 2/103 (1%)

Query: 63  FSDNTLCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEK 122
            S N  C+    G    +L  L  +  E E      E+  +   +E      EE++E+  
Sbjct: 417 LSKNFFCITHEVGAEEIILYFLLKKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVL 476

Query: 123 EEEEEKEKKEEEEKEKMEEEEKE--EEETKYLVRDVHLESCEF 163
           E+ E+  + +EE K+++ ++ K+  E +T  L+ ++ +    +
Sbjct: 477 EQAEQGCENQEEVKKEVPKKVKKIPEVDTYLLLEELGINEETY 519


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 33.4 bits (77), Expect = 0.045
 Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 15/89 (16%)

Query: 77  PNGLLCV------LDDQAKETEKA-GVLYEDNIKK--EEEEKEEEEEEEVKE--EEKEEE 125
            NG+L V         + K T  A   L +D I++  ++ E+   E+++ KE  E K E 
Sbjct: 474 ANGILTVSAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEA 533

Query: 126 EE----KEKKEEEEKEKMEEEEKEEEETK 150
           EE     EK  +EE +K+ E +K++ E  
Sbjct: 534 EEYVYSLEKSLKEEGDKLPEADKKKVEEA 562


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 33.1 bits (76), Expect = 0.046
 Identities = 16/40 (40%), Positives = 25/40 (62%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
            +KEEEE+  E   E+  E+ E+   K+K +EE+K + EE
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160



 Score = 30.1 bits (68), Expect = 0.46
 Identities = 16/43 (37%), Positives = 25/43 (58%), Gaps = 3/43 (6%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEE-KEKMEEEEKEEEE 148
            +K++EEEE  +  E+  E   E  E+   K+K +EE+K E E
Sbjct: 119 AQKQKEEEE--RRVERRRELGLEDPEQLRLKQKAKEEQKAESE 159



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 14/43 (32%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
             K++EEE+   E       E  E+   ++K +EE +K E EE
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEE-QKAESEE 160


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 32.7 bits (75), Expect = 0.047
 Identities = 11/39 (28%), Positives = 23/39 (58%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           E K+ +++EE   EE++E      K+    +K+E ++K+
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 32.0 bits (73), Expect = 0.095
 Identities = 10/39 (25%), Positives = 19/39 (48%)

Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E ++  ++EE   EE KE      K+    ++ E ++ K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 31.2 bits (71), Expect = 0.18
 Identities = 13/54 (24%), Positives = 22/54 (40%), Gaps = 12/54 (22%)

Query: 94  AGVL----------YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
            GVL           +D  +KEE   EE +E        ++    +K E ++K+
Sbjct: 145 VGVLVLQAGLWYAERKDAKQKEEFAAEERKEA--LAAAAKKSATPQKVETKKKK 196



 Score = 28.1 bits (63), Expect = 1.9
 Identities = 10/36 (27%), Positives = 19/36 (52%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           +K+ ++KEE   EE KE      ++    ++ E +K
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKK 194


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 33.2 bits (76), Expect = 0.049
 Identities = 15/55 (27%), Positives = 33/55 (60%)

Query: 96  VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           VL ++ +    EE+ +      K+++KE++  + KK+ E+K+K ++E+KE +   
Sbjct: 33  VLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEG 87



 Score = 32.8 bits (75), Expect = 0.066
 Identities = 19/58 (32%), Positives = 33/58 (56%), Gaps = 8/58 (13%)

Query: 101 NIKKEEEEKEEEEEEEV--------KEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            + KE      EEE +V        K+E+K  E +K+ +++++K+K ++E K E ETK
Sbjct: 33  VLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETK 90



 Score = 28.9 bits (65), Expect = 1.1
 Identities = 12/55 (21%), Positives = 32/55 (58%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           E + A    + + K+++  + +++ E+ K+++KE++E K + E +   K  ++ K
Sbjct: 46  ENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSK 100



 Score = 28.9 bits (65), Expect = 1.3
 Identities = 14/51 (27%), Positives = 33/51 (64%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
              K ++E+K  E +++ ++++K+++E+KE K E E +   +  K+ ++TK
Sbjct: 53  STKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103



 Score = 27.4 bits (61), Expect = 3.9
 Identities = 12/50 (24%), Positives = 30/50 (60%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
            K+E++  E +++  K+++K++E+++ K E E K   +  +K ++  K  
Sbjct: 57  DKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKP 106


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 32.8 bits (76), Expect = 0.058
 Identities = 12/62 (19%), Positives = 26/62 (41%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           +     E       + I    +   EE+   V  E +E +++++++EEE++        E
Sbjct: 154 EWYDRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADE 213

Query: 146 EE 147
            E
Sbjct: 214 SE 215



 Score = 30.5 bits (70), Expect = 0.39
 Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
            EE+      E  + ++ E+EEE+E + ++     E E  E+
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219



 Score = 28.2 bits (64), Expect = 1.9
 Identities = 15/52 (28%), Positives = 26/52 (50%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           ED      E +E +++E+ +EEE E ++     E E  EK+ E+ K   +  
Sbjct: 180 EDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQY 231



 Score = 26.7 bits (60), Expect = 6.9
 Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 6/72 (8%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK------KEEEEKEK 138
            D   E + A V  E     ++E++EEEE+E       +E E  EK         ++ +K
Sbjct: 174 VDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKK 233

Query: 139 MEEEEKEEEETK 150
           + + ++++ E +
Sbjct: 234 LRKAQEKKVEGR 245


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 33.0 bits (75), Expect = 0.060
 Identities = 15/47 (31%), Positives = 24/47 (51%)

Query: 99   EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            + +   E++EK  E+  E   +E EE  E   K +EE E+ E  E +
Sbjct: 4051 DFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQ 4097



 Score = 31.9 bits (72), Expect = 0.15
 Identities = 19/77 (24%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 86   DQAKETEKAGVLYEDNIKKEEEEKEEE--EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            D+  + +    L ED+ K  E+  EE   E EE  E+  + +EE E+ E  E + ++   
Sbjct: 4044 DEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHP 4103

Query: 144  KEEEETKYLVRDVHLES 160
            K + ++ +   +   E+
Sbjct: 4104 KMDAKSTFASAEADEEN 4120



 Score = 31.5 bits (71), Expect = 0.23
 Identities = 13/50 (26%), Positives = 30/50 (60%)

Query: 99   EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            E+N   +E+ ++++  +  +++EK  E+  E+  +E +E  E+  K +EE
Sbjct: 4038 EENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEE 4087



 Score = 28.8 bits (64), Expect = 1.6
 Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 7/67 (10%)

Query: 90   ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEE-------EKEKKEEEEKEKMEEE 142
            E++  GV  +  + + E    EEE +++ EE  +  E       EK   E  E++ +E E
Sbjct: 3861 ESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETE 3920

Query: 143  EKEEEET 149
            +K  E++
Sbjct: 3921 QKSNEQS 3927



 Score = 28.8 bits (64), Expect = 1.7
 Identities = 11/45 (24%), Positives = 23/45 (51%)

Query: 104  KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             E E+K  E+     E +   +E+  K  E++  + +E+E+E  +
Sbjct: 3917 LETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD 3961



 Score = 28.4 bits (63), Expect = 1.8
 Identities = 16/68 (23%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 84   LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEV-KEEEKEEEEEKEKKEEEEKEKMEEE 142
            L+++  +      L E   K  E+     E + V KE++ +  E+K+++E+E++E+M ++
Sbjct: 3903 LNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDD 3962

Query: 143  EKEEEETK 150
               ++E +
Sbjct: 3963 VGIDDEIQ 3970



 Score = 28.0 bits (62), Expect = 2.5
 Identities = 11/64 (17%), Positives = 34/64 (53%), Gaps = 2/64 (3%)

Query: 85   DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            ++   ETE+     E +    E +   +E++    E+K+ +E+++++E  +   +++E +
Sbjct: 3913 EEDLLETEQKS--NEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQ 3970

Query: 145  EEEE 148
             + +
Sbjct: 3971 PDIQ 3974



 Score = 28.0 bits (62), Expect = 2.7
 Identities = 19/83 (22%), Positives = 41/83 (49%), Gaps = 4/83 (4%)

Query: 84   LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            L+ + K  E++    E ++  +E++ +  E+++ +E+E EEE   +   ++E +   +E 
Sbjct: 3917 LETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQEN 3976

Query: 144  KE----EEETKYLVRDVHLESCE 162
                  E E   L  D+ L+  E
Sbjct: 3977 NSQPPPENEDLDLPEDLKLDEKE 3999



 Score = 27.7 bits (61), Expect = 4.4
 Identities = 17/71 (23%), Positives = 42/71 (59%), Gaps = 6/71 (8%)

Query: 84   LDDQAKETEKAGVLYEDNIKKEEEEKEE---EEEEEVKEEEKEEEE---EKEKKEEEEKE 137
            LD++  +  K   L + +++  +E KEE   E++E +++E+  EE    +++ ++++  +
Sbjct: 3995 LDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSD 4054

Query: 138  KMEEEEKEEEE 148
              E++EK  E+
Sbjct: 4055 LAEDDEKMNED 4065



 Score = 27.3 bits (60), Expect = 5.4
 Identities = 38/166 (22%), Positives = 68/166 (40%), Gaps = 29/166 (17%)

Query: 12   EEEEEEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQ 71
            E  + EEE +D ++  +N   E L    N+ ++    EE   E  +  + + + N     
Sbjct: 3877 ENSDSEEENQDLDEE-VNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDL 3935

Query: 72   LVEGKPNGLLCVLDDQAKETEKA---GVLYEDNIKKEEEEKEEEE---------EEEVKE 119
            + +   N  L   D Q KE E+     V  +D I+ + +E   +           E++K 
Sbjct: 3936 VSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKL 3995

Query: 120  EEKEEE----------------EEKEKKEEEEKEKMEEEEKEEEET 149
            +EKE +                E KE+ + E+ E M++E+  EE  
Sbjct: 3996 DEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENN 4041



 Score = 26.9 bits (59), Expect = 6.7
 Identities = 12/46 (26%), Positives = 23/46 (50%)

Query: 105  EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            EE    +E+ ++    +  E++EK  ++  E+   E EE  E+  K
Sbjct: 4038 EENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVK 4083



 Score = 26.5 bits (58), Expect = 9.2
 Identities = 14/59 (23%), Positives = 34/59 (57%), Gaps = 7/59 (11%)

Query: 99   EDNIKKEEEEKEEEE-----EEEVKEEEKEEEEEK--EKKEEEEKEKMEEEEKEEEETK 150
            ED  ++E+E++EE       ++E++ + +E   +   E ++ +  E ++ +EKE + +K
Sbjct: 3946 EDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSK 4004



 Score = 26.5 bits (58), Expect = 9.4
 Identities = 11/63 (17%), Positives = 26/63 (41%)

Query: 85   DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            +D      +   L E   +++   + + +E E +E E +     +     E E  + EE+
Sbjct: 3825 EDITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEE 3884

Query: 145  EEE 147
             ++
Sbjct: 3885 NQD 3887


>gnl|CDD|111553 pfam02667, SCFA_trans, Short chain fatty acid transporter.  This
           family consists of two sequences annotated as short
           chain fatty acid transporters, however, there are no
           references giving details of experimental
           characterisation of this function.
          Length = 453

 Score = 32.9 bits (75), Expect = 0.061
 Identities = 11/35 (31%), Positives = 22/35 (62%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
            E +  + + +K+E++EE++ K KK+    EK+E 
Sbjct: 216 GELKSIDPKLLKKEKEEEKKLKSKKDATIAEKLEN 250


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 32.4 bits (74), Expect = 0.063
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           KK++E+  EEE++   +++K EE E +    EE    +E 
Sbjct: 70  KKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQES 109


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 32.6 bits (75), Expect = 0.064
 Identities = 14/78 (17%), Positives = 37/78 (47%), Gaps = 3/78 (3%)

Query: 78  NGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           +  + + DD   E E   +  E        + EE+++++ + +E++EE  K + +E++ +
Sbjct: 95  SDQVVIFDDGDIEEE---LQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDD 151

Query: 138 KMEEEEKEEEETKYLVRD 155
             ++++    E     R 
Sbjct: 152 DDDDDDIATRERSLERRR 169


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 32.7 bits (75), Expect = 0.066
 Identities = 19/66 (28%), Positives = 39/66 (59%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
             DD        G   E+  +   ++++EE++++  EEE+E+ +  +++E+EE E  ++E
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDE 366

Query: 143 EKEEEE 148
           + EEEE
Sbjct: 367 DDEEEE 372



 Score = 32.7 bits (75), Expect = 0.084
 Identities = 17/53 (32%), Positives = 36/53 (67%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
              + + EE++ ++ EEEE   +  +EEE++E ++ ++++  EEEE+E+E+ K
Sbjct: 328 DGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKK 380



 Score = 31.5 bits (72), Expect = 0.17
 Identities = 16/47 (34%), Positives = 31/47 (65%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           E+ +  +EEE+E  E+  +E++E+E++EE+EK+K +  E    E  +
Sbjct: 347 EDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPF 393



 Score = 31.5 bits (72), Expect = 0.21
 Identities = 17/65 (26%), Positives = 39/65 (60%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           LDD+ +  +          + EEEE++  ++E+ ++++ + EEE+E  +  ++E+ EE+E
Sbjct: 302 LDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDE 361

Query: 144 KEEEE 148
             ++E
Sbjct: 362 DSDDE 366



 Score = 28.8 bits (65), Expect = 1.4
 Identities = 20/44 (45%), Positives = 31/44 (70%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           ED    +EEE EE+E+ + +++E+EEEEEKEKK+++  E    E
Sbjct: 347 EDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 27.3 bits (61), Expect = 5.3
 Identities = 15/64 (23%), Positives = 36/64 (56%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           DD+ +E  K      D+  + +++      +  ++EE+EE+   ++ EE++ + +EEEE+
Sbjct: 288 DDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEE 347

Query: 145 EEEE 148
           + + 
Sbjct: 348 DVDL 351



 Score = 26.5 bits (59), Expect = 7.4
 Identities = 18/54 (33%), Positives = 32/54 (59%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           DD   E E+  V   D  + EE+E  ++E++E +EEE++E+++K+  E    E 
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 32.7 bits (74), Expect = 0.066
 Identities = 26/100 (26%), Positives = 48/100 (48%), Gaps = 2/100 (2%)

Query: 61  IEFSDNTLCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEE 120
           + F DN   LQ +      L   L+++    E+A    +D I        E++ +++   
Sbjct: 313 LRFLDNLDFLQFLSKYIGCLWRNLNNEKWGNEEAE--RKDYILDSSSVPLEKQFDDILYF 370

Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
           EK E E +  +E E +E++E+ E E + +K +  D  LE+
Sbjct: 371 EKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELEN 410



 Score = 27.3 bits (60), Expect = 4.9
 Identities = 17/66 (25%), Positives = 28/66 (42%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           +LD  +   EK         K E E +  EE E  +E E  E+E    K   + +++E  
Sbjct: 352 ILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENH 411

Query: 143 EKEEEE 148
            +  +E
Sbjct: 412 FRAADE 417


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 32.3 bits (74), Expect = 0.067
 Identities = 12/38 (31%), Positives = 26/38 (68%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           EK +E ++ +KE +KE  E +E  + ++ +K++E++ E
Sbjct: 72  EKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQME 109



 Score = 29.2 bits (66), Expect = 0.68
 Identities = 9/38 (23%), Positives = 27/38 (71%), Gaps = 1/38 (2%)

Query: 112 EEEEEVKEEEKE-EEEEKEKKEEEEKEKMEEEEKEEEE 148
           E+ +E+++  KE ++E +E +E  + +K+++ ++++ E
Sbjct: 72  EKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQME 109


>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA.  This model
           describes flagellar biosynthesis protein FlhA, one of a
           large number of genes associated with the biosynthesis
           of functional bacterial flagella. Homologs of many such
           proteins, including FlhA, function in type III protein
           secretion systems. A separate model describes InvA
           (Salmonella enterica), LcrD (Yersinia enterocolitica),
           HrcV (Xanthomonas), etc., all of which score below the
           noise cutoff for this model [Cellular processes,
           Chemotaxis and motility].
          Length = 678

 Score = 32.6 bits (75), Expect = 0.070
 Identities = 12/43 (27%), Positives = 20/43 (46%), Gaps = 2/43 (4%)

Query: 117 VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
            +  ++EEE   E  + +E+   EEEE   +       D+ LE
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESINDILAL--DDLELE 350



 Score = 30.7 bits (70), Expect = 0.32
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEE 135
           +  ++EEE   E  + +EE  ++EEE 
Sbjct: 311 RRSKQEEEAAAEAAKAQEEAAEEEEES 337



 Score = 30.7 bits (70), Expect = 0.35
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           + ++EEE   E  K +EE  E++EE   + +  ++ E E
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEESINDILALDDLELE 350



 Score = 30.7 bits (70), Expect = 0.37
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKE 129
           K+EEE   E  + + +  E+EEE   +
Sbjct: 314 KQEEEAAAEAAKAQEEAAEEEEESIND 340



 Score = 28.8 bits (65), Expect = 1.5
 Identities = 9/32 (28%), Positives = 19/32 (59%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE 133
           +++  +++EE   E  K +E+  EEE+E   +
Sbjct: 309 LRRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340



 Score = 27.2 bits (61), Expect = 4.8
 Identities = 8/29 (27%), Positives = 15/29 (51%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
             K+EEE      + +EE  E+E++   +
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEESIND 340


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 32.8 bits (75), Expect = 0.077
 Identities = 14/44 (31%), Positives = 29/44 (65%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            N K  E +  +  EE+ K+++K+E++++E+ + EEK ++E  E
Sbjct: 728 WNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771



 Score = 30.5 bits (69), Expect = 0.39
 Identities = 13/39 (33%), Positives = 26/39 (66%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
            K+ E +     EEK ++++K++K++EE+ K EE+ + E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768



 Score = 29.7 bits (67), Expect = 0.84
 Identities = 13/43 (30%), Positives = 26/43 (60%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
            +  E +  +  EE+ +++++KEKK+EEE ++ E+   E  E 
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAEA 772



 Score = 28.9 bits (65), Expect = 1.5
 Identities = 14/45 (31%), Positives = 24/45 (53%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
           +   K+ E       +   K++++KE+++EEE K EEK   E  E
Sbjct: 727 NWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 1/60 (1%)

Query: 84   LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEE-EKEKKEEEEKEKMEEE 142
             + + KE    G L  D  KK   E     +E+  EE+  E + +K K +++ K   E E
Sbjct: 1523 RNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAE 1582



 Score = 27.0 bits (60), Expect = 6.6
 Identities = 12/46 (26%), Positives = 27/46 (58%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            N   ++ E +  +  E K ++K+++E+K+++E + +EK   E  E
Sbjct: 726 RNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 32.3 bits (74), Expect = 0.078
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
           EEEE+E EE EE  EE+  + + +++  EE KE+ E
Sbjct: 151 EEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEE 186



 Score = 30.8 bits (70), Expect = 0.25
 Identities = 15/40 (37%), Positives = 23/40 (57%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           EEE+EE EE E + EE   + +  K+  EE ++ EE  + 
Sbjct: 151 EEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRR 190



 Score = 29.6 bits (67), Expect = 0.61
 Identities = 17/45 (37%), Positives = 27/45 (60%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E E  EEEEEE +E E+E EE+   ++  ++   E +E+EE   +
Sbjct: 146 ELELPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRR 190


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 32.1 bits (73), Expect = 0.079
 Identities = 12/48 (25%), Positives = 23/48 (47%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           + E E  EE+ +   + E   E    + KE+ + EK  ++ K + + K
Sbjct: 57  QPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPK 104



 Score = 31.3 bits (71), Expect = 0.15
 Identities = 10/48 (20%), Positives = 21/48 (43%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
              EE + E E  E + +   E E   +    + ++  + EK+ ++ K
Sbjct: 51  APTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPK 98



 Score = 31.3 bits (71), Expect = 0.18
 Identities = 14/46 (30%), Positives = 22/46 (47%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E   +E + E E  EE+ +   E E   E    K +E+ K E++ K
Sbjct: 50  EAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPK 95



 Score = 29.4 bits (66), Expect = 0.64
 Identities = 9/44 (20%), Positives = 19/44 (43%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
             +   EE + + E  EE+ +   + E   E    + KE+ + +
Sbjct: 48  VLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPE 91



 Score = 27.8 bits (62), Expect = 2.2
 Identities = 13/69 (18%), Positives = 27/69 (39%), Gaps = 7/69 (10%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEE-----KEKKEEEEKEKMEE 141
           +  + E      E+  K   E +   E    K +EK + E+     K K + + K K + 
Sbjct: 54  EEPQPEPEPP--EEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111

Query: 142 EEKEEEETK 150
           + + + +  
Sbjct: 112 KPQPKPKKP 120


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 32.6 bits (74), Expect = 0.082
 Identities = 12/57 (21%), Positives = 25/57 (43%), Gaps = 3/57 (5%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEE---EVKEEEKEEEEEKEKKEEEEKEKME 140
           +AK+  K     +  IK E    E E E    + +EE+  +       ++++  K++
Sbjct: 796 RAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852



 Score = 30.7 bits (69), Expect = 0.32
 Identities = 14/66 (21%), Positives = 24/66 (36%), Gaps = 1/66 (1%)

Query: 88  AKETEKAGVLY-EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
            K   K         IK     K++ +E+   ++E + E    + E E     + EEK  
Sbjct: 776 GKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGI 835

Query: 147 EETKYL 152
           +    L
Sbjct: 836 DAPAIL 841


>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 379

 Score = 32.2 bits (73), Expect = 0.086
 Identities = 6/66 (9%), Positives = 26/66 (39%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           +  ++ K +E   +        E+ +  E++    + +    +   +++ + +K ++ + 
Sbjct: 197 LSLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKA 256

Query: 143 EKEEEE 148
               E 
Sbjct: 257 LAALEA 262


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 32.0 bits (73), Expect = 0.087
 Identities = 14/53 (26%), Positives = 32/53 (60%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
           + E+E+E      E E+ ++KE+ E ++K++MEE +  +++    +++  L S
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSS 125



 Score = 27.0 bits (60), Expect = 4.3
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
           +   +KE      E E+  K+EE E +++KE +E
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106



 Score = 27.0 bits (60), Expect = 5.2
 Identities = 19/61 (31%), Positives = 34/61 (55%), Gaps = 2/61 (3%)

Query: 78  NGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
            G+   L DQ  E EK       +++ E+ +K+EE E + K+E +E +  ++K ++  KE
Sbjct: 63  TGVKDFLSDQKPEDEKELS--ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKE 120

Query: 138 K 138
           K
Sbjct: 121 K 121


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 32.3 bits (74), Expect = 0.090
 Identities = 20/59 (33%), Positives = 36/59 (61%), Gaps = 2/59 (3%)

Query: 90  ETEKAGVLY--EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           E E  GV    E+NI+ E + +E ++E +  EE+ E+ E + +K+EEE EK + +  ++
Sbjct: 77  EGEIKGVFTLGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 32.3 bits (74), Expect = 0.091
 Identities = 10/40 (25%), Positives = 19/40 (47%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
              +  EEE        E+++   ++EEE++  E+   EE
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299



 Score = 30.8 bits (70), Expect = 0.30
 Identities = 9/39 (23%), Positives = 18/39 (46%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             +  EE         E+ +   E+E+E+ + E++  EE
Sbjct: 261 PPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299



 Score = 30.0 bits (68), Expect = 0.64
 Identities = 8/39 (20%), Positives = 19/39 (48%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
             +  EEE        E +++  ++++E+++ E    EE
Sbjct: 261 PPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299



 Score = 29.2 bits (66), Expect = 0.92
 Identities = 10/40 (25%), Positives = 18/40 (45%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
              +  EEE        E++ +  E+EE++   E+   EE
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299



 Score = 29.2 bits (66), Expect = 1.1
 Identities = 10/35 (28%), Positives = 18/35 (51%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKM 139
           EEE        E  ++  E+EEE++  E+   E++
Sbjct: 266 EEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 32.3 bits (74), Expect = 0.094
 Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 4/50 (8%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKE----EEEKEKMEEEEKEEEE 148
            +E   +  +E  ++ E+EK  E E+EK E    +EE +K++ E    EE
Sbjct: 4   LREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEE 53



 Score = 31.6 bits (72), Expect = 0.17
 Identities = 9/49 (18%), Positives = 28/49 (57%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +++   E  +E  +  ++E+  E EE++ + +  KE++++ + E +  +
Sbjct: 4   LREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLE 52



 Score = 31.6 bits (72), Expect = 0.17
 Identities = 13/55 (23%), Positives = 27/55 (49%), Gaps = 3/55 (5%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK---KEEEEKEKMEEEEKEEEETK 150
              ++K  E+++  E EE K E    +EE +K   + +  +E ++E E +   + 
Sbjct: 12  AKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASG 66



 Score = 29.6 bits (67), Expect = 0.62
 Identities = 14/43 (32%), Positives = 23/43 (53%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            +E  E   E  KE  K  E+EK  + EEEK + +  ++E ++
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDK 43



 Score = 29.2 bits (66), Expect = 0.88
 Identities = 14/47 (29%), Positives = 23/47 (48%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +K+  E   E  +E  K  E E+  E E+++ E     EE +K + E
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAE 47



 Score = 28.9 bits (65), Expect = 1.4
 Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 114 EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            +E++E   E  +E  K  E+EK    EEEK E +  
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDAL 37


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 32.5 bits (75), Expect = 0.096
 Identities = 19/56 (33%), Positives = 31/56 (55%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
             +K + E  + +EE E  E +KEEE+EK  KE  E  K  E EK+++   +  ++
Sbjct: 157 GILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKE 212



 Score = 32.1 bits (74), Expect = 0.10
 Identities = 16/52 (30%), Positives = 31/52 (59%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           D+ K+E E+ E ++EEE ++  KE  E  +K E E+K++ +  + +E   + 
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQI 217



 Score = 29.8 bits (68), Expect = 0.70
 Identities = 12/46 (26%), Positives = 28/46 (60%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           ++ ++K E +KEEE+E+  KE  +  ++ + +K+++ K    +E  
Sbjct: 169 KEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 31.9 bits (73), Expect = 0.099
 Identities = 14/45 (31%), Positives = 23/45 (51%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           K    KEEEEE      ++  E   EKK  +E ++  +EE+++  
Sbjct: 16  KPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRF 60



 Score = 26.8 bits (60), Expect = 5.2
 Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            KEEEE+  E   +   E   E++  ++ +EE  E  EE+++  EE +
Sbjct: 20  PKEEEEEALELAIQEAHENALEKKLLDELDEELDE--EEDDRFLEEYR 65


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 32.1 bits (73), Expect = 0.11
 Identities = 21/45 (46%), Positives = 29/45 (64%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           KE +EK+EE++EE  EE K EE ++E +EEE+    E  EK   E
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIE 321



 Score = 32.1 bits (73), Expect = 0.11
 Identities = 20/51 (39%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
           +K+++EEK+EE+ EEVK EE +EE E+E  E+    ++ E+   E   K++
Sbjct: 279 LKEKKEEKDEEKSEEVKTEEVDEEFEEE--EKGFYYELYEKVNIEANKKFI 327



 Score = 32.1 bits (73), Expect = 0.14
 Identities = 15/46 (32%), Positives = 31/46 (67%)

Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
           +E+KE+++E++EEK ++ + E+   E EE+E+     L   V++E+
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEA 322



 Score = 28.7 bits (64), Expect = 1.5
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query: 117 VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           +KE ++++EE+ E+K EE K +  +EE EEEE  +
Sbjct: 276 LKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 31.6 bits (72), Expect = 0.11
 Identities = 14/51 (27%), Positives = 38/51 (74%), Gaps = 3/51 (5%)

Query: 102 IKKEEEEKE---EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           +KKE EEK+    ++++  K+++K+++++ +KK+++ ++K E+E +++ E 
Sbjct: 74  VKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 29.7 bits (67), Expect = 0.43
 Identities = 13/67 (19%), Positives = 37/67 (55%)

Query: 92  EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           EK    ++    K++++K+++++++ K+++ E+++EKE +++ E       E     ++ 
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSEL 139

Query: 152 LVRDVHL 158
             R   L
Sbjct: 140 KPRKYAL 146



 Score = 29.7 bits (67), Expect = 0.52
 Identities = 11/49 (22%), Positives = 37/49 (75%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           + E+ +KE EE+++ K ++K+ +++K+K ++++ +K +++ ++++E + 
Sbjct: 70  EIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118



 Score = 28.9 bits (65), Expect = 0.95
 Identities = 12/47 (25%), Positives = 34/47 (72%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            E ++K++E  EE+++ +KE EE+++ K +++K K ++++ ++++  
Sbjct: 58  TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDD 104



 Score = 28.5 bits (64), Expect = 1.2
 Identities = 14/57 (24%), Positives = 36/57 (63%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
            EE EK ++E EE ++ + ++++ K+KK++++ +K ++++ + E+      +  LE 
Sbjct: 68  AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 27.7 bits (62), Expect = 2.1
 Identities = 17/41 (41%), Positives = 27/41 (65%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            E  E + K++E  EE EK KKE EEK+K + ++K+ ++ K
Sbjct: 55  AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKK 95



 Score = 27.0 bits (60), Expect = 4.4
 Identities = 13/48 (27%), Positives = 39/48 (81%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           KKE  E+ E+ ++E +E++K + ++K+ K++++K+K ++++K++++++
Sbjct: 64  KKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSE 111



 Score = 27.0 bits (60), Expect = 4.7
 Identities = 11/48 (22%), Positives = 36/48 (75%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           ++ E+ K+E EE++  + +K++ ++K+ K++++K+  ++++ E+++ K
Sbjct: 69  EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 31.7 bits (72), Expect = 0.11
 Identities = 23/61 (37%), Positives = 40/61 (65%)

Query: 89  KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           KE+  A    +D  K  + + ++ ++EE  +++ +  EEKE KE+EE+E+ EEEE+EE+E
Sbjct: 224 KESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDE 283

Query: 149 T 149
            
Sbjct: 284 D 284



 Score = 26.7 bits (59), Expect = 6.1
 Identities = 15/56 (26%), Positives = 31/56 (55%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
                +  K   +       EE++ ++ +  E KE +++EEEE+ ++EEEE+++ E
Sbjct: 230 AASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 31.9 bits (73), Expect = 0.11
 Identities = 19/46 (41%), Positives = 30/46 (65%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            +EE  EE  E+ +EEE+E++  + + E+EE E  EEEE ++E  K
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 31.7 bits (72), Expect = 0.12
 Identities = 15/67 (22%), Positives = 39/67 (58%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            +AK   +A        ++ +++  E  +EEV  EE+ E+++ E  ++  +++ E++++E
Sbjct: 50  QKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQE 109

Query: 146 EEETKYL 152
           E++ + L
Sbjct: 110 EKQVREL 116


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 31.9 bits (73), Expect = 0.12
 Identities = 31/69 (44%), Positives = 42/69 (60%), Gaps = 5/69 (7%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK-----KEEEEKEKMEEEEKEEEETKYLV 153
           E+ I++EEEE+  E EEE +EE++EE++EK       K  E KEK  E+  EE E + L 
Sbjct: 17  EEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIEDLLEELELELLE 76

Query: 154 RDVHLESCE 162
            DV LE  E
Sbjct: 77  SDVALEVAE 85



 Score = 27.2 bits (61), Expect = 4.7
 Identities = 17/31 (54%), Positives = 25/31 (80%)

Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           K EEK EEEE+E+  E E+E+ EE+E+E++E
Sbjct: 15  KVEEKIEEEEEEEAPEAEEEEEEEDEEEKKE 45



 Score = 26.1 bits (58), Expect = 9.8
 Identities = 17/32 (53%), Positives = 27/32 (84%)

Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           E+V+E+ +EEEEE+  + EEE+E+ +EEEK+E
Sbjct: 14  EKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKE 45


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 31.3 bits (71), Expect = 0.13
 Identities = 16/63 (25%), Positives = 25/63 (39%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            +AK+  K   L  +    E   +   E++  +      EE    +EEE +E   EE   
Sbjct: 122 SKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPA 181

Query: 146 EEE 148
           EE 
Sbjct: 182 EES 184



 Score = 30.5 bits (69), Expect = 0.24
 Identities = 15/47 (31%), Positives = 18/47 (38%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           K ++  K+   E E K  E   E   EKK  E      EE    EE 
Sbjct: 123 KAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEE 169



 Score = 28.2 bits (63), Expect = 1.5
 Identities = 15/47 (31%), Positives = 20/47 (42%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           KK  ++   E E++V E   E   EK+  E       E    EEEE 
Sbjct: 125 KKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEA 171



 Score = 26.7 bits (59), Expect = 5.8
 Identities = 8/52 (15%), Positives = 21/52 (40%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E  +  ++++  + ++   K   + E++  E + E   EK   E       +
Sbjct: 111 EGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEE 162


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 31.7 bits (72), Expect = 0.13
 Identities = 10/57 (17%), Positives = 27/57 (47%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
              +   +++  E+ K   K +  E+ KK   +++K E+++ ++++    V    L 
Sbjct: 32  IYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVPLQVNPAQLF 88



 Score = 31.3 bits (71), Expect = 0.15
 Identities = 14/74 (18%), Positives = 32/74 (43%), Gaps = 5/74 (6%)

Query: 74  EGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE 133
           + K N      D    E        +  I+K +   + +  E VK+   +E++E++KK +
Sbjct: 20  KYKANK-----DKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPK 74

Query: 134 EEKEKMEEEEKEEE 147
           ++K  ++    +  
Sbjct: 75  KKKVPLQVNPAQLF 88



 Score = 31.3 bits (71), Expect = 0.15
 Identities = 15/66 (22%), Positives = 31/66 (46%), Gaps = 3/66 (4%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK---EEEEKEKMEEEEKEEEETKYLV 153
           L  +  K  +++   E   ++  ++K  E+ K        E  +K+  +EK+EE+ K   
Sbjct: 16  LESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKK 75

Query: 154 RDVHLE 159
           + V L+
Sbjct: 76  KKVPLQ 81


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 31.6 bits (72), Expect = 0.13
 Identities = 14/49 (28%), Positives = 23/49 (46%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVR 154
           + ++E+E   EV+E E     E     E E++    E+  E   KY+ R
Sbjct: 141 KTDEEKELLREVEELESRLATEPSPAPELEEQLALMEKSYELAAKYMPR 189



 Score = 29.3 bits (66), Expect = 1.0
 Identities = 14/55 (25%), Positives = 22/55 (40%)

Query: 96  VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
             Y D  ++     E  + +E KE  +E EE + +   E     E EE+     K
Sbjct: 124 AAYRDINRELGSFYEYPKTDEEKELLREVEELESRLATEPSPAPELEEQLALMEK 178


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 32.0 bits (72), Expect = 0.14
 Identities = 19/45 (42%), Positives = 30/45 (66%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           D + +EEE + +EE EE +E++KE  E+   + +EEK K +EE K
Sbjct: 238 DKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIK 282



 Score = 29.7 bits (66), Expect = 0.71
 Identities = 25/60 (41%), Positives = 36/60 (60%), Gaps = 2/60 (3%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEE--EEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
           E +I K + +K  EEEE   +EE EE  E++KE  E+   E  EE+ K +EE K++  DV
Sbjct: 229 EIDILKNDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDV 288


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 31.7 bits (73), Expect = 0.16
 Identities = 24/62 (38%), Positives = 43/62 (69%), Gaps = 2/62 (3%)

Query: 89  KETEKAGVLYEDNIKKEEE--EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           ++ EK  +  E+N+ ++ E  EK EEE E+ ++E +++++E EKKEEE +E +EE+ +E 
Sbjct: 85  QKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQEL 144

Query: 147 EE 148
           E 
Sbjct: 145 ER 146



 Score = 30.1 bits (69), Expect = 0.51
 Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 3/66 (4%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKE--EEEEKEKKEEEEK-EKMEEEE 143
           + KE E+      +  KKE E  ++E   E KEE  +   E EKE +E   + +K+E+  
Sbjct: 32  KIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRL 91

Query: 144 KEEEET 149
            ++EE 
Sbjct: 92  LQKEEN 97



 Score = 28.6 bits (65), Expect = 1.5
 Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 3/69 (4%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKE---EEEEKEKKEEEEKEKM 139
           +L++  KE E          K+E  +   E E+E++E   E    E+   +KEE    K+
Sbjct: 43  ILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKL 102

Query: 140 EEEEKEEEE 148
           E  EK EEE
Sbjct: 103 ELLEKREEE 111



 Score = 27.8 bits (63), Expect = 2.7
 Identities = 17/61 (27%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESC 161
            + + +E EEE +  ++E +KE E  K++   E KE++  + + E E +   R   L+  
Sbjct: 29  AEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEI-HKLRNEFEKELRERRNELQKL 87

Query: 162 E 162
           E
Sbjct: 88  E 88


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 31.7 bits (72), Expect = 0.16
 Identities = 11/46 (23%), Positives = 24/46 (52%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
             EE  + +E+ + E         E  E+E+ E+ +EE +++++ K
Sbjct: 316 IHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVK 361



 Score = 27.5 bits (61), Expect = 4.0
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
           EEE  ++E + ++E E EE      +EE  
Sbjct: 242 EEESIDDELDVLREIEAEEAGIGPIEEEVV 271


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 31.5 bits (72), Expect = 0.17
 Identities = 17/51 (33%), Positives = 32/51 (62%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           ++ K+ E    E+EEE+ +  +K +E+  E+  EE++E+  EEE+ E E +
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172



 Score = 30.0 bits (68), Expect = 0.45
 Identities = 19/52 (36%), Positives = 34/52 (65%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E + + E  + E+EEE++   ++ +E+ ++E  EE+E+E  EEEE E E+ K
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173



 Score = 29.6 bits (67), Expect = 0.71
 Identities = 21/65 (32%), Positives = 38/65 (58%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           +D   E E   V  +  I+  + E EEE++E  K+ +++ +EE  +++EEE  + EE E 
Sbjct: 110 EDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEA 169

Query: 145 EEEET 149
           E+E+ 
Sbjct: 170 EKEKA 174



 Score = 29.2 bits (66), Expect = 0.95
 Identities = 15/46 (32%), Positives = 26/46 (56%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E  + E ++E E  + E EEE+++  K+ +E    E  E++EEE  
Sbjct: 117 EWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAA 162



 Score = 28.5 bits (64), Expect = 1.6
 Identities = 17/42 (40%), Positives = 25/42 (59%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
           +  K E  +K +E+ +E   EE EEE  +E++ E EKEK  E
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASE 176



 Score = 28.1 bits (63), Expect = 1.9
 Identities = 18/52 (34%), Positives = 30/52 (57%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
            E+  KK+E E+  E +++  EEE+ E EE E  ++E +    E +KE E +
Sbjct: 79  KEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESS 130



 Score = 27.7 bits (62), Expect = 2.5
 Identities = 14/46 (30%), Positives = 29/46 (63%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            + E ++E E    E+E+E++E  +K +E+  E++ EE++EE   +
Sbjct: 119 IDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEE 164



 Score = 27.7 bits (62), Expect = 2.9
 Identities = 15/52 (28%), Positives = 30/52 (57%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            D+  +  + + ++E E    E++EE++E  KK +E+ ++   EE EEE  +
Sbjct: 112 SDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAE 163


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 31.4 bits (71), Expect = 0.17
 Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 13/78 (16%)

Query: 92  EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEE-------------EEKEKKEEEEKEK 138
           EK     E  I KE  + EEE E+EV++  KEEE             E ++ +E   KE 
Sbjct: 367 EKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKEL 426

Query: 139 MEEEEKEEEETKYLVRDV 156
            +EE++  E+ K    +V
Sbjct: 427 SKEEKELLEKLKMEASEV 444



 Score = 28.3 bits (63), Expect = 2.1
 Identities = 15/48 (31%), Positives = 28/48 (58%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           + ++K+ EE E+   +E ++ E+E E+E EK  +EE+   +    EE 
Sbjct: 364 ERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEG 411



 Score = 27.1 bits (60), Expect = 4.9
 Identities = 18/48 (37%), Positives = 28/48 (58%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           DN    E    E  E++V+E EK   +E  + EEE ++++E+  KEEE
Sbjct: 353 DNTCGPEPPLLERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEE 400


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 31.4 bits (71), Expect = 0.18
 Identities = 19/64 (29%), Positives = 27/64 (42%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           DD  +E        +D     E+ +   EE E  +E +E+E E    E EE E    E  
Sbjct: 224 DDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEAS 283

Query: 145 EEEE 148
           E+ E
Sbjct: 284 EDSE 287



 Score = 29.8 bits (67), Expect = 0.59
 Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 4/70 (5%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEE----EEEVKEEEKEEEEEKEKKEEEEKEK 138
           V+ D     + A    +D I+++ +E++ ++      E  E  +EE E  ++ EE+E E 
Sbjct: 208 VVRDMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEA 267

Query: 139 MEEEEKEEEE 148
            + E +E E 
Sbjct: 268 TDGEGEEGEM 277



 Score = 29.0 bits (65), Expect = 1.3
 Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 5/65 (7%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           DDQ    E       +  ++E E  +E EE+E +  + E EE +    E  ++   +E  
Sbjct: 238 DDQPDNNED-----SEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESD 292

Query: 145 EEEET 149
           E+ ET
Sbjct: 293 EDTET 297



 Score = 28.7 bits (64), Expect = 1.4
 Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 5/51 (9%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKK-----EEEEKEKMEEEEKEEEETK 150
           EE E  +E EE+  E    E EE E       E+ E ++ +E+ +   E  
Sbjct: 252 EESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTETPGEDA 302


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 30.8 bits (70), Expect = 0.20
 Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEE-----EKEKKEEEEKEKMEEEEKEEEETK 150
            D+     +    +++ +  + E E        EK KKE  E+++ EEEEK  EE K
Sbjct: 119 ADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEK 175



 Score = 29.7 bits (67), Expect = 0.62
 Identities = 17/51 (33%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEE--KEEEEEKEKKEEEEKEKMEEEEKEEE 147
             +   ++++ E+E    ++E E  K+E  E++++EEEEK   EE+ +EEE
Sbjct: 130 SSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180



 Score = 28.9 bits (65), Expect = 0.96
 Identities = 21/45 (46%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 85  DDQAKETEKAGVLYE-DNIKKEEEEKEEEEEEEVKEEEKEEEEEK 128
           DD   E E A +L E + IKKE  E++E EEEE   EE++  EE+
Sbjct: 136 DDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 30.4 bits (69), Expect = 0.22
 Identities = 20/51 (39%), Positives = 35/51 (68%)

Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           +I+++ EEKE E+EE+ ++EEKE+E  K   ++EE+EK  EE ++ +    
Sbjct: 13  DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPL 63



 Score = 26.1 bits (58), Expect = 7.3
 Identities = 19/46 (41%), Positives = 30/46 (65%)

Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
           EE +EE++ E+EEKE+KEE+EKE  +   ++EE  K L      ++
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKN 60


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 31.3 bits (71), Expect = 0.22
 Identities = 14/66 (21%), Positives = 23/66 (34%), Gaps = 8/66 (12%)

Query: 98  YEDNIKKEEEEKEEEEEE------EVKEEEKEEEEEKEKK--EEEEKEKMEEEEKEEEET 149
           YE NI   +     EE        E++   K E   +  K  EE+  ++        EE 
Sbjct: 870 YEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEEL 929

Query: 150 KYLVRD 155
               ++
Sbjct: 930 MKRAKE 935



 Score = 30.9 bits (70), Expect = 0.35
 Identities = 15/65 (23%), Positives = 28/65 (43%), Gaps = 2/65 (3%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE--KEKMEEE 142
           D+   E E      +++I+ EE   + +  E     + E   +  K  EE+  +E+M   
Sbjct: 864 DNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGA 923

Query: 143 EKEEE 147
           +  EE
Sbjct: 924 DSPEE 928



 Score = 28.6 bits (64), Expect = 1.9
 Identities = 10/40 (25%), Positives = 19/40 (47%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           E   +  +  +E+  +E +   +  EE  K  KE ++K K
Sbjct: 902 EALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 30.8 bits (69), Expect = 0.23
 Identities = 19/53 (35%), Positives = 36/53 (67%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           ++E EE EE  E E +EE+ +E E++E++E E +E+  EE+ ++ E + + +D
Sbjct: 132 RQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQD 184



 Score = 28.5 bits (63), Expect = 1.4
 Identities = 16/45 (35%), Positives = 32/45 (71%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           ++ EE+ +E E++E +E E+EE+  +E+ ++ E E +E++E E E
Sbjct: 145 ERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETE 189



 Score = 27.7 bits (61), Expect = 2.5
 Identities = 19/64 (29%), Positives = 41/64 (64%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           + ++ EKA    E+  ++ E E+ EE+ +E +++E++E E +E+  EE+ +  E E  E+
Sbjct: 124 RQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQ 183

Query: 147 EETK 150
           +E++
Sbjct: 184 DESE 187



 Score = 27.7 bits (61), Expect = 2.8
 Identities = 18/64 (28%), Positives = 39/64 (60%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           DQ +  +K  +  + ++   +E+ E+  +E  + EE+ E E +E+K +E +++ E+E + 
Sbjct: 105 DQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQERER 164

Query: 146 EEET 149
           EE+T
Sbjct: 165 EEQT 168



 Score = 26.6 bits (58), Expect = 6.8
 Identities = 15/45 (33%), Positives = 30/45 (66%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           ++E++E++E E EE   EE+ ++ E E  E++E E   +++K E+
Sbjct: 153 EREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 31.2 bits (71), Expect = 0.24
 Identities = 12/42 (28%), Positives = 25/42 (59%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
           ++  K +  E+ +EEE++ +++  + EE K+  EE  K + E
Sbjct: 60  KEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDE 101



 Score = 30.0 bits (68), Expect = 0.46
 Identities = 16/44 (36%), Positives = 25/44 (56%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           EE  +EE  +E  K +  EE +E+E  EE++ +  EE +   EE
Sbjct: 51  EESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94



 Score = 28.1 bits (63), Expect = 2.6
 Identities = 12/50 (24%), Positives = 24/50 (48%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
                +  EE   EE+ +E  + +  +E  EEE+ E+ + ++ EE +   
Sbjct: 43  RGSFRDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTV 92


>gnl|CDD|216831 pfam01988, VIT1, VIT family.  This family includes the vacuolar
           Fe2+/Mn2+ uptake transporter, Ccc1 and the vacuolar iron
           transporter VIT1.
          Length = 209

 Score = 30.7 bits (70), Expect = 0.25
 Identities = 13/39 (33%), Positives = 21/39 (53%)

Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
           E++ EE + ++E+ E E   E E+EE    Y  R +  E
Sbjct: 59  ERDLEEAELEREKRELENDPEGEREELAEIYRARGLSEE 97


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 30.8 bits (70), Expect = 0.26
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 11/78 (14%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK------- 136
           L  Q K  +K  +L+E     + + K + E +  K + + +E++KEKK+EE+K       
Sbjct: 289 LKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERL 348

Query: 137 ----EKMEEEEKEEEETK 150
               EK+E +  ++EE K
Sbjct: 349 EERIEKLEVQATDKEENK 366


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 30.6 bits (70), Expect = 0.26
 Identities = 15/39 (38%), Positives = 23/39 (58%)

Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
           E E+E++K   +EE+K   EE++K EE   Y + D   E
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEPYGYCLVDGRKE 134



 Score = 27.2 bits (61), Expect = 3.8
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
           I +  E ++E+++   KEE+K  +EEK+K EE 
Sbjct: 91  IYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 31.1 bits (71), Expect = 0.28
 Identities = 13/46 (28%), Positives = 25/46 (54%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           I+  E+EK++ EE + + E ++   E+EK   E + K   E +  +
Sbjct: 438 IRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAK 483



 Score = 26.4 bits (59), Expect = 8.5
 Identities = 13/38 (34%), Positives = 19/38 (50%), Gaps = 5/38 (13%)

Query: 118 KEEEKEEEEEKEKKEE-----EEKEKMEEEEKEEEETK 150
           K E +  E+EK+K EE     E ++   E EK   E +
Sbjct: 435 KAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREAR 472


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 30.7 bits (70), Expect = 0.28
 Identities = 11/41 (26%), Positives = 23/41 (56%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           +K E++ E+EE E+   ++K E   K+ K  + + K  + +
Sbjct: 419 RKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459



 Score = 30.7 bits (70), Expect = 0.32
 Identities = 11/37 (29%), Positives = 19/37 (51%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKM 139
           K+ + EK+ E+EE  K   K++ E   KK +    + 
Sbjct: 417 KQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGET 453



 Score = 29.1 bits (66), Expect = 1.0
 Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 1/51 (1%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           D     E E+EE E   +   E+++  +K++K E++ EK EE EK   + K
Sbjct: 389 DKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEK-EEAEKAAAKKK 438



 Score = 28.0 bits (63), Expect = 2.5
 Identities = 15/60 (25%), Positives = 28/60 (46%), Gaps = 6/60 (10%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKEEEKE------EEEEKEKKEEEEKEKMEEEEKEEEETK 150
           L++  +  E EE+E E       E K+      + E+K +KEE EK   +++ +   +  
Sbjct: 387 LHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446



 Score = 27.2 bits (61), Expect = 4.1
 Identities = 10/51 (19%), Positives = 29/51 (56%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            N+   E +K  +++ + +++ ++EE EK   +++ +   ++ +  + ETK
Sbjct: 404 GNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 30.5 bits (69), Expect = 0.28
 Identities = 13/45 (28%), Positives = 22/45 (48%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           KK+++EKEE  E   +E    EE E+  K+ + +    +    E 
Sbjct: 68  KKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEV 112



 Score = 29.4 bits (66), Expect = 0.60
 Identities = 14/44 (31%), Positives = 24/44 (54%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            +K+++E+EEV E   EE  + E+ EE +K+   E  K +    
Sbjct: 67  TKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAA 110



 Score = 27.1 bits (60), Expect = 4.2
 Identities = 13/48 (27%), Positives = 26/48 (54%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           K    +K+  ++ + K++EKEE  E   +E  + E+ EE +K+ +   
Sbjct: 55  KTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYEL 102



 Score = 25.9 bits (57), Expect = 8.4
 Identities = 17/57 (29%), Positives = 25/57 (43%)

Query: 93  KAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           K         KKE+EE  E   EE+ + E+ EE +K+   E  K +    E   E+ 
Sbjct: 61  KKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDV 117


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 31.0 bits (70), Expect = 0.29
 Identities = 16/37 (43%), Positives = 23/37 (62%)

Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           E E E K   +EE E K+KKEE+ KEK  ++ K  ++
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQK 41



 Score = 29.9 bits (67), Expect = 0.64
 Identities = 14/44 (31%), Positives = 28/44 (63%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            E EK+   EEE++ ++K+EE+ KEK+ ++ K   +E + + + 
Sbjct: 6   SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQA 49



 Score = 29.5 bits (66), Expect = 0.97
 Identities = 14/42 (33%), Positives = 27/42 (64%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           E E E++   E + E K+++EEK K++E +K K  ++E + +
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46



 Score = 27.2 bits (60), Expect = 5.4
 Identities = 14/45 (31%), Positives = 29/45 (64%), Gaps = 1/45 (2%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEK-KEEEEKEKMEEEEKEE 146
           KK   E+E E +++ +E+ KE+E +K K  ++E K K++ ++  +
Sbjct: 10  KKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASD 54


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 30.4 bits (69), Expect = 0.29
 Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 105 EEEEKE--EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
           +++ K+  +EEE+E KEEEK +   K K ++  K K+EE+EK + E +        E
Sbjct: 30  DDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86



 Score = 28.5 bits (64), Expect = 1.6
 Identities = 19/64 (29%), Positives = 36/64 (56%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           +D+ KE EKA V  +   KK  + K EE+E+  +E+E++   E E+   E++   +   +
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100

Query: 145 EEEE 148
           + +E
Sbjct: 101 KLQE 104


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 30.0 bits (68), Expect = 0.29
 Identities = 8/61 (13%), Positives = 19/61 (31%), Gaps = 2/61 (3%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           + K ++    L+   + + +     E     K++   E    E      +       +EE
Sbjct: 97  EPKPSKLE--LFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEE 154

Query: 147 E 147
            
Sbjct: 155 A 155



 Score = 26.6 bits (59), Expect = 5.5
 Identities = 8/39 (20%), Positives = 9/39 (23%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
                E       E    K+KK   E    E      E 
Sbjct: 107 NAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEA 145


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 30.8 bits (70), Expect = 0.30
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 96  VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
                  K EEEE+E+EEEEE +EE +E E E+ 
Sbjct: 340 TWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373



 Score = 29.6 bits (67), Expect = 0.67
 Identities = 15/31 (48%), Positives = 23/31 (74%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
             E++ EEEE+E+ +EEE+EEE E+ + EE 
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373



 Score = 29.6 bits (67), Expect = 0.77
 Identities = 13/31 (41%), Positives = 24/31 (77%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
             ++++EEEE ++EE+EEEEE+ ++ E E+ 
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373



 Score = 29.3 bits (66), Expect = 0.88
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
             E+K+EEEE+E +EEE+EEEE +E + EE   
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375



 Score = 28.9 bits (65), Expect = 1.4
 Identities = 15/30 (50%), Positives = 25/30 (83%)

Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEK 130
           N ++++EE+E+E+EEE +EEE+ EE E E+
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEE 372



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 17/32 (53%), Positives = 24/32 (75%)

Query: 117 VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           V  E+K+EEEE+E +EEEE+E+  EE + EE 
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373



 Score = 27.7 bits (62), Expect = 2.7
 Identities = 15/42 (35%), Positives = 27/42 (64%)

Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           +I  +        E++ +EEE+E+EEE+E++EE E+ + EE 
Sbjct: 332 HILPQGRCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373



 Score = 27.3 bits (61), Expect = 4.1
 Identities = 19/35 (54%), Positives = 26/35 (74%)

Query: 114 EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           E E++E E+EEEEE+  + +EE E  EEEE+E EE
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEE 198



 Score = 26.9 bits (60), Expect = 5.8
 Identities = 21/43 (48%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE-EKEEEET 149
           E E  E EE +EEE+  E ++E + EEE+E+  EE  KEEE T
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGT 206



 Score = 26.6 bits (59), Expect = 7.1
 Identities = 12/33 (36%), Positives = 23/33 (69%)

Query: 117 VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
                ++++EE+E+++EEE+E+ EE E+ E E 
Sbjct: 340 TWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEE 372



 Score = 26.2 bits (58), Expect = 9.7
 Identities = 20/42 (47%), Positives = 26/42 (61%)

Query: 95  GVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK 136
             L E   ++EEEE  E +EE+  EEE+EEE E+  KEEE  
Sbjct: 165 VELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGT 206


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 30.7 bits (69), Expect = 0.30
 Identities = 15/46 (32%), Positives = 24/46 (52%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
             + +E E++   +   K E E+ E  EE E+EK EEE    ++ K
Sbjct: 294 GNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGK 339



 Score = 29.5 bits (66), Expect = 0.83
 Identities = 16/57 (28%), Positives = 29/57 (50%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           E+   K   E   + E E  E+ +E EEEK ++E    +K ++ +K + +   L +D
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354



 Score = 28.4 bits (63), Expect = 2.1
 Identities = 11/48 (22%), Positives = 30/48 (62%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           I+++E+ +E EEE+  +E    ++ +K KK + +K  +++++ +  + 
Sbjct: 314 IEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDD 361



 Score = 28.0 bits (62), Expect = 2.6
 Identities = 17/64 (26%), Positives = 32/64 (50%), Gaps = 2/64 (3%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
                  E+   L  +   K E E++E+ EE   EEEK EEE    K+ ++ +K++ ++ 
Sbjct: 292 ASGNDPEEREDKLSPEIPAKPEIEQDEDSEE--SEEEKNEEEGGLSKKGKKLKKLKGKKN 349

Query: 145 EEEE 148
             ++
Sbjct: 350 GLDK 353



 Score = 26.5 bits (58), Expect = 8.8
 Identities = 11/45 (24%), Positives = 25/45 (55%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             EE +EE+ EEE    +K ++ +K K ++   +K + +  ++ +
Sbjct: 319 DSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSD 363


>gnl|CDD|234655 PRK00133, metG, methionyl-tRNA synthetase; Reviewed.
          Length = 673

 Score = 30.9 bits (71), Expect = 0.30
 Identities = 10/61 (16%), Positives = 21/61 (34%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESCE 162
            + E+++ E   E  KE    +           +E + E    ++  K  +R   +   E
Sbjct: 528 TRIEDKQIEALIEASKEAAAAKAAAAAAAAPLAEEPIAETISFDDFAKVDLRVAKIVEAE 587

Query: 163 F 163
            
Sbjct: 588 K 588


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 30.8 bits (70), Expect = 0.31
 Identities = 17/53 (32%), Positives = 30/53 (56%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVR 154
           + K +EE+   E+E  +EEE+ E    EK+ ++ +E +E  E+E  E K  + 
Sbjct: 394 LSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELE 446



 Score = 28.1 bits (63), Expect = 2.5
 Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
            E   + KEEE  +EK+  EE+E+ E    E+   K 
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKL 427



 Score = 27.7 bits (62), Expect = 3.4
 Identities = 23/76 (30%), Positives = 38/76 (50%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           + +  EK G   E+  +    EK  ++ EE  E  +EE  E +++ EE K ++E+ E E 
Sbjct: 400 EERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESEL 459

Query: 147 EETKYLVRDVHLESCE 162
           E  +  VRD   +  E
Sbjct: 460 ERFRREVRDKVRKDRE 475



 Score = 27.0 bits (60), Expect = 6.5
 Identities = 18/51 (35%), Positives = 26/51 (50%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
            + KEEE   E +  E+EE  E    E+  K+  E  E+ EEE   L R++
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKREL 445


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 881

 Score = 30.7 bits (69), Expect = 0.31
 Identities = 15/55 (27%), Positives = 33/55 (60%), Gaps = 2/55 (3%)

Query: 104 KEEEEKEEE--EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
            +E +K+++  +E+     EK+   +K+KK+EEE++  EE+   + + + LV  +
Sbjct: 2   VDESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLKGDLELLVERI 56


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 30.8 bits (70), Expect = 0.31
 Identities = 18/53 (33%), Positives = 28/53 (52%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           E+ +++  +EK EE   +++EE     E KE   E++     E EKEE   KY
Sbjct: 187 EEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKY 239



 Score = 29.6 bits (67), Expect = 0.72
 Identities = 16/46 (34%), Positives = 24/46 (52%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           +K EEEE+ E   +E +EE   + EE+     E KE   E++   E
Sbjct: 182 LKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLE 227



 Score = 28.8 bits (65), Expect = 1.1
 Identities = 15/58 (25%), Positives = 27/58 (46%)

Query: 92  EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           EK   L     ++     E +E    K+   E E EKE+  ++ +EK+ +E + + E 
Sbjct: 196 EKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEA 253



 Score = 26.1 bits (58), Expect = 9.1
 Identities = 13/49 (26%), Positives = 27/49 (55%), Gaps = 1/49 (2%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKE-EEEKEKMEEEEKEEEETK 150
           ++E   K EEE     E ++   E++ + E E EKE++ ++ +E+   +
Sbjct: 198 REELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQE 246



 Score = 26.1 bits (58), Expect = 9.3
 Identities = 14/45 (31%), Positives = 22/45 (48%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           K EEE+E E   + K EE   + E+E     E ++   E++   E
Sbjct: 183 KAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLE 227



 Score = 26.1 bits (58), Expect = 9.4
 Identities = 18/46 (39%), Positives = 26/46 (56%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           KK  E K EEEEE  +  +++ EE   K EEE   ++E +E   E+
Sbjct: 177 KKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEK 222


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 30.6 bits (70), Expect = 0.33
 Identities = 21/44 (47%), Positives = 30/44 (68%), Gaps = 2/44 (4%)

Query: 107 EEKEEEEEEEVKE-EEKEEEEEKEKKE-EEEKEKMEEEEKEEEE 148
           EE E E E++ +E E   +E EK K+E EE+KEK++EEE +  E
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLE 569



 Score = 29.0 bits (66), Expect = 1.1
 Identities = 20/65 (30%), Positives = 41/65 (63%), Gaps = 8/65 (12%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           L+ +A+E E         + KE E+ +EE EE+ ++ ++EE++  E+ E+E ++ ++E +
Sbjct: 532 LEQKAEEAEA--------LLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAK 583

Query: 144 KEEEE 148
           KE +E
Sbjct: 584 KEADE 588


>gnl|CDD|232975 TIGR00437, feoB, ferrous iron transporter FeoB.  FeoB (773 amino
           acids in E. coli), a cytoplasmic membrane protein
           required for iron(II) update, is encoded in an operon
           with FeoA (75 amino acids), which is also required, and
           is regulated by Fur. There appear to be two copies in
           Archaeoglobus fulgidus and Clostridium acetobutylicum
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 591

 Score = 30.5 bits (69), Expect = 0.35
 Identities = 14/80 (17%), Positives = 29/80 (36%), Gaps = 2/80 (2%)

Query: 49  EEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEE 108
            E   E  +   +      +   + +     LL +L+D + E EK      + + KE   
Sbjct: 164 IEIVPEAYQVVEVVEGLIEIIYSISKRGLEILLGLLEDLSLEIEKIERNLAEVVIKESPS 223

Query: 109 KEEEEEEEVKEEEKEEEEEK 128
                  E+ +E++   E+ 
Sbjct: 224 N--LSPTEIADEDRVLVEKS 241


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 30.5 bits (70), Expect = 0.37
 Identities = 9/32 (28%), Positives = 19/32 (59%)

Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           +V+EE    +E+ E    +E E+ E++ ++ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 30.2 bits (69), Expect = 0.55
 Identities = 10/32 (31%), Positives = 20/32 (62%)

Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           + EEE+   +++ E    K+ EE+++K E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 29.8 bits (68), Expect = 0.59
 Identities = 9/32 (28%), Positives = 18/32 (56%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           + E+E    +E  E    +E E+++K+ E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 29.4 bits (67), Expect = 0.78
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEE 133
           +EE    +E+ E    +E EE+E+K +K E
Sbjct: 513 EEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 29.4 bits (67), Expect = 0.96
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
           + EEE    +E+ E    KE +E+E+K +  E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 29.0 bits (66), Expect = 1.2
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
           + + EE+    +E+++    +E EEKEKK E+ 
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541



 Score = 27.8 bits (63), Expect = 2.9
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
           I  EEE    +E+ E    ++ EE+EK+ ++ E
Sbjct: 510 IDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 27.8 bits (63), Expect = 3.0
 Identities = 7/31 (22%), Positives = 16/31 (51%)

Query: 120 EEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           + +EE    +++ E    K  EE++++ E  
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541



 Score = 27.5 bits (62), Expect = 4.2
 Identities = 7/32 (21%), Positives = 17/32 (53%)

Query: 114 EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           + E +    +E+ E    +E E+++ + E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 27.1 bits (61), Expect = 5.2
 Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
            EEE    +E+      KE EE+++K E+ E
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 27.1 bits (61), Expect = 5.6
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           + E+E    KE+ E    ++ EE+EK+ E+
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEK 540



 Score = 26.7 bits (60), Expect = 7.7
 Identities = 8/31 (25%), Positives = 17/31 (54%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           + EEE     ++ E    +E E+KE++ ++ 
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 30.4 bits (68), Expect = 0.38
 Identities = 16/45 (35%), Positives = 30/45 (66%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           +++   K +E  +E K EEK+ +EE+E++E EE+ K EE +  ++
Sbjct: 199 QEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243



 Score = 26.5 bits (58), Expect = 6.7
 Identities = 14/47 (29%), Positives = 32/47 (68%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           K  + K+E+   +++E  +E + E+++ +EE++E+  EEE ++EE +
Sbjct: 193 KAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQ 239



 Score = 26.5 bits (58), Expect = 7.1
 Identities = 13/46 (28%), Positives = 27/46 (58%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
             + K  + ++E    + +E  ++ K EE++ ++ +EEE+ EEE K
Sbjct: 189 SVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAK 234


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 30.4 bits (68), Expect = 0.39
 Identities = 22/72 (30%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           V ++Q +E EKA +  + N  +E  + ++ +  ++K+E K  E+E E KE E ++K E  
Sbjct: 275 VAENQKREIEKAQIEIKKN-DEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPV 333

Query: 143 EKEEEETKYLVR 154
            ++ ++TK  V 
Sbjct: 334 AEDLQKTKPQVE 345



 Score = 29.2 bits (65), Expect = 1.1
 Identities = 12/67 (17%), Positives = 31/67 (46%), Gaps = 4/67 (5%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            D   ++ + A    +DN  K+ +E  ++++E     +  +    ++ ++  + +  E E
Sbjct: 229 ADKAQQKADFA----QDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIE 284

Query: 144 KEEEETK 150
           K + E K
Sbjct: 285 KAQIEIK 291



 Score = 28.0 bits (62), Expect = 2.2
 Identities = 13/59 (22%), Positives = 28/59 (47%)

Query: 92  EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E   V      + E+ + E ++ +E   + K+ +    K+E +  EK  E+++ E + K
Sbjct: 271 EDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKK 329


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 30.7 bits (69), Expect = 0.39
 Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 3/64 (4%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           DD   E E+ G   +D+   E E++E+ E E   +E    +E +    E E  + E  + 
Sbjct: 205 DDTESEDEEDG---DDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQS 261

Query: 145 EEEE 148
           ++++
Sbjct: 262 DQDD 265



 Score = 28.4 bits (63), Expect = 1.9
 Identities = 13/51 (25%), Positives = 28/51 (54%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             D+ + E+EE  ++++    E+E++ E E E +E    ++ E  ++E E 
Sbjct: 203 MGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESES 253



 Score = 28.4 bits (63), Expect = 2.2
 Identities = 19/79 (24%), Positives = 39/79 (49%), Gaps = 6/79 (7%)

Query: 78  NGLLCVLDDQA------KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK 131
           + L   +DDQ       +E  ++  L E+     E E EE+ +++   E ++EE+ + + 
Sbjct: 174 DRLSAAIDDQQAFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEG 233

Query: 132 EEEEKEKMEEEEKEEEETK 150
           E +E    +E E  + E++
Sbjct: 234 EGQEGSAPQESEATDRESE 252


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 30.5 bits (68), Expect = 0.41
 Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 2/69 (2%)

Query: 89  KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE--EKEKMEEEEKEE 146
           KE      L    ++K+  E+ E EE E  E E+ E  E+E+ E E  E+E++E +  E 
Sbjct: 442 KENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERERLERDRLER 501

Query: 147 EETKYLVRD 155
           +    L R+
Sbjct: 502 DRLDRLERE 510



 Score = 28.9 bits (64), Expect = 1.5
 Identities = 16/52 (30%), Positives = 32/52 (61%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           ++E    +  E ++ E+++ E  E+E++E  E+E+ME  E+E  E + L R+
Sbjct: 441 EKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERE 492


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 29.7 bits (67), Expect = 0.41
 Identities = 19/55 (34%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 97  LYEDNIKKEEEEKEEE-EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           L+ +  +KE++E  +E EE+E  + +KE++E+KEKK  E+  K +  +  +  TK
Sbjct: 90  LFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTK 144



 Score = 27.8 bits (62), Expect = 1.8
 Identities = 18/48 (37%), Positives = 32/48 (66%)

Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           N+  EE EKE++E  +  EE++  + +KEKKE++EK+  E+  K++  
Sbjct: 89  NLFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKST 136


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 30.4 bits (69), Expect = 0.41
 Identities = 18/51 (35%), Positives = 26/51 (50%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           ED  +KEE E     +    + EK  +E  +K   EEKEK+E+   E +E 
Sbjct: 516 EDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIEKAVAELKEA 566



 Score = 27.7 bits (62), Expect = 3.3
 Identities = 21/58 (36%), Positives = 27/58 (46%), Gaps = 19/58 (32%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEE-------------------EEKEKMEEEEKEEEE 148
           EEE E  VKE E   EE+K++KEE                   E  +K+  EEKE+ E
Sbjct: 500 EEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIE 557


>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
           ArcB; Provisional.
          Length = 779

 Score = 30.3 bits (69), Expect = 0.43
 Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 5/40 (12%)

Query: 100 DNIKK-----EEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
           DNI +      E EK EE  +E  E+ K E +E+E+ + E
Sbjct: 110 DNIAQLNQEIAEREKAEEARQEAFEQLKNEIKEREETQIE 149


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 30.2 bits (68), Expect = 0.43
 Identities = 13/49 (26%), Positives = 24/49 (48%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
            D+   E+E+  +E+E++ + EE+E        E+ +    E E  E E
Sbjct: 219 ADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME 267



 Score = 27.5 bits (61), Expect = 2.9
 Identities = 9/47 (19%), Positives = 26/47 (55%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           + E  + E+ E+E+  +E++++++ +E++        E+ +   EE 
Sbjct: 215 EPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEM 261



 Score = 27.5 bits (61), Expect = 3.6
 Identities = 11/37 (29%), Positives = 23/37 (62%)

Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +  EE+ +E +  + E  + E++ KE  ++++ EEEE
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEE 243



 Score = 27.1 bits (60), Expect = 4.6
 Identities = 15/51 (29%), Positives = 27/51 (52%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           +++  KE+E+ ++ EEEE    +   E+     EE E  +ME  E   ++T
Sbjct: 226 DEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDT 276



 Score = 26.3 bits (58), Expect = 7.9
 Identities = 11/54 (20%), Positives = 26/54 (48%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
             D  +  + E  E+E++  ++E+ ++ EE+E    +   +  +   EE E+  
Sbjct: 212 LGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGE 265



 Score = 26.3 bits (58), Expect = 8.1
 Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 99  EDNIKKEEEEKEEEEEEE-----VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           ED+ K++E++ + EEEE      + E+     EE E  E E  E   ++  + ++
Sbjct: 227 EDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPDSDD 281


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 30.0 bits (68), Expect = 0.44
 Identities = 9/40 (22%), Positives = 13/40 (32%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           EE++                 +    E E E   EEE E+
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEK 154



 Score = 29.6 bits (67), Expect = 0.61
 Identities = 9/40 (22%), Positives = 13/40 (32%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           EEE                  E    E E + + EEE ++
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEK 154



 Score = 29.2 bits (66), Expect = 0.72
 Identities = 11/40 (27%), Positives = 12/40 (30%)

Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           EEE+                 E    E   E E EEE  K
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEK 154



 Score = 28.8 bits (65), Expect = 1.1
 Identities = 11/40 (27%), Positives = 13/40 (32%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           EEE+                 E    + E E E  EE EK
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEK 154



 Score = 28.5 bits (64), Expect = 1.3
 Identities = 8/41 (19%), Positives = 13/41 (31%)

Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           +EE+                 +    E E   E E+E E+ 
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEKK 155



 Score = 26.9 bits (60), Expect = 4.6
 Identities = 8/41 (19%), Positives = 15/41 (36%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           +++E+          V      E    E + E E E+  E+
Sbjct: 114 MEEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEK 154


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 30.3 bits (68), Expect = 0.44
 Identities = 19/58 (32%), Positives = 37/58 (63%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           QA E E+   L ++ +K +E++K+ EE E+  + E++++EE+ +K   E++K  E  K
Sbjct: 96  QAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAK 153



 Score = 29.5 bits (66), Expect = 0.66
 Identities = 15/52 (28%), Positives = 34/52 (65%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +   KKEE+  EE + ++  E+E+ ++ EKE+ + +E++K  EE +++ + +
Sbjct: 79  QQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLE 130



 Score = 27.6 bits (61), Expect = 3.5
 Identities = 16/65 (24%), Positives = 38/65 (58%), Gaps = 3/65 (4%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKE---EEEEKEKKEEEEKEKMEEEE 143
            A+E +      ++ +K+ E+E+ + +E++ + EE E   + E+K+++E+  K   E+++
Sbjct: 88  VAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKK 147

Query: 144 KEEEE 148
           K E  
Sbjct: 148 KAEAA 152



 Score = 26.8 bits (59), Expect = 5.2
 Identities = 20/66 (30%), Positives = 26/66 (39%), Gaps = 2/66 (3%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEE--EEEKEKKEEEEKEKMEEEEK 144
           +A           +  KK EE  +  EE + K E      + E E K   EK K E E K
Sbjct: 155 KAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAK 214

Query: 145 EEEETK 150
            + E K
Sbjct: 215 AKAEKK 220



 Score = 26.5 bits (58), Expect = 8.5
 Identities = 14/64 (21%), Positives = 24/64 (37%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
             K+ E       +  K E E K + E++     E++   EK+K   + K        + 
Sbjct: 192 AKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKA 251

Query: 147 EETK 150
            E K
Sbjct: 252 AERK 255



 Score = 26.5 bits (58), Expect = 9.1
 Identities = 16/57 (28%), Positives = 39/57 (68%), Gaps = 2/57 (3%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEE--EEEKEKKEEEEKEKMEEEEKEEEETKYLV 153
           + + KK E++++++EE+  +E + ++  E+E+ K+ E+E+ K +E++K+ EE +   
Sbjct: 71  QSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQA 127


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 29.4 bits (66), Expect = 0.45
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
             +EE+EEEEE E +++  E + ++ ++ E   E  EE
Sbjct: 94  TRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGEE 131



 Score = 27.5 bits (61), Expect = 1.9
 Identities = 12/38 (31%), Positives = 24/38 (63%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
             +E+EEEEEE  K+++ +E + ++ ++ E   +  EE
Sbjct: 94  TRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGEE 131



 Score = 26.3 bits (58), Expect = 5.4
 Identities = 10/29 (34%), Positives = 24/29 (82%)

Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           ++EE+EEEEE EK+++ ++ ++++ ++ E
Sbjct: 95  RQEEEEEEEENEKQQQSDEAQVQQHQQHE 123


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 30.4 bits (69), Expect = 0.46
 Identities = 19/60 (31%), Positives = 33/60 (55%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
            LD Q  E ++A   YE      E+ + ++  EE++EE  EE   K KK++ +K++  E+
Sbjct: 381 NLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 12/51 (23%), Positives = 32/51 (62%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            ++  ++++ ++   E E+  E+E ++ E K +K+E+E E++E+  +E  +
Sbjct: 271 LDEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQ 321



 Score = 26.6 bits (59), Expect = 8.5
 Identities = 18/53 (33%), Positives = 31/53 (58%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
           D IK+   E E++ E+E+K+ E + E+++++ EE EK   E  +K E     L
Sbjct: 278 DKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANL 330


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 30.0 bits (68), Expect = 0.49
 Identities = 16/41 (39%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           +Y D++ KE EEKE+E++ + KE ++E     + K  EE E
Sbjct: 384 VYRDDVYKEREEKEKEKKAK-KEGKEERRIHFQNKSIEEAE 423


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 29.4 bits (67), Expect = 0.50
 Identities = 13/56 (23%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE-EKEKMEEEEKEEEETKYL 152
           Y    KK++++K+E E  E K+++    EE+++ ++  +   +   +K      YL
Sbjct: 80  YFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAAIKKVPLLKGYL 135


>gnl|CDD|184899 PRK14906, PRK14906, DNA-directed RNA polymerase subunit beta'/alpha
           domain fusion protein; Provisional.
          Length = 1460

 Score = 30.2 bits (68), Expect = 0.52
 Identities = 17/61 (27%), Positives = 28/61 (45%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVH 157
           + D+I  +E    EE   EV +  +E  E K  + E     M+ E K+    + L R++ 
Sbjct: 189 FVDDIGDDERLTAEEVRAEVADIYEEYNERKALRREAFDAFMQIEPKQLISDEALYREMR 248

Query: 158 L 158
           L
Sbjct: 249 L 249



 Score = 27.5 bits (61), Expect = 4.0
 Identities = 20/81 (24%), Positives = 39/81 (48%)

Query: 57  RWRHIEFSDNTLCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEE 116
           R  HIE +     +   +G P+ L  +LD + K+ EK        I   ++E  EE+ ++
Sbjct: 93  RMGHIELAAPVSHIWYFKGSPSRLGYLLDIKPKDLEKVLYFASYIITSVDKEAREEDADD 152

Query: 117 VKEEEKEEEEEKEKKEEEEKE 137
           +++E   + EE + + +   E
Sbjct: 153 LRDELAADLEELDAERDRLIE 173


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 29.5 bits (67), Expect = 0.52
 Identities = 17/39 (43%), Positives = 25/39 (64%)

Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
           E+E+E++K   +EE+K   EE+EK EE   Y + D H E
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEPYGYCLVDGHKE 133



 Score = 27.2 bits (61), Expect = 3.4
 Identities = 15/33 (45%), Positives = 23/33 (69%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
           I +  EE++E+++   KEE+K  +EEKEK EE 
Sbjct: 90  IYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 30.2 bits (68), Expect = 0.54
 Identities = 16/43 (37%), Positives = 25/43 (58%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
           ED    EE + +E+E E +++ E E E + E  EE+E+E   E
Sbjct: 650 EDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692



 Score = 29.8 bits (67), Expect = 0.58
 Identities = 14/43 (32%), Positives = 29/43 (67%), Gaps = 1/43 (2%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKE-EEEKEKMEEEEKEEEET 149
           E EEE+E++++E + +E+E +  ++ E E E   E+ +E+E+ 
Sbjct: 646 EGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQE 688


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 30.0 bits (68), Expect = 0.56
 Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 1/57 (1%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE-EKEEEETKYLVR 154
           E  + K+  EK+ EE +  K+ E+      E+K +E K+  EE  E   +E   +  
Sbjct: 76  EAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEIAEMMRDEKVPIRE 132


>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
           family.  This model represents a relatively
           well-conserved region near the C-terminus of the tape
           measure protein of a lambda and related phage. This
           protein, which controls phage tail length, is typically
           about 1000 residues in length. Both low-complexity
           sequence and insertion/deletion events appear common in
           this family. Mutational studies suggest a ruler or
           template role in the determination of phage tail length.
           Similar behavior is attributed to proteins from
           distantly related or unrelated families in other phage
           [Mobile and extrachromosomal element functions, Prophage
           functions].
          Length = 332

 Score = 29.8 bits (67), Expect = 0.57
 Identities = 12/66 (18%), Positives = 24/66 (36%), Gaps = 2/66 (3%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEE--EEEKEKKEEEEKEKMEEE 142
           +     T++        +   +E+  +   +E+    K E   EE E+K  E  +K+ E 
Sbjct: 1   NQLLLLTQQIADRKLKKLNTADEKSLQSRSDEIIALIKLEKLLEEAERKALEALKKLAEA 60

Query: 143 EKEEEE 148
                 
Sbjct: 61  TASIRA 66


>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
          Length = 592

 Score = 30.0 bits (67), Expect = 0.57
 Identities = 15/59 (25%), Positives = 25/59 (42%), Gaps = 4/59 (6%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK----EEEEKEKME 140
           D+A ++ +A    E       E   E   E  + E KE+ +  E+K    ++ EK   E
Sbjct: 59  DEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVEKAPAE 117


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 29.5 bits (67), Expect = 0.59
 Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 4/58 (6%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
            E  IKK EEEKEE E+   + E K E  EK     EE+E+  EE++  +E  +L + 
Sbjct: 125 LEQEIKKLEEEKEELEKRVAELEAKLEAIEKR----EEEERQIEEKRHADEIAFLKKQ 178



 Score = 26.4 bits (59), Expect = 6.6
 Identities = 20/61 (32%), Positives = 34/61 (55%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESCE 162
           K  + E+ + E E+  ++ +EE+EE EK+  E + K+E  EK EEE + +    H +   
Sbjct: 114 KALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIA 173

Query: 163 F 163
           F
Sbjct: 174 F 174


>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
          Length = 697

 Score = 30.1 bits (69), Expect = 0.60
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEE 127
           K+E++  E   EE  +EE  E EEE
Sbjct: 328 KREKKAAELAAEEAEEEEAAEPEEE 352



 Score = 29.3 bits (67), Expect = 1.0
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
           ++K E++  E   EE +EEE  E EE+ 
Sbjct: 326 LRKREKKAAELAAEEAEEEEAAEPEEES 353


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 29.8 bits (68), Expect = 0.62
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
            EE+ E ++E   E  EE+E+K+EE+K 
Sbjct: 63  LEEQRERLEELAPELLEEEEEKKEEKKG 90



 Score = 27.1 bits (61), Expect = 5.5
 Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 114 EEEVKEEEKEEEEEKEKK-EEEEKEKMEEEEKEEEETKYL 152
           +E V+E      EE+ ++ EE   E +EEEE+++EE K L
Sbjct: 52  KEVVEEVNSLSLEEQRERLEELAPELLEEEEEKKEEKKGL 91



 Score = 26.7 bits (60), Expect = 6.2
 Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            EE++E  EE   +  E+EEE+++EKK        E+ +
Sbjct: 63  LEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKGK 101


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 29.7 bits (68), Expect = 0.64
 Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 19/60 (31%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEE-------------------EEKEKMEEEEKEEEETK 150
           +EE E  VK+ E   EE+K++KE                    E  +K+  +EKE+ E  
Sbjct: 502 DEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAA 561



 Score = 27.8 bits (63), Expect = 3.2
 Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 16/68 (23%)

Query: 97  LYEDNIKK--------EEEEKEEEEEEEVKEE--------EKEEEEEKEKKEEEEKEKME 140
           L ++ I++         EE+K+ +E  E + +        EK  +E  +K   +EKEK+E
Sbjct: 500 LSDEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIE 559

Query: 141 EEEKEEEE 148
              KE +E
Sbjct: 560 AAIKELKE 567


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 29.5 bits (66), Expect = 0.64
 Identities = 17/42 (40%), Positives = 30/42 (71%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           K+EEE  + + EEV+E+E+E E+ ++  +EE++E  E EE+ 
Sbjct: 18  KDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59



 Score = 26.0 bits (57), Expect = 7.8
 Identities = 15/39 (38%), Positives = 25/39 (64%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           E++   + EE EE+EEE    E+  +EE++E +E EE+ 
Sbjct: 21  EEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 28.9 bits (65), Expect = 0.68
 Identities = 12/39 (30%), Positives = 29/39 (74%)

Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           EE++ VK+  KE+ ++K++K++++K+K ++  K+  + K
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134


>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
           trafficking and secretion].
          Length = 340

 Score = 29.5 bits (67), Expect = 0.68
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
               E+++EE +E E EKEK EEE++E+ +E   E 
Sbjct: 1   MGFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFER 36



 Score = 29.1 bits (66), Expect = 1.1
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
            E+ EEE++E E E+E+ +E+ EEEEKE   E  K
Sbjct: 4   FEKLEEELEELEAEKEKIEEEDEEEEKEGWFERLK 38



 Score = 28.8 bits (65), Expect = 1.2
 Identities = 15/36 (41%), Positives = 24/36 (66%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKM 139
              E+ EEE EE   E+EK EEE++E+++E   E++
Sbjct: 2   GFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFERL 37



 Score = 28.4 bits (64), Expect = 1.8
 Identities = 26/79 (32%), Positives = 36/79 (45%), Gaps = 22/79 (27%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE----------------------EEKEKME 140
           K EEE +E E E+E  EEE EEEE++   E                       + KEK++
Sbjct: 6   KLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQGLSKTKKNFGKGIKGLFLKKIKEKLD 65

Query: 141 EEEKEEEETKYLVRDVHLE 159
           E+  EE E   +  DV +E
Sbjct: 66  EDLLEELEELLIEADVGVE 84



 Score = 27.6 bits (62), Expect = 3.1
 Identities = 12/36 (33%), Positives = 24/36 (66%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKM 139
           ++ EE+ EE E E ++ E+E+EEE+++   E  ++ 
Sbjct: 5   EKLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQG 40



 Score = 27.6 bits (62), Expect = 3.2
 Identities = 14/30 (46%), Positives = 22/30 (73%)

Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
               E+ EE+ ++ E EKEK+EEE++EEE+
Sbjct: 1   MGFFEKLEEELEELEAEKEKIEEEDEEEEK 30



 Score = 27.2 bits (61), Expect = 3.7
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
              E+ E + EE E E+EK ++E+EE+EK    E+
Sbjct: 2   GFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFER 36


>gnl|CDD|237001 PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase
           subunit E2; Reviewed.
          Length = 411

 Score = 29.8 bits (68), Expect = 0.71
 Identities = 7/44 (15%), Positives = 7/44 (15%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           EE E E     E   E    E                       
Sbjct: 78  EEGEAEAAAAAEAAPEAPAPEPAPAAAAAAAAAPAAAAAPAAPA 121


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 29.5 bits (67), Expect = 0.71
 Identities = 21/73 (28%), Positives = 34/73 (46%), Gaps = 7/73 (9%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            D+  +E +K G        +  E +EE    E+  E KEE EE EK+ E   E+++E  
Sbjct: 190 SDEVEEELKKLGF-------ERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELA 242

Query: 144 KEEEETKYLVRDV 156
           K+  E    + + 
Sbjct: 243 KKYLEELLALYEY 255



 Score = 29.1 bits (66), Expect = 1.2
 Identities = 15/56 (26%), Positives = 26/56 (46%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
            KEE E+ E+E E + EE KE  ++  ++     E +E E +  E     ++    
Sbjct: 220 IKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALSKFLKTDKT 275



 Score = 28.4 bits (64), Expect = 1.7
 Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 2/63 (3%)

Query: 92  EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEE--EKEKKEEEEKEKMEEEEKEEEET 149
            K   L E+  K   +  EE  ++  +E EK E+E  E E++  E + +++E E+E E  
Sbjct: 67  PKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERL 126

Query: 150 KYL 152
           +  
Sbjct: 127 EPW 129



 Score = 28.0 bits (63), Expect = 3.0
 Identities = 27/135 (20%), Positives = 56/135 (41%), Gaps = 11/135 (8%)

Query: 7   FTIRGEEEEEEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDN 66
            T+  ++ EE + E D E +    + +    Y    V    +E  ++     + + F   
Sbjct: 150 GTVPEDKLEELKLESDVENV-EYISTDKGYVYV---VVVVLKELSDEVEEELKKLGFERL 205

Query: 67  TLCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEE 126
            L     EG P+ L+  + ++ +E EK      +++ +E +E  ++  EE+    +  E 
Sbjct: 206 EL---EEEGTPSELIREIKEELEEIEK----ERESLLEELKELAKKYLEELLALYEYLEI 258

Query: 127 EKEKKEEEEKEKMEE 141
           E E+ E   K    +
Sbjct: 259 ELERAEALSKFLKTD 273


>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6.  This region covers
           TM regions 4-6 of the ryanodine receptor 1 family.
          Length = 217

 Score = 29.3 bits (66), Expect = 0.72
 Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 105 EEEEKEEEEEEEVKEEEKEE-EEEKEKKEEEEKEKMEEEEKEEEETKYLVR 154
           E E +E    E    E++E  EE +E K E EK  +E+ EK  +   YL R
Sbjct: 72  EVEPEEPPTPEGAPAEKRELKEEGEEPKSEPEKADLEDGEKRTKILNYLAR 122


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 29.6 bits (66), Expect = 0.72
 Identities = 12/47 (25%), Positives = 27/47 (57%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           E+ + + E E   E + ++K+E+++  EE++   E   ++  + EKE
Sbjct: 92  EEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKE 138



 Score = 28.1 bits (62), Expect = 2.2
 Identities = 14/48 (29%), Positives = 24/48 (50%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
             ++   EE   E E E   E + + ++EK+K  EE+K   E   K++
Sbjct: 85  VADVNNGEEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQ 132



 Score = 26.9 bits (59), Expect = 6.1
 Identities = 14/47 (29%), Positives = 20/47 (42%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            K          EE + E E E   E +   ++EK+K  EE+K   E
Sbjct: 80  TKDNVVADVNNGEEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPE 126



 Score = 26.5 bits (58), Expect = 7.2
 Identities = 10/41 (24%), Positives = 21/41 (51%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           EE   E E E   E   + +++++K  EE++   E   +++
Sbjct: 92  EEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQ 132


>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
          Length = 1560

 Score = 29.7 bits (67), Expect = 0.75
 Identities = 5/41 (12%), Positives = 15/41 (36%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           K+ +E + + E    +       + E    +++   E  + 
Sbjct: 885 KDSKEGDADAEVAEVDAAPGGAVDHEPPVAKQEGNAEGGDG 925



 Score = 29.7 bits (67), Expect = 0.86
 Identities = 8/46 (17%), Positives = 13/46 (28%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
              +        + E    K+E   E  +    +      M  EEK
Sbjct: 896 VAEVDAAPGGAVDHEPPVAKQEGNAEGGDGAALDAAAGRLMTREEK 941


>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 535

 Score = 29.6 bits (67), Expect = 0.79
 Identities = 15/50 (30%), Positives = 25/50 (50%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
              IK ++  K  E   +  +E K   E+K+   E   E +EEE K+++E
Sbjct: 447 GAKIKIQKALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDE 496


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 29.4 bits (66), Expect = 0.79
 Identities = 11/47 (23%), Positives = 17/47 (36%)

Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           +   E      EE+     +  E       K +    + EEEE+ EE
Sbjct: 69  SKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEE 115


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 29.8 bits (67), Expect = 0.82
 Identities = 15/47 (31%), Positives = 25/47 (53%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +  EE++EEE E V      + ++ E   E+E E  E++E+ E   K
Sbjct: 8   QANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAK 54



 Score = 28.6 bits (64), Expect = 1.5
 Identities = 15/44 (34%), Positives = 22/44 (50%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           EEE+   +  EE  EEE E        + ++ E   E+E E+EE
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEE 44



 Score = 27.1 bits (60), Expect = 5.4
 Identities = 12/44 (27%), Positives = 24/44 (54%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             +  EEE+EEE++   +    + +  E   +++ E+EE +EE 
Sbjct: 6   NTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEA 49



 Score = 27.1 bits (60), Expect = 6.3
 Identities = 14/60 (23%), Positives = 30/60 (50%), Gaps = 2/60 (3%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEK--EKKEEEEKEKMEEE 142
           D Q    ++   LYE  ++   +  ++ + ++  +  + EE       +E+EEKE++ EE
Sbjct: 763 DFQFFNVQRLTELYEKEVRYLMQAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQLLEE 822


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 29.4 bits (66), Expect = 0.85
 Identities = 16/33 (48%), Positives = 19/33 (57%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
           E+  EEEEE +EEE+EE  E E   EE     E
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFE 431



 Score = 29.0 bits (65), Expect = 1.2
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
           D +++  +EK     E   EEE+EEEEE+E++  E +  MEE
Sbjct: 384 DELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEE 425



 Score = 27.9 bits (62), Expect = 3.2
 Identities = 18/49 (36%), Positives = 29/49 (59%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           E++    +E +E  +E+     E+  EEE+E++EEEE+E  E E   EE
Sbjct: 377 EEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEE 425



 Score = 27.9 bits (62), Expect = 3.2
 Identities = 15/25 (60%), Positives = 16/25 (64%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKE 129
           EEEE+EEEEEEE  E E   EE   
Sbjct: 404 EEEEEEEEEEEEAAEAEAPMEEPVP 428



 Score = 27.5 bits (61), Expect = 3.3
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
           +++  EE+EEEEEEE +E  + E   +E   
Sbjct: 398 VERWAEEEEEEEEEEEEEAAEAEAPMEEPVP 428


>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
          Length = 1634

 Score = 29.5 bits (66), Expect = 0.85
 Identities = 30/141 (21%), Positives = 62/141 (43%), Gaps = 23/141 (16%)

Query: 10   RGEEEEEEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIR--WRH--IEFSD 65
            +GE  +EEEEE   E+               + VF Y  +E ++ G++   R   IE   
Sbjct: 1473 KGENPDEEEEEAFREE--------------KEKVFTYLNKELDEAGLKRVLRRCVIETYQ 1518

Query: 66   NT-----LCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEE 120
             T     L    V     G        A E ++A  +  D + +  ++++++E+ +V E+
Sbjct: 1519 KTDKSKVLLRLRVGNVGKGKKKETLATADEIDQAAHMLMDQVGQVPDDEDDDEDGDVLEK 1578

Query: 121  EKEEEEEKEKKEEEEKEKMEE 141
            E ++ ++  ++  E + ++ E
Sbjct: 1579 EVDQLQQSLERLAEVQRELAE 1599


>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
          Length = 702

 Score = 29.6 bits (66), Expect = 0.87
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEK 130
           I   EE+ EEE + +VKEE+  +   +EK
Sbjct: 9   IGTSEEDDEEEMDMDVKEEDDGDRRNREK 37


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 28.8 bits (65), Expect = 0.91
 Identities = 17/49 (34%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +++EEEEK +EEE E      +   E+ + +++ KE M++EE EE   +
Sbjct: 122 LEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKE-MQKEEDEEMRHR 169



 Score = 26.9 bits (60), Expect = 4.1
 Identities = 13/38 (34%), Positives = 24/38 (63%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           E+ E EEEE ++EE+ E   +  K   E+ +++++ KE
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKE 157



 Score = 26.9 bits (60), Expect = 4.9
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
           E+ E EE+EK++EEE+E++    K   E   L
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRL 151



 Score = 26.1 bits (58), Expect = 8.2
 Identities = 17/45 (37%), Positives = 29/45 (64%), Gaps = 1/45 (2%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE-KEKMEEEEKEEEE 148
           E+ E+EEEE+ + +E E+     K + E+   K+K +E +KEE+E
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDE 164


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 29.3 bits (65), Expect = 0.91
 Identities = 12/45 (26%), Positives = 24/45 (53%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           +KE+   EE+E +E   E+ +   E+ +++  E +  EE   +E 
Sbjct: 97  RKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEP 141


>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
          Length = 249

 Score = 29.3 bits (66), Expect = 0.91
 Identities = 12/45 (26%), Positives = 17/45 (37%), Gaps = 3/45 (6%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           ++ EE EE+EE      E    E+      E +E  E       E
Sbjct: 205 RDPEEAEEKEEA---AAETAGVEDAAAAAAEAEEGEEWVAAANNE 246



 Score = 27.3 bits (61), Expect = 3.4
 Identities = 8/31 (25%), Positives = 12/31 (38%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
            +  +++EE   E    E       E EE E
Sbjct: 207 PEEAEEKEEAAAETAGVEDAAAAAAEAEEGE 237


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 29.3 bits (66), Expect = 0.92
 Identities = 13/54 (24%), Positives = 21/54 (38%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVH 157
                +   E EEV  E ++E E    + E   EK+ EE     E   + + + 
Sbjct: 224 DGGPSELISELEEVIAEIQDELESLRSELEALAEKIAEELLAVREILEIEKALG 277



 Score = 28.9 bits (65), Expect = 1.4
 Identities = 19/70 (27%), Positives = 29/70 (41%), Gaps = 1/70 (1%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
                E   +  E     EE  K  E+     EE   + EE+  + +EE E +E+  +E 
Sbjct: 81  FILPEEVEKLEAELK-SLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEEL 139

Query: 147 EETKYLVRDV 156
           E   YL  D+
Sbjct: 140 EPLAYLDFDL 149


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 29.3 bits (66), Expect = 0.94
 Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKE--EEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
                KKEEEE+EEE+E  +K      E EE++ + ++E+ E  EE+       K
Sbjct: 176 LFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPK 230



 Score = 26.6 bits (59), Expect = 6.7
 Identities = 18/49 (36%), Positives = 27/49 (55%), Gaps = 6/49 (12%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEE------KEKKEEEEKEKMEEEEKEEE 147
            E +K +EE+EE  E+E+EEE         E +  + K +ME  E+ EE
Sbjct: 108 YEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEE 156


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 29.3 bits (65), Expect = 0.98
 Identities = 18/56 (32%), Positives = 31/56 (55%), Gaps = 4/56 (7%)

Query: 83  VLDDQAKETEKA----GVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
           V+D +  ETE+       L+ +  ++ EEE  + E  + + EE++EEEE E  + E
Sbjct: 60  VIDGENSETERPRDLTAALFAEAGEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPE 115



 Score = 28.1 bits (62), Expect = 2.5
 Identities = 15/35 (42%), Positives = 20/35 (57%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           E  EE EEE  + E  + E +E+ EEEE E  + E
Sbjct: 81  EAGEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPE 115



 Score = 27.3 bits (60), Expect = 4.7
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           E  EE +EE+ ++E  + E +E+ EEEE E
Sbjct: 81  EAGEEAEEEDNDRECPDTEAEEEDEEEEIE 110



 Score = 26.6 bits (58), Expect = 8.3
 Identities = 14/35 (40%), Positives = 19/35 (54%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
           E  EE EEE+   E  + E E+E +EEE +    E
Sbjct: 81  EAGEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPE 115


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 29.1 bits (66), Expect = 0.99
 Identities = 13/50 (26%), Positives = 32/50 (64%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           E+N ++E EE E  E+ E   ++   E+ +E +EE++ +  ++ +++++E
Sbjct: 333 EENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDE 382



 Score = 26.4 bits (59), Expect = 9.1
 Identities = 11/51 (21%), Positives = 27/51 (52%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           + +   +EE E+ E  E+     +    E+ E+ EEE+ +  +++  E+++
Sbjct: 331 FIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDD 381


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 29.4 bits (66), Expect = 1.00
 Identities = 10/43 (23%), Positives = 20/43 (46%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            E + +        E  E EEE ++ ++ + +  EE ++  EE
Sbjct: 267 PEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEE 309



 Score = 27.8 bits (62), Expect = 3.2
 Identities = 11/48 (22%), Positives = 17/48 (35%), Gaps = 3/48 (6%)

Query: 105 EEEEKEE---EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           E +        E  E +EE  E ++      EE  +  EE   +  E 
Sbjct: 270 EPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEELMFDAVEA 317


>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
          Length = 432

 Score = 29.3 bits (66), Expect = 1.0
 Identities = 10/43 (23%), Positives = 23/43 (53%), Gaps = 3/43 (6%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKK---EEEEKEKMEEEEKEE 146
           E ++EE+       +  + EE+      EEE + +++ E++E+
Sbjct: 97  ETQQEEKAVNPFTPQPGQREERRPTLESEEEWRARLKREQEEQ 139


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 29.1 bits (65), Expect = 1.0
 Identities = 21/45 (46%), Positives = 33/45 (73%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           KK+E+ KEE ++ + KEE KE+   KEK++E+EK+  E  ++EEE
Sbjct: 119 KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEE 163



 Score = 27.5 bits (61), Expect = 3.4
 Identities = 17/65 (26%), Positives = 38/65 (58%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            + ++    G   +    KE + +  +EEE+ KE+ KEE+++K++K +EE +  + +E+ 
Sbjct: 78  KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137

Query: 146 EEETK 150
           +E+  
Sbjct: 138 KEKRP 142



 Score = 26.4 bits (58), Expect = 9.8
 Identities = 20/63 (31%), Positives = 38/63 (60%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           D+  K  EK G        K  +E + E  +E ++E+++ +EEK+KK+E+ KE+ ++ + 
Sbjct: 74  DEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKP 133

Query: 145 EEE 147
           +EE
Sbjct: 134 KEE 136


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 28.7 bits (65), Expect = 1.0
 Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 5/64 (7%)

Query: 87  QAKETEKAGVLYE---DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           QAK + K   L E   D ++ E      E + ++   +KE+   + + +E E +  E E 
Sbjct: 45  QAK-SIK-QHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEA 102

Query: 144 KEEE 147
           + + 
Sbjct: 103 ESDH 106



 Score = 27.2 bits (61), Expect = 3.4
 Identities = 8/41 (19%), Positives = 18/41 (43%), Gaps = 2/41 (4%)

Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           + +    K+E+     E ++ E + KE   E E +    ++
Sbjct: 73  QAKIARYKKEKARYRSEAKELEAKAKEA--EAESDHALHQH 111


>gnl|CDD|237431 PRK13570, PRK13570, anthranilate synthase component I; Provisional.
          Length = 455

 Score = 29.2 bits (66), Expect = 1.1
 Identities = 16/62 (25%), Positives = 27/62 (43%), Gaps = 2/62 (3%)

Query: 96  VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE--EETKYLV 153
             Y D  ++E E+      EE+K+  + E E  E  +   K  + +EE     E+ K  +
Sbjct: 149 NRYSDRSEEELEKALNVVLEELKQPAEAEHELIELSKLSFKSNITKEEFCGMVEKAKEYI 208

Query: 154 RD 155
           R 
Sbjct: 209 RA 210


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 29.1 bits (65), Expect = 1.1
 Identities = 10/54 (18%), Positives = 27/54 (50%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
             +   E      + ++E++ E++++K + E +EK   ++EE   +     V++
Sbjct: 2   AYRNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKE 55



 Score = 29.1 bits (65), Expect = 1.1
 Identities = 15/67 (22%), Positives = 33/67 (49%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
             +  EK     +   + E +EK  +++EEV  +  +   +++  +E  K   E ++  +
Sbjct: 14  NDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTK 73

Query: 147 EETKYLV 153
           EE+K L+
Sbjct: 74  EESKQLL 80



 Score = 28.4 bits (63), Expect = 2.2
 Identities = 14/58 (24%), Positives = 31/58 (53%), Gaps = 1/58 (1%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEE-KEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           E NI   +  +E+++E++ ++   E +E+  +KKEE   + ++   KE+   + L   
Sbjct: 8   EANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIA 65


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 28.2 bits (63), Expect = 1.1
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
             K   +KEEE+EEE+ E  KEEE   E
Sbjct: 90  FNKLTSKKEEEKEEEIPEPTKEEELLGE 117



 Score = 27.5 bits (61), Expect = 2.3
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 128 KEKKEEEEKEKMEEEEKEEE 147
             KKEEE++E++ E  KEEE
Sbjct: 94  TSKKEEEKEEEIPEPTKEEE 113



 Score = 25.9 bits (57), Expect = 8.2
 Identities = 12/17 (70%), Positives = 13/17 (76%)

Query: 118 KEEEKEEEEEKEKKEEE 134
           KEEEKEEE  +  KEEE
Sbjct: 97  KEEEKEEEIPEPTKEEE 113



 Score = 25.9 bits (57), Expect = 8.2
 Identities = 13/35 (37%), Positives = 20/35 (57%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           KKEEE++EE  E   +EE   E  +  K++   K+
Sbjct: 96  KKEEEKEEEIPEPTKEEELLGEIRDLLKQQNSSKD 130


>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 696

 Score = 29.1 bits (66), Expect = 1.2
 Identities = 12/31 (38%), Positives = 23/31 (74%)

Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           K +E++ EE+++  EE +KE+ EEEE+  ++
Sbjct: 325 KRKEQQAEEKEKPAEEAKKEEEEEEEESVDD 355



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 15/38 (39%), Positives = 24/38 (63%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
           +++ EEKE+  EE  KEEE+EEEE  +     +  ++E
Sbjct: 328 EQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDPIELE 365



 Score = 28.0 bits (63), Expect = 2.5
 Identities = 16/51 (31%), Positives = 29/51 (56%), Gaps = 4/51 (7%)

Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
            + K +E++ EE+E+  +E +KEE    E++EEE  + +   +  E E  Y
Sbjct: 322 LLSKRKEQQAEEKEKPAEEAKKEE----EEEEEESVDDVLLIDPIELELGY 368


>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein.  This family consists of mammalian
           MRVI1 proteins which are related to the
           lymphoid-restricted membrane protein (JAW1) and the IP3
           receptor associated cGMP kinase substrates A and B
           (IRAGA and IRAGB). The function of MRVI1 is unknown
           although mutations in the Mrvi1 gene induces myeloid
           leukaemia by altering the expression of a gene important
           for myeloid cell growth and/or differentiation so it has
           been speculated that Mrvi1 is a tumour suppressor gene.
           IRAG is very similar in sequence to MRVI1 and is an
           essential NO/cGKI-dependent regulator of IP3-induced
           calcium release. Activation of cGKI decreases
           IP3-stimulated elevations in intracellular calcium,
           induces smooth muscle relaxation and contributes to the
           antiproliferative and pro-apoptotic effects of NO/cGMP.
           Jaw1 is a member of a class of proteins with
           COOH-terminal hydrophobic membrane anchors and is
           structurally similar to proteins involved in vesicle
           targeting and fusion. This suggests that the function
           and/or the structure of the ER in lymphocytes may be
           modified by lymphoid-restricted resident ER proteins.
          Length = 538

 Score = 28.8 bits (64), Expect = 1.2
 Identities = 15/43 (34%), Positives = 28/43 (65%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           EEEK+ ++ ++++E E EE  E+ +K    +E  E EE ++E+
Sbjct: 415 EEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKEQ 457


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 28.3 bits (64), Expect = 1.2
 Identities = 15/45 (33%), Positives = 28/45 (62%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           ++ KE  E   + ++E +  E+ EKK E+ K K E+++ ++EE K
Sbjct: 52  KDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96


>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
          Length = 382

 Score = 28.7 bits (65), Expect = 1.3
 Identities = 14/48 (29%), Positives = 24/48 (50%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            +EEEE+EE EEEE +     E    +     ++E+ +   + +E  K
Sbjct: 225 GEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEK 272



 Score = 28.3 bits (64), Expect = 1.7
 Identities = 15/60 (25%), Positives = 30/60 (50%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESCEF 163
            EEEE+EEE EEE  +     E    +     K++ ++   + +E +   + V+++  +F
Sbjct: 225 GEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQPVNVQPAQF 284


>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
           2-oxoacid:ferredoxin oxidoreductases, beta subunit
           [Energy production and conversion].
          Length = 294

 Score = 28.8 bits (65), Expect = 1.3
 Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 15/68 (22%)

Query: 96  VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE-----------EEKEKMEEEEK 144
            LY      E  + EEE+ +++        EE  + EE             ++  ++ ++
Sbjct: 230 PLYRY----EPGKAEEEKGKDIPIGIFYPVEEYLEYEERFKHLTKSNPELIQKLQKDVDE 285

Query: 145 EEEETKYL 152
             EE K L
Sbjct: 286 RWEELKRL 293


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 28.9 bits (64), Expect = 1.3
 Identities = 9/40 (22%), Positives = 22/40 (55%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
            +DN ++ ++ K  + + E  +++ EEE+ K    ++ K 
Sbjct: 99  VQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKG 138


>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
           to archaeal Holliday junction resolvase [Nucleotide
           transport and metabolism].
          Length = 175

 Score = 28.2 bits (63), Expect = 1.3
 Identities = 19/54 (35%), Positives = 30/54 (55%), Gaps = 2/54 (3%)

Query: 96  VLYEDNIKKEEEEKEEEEEEEVKEEEKEEE--EEKEKKEEEEKEKMEEEEKEEE 147
             Y  +++ + E K  E EE +++ E+E    E + +KEEE K K   E+K EE
Sbjct: 21  RAYIRSLQGKVESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEE 74



 Score = 28.2 bits (63), Expect = 1.5
 Identities = 16/39 (41%), Positives = 24/39 (61%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           E+ ++K E E+   E +  KEEE + +E  EKK EE +E
Sbjct: 39  EETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 28.7 bits (64), Expect = 1.4
 Identities = 19/72 (26%), Positives = 34/72 (47%)

Query: 81  LCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
           L    ++ KE    G  YED  +  E  +++E +   +E   EE E++ + +E   E++ 
Sbjct: 308 LERNGNRIKELADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVA 367

Query: 141 EEEKEEEETKYL 152
           E E  E +   L
Sbjct: 368 EMELLEVQVDDL 379



 Score = 27.6 bits (61), Expect = 3.3
 Identities = 14/47 (29%), Positives = 23/47 (48%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           D + +E  ++E E+E E+ E   EE  E E  E +  +  E   + E
Sbjct: 341 DVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387



 Score = 27.2 bits (60), Expect = 4.5
 Identities = 15/52 (28%), Positives = 24/52 (46%)

Query: 92  EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           EK           +EE ++E E +E   EE  E E  E + ++  E++ E E
Sbjct: 336 EKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387


>gnl|CDD|233990 TIGR02740, TraF-like, TraF-like protein.  This protein is related
           to the F-type conjugation system pilus assembly proteins
           TraF (TIGR02739)and TrbB (TIGR02738) both of which
           exhibit a thioredoxin fold. The protein represented by
           this model has the same length and architecture as TraF,
           but lacks the CXXC-motif found in TrbB and believed to
           be responsible for the disulfide isomerase activity of
           that protein.
          Length = 271

 Score = 28.5 bits (64), Expect = 1.5
 Identities = 14/53 (26%), Positives = 21/53 (39%)

Query: 89  KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
           K+ ++    YED   KEEEE  E        ++   EE         +E + E
Sbjct: 36  KDEKRGWFWYEDPPPKEEEEPLEPAPPAPPAQQPAREELAPFSVAWLRENLPE 88


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 28.7 bits (65), Expect = 1.5
 Identities = 15/64 (23%), Positives = 26/64 (40%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
              +E     V     I+  +EE  EE  E  KE  + E   +E+K  ++   +E  ++E
Sbjct: 378 SSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRE 437

Query: 146 EEET 149
               
Sbjct: 438 LAFK 441



 Score = 27.1 bits (61), Expect = 4.7
 Identities = 21/63 (33%), Positives = 29/63 (46%), Gaps = 4/63 (6%)

Query: 94  AGVLYEDNIKKEEE----EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           A +L E+     EE      EE  E E  +EE  EE  +  KE  E E + +EEK  ++ 
Sbjct: 369 AQLLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDL 428

Query: 150 KYL 152
             L
Sbjct: 429 LSL 431



 Score = 26.4 bits (59), Expect = 7.9
 Identities = 8/47 (17%), Positives = 24/47 (51%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           ++ ++ E   +EE+  +++   E  + E   K  E+  + +E+  ++
Sbjct: 410 KEALETEALAQEEKLADDLLSLEGLDRELAFKLAEKGIKTLEDLAEQ 456


>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
           Provisional.
          Length = 719

 Score = 28.6 bits (64), Expect = 1.5
 Identities = 13/41 (31%), Positives = 25/41 (60%), Gaps = 3/41 (7%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           +E+ +++EE ++ E E+E E     K E +   MEEE +++
Sbjct: 442 DEDSDDDEEFQIPESEQEPE---TTKNETKDTAMEEEPQDK 479


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 28.5 bits (64), Expect = 1.5
 Identities = 15/39 (38%), Positives = 24/39 (61%)

Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
           + ++EE+K   +EE+K    E+EK EEE  + + D H E
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEEYGFCILDGHKE 133


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 28.4 bits (64), Expect = 1.5
 Identities = 19/48 (39%), Positives = 31/48 (64%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            E ++K+  E +  EEE E+K++  E+  EKEK+ EE ++ + EE KE
Sbjct: 299 GEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKE 346



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 19/71 (26%), Positives = 39/71 (54%), Gaps = 3/71 (4%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE---KMEEE 142
            +         ++E  + +EE E +++  E+++E+EK  EE ++   EE KE   K+EE 
Sbjct: 295 LKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEI 354

Query: 143 EKEEEETKYLV 153
           +K+ E+ +  +
Sbjct: 355 QKKLEDLEKRL 365



 Score = 27.3 bits (61), Expect = 3.6
 Identities = 15/43 (34%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 104 KEEEEKEEEEEEEVKEE--EKEEEEEKEKKEEEEKEKMEEEEK 144
           +E+E++ EE E+ + EE  E   + E+ +K+ E+ EK  E+ K
Sbjct: 327 REKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLK 369


>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3).  This family
           consists of several optic atrophy 3 (OPA3) proteins.
           OPA3 deficiency causes type III 3-methylglutaconic
           aciduria (MGA) in humans. This disease manifests with
           early bilateral optic atrophy, spasticity,
           extrapyramidal dysfunction, ataxia, and cognitive
           deficits, but normal longevity.
          Length = 134

 Score = 28.0 bits (63), Expect = 1.6
 Identities = 14/30 (46%), Positives = 19/30 (63%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
           KKEEE ++E EE E +  E E E E++K  
Sbjct: 105 KKEEELQQELEELEARVGELELEIERQKAR 134



 Score = 26.5 bits (59), Expect = 5.5
 Identities = 13/43 (30%), Positives = 24/43 (55%)

Query: 96  VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           ++YE   +  +E K+EEE ++  EE +    E E + E +K +
Sbjct: 92  LVYEYQRQSRKEAKKEEELQQELEELEARVGELELEIERQKAR 134


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 28.6 bits (64), Expect = 1.6
 Identities = 14/51 (27%), Positives = 31/51 (60%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
               +E   EE+ +    ++KE+++E+ K +EE++E +E E++ +E  + L
Sbjct: 336 SPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEEL 386


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 28.4 bits (64), Expect = 1.7
 Identities = 13/43 (30%), Positives = 21/43 (48%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
             I + E    EE EEE  EE +E + E  K E++  ++   +
Sbjct: 823 KTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQ 865


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 28.1 bits (63), Expect = 1.7
 Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 4/71 (5%)

Query: 76  KPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
           KP     VL D +      G  Y D + K EE+++E      K++ K  +  KE + ++ 
Sbjct: 129 KPERPYTVLGDLSAHV---GWKYADVVAKLEEKRKERAAAYYKKKVKLRKAWKEAR-KKA 184

Query: 136 KEKMEEEEKEE 146
             K+ +     
Sbjct: 185 LAKLPKAIVAV 195


>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway components
           [Intracellular trafficking and secretion].
          Length = 94

 Score = 27.3 bits (61), Expect = 1.7
 Identities = 13/49 (26%), Positives = 26/49 (53%), Gaps = 2/49 (4%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            KK   + + E +EE+K EE ++++++     E +   EE ++   E K
Sbjct: 39  FKKAASDVKNELDEELKLEELDDKKKELT--AELQATKEELDQLASELK 85


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 28.4 bits (64), Expect = 1.7
 Identities = 20/76 (26%), Positives = 40/76 (52%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           L+++ KE E+     +  +KK  ++   +   E +E+ ++  E KEK  EE +E  EE +
Sbjct: 339 LEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEEELK 398

Query: 144 KEEEETKYLVRDVHLE 159
           + +EE + L  +  + 
Sbjct: 399 ELKEELESLYSEGKIS 414


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 27.2 bits (61), Expect = 1.9
 Identities = 14/29 (48%), Positives = 18/29 (62%)

Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
                  EE+KEEEEE+E+KEE E+E   
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 26.5 bits (59), Expect = 3.3
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEE 134
                 E K+EE+EEEEEKE+ EEE
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 26.5 bits (59), Expect = 3.5
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKE 129
                 EE++EE +EEE++EE E+E
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 26.1 bits (58), Expect = 4.4
 Identities = 12/24 (50%), Positives = 18/24 (75%)

Query: 119 EEEKEEEEEKEKKEEEEKEKMEEE 142
                EE+++E++EEEEKE+ EEE
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 26.1 bits (58), Expect = 4.8
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEE 143
                 EE+KE++EEEE+++  EEE
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 26.1 bits (58), Expect = 5.3
 Identities = 14/22 (63%), Positives = 17/22 (77%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEE 125
              EEK+EEEEEE ++EE EEE
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEE 96



 Score = 25.3 bits (56), Expect = 9.0
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEE 135
                  +E+++EEEEE+EK+E EE
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEESEE 95



 Score = 25.3 bits (56), Expect = 9.4
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEE 135
                   +++E+EEEEE++++ EEE
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEESEEE 96


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 27.6 bits (62), Expect = 1.9
 Identities = 12/36 (33%), Positives = 25/36 (69%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           ++ E ++  +EE+++ EEE + + E+ K+ E EK+K
Sbjct: 91  ERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126



 Score = 27.2 bits (61), Expect = 2.8
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
           E ++  +EEK+  EEE K + EE ++ E E K 
Sbjct: 94  EAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
           family 4 of glycosyl hydrolases [Carbohydrate transport
           and metabolism].
          Length = 442

 Score = 28.4 bits (64), Expect = 2.1
 Identities = 14/50 (28%), Positives = 27/50 (54%), Gaps = 3/50 (6%)

Query: 97  LYEDNIKKEEEEKEEEEE---EEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            Y  ++K+++ ++ E+      EV + EKE  E  +K E +EK +  E+ 
Sbjct: 262 YYPYDMKEKKPDEIEKFGTRANEVMKREKELFELYKKPELKEKPEELEKR 311


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 28.5 bits (64), Expect = 2.1
 Identities = 14/68 (20%), Positives = 36/68 (52%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           +++ + +  +   +  E+ + + EE +   EE++  ++E + E  + ++ EEE + + E 
Sbjct: 811 IIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINEN 870

Query: 143 EKEEEETK 150
           + E  E K
Sbjct: 871 KNEFVEFK 878



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 15/74 (20%), Positives = 35/74 (47%), Gaps = 7/74 (9%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK----- 138
           ++++ K  ++   L    I++ EEE +   E + +  E + + +K  K    K+      
Sbjct: 839 INEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLV 898

Query: 139 --MEEEEKEEEETK 150
              +E +++E+ TK
Sbjct: 899 ISSDEIKQDEKTTK 912


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 28.0 bits (63), Expect = 2.1
 Identities = 11/59 (18%), Positives = 28/59 (47%)

Query: 95  GVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLV 153
                 N  ++ + K  E+E+ +   E++  E +  + + E +++ EE +  +E   +V
Sbjct: 17  EPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLREEIERLKEPPLIV 75



 Score = 26.8 bits (60), Expect = 6.2
 Identities = 14/54 (25%), Positives = 23/54 (42%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
             D I  + E  E +E     E+ K +  EKEK+    +E+  E E    + + 
Sbjct: 5   ALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREV 58


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 28.3 bits (62), Expect = 2.2
 Identities = 18/49 (36%), Positives = 34/49 (69%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           I+ E+EE++ E+E++  E+E  E    + K E+EK+K E+E+++ E+ K
Sbjct: 149 IELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEK 197


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 28.2 bits (63), Expect = 2.2
 Identities = 22/66 (33%), Positives = 41/66 (62%), Gaps = 3/66 (4%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEE--EEEVKEEEKEEEE-EKEKKEEEEKEKMEEEEKEE 146
             E+  VL E     E+E +++ E  E+ +KE+EK ++E E+E +E +E+E+ ++ E E+
Sbjct: 509 FKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEK 568

Query: 147 EETKYL 152
           E  + L
Sbjct: 569 EAQEAL 574



 Score = 26.3 bits (58), Expect = 9.8
 Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 11/63 (17%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEE-----------EKEKKEEEEKEKMEEEEKEEEETKY 151
           +KE E+K E  E+ +KE+EK ++E           E+ KK E EKE  E  +  ++E + 
Sbjct: 524 EKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVES 583

Query: 152 LVR 154
           ++R
Sbjct: 584 IIR 586



 Score = 26.3 bits (58), Expect = 10.0
 Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEE---EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
           L+   KE EK     E  +++ +E    K+ E E+E +E  K  ++E E    E KEK  
Sbjct: 534 LEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKI 593

Query: 141 EEEKEEEETKYLVR 154
            + KE +  + LV+
Sbjct: 594 HKAKEIKSIEDLVK 607


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
           chaperones [Posttranslational modification, protein
           turnover, chaperones].
          Length = 786

 Score = 28.4 bits (64), Expect = 2.2
 Identities = 12/49 (24%), Positives = 31/49 (63%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           D +++E  + E E+E   +E++++E++  ++  + ++ ++ E EKE E 
Sbjct: 404 DELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEA 452


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 28.3 bits (63), Expect = 2.2
 Identities = 7/53 (13%), Positives = 19/53 (35%), Gaps = 1/53 (1%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK-KEEEEKEKMEEEEKEEEETK 150
               +  E+ + +  ++E+ ++ +  E E  +   +E    M          K
Sbjct: 332 IKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVK 384



 Score = 27.9 bits (62), Expect = 2.9
 Identities = 12/63 (19%), Positives = 23/63 (36%), Gaps = 7/63 (11%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
             K   KA        KKE  +K    E E+  E ++E      +    +  ++ ++K  
Sbjct: 338 AEKNEAKAR-------KKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGL 390

Query: 147 EET 149
            + 
Sbjct: 391 IDA 393


>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
          Length = 673

 Score = 28.3 bits (63), Expect = 2.2
 Identities = 12/47 (25%), Positives = 26/47 (55%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            + E E+K  +E+ ++    ++ E   E+ ++  + K+E EE+E  E
Sbjct: 364 AQAEIEQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLE 410


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 28.0 bits (63), Expect = 2.3
 Identities = 20/44 (45%), Positives = 27/44 (61%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           EE EEE  EE +EE KE E EKE  E E+K   ++ +K  ++ K
Sbjct: 64  EEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGK 107



 Score = 27.2 bits (61), Expect = 4.2
 Identities = 21/53 (39%), Positives = 32/53 (60%), Gaps = 6/53 (11%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKE------EEEEKEKKEEEEKEKMEEEEKEEEE 148
           +++ EEE  EE EEE+KE E E      E++  EKK ++  +K ++EE   EE
Sbjct: 63  LEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEE 115



 Score = 27.2 bits (61), Expect = 4.3
 Identities = 18/53 (33%), Positives = 29/53 (54%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           YE+ +K+ E EKE  E E+   E+K ++  K+ K+EE   +   E + EE   
Sbjct: 74  YEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPEP 126


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 26.9 bits (60), Expect = 2.3
 Identities = 9/18 (50%), Positives = 14/18 (77%)

Query: 110 EEEEEEEVKEEEKEEEEE 127
            E ++EE KEEE+EE ++
Sbjct: 79  AEAKKEEKKEEEEEESDD 96



 Score = 25.4 bits (56), Expect = 9.3
 Identities = 8/23 (34%), Positives = 16/23 (69%)

Query: 120 EEKEEEEEKEKKEEEEKEKMEEE 142
                E +KE+K+EEE+E+ +++
Sbjct: 75  AAAAAEAKKEEKKEEEEEESDDD 97



 Score = 25.4 bits (56), Expect = 9.3
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 116 EVKEEEKEEEEEKEKKEEEEKEKM 139
                E ++EE+KE++EEE  + M
Sbjct: 75  AAAAAEAKKEEKKEEEEEESDDDM 98



 Score = 25.4 bits (56), Expect = 9.7
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 107 EEKEEEEEEEVKEEEKEEEEE 127
                E ++E K+EE+EEE +
Sbjct: 75  AAAAAEAKKEEKKEEEEEESD 95


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 28.1 bits (63), Expect = 2.3
 Identities = 17/42 (40%), Positives = 23/42 (54%)

Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVH 157
           E+K  E E+  E+ ++ EEE EK  EE  +EE  K   R V 
Sbjct: 93  ELKSGELEKRRERREEAEEELEKAIEEGDDEEIKKQSKRTVR 134


>gnl|CDD|215059 PLN00111, PLN00111, accumulation of photosystem one; Provisional.
          Length = 399

 Score = 28.1 bits (63), Expect = 2.5
 Identities = 11/46 (23%), Positives = 17/46 (36%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
             +  + E+E +E +   E        E  +   E    EEE K L
Sbjct: 221 GGRIVDFEDESEEPDPPPEGPSSPLLTELDDSEIEAPSSEEELKEL 266


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 26.8 bits (60), Expect = 2.5
 Identities = 11/35 (31%), Positives = 26/35 (74%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           K+E+++K+++ E E   E+KE++E  E+++ ++K 
Sbjct: 38  KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72


>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 145

 Score = 27.4 bits (61), Expect = 2.5
 Identities = 8/41 (19%), Positives = 17/41 (41%)

Query: 88  AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEK 128
           A    K       +I     E+   E  +++ +++EE  +K
Sbjct: 100 AYMYRKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKK 140



 Score = 26.3 bits (58), Expect = 5.4
 Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 5/44 (11%)

Query: 97  LYEDNIKKEE-----EEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
           +Y  N             E+   E ++   K+ EE ++K  + +
Sbjct: 102 MYRKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKKWYQND 145


>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306).  This
           family of proteobacterial species proteins has no known
           function.
          Length = 115

 Score = 26.9 bits (60), Expect = 2.5
 Identities = 13/46 (28%), Positives = 17/46 (36%), Gaps = 4/46 (8%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +K     EE  E     EE+              E+ EE E E+EE
Sbjct: 9   RKLAVRAEEPAEPAETAEEEAAAAA----PAPAPEEEEEAELEDEE 50



 Score = 26.5 bits (59), Expect = 4.2
 Identities = 13/55 (23%), Positives = 23/55 (41%), Gaps = 3/55 (5%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKM---EEEEKEEEETKYLVRDV 156
           E  E+E          E+EEE E E +E  E+  +   +      + + +L + V
Sbjct: 23  ETAEEEAAAAAPAPAPEEEEEAELEDEELLEELDLPDPDTLTPGSDFSAFLAKGV 77


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 27.8 bits (62), Expect = 2.6
 Identities = 18/50 (36%), Positives = 31/50 (62%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
              + K E  E++  +E E KEE+K +EE+ E +E E++E +E+ E+  E
Sbjct: 4   ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNE 53



 Score = 26.3 bits (58), Expect = 7.1
 Identities = 17/47 (36%), Positives = 31/47 (65%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            K+ ++ K E  EE+  +E + +EE+K K+E+ E E++E+EE  E+ 
Sbjct: 2   EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDS 48



 Score = 26.3 bits (58), Expect = 7.4
 Identities = 18/65 (27%), Positives = 36/65 (55%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            + +  +      E++  KE E KEE++ +E   E +E E+E+  ++ EE  +++ EE +
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60

Query: 146 EEETK 150
           +E  K
Sbjct: 61  DENNK 65


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 28.0 bits (63), Expect = 2.6
 Identities = 10/52 (19%), Positives = 30/52 (57%), Gaps = 1/52 (1%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E   + ++ + +EEEE  + +E+ ++E++    +++++E+ + E +     K
Sbjct: 217 EIQNELDDID-DEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIK 267


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 28.1 bits (63), Expect = 2.6
 Identities = 18/61 (29%), Positives = 34/61 (55%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           L +   + E +G+L    +++ E  K+EE + E++E  +E E E+E  EEE +E  ++  
Sbjct: 361 LKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNV 420

Query: 144 K 144
            
Sbjct: 421 G 421



 Score = 26.9 bits (60), Expect = 5.6
 Identities = 22/66 (33%), Positives = 38/66 (57%)

Query: 88  AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
             E++K     ++  K+++E  EEEE E+ +E + E+   K  K  E+ +K EEEE+ +E
Sbjct: 433 EAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492

Query: 148 ETKYLV 153
           E  +L 
Sbjct: 493 ENPWLK 498



 Score = 26.2 bits (58), Expect = 9.9
 Identities = 18/70 (25%), Positives = 32/70 (45%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           + K  +      ++        K  +  E  K+EE + E E+ ++E E +E+ +EEE EE
Sbjct: 355 RKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEE 414

Query: 147 EETKYLVRDV 156
              K + R  
Sbjct: 415 PSKKNVGRRK 424


>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B.  Escherichia coli
           stringent starvation protein B (SspB), is thought to
           enhance the specificity of degradation of tmRNA-tagged
           proteins by the ClpXP protease. The tmRNA tag, also
           known as ssrA, is an 11-aa peptide added to the C
           terminus of proteins stalled during translation, targets
           proteins for degradation by ClpXP and ClpAP. SspB a
           cytoplasmic protein that specifically binds to residues
           1-4 and 7 of the tag. Binding of SspB enhances
           degradation of tagged proteins by ClpX, and masks
           sequence elements important for ClpA interactions,
           inhibiting degradation by ClpA. However, more recent
           work has cast doubt on the importance of SspB in
           wild-type cells. SspB is encoded in an operon whose
           synthesis is stimulated by carbon, amino acid, and
           phosphate starvation. SspB may play a special role
           during nutrient stress, for example by ensuring rapid
           degradation of the products of stalled translation,
           without causing a global increase in degradation of all
           ClpXP substrates.
          Length = 153

 Score = 27.2 bits (61), Expect = 2.6
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           E EE +EEEEEE  + +   E+E  + ++  K K
Sbjct: 111 EPEEADEEEEEEDDDADDSPEDEDPEPKDPPKTK 144


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 28.2 bits (62), Expect = 2.6
 Identities = 12/40 (30%), Positives = 27/40 (67%)

Query: 106  EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            E  K++EE+E ++  ++ +EEE  KKE+E ++++ +   +
Sbjct: 1024 ERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHD 1063



 Score = 27.8 bits (61), Expect = 3.7
 Identities = 13/60 (21%), Positives = 34/60 (56%)

Query: 91   TEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            + +  +      +++ E K+   +E +K++E++E  E  ++ +EE+   +E+E+E+   K
Sbjct: 1000 SRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRK 1059



 Score = 27.4 bits (60), Expect = 4.8
 Identities = 20/51 (39%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 105  EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
            E  EK E ++  VKE  K++EE KE+ E  ++ K EE  K+E+E +  +R 
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEE-KERMESLQRAKEEEIGKKEKEREQRIRK 1059



 Score = 27.0 bits (59), Expect = 5.2
 Identities = 11/40 (27%), Positives = 24/40 (60%)

Query: 107  EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
            E  +++EE+E  E  +  +EE+  K+E+E+E+   +   +
Sbjct: 1024 ERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHD 1063



 Score = 26.6 bits (58), Expect = 9.2
 Identities = 19/66 (28%), Positives = 29/66 (43%), Gaps = 3/66 (4%)

Query: 85  DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           D + +E      + E        E EE  +E   E E+EE     +K+EE +   E EE+
Sbjct: 606 DLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREE---NARKKEELRGNFELEER 662

Query: 145 EEEETK 150
            + E K
Sbjct: 663 GDPEKK 668


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 27.7 bits (62), Expect = 2.6
 Identities = 17/50 (34%), Positives = 27/50 (54%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           E++ K   E  + EE++    E    + E  ++EEEE  + EEE KE E+
Sbjct: 132 EESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEEEVKEPED 181


>gnl|CDD|235604 PRK05778, PRK05778, 2-oxoglutarate ferredoxin oxidoreductase
           subunit beta; Validated.
          Length = 301

 Score = 27.9 bits (63), Expect = 2.7
 Identities = 12/55 (21%), Positives = 20/55 (36%), Gaps = 5/55 (9%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           L+++       GV Y     K E    EE  E++ E   E      +  +E  + 
Sbjct: 250 LEEELGGKIPIGVFY-----KNERPTFEERLEKLIEPLLELPPAALRPGKEALDT 299



 Score = 27.1 bits (61), Expect = 3.8
 Identities = 12/49 (24%), Positives = 20/49 (40%), Gaps = 1/49 (2%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           +   K      E  ++ V +  K EE+      ++  EKM EEE   + 
Sbjct: 211 NTSTKSPAYMREYYKKRVYKL-KLEEDYDPTDRDKAAEKMLEEELGGKI 258


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 28.1 bits (62), Expect = 2.8
 Identities = 21/71 (29%), Positives = 33/71 (46%), Gaps = 4/71 (5%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK----KEEEEKEKM 139
             D   + E      +D   K E  +E ++EE + E     + E++      E  E+++ 
Sbjct: 717 FSDSTSDDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDES 776

Query: 140 EEEEKEEEETK 150
            EEEKEEEE K
Sbjct: 777 SEEEKEEEENK 787



 Score = 26.5 bits (58), Expect = 9.5
 Identities = 16/29 (55%), Positives = 19/29 (65%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           E  EE+E  EEEKEEEE KE   +  K+K
Sbjct: 769 ESSEEDESSEEEKEEEENKEVSAKRAKKK 797


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 27.4 bits (61), Expect = 2.8
 Identities = 13/61 (21%), Positives = 28/61 (45%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
            +E   +L E  +K+E E +EE E +   +      E    ++E E++   ++ K   + 
Sbjct: 59  HSEIHALLRERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDK 118

Query: 150 K 150
           +
Sbjct: 119 E 119


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 27.8 bits (62), Expect = 2.8
 Identities = 20/46 (43%), Positives = 31/46 (67%), Gaps = 3/46 (6%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEK---EKKEEEEKEKMEEEEKEE 146
           KE+  +E+EE EE  E EKEEEE++    +KEEEE++  + + K+ 
Sbjct: 137 KEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQA 182


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 27.6 bits (62), Expect = 2.9
 Identities = 16/47 (34%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKE-EEEKEKMEEEEKEEEETKYL 152
           EEK  EEE E  E++K    E++K++  E   ++EE +   E+   L
Sbjct: 8   EEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTPEDLSCL 54



 Score = 26.0 bits (58), Expect = 9.3
 Identities = 16/40 (40%), Positives = 24/40 (60%)

Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            E  EE+  +EE ++ E++K    EE+KEK+ E   E EE
Sbjct: 4   DETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEE 43


>gnl|CDD|215283 PLN02518, PLN02518, pheophorbide a oxygenase.
          Length = 539

 Score = 27.9 bits (62), Expect = 2.9
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEK 130
           E   +++E EE+  E+E  +E    K
Sbjct: 59  EAALQQDEGEEQRVEQELGQESSDSK 84


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 28.1 bits (63), Expect = 2.9
 Identities = 13/58 (22%), Positives = 27/58 (46%), Gaps = 9/58 (15%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEE----EEEK-----EKKEEEEKEKMEEEEKEEEE 148
           ++ + E  EK   ++E+ +   +E      +EK     E K    +E+  +E ++EE 
Sbjct: 655 ESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEER 712



 Score = 27.3 bits (61), Expect = 4.9
 Identities = 9/49 (18%), Positives = 29/49 (59%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           E +E ++ E  ++   ++E+++  + E ++ + +E+ + ++E K L  +
Sbjct: 652 ETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVE 700


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 27.2 bits (61), Expect = 2.9
 Identities = 15/28 (53%), Positives = 18/28 (64%), Gaps = 1/28 (3%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           E+EE E  K  EKE  E+  KKEEE +E
Sbjct: 82  EKEERELRKRAEKEALEQA-KKEEELRE 108



 Score = 26.1 bits (58), Expect = 6.6
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 122 KEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           K E+EE+E ++  EKE +E+ +KEEE 
Sbjct: 80  KNEKEERELRKRAEKEALEQAKKEEEL 106


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 27.6 bits (61), Expect = 2.9
 Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 6/42 (14%)

Query: 106 EEEKEEEEEEEVKEEEKEEEE------EKEKKEEEEKEKMEE 141
             + E EE+EE +EEE+EEEE      EK +     K    E
Sbjct: 223 GSDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKASTSE 264



 Score = 26.4 bits (58), Expect = 7.2
 Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 4/34 (11%)

Query: 105 EEEEKEEEEEEEVKEEEKE----EEEEKEKKEEE 134
           E EE EEEEEEE +EEE +    E+       + 
Sbjct: 227 ESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKA 260


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 26.9 bits (60), Expect = 2.9
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEE 134
                EE +EEE+EEEEE+E+ EEE
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 26.2 bits (58), Expect = 4.1
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEE 135
                E +EEE+EEEEE+E++ EEE
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 26.2 bits (58), Expect = 4.3
 Identities = 14/28 (50%), Positives = 20/28 (71%)

Query: 113 EEEEVKEEEKEEEEEKEKKEEEEKEKME 140
                +EEE+EEEEE+E++EE E+E M 
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 25.8 bits (57), Expect = 6.4
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEE 143
                EEEE+E++EEEE+E+  EEE
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 28.0 bits (62), Expect = 2.9
 Identities = 14/50 (28%), Positives = 27/50 (54%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
           +K   E +EE  E +K  +K +EE++   +E E    + E K+  E+++ 
Sbjct: 468 RKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHKSEIKKIAESEFK 517


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 27.5 bits (62), Expect = 3.0
 Identities = 8/47 (17%), Positives = 24/47 (51%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           +           E E + ++ E ++++++++ E++E+E     + EE
Sbjct: 33  NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79



 Score = 27.2 bits (61), Expect = 4.1
 Identities = 7/39 (17%), Positives = 20/39 (51%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
                       + E +E++ +++++ ++ E+E+ EEE 
Sbjct: 33  NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71



 Score = 26.0 bits (58), Expect = 9.1
 Identities = 9/49 (18%), Positives = 21/49 (42%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVR 154
                      ++ E  EE+ E +  ++E++++ +EEE +        R
Sbjct: 33  NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEAR 81


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 27.7 bits (61), Expect = 3.1
 Identities = 16/49 (32%), Positives = 27/49 (55%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           DN +   +E+   ++ E  EE   EEEE++ +E  + EK + E  +E E
Sbjct: 422 DNEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFPDEAE 470


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 27.6 bits (62), Expect = 3.1
 Identities = 16/59 (27%), Positives = 31/59 (52%)

Query: 91  TEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           T K   +  +  K E  E E+E   E ++EE++  E +E+  +E  +++ E+ + E E 
Sbjct: 200 TAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREK 258


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 27.7 bits (62), Expect = 3.2
 Identities = 21/78 (26%), Positives = 40/78 (51%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           +L ++    E+     E  +++ E + +E  EE  + EEK EE ++E +  E + +  E 
Sbjct: 306 ILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEA 365

Query: 143 EKEEEETKYLVRDVHLES 160
           E EE E++    +  LE+
Sbjct: 366 ELEELESRLEELEEQLET 383



 Score = 26.6 bits (59), Expect = 7.9
 Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 1/69 (1%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEE-EVKEEEKEEEEEKEKKEEEEKEKMEE 141
            L++Q +E  +        I++ EE  EE E E E    E+   EE       E E++ E
Sbjct: 842 DLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSE 901

Query: 142 EEKEEEETK 150
           E +E E  +
Sbjct: 902 ELRELESKR 910


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 27.3 bits (61), Expect = 3.3
 Identities = 15/53 (28%), Positives = 24/53 (45%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
            E E + E   E  KE      + + K + + K K +  +K EE+ K  V+ V
Sbjct: 72  VEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPV 124


>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
           Endonuclease that cleaves the 5'-overhanging flap
           structure that is generated by displacement synthesis
           when DNA polymerase encounters the 5'-end of a
           downstream Okazaki fragment. Has 5'-endo-/exonuclease
           and 5'-pseudo-Y-endonuclease activities. Cleaves the
           junction between single and double-stranded regions of
           flap DNA.
          Length = 338

 Score = 27.6 bits (62), Expect = 3.4
 Identities = 17/45 (37%), Positives = 23/45 (51%)

Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
           E+K E  EE  E  ++ EE+ E+  E+   EE  KY  R   L S
Sbjct: 85  ELKAETLEERREIREEAEEKWEEALEKGDLEEARKYAQRSSRLTS 129


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 27.8 bits (62), Expect = 3.4
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 13/60 (21%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKE-------------EEEKEKMEEEEKEEEETKYLVRDV 156
           ++E E  VKE EK   E+KEK+E             + EK+  E ++K  EE K  + ++
Sbjct: 504 KDEVERMVKEAEKNAAEDKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEKKEKIENL 563


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 26.9 bits (60), Expect = 3.6
 Identities = 15/34 (44%), Positives = 24/34 (70%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
           ED    +EE++EEE++EE  +E++ EEEE   K+
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEESPVKK 144



 Score = 26.9 bits (60), Expect = 4.0
 Identities = 12/32 (37%), Positives = 26/32 (81%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK 136
            EE++ +++EE+ +EE+ EE++++++ EEEE 
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEES 140



 Score = 26.5 bits (59), Expect = 4.5
 Identities = 14/34 (41%), Positives = 27/34 (79%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           EE+E +++EE+E +E+++E+++E E +EEE   K
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEESPVK 143


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 27.1 bits (60), Expect = 3.6
 Identities = 21/56 (37%), Positives = 36/56 (64%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVR 154
           E  I+ EE EKE+E  E  +E+  E++E++E  +EEEKE+  E+ K++E   +  +
Sbjct: 56  EQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRK 111



 Score = 26.7 bits (59), Expect = 5.6
 Identities = 24/63 (38%), Positives = 43/63 (68%)

Query: 88  AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           A E E+     ++++K +EEE++ E EE  KE+E  E E ++  E++EKE++ +EE++EE
Sbjct: 36  ADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEE 95

Query: 148 ETK 150
           E +
Sbjct: 96  EAE 98


>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
          Length = 985

 Score = 27.4 bits (61), Expect = 3.6
 Identities = 16/54 (29%), Positives = 29/54 (53%)

Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESCE 162
           KE  +E  +K+   ++EEE  K   +E+ K+  E  E+     L++D +L+  E
Sbjct: 563 KEINDESFIKQGFAKDEEELSKLILKERGKVLPETLEKLLFGDLIKDDYLDLNE 616


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 27.3 bits (61), Expect = 3.7
 Identities = 22/49 (44%), Positives = 30/49 (61%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           +   K EE + EE EE E  EEE+ EEEE E++ E E+E+ E  E E +
Sbjct: 3   DKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQ 51


>gnl|CDD|224849 COG1938, COG1938, Archaeal enzymes of ATP-grasp superfamily
           [General function prediction only].
          Length = 244

 Score = 27.3 bits (61), Expect = 3.7
 Identities = 13/38 (34%), Positives = 26/38 (68%)

Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
           +++++E +E EE+ EK  E+ +++ E  E+EEE   Y+
Sbjct: 207 DKLEKEAEEIEEQLEKLAEQLEKEEERVEREEEPPMYM 244



 Score = 26.1 bits (58), Expect = 8.8
 Identities = 18/46 (39%), Positives = 27/46 (58%), Gaps = 3/46 (6%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKE---EEKEEEEEKEKKEEEEKEKM 139
           +   N+  ++ EKE EE EE  E   E+ E+EEE+ ++EEE    M
Sbjct: 199 MLGLNVDTDKLEKEAEEIEEQLEKLAEQLEKEEERVEREEEPPMYM 244


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 27.4 bits (61), Expect = 3.7
 Identities = 18/76 (23%), Positives = 40/76 (52%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           L+ Q ++ E+   + E+++ ++E  +   EEE ++   +  EEE E + E  + K+E E 
Sbjct: 124 LEQQRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEA 183

Query: 144 KEEEETKYLVRDVHLE 159
           +   + +    D++ E
Sbjct: 184 RGRAKEERENEDINRE 199


>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
          Length = 673

 Score = 27.5 bits (61), Expect = 3.7
 Identities = 17/62 (27%), Positives = 33/62 (53%), Gaps = 2/62 (3%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           ++   +E EK     ED  K++  + + + +  V + EK+ +E  +K   + KEK+E + 
Sbjct: 544 VERMVQEAEKFA--KEDKEKRDAVDTKNQADSVVYQTEKQLKELGDKVPADVKEKVEAKL 601

Query: 144 KE 145
           KE
Sbjct: 602 KE 603


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 27.4 bits (61), Expect = 3.8
 Identities = 15/42 (35%), Positives = 26/42 (61%)

Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
            E +E V+E EK EE  K+ K ++E +  + + ++E+E  YL
Sbjct: 17  SEYKEVVEEYEKLEELRKKLKAKKEDDSDDTDYEDEDEELYL 58


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score = 27.6 bits (62), Expect = 3.8
 Identities = 20/57 (35%), Positives = 25/57 (43%), Gaps = 2/57 (3%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE--ETKYLVRDVH 157
             E   K E   EE+KE EKE E  K+K    E   +    K EE    K L ++V 
Sbjct: 727 PPELLPKVERLLEELKELEKELERLKKKLAAAELADLLSNAKAEEIGGVKVLAKEVD 783


>gnl|CDD|236356 PRK08944, motB, flagellar motor protein MotB; Reviewed.
          Length = 302

 Score = 27.3 bits (61), Expect = 3.8
 Identities = 15/60 (25%), Positives = 32/60 (53%), Gaps = 4/60 (6%)

Query: 83  VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
            L+ QA E++ AG       ++  +++  E   EVK E+   +E+ E   ++  +++E+E
Sbjct: 103 TLEFQAGESDSAG----GTQQQRGKQRGGEASAEVKAEKAAAQEQIEDLVKKLAQQLEKE 158


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 27.7 bits (61), Expect = 3.9
 Identities = 11/41 (26%), Positives = 17/41 (41%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            K E+  E     +K E+ ++E  E    E   E E E  +
Sbjct: 547 AKREDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVTD 587


>gnl|CDD|236168 PRK08156, PRK08156, type III secretion system protein SpaS;
           Validated.
          Length = 361

 Score = 27.3 bits (61), Expect = 4.0
 Identities = 14/35 (40%), Positives = 22/35 (62%), Gaps = 3/35 (8%)

Query: 113 EEEEVKEEEKEEE---EEKEKKEEEEKEKMEEEEK 144
           +++EVK E KE+E   E K K+ E  +E + E+ K
Sbjct: 211 DKQEVKREYKEQEGNPEIKSKRREAHQEILSEQVK 245


>gnl|CDD|217451 pfam03249, TSA, Type specific antigen.  There are several antigenic
           variants in Rickettsia tsutsugamushi, and a
           type-specific antigen (TSA) of 56-kilodaltons located on
           the rickettsial surface is responsible for the
           variation. TSA proteins are probably integral membrane
           proteins.
          Length = 502

 Score = 27.5 bits (61), Expect = 4.0
 Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 10/59 (16%)

Query: 97  LYED--------NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           LY+D         IKK  E+   ++EE+   +     + K+K+   EK K   + KE E
Sbjct: 346 LYKDLVKLQRHAGIKKAMEKLAAQQEEDAGNQG--GGDCKKKQGASEKSKEGGKGKETE 402


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 27.3 bits (61), Expect = 4.0
 Identities = 5/21 (23%), Positives = 9/21 (42%)

Query: 106 EEEKEEEEEEEVKEEEKEEEE 126
                       +EEE+EE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303



 Score = 27.3 bits (61), Expect = 4.0
 Identities = 6/15 (40%), Positives = 8/15 (53%)

Query: 103 KKEEEEKEEEEEEEV 117
                 +EEEEEE+ 
Sbjct: 289 AAAAPAEEEEEEEDD 303



 Score = 26.9 bits (60), Expect = 5.8
 Identities = 6/18 (33%), Positives = 8/18 (44%)

Query: 110 EEEEEEEVKEEEKEEEEE 127
                     EE+EEEE+
Sbjct: 285 AAAAAAAAPAEEEEEEED 302



 Score = 26.5 bits (59), Expect = 6.1
 Identities = 5/22 (22%), Positives = 9/22 (40%)

Query: 118 KEEEKEEEEEKEKKEEEEKEKM 139
                       ++EEEE++ M
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDDM 304



 Score = 26.5 bits (59), Expect = 6.9
 Identities = 6/21 (28%), Positives = 9/21 (42%)

Query: 107 EEKEEEEEEEVKEEEKEEEEE 127
                       EEE+EEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303



 Score = 26.1 bits (58), Expect = 8.7
 Identities = 6/21 (28%), Positives = 9/21 (42%)

Query: 128 KEKKEEEEKEKMEEEEKEEEE 148
                       EEEE+EE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 27.3 bits (60), Expect = 4.0
 Identities = 17/61 (27%), Positives = 36/61 (59%)

Query: 97  LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
           L  +   ++E   E+EE+ +  +E+ EE + +E KE +E +K + ++ +E   +Y V++ 
Sbjct: 205 LMVEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNK 264

Query: 157 H 157
           H
Sbjct: 265 H 265



 Score = 26.9 bits (59), Expect = 6.2
 Identities = 18/56 (32%), Positives = 32/56 (57%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           D  +++ ++ +E+ EE   EE KE +E K+KK ++ KE  +E E + +      RD
Sbjct: 217 DEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHKPLWTRD 272


>gnl|CDD|234235 TIGR03503, TIGR03503, TIGR03503 family protein.  This set of
           conserved hypothetical protein has a phylogenetic range
           that closely matches that of TIGR03501, a putative
           C-terminal protein targeting signal.
          Length = 374

 Score = 27.3 bits (61), Expect = 4.1
 Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 1/45 (2%)

Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE-EEKEEEETKYLV 153
           EE  E E   EE   EE  + K E+E+  + E E K+ +    ++
Sbjct: 308 EEPPEAEPSAEEAATEEALKAKAEQEQALLVEREAKDSKFWIIII 352


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 26.9 bits (60), Expect = 4.1
 Identities = 22/45 (48%), Positives = 29/45 (64%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            EEK E+ EEEV+E E EE  E E +EE  +E+ E E  +EE+ K
Sbjct: 1   MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAK 45



 Score = 26.5 bits (59), Expect = 5.6
 Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME-EEEKEEEETKYL 152
           +   + EE E EE  E+EV+EE  EEE E E  +EE+ +  E E + +E E +YL
Sbjct: 7   QVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERYL 61


>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
           and repair].
          Length = 1187

 Score = 27.3 bits (61), Expect = 4.2
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
           IK   +   E+  E V+EE +E E E+EKK 
Sbjct: 243 IKLRRKLYGEKRAERVREELREVEREREKKR 273


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 27.1 bits (60), Expect = 4.2
 Identities = 17/56 (30%), Positives = 32/56 (57%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
           KK ++++ +E  ++  EEE E E+E + +E  E +  +EEE++ +   Y   D  L
Sbjct: 156 KKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGML 211


>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein.  This family consists of several
           eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
           expression of functional F1-ATPase requires two proteins
           encoded by the ATP11 and ATP12 genes. Atp11p is a
           molecular chaperone of the mitochondrial matrix that
           participates in the biogenesis pathway to form F1, the
           catalytic unit of the ATP synthase.
          Length = 250

 Score = 26.9 bits (60), Expect = 4.3
 Identities = 10/53 (18%), Positives = 25/53 (47%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           Y   + ++ +E   E  E +K+  K++ E+KE   ++      ++  + +  K
Sbjct: 4   YRSKLLQKAKESGLEFIERLKKALKDKIEKKEFSAKKPPTGPSKQASKFKTLK 56


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 27.2 bits (60), Expect = 4.3
 Identities = 10/42 (23%), Positives = 20/42 (47%)

Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESCE 162
           ++E + E+EK    ++++  E  +     +   RD H  S E
Sbjct: 2   DEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRE 43


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 26.0 bits (58), Expect = 4.3
 Identities = 11/20 (55%), Positives = 13/20 (65%)

Query: 108 EKEEEEEEEVKEEEKEEEEE 127
                 EEE KEEE+EEEE+
Sbjct: 61  AAAAAAEEEKKEEEEEEEED 80



 Score = 25.3 bits (56), Expect = 8.7
 Identities = 9/22 (40%), Positives = 15/22 (68%)

Query: 118 KEEEKEEEEEKEKKEEEEKEKM 139
                EEE+++E++EEEE + M
Sbjct: 62  AAAAAEEEKKEEEEEEEEDDDM 83


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 27.4 bits (61), Expect = 4.4
 Identities = 11/44 (25%), Positives = 17/44 (38%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             +E   E E E     +   E  E+  E   E+ E+  +EE  
Sbjct: 204 TVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENN 247



 Score = 27.0 bits (60), Expect = 5.5
 Identities = 11/46 (23%), Positives = 17/46 (36%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            +E   E E E     E   E  +E  E   +E  +  E+E  +  
Sbjct: 205 VDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPV 250



 Score = 26.6 bits (59), Expect = 6.4
 Identities = 13/46 (28%), Positives = 17/46 (36%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           E    E   E EV+     E   +  +E  E    E E+  EEE  
Sbjct: 202 EPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENN 247



 Score = 26.2 bits (58), Expect = 9.1
 Identities = 12/48 (25%), Positives = 19/48 (39%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           E  + +   E E E     +   +  EE  E   EE ++  EEE  + 
Sbjct: 202 EPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKP 249


>gnl|CDD|224009 COG1084, COG1084, Predicted GTPase [General function prediction
           only].
          Length = 346

 Score = 27.3 bits (61), Expect = 4.4
 Identities = 11/41 (26%), Positives = 20/41 (48%)

Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
            +  +EE+ EE++    EE  E+  K    K    ++ +EE
Sbjct: 290 IDIADEEKLEEIEASVLEEGGEEPLKISATKGCGLDKLREE 330


>gnl|CDD|234271 TIGR03591, polynuc_phos, polyribonucleotide nucleotidyltransferase.
            Members of this protein family are polyribonucleotide
           nucleotidyltransferase, also called polynucleotide
           phosphorylase. Some members have been shown also to have
           additional functions as guanosine pentaphosphate
           synthetase and as poly(A) polymerase (see model
           TIGR02696 for an exception clade, within this family)
           [Transcription, Degradation of RNA].
          Length = 684

 Score = 27.5 bits (62), Expect = 4.4
 Identities = 18/46 (39%), Positives = 28/46 (60%), Gaps = 2/46 (4%)

Query: 103 KKEEEEKEEEEEEEVKEE--EKEEEEEKEKKEEEEKEKMEEEEKEE 146
           K+E     +  +EEV E   E+EEEEE+  +E+E KE  ++ EK+ 
Sbjct: 255 KQERYAALDAIKEEVLEALAEEEEEEEEADREKEIKEAFKDLEKKI 300


>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor.  The ribosome recycling
           factor (RRF / ribosome release factor) dissociates the
           ribosome from the mRNA after termination of translation,
           and is essential bacterial growth. Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.
          Length = 165

 Score = 26.7 bits (60), Expect = 4.5
 Identities = 14/45 (31%), Positives = 30/45 (66%), Gaps = 1/45 (2%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKE-EEKEEEEEKE 129
           D+ K+ EK   + ED +K+ E+E ++  ++ +K+ +E  +++EKE
Sbjct: 119 DKLKKLEKDKEISEDEVKRAEKEIQKLTDKYIKKIDELLKKKEKE 163


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 27.3 bits (61), Expect = 4.6
 Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 1/66 (1%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKE-EEKEEEEEKEKKEEEEKEKMEEE 142
           L    K+T     +       + EE+  + E E+KE EE  E  EKE  E EE   + +E
Sbjct: 50  LGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDE 109

Query: 143 EKEEEE 148
           EK   +
Sbjct: 110 EKSFLD 115


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 26.9 bits (60), Expect = 4.7
 Identities = 14/36 (38%), Positives = 21/36 (58%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
           K+E EE  E  E   +++ K +EE K+KK +E K  
Sbjct: 234 KRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGV 269


>gnl|CDD|205200 pfam13019, Telomere_Sde2, Telomere stability and silencing.  Sde2
           has been identified in fission yeast as an important
           factor in telomere formation and maintenance. This is a
           more N-terminal domain on these nuclear proteins, and is
           essential for telomeric silencing and genomic stability.
          Length = 163

 Score = 26.7 bits (59), Expect = 4.7
 Identities = 10/36 (27%), Positives = 17/36 (47%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           E K   E   +K E +++E+E  +K  E   +  E 
Sbjct: 128 EAKALAEYLAIKPEMEKKEKEARRKRWESVVEALER 163



 Score = 26.7 bits (59), Expect = 5.0
 Identities = 10/29 (34%), Positives = 12/29 (41%)

Query: 120 EEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           E K   E    K E EK++ E   K  E 
Sbjct: 128 EAKALAEYLAIKPEMEKKEKEARRKRWES 156


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 26.9 bits (60), Expect = 4.7
 Identities = 12/49 (24%), Positives = 33/49 (67%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
             +++++ + E+    E+ E ++ EE E+ ++EE+E+ K +++E+++ E
Sbjct: 155 AELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQRE 203


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 27.2 bits (60), Expect = 4.8
 Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 6/52 (11%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
            AK+ +K       +++  E+  + E +  V  E++  EE+K KKEEEE   
Sbjct: 469 SAKQKDK------QSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAA 514


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 27.2 bits (60), Expect = 4.9
 Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
           ++ +  KE+E EE V+ +  +E+E  E  E+ E +
Sbjct: 238 RRRDLRKEQELEENVERDSDDEDEHGEDSEDGETK 272


>gnl|CDD|131743 TIGR02696, pppGpp_PNP, guanosine pentaphosphate synthetase
           I/polynucleotide phosphorylase.  Sohlberg, et al present
           characterization of two proteins from Streptomyces
           coelicolor. The protein in this family was shown to have
           poly(A) polymerase activity and may be responsible for
           polyadenylating RNA in this species. Reference 2 showed
           that a nearly identical plasmid-encoded protein from
           Streptomyces antibioticus is a bifunctional enzyme that
           acts also as a guanosine pentaphosphate synthetase.
          Length = 719

 Score = 27.1 bits (60), Expect = 4.9
 Identities = 21/80 (26%), Positives = 34/80 (42%), Gaps = 13/80 (16%)

Query: 88  AKETEKAGVL--YEDNIKKEEEEKEEEEEEEV------KEEEKEEEEEKEKKEEEEKEKM 139
           AK T +  +   Y+D++ +  E   ++E          +E E+  +E K     +  E+ 
Sbjct: 248 AKPTGEFPLFPDYQDDVYEAVEGAVKDELSAALTIAGKQEREEALDEVKALVAAKLAEQF 307

Query: 140 EEEEKE-----EEETKYLVR 154
           E  EKE        TK LVR
Sbjct: 308 EGREKEISAAYRAVTKKLVR 327


>gnl|CDD|236449 PRK09282, PRK09282, pyruvate carboxylase subunit B; Validated.
          Length = 592

 Score = 27.1 bits (61), Expect = 5.0
 Identities = 17/67 (25%), Positives = 30/67 (44%), Gaps = 10/67 (14%)

Query: 100 DNIKKEEEEKEEEEEE---------EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +  +KE EE  + E+E         ++ ++  EE E  E K E E ++      E   T+
Sbjct: 408 EKARKEAEELGKSEKEDVLTYALFPQIAKKFLEEREAGELKPEPEPKEAAAAGAEGIPTE 467

Query: 151 YLVRDVH 157
           + V +V 
Sbjct: 468 FKV-EVD 473


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 27.2 bits (61), Expect = 5.2
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 91  TEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK 130
             + G   E   +  E+E+ E+    + E EK EE+E+E 
Sbjct: 840 KLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEEL 879


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 26.9 bits (59), Expect = 5.2
 Identities = 11/61 (18%), Positives = 23/61 (37%), Gaps = 2/61 (3%)

Query: 88  AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
                ++GV   D+I+ +      E      + EK   E + +    E +K  +E +   
Sbjct: 65  LNRNLRSGVFQLDDIRPQLRALRTELG--TAQGEKRAAETEREAARSELQKARQEREAVR 122

Query: 148 E 148
           +
Sbjct: 123 Q 123


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 26.8 bits (59), Expect = 5.2
 Identities = 18/52 (34%), Positives = 28/52 (53%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
            DN  KE+  +E  E +  K    E E E E ++E E+E  EE+E E++  +
Sbjct: 235 NDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEEQENEDKGEE 286


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 26.8 bits (60), Expect = 5.4
 Identities = 15/46 (32%), Positives = 29/46 (63%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           +     ++EE  E++E+E+  +EK  +E++E+ EE   E+ EE E+
Sbjct: 95  KMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 27.0 bits (61), Expect = 5.5
 Identities = 15/42 (35%), Positives = 19/42 (45%)

Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
           EE+KE EKE E+ K K        +  + KE    K L   V
Sbjct: 726 EELKELEKELEQLKAKLAAAAAGDLLAQAKEVNGVKVLAAQV 767


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 27.2 bits (60), Expect = 5.6
 Identities = 11/74 (14%), Positives = 28/74 (37%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
             +   E +      ++N  KE+  K   + +    EE+E+  E   K     +K+   +
Sbjct: 37  NRNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEEDGSESISKLNVNSKKISLNQ 96

Query: 144 KEEEETKYLVRDVH 157
              ++ +  +  + 
Sbjct: 97  VSTQKWRKELDLLA 110


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 26.9 bits (60), Expect = 5.7
 Identities = 6/33 (18%), Positives = 19/33 (57%)

Query: 103  KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
             ++++     E+++  E +  E+E+ E  E+++
Sbjct: 1356 PRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 27.0 bits (59), Expect = 5.7
 Identities = 18/55 (32%), Positives = 32/55 (58%)

Query: 94  AGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
            G    +N +  E +KE+++EE    E  +EE + ++ +E  +EK E++E EE E
Sbjct: 279 VGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENE 333



 Score = 26.6 bits (58), Expect = 8.1
 Identities = 16/65 (24%), Positives = 32/65 (49%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
            +  + ++   +      KEE + +  +E   ++EE +E EE E+  E   +++ E EK 
Sbjct: 290 SEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKG 349

Query: 146 EEETK 150
            EE +
Sbjct: 350 IEEKR 354


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 27.0 bits (60), Expect = 5.7
 Identities = 21/53 (39%), Positives = 32/53 (60%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           +KE  EKE +E +E + + KE+ +  EK+ E    K EE E+E EE +  +RD
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRD 879


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 25.9 bits (57), Expect = 5.7
 Identities = 17/25 (68%), Positives = 21/25 (84%)

Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEE 142
             EEK EE+E+EKK+EEEKE+ EEE
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 25.2 bits (55), Expect = 9.6
 Identities = 16/25 (64%), Positives = 21/25 (84%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKE 129
             EEK EE+EEE K+EE++EEEE+E
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEEE 97


>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
          Length = 177

 Score = 26.3 bits (58), Expect = 5.8
 Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE--EEKEEEETKYL 152
            EE K EE EEEV+  E EE  E+  +E  EK ++E   E  +E E KYL
Sbjct: 2   SEEIKNEEVEEEVEATETEETVEEVVEETPEKSELELANERADEFENKYL 51


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 26.8 bits (59), Expect = 5.9
 Identities = 10/33 (30%), Positives = 22/33 (66%)

Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK 136
              E   +E+E++  E+E + E+E++++ EE+K
Sbjct: 365 PLPEYSSDEDEDDSDEDEVDYEKERKRRREEDK 397


>gnl|CDD|223097 COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 577

 Score = 26.8 bits (60), Expect = 6.0
 Identities = 12/55 (21%), Positives = 26/55 (47%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLV 153
              +  E +    E   ++ ++ +E+    E++  EE EK+E  ++E E  +  V
Sbjct: 184 ALGLTPEPDGYLGEYYVKIAKDLEEDPGNDEEEAREEVEKLESGDEEAELWRKFV 238


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 26.6 bits (59), Expect = 6.2
 Identities = 16/48 (33%), Positives = 33/48 (68%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           K E    +EE ++ ++E E ++ +++++K EEE+ K  ++++EEEE K
Sbjct: 170 KPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERK 217


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 26.7 bits (59), Expect = 6.2
 Identities = 21/63 (33%), Positives = 33/63 (52%)

Query: 86  DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
           D A E  +     +     +   +++E   EV+  E  EEEE E+ EE+E+E+ EEE +E
Sbjct: 42  DDAIELAEPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEE 101

Query: 146 EEE 148
            E 
Sbjct: 102 FEP 104


>gnl|CDD|184860 PRK14858, tatA, twin arginine translocase protein A; Provisional.
          Length = 108

 Score = 26.0 bits (57), Expect = 6.3
 Identities = 18/60 (30%), Positives = 31/60 (51%)

Query: 90  ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
           E +KA   ++ ++++E    EE+E+ E   E K+E E  E K EE++    +   E   T
Sbjct: 38  EFKKATDDFKQSMQEESRTAEEKEKAEKLAETKKEAEAPEAKAEEDQAPKPKGAGEPPAT 97


>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
          Length = 542

 Score = 26.8 bits (60), Expect = 6.6
 Identities = 17/63 (26%), Positives = 30/63 (47%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
            +++  E E A         +E   + E  ++E  EEEK EEE K +   +   ++ +E+
Sbjct: 467 PEEELAEEEAAAEEAALEEDEEAALEVELSDDEELEEEKAEEELKYEDLLKRLRELAKED 526

Query: 144 KEE 146
            EE
Sbjct: 527 PEE 529


>gnl|CDD|237046 PRK12297, obgE, GTPase CgtA; Reviewed.
          Length = 424

 Score = 26.6 bits (60), Expect = 6.6
 Identities = 17/48 (35%), Positives = 21/48 (43%), Gaps = 1/48 (2%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           D +     E  EE  E   EEE E EEE   K EEE++       E+ 
Sbjct: 316 DELLYAVAELLEETPEFPLEEE-EVEEEVYYKFEEEEKDFTITRDEDG 362


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 26.7 bits (59), Expect = 6.7
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
           +K+   E +EEE EE +E  + EE E E  E
Sbjct: 396 VKRVVREVDEEEIEEEEEAMQPEEMEMEGFE 426


>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
          Length = 193

 Score = 26.4 bits (59), Expect = 6.7
 Identities = 18/60 (30%), Positives = 32/60 (53%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
            +  K  +E+  EE E EV+++E+ +E E E + +E   ++ E E +  E +   RD  L
Sbjct: 3   NEEQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEADARIAELEAQLAEAQAEERDSVL 62


>gnl|CDD|227900 COG5613, COG5613, Uncharacterized conserved protein [Function
           unknown].
          Length = 400

 Score = 26.9 bits (59), Expect = 6.8
 Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 2/59 (3%)

Query: 85  DDQAKETEKAGVLYED-NIKKEEEEKEEEEEEEVKEEEK-EEEEEKEKKEEEEKEKMEE 141
             Q  E E A +L    + +K  +     E    +  +K E  ++K KK+EEE +K E+
Sbjct: 110 SPQQFEIELAKLLSLVESAQKVSKLLNSNEIYMDQNTKKMENNQQKIKKQEEEAKKAEK 168


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 26.2 bits (58), Expect = 6.8
 Identities = 16/50 (32%), Positives = 31/50 (62%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           ++ + +E+E+ E  +EEEV   + E++EE + +EE+     +EEE + E 
Sbjct: 13  DEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEA 62


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 26.2 bits (58), Expect = 6.9
 Identities = 15/48 (31%), Positives = 27/48 (56%), Gaps = 2/48 (4%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK--EKMEEEEKEEEETK 150
             E+K+ E   +  +EE E ++E    EE++K   +  +E+ E+EE K
Sbjct: 30  RYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELK 77



 Score = 25.8 bits (57), Expect = 8.3
 Identities = 10/46 (21%), Positives = 25/46 (54%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           + +I+   +  +EE E + +    EE+++   ++ +EK + EE + 
Sbjct: 33  KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKN 78


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 26.0 bits (57), Expect = 7.1
 Identities = 11/38 (28%), Positives = 21/38 (55%)

Query: 113 EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
           +   + ++ K+    K  +E +EK+K E  E EE++ K
Sbjct: 35  DGNVLMKDVKDIATVKPAEEVKEKKKKEGTESEEDDEK 72


>gnl|CDD|220112 pfam09110, HAND, HAND.  The HAND domain adopts a secondary
           structure consisting of four alpha helices, three of
           which (H2, H3, H4) form an L-like configuration. Helix
           H2 runs antiparallel to helices H3 and H4, packing
           closely against helix H4, whilst helix H1 reposes in the
           concave surface formed by these three helices and runs
           perpendicular to them. The domain confers DNA and
           nucleosome binding properties to the protein.
          Length = 109

 Score = 25.7 bits (57), Expect = 7.1
 Identities = 16/45 (35%), Positives = 27/45 (60%)

Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
           D++   +EE EEE E + K E+++ +  +   EEEE+EK    E+
Sbjct: 64  DDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108


>gnl|CDD|237478 PRK13709, PRK13709, conjugal transfer nickase/helicase TraI;
            Provisional.
          Length = 1747

 Score = 26.7 bits (59), Expect = 7.4
 Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 5/56 (8%)

Query: 106  EEEKEEEEEEEVKEEEKEEEE---EKEKKEEEEKEKMEEEEKE--EEETKYLVRDV 156
                 E    EV  + + EE    E E++ +E   KM E + +  + +T+  VRD+
Sbjct: 1632 GAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVRDI 1687


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
           minor protein in the core of the virion is probably the
           viral helicase.
          Length = 322

 Score = 26.6 bits (58), Expect = 7.5
 Identities = 15/55 (27%), Positives = 31/55 (56%)

Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
           +  E  E +E+E K+E+ +E+   + + E++++  +EE  +E E   + R  H E
Sbjct: 26  DWFESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRRTHTE 80


>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
           protein; Provisional.
          Length = 196

 Score = 26.1 bits (57), Expect = 7.7
 Identities = 16/63 (25%), Positives = 29/63 (46%), Gaps = 9/63 (14%)

Query: 95  GVLYE--DNIKKEEEEKEEEEEEEVKEEEKEE-------EEEKEKKEEEEKEKMEEEEKE 145
           GVL E  D++ K +EE +   +     + KE        E +  K +  E   + EE++ 
Sbjct: 133 GVLVEGYDDVVKAKEEAKAAAKAAALAKAKEASLKSAEAELKASKGKSIEVNVIGEEDEG 192

Query: 146 EEE 148
           E++
Sbjct: 193 EKD 195


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 26.1 bits (56), Expect = 7.8
 Identities = 24/61 (39%), Positives = 34/61 (55%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           + K  +   V  +D    + +++ EEEEE   EEE EEE E E++ EEE E  E  E+E 
Sbjct: 41  KGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEP 100

Query: 147 E 147
           E
Sbjct: 101 E 101


>gnl|CDD|216295 pfam01093, Clusterin, Clusterin. 
          Length = 434

 Score = 26.6 bits (59), Expect = 7.8
 Identities = 18/38 (47%), Positives = 21/38 (55%)

Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
           E  EEE K      E+ K++KEE  K   E EEK EEE
Sbjct: 33  ERTEEEHKNLMSTLEKTKKEKEEALKLANEVEEKLEEE 70


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 25.7 bits (57), Expect = 8.1
 Identities = 13/40 (32%), Positives = 29/40 (72%), Gaps = 2/40 (5%)

Query: 108 EKEEEEEEEVKE--EEKEEEEEKEKKEEEEKEKMEEEEKE 145
           + E+EE + + E  +E+ +E  K+++E++EKE++E+  +E
Sbjct: 57  DDEDEEADRIYESIDERMDERRKKRREQKEKEEIEKYREE 96


>gnl|CDD|214923 smart00937, PCRF, This domain is found in peptide chain release
           factors. 
          Length = 116

 Score = 25.5 bits (57), Expect = 8.1
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
           E  EEEE++E++E  +EE EE E + EEE E++ + 
Sbjct: 1   ELLEEEEDDEMRELAEEELEELEAELEEELEELLKL 36


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 26.2 bits (58), Expect = 8.2
 Identities = 13/65 (20%), Positives = 30/65 (46%)

Query: 87  QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           +  E +K   +  +  + +E  +   EE +   EE+ ++E   +  E E E + +  + +
Sbjct: 217 ERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPK 276

Query: 147 EETKY 151
            +TK 
Sbjct: 277 RKTKA 281


>gnl|CDD|223619 COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 205

 Score = 26.2 bits (58), Expect = 8.3
 Identities = 9/50 (18%), Positives = 20/50 (40%), Gaps = 1/50 (2%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK-EEEEKEKMEEEEKEE 146
                K+   E  +  E+ +++  K +  +       E K  +E+  KE+
Sbjct: 45  LALVQKEAALEALQRAEQRLQQAAKAKRAKDAAANAAEGKAFLEKNAKEK 94


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 26.5 bits (59), Expect = 8.5
 Identities = 16/53 (30%), Positives = 31/53 (58%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
           +K +E + E++EEEV+E+  ++  EK  ++ EE  K  + E + EE +    +
Sbjct: 91  RKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQME 143


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 26.6 bits (59), Expect = 8.5
 Identities = 21/67 (31%), Positives = 32/67 (47%)

Query: 84  LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
           L++   E E+     E+  +K E  KEE EE E   EE E+   + ++ +EE E+     
Sbjct: 311 LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSAL 370

Query: 144 KEEEETK 150
            EE E  
Sbjct: 371 LEELEEL 377


>gnl|CDD|224553 COG1638, DctP, TRAP-type C4-dicarboxylate transport system,
           periplasmic component [Carbohydrate transport and
           metabolism].
          Length = 332

 Score = 26.1 bits (58), Expect = 8.9
 Identities = 12/45 (26%), Positives = 21/45 (46%)

Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
           + +  +E  EE+ + V+E E E  E+ ++   E  E    E   E
Sbjct: 258 LLEAAKEAAEEQRKLVEELEDELLEKLKEAGVEVVEPDAAEAFRE 302


>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
          Length = 212

 Score = 26.2 bits (58), Expect = 8.9
 Identities = 15/88 (17%), Positives = 34/88 (38%), Gaps = 8/88 (9%)

Query: 71  QLVEGKP-NGLLCVLDDQAKETEKAGVLYEDNI---KKEEEEKEEEEEEEVKEEEKEEEE 126
                KP  G   +  D +KE   A  + ++        +   + E+   + +EE+    
Sbjct: 125 PRKGKKPKKGFAGIPADTSKEEVVALPVKQNKNSEVIPFQRTPKREKARVITKEERAFNA 184

Query: 127 ----EKEKKEEEEKEKMEEEEKEEEETK 150
                + K   +   K E++++ +E+ K
Sbjct: 185 YRTLRQAKLNAKFVGKKEKKKQAKEKKK 212


>gnl|CDD|189037 cd09867, PIN_FEN1, PIN domain of Flap Endonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Flap endonuclease-1 (FEN1) is
           involved in multiple DNA metabolic pathways, including
           DNA replication processes (5' flap DNA endonuclease
           activity and double stranded DNA 5'-exonuclease
           activity) and DNA repair processes (long-patch base
           excision repair) in eukaryotes and archaea. Interaction
           between FEN1 and PCNA (Proliferating cell nuclear
           antigen) is an essential prerequisite to FEN1's DNA
           replication functionality and stimulates FEN1 nuclease
           activity by 10-50 fold. FEN1 belongs to the
           FEN1-EXO1-like family of structure-specific, 5'
           nucleases. These nucleases contain a PIN (PilT N
           terminus) domain with a helical arch/clamp region (I
           domain) of variable length (approximately 45 residues in
           FEN1 PIN domains) and a H3TH (helix-3-turn-helix)
           domain, an atypical helix-hairpin-helix-2-like region.
           Both the H3TH domain (not included here) and the helical
           arch/clamp region are involved in DNA binding. Nucleases
           within this group also have a carboxylate-rich active
           site that is involved in binding essential divalent
           metal ion cofactors (Mg2+/Mn2+).  FEN1 has a C-terminal
           extension containing residues forming the consensus
           PIP-box - Qxx(M/L/I)xxF(Y/F) which serves to anchor FEN1
           to PCNA.
          Length = 261

 Score = 25.9 bits (58), Expect = 9.0
 Identities = 16/45 (35%), Positives = 25/45 (55%)

Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
           E+K  E E+  E+ ++ EE+ E+ +EE   EE  KY  R V +  
Sbjct: 86  ELKSGELEKRRERREEAEEKLEEAKEEGDAEEARKYAKRTVRVTK 130


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 26.0 bits (57), Expect = 9.3
 Identities = 13/61 (21%), Positives = 30/61 (49%)

Query: 99  EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
           ED+ K +E+ ++++EE+   +E+K           +     ++E  +E+  K+   +  L
Sbjct: 173 EDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPKEKRQKHHDPERRL 232

Query: 159 E 159
           E
Sbjct: 233 E 233


>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
           Provisional.
          Length = 606

 Score = 26.3 bits (58), Expect = 9.3
 Identities = 7/30 (23%), Positives = 14/30 (46%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEE 127
             +   +E E     E  E  EE+ +E+++
Sbjct: 577 NNNEASEETETITVPENNEEDEEDDDEDDD 606


>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
          Length = 292

 Score = 25.9 bits (58), Expect = 9.5
 Identities = 18/43 (41%), Positives = 24/43 (55%)

Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
           E+K EE EE  E  ++ EE+ E+ +EE   EE  KY  R   L
Sbjct: 38  ELKAEEIEERREVREEAEEKYEEAKEEGDLEEARKYAQRSSRL 80


>gnl|CDD|143630 cd07892, PolyPPase_VTC2-3_like, Polyphosphate(polyP) polymerase
           domain of yeast vacuolar transport chaperone (VTC)
           proteins VTC-2, and -3 , and similar proteins.
           Saccharomyces cerevisiae VTC-1, -2, -3, and -4 comprise
           the membrane-integral VTC complex. VTC-2, -3, and -4
           contain polyP polymerase domains. S. cerevisiae
           VTC-2,and -3 belong to this subgroup. For VTC4 it has
           been shown that this domain generates polyP from ATP by
           a phosphotransfer reaction releasing ADP. This activity
           is metal ion-dependent. The ATP gamma phosphate may be
           cleaved and then transferred to an acceptor phosphate to
           form polyP. PolyP is ubiquitous. In prokaryotes, it is a
           store of phosphate and energy. In eukaryotes, polyPs
           have roles in  bone calcification, and osmoregulation,
           and in phosphate transport in the symbiosis of
           mycorrhizal fungi and plants. This subgroup belongs to
           the CYTH/triphosphate tunnel metalloenzyme (TTM)-like
           superfamily, whose enzymes have a unique active site
           located within an eight-stranded beta barrel.
          Length = 303

 Score = 26.2 bits (58), Expect = 9.6
 Identities = 18/78 (23%), Positives = 36/78 (46%), Gaps = 7/78 (8%)

Query: 89  KETEKAGVLYEDNIKKEEEEKEEEEEEEVKE-------EEKEEEEEKEKKEEEEKEKMEE 141
           K ++K  +  E     E     EE++ ++KE         K + E+K +K E+    +E 
Sbjct: 89  KLSDKPDIFVEKKTFDENTSSFEEDKLQLKEKYINGFIFGKYKFEKKLQKMEKRGADLEN 148

Query: 142 EEKEEEETKYLVRDVHLE 159
            +K+ E  +  +R+  L+
Sbjct: 149 LKKDVENIQDFIRENKLQ 166


>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
           mechanisms].
          Length = 733

 Score = 26.3 bits (58), Expect = 9.9
 Identities = 13/49 (26%), Positives = 22/49 (44%)

Query: 98  YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
                +  E  K   +E+E   + KE  + ++  +EE  E + EE KE 
Sbjct: 392 LGKGREVVENAKFCPKEDEPGLKRKERVDVEDGPQEELIELISEEVKEG 440


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 26.1 bits (58), Expect = 10.0
 Identities = 11/50 (22%), Positives = 19/50 (38%)

Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
           K+ EEEK  +EE +        +    K    + E M+  +      K +
Sbjct: 20  KQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRANRLKKV 69


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 26.2 bits (57), Expect = 10.0
 Identities = 13/40 (32%), Positives = 26/40 (65%)

Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
           EK +    E K ++ +++E+KEK++E +K+K +E E  + 
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.302    0.127    0.343 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,720,894
Number of extensions: 896263
Number of successful extensions: 27656
Number of sequences better than 10.0: 1
Number of HSP's gapped: 13840
Number of HSP's successfully gapped: 4275
Length of query: 163
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 74
Effective length of database: 6,990,096
Effective search space: 517267104
Effective search space used: 517267104
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.7 bits)
S2: 55 (24.8 bits)