RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10931
         (139 letters)



>gnl|CDD|238681 cd01385, MYSc_type_IX, Myosin motor domain, type IX myosins. Myosin
           IX is a processive single-headed motor, which might play
           a role in signalling. This catalytic (head) domain has
           ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 692

 Score =  118 bits (298), Expect = 5e-32
 Identities = 43/67 (64%), Positives = 54/67 (80%)

Query: 2   TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
           +FEQLCINYANE LQYYFNQH+F+ EQEEY  EGI W +IE++DN  C+QL   KP GLL
Sbjct: 394 SFEQLCINYANEQLQYYFNQHIFKLEQEEYQGEGITWTNIEYTDNVGCIQLFSKKPTGLL 453

Query: 62  CVLDDQA 68
            +LD+++
Sbjct: 454 YLLDEES 460


>gnl|CDD|214580 smart00242, MYSc, Myosin. Large ATPases.  ATPase; molecular motor.
           Muscle contraction consists of a cyclical interaction
           between myosin and actin. The core of the myosin
           structure is similar in fold to that of kinesin.
          Length = 677

 Score =  118 bits (297), Expect = 8e-32
 Identities = 41/68 (60%), Positives = 53/68 (77%)

Query: 2   TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
           +FEQLCINYANE LQ +FNQHVF+ EQEEY +EGI W  I+F DN  C+ L+E KP G+L
Sbjct: 386 SFEQLCINYANEKLQQFFNQHVFKLEQEEYEREGIDWTFIDFFDNQDCIDLIEKKPPGIL 445

Query: 62  CVLDDQAK 69
            +LD++ +
Sbjct: 446 SLLDEECR 453


>gnl|CDD|238071 cd00124, MYSc, Myosin motor domain. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 679

 Score =  111 bits (281), Expect = 1e-29
 Identities = 44/76 (57%), Positives = 55/76 (72%), Gaps = 2/76 (2%)

Query: 2   TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
           +FEQLCINYANE LQ +FNQHVF+ EQEEY +EGI W  I+F+DN   + L+E KP GLL
Sbjct: 379 SFEQLCINYANEKLQQFFNQHVFKLEQEEYQEEGIDWESIDFTDNQEVIDLIEKKPGGLL 438

Query: 62  CVLDDQ--AKKRRDEK 75
            +LD++    K  DE 
Sbjct: 439 SLLDEECLFPKGTDET 454


>gnl|CDD|238677 cd01381, MYSc_type_VII, Myosin motor domain, type VII myosins.
           Myosins in this group have been associated with
           functions in sensory systems such as vision and hearing.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 671

 Score =  103 bits (258), Expect = 1e-26
 Identities = 37/68 (54%), Positives = 52/68 (76%)

Query: 2   TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
           +FEQLCIN+ANE+LQ +F QH+F+ EQEEYN E I W+HIEF DN   L L+  KP  ++
Sbjct: 379 SFEQLCINFANENLQQFFVQHIFKLEQEEYNLEHINWQHIEFVDNQDALDLIAIKPLNIM 438

Query: 62  CVLDDQAK 69
            ++D+++K
Sbjct: 439 SLIDEESK 446


>gnl|CDD|215687 pfam00063, Myosin_head, Myosin head (motor domain). 
          Length = 679

 Score =  103 bits (258), Expect = 1e-26
 Identities = 38/69 (55%), Positives = 52/69 (75%), Gaps = 1/69 (1%)

Query: 2   TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFS-DNTLCLQLVEGKPNGL 60
           +FEQLCINYANE LQ +FN H+F+ EQEEY +EGI W  I++  DN  C+ L+E KP G+
Sbjct: 380 SFEQLCINYANEKLQQFFNHHMFKLEQEEYVREGIAWTFIDYGLDNQACIDLIEKKPPGI 439

Query: 61  LCVLDDQAK 69
           L +LD++ +
Sbjct: 440 LSLLDEECR 448


>gnl|CDD|238680 cd01384, MYSc_type_XI, Myosin motor domain, plant-specific type XI
           myosin, involved in organelle transport. This catalytic
           (head) domain has ATPase activity and belongs to the
           larger group of P-loop NTPases. Myosins are
           actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 674

 Score =  102 bits (256), Expect = 3e-26
 Identities = 38/64 (59%), Positives = 46/64 (71%)

Query: 3   FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 62
           FEQ CIN  NE LQ +FNQHVF+ EQEEY KE I W +IEF DN   L L+E KP G++ 
Sbjct: 385 FEQFCINLTNEKLQQHFNQHVFKMEQEEYTKEEIDWSYIEFVDNQDVLDLIEKKPGGIIA 444

Query: 63  VLDD 66
           +LD+
Sbjct: 445 LLDE 448


>gnl|CDD|238676 cd01380, MYSc_type_V, Myosin motor domain, type V myosins. Myosins
           V transport a variety of intracellular cargo
           processively along actin filaments, such as membraneous
           organelles and mRNA. This catalytic (head) domain has
           ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 691

 Score =  102 bits (255), Expect = 4e-26
 Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 1/67 (1%)

Query: 3   FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 62
           FEQ CINYANE LQ  FNQHVF+ EQEEY KEGI W  I+F DN  C+ L+E K  G+L 
Sbjct: 386 FEQFCINYANEKLQQQFNQHVFKLEQEEYLKEGIEWTFIDFYDNQPCIDLIESKL-GILS 444

Query: 63  VLDDQAK 69
           +LD++ +
Sbjct: 445 LLDEECR 451


>gnl|CDD|238683 cd01387, MYSc_type_XV, Myosin motor domain, type XV myosins. In
           vertebrates, myosin XV appears to be expressed in
           sensory tissue and play a role in hearing. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Myosins are
           actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 677

 Score = 97.3 bits (242), Expect = 2e-24
 Identities = 39/66 (59%), Positives = 49/66 (74%)

Query: 2   TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
           +FEQLCINYANE+LQY FN+ VFQ EQEEY +E + W  I F+DN   + L+  KP G+L
Sbjct: 378 SFEQLCINYANENLQYLFNKIVFQEEQEEYIREQLDWTEIAFADNQPVINLISLKPYGIL 437

Query: 62  CVLDDQ 67
            +LDDQ
Sbjct: 438 RILDDQ 443


>gnl|CDD|238679 cd01383, MYSc_type_VIII, Myosin motor domain, plant-specific type
           VIII myosins, a subgroup which has been associated with
           endocytosis, cytokinesis, cell-to-cell coupling and
           gating at plasmodesmata. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 677

 Score = 94.5 bits (235), Expect = 2e-23
 Identities = 39/67 (58%), Positives = 51/67 (76%)

Query: 2   TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
           +FEQ CINYANE LQ +FN+H+F+ EQEEY ++GI W  +EF DN  CL L E KP GLL
Sbjct: 382 SFEQFCINYANERLQQHFNRHLFKLEQEEYEEDGIDWTKVEFEDNQECLDLFEKKPLGLL 441

Query: 62  CVLDDQA 68
            +LD+++
Sbjct: 442 SLLDEES 448


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 88.2 bits (219), Expect = 3e-21
 Identities = 39/67 (58%), Positives = 50/67 (74%), Gaps = 1/67 (1%)

Query: 3   FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGK-PNGLL 61
           FEQLCINY NE LQ +FNQH+F+ EQEEY KEGI W  I++ DN  C+ L+E K P G+L
Sbjct: 446 FEQLCINYTNEKLQQFFNQHMFKLEQEEYVKEGIEWSFIDYFDNQPCIDLIEKKNPLGIL 505

Query: 62  CVLDDQA 68
            +LD++ 
Sbjct: 506 SLLDEEC 512


>gnl|CDD|238675 cd01379, MYSc_type_III, Myosin motor domain, type III myosins.
           Myosin III has been shown to play a role in  the vision
           process in insects and in hearing in mammals. Myosin
           III, an unconventional myosin, does not form dimers.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 653

 Score = 88.0 bits (218), Expect = 3e-21
 Identities = 36/68 (52%), Positives = 50/68 (73%)

Query: 2   TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
           +FEQLCIN ANE +QYYFNQH+F +EQ+EY  EG+  R +E+ DN   L +   KP GLL
Sbjct: 390 SFEQLCINIANEQIQYYFNQHIFAWEQQEYLNEGVDARLVEYEDNRPLLDMFLQKPLGLL 449

Query: 62  CVLDDQAK 69
            +LD++++
Sbjct: 450 ALLDEESR 457


>gnl|CDD|238673 cd01377, MYSc_type_II, Myosin motor domain, type II myosins. Myosin
           II mediates cortical contraction in cell motility, and
           is the motor in smooth and skeletal muscle. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Myosins are
           actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 693

 Score = 85.7 bits (213), Expect = 2e-20
 Identities = 35/66 (53%), Positives = 45/66 (68%), Gaps = 1/66 (1%)

Query: 3   FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFS-DNTLCLQLVEGKPNGLL 61
           FEQLCINY NE LQ +FN H+F  EQEEY +EGI W  I+F  D    + L+E  P G+L
Sbjct: 392 FEQLCINYTNEKLQQFFNHHMFVLEQEEYQREGIEWTFIDFGLDLQPTIDLIEKNPMGIL 451

Query: 62  CVLDDQ 67
            +LD++
Sbjct: 452 SLLDEE 457


>gnl|CDD|238674 cd01378, MYSc_type_I, Myosin motor domain, type I myosins. Myosin I
           generates movement at the leading edge in cell motility,
           and class I myosins have been implicated in phagocytosis
           and vesicle transport. Myosin I, an unconventional
           myosin, does not form dimers. This catalytic (head)
           domain has ATPase activity and belongs to the larger
           group of P-loop NTPases. Myosins are actin-dependent
           molecular motors that play important roles in muscle
           contraction, cell motility, and organelle transport. The
           head domain is a molecular motor, which utilizes ATP
           hydrolysis to generate directed movement toward the plus
           end along actin filaments. A cyclical interaction
           between myosin and actin provides the driving force.
           Rates of ATP hydrolysis and consequently the speed of
           movement along actin filaments vary widely, from about
           0.04 micrometer per second for myosin I to 4.5
           micrometer per second for myosin II in skeletal muscle.
           Myosin II moves in discrete steps about 5-10 nm long and
           generates 1-5 piconewtons of force. Upon ATP binding,
           the myosin head dissociates from an actin filament. ATP
           hydrolysis causes the head to pivot and associate with a
           new actin subunit. The release of Pi causes the head to
           pivot and move the filament (power stroke). Release of
           ADP completes the cycle.
          Length = 674

 Score = 84.6 bits (210), Expect = 4e-20
 Identities = 33/65 (50%), Positives = 43/65 (66%), Gaps = 1/65 (1%)

Query: 3   FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGK-PNGLL 61
           FEQ CINY NE LQ  F +   + EQEEY +EGI+W  IE+ +N +   L+EGK P G+ 
Sbjct: 384 FEQFCINYVNEKLQQIFIELTLKAEQEEYVREGIKWTPIEYFNNKIVCDLIEGKRPPGIF 443

Query: 62  CVLDD 66
            +LDD
Sbjct: 444 SILDD 448


>gnl|CDD|238678 cd01382, MYSc_type_VI, Myosin motor domain, type VI myosins. Myosin
           VI is a monomeric myosin, which moves towards the
           minus-end of actin filaments, in contrast to most other
           myosins. It has been implicated in endocytosis,
           secretion, and cell migration. This catalytic (head)
           domain has ATPase activity and belongs to the larger
           group of P-loop NTPases. Myosins are actin-dependent
           molecular motors that play important roles in muscle
           contraction, cell motility, and organelle transport. The
           head domain is a molecular motor, which utilizes ATP
           hydrolysis to generate directed movement toward the
           minus end along actin filaments. A cyclical interaction
           between myosin and actin provides the driving force.
           Rates of ATP hydrolysis and consequently the speed of
           movement along actin filaments vary widely, from about
           0.04 micrometer per second for myosin I to 4.5
           micrometer per second for myosin II in skeletal muscle.
           Myosin II moves in discrete steps about 5-10 nm long and
           generates 1-5 piconewtons of force. Upon ATP binding,
           the myosin head dissociates from an actin filament. ATP
           hydrolysis causes the head to pivot and associate with a
           new actin subunit. The release of Pi causes the head to
           pivot and move the filament (power stroke). Release of
           ADP completes the cycle.
          Length = 717

 Score = 81.8 bits (202), Expect = 4e-19
 Identities = 30/68 (44%), Positives = 46/68 (67%)

Query: 2   TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
           +FEQ CINY NE LQ +FN+ + + EQE Y +EG+    + + DN  C+ L+E K NG+L
Sbjct: 414 SFEQFCINYCNEKLQQFFNERILKEEQELYQREGLGVNEVHYVDNQDCIDLIEAKLNGIL 473

Query: 62  CVLDDQAK 69
            +LD++ +
Sbjct: 474 DILDEENR 481


>gnl|CDD|240229 PTZ00014, PTZ00014, myosin-A; Provisional.
          Length = 821

 Score = 57.0 bits (138), Expect = 2e-10
 Identities = 30/74 (40%), Positives = 40/74 (54%), Gaps = 2/74 (2%)

Query: 4   EQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLCV 63
           EQL IN  NE LQ  F   VF+ E + Y  EGI    +E++ N   + L+ GK   +L +
Sbjct: 481 EQLFINITNEMLQKNFVDIVFERESKLYKDEGISTEELEYTSNESVIDLLCGKGKSVLSI 540

Query: 64  LDDQ--AKKRRDEK 75
           L+DQ  A    DEK
Sbjct: 541 LEDQCLAPGGTDEK 554


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 46.1 bits (109), Expect = 1e-06
 Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 5/96 (5%)

Query: 27  EQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGE 86
           E E   + G +  H   S+       V+ +        ++Q + ++DEK     G +GG 
Sbjct: 802 EHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEK-----GVDGGG 856

Query: 87  EMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
              G +   EE+E EE EEEEE+++EE+ +++E EE
Sbjct: 857 GSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 45.4 bits (107), Expect = 2e-06
 Identities = 28/84 (33%), Positives = 37/84 (44%), Gaps = 1/84 (1%)

Query: 56  KPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKA 115
           +P   +  L D +K    E E   G R G E  +  E  GE  E   GE E+E + E K 
Sbjct: 619 RPVAKVMALGDLSKGDVAEAEHT-GERTGEEGERPTEAEGENGEESGGEAEQEGETETKG 677

Query: 116 KKKEQEEEGGGGGGEQEKEEEREG 139
           + + + E      GEQE E E E 
Sbjct: 678 ENESEGEIPAERKGEQEGEGEIEA 701



 Score = 44.6 bits (105), Expect = 4e-06
 Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 2/76 (2%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ--EE 122
           +    K   E E + G ++  E     + +  E + E GE+E   + + +AK+ E+  + 
Sbjct: 795 EGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDG 854

Query: 123 EGGGGGGEQEKEEERE 138
            GG  GG+ E+EEE E
Sbjct: 855 GGGSDGGDSEEEEEEE 870



 Score = 41.9 bits (98), Expect = 3e-05
 Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
           + +  +   E E  E G E  +E +G+ E   E E E   +E E + E +A+ KE E+EG
Sbjct: 722 EAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEG 781

Query: 125 ---GGGGGEQEKEEEREG 139
               G  GE + +E  EG
Sbjct: 782 EIQAGEDGEMKGDEGAEG 799



 Score = 41.5 bits (97), Expect = 4e-05
 Identities = 23/79 (29%), Positives = 35/79 (44%), Gaps = 8/79 (10%)

Query: 66  DQAKKRRDEKE-----EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
           +   +R+ E+E     E +     GE    + E+  E E E  E+E E +  E+ ++ E 
Sbjct: 684 EIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVED 743

Query: 121 EEEGGGGGGEQEKEEEREG 139
           E EG   G     E E EG
Sbjct: 744 EGEGEAEGKH---EVETEG 759



 Score = 39.6 bits (92), Expect = 2e-04
 Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 65  DDQAKKRRDEKEEKEGGRE--GGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
           + +AK+   + E +    E  G  E +G E+ GE +  EEGEE E++ + E   K E E 
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVET 757

Query: 123 EGGGGGGEQEKEEEREG 139
           EG     E E E E EG
Sbjct: 758 EGDRKETEHEGETEAEG 774



 Score = 36.9 bits (85), Expect = 0.002
 Identities = 20/74 (27%), Positives = 35/74 (47%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
            +  +R  E E + G   GGE  +  E   + +   EGE   E+K E++ + + + +E  
Sbjct: 646 GEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEAD 705

Query: 126 GGGGEQEKEEEREG 139
             G  + +E E EG
Sbjct: 706 HKGETEAEEVEHEG 719



 Score = 36.9 bits (85), Expect = 0.002
 Identities = 20/56 (35%), Positives = 34/56 (60%), Gaps = 7/56 (12%)

Query: 83  EGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
           E  ++ KG +  G  D  +  EEEEE+++EE+ +++E+EE       E+E+EE  E
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEE-------EEEEEENEE 892



 Score = 36.5 bits (84), Expect = 0.002
 Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 1/71 (1%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVE-EGEEEEEKKKEEKAKKKEQEEEGG 125
           +A+    E+   E  +EG  E KG+ E+  E   E +GE+E E + E K    + E E  
Sbjct: 654 EAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAE 713

Query: 126 GGGGEQEKEEE 136
               E E E E
Sbjct: 714 EVEHEGETEAE 724



 Score = 36.5 bits (84), Expect = 0.002
 Identities = 19/75 (25%), Positives = 34/75 (45%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
           D + +   +E E +      G E +G+ E GEE E  E E E E + + + + +   +E 
Sbjct: 705 DHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKET 764

Query: 125 GGGGGEQEKEEEREG 139
              G  + + +E E 
Sbjct: 765 EHEGETEAEGKEDED 779



 Score = 36.1 bits (83), Expect = 0.004
 Identities = 18/65 (27%), Positives = 26/65 (40%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
           E E K      G+  + + E   E E +E E+E E +  E  + K  E   G    E E 
Sbjct: 748 EAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGET 807

Query: 134 EEERE 138
           E   +
Sbjct: 808 EAGEK 812



 Score = 35.7 bits (82), Expect = 0.005
 Identities = 20/74 (27%), Positives = 33/74 (44%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
            +A++  + + + E   EG    + K E   E E+E  E + + + E +  + E E E  
Sbjct: 665 GEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAE 724

Query: 126 GGGGEQEKEEEREG 139
           G   E E E   EG
Sbjct: 725 GTEDEGEIETGEEG 738



 Score = 35.4 bits (81), Expect = 0.006
 Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 1/65 (1%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
           E + KE   EG  E +GKE+  E  E++ GE+ E K  E    K E E E   G  ++ +
Sbjct: 758 EGDRKETEHEGETEAEGKEDEDE-GEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHE 816

Query: 134 EEERE 138
            +   
Sbjct: 817 GQSET 821



 Score = 35.4 bits (81), Expect = 0.007
 Identities = 17/62 (27%), Positives = 32/62 (51%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
           +++E EG  + GE+ + K + G E +VE   E E  +K+E   + E + +      E  +
Sbjct: 775 KEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGE 834

Query: 134 EE 135
           +E
Sbjct: 835 QE 836



 Score = 34.2 bits (78), Expect = 0.013
 Identities = 23/64 (35%), Positives = 31/64 (48%), Gaps = 2/64 (3%)

Query: 75  KEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKE 134
           + E EG      E KG++E   E E +E + + E + EE   + E E EG    GE E  
Sbjct: 678 ENESEGEIPA--ERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETG 735

Query: 135 EERE 138
           EE E
Sbjct: 736 EEGE 739



 Score = 32.7 bits (74), Expect = 0.046
 Identities = 19/74 (25%), Positives = 38/74 (51%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
             + +    E +++ G +E   E +G+ +  E+     G  +    +EE+ +++E+EEE 
Sbjct: 818 QSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEE 877

Query: 125 GGGGGEQEKEEERE 138
                E+E+EEE E
Sbjct: 878 EEEEEEEEEEEENE 891



 Score = 30.3 bits (68), Expect = 0.31
 Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 3/68 (4%)

Query: 74  EKEEKEGGREGGEEMKGKEE---NGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGE 130
            +E +E   EG  E +GK E    G+  E E   E E + KE++ + + Q  E G   G+
Sbjct: 735 GEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGD 794

Query: 131 QEKEEERE 138
           +  E + E
Sbjct: 795 EGAEGKVE 802


>gnl|CDD|238682 cd01386, MYSc_type_XVIII, Myosin motor domain, type XVIII myosins.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 767

 Score = 43.3 bits (102), Expect = 1e-05
 Identities = 25/83 (30%), Positives = 34/83 (40%), Gaps = 13/83 (15%)

Query: 2   TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
           TFE+LC NY  E LQ  F+   F    E Y +EG+    +EF        L E  P   +
Sbjct: 396 TFEELCHNYLQERLQLLFHHRTFVQPLERYAEEGVE---VEFD-------LAEPSPGTTV 445

Query: 62  CVLD---DQAKKRRDEKEEKEGG 81
            ++D    Q       + E   G
Sbjct: 446 ALVDQAPQQVVVPAGLRAEDARG 468


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 40.7 bits (96), Expect = 9e-05
 Identities = 22/62 (35%), Positives = 39/62 (62%), Gaps = 2/62 (3%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
           ++   KKR +EK+EK+         K +EE  EE E +E E+EEE+++ E+ K++E+E++
Sbjct: 416 VEKAEKKREEEKKEKKKKAF--AGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473

Query: 124 GG 125
             
Sbjct: 474 KK 475



 Score = 39.5 bits (93), Expect = 2e-04
 Identities = 18/63 (28%), Positives = 37/63 (58%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
           +    + EK+ +E  +E  ++    ++  EE+E E+ ++EEEK++EE+  ++E+EEE   
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472

Query: 127 GGG 129
              
Sbjct: 473 KKK 475



 Score = 34.9 bits (81), Expect = 0.008
 Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 3/66 (4%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
           KK  ++ E+K    E  +E K K   G++ E EE EEE+EKK+EEK +++E+ EE     
Sbjct: 413 KKIVEKAEKKR--EEEKKEKKKKAFAGKKKEEEE-EEEKEKKEEEKEEEEEEAEEEKEEE 469

Query: 129 GEQEKE 134
            E++K+
Sbjct: 470 EEKKKK 475



 Score = 34.5 bits (80), Expect = 0.012
 Identities = 21/66 (31%), Positives = 37/66 (56%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
           +  +K        E K +EE  E+ +     +++E+++EE+ +KKE+E+E      E+EK
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466

Query: 134 EEEREG 139
           EEE E 
Sbjct: 467 EEEEEK 472



 Score = 33.7 bits (78), Expect = 0.020
 Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 1/67 (1%)

Query: 73  DEKEEKE-GGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQ 131
            E+E +   G +   +   K     E + EE ++E++KK     KK+E+EEE      E+
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 132 EKEEERE 138
           ++EEE E
Sbjct: 455 KEEEEEE 461


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 38.2 bits (89), Expect = 7e-04
 Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 21/99 (21%)

Query: 48  LC--LQLVEG---KPNGL-----LCVLD---DQAKKRRDEKEEKEGGREGG-------EE 87
           LC  L LVEG   + N L     L +LD      +   +++E+  G            EE
Sbjct: 181 LCLDLDLVEGLLERKNELPHLKKLIILDTLIKSKEININKEEKNNGSNVNNNGNKNNKEE 240

Query: 88  MKGKEENGEEDEVEEGEEEEEKKKEEKAKK-KEQEEEGG 125
            KG + + E +++  G  E +K+K EK K  KE+ ++ G
Sbjct: 241 QKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLG 279


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 36.2 bits (83), Expect = 0.004
 Identities = 16/60 (26%), Positives = 34/60 (56%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
           ++R     E++        +   +E+ +ED+ +E ++EEE+++EE+ K  + E+E   GG
Sbjct: 134 RRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGG 193



 Score = 31.9 bits (72), Expect = 0.10
 Identities = 18/57 (31%), Positives = 32/57 (56%)

Query: 82  REGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
           R   EE     +N   D+ +E E+E++  +E+  +++E+EEE  G   E E++E  E
Sbjct: 138 RHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGE 194



 Score = 27.3 bits (60), Expect = 3.9
 Identities = 21/80 (26%), Positives = 36/80 (45%)

Query: 57  PNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAK 116
           P     + DD   +  D+ +E++   E  EE + K  + E++E E GE+   +K E    
Sbjct: 147 PRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKT 206

Query: 117 KKEQEEEGGGGGGEQEKEEE 136
              +  E G G    +K +E
Sbjct: 207 DCFKFIEAGAGDDVFDKIDE 226


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 35.3 bits (81), Expect = 0.006
 Identities = 27/75 (36%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG--- 125
              ++ +EE+    E  EE + +EE  E +E EEGE+EEE+++ E     E+E EG    
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEE-EEGEDEEEEEEVEADNGSEEEMEGSSEG 492

Query: 126 -GGGGEQEKEEEREG 139
            G G E E++ ER  
Sbjct: 493 DGDGEEPEEDAERRN 507



 Score = 31.8 bits (72), Expect = 0.095
 Identities = 23/72 (31%), Positives = 34/72 (47%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
           Q    R     K     G       +E+ EE+ VEE EEEEE+++EE+ + +E+E E   
Sbjct: 412 QGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEE 471

Query: 127 GGGEQEKEEERE 138
              E E +   E
Sbjct: 472 EEEEVEADNGSE 483



 Score = 26.8 bits (59), Expect = 4.4
 Identities = 19/74 (25%), Positives = 38/74 (51%), Gaps = 1/74 (1%)

Query: 67  QAKKRR-DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
           Q ++R+       +  +      +      +E E EE  EEEE+++EE+ +++++ EE  
Sbjct: 407 QERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEE 466

Query: 126 GGGGEQEKEEEREG 139
           G   E+E+E E + 
Sbjct: 467 GEDEEEEEEVEADN 480



 Score = 26.4 bits (58), Expect = 5.6
 Identities = 16/74 (21%), Positives = 38/74 (51%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
           +++ ++  +++ E+E G +  EE + + +NG E+E+E   E +   +E +   + +  E 
Sbjct: 451 EEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEM 510

Query: 125 GGGGGEQEKEEERE 138
            G     E ++ R 
Sbjct: 511 AGISRMSEGQQPRG 524


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 34.1 bits (79), Expect = 0.013
 Identities = 17/75 (22%), Positives = 31/75 (41%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
             QA++RR+  E    G        G++   E         EEE++ E + +   ++ + 
Sbjct: 124 AAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQA 183

Query: 125 GGGGGEQEKEEEREG 139
               GE+ + EER  
Sbjct: 184 EAERGERGRREERGR 198


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 32.6 bits (75), Expect = 0.038
 Identities = 16/55 (29%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
           ++   +  E+E +E  +E  EE K +E   +  ++     EE++K EEK +KK+ 
Sbjct: 269 EEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSP---EEQRKLEEKERKKQA 320



 Score = 32.6 bits (75), Expect = 0.048
 Identities = 16/38 (42%), Positives = 24/38 (63%)

Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
           EE +EE ++KKEEK K++ + +       EQ K EE+E
Sbjct: 278 EERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKE 315


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 32.6 bits (74), Expect = 0.042
 Identities = 22/73 (30%), Positives = 32/73 (43%), Gaps = 5/73 (6%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKA--KKKEQEEEGGG 126
           KK+  +K+E E   +GGE     ++  E     EG E E+   E+       E E EGG 
Sbjct: 93  KKKEKKKKETEKPAQGGE---KPDQGPEAKGEGEGHEPEDPPPEDTPPPPGGEGEVEGGP 149

Query: 127 GGGEQEKEEEREG 139
             G      ++EG
Sbjct: 150 SPGPGPGPLDQEG 162


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 32.7 bits (74), Expect = 0.045
 Identities = 22/56 (39%), Positives = 27/56 (48%), Gaps = 4/56 (7%)

Query: 70  KRRDEKEEKEGGREGGEEMKGKEENGEEDEVE-EGEEEEEKKKEEKAKKKEQEEEG 124
           K+R+E  EK   RE   E K +EE   E E E E E E E++ E  AK      E 
Sbjct: 579 KKREEAVEK-AKREA--EQKAREEREREKEKEKEREREREREAERAAKASSSSHES 631



 Score = 32.4 bits (73), Expect = 0.063
 Identities = 17/43 (39%), Positives = 29/43 (67%), Gaps = 8/43 (18%)

Query: 97  EDEVEEGEEE-EEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
           E+ VE+ + E E+K +EE+ ++KE+E+E       +E+E ERE
Sbjct: 582 EEAVEKAKREAEQKAREEREREKEKEKE-------RERERERE 617



 Score = 31.2 bits (70), Expect = 0.15
 Identities = 17/49 (34%), Positives = 29/49 (59%), Gaps = 7/49 (14%)

Query: 89  KGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
           K +EE  E+ + E  ++  E+++ EK K+KE+E        E+E+E ER
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERER-------EREREAER 620


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 32.0 bits (73), Expect = 0.069
 Identities = 13/48 (27%), Positives = 22/48 (45%)

Query: 92  EENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEEREG 139
            E  +EDE EE E+  ++ +EE+ +  E+E       G  E   +   
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGS 414



 Score = 30.1 bits (68), Expect = 0.30
 Identities = 16/69 (23%), Positives = 31/69 (44%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
            KRR   +  +      +E + +E+  +E E EEGE+ EE+  + +     +     G  
Sbjct: 356 DKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSD 415

Query: 129 GEQEKEEER 137
            E + ++E 
Sbjct: 416 SESKADKES 424



 Score = 29.3 bits (66), Expect = 0.57
 Identities = 16/71 (22%), Positives = 26/71 (36%), Gaps = 5/71 (7%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
           D  +   DE EE+E         + +EE GE+ E E  +  E+   E  +      E   
Sbjct: 366 DFEEVDEDEDEEEE-----QRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKA 420

Query: 126 GGGGEQEKEEE 136
                 + + E
Sbjct: 421 DKESASDSDSE 431



 Score = 28.9 bits (65), Expect = 0.84
 Identities = 14/52 (26%), Positives = 23/52 (44%)

Query: 87  EMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
             +    + EE + +E EEEE++  E + ++ E  EE G    E    E   
Sbjct: 359 RARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSS 410



 Score = 27.8 bits (62), Expect = 2.1
 Identities = 17/72 (23%), Positives = 32/72 (44%), Gaps = 1/72 (1%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
           + D+ + R D  + +E   +  EE + + +  EE+E  E  EEE  +  E    +   + 
Sbjct: 354 MRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEE-GEDSEEEGSQSREDGSSESSSDV 412

Query: 124 GGGGGGEQEKEE 135
           G     + +KE 
Sbjct: 413 GSDSESKADKES 424



 Score = 26.2 bits (58), Expect = 6.4
 Identities = 15/59 (25%), Positives = 30/59 (50%), Gaps = 1/59 (1%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEV-EEGEEEEEKKKEEKAKKKEQE 121
            +D+ +++R ++ E+E G +  EE     E+G  +   + G + E K  +E A   + E
Sbjct: 373 DEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 32.2 bits (73), Expect = 0.070
 Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 88  MKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
               E   E+  + E E E +KKKEEKAK+KE ++
Sbjct: 1   GSRTESEAEKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 26.8 bits (59), Expect = 5.3
 Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 3/64 (4%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
           E E+K    E  E  K KEE  +E E+++    +  +KE KAK + Q+   G    ++ +
Sbjct: 7   EAEKKILTEEELERKKKKEEKAKEKELKK---LKAAQKEAKAKLQAQQASDGTNVPKKSE 63

Query: 134 EEER 137
           ++ R
Sbjct: 64  KKSR 67


>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
          Length = 1560

 Score = 32.4 bits (74), Expect = 0.071
 Identities = 10/52 (19%), Positives = 18/52 (34%), Gaps = 2/52 (3%)

Query: 79  EGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGE 130
                  +E K  +E   + EV E +           +    ++EG   GG+
Sbjct: 875 ATLAAELKENKDSKEGDADAEVAEVDAAPG--GAVDHEPPVAKQEGNAEGGD 924



 Score = 27.1 bits (60), Expect = 3.6
 Identities = 13/59 (22%), Positives = 21/59 (35%), Gaps = 8/59 (13%)

Query: 68  AKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
           A    + KE K+         K  + + E  EV+         +   AK++   E G G
Sbjct: 875 ATLAAELKENKDS--------KEGDADAEVAEVDAAPGGAVDHEPPVAKQEGNAEGGDG 925


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 32.0 bits (73), Expect = 0.076
 Identities = 20/48 (41%), Positives = 30/48 (62%)

Query: 76  EEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
           EE+E  +E  +E + K  + EE+  EE E+EE+KKK +K K+   E E
Sbjct: 38  EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85



 Score = 28.2 bits (63), Expect = 1.6
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query: 97  EDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
           E E E  +EEEE++KEEK +++E+  +      E+E++EE+
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEK 70



 Score = 26.3 bits (58), Expect = 7.7
 Identities = 17/47 (36%), Positives = 28/47 (59%)

Query: 91  KEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
           KE   E  + EE EE+EEKK+EE+    ++EE       E++K++ +
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTK 75


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 31.8 bits (73), Expect = 0.083
 Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 3/45 (6%)

Query: 85  GEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
            EE+K               EEEE+++EE+ +++  EEE   G G
Sbjct: 285 DEELKEVLSA---QAQAAAAEEEEEEEEEEEEEEPSEEEAAAGLG 326


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 31.7 bits (72), Expect = 0.10
 Identities = 14/54 (25%), Positives = 31/54 (57%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKK 117
              +  + +DE +  E      +E K K+E  E+ + E+ +E++EK+++++ KK
Sbjct: 381 YTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 29.8 bits (67), Expect = 0.40
 Identities = 12/52 (23%), Positives = 27/52 (51%)

Query: 75  KEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
           K+  E   E     + + +  EE   +E  E+++K++ ++ K+K Q++E   
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 27.5 bits (61), Expect = 3.0
 Identities = 13/62 (20%), Positives = 31/62 (50%)

Query: 77  EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
               G +  E +K   +  ++   + GE ++E    E+A+ K +EE+      E++++E+
Sbjct: 360 TDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQ 419

Query: 137 RE 138
            +
Sbjct: 420 AD 421


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 31.4 bits (72), Expect = 0.11
 Identities = 20/73 (27%), Positives = 32/73 (43%), Gaps = 1/73 (1%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
           L D+      E+EE E G     E K K    +    ++ E+EE +K   K K +   ++
Sbjct: 387 LHDKPLLAEGEEEEGENGNLSPAERK-KLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKK 445

Query: 124 GGGGGGEQEKEEE 136
             G  GE +K + 
Sbjct: 446 AKGPDGETKKVDP 458


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 31.6 bits (71), Expect = 0.13
 Identities = 21/47 (44%), Positives = 30/47 (63%)

Query: 77  EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
           EKE   E  E+ K  +E+GEE +VEE +E +E KK++  K KE  +E
Sbjct: 212 EKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKE 258


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 31.4 bits (71), Expect = 0.14
 Identities = 17/66 (25%), Positives = 26/66 (39%), Gaps = 21/66 (31%)

Query: 72  RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKK--------------EEKAKK 117
           R+  EE+        ++   +   +EDE E   +EE KK               + KA K
Sbjct: 538 RNSTEERI-------DLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPK 590

Query: 118 KEQEEE 123
             +EEE
Sbjct: 591 LTEEEE 596



 Score = 30.6 bits (69), Expect = 0.26
 Identities = 10/59 (16%), Positives = 22/59 (37%), Gaps = 14/59 (23%)

Query: 94  NGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG--------------GGGGEQEKEEERE 138
           + EE    + ++  E + E + +  E+ ++ G              G   +  +EEE  
Sbjct: 540 STEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAA 598


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 31.3 bits (71), Expect = 0.15
 Identities = 21/48 (43%), Positives = 25/48 (52%), Gaps = 9/48 (18%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
           D KE  E GREG            ED+ EEG + EE   ++KAKKK Q
Sbjct: 153 DNKEAPETGREGD---------DGEDDDEEGSDGEEDGAKKKAKKKRQ 191


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 30.9 bits (70), Expect = 0.16
 Identities = 17/59 (28%), Positives = 30/59 (50%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
             ++  + +E KE  +E       +++   E E EE EEE E++ EE + + E+E E  
Sbjct: 55  DDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEK 113



 Score = 30.1 bits (68), Expect = 0.25
 Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 1/73 (1%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMK-GKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
               +   DE+E K+   +   E++  KEE  E    E+ E++ + +KE++  ++E EEE
Sbjct: 39  SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98

Query: 124 GGGGGGEQEKEEE 136
                 E EKE E
Sbjct: 99  DEESSDENEKETE 111



 Score = 28.2 bits (63), Expect = 1.2
 Identities = 18/70 (25%), Positives = 32/70 (45%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
              +   +E+E  +   +E    EE  EE++     E++E K + + + +E EEE     
Sbjct: 40  PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99

Query: 129 GEQEKEEERE 138
            E   E E+E
Sbjct: 100 EESSDENEKE 109



 Score = 26.6 bits (59), Expect = 4.6
 Identities = 17/60 (28%), Positives = 30/60 (50%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
            ++ K+   E    E   + G+  K  EE+ EE+E E+ E  +E +KE + K +   E+ 
Sbjct: 62  IEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKE 121


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 31.1 bits (70), Expect = 0.17
 Identities = 11/66 (16%), Positives = 22/66 (33%)

Query: 58  NGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKK 117
                +        R +++ KE G    E          E E    ++ EEK  +  A  
Sbjct: 782 VNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAIL 841

Query: 118 KEQEEE 123
             ++++
Sbjct: 842 NVKKKK 847


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 30.8 bits (70), Expect = 0.17
 Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 7/79 (8%)

Query: 65  DDQAKKRRDEKEEKEGG--REGGEEMKGKEENGEEDEVEEGEEEE-----EKKKEEKAKK 117
           +D+ K++R E++ +                ++  +D+  E E        EK K+E+A++
Sbjct: 102 EDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEE 161

Query: 118 KEQEEEGGGGGGEQEKEEE 136
           KE+EEE      E+ +EEE
Sbjct: 162 KEREEEEKAAEEEKAREEE 180


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 31.1 bits (71), Expect = 0.17
 Identities = 12/80 (15%), Positives = 33/80 (41%), Gaps = 3/80 (3%)

Query: 58  NGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKK 117
           +  + + DD      +E+ + E   +       ++++ +E+  EE EE  + + +E    
Sbjct: 95  SDQVVIFDDG---DIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDD 151

Query: 118 KEQEEEGGGGGGEQEKEEER 137
            + +++        E+   R
Sbjct: 152 DDDDDDIATRERSLERRRRR 171



 Score = 30.0 bits (68), Expect = 0.36
 Identities = 10/65 (15%), Positives = 32/65 (49%), Gaps = 6/65 (9%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEE------KKKEEKAKKK 118
            D A +  +E ++ +   +  +E   K E+ E+D+ ++ +++        +++  + + +
Sbjct: 116 YDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWE 175

Query: 119 EQEEE 123
           E+  E
Sbjct: 176 EKRAE 180


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 30.9 bits (69), Expect = 0.20
 Identities = 24/74 (32%), Positives = 44/74 (59%)

Query: 65   DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
            +++ K  + +K+E E  ++  E  K +EEN  +   E  + EE+KKK E+AKK E++E+ 
Sbjct: 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689

Query: 125  GGGGGEQEKEEERE 138
                 ++E EE ++
Sbjct: 1690 AAEALKKEAEEAKK 1703



 Score = 30.5 bits (68), Expect = 0.31
 Identities = 22/74 (29%), Positives = 43/74 (58%)

Query: 65   DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
            ++  KK  + K+ +E  ++  E +K + E  ++ E  + +E EEKKK E+ KK E+E + 
Sbjct: 1671 EEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI 1730

Query: 125  GGGGGEQEKEEERE 138
                 ++E EE+++
Sbjct: 1731 KAEEAKKEAEEDKK 1744



 Score = 29.7 bits (66), Expect = 0.50
 Identities = 22/74 (29%), Positives = 41/74 (55%)

Query: 65   DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
            +++ K    +K E+E   +  EE K  EE+ ++ E  +  EE+EKK  E  KK+ +E + 
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703

Query: 125  GGGGGEQEKEEERE 138
                 ++E EE+++
Sbjct: 1704 AEELKKKEAEEKKK 1717



 Score = 29.7 bits (66), Expect = 0.55
 Identities = 19/71 (26%), Positives = 35/71 (49%)

Query: 66   DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
            D+AKK+ +E ++ +  ++  EE K K +  ++   E  +  E  K E +A   E E    
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364

Query: 126  GGGGEQEKEEE 136
                 ++K+EE
Sbjct: 1365 KAEAAEKKKEE 1375



 Score = 29.0 bits (64), Expect = 0.88
 Identities = 22/68 (32%), Positives = 37/68 (54%)

Query: 70   KRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
            K  +E ++ E  ++  EE K  EE+ ++      +E EE KK E+ KKKE EE+      
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEEL 1721

Query: 130  EQEKEEER 137
            ++ +EE +
Sbjct: 1722 KKAEEENK 1729



 Score = 29.0 bits (64), Expect = 1.1
 Identities = 23/88 (26%), Positives = 49/88 (55%)

Query: 51   QLVEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKK 110
            +  E +   ++ + +++ K + +E ++ E  +   EE+K  EE  ++ E  + +E EEKK
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKK 1647

Query: 111  KEEKAKKKEQEEEGGGGGGEQEKEEERE 138
            K E+ KK E+E +       ++ EE+++
Sbjct: 1648 KAEELKKAEEENKIKAAEEAKKAEEDKK 1675



 Score = 28.6 bits (63), Expect = 1.1
 Identities = 22/86 (25%), Positives = 41/86 (47%)

Query: 53   VEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKE 112
             + K   L     ++ KK  + K+ +E  +    E   K E  ++   E  + EE++KK 
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690

Query: 113  EKAKKKEQEEEGGGGGGEQEKEEERE 138
             +A KKE EE       ++++ EE++
Sbjct: 1691 AEALKKEAEEAKKAEELKKKEAEEKK 1716



 Score = 28.6 bits (63), Expect = 1.4
 Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 4/78 (5%)

Query: 65   DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK----EQ 120
             ++AKK  ++K++ E  ++  E+ K   E  +++  E  + EE KKKE + KKK    ++
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723

Query: 121  EEEGGGGGGEQEKEEERE 138
             EE      E+ K+E  E
Sbjct: 1724 AEEENKIKAEEAKKEAEE 1741



 Score = 28.2 bits (62), Expect = 1.9
 Identities = 21/74 (28%), Positives = 42/74 (56%)

Query: 65   DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
            +D+ K     K+E E  ++  E  K + E  ++ E  +  EEE K K E+AKK+ +E++ 
Sbjct: 1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKK 1744

Query: 125  GGGGGEQEKEEERE 138
                 ++++EE+++
Sbjct: 1745 KAEEAKKDEEEKKK 1758



 Score = 27.8 bits (61), Expect = 2.2
 Identities = 18/56 (32%), Positives = 32/56 (57%)

Query: 66   DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
            ++AKK+ D  ++K   ++  +E K K E  ++   E  +    KKK ++AKKK +E
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429



 Score = 27.0 bits (59), Expect = 3.9
 Identities = 19/73 (26%), Positives = 44/73 (60%)

Query: 66   DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
            ++AKK+ DE ++    ++  +E K  EE  + DE ++ EE ++  + +KA++K++ +E  
Sbjct: 1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552

Query: 126  GGGGEQEKEEERE 138
                 ++ EE+++
Sbjct: 1553 KAEELKKAEEKKK 1565



 Score = 27.0 bits (59), Expect = 4.1
 Identities = 22/70 (31%), Positives = 39/70 (55%)

Query: 69   KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
            KK  +EK++ E  ++  EE K K E  +++  E+ ++ EE KK+E+ KKK    +     
Sbjct: 1709 KKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768

Query: 129  GEQEKEEERE 138
              +E  +E+E
Sbjct: 1769 KAEEIRKEKE 1778



 Score = 27.0 bits (59), Expect = 4.5
 Identities = 18/71 (25%), Positives = 37/71 (52%)

Query: 66   DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
            D+AKK+ +E ++ +  ++  EE K K +  ++    + + +E KK EE  K  E ++   
Sbjct: 1473 DEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE 1532

Query: 126  GGGGEQEKEEE 136
                ++ K+ E
Sbjct: 1533 AKKADEAKKAE 1543



 Score = 27.0 bits (59), Expect = 4.7
 Identities = 18/71 (25%), Positives = 35/71 (49%)

Query: 66   DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
            ++AKK+ D  ++K    +   E    E     DE E  EE+ E  +++K + K++ +   
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384

Query: 126  GGGGEQEKEEE 136
                E++K +E
Sbjct: 1385 KKAEEKKKADE 1395



 Score = 26.6 bits (58), Expect = 6.7
 Identities = 20/71 (28%), Positives = 37/71 (52%)

Query: 66   DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
            ++AKK  +++++     +   E   K E  ++ E EE ++ EE KK E+  K + EE   
Sbjct: 1678 EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKK 1737

Query: 126  GGGGEQEKEEE 136
                +++K EE
Sbjct: 1738 EAEEDKKKAEE 1748



 Score = 26.3 bits (57), Expect = 7.3
 Identities = 18/55 (32%), Positives = 33/55 (60%)

Query: 66   DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
            D+AKK+ +E ++ E  ++  EE K  +E  ++ E  +  +E +KK EE  KK ++
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADE 1501


>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
           domain 1.  The small mechanosensitive channel, MscS, is
           a part of the turgor-driven solute efflux system that
           protects bacteria from lysis in the event of osmotic
           shock. The MscS protein alone is sufficient to form a
           functional mechanosensitive channel gated directly by
           tension in the lipid bilayer. The MscS proteins are
           heptamers of three transmembrane subunits with seven
           converging M3 domains, and this domain is one of the
           inner membrane domains.
          Length = 339

 Score = 30.7 bits (70), Expect = 0.21
 Identities = 8/28 (28%), Positives = 13/28 (46%)

Query: 109 KKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
            K+ E   ++ +EEE    G  +  EE 
Sbjct: 265 AKRAEILAQRAEEEEESSEGAAETIEEP 292


>gnl|CDD|237626 PRK14154, PRK14154, heat shock protein GrpE; Provisional.
          Length = 208

 Score = 30.5 bits (68), Expect = 0.21
 Identities = 18/35 (51%), Positives = 22/35 (62%)

Query: 105 EEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEEREG 139
           E + K K EK  + E+E+E GGG G QE E  REG
Sbjct: 14  ETKAKNKWEKVMEAEEEQEEGGGDGSQEMEPHREG 48


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 30.4 bits (69), Expect = 0.26
 Identities = 20/65 (30%), Positives = 32/65 (49%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQE 132
           DE  + EG     E  K  E +  EDE E+ E  ++ K++   +  E++EE      E E
Sbjct: 109 DEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAE 168

Query: 133 KEEER 137
            E+E+
Sbjct: 169 AEKEK 173


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 30.4 bits (69), Expect = 0.31
 Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 2/70 (2%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKK--KEEKAKKKEQEEEGGG 126
           KK ++  E +E   E   E++   E     + +EG  EE+     EEK    + E+    
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283

Query: 127 GGGEQEKEEE 136
              +++K+EE
Sbjct: 284 EILKEKKDEE 293



 Score = 28.4 bits (64), Expect = 1.4
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 106 EEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
           E E+++EE   + E   E  G   EQE   E +
Sbjct: 231 ETEEREEETDVEIETTSETKGTKQEQEGSTEED 263



 Score = 28.0 bits (63), Expect = 1.7
 Identities = 15/71 (21%), Positives = 30/71 (42%), Gaps = 5/71 (7%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
            +  +  + +EE +   E   E K     G + E E   EE+     E+ +  ++ E+  
Sbjct: 227 KETSETEEREEETDVEIETTSETK-----GTKQEQEGSTEEDPSLFSEEKEDPDKTEDLD 281

Query: 126 GGGGEQEKEEE 136
                +EK++E
Sbjct: 282 KLEILKEKKDE 292



 Score = 27.7 bits (62), Expect = 2.3
 Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 2/46 (4%)

Query: 95  GEEDEVEEGEEEEEKKKE--EKAKKKEQEEEGGGGGGEQEKEEERE 138
            E  E EE EEE + + E   + K  +QE+EG          EE+E
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKE 272



 Score = 26.1 bits (58), Expect = 7.7
 Identities = 12/46 (26%), Positives = 21/46 (45%)

Query: 93  ENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
           E  EE+   E E   E K  ++ ++   EE+      E+E  ++ E
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTE 278


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 29.8 bits (67), Expect = 0.40
 Identities = 9/52 (17%), Positives = 21/52 (40%)

Query: 87  EMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
           +M    +  EE   E    + E  ++E   K++++++ G        +   E
Sbjct: 201 DMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSE 252



 Score = 27.1 bits (60), Expect = 3.2
 Identities = 9/42 (21%), Positives = 20/42 (47%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEE 106
           D+   K  ++ ++ E    G  +   ++ +   +E+E GE E
Sbjct: 226 DEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME 267


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 29.3 bits (66), Expect = 0.45
 Identities = 11/50 (22%), Positives = 24/50 (48%), Gaps = 1/50 (2%)

Query: 89  KGKEENGE-EDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
              +E  E E+E  +  E+E+  + +   K++++ E      E+EK +  
Sbjct: 93  SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142



 Score = 26.3 bits (58), Expect = 4.7
 Identities = 15/55 (27%), Positives = 27/55 (49%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
            D  ++  +E+   E  +E   E K + +  ++ EV + + E+EK K E  K K 
Sbjct: 94  SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 29.6 bits (67), Expect = 0.47
 Identities = 19/67 (28%), Positives = 38/67 (56%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQE 132
           +  ++   G   GEE + +EE+G +DE EE ++++ +++EE     ++EE+      + E
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDE 366

Query: 133 KEEEREG 139
            +EE E 
Sbjct: 367 DDEEEEE 373



 Score = 29.2 bits (66), Expect = 0.66
 Identities = 15/52 (28%), Positives = 28/52 (53%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
           D+  E+E       + +  EE+ + D+ ++ EEEEE+K+++K K  E     
Sbjct: 339 DDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 28.8 bits (65), Expect = 0.96
 Identities = 17/51 (33%), Positives = 30/51 (58%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
           D+ ++ E   E  +    +E+  +ED  +E +EEEE++++EK KKK  E  
Sbjct: 337 DDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAEST 387



 Score = 28.4 bits (64), Expect = 1.5
 Identities = 15/56 (26%), Positives = 27/56 (48%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
           +D     R D+  + +   +   + +     G ED+ E+ EE+ E+KK +K   KE
Sbjct: 126 IDKDDDVRDDDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKE 181



 Score = 28.0 bits (63), Expect = 1.9
 Identities = 15/51 (29%), Positives = 29/51 (56%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
           D+ + +E   +     + ++E  E+ + E+ EEEEE++KE+K KK  +   
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTR 388



 Score = 27.7 bits (62), Expect = 2.4
 Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 3/74 (4%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
           D+      D     +G  +  EE  G ++  EED+ ++ EEEEE   +     +E++EE 
Sbjct: 304 DEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEE---DVDLSDEEEDEED 360

Query: 125 GGGGGEQEKEEERE 138
                E ++EEE E
Sbjct: 361 EDSDDEDDEEEEEE 374



 Score = 27.3 bits (61), Expect = 3.0
 Identities = 18/58 (31%), Positives = 36/58 (62%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
           +D+     D +EE+E      EE   ++E+ ++++ EE EEEE++KK++K+ +  + E
Sbjct: 333 EDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 25.7 bits (57), Expect = 9.9
 Identities = 18/74 (24%), Positives = 31/74 (41%)

Query: 63  VLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
                  +  +E+EE     E  E+     E  EED     EEE+E+ ++   +  E+EE
Sbjct: 313 NFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372

Query: 123 EGGGGGGEQEKEEE 136
           E      +++  E 
Sbjct: 373 EEEKEKKKKKSAES 386


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 29.8 bits (68), Expect = 0.47
 Identities = 9/49 (18%), Positives = 23/49 (46%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
            +   +E     G E++  +++ +E+E E+  ++     E +  +K  E
Sbjct: 174 VDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLE 222



 Score = 26.3 bits (59), Expect = 6.0
 Identities = 13/53 (24%), Positives = 22/53 (41%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
           ++R  E  +        E+        EE + +E EEEEE + ++     E E
Sbjct: 163 ERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESE 215


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 29.2 bits (66), Expect = 0.51
 Identities = 10/38 (26%), Positives = 18/38 (47%), Gaps = 2/38 (5%)

Query: 96  EEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
           +   +EE   EE +     A K+E+  EG     +++K
Sbjct: 161 DAKLLEEFAAEEAEA--AAAAKEEESAEGEKKESKKKK 196



 Score = 26.9 bits (60), Expect = 3.7
 Identities = 9/34 (26%), Positives = 15/34 (44%)

Query: 100 VEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
            +    EE   +E +A    +EEE   G  ++ K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESK 193



 Score = 26.5 bits (59), Expect = 4.8
 Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 3/41 (7%)

Query: 72  RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKE 112
             + +  E   E   E        +E+E  EGE++E KKK+
Sbjct: 159 WKDAKLLE---EFAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 29.4 bits (66), Expect = 0.52
 Identities = 17/74 (22%), Positives = 33/74 (44%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
            D A++  D+  E++   E G++ +       E   EE E  +E +++E      + EEG
Sbjct: 216 MDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEG 275

Query: 125 GGGGGEQEKEEERE 138
                E  ++ E +
Sbjct: 276 EMDAAEASEDSESD 289



 Score = 27.9 bits (62), Expect = 1.9
 Identities = 16/75 (21%), Positives = 32/75 (42%), Gaps = 4/75 (5%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
           DD  ++  DE++  +   +  E+     E G E+     E EE++ +    + +E E + 
Sbjct: 224 DDGIEEDADEEDGDDDQPDNNED----SEAGREESEGSDESEEDEAEATDGEGEEGEMDA 279

Query: 125 GGGGGEQEKEEEREG 139
                + E +E  E 
Sbjct: 280 AEASEDSESDESDED 294


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 29.3 bits (66), Expect = 0.53
 Identities = 14/55 (25%), Positives = 35/55 (63%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK 118
           L ++ +K + E EEK+  +   ++ K K++  ++ + ++ +++ EKK E++A+ K
Sbjct: 67  LAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121



 Score = 28.5 bits (64), Expect = 0.98
 Identities = 10/82 (12%), Positives = 37/82 (45%)

Query: 40  HIEFSDNTLCLQLVEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDE 99
                D      + + +         + A++    K+E E  ++   + K  ++  ++D+
Sbjct: 40  PSHLQDRHFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDK 99

Query: 100 VEEGEEEEEKKKEEKAKKKEQE 121
            ++ +++++K +++  K+ E +
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDK 121



 Score = 27.3 bits (61), Expect = 2.1
 Identities = 14/53 (26%), Positives = 32/53 (60%)

Query: 86  EEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
           EE++  ++  EE +  + ++++ KKK++K K K+ +++      + EKE E +
Sbjct: 69  EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121



 Score = 26.2 bits (58), Expect = 5.7
 Identities = 9/56 (16%), Positives = 28/56 (50%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
           + K++   K++K   ++  ++ K  ++  ++ E ++ +E E+K ++      E   
Sbjct: 79  EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134



 Score = 26.2 bits (58), Expect = 5.8
 Identities = 10/54 (18%), Positives = 33/54 (61%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
           KK  +EK++ +  ++  ++ K K+++ ++D+ ++  E++++K+ E   +   + 
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKS 128


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 29.6 bits (66), Expect = 0.64
 Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 3/55 (5%)

Query: 73   DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEE---KKKEEKAKKKEQEEEG 124
            DE  +++   +  E+ +   E+G E+ V+E EE  E   K  EE  + +  E++ 
Sbjct: 4044 DEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQA 4098



 Score = 29.6 bits (66), Expect = 0.68
 Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 3/51 (5%)

Query: 65   DDQAKKRRDEKEEKEGGREGGEEMKG--KEENGEEDEVEEGEEEEEKKKEE 113
            +++    + + +E E  RE   +M G  K+    E+E  + EEE +   EE
Sbjct: 3842 NEEDTANQSDLDESEA-RELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891



 Score = 29.2 bits (65), Expect = 0.77
 Identities = 17/87 (19%), Positives = 31/87 (35%), Gaps = 16/87 (18%)

Query: 64   LDDQAKKRRDEKEEKEGGREGGEEMKGKEENG----------------EEDEVEEGEEEE 107
            +D +A     E+ + E      +E   +E N                 +E   E+G EE 
Sbjct: 4011 MDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEEN 4070

Query: 108  EKKKEEKAKKKEQEEEGGGGGGEQEKE 134
             ++ EE  +   + +E    G   E +
Sbjct: 4071 VQENEESTEDGVKSDEELEQGEVPEDQ 4097



 Score = 27.3 bits (60), Expect = 3.1
 Identities = 20/81 (24%), Positives = 33/81 (40%), Gaps = 6/81 (7%)

Query: 65   DDQAKKRRDEKEEK----EGGREGGEEMKGKEENGEEDEVE--EGEEEEEKKKEEKAKKK 118
            +D  K   D  EE     E   E G +   + E GE  E +  +   + + K    + + 
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEA 4116

Query: 119  EQEEEGGGGGGEQEKEEEREG 139
            ++E    G  GE E+  E +G
Sbjct: 4117 DEENTDKGIVGENEELGEEDG 4137



 Score = 26.1 bits (57), Expect = 7.9
 Identities = 12/69 (17%), Positives = 28/69 (40%), Gaps = 4/69 (5%)

Query: 74   EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKE----EKAKKKEQEEEGGGGGG 129
            ++E ++G     + +    +   +      E +EE   +    E  +  E++   G G  
Sbjct: 4085 DEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVGENEELGEEDGVRGNGTA 4144

Query: 130  EQEKEEERE 138
            + E E+ +E
Sbjct: 4145 DGEFEQVQE 4153


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 28.4 bits (64), Expect = 0.65
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGGE 130
               EE+++++EE+ +K+E EEE   G G 
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEEAAAGLGA 103



 Score = 27.2 bits (61), Expect = 1.4
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
                EE+K++EE+ ++KE+ EE    G
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEEEAAAG 100


>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
          Length = 639

 Score = 29.4 bits (66), Expect = 0.68
 Identities = 16/85 (18%), Positives = 34/85 (40%)

Query: 39  RHIEFSDNTLCLQLVEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEED 98
           + IE     L L         +L   DD    + D+  ++   + G ++   +E+N +  
Sbjct: 81  KTIEPDSQDLLLDKRGEHKARVLSATDDDTHSQTDDIIKQVTQKAGQDDSDQQEKNSQSQ 140

Query: 99  EVEEGEEEEEKKKEEKAKKKEQEEE 123
              + E  E  ++  +  +K  E+E
Sbjct: 141 SASQAESLEHVQQSAQTSEKVDEKE 165


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 29.2 bits (66), Expect = 0.77
 Identities = 13/60 (21%), Positives = 36/60 (60%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
           + +QA    D+ ++ +   +  ++    E++ E+D+ ++ ++E+E+KKE K  +K  +++
Sbjct: 126 VLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDD 185



 Score = 26.1 bits (58), Expect = 8.4
 Identities = 10/71 (14%), Positives = 36/71 (50%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
           D     + D+ ++ +   +  ++    +++ E+D+ ++ +++ + + EEK + KE E+  
Sbjct: 123 DIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLS 182

Query: 125 GGGGGEQEKEE 135
                  ++++
Sbjct: 183 DDDDFVWDEDD 193



 Score = 25.7 bits (57), Expect = 9.5
 Identities = 8/75 (10%), Positives = 33/75 (44%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
              +     ++ ++ +   +  +      ++ ++DE ++ +++++   +E  +KKE +E 
Sbjct: 119 NYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKEL 178

Query: 124 GGGGGGEQEKEEERE 138
                 +    +E +
Sbjct: 179 EKLSDDDDFVWDEDD 193


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 28.6 bits (64), Expect = 0.80
 Identities = 18/74 (24%), Positives = 31/74 (41%), Gaps = 1/74 (1%)

Query: 66  DQAKKRRDE-KEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
            QA+ + +  KEEKEG     ++   K +   +    E E++  + + E   +K+  E  
Sbjct: 97  AQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAA 156

Query: 125 GGGGGEQEKEEERE 138
                E    EE E
Sbjct: 157 AVAAEEAAAAEEEE 170


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 28.8 bits (64), Expect = 0.81
 Identities = 16/77 (20%), Positives = 31/77 (40%)

Query: 62  CVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
            ++ D  +   +    +       EE   + +  EE+E EE   +   +   +  ++E+E
Sbjct: 155 NIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEE 214

Query: 122 EEGGGGGGEQEKEEERE 138
           E  G   GE   + E E
Sbjct: 215 EAEGSDDGEDVVDYEGE 231


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 28.1 bits (63), Expect = 0.82
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGGE 130
              EEEEE+++EE+ +++E EEE   G G 
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEEAMAGLGA 102



 Score = 26.2 bits (58), Expect = 3.2
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
               EEEE+++EE+ +++E+ EE    G
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEEEAMAG 99


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 29.1 bits (66), Expect = 0.89
 Identities = 21/73 (28%), Positives = 38/73 (52%), Gaps = 4/73 (5%)

Query: 71  RRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKK----EEKAKKKEQEEEGGG 126
           RR EK+ KE  +     +  K +  +E++  E EE++E+       +   K+E+EEE   
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549

Query: 127 GGGEQEKEEEREG 139
              ++ +E+E EG
Sbjct: 550 EKDKETEEDEPEG 562


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 28.9 bits (65), Expect = 0.93
 Identities = 14/54 (25%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAK--KKEQEEEG 124
           +E  E+ G  E  +       NG  + +++    ++K KE K+   K+E+ E G
Sbjct: 844 EEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEERETG 897



 Score = 26.6 bits (59), Expect = 6.3
 Identities = 15/62 (24%), Positives = 21/62 (33%), Gaps = 4/62 (6%)

Query: 77  EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
           E  G  E   E    E   EED+    +           K    +++   G     K+EE
Sbjct: 838 ENAGKMEEYVE----ENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEE 893

Query: 137 RE 138
           RE
Sbjct: 894 RE 895


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 28.6 bits (65), Expect = 0.94
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 90  GKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
           G++    E   EE EE EE+++EE+ ++ E+ E 
Sbjct: 224 GRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257



 Score = 26.3 bits (59), Expect = 5.3
 Identities = 15/35 (42%), Positives = 20/35 (57%)

Query: 81  GREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKA 115
           GR+G      +EE  E +E EE EEEEE ++ E  
Sbjct: 224 GRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 28.7 bits (64), Expect = 0.96
 Identities = 21/100 (21%), Positives = 42/100 (42%), Gaps = 7/100 (7%)

Query: 45  DNTLCLQLVEGKPNGLLCVLDDQAKKRRDEKEEKEG-----GREGGEEMK--GKEENGEE 97
           +  L +  V          + +    +  EK EKE       +   E  K  GK++    
Sbjct: 112 NGKLQVTYVTHSKVAHEAEIRNLGDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRR 171

Query: 98  DEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
            E  +  +E+ +KK+E+ ++ ++++  GGGGG    +   
Sbjct: 172 VEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGL 211


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 28.8 bits (65), Expect = 1.0
 Identities = 10/55 (18%), Positives = 27/55 (49%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK 118
           LD + +   D+ +++E  R   +E    E++  + + ++ EE + + +     K+
Sbjct: 214 LDVEIQNELDDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKE 268



 Score = 26.1 bits (58), Expect = 7.2
 Identities = 12/46 (26%), Positives = 22/46 (47%), Gaps = 1/46 (2%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEE-KKKEEKAKK 117
           DE+EE+    E  ++     +  ++DE E   E E     +E ++K
Sbjct: 227 DEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKEVSEK 272


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 28.8 bits (64), Expect = 1.0
 Identities = 18/73 (24%), Positives = 35/73 (47%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
            + KK     EE +  +   +E   + E       E+ ++ EE +K+ + ++K+QEE+  
Sbjct: 80  QRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQAR 139

Query: 126 GGGGEQEKEEERE 138
               EQ+K+ E  
Sbjct: 140 KAAAEQKKKAEAA 152


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 28.8 bits (64), Expect = 1.0
 Identities = 5/66 (7%), Positives = 11/66 (16%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
                    E    +  +  +G   E                  +    E       Q+ 
Sbjct: 104 TPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQP 163

Query: 134 EEEREG 139
               + 
Sbjct: 164 SSFLQP 169



 Score = 28.0 bits (62), Expect = 2.0
 Identities = 8/63 (12%), Positives = 10/63 (15%)

Query: 77  EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
             E       E               G  E    +       +Q         E   EE 
Sbjct: 119 SPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEP 178

Query: 137 REG 139
              
Sbjct: 179 EPP 181



 Score = 27.6 bits (61), Expect = 2.2
 Identities = 8/72 (11%), Positives = 14/72 (19%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
           Q +   ++   +        E     E  E  +        E             EE   
Sbjct: 57  QVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELAS 116

Query: 127 GGGEQEKEEERE 138
           G   +       
Sbjct: 117 GLSPENTSGSSP 128



 Score = 27.6 bits (61), Expect = 2.7
 Identities = 7/70 (10%), Positives = 13/70 (18%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
           K+ R +      G E             E+       E       ++             
Sbjct: 84  KEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSH 143

Query: 129 GEQEKEEERE 138
               +    E
Sbjct: 144 PGPHEPAPPE 153



 Score = 26.9 bits (59), Expect = 4.6
 Identities = 6/67 (8%), Positives = 12/67 (17%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQE 132
               E                  E    E       ++     +   ++          E
Sbjct: 125 GSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSE 184

Query: 133 KEEEREG 139
            E +  G
Sbjct: 185 PEPDSPG 191



 Score = 26.5 bits (58), Expect = 5.2
 Identities = 14/74 (18%), Positives = 15/74 (20%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
           D          EKE   +GG    G E  G       G  EE               E  
Sbjct: 72  DTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESP 131

Query: 126 GGGGEQEKEEEREG 139
                        G
Sbjct: 132 ASHSPPPSPPSHPG 145



 Score = 26.1 bits (57), Expect = 7.0
 Identities = 13/78 (16%), Positives = 22/78 (28%), Gaps = 1/78 (1%)

Query: 63  VLDDQAKKRRDEKEEKEGGREGGEEMKGK-EENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
           V +   ++   + E  E  R G +E +G+   +G   E             E+       
Sbjct: 61  VAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSP 120

Query: 122 EEGGGGGGEQEKEEEREG 139
           E   G   E         
Sbjct: 121 ENTSGSSPESPASHSPPP 138


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 28.8 bits (64), Expect = 1.0
 Identities = 18/75 (24%), Positives = 33/75 (44%), Gaps = 7/75 (9%)

Query: 71  RRDEKEEKEGGREGGEEMKGKEENGEEDEV-------EEGEEEEEKKKEEKAKKKEQEEE 123
           R D  E+    +E G+E    E++ +           +E +  ++   +E     E ++ 
Sbjct: 449 RADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDG 508

Query: 124 GGGGGGEQEKEEERE 138
           G G   E+E+EEER 
Sbjct: 509 GVGQAIEEEEEEERA 523


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 28.8 bits (64), Expect = 1.1
 Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 2/55 (3%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEE--DEVEEGEEEEEKKKEEKAKKKE 119
           + +KR  EK + E  +   E +K K+    +   E +  E+E E K+ E  KK+E
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331



 Score = 27.3 bits (60), Expect = 3.4
 Identities = 19/82 (23%), Positives = 34/82 (41%), Gaps = 9/82 (10%)

Query: 66  DQAKKRRDEKEEK--------EGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKK 117
           D A K+RDE  +K        +       +   +    ++ E+E+ + E +K  EE  K 
Sbjct: 241 DNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKA 300

Query: 118 KEQE-EEGGGGGGEQEKEEERE 138
           K+ +  +        EKE E +
Sbjct: 301 KDHKAFDLKQESKASEKEAEDK 322


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 28.7 bits (64), Expect = 1.1
 Identities = 20/72 (27%), Positives = 41/72 (56%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
           +  K    KEE++   +  EE K K+E  +E+  +   +EE K+K    +K++++E+   
Sbjct: 96  KEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVE 155

Query: 127 GGGEQEKEEERE 138
              ++E+E++RE
Sbjct: 156 EPRDREEEKKRE 167



 Score = 26.0 bits (57), Expect = 7.5
 Identities = 19/72 (26%), Positives = 41/72 (56%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
           + + +  ++E++EKE  +E  ++ K K +   +D   + E +E++  +EK K+KE++ E 
Sbjct: 97  EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156

Query: 125 GGGGGEQEKEEE 136
                E++K E 
Sbjct: 157 PRDREEEKKRER 168


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 28.4 bits (64), Expect = 1.1
 Identities = 6/25 (24%), Positives = 11/25 (44%)

Query: 104 EEEEEKKKEEKAKKKEQEEEGGGGG 128
                      A+++E+EE+  G G
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDDMGFG 307



 Score = 26.5 bits (59), Expect = 5.8
 Identities = 6/25 (24%), Positives = 9/25 (36%)

Query: 105 EEEEKKKEEKAKKKEQEEEGGGGGG 129
                       ++E+EEE   G G
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDDMGFG 307


>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
          Length = 249

 Score = 28.1 bits (63), Expect = 1.2
 Identities = 12/42 (28%), Positives = 16/42 (38%)

Query: 72  RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEE 113
           RD +E +E      E    ++      E EEGEE       E
Sbjct: 205 RDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246



 Score = 28.1 bits (63), Expect = 1.5
 Identities = 11/39 (28%), Positives = 14/39 (35%)

Query: 96  EEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKE 134
           EE E +E    E    E+ A    + EEG         E
Sbjct: 208 EEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 28.2 bits (63), Expect = 1.4
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 91  KEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
           ++E        E E+ ++K++ E  KKKE EE
Sbjct: 75  EDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106



 Score = 27.0 bits (60), Expect = 3.5
 Identities = 9/31 (29%), Positives = 19/31 (61%)

Query: 92  EENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
           +   E++      E E+ KK+E+A+ K+++E
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKE 103


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 28.2 bits (63), Expect = 1.4
 Identities = 15/55 (27%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMK-GKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
             K  R++ ++ + G++GG+  + GK+ N E D    GE++ + K  +K  +K +
Sbjct: 734 VGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAK 788


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 28.1 bits (63), Expect = 1.5
 Identities = 23/80 (28%), Positives = 36/80 (45%), Gaps = 15/80 (18%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAK---------------KK 118
            KE +E   E  +E++   E+ +  E EE + E +  KEE  K               + 
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60

Query: 119 EQEEEGGGGGGEQEKEEERE 138
           +    G GGGGE+E+EE + 
Sbjct: 61  KPAASGEGGGGEEEEEEAKA 80


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 28.1 bits (63), Expect = 1.7
 Identities = 11/32 (34%), Positives = 21/32 (65%)

Query: 91  KEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
           + E  E++ ++E +   + KK+E+ KKKE E+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 26.9 bits (60), Expect = 4.8
 Identities = 13/34 (38%), Positives = 21/34 (61%)

Query: 105 EEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
           EE +++KEEK   KEQ+        E++K++E E
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELE 584


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 28.0 bits (62), Expect = 1.7
 Identities = 17/55 (30%), Positives = 30/55 (54%)

Query: 82  REGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
           RE    M+  EE G++ E E+ E+ ++ +  E  ++++ E EG G  G   +E E
Sbjct: 191 REMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESE 245



 Score = 27.2 bits (60), Expect = 3.7
 Identities = 15/73 (20%), Positives = 33/73 (45%), Gaps = 2/73 (2%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
           + A++  D+ E ++   E G++ +  E   EE    EGE +E    +E      + E G 
Sbjct: 198 ELAEEMGDDTESED--EEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGE 255

Query: 126 GGGGEQEKEEERE 138
               + ++++  +
Sbjct: 256 EEMVQSDQDDLPD 268



 Score = 26.8 bits (59), Expect = 4.1
 Identities = 14/58 (24%), Positives = 28/58 (48%), Gaps = 1/58 (1%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEEN-GEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
           D  +   +E+EE+  G   G+E    +E+   + E E GEEE  +  ++    +  ++
Sbjct: 216 DDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDD 273


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 27.8 bits (62), Expect = 1.7
 Identities = 12/51 (23%), Positives = 23/51 (45%)

Query: 86  EEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
           E+ K  ++ G  +   +   +++  ++ K   K Q  E       QEK+EE
Sbjct: 19  EKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69



 Score = 26.3 bits (58), Expect = 6.4
 Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 3/51 (5%)

Query: 70  KRRDEKEEKEGGR-EGGEEMKGKEENG--EEDEVEEGEEEEEKKKEEKAKK 117
           K   E   K   + +  E+ K   +    E  +    +E++E+KK+ K KK
Sbjct: 27  KGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 27.4 bits (61), Expect = 1.9
 Identities = 12/62 (19%), Positives = 17/62 (27%), Gaps = 3/62 (4%)

Query: 75  KEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKE 134
            E K    E       + + G      E    ++KK   +A   E              E
Sbjct: 96  AEPKPSKLELFNAALAEADGGPT---AEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAE 152

Query: 135 EE 136
           EE
Sbjct: 153 EE 154


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 27.8 bits (62), Expect = 2.0
 Identities = 19/66 (28%), Positives = 34/66 (51%), Gaps = 7/66 (10%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEE-------EKKKEEKAK 116
           ++ QA +  DE+E +   R  G +    E   E+++ +E ++EE       E  K EKA+
Sbjct: 5   VNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKAR 64

Query: 117 KKEQEE 122
            KE ++
Sbjct: 65  LKELKK 70


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 28.1 bits (62), Expect = 2.1
 Identities = 21/85 (24%), Positives = 39/85 (45%), Gaps = 8/85 (9%)

Query: 54  EGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEE 113
           +G  + L+   DD++ +  +E  E E   +        +E+ E DE EE +E  E   E+
Sbjct: 917 DGGWSFLMVGSDDESDESEEEVSEYEASSD--------DESDETDEDEESDESSEDLSED 968

Query: 114 KAKKKEQEEEGGGGGGEQEKEEERE 138
           +++    +EE G    E E +   +
Sbjct: 969 ESENDSSDEEDGEDWDELESKAAYD 993


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 27.9 bits (62), Expect = 2.1
 Identities = 17/71 (23%), Positives = 33/71 (46%)

Query: 68  AKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGG 127
            +K+  ++  +   +E  EE  GK +   EDE  E    +E + E++ K   +  +    
Sbjct: 227 KEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEE 286

Query: 128 GGEQEKEEERE 138
             E+EKE+ + 
Sbjct: 287 TEEKEKEKRKR 297


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 27.5 bits (61), Expect = 2.3
 Identities = 21/70 (30%), Positives = 42/70 (60%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
            K+ DEKEE++   +   + K +E+  E +E E+ +E  E ++E+ A+++E+EE      
Sbjct: 33  IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEE 92

Query: 129 GEQEKEEERE 138
            E+E E+ ++
Sbjct: 93  KEEEAEDVKQ 102


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 27.7 bits (61), Expect = 2.4
 Identities = 15/64 (23%), Positives = 28/64 (43%)

Query: 60  LLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
            +  +DD   K  D +E         ++ +G EE   E+E  +  E  + +KE++    E
Sbjct: 409 TMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFPDE 468

Query: 120 QEEE 123
            E +
Sbjct: 469 AELQ 472


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 27.5 bits (61), Expect = 2.4
 Identities = 10/60 (16%), Positives = 22/60 (36%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
                     E  +    R+              DE E+  +E+E   E++ K++ +E++
Sbjct: 338 AISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDK 397


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 27.4 bits (61), Expect = 2.9
 Identities = 13/38 (34%), Positives = 24/38 (63%)

Query: 89  KGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
           K K+E+   +  ++ E++++KKKE+K  K E E + G 
Sbjct: 56  KDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93


>gnl|CDD|165587 PHA03343, PHA03343, US22 family homolog; Provisional.
          Length = 578

 Score = 27.3 bits (60), Expect = 3.2
 Identities = 18/83 (21%), Positives = 26/83 (31%), Gaps = 12/83 (14%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVE------------EGEEEEEKKKEEKAK 116
           +K+    E            KG+EE    D++E              E  E       A 
Sbjct: 99  RKKGPGGETAAAAAAAAAAEKGEEEEEAGDKIEPPAVRGSVSPIRGHETVEGAAAAATAA 158

Query: 117 KKEQEEEGGGGGGEQEKEEEREG 139
                 +G  GGGE   E+  +G
Sbjct: 159 TSAAACDGDDGGGEDGGEDGGKG 181


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 26.2 bits (58), Expect = 3.3
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
           E   E +E ++EEK ++ E+E +     G
Sbjct: 78  EAAAEADEAEEEEKEEEAEEESDDDMLFG 106



 Score = 25.4 bits (56), Expect = 7.8
 Identities = 6/29 (20%), Positives = 14/29 (48%)

Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
            E   E ++ +EE+ +++ +EE       
Sbjct: 77  AEAAAEADEAEEEEKEEEAEEESDDDMLF 105


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 27.4 bits (61), Expect = 3.5
 Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 2/62 (3%)

Query: 64  LDDQAKKRRDEKEEKEGGREGGEEMKG--KEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
           L+++A  +  E+ ++   +E    +    +E+  E+ E     +EE KK+  K  KK  E
Sbjct: 443 LEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPE 502

Query: 122 EE 123
            +
Sbjct: 503 VD 504


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 27.0 bits (59), Expect = 3.6
 Identities = 12/52 (23%), Positives = 28/52 (53%)

Query: 72  RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
             +KE+++   E  E  K + ++   DE+ E +EE+++ +E +   +   +E
Sbjct: 291 EGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADE 342


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 27.1 bits (60), Expect = 3.7
 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGE--EEEEKKKEEKAKKKEQEEE 123
           D  K+ +++KEEK+   E  EE+K +E + E +E E+G   E  EK   E  KK  +E  
Sbjct: 274 DWLKELKEKKEEKD--EEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKEVI 331

Query: 124 GG 125
            G
Sbjct: 332 AG 333


>gnl|CDD|234064 TIGR02936, fdxN_nitrog, ferredoxin III, nif-specific.  Members of
           this family are homodimeric ferredoxins from nitrogen
           fixation regions of many nitrogen-fixing bacteria. As
           characterized in Rhodobacter capsulatus, these proteins
           are homodimeric, with two 4Fe-4S clusters bound per
           monomer. Although nif-specific, this protein family is
           not usiveral, as other nitrogenase systems may
           substitute flavodoxins, or different types of ferredoxin
           [Central intermediary metabolism, Nitrogen fixation].
          Length = 91

 Score = 25.8 bits (57), Expect = 3.7
 Identities = 8/24 (33%), Positives = 14/24 (58%)

Query: 87  EMKGKEENGEEDEVEEGEEEEEKK 110
            +KG  E GE    ++ ++E E+K
Sbjct: 39  TLKGINEEGELVASDDDDDEIERK 62


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 27.2 bits (60), Expect = 3.7
 Identities = 19/67 (28%), Positives = 35/67 (52%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQE 132
           + + EK GG  G +      E  +ED+ +E ++  E   EEK+KKK+++        + +
Sbjct: 199 ETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDD 258

Query: 133 KEEEREG 139
           K+ +R G
Sbjct: 259 KKGKRGG 265



 Score = 26.1 bits (57), Expect = 8.5
 Identities = 16/65 (24%), Positives = 30/65 (46%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
           + EE+E         K + E  E+ E  E E+ EE+    K  KK ++ +G   G +++ 
Sbjct: 296 DPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355

Query: 134 EEERE 138
            +  +
Sbjct: 356 SDSGD 360



 Score = 26.1 bits (57), Expect = 9.2
 Identities = 16/66 (24%), Positives = 33/66 (50%)

Query: 73  DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQE 132
           +E+E+K       +    ++E+ EE E E+ EEE    K+ K  KK + ++ G    + +
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357

Query: 133 KEEERE 138
             ++ +
Sbjct: 358 SGDDSD 363


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 27.1 bits (60), Expect = 3.9
 Identities = 12/59 (20%), Positives = 32/59 (54%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
             +  + + +K+E++   E  ++   ++E  ++ E E    +E+KK+ E+A K+   ++
Sbjct: 73  SAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQ 131


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 27.0 bits (60), Expect = 3.9
 Identities = 7/50 (14%), Positives = 18/50 (36%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKA 115
           ++  +      EK    E   E  G   +      +E E + +++ + + 
Sbjct: 326 EEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQEEADGQG 375


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 26.8 bits (60), Expect = 4.0
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query: 93  ENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
               +DE+E   ++ E+   E  K+KE+ E
Sbjct: 499 SGLSDDEIERMVKDAEEYAAEDKKRKERIE 528


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 26.9 bits (60), Expect = 4.3
 Identities = 20/72 (27%), Positives = 34/72 (47%), Gaps = 9/72 (12%)

Query: 76  EEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ---------EEEGGG 126
           +EKE    GG         G      + E+++++KK+ + + KE          E++ G 
Sbjct: 154 QEKEAAGSGGGGGGTWGLVGVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGE 213

Query: 127 GGGEQEKEEERE 138
           GGGE  +E +RE
Sbjct: 214 GGGEGGEERQRE 225


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 26.9 bits (60), Expect = 4.6
 Identities = 17/49 (34%), Positives = 25/49 (51%), Gaps = 9/49 (18%)

Query: 91  KEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEEREG 139
           +E   EE+E E GE +EE + EE+ +++           E  KEEE  G
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEP---------EEVPKEEEGTG 207


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 26.6 bits (59), Expect = 4.6
 Identities = 12/56 (21%), Positives = 26/56 (46%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
            A  ++ +K++KE   E  ++ +  ++  EE+E +    EE    +E A      +
Sbjct: 62  PATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSD 117


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 26.7 bits (59), Expect = 5.3
 Identities = 11/37 (29%), Positives = 17/37 (45%), Gaps = 5/37 (13%)

Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
            E  E + +KKE   K++  E E       +E  +ER
Sbjct: 338 AEKNEAKARKKEIAQKRRAAERE-----INREARQER 369



 Score = 26.4 bits (58), Expect = 5.5
 Identities = 11/39 (28%), Positives = 17/39 (43%), Gaps = 8/39 (20%)

Query: 100 VEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
            E  +    +K E KA+KKE  ++         +  ERE
Sbjct: 330 AEIKKTRTAEKNEAKARKKEIAQK--------RRAAERE 360



 Score = 26.0 bits (57), Expect = 7.8
 Identities = 13/73 (17%), Positives = 22/73 (30%), Gaps = 4/73 (5%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
           +  K R  EK E    R+     K +     E E+     +E      +A+ +    +  
Sbjct: 331 EIKKTRTAEKNE-AKARKKEIAQKRRAA---EREINREARQERAAAMARARARRAAVKAK 386

Query: 126 GGGGEQEKEEERE 138
             G       E  
Sbjct: 387 KKGLIDASPNEDT 399


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 26.7 bits (59), Expect = 5.3
 Identities = 13/54 (24%), Positives = 27/54 (50%), Gaps = 4/54 (7%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK 118
           ++  K       E  G   G +E    ++  E+DE  + E E+++K ++K +K+
Sbjct: 318 EEIPKLDEKSLSEFLGYMGGIDE----DDEDEDDEESKEEVEKKQKVKKKPRKR 367


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 26.4 bits (58), Expect = 5.4
 Identities = 12/50 (24%), Positives = 20/50 (40%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
           E  EK+ G    EE   +E   E +  E   EE  + +  + +  +  E 
Sbjct: 333 EVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAER 382


>gnl|CDD|220776 pfam10483, Hap2_elong, Histone acetylation protein 2.  Hap2 is one
           of three histone acetyltransferases proteins that, in
           yeasts, are found associated with elongating forms of
           RNA polymerase II (Elongator). The Haps can be isolated
           in two forms, as a six-subunit complex with Elongator
           and as a complex of the three proteins on their own. The
           role of the Hap complex in transcription is still
           speculative, being possibly to keep the HAT activity of
           free Elongator in check, allowing histone acetylation
           only in the presence of a transcribing polymerase, or
           the interaction with Haps might render Elongator
           susceptible to modifications thereby altering its
           activity.
          Length = 280

 Score = 26.5 bits (59), Expect = 5.5
 Identities = 14/57 (24%), Positives = 24/57 (42%), Gaps = 17/57 (29%)

Query: 90  GKEENGEEDEVEEGEEEEEK----------KKEEKAKKK-------EQEEEGGGGGG 129
            KE+   + E  E + E  +          +K+  A+++        QE+ GGG GG
Sbjct: 219 LKEKPEGDAEAAEEDPESLEDLTTFNLGLSEKQRLAREQVELPFFDAQEKLGGGEGG 275


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 26.2 bits (58), Expect = 5.5
 Identities = 16/48 (33%), Positives = 27/48 (56%)

Query: 76  EEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
            E E   +       KE   EE+E +E ++++E KKE+K KK ++E+ 
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177



 Score = 25.8 bits (57), Expect = 6.6
 Identities = 14/55 (25%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 65  DDQAKKRRDEKEEKEGGR-EGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK 118
                ++  E EE+E    +  +E+K +++  ++ + +  E +  KKK++K KKK
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 26.3 bits (58), Expect = 5.8
 Identities = 13/72 (18%), Positives = 35/72 (48%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
             + KK + ++ ++   ++  EE + ++E+  E+  E+  ++EE++  +     + +   
Sbjct: 152 KKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGML 211

Query: 125 GGGGGEQEKEEE 136
                E+E EE 
Sbjct: 212 VDSSDEEEGEEA 223


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 26.2 bits (57), Expect = 5.8
 Identities = 15/54 (27%), Positives = 25/54 (46%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
            Q K  + ++E+     +   +    EE   ++E EE E EEE K+EE     +
Sbjct: 190 VQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243



 Score = 26.2 bits (57), Expect = 7.2
 Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 2/59 (3%)

Query: 78  KEGGREGGEEMKGKEENGEEDEVEEGEE--EEEKKKEEKAKKKEQEEEGGGGGGEQEKE 134
             GG    +  K K+E       E  +E   EEK+ +E+ +++E EEE     G+   +
Sbjct: 185 SLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 25.3 bits (56), Expect = 6.0
 Identities = 6/26 (23%), Positives = 17/26 (65%)

Query: 102 EGEEEEEKKKEEKAKKKEQEEEGGGG 127
                 E++K+E+ +++E++++ G G
Sbjct: 61  AAAAAAEEEKKEEEEEEEEDDDMGFG 86



 Score = 25.3 bits (56), Expect = 6.0
 Identities = 7/28 (25%), Positives = 15/28 (53%)

Query: 102 EGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
                   ++E+K +++E+EE+   G G
Sbjct: 59  AAAAAAAAEEEKKEEEEEEEEDDDMGFG 86


>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2.  This model
           describes a tightly conserved subfamily of the larger
           family of sugar (and other) transporters described by
           PFAM model pfam00083. Members of this subfamily include
           closely related forms SV2A and SV2B of synaptic vesicle
           protein from vertebrates and a more distantly related
           homolog (below trusted cutoff) from Drosophila
           melanogaster. Members are predicted to have two sets of
           six transmembrane helices.
          Length = 742

 Score = 26.5 bits (58), Expect = 6.0
 Identities = 17/64 (26%), Positives = 29/64 (45%), Gaps = 1/64 (1%)

Query: 76  EEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEE 135
           E  +   EG       E + E+DE+ EGE +   + E    K E   +G    G +++ +
Sbjct: 69  EAADDEEEGEASSDATEGHDEDDEIYEGEYQGIPRAENGG-KDEIMADGAPKAGVRDELK 127

Query: 136 EREG 139
           + EG
Sbjct: 128 DGEG 131


>gnl|CDD|184860 PRK14858, tatA, twin arginine translocase protein A; Provisional.
          Length = 108

 Score = 25.6 bits (56), Expect = 6.0
 Identities = 12/52 (23%), Positives = 22/52 (42%)

Query: 72  RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
            +EKE+ E   E  +E +  E   EED+  + +   E      +K  +  + 
Sbjct: 57  AEEKEKAEKLAETKKEAEAPEAKAEEDQAPKPKGAGEPPATVASKAGDGAKA 108


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 26.5 bits (58), Expect = 6.1
 Identities = 18/61 (29%), Positives = 28/61 (45%), Gaps = 7/61 (11%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGK-------EENGEEDEVEEGEEEEEKKKEEKAKKK 118
            + +KR+  ++E EG  E  +    K       +E  E DE       EE++K+  AK K
Sbjct: 72  KERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDESSVANLSEEERKKYAAKLK 131

Query: 119 E 119
           E
Sbjct: 132 E 132


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 26.4 bits (58), Expect = 6.2
 Identities = 13/57 (22%), Positives = 23/57 (40%), Gaps = 8/57 (14%)

Query: 77  EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK--------EQEEEGG 125
            K+  R           +  E+++E G  +  K KE+ +K          E+EE+G 
Sbjct: 23  RKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEEDGS 79


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 26.2 bits (58), Expect = 6.2
 Identities = 15/77 (19%), Positives = 35/77 (45%)

Query: 60  LLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
           +L  L D A K++    ++       +E +  +E+  E  +EE EEE E ++ +  +  +
Sbjct: 105 VLDDLADAALKKKGFSFKRPKYPNEEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQ 164

Query: 120 QEEEGGGGGGEQEKEEE 136
           + +   G      + ++
Sbjct: 165 ETKYKRGDTSLTPQAKD 181


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 26.6 bits (59), Expect = 6.2
 Identities = 14/55 (25%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 70   KRRDEKEEKEGGREGGEEMKGKEENGE---EDEVEEGEEEEEKKKEEKAKKKEQE 121
            + R+++E++  G+   E  K K+ N E    ++ +  EE+  +   +K K K+Q 
Sbjct: 1521 ENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQY 1575



 Score = 26.6 bits (59), Expect = 6.5
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 93  ENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
               E ++ +  EE+ KKK++K KKKE+E +
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYK 760



 Score = 26.2 bits (58), Expect = 7.7
 Identities = 15/44 (34%), Positives = 27/44 (61%), Gaps = 6/44 (13%)

Query: 94  NGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
           NG++ E +  +  EEK K++K K+K++EEE       + +E+ R
Sbjct: 729 NGKDAEFKISDSVEEKTKKKKKKEKKKEEE------YKREEKAR 766



 Score = 26.2 bits (58), Expect = 8.8
 Identities = 17/63 (26%), Positives = 27/63 (42%), Gaps = 9/63 (14%)

Query: 69   KKRRDEKEEKEGGREGGEEMKGKE------ENGEEDEVEEGEEEEEKKKEEKAK--KKEQ 120
            + R  E++E  G  E   + + K        N E++ +EE   E + KK +  K  K   
Sbjct: 1521 ENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKN-IEEDYAESDIKKRKNKKQYKSNT 1579

Query: 121  EEE 123
            E E
Sbjct: 1580 EAE 1582


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 26.5 bits (58), Expect = 6.4
 Identities = 6/73 (8%), Positives = 18/73 (24%), Gaps = 1/73 (1%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
            +A   R++  +  G      ++  +       E       E   + E    ++      
Sbjct: 367 SEADIEREQPGDLAGQAPAAHQVDAEAA-SAAPEEPAALASEAHDETEPEVPEKAAPIPD 425

Query: 126 GGGGEQEKEEERE 138
               ++       
Sbjct: 426 PAKPDELAVAGPG 438


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 26.3 bits (58), Expect = 6.5
 Identities = 21/64 (32%), Positives = 25/64 (39%), Gaps = 7/64 (10%)

Query: 52  LVEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKK 111
           L  G   G      D       E EE E      +E        EEDE E  EEEE+++ 
Sbjct: 19  LTFGNGKGPFASAQDLT-----EDEEAEDDVVDEDEEDEAVV--EEDENELTEEEEDEEG 71

Query: 112 EEKA 115
           E KA
Sbjct: 72  EVKA 75


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 25.7 bits (57), Expect = 6.7
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query: 77  EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEK 114
            +E   +  EE + +E++ E+D+ +E EEEE   K+ K
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146


>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
           peptidase.  Corresponds to Merops family C37.
           Norwalk-like viruses (NLVs), including the Southampton
           virus, cause acute non-bacterial gastroenteritis in
           humans. The NLV genome encodes three open reading frames
           (ORFs). ORF1 encodes a polyprotein, which is processed
           by the viral protease into six proteins.
          Length = 535

 Score = 26.4 bits (58), Expect = 6.7
 Identities = 23/72 (31%), Positives = 32/72 (44%), Gaps = 5/72 (6%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
           D+ KK R+E+    GG+   +E     E  EE E+ E +  E    EE+  K  Q   G 
Sbjct: 259 DEYKKIREER----GGKYSIQEYLEDRERYEE-ELAERQATEADFCEEEEAKIRQRIFGL 313

Query: 126 GGGGEQEKEEER 137
               +Q KEE  
Sbjct: 314 RKTRKQRKEERA 325


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 26.2 bits (58), Expect = 7.0
 Identities = 15/30 (50%), Positives = 18/30 (60%), Gaps = 4/30 (13%)

Query: 94  NGEEDEVE----EGEEEEEKKKEEKAKKKE 119
           N  E+EVE      EEE  KKK++K KKK 
Sbjct: 163 NPTEEEVELLKARLEEERAKKKKKKKKKKT 192


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 26.2 bits (58), Expect = 7.1
 Identities = 11/64 (17%), Positives = 36/64 (56%), Gaps = 3/64 (4%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEE---DEVEEGEEEEEKKKEEKAKKKEQE 121
            +   K++ ++ +++G ++  ++ K   ++ +    + + E +E E K+  +K KKK+++
Sbjct: 6   GENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKK 65

Query: 122 EEGG 125
           ++  
Sbjct: 66  KKKK 69



 Score = 26.2 bits (58), Expect = 7.2
 Identities = 10/63 (15%), Positives = 29/63 (46%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
            K++K+  ++ G + K K+   + D+ +    E   + +E   K+  +++      +++K
Sbjct: 10  VKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKK 69

Query: 134 EEE 136
              
Sbjct: 70  NLG 72


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
           Provisional.
          Length = 291

 Score = 26.3 bits (57), Expect = 7.3
 Identities = 17/55 (30%), Positives = 27/55 (49%)

Query: 65  DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
           D+      +E ++ +GG E  EE K + ++G+E  V  G    EKK     K+ E
Sbjct: 78  DEDVDDDSEEGDDDDGGAEDEEEEKVRGQSGQEGTVALGSGSTEKKLIGSEKQTE 132


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 25.7 bits (57), Expect = 7.3
 Identities = 15/53 (28%), Positives = 27/53 (50%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
           KK R      +  +E  EE + + E  +E+E  +   E+E  K ++ K++E E
Sbjct: 88  KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENE 140


>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
           N-terminal domain.  Autophagocytosis is a
           starvation-induced process responsible for transport of
           cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
           ubiquitin like modifier that is topologically similar to
           the canonical E2 enzyme. It catalyzes the conjugation of
           Atg8 and phosphatidylethanolamine.
          Length = 146

 Score = 25.8 bits (57), Expect = 7.5
 Identities = 8/52 (15%), Positives = 25/52 (48%)

Query: 72  RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
            DE E+       G   K K++  +E+++ E  ++++   +     ++ +++
Sbjct: 93  EDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDDDD 144


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 26.0 bits (57), Expect = 7.8
 Identities = 15/56 (26%), Positives = 28/56 (50%)

Query: 82  REGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
            E     +   E  EEDE +  E ++++ + E  +  E E E  G  GE++++E+ 
Sbjct: 635 DENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDD 690



 Score = 26.0 bits (57), Expect = 8.4
 Identities = 18/68 (26%), Positives = 33/68 (48%), Gaps = 2/68 (2%)

Query: 57  PNGLLCVLDD-QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKA 115
             GL  +LD  +A + +   ++   G E  E+    E + +EDE E  E+ E + + +  
Sbjct: 622 NEGLQLILDVLKADENKSRHQQLFEGEEEDEDDLE-ETDDDEDECEAIEDSESESESDGE 680

Query: 116 KKKEQEEE 123
             +E E+E
Sbjct: 681 DGEEDEQE 688


>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
           Provisional.
          Length = 231

 Score = 25.7 bits (56), Expect = 7.9
 Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 2/60 (3%)

Query: 70  KRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
           KRR +K+ K    E  +  K  E+    D+ ++  EE+E  +E +  ++  + EGG  G 
Sbjct: 87  KRRTKKKAKADKPE--KSPKAVEKLCPPDDRDDKNEEKEPTEEAQRNEESGDAEGGASGR 144


>gnl|CDD|203437 pfam06405, RCC_reductase, Red chlorophyll catabolite reductase (RCC
           reductase).  This family consists of several red
           chlorophyll catabolite reductase (RCC reductase)
           proteins. Red chlorophyll catabolite (RCC) reductase
           (RCCR) and pheophorbide (Pheide) a oxygenase (PaO)
           catalyze the key reaction of chlorophyll catabolism,
           porphyrin macrocycle cleavage of Pheide a to a primary
           fluorescent catabolite (pFCC).
          Length = 255

 Score = 25.9 bits (57), Expect = 8.0
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 95  GEEDEVEEGEEEEEKKKEEKAKKKEQE 121
           G E EVEE E E  +K++   ++K  E
Sbjct: 202 GGEREVEESEREALEKRDLIVRRKSIE 228


>gnl|CDD|167284 PRK01833, tatA, twin arginine translocase protein A; Provisional.
          Length = 74

 Score = 24.8 bits (54), Expect = 8.0
 Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 5/45 (11%)

Query: 81  GREGGEEMKG-----KEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
           G + GE +KG      ++  ++ E E+ E +E    E+KAK+KEQ
Sbjct: 29  GTDLGESVKGFKKAMADDKPKDAEFEKVEAKEAASTEQKAKEKEQ 73


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 26.1 bits (58), Expect = 8.1
 Identities = 16/40 (40%), Positives = 20/40 (50%)

Query: 69  KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEE 108
           ++RR      E   E  EE +  EE  EE+E EE   EEE
Sbjct: 294 RRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333


>gnl|CDD|221303 pfam11902, DUF3422, Protein of unknown function (DUF3422).  This
          family of proteins are functionally uncharacterized.
          This protein is found in bacteria, archaea and
          eukaryotes. Proteins in this family are typically
          between 426 to 444 amino acids in length.
          Length = 419

 Score = 26.0 bits (58), Expect = 8.2
 Identities = 9/24 (37%), Positives = 11/24 (45%), Gaps = 1/24 (4%)

Query: 35 GIRW-RHIEFSDNTLCLQLVEGKP 57
           +RW RH EFS  T        +P
Sbjct: 74 RLRWERHTEFSTYTFFRPGPGDEP 97


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 25.9 bits (57), Expect = 8.3
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEE 113
              EK+      E++  ++E  EE+E EE EE+E+   ++
Sbjct: 154 SMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 26.0 bits (57), Expect = 8.3
 Identities = 13/58 (22%), Positives = 28/58 (48%), Gaps = 1/58 (1%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
            +A ++  +KE ++      +  K  E+  ++ E ++ + EE K K+    K + E E
Sbjct: 83  QRAAEQARQKELEQRAAAE-KAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAE 139



 Score = 26.0 bits (57), Expect = 8.7
 Identities = 13/58 (22%), Positives = 32/58 (55%)

Query: 66  DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
             AKK ++ +++ E   E  E+ +  E+  +++  +    E+  K+ E+A K+ +E++
Sbjct: 61  PAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQ 118


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 25.8 bits (57), Expect = 8.4
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 91  KEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
           +E    E+E+EE   + + +K    + KEQEE
Sbjct: 155 EEPEEMEEELEEDAADRDARKRAAEEAKEQEE 186


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 25.8 bits (56), Expect = 8.7
 Identities = 22/62 (35%), Positives = 35/62 (56%)

Query: 74  EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
           E+EEKE  RE  EE++  E   + ++  +  + EE +KEEK  + E+EE+   G  E+  
Sbjct: 112 EEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENN 171

Query: 134 EE 135
            E
Sbjct: 172 GE 173


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 25.5 bits (56), Expect = 8.9
 Identities = 13/56 (23%), Positives = 24/56 (42%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
           +    R +K  K+  ++  E+ +  E   EE    E  EE +KK + +  K +   
Sbjct: 54  KKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTA 109


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 25.3 bits (56), Expect = 9.0
 Identities = 18/39 (46%), Positives = 28/39 (71%), Gaps = 2/39 (5%)

Query: 100 VEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
           +EE  EE+E++KEEK +K+E+E+E   G G  +KEE  +
Sbjct: 14  IEEKREEKEREKEEKERKEEKEKE--WGKGLVQKEEREK 50


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 25.5 bits (56), Expect = 9.1
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 92  EENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
           EE+ +E +    E EE++  + K +KKE++E+ 
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125


>gnl|CDD|178680 PLN03134, PLN03134, glycine-rich RNA-binding protein 4;
           Provisional.
          Length = 144

 Score = 25.4 bits (55), Expect = 9.1
 Identities = 13/44 (29%), Positives = 15/44 (34%)

Query: 86  EEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
            EM GKE NG    V    +     +            GG GGG
Sbjct: 94  SEMDGKELNGRHIRVNPANDRPSAPRAYGGGGGYSGGGGGYGGG 137


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 26.0 bits (57), Expect = 9.1
 Identities = 13/38 (34%), Positives = 23/38 (60%), Gaps = 6/38 (15%)

Query: 100 VEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
           +EE   E EK  +E+A+++ Q EE      ++ +EEE+
Sbjct: 251 LEEKRRELEKLAKEEAERERQAEE------QRRREEEK 282


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 25.2 bits (56), Expect = 9.3
 Identities = 14/73 (19%), Positives = 28/73 (38%), Gaps = 13/73 (17%)

Query: 67  QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEE-EEKKKEEKAKKKEQEEEGG 125
           QAK        K+   E   ++   E      E++      +++K   +++ KE E +  
Sbjct: 45  QAKS------IKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKA- 97

Query: 126 GGGGEQEKEEERE 138
                +E E E +
Sbjct: 98  -----KEAEAESD 105


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 25.5 bits (56), Expect = 9.4
 Identities = 16/63 (25%), Positives = 33/63 (52%), Gaps = 7/63 (11%)

Query: 77  EKEGGREGGEEMKGKEENGEEDEV-EEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEE 135
               G    E  K   E+ +E+ V EE  E+++ +  ++  +K+QE++      +QE+++
Sbjct: 59  STAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQ------QQEEKQ 112

Query: 136 ERE 138
            RE
Sbjct: 113 VRE 115


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.303    0.130    0.363 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,335,791
Number of extensions: 704784
Number of successful extensions: 7451
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4520
Number of HSP's successfully gapped: 1441
Length of query: 139
Length of database: 10,937,602
Length adjustment: 87
Effective length of query: 52
Effective length of database: 7,078,804
Effective search space: 368097808
Effective search space used: 368097808
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.7 bits)
S2: 54 (24.4 bits)