RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6822
         (107 letters)



>gnl|CDD|214580 smart00242, MYSc, Myosin. Large ATPases.  ATPase; molecular motor.
           Muscle contraction consists of a cyclical interaction
           between myosin and actin. The core of the myosin
           structure is similar in fold to that of kinesin.
          Length = 677

 Score = 84.9 bits (211), Expect = 1e-20
 Identities = 32/55 (58%), Positives = 46/55 (83%), Gaps = 2/55 (3%)

Query: 1   MQYLAAVNKSPS--NLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
           MQYLA+V+ S +    + +QILE++P+LE+FGNAKT+RN+NSSRFGKF+++HF  
Sbjct: 113 MQYLASVSGSNTEVGSVEDQILESNPILEAFGNAKTLRNNNSSRFGKFIEIHFDA 167


>gnl|CDD|238071 cd00124, MYSc, Myosin motor domain. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 679

 Score = 83.9 bits (208), Expect = 3e-20
 Identities = 31/51 (60%), Positives = 42/51 (82%)

Query: 1   MQYLAAVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
           M+YLA++  S    I E+IL A+P+LE+FGNAKTVRN+NSSRFGKF+++ F
Sbjct: 107 MKYLASLAGSNDTGIEEKILAANPILEAFGNAKTVRNNNSSRFGKFIELQF 157


>gnl|CDD|238683 cd01387, MYSc_type_XV, Myosin motor domain, type XV myosins. In
           vertebrates, myosin XV appears to be expressed in
           sensory tissue and play a role in hearing. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Myosins are
           actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 677

 Score = 83.4 bits (206), Expect = 3e-20
 Identities = 36/52 (69%), Positives = 47/52 (90%)

Query: 1   MQYLAAVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFK 52
           ++YLAA+N+  S +ITEQILEA+PLLE+FGNAKTVRNDNSSRFGKF+++  +
Sbjct: 108 LRYLAAMNQGGSAVITEQILEATPLLEAFGNAKTVRNDNSSRFGKFVEIFLE 159


>gnl|CDD|238673 cd01377, MYSc_type_II, Myosin motor domain, type II myosins. Myosin
           II mediates cortical contraction in cell motility, and
           is the motor in smooth and skeletal muscle. This
           catalytic (head) domain has ATPase activity and belongs
           to the larger group of P-loop NTPases. Myosins are
           actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 693

 Score = 79.2 bits (196), Expect = 1e-18
 Identities = 32/59 (54%), Positives = 43/59 (72%), Gaps = 9/59 (15%)

Query: 2   QYLAAV---------NKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
           QYLA+V         +      + +QIL+A+P+LE+FGNAKTVRNDNSSRFGKF+++HF
Sbjct: 113 QYLASVAASSKKKKQSGKGQGTLEDQILQANPILEAFGNAKTVRNDNSSRFGKFIRIHF 171


>gnl|CDD|215687 pfam00063, Myosin_head, Myosin head (motor domain). 
          Length = 679

 Score = 77.7 bits (192), Expect = 4e-18
 Identities = 30/54 (55%), Positives = 43/54 (79%), Gaps = 3/54 (5%)

Query: 1   MQYLAAV---NKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
           MQYLA+V     S    + +QIL+++P+LE+FGNAKT+RN+NSSRFGKF+++ F
Sbjct: 106 MQYLASVSGSTPSAVGPLEDQILQSNPILEAFGNAKTLRNNNSSRFGKFIEIQF 159


>gnl|CDD|238674 cd01378, MYSc_type_I, Myosin motor domain, type I myosins. Myosin I
           generates movement at the leading edge in cell motility,
           and class I myosins have been implicated in phagocytosis
           and vesicle transport. Myosin I, an unconventional
           myosin, does not form dimers. This catalytic (head)
           domain has ATPase activity and belongs to the larger
           group of P-loop NTPases. Myosins are actin-dependent
           molecular motors that play important roles in muscle
           contraction, cell motility, and organelle transport. The
           head domain is a molecular motor, which utilizes ATP
           hydrolysis to generate directed movement toward the plus
           end along actin filaments. A cyclical interaction
           between myosin and actin provides the driving force.
           Rates of ATP hydrolysis and consequently the speed of
           movement along actin filaments vary widely, from about
           0.04 micrometer per second for myosin I to 4.5
           micrometer per second for myosin II in skeletal muscle.
           Myosin II moves in discrete steps about 5-10 nm long and
           generates 1-5 piconewtons of force. Upon ATP binding,
           the myosin head dissociates from an actin filament. ATP
           hydrolysis causes the head to pivot and associate with a
           new actin subunit. The release of Pi causes the head to
           pivot and move the filament (power stroke). Release of
           ADP completes the cycle.
          Length = 674

 Score = 77.2 bits (191), Expect = 5e-18
 Identities = 28/53 (52%), Positives = 41/53 (77%), Gaps = 2/53 (3%)

Query: 1   MQYLAAV--NKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
           MQY+AAV         + + IL+++PLLE+FGNAKT+RN+NSSRFGK++++ F
Sbjct: 107 MQYIAAVSGGGQKVERVKDVILQSNPLLEAFGNAKTLRNNNSSRFGKYMEIQF 159


>gnl|CDD|238677 cd01381, MYSc_type_VII, Myosin motor domain, type VII myosins.
           Myosins in this group have been associated with
           functions in sensory systems such as vision and hearing.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 671

 Score = 74.0 bits (182), Expect = 7e-17
 Identities = 33/53 (62%), Positives = 45/53 (84%), Gaps = 1/53 (1%)

Query: 1   MQYLAAVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
           +QYLAA++   S  I +QILEA+P+LE+FGNAKT+RNDNSSRFGK++ +HF +
Sbjct: 107 LQYLAAISGKHS-WIEQQILEANPILEAFGNAKTIRNDNSSRFGKYIDIHFNK 158


>gnl|CDD|238676 cd01380, MYSc_type_V, Myosin motor domain, type V myosins. Myosins
           V transport a variety of intracellular cargo
           processively along actin filaments, such as membraneous
           organelles and mRNA. This catalytic (head) domain has
           ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 691

 Score = 72.0 bits (177), Expect = 4e-16
 Identities = 28/61 (45%), Positives = 42/61 (68%), Gaps = 5/61 (8%)

Query: 1   MQYLAAVNKSPSNL-----ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELN 55
           M+Y A+V  S S       + E++L ++P++E+FGNAKT RNDNSSRFGK++Q+ F +  
Sbjct: 107 MRYFASVGGSDSREVSETQVEEKVLASNPIMEAFGNAKTTRNDNSSRFGKYIQILFDKRG 166

Query: 56  T 56
            
Sbjct: 167 R 167


>gnl|CDD|238680 cd01384, MYSc_type_XI, Myosin motor domain, plant-specific type XI
           myosin, involved in organelle transport. This catalytic
           (head) domain has ATPase activity and belongs to the
           larger group of P-loop NTPases. Myosins are
           actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 674

 Score = 71.2 bits (175), Expect = 7e-16
 Identities = 28/56 (50%), Positives = 42/56 (75%), Gaps = 3/56 (5%)

Query: 1   MQYLAAVNKSPSNL---ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
           M+YLA +          + +Q+LE++P+LE+FGNAKTVRN+NSSRFGKF+++ F +
Sbjct: 109 MRYLAYMGGRAGVEGRTVEQQVLESNPVLEAFGNAKTVRNNNSSRFGKFVEIQFDD 164


>gnl|CDD|238678 cd01382, MYSc_type_VI, Myosin motor domain, type VI myosins. Myosin
           VI is a monomeric myosin, which moves towards the
           minus-end of actin filaments, in contrast to most other
           myosins. It has been implicated in endocytosis,
           secretion, and cell migration. This catalytic (head)
           domain has ATPase activity and belongs to the larger
           group of P-loop NTPases. Myosins are actin-dependent
           molecular motors that play important roles in muscle
           contraction, cell motility, and organelle transport. The
           head domain is a molecular motor, which utilizes ATP
           hydrolysis to generate directed movement toward the
           minus end along actin filaments. A cyclical interaction
           between myosin and actin provides the driving force.
           Rates of ATP hydrolysis and consequently the speed of
           movement along actin filaments vary widely, from about
           0.04 micrometer per second for myosin I to 4.5
           micrometer per second for myosin II in skeletal muscle.
           Myosin II moves in discrete steps about 5-10 nm long and
           generates 1-5 piconewtons of force. Upon ATP binding,
           the myosin head dissociates from an actin filament. ATP
           hydrolysis causes the head to pivot and associate with a
           new actin subunit. The release of Pi causes the head to
           pivot and move the filament (power stroke). Release of
           ADP completes the cycle.
          Length = 717

 Score = 69.0 bits (169), Expect = 3e-15
 Identities = 29/41 (70%), Positives = 38/41 (92%)

Query: 15  ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELN 55
           I ++I+EA+PLLE+FGNAKTVRN+NSSRFGKF+++HF E N
Sbjct: 125 IDDRIVEANPLLEAFGNAKTVRNNNSSRFGKFVEIHFNEKN 165


>gnl|CDD|238679 cd01383, MYSc_type_VIII, Myosin motor domain, plant-specific type
           VIII myosins, a subgroup which has been associated with
           endocytosis, cytokinesis, cell-to-cell coupling and
           gating at plasmodesmata. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 677

 Score = 65.6 bits (160), Expect = 7e-14
 Identities = 30/53 (56%), Positives = 41/53 (77%), Gaps = 2/53 (3%)

Query: 1   MQYLAAVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
           MQYLA++     + I  +IL+ +P+LE+FGNAKT RNDNSSRFGK +++HF E
Sbjct: 113 MQYLASLGGG--SGIEYEILQTNPILEAFGNAKTSRNDNSSRFGKLIEIHFSE 163


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 64.7 bits (158), Expect = 1e-13
 Identities = 33/56 (58%), Positives = 44/56 (78%), Gaps = 3/56 (5%)

Query: 1   MQYLAAVNKSPSNLIT---EQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
           MQYLA+V  S +  I+   +QIL  +P+LE+FGNAKTVRNDNSSRFGK++++ F E
Sbjct: 173 MQYLASVTSSSTVEISSIEKQILATNPILEAFGNAKTVRNDNSSRFGKYIKIEFDE 228


>gnl|CDD|240229 PTZ00014, PTZ00014, myosin-A; Provisional.
          Length = 821

 Score = 58.5 bits (142), Expect = 2e-11
 Identities = 26/50 (52%), Positives = 38/50 (76%), Gaps = 1/50 (2%)

Query: 1   MQYLAAVNKSPSNL-ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQV 49
           M+Y A+      +L I   I+ A+P+LE+FGNAKT+RN+NSSRFG+F+Q+
Sbjct: 204 MRYFASSKSGNMDLKIQNAIMAANPVLEAFGNAKTIRNNNSSRFGRFMQL 253


>gnl|CDD|238675 cd01379, MYSc_type_III, Myosin motor domain, type III myosins.
           Myosin III has been shown to play a role in  the vision
           process in insects and in hearing in mammals. Myosin
           III, an unconventional myosin, does not form dimers.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 653

 Score = 54.1 bits (130), Expect = 7e-10
 Identities = 24/51 (47%), Positives = 38/51 (74%)

Query: 1   MQYLAAVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
           +Q L  + K+ +  + E+IL+ + L+E+FGNA+T  NDNSSRFGK+L++ F
Sbjct: 107 VQQLTVLGKANNRTLQEKILQVNSLVEAFGNARTGINDNSSRFGKYLEMKF 157


>gnl|CDD|238681 cd01385, MYSc_type_IX, Myosin motor domain, type IX myosins. Myosin
           IX is a processive single-headed motor, which might play
           a role in signalling. This catalytic (head) domain has
           ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 692

 Score = 53.4 bits (128), Expect = 1e-09
 Identities = 26/39 (66%), Positives = 33/39 (84%), Gaps = 1/39 (2%)

Query: 16  TEQ-ILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
            EQ IL A P+LE+FGNAKT  N+NSSRFGKF+QV+++E
Sbjct: 131 VEQTILSAGPVLEAFGNAKTAHNNNSSRFGKFIQVNYRE 169


>gnl|CDD|238659 cd01363, Motor_domain, Myosin and Kinesin motor domain. These
           ATPases belong to the P-loop NTPase family and provide
           the driving force in myosin and kinesin mediated
           processes.
          Length = 186

 Score = 46.4 bits (110), Expect = 2e-07
 Identities = 11/52 (21%), Positives = 20/52 (38%), Gaps = 1/52 (1%)

Query: 15  ITEQILEASPLLESFG-NAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGK 65
               +++      +    A T  N++SSR     ++HF   N      E+ K
Sbjct: 54  TVTDVIDLMDKGNANRTTAATAMNEHSSRSHSVFRIHFGGKNALASATEQPK 105


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 43.4 bits (102), Expect = 4e-06
 Identities = 26/89 (29%), Positives = 48/89 (53%)

Query: 17  EQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQ 76
           E   EA    E  G ++T  +D   +     Q    E   E ++ E+G +          
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863

Query: 77  QQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           ++++E+EE++E+EE++E++EEEE++E EE
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 39.2 bits (91), Expect = 1e-04
 Identities = 21/57 (36%), Positives = 39/57 (68%), Gaps = 1/57 (1%)

Query: 52  KELNTEEE-EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +ELN E + E ++ ++  +          +E+EE++E+EE++E++EEEE++EEEE E
Sbjct: 835 QELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 35.0 bits (80), Expect = 0.003
 Identities = 15/44 (34%), Positives = 35/44 (79%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
           ++EEEE+EE +E+EEEE+++++++++E+ E+    E  E ++++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 33.4 bits (76), Expect = 0.014
 Identities = 15/44 (34%), Positives = 33/44 (75%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           + E+EE +E+EEEE+++++++++E+EE+ E+    E  E  +K+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 32.3 bits (73), Expect = 0.032
 Identities = 19/59 (32%), Positives = 36/59 (61%), Gaps = 4/59 (6%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKE----EQQEKEEQQEKKEEEEKKEEEEKE 107
           E+  E  E+E   E + E ++ ++           + +E+EE++E++EEEE++EEEE+E
Sbjct: 827 EVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEE 885



 Score = 30.3 bits (68), Expect = 0.15
 Identities = 13/52 (25%), Positives = 29/52 (55%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E E++ E + K E E + +   +++ E++ E E + ++ + + + E EE E
Sbjct: 665 GEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVE 716



 Score = 30.0 bits (67), Expect = 0.19
 Identities = 17/55 (30%), Positives = 27/55 (49%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E   E +E +   E E EE + + + + E  E + + E  E+ EE E + E E E
Sbjct: 696 EGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAE 750



 Score = 29.6 bits (66), Expect = 0.25
 Identities = 15/55 (27%), Positives = 26/55 (47%)

Query: 50  HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           H  E   EE E+    E E  E+   + +Q+ + E + + E + +   E K E+E
Sbjct: 640 HTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQE 694



 Score = 29.6 bits (66), Expect = 0.27
 Identities = 15/40 (37%), Positives = 27/40 (67%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQ 92
           E   EEEE+EE +E+EEEE+++++ ++    E  E  ++Q
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 29.6 bits (66), Expect = 0.27
 Identities = 15/42 (35%), Positives = 30/42 (71%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEK 94
           E   EEEE+EE +E+EEEE+++++++ +E    +  E +Q++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 28.4 bits (63), Expect = 0.65
 Identities = 18/59 (30%), Positives = 32/59 (54%)

Query: 48  QVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           +    E   E E  EEG+E E+E + + + + + + E   KE + E + E E KE+E++
Sbjct: 722 EAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE 780



 Score = 28.4 bits (63), Expect = 0.82
 Identities = 15/38 (39%), Positives = 26/38 (68%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKE 89
           +E   EEEE+EE +E+EEEE++ ++    E  E ++K+
Sbjct: 868 EEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 28.0 bits (62), Expect = 1.0
 Identities = 14/40 (35%), Positives = 28/40 (70%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQ 91
           +E   EEEE+EE +E+EEEE++++ ++    E  + +++Q
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 28.0 bits (62), Expect = 1.1
 Identities = 17/55 (30%), Positives = 29/55 (52%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E   E E + +G+ + E E   +++ +QE E + E +E   K E E ++ E E E
Sbjct: 666 EAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGE 720



 Score = 27.7 bits (61), Expect = 1.2
 Identities = 15/52 (28%), Positives = 27/52 (51%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E E + E KE + + + + ++ + E E + E  E + + E  E+ EE E E
Sbjct: 693 QEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDE 744



 Score = 27.7 bits (61), Expect = 1.3
 Identities = 16/55 (29%), Positives = 29/55 (52%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E   E E KE   + E E ++ + + + E E  +++ E +  +E EE ++E E E
Sbjct: 694 EGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGE 748



 Score = 27.3 bits (60), Expect = 1.7
 Identities = 15/56 (26%), Positives = 28/56 (50%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
                E+E + E K + E E +   +++ E+E + E E ++   + E + EE E E
Sbjct: 663 SGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHE 718



 Score = 26.9 bits (59), Expect = 2.2
 Identities = 17/52 (32%), Positives = 29/52 (55%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           E+  E E + EG E E E +  ++ ++ E E + E E + E + E ++KE E
Sbjct: 714 EVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETE 765



 Score = 26.9 bits (59), Expect = 2.4
 Identities = 16/53 (30%), Positives = 26/53 (49%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +T E   EEG+   E E +  ++   E E++ E E + E + E E   E + E
Sbjct: 640 HTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGE 692



 Score = 26.9 bits (59), Expect = 2.4
 Identities = 16/55 (29%), Positives = 30/55 (54%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E   +E E E G+E EE E + + + + + E + E + ++ + E E + E +E E
Sbjct: 724 EGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDE 778



 Score = 26.5 bits (58), Expect = 3.1
 Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQ--EKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E+ T EE +E   E E E + + + + +   KE + E E + E KE+E++ E +  E
Sbjct: 731 EIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGE 787



 Score = 26.5 bits (58), Expect = 3.1
 Identities = 19/69 (27%), Positives = 33/69 (47%), Gaps = 14/69 (20%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE-------------- 98
           E  TE  EK+E + + E +    + + +  E++   E Q E K++E              
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863

Query: 99  EKKEEEEKE 107
           E++EEEE+E
Sbjct: 864 EEEEEEEEE 872



 Score = 26.5 bits (58), Expect = 3.4
 Identities = 12/55 (21%), Positives = 29/55 (52%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E   E E + +  + + E + ++ + + E E +  ++E + +  EE ++ E+E E
Sbjct: 692 EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGE 746



 Score = 26.5 bits (58), Expect = 3.5
 Identities = 14/48 (29%), Positives = 24/48 (50%)

Query: 60  EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E E   E+  EE ++  + + E  E+   E +QE + E + + E E E
Sbjct: 637 EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGE 684



 Score = 25.7 bits (56), Expect = 5.6
 Identities = 18/56 (32%), Positives = 29/56 (51%), Gaps = 3/56 (5%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQ---QQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           N EE   E  +E E E + + + +     E++ +QE E + E KE + K E E +E
Sbjct: 659 NGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEE 714



 Score = 25.7 bits (56), Expect = 6.6
 Identities = 15/57 (26%), Positives = 24/57 (42%), Gaps = 1/57 (1%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQ-QQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K    E E   E   +E E   + + +  +E   + E+E + E K E E + E   E
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAE 688


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 37.2 bits (87), Expect = 6e-04
 Identities = 15/55 (27%), Positives = 46/55 (83%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           K+   E++EK++     +++++++++++++KEE++E+EE++ ++E+EE++E+++K
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 36.8 bits (86), Expect = 7e-04
 Identities = 23/52 (44%), Positives = 44/52 (84%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             EEEK+E K+K    +K+++++++EKE+++E++E++E++ EEEK+EEEEK+
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 36.1 bits (84), Expect = 0.002
 Identities = 24/51 (47%), Positives = 43/51 (84%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E+ EK+  +EK+E+++K    +++E+EE++EKE+++E+KEEEE++ EEEKE
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467



 Score = 34.9 bits (81), Expect = 0.004
 Identities = 15/56 (26%), Positives = 47/56 (83%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           ++   EE+++++ K    +++++++++++EK+E++++EE++E +EE+E++EE++K+
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 34.1 bits (79), Expect = 0.006
 Identities = 19/49 (38%), Positives = 40/49 (81%)

Query: 59  EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             ++  K++EEE+++++++    K++++E+EE++EKKEEE+++EEEE E
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463



 Score = 33.7 bits (78), Expect = 0.010
 Identities = 19/56 (33%), Positives = 45/56 (80%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K++    E+ E+ +E+E++E+K++    ++KEE++E+E++++++E+EE++EE E+E
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465



 Score = 33.0 bits (76), Expect = 0.017
 Identities = 22/63 (34%), Positives = 46/63 (73%), Gaps = 1/63 (1%)

Query: 45  KFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           +FL    K+   + ++  E  EK+ EE+K++++++    +++E+EE++EK+++EE+KEEE
Sbjct: 400 EFL-TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458

Query: 105 EKE 107
           E+E
Sbjct: 459 EEE 461



 Score = 31.4 bits (72), Expect = 0.062
 Identities = 22/60 (36%), Positives = 42/60 (70%), Gaps = 5/60 (8%)

Query: 53  ELNTEEEE-----KEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           EL  EE E     K+  K+ ++  +K ++++++EK+E+++K    +KKEEEE++E+E+KE
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452



 Score = 31.0 bits (71), Expect = 0.073
 Identities = 15/56 (26%), Positives = 35/56 (62%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             L   EEE E     ++  +K ++  ++ +++++E++++++KK    KK+EEE+E
Sbjct: 390 AFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445



 Score = 27.2 bits (61), Expect = 1.6
 Identities = 18/42 (42%), Positives = 35/42 (83%)

Query: 51  FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQ 92
           F     EEEE+EE ++KEEE+++++++ ++EKEE++EK+++Q
Sbjct: 435 FAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQ 476


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 36.7 bits (86), Expect = 8e-04
 Identities = 14/64 (21%), Positives = 33/64 (51%)

Query: 43  FGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
           + KF     K     ++ KEE ++ E +++++ ++  +E  E  +K E ++KK+ +    
Sbjct: 151 YEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDP 210

Query: 103 EEEK 106
           +E  
Sbjct: 211 KEGP 214



 Score = 33.3 bits (77), Expect = 0.015
 Identities = 15/60 (25%), Positives = 30/60 (50%), Gaps = 1/60 (1%)

Query: 43  FGKFLQVHFKELNTEEEEK-EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           FG        + + EE EK E  KE+E+E+  ++  +  +K E ++K++ +    +E   
Sbjct: 156 FGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215



 Score = 32.9 bits (76), Expect = 0.017
 Identities = 25/113 (22%), Positives = 51/113 (45%), Gaps = 18/113 (15%)

Query: 13  NLITEQILEASPLLESFGNAKTVRNDNSS----RFGKFLQVHFKELNTEEEEKE------ 62
           N I E+  + SPL +S    + V  + +        +  + H K+ +  +  K+      
Sbjct: 97  NEIIEKAKKNSPLFKSLLKKQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGF 156

Query: 63  -------EGKEKEEEEQKQQQQQQQEKEEQ-QEKEEQQEKKEEEEKKEEEEKE 107
                  E  + +EE +K + Q+++E E+  +E  E  +K E E+KK+ +  +
Sbjct: 157 GILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFD 209


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 36.5 bits (85), Expect = 9e-04
 Identities = 16/56 (28%), Positives = 30/56 (53%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +   TEE E+E   E E   + +  +Q+QE   +++     E+KE+ +K E+ +K 
Sbjct: 228 ETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283



 Score = 31.9 bits (73), Expect = 0.038
 Identities = 16/59 (27%), Positives = 29/59 (49%), Gaps = 4/59 (6%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQ----EKEEQQEKKEEEEKKEEEEKE 107
           E    EEE +   E   E +  +Q+Q+   EE      E++E  +K E+ +K E  +++
Sbjct: 231 ETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEK 289



 Score = 29.6 bits (67), Expect = 0.29
 Identities = 14/57 (24%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 52  KELNTEEEEKEEGKE-KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E + E E   E K  K+E+E   ++      EE+++ ++ ++  + E  KE++++E
Sbjct: 237 EETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEE 293



 Score = 29.2 bits (66), Expect = 0.40
 Identities = 13/49 (26%), Positives = 26/49 (53%)

Query: 59  EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E  E +E+EEE   + +   + K  +QE+E   E+      +E+E+ +
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPD 275



 Score = 26.5 bits (59), Expect = 2.8
 Identities = 10/51 (19%), Positives = 28/51 (54%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            +E  + E +E+E + + +   + +  +++QE   +++     E+KE+ +K
Sbjct: 226 LKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDK 276


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 36.4 bits (85), Expect = 0.001
 Identities = 16/41 (39%), Positives = 26/41 (63%)

Query: 64  GKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
            K+  +EE K+    Q +    +E+EE++E++EEEE  EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 34.5 bits (80), Expect = 0.005
 Identities = 13/41 (31%), Positives = 25/41 (60%)

Query: 63  EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
           +    +EE ++    Q Q    ++E+EE++E++EEE  +EE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 32.1 bits (74), Expect = 0.028
 Identities = 14/46 (30%), Positives = 26/46 (56%)

Query: 54  LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
           L     +K+   E+ +E    Q Q    +EE++E+EE++E++  EE
Sbjct: 274 LAAALADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 30.2 bits (69), Expect = 0.15
 Identities = 14/41 (34%), Positives = 25/41 (60%)

Query: 59  EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
           ++    +E +E    Q Q    E+EE++E+EE++E+  EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 35.8 bits (83), Expect = 0.002
 Identities = 15/50 (30%), Positives = 37/50 (74%), Gaps = 2/50 (4%)

Query: 59  EEKEEGKE--KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           E+K +G    K +++++ Q+++++++  +++K  ++ KK+EE+KK+E EK
Sbjct: 536 EDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 33.8 bits (78), Expect = 0.008
 Identities = 11/38 (28%), Positives = 31/38 (81%)

Query: 70  EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +++++ Q++++EKE  +E++  ++ K++EEKK++E ++
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 33.5 bits (77), Expect = 0.010
 Identities = 12/41 (29%), Positives = 33/41 (80%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
           ++++E  +EKEE+E  ++Q++ ++ ++Q+EK++++ +K E+
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 31.5 bits (72), Expect = 0.049
 Identities = 12/43 (27%), Positives = 33/43 (76%)

Query: 59  EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           ++KEE + ++EE++  ++Q++  K ++QE+++++E ++ E+ K
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 31.2 bits (71), Expect = 0.070
 Identities = 18/55 (32%), Positives = 36/55 (65%), Gaps = 2/55 (3%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           +EL  E+EEKE    KE++  ++ ++Q+++K+++ EK E+ +    E  K +E+K
Sbjct: 551 EELQREKEEKEA--LKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDK 603



 Score = 29.6 bits (67), Expect = 0.25
 Identities = 12/46 (26%), Positives = 34/46 (73%), Gaps = 2/46 (4%)

Query: 51  FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKE 96
           +K  + EE ++E  KE++E  ++Q++ ++ +K+E+++K+E ++ ++
Sbjct: 545 WKLDDKEELQRE--KEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 28.8 bits (65), Expect = 0.43
 Identities = 8/35 (22%), Positives = 26/35 (74%)

Query: 73  KQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K   +++ ++E+++++  +++K+  + KK+EE+K+
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKK 580


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 34.7 bits (80), Expect = 0.003
 Identities = 15/56 (26%), Positives = 37/56 (66%)

Query: 51  FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           F+EL++ ++E+EE +E+  +E +Q+   + + E +++K+ +  K + E++K + E 
Sbjct: 88  FRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEP 143



 Score = 26.3 bits (58), Expect = 2.5
 Identities = 14/49 (28%), Positives = 31/49 (63%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           E  TEEE  +E ++++  E K + ++++++E  + K E+++ K E +K 
Sbjct: 98  EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 33.8 bits (78), Expect = 0.007
 Identities = 10/39 (25%), Positives = 21/39 (53%), Gaps = 2/39 (5%)

Query: 71  EQKQQQQQQQ--EKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E K  +  ++   +E +     ++E+  E EKKE ++K+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 30.8 bits (70), Expect = 0.092
 Identities = 10/40 (25%), Positives = 22/40 (55%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKK 95
            E ++ +  +E   EE +     ++E+  + EK+E ++KK
Sbjct: 157 AEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 29.6 bits (67), Expect = 0.23
 Identities = 9/39 (23%), Positives = 22/39 (56%), Gaps = 3/39 (7%)

Query: 68  EEEEQKQQQQ---QQQEKEEQQEKEEQQEKKEEEEKKEE 103
           E ++ K  ++   ++ E     ++EE  E +++E KK++
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 28.1 bits (63), Expect = 0.66
 Identities = 10/39 (25%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 60  EKEEGKEKEE-EEQKQQQQQQQEKEEQQEKEEQQEKKEE 97
           E ++ K  EE   ++ +     ++EE  E E+++ KK++
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 33.8 bits (78), Expect = 0.008
 Identities = 18/60 (30%), Positives = 36/60 (60%)

Query: 48  QVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           ++   +L+ E   K +   +EEEE+  +  +++ +EE QEK+E+++K+E E K  +   E
Sbjct: 247 KLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPE 306



 Score = 32.2 bits (74), Expect = 0.029
 Identities = 20/57 (35%), Positives = 38/57 (66%), Gaps = 7/57 (12%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKE-------EQQEKKEEEEKKEEEEK 106
           EEEE++  K  EEE Q++ Q++++EK++++ +        E+Q K EE+E+K++  K
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 33.7 bits (77), Expect = 0.010
 Identities = 24/86 (27%), Positives = 47/86 (54%), Gaps = 14/86 (16%)

Query: 29  FGNAKTV-RNDNSSRFGKFLQVH-------FKELNTEEEEKEEGK-EKEEEEQKQQQQQQ 79
           F N + V  N++   F   +++        F++   EE+ +E  K  KEE E+++Q ++Q
Sbjct: 218 FFNGQNVYDNNDHKDF--CVEIEGGMDEHSFEDFLLEEKRRELEKLAKEEAERERQAEEQ 275

Query: 80  QEKEEQQ---EKEEQQEKKEEEEKKE 102
           + +EE++   E +  Q K E E+++E
Sbjct: 276 RRREEEKAAMEADRAQAKAEVEKRRE 301



 Score = 27.9 bits (62), Expect = 1.0
 Identities = 10/38 (26%), Positives = 27/38 (71%)

Query: 69  EEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           EE++++ ++  +E+ E++ + E+Q ++EEE+   E ++
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADR 289



 Score = 25.2 bits (55), Expect = 8.5
 Identities = 11/41 (26%), Positives = 24/41 (58%), Gaps = 2/41 (4%)

Query: 68  EEEEQ--KQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           EEE Q  ++ Q++++ +EE    + ++  + + E KE+  K
Sbjct: 430 EEEHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEKTLK 470


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 33.5 bits (76), Expect = 0.011
 Identities = 19/56 (33%), Positives = 36/56 (64%), Gaps = 1/56 (1%)

Query: 46  FLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           F  +   +L  + EE  E K K E EQK ++++++EKE+++E+E ++E++ E   K
Sbjct: 569 FTPLASSKLAKKREEAVE-KAKREAEQKAREEREREKEKEKEREREREREAERAAK 623



 Score = 33.1 bits (75), Expect = 0.019
 Identities = 14/42 (33%), Positives = 32/42 (76%)

Query: 65  KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           K++EE  +K +++ +Q+  E++E+E+++EK+ E E++ E E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620



 Score = 30.4 bits (68), Expect = 0.13
 Identities = 13/41 (31%), Positives = 30/41 (73%)

Query: 67  KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K+ EE  ++ +++ E++ ++E+E ++EK++E E++ E E E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 33.5 bits (77), Expect = 0.012
 Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 6/59 (10%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKE--EQQEKEE-QQEKKE---EEEKKEEEEKE 107
            T E ++ E  EK   + +QQQ  ++E++     EK + QQE K    EE+  +E E+E
Sbjct: 652 ETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQE 710



 Score = 26.9 bits (60), Expect = 2.6
 Identities = 14/53 (26%), Positives = 28/53 (52%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           N    E  E +E+    ++Q QQQ  E  E Q+ E  ++ + ++E+++   +E
Sbjct: 626 NRTRREGRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRE 678



 Score = 26.2 bits (58), Expect = 4.0
 Identities = 16/65 (24%), Positives = 32/65 (49%), Gaps = 3/65 (4%)

Query: 41  SRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
           SRF   L+  F   +  EE K + +   + E K ++QQ + K  Q  + ++ E+++  + 
Sbjct: 570 SRFFGALKALF---SGGEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDN 626

Query: 101 KEEEE 105
           +   E
Sbjct: 627 RTRRE 631


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 32.8 bits (75), Expect = 0.018
 Identities = 13/56 (23%), Positives = 26/56 (46%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E   EEEE+   + +EEE +  +++  Q +E+   +       + E K ++E   
Sbjct: 371 DEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426



 Score = 28.9 bits (65), Expect = 0.46
 Identities = 10/52 (19%), Positives = 28/52 (53%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             +E+++E +E+  +E ++++ +  E+E  Q +E+   +   +   + E K 
Sbjct: 369 EVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKA 420



 Score = 28.5 bits (64), Expect = 0.65
 Identities = 9/71 (12%), Positives = 32/71 (45%)

Query: 37  NDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKE 96
           +   S+     +     ++ EE +++E +E+E+   + ++++ ++ EE+  +  +    E
Sbjct: 348 STKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSE 407

Query: 97  EEEKKEEEEKE 107
                  + + 
Sbjct: 408 SSSDVGSDSES 418



 Score = 26.6 bits (59), Expect = 3.0
 Identities = 11/53 (20%), Positives = 22/53 (41%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           ++ + E EE+E    +EE  Q ++    +   +     E +  KE     + E
Sbjct: 379 EQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431


>gnl|CDD|238682 cd01386, MYSc_type_XVIII, Myosin motor domain, type XVIII myosins.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 767

 Score = 32.5 bits (74), Expect = 0.027
 Identities = 17/52 (32%), Positives = 29/52 (55%), Gaps = 1/52 (1%)

Query: 1   MQYLAAVNKSPSNLIT-EQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
           ++YLA    S    ++ E++     +LE+FGN  T  N N++RF + L + F
Sbjct: 107 LEYLALAAGSVDGRVSVEKVRALFTILEAFGNVSTALNGNATRFTQILSLDF 158


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 32.0 bits (73), Expect = 0.027
 Identities = 8/39 (20%), Positives = 21/39 (53%), Gaps = 3/39 (7%)

Query: 72  QKQQQQQQQEKEEQQEKEE---QQEKKEEEEKKEEEEKE 107
           +++  +Q++E   ++ KE      +K    +K E ++K+
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 30.0 bits (68), Expect = 0.13
 Identities = 10/37 (27%), Positives = 19/37 (51%)

Query: 70  EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           E +  +Q+++   EE++E      KK    +K E +K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKK 194



 Score = 27.7 bits (62), Expect = 0.96
 Identities = 11/44 (25%), Positives = 23/44 (52%), Gaps = 7/44 (15%)

Query: 60  EKEEGKEKEE--EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           E+++ K+KEE   E++     ++      +K    +K E ++KK
Sbjct: 158 ERKDAKQKEEFAAEER-----KEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 32.5 bits (74), Expect = 0.029
 Identities = 14/54 (25%), Positives = 33/54 (61%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           K+   + EE ++ +  E+E  KQ ++++   +EQ+++ E+  K+   ++K+ EE
Sbjct: 83  KKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136



 Score = 31.3 bits (71), Expect = 0.072
 Identities = 10/48 (20%), Positives = 29/48 (60%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           +++  E+ + K+ E+++   Q+Q+++ E+  K+   ++K+ EE   + 
Sbjct: 94  QKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKA 141



 Score = 30.5 bits (69), Expect = 0.12
 Identities = 15/55 (27%), Positives = 39/55 (70%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           ++ N ++++++  K  EE+ +K++QQQ +E +++Q  E+++ K+ E+E+   +E+
Sbjct: 62  EQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQ 116



 Score = 28.6 bits (64), Expect = 0.48
 Identities = 15/51 (29%), Positives = 34/51 (66%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E+ +K+E ++ EE +QKQ  +Q++ K+ ++E+   QE+K++ E+  ++   
Sbjct: 79  EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL 129



 Score = 28.6 bits (64), Expect = 0.53
 Identities = 14/47 (29%), Positives = 34/47 (72%)

Query: 61  KEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            EE ++K+E++Q ++ QQ+Q  E+++ K+ ++E+   +E+K++ E+ 
Sbjct: 77  AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEA 123



 Score = 27.1 bits (60), Expect = 2.0
 Identities = 13/50 (26%), Positives = 35/50 (70%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           EE++++ ++++ EE +Q+Q  +QE+ +Q EKE    ++++++ +E  ++ 
Sbjct: 78  EEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQA 127



 Score = 26.7 bits (59), Expect = 2.5
 Identities = 10/51 (19%), Positives = 26/51 (50%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             ++ E+ +   +E++KQ ++  ++   +Q++ E+   K     K + E E
Sbjct: 102 RLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAE 152


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 32.4 bits (74), Expect = 0.033
 Identities = 16/59 (27%), Positives = 24/59 (40%), Gaps = 6/59 (10%)

Query: 49  VHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +  KE+N  +EEK  G        K        KEEQ+  +   E ++      E +KE
Sbjct: 211 IKSKEININKEEKNNGSNVNNNGNKN------NKEEQKGNDLSNELEDISLGPLEYDKE 263


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 32.2 bits (74), Expect = 0.034
 Identities = 17/61 (27%), Positives = 35/61 (57%)

Query: 46  FLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           +L++H K L  E EE+E         ++++ +++Q K E++ ++E+ EK   ++K E   
Sbjct: 384 YLKLHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAA 443

Query: 106 K 106
           K
Sbjct: 444 K 444



 Score = 30.7 bits (70), Expect = 0.10
 Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 57  EEEEKEEGKEKEEEE----QKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           EEEE E G     E     +KQ++ +++ ++E+ EK   ++K E   KK +    
Sbjct: 397 EEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451



 Score = 28.0 bits (63), Expect = 1.0
 Identities = 10/39 (25%), Positives = 20/39 (51%)

Query: 61  KEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
           K E K ++EE +K   +++ E   ++ K    E K+ + 
Sbjct: 420 KAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458



 Score = 25.7 bits (57), Expect = 6.1
 Identities = 7/39 (17%), Positives = 19/39 (48%)

Query: 59  EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEE 97
            EK+  KE+ E+   +++ +   K+ +    E ++   +
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459


>gnl|CDD|173487 PTZ00236, PTZ00236, mitochondrial import inner membrane translocase
           subunit tim17; Provisional.
          Length = 164

 Score = 31.6 bits (72), Expect = 0.037
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 68  EEEEQKQQQQQQQEKEEQQEKEEQQ 92
               Q+ QQQQ+ EK+ ++E +++Q
Sbjct: 139 RTPRQQFQQQQEMEKQLEEEDKKEQ 163



 Score = 30.0 bits (68), Expect = 0.14
 Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 4/29 (13%)

Query: 70  EEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
              +QQ QQQQE     EK+ ++E K+E+
Sbjct: 139 RTPRQQFQQQQE----MEKQLEEEDKKEQ 163



 Score = 28.5 bits (64), Expect = 0.48
 Identities = 9/30 (30%), Positives = 21/30 (70%)

Query: 75  QQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
            ++Q +   +Q +++++ EK+ EEE K+E+
Sbjct: 134 NRRQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163



 Score = 28.5 bits (64), Expect = 0.57
 Identities = 9/25 (36%), Positives = 19/25 (76%)

Query: 67  KEEEEQKQQQQQQQEKEEQQEKEEQ 91
           +   +Q QQQQ+ +++ E+++K+EQ
Sbjct: 139 RTPRQQFQQQQEMEKQLEEEDKKEQ 163



 Score = 27.7 bits (62), Expect = 0.83
 Identities = 9/30 (30%), Positives = 23/30 (76%)

Query: 74  QQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
            ++Q +  +++ Q+++E +++ EEE+KKE+
Sbjct: 134 NRRQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163



 Score = 27.3 bits (61), Expect = 1.3
 Identities = 8/26 (30%), Positives = 20/26 (76%)

Query: 72  QKQQQQQQQEKEEQQEKEEQQEKKEE 97
            +  +QQ Q+++E +++ E+++KKE+
Sbjct: 138 MRTPRQQFQQQQEMEKQLEEEDKKEQ 163



 Score = 26.5 bits (59), Expect = 2.3
 Identities = 9/29 (31%), Positives = 20/29 (68%)

Query: 67  KEEEEQKQQQQQQQEKEEQQEKEEQQEKK 95
           + +    +QQ QQQ++ E+Q +EE ++++
Sbjct: 135 RRQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163



 Score = 26.5 bits (59), Expect = 2.6
 Identities = 8/30 (26%), Positives = 20/30 (66%)

Query: 76  QQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            ++Q     QQ +++Q+ +K+ EE+ ++E+
Sbjct: 134 NRRQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163



 Score = 26.2 bits (58), Expect = 3.1
 Identities = 9/28 (32%), Positives = 22/28 (78%)

Query: 73  KQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
           +Q +  +Q+ ++QQE E+Q E+++++E+
Sbjct: 136 RQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163



 Score = 25.0 bits (55), Expect = 8.6
 Identities = 6/29 (20%), Positives = 20/29 (68%)

Query: 61  KEEGKEKEEEEQKQQQQQQQEKEEQQEKE 89
           + + +   ++ Q+QQ+ ++Q +EE ++++
Sbjct: 135 RRQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 32.1 bits (73), Expect = 0.040
 Identities = 15/51 (29%), Positives = 31/51 (60%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
                EKE  ++ E  E+  ++Q++ +KE +QE EE +E++  ++ + E+E
Sbjct: 519 KLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKE 569



 Score = 28.2 bits (63), Expect = 0.91
 Identities = 13/45 (28%), Positives = 31/45 (68%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
           E++ E  ++  +E++K +++ +QE EE +E+E  ++ + E+E +E
Sbjct: 528 EQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQE 572



 Score = 25.9 bits (57), Expect = 4.7
 Identities = 15/52 (28%), Positives = 33/52 (63%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           + E+  KE+ K K+E EQ+ ++ +++E+ ++ E E++ ++  +  KKE E  
Sbjct: 533 HLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESI 584


>gnl|CDD|233199 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT).
           [Transport and binding proteins, Nucleosides, purines
           and pyrimidines].
          Length = 437

 Score = 32.0 bits (73), Expect = 0.041
 Identities = 14/66 (21%), Positives = 26/66 (39%), Gaps = 4/66 (6%)

Query: 39  NSSRFGKFLQVHF-KELNT---EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEK 94
                  F + +  K+L+    E+E K E + K E+        Q       + E++ E 
Sbjct: 193 LLLPKLPFARYYLQKKLDKGAGEDETKGELRSKAEQNGIPHGGDQPSPTLVLDWEKEPES 252

Query: 95  KEEEEK 100
            +E +K
Sbjct: 253 PDEPQK 258


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 31.8 bits (72), Expect = 0.045
 Identities = 20/51 (39%), Positives = 40/51 (78%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E E+EE  E+EEEE++++++++QE EE++ ++E++E++ E +   EEE E
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 31.4 bits (71), Expect = 0.073
 Identities = 18/55 (32%), Positives = 37/55 (67%)

Query: 50  HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
                 +EEEE  E +E+EEEE+++++Q+ +E+E + E+EE++ + +   ++E E
Sbjct: 433 SMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 28.0 bits (62), Expect = 0.95
 Identities = 20/51 (39%), Positives = 35/51 (68%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           EEE  EE +E+EEEE++++Q+ ++E+ E +E+EE+ E     E++ E   E
Sbjct: 441 EEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSE 491



 Score = 28.0 bits (62), Expect = 1.0
 Identities = 17/52 (32%), Positives = 36/52 (69%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E  E+EE +E+EEEE++Q+ ++++ ++E++E+E + +   EEE +   E +
Sbjct: 442 EESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493



 Score = 27.2 bits (60), Expect = 1.7
 Identities = 16/55 (29%), Positives = 27/55 (49%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E   EE+E EE + ++EEE+++ +     +EE +   E     EE E+  E    
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNS 508



 Score = 26.4 bits (58), Expect = 3.3
 Identities = 16/51 (31%), Positives = 34/51 (66%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
                 E      +E ++++  ++E+EE++E+EE++++ EEEE ++EEE+E
Sbjct: 424 ASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEE 474



 Score = 26.0 bits (57), Expect = 4.1
 Identities = 17/41 (41%), Positives = 33/41 (80%)

Query: 67  KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             +E ++++  +++E+EE++E+EE+QE +EEE + EEEE+E
Sbjct: 435 ASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEE 475



 Score = 26.0 bits (57), Expect = 4.8
 Identities = 17/51 (33%), Positives = 26/51 (50%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E+E +EE  E EEEE++ +     E+E +   E   + +E EE  E    E
Sbjct: 459 EQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSE 509



 Score = 25.6 bits (56), Expect = 5.9
 Identities = 15/53 (28%), Positives = 30/53 (56%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           E   EEE++ E +E E+EE++++ +     EE+ E   + +   EE +++ E 
Sbjct: 453 EEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAER 505



 Score = 25.3 bits (55), Expect = 8.7
 Identities = 20/54 (37%), Positives = 37/54 (68%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           +E  + EEE+EE +E+EEEEQ+ ++++ +++EE++E E     +EE E   E +
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493



 Score = 25.3 bits (55), Expect = 8.8
 Identities = 16/43 (37%), Positives = 35/43 (81%)

Query: 65  KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             +E EE++  +++++E+EE++E+E++ E++E E+++EEEE E
Sbjct: 435 ASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 31.7 bits (73), Expect = 0.050
 Identities = 15/49 (30%), Positives = 34/49 (69%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           EE + E ++K EE +   ++ ++ KEE +EK+E+ +++E++  +E E++
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKE 574



 Score = 30.2 bits (69), Expect = 0.15
 Identities = 17/55 (30%), Positives = 35/55 (63%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           EL  E E+K E  E   +E ++ +++ +EK+E+ ++EE +  +E E++ ++  KE
Sbjct: 527 ELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581



 Score = 30.2 bits (69), Expect = 0.17
 Identities = 19/54 (35%), Positives = 33/54 (61%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           K    E   KE  K KEE E+K+++ Q++E +  +E E++ ++  +E KKE +E
Sbjct: 535 KAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADE 588



 Score = 27.5 bits (62), Expect = 1.4
 Identities = 11/37 (29%), Positives = 26/37 (70%)

Query: 71  EQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E+ +++ +Q+ +E +   +E ++ KEE E+K+E+ +E
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQE 562



 Score = 27.5 bits (62), Expect = 1.5
 Identities = 15/57 (26%), Positives = 36/57 (63%), Gaps = 1/57 (1%)

Query: 52  KELNTEEEEKE-EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E   E++ +E E   KE E+ K++ ++++EK +++E +  +E ++E ++  +E K+
Sbjct: 528 LERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKK 584



 Score = 26.7 bits (60), Expect = 2.6
 Identities = 14/59 (23%), Positives = 36/59 (61%), Gaps = 3/59 (5%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQE---KEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K +  ++E+  E     EE +++ +Q+ +E     ++ EK +++ ++++E+ +EEE+K 
Sbjct: 509 KLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKL 567



 Score = 25.6 bits (57), Expect = 6.1
 Identities = 17/51 (33%), Positives = 31/51 (60%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           EE +K  G++KE+  +     ++ E+E +Q+ EE +   +E EK +EE +E
Sbjct: 505 EEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEE 555


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 31.3 bits (71), Expect = 0.052
 Identities = 15/51 (29%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 51  FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
             E+  +   +++  EK +   K Q  ++ +K   QEK+E  EKK+ ++KK
Sbjct: 29  NPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKE--EKKKPKKKK 77


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 31.7 bits (73), Expect = 0.052
 Identities = 15/58 (25%), Positives = 25/58 (43%)

Query: 50  HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           H   +  EE  KEE K + EEE K + ++    E ++E + +       E+  E    
Sbjct: 34  HSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAA 91


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 31.8 bits (72), Expect = 0.053
 Identities = 14/56 (25%), Positives = 32/56 (57%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +EL  +++++E+ KEKE ++ K  Q++ + K + Q+  +     ++ EKK  +   
Sbjct: 16  EELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDV 71



 Score = 31.0 bits (70), Expect = 0.084
 Identities = 11/48 (22%), Positives = 26/48 (54%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           +E +K +  +KE + + Q QQ        ++ E++  K++ E++  E+
Sbjct: 31  KELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDENPED 78



 Score = 29.9 bits (67), Expect = 0.26
 Identities = 14/47 (29%), Positives = 30/47 (63%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
             E EK+   E+E E +K+++++ +EKE ++ K  Q+E K + + ++
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQ 51



 Score = 28.3 bits (63), Expect = 0.80
 Identities = 12/31 (38%), Positives = 23/31 (74%)

Query: 77  QQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           + + EK+   E+E +++KK+EE+ KE+E K+
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 27.2 bits (60), Expect = 2.1
 Identities = 10/31 (32%), Positives = 23/31 (74%)

Query: 76  QQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           + + ++K   +E+ E+++KKEE+ K++E +K
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 27.2 bits (60), Expect = 2.1
 Identities = 10/38 (26%), Positives = 27/38 (71%)

Query: 70  EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E + +++   +E+ E+++K+E++ K++E +K +  +KE
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKE 42



 Score = 26.0 bits (57), Expect = 5.1
 Identities = 16/58 (27%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQ---EKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E     EE+ E K+K+EE+ K+++ ++    +KE + + + QQ        K+ E+K 
Sbjct: 9   EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKS 66



 Score = 25.6 bits (56), Expect = 7.4
 Identities = 10/50 (20%), Positives = 32/50 (64%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E E E+    EEE +++++++++ KE++ +K +  +K+ + + + ++  +
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASD 54


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 31.7 bits (73), Expect = 0.055
 Identities = 15/48 (31%), Positives = 35/48 (72%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           + + E  +++EEE +K++++ +Q+++E ++KEE+ E+  EE+ +E E 
Sbjct: 99  DRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146



 Score = 31.3 bits (72), Expect = 0.073
 Identities = 20/63 (31%), Positives = 35/63 (55%)

Query: 45  KFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
             L+   +      E ++E +E+  E QK +++  Q++E    K E  EK+EEE +K+E+
Sbjct: 58  ALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEK 117

Query: 105 EKE 107
           E E
Sbjct: 118 ELE 120



 Score = 31.3 bits (72), Expect = 0.074
 Identities = 17/60 (28%), Positives = 41/60 (68%), Gaps = 2/60 (3%)

Query: 50  HFKELNTEEEEKEEGKEKEEE--EQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             ++L     +KEE  +++ E  E+++++ +++EKE +Q+++E ++K+EE E+  EE+ +
Sbjct: 83  ELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142



 Score = 25.9 bits (58), Expect = 4.2
 Identities = 13/56 (23%), Positives = 38/56 (67%)

Query: 51  FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            +E   E ++ E+   ++EE   ++ +  +++EE+ EK+E++ +++++E +++EE+
Sbjct: 77  LRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEE 132



 Score = 25.9 bits (58), Expect = 5.1
 Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 11/60 (18%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQ-----------QEKEEQQEKKEEEEKKEEEE 105
           E  +KE   E +EE  K + + ++E  E+            +KEE  ++K E  +K EEE
Sbjct: 52  EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEE 111


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 31.3 bits (71), Expect = 0.084
 Identities = 13/56 (23%), Positives = 26/56 (46%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E NT + +  E +   + E   Q  +  E++  QE+    +  EE  K+ +E ++
Sbjct: 883 GEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQD 938



 Score = 30.5 bits (69), Expect = 0.14
 Identities = 11/80 (13%), Positives = 27/80 (33%), Gaps = 10/80 (12%)

Query: 26  LESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQ 85
           +E       +RND          V  +  N   E +    + +   + ++   + +  E 
Sbjct: 846 IEQRTQDSGIRND----------VKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSEL 895

Query: 86  QEKEEQQEKKEEEEKKEEEE 105
           Q   + +   +  +  EE+ 
Sbjct: 896 QNHHKTEALSQNNKYNEEKS 915



 Score = 30.5 bits (69), Expect = 0.14
 Identities = 16/66 (24%), Positives = 26/66 (39%), Gaps = 8/66 (12%)

Query: 30  GNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKE 89
           G   TV+   S      LQ H K   TE   +     +E+  Q++       +E  +  +
Sbjct: 883 GEENTVKGQYSE-----LQNHHK---TEALSQNNKYNEEKSAQERMPGADSPEELMKRAK 934

Query: 90  EQQEKK 95
           E Q+K 
Sbjct: 935 EYQDKH 940



 Score = 28.2 bits (63), Expect = 0.81
 Identities = 8/51 (15%), Positives = 19/51 (37%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E E   G  +     ++   + Q  E Q   + +   +  +  +E+  +E
Sbjct: 868 TEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQE 918



 Score = 28.2 bits (63), Expect = 0.90
 Identities = 9/49 (18%), Positives = 17/49 (34%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
                   +   + EE   + Q  + +   + E   Q  K  EE+  +E
Sbjct: 870 YEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQE 918



 Score = 26.6 bits (59), Expect = 2.9
 Identities = 7/45 (15%), Positives = 19/45 (42%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           +   K E   +  +  +++  Q++       +E  +  KE ++K 
Sbjct: 896 QNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKH 940


>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA.  This model
           describes flagellar biosynthesis protein FlhA, one of a
           large number of genes associated with the biosynthesis
           of functional bacterial flagella. Homologs of many such
           proteins, including FlhA, function in type III protein
           secretion systems. A separate model describes InvA
           (Salmonella enterica), LcrD (Yersinia enterocolitica),
           HrcV (Xanthomonas), etc., all of which score below the
           noise cutoff for this model [Cellular processes,
           Chemotaxis and motility].
          Length = 678

 Score = 31.1 bits (71), Expect = 0.087
 Identities = 9/31 (29%), Positives = 19/31 (61%)

Query: 75  QQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           +++ +QE+E   E  + QE+  EEE++   +
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340



 Score = 28.8 bits (65), Expect = 0.44
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 82  KEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +  +QE+E   E  + +E+  EEE+E
Sbjct: 311 RRSKQEEEAAAEAAKAQEEAAEEEEE 336



 Score = 28.4 bits (64), Expect = 0.66
 Identities = 6/31 (19%), Positives = 19/31 (61%)

Query: 72  QKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
           +++ +Q+++   E  + +E+  ++EEE   +
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340



 Score = 28.0 bits (63), Expect = 1.1
 Identities = 8/33 (24%), Positives = 19/33 (57%)

Query: 72  QKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
             +++ +Q+E+   +  + Q+E  EEEE+   +
Sbjct: 308 YLRRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340



 Score = 27.2 bits (61), Expect = 1.8
 Identities = 7/31 (22%), Positives = 19/31 (61%)

Query: 77  QQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +++ ++EE+   E  + ++E  E++EE   +
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 31.0 bits (71), Expect = 0.088
 Identities = 14/41 (34%), Positives = 27/41 (65%)

Query: 59  EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
           ++KEE  EK+E+E   +++   EKEE+ E+   ++++E E 
Sbjct: 100 DKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140



 Score = 29.1 bits (66), Expect = 0.37
 Identities = 13/48 (27%), Positives = 29/48 (60%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           + + E   +KEE  +K++++   +++   EKEE+ E+   E+++E E 
Sbjct: 93  DRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140



 Score = 28.7 bits (65), Expect = 0.52
 Identities = 17/48 (35%), Positives = 28/48 (58%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            E + E KE+  E Q+ +++  Q +E    K E  +KKEE  +K+E+E
Sbjct: 65  AELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKE 112


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 30.8 bits (69), Expect = 0.099
 Identities = 18/60 (30%), Positives = 36/60 (60%)

Query: 45  KFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           + LQ       + + +  + K+++   + Q+  Q+ + EE+Q +EEQ+E++ EEE K+EE
Sbjct: 178 ELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237



 Score = 26.9 bits (59), Expect = 2.1
 Identities = 12/47 (25%), Positives = 28/47 (59%)

Query: 61  KEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
              G  + +  + +Q+Q   + +E  ++ + +EK+ +EE++EEE +E
Sbjct: 185 SLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEE 231



 Score = 25.8 bits (56), Expect = 5.2
 Identities = 12/45 (26%), Positives = 31/45 (68%)

Query: 60  EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           +++   + +E  Q+ + +++Q +EEQ+E+E ++E K+EE +  ++
Sbjct: 199 QEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243



 Score = 25.4 bits (55), Expect = 7.6
 Identities = 12/54 (22%), Positives = 34/54 (62%)

Query: 54  LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           L+     + +  + ++E+   + Q+  ++ + +EK+ Q+E++EEE ++E +++E
Sbjct: 184 LSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237


>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
           domain 1.  The small mechanosensitive channel, MscS, is
           a part of the turgor-driven solute efflux system that
           protects bacteria from lysis in the event of osmotic
           shock. The MscS protein alone is sufficient to form a
           functional mechanosensitive channel gated directly by
           tension in the lipid bilayer. The MscS proteins are
           heptamers of three transmembrane subunits with seven
           converging M3 domains, and this domain is one of the
           inner membrane domains.
          Length = 339

 Score = 30.7 bits (70), Expect = 0.10
 Identities = 8/29 (27%), Positives = 15/29 (51%)

Query: 71  EQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
            ++ +   Q+ +EE++  E   E  EE E
Sbjct: 265 AKRAEILAQRAEEEEESSEGAAETIEEPE 293



 Score = 29.1 bits (66), Expect = 0.39
 Identities = 9/32 (28%), Positives = 16/32 (50%), Gaps = 2/32 (6%)

Query: 76  QQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           + ++ E   Q+ +EE  E+  E   +  EE E
Sbjct: 264 KAKRAEILAQRAEEE--EESSEGAAETIEEPE 293


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 30.1 bits (68), Expect = 0.15
 Identities = 14/41 (34%), Positives = 29/41 (70%)

Query: 66  EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           EK+ + + QQ ++ ++K+E++ +E Q++ K+  EK EE +K
Sbjct: 14  EKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKK 54



 Score = 28.6 bits (64), Expect = 0.60
 Identities = 11/41 (26%), Positives = 26/41 (63%)

Query: 67  KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K  E+Q + + QQ E+   +++++ +E +E+ +K  E+ +E
Sbjct: 11  KLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEE 51



 Score = 27.4 bits (61), Expect = 1.2
 Identities = 14/44 (31%), Positives = 29/44 (65%), Gaps = 1/44 (2%)

Query: 64  GKEKEEEEQKQQQQQQQEKEEQQEK-EEQQEKKEEEEKKEEEEK 106
            K  E++ + + QQ ++ +++Q++K EE QE+ ++  +K EE K
Sbjct: 10  SKLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAK 53



 Score = 27.4 bits (61), Expect = 1.4
 Identities = 9/35 (25%), Positives = 23/35 (65%)

Query: 67  KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
            ++ E+ + +Q+++ +E Q++ ++  EK EE +K 
Sbjct: 21  LQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKS 55



 Score = 27.4 bits (61), Expect = 1.6
 Identities = 12/37 (32%), Positives = 25/37 (67%)

Query: 65  KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           K K ++ ++ + +Q+++ EE QE+ ++  +K EE KK
Sbjct: 18  KSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKK 54



 Score = 25.9 bits (57), Expect = 5.2
 Identities = 7/40 (17%), Positives = 24/40 (60%)

Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKK 95
            E++ +   ++ E  + +Q+++ +E +EQ +K  ++ ++
Sbjct: 12 LAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEE 51


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 30.2 bits (68), Expect = 0.15
 Identities = 11/63 (17%), Positives = 26/63 (41%), Gaps = 14/63 (22%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEE--------------QQEKEEQQEKKEEEEK 100
           +TEE    +  +  E+E + + +  +E+++               Q K  +  ++EE   
Sbjct: 540 STEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAAL 599

Query: 101 KEE 103
           K +
Sbjct: 600 KMK 602



 Score = 29.5 bits (66), Expect = 0.33
 Identities = 6/32 (18%), Positives = 17/32 (53%)

Query: 76  QQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +   +E+ +    +  +++ E E + +EE K+
Sbjct: 538 RNSTEERIDLDADDWTEDEDENEMETDEERKK 569


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 29.9 bits (68), Expect = 0.16
 Identities = 11/52 (21%), Positives = 25/52 (48%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           EL  +  E E    + E + K  + ++++   + E +E + K +E E + + 
Sbjct: 55  ELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDH 106


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 30.1 bits (68), Expect = 0.17
 Identities = 20/54 (37%), Positives = 40/54 (74%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           KE+  EEEE+E+ ++KEEEE+   ++++ ++EE++E+++++ KK +E   E E 
Sbjct: 33  KEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86



 Score = 29.0 bits (65), Expect = 0.40
 Identities = 18/52 (34%), Positives = 42/52 (80%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            +E EKE   E+EEEE++++++++++  +++E+ +++E+KEE++KK ++ KE
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 29.4 bits (67), Expect = 0.21
 Identities = 9/33 (27%), Positives = 22/33 (66%)

Query: 69  EEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           ++E K  ++ +++ E+ + K E+++ K+EE K 
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 29.6 bits (67), Expect = 0.24
 Identities = 14/69 (20%), Positives = 36/69 (52%)

Query: 39  NSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
           NS    +  ++  + + +EEE  +  + + EE   + ++ +   EEQ+  +++ + +  E
Sbjct: 797 NSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSE 856

Query: 99  EKKEEEEKE 107
            ++ EEE +
Sbjct: 857 IQEAEEEIQ 865



 Score = 27.7 bits (62), Expect = 1.2
 Identities = 11/70 (15%), Positives = 29/70 (41%), Gaps = 3/70 (4%)

Query: 37  NDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQ---QQQQQQEKEEQQEKEEQQE 93
           ++      +   ++  +    E + +  K  +    K    +     ++ +Q EK  +++
Sbjct: 855 SEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKK 914

Query: 94  KKEEEEKKEE 103
           KK+ E+  EE
Sbjct: 915 KKDLEKTDEE 924



 Score = 27.7 bits (62), Expect = 1.3
 Identities = 18/100 (18%), Positives = 40/100 (40%), Gaps = 7/100 (7%)

Query: 8   NKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEK 67
           N +   ++    + +  +  S  N +    +   +  +  +   K L  + EE       
Sbjct: 778 NPNAEKILIITEILSDGINNSDINDRP--QELIDQIIESEEERLKALRIQREEML--MRP 833

Query: 68  EEEEQKQQQQQ---QQEKEEQQEKEEQQEKKEEEEKKEEE 104
           EE E   ++Q+   Q+ K E  E +E +E+ +   + + E
Sbjct: 834 EELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNE 873


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 29.6 bits (66), Expect = 0.26
 Identities = 11/52 (21%), Positives = 18/52 (34%), Gaps = 2/52 (3%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE--EKKEEEEKE 107
           EE  E   E+E+      Q     + + +      E+      E  +E E E
Sbjct: 364 EETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE 415



 Score = 28.0 bits (62), Expect = 0.90
 Identities = 8/49 (16%), Positives = 17/49 (34%), Gaps = 1/49 (2%)

Query: 60  EKEEGKEKE-EEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E+E+  +   +     Q   +      +E      +  +E + E  EK 
Sbjct: 372 EREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKA 420


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 29.5 bits (66), Expect = 0.29
 Identities = 15/70 (21%), Positives = 36/70 (51%)

Query: 38  DNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEE 97
           D+   F + ++   + +   EE  ++ + ++EE+    Q  + E+EEQ E E + ++   
Sbjct: 181 DDQQAFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSA 240

Query: 98  EEKKEEEEKE 107
            ++ E  ++E
Sbjct: 241 PQESEATDRE 250



 Score = 29.1 bits (65), Expect = 0.43
 Identities = 15/52 (28%), Positives = 27/52 (51%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E+EE  +  +  E EQ++Q + + E +E    +E +    E E  EEE  +
Sbjct: 209 SEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQ 260



 Score = 28.4 bits (63), Expect = 0.69
 Identities = 17/53 (32%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +TE E++E+G + +  E +Q++Q + E  E QE    QE +  + + E  E+E
Sbjct: 206 DTESEDEEDGDDDQPTENEQEEQGEGE-GEGQEGSAPQESEATDRESESGEEE 257



 Score = 27.2 bits (60), Expect = 1.8
 Identities = 10/56 (17%), Positives = 31/56 (55%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E+  + E ++E    +++  + +Q++Q E E + ++    ++ E  +++ E  +E
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEE 256



 Score = 26.4 bits (58), Expect = 3.1
 Identities = 10/55 (18%), Positives = 30/55 (54%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E + ++++  E +++E+ E + + Q+    +E +  + + E  EEE  + +++  
Sbjct: 212 EEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDL 266



 Score = 26.4 bits (58), Expect = 3.5
 Identities = 10/52 (19%), Positives = 29/52 (55%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           TE E++E+G+ + E ++    Q+ +  + + E  E++  + +++   +E  +
Sbjct: 221 TENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDD 272


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 29.0 bits (66), Expect = 0.32
 Identities = 5/41 (12%), Positives = 24/41 (58%)

Query: 67  KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           ++ +E K++ +     +++ +   + +KK+E+++ +  + +
Sbjct: 47  EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAK 87



 Score = 27.9 bits (63), Expect = 0.97
 Identities = 11/42 (26%), Positives = 25/42 (59%)

Query: 66  EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E+ +E +++ +    +K+E +   + Q+KKE++E K  + K 
Sbjct: 47  EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88



 Score = 25.2 bits (56), Expect = 7.7
 Identities = 8/27 (29%), Positives = 17/27 (62%)

Query: 69 EEEQKQQQQQQQEKEEQQEKEEQQEKK 95
          ++E K   + Q++KE+Q+ K  + + K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSK 89


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 29.0 bits (65), Expect = 0.32
 Identities = 14/40 (35%), Positives = 24/40 (60%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEE 97
              ++E +E+EEE +KQQQ  + + ++ Q+ E   E  EE
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGEE 131



 Score = 25.9 bits (57), Expect = 3.1
 Identities = 12/40 (30%), Positives = 25/40 (62%)

Query: 63  EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
               +EEEE++++ ++QQ+ +E Q ++ QQ +   E  +E
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGEE 131



 Score = 25.5 bits (56), Expect = 4.6
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 50  HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQE 87
           H ++   EEEE+ E +++ +E Q QQ QQ +   E  E
Sbjct: 93  HTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGE 130


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 29.4 bits (66), Expect = 0.33
 Identities = 11/53 (20%), Positives = 27/53 (50%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           +   E+ ++E+G + + +  +  +  ++E E   E EE + +  + E +E E 
Sbjct: 225 DGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEM 277



 Score = 26.7 bits (59), Expect = 3.0
 Identities = 10/51 (19%), Positives = 25/51 (49%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           N E+ E    + +  +E ++ + +  + E ++ + +  E  E+ E  E +E
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDE 293



 Score = 26.3 bits (58), Expect = 3.3
 Identities = 10/53 (18%), Positives = 25/53 (47%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           N + E   E  E  +E ++ + +    + E+ E +  +  ++ E  + +E+ E
Sbjct: 244 NEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTE 296



 Score = 25.6 bits (56), Expect = 6.7
 Identities = 10/51 (19%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +EE+ ++  + +  E  +  +++ E  ++ E++E +    E E+ E +  E
Sbjct: 232 DEEDGDDD-QPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAE 281



 Score = 25.6 bits (56), Expect = 7.2
 Identities = 12/51 (23%), Positives = 25/51 (49%), Gaps = 2/51 (3%)

Query: 59  EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK--EEEEKE 107
           ++  E    EE+    Q    ++ E  +E+ E  ++ EE+E +  + E +E
Sbjct: 224 DDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEE 274


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 29.0 bits (66), Expect = 0.37
 Identities = 14/27 (51%), Positives = 19/27 (70%), Gaps = 2/27 (7%)

Query: 83  EEQQEK--EEQQEKKEEEEKKEEEEKE 107
           EEQ+E+  E   E  EEEE+K+EE+K 
Sbjct: 64  EEQRERLEELAPELLEEEEEKKEEKKG 90



 Score = 25.2 bits (56), Expect = 7.8
 Identities = 9/26 (34%), Positives = 19/26 (73%)

Query: 76  QQQQQEKEEQQEKEEQQEKKEEEEKK 101
           +Q+++ +E   E  E++E+K+EE+K 
Sbjct: 65  EQRERLEELAPELLEEEEEKKEEKKG 90


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 29.3 bits (66), Expect = 0.39
 Identities = 16/67 (23%), Positives = 27/67 (40%)

Query: 39  NSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
           N   F K ++   K     EE  EE  ++   +          K++   K++ +E K   
Sbjct: 829 NGPLFQKLMENAGKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVL 888

Query: 99  EKKEEEE 105
            K+EE E
Sbjct: 889 IKQEERE 895


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 28.5 bits (64), Expect = 0.45
 Identities = 18/56 (32%), Positives = 34/56 (60%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E    EE KEE KE    E K+ +   ++++E+ E+E ++E +E  ++ E+E +E
Sbjct: 57  QETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEE 112



 Score = 27.8 bits (62), Expect = 0.83
 Identities = 14/53 (26%), Positives = 34/53 (64%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             E ++ ++ +  E EE K+++++    E++++K + +++ EE E++ EEE E
Sbjct: 48  EQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100



 Score = 25.9 bits (57), Expect = 4.6
 Identities = 15/52 (28%), Positives = 34/52 (65%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            +EEEKE    +++E++   +++ +E EE+ E+E+++   E E++ EE+ + 
Sbjct: 65  VKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTES 116


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 28.6 bits (64), Expect = 0.48
 Identities = 21/83 (25%), Positives = 35/83 (42%)

Query: 10  SPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEE 69
           S S         +    +  G A T     S    K       E    EEE E+ + +  
Sbjct: 36  SSSEASNAGSGSSEQKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAG 95

Query: 70  EEQKQQQQQQQEKEEQQEKEEQQ 92
           ++  Q+QQ+QQ++EE+Q +E + 
Sbjct: 96  KQLSQKQQEQQQQEEKQVRELKA 118



 Score = 26.3 bits (58), Expect = 2.9
 Identities = 8/46 (17%), Positives = 31/46 (67%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
           +E +K+  +  +EE   +++ ++ + E  ++  ++Q++++++E+K+
Sbjct: 67  QELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQ 112



 Score = 25.5 bits (56), Expect = 6.9
 Identities = 10/53 (18%), Positives = 29/53 (54%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
                   +E  K+  E   ++   +++ +++Q E  +Q  +K++E++++EE+
Sbjct: 59  STAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEK 111



 Score = 25.5 bits (56), Expect = 7.2
 Identities = 7/51 (13%), Positives = 31/51 (60%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
                ++ + +  E  +++   +++ EK++ +  ++  +K++E++++EE++
Sbjct: 62  AGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQ 112


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 28.9 bits (64), Expect = 0.48
 Identities = 23/102 (22%), Positives = 39/102 (38%), Gaps = 2/102 (1%)

Query: 6   AVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGK 65
           A N+  +NL T     A+    + GN  +V N  +SR             T +      +
Sbjct: 492 AANQPAANLPTANQPAANNFHGAAGN--SVGNPFASRPFGSAPYGGNAATTADPNGIAKR 549

Query: 66  EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E   E    +Q+ +Q  EE  E    +   E E +  ++ +E
Sbjct: 550 EDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVTDKGEE 591


>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 145

 Score = 28.2 bits (63), Expect = 0.52
 Identities = 6/40 (15%), Positives = 16/40 (40%)

Query: 65  KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           ++ +            E+   +  + + +K+EE  KK  +
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKKWYQ 143



 Score = 25.9 bits (57), Expect = 4.1
 Identities = 6/29 (20%), Positives = 15/29 (51%)

Query: 65  KEKEEEEQKQQQQQQQEKEEQQEKEEQQE 93
              E+   +  Q + +++EE ++K  Q +
Sbjct: 117 SSNEQNSTEALQLRAKKREEHRKKWYQND 145


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 28.5 bits (64), Expect = 0.57
 Identities = 13/63 (20%), Positives = 30/63 (47%), Gaps = 9/63 (14%)

Query: 51   FKELNTEEEEKEEGKEKEEEEQKQ---------QQQQQQEKEEQQEKEEQQEKKEEEEKK 101
             +  N EE+E     E E +++K+         Q++  +E   + + ++++ KK+ +   
Sbjct: 1520 IENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNT 1579

Query: 102  EEE 104
            E E
Sbjct: 1580 EAE 1582



 Score = 27.4 bits (61), Expect = 1.5
 Identities = 12/24 (50%), Positives = 20/24 (83%)

Query: 83  EEQQEKEEQQEKKEEEEKKEEEEK 106
           EE+ +K++++EKK+EEE K EE+ 
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKA 765



 Score = 25.8 bits (57), Expect = 6.3
 Identities = 11/42 (26%), Positives = 29/42 (69%)

Query: 64  GKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           GK+ E +     +++ ++K+++++K+E++ K+EE+ + E  E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771


>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
           ArcB; Provisional.
          Length = 779

 Score = 28.8 bits (65), Expect = 0.58
 Identities = 14/46 (30%), Positives = 26/46 (56%), Gaps = 7/46 (15%)

Query: 47  LQVHFKELNTE-------EEEKEEGKEKEEEEQKQQQQQQQEKEEQ 85
           L+ +  +LN E       EE ++E  E+ + E K++++ Q E E+Q
Sbjct: 108 LKDNIAQLNQEIAEREKAEEARQEAFEQLKNEIKEREETQIELEQQ 153


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 28.4 bits (64), Expect = 0.62
 Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 10/96 (10%)

Query: 15  ITEQILEASPLLESFGNAKTVRNDNSSRF------GKFLQVHFKELNTEEEEK----EEG 64
           + E  LE   L     N + +  D    +       +      +EL     E+    EEG
Sbjct: 152 VPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEG 211

Query: 65  KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
              E   + +++ ++ EKE +   EE +E  ++  +
Sbjct: 212 TPSELIREIKEELEEIEKERESLLEELKELAKKYLE 247


>gnl|CDD|217050 pfam02460, Patched, Patched family.  The transmembrane protein
           Patched is a receptor for the morphogene Sonic Hedgehog.
           This protein associates with the smoothened protein to
           transduce hedgehog signals.
          Length = 801

 Score = 28.5 bits (64), Expect = 0.62
 Identities = 16/61 (26%), Positives = 26/61 (42%), Gaps = 4/61 (6%)

Query: 6   AVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQ--VHFKELNTEEEEKEE 63
            VN  P   I E     + +++ F +       NS+ F  +L+   +F + N  E E EE
Sbjct: 498 FVNNPPDLTIPENRDRLNEMVDEFESTPYSMGPNSTSF--WLREYENFYKTNGSELEDEE 555

Query: 64  G 64
            
Sbjct: 556 K 556


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 28.6 bits (64), Expect = 0.63
 Identities = 14/56 (25%), Positives = 30/56 (53%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K+    ++E E+ K  E   Q+++++  Q + E++ + EQ E+ EE+ +  E    
Sbjct: 371 KDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPL 426



 Score = 25.5 bits (56), Expect = 6.4
 Identities = 13/52 (25%), Positives = 27/52 (51%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            EE  + E +E+ E EQ ++ ++     E     E + + E+E ++ + E+E
Sbjct: 394 EEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEE 445



 Score = 25.1 bits (55), Expect = 7.9
 Identities = 10/52 (19%), Positives = 26/52 (50%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E+ +  E   + EEE+  Q++ ++  E +Q +  +++ +  E     E++ 
Sbjct: 380 AEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEF 431


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 28.4 bits (64), Expect = 0.63
 Identities = 11/39 (28%), Positives = 25/39 (64%)

Query: 63  EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           E K++E E Q +Q Q++  +  Q+ K++++E  ++  K+
Sbjct: 190 EEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKR 228



 Score = 27.2 bits (61), Expect = 1.7
 Identities = 10/50 (20%), Positives = 28/50 (56%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
               +++E    +    E +++QQ+ + + E+ QEK  +  ++ ++++KE
Sbjct: 171 LAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKRKE 220



 Score = 25.7 bits (57), Expect = 5.6
 Identities = 10/40 (25%), Positives = 25/40 (62%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKK 95
            EE+++E   + E+ ++K  +  Q+ K++++E  +Q  K+
Sbjct: 189 LEEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKR 228



 Score = 25.3 bits (56), Expect = 9.4
 Identities = 15/55 (27%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEK-EEQQEKEEQQEKKEEEEKKEEEEK 106
           E   +E    EG   E EE++Q+ + Q E+ +E+  +  Q+ K++ +E  ++  K
Sbjct: 173 EAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAK 227



 Score = 24.9 bits (55), Expect = 9.9
 Identities = 11/49 (22%), Positives = 29/49 (59%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            E E+++ + + + EQ Q++  +  +E +Q+++E  ++  +  +  EEE
Sbjct: 187 AELEEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKRLELSEEE 235


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 28.6 bits (64), Expect = 0.64
 Identities = 15/47 (31%), Positives = 28/47 (59%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
                +E   +E+ +    QQ++++KEE + KEE+QE  E E++ +E
Sbjct: 335 SSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 28.3 bits (64), Expect = 0.67
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 77  QQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           + ++EK++   KEE++  KEE++K EE 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 28.4 bits (63), Expect = 0.69
 Identities = 10/51 (19%), Positives = 28/51 (54%)

Query: 57   EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E+ ++++  +  E+++K  +   +E  ++ E+  +   K +EE ++ E  E
Sbjct: 4045 EDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPE 4095



 Score = 26.9 bits (59), Expect = 2.6
 Identities = 14/52 (26%), Positives = 25/52 (48%), Gaps = 3/52 (5%)

Query: 59   EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQ---EKKEEEEKKEEEEKE 107
            +E  E    E E++  +Q     + +   KE+     E K+ +EK++EEE  
Sbjct: 3909 DEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMS 3960



 Score = 26.1 bits (57), Expect = 4.6
 Identities = 10/49 (20%), Positives = 26/49 (53%)

Query: 57   EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            EE+  E  ++  E+     +     KE+  +  E ++++E+E+++E  +
Sbjct: 3913 EEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD 3961



 Score = 26.1 bits (57), Expect = 4.9
 Identities = 11/50 (22%), Positives = 24/50 (48%)

Query: 58   EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            EE+  E ++K  E+     +     +E   K  + + ++E+E +EE   +
Sbjct: 3913 EEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDD 3962



 Score = 26.1 bits (57), Expect = 5.3
 Identities = 10/53 (18%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 55   NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKK--EEEEKKEEEE 105
              +E+  E+G E+  +E ++  +   + +E+ E+ E  E +  +   K + + 
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKS 4109


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 28.2 bits (63), Expect = 0.70
 Identities = 11/55 (20%), Positives = 39/55 (70%)

Query: 47  LQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           LQ + K+L   ++E +  +E E + ++++ +Q++ +++Q+E+ ++ ++K +++++
Sbjct: 378 LQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432



 Score = 27.8 bits (62), Expect = 0.92
 Identities = 7/50 (14%), Positives = 34/50 (68%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           ++  E   E +  E+ + + ++++ ++++ +++Q+E+ +E+++K ++++ 
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432



 Score = 27.8 bits (62), Expect = 1.0
 Identities = 9/56 (16%), Positives = 33/56 (58%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           + L    ++ ++  +K  E + +    ++ + + +E++ +QE+ E+++K++ +E +
Sbjct: 369 ELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424



 Score = 27.1 bits (60), Expect = 2.2
 Identities = 14/82 (17%), Positives = 43/82 (52%)

Query: 25  LLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEE 84
           L E   +   +  D      K      ++   +  E ++  +  EE + + ++++ ++EE
Sbjct: 352 LYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEE 411

Query: 85  QQEKEEQQEKKEEEEKKEEEEK 106
            ++K+++Q  +++E+++++E K
Sbjct: 412 NEKKQKEQADEDKEKRQKDERK 433



 Score = 26.7 bits (59), Expect = 2.5
 Identities = 11/51 (21%), Positives = 31/51 (60%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E  KE  K+ ++  +K  + + +    ++ + + +E+K ++E+ E+++KE
Sbjct: 368 QELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKE 418



 Score = 26.7 bits (59), Expect = 2.6
 Identities = 14/50 (28%), Positives = 29/50 (58%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           ++ +E  KE  ++ Q   ++  + K+E    EE + K +EE+ K+EE ++
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEK 414



 Score = 25.5 bits (56), Expect = 6.5
 Identities = 11/53 (20%), Positives = 31/53 (58%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +   ++++E  ++  ++ +   ++  E +++ +  E+ E K +EEK ++EE E
Sbjct: 361 DLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE 413


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 28.2 bits (63), Expect = 0.71
 Identities = 13/51 (25%), Positives = 37/51 (72%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           EEE+ +E     E++ K+ + +  ++E+++++EE++E++EE+E  ++++ +
Sbjct: 145 EEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDD 195



 Score = 27.0 bits (60), Expect = 1.7
 Identities = 16/54 (29%), Positives = 35/54 (64%)

Query: 54  LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           L TEEEE  + K    E++ ++ + +   EE ++ EE++E++EEE++  +++ +
Sbjct: 141 LFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194



 Score = 25.9 bits (57), Expect = 3.9
 Identities = 12/52 (23%), Positives = 35/52 (67%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            EE+  E+    E++ ++ + +   E++E+ E+EE++E++E+E+  ++++ +
Sbjct: 145 EEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196


>gnl|CDD|237205 PRK12792, flhA, flagellar biosynthesis protein FlhA; Reviewed.
          Length = 694

 Score = 28.2 bits (63), Expect = 0.73
 Identities = 9/27 (33%), Positives = 11/27 (40%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKE 83
                E  K K EEE  Q + +   KE
Sbjct: 329 ARAAAEAAKVKREEESAQAEAKDSVKE 355



 Score = 25.8 bits (57), Expect = 6.2
 Identities = 7/35 (20%), Positives = 14/35 (40%)

Query: 70  EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
                +  + + +EE  + E +   KE+    E E
Sbjct: 329 ARAAAEAAKVKREEESAQAEAKDSVKEQLRTAEIE 363



 Score = 25.5 bits (56), Expect = 7.1
 Identities = 6/35 (17%), Positives = 16/35 (45%)

Query: 64  GKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
            +   E  + +++++  + E +   +EQ    E E
Sbjct: 329 ARAAAEAAKVKREEESAQAEAKDSVKEQLRTAEIE 363


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 27.9 bits (62), Expect = 0.92
 Identities = 9/49 (18%), Positives = 24/49 (48%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           +E E  + ++ E+E+  ++ +   + EE++        ++ +   EE E
Sbjct: 214 DEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEME 262



 Score = 25.9 bits (57), Expect = 3.9
 Identities = 9/52 (17%), Positives = 24/52 (46%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E  + E+ +++++ ++ +   Q +E+E        ++     E+ E  E E
Sbjct: 216 PESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME 267


>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
          Length = 222

 Score = 27.7 bits (61), Expect = 0.92
 Identities = 15/54 (27%), Positives = 31/54 (57%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           K    E +EK+E  E+ E + ++Q+Q  Q+ + Q + ++Q ++    ++K  EE
Sbjct: 151 KNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRKAMEE 204


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 27.6 bits (62), Expect = 1.0
 Identities = 13/32 (40%), Positives = 22/32 (68%)

Query: 73  KQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
            Q  ++++EK++   KEE++  KEE+EK EE 
Sbjct: 91  YQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 27.8 bits (62), Expect = 1.0
 Identities = 8/51 (15%), Positives = 32/51 (62%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            + E+++E      E ++ +++++ E ++++E EE +  +++ ++  +E++
Sbjct: 72  QKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 27.7 bits (62), Expect = 1.0
 Identities = 15/56 (26%), Positives = 24/56 (42%)

Query: 51  FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            K L        E  EK E E K  ++  +  E+   + E+  +K EE   E +E+
Sbjct: 73  VKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEE 128


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 27.7 bits (61), Expect = 1.0
 Identities = 17/61 (27%), Positives = 39/61 (63%)

Query: 45  KFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           K L +  +++    +E+EE +E+ E E+++++  ++E +E+QE+E +++  EE+    E 
Sbjct: 119 KSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEH 178

Query: 105 E 105
           E
Sbjct: 179 E 179



 Score = 27.0 bits (59), Expect = 1.7
 Identities = 15/51 (29%), Positives = 38/51 (74%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E+ ++  +E+EE E++ + ++++EK +++E +E+QE++ EE+  EE+  +
Sbjct: 125 QEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDD 175



 Score = 26.6 bits (58), Expect = 2.6
 Identities = 16/58 (27%), Positives = 38/58 (65%)

Query: 50  HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             + +  ++ +K++     +E+ ++ +Q+++E EE+ E E ++EK +E E +EE+E+E
Sbjct: 106 QLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQERE 163


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 28.0 bits (62), Expect = 1.1
 Identities = 12/50 (24%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 60  EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEE--EEKKEEEEKE 107
            +E      +  Q++  ++Q +K   + KE+  +KKEE   +  +   KE
Sbjct: 6   RREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKE 55



 Score = 26.4 bits (58), Expect = 3.0
 Identities = 10/39 (25%), Positives = 21/39 (53%)

Query: 65  KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
             +E       + Q+++ E+Q +K   + K++  +KKEE
Sbjct: 5   NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEE 43


>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175). 
          Length = 820

 Score = 27.9 bits (63), Expect = 1.1
 Identities = 10/56 (17%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 51  FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           F++L  +E+   +  E++ ++ +  Q Q Q  ++ Q+ + QQ+ ++ +  +  +  
Sbjct: 608 FRDL--QEQFNAQRGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGS 661



 Score = 27.6 bits (62), Expect = 1.3
 Identities = 7/44 (15%), Positives = 23/44 (52%)

Query: 62  EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           ++G+  + + Q  QQ QQ + ++Q ++ +  +  +    + ++ 
Sbjct: 625 QQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQA 668



 Score = 26.4 bits (59), Expect = 3.0
 Identities = 6/38 (15%), Positives = 14/38 (36%)

Query: 62  EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
           + G+  +    ++QQ  + E   Q+        +  E 
Sbjct: 653 QGGQGGQGSLAERQQALRDELGRQRGGLPGMGGEAGEA 690



 Score = 26.0 bits (58), Expect = 4.3
 Identities = 11/60 (18%), Positives = 29/60 (48%)

Query: 45  KFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           + LQ  F     E++ ++    + + +  QQ QQ Q +++ Q+ +  Q  +    ++++ 
Sbjct: 609 RDLQEQFNAQRGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQA 668



 Score = 26.0 bits (58), Expect = 4.3
 Identities = 9/41 (21%), Positives = 22/41 (53%)

Query: 62  EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
            +G+  ++ +Q Q QQQ Q+ +  Q  +    ++++  + E
Sbjct: 632 GQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQALRDE 672


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 27.3 bits (61), Expect = 1.1
 Identities = 17/55 (30%), Positives = 34/55 (61%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            +   +EEE+EE  E EE ++++Q  +  EKE  + K E++ + E ++K+  +E+
Sbjct: 96  LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQ 150



 Score = 24.9 bits (55), Expect = 8.1
 Identities = 18/56 (32%), Positives = 36/56 (64%), Gaps = 2/56 (3%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K L  +++EKEE +E+E E ++  +++Q ++    EKE  + K+E+  + E ++KE
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEELDEEEQIDEL--LEKELAKLKREKRRENERKQKE 145


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 27.7 bits (62), Expect = 1.1
 Identities = 8/43 (18%), Positives = 14/43 (32%), Gaps = 3/43 (6%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
              +  E +        +      E EE+Q   +  E +  EE
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQ---DDAEDRALEE 299


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 27.7 bits (62), Expect = 1.3
 Identities = 15/58 (25%), Positives = 31/58 (53%)

Query: 50  HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             K  N   +EK E  + ++   + +++ ++E ++ + K E+QE + EE +K  EE  
Sbjct: 263 LEKLFNEALDEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELR 320



 Score = 25.8 bits (57), Expect = 5.1
 Identities = 10/47 (21%), Positives = 30/47 (63%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
           E ++ ++   + E++ +++ ++ + K E+QE E ++ +K  EE +++
Sbjct: 276 ERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQK 322


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 27.5 bits (61), Expect = 1.3
 Identities = 16/56 (28%), Positives = 34/56 (60%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K+L  + EE E+ +  E+  QK+ +Q+   ++  ++ E+  ++ EE++K+ EE K 
Sbjct: 71  KKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKA 126



 Score = 26.7 bits (59), Expect = 2.5
 Identities = 13/50 (26%), Positives = 31/50 (62%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
               +KE+ ++K+ E+Q ++ ++Q+  E+ ++KE +Q    E+  K+ E+
Sbjct: 60  KPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQ 109



 Score = 26.7 bits (59), Expect = 2.6
 Identities = 16/55 (29%), Positives = 35/55 (63%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           ++ N  +++K+   +KE+E QK+ +QQ +E E+Q+  E+ ++K+ E+    E+  
Sbjct: 50  QQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA 104



 Score = 25.2 bits (55), Expect = 9.1
 Identities = 12/53 (22%), Positives = 32/53 (60%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             +E +K+  ++ EE E+++  +Q ++KE +Q    ++  K+ E+  ++ E++
Sbjct: 65  KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 27.8 bits (61), Expect = 1.3
 Identities = 19/49 (38%), Positives = 35/49 (71%)

Query: 57   EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            E+++K E  +K EE++K+  +  +++ E+ +K E+ +KKE EEKK+ EE
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEE 1720



 Score = 27.4 bits (60), Expect = 1.8
 Identities = 16/56 (28%), Positives = 34/56 (60%)

Query: 52   KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            K     + +  E  +K EE++K+ ++ ++ +E++++  E  +K+ EE KK EE K+
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709



 Score = 27.0 bits (59), Expect = 2.6
 Identities = 20/49 (40%), Positives = 36/49 (73%)

Query: 57   EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            E ++ EE K+KE EE+K+ ++ ++ +EE + K E+ +K+ EE+KK+ EE
Sbjct: 1700 EAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEE 1748



 Score = 26.6 bits (58), Expect = 3.5
 Identities = 19/55 (34%), Positives = 35/55 (63%)

Query: 52   KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            K+   EE++K E  +K EEE K +  ++ +K E+ +K+ ++ KK EE++K+  E 
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693



 Score = 26.3 bits (57), Expect = 4.5
 Identities = 20/54 (37%), Positives = 38/54 (70%)

Query: 52   KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            K+   E+++ E+ K+KE EE+K+ ++ ++ +EE + K  ++ KK EE+KK+ EE
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE 1679



 Score = 25.1 bits (54), Expect = 9.5
 Identities = 18/56 (32%), Positives = 37/56 (66%)

Query: 52   KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             E    EE+ E  ++K+EE +K+    +++ EE+++ +E ++K EE++KK +E K+
Sbjct: 1357 DEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412


>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 696

 Score = 27.6 bits (62), Expect = 1.3
 Identities = 12/29 (41%), Positives = 23/29 (79%)

Query: 71  EQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
             K+++QQ +EKE+  E+ +++E++EEEE
Sbjct: 323 LSKRKEQQAEEKEKPAEEAKKEEEEEEEE 351



 Score = 25.7 bits (57), Expect = 6.9
 Identities = 10/31 (32%), Positives = 25/31 (80%)

Query: 74  QQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           ++++QQ E++E+  +E ++E++EEEE+  ++
Sbjct: 325 KRKEQQAEEKEKPAEEAKKEEEEEEEESVDD 355


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 27.6 bits (61), Expect = 1.3
 Identities = 14/56 (25%), Positives = 24/56 (42%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K  N E E K+   +      ++Q       E+ + +    E+ E EE+ E+ E E
Sbjct: 340 KWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 27.3 bits (61), Expect = 1.4
 Identities = 19/59 (32%), Positives = 32/59 (54%), Gaps = 3/59 (5%)

Query: 51  FKELNTEEEEKEEGKEKEEE---EQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
             E   E ++K   K +E+E   E+ +Q   ++ KE   + EE Q+K E+ EK+ E+ K
Sbjct: 311 LNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLK 369



 Score = 26.5 bits (59), Expect = 2.5
 Identities = 15/54 (27%), Positives = 31/54 (57%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           E + +E  +    E+E E +K+  ++ +EKE++ E+ EQ   +E +E   + E+
Sbjct: 300 EPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEE 353


>gnl|CDD|236356 PRK08944, motB, flagellar motor protein MotB; Reviewed.
          Length = 302

 Score = 27.3 bits (61), Expect = 1.4
 Identities = 14/49 (28%), Positives = 26/49 (53%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           ++  K+ G E   E + ++   Q++ E+  +K  QQ +KE E+   E E
Sbjct: 119 QQRGKQRGGEASAEVKAEKAAAQEQIEDLVKKLAQQLEKEIEDGAIEIE 167



 Score = 25.7 bits (57), Expect = 4.6
 Identities = 11/49 (22%), Positives = 24/49 (48%), Gaps = 3/49 (6%)

Query: 58  EEEKEEGKEKEEEEQ---KQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
           E +   G +++  +Q   +   + + EK   QE+ E   KK  ++ ++E
Sbjct: 110 ESDSAGGTQQQRGKQRGGEASAEVKAEKAAAQEQIEDLVKKLAQQLEKE 158



 Score = 25.4 bits (56), Expect = 6.4
 Identities = 11/49 (22%), Positives = 24/49 (48%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           T+++  ++   +   E K ++   QE+ E   K+  Q+ ++E E    E
Sbjct: 117 TQQQRGKQRGGEASAEVKAEKAAAQEQIEDLVKKLAQQLEKEIEDGAIE 165


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 27.3 bits (60), Expect = 1.4
 Identities = 11/48 (22%), Positives = 28/48 (58%)

Query: 60  EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +++E +  EE+E+++Q  +Q ++  +Q +++  E    EE   +E  +
Sbjct: 96  KRKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQ 143



 Score = 27.0 bits (59), Expect = 2.3
 Identities = 12/48 (25%), Positives = 26/48 (54%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           + +EK   +EKE +EQ  +Q Q+  ++ +Q+  E    +E   ++  +
Sbjct: 96  KRKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQ 143


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 27.3 bits (61), Expect = 1.5
 Identities = 13/30 (43%), Positives = 23/30 (76%)

Query: 78  QQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           Q+ E+EEQ+++EE++E++E EE + EE   
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 27.2 bits (61), Expect = 1.5
 Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 9/62 (14%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQE------KKEEEEKKEEEE 105
             L+ +E E+     K+ EE   + ++++E+ E + + E+         KEE +K  E +
Sbjct: 499 SGLSDDEIER---MVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEAD 555

Query: 106 KE 107
           K+
Sbjct: 556 KK 557



 Score = 27.2 bits (61), Expect = 1.7
 Identities = 10/52 (19%), Positives = 24/52 (46%), Gaps = 2/52 (3%)

Query: 54  LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           L    ++K  GKE  ++          + E ++  ++ +E   E++K++E  
Sbjct: 478 LTVSAKDKGTGKE--QKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERI 527


>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
           to archaeal Holliday junction resolvase [Nucleotide
           transport and metabolism].
          Length = 175

 Score = 27.0 bits (60), Expect = 1.6
 Identities = 16/46 (34%), Positives = 26/46 (56%)

Query: 62  EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E    + EE  ++ ++++   E Q  KEE+ + KE  EKK EE +E
Sbjct: 32  ESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 27.3 bits (61), Expect = 1.6
 Identities = 11/51 (21%), Positives = 16/51 (31%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
             EE  KE  +E EE E+   + +         K    E K +        
Sbjct: 512 KLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMG 562


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 27.2 bits (60), Expect = 1.7
 Identities = 11/43 (25%), Positives = 20/43 (46%)

Query: 65  KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K     +QK +++   +KE + E    + + E    K+ EEK 
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKG 834



 Score = 26.8 bits (59), Expect = 2.8
 Identities = 10/51 (19%), Positives = 22/51 (43%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
           KE    ++E +      E E ++   +Q+E++          KK++  K +
Sbjct: 802 KEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852



 Score = 26.8 bits (59), Expect = 2.9
 Identities = 11/52 (21%), Positives = 24/52 (46%)

Query: 54  LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           L+T  +     K+K +E+    ++ + E    + + E+   K+ EEK  +  
Sbjct: 787 LSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAP 838



 Score = 25.3 bits (55), Expect = 8.8
 Identities = 14/83 (16%), Positives = 35/83 (42%)

Query: 24  PLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKE 83
           P +    + K+VR  N++     ++   +     +E+    KE + E    + + ++   
Sbjct: 768 PRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTI 827

Query: 84  EQQEKEEQQEKKEEEEKKEEEEK 106
           +Q+E++          KK++  K
Sbjct: 828 KQREEKGIDAPAILNVKKKKPYK 850


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 27.2 bits (61), Expect = 1.7
 Identities = 11/54 (20%), Positives = 27/54 (50%), Gaps = 9/54 (16%)

Query: 60  EKEEGKEKEEEEQK--------QQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            K E +  E+E++K        + +Q + E+E+   +E + +K  E    ++++
Sbjct: 434 AKAEIRAIEQEKKKAEEAKARFEARQARLEREKA-AREARHKKAAEARAAKDKD 486


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 26.9 bits (60), Expect = 1.7
 Identities = 12/51 (23%), Positives = 34/51 (66%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           EEE++E+ +EK +   K + ++  + + +++++ ++EK+E+  ++ EE+  
Sbjct: 39  EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTP 89


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 27.3 bits (61), Expect = 1.7
 Identities = 15/66 (22%), Positives = 28/66 (42%), Gaps = 5/66 (7%)

Query: 47  LQVHFKELNTEEEEKEEGKEKEEEEQKQQ-----QQQQQEKEEQQEKEEQQEKKEEEEKK 101
           L    ++L  +E+  E  +EK E +  ++       +    EE  ++ E +     E   
Sbjct: 11  LAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGG 70

Query: 102 EEEEKE 107
            EEE+E
Sbjct: 71  GEEEEE 76



 Score = 26.2 bits (58), Expect = 3.9
 Identities = 13/61 (21%), Positives = 27/61 (44%), Gaps = 4/61 (6%)

Query: 51  FKELNTEEEEKEEGKEKEE----EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            K    EEE+ E    KEE    + +  + ++  ++ E +     +    EEE++E + +
Sbjct: 22  EKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKAE 81

Query: 107 E 107
            
Sbjct: 82  A 82


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 27.4 bits (60), Expect = 1.8
 Identities = 16/46 (34%), Positives = 30/46 (65%)

Query: 62   EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             E +EK E + +  +++ +++EE++  E  Q  KEEE  K+E+E+E
Sbjct: 1009 PECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKERE 1054



 Score = 27.0 bits (59), Expect = 2.5
 Identities = 11/43 (25%), Positives = 27/43 (62%)

Query: 65   KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            + K++EE+++ +  Q+ KEE+  K+E++ ++   +   +  KE
Sbjct: 1025 RIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKE 1067



 Score = 26.2 bits (57), Expect = 4.4
 Identities = 19/93 (20%), Positives = 37/93 (39%)

Query: 15  ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQ 74
           +  +  + + L    G  + +++D    F           N  EE +      E EE   
Sbjct: 576 LKSRFQKDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESAD 635

Query: 75  QQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +   + E+EE   K+E+     E E++ + EK+
Sbjct: 636 EVDYETEREENARKKEELRGNFELEERGDPEKK 668



 Score = 25.9 bits (56), Expect = 5.5
 Identities = 13/42 (30%), Positives = 28/42 (66%), Gaps = 1/42 (2%)

Query: 61   KEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
            KE  K++EE+E+ +  Q+ +E EE  +KE+++E++  +   +
Sbjct: 1023 KERIKDQEEKERMESLQRAKE-EEIGKKEKEREQRIRKTIHD 1063


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 27.2 bits (61), Expect = 1.8
 Identities = 12/63 (19%), Positives = 25/63 (39%), Gaps = 13/63 (20%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQE-------------KEEQQEKEEQQEKKEEEEKKEEE 104
            E+K E   +E    ++QQ++  +             K ++ +   ++ +K E    EE 
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEER 298

Query: 105 EKE 107
             E
Sbjct: 299 PVE 301


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 27.3 bits (61), Expect = 1.9
 Identities = 16/56 (28%), Positives = 30/56 (53%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           KE   E   K E +     E K+   ++Q + E + ++E+  KK EE+ ++E E++
Sbjct: 195 KEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQ 250


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 26.9 bits (60), Expect = 1.9
 Identities = 9/50 (18%), Positives = 26/50 (52%)

Query: 41  SRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEE 90
           +   +F+   F+ L +  + +EE +E ++E  + + + Q+ ++ + E   
Sbjct: 55  NAPREFVSGVFESLASLFDLREENEELKKELLELESRLQELEQLEAENAR 104


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 27.0 bits (60), Expect = 1.9
 Identities = 12/48 (25%), Positives = 23/48 (47%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
             E+ E      E++ QQ +  Q  EE+++   Q   +   ++K+ EE
Sbjct: 181 AAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEE 228


>gnl|CDD|221268 pfam11853, DUF3373, Protein of unknown function (DUF3373).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria. Proteins in this
           family are typically between 472 to 574 amino acids in
           length.
          Length = 485

 Score = 27.1 bits (60), Expect = 1.9
 Identities = 6/34 (17%), Positives = 18/34 (52%)

Query: 70  EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
           +    Q+ +  +KE  + K + ++ ++  +K E+
Sbjct: 26  DIDLLQKIEALKKELAELKAQLKDLQKRVDKTEK 59


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 27.1 bits (60), Expect = 1.9
 Identities = 11/33 (33%), Positives = 25/33 (75%)

Query: 75  QQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           ++ +++++E+ +EK E+ + +E +E+ EEEEK 
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKG 309


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 27.1 bits (60), Expect = 2.0
 Identities = 18/55 (32%), Positives = 30/55 (54%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             N E  + E  + K+EEE+   + ++ EKE+     E  E +EE+E+ E EE +
Sbjct: 62  ISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQ 116


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 27.2 bits (61), Expect = 2.0
 Identities = 10/49 (20%), Positives = 19/49 (38%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            E + E    E E    + + QQEKE  +     + + E +     ++ 
Sbjct: 453 AELRAELAALEAELAALEARWQQEKELVEAILALRAELEADADAPADDD 501


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 27.0 bits (60), Expect = 2.1
 Identities = 14/69 (20%), Positives = 39/69 (56%), Gaps = 1/69 (1%)

Query: 40  SSRFGKFLQVHFKELNTEEEEKEEGKEKE-EEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
           S  +   L +  + L+T  EE+ +      ++++K+ +  + +K+ +++K++++EKKE +
Sbjct: 25  SYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84

Query: 99  EKKEEEEKE 107
            + E +   
Sbjct: 85  SEGETKLGF 93



 Score = 25.4 bits (56), Expect = 8.0
 Identities = 14/75 (18%), Positives = 36/75 (48%), Gaps = 14/75 (18%)

Query: 47  LQVHFKELNTEEEEKE--------------EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQ 92
           + + F    T++ + +              E      EE+ +      +K+++++K  + 
Sbjct: 7   MILSFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNES 66

Query: 93  EKKEEEEKKEEEEKE 107
           +KK E++KK+++EK+
Sbjct: 67  KKKSEKKKKKKKEKK 81


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 26.7 bits (59), Expect = 2.3
 Identities = 8/56 (14%), Positives = 26/56 (46%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +     E    + KEK + E+K ++ + + K + + K + + + + ++   +   +
Sbjct: 72  EPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAK 127



 Score = 25.5 bits (56), Expect = 5.7
 Identities = 10/51 (19%), Positives = 24/51 (47%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            E E  EE  +   E +   +    + +E+ + E++ +K + + K + + K
Sbjct: 58  PEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPK 108


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 26.5 bits (59), Expect = 2.5
 Identities = 13/42 (30%), Positives = 24/42 (57%), Gaps = 2/42 (4%)

Query: 64  GKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            +EK +E QK  ++ Q+E  E Q  E    KK ++ ++++ E
Sbjct: 70  DQEKMKELQKMMKEFQKEFREAQ--ESGDMKKLKKLQEKQME 109



 Score = 25.4 bits (56), Expect = 6.2
 Identities = 12/37 (32%), Positives = 22/37 (59%), Gaps = 2/37 (5%)

Query: 73  KQQQQQQQEKEEQQEKEEQQEKKEEE--EKKEEEEKE 107
           K ++ Q+  KE Q+E  E QE  + +  +K +E++ E
Sbjct: 73  KMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQME 109


>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
          Length = 639

 Score = 26.7 bits (59), Expect = 2.5
 Identities = 9/42 (21%), Positives = 18/42 (42%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKE 96
             +  + +   + E  E  QQ  Q  EK +++E    +  K+
Sbjct: 133 QEKNSQSQSASQAESLEHVQQSAQTSEKVDEKEPLLTKTDKQ 174



 Score = 26.0 bits (57), Expect = 5.2
 Identities = 9/40 (22%), Positives = 22/40 (55%)

Query: 66  EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            +++ +Q+++  Q Q   + +  E  Q+  +  EK +E+E
Sbjct: 126 GQDDSDQQEKNSQSQSASQAESLEHVQQSAQTSEKVDEKE 165


>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
          Length = 356

 Score = 26.9 bits (59), Expect = 2.5
 Identities = 15/43 (34%), Positives = 28/43 (65%)

Query: 64  GKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           G E++E +Q+ Q+Q        +E  E+ ++++EEE+ EEEE+
Sbjct: 307 GGEEKEPKQESQEQLFNPFAIDEEMLEETQEQQEEEENEEEEE 349



 Score = 26.1 bits (57), Expect = 4.8
 Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 9/62 (14%)

Query: 43  FGKFLQVHFKELNTEEEEKEEGKEKEE---------EEQKQQQQQQQEKEEQQEKEEQQE 93
           F + ++V  K+L    EEKE  +E +E         EE  ++ Q+QQE+EE +E+EE   
Sbjct: 293 FQEMVKVAAKDLILGGEEKEPKQESQEQLFNPFAIDEEMLEETQEQQEEEENEEEEENDT 352

Query: 94  KK 95
            K
Sbjct: 353 VK 354


>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
          Length = 382

 Score = 26.4 bits (59), Expect = 2.9
 Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 3/51 (5%)

Query: 57  EEEEKEEGKEKEEEEQKQQQ---QQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           EEEE+EE  E+EE +          Q     +QE+++   +++E EK+ + 
Sbjct: 226 EEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQP 276



 Score = 26.0 bits (58), Expect = 4.7
 Identities = 9/43 (20%), Positives = 20/43 (46%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
            EEEE +     E    +     +QE+++   + ++ EK+ + 
Sbjct: 234 VEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQP 276


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 26.5 bits (59), Expect = 2.9
 Identities = 14/55 (25%), Positives = 32/55 (58%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           + L  +E+  E  + K E  + +Q+  +++++E+++  E QE+  +E  K+  EK
Sbjct: 197 QALTAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEK 251



 Score = 26.1 bits (58), Expect = 3.6
 Identities = 16/50 (32%), Positives = 29/50 (58%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            E  K E  E E+E  +++Q+++++  E QE+  Q+  K+  EK E E +
Sbjct: 208 AERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAERE 257


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap is
           involved in recombination. Rap (recombination adept with
           plasmid) increases lambda-by-plasmid recombination
           catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 26.1 bits (58), Expect = 2.9
 Identities = 9/36 (25%), Positives = 24/36 (66%)

Query: 72  QKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +++ Q+++++ E Q E+ E + +KE+ + + +  KE
Sbjct: 36  REKAQEKKRKAEAQAERRELKARKEKLKTRSDWLKE 71


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 26.5 bits (58), Expect = 3.0
 Identities = 8/51 (15%), Positives = 31/51 (60%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E E +E  ++ ++ +++  ++Q + ++ Q+K +  +   ++++ E  +K+
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQ 254


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 26.1 bits (58), Expect = 3.1
 Identities = 13/51 (25%), Positives = 29/51 (56%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            +EEK   K+  + +  Q     + ++E+++   ++EKK  + +KE+ E+E
Sbjct: 72  TKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 26.4 bits (58), Expect = 3.1
 Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 1/35 (2%)

Query: 74  QQQQQQQEKEEQQEK-EEQQEKKEEEEKKEEEEKE 107
             +  + +K    EK E +  KKE  +K+   E+E
Sbjct: 326 HAENAEIKKTRTAEKNEAKARKKEIAQKRRAAERE 360



 Score = 26.4 bits (58), Expect = 3.6
 Identities = 8/51 (15%), Positives = 26/51 (50%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           T E+ + + ++KE  ++++  +++  +E +QE+     +        + +K
Sbjct: 337 TAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKK 387



 Score = 25.6 bits (56), Expect = 6.0
 Identities = 5/45 (11%), Positives = 21/45 (46%)

Query: 59  EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
           E K+    ++ E + ++++  Q++   + +  ++ ++E       
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMAR 375



 Score = 25.2 bits (55), Expect = 9.5
 Identities = 8/40 (20%), Positives = 22/40 (55%)

Query: 68  EEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E  E K+ +  ++ + + ++KE  Q+++  E +   E ++
Sbjct: 328 ENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQ 367


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 26.4 bits (58), Expect = 3.3
 Identities = 12/39 (30%), Positives = 25/39 (64%)

Query: 60 EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
          E    +EK E EQ++++Q+ + K E++ K +++  K+ E
Sbjct: 4  ELRRAREKLEREQRERKQRAKLKLERERKAKEEAAKQRE 42


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 881

 Score = 26.4 bits (58), Expect = 3.3
 Identities = 16/41 (39%), Positives = 26/41 (63%), Gaps = 2/41 (4%)

Query: 67  KEEEEQKQQ--QQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
            +E ++KQQ   +Q Q   E+Q   ++ +KKEEEE+  EE+
Sbjct: 2   VDESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEED 42


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 26.5 bits (58), Expect = 3.3
 Identities = 10/39 (25%), Positives = 25/39 (64%)

Query: 68  EEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            EE+   +     + +++ E E+  +++++EE++EEEE+
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179



 Score = 26.2 bits (57), Expect = 4.4
 Identities = 11/53 (20%), Positives = 35/53 (66%)

Query: 54  LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           ++ ++E+++E  + EE+++++++++++ K    E EE +  ++   +K E +K
Sbjct: 153 IDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDK 205



 Score = 25.8 bits (56), Expect = 6.9
 Identities = 13/56 (23%), Positives = 30/56 (53%)

Query: 50  HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           H  E +    +     + +E+E +    ++ ++EE++E+EE +   +E+E+ E  E
Sbjct: 139 HLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGE 194



 Score = 25.4 bits (55), Expect = 8.2
 Identities = 16/50 (32%), Positives = 29/50 (58%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
           E   E+++ EE  E+EEEE+++ +    E EE +  E+   +K E +K +
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTD 207


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 25.8 bits (57), Expect = 3.3
 Identities = 12/35 (34%), Positives = 27/35 (77%), Gaps = 2/35 (5%)

Query: 70  EEQK--QQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
           EEQK  +Q  +++ K+++Q+K+++++KK++  KK 
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130


>gnl|CDD|224712 COG1799, COG1799, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 167

 Score = 26.2 bits (58), Expect = 3.4
 Identities = 9/57 (15%), Positives = 22/57 (38%), Gaps = 4/57 (7%)

Query: 51  FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           F      ++E++      EE+ +Q+ +    + E+    +   +K E  + E     
Sbjct: 10  FFGYFPLDDEEDY----YEEDPRQEPRDPAVEPEEYAPPKYNFRKREPARMEMRGNV 62


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 26.0 bits (58), Expect = 3.6
 Identities = 10/51 (19%), Positives = 31/51 (60%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
           KE   + ++  +G +  ++E KQ  ++ + ++ ++++E  ++++ E E+K 
Sbjct: 22  KEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKA 72


>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG.  Members of this
           family are the protein YabG, demonstrated for Bacillus
           subtilis to be an endopeptidase able to release
           N-terminal peptides from a number of sporulation
           proteins, including CotT, CotF, and SpoIVA. It appears
           to be expressed under control of sigma-K [Cellular
           processes, Sporulation and germination].
          Length = 283

 Score = 26.2 bits (58), Expect = 3.6
 Identities = 11/40 (27%), Positives = 24/40 (60%)

Query: 68  EEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E E+K+++++ +E+ E   +  +Q+ K  +EKKE     
Sbjct: 53  DESERKKRKKRFKERMETSYRLFRQDYKLMKEKKEYRATG 92


>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
           biogenesis, outer membrane].
          Length = 284

 Score = 26.2 bits (58), Expect = 3.7
 Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 3/57 (5%)

Query: 43  FGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
             +FL+         EE K+E  E    EQ  ++ +  E+E ++ KE    K+   +
Sbjct: 61  VLEFLKSLKDLALENEELKKELAEL---EQLLEEVESLEEENKRLKELLDFKESSSD 114


>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
           Reviewed.
          Length = 830

 Score = 26.3 bits (58), Expect = 3.7
 Identities = 6/28 (21%), Positives = 17/28 (60%)

Query: 71  EQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
           +Q + QQQ  + ++Q +++  ++K  + 
Sbjct: 792 QQSEMQQQPSQPQQQPQQQPAEQKDSDG 819



 Score = 25.9 bits (57), Expect = 5.9
 Identities = 8/28 (28%), Positives = 16/28 (57%)

Query: 62  EEGKEKEEEEQKQQQQQQQEKEEQQEKE 89
           ++ + +++  Q QQQ QQQ  E++    
Sbjct: 792 QQSEMQQQPSQPQQQPQQQPAEQKDSDG 819



 Score = 25.5 bits (56), Expect = 6.8
 Identities = 7/27 (25%), Positives = 17/27 (62%)

Query: 74  QQQQQQQEKEEQQEKEEQQEKKEEEEK 100
           QQ + QQ+  + Q++ +QQ  ++++  
Sbjct: 792 QQSEMQQQPSQPQQQPQQQPAEQKDSD 818


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 26.2 bits (58), Expect = 3.8
 Identities = 15/56 (26%), Positives = 35/56 (62%)

Query: 52   KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            ++L+  EE  EE +E EE+E  ++Q+ + + + +  K  + + K++E+KK++   +
Sbjct: 1132 EDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSAD 1187


>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein.  This
           is the P3 protein section of the Potyviridae
           polyproteins. The function is not known except that the
           protein is essential to viral survival.
          Length = 445

 Score = 26.0 bits (58), Expect = 4.0
 Identities = 10/36 (27%), Positives = 21/36 (58%), Gaps = 1/36 (2%)

Query: 70  EEQKQQQQQQQEKEEQQ-EKEEQQEKKEEEEKKEEE 104
           + +++  ++++EKEE++  K     K E +   EEE
Sbjct: 274 KLKREAAEEKEEKEEKEIRKLYLYSKLEGKLPTEEE 309


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 25.9 bits (57), Expect = 4.0
 Identities = 11/41 (26%), Positives = 28/41 (68%)

Query: 67  KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E+EE ++  + ++E+EEQ+    Q+E++E++  K + ++ 
Sbjct: 142 REQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQA 182


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 26.1 bits (57), Expect = 4.1
 Identities = 13/54 (24%), Positives = 23/54 (42%)

Query: 54  LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
             + ++E +E  E EE ++  +   + E E     EE  E  +E E K   +  
Sbjct: 942 EASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDSR 995


>gnl|CDD|218660 pfam05620, DUF788, Protein of unknown function (DUF788).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 166

 Score = 25.8 bits (57), Expect = 4.1
 Identities = 7/22 (31%), Positives = 14/22 (63%)

Query: 74  QQQQQQQEKEEQQEKEEQQEKK 95
           Q  +  + K ++QEK E++ +K
Sbjct: 145 QGAETNETKSKRQEKLEKRGEK 166


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 26.1 bits (58), Expect = 4.6
 Identities = 12/61 (19%), Positives = 31/61 (50%), Gaps = 4/61 (6%)

Query: 51  FKELNTEEEEKEEGKEKEEEEQKQQQQQQ----QEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           +++L  ++++ EE  E++EE  K +  +      + ++   +E   E++    + EE+  
Sbjct: 120 YRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRA 179

Query: 107 E 107
           E
Sbjct: 180 E 180



 Score = 25.7 bits (57), Expect = 6.6
 Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 2/55 (3%)

Query: 53  ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E + E+EE  + ++ E+++          +E   E+  ++ ++E EEK+ E E  
Sbjct: 132 ESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLER--RRRRREWEEKRAELEFY 184


>gnl|CDD|235449 PRK05415, PRK05415, hypothetical protein; Provisional.
          Length = 341

 Score = 26.0 bits (58), Expect = 4.7
 Identities = 14/40 (35%), Positives = 22/40 (55%)

Query: 66  EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           E  EEE++ + + QQ  +EQ+        +EEEE+ E E 
Sbjct: 14  EPLEEEEEPELRAQQAFDEQEAFAPAAPDEEEEEEGELEA 53


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 25.9 bits (58), Expect = 4.8
 Identities = 12/36 (33%), Positives = 26/36 (72%)

Query: 63  EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
           EG++    E  ++++++ E+EE++E+EE+ E+ E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 25.9 bits (58), Expect = 5.3
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 80  QEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           +E    +E+ E    KE EEK+++ EK 
Sbjct: 514 EEIAYIKEQMEGSAPKEPEEKEKKPEKP 541



 Score = 25.5 bits (57), Expect = 5.8
 Identities = 9/32 (28%), Positives = 19/32 (59%)

Query: 59  EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEE 90
           + +EE    +E+ +    ++ +EKE++ EK E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 25.5 bits (57), Expect = 6.4
 Identities = 8/32 (25%), Positives = 19/32 (59%)

Query: 66  EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEE 97
           + EEE    ++Q +    ++ E++E++ +K E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542


>gnl|CDD|150747 pfam10107, Endonuc_Holl, Endonuclease related to archaeal Holliday
           junction resolvase.  This domain is found in various
           predicted bacterial endonucleases which are distantly
           related to archaeal Holliday junction resolvases.
          Length = 153

 Score = 25.4 bits (56), Expect = 5.5
 Identities = 9/45 (20%), Positives = 26/45 (57%)

Query: 60  EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
            K E + +E  EQ +  +++ +   + E + ++ +K+EE++  ++
Sbjct: 18  GKVEKRARELFEQWRNAEREAQAMREAEAKFEEWQKKEEKEIRKD 62


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 25.5 bits (56), Expect = 5.7
 Identities = 11/48 (22%), Positives = 34/48 (70%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           ++ + ++ +++     ++ ++ ++E+ E++E+EE+++  EEE+ +EEE
Sbjct: 133 DDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180


>gnl|CDD|179867 PRK04598, tatA, twin arginine translocase protein A; Provisional.
          Length = 81

 Score = 24.8 bits (54), Expect = 5.8
 Identities = 12/42 (28%), Positives = 25/42 (59%)

Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQ 92
          FK+  +EEE  +  K+  + E K  +Q + E   +++K+++Q
Sbjct: 39 FKKAMSEEESAKANKKDADFEPKNLEQAKTEAAAEKKKDKEQ 80


>gnl|CDD|223100 COG0021, TktA, Transketolase [Carbohydrate transport and
           metabolism].
          Length = 663

 Score = 25.6 bits (57), Expect = 6.0
 Identities = 11/51 (21%), Positives = 19/51 (37%)

Query: 52  KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
           K L  E E  E  +E     +  +++  + +    E     +KK  E   E
Sbjct: 277 KALGWEPEPFEVPEEVYAAFRAVEERGAKAEAAWNELFAAYKKKYPELAAE 327


>gnl|CDD|130727 TIGR01666, YCCS, TIGR01666 family membrane protein.  This model
           represents a clade of sequences from gamma and beta
           proteobacteria. These proteins are >700 amino acids long
           and many have been annotated as putative membrane
           proteins. The gene from Salmonella has been annotated as
           a putative efflux transporter. The gene from E. coli has
           the name yccS [Cell envelope, Other].
          Length = 704

 Score = 25.6 bits (56), Expect = 6.1
 Identities = 12/35 (34%), Positives = 15/35 (42%)

Query: 66  EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
           E  EE  +    QQQE  E  E  +Q+   EE   
Sbjct: 645 EHIEEIPEAIFNQQQESIETLELRKQEMTAEERAV 679


>gnl|CDD|236836 PRK11072, PRK11072, bifunctional glutamine-synthetase
           adenylyltransferase/deadenyltransferase; Reviewed.
          Length = 943

 Score = 25.6 bits (57), Expect = 6.1
 Identities = 9/50 (18%), Positives = 20/50 (40%)

Query: 49  VHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
             F +L  ++EE+ E +   E+ ++  Q    E++      E       +
Sbjct: 430 RVFNQLIGDDEEETEEEAASEQWRELWQDALDEEDATPLLAELGFDDPAQ 479


>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway components
           [Intracellular trafficking and secretion].
          Length = 94

 Score = 25.0 bits (55), Expect = 6.1
 Identities = 11/48 (22%), Positives = 27/48 (56%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           + E  EE K +E +++K++   + +  +++  +   E KE+ +KK + 
Sbjct: 47  KNELDEELKLEELDDKKKELTAELQATKEELDQLASELKEDLKKKAKP 94


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 25.8 bits (57), Expect = 6.4
 Identities = 9/49 (18%), Positives = 26/49 (53%)

Query: 57  EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           ++   +E      EE +++  +Q E+  + ++E ++E  ++ +K  E +
Sbjct: 456 KQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVD 504


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 25.0 bits (55), Expect = 6.4
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query: 88  KEEQQEKKEEEEKKEEEEKE 107
                 + ++EEKKEEEE+E
Sbjct: 74  AAAAAAEAKKEEKKEEEEEE 93


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 25.8 bits (57), Expect = 6.5
 Identities = 19/60 (31%), Positives = 27/60 (45%)

Query: 47  LQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
           LQ     L  E    E+ K+   E     ++Q +E E Q E+ E +  +  EE  E EEK
Sbjct: 286 LQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEK 345



 Score = 25.4 bits (56), Expect = 9.0
 Identities = 19/67 (28%), Positives = 35/67 (52%)

Query: 41  SRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
           SR  +  Q+  + L   E + EE + + EE + +  +  +E  E +EK E+ +++ E  +
Sbjct: 298 SRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLE 357

Query: 101 KEEEEKE 107
            E EE E
Sbjct: 358 AELEELE 364


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 25.3 bits (56), Expect = 6.6
 Identities = 13/45 (28%), Positives = 34/45 (75%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
             EE++EE + ++EE++++++++++  +   +KEE++++ EE EK
Sbjct: 13  DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEK 57


>gnl|CDD|225470 COG2918, GshA, Gamma-glutamylcysteine synthetase [Coenzyme
           metabolism].
          Length = 518

 Score = 25.5 bits (56), Expect = 6.7
 Identities = 14/55 (25%), Positives = 24/55 (43%)

Query: 32  AKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQ 86
            + +++    RFG      ++EL  +E      +E  E E+    Q+QQE E   
Sbjct: 452 LRQMKDTGIGRFGLAFAEEYRELLRQEPLFLLIQEDFEAERLDSNQRQQELEAAD 506


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 25.4 bits (56), Expect = 6.8
 Identities = 10/32 (31%), Positives = 19/32 (59%)

Query: 76  QQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            +Q Q K+E+  +E+ +E   + EK +E E +
Sbjct: 314 FEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDD 345


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 25.5 bits (56), Expect = 7.0
 Identities = 5/36 (13%), Positives = 16/36 (44%)

Query: 69  EEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
            E  K  ++++  +E ++ +     +     + EE+
Sbjct: 137 YEYPKTDEEKELLREVEELESRLATEPSPAPELEEQ 172


>gnl|CDD|163696 cd08065, MPN_eIF3h, Mpr1p, Pad1p N-terminal (MPN) domains without
           catalytic isopeptidase activity, found in eIF2h.
           Eukaryotic translation initiation factor 3 (eIF3)
           subunit h (eIF3h; eIF3 subunit 3; eIF3S3; eIF3-gamma;
           eIF3-p40) is an evolutionarily non-conserved subunit of
           the functional core that comprises eIF3a, eIF3b, eIF3c,
           eIF3e, eIF3f, and eIF3h, and contains the MPN domain.
           However, it lacks the canonical JAMM motif, and
           therefore does not show catalytic isopeptidase
           activity.Together with eIF3e and eIF3f, eIF3h stabilizes
           the eIF3 complex. Results suggest that eIF3h regulates
           cell growth and viability, and that over-expression of
           the gene may provide growth advantage to prostate,
           breast, and liver cancer cells. For example, EIF3h gene
           amplification is common in late-stage prostate cancer
           suggesting that it may be functionally involved in the
           progression of the disease. It has been shown that
           coamplification of MYC, a well characterized oncogene
           involved in cell growth, differentiation, and apoptosis,
           and EIF3h in patients with non-small cell lung cancer
           (NSCLC) improves survival if treated with the Epidermal
           Growth Factor Receptor Tyrosine Kinase Inhibitor
           (EGFR-TKI), Gefitinib. Plant eIF3h is implicated in
           translation of specific mRNAs.
          Length = 266

 Score = 25.3 bits (56), Expect = 7.1
 Identities = 8/38 (21%), Positives = 18/38 (47%)

Query: 70  EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           E+ K    Q+    +Q + ++  +K++ E  + E   E
Sbjct: 222 EQGKFNYYQRNLARQQAQIQQWLQKRKAENAQREARGE 259



 Score = 25.3 bits (56), Expect = 7.4
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 72  QKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
            +QQ Q QQ  ++++ +  Q+E + EE   EE+
Sbjct: 234 ARQQAQIQQWLQKRKAENAQREARGEEPLPEED 266


>gnl|CDD|235751 PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated.
          Length = 871

 Score = 25.2 bits (56), Expect = 7.4
 Identities = 10/47 (21%), Positives = 25/47 (53%)

Query: 55  NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
           N +  E    K K E+ +++  ++++E   + ++    E+K +E K+
Sbjct: 611 NIKNFEPGASKRKFEQGRQEALEKEEELLSRLQQLPDGEQKAKETKR 657


>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
          Length = 580

 Score = 25.2 bits (56), Expect = 7.5
 Identities = 19/93 (20%), Positives = 36/93 (38%), Gaps = 4/93 (4%)

Query: 15  ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEE-EQK 73
           + E IL    L       + +R+         L+        EE  +E+    E+E +  
Sbjct: 478 LLEAILAQPGLTTGSQLLEHLRDAGLEELAALLE---SLAVWEEISEEDIAALEKELKDA 534

Query: 74  QQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
            ++ + Q  EE+ E+   +E+  E      EE+
Sbjct: 535 LEKLRDQLLEERLEELIAKERLLEGHGLSSEER 567


>gnl|CDD|224849 COG1938, COG1938, Archaeal enzymes of ATP-grasp superfamily
           [General function prediction only].
          Length = 244

 Score = 25.4 bits (56), Expect = 7.6
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 76  QQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
           +++ +E EEQ EK  +Q +KEEE  + EEE
Sbjct: 210 EKEAEEIEEQLEKLAEQLEKEEERVEREEE 239


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 25.0 bits (55), Expect = 7.7
 Identities = 17/62 (27%), Positives = 29/62 (46%), Gaps = 5/62 (8%)

Query: 51  FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEE-----QQEKKEEEEKKEEEE 105
            + LN E+   E     +EE +K   +  + +E+     E      +  +EEEE+  EEE
Sbjct: 114 LENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEE 173

Query: 106 KE 107
           +E
Sbjct: 174 EE 175


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 25.0 bits (55), Expect = 8.0
 Identities = 8/42 (19%), Positives = 20/42 (47%)

Query: 66  EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            K EE++K++     +K+ +  K  ++ +K+   K  +    
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVA 194


>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 304

 Score = 25.0 bits (55), Expect = 8.1
 Identities = 5/17 (29%), Positives = 12/17 (70%)

Query: 81  EKEEQQEKEEQQEKKEE 97
           +KEE+ +K+ + E++  
Sbjct: 271 DKEEEAKKQAKAERRAA 287


>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
           Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the heterodimeric topo I from
           Leishmania donvanni. Topo I enzymes are divided into:
           topo type IA (bacterial) and type IB (eukaryotic). Topo
           I relaxes superhelical tension in duplex DNA by creating
           a single-strand nick, the broken strand can then rotate
           around the unbroken strand to remove DNA supercoils and,
           the nick is religated, liberating topo I. These enzymes
           regulate the topological changes that accompany DNA
           replication, transcription and other nuclear processes. 
           Human topo I is the target of a diverse set of
           anticancer drugs including camptothecins (CPTs). CPTs
           bind to the topo I-DNA complex and inhibit re-ligation
           of the single-strand nick, resulting in the accumulation
           of topo I-DNA adducts. In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topo I play putative roles in
           organizing the kinetoplast DNA network unique to these
           parasites.  This family may represent more than one
           structural domain.
          Length = 212

 Score = 25.2 bits (55), Expect = 8.1
 Identities = 10/30 (33%), Positives = 22/30 (73%)

Query: 74  QQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
           +   +++EK++ + KEE++  KEE++K+ E
Sbjct: 89  EWHLREKEKKKSRTKEEKKALKEEKDKEAE 118


>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306).  This
           family of proteobacterial species proteins has no known
           function.
          Length = 115

 Score = 24.6 bits (54), Expect = 8.1
 Identities = 10/41 (24%), Positives = 18/41 (43%)

Query: 67  KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
           K     ++  +  +  EE+          EEEE+ E E++E
Sbjct: 10  KLAVRAEEPAEPAETAEEEAAAAAPAPAPEEEEEAELEDEE 50


>gnl|CDD|236781 PRK10869, PRK10869, recombination and repair protein; Provisional.
          Length = 553

 Score = 25.3 bits (56), Expect = 8.3
 Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 8/46 (17%)

Query: 69  EEEQKQQQQQQQEKEEQQEKEEQQEKK--------EEEEKKEEEEK 106
             +  Q QQQ QE+  +++  + Q K+         E E+ +EE K
Sbjct: 170 CRDLAQHQQQSQERAARKQLLQYQLKELNEFAPQPGEFEQIDEEYK 215


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 24.6 bits (54), Expect = 8.8
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 83  EEQQEKEEQQEKKEEEEKKEEEE 105
               E+EE++E++EEEE++E EE
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEE 94


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 25.1 bits (55), Expect = 8.9
 Identities = 11/52 (21%), Positives = 19/52 (36%)

Query: 56  TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
            E  E    +   E E +     +   E  +E  E   ++ E+  +EE  K 
Sbjct: 198 PEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKP 249


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 24.9 bits (54), Expect = 8.9
 Identities = 9/26 (34%), Positives = 20/26 (76%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKE 83
           EEEKE+ K   +EE++ +++++ ++E
Sbjct: 94  EEEKEKKKNLNKEEKEAKKKERAKRE 119


>gnl|CDD|178439 PLN02847, PLN02847, triacylglycerol lipase.
          Length = 633

 Score = 25.2 bits (55), Expect = 9.0
 Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 3/44 (6%)

Query: 58  EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKE--EEE 99
           EEE  EG E   E +K+ Q+Q+ E + Q ++EE    KE  EEE
Sbjct: 491 EEEVTEG-ELWYELEKELQRQETEVDAQAQEEEAAAAKEITEEE 533


>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
           hydroxylase/glutamate synthase subunit beta;
           Provisional.
          Length = 752

 Score = 25.1 bits (55), Expect = 9.0
 Identities = 11/59 (18%), Positives = 23/59 (38%), Gaps = 3/59 (5%)

Query: 42  RFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEK---EEQQEKEEQQEKKEE 97
           R G +  +  +EL   EE     + +  +  K +++   E+    E   +     + EE
Sbjct: 256 RMGAYKTIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRFEE 314


>gnl|CDD|240282 PTZ00126, PTZ00126, tyrosyl-tRNA synthetase; Provisional.
          Length = 383

 Score = 25.0 bits (55), Expect = 9.5
 Identities = 13/56 (23%), Positives = 19/56 (33%), Gaps = 4/56 (7%)

Query: 53  ELNTEEEEKEEGKEK----EEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
               EE +      K    EE E   +    Q K   +E+ +      EE  + EE
Sbjct: 1   TNPGEEFKSGSCSIKVGLREELETGFRGSPPQSKLSLEERVKLCLSIGEECIQPEE 56


>gnl|CDD|240400 PTZ00396, PTZ00396, Casein kinase II subunit beta; Provisional.
          Length = 251

 Score = 25.0 bits (55), Expect = 9.9
 Identities = 6/38 (15%), Positives = 19/38 (50%)

Query: 67  KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
           K+  ++ + Q  +  K   ++  ++  K   E +++E+
Sbjct: 209 KKSSKEYKLQNGEFGKSAIEQFLKKSPKGNIELQEQEQ 246


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.291    0.114    0.275 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,218,211
Number of extensions: 472602
Number of successful extensions: 18256
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10248
Number of HSP's successfully gapped: 4020
Length of query: 107
Length of database: 10,937,602
Length adjustment: 72
Effective length of query: 35
Effective length of database: 7,744,114
Effective search space: 271043990
Effective search space used: 271043990
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (21.6 bits)
S2: 53 (24.0 bits)