RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6822
(107 letters)
>gnl|CDD|214580 smart00242, MYSc, Myosin. Large ATPases. ATPase; molecular motor.
Muscle contraction consists of a cyclical interaction
between myosin and actin. The core of the myosin
structure is similar in fold to that of kinesin.
Length = 677
Score = 84.9 bits (211), Expect = 1e-20
Identities = 32/55 (58%), Positives = 46/55 (83%), Gaps = 2/55 (3%)
Query: 1 MQYLAAVNKSPS--NLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
MQYLA+V+ S + + +QILE++P+LE+FGNAKT+RN+NSSRFGKF+++HF
Sbjct: 113 MQYLASVSGSNTEVGSVEDQILESNPILEAFGNAKTLRNNNSSRFGKFIEIHFDA 167
>gnl|CDD|238071 cd00124, MYSc, Myosin motor domain. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 679
Score = 83.9 bits (208), Expect = 3e-20
Identities = 31/51 (60%), Positives = 42/51 (82%)
Query: 1 MQYLAAVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
M+YLA++ S I E+IL A+P+LE+FGNAKTVRN+NSSRFGKF+++ F
Sbjct: 107 MKYLASLAGSNDTGIEEKILAANPILEAFGNAKTVRNNNSSRFGKFIELQF 157
>gnl|CDD|238683 cd01387, MYSc_type_XV, Myosin motor domain, type XV myosins. In
vertebrates, myosin XV appears to be expressed in
sensory tissue and play a role in hearing. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Myosins are
actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 677
Score = 83.4 bits (206), Expect = 3e-20
Identities = 36/52 (69%), Positives = 47/52 (90%)
Query: 1 MQYLAAVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFK 52
++YLAA+N+ S +ITEQILEA+PLLE+FGNAKTVRNDNSSRFGKF+++ +
Sbjct: 108 LRYLAAMNQGGSAVITEQILEATPLLEAFGNAKTVRNDNSSRFGKFVEIFLE 159
>gnl|CDD|238673 cd01377, MYSc_type_II, Myosin motor domain, type II myosins. Myosin
II mediates cortical contraction in cell motility, and
is the motor in smooth and skeletal muscle. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Myosins are
actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 693
Score = 79.2 bits (196), Expect = 1e-18
Identities = 32/59 (54%), Positives = 43/59 (72%), Gaps = 9/59 (15%)
Query: 2 QYLAAV---------NKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
QYLA+V + + +QIL+A+P+LE+FGNAKTVRNDNSSRFGKF+++HF
Sbjct: 113 QYLASVAASSKKKKQSGKGQGTLEDQILQANPILEAFGNAKTVRNDNSSRFGKFIRIHF 171
>gnl|CDD|215687 pfam00063, Myosin_head, Myosin head (motor domain).
Length = 679
Score = 77.7 bits (192), Expect = 4e-18
Identities = 30/54 (55%), Positives = 43/54 (79%), Gaps = 3/54 (5%)
Query: 1 MQYLAAV---NKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
MQYLA+V S + +QIL+++P+LE+FGNAKT+RN+NSSRFGKF+++ F
Sbjct: 106 MQYLASVSGSTPSAVGPLEDQILQSNPILEAFGNAKTLRNNNSSRFGKFIEIQF 159
>gnl|CDD|238674 cd01378, MYSc_type_I, Myosin motor domain, type I myosins. Myosin I
generates movement at the leading edge in cell motility,
and class I myosins have been implicated in phagocytosis
and vesicle transport. Myosin I, an unconventional
myosin, does not form dimers. This catalytic (head)
domain has ATPase activity and belongs to the larger
group of P-loop NTPases. Myosins are actin-dependent
molecular motors that play important roles in muscle
contraction, cell motility, and organelle transport. The
head domain is a molecular motor, which utilizes ATP
hydrolysis to generate directed movement toward the plus
end along actin filaments. A cyclical interaction
between myosin and actin provides the driving force.
Rates of ATP hydrolysis and consequently the speed of
movement along actin filaments vary widely, from about
0.04 micrometer per second for myosin I to 4.5
micrometer per second for myosin II in skeletal muscle.
Myosin II moves in discrete steps about 5-10 nm long and
generates 1-5 piconewtons of force. Upon ATP binding,
the myosin head dissociates from an actin filament. ATP
hydrolysis causes the head to pivot and associate with a
new actin subunit. The release of Pi causes the head to
pivot and move the filament (power stroke). Release of
ADP completes the cycle.
Length = 674
Score = 77.2 bits (191), Expect = 5e-18
Identities = 28/53 (52%), Positives = 41/53 (77%), Gaps = 2/53 (3%)
Query: 1 MQYLAAV--NKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
MQY+AAV + + IL+++PLLE+FGNAKT+RN+NSSRFGK++++ F
Sbjct: 107 MQYIAAVSGGGQKVERVKDVILQSNPLLEAFGNAKTLRNNNSSRFGKYMEIQF 159
>gnl|CDD|238677 cd01381, MYSc_type_VII, Myosin motor domain, type VII myosins.
Myosins in this group have been associated with
functions in sensory systems such as vision and hearing.
This catalytic (head) domain has ATPase activity and
belongs to the larger group of P-loop NTPases. Myosins
are actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 671
Score = 74.0 bits (182), Expect = 7e-17
Identities = 33/53 (62%), Positives = 45/53 (84%), Gaps = 1/53 (1%)
Query: 1 MQYLAAVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
+QYLAA++ S I +QILEA+P+LE+FGNAKT+RNDNSSRFGK++ +HF +
Sbjct: 107 LQYLAAISGKHS-WIEQQILEANPILEAFGNAKTIRNDNSSRFGKYIDIHFNK 158
>gnl|CDD|238676 cd01380, MYSc_type_V, Myosin motor domain, type V myosins. Myosins
V transport a variety of intracellular cargo
processively along actin filaments, such as membraneous
organelles and mRNA. This catalytic (head) domain has
ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 691
Score = 72.0 bits (177), Expect = 4e-16
Identities = 28/61 (45%), Positives = 42/61 (68%), Gaps = 5/61 (8%)
Query: 1 MQYLAAVNKSPSNL-----ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELN 55
M+Y A+V S S + E++L ++P++E+FGNAKT RNDNSSRFGK++Q+ F +
Sbjct: 107 MRYFASVGGSDSREVSETQVEEKVLASNPIMEAFGNAKTTRNDNSSRFGKYIQILFDKRG 166
Query: 56 T 56
Sbjct: 167 R 167
>gnl|CDD|238680 cd01384, MYSc_type_XI, Myosin motor domain, plant-specific type XI
myosin, involved in organelle transport. This catalytic
(head) domain has ATPase activity and belongs to the
larger group of P-loop NTPases. Myosins are
actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 674
Score = 71.2 bits (175), Expect = 7e-16
Identities = 28/56 (50%), Positives = 42/56 (75%), Gaps = 3/56 (5%)
Query: 1 MQYLAAVNKSPSNL---ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
M+YLA + + +Q+LE++P+LE+FGNAKTVRN+NSSRFGKF+++ F +
Sbjct: 109 MRYLAYMGGRAGVEGRTVEQQVLESNPVLEAFGNAKTVRNNNSSRFGKFVEIQFDD 164
>gnl|CDD|238678 cd01382, MYSc_type_VI, Myosin motor domain, type VI myosins. Myosin
VI is a monomeric myosin, which moves towards the
minus-end of actin filaments, in contrast to most other
myosins. It has been implicated in endocytosis,
secretion, and cell migration. This catalytic (head)
domain has ATPase activity and belongs to the larger
group of P-loop NTPases. Myosins are actin-dependent
molecular motors that play important roles in muscle
contraction, cell motility, and organelle transport. The
head domain is a molecular motor, which utilizes ATP
hydrolysis to generate directed movement toward the
minus end along actin filaments. A cyclical interaction
between myosin and actin provides the driving force.
Rates of ATP hydrolysis and consequently the speed of
movement along actin filaments vary widely, from about
0.04 micrometer per second for myosin I to 4.5
micrometer per second for myosin II in skeletal muscle.
Myosin II moves in discrete steps about 5-10 nm long and
generates 1-5 piconewtons of force. Upon ATP binding,
the myosin head dissociates from an actin filament. ATP
hydrolysis causes the head to pivot and associate with a
new actin subunit. The release of Pi causes the head to
pivot and move the filament (power stroke). Release of
ADP completes the cycle.
Length = 717
Score = 69.0 bits (169), Expect = 3e-15
Identities = 29/41 (70%), Positives = 38/41 (92%)
Query: 15 ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELN 55
I ++I+EA+PLLE+FGNAKTVRN+NSSRFGKF+++HF E N
Sbjct: 125 IDDRIVEANPLLEAFGNAKTVRNNNSSRFGKFVEIHFNEKN 165
>gnl|CDD|238679 cd01383, MYSc_type_VIII, Myosin motor domain, plant-specific type
VIII myosins, a subgroup which has been associated with
endocytosis, cytokinesis, cell-to-cell coupling and
gating at plasmodesmata. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 677
Score = 65.6 bits (160), Expect = 7e-14
Identities = 30/53 (56%), Positives = 41/53 (77%), Gaps = 2/53 (3%)
Query: 1 MQYLAAVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
MQYLA++ + I +IL+ +P+LE+FGNAKT RNDNSSRFGK +++HF E
Sbjct: 113 MQYLASLGGG--SGIEYEILQTNPILEAFGNAKTSRNDNSSRFGKLIEIHFSE 163
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 64.7 bits (158), Expect = 1e-13
Identities = 33/56 (58%), Positives = 44/56 (78%), Gaps = 3/56 (5%)
Query: 1 MQYLAAVNKSPSNLIT---EQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
MQYLA+V S + I+ +QIL +P+LE+FGNAKTVRNDNSSRFGK++++ F E
Sbjct: 173 MQYLASVTSSSTVEISSIEKQILATNPILEAFGNAKTVRNDNSSRFGKYIKIEFDE 228
>gnl|CDD|240229 PTZ00014, PTZ00014, myosin-A; Provisional.
Length = 821
Score = 58.5 bits (142), Expect = 2e-11
Identities = 26/50 (52%), Positives = 38/50 (76%), Gaps = 1/50 (2%)
Query: 1 MQYLAAVNKSPSNL-ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQV 49
M+Y A+ +L I I+ A+P+LE+FGNAKT+RN+NSSRFG+F+Q+
Sbjct: 204 MRYFASSKSGNMDLKIQNAIMAANPVLEAFGNAKTIRNNNSSRFGRFMQL 253
>gnl|CDD|238675 cd01379, MYSc_type_III, Myosin motor domain, type III myosins.
Myosin III has been shown to play a role in the vision
process in insects and in hearing in mammals. Myosin
III, an unconventional myosin, does not form dimers.
This catalytic (head) domain has ATPase activity and
belongs to the larger group of P-loop NTPases. Myosins
are actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 653
Score = 54.1 bits (130), Expect = 7e-10
Identities = 24/51 (47%), Positives = 38/51 (74%)
Query: 1 MQYLAAVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
+Q L + K+ + + E+IL+ + L+E+FGNA+T NDNSSRFGK+L++ F
Sbjct: 107 VQQLTVLGKANNRTLQEKILQVNSLVEAFGNARTGINDNSSRFGKYLEMKF 157
>gnl|CDD|238681 cd01385, MYSc_type_IX, Myosin motor domain, type IX myosins. Myosin
IX is a processive single-headed motor, which might play
a role in signalling. This catalytic (head) domain has
ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 692
Score = 53.4 bits (128), Expect = 1e-09
Identities = 26/39 (66%), Positives = 33/39 (84%), Gaps = 1/39 (2%)
Query: 16 TEQ-ILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKE 53
EQ IL A P+LE+FGNAKT N+NSSRFGKF+QV+++E
Sbjct: 131 VEQTILSAGPVLEAFGNAKTAHNNNSSRFGKFIQVNYRE 169
>gnl|CDD|238659 cd01363, Motor_domain, Myosin and Kinesin motor domain. These
ATPases belong to the P-loop NTPase family and provide
the driving force in myosin and kinesin mediated
processes.
Length = 186
Score = 46.4 bits (110), Expect = 2e-07
Identities = 11/52 (21%), Positives = 20/52 (38%), Gaps = 1/52 (1%)
Query: 15 ITEQILEASPLLESFG-NAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGK 65
+++ + A T N++SSR ++HF N E+ K
Sbjct: 54 TVTDVIDLMDKGNANRTTAATAMNEHSSRSHSVFRIHFGGKNALASATEQPK 105
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 43.4 bits (102), Expect = 4e-06
Identities = 26/89 (29%), Positives = 48/89 (53%)
Query: 17 EQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQ 76
E EA E G ++T +D + Q E E ++ E+G +
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863
Query: 77 QQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
++++E+EE++E+EE++E++EEEE++E EE
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 39.2 bits (91), Expect = 1e-04
Identities = 21/57 (36%), Positives = 39/57 (68%), Gaps = 1/57 (1%)
Query: 52 KELNTEEE-EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ELN E + E ++ ++ + +E+EE++E+EE++E++EEEE++EEEE E
Sbjct: 835 QELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 35.0 bits (80), Expect = 0.003
Identities = 15/44 (34%), Positives = 35/44 (79%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
++EEEE+EE +E+EEEE+++++++++E+ E+ E E ++++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 33.4 bits (76), Expect = 0.014
Identities = 15/44 (34%), Positives = 33/44 (75%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
+ E+EE +E+EEEE+++++++++E+EE+ E+ E E +K+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 32.3 bits (73), Expect = 0.032
Identities = 19/59 (32%), Positives = 36/59 (61%), Gaps = 4/59 (6%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKE----EQQEKEEQQEKKEEEEKKEEEEKE 107
E+ E E+E E + E ++ ++ + +E+EE++E++EEEE++EEEE+E
Sbjct: 827 EVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEE 885
Score = 30.3 bits (68), Expect = 0.15
Identities = 13/52 (25%), Positives = 29/52 (55%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E++ E + K E E + + +++ E++ E E + ++ + + + E EE E
Sbjct: 665 GEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVE 716
Score = 30.0 bits (67), Expect = 0.19
Identities = 17/55 (30%), Positives = 27/55 (49%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E +E + E E EE + + + + E E + + E E+ EE E + E E E
Sbjct: 696 EGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAE 750
Score = 29.6 bits (66), Expect = 0.25
Identities = 15/55 (27%), Positives = 26/55 (47%)
Query: 50 HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
H E EE E+ E E E+ + +Q+ + E + + E + + E K E+E
Sbjct: 640 HTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQE 694
Score = 29.6 bits (66), Expect = 0.27
Identities = 15/40 (37%), Positives = 27/40 (67%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQ 92
E EEEE+EE +E+EEEE+++++ ++ E E ++Q
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 29.6 bits (66), Expect = 0.27
Identities = 15/42 (35%), Positives = 30/42 (71%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEK 94
E EEEE+EE +E+EEEE+++++++ +E + E +Q++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 28.4 bits (63), Expect = 0.65
Identities = 18/59 (30%), Positives = 32/59 (54%)
Query: 48 QVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
+ E E E EEG+E E+E + + + + + + E KE + E + E E KE+E++
Sbjct: 722 EAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE 780
Score = 28.4 bits (63), Expect = 0.82
Identities = 15/38 (39%), Positives = 26/38 (68%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKE 89
+E EEEE+EE +E+EEEE++ ++ E E ++K+
Sbjct: 868 EEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 28.0 bits (62), Expect = 1.0
Identities = 14/40 (35%), Positives = 28/40 (70%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQ 91
+E EEEE+EE +E+EEEE++++ ++ E + +++Q
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 28.0 bits (62), Expect = 1.1
Identities = 17/55 (30%), Positives = 29/55 (52%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E E + +G+ + E E +++ +QE E + E +E K E E ++ E E E
Sbjct: 666 EAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGE 720
Score = 27.7 bits (61), Expect = 1.2
Identities = 15/52 (28%), Positives = 27/52 (51%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E + E KE + + + + ++ + E E + E E + + E E+ EE E E
Sbjct: 693 QEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDE 744
Score = 27.7 bits (61), Expect = 1.3
Identities = 16/55 (29%), Positives = 29/55 (52%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E E KE + E E ++ + + + E E +++ E + +E EE ++E E E
Sbjct: 694 EGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGE 748
Score = 27.3 bits (60), Expect = 1.7
Identities = 15/56 (26%), Positives = 28/56 (50%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+E + E K + E E + +++ E+E + E E ++ + E + EE E E
Sbjct: 663 SGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHE 718
Score = 26.9 bits (59), Expect = 2.2
Identities = 17/52 (32%), Positives = 29/52 (55%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
E+ E E + EG E E E + ++ ++ E E + E E + E + E ++KE E
Sbjct: 714 EVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETE 765
Score = 26.9 bits (59), Expect = 2.4
Identities = 16/53 (30%), Positives = 26/53 (49%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+T E EEG+ E E + ++ E E++ E E + E + E E E + E
Sbjct: 640 HTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGE 692
Score = 26.9 bits (59), Expect = 2.4
Identities = 16/55 (29%), Positives = 30/55 (54%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E +E E E G+E EE E + + + + + E + E + ++ + E E + E +E E
Sbjct: 724 EGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDE 778
Score = 26.5 bits (58), Expect = 3.1
Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQ--EKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+ T EE +E E E E + + + + + KE + E E + E KE+E++ E + E
Sbjct: 731 EIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGE 787
Score = 26.5 bits (58), Expect = 3.1
Identities = 19/69 (27%), Positives = 33/69 (47%), Gaps = 14/69 (20%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE-------------- 98
E TE EK+E + + E + + + + E++ E Q E K++E
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863
Query: 99 EKKEEEEKE 107
E++EEEE+E
Sbjct: 864 EEEEEEEEE 872
Score = 26.5 bits (58), Expect = 3.4
Identities = 12/55 (21%), Positives = 29/55 (52%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E E + + + + E + ++ + + E E + ++E + + EE ++ E+E E
Sbjct: 692 EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGE 746
Score = 26.5 bits (58), Expect = 3.5
Identities = 14/48 (29%), Positives = 24/48 (50%)
Query: 60 EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E E+ EE ++ + + E E+ E +QE + E + + E E E
Sbjct: 637 EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGE 684
Score = 25.7 bits (56), Expect = 5.6
Identities = 18/56 (32%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQ---QQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
N EE E +E E E + + + + E++ +QE E + E KE + K E E +E
Sbjct: 659 NGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEE 714
Score = 25.7 bits (56), Expect = 6.6
Identities = 15/57 (26%), Positives = 24/57 (42%), Gaps = 1/57 (1%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQ-QQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K E E E +E E + + + +E + E+E + E K E E + E E
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAE 688
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 37.2 bits (87), Expect = 6e-04
Identities = 15/55 (27%), Positives = 46/55 (83%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
K+ E++EK++ +++++++++++++KEE++E+EE++ ++E+EE++E+++K
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 36.8 bits (86), Expect = 7e-04
Identities = 23/52 (44%), Positives = 44/52 (84%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EEEK+E K+K +K+++++++EKE+++E++E++E++ EEEK+EEEEK+
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 36.1 bits (84), Expect = 0.002
Identities = 24/51 (47%), Positives = 43/51 (84%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+ EK+ +EK+E+++K +++E+EE++EKE+++E+KEEEE++ EEEKE
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467
Score = 34.9 bits (81), Expect = 0.004
Identities = 15/56 (26%), Positives = 47/56 (83%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
++ EE+++++ K +++++++++++EK+E++++EE++E +EE+E++EE++K+
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 34.1 bits (79), Expect = 0.006
Identities = 19/49 (38%), Positives = 40/49 (81%)
Query: 59 EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
++ K++EEE+++++++ K++++E+EE++EKKEEE+++EEEE E
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Score = 33.7 bits (78), Expect = 0.010
Identities = 19/56 (33%), Positives = 45/56 (80%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K++ E+ E+ +E+E++E+K++ ++KEE++E+E++++++E+EE++EE E+E
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Score = 33.0 bits (76), Expect = 0.017
Identities = 22/63 (34%), Positives = 46/63 (73%), Gaps = 1/63 (1%)
Query: 45 KFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+FL K+ + ++ E EK+ EE+K++++++ +++E+EE++EK+++EE+KEEE
Sbjct: 400 EFL-TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458
Query: 105 EKE 107
E+E
Sbjct: 459 EEE 461
Score = 31.4 bits (72), Expect = 0.062
Identities = 22/60 (36%), Positives = 42/60 (70%), Gaps = 5/60 (8%)
Query: 53 ELNTEEEE-----KEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EL EE E K+ K+ ++ +K ++++++EK+E+++K +KKEEEE++E+E+KE
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452
Score = 31.0 bits (71), Expect = 0.073
Identities = 15/56 (26%), Positives = 35/56 (62%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
L EEE E ++ +K ++ ++ +++++E++++++KK KK+EEE+E
Sbjct: 390 AFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445
Score = 27.2 bits (61), Expect = 1.6
Identities = 18/42 (42%), Positives = 35/42 (83%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQ 92
F EEEE+EE ++KEEE+++++++ ++EKEE++EK+++Q
Sbjct: 435 FAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQ 476
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 36.7 bits (86), Expect = 8e-04
Identities = 14/64 (21%), Positives = 33/64 (51%)
Query: 43 FGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
+ KF K ++ KEE ++ E +++++ ++ +E E +K E ++KK+ +
Sbjct: 151 YEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDP 210
Query: 103 EEEK 106
+E
Sbjct: 211 KEGP 214
Score = 33.3 bits (77), Expect = 0.015
Identities = 15/60 (25%), Positives = 30/60 (50%), Gaps = 1/60 (1%)
Query: 43 FGKFLQVHFKELNTEEEEK-EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
FG + + EE EK E KE+E+E+ ++ + +K E ++K++ + +E
Sbjct: 156 FGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215
Score = 32.9 bits (76), Expect = 0.017
Identities = 25/113 (22%), Positives = 51/113 (45%), Gaps = 18/113 (15%)
Query: 13 NLITEQILEASPLLESFGNAKTVRNDNSS----RFGKFLQVHFKELNTEEEEKE------ 62
N I E+ + SPL +S + V + + + + H K+ + + K+
Sbjct: 97 NEIIEKAKKNSPLFKSLLKKQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGF 156
Query: 63 -------EGKEKEEEEQKQQQQQQQEKEEQ-QEKEEQQEKKEEEEKKEEEEKE 107
E + +EE +K + Q+++E E+ +E E +K E E+KK+ + +
Sbjct: 157 GILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFD 209
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 36.5 bits (85), Expect = 9e-04
Identities = 16/56 (28%), Positives = 30/56 (53%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ TEE E+E E E + + +Q+QE +++ E+KE+ +K E+ +K
Sbjct: 228 ETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Score = 31.9 bits (73), Expect = 0.038
Identities = 16/59 (27%), Positives = 29/59 (49%), Gaps = 4/59 (6%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQ----EKEEQQEKKEEEEKKEEEEKE 107
E EEE + E E + +Q+Q+ EE E++E +K E+ +K E +++
Sbjct: 231 ETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEK 289
Score = 29.6 bits (67), Expect = 0.29
Identities = 14/57 (24%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 52 KELNTEEEEKEEGKE-KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E + E E E K K+E+E ++ EE+++ ++ ++ + E KE++++E
Sbjct: 237 EETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEE 293
Score = 29.2 bits (66), Expect = 0.40
Identities = 13/49 (26%), Positives = 26/49 (53%)
Query: 59 EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E E +E+EEE + + + K +QE+E E+ +E+E+ +
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPD 275
Score = 26.5 bits (59), Expect = 2.8
Identities = 10/51 (19%), Positives = 28/51 (54%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
+E + E +E+E + + + + + +++QE +++ E+KE+ +K
Sbjct: 226 LKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDK 276
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 36.4 bits (85), Expect = 0.001
Identities = 16/41 (39%), Positives = 26/41 (63%)
Query: 64 GKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
K+ +EE K+ Q + +E+EE++E++EEEE EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 34.5 bits (80), Expect = 0.005
Identities = 13/41 (31%), Positives = 25/41 (60%)
Query: 63 EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
+ +EE ++ Q Q ++E+EE++E++EEE +EE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 32.1 bits (74), Expect = 0.028
Identities = 14/46 (30%), Positives = 26/46 (56%)
Query: 54 LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
L +K+ E+ +E Q Q +EE++E+EE++E++ EE
Sbjct: 274 LAAALADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319
Score = 30.2 bits (69), Expect = 0.15
Identities = 14/41 (34%), Positives = 25/41 (60%)
Query: 59 EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
++ +E +E Q Q E+EE++E+EE++E+ EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 35.8 bits (83), Expect = 0.002
Identities = 15/50 (30%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 59 EEKEEGKE--KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
E+K +G K +++++ Q+++++++ +++K ++ KK+EE+KK+E EK
Sbjct: 536 EDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 33.8 bits (78), Expect = 0.008
Identities = 11/38 (28%), Positives = 31/38 (81%)
Query: 70 EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+++++ Q++++EKE +E++ ++ K++EEKK++E ++
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 33.5 bits (77), Expect = 0.010
Identities = 12/41 (29%), Positives = 33/41 (80%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
++++E +EKEE+E ++Q++ ++ ++Q+EK++++ +K E+
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 31.5 bits (72), Expect = 0.049
Identities = 12/43 (27%), Positives = 33/43 (76%)
Query: 59 EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
++KEE + ++EE++ ++Q++ K ++QE+++++E ++ E+ K
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 31.2 bits (71), Expect = 0.070
Identities = 18/55 (32%), Positives = 36/55 (65%), Gaps = 2/55 (3%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
+EL E+EEKE KE++ ++ ++Q+++K+++ EK E+ + E K +E+K
Sbjct: 551 EELQREKEEKEA--LKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDK 603
Score = 29.6 bits (67), Expect = 0.25
Identities = 12/46 (26%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKE 96
+K + EE ++E KE++E ++Q++ ++ +K+E+++K+E ++ ++
Sbjct: 545 WKLDDKEELQRE--KEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 28.8 bits (65), Expect = 0.43
Identities = 8/35 (22%), Positives = 26/35 (74%)
Query: 73 KQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K +++ ++E+++++ +++K+ + KK+EE+K+
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKK 580
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 34.7 bits (80), Expect = 0.003
Identities = 15/56 (26%), Positives = 37/56 (66%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
F+EL++ ++E+EE +E+ +E +Q+ + + E +++K+ + K + E++K + E
Sbjct: 88 FRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEP 143
Score = 26.3 bits (58), Expect = 2.5
Identities = 14/49 (28%), Positives = 31/49 (63%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
E TEEE +E ++++ E K + ++++++E + K E+++ K E +K
Sbjct: 98 EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 33.8 bits (78), Expect = 0.007
Identities = 10/39 (25%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Query: 71 EQKQQQQQQQ--EKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E K + ++ +E + ++E+ E EKKE ++K+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 30.8 bits (70), Expect = 0.092
Identities = 10/40 (25%), Positives = 22/40 (55%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKK 95
E ++ + +E EE + ++E+ + EK+E ++KK
Sbjct: 157 AEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 29.6 bits (67), Expect = 0.23
Identities = 9/39 (23%), Positives = 22/39 (56%), Gaps = 3/39 (7%)
Query: 68 EEEEQKQQQQ---QQQEKEEQQEKEEQQEKKEEEEKKEE 103
E ++ K ++ ++ E ++EE E +++E KK++
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 28.1 bits (63), Expect = 0.66
Identities = 10/39 (25%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 60 EKEEGKEKEE-EEQKQQQQQQQEKEEQQEKEEQQEKKEE 97
E ++ K EE ++ + ++EE E E+++ KK++
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 33.8 bits (78), Expect = 0.008
Identities = 18/60 (30%), Positives = 36/60 (60%)
Query: 48 QVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
++ +L+ E K + +EEEE+ + +++ +EE QEK+E+++K+E E K + E
Sbjct: 247 KLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPE 306
Score = 32.2 bits (74), Expect = 0.029
Identities = 20/57 (35%), Positives = 38/57 (66%), Gaps = 7/57 (12%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKE-------EQQEKKEEEEKKEEEEK 106
EEEE++ K EEE Q++ Q++++EK++++ + E+Q K EE+E+K++ K
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 33.7 bits (77), Expect = 0.010
Identities = 24/86 (27%), Positives = 47/86 (54%), Gaps = 14/86 (16%)
Query: 29 FGNAKTV-RNDNSSRFGKFLQVH-------FKELNTEEEEKEEGK-EKEEEEQKQQQQQQ 79
F N + V N++ F +++ F++ EE+ +E K KEE E+++Q ++Q
Sbjct: 218 FFNGQNVYDNNDHKDF--CVEIEGGMDEHSFEDFLLEEKRRELEKLAKEEAERERQAEEQ 275
Query: 80 QEKEEQQ---EKEEQQEKKEEEEKKE 102
+ +EE++ E + Q K E E+++E
Sbjct: 276 RRREEEKAAMEADRAQAKAEVEKRRE 301
Score = 27.9 bits (62), Expect = 1.0
Identities = 10/38 (26%), Positives = 27/38 (71%)
Query: 69 EEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
EE++++ ++ +E+ E++ + E+Q ++EEE+ E ++
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADR 289
Score = 25.2 bits (55), Expect = 8.5
Identities = 11/41 (26%), Positives = 24/41 (58%), Gaps = 2/41 (4%)
Query: 68 EEEEQ--KQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
EEE Q ++ Q++++ +EE + ++ + + E KE+ K
Sbjct: 430 EEEHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEKTLK 470
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 33.5 bits (76), Expect = 0.011
Identities = 19/56 (33%), Positives = 36/56 (64%), Gaps = 1/56 (1%)
Query: 46 FLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
F + +L + EE E K K E EQK ++++++EKE+++E+E ++E++ E K
Sbjct: 569 FTPLASSKLAKKREEAVE-KAKREAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 33.1 bits (75), Expect = 0.019
Identities = 14/42 (33%), Positives = 32/42 (76%)
Query: 65 KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
K++EE +K +++ +Q+ E++E+E+++EK+ E E++ E E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620
Score = 30.4 bits (68), Expect = 0.13
Identities = 13/41 (31%), Positives = 30/41 (73%)
Query: 67 KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K+ EE ++ +++ E++ ++E+E ++EK++E E++ E E E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 33.5 bits (77), Expect = 0.012
Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 6/59 (10%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKE--EQQEKEE-QQEKKE---EEEKKEEEEKE 107
T E ++ E EK + +QQQ ++E++ EK + QQE K EE+ +E E+E
Sbjct: 652 ETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQE 710
Score = 26.9 bits (60), Expect = 2.6
Identities = 14/53 (26%), Positives = 28/53 (52%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
N E E +E+ ++Q QQQ E E Q+ E ++ + ++E+++ +E
Sbjct: 626 NRTRREGRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRE 678
Score = 26.2 bits (58), Expect = 4.0
Identities = 16/65 (24%), Positives = 32/65 (49%), Gaps = 3/65 (4%)
Query: 41 SRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
SRF L+ F + EE K + + + E K ++QQ + K Q + ++ E+++ +
Sbjct: 570 SRFFGALKALF---SGGEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDN 626
Query: 101 KEEEE 105
+ E
Sbjct: 627 RTRRE 631
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 32.8 bits (75), Expect = 0.018
Identities = 13/56 (23%), Positives = 26/56 (46%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E EEEE+ + +EEE + +++ Q +E+ + + E K ++E
Sbjct: 371 DEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426
Score = 28.9 bits (65), Expect = 0.46
Identities = 10/52 (19%), Positives = 28/52 (53%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E+++E +E+ +E ++++ + E+E Q +E+ + + + E K
Sbjct: 369 EVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKA 420
Score = 28.5 bits (64), Expect = 0.65
Identities = 9/71 (12%), Positives = 32/71 (45%)
Query: 37 NDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKE 96
+ S+ + ++ EE +++E +E+E+ + ++++ ++ EE+ + + E
Sbjct: 348 STKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSE 407
Query: 97 EEEKKEEEEKE 107
+ +
Sbjct: 408 SSSDVGSDSES 418
Score = 26.6 bits (59), Expect = 3.0
Identities = 11/53 (20%), Positives = 22/53 (41%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
++ + E EE+E +EE Q ++ + + E + KE + E
Sbjct: 379 EQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
>gnl|CDD|238682 cd01386, MYSc_type_XVIII, Myosin motor domain, type XVIII myosins.
This catalytic (head) domain has ATPase activity and
belongs to the larger group of P-loop NTPases. Myosins
are actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 767
Score = 32.5 bits (74), Expect = 0.027
Identities = 17/52 (32%), Positives = 29/52 (55%), Gaps = 1/52 (1%)
Query: 1 MQYLAAVNKSPSNLIT-EQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHF 51
++YLA S ++ E++ +LE+FGN T N N++RF + L + F
Sbjct: 107 LEYLALAAGSVDGRVSVEKVRALFTILEAFGNVSTALNGNATRFTQILSLDF 158
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 32.0 bits (73), Expect = 0.027
Identities = 8/39 (20%), Positives = 21/39 (53%), Gaps = 3/39 (7%)
Query: 72 QKQQQQQQQEKEEQQEKEE---QQEKKEEEEKKEEEEKE 107
+++ +Q++E ++ KE +K +K E ++K+
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 30.0 bits (68), Expect = 0.13
Identities = 10/37 (27%), Positives = 19/37 (51%)
Query: 70 EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
E + +Q+++ EE++E KK +K E +K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKK 194
Score = 27.7 bits (62), Expect = 0.96
Identities = 11/44 (25%), Positives = 23/44 (52%), Gaps = 7/44 (15%)
Query: 60 EKEEGKEKEE--EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
E+++ K+KEE E++ ++ +K +K E ++KK
Sbjct: 158 ERKDAKQKEEFAAEER-----KEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 32.5 bits (74), Expect = 0.029
Identities = 14/54 (25%), Positives = 33/54 (61%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
K+ + EE ++ + E+E KQ ++++ +EQ+++ E+ K+ ++K+ EE
Sbjct: 83 KKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136
Score = 31.3 bits (71), Expect = 0.072
Identities = 10/48 (20%), Positives = 29/48 (60%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+++ E+ + K+ E+++ Q+Q+++ E+ K+ ++K+ EE +
Sbjct: 94 QKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKA 141
Score = 30.5 bits (69), Expect = 0.12
Identities = 15/55 (27%), Positives = 39/55 (70%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
++ N ++++++ K EE+ +K++QQQ +E +++Q E+++ K+ E+E+ +E+
Sbjct: 62 EQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQ 116
Score = 28.6 bits (64), Expect = 0.48
Identities = 15/51 (29%), Positives = 34/51 (66%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+ +K+E ++ EE +QKQ +Q++ K+ ++E+ QE+K++ E+ ++
Sbjct: 79 EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL 129
Score = 28.6 bits (64), Expect = 0.53
Identities = 14/47 (29%), Positives = 34/47 (72%)
Query: 61 KEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EE ++K+E++Q ++ QQ+Q E+++ K+ ++E+ +E+K++ E+
Sbjct: 77 AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEA 123
Score = 27.1 bits (60), Expect = 2.0
Identities = 13/50 (26%), Positives = 35/50 (70%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EE++++ ++++ EE +Q+Q +QE+ +Q EKE ++++++ +E ++
Sbjct: 78 EEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQA 127
Score = 26.7 bits (59), Expect = 2.5
Identities = 10/51 (19%), Positives = 26/51 (50%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
++ E+ + +E++KQ ++ ++ +Q++ E+ K K + E E
Sbjct: 102 RLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAE 152
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 32.4 bits (74), Expect = 0.033
Identities = 16/59 (27%), Positives = 24/59 (40%), Gaps = 6/59 (10%)
Query: 49 VHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ KE+N +EEK G K KEEQ+ + E ++ E +KE
Sbjct: 211 IKSKEININKEEKNNGSNVNNNGNKN------NKEEQKGNDLSNELEDISLGPLEYDKE 263
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 32.2 bits (74), Expect = 0.034
Identities = 17/61 (27%), Positives = 35/61 (57%)
Query: 46 FLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+L++H K L E EE+E ++++ +++Q K E++ ++E+ EK ++K E
Sbjct: 384 YLKLHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAA 443
Query: 106 K 106
K
Sbjct: 444 K 444
Score = 30.7 bits (70), Expect = 0.10
Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 57 EEEEKEEGKEKEEEE----QKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EEEE E G E +KQ++ +++ ++E+ EK ++K E KK +
Sbjct: 397 EEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451
Score = 28.0 bits (63), Expect = 1.0
Identities = 10/39 (25%), Positives = 20/39 (51%)
Query: 61 KEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
K E K ++EE +K +++ E ++ K E K+ +
Sbjct: 420 KAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458
Score = 25.7 bits (57), Expect = 6.1
Identities = 7/39 (17%), Positives = 19/39 (48%)
Query: 59 EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEE 97
EK+ KE+ E+ +++ + K+ + E ++ +
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
>gnl|CDD|173487 PTZ00236, PTZ00236, mitochondrial import inner membrane translocase
subunit tim17; Provisional.
Length = 164
Score = 31.6 bits (72), Expect = 0.037
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 68 EEEEQKQQQQQQQEKEEQQEKEEQQ 92
Q+ QQQQ+ EK+ ++E +++Q
Sbjct: 139 RTPRQQFQQQQEMEKQLEEEDKKEQ 163
Score = 30.0 bits (68), Expect = 0.14
Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 4/29 (13%)
Query: 70 EEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
+QQ QQQQE EK+ ++E K+E+
Sbjct: 139 RTPRQQFQQQQE----MEKQLEEEDKKEQ 163
Score = 28.5 bits (64), Expect = 0.48
Identities = 9/30 (30%), Positives = 21/30 (70%)
Query: 75 QQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
++Q + +Q +++++ EK+ EEE K+E+
Sbjct: 134 NRRQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163
Score = 28.5 bits (64), Expect = 0.57
Identities = 9/25 (36%), Positives = 19/25 (76%)
Query: 67 KEEEEQKQQQQQQQEKEEQQEKEEQ 91
+ +Q QQQQ+ +++ E+++K+EQ
Sbjct: 139 RTPRQQFQQQQEMEKQLEEEDKKEQ 163
Score = 27.7 bits (62), Expect = 0.83
Identities = 9/30 (30%), Positives = 23/30 (76%)
Query: 74 QQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
++Q + +++ Q+++E +++ EEE+KKE+
Sbjct: 134 NRRQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163
Score = 27.3 bits (61), Expect = 1.3
Identities = 8/26 (30%), Positives = 20/26 (76%)
Query: 72 QKQQQQQQQEKEEQQEKEEQQEKKEE 97
+ +QQ Q+++E +++ E+++KKE+
Sbjct: 138 MRTPRQQFQQQQEMEKQLEEEDKKEQ 163
Score = 26.5 bits (59), Expect = 2.3
Identities = 9/29 (31%), Positives = 20/29 (68%)
Query: 67 KEEEEQKQQQQQQQEKEEQQEKEEQQEKK 95
+ + +QQ QQQ++ E+Q +EE ++++
Sbjct: 135 RRQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163
Score = 26.5 bits (59), Expect = 2.6
Identities = 8/30 (26%), Positives = 20/30 (66%)
Query: 76 QQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
++Q QQ +++Q+ +K+ EE+ ++E+
Sbjct: 134 NRRQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163
Score = 26.2 bits (58), Expect = 3.1
Identities = 9/28 (32%), Positives = 22/28 (78%)
Query: 73 KQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
+Q + +Q+ ++QQE E+Q E+++++E+
Sbjct: 136 RQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163
Score = 25.0 bits (55), Expect = 8.6
Identities = 6/29 (20%), Positives = 20/29 (68%)
Query: 61 KEEGKEKEEEEQKQQQQQQQEKEEQQEKE 89
+ + + ++ Q+QQ+ ++Q +EE ++++
Sbjct: 135 RRQMRTPRQQFQQQQEMEKQLEEEDKKEQ 163
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 32.1 bits (73), Expect = 0.040
Identities = 15/51 (29%), Positives = 31/51 (60%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
EKE ++ E E+ ++Q++ +KE +QE EE +E++ ++ + E+E
Sbjct: 519 KLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKE 569
Score = 28.2 bits (63), Expect = 0.91
Identities = 13/45 (28%), Positives = 31/45 (68%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
E++ E ++ +E++K +++ +QE EE +E+E ++ + E+E +E
Sbjct: 528 EQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQE 572
Score = 25.9 bits (57), Expect = 4.7
Identities = 15/52 (28%), Positives = 33/52 (63%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
+ E+ KE+ K K+E EQ+ ++ +++E+ ++ E E++ ++ + KKE E
Sbjct: 533 HLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESI 584
>gnl|CDD|233199 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT).
[Transport and binding proteins, Nucleosides, purines
and pyrimidines].
Length = 437
Score = 32.0 bits (73), Expect = 0.041
Identities = 14/66 (21%), Positives = 26/66 (39%), Gaps = 4/66 (6%)
Query: 39 NSSRFGKFLQVHF-KELNT---EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEK 94
F + + K+L+ E+E K E + K E+ Q + E++ E
Sbjct: 193 LLLPKLPFARYYLQKKLDKGAGEDETKGELRSKAEQNGIPHGGDQPSPTLVLDWEKEPES 252
Query: 95 KEEEEK 100
+E +K
Sbjct: 253 PDEPQK 258
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 31.8 bits (72), Expect = 0.045
Identities = 20/51 (39%), Positives = 40/51 (78%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E E+EE E+EEEE++++++++QE EE++ ++E++E++ E + EEE E
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 31.4 bits (71), Expect = 0.073
Identities = 18/55 (32%), Positives = 37/55 (67%)
Query: 50 HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+EEEE E +E+EEEE+++++Q+ +E+E + E+EE++ + + ++E E
Sbjct: 433 SMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 28.0 bits (62), Expect = 0.95
Identities = 20/51 (39%), Positives = 35/51 (68%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EEE EE +E+EEEE++++Q+ ++E+ E +E+EE+ E E++ E E
Sbjct: 441 EEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSE 491
Score = 28.0 bits (62), Expect = 1.0
Identities = 17/52 (32%), Positives = 36/52 (69%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E+EE +E+EEEE++Q+ ++++ ++E++E+E + + EEE + E +
Sbjct: 442 EESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493
Score = 27.2 bits (60), Expect = 1.7
Identities = 16/55 (29%), Positives = 27/55 (49%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E EE+E EE + ++EEE+++ + +EE + E EE E+ E
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNS 508
Score = 26.4 bits (58), Expect = 3.3
Identities = 16/51 (31%), Positives = 34/51 (66%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E +E ++++ ++E+EE++E+EE++++ EEEE ++EEE+E
Sbjct: 424 ASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEE 474
Score = 26.0 bits (57), Expect = 4.1
Identities = 17/41 (41%), Positives = 33/41 (80%)
Query: 67 KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E ++++ +++E+EE++E+EE+QE +EEE + EEEE+E
Sbjct: 435 ASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEE 475
Score = 26.0 bits (57), Expect = 4.8
Identities = 17/51 (33%), Positives = 26/51 (50%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+E +EE E EEEE++ + E+E + E + +E EE E E
Sbjct: 459 EQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSE 509
Score = 25.6 bits (56), Expect = 5.9
Identities = 15/53 (28%), Positives = 30/53 (56%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
E EEE++ E +E E+EE++++ + EE+ E + + EE +++ E
Sbjct: 453 EEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAER 505
Score = 25.3 bits (55), Expect = 8.7
Identities = 20/54 (37%), Positives = 37/54 (68%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+E + EEE+EE +E+EEEEQ+ ++++ +++EE++E E +EE E E +
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493
Score = 25.3 bits (55), Expect = 8.8
Identities = 16/43 (37%), Positives = 35/43 (81%)
Query: 65 KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E EE++ +++++E+EE++E+E++ E++E E+++EEEE E
Sbjct: 435 ASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 31.7 bits (73), Expect = 0.050
Identities = 15/49 (30%), Positives = 34/49 (69%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
EE + E ++K EE + ++ ++ KEE +EK+E+ +++E++ +E E++
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKE 574
Score = 30.2 bits (69), Expect = 0.15
Identities = 17/55 (30%), Positives = 35/55 (63%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EL E E+K E E +E ++ +++ +EK+E+ ++EE + +E E++ ++ KE
Sbjct: 527 ELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581
Score = 30.2 bits (69), Expect = 0.17
Identities = 19/54 (35%), Positives = 33/54 (61%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
K E KE K KEE E+K+++ Q++E + +E E++ ++ +E KKE +E
Sbjct: 535 KAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADE 588
Score = 27.5 bits (62), Expect = 1.4
Identities = 11/37 (29%), Positives = 26/37 (70%)
Query: 71 EQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+ +++ +Q+ +E + +E ++ KEE E+K+E+ +E
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQE 562
Score = 27.5 bits (62), Expect = 1.5
Identities = 15/57 (26%), Positives = 36/57 (63%), Gaps = 1/57 (1%)
Query: 52 KELNTEEEEKE-EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E++ +E E KE E+ K++ ++++EK +++E + +E ++E ++ +E K+
Sbjct: 528 LERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKK 584
Score = 26.7 bits (60), Expect = 2.6
Identities = 14/59 (23%), Positives = 36/59 (61%), Gaps = 3/59 (5%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQE---KEEQQEKEEQQEKKEEEEKKEEEEKE 107
K + ++E+ E EE +++ +Q+ +E ++ EK +++ ++++E+ +EEE+K
Sbjct: 509 KLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKL 567
Score = 25.6 bits (57), Expect = 6.1
Identities = 17/51 (33%), Positives = 31/51 (60%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EE +K G++KE+ + ++ E+E +Q+ EE + +E EK +EE +E
Sbjct: 505 EEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEE 555
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 31.3 bits (71), Expect = 0.052
Identities = 15/51 (29%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
E+ + +++ EK + K Q ++ +K QEK+E EKK+ ++KK
Sbjct: 29 NPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKE--EKKKPKKKK 77
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 31.7 bits (73), Expect = 0.052
Identities = 15/58 (25%), Positives = 25/58 (43%)
Query: 50 HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
H + EE KEE K + EEE K + ++ E ++E + + E+ E
Sbjct: 34 HSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAA 91
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 31.8 bits (72), Expect = 0.053
Identities = 14/56 (25%), Positives = 32/56 (57%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+EL +++++E+ KEKE ++ K Q++ + K + Q+ + ++ EKK +
Sbjct: 16 EELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDV 71
Score = 31.0 bits (70), Expect = 0.084
Identities = 11/48 (22%), Positives = 26/48 (54%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+E +K + +KE + + Q QQ ++ E++ K++ E++ E+
Sbjct: 31 KELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDENPED 78
Score = 29.9 bits (67), Expect = 0.26
Identities = 14/47 (29%), Positives = 30/47 (63%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
E EK+ E+E E +K+++++ +EKE ++ K Q+E K + + ++
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQ 51
Score = 28.3 bits (63), Expect = 0.80
Identities = 12/31 (38%), Positives = 23/31 (74%)
Query: 77 QQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ + EK+ E+E +++KK+EE+ KE+E K+
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKK 35
Score = 27.2 bits (60), Expect = 2.1
Identities = 10/31 (32%), Positives = 23/31 (74%)
Query: 76 QQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
+ + ++K +E+ E+++KKEE+ K++E +K
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKK 35
Score = 27.2 bits (60), Expect = 2.1
Identities = 10/38 (26%), Positives = 27/38 (71%)
Query: 70 EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E + +++ +E+ E+++K+E++ K++E +K + +KE
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKE 42
Score = 26.0 bits (57), Expect = 5.1
Identities = 16/58 (27%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQ---EKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E EE+ E K+K+EE+ K+++ ++ +KE + + + QQ K+ E+K
Sbjct: 9 EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKS 66
Score = 25.6 bits (56), Expect = 7.4
Identities = 10/50 (20%), Positives = 32/50 (64%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E E+ EEE +++++++++ KE++ +K + +K+ + + + ++ +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASD 54
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 31.7 bits (73), Expect = 0.055
Identities = 15/48 (31%), Positives = 35/48 (72%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+ + E +++EEE +K++++ +Q+++E ++KEE+ E+ EE+ +E E
Sbjct: 99 DRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146
Score = 31.3 bits (72), Expect = 0.073
Identities = 20/63 (31%), Positives = 35/63 (55%)
Query: 45 KFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
L+ + E ++E +E+ E QK +++ Q++E K E EK+EEE +K+E+
Sbjct: 58 ALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEK 117
Query: 105 EKE 107
E E
Sbjct: 118 ELE 120
Score = 31.3 bits (72), Expect = 0.074
Identities = 17/60 (28%), Positives = 41/60 (68%), Gaps = 2/60 (3%)
Query: 50 HFKELNTEEEEKEEGKEKEEE--EQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
++L +KEE +++ E E+++++ +++EKE +Q+++E ++K+EE E+ EE+ +
Sbjct: 83 ELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142
Score = 25.9 bits (58), Expect = 4.2
Identities = 13/56 (23%), Positives = 38/56 (67%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
+E E ++ E+ ++EE ++ + +++EE+ EK+E++ +++++E +++EE+
Sbjct: 77 LRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEE 132
Score = 25.9 bits (58), Expect = 5.1
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 11/60 (18%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQ-----------QEKEEQQEKKEEEEKKEEEE 105
E +KE E +EE K + + ++E E+ +KEE ++K E +K EEE
Sbjct: 52 EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEE 111
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 31.3 bits (71), Expect = 0.084
Identities = 13/56 (23%), Positives = 26/56 (46%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E NT + + E + + E Q + E++ QE+ + EE K+ +E ++
Sbjct: 883 GEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQD 938
Score = 30.5 bits (69), Expect = 0.14
Identities = 11/80 (13%), Positives = 27/80 (33%), Gaps = 10/80 (12%)
Query: 26 LESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQ 85
+E +RND V + N E + + + + ++ + + E
Sbjct: 846 IEQRTQDSGIRND----------VKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSEL 895
Query: 86 QEKEEQQEKKEEEEKKEEEE 105
Q + + + + EE+
Sbjct: 896 QNHHKTEALSQNNKYNEEKS 915
Score = 30.5 bits (69), Expect = 0.14
Identities = 16/66 (24%), Positives = 26/66 (39%), Gaps = 8/66 (12%)
Query: 30 GNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKE 89
G TV+ S LQ H K TE + +E+ Q++ +E + +
Sbjct: 883 GEENTVKGQYSE-----LQNHHK---TEALSQNNKYNEEKSAQERMPGADSPEELMKRAK 934
Query: 90 EQQEKK 95
E Q+K
Sbjct: 935 EYQDKH 940
Score = 28.2 bits (63), Expect = 0.81
Identities = 8/51 (15%), Positives = 19/51 (37%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E G + ++ + Q E Q + + + + +E+ +E
Sbjct: 868 TEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQE 918
Score = 28.2 bits (63), Expect = 0.90
Identities = 9/49 (18%), Positives = 17/49 (34%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
+ + EE + Q + + + E Q K EE+ +E
Sbjct: 870 YEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQE 918
Score = 26.6 bits (59), Expect = 2.9
Identities = 7/45 (15%), Positives = 19/45 (42%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
+ K E + + +++ Q++ +E + KE ++K
Sbjct: 896 QNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKH 940
>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model
describes flagellar biosynthesis protein FlhA, one of a
large number of genes associated with the biosynthesis
of functional bacterial flagella. Homologs of many such
proteins, including FlhA, function in type III protein
secretion systems. A separate model describes InvA
(Salmonella enterica), LcrD (Yersinia enterocolitica),
HrcV (Xanthomonas), etc., all of which score below the
noise cutoff for this model [Cellular processes,
Chemotaxis and motility].
Length = 678
Score = 31.1 bits (71), Expect = 0.087
Identities = 9/31 (29%), Positives = 19/31 (61%)
Query: 75 QQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+++ +QE+E E + QE+ EEE++ +
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340
Score = 28.8 bits (65), Expect = 0.44
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 82 KEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ +QE+E E + +E+ EEE+E
Sbjct: 311 RRSKQEEEAAAEAAKAQEEAAEEEEE 336
Score = 28.4 bits (64), Expect = 0.66
Identities = 6/31 (19%), Positives = 19/31 (61%)
Query: 72 QKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
+++ +Q+++ E + +E+ ++EEE +
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340
Score = 28.0 bits (63), Expect = 1.1
Identities = 8/33 (24%), Positives = 19/33 (57%)
Query: 72 QKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+++ +Q+E+ + + Q+E EEEE+ +
Sbjct: 308 YLRRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340
Score = 27.2 bits (61), Expect = 1.8
Identities = 7/31 (22%), Positives = 19/31 (61%)
Query: 77 QQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+++ ++EE+ E + ++E E++EE +
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 31.0 bits (71), Expect = 0.088
Identities = 14/41 (34%), Positives = 27/41 (65%)
Query: 59 EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
++KEE EK+E+E +++ EKEE+ E+ ++++E E
Sbjct: 100 DKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140
Score = 29.1 bits (66), Expect = 0.37
Identities = 13/48 (27%), Positives = 29/48 (60%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+ + E +KEE +K++++ +++ EKEE+ E+ E+++E E
Sbjct: 93 DRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140
Score = 28.7 bits (65), Expect = 0.52
Identities = 17/48 (35%), Positives = 28/48 (58%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
E + E KE+ E Q+ +++ Q +E K E +KKEE +K+E+E
Sbjct: 65 AELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKE 112
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 30.8 bits (69), Expect = 0.099
Identities = 18/60 (30%), Positives = 36/60 (60%)
Query: 45 KFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+ LQ + + + + K+++ + Q+ Q+ + EE+Q +EEQ+E++ EEE K+EE
Sbjct: 178 ELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237
Score = 26.9 bits (59), Expect = 2.1
Identities = 12/47 (25%), Positives = 28/47 (59%)
Query: 61 KEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
G + + + +Q+Q + +E ++ + +EK+ +EE++EEE +E
Sbjct: 185 SLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEE 231
Score = 25.8 bits (56), Expect = 5.2
Identities = 12/45 (26%), Positives = 31/45 (68%)
Query: 60 EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+++ + +E Q+ + +++Q +EEQ+E+E ++E K+EE + ++
Sbjct: 199 QEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
Score = 25.4 bits (55), Expect = 7.6
Identities = 12/54 (22%), Positives = 34/54 (62%)
Query: 54 LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
L+ + + + ++E+ + Q+ ++ + +EK+ Q+E++EEE ++E +++E
Sbjct: 184 LSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237
>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
domain 1. The small mechanosensitive channel, MscS, is
a part of the turgor-driven solute efflux system that
protects bacteria from lysis in the event of osmotic
shock. The MscS protein alone is sufficient to form a
functional mechanosensitive channel gated directly by
tension in the lipid bilayer. The MscS proteins are
heptamers of three transmembrane subunits with seven
converging M3 domains, and this domain is one of the
inner membrane domains.
Length = 339
Score = 30.7 bits (70), Expect = 0.10
Identities = 8/29 (27%), Positives = 15/29 (51%)
Query: 71 EQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
++ + Q+ +EE++ E E EE E
Sbjct: 265 AKRAEILAQRAEEEEESSEGAAETIEEPE 293
Score = 29.1 bits (66), Expect = 0.39
Identities = 9/32 (28%), Positives = 16/32 (50%), Gaps = 2/32 (6%)
Query: 76 QQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ ++ E Q+ +EE E+ E + EE E
Sbjct: 264 KAKRAEILAQRAEEE--EESSEGAAETIEEPE 293
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 30.1 bits (68), Expect = 0.15
Identities = 14/41 (34%), Positives = 29/41 (70%)
Query: 66 EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
EK+ + + QQ ++ ++K+E++ +E Q++ K+ EK EE +K
Sbjct: 14 EKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKK 54
Score = 28.6 bits (64), Expect = 0.60
Identities = 11/41 (26%), Positives = 26/41 (63%)
Query: 67 KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K E+Q + + QQ E+ +++++ +E +E+ +K E+ +E
Sbjct: 11 KLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEE 51
Score = 27.4 bits (61), Expect = 1.2
Identities = 14/44 (31%), Positives = 29/44 (65%), Gaps = 1/44 (2%)
Query: 64 GKEKEEEEQKQQQQQQQEKEEQQEK-EEQQEKKEEEEKKEEEEK 106
K E++ + + QQ ++ +++Q++K EE QE+ ++ +K EE K
Sbjct: 10 SKLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAK 53
Score = 27.4 bits (61), Expect = 1.4
Identities = 9/35 (25%), Positives = 23/35 (65%)
Query: 67 KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
++ E+ + +Q+++ +E Q++ ++ EK EE +K
Sbjct: 21 LQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKS 55
Score = 27.4 bits (61), Expect = 1.6
Identities = 12/37 (32%), Positives = 25/37 (67%)
Query: 65 KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
K K ++ ++ + +Q+++ EE QE+ ++ +K EE KK
Sbjct: 18 KSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKK 54
Score = 25.9 bits (57), Expect = 5.2
Identities = 7/40 (17%), Positives = 24/40 (60%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKK 95
E++ + ++ E + +Q+++ +E +EQ +K ++ ++
Sbjct: 12 LAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEE 51
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 30.2 bits (68), Expect = 0.15
Identities = 11/63 (17%), Positives = 26/63 (41%), Gaps = 14/63 (22%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEE--------------QQEKEEQQEKKEEEEK 100
+TEE + + E+E + + + +E+++ Q K + ++EE
Sbjct: 540 STEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAAL 599
Query: 101 KEE 103
K +
Sbjct: 600 KMK 602
Score = 29.5 bits (66), Expect = 0.33
Identities = 6/32 (18%), Positives = 17/32 (53%)
Query: 76 QQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ +E+ + + +++ E E + +EE K+
Sbjct: 538 RNSTEERIDLDADDWTEDEDENEMETDEERKK 569
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 29.9 bits (68), Expect = 0.16
Identities = 11/52 (21%), Positives = 25/52 (48%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
EL + E E + E + K + ++++ + E +E + K +E E + +
Sbjct: 55 ELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDH 106
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 30.1 bits (68), Expect = 0.17
Identities = 20/54 (37%), Positives = 40/54 (74%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
KE+ EEEE+E+ ++KEEEE+ ++++ ++EE++E+++++ KK +E E E
Sbjct: 33 KEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
Score = 29.0 bits (65), Expect = 0.40
Identities = 18/52 (34%), Positives = 42/52 (80%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E EKE E+EEEE++++++++++ +++E+ +++E+KEE++KK ++ KE
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 29.4 bits (67), Expect = 0.21
Identities = 9/33 (27%), Positives = 22/33 (66%)
Query: 69 EEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
++E K ++ +++ E+ + K E+++ K+EE K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 29.6 bits (67), Expect = 0.24
Identities = 14/69 (20%), Positives = 36/69 (52%)
Query: 39 NSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
NS + ++ + + +EEE + + + EE + ++ + EEQ+ +++ + + E
Sbjct: 797 NSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSE 856
Query: 99 EKKEEEEKE 107
++ EEE +
Sbjct: 857 IQEAEEEIQ 865
Score = 27.7 bits (62), Expect = 1.2
Identities = 11/70 (15%), Positives = 29/70 (41%), Gaps = 3/70 (4%)
Query: 37 NDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQ---QQQQQQEKEEQQEKEEQQE 93
++ + ++ + E + + K + K + ++ +Q EK +++
Sbjct: 855 SEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKK 914
Query: 94 KKEEEEKKEE 103
KK+ E+ EE
Sbjct: 915 KKDLEKTDEE 924
Score = 27.7 bits (62), Expect = 1.3
Identities = 18/100 (18%), Positives = 40/100 (40%), Gaps = 7/100 (7%)
Query: 8 NKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEK 67
N + ++ + + + S N + + + + + K L + EE
Sbjct: 778 NPNAEKILIITEILSDGINNSDINDRP--QELIDQIIESEEERLKALRIQREEML--MRP 833
Query: 68 EEEEQKQQQQQ---QQEKEEQQEKEEQQEKKEEEEKKEEE 104
EE E ++Q+ Q+ K E E +E +E+ + + + E
Sbjct: 834 EELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNE 873
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 29.6 bits (66), Expect = 0.26
Identities = 11/52 (21%), Positives = 18/52 (34%), Gaps = 2/52 (3%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE--EKKEEEEKE 107
EE E E+E+ Q + + + E+ E +E E E
Sbjct: 364 EETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE 415
Score = 28.0 bits (62), Expect = 0.90
Identities = 8/49 (16%), Positives = 17/49 (34%), Gaps = 1/49 (2%)
Query: 60 EKEEGKEKE-EEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+E+ + + Q + +E + +E + E EK
Sbjct: 372 EREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKA 420
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 29.5 bits (66), Expect = 0.29
Identities = 15/70 (21%), Positives = 36/70 (51%)
Query: 38 DNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEE 97
D+ F + ++ + + EE ++ + ++EE+ Q + E+EEQ E E + ++
Sbjct: 181 DDQQAFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSA 240
Query: 98 EEKKEEEEKE 107
++ E ++E
Sbjct: 241 PQESEATDRE 250
Score = 29.1 bits (65), Expect = 0.43
Identities = 15/52 (28%), Positives = 27/52 (51%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E+EE + + E EQ++Q + + E +E +E + E E EEE +
Sbjct: 209 SEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQ 260
Score = 28.4 bits (63), Expect = 0.69
Identities = 17/53 (32%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+TE E++E+G + + E +Q++Q + E E QE QE + + + E E+E
Sbjct: 206 DTESEDEEDGDDDQPTENEQEEQGEGE-GEGQEGSAPQESEATDRESESGEEE 257
Score = 27.2 bits (60), Expect = 1.8
Identities = 10/56 (17%), Positives = 31/56 (55%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E+ + E ++E +++ + +Q++Q E E + ++ ++ E +++ E +E
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEE 256
Score = 26.4 bits (58), Expect = 3.1
Identities = 10/55 (18%), Positives = 30/55 (54%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E + ++++ E +++E+ E + + Q+ +E + + + E EEE + +++
Sbjct: 212 EEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDL 266
Score = 26.4 bits (58), Expect = 3.5
Identities = 10/52 (19%), Positives = 29/52 (55%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
TE E++E+G+ + E ++ Q+ + + + E E++ + +++ +E +
Sbjct: 221 TENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDD 272
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 29.0 bits (66), Expect = 0.32
Identities = 5/41 (12%), Positives = 24/41 (58%)
Query: 67 KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
++ +E K++ + +++ + + +KK+E+++ + + +
Sbjct: 47 EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAK 87
Score = 27.9 bits (63), Expect = 0.97
Identities = 11/42 (26%), Positives = 25/42 (59%)
Query: 66 EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+ +E +++ + +K+E + + Q+KKE++E K + K
Sbjct: 47 EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88
Score = 25.2 bits (56), Expect = 7.7
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 69 EEEQKQQQQQQQEKEEQQEKEEQQEKK 95
++E K + Q++KE+Q+ K + + K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSK 89
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 29.0 bits (65), Expect = 0.32
Identities = 14/40 (35%), Positives = 24/40 (60%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEE 97
++E +E+EEE +KQQQ + + ++ Q+ E E EE
Sbjct: 92 GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGEE 131
Score = 25.9 bits (57), Expect = 3.1
Identities = 12/40 (30%), Positives = 25/40 (62%)
Query: 63 EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
+EEEE++++ ++QQ+ +E Q ++ QQ + E +E
Sbjct: 92 GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGEE 131
Score = 25.5 bits (56), Expect = 4.6
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 50 HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQE 87
H ++ EEEE+ E +++ +E Q QQ QQ + E E
Sbjct: 93 HTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGE 130
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 29.4 bits (66), Expect = 0.33
Identities = 11/53 (20%), Positives = 27/53 (50%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+ E+ ++E+G + + + + + ++E E E EE + + + E +E E
Sbjct: 225 DGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEM 277
Score = 26.7 bits (59), Expect = 3.0
Identities = 10/51 (19%), Positives = 25/51 (49%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
N E+ E + + +E ++ + + + E ++ + + E E+ E E +E
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDE 293
Score = 26.3 bits (58), Expect = 3.3
Identities = 10/53 (18%), Positives = 25/53 (47%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
N + E E E +E ++ + + + E+ E + + ++ E + +E+ E
Sbjct: 244 NEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTE 296
Score = 25.6 bits (56), Expect = 6.7
Identities = 10/51 (19%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+EE+ ++ + + E + +++ E ++ E++E + E E+ E + E
Sbjct: 232 DEEDGDDD-QPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAE 281
Score = 25.6 bits (56), Expect = 7.2
Identities = 12/51 (23%), Positives = 25/51 (49%), Gaps = 2/51 (3%)
Query: 59 EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK--EEEEKE 107
++ E EE+ Q ++ E +E+ E ++ EE+E + + E +E
Sbjct: 224 DDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEE 274
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 29.0 bits (66), Expect = 0.37
Identities = 14/27 (51%), Positives = 19/27 (70%), Gaps = 2/27 (7%)
Query: 83 EEQQEK--EEQQEKKEEEEKKEEEEKE 107
EEQ+E+ E E EEEE+K+EE+K
Sbjct: 64 EEQRERLEELAPELLEEEEEKKEEKKG 90
Score = 25.2 bits (56), Expect = 7.8
Identities = 9/26 (34%), Positives = 19/26 (73%)
Query: 76 QQQQQEKEEQQEKEEQQEKKEEEEKK 101
+Q+++ +E E E++E+K+EE+K
Sbjct: 65 EQRERLEELAPELLEEEEEKKEEKKG 90
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 29.3 bits (66), Expect = 0.39
Identities = 16/67 (23%), Positives = 27/67 (40%)
Query: 39 NSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
N F K ++ K EE EE ++ + K++ K++ +E K
Sbjct: 829 NGPLFQKLMENAGKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVL 888
Query: 99 EKKEEEE 105
K+EE E
Sbjct: 889 IKQEERE 895
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 28.5 bits (64), Expect = 0.45
Identities = 18/56 (32%), Positives = 34/56 (60%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E EE KEE KE E K+ + ++++E+ E+E ++E +E ++ E+E +E
Sbjct: 57 QETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEE 112
Score = 27.8 bits (62), Expect = 0.83
Identities = 14/53 (26%), Positives = 34/53 (64%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E ++ ++ + E EE K+++++ E++++K + +++ EE E++ EEE E
Sbjct: 48 EQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100
Score = 25.9 bits (57), Expect = 4.6
Identities = 15/52 (28%), Positives = 34/52 (65%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+EEEKE +++E++ +++ +E EE+ E+E+++ E E++ EE+ +
Sbjct: 65 VKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTES 116
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 28.6 bits (64), Expect = 0.48
Identities = 21/83 (25%), Positives = 35/83 (42%)
Query: 10 SPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEE 69
S S + + G A T S K E EEE E+ + +
Sbjct: 36 SSSEASNAGSGSSEQKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAG 95
Query: 70 EEQKQQQQQQQEKEEQQEKEEQQ 92
++ Q+QQ+QQ++EE+Q +E +
Sbjct: 96 KQLSQKQQEQQQQEEKQVRELKA 118
Score = 26.3 bits (58), Expect = 2.9
Identities = 8/46 (17%), Positives = 31/46 (67%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
+E +K+ + +EE +++ ++ + E ++ ++Q++++++E+K+
Sbjct: 67 QELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQ 112
Score = 25.5 bits (56), Expect = 6.9
Identities = 10/53 (18%), Positives = 29/53 (54%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+E K+ E ++ +++ +++Q E +Q +K++E++++EE+
Sbjct: 59 STAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEK 111
Score = 25.5 bits (56), Expect = 7.2
Identities = 7/51 (13%), Positives = 31/51 (60%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
++ + + E +++ +++ EK++ + ++ +K++E++++EE++
Sbjct: 62 AGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQ 112
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 28.9 bits (64), Expect = 0.48
Identities = 23/102 (22%), Positives = 39/102 (38%), Gaps = 2/102 (1%)
Query: 6 AVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGK 65
A N+ +NL T A+ + GN +V N +SR T + +
Sbjct: 492 AANQPAANLPTANQPAANNFHGAAGN--SVGNPFASRPFGSAPYGGNAATTADPNGIAKR 549
Query: 66 EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E +Q+ +Q EE E + E E + ++ +E
Sbjct: 550 EDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVTDKGEE 591
>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 145
Score = 28.2 bits (63), Expect = 0.52
Identities = 6/40 (15%), Positives = 16/40 (40%)
Query: 65 KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
++ + E+ + + + +K+EE KK +
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKKWYQ 143
Score = 25.9 bits (57), Expect = 4.1
Identities = 6/29 (20%), Positives = 15/29 (51%)
Query: 65 KEKEEEEQKQQQQQQQEKEEQQEKEEQQE 93
E+ + Q + +++EE ++K Q +
Sbjct: 117 SSNEQNSTEALQLRAKKREEHRKKWYQND 145
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 28.5 bits (64), Expect = 0.57
Identities = 13/63 (20%), Positives = 30/63 (47%), Gaps = 9/63 (14%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQ---------QQQQQQEKEEQQEKEEQQEKKEEEEKK 101
+ N EE+E E E +++K+ Q++ +E + + ++++ KK+ +
Sbjct: 1520 IENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNT 1579
Query: 102 EEE 104
E E
Sbjct: 1580 EAE 1582
Score = 27.4 bits (61), Expect = 1.5
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 83 EEQQEKEEQQEKKEEEEKKEEEEK 106
EE+ +K++++EKK+EEE K EE+
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKA 765
Score = 25.8 bits (57), Expect = 6.3
Identities = 11/42 (26%), Positives = 29/42 (69%)
Query: 64 GKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
GK+ E + +++ ++K+++++K+E++ K+EE+ + E E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
ArcB; Provisional.
Length = 779
Score = 28.8 bits (65), Expect = 0.58
Identities = 14/46 (30%), Positives = 26/46 (56%), Gaps = 7/46 (15%)
Query: 47 LQVHFKELNTE-------EEEKEEGKEKEEEEQKQQQQQQQEKEEQ 85
L+ + +LN E EE ++E E+ + E K++++ Q E E+Q
Sbjct: 108 LKDNIAQLNQEIAEREKAEEARQEAFEQLKNEIKEREETQIELEQQ 153
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 28.4 bits (64), Expect = 0.62
Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 10/96 (10%)
Query: 15 ITEQILEASPLLESFGNAKTVRNDNSSRF------GKFLQVHFKELNTEEEEK----EEG 64
+ E LE L N + + D + + +EL E+ EEG
Sbjct: 152 VPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEG 211
Query: 65 KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
E + +++ ++ EKE + EE +E ++ +
Sbjct: 212 TPSELIREIKEELEEIEKERESLLEELKELAKKYLE 247
>gnl|CDD|217050 pfam02460, Patched, Patched family. The transmembrane protein
Patched is a receptor for the morphogene Sonic Hedgehog.
This protein associates with the smoothened protein to
transduce hedgehog signals.
Length = 801
Score = 28.5 bits (64), Expect = 0.62
Identities = 16/61 (26%), Positives = 26/61 (42%), Gaps = 4/61 (6%)
Query: 6 AVNKSPSNLITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQ--VHFKELNTEEEEKEE 63
VN P I E + +++ F + NS+ F +L+ +F + N E E EE
Sbjct: 498 FVNNPPDLTIPENRDRLNEMVDEFESTPYSMGPNSTSF--WLREYENFYKTNGSELEDEE 555
Query: 64 G 64
Sbjct: 556 K 556
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 28.6 bits (64), Expect = 0.63
Identities = 14/56 (25%), Positives = 30/56 (53%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K+ ++E E+ K E Q+++++ Q + E++ + EQ E+ EE+ + E
Sbjct: 371 KDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPL 426
Score = 25.5 bits (56), Expect = 6.4
Identities = 13/52 (25%), Positives = 27/52 (51%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EE + E +E+ E EQ ++ ++ E E + + E+E ++ + E+E
Sbjct: 394 EEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEE 445
Score = 25.1 bits (55), Expect = 7.9
Identities = 10/52 (19%), Positives = 26/52 (50%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+ + E + EEE+ Q++ ++ E +Q + +++ + E E++
Sbjct: 380 AEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEF 431
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 28.4 bits (64), Expect = 0.63
Identities = 11/39 (28%), Positives = 25/39 (64%)
Query: 63 EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
E K++E E Q +Q Q++ + Q+ K++++E ++ K+
Sbjct: 190 EEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKR 228
Score = 27.2 bits (61), Expect = 1.7
Identities = 10/50 (20%), Positives = 28/50 (56%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
+++E + E +++QQ+ + + E+ QEK + ++ ++++KE
Sbjct: 171 LAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKRKE 220
Score = 25.7 bits (57), Expect = 5.6
Identities = 10/40 (25%), Positives = 25/40 (62%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKK 95
EE+++E + E+ ++K + Q+ K++++E +Q K+
Sbjct: 189 LEEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKR 228
Score = 25.3 bits (56), Expect = 9.4
Identities = 15/55 (27%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEK-EEQQEKEEQQEKKEEEEKKEEEEK 106
E +E EG E EE++Q+ + Q E+ +E+ + Q+ K++ +E ++ K
Sbjct: 173 EAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAK 227
Score = 24.9 bits (55), Expect = 9.9
Identities = 11/49 (22%), Positives = 29/49 (59%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
E E+++ + + + EQ Q++ + +E +Q+++E ++ + + EEE
Sbjct: 187 AELEEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKRLELSEEE 235
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 28.6 bits (64), Expect = 0.64
Identities = 15/47 (31%), Positives = 28/47 (59%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
+E +E+ + QQ++++KEE + KEE+QE E E++ +E
Sbjct: 335 SSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 28.3 bits (64), Expect = 0.67
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 77 QQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+ ++EK++ KEE++ KEE++K EE
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 28.4 bits (63), Expect = 0.69
Identities = 10/51 (19%), Positives = 28/51 (54%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+ ++++ + E+++K + +E ++ E+ + K +EE ++ E E
Sbjct: 4045 EDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPE 4095
Score = 26.9 bits (59), Expect = 2.6
Identities = 14/52 (26%), Positives = 25/52 (48%), Gaps = 3/52 (5%)
Query: 59 EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQ---EKKEEEEKKEEEEKE 107
+E E E E++ +Q + + KE+ E K+ +EK++EEE
Sbjct: 3909 DEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMS 3960
Score = 26.1 bits (57), Expect = 4.6
Identities = 10/49 (20%), Positives = 26/49 (53%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
EE+ E ++ E+ + KE+ + E ++++E+E+++E +
Sbjct: 3913 EEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD 3961
Score = 26.1 bits (57), Expect = 4.9
Identities = 11/50 (22%), Positives = 24/50 (48%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EE+ E ++K E+ + +E K + + ++E+E +EE +
Sbjct: 3913 EEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDD 3962
Score = 26.1 bits (57), Expect = 5.3
Identities = 10/53 (18%), Positives = 26/53 (49%), Gaps = 2/53 (3%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKK--EEEEKKEEEE 105
+E+ E+G E+ +E ++ + + +E+ E+ E E + + K + +
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKS 4109
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 28.2 bits (63), Expect = 0.70
Identities = 11/55 (20%), Positives = 39/55 (70%)
Query: 47 LQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
LQ + K+L ++E + +E E + ++++ +Q++ +++Q+E+ ++ ++K +++++
Sbjct: 378 LQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432
Score = 27.8 bits (62), Expect = 0.92
Identities = 7/50 (14%), Positives = 34/50 (68%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
++ E E + E+ + + ++++ ++++ +++Q+E+ +E+++K ++++
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432
Score = 27.8 bits (62), Expect = 1.0
Identities = 9/56 (16%), Positives = 33/56 (58%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ L ++ ++ +K E + + ++ + + +E++ +QE+ E+++K++ +E +
Sbjct: 369 ELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424
Score = 27.1 bits (60), Expect = 2.2
Identities = 14/82 (17%), Positives = 43/82 (52%)
Query: 25 LLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEE 84
L E + + D K ++ + E ++ + EE + + ++++ ++EE
Sbjct: 352 LYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEE 411
Query: 85 QQEKEEQQEKKEEEEKKEEEEK 106
++K+++Q +++E+++++E K
Sbjct: 412 NEKKQKEQADEDKEKRQKDERK 433
Score = 26.7 bits (59), Expect = 2.5
Identities = 11/51 (21%), Positives = 31/51 (60%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E KE K+ ++ +K + + + ++ + + +E+K ++E+ E+++KE
Sbjct: 368 QELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKE 418
Score = 26.7 bits (59), Expect = 2.6
Identities = 14/50 (28%), Positives = 29/50 (58%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
++ +E KE ++ Q ++ + K+E EE + K +EE+ K+EE ++
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEK 414
Score = 25.5 bits (56), Expect = 6.5
Identities = 11/53 (20%), Positives = 31/53 (58%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ ++++E ++ ++ + ++ E +++ + E+ E K +EEK ++EE E
Sbjct: 361 DLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE 413
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 28.2 bits (63), Expect = 0.71
Identities = 13/51 (25%), Positives = 37/51 (72%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EEE+ +E E++ K+ + + ++E+++++EE++E++EE+E ++++ +
Sbjct: 145 EEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDD 195
Score = 27.0 bits (60), Expect = 1.7
Identities = 16/54 (29%), Positives = 35/54 (64%)
Query: 54 LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
L TEEEE + K E++ ++ + + EE ++ EE++E++EEE++ +++ +
Sbjct: 141 LFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194
Score = 25.9 bits (57), Expect = 3.9
Identities = 12/52 (23%), Positives = 35/52 (67%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EE+ E+ E++ ++ + + E++E+ E+EE++E++E+E+ ++++ +
Sbjct: 145 EEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196
>gnl|CDD|237205 PRK12792, flhA, flagellar biosynthesis protein FlhA; Reviewed.
Length = 694
Score = 28.2 bits (63), Expect = 0.73
Identities = 9/27 (33%), Positives = 11/27 (40%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKE 83
E K K EEE Q + + KE
Sbjct: 329 ARAAAEAAKVKREEESAQAEAKDSVKE 355
Score = 25.8 bits (57), Expect = 6.2
Identities = 7/35 (20%), Positives = 14/35 (40%)
Query: 70 EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+ + + +EE + E + KE+ E E
Sbjct: 329 ARAAAEAAKVKREEESAQAEAKDSVKEQLRTAEIE 363
Score = 25.5 bits (56), Expect = 7.1
Identities = 6/35 (17%), Positives = 16/35 (45%)
Query: 64 GKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
+ E + +++++ + E + +EQ E E
Sbjct: 329 ARAAAEAAKVKREEESAQAEAKDSVKEQLRTAEIE 363
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 27.9 bits (62), Expect = 0.92
Identities = 9/49 (18%), Positives = 24/49 (48%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+E E + ++ E+E+ ++ + + EE++ ++ + EE E
Sbjct: 214 DEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEME 262
Score = 25.9 bits (57), Expect = 3.9
Identities = 9/52 (17%), Positives = 24/52 (46%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E + E+ +++++ ++ + Q +E+E ++ E+ E E E
Sbjct: 216 PESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME 267
>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
Length = 222
Score = 27.7 bits (61), Expect = 0.92
Identities = 15/54 (27%), Positives = 31/54 (57%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
K E +EK+E E+ E + ++Q+Q Q+ + Q + ++Q ++ ++K EE
Sbjct: 151 KNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRKAMEE 204
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 27.6 bits (62), Expect = 1.0
Identities = 13/32 (40%), Positives = 22/32 (68%)
Query: 73 KQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
Q ++++EK++ KEE++ KEE+EK EE
Sbjct: 91 YQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 27.8 bits (62), Expect = 1.0
Identities = 8/51 (15%), Positives = 32/51 (62%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
+ E+++E E ++ +++++ E ++++E EE + +++ ++ +E++
Sbjct: 72 QKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 27.7 bits (62), Expect = 1.0
Identities = 15/56 (26%), Positives = 24/56 (42%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
K L E EK E E K ++ + E+ + E+ +K EE E +E+
Sbjct: 73 VKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEE 128
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 27.7 bits (61), Expect = 1.0
Identities = 17/61 (27%), Positives = 39/61 (63%)
Query: 45 KFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
K L + +++ +E+EE +E+ E E+++++ ++E +E+QE+E +++ EE+ E
Sbjct: 119 KSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEH 178
Query: 105 E 105
E
Sbjct: 179 E 179
Score = 27.0 bits (59), Expect = 1.7
Identities = 15/51 (29%), Positives = 38/51 (74%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E+ ++ +E+EE E++ + ++++EK +++E +E+QE++ EE+ EE+ +
Sbjct: 125 QEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDD 175
Score = 26.6 bits (58), Expect = 2.6
Identities = 16/58 (27%), Positives = 38/58 (65%)
Query: 50 HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ + ++ +K++ +E+ ++ +Q+++E EE+ E E ++EK +E E +EE+E+E
Sbjct: 106 QLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQERE 163
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 28.0 bits (62), Expect = 1.1
Identities = 12/50 (24%), Positives = 24/50 (48%), Gaps = 2/50 (4%)
Query: 60 EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEE--EEKKEEEEKE 107
+E + Q++ ++Q +K + KE+ +KKEE + + KE
Sbjct: 6 RREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKE 55
Score = 26.4 bits (58), Expect = 3.0
Identities = 10/39 (25%), Positives = 21/39 (53%)
Query: 65 KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
+E + Q+++ E+Q +K + K++ +KKEE
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEE 43
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175).
Length = 820
Score = 27.9 bits (63), Expect = 1.1
Identities = 10/56 (17%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
F++L +E+ + E++ ++ + Q Q Q ++ Q+ + QQ+ ++ + + +
Sbjct: 608 FRDL--QEQFNAQRGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGS 661
Score = 27.6 bits (62), Expect = 1.3
Identities = 7/44 (15%), Positives = 23/44 (52%)
Query: 62 EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
++G+ + + Q QQ QQ + ++Q ++ + + + + ++
Sbjct: 625 QQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQA 668
Score = 26.4 bits (59), Expect = 3.0
Identities = 6/38 (15%), Positives = 14/38 (36%)
Query: 62 EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
+ G+ + ++QQ + E Q+ + E
Sbjct: 653 QGGQGGQGSLAERQQALRDELGRQRGGLPGMGGEAGEA 690
Score = 26.0 bits (58), Expect = 4.3
Identities = 11/60 (18%), Positives = 29/60 (48%)
Query: 45 KFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+ LQ F E++ ++ + + + QQ QQ Q +++ Q+ + Q + ++++
Sbjct: 609 RDLQEQFNAQRGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQA 668
Score = 26.0 bits (58), Expect = 4.3
Identities = 9/41 (21%), Positives = 22/41 (53%)
Query: 62 EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
+G+ ++ +Q Q QQQ Q+ + Q + ++++ + E
Sbjct: 632 GQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQALRDE 672
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 27.3 bits (61), Expect = 1.1
Identities = 17/55 (30%), Positives = 34/55 (61%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
+ +EEE+EE E EE ++++Q + EKE + K E++ + E ++K+ +E+
Sbjct: 96 LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQ 150
Score = 24.9 bits (55), Expect = 8.1
Identities = 18/56 (32%), Positives = 36/56 (64%), Gaps = 2/56 (3%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K L +++EKEE +E+E E ++ +++Q ++ EKE + K+E+ + E ++KE
Sbjct: 92 KLLGLDKKEKEEEEEEEVEVEELDEEEQIDEL--LEKELAKLKREKRRENERKQKE 145
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 27.7 bits (62), Expect = 1.1
Identities = 8/43 (18%), Positives = 14/43 (32%), Gaps = 3/43 (6%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
+ E + + E EE+Q + E + EE
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQ---DDAEDRALEE 299
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 27.7 bits (62), Expect = 1.3
Identities = 15/58 (25%), Positives = 31/58 (53%)
Query: 50 HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K N +EK E + ++ + +++ ++E ++ + K E+QE + EE +K EE
Sbjct: 263 LEKLFNEALDEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELR 320
Score = 25.8 bits (57), Expect = 5.1
Identities = 10/47 (21%), Positives = 30/47 (63%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
E ++ ++ + E++ +++ ++ + K E+QE E ++ +K EE +++
Sbjct: 276 ERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQK 322
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 27.5 bits (61), Expect = 1.3
Identities = 16/56 (28%), Positives = 34/56 (60%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K+L + EE E+ + E+ QK+ +Q+ ++ ++ E+ ++ EE++K+ EE K
Sbjct: 71 KKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKA 126
Score = 26.7 bits (59), Expect = 2.5
Identities = 13/50 (26%), Positives = 31/50 (62%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+KE+ ++K+ E+Q ++ ++Q+ E+ ++KE +Q E+ K+ E+
Sbjct: 60 KPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQ 109
Score = 26.7 bits (59), Expect = 2.6
Identities = 16/55 (29%), Positives = 35/55 (63%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
++ N +++K+ +KE+E QK+ +QQ +E E+Q+ E+ ++K+ E+ E+
Sbjct: 50 QQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA 104
Score = 25.2 bits (55), Expect = 9.1
Identities = 12/53 (22%), Positives = 32/53 (60%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E +K+ ++ EE E+++ +Q ++KE +Q ++ K+ E+ ++ E++
Sbjct: 65 KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 27.8 bits (61), Expect = 1.3
Identities = 19/49 (38%), Positives = 35/49 (71%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
E+++K E +K EE++K+ + +++ E+ +K E+ +KKE EEKK+ EE
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEE 1720
Score = 27.4 bits (60), Expect = 1.8
Identities = 16/56 (28%), Positives = 34/56 (60%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K + + E +K EE++K+ ++ ++ +E++++ E +K+ EE KK EE K+
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709
Score = 27.0 bits (59), Expect = 2.6
Identities = 20/49 (40%), Positives = 36/49 (73%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
E ++ EE K+KE EE+K+ ++ ++ +EE + K E+ +K+ EE+KK+ EE
Sbjct: 1700 EAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEE 1748
Score = 26.6 bits (58), Expect = 3.5
Identities = 19/55 (34%), Positives = 35/55 (63%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
K+ EE++K E +K EEE K + ++ +K E+ +K+ ++ KK EE++K+ E
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693
Score = 26.3 bits (57), Expect = 4.5
Identities = 20/54 (37%), Positives = 38/54 (70%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
K+ E+++ E+ K+KE EE+K+ ++ ++ +EE + K ++ KK EE+KK+ EE
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE 1679
Score = 25.1 bits (54), Expect = 9.5
Identities = 18/56 (32%), Positives = 37/56 (66%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E EE+ E ++K+EE +K+ +++ EE+++ +E ++K EE++KK +E K+
Sbjct: 1357 DEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412
>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 696
Score = 27.6 bits (62), Expect = 1.3
Identities = 12/29 (41%), Positives = 23/29 (79%)
Query: 71 EQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
K+++QQ +EKE+ E+ +++E++EEEE
Sbjct: 323 LSKRKEQQAEEKEKPAEEAKKEEEEEEEE 351
Score = 25.7 bits (57), Expect = 6.9
Identities = 10/31 (32%), Positives = 25/31 (80%)
Query: 74 QQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
++++QQ E++E+ +E ++E++EEEE+ ++
Sbjct: 325 KRKEQQAEEKEKPAEEAKKEEEEEEEESVDD 355
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 27.6 bits (61), Expect = 1.3
Identities = 14/56 (25%), Positives = 24/56 (42%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K N E E K+ + ++Q E+ + + E+ E EE+ E+ E E
Sbjct: 340 KWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 27.3 bits (61), Expect = 1.4
Identities = 19/59 (32%), Positives = 32/59 (54%), Gaps = 3/59 (5%)
Query: 51 FKELNTEEEEKEEGKEKEEE---EQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
E E ++K K +E+E E+ +Q ++ KE + EE Q+K E+ EK+ E+ K
Sbjct: 311 LNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLK 369
Score = 26.5 bits (59), Expect = 2.5
Identities = 15/54 (27%), Positives = 31/54 (57%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
E + +E + E+E E +K+ ++ +EKE++ E+ EQ +E +E + E+
Sbjct: 300 EPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEE 353
>gnl|CDD|236356 PRK08944, motB, flagellar motor protein MotB; Reviewed.
Length = 302
Score = 27.3 bits (61), Expect = 1.4
Identities = 14/49 (28%), Positives = 26/49 (53%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
++ K+ G E E + ++ Q++ E+ +K QQ +KE E+ E E
Sbjct: 119 QQRGKQRGGEASAEVKAEKAAAQEQIEDLVKKLAQQLEKEIEDGAIEIE 167
Score = 25.7 bits (57), Expect = 4.6
Identities = 11/49 (22%), Positives = 24/49 (48%), Gaps = 3/49 (6%)
Query: 58 EEEKEEGKEKEEEEQ---KQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
E + G +++ +Q + + + EK QE+ E KK ++ ++E
Sbjct: 110 ESDSAGGTQQQRGKQRGGEASAEVKAEKAAAQEQIEDLVKKLAQQLEKE 158
Score = 25.4 bits (56), Expect = 6.4
Identities = 11/49 (22%), Positives = 24/49 (48%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
T+++ ++ + E K ++ QE+ E K+ Q+ ++E E E
Sbjct: 117 TQQQRGKQRGGEASAEVKAEKAAAQEQIEDLVKKLAQQLEKEIEDGAIE 165
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 27.3 bits (60), Expect = 1.4
Identities = 11/48 (22%), Positives = 28/48 (58%)
Query: 60 EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+++E + EE+E+++Q +Q ++ +Q +++ E EE +E +
Sbjct: 96 KRKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQ 143
Score = 27.0 bits (59), Expect = 2.3
Identities = 12/48 (25%), Positives = 26/48 (54%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+ +EK +EKE +EQ +Q Q+ ++ +Q+ E +E ++ +
Sbjct: 96 KRKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQ 143
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 27.3 bits (61), Expect = 1.5
Identities = 13/30 (43%), Positives = 23/30 (76%)
Query: 78 QQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
Q+ E+EEQ+++EE++E++E EE + EE
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 27.2 bits (61), Expect = 1.5
Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 9/62 (14%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQE------KKEEEEKKEEEE 105
L+ +E E+ K+ EE + ++++E+ E + + E+ KEE +K E +
Sbjct: 499 SGLSDDEIER---MVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEAD 555
Query: 106 KE 107
K+
Sbjct: 556 KK 557
Score = 27.2 bits (61), Expect = 1.7
Identities = 10/52 (19%), Positives = 24/52 (46%), Gaps = 2/52 (3%)
Query: 54 LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
L ++K GKE ++ + E ++ ++ +E E++K++E
Sbjct: 478 LTVSAKDKGTGKE--QKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERI 527
>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
to archaeal Holliday junction resolvase [Nucleotide
transport and metabolism].
Length = 175
Score = 27.0 bits (60), Expect = 1.6
Identities = 16/46 (34%), Positives = 26/46 (56%)
Query: 62 EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E + EE ++ ++++ E Q KEE+ + KE EKK EE +E
Sbjct: 32 ESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 27.3 bits (61), Expect = 1.6
Identities = 11/51 (21%), Positives = 16/51 (31%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
EE KE +E EE E+ + + K E K +
Sbjct: 512 KLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMG 562
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 27.2 bits (60), Expect = 1.7
Identities = 11/43 (25%), Positives = 20/43 (46%)
Query: 65 KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K +QK +++ +KE + E + + E K+ EEK
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKG 834
Score = 26.8 bits (59), Expect = 2.8
Identities = 10/51 (19%), Positives = 22/51 (43%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
KE ++E + E E ++ +Q+E++ KK++ K +
Sbjct: 802 KEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852
Score = 26.8 bits (59), Expect = 2.9
Identities = 11/52 (21%), Positives = 24/52 (46%)
Query: 54 LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
L+T + K+K +E+ ++ + E + + E+ K+ EEK +
Sbjct: 787 LSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAP 838
Score = 25.3 bits (55), Expect = 8.8
Identities = 14/83 (16%), Positives = 35/83 (42%)
Query: 24 PLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKE 83
P + + K+VR N++ ++ + +E+ KE + E + + ++
Sbjct: 768 PRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTI 827
Query: 84 EQQEKEEQQEKKEEEEKKEEEEK 106
+Q+E++ KK++ K
Sbjct: 828 KQREEKGIDAPAILNVKKKKPYK 850
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 27.2 bits (61), Expect = 1.7
Identities = 11/54 (20%), Positives = 27/54 (50%), Gaps = 9/54 (16%)
Query: 60 EKEEGKEKEEEEQK--------QQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
K E + E+E++K + +Q + E+E+ +E + +K E ++++
Sbjct: 434 AKAEIRAIEQEKKKAEEAKARFEARQARLEREKA-AREARHKKAAEARAAKDKD 486
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 26.9 bits (60), Expect = 1.7
Identities = 12/51 (23%), Positives = 34/51 (66%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
EEE++E+ +EK + K + ++ + + +++++ ++EK+E+ ++ EE+
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTP 89
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 27.3 bits (61), Expect = 1.7
Identities = 15/66 (22%), Positives = 28/66 (42%), Gaps = 5/66 (7%)
Query: 47 LQVHFKELNTEEEEKEEGKEKEEEEQKQQ-----QQQQQEKEEQQEKEEQQEKKEEEEKK 101
L ++L +E+ E +EK E + ++ + EE ++ E + E
Sbjct: 11 LAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGG 70
Query: 102 EEEEKE 107
EEE+E
Sbjct: 71 GEEEEE 76
Score = 26.2 bits (58), Expect = 3.9
Identities = 13/61 (21%), Positives = 27/61 (44%), Gaps = 4/61 (6%)
Query: 51 FKELNTEEEEKEEGKEKEE----EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
K EEE+ E KEE + + + ++ ++ E + + EEE++E + +
Sbjct: 22 EKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKAE 81
Query: 107 E 107
Sbjct: 82 A 82
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 27.4 bits (60), Expect = 1.8
Identities = 16/46 (34%), Positives = 30/46 (65%)
Query: 62 EEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E +EK E + + +++ +++EE++ E Q KEEE K+E+E+E
Sbjct: 1009 PECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKERE 1054
Score = 27.0 bits (59), Expect = 2.5
Identities = 11/43 (25%), Positives = 27/43 (62%)
Query: 65 KEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ K++EE+++ + Q+ KEE+ K+E++ ++ + + KE
Sbjct: 1025 RIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKE 1067
Score = 26.2 bits (57), Expect = 4.4
Identities = 19/93 (20%), Positives = 37/93 (39%)
Query: 15 ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQ 74
+ + + + L G + +++D F N EE + E EE
Sbjct: 576 LKSRFQKDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESAD 635
Query: 75 QQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ + E+EE K+E+ E E++ + EK+
Sbjct: 636 EVDYETEREENARKKEELRGNFELEERGDPEKK 668
Score = 25.9 bits (56), Expect = 5.5
Identities = 13/42 (30%), Positives = 28/42 (66%), Gaps = 1/42 (2%)
Query: 61 KEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
KE K++EE+E+ + Q+ +E EE +KE+++E++ + +
Sbjct: 1023 KERIKDQEEKERMESLQRAKE-EEIGKKEKEREQRIRKTIHD 1063
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 27.2 bits (61), Expect = 1.8
Identities = 12/63 (19%), Positives = 25/63 (39%), Gaps = 13/63 (20%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQE-------------KEEQQEKEEQQEKKEEEEKKEEE 104
E+K E +E ++QQ++ + K ++ + ++ +K E EE
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEER 298
Query: 105 EKE 107
E
Sbjct: 299 PVE 301
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 27.3 bits (61), Expect = 1.9
Identities = 16/56 (28%), Positives = 30/56 (53%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
KE E K E + E K+ ++Q + E + ++E+ KK EE+ ++E E++
Sbjct: 195 KEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQ 250
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 26.9 bits (60), Expect = 1.9
Identities = 9/50 (18%), Positives = 26/50 (52%)
Query: 41 SRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEE 90
+ +F+ F+ L + + +EE +E ++E + + + Q+ ++ + E
Sbjct: 55 NAPREFVSGVFESLASLFDLREENEELKKELLELESRLQELEQLEAENAR 104
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 27.0 bits (60), Expect = 1.9
Identities = 12/48 (25%), Positives = 23/48 (47%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
E+ E E++ QQ + Q EE+++ Q + ++K+ EE
Sbjct: 181 AAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEE 228
>gnl|CDD|221268 pfam11853, DUF3373, Protein of unknown function (DUF3373). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria. Proteins in this
family are typically between 472 to 574 amino acids in
length.
Length = 485
Score = 27.1 bits (60), Expect = 1.9
Identities = 6/34 (17%), Positives = 18/34 (52%)
Query: 70 EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
+ Q+ + +KE + K + ++ ++ +K E+
Sbjct: 26 DIDLLQKIEALKKELAELKAQLKDLQKRVDKTEK 59
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 27.1 bits (60), Expect = 1.9
Identities = 11/33 (33%), Positives = 25/33 (75%)
Query: 75 QQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
++ +++++E+ +EK E+ + +E +E+ EEEEK
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKG 309
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 27.1 bits (60), Expect = 2.0
Identities = 18/55 (32%), Positives = 30/55 (54%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
N E + E + K+EEE+ + ++ EKE+ E E +EE+E+ E EE +
Sbjct: 62 ISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQ 116
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 27.2 bits (61), Expect = 2.0
Identities = 10/49 (20%), Positives = 19/49 (38%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
E + E E E + + QQEKE + + + E + ++
Sbjct: 453 AELRAELAALEAELAALEARWQQEKELVEAILALRAELEADADAPADDD 501
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 27.0 bits (60), Expect = 2.1
Identities = 14/69 (20%), Positives = 39/69 (56%), Gaps = 1/69 (1%)
Query: 40 SSRFGKFLQVHFKELNTEEEEKEEGKEKE-EEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
S + L + + L+T EE+ + ++++K+ + + +K+ +++K++++EKKE +
Sbjct: 25 SYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84
Query: 99 EKKEEEEKE 107
+ E +
Sbjct: 85 SEGETKLGF 93
Score = 25.4 bits (56), Expect = 8.0
Identities = 14/75 (18%), Positives = 36/75 (48%), Gaps = 14/75 (18%)
Query: 47 LQVHFKELNTEEEEKE--------------EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQ 92
+ + F T++ + + E EE+ + +K+++++K +
Sbjct: 7 MILSFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNES 66
Query: 93 EKKEEEEKKEEEEKE 107
+KK E++KK+++EK+
Sbjct: 67 KKKSEKKKKKKKEKK 81
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 26.7 bits (59), Expect = 2.3
Identities = 8/56 (14%), Positives = 26/56 (46%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ E + KEK + E+K ++ + + K + + K + + + + ++ + +
Sbjct: 72 EPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAK 127
Score = 25.5 bits (56), Expect = 5.7
Identities = 10/51 (19%), Positives = 24/51 (47%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
E E EE + E + + + +E+ + E++ +K + + K + + K
Sbjct: 58 PEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPK 108
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 26.5 bits (59), Expect = 2.5
Identities = 13/42 (30%), Positives = 24/42 (57%), Gaps = 2/42 (4%)
Query: 64 GKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+EK +E QK ++ Q+E E Q E KK ++ ++++ E
Sbjct: 70 DQEKMKELQKMMKEFQKEFREAQ--ESGDMKKLKKLQEKQME 109
Score = 25.4 bits (56), Expect = 6.2
Identities = 12/37 (32%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 73 KQQQQQQQEKEEQQEKEEQQEKKEEE--EKKEEEEKE 107
K ++ Q+ KE Q+E E QE + + +K +E++ E
Sbjct: 73 KMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQME 109
>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
Length = 639
Score = 26.7 bits (59), Expect = 2.5
Identities = 9/42 (21%), Positives = 18/42 (42%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKE 96
+ + + + E E QQ Q EK +++E + K+
Sbjct: 133 QEKNSQSQSASQAESLEHVQQSAQTSEKVDEKEPLLTKTDKQ 174
Score = 26.0 bits (57), Expect = 5.2
Identities = 9/40 (22%), Positives = 22/40 (55%)
Query: 66 EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+++ +Q+++ Q Q + + E Q+ + EK +E+E
Sbjct: 126 GQDDSDQQEKNSQSQSASQAESLEHVQQSAQTSEKVDEKE 165
>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
Length = 356
Score = 26.9 bits (59), Expect = 2.5
Identities = 15/43 (34%), Positives = 28/43 (65%)
Query: 64 GKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
G E++E +Q+ Q+Q +E E+ ++++EEE+ EEEE+
Sbjct: 307 GGEEKEPKQESQEQLFNPFAIDEEMLEETQEQQEEEENEEEEE 349
Score = 26.1 bits (57), Expect = 4.8
Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 9/62 (14%)
Query: 43 FGKFLQVHFKELNTEEEEKEEGKEKEE---------EEQKQQQQQQQEKEEQQEKEEQQE 93
F + ++V K+L EEKE +E +E EE ++ Q+QQE+EE +E+EE
Sbjct: 293 FQEMVKVAAKDLILGGEEKEPKQESQEQLFNPFAIDEEMLEETQEQQEEEENEEEEENDT 352
Query: 94 KK 95
K
Sbjct: 353 VK 354
>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
Length = 382
Score = 26.4 bits (59), Expect = 2.9
Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 3/51 (5%)
Query: 57 EEEEKEEGKEKEEEEQKQQQ---QQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
EEEE+EE E+EE + Q +QE+++ +++E EK+ +
Sbjct: 226 EEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQP 276
Score = 26.0 bits (58), Expect = 4.7
Identities = 9/43 (20%), Positives = 20/43 (46%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
EEEE + E + +QE+++ + ++ EK+ +
Sbjct: 234 VEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQP 276
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 26.5 bits (59), Expect = 2.9
Identities = 14/55 (25%), Positives = 32/55 (58%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
+ L +E+ E + K E + +Q+ +++++E+++ E QE+ +E K+ EK
Sbjct: 197 QALTAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEK 251
Score = 26.1 bits (58), Expect = 3.6
Identities = 16/50 (32%), Positives = 29/50 (58%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
E K E E E+E +++Q+++++ E QE+ Q+ K+ EK E E +
Sbjct: 208 AERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAERE 257
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap is
involved in recombination. Rap (recombination adept with
plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 26.1 bits (58), Expect = 2.9
Identities = 9/36 (25%), Positives = 24/36 (66%)
Query: 72 QKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+++ Q+++++ E Q E+ E + +KE+ + + + KE
Sbjct: 36 REKAQEKKRKAEAQAERRELKARKEKLKTRSDWLKE 71
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 26.5 bits (58), Expect = 3.0
Identities = 8/51 (15%), Positives = 31/51 (60%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E E +E ++ ++ +++ ++Q + ++ Q+K + + ++++ E +K+
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQ 254
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 26.1 bits (58), Expect = 3.1
Identities = 13/51 (25%), Positives = 29/51 (56%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+EEK K+ + + Q + ++E+++ ++EKK + +KE+ E+E
Sbjct: 72 TKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 26.4 bits (58), Expect = 3.1
Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 1/35 (2%)
Query: 74 QQQQQQQEKEEQQEK-EEQQEKKEEEEKKEEEEKE 107
+ + +K EK E + KKE +K+ E+E
Sbjct: 326 HAENAEIKKTRTAEKNEAKARKKEIAQKRRAAERE 360
Score = 26.4 bits (58), Expect = 3.6
Identities = 8/51 (15%), Positives = 26/51 (50%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
T E+ + + ++KE ++++ +++ +E +QE+ + + +K
Sbjct: 337 TAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKK 387
Score = 25.6 bits (56), Expect = 6.0
Identities = 5/45 (11%), Positives = 21/45 (46%)
Query: 59 EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
E K+ ++ E + ++++ Q++ + + ++ ++E
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMAR 375
Score = 25.2 bits (55), Expect = 9.5
Identities = 8/40 (20%), Positives = 22/40 (55%)
Query: 68 EEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E K+ + ++ + + ++KE Q+++ E + E ++
Sbjct: 328 ENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQ 367
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 26.4 bits (58), Expect = 3.3
Identities = 12/39 (30%), Positives = 25/39 (64%)
Query: 60 EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
E +EK E EQ++++Q+ + K E++ K +++ K+ E
Sbjct: 4 ELRRAREKLEREQRERKQRAKLKLERERKAKEEAAKQRE 42
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 26.4 bits (58), Expect = 3.3
Identities = 16/41 (39%), Positives = 26/41 (63%), Gaps = 2/41 (4%)
Query: 67 KEEEEQKQQ--QQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+E ++KQQ +Q Q E+Q ++ +KKEEEE+ EE+
Sbjct: 2 VDESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEED 42
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 26.5 bits (58), Expect = 3.3
Identities = 10/39 (25%), Positives = 25/39 (64%)
Query: 68 EEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
EE+ + + +++ E E+ +++++EE++EEEE+
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179
Score = 26.2 bits (57), Expect = 4.4
Identities = 11/53 (20%), Positives = 35/53 (66%)
Query: 54 LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
++ ++E+++E + EE+++++++++++ K E EE + ++ +K E +K
Sbjct: 153 IDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDK 205
Score = 25.8 bits (56), Expect = 6.9
Identities = 13/56 (23%), Positives = 30/56 (53%)
Query: 50 HFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
H E + + + +E+E + ++ ++EE++E+EE + +E+E+ E E
Sbjct: 139 HLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGE 194
Score = 25.4 bits (55), Expect = 8.2
Identities = 16/50 (32%), Positives = 29/50 (58%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
E E+++ EE E+EEEE+++ + E EE + E+ +K E +K +
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTD 207
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 25.8 bits (57), Expect = 3.3
Identities = 12/35 (34%), Positives = 27/35 (77%), Gaps = 2/35 (5%)
Query: 70 EEQK--QQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
EEQK +Q +++ K+++Q+K+++++KK++ KK
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130
>gnl|CDD|224712 COG1799, COG1799, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 167
Score = 26.2 bits (58), Expect = 3.4
Identities = 9/57 (15%), Positives = 22/57 (38%), Gaps = 4/57 (7%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
F ++E++ EE+ +Q+ + + E+ + +K E + E
Sbjct: 10 FFGYFPLDDEEDY----YEEDPRQEPRDPAVEPEEYAPPKYNFRKREPARMEMRGNV 62
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 26.0 bits (58), Expect = 3.6
Identities = 10/51 (19%), Positives = 31/51 (60%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
KE + ++ +G + ++E KQ ++ + ++ ++++E ++++ E E+K
Sbjct: 22 KEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKA 72
>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG. Members of this
family are the protein YabG, demonstrated for Bacillus
subtilis to be an endopeptidase able to release
N-terminal peptides from a number of sporulation
proteins, including CotT, CotF, and SpoIVA. It appears
to be expressed under control of sigma-K [Cellular
processes, Sporulation and germination].
Length = 283
Score = 26.2 bits (58), Expect = 3.6
Identities = 11/40 (27%), Positives = 24/40 (60%)
Query: 68 EEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E E+K+++++ +E+ E + +Q+ K +EKKE
Sbjct: 53 DESERKKRKKRFKERMETSYRLFRQDYKLMKEKKEYRATG 92
>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
biogenesis, outer membrane].
Length = 284
Score = 26.2 bits (58), Expect = 3.7
Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 3/57 (5%)
Query: 43 FGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEE 99
+FL+ EE K+E E EQ ++ + E+E ++ KE K+ +
Sbjct: 61 VLEFLKSLKDLALENEELKKELAEL---EQLLEEVESLEEENKRLKELLDFKESSSD 114
>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
Reviewed.
Length = 830
Score = 26.3 bits (58), Expect = 3.7
Identities = 6/28 (21%), Positives = 17/28 (60%)
Query: 71 EQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
+Q + QQQ + ++Q +++ ++K +
Sbjct: 792 QQSEMQQQPSQPQQQPQQQPAEQKDSDG 819
Score = 25.9 bits (57), Expect = 5.9
Identities = 8/28 (28%), Positives = 16/28 (57%)
Query: 62 EEGKEKEEEEQKQQQQQQQEKEEQQEKE 89
++ + +++ Q QQQ QQQ E++
Sbjct: 792 QQSEMQQQPSQPQQQPQQQPAEQKDSDG 819
Score = 25.5 bits (56), Expect = 6.8
Identities = 7/27 (25%), Positives = 17/27 (62%)
Query: 74 QQQQQQQEKEEQQEKEEQQEKKEEEEK 100
QQ + QQ+ + Q++ +QQ ++++
Sbjct: 792 QQSEMQQQPSQPQQQPQQQPAEQKDSD 818
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 26.2 bits (58), Expect = 3.8
Identities = 15/56 (26%), Positives = 35/56 (62%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
++L+ EE EE +E EE+E ++Q+ + + + + K + + K++E+KK++ +
Sbjct: 1132 EDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSAD 1187
>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein. This
is the P3 protein section of the Potyviridae
polyproteins. The function is not known except that the
protein is essential to viral survival.
Length = 445
Score = 26.0 bits (58), Expect = 4.0
Identities = 10/36 (27%), Positives = 21/36 (58%), Gaps = 1/36 (2%)
Query: 70 EEQKQQQQQQQEKEEQQ-EKEEQQEKKEEEEKKEEE 104
+ +++ ++++EKEE++ K K E + EEE
Sbjct: 274 KLKREAAEEKEEKEEKEIRKLYLYSKLEGKLPTEEE 309
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 25.9 bits (57), Expect = 4.0
Identities = 11/41 (26%), Positives = 28/41 (68%)
Query: 67 KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E+EE ++ + ++E+EEQ+ Q+E++E++ K + ++
Sbjct: 142 REQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQA 182
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 26.1 bits (57), Expect = 4.1
Identities = 13/54 (24%), Positives = 23/54 (42%)
Query: 54 LNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+ ++E +E E EE ++ + + E E EE E +E E K +
Sbjct: 942 EASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDSR 995
>gnl|CDD|218660 pfam05620, DUF788, Protein of unknown function (DUF788). This
family consists of several eukaryotic proteins of
unknown function.
Length = 166
Score = 25.8 bits (57), Expect = 4.1
Identities = 7/22 (31%), Positives = 14/22 (63%)
Query: 74 QQQQQQQEKEEQQEKEEQQEKK 95
Q + + K ++QEK E++ +K
Sbjct: 145 QGAETNETKSKRQEKLEKRGEK 166
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 26.1 bits (58), Expect = 4.6
Identities = 12/61 (19%), Positives = 31/61 (50%), Gaps = 4/61 (6%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQ----QEKEEQQEKEEQQEKKEEEEKKEEEEK 106
+++L ++++ EE E++EE K + + + ++ +E E++ + EE+
Sbjct: 120 YRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRA 179
Query: 107 E 107
E
Sbjct: 180 E 180
Score = 25.7 bits (57), Expect = 6.6
Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 2/55 (3%)
Query: 53 ELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E + E+EE + ++ E+++ +E E+ ++ ++E EEK+ E E
Sbjct: 132 ESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLER--RRRRREWEEKRAELEFY 184
>gnl|CDD|235449 PRK05415, PRK05415, hypothetical protein; Provisional.
Length = 341
Score = 26.0 bits (58), Expect = 4.7
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 66 EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
E EEE++ + + QQ +EQ+ +EEEE+ E E
Sbjct: 14 EPLEEEEEPELRAQQAFDEQEAFAPAAPDEEEEEEGELEA 53
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 25.9 bits (58), Expect = 4.8
Identities = 12/36 (33%), Positives = 26/36 (72%)
Query: 63 EGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
EG++ E ++++++ E+EE++E+EE+ E+ E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 25.9 bits (58), Expect = 5.3
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 80 QEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+E +E+ E KE EEK+++ EK
Sbjct: 514 EEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
Score = 25.5 bits (57), Expect = 5.8
Identities = 9/32 (28%), Positives = 19/32 (59%)
Query: 59 EEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEE 90
+ +EE +E+ + ++ +EKE++ EK E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 25.5 bits (57), Expect = 6.4
Identities = 8/32 (25%), Positives = 19/32 (59%)
Query: 66 EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEE 97
+ EEE ++Q + ++ E++E++ +K E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
>gnl|CDD|150747 pfam10107, Endonuc_Holl, Endonuclease related to archaeal Holliday
junction resolvase. This domain is found in various
predicted bacterial endonucleases which are distantly
related to archaeal Holliday junction resolvases.
Length = 153
Score = 25.4 bits (56), Expect = 5.5
Identities = 9/45 (20%), Positives = 26/45 (57%)
Query: 60 EKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
K E + +E EQ + +++ + + E + ++ +K+EE++ ++
Sbjct: 18 GKVEKRARELFEQWRNAEREAQAMREAEAKFEEWQKKEEKEIRKD 62
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 25.5 bits (56), Expect = 5.7
Identities = 11/48 (22%), Positives = 34/48 (70%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
++ + ++ +++ ++ ++ ++E+ E++E+EE+++ EEE+ +EEE
Sbjct: 133 DDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180
>gnl|CDD|179867 PRK04598, tatA, twin arginine translocase protein A; Provisional.
Length = 81
Score = 24.8 bits (54), Expect = 5.8
Identities = 12/42 (28%), Positives = 25/42 (59%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQ 92
FK+ +EEE + K+ + E K +Q + E +++K+++Q
Sbjct: 39 FKKAMSEEESAKANKKDADFEPKNLEQAKTEAAAEKKKDKEQ 80
>gnl|CDD|223100 COG0021, TktA, Transketolase [Carbohydrate transport and
metabolism].
Length = 663
Score = 25.6 bits (57), Expect = 6.0
Identities = 11/51 (21%), Positives = 19/51 (37%)
Query: 52 KELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKE 102
K L E E E +E + +++ + + E +KK E E
Sbjct: 277 KALGWEPEPFEVPEEVYAAFRAVEERGAKAEAAWNELFAAYKKKYPELAAE 327
>gnl|CDD|130727 TIGR01666, YCCS, TIGR01666 family membrane protein. This model
represents a clade of sequences from gamma and beta
proteobacteria. These proteins are >700 amino acids long
and many have been annotated as putative membrane
proteins. The gene from Salmonella has been annotated as
a putative efflux transporter. The gene from E. coli has
the name yccS [Cell envelope, Other].
Length = 704
Score = 25.6 bits (56), Expect = 6.1
Identities = 12/35 (34%), Positives = 15/35 (42%)
Query: 66 EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
E EE + QQQE E E +Q+ EE
Sbjct: 645 EHIEEIPEAIFNQQQESIETLELRKQEMTAEERAV 679
>gnl|CDD|236836 PRK11072, PRK11072, bifunctional glutamine-synthetase
adenylyltransferase/deadenyltransferase; Reviewed.
Length = 943
Score = 25.6 bits (57), Expect = 6.1
Identities = 9/50 (18%), Positives = 20/50 (40%)
Query: 49 VHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEE 98
F +L ++EE+ E + E+ ++ Q E++ E +
Sbjct: 430 RVFNQLIGDDEEETEEEAASEQWRELWQDALDEEDATPLLAELGFDDPAQ 479
>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway components
[Intracellular trafficking and secretion].
Length = 94
Score = 25.0 bits (55), Expect = 6.1
Identities = 11/48 (22%), Positives = 27/48 (56%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+ E EE K +E +++K++ + + +++ + E KE+ +KK +
Sbjct: 47 KNELDEELKLEELDDKKKELTAELQATKEELDQLASELKEDLKKKAKP 94
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 25.8 bits (57), Expect = 6.4
Identities = 9/49 (18%), Positives = 26/49 (53%)
Query: 57 EEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
++ +E EE +++ +Q E+ + ++E ++E ++ +K E +
Sbjct: 456 KQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVD 504
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 25.0 bits (55), Expect = 6.4
Identities = 9/20 (45%), Positives = 13/20 (65%)
Query: 88 KEEQQEKKEEEEKKEEEEKE 107
+ ++EEKKEEEE+E
Sbjct: 74 AAAAAAEAKKEEKKEEEEEE 93
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 25.8 bits (57), Expect = 6.5
Identities = 19/60 (31%), Positives = 27/60 (45%)
Query: 47 LQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
LQ L E E+ K+ E ++Q +E E Q E+ E + + EE E EEK
Sbjct: 286 LQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEK 345
Score = 25.4 bits (56), Expect = 9.0
Identities = 19/67 (28%), Positives = 35/67 (52%)
Query: 41 SRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
SR + Q+ + L E + EE + + EE + + + +E E +EK E+ +++ E +
Sbjct: 298 SRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLE 357
Query: 101 KEEEEKE 107
E EE E
Sbjct: 358 AELEELE 364
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 25.3 bits (56), Expect = 6.6
Identities = 13/45 (28%), Positives = 34/45 (75%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEK 100
EE++EE + ++EE++++++++++ + +KEE++++ EE EK
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEK 57
>gnl|CDD|225470 COG2918, GshA, Gamma-glutamylcysteine synthetase [Coenzyme
metabolism].
Length = 518
Score = 25.5 bits (56), Expect = 6.7
Identities = 14/55 (25%), Positives = 24/55 (43%)
Query: 32 AKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQ 86
+ +++ RFG ++EL +E +E E E+ Q+QQE E
Sbjct: 452 LRQMKDTGIGRFGLAFAEEYRELLRQEPLFLLIQEDFEAERLDSNQRQQELEAAD 506
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 25.4 bits (56), Expect = 6.8
Identities = 10/32 (31%), Positives = 19/32 (59%)
Query: 76 QQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
+Q Q K+E+ +E+ +E + EK +E E +
Sbjct: 314 FEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDD 345
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 25.5 bits (56), Expect = 7.0
Identities = 5/36 (13%), Positives = 16/36 (44%)
Query: 69 EEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
E K ++++ +E ++ + + + EE+
Sbjct: 137 YEYPKTDEEKELLREVEELESRLATEPSPAPELEEQ 172
>gnl|CDD|163696 cd08065, MPN_eIF3h, Mpr1p, Pad1p N-terminal (MPN) domains without
catalytic isopeptidase activity, found in eIF2h.
Eukaryotic translation initiation factor 3 (eIF3)
subunit h (eIF3h; eIF3 subunit 3; eIF3S3; eIF3-gamma;
eIF3-p40) is an evolutionarily non-conserved subunit of
the functional core that comprises eIF3a, eIF3b, eIF3c,
eIF3e, eIF3f, and eIF3h, and contains the MPN domain.
However, it lacks the canonical JAMM motif, and
therefore does not show catalytic isopeptidase
activity.Together with eIF3e and eIF3f, eIF3h stabilizes
the eIF3 complex. Results suggest that eIF3h regulates
cell growth and viability, and that over-expression of
the gene may provide growth advantage to prostate,
breast, and liver cancer cells. For example, EIF3h gene
amplification is common in late-stage prostate cancer
suggesting that it may be functionally involved in the
progression of the disease. It has been shown that
coamplification of MYC, a well characterized oncogene
involved in cell growth, differentiation, and apoptosis,
and EIF3h in patients with non-small cell lung cancer
(NSCLC) improves survival if treated with the Epidermal
Growth Factor Receptor Tyrosine Kinase Inhibitor
(EGFR-TKI), Gefitinib. Plant eIF3h is implicated in
translation of specific mRNAs.
Length = 266
Score = 25.3 bits (56), Expect = 7.1
Identities = 8/38 (21%), Positives = 18/38 (47%)
Query: 70 EEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E+ K Q+ +Q + ++ +K++ E + E E
Sbjct: 222 EQGKFNYYQRNLARQQAQIQQWLQKRKAENAQREARGE 259
Score = 25.3 bits (56), Expect = 7.4
Identities = 11/33 (33%), Positives = 20/33 (60%)
Query: 72 QKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
+QQ Q QQ ++++ + Q+E + EE EE+
Sbjct: 234 ARQQAQIQQWLQKRKAENAQREARGEEPLPEED 266
>gnl|CDD|235751 PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated.
Length = 871
Score = 25.2 bits (56), Expect = 7.4
Identities = 10/47 (21%), Positives = 25/47 (53%)
Query: 55 NTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKK 101
N + E K K E+ +++ ++++E + ++ E+K +E K+
Sbjct: 611 NIKNFEPGASKRKFEQGRQEALEKEEELLSRLQQLPDGEQKAKETKR 657
>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
Length = 580
Score = 25.2 bits (56), Expect = 7.5
Identities = 19/93 (20%), Positives = 36/93 (38%), Gaps = 4/93 (4%)
Query: 15 ITEQILEASPLLESFGNAKTVRNDNSSRFGKFLQVHFKELNTEEEEKEEGKEKEEE-EQK 73
+ E IL L + +R+ L+ EE +E+ E+E +
Sbjct: 478 LLEAILAQPGLTTGSQLLEHLRDAGLEELAALLE---SLAVWEEISEEDIAALEKELKDA 534
Query: 74 QQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEK 106
++ + Q EE+ E+ +E+ E EE+
Sbjct: 535 LEKLRDQLLEERLEELIAKERLLEGHGLSSEER 567
>gnl|CDD|224849 COG1938, COG1938, Archaeal enzymes of ATP-grasp superfamily
[General function prediction only].
Length = 244
Score = 25.4 bits (56), Expect = 7.6
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 76 QQQQQEKEEQQEKEEQQEKKEEEEKKEEEE 105
+++ +E EEQ EK +Q +KEEE + EEE
Sbjct: 210 EKEAEEIEEQLEKLAEQLEKEEERVEREEE 239
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 25.0 bits (55), Expect = 7.7
Identities = 17/62 (27%), Positives = 29/62 (46%), Gaps = 5/62 (8%)
Query: 51 FKELNTEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEE-----QQEKKEEEEKKEEEE 105
+ LN E+ E +EE +K + + +E+ E + +EEEE+ EEE
Sbjct: 114 LENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEE 173
Query: 106 KE 107
+E
Sbjct: 174 EE 175
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 25.0 bits (55), Expect = 8.0
Identities = 8/42 (19%), Positives = 20/42 (47%)
Query: 66 EKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K EE++K++ +K+ + K ++ +K+ K +
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVA 194
>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
component YidC; Provisional.
Length = 304
Score = 25.0 bits (55), Expect = 8.1
Identities = 5/17 (29%), Positives = 12/17 (70%)
Query: 81 EKEEQQEKEEQQEKKEE 97
+KEE+ +K+ + E++
Sbjct: 271 DKEEEAKKQAKAERRAA 287
>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the heterodimeric topo I from
Leishmania donvanni. Topo I enzymes are divided into:
topo type IA (bacterial) and type IB (eukaryotic). Topo
I relaxes superhelical tension in duplex DNA by creating
a single-strand nick, the broken strand can then rotate
around the unbroken strand to remove DNA supercoils and,
the nick is religated, liberating topo I. These enzymes
regulate the topological changes that accompany DNA
replication, transcription and other nuclear processes.
Human topo I is the target of a diverse set of
anticancer drugs including camptothecins (CPTs). CPTs
bind to the topo I-DNA complex and inhibit re-ligation
of the single-strand nick, resulting in the accumulation
of topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topo I play putative roles in
organizing the kinetoplast DNA network unique to these
parasites. This family may represent more than one
structural domain.
Length = 212
Score = 25.2 bits (55), Expect = 8.1
Identities = 10/30 (33%), Positives = 22/30 (73%)
Query: 74 QQQQQQQEKEEQQEKEEQQEKKEEEEKKEE 103
+ +++EK++ + KEE++ KEE++K+ E
Sbjct: 89 EWHLREKEKKKSRTKEEKKALKEEKDKEAE 118
>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306). This
family of proteobacterial species proteins has no known
function.
Length = 115
Score = 24.6 bits (54), Expect = 8.1
Identities = 10/41 (24%), Positives = 18/41 (43%)
Query: 67 KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
K ++ + + EE+ EEEE+ E E++E
Sbjct: 10 KLAVRAEEPAEPAETAEEEAAAAAPAPAPEEEEEAELEDEE 50
>gnl|CDD|236781 PRK10869, PRK10869, recombination and repair protein; Provisional.
Length = 553
Score = 25.3 bits (56), Expect = 8.3
Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 8/46 (17%)
Query: 69 EEEQKQQQQQQQEKEEQQEKEEQQEKK--------EEEEKKEEEEK 106
+ Q QQQ QE+ +++ + Q K+ E E+ +EE K
Sbjct: 170 CRDLAQHQQQSQERAARKQLLQYQLKELNEFAPQPGEFEQIDEEYK 215
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 24.6 bits (54), Expect = 8.8
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 83 EEQQEKEEQQEKKEEEEKKEEEE 105
E+EE++E++EEEE++E EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEE 94
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 25.1 bits (55), Expect = 8.9
Identities = 11/52 (21%), Positives = 19/52 (36%)
Query: 56 TEEEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEEEKE 107
E E + E E + + E +E E ++ E+ +EE K
Sbjct: 198 PEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKP 249
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 24.9 bits (54), Expect = 8.9
Identities = 9/26 (34%), Positives = 20/26 (76%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKE 83
EEEKE+ K +EE++ +++++ ++E
Sbjct: 94 EEEKEKKKNLNKEEKEAKKKERAKRE 119
>gnl|CDD|178439 PLN02847, PLN02847, triacylglycerol lipase.
Length = 633
Score = 25.2 bits (55), Expect = 9.0
Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 3/44 (6%)
Query: 58 EEEKEEGKEKEEEEQKQQQQQQQEKEEQQEKEEQQEKKE--EEE 99
EEE EG E E +K+ Q+Q+ E + Q ++EE KE EEE
Sbjct: 491 EEEVTEG-ELWYELEKELQRQETEVDAQAQEEEAAAAKEITEEE 533
>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
hydroxylase/glutamate synthase subunit beta;
Provisional.
Length = 752
Score = 25.1 bits (55), Expect = 9.0
Identities = 11/59 (18%), Positives = 23/59 (38%), Gaps = 3/59 (5%)
Query: 42 RFGKFLQVHFKELNTEEEEKEEGKEKEEEEQKQQQQQQQEK---EEQQEKEEQQEKKEE 97
R G + + +EL EE + + + K +++ E+ E + + EE
Sbjct: 256 RMGAYKTIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRFEE 314
>gnl|CDD|240282 PTZ00126, PTZ00126, tyrosyl-tRNA synthetase; Provisional.
Length = 383
Score = 25.0 bits (55), Expect = 9.5
Identities = 13/56 (23%), Positives = 19/56 (33%), Gaps = 4/56 (7%)
Query: 53 ELNTEEEEKEEGKEK----EEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
EE + K EE E + Q K +E+ + EE + EE
Sbjct: 1 TNPGEEFKSGSCSIKVGLREELETGFRGSPPQSKLSLEERVKLCLSIGEECIQPEE 56
>gnl|CDD|240400 PTZ00396, PTZ00396, Casein kinase II subunit beta; Provisional.
Length = 251
Score = 25.0 bits (55), Expect = 9.9
Identities = 6/38 (15%), Positives = 19/38 (50%)
Query: 67 KEEEEQKQQQQQQQEKEEQQEKEEQQEKKEEEEKKEEE 104
K+ ++ + Q + K ++ ++ K E +++E+
Sbjct: 209 KKSSKEYKLQNGEFGKSAIEQFLKKSPKGNIELQEQEQ 246
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.291 0.114 0.275
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,218,211
Number of extensions: 472602
Number of successful extensions: 18256
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10248
Number of HSP's successfully gapped: 4020
Length of query: 107
Length of database: 10,937,602
Length adjustment: 72
Effective length of query: 35
Effective length of database: 7,744,114
Effective search space: 271043990
Effective search space used: 271043990
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (21.6 bits)
S2: 53 (24.0 bits)