RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5329
(163 letters)
>gnl|CDD|238681 cd01385, MYSc_type_IX, Myosin motor domain, type IX myosins. Myosin
IX is a processive single-headed motor, which might play
a role in signalling. This catalytic (head) domain has
ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 692
Score = 115 bits (290), Expect = 1e-30
Identities = 44/74 (59%), Positives = 55/74 (74%)
Query: 15 EEEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVE 74
E+ FEQLCINYANE LQYYFNQH+F+ EQEEY EGI W +IE++DN C+QL
Sbjct: 387 EDFGRCNSFEQLCINYANEQLQYYFNQHIFKLEQEEYQGEGITWTNIEYTDNVGCIQLFS 446
Query: 75 GKPNGLLCVLDDQA 88
KP GLL +LD+++
Sbjct: 447 KKPTGLLYLLDEES 460
>gnl|CDD|214580 smart00242, MYSc, Myosin. Large ATPases. ATPase; molecular motor.
Muscle contraction consists of a cyclical interaction
between myosin and actin. The core of the myosin
structure is similar in fold to that of kinesin.
Length = 677
Score = 114 bits (288), Expect = 3e-30
Identities = 41/67 (61%), Positives = 52/67 (77%)
Query: 23 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 82
FEQLCINYANE LQ +FNQHVF+ EQEEY +EGI W I+F DN C+ L+E KP G+L
Sbjct: 387 FEQLCINYANEKLQQFFNQHVFKLEQEEYEREGIDWTFIDFFDNQDCIDLIEKKPPGILS 446
Query: 83 VLDDQAK 89
+LD++ +
Sbjct: 447 LLDEECR 453
>gnl|CDD|238071 cd00124, MYSc, Myosin motor domain. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 679
Score = 107 bits (269), Expect = 1e-27
Identities = 41/65 (63%), Positives = 51/65 (78%)
Query: 23 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 82
FEQLCINYANE LQ +FNQHVF+ EQEEY +EGI W I+F+DN + L+E KP GLL
Sbjct: 380 FEQLCINYANEKLQQFFNQHVFKLEQEEYQEEGIDWESIDFTDNQEVIDLIEKKPGGLLS 439
Query: 83 VLDDQ 87
+LD++
Sbjct: 440 LLDEE 444
>gnl|CDD|238677 cd01381, MYSc_type_VII, Myosin motor domain, type VII myosins.
Myosins in this group have been associated with
functions in sensory systems such as vision and hearing.
This catalytic (head) domain has ATPase activity and
belongs to the larger group of P-loop NTPases. Myosins
are actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 671
Score = 100 bits (251), Expect = 2e-25
Identities = 38/74 (51%), Positives = 53/74 (71%)
Query: 16 EEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEG 75
E + FEQLCIN+ANE+LQ +F QH+F+ EQEEYN E I W+HIEF DN L L+
Sbjct: 373 ENFDVNSFEQLCINFANENLQQFFVQHIFKLEQEEYNLEHINWQHIEFVDNQDALDLIAI 432
Query: 76 KPNGLLCVLDDQAK 89
KP ++ ++D+++K
Sbjct: 433 KPLNIMSLIDEESK 446
>gnl|CDD|238680 cd01384, MYSc_type_XI, Myosin motor domain, plant-specific type XI
myosin, involved in organelle transport. This catalytic
(head) domain has ATPase activity and belongs to the
larger group of P-loop NTPases. Myosins are
actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 674
Score = 100 bits (251), Expect = 3e-25
Identities = 39/71 (54%), Positives = 48/71 (67%)
Query: 16 EEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEG 75
E + FEQ CIN NE LQ +FNQHVF+ EQEEY KE I W +IEF DN L L+E
Sbjct: 378 ESFKTNSFEQFCINLTNEKLQQHFNQHVFKMEQEEYTKEEIDWSYIEFVDNQDVLDLIEK 437
Query: 76 KPNGLLCVLDD 86
KP G++ +LD+
Sbjct: 438 KPGGIIALLDE 448
>gnl|CDD|215687 pfam00063, Myosin_head, Myosin head (motor domain).
Length = 679
Score = 100 bits (251), Expect = 3e-25
Identities = 38/68 (55%), Positives = 51/68 (75%), Gaps = 1/68 (1%)
Query: 23 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFS-DNTLCLQLVEGKPNGLL 81
FEQLCINYANE LQ +FN H+F+ EQEEY +EGI W I++ DN C+ L+E KP G+L
Sbjct: 381 FEQLCINYANEKLQQFFNHHMFKLEQEEYVREGIAWTFIDYGLDNQACIDLIEKKPPGIL 440
Query: 82 CVLDDQAK 89
+LD++ +
Sbjct: 441 SLLDEECR 448
>gnl|CDD|238676 cd01380, MYSc_type_V, Myosin motor domain, type V myosins. Myosins
V transport a variety of intracellular cargo
processively along actin filaments, such as membraneous
organelles and mRNA. This catalytic (head) domain has
ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 691
Score = 98.2 bits (245), Expect = 2e-24
Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 1/67 (1%)
Query: 23 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 82
FEQ CINYANE LQ FNQHVF+ EQEEY KEGI W I+F DN C+ L+E K G+L
Sbjct: 386 FEQFCINYANEKLQQQFNQHVFKLEQEEYLKEGIEWTFIDFYDNQPCIDLIESKL-GILS 444
Query: 83 VLDDQAK 89
+LD++ +
Sbjct: 445 LLDEECR 451
>gnl|CDD|238683 cd01387, MYSc_type_XV, Myosin motor domain, type XV myosins. In
vertebrates, myosin XV appears to be expressed in
sensory tissue and play a role in hearing. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Myosins are
actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 677
Score = 96.9 bits (241), Expect = 6e-24
Identities = 40/72 (55%), Positives = 50/72 (69%)
Query: 16 EEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEG 75
E+ FEQLCINYANE+LQY FN+ VFQ EQEEY +E + W I F+DN + L+
Sbjct: 372 EDLSFNSFEQLCINYANENLQYLFNKIVFQEEQEEYIREQLDWTEIAFADNQPVINLISL 431
Query: 76 KPNGLLCVLDDQ 87
KP G+L +LDDQ
Sbjct: 432 KPYGILRILDDQ 443
>gnl|CDD|238679 cd01383, MYSc_type_VIII, Myosin motor domain, plant-specific type
VIII myosins, a subgroup which has been associated with
endocytosis, cytokinesis, cell-to-cell coupling and
gating at plasmodesmata. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 677
Score = 91.8 bits (228), Expect = 3e-22
Identities = 39/66 (59%), Positives = 50/66 (75%)
Query: 23 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 82
FEQ CINYANE LQ +FN+H+F+ EQEEY ++GI W +EF DN CL L E KP GLL
Sbjct: 383 FEQFCINYANERLQQHFNRHLFKLEQEEYEEDGIDWTKVEFEDNQECLDLFEKKPLGLLS 442
Query: 83 VLDDQA 88
+LD+++
Sbjct: 443 LLDEES 448
>gnl|CDD|238675 cd01379, MYSc_type_III, Myosin motor domain, type III myosins.
Myosin III has been shown to play a role in the vision
process in insects and in hearing in mammals. Myosin
III, an unconventional myosin, does not form dimers.
This catalytic (head) domain has ATPase activity and
belongs to the larger group of P-loop NTPases. Myosins
are actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 653
Score = 86.8 bits (215), Expect = 2e-20
Identities = 43/92 (46%), Positives = 62/92 (67%), Gaps = 4/92 (4%)
Query: 16 EEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEG 75
E ++ FEQLCIN ANE +QYYFNQH+F +EQ+EY EG+ R +E+ DN L +
Sbjct: 384 ENFKKNSFEQLCINIANEQIQYYFNQHIFAWEQQEYLNEGVDARLVEYEDNRPLLDMFLQ 443
Query: 76 KPNGLLCVLDDQA---KETEKAGV-LYEDNIK 103
KP GLL +LD+++ + T++ V +EDN+K
Sbjct: 444 KPLGLLALLDEESRFPQATDQTLVEKFEDNLK 475
>gnl|CDD|238673 cd01377, MYSc_type_II, Myosin motor domain, type II myosins. Myosin
II mediates cortical contraction in cell motility, and
is the motor in smooth and skeletal muscle. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Myosins are
actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 693
Score = 85.3 bits (212), Expect = 6e-20
Identities = 38/75 (50%), Positives = 49/75 (65%), Gaps = 4/75 (5%)
Query: 23 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFS-DNTLCLQLVEGKPNGLL 81
FEQLCINY NE LQ +FN H+F EQEEY +EGI W I+F D + L+E P G+L
Sbjct: 392 FEQLCINYTNEKLQQFFNHHMFVLEQEEYQREGIEWTFIDFGLDLQPTIDLIEKNPMGIL 451
Query: 82 CVLDDQA---KETEK 93
+LD++ K T+K
Sbjct: 452 SLLDEECVFPKATDK 466
>gnl|CDD|238674 cd01378, MYSc_type_I, Myosin motor domain, type I myosins. Myosin I
generates movement at the leading edge in cell motility,
and class I myosins have been implicated in phagocytosis
and vesicle transport. Myosin I, an unconventional
myosin, does not form dimers. This catalytic (head)
domain has ATPase activity and belongs to the larger
group of P-loop NTPases. Myosins are actin-dependent
molecular motors that play important roles in muscle
contraction, cell motility, and organelle transport. The
head domain is a molecular motor, which utilizes ATP
hydrolysis to generate directed movement toward the plus
end along actin filaments. A cyclical interaction
between myosin and actin provides the driving force.
Rates of ATP hydrolysis and consequently the speed of
movement along actin filaments vary widely, from about
0.04 micrometer per second for myosin I to 4.5
micrometer per second for myosin II in skeletal muscle.
Myosin II moves in discrete steps about 5-10 nm long and
generates 1-5 piconewtons of force. Upon ATP binding,
the myosin head dissociates from an actin filament. ATP
hydrolysis causes the head to pivot and associate with a
new actin subunit. The release of Pi causes the head to
pivot and move the filament (power stroke). Release of
ADP completes the cycle.
Length = 674
Score = 83.0 bits (206), Expect = 3e-19
Identities = 33/65 (50%), Positives = 43/65 (66%), Gaps = 1/65 (1%)
Query: 23 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGK-PNGLL 81
FEQ CINY NE LQ F + + EQEEY +EGI+W IE+ +N + L+EGK P G+
Sbjct: 384 FEQFCINYVNEKLQQIFIELTLKAEQEEYVREGIKWTPIEYFNNKIVCDLIEGKRPPGIF 443
Query: 82 CVLDD 86
+LDD
Sbjct: 444 SILDD 448
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 82.0 bits (203), Expect = 7e-19
Identities = 40/71 (56%), Positives = 52/71 (73%), Gaps = 1/71 (1%)
Query: 19 EEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGK-P 77
E+ FEQLCINY NE LQ +FNQH+F+ EQEEY KEGI W I++ DN C+ L+E K P
Sbjct: 442 EKNSFEQLCINYTNEKLQQFFNQHMFKLEQEEYVKEGIEWSFIDYFDNQPCIDLIEKKNP 501
Query: 78 NGLLCVLDDQA 88
G+L +LD++
Sbjct: 502 LGILSLLDEEC 512
>gnl|CDD|238678 cd01382, MYSc_type_VI, Myosin motor domain, type VI myosins. Myosin
VI is a monomeric myosin, which moves towards the
minus-end of actin filaments, in contrast to most other
myosins. It has been implicated in endocytosis,
secretion, and cell migration. This catalytic (head)
domain has ATPase activity and belongs to the larger
group of P-loop NTPases. Myosins are actin-dependent
molecular motors that play important roles in muscle
contraction, cell motility, and organelle transport. The
head domain is a molecular motor, which utilizes ATP
hydrolysis to generate directed movement toward the
minus end along actin filaments. A cyclical interaction
between myosin and actin provides the driving force.
Rates of ATP hydrolysis and consequently the speed of
movement along actin filaments vary widely, from about
0.04 micrometer per second for myosin I to 4.5
micrometer per second for myosin II in skeletal muscle.
Myosin II moves in discrete steps about 5-10 nm long and
generates 1-5 piconewtons of force. Upon ATP binding,
the myosin head dissociates from an actin filament. ATP
hydrolysis causes the head to pivot and associate with a
new actin subunit. The release of Pi causes the head to
pivot and move the filament (power stroke). Release of
ADP completes the cycle.
Length = 717
Score = 81.0 bits (200), Expect = 2e-18
Identities = 31/71 (43%), Positives = 46/71 (64%)
Query: 19 EEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPN 78
E FEQ CINY NE LQ +FN+ + + EQE Y +EG+ + + DN C+ L+E K N
Sbjct: 411 EHNSFEQFCINYCNEKLQQFFNERILKEEQELYQREGLGVNEVHYVDNQDCIDLIEAKLN 470
Query: 79 GLLCVLDDQAK 89
G+L +LD++ +
Sbjct: 471 GILDILDEENR 481
>gnl|CDD|240229 PTZ00014, PTZ00014, myosin-A; Provisional.
Length = 821
Score = 54.3 bits (131), Expect = 4e-09
Identities = 26/64 (40%), Positives = 36/64 (56%)
Query: 24 EQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLCV 83
EQL IN NE LQ F VF+ E + Y EGI +E++ N + L+ GK +L +
Sbjct: 481 EQLFINITNEMLQKNFVDIVFERESKLYKDEGISTEELEYTSNESVIDLLCGKGKSVLSI 540
Query: 84 LDDQ 87
L+DQ
Sbjct: 541 LEDQ 544
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 51.5 bits (124), Expect = 2e-08
Identities = 28/62 (45%), Positives = 46/62 (74%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K T+K + E KK EEEK+E++++ ++KEEEEE+EK+++EE+++ EEEE EEE+
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Query: 149 TK 150
+
Sbjct: 467 EE 468
Score = 45.7 bits (109), Expect = 3e-06
Identities = 24/63 (38%), Positives = 46/63 (73%)
Query: 88 AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
K EKA E+ K+++++ +++E +EEE++E++E+EK+EEEE+ + E+EE+EE+
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Query: 148 ETK 150
+ K
Sbjct: 473 KKK 475
Score = 45.3 bits (108), Expect = 4e-06
Identities = 26/55 (47%), Positives = 38/55 (69%), Gaps = 1/55 (1%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE-KMEEEEKEEEETKYLVRD 155
IKK E+ E++ EEE KE++K+ K+K+EEEE+E + +EEEKEEEE +
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Score = 41.8 bits (99), Expect = 5e-05
Identities = 20/58 (34%), Positives = 40/58 (68%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
++ K+ E+ + ++++EEEEEE+ K+EE++EEEE+E +EE+E+E+ ++
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 41.8 bits (99), Expect = 6e-05
Identities = 26/67 (38%), Positives = 46/67 (68%), Gaps = 4/67 (5%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
V + K E+ ++ KK K++EEEEE ++E+KEEE+E+E++E EE+++ EEE
Sbjct: 416 VEKAEKKREEEK----KEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Query: 143 EKEEEET 149
+K+++ T
Sbjct: 472 KKKKQAT 478
Score = 39.5 bits (93), Expect = 3e-04
Identities = 17/46 (36%), Positives = 33/46 (71%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K+ +K ++ E+ +++ +EE++EK+KK K+K EEEE+E+E+
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450
Score = 39.5 bits (93), Expect = 4e-04
Identities = 16/51 (31%), Positives = 35/51 (68%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++ +K E+ E+ +EEEK+E+++K ++++E+ EEE++++EE K
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Score = 35.7 bits (83), Expect = 0.007
Identities = 17/49 (34%), Positives = 33/49 (67%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++ K+ ++ E E+++EEE++++KK+ +K EEEE+EE+E K
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Score = 34.5 bits (80), Expect = 0.015
Identities = 16/54 (29%), Positives = 34/54 (62%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
L E+ I+ K+ ++ + E+ E++ E+EKKE+++K ++++EEEE +
Sbjct: 394 LTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447
Score = 31.4 bits (72), Expect = 0.20
Identities = 12/47 (25%), Positives = 27/47 (57%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
EEE + ++ ++ K+ E+ EKK EEEK++ +++ ++ +
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEE 442
Score = 27.2 bits (61), Expect = 4.4
Identities = 13/50 (26%), Positives = 30/50 (60%)
Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ +EE E ++ K+ +K E+ ++K+EEE+KEK ++ +++ +
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEE 442
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 50.8 bits (121), Expect = 6e-08
Identities = 45/152 (29%), Positives = 73/152 (48%), Gaps = 6/152 (3%)
Query: 4 NECFTIRGEEEEEEEEEEDFEQLCINYANEHLQYYFNQHVFQY-EQEEYNKEGIRWRHIE 62
+ GE E E +E+ED ++ A E + ++ E E + G + H
Sbjct: 761 RKETEHEGETEAEGKEDEDEGEI---QAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEG 817
Query: 63 FSDNTLCLQLVEGKPNGLLCVLDDQ--AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEE 120
S+ V+ + ++Q AK+ EK + + EE+EEEEEEE +EE
Sbjct: 818 QSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEE 877
Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
E+EEEEE+E++E EE +E E +++ YL
Sbjct: 878 EEEEEEEEEEEENEEPLSLEWPETRQKQAIYL 909
Score = 33.8 bits (77), Expect = 0.031
Identities = 22/61 (36%), Positives = 32/61 (52%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
E E ++ + EE E E E E E E+E E E +E +E E++ + E E K E ET
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVET 757
Query: 150 K 150
+
Sbjct: 758 E 758
Score = 33.4 bits (76), Expect = 0.038
Identities = 23/59 (38%), Positives = 32/59 (54%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
ETE G E I+ EE +E E+E E + E K E E + ++E E E E E +E+E
Sbjct: 720 ETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDE 778
Score = 33.0 bits (75), Expect = 0.067
Identities = 21/64 (32%), Positives = 32/64 (50%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+ +AKE + G + ++ E E + E E+E + E EE EE E + E E E E E
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVET 757
Query: 145 EEEE 148
E +
Sbjct: 758 EGDR 761
Score = 31.5 bits (71), Expect = 0.20
Identities = 27/72 (37%), Positives = 36/72 (50%), Gaps = 2/72 (2%)
Query: 86 DQAKETEKAGVLYEDNIKKE--EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
D ETE V +E + E E+E E E EE +E E E E E E K E E E +E
Sbjct: 705 DHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKET 764
Query: 144 KEEEETKYLVRD 155
+ E ET+ ++
Sbjct: 765 EHEGETEAEGKE 776
Score = 30.7 bits (69), Expect = 0.33
Identities = 14/46 (30%), Positives = 23/46 (50%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ E ++E E E E E E E E+K E+E E E ++ + +
Sbjct: 662 ESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHK 707
Score = 30.3 bits (68), Expect = 0.50
Identities = 21/50 (42%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 103 KKEEEEKEEEEEEEVKEE---EKEEEEEKEKKEEEEKE-KMEEEEKEEEE 148
EE E+ E E E EE E E+E E E K E E E ++ E K E+E
Sbjct: 645 TGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQE 694
Score = 30.0 bits (67), Expect = 0.61
Identities = 16/44 (36%), Positives = 18/44 (40%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
E EE E E E EE E ++E E E E E E E
Sbjct: 641 TGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGE 684
Score = 29.2 bits (65), Expect = 1.1
Identities = 18/49 (36%), Positives = 25/49 (51%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
E I+ +E + + E E E E E E E E + E E + E EE E+E
Sbjct: 696 EGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDE 744
Score = 28.8 bits (64), Expect = 1.3
Identities = 17/50 (34%), Positives = 27/50 (54%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E + E +E + + E E +E E E E E E E+E + + EE +E E+
Sbjct: 694 EGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVED 743
Score = 28.4 bits (63), Expect = 1.7
Identities = 17/66 (25%), Positives = 32/66 (48%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+ + + + + E ++E E + E +E + K E + EE E E + E E + E E +
Sbjct: 674 ETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIE 733
Query: 145 EEEETK 150
EE +
Sbjct: 734 TGEEGE 739
Score = 28.4 bits (63), Expect = 2.0
Identities = 22/63 (34%), Positives = 32/63 (50%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+ +A+E E G + + E E + EE EEV++E + E E K + E E K E E
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEG 768
Query: 145 EEE 147
E E
Sbjct: 769 ETE 771
Score = 28.4 bits (63), Expect = 2.2
Identities = 16/46 (34%), Positives = 22/46 (47%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ EE E E+E E + E E E E E + E+ E E E +E
Sbjct: 658 ENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKE 703
Score = 28.4 bits (63), Expect = 2.3
Identities = 14/43 (32%), Positives = 22/43 (51%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
E E K E E E E++ E+E E + E ++ + E + EE
Sbjct: 672 ETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEE 714
Score = 28.0 bits (62), Expect = 2.5
Identities = 20/61 (32%), Positives = 31/61 (50%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
E E G + + E + EE E E + E + E+E E + EE E++E+E + E E
Sbjct: 692 EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEG 751
Query: 150 K 150
K
Sbjct: 752 K 752
Score = 28.0 bits (62), Expect = 2.5
Identities = 18/61 (29%), Positives = 30/61 (49%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ + E++G E + E + + E E E E + E+E E E + +E K E E +E
Sbjct: 656 EGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEV 715
Query: 147 E 147
E
Sbjct: 716 E 716
Score = 27.7 bits (61), Expect = 4.1
Identities = 19/62 (30%), Positives = 32/62 (51%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
+Q ETE G + E + E+E E E++ +E + + E E +E E + + E E E
Sbjct: 668 EQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTE 727
Query: 146 EE 147
+E
Sbjct: 728 DE 729
Score = 26.9 bits (59), Expect = 6.1
Identities = 14/46 (30%), Positives = 22/46 (47%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E E E EE + + E E E+ E +++ E E K E E++
Sbjct: 637 EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESE 682
Score = 26.5 bits (58), Expect = 7.7
Identities = 16/63 (25%), Positives = 31/63 (49%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+ + E E E+ + E+E + E E + E E + +E + + E E + K +E+E
Sbjct: 722 EAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEG 781
Query: 145 EEE 147
E +
Sbjct: 782 EIQ 784
Score = 26.5 bits (58), Expect = 7.9
Identities = 14/64 (21%), Positives = 27/64 (42%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
++E E + + E E E E + E+E + E E ++ + + + E E +
Sbjct: 664 GGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEA 723
Query: 159 ESCE 162
E E
Sbjct: 724 EGTE 727
Score = 26.5 bits (58), Expect = 9.7
Identities = 16/48 (33%), Positives = 20/48 (41%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
D K + E E E +E E+ E E E EE E +E E E
Sbjct: 628 GDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETET 675
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 45.1 bits (107), Expect = 1e-06
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EE E+E +E E ++ + + E KEKK+ E + E+EK + E K
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144
Score = 37.0 bits (86), Expect = 0.001
Identities = 14/42 (33%), Positives = 24/42 (57%)
Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++EEE EEE +E E+E E + E E++++E + K
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPK 133
Score = 35.9 bits (83), Expect = 0.003
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
+E E+E+ E + + +EK++ E + K E+EK K E
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142
Score = 35.1 bits (81), Expect = 0.006
Identities = 13/43 (30%), Positives = 24/43 (55%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+EEEE E + ++ E+E+ + + E KEK + E + + K
Sbjct: 94 SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEK 136
Score = 34.3 bits (79), Expect = 0.010
Identities = 8/46 (17%), Positives = 27/46 (58%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
E+ ++E ++ E+E+ + E +E++++E + + +++ + E
Sbjct: 98 EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEP 143
Score = 32.4 bits (74), Expect = 0.055
Identities = 13/51 (25%), Positives = 25/51 (49%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
E E ++ +++ E + E +E+ K E + + EKEK + E K+
Sbjct: 98 EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
Score = 27.8 bits (62), Expect = 2.2
Identities = 11/54 (20%), Positives = 29/54 (53%), Gaps = 6/54 (11%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
++ ETE+ + + + E +E+++ EV + + E+E+ K + ++ + K
Sbjct: 103 EESTDETEQE------DPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
Score = 27.4 bits (61), Expect = 2.2
Identities = 12/42 (28%), Positives = 23/42 (54%)
Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
++E+EE EE+ E E+++ E + + +E+ K V E
Sbjct: 95 DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEK 136
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 42.0 bits (98), Expect = 6e-05
Identities = 20/41 (48%), Positives = 27/41 (65%), Gaps = 2/41 (4%)
Query: 103 KKEEE--EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
KK EE EK + E E+ EE+E E+EKEK+ E E+E+ E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619
Score = 41.2 bits (96), Expect = 9e-05
Identities = 17/44 (38%), Positives = 31/44 (70%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
K +++EE E+ +E E++ EE+E+++E+EKE+ E E+E E
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619
Score = 38.5 bits (89), Expect = 9e-04
Identities = 16/40 (40%), Positives = 29/40 (72%), Gaps = 1/40 (2%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
E+ ++K + E E++ EE +E EKE+E+E+E++ E E E+
Sbjct: 582 EEAVEKAKREAEQKAREE-REREKEKEKEREREREREAER 620
Score = 36.2 bits (83), Expect = 0.005
Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
K ++ EE E+ K E E+K +EE E+EK +E+E+E E +
Sbjct: 576 KLAKKREEAVEKAKREAEQK-AREEREREKEKEKERERERER 616
Score = 30.4 bits (68), Expect = 0.38
Identities = 13/28 (46%), Positives = 21/28 (75%), Gaps = 1/28 (3%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKK 131
+EE E+E+E+E+E +E E+E E E+ K
Sbjct: 597 REEREREKEKEKE-REREREREAERAAK 623
Score = 29.3 bits (65), Expect = 1.0
Identities = 15/50 (30%), Positives = 26/50 (52%)
Query: 92 EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
EKA E ++E E ++E+E+E +E E+E E + + +M E
Sbjct: 586 EKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHESRMSE 635
Score = 28.9 bits (64), Expect = 1.3
Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 3/36 (8%)
Query: 120 EEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
+++EE EK K+E E+K + EE+E E+ K R+
Sbjct: 579 KKREEAVEKAKREAEQKAR---EEREREKEKEKERE 611
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 40.7 bits (96), Expect = 1e-04
Identities = 22/54 (40%), Positives = 37/54 (68%), Gaps = 2/54 (3%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE--EEEKEKMEEEEKEEEETK 150
E+ I K EE+ +EE +E KEE+K+EE E + + EE+ K+EE+E++++ K
Sbjct: 269 EEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 35.3 bits (82), Expect = 0.009
Identities = 23/60 (38%), Positives = 40/60 (66%), Gaps = 4/60 (6%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEE-KEKKEEEEKEKMEEEEKE---EEETKYL 152
L + ++K ++ +EEEEE+ +K E+E +EE +EKKEE++KE+ E + + EE+ K
Sbjct: 253 LSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLE 312
Score = 30.3 bits (69), Expect = 0.42
Identities = 15/55 (27%), Positives = 34/55 (61%), Gaps = 3/55 (5%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEE---EEKEKKEEEEKEKME 140
+E E+ + + ++EE ++++EE+++ + E K + EE+ K EE+E++K
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320
>gnl|CDD|238682 cd01386, MYSc_type_XVIII, Myosin motor domain, type XVIII myosins.
This catalytic (head) domain has ATPase activity and
belongs to the larger group of P-loop NTPases. Myosins
are actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 767
Score = 39.8 bits (93), Expect = 3e-04
Identities = 21/63 (33%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 23 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 82
FE+LC NY E LQ F+ F E Y +EG+ +EF L E P +
Sbjct: 397 FEELCHNYLQERLQLLFHHRTFVQPLERYAEEGVE---VEFD-------LAEPSPGTTVA 446
Query: 83 VLD 85
++D
Sbjct: 447 LVD 449
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 39.0 bits (91), Expect = 6e-04
Identities = 25/68 (36%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 88 AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
AKETE+ +L D E E E ++ +EE++EEEEE+++ E KE ++EE EE+
Sbjct: 364 AKETEQDYIL--DAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEED 421
Query: 148 ETKYLVRD 155
+ + D
Sbjct: 422 DVESKYED 429
Score = 30.5 bits (69), Expect = 0.44
Identities = 19/52 (36%), Positives = 32/52 (61%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
A E E A +D +++EEE+EEE+E+E +E ++EE E+ + E K +
Sbjct: 377 SALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYE 428
Score = 28.2 bits (63), Expect = 2.5
Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 4/58 (6%)
Query: 95 GVLYED-NIKKEEEEKEEEE---EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
L+E N +K + KE E+ + E E+ + EEEE E+ EEEE E+E
Sbjct: 350 QALWETLNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEG 407
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 38.7 bits (90), Expect = 6e-04
Identities = 21/65 (32%), Positives = 34/65 (52%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
L+ QA+E EK + K+ E+ E+ + E+ ++ EEK+K+ EE K K E
Sbjct: 73 LEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEA 132
Query: 144 KEEEE 148
K + E
Sbjct: 133 KAKAE 137
Score = 34.8 bits (80), Expect = 0.012
Identities = 15/59 (25%), Positives = 31/59 (52%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+A E + L + ++ ++ E+ ++ +E++K+ EE K K+ E K K E E +
Sbjct: 83 QRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAE 141
Score = 32.9 bits (75), Expect = 0.059
Identities = 15/66 (22%), Positives = 36/66 (54%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
AK+ ++ E ++ E+++ E+ + + E++ E+ K+ E+ ++ EE++K+
Sbjct: 61 PAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQ 120
Query: 146 EEETKY 151
EE K
Sbjct: 121 AEEAKA 126
Score = 31.0 bits (70), Expect = 0.25
Identities = 20/62 (32%), Positives = 28/62 (45%), Gaps = 3/62 (4%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
E E E+ K+ EEE + + E K K+ E K+K E E K K E + K +
Sbjct: 134 AKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAK---KKAAEAKKKAEAEAKAKAEAKAKAK 190
Query: 147 EE 148
E
Sbjct: 191 AE 192
Score = 27.9 bits (62), Expect = 2.4
Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 14/80 (17%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEK--------------EEEEEKEK 130
+ AK+ E+A E+ K+ EE K ++ E + E EEE + +
Sbjct: 101 EKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKA 160
Query: 131 KEEEEKEKMEEEEKEEEETK 150
E +K+ E ++K E E K
Sbjct: 161 AAEAKKKAAEAKKKAEAEAK 180
Score = 27.9 bits (62), Expect = 2.7
Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 1/66 (1%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK-EEEEKEKMEEEEK 144
++ K+ E+ E E+ ++E E+ E+ ++ E+ K+ EE++K+ E + K
Sbjct: 68 ERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAK 127
Query: 145 EEEETK 150
+ E K
Sbjct: 128 QAAEAK 133
Score = 26.7 bits (59), Expect = 6.8
Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 3/52 (5%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E+ K E +++ E K K E E K K E + K K EE + + E K
Sbjct: 154 EEAKAKAAAEAKKKAAEAKK---KAEAEAKAKAEAKAKAKAEEAKAKAEAAK 202
Score = 26.0 bits (57), Expect = 9.9
Identities = 11/51 (21%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKK--EEEEKEKMEEEEKEEEETK 150
I+++++ ++E+E K+ E++ EE ++++ E+ ++++E+ E+ K
Sbjct: 55 IQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAK 105
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 38.8 bits (90), Expect = 6e-04
Identities = 7/50 (14%), Positives = 17/50 (34%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
D +++ + + + + + E+ E +E E E E
Sbjct: 370 DIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEK 419
Score = 35.0 bits (80), Expect = 0.012
Identities = 9/52 (17%), Positives = 16/52 (30%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
D + + + E E+ E +E E E E+ + K
Sbjct: 377 GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAK 428
Score = 35.0 bits (80), Expect = 0.013
Identities = 10/59 (16%), Positives = 18/59 (30%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
ET +A + E + + + EE E ++ + E EK
Sbjct: 365 ETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPI 423
Score = 34.2 bits (78), Expect = 0.024
Identities = 8/50 (16%), Positives = 18/50 (36%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E+ + + E ++ + + + E EE + E +E E
Sbjct: 364 EETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETE 413
Score = 32.3 bits (73), Expect = 0.10
Identities = 10/45 (22%), Positives = 16/45 (35%), Gaps = 1/45 (2%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ + E EE E +E E E EK + + +E
Sbjct: 388 QVDAEAASAAPEEPAALASEAHDETEP-EVPEKAAPIPDPAKPDE 431
Score = 28.4 bits (63), Expect = 2.1
Identities = 9/52 (17%), Positives = 16/52 (30%), Gaps = 1/52 (1%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEE-EKEKKEEEEKEKMEEEEKEEEET 149
+ E EE + E +E E E +K + + +E
Sbjct: 386 AHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGP 437
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 38.2 bits (89), Expect = 8e-04
Identities = 24/63 (38%), Positives = 39/63 (61%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
DQA E+ +D E EE +EEE+E E+KE++ + EK++EE +E+ EEE++E
Sbjct: 42 DQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101
Query: 146 EEE 148
+
Sbjct: 102 SSD 104
Score = 33.2 bits (76), Expect = 0.037
Identities = 21/65 (32%), Positives = 35/65 (53%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
D + E V E+ E+KE++ + E ++EE EEE E+E +E ++ + E EEK
Sbjct: 55 DDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKT 114
Query: 146 EEETK 150
E +
Sbjct: 115 ESNVE 119
Score = 32.0 bits (73), Expect = 0.089
Identities = 25/59 (42%), Positives = 34/59 (57%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
++ KE EK ED K + EKE+EE EE EEE EE ++ +KE EEK + E+
Sbjct: 62 IEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEK 120
Score = 30.5 bits (69), Expect = 0.28
Identities = 17/52 (32%), Positives = 30/52 (57%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ ++E +K +++E EE KEEE+E E++E + E+E EE E +
Sbjct: 43 QAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEE 94
Score = 30.1 bits (68), Expect = 0.34
Identities = 18/67 (26%), Positives = 37/67 (55%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
D + E E+ ++ E++E + + E+E +E E+E EEE E+ +E +++ EE+ +
Sbjct: 56 DQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTE 115
Query: 145 EEEETKY 151
E +
Sbjct: 116 SNVEKEI 122
Score = 30.1 bits (68), Expect = 0.37
Identities = 19/52 (36%), Positives = 33/52 (63%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
D +E+E ++ +++E E E+ +EEEKE E+KE + EKE+EE++
Sbjct: 41 SDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE 92
Score = 29.3 bits (66), Expect = 0.78
Identities = 19/48 (39%), Positives = 30/48 (62%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
KE++ E+E+EE +EE +EE+EE + E+E E+ E E+E T
Sbjct: 77 DKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITN 124
Score = 27.0 bits (60), Expect = 4.1
Identities = 19/52 (36%), Positives = 34/52 (65%), Gaps = 5/52 (9%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEE-----EKEKKEEEEKEKMEEEEKEEEET 149
++E ++ +++E E++E ++EE+E +KE K + EKE E EE+ EEE
Sbjct: 48 EQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99
Score = 25.9 bits (57), Expect = 9.3
Identities = 13/53 (24%), Positives = 28/53 (52%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
+ + KE + ++ ++E EE++EE +E ++E +E+ E +KE
Sbjct: 74 NSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 37.3 bits (87), Expect = 0.002
Identities = 16/48 (33%), Positives = 28/48 (58%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
EE E+E + E E E K ++E+E EE+ EE+++ ++T+ L
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDL 280
Score = 36.9 bits (86), Expect = 0.003
Identities = 19/67 (28%), Positives = 32/67 (47%), Gaps = 4/67 (5%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEE----KEEEEEKEKKEEEEKEKMEE 141
+ ETE+ + I+ E K ++E+E EE EE+E K E+ +
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286
Query: 142 EEKEEEE 148
+EK++EE
Sbjct: 287 KEKKDEE 293
Score = 35.0 bits (81), Expect = 0.011
Identities = 17/61 (27%), Positives = 29/61 (47%)
Query: 91 TEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
T+K E ++EE + E E E K ++E+E E+ E+ E+ +K E+ K
Sbjct: 223 TKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDK 282
Query: 151 Y 151
Sbjct: 283 L 283
Score = 30.0 bits (68), Expect = 0.54
Identities = 13/56 (23%), Positives = 25/56 (44%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
E E E + +E+E EE+ EE+++ + E+ ++ E +D L
Sbjct: 239 TDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEEL 294
Score = 30.0 bits (68), Expect = 0.61
Identities = 16/38 (42%), Positives = 27/38 (71%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
E+K+E + + EE+ ++EE+KEK + EEKE++E E
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKERIEIAET 685
Score = 29.2 bits (66), Expect = 0.98
Identities = 14/32 (43%), Positives = 25/32 (78%), Gaps = 1/32 (3%)
Query: 102 IKKEEEEKE-EEEEEEVKEEEKEEEEEKEKKE 132
I+K++E K + EE+ K+EEK+E+ ++E+KE
Sbjct: 647 IEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
Score = 27.3 bits (61), Expect = 5.0
Identities = 14/31 (45%), Positives = 22/31 (70%)
Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
E+ KE + + E++ K+EE+KEK + EEKE
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 36.9 bits (86), Expect = 0.002
Identities = 22/31 (70%), Positives = 26/31 (83%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
E E+EEEEEEEV EEE+EEEEE+E+ EEE
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 32.7 bits (75), Expect = 0.066
Identities = 17/33 (51%), Positives = 23/33 (69%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
E EE +EEE+E EE+E++EEEE+ EEE
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 32.7 bits (75), Expect = 0.078
Identities = 21/38 (55%), Positives = 24/38 (63%), Gaps = 4/38 (10%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
E EEEEE +EE EEEEE EEEE+E+ EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEE----EEEEEERTFEEE 333
Score = 31.2 bits (71), Expect = 0.24
Identities = 17/34 (50%), Positives = 23/34 (67%)
Query: 114 EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ EE+EEEEE+ +EEEE+E+ EE EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 29.6 bits (67), Expect = 0.72
Identities = 19/34 (55%), Positives = 23/34 (67%)
Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E E+EEEEE+E EEEE+E+ EEE EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 29.6 bits (67), Expect = 0.85
Identities = 16/34 (47%), Positives = 22/34 (64%)
Query: 113 EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
EEE+EEEEE ++EEEE+E+ E +EE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 28.8 bits (65), Expect = 1.3
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
E E+E++EEEE + EEEE+EEEE +
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFE 331
Score = 28.8 bits (65), Expect = 1.4
Identities = 15/34 (44%), Positives = 22/34 (64%)
Query: 123 EEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
E +E++EEEE+ EEEE+EEEE + +V
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEV 334
Score = 28.5 bits (64), Expect = 1.7
Identities = 16/30 (53%), Positives = 21/30 (70%)
Query: 120 EEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
E EE+E++EEE E+ EEEE+EEE T
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERT 329
Score = 28.1 bits (63), Expect = 2.4
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ EEEE+E++E E+E+ EEEE+E
Sbjct: 298 PPPSPPEPEEEEEEEEEVPEEEEEEEEEEERT 329
>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal. This is the N terminal of
the DNA repair protein Rad54.
Length = 191
Score = 36.6 bits (85), Expect = 0.003
Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 7/60 (11%)
Query: 92 EKAGVLYE---DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE----KEKMEEEEK 144
E A VLY+ D+ K EEEK E+++E + E K K K E K+K+EE K
Sbjct: 131 EFAIVLYDPTVDDKPKIEEEKAEKDQEPEESETKLSNGPKVHKSLAEILGIKKKVEERPK 190
Score = 28.1 bits (63), Expect = 1.7
Identities = 8/28 (28%), Positives = 16/28 (57%)
Query: 123 EEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++ + +E+K E+++E E E K K
Sbjct: 143 DKPKIEEEKAEKDQEPEESETKLSNGPK 170
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 37.2 bits (86), Expect = 0.003
Identities = 14/64 (21%), Positives = 29/64 (45%), Gaps = 14/64 (21%)
Query: 101 NIKKEEEEKE-EEEEEEVKEEEKEEEEEKEKK-------------EEEEKEKMEEEEKEE 146
N +E + + ++ E+ E E E +EE++K ++ + K+ EEE+
Sbjct: 539 NSTEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAA 598
Query: 147 EETK 150
+ K
Sbjct: 599 LKMK 602
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 36.9 bits (85), Expect = 0.003
Identities = 23/74 (31%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 74 EGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEE-EEVKEEEKEEEEEKEKKE 132
+G + L+ DD++ E+E+ YE + E +E +E+EE +E E+ E+E E + +
Sbjct: 917 DGGWSFLMVGSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSD 976
Query: 133 EEEKEKMEEEEKEE 146
EE+ E +E E +
Sbjct: 977 EEDGEDWDELESKA 990
Score = 27.7 bits (61), Expect = 3.7
Identities = 14/50 (28%), Positives = 29/50 (58%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
+++ +E EEEV E E ++E ++ +E+E+ E+ E+E++ D
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSD 976
Score = 27.3 bits (60), Expect = 4.8
Identities = 14/47 (29%), Positives = 25/47 (53%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
++E +E EEE + E ++E E E+EE ++ E+ E+E
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESEND 973
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 36.3 bits (84), Expect = 0.004
Identities = 19/63 (30%), Positives = 31/63 (49%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
DQ E K L ++E ++E EE E ++ E+ EE+ + ++ E E+E E
Sbjct: 377 DQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDE 436
Query: 146 EEE 148
EE
Sbjct: 437 LEE 439
Score = 32.8 bits (75), Expect = 0.054
Identities = 16/53 (30%), Positives = 25/53 (47%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
ED E + EEEE +E E+ E E+ ++ EE+ E E+E +
Sbjct: 381 EDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFED 433
Score = 31.3 bits (71), Expect = 0.18
Identities = 16/52 (30%), Positives = 28/52 (53%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
L + ++ E E+ E EE+ + E E E ++E+E E+ + EE+EE
Sbjct: 398 LQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449
Score = 30.9 bits (70), Expect = 0.31
Identities = 17/65 (26%), Positives = 31/65 (47%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
V + ++ E+ + E + E E+ E EE+ E E++ + E+E +E EE
Sbjct: 385 VAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEE 444
Query: 143 EKEEE 147
E+E
Sbjct: 445 EEEAR 449
Score = 30.5 bits (69), Expect = 0.32
Identities = 19/63 (30%), Positives = 29/63 (46%)
Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
+IK E +E E+ +V E + EEE+ + E E+ E+ + EE L L
Sbjct: 369 DIKDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPE 428
Query: 161 CEF 163
EF
Sbjct: 429 DEF 431
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 36.3 bits (83), Expect = 0.005
Identities = 24/62 (38%), Positives = 38/62 (61%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K E+ + + KK EE+K++ EE + EE++++ E KKE EE +K EE +K+E E
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAE 1713
Query: 149 TK 150
K
Sbjct: 1714 EK 1715
Score = 35.5 bits (81), Expect = 0.010
Identities = 30/78 (38%), Positives = 46/78 (58%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
++AK+ E+ E+ K EE+EK+ E + + EE ++ EE +KKE EEK+K EE +K
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723
Query: 145 EEEETKYLVRDVHLESCE 162
EEE K + E+ E
Sbjct: 1724 AEEENKIKAEEAKKEAEE 1741
Score = 34.7 bits (79), Expect = 0.019
Identities = 23/64 (35%), Positives = 41/64 (64%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+++ K+ E+ + KK EE K+ EEE ++K E+ ++ E++KK+ EE +K EE+EK
Sbjct: 1629 EEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688
Query: 145 EEEE 148
+ E
Sbjct: 1689 KAAE 1692
Score = 33.6 bits (76), Expect = 0.042
Identities = 27/78 (34%), Positives = 41/78 (52%)
Query: 73 VEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
+ K L ++ K+ E+ E+N K EE ++ EE++ K EE ++ EE EKK
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690
Query: 133 EEEKEKMEEEEKEEEETK 150
E +K EE K+ EE K
Sbjct: 1691 AEALKKEAEEAKKAEELK 1708
Score = 33.6 bits (76), Expect = 0.047
Identities = 28/67 (41%), Positives = 42/67 (62%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
+++ K E+ + + KK EE K + EE + EEEK++ E+ +KKE EEK+K EE +
Sbjct: 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELK 1653
Query: 144 KEEEETK 150
K EEE K
Sbjct: 1654 KAEEENK 1660
Score = 32.8 bits (74), Expect = 0.082
Identities = 24/64 (37%), Positives = 37/64 (57%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
A E E A E KK+EE K++ + + K EEK++ +E +KK EE+K+K +E +K
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414
Query: 147 EETK 150
K
Sbjct: 1415 AAKK 1418
Score = 32.4 bits (73), Expect = 0.093
Identities = 24/62 (38%), Positives = 39/62 (62%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K E+ + + K+ EE+K+ EE ++ +EE K + E+ KK EE+K+K EE +K EE+
Sbjct: 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686
Query: 149 TK 150
K
Sbjct: 1687 EK 1688
Score = 32.0 bits (72), Expect = 0.12
Identities = 27/62 (43%), Positives = 36/62 (58%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K E E K+ EE K+ EE ++ + EEK++ EE +K EEE K K EE +KE EE
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741
Query: 149 TK 150
K
Sbjct: 1742 DK 1743
Score = 32.0 bits (72), Expect = 0.14
Identities = 28/62 (45%), Positives = 42/62 (67%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
KE E+A E K+ EE+K+ EE ++ +EE K + EE +K+ EE+K+K EE +K+EEE
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755
Query: 149 TK 150
K
Sbjct: 1756 KK 1757
Score = 32.0 bits (72), Expect = 0.15
Identities = 26/66 (39%), Positives = 42/66 (63%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
++AK+ E+ + KK EE K+ EEE ++K EE ++E E++KK+ EE +K EEE+K
Sbjct: 1698 AEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757
Query: 145 EEEETK 150
+ K
Sbjct: 1758 KIAHLK 1763
Score = 31.6 bits (71), Expect = 0.19
Identities = 19/51 (37%), Positives = 32/51 (62%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ KK+ EE ++ +E + K EE ++ +E +KK EE K+K +E +K E K
Sbjct: 1460 EEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKK 1510
Score = 30.5 bits (68), Expect = 0.48
Identities = 26/61 (42%), Positives = 35/61 (57%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
E +K L + K+ EE EE ++ EEEK+ + E+ KK EE K K EE +K EEE
Sbjct: 1573 EEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEK 1632
Query: 150 K 150
K
Sbjct: 1633 K 1633
Score = 29.7 bits (66), Expect = 0.68
Identities = 26/66 (39%), Positives = 43/66 (65%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+++ + E+A E IK EE +K EEE+++V++ +K+E EEK+K EE +K + E + K
Sbjct: 1603 EEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIK 1662
Query: 145 EEEETK 150
EE K
Sbjct: 1663 AAEEAK 1668
Score = 29.7 bits (66), Expect = 0.79
Identities = 23/65 (35%), Positives = 38/65 (58%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
D+AK+ + D +KK K++ +E + K EEK++ +E +KK EE K+ E ++K
Sbjct: 1394 DEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKA 1453
Query: 146 EEETK 150
EE K
Sbjct: 1454 EEAKK 1458
Score = 29.3 bits (65), Expect = 0.95
Identities = 23/64 (35%), Positives = 40/64 (62%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+A E +KA + + K+ EEK++ +E + K EE ++ +E +KK EE K+K + +K+
Sbjct: 1279 KADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKA 1338
Query: 147 EETK 150
EE K
Sbjct: 1339 EEAK 1342
Score = 29.3 bits (65), Expect = 1.1
Identities = 19/51 (37%), Positives = 32/51 (62%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
D KK+ EE ++ +E + K EE +++ ++ KK E K+K +E +K EE K
Sbjct: 1473 DEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523
Score = 29.0 bits (64), Expect = 1.5
Identities = 19/51 (37%), Positives = 33/51 (64%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
D KK+ EEK++ +E + K EE +++ ++ KK K+K +E +K+ EE K
Sbjct: 1381 DAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKK 1431
Score = 28.6 bits (63), Expect = 2.1
Identities = 21/65 (32%), Positives = 39/65 (60%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
++AK+ A + KK +E K++ EE++ K +E ++ +KK +E K+K EE++K
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKA 1433
Query: 146 EEETK 150
+E K
Sbjct: 1434 DEAKK 1438
Score = 28.2 bits (62), Expect = 2.2
Identities = 20/62 (32%), Positives = 38/62 (61%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E E A E +K E ++++EE + K + +++ E++KK +E K+K EE++K+ +E
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409
Query: 149 TK 150
K
Sbjct: 1410 LK 1411
Score = 27.8 bits (61), Expect = 3.0
Identities = 21/51 (41%), Positives = 32/51 (62%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
D KK+ EE ++ +E + K EE ++ EE +KK EE K+ E ++K EE K
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484
Score = 27.8 bits (61), Expect = 3.0
Identities = 21/51 (41%), Positives = 32/51 (62%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
D KK+ EE ++ EE + K EE ++ +E +KK EE K+ E ++K EE K
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKK 1497
Score = 27.8 bits (61), Expect = 3.6
Identities = 21/51 (41%), Positives = 33/51 (64%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
D KK+ EEK++ +E + K EE ++ +E +KK EE K+ E ++K EE K
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
Score = 27.8 bits (61), Expect = 3.7
Identities = 18/51 (35%), Positives = 29/51 (56%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
D KK+ EE ++ +E + K EE +++ + KK+ EE +K E K E E
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAA 1355
Score = 27.4 bits (60), Expect = 4.4
Identities = 20/62 (32%), Positives = 38/62 (61%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K+ E+A + KK EE+K+ +E ++ EE+K++ +E +K +K+ E ++K EE+
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430
Query: 149 TK 150
K
Sbjct: 1431 KK 1432
Score = 27.0 bits (59), Expect = 5.7
Identities = 21/57 (36%), Positives = 39/57 (68%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
E+ IK EE +KE EE+++ EE K++EEEK+K +KE+ ++ E+ +E + ++ +
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEE 1783
Score = 27.0 bits (59), Expect = 5.8
Identities = 23/69 (33%), Positives = 41/69 (59%), Gaps = 4/69 (5%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE----EEKEKMEE 141
++AK+ ++A E+ K +E +K+ EE ++ +E K+ E K+K +E EE +K +E
Sbjct: 1467 EEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526
Query: 142 EEKEEEETK 150
+K EE K
Sbjct: 1527 AKKAEEAKK 1535
Score = 27.0 bits (59), Expect = 5.9
Identities = 17/48 (35%), Positives = 27/48 (56%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
KK EE K+ +E ++ EE K++ + +KK EE K+ E + E E
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA 1356
Score = 27.0 bits (59), Expect = 6.4
Identities = 22/66 (33%), Positives = 37/66 (56%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
++ K E A E + E +E+ E E K+EE +++ + KK+ EEK+K +E +K
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK 1398
Query: 145 EEEETK 150
+ EE K
Sbjct: 1399 KAEEDK 1404
Score = 26.6 bits (58), Expect = 7.3
Identities = 22/70 (31%), Positives = 42/70 (60%), Gaps = 5/70 (7%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE-----EEKEKME 140
++ K+ ++A E++ KK +E K+ ++ +E K++ EEK+K +E EE +K +
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKAD 1447
Query: 141 EEEKEEEETK 150
E +K+ EE K
Sbjct: 1448 EAKKKAEEAK 1457
Score = 26.6 bits (58), Expect = 7.5
Identities = 22/57 (38%), Positives = 39/57 (68%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
ED K EE +K+EEE++++ +KEEE++ E+ +E++ +EEE EE+E + + D
Sbjct: 1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
Score = 26.3 bits (57), Expect = 9.6
Identities = 19/64 (29%), Positives = 34/64 (53%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
D+ K+ E+A + KK EE K+ E + + E +E E +++ E EK +EE K
Sbjct: 1318 DEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377
Query: 145 EEEE 148
++ +
Sbjct: 1378 KKAD 1381
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 36.2 bits (83), Expect = 0.005
Identities = 22/53 (41%), Positives = 35/53 (66%), Gaps = 2/53 (3%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEE--EEEKEKKEEEEKEKMEEEEKEEEETK 150
D + +EEEE E EE+EE E+E + +E+ E++EEEE + M++ E +TK
Sbjct: 137 DLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTK 189
Score = 34.2 bits (78), Expect = 0.023
Identities = 23/50 (46%), Positives = 31/50 (62%), Gaps = 3/50 (6%)
Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
EEEEEV+ EE EE EKE + +++ EE+EEEE K + + LES
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKS---EEEEEEELKTMKDFIDLES 186
Score = 30.0 bits (67), Expect = 0.53
Identities = 17/46 (36%), Positives = 29/46 (63%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
E+ +E+EE E+E +V +E+ EEEEE+E K ++ +E + K
Sbjct: 144 EEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTK 189
Score = 27.3 bits (60), Expect = 4.6
Identities = 16/43 (37%), Positives = 23/43 (53%), Gaps = 3/43 (6%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
E E+ + ED E+E + +E K EE+EEEE K K+
Sbjct: 141 EEEEEVEMEEDEEYYEKEPGKVVDE---KSEEEEEEELKTMKD 180
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 36.0 bits (83), Expect = 0.005
Identities = 24/45 (53%), Positives = 33/45 (73%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ EEE+EEEEEEE +E+E EEEE ++++EEEE E E+E E
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 36.0 bits (83), Expect = 0.005
Identities = 24/50 (48%), Positives = 36/50 (72%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+++ ++E E+EEEEEEE +EEE+E EEE+ + EEEE+E + EEE
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEM 486
Score = 34.9 bits (80), Expect = 0.013
Identities = 24/48 (50%), Positives = 38/48 (79%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++EE +EEEEEEE +EEE++E EE+E ++EEE+E++E + EEE +
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 34.9 bits (80), Expect = 0.014
Identities = 24/48 (50%), Positives = 33/48 (68%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+E +EEE EE +EEE+EEEEE+++ EEEE E EEEE+ E +
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNG 481
Score = 34.1 bits (78), Expect = 0.023
Identities = 26/64 (40%), Positives = 40/64 (62%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
Q E E++ E+ ++EEEE++E EEEE ++EE+EEE E + EEE E E + +
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDG 496
Query: 147 EETK 150
EE +
Sbjct: 497 EEPE 500
Score = 31.8 bits (72), Expect = 0.13
Identities = 25/67 (37%), Positives = 34/67 (50%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+E E E+ ++EEEE+E EEEE EEE+EE E EEE + E + EE
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEP 499
Query: 149 TKYLVRD 155
+ R
Sbjct: 500 EEDAERR 506
Score = 30.3 bits (68), Expect = 0.51
Identities = 19/69 (27%), Positives = 33/69 (47%)
Query: 82 CVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
++ +E E+ E+ ++ EEE+ E+EEEE + E EE+ + E EE
Sbjct: 439 SEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEE 498
Query: 142 EEKEEEETK 150
E++ E
Sbjct: 499 PEEDAERRN 507
Score = 28.7 bits (64), Expect = 1.7
Identities = 21/44 (47%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
+E EE + E+EEEEE+E++EEE++ EEEE E+EE + V
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQES--EEEEGEDEEEEEEVEA 478
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 36.3 bits (84), Expect = 0.005
Identities = 26/48 (54%), Positives = 36/48 (75%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
K+ E+E +EEEEE KEE+KEEEE+ KEEE E+ E+EEK+++ K
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76
Score = 35.5 bits (82), Expect = 0.008
Identities = 24/60 (40%), Positives = 39/60 (65%), Gaps = 2/60 (3%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
KE EK E+ +KEE+++EEE+ + +EE EEEE++EKK ++ +K++E E E
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK--KKTKKVKETTTEWEL 86
Score = 34.0 bits (78), Expect = 0.026
Identities = 20/43 (46%), Positives = 33/43 (76%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EKE E+E +EEE+E+EE+KE++E+ ++ E +E+EE+E K
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEK 70
Score = 32.0 bits (73), Expect = 0.12
Identities = 21/46 (45%), Positives = 37/46 (80%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
E ++KE ++EEEEE+E K+EE+E+ +KE++ +EE+EK E+++K
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKK 73
Score = 26.7 bits (59), Expect = 8.0
Identities = 19/54 (35%), Positives = 30/54 (55%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
+++ +E E+ E KEEE EEEE+EE K++ K+ +E + E K K
Sbjct: 38 EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 36.2 bits (84), Expect = 0.005
Identities = 28/96 (29%), Positives = 41/96 (42%), Gaps = 13/96 (13%)
Query: 68 LC--LQLVEG---KPNGL-----LCVLDDQAKETEKAGVLYEDNIKKEEE---EKEEEEE 114
LC L LVEG + N L L +LD K E E N K +EE
Sbjct: 181 LCLDLDLVEGLLERKNELPHLKKLIILDTLIKSKEININKEEKNNGSNVNNNGNKNNKEE 240
Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++ + E E+ E +KEK+E+ + +E+ K
Sbjct: 241 QKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAK 276
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 35.6 bits (82), Expect = 0.006
Identities = 15/62 (24%), Positives = 33/62 (53%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ KE ++A L + ++E K+ E+E +E+K++ EE K+ ++++ EE +
Sbjct: 82 KKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKA 141
Query: 147 EE 148
Sbjct: 142 AA 143
Score = 35.6 bits (82), Expect = 0.007
Identities = 13/46 (28%), Positives = 33/46 (71%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+E+ +K+E+++ E ++++ E+E+ K+ E+E+ +E++K+ EE
Sbjct: 77 AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEE 122
Score = 34.4 bits (79), Expect = 0.016
Identities = 15/49 (30%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 105 EEEEKEEEEEEEVKEEEKE---EEEEKEKKEEEEKEKMEEEEKEEEETK 150
E ++K+ E+E +K+ EKE +E+K++ EE K+ ++++ EE
Sbjct: 91 ELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAA 139
Score = 33.6 bits (77), Expect = 0.035
Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKE-EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
QA E E+ L ++ + +E++K+ EE ++ ++K+ EE K K K E E K
Sbjct: 96 QAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKR 155
Query: 146 EEE 148
Sbjct: 156 AAA 158
Score = 30.2 bits (68), Expect = 0.51
Identities = 14/52 (26%), Positives = 25/52 (48%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ KK E E ++ E K++ + E K E ++K + E ++K E K
Sbjct: 165 AEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAK 216
Score = 30.2 bits (68), Expect = 0.53
Identities = 14/61 (22%), Positives = 26/61 (42%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
E + KK E +++ E E + E ++K + E ++K E ++K E
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEA 223
Query: 150 K 150
K
Sbjct: 224 K 224
Score = 29.4 bits (66), Expect = 0.87
Identities = 8/48 (16%), Positives = 30/48 (62%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++ +++++++ EE +++ E+E K+ ++E + ++++ EE +
Sbjct: 79 EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ 126
Score = 28.6 bits (64), Expect = 1.7
Identities = 17/67 (25%), Positives = 26/67 (38%), Gaps = 4/67 (5%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEE---KEEEEEEEVKEEEKEEEEEKEKKEEEE-KEKMEE 141
+ AK+ E+ K K E E + K+ E +KK E E +K
Sbjct: 122 EAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAA 181
Query: 142 EEKEEEE 148
E K++ E
Sbjct: 182 EAKKKAE 188
Score = 28.6 bits (64), Expect = 1.7
Identities = 12/40 (30%), Positives = 26/40 (65%)
Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K EE+ + KE+++ EE ++++ E+E+ K E+E+ +
Sbjct: 75 KRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQ 114
Score = 27.1 bits (60), Expect = 5.0
Identities = 11/48 (22%), Positives = 32/48 (66%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+++ ++ EE+ ++ ++++ EE ++K+ E+E +++E+E +E K
Sbjct: 70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQK 117
Score = 26.3 bits (58), Expect = 8.4
Identities = 15/65 (23%), Positives = 25/65 (38%), Gaps = 3/65 (4%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEE---EVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
Q K+ E+A K+ EE + + + E K +K E K+K E E
Sbjct: 116 QKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA 175
Query: 144 KEEEE 148
++
Sbjct: 176 AKKAA 180
Score = 26.3 bits (58), Expect = 8.4
Identities = 14/50 (28%), Positives = 20/50 (40%), Gaps = 2/50 (4%)
Query: 103 KKEEEEKEEEEEEEVKEEE--KEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ EE K+ +++ EE K K K E E K +K E K
Sbjct: 119 QAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAK 168
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 36.2 bits (83), Expect = 0.006
Identities = 15/49 (30%), Positives = 37/49 (75%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
ED ++ ++++E+E ++++ EE++E+E++EEEE + ++E++E+E
Sbjct: 143 EDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191
Score = 33.1 bits (75), Expect = 0.061
Identities = 16/52 (30%), Positives = 37/52 (71%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
L E+++ + ++++E+ E++ +EE+++E++EEEE+ K ++E EE+E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191
Score = 32.7 bits (74), Expect = 0.085
Identities = 18/54 (33%), Positives = 37/54 (68%), Gaps = 5/54 (9%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK-----KEEEEKEKMEEEEKEEEE 148
DN ++++++E+E+++ +E+++EEEEE+E+ E+EE E E+ E+ E
Sbjct: 149 DNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSE 202
Score = 31.9 bits (72), Expect = 0.15
Identities = 18/51 (35%), Positives = 34/51 (66%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
+E+E E++++EE EEE+EEEEE + ++E++E E+ E+++ D
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTD 207
Score = 29.6 bits (66), Expect = 0.89
Identities = 15/58 (25%), Positives = 32/58 (55%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
IK+ EE+ ++ +++++++ E+++ E+ +EEE+EEEE D E
Sbjct: 132 IKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189
Score = 29.6 bits (66), Expect = 0.94
Identities = 20/68 (29%), Positives = 39/68 (57%), Gaps = 6/68 (8%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKE------EEEEKEKKEEEEKE 137
L ++ V+ +D+ ++E++ +EE++EE +EEE+E E+EE E E+ E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYE 199
Query: 138 KMEEEEKE 145
K E ++ +
Sbjct: 200 KSEVDKTD 207
Score = 26.9 bits (59), Expect = 7.0
Identities = 12/47 (25%), Positives = 27/47 (57%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
I K + EE+ + +++ E ++E++ ++ ++EE+EEEE
Sbjct: 131 IIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 35.7 bits (83), Expect = 0.007
Identities = 12/65 (18%), Positives = 39/65 (60%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
+ +QA + + + + +++ ++E+++E +++ ++E++EKKE +E EK+ +++
Sbjct: 126 VLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDD 185
Query: 144 KEEEE 148
+
Sbjct: 186 DFVWD 190
Score = 34.6 bits (80), Expect = 0.015
Identities = 14/64 (21%), Positives = 39/64 (60%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
D + + L +D+I ++++++++E+++ + + E+EE+KE KE E+ ++
Sbjct: 130 ADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVW 189
Query: 145 EEEE 148
+E++
Sbjct: 190 DEDD 193
Score = 31.5 bits (72), Expect = 0.16
Identities = 8/64 (12%), Positives = 36/64 (56%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
+D+ ++++ ++++ ++ ++E ++E++ + ++E E+ +E ++ E+ +
Sbjct: 132 DDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDE 191
Query: 159 ESCE 162
+ E
Sbjct: 192 DDSE 195
Score = 31.1 bits (71), Expect = 0.23
Identities = 15/67 (22%), Positives = 36/67 (53%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
LDD + + +++ ++ + E+EE++E KE EK +++ +E++ E + +
Sbjct: 142 LDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQAR 201
Query: 144 KEEEETK 150
K+ + T
Sbjct: 202 KDAKLTA 208
Score = 30.7 bits (70), Expect = 0.27
Identities = 11/63 (17%), Positives = 29/63 (46%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
DD + ED+ + ++++ ++E+EE KE ++ E+ + +++ E
Sbjct: 138 DDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEAL 197
Query: 143 EKE 145
+
Sbjct: 198 RQA 200
Score = 30.7 bits (70), Expect = 0.28
Identities = 12/56 (21%), Positives = 34/56 (60%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
+E++ E++++++V +E++E++E KE ++ + + +E + E + +D L
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKL 206
Score = 30.3 bits (69), Expect = 0.38
Identities = 11/55 (20%), Positives = 33/55 (60%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLV 153
+D+ ++++E +++++ + ++EEK+E +E EK +++ +E++ E
Sbjct: 149 DDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKD 203
Score = 30.0 bits (68), Expect = 0.51
Identities = 9/66 (13%), Positives = 36/66 (54%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
D K + ++++ +++ +++ +++ ++E++ E ++++ + +EE+K
Sbjct: 114 KDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKK 173
Query: 145 EEEETK 150
E +E +
Sbjct: 174 EAKELE 179
Score = 29.6 bits (67), Expect = 0.67
Identities = 6/64 (9%), Positives = 37/64 (57%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
D+ + + N ++++ +++++ + + + ++++E + +++++ + +E+E+
Sbjct: 112 LDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEE 171
Query: 145 EEEE 148
++E
Sbjct: 172 KKEA 175
Score = 28.8 bits (65), Expect = 1.4
Identities = 9/63 (14%), Positives = 36/63 (57%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
D + +D+ +++ ++++++ ++E +++++ + ++EE+KE E E+
Sbjct: 123 DIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLS 182
Query: 146 EEE 148
+++
Sbjct: 183 DDD 185
Score = 27.3 bits (61), Expect = 5.0
Identities = 11/71 (15%), Positives = 33/71 (46%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
DD + + +D+ ++E EE++E + E+ ++++ +++ E + ++
Sbjct: 145 DDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDA 204
Query: 145 EEEETKYLVRD 155
+ T V+
Sbjct: 205 KLTATADPVKA 215
Score = 26.9 bits (60), Expect = 5.5
Identities = 10/59 (16%), Positives = 35/59 (59%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
DD + + + +D+ + ++E+ ++++ ++ EE+KE +E ++ ++++ E++
Sbjct: 135 DDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDD 193
Score = 26.5 bits (59), Expect = 8.6
Identities = 8/64 (12%), Positives = 40/64 (62%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+ K+ + V D + + +++ ++++++++ +++ +++++ E +E++ + ++E
Sbjct: 110 NALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDED 169
Query: 145 EEEE 148
EE++
Sbjct: 170 EEKK 173
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 35.8 bits (83), Expect = 0.007
Identities = 15/56 (26%), Positives = 34/56 (60%)
Query: 93 KAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
G+ ED K +++EE +E+E++E +++K+ + K++ E+++KE E+
Sbjct: 530 NLGIRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 28.8 bits (65), Expect = 1.2
Identities = 12/43 (27%), Positives = 25/43 (58%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK 131
K +K + E K+ +E++ + + +EE+K++E EK +K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 35.2 bits (82), Expect = 0.009
Identities = 15/41 (36%), Positives = 26/41 (63%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
K+ ++E +E + + EEE+E++EEEE+E+ EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 34.5 bits (80), Expect = 0.013
Identities = 15/40 (37%), Positives = 25/40 (62%)
Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
++ +EE+KE + + +EEEE+E+ EEEE+ EE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319
Score = 34.5 bits (80), Expect = 0.016
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
+ +K+ + + E +EEE+EEEEE+E EEE
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 33.3 bits (77), Expect = 0.034
Identities = 14/39 (35%), Positives = 25/39 (64%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
+D + +E +E + + EE+EEEEE+E++EE +E
Sbjct: 281 KDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319
Score = 32.9 bits (76), Expect = 0.046
Identities = 17/43 (39%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+++ +EE +EV + + +E++EEEE+E EEEE EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEE--EEEEPSEEE 320
Score = 31.4 bits (72), Expect = 0.15
Identities = 15/44 (34%), Positives = 25/44 (56%)
Query: 94 AGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
A L + + EE ++ + + E+EEEEE+E++EEE E
Sbjct: 275 AAALADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318
Score = 27.1 bits (61), Expect = 4.8
Identities = 17/37 (45%), Positives = 23/37 (62%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEE 120
LD++ KE A ++EEEE+EEEEEE +EE
Sbjct: 284 LDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 26.8 bits (60), Expect = 5.6
Identities = 14/38 (36%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 113 EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+++ + EE KE + + E+E+ EEEE+EEEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEE-EEEEEEEEEEP 316
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 35.4 bits (82), Expect = 0.010
Identities = 13/52 (25%), Positives = 25/52 (48%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E K+E++ EE+ EE + EK +E E + ++ + +E+E
Sbjct: 315 EQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPD 366
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 35.2 bits (82), Expect = 0.010
Identities = 16/52 (30%), Positives = 25/52 (48%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E+ +KEE ++E EEE + + EE E +E+ + E EE E
Sbjct: 40 EEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAA 91
Score = 32.5 bits (75), Expect = 0.074
Identities = 19/39 (48%), Positives = 22/39 (56%)
Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EEEE KEE K E EE+ K E EE E EE+ + E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAA 77
Score = 32.1 bits (74), Expect = 0.12
Identities = 19/45 (42%), Positives = 22/45 (48%), Gaps = 1/45 (2%)
Query: 105 EEEEKE-EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
EEEE EE + E +EE K E EE E EE+ K E E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAE 83
Score = 30.6 bits (70), Expect = 0.31
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EEEE +E ++E EEE + + EE EEE + E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAA 78
Score = 29.4 bits (67), Expect = 0.77
Identities = 15/41 (36%), Positives = 21/41 (51%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EEEE + + + + EEE K + EE + EEE K E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAA 79
Score = 29.0 bits (66), Expect = 1.1
Identities = 15/34 (44%), Positives = 20/34 (58%)
Query: 117 VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
V+EEE +EE K + EEE K + EE E E +
Sbjct: 38 VEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEE 71
>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain. This family consists
of several ATP synthase E chain sequences which are
components of the CF(0) subunit.
Length = 83
Score = 33.2 bits (76), Expect = 0.012
Identities = 16/58 (27%), Positives = 32/58 (55%), Gaps = 1/58 (1%)
Query: 94 AGVLYEDNIKKEEEEKEE-EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
G ++ ++KK+EE+ E E +E++ E+ K +K++ ++E K E +ET
Sbjct: 24 YGAKHQRSLKKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKSLAEGSSAPFDETP 81
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 35.2 bits (81), Expect = 0.012
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 5/59 (8%)
Query: 97 LYEDNIKKE-----EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+Y++N K+ E +E E+ EEK E EK KEE E+E+ EE++ EE K
Sbjct: 224 VYDNNDHKDFCVEIEGGMDEHSFEDFLLEEKRRELEKLAKEEAERERQAEEQRRREEEK 282
Score = 31.4 bits (71), Expect = 0.20
Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 106 EEEKEEEEEEEVK---EEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
EE EEE ++ +EE+ EE + + E+ +M+ E KE+ +L+ H+
Sbjct: 424 EELYWVEEEHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEKTLKMFLLSQKHI 479
Score = 29.8 bits (67), Expect = 0.68
Identities = 19/55 (34%), Positives = 34/55 (61%), Gaps = 5/55 (9%)
Query: 107 EEKEEEEEEEVKEE-EKEEEEEKEKKEEEEKEKME----EEEKEEEETKYLVRDV 156
EEK E E+ KEE E+E + E++++ EEEK ME + + E E+ + ++++
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNL 306
Score = 28.3 bits (63), Expect = 1.9
Identities = 19/48 (39%), Positives = 24/48 (50%), Gaps = 3/48 (6%)
Query: 103 KKEEEEK---EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
K+ E EK EE E E EE++ EEEK E + + E EK E
Sbjct: 254 KRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 34.3 bits (79), Expect = 0.014
Identities = 14/59 (23%), Positives = 36/59 (61%)
Query: 93 KAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
+ + K EK+ +E E +E++E++E+E++EEEE+++ +++ ++++ Y
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDY 200
Score = 33.2 bits (76), Expect = 0.038
Identities = 19/49 (38%), Positives = 32/49 (65%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ EEEE +E+ ++++ KE E E +E+E+ E+ EEEE+EE+E
Sbjct: 139 VGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDE 187
Score = 32.8 bits (75), Expect = 0.050
Identities = 16/62 (25%), Positives = 36/62 (58%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ ++ L+ + + +E+ E++ + E ++ +EE EK EEEE+E+ EE+E +
Sbjct: 131 ALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFD 190
Query: 147 EE 148
++
Sbjct: 191 DD 192
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 33.9 bits (78), Expect = 0.018
Identities = 17/48 (35%), Positives = 36/48 (75%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++E E EEEE++E K++++ ++E+KEKK+++EK + K++++ K
Sbjct: 143 VEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190
Score = 33.9 bits (78), Expect = 0.019
Identities = 18/48 (37%), Positives = 33/48 (68%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ E ++E + KE E EEEE+KEKK+++E +K ++E+K+++E
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Score = 32.4 bits (74), Expect = 0.058
Identities = 20/53 (37%), Positives = 35/53 (66%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
E + E++ E EEEE KE++K++E +KEKKE+++K++ E K ++ K
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188
Score = 32.4 bits (74), Expect = 0.071
Identities = 17/46 (36%), Positives = 29/46 (63%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E E E E+ + E+E E E++E++EK+K +E +KE++E K
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKK 171
Score = 32.0 bits (73), Expect = 0.083
Identities = 19/64 (29%), Positives = 44/64 (68%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
+ + E E + ++ +EEE+KE+++++EVK+E+KE++++KEK E + K +++
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188
Query: 143 EKEE 146
+K++
Sbjct: 189 KKKK 192
Score = 32.0 bits (73), Expect = 0.085
Identities = 17/48 (35%), Positives = 36/48 (75%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
K+ E E+EE++E++ K+E K+E++EK+ K+E+ E ++K++++ K
Sbjct: 145 KEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 31.6 bits (72), Expect = 0.099
Identities = 17/49 (34%), Positives = 36/49 (73%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ E E E+E +V++E + EEEEK++K+++++ K E++EK++++ K
Sbjct: 128 LGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEK 176
Score = 29.7 bits (67), Expect = 0.57
Identities = 15/53 (28%), Positives = 29/53 (54%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
Y E E E E + K E+E + ++EE++++K ++E K+E++ K
Sbjct: 118 YGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170
Score = 28.5 bits (64), Expect = 1.2
Identities = 18/48 (37%), Positives = 32/48 (66%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ E E E+E + EKE E E+E+K+E++K+K ++EK+E++ K
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 34.0 bits (78), Expect = 0.024
Identities = 24/69 (34%), Positives = 40/69 (57%), Gaps = 5/69 (7%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEK-----EEEEEKEKKEEEEKEKM 139
++ +E+ K V+ ED + E+E+E+E K + EE EEKEK++ + +KM
Sbjct: 242 EESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKM 301
Query: 140 EEEEKEEEE 148
E+E E+EE
Sbjct: 302 MEDEDEDEE 310
Score = 30.6 bits (69), Expect = 0.34
Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 3/76 (3%)
Query: 74 EGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE 133
E KP+G +++ +E EK +KK E+++E+EE E+ E EEEE E+ E
Sbjct: 273 EPKPSGERSDSEEETEEKEKEK---RKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEP 329
Query: 134 EEKEKMEEEEKEEEET 149
K EEE++E +
Sbjct: 330 PPLPKKEEEKEEVTVS 345
Score = 29.0 bits (65), Expect = 0.99
Identities = 20/64 (31%), Positives = 36/64 (56%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
LD+ E E + ++E EEKE+E+ + +K+ ++E+E++E + E EEE
Sbjct: 265 LDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEES 324
Query: 144 KEEE 147
+E E
Sbjct: 325 EEPE 328
Score = 27.9 bits (62), Expect = 2.6
Identities = 15/48 (31%), Positives = 25/48 (52%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
KKE++E E +E EEE + + + E E ++E+E E+E
Sbjct: 229 KKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKP 276
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 33.8 bits (77), Expect = 0.025
Identities = 14/57 (24%), Positives = 39/57 (68%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
E+ I+K +E EEE EE ++E++++++ ++ E+++ + +E++++E++ + D
Sbjct: 35 ENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHD 91
Score = 33.0 bits (75), Expect = 0.049
Identities = 15/53 (28%), Positives = 34/53 (64%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
K +E +EE EEE +EE +++++ E +++++ + E++E E+++ L D
Sbjct: 40 KLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDD 92
Score = 30.0 bits (67), Expect = 0.51
Identities = 16/61 (26%), Positives = 38/61 (62%), Gaps = 3/61 (4%)
Query: 100 DNIKKE---EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
D +KKE + +E EEE ++EE+ +EE++ + +E+E + +++++++E+ +
Sbjct: 30 DTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTL 89
Query: 157 H 157
H
Sbjct: 90 H 90
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 33.5 bits (77), Expect = 0.025
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
E K+ + EE EE E +++E E EK E ++K+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 32.7 bits (75), Expect = 0.049
Identities = 14/35 (40%), Positives = 19/35 (54%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
K EE EE E ++EE E EKKE ++K+
Sbjct: 162 AKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 32.3 bits (74), Expect = 0.072
Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 104 KEEEEKEE--EEEEEVKEEEKEEEEEKEKKEEEEKEK 138
K+ + EE EE E KEEE + +K+E +K+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 27.7 bits (62), Expect = 2.6
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E ++ EE EE + +E+E E E+KE ++ K
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 33.6 bits (77), Expect = 0.026
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ E+EE + E+EE++ EE+E E EEEE EE E K
Sbjct: 31 AQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVK 74
Score = 32.8 bits (75), Expect = 0.049
Identities = 16/60 (26%), Positives = 31/60 (51%)
Query: 77 PNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK 136
P LL + + + ++ + + +++EE+E V+E+E E EE+E +E E K
Sbjct: 15 PAVLLTFGNGKGPFASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVK 74
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 33.9 bits (78), Expect = 0.027
Identities = 13/48 (27%), Positives = 25/48 (52%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E+E++EEE+ + EEE+ E+ E+E + E E +++
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSE 417
Score = 32.8 bits (75), Expect = 0.057
Identities = 12/51 (23%), Positives = 27/51 (52%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
+ + ++E+E+EE+ +E +EEE E+ EE+ + E+ + +
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSE 417
Score = 32.8 bits (75), Expect = 0.070
Identities = 16/49 (32%), Positives = 30/49 (61%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
D I EE +++E+EEEE + +E EEEE ++ +EE + + + + +
Sbjct: 363 DPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSD 411
Score = 32.0 bits (73), Expect = 0.10
Identities = 15/50 (30%), Positives = 24/50 (48%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
ED +++ ++ EEEE E EEE + E E + E K ++E
Sbjct: 374 EDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKE 423
Score = 32.0 bits (73), Expect = 0.12
Identities = 14/54 (25%), Positives = 30/54 (55%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
L + ++ +E+++EEEE+ E E+EE E+ E++ + +E E + +
Sbjct: 362 LDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSD 415
Score = 30.1 bits (68), Expect = 0.53
Identities = 12/50 (24%), Positives = 23/50 (46%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ + + EEEE E+ EEE + E E + + K ++E + +
Sbjct: 380 QRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSD 429
Score = 29.3 bits (66), Expect = 0.98
Identities = 16/46 (34%), Positives = 25/46 (54%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K + + EEV E+E EEEE++ + EEE+ + EEE +
Sbjct: 356 DKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSR 401
Score = 28.9 bits (65), Expect = 1.1
Identities = 15/61 (24%), Positives = 32/61 (52%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
D + + + EE +E ++EE+E+ ++ ++EE E + E + E+ + DV +
Sbjct: 356 DKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSD 415
Query: 160 S 160
S
Sbjct: 416 S 416
Score = 27.4 bits (61), Expect = 3.6
Identities = 12/44 (27%), Positives = 26/44 (59%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ +K + + EE +E+E++E+++ ++ + EE E EEE
Sbjct: 353 KMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEE 396
Score = 27.4 bits (61), Expect = 4.0
Identities = 12/46 (26%), Positives = 19/46 (41%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
++EE E EEE + +E+ E + E K E + E
Sbjct: 386 EEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
Score = 27.0 bits (60), Expect = 4.8
Identities = 14/49 (28%), Positives = 26/49 (53%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K +++ + EE E+E+E+E++ +E E+ E E+ EEE
Sbjct: 349 TKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEG 397
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 33.6 bits (77), Expect = 0.028
Identities = 13/43 (30%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 109 KEEEEEEEVKEEEKEE-EEEKEKKEEEEKEKMEEEEKEEEETK 150
K E++ + K ++ E +++EKK EE +E++++ ++ EE K
Sbjct: 11 KLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAK 53
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 33.0 bits (76), Expect = 0.030
Identities = 24/55 (43%), Positives = 36/55 (65%), Gaps = 1/55 (1%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKEEEKEEEE-EKEKKEEEEKEKMEEEEKEEEETK 150
L D +KEEEE+EE E EE+ EEE+ +E EKE + + +++ E E K++E K
Sbjct: 94 LGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148
Score = 28.4 bits (64), Expect = 1.3
Identities = 18/50 (36%), Positives = 32/50 (64%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
++EE E EE +EEE +E E+E K K+E+ + + +++E +E+ K L
Sbjct: 105 EEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQMKML 154
Score = 26.5 bits (59), Expect = 5.1
Identities = 15/48 (31%), Positives = 29/48 (60%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
KK + +++E+ +EEE+E E E+ +EE+ E +E+E + + K
Sbjct: 88 KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREK 135
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 33.2 bits (77), Expect = 0.032
Identities = 17/36 (47%), Positives = 25/36 (69%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
E ++ E +EEE+E EEE+E++EEEE E+ E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 30.9 bits (71), Expect = 0.22
Identities = 17/34 (50%), Positives = 23/34 (67%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
++ E EEEE + EE+EEEEE+E+ EE E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 30.5 bits (70), Expect = 0.29
Identities = 16/34 (47%), Positives = 23/34 (67%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK 136
++ + EEEEE EEE+EEEEE+E +E E +
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 30.1 bits (69), Expect = 0.44
Identities = 14/33 (42%), Positives = 22/33 (66%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
+ E EEE +E E+EEEEE+E++ EE + +
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 29.7 bits (68), Expect = 0.62
Identities = 17/39 (43%), Positives = 25/39 (64%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
D I + + + E EE +EE +EEEEE+E++E EE E
Sbjct: 219 DAILEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257
Score = 29.0 bits (66), Expect = 1.0
Identities = 16/31 (51%), Positives = 22/31 (70%)
Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ E EEEE+E +EEEE+E+ EE E+ E E
Sbjct: 228 RLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 28.2 bits (64), Expect = 1.8
Identities = 17/31 (54%), Positives = 20/31 (64%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
EEEE+E EEEEE +EEE+ EE E E
Sbjct: 228 RLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 28.2 bits (64), Expect = 2.1
Identities = 15/33 (45%), Positives = 23/33 (69%)
Query: 114 EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ + E +EEEEE E++EEEE+E+ EE + E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.8 bits (63), Expect = 2.1
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+ EE+EEE E+E++EEEE+E E E +
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.8 bits (63), Expect = 2.2
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E ++ E E+E++E EE+E+ EEEE+ EE
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257
Score = 27.8 bits (63), Expect = 2.3
Identities = 16/33 (48%), Positives = 22/33 (66%)
Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ E EEEEE+ ++EEEE+E+ E EE E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.4 bits (62), Expect = 2.8
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
+ E EEE+EE EE+E++EEEE+ + E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.4 bits (62), Expect = 3.2
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ E EE+E++ EEE+E+ EEEE EE E +
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 26.7 bits (60), Expect = 6.2
Identities = 17/34 (50%), Positives = 23/34 (67%)
Query: 93 KAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEE 126
+ G L E ++EEE +EEEEEEE +E E+ E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 33.7 bits (77), Expect = 0.034
Identities = 21/60 (35%), Positives = 34/60 (56%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
D +KKEEE +E EE KE++ + E E +EE+E+ + EE + E + ++ LE
Sbjct: 73 DELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLE 132
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 33.6 bits (77), Expect = 0.038
Identities = 16/44 (36%), Positives = 30/44 (68%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
K+E + EE E + KEE+ ++EE ++K++E+ E E+ +K+E
Sbjct: 389 KDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432
Score = 33.2 bits (76), Expect = 0.044
Identities = 18/54 (33%), Positives = 36/54 (66%), Gaps = 1/54 (1%)
Query: 98 YEDNIKKEEEEKEEEEEEEVK-EEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
Y + + ++E + EE E K +EEK ++EE EKK++E+ ++ +E+ +++E K
Sbjct: 381 YTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 29.8 bits (67), Expect = 0.70
Identities = 14/46 (30%), Positives = 27/46 (58%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++ ++ ++ ++E + EE E K +EEK K EE EK+++E
Sbjct: 375 NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQA 420
Score = 29.0 bits (65), Expect = 1.3
Identities = 12/51 (23%), Positives = 28/51 (54%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
K++E KE ++ + ++ E +++ EE + K +EE+ ++EE +
Sbjct: 363 SGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE 413
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 33.5 bits (77), Expect = 0.040
Identities = 22/103 (21%), Positives = 47/103 (45%), Gaps = 2/103 (1%)
Query: 63 FSDNTLCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEK 122
S N C+ G +L L + E E E+ + +E EE++E+
Sbjct: 417 LSKNFFCITHEVGAEEIILYFLLKKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVL 476
Query: 123 EEEEEKEKKEEEEKEKMEEEEKE--EEETKYLVRDVHLESCEF 163
E+ E+ + +EE K+++ ++ K+ E +T L+ ++ + +
Sbjct: 477 EQAEQGCENQEEVKKEVPKKVKKIPEVDTYLLLEELGINEETY 519
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 33.4 bits (77), Expect = 0.045
Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 15/89 (16%)
Query: 77 PNGLLCV------LDDQAKETEKA-GVLYEDNIKK--EEEEKEEEEEEEVKE--EEKEEE 125
NG+L V + K T A L +D I++ ++ E+ E+++ KE E K E
Sbjct: 474 ANGILTVSAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEA 533
Query: 126 EE----KEKKEEEEKEKMEEEEKEEEETK 150
EE EK +EE +K+ E +K++ E
Sbjct: 534 EEYVYSLEKSLKEEGDKLPEADKKKVEEA 562
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 33.1 bits (76), Expect = 0.046
Identities = 16/40 (40%), Positives = 25/40 (62%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
+KEEEE+ E E+ E+ E+ K+K +EE+K + EE
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160
Score = 30.1 bits (68), Expect = 0.46
Identities = 16/43 (37%), Positives = 25/43 (58%), Gaps = 3/43 (6%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEE-KEKMEEEEKEEEE 148
+K++EEEE + E+ E E E+ K+K +EE+K E E
Sbjct: 119 AQKQKEEEE--RRVERRRELGLEDPEQLRLKQKAKEEQKAESE 159
Score = 28.5 bits (64), Expect = 1.7
Identities = 14/43 (32%), Positives = 22/43 (51%), Gaps = 1/43 (2%)
Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
K++EEE+ E E E+ ++K +EE +K E EE
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEE-QKAESEE 160
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 32.7 bits (75), Expect = 0.047
Identities = 11/39 (28%), Positives = 23/39 (58%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
E K+ +++EE EE++E K+ +K+E ++K+
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 32.0 bits (73), Expect = 0.095
Identities = 10/39 (25%), Positives = 19/39 (48%)
Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E ++ ++EE EE KE K+ ++ E ++ K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 31.2 bits (71), Expect = 0.18
Identities = 13/54 (24%), Positives = 22/54 (40%), Gaps = 12/54 (22%)
Query: 94 AGVL----------YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
GVL +D +KEE EE +E ++ +K E ++K+
Sbjct: 145 VGVLVLQAGLWYAERKDAKQKEEFAAEERKEA--LAAAAKKSATPQKVETKKKK 196
Score = 28.1 bits (63), Expect = 1.9
Identities = 10/36 (27%), Positives = 19/36 (52%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
+K+ ++KEE EE KE ++ ++ E +K
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKK 194
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 33.2 bits (76), Expect = 0.049
Identities = 15/55 (27%), Positives = 33/55 (60%)
Query: 96 VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
VL ++ + EE+ + K+++KE++ + KK+ E+K+K ++E+KE +
Sbjct: 33 VLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEG 87
Score = 32.8 bits (75), Expect = 0.066
Identities = 19/58 (32%), Positives = 33/58 (56%), Gaps = 8/58 (13%)
Query: 101 NIKKEEEEKEEEEEEEV--------KEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ KE EEE +V K+E+K E +K+ +++++K+K ++E K E ETK
Sbjct: 33 VLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETK 90
Score = 28.9 bits (65), Expect = 1.1
Identities = 12/55 (21%), Positives = 32/55 (58%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
E + A + + K+++ + +++ E+ K+++KE++E K + E + K ++ K
Sbjct: 46 ENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSK 100
Score = 28.9 bits (65), Expect = 1.3
Identities = 14/51 (27%), Positives = 33/51 (64%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
K ++E+K E +++ ++++K+++E+KE K E E + + K+ ++TK
Sbjct: 53 STKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103
Score = 27.4 bits (61), Expect = 3.9
Identities = 12/50 (24%), Positives = 30/50 (60%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
K+E++ E +++ K+++K++E+++ K E E K + +K ++ K
Sbjct: 57 DKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKP 106
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 32.8 bits (76), Expect = 0.058
Identities = 12/62 (19%), Positives = 26/62 (41%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
+ E + I + EE+ V E +E +++++++EEE++ E
Sbjct: 154 EWYDRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADE 213
Query: 146 EE 147
E
Sbjct: 214 SE 215
Score = 30.5 bits (70), Expect = 0.39
Identities = 11/42 (26%), Positives = 21/42 (50%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
EE+ E + ++ E+EEE+E + ++ E E E+
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219
Score = 28.2 bits (64), Expect = 1.9
Identities = 15/52 (28%), Positives = 26/52 (50%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
ED E +E +++E+ +EEE E ++ E E EK+ E+ K +
Sbjct: 180 EDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQY 231
Score = 26.7 bits (60), Expect = 6.9
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 6/72 (8%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK------KEEEEKEK 138
D E + A V E ++E++EEEE+E +E E EK ++ +K
Sbjct: 174 VDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKK 233
Query: 139 MEEEEKEEEETK 150
+ + ++++ E +
Sbjct: 234 LRKAQEKKVEGR 245
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 33.0 bits (75), Expect = 0.060
Identities = 15/47 (31%), Positives = 24/47 (51%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
+ + E++EK E+ E +E EE E K +EE E+ E E +
Sbjct: 4051 DFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQ 4097
Score = 31.9 bits (72), Expect = 0.15
Identities = 19/77 (24%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEE--EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
D+ + + L ED+ K E+ EE E EE E+ + +EE E+ E E + ++
Sbjct: 4044 DEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHP 4103
Query: 144 KEEEETKYLVRDVHLES 160
K + ++ + + E+
Sbjct: 4104 KMDAKSTFASAEADEEN 4120
Score = 31.5 bits (71), Expect = 0.23
Identities = 13/50 (26%), Positives = 30/50 (60%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E+N +E+ ++++ + +++EK E+ E+ +E +E E+ K +EE
Sbjct: 4038 EENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEE 4087
Score = 28.8 bits (64), Expect = 1.6
Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 7/67 (10%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEE-------EKEKKEEEEKEKMEEE 142
E++ GV + + + E EEE +++ EE + E EK E E++ +E E
Sbjct: 3861 ESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETE 3920
Query: 143 EKEEEET 149
+K E++
Sbjct: 3921 QKSNEQS 3927
Score = 28.8 bits (64), Expect = 1.7
Identities = 11/45 (24%), Positives = 23/45 (51%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E E+K E+ E + +E+ K E++ + +E+E+E +
Sbjct: 3917 LETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD 3961
Score = 28.4 bits (63), Expect = 1.8
Identities = 16/68 (23%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEV-KEEEKEEEEEKEKKEEEEKEKMEEE 142
L+++ + L E K E+ E + V KE++ + E+K+++E+E++E+M ++
Sbjct: 3903 LNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDD 3962
Query: 143 EKEEEETK 150
++E +
Sbjct: 3963 VGIDDEIQ 3970
Score = 28.0 bits (62), Expect = 2.5
Identities = 11/64 (17%), Positives = 34/64 (53%), Gaps = 2/64 (3%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
++ ETE+ E + E + +E++ E+K+ +E+++++E + +++E +
Sbjct: 3913 EEDLLETEQKS--NEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQ 3970
Query: 145 EEEE 148
+ +
Sbjct: 3971 PDIQ 3974
Score = 28.0 bits (62), Expect = 2.7
Identities = 19/83 (22%), Positives = 41/83 (49%), Gaps = 4/83 (4%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
L+ + K E++ E ++ +E++ + E+++ +E+E EEE + ++E + +E
Sbjct: 3917 LETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQEN 3976
Query: 144 KE----EEETKYLVRDVHLESCE 162
E E L D+ L+ E
Sbjct: 3977 NSQPPPENEDLDLPEDLKLDEKE 3999
Score = 27.7 bits (61), Expect = 4.4
Identities = 17/71 (23%), Positives = 42/71 (59%), Gaps = 6/71 (8%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEE---EEEEEVKEEEKEEEE---EKEKKEEEEKE 137
LD++ + K L + +++ +E KEE E++E +++E+ EE +++ ++++ +
Sbjct: 3995 LDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSD 4054
Query: 138 KMEEEEKEEEE 148
E++EK E+
Sbjct: 4055 LAEDDEKMNED 4065
Score = 27.3 bits (60), Expect = 5.4
Identities = 38/166 (22%), Positives = 68/166 (40%), Gaps = 29/166 (17%)
Query: 12 EEEEEEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQ 71
E + EEE +D ++ +N E L N+ ++ EE E + + + + N
Sbjct: 3877 ENSDSEEENQDLDEE-VNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDL 3935
Query: 72 LVEGKPNGLLCVLDDQAKETEKA---GVLYEDNIKKEEEEKEEEE---------EEEVKE 119
+ + N L D Q KE E+ V +D I+ + +E + E++K
Sbjct: 3936 VSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKL 3995
Query: 120 EEKEEE----------------EEKEKKEEEEKEKMEEEEKEEEET 149
+EKE + E KE+ + E+ E M++E+ EE
Sbjct: 3996 DEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENN 4041
Score = 26.9 bits (59), Expect = 6.7
Identities = 12/46 (26%), Positives = 23/46 (50%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EE +E+ ++ + E++EK ++ E+ E EE E+ K
Sbjct: 4038 EENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVK 4083
Score = 26.5 bits (58), Expect = 9.2
Identities = 14/59 (23%), Positives = 34/59 (57%), Gaps = 7/59 (11%)
Query: 99 EDNIKKEEEEKEEEE-----EEEVKEEEKEEEEEK--EKKEEEEKEKMEEEEKEEEETK 150
ED ++E+E++EE ++E++ + +E + E ++ + E ++ +EKE + +K
Sbjct: 3946 EDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSK 4004
Score = 26.5 bits (58), Expect = 9.4
Identities = 11/63 (17%), Positives = 26/63 (41%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+D + L E +++ + + +E E +E E + + E E + EE+
Sbjct: 3825 EDITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEE 3884
Query: 145 EEE 147
++
Sbjct: 3885 NQD 3887
>gnl|CDD|111553 pfam02667, SCFA_trans, Short chain fatty acid transporter. This
family consists of two sequences annotated as short
chain fatty acid transporters, however, there are no
references giving details of experimental
characterisation of this function.
Length = 453
Score = 32.9 bits (75), Expect = 0.061
Identities = 11/35 (31%), Positives = 22/35 (62%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
E + + + +K+E++EE++ K KK+ EK+E
Sbjct: 216 GELKSIDPKLLKKEKEEEKKLKSKKDATIAEKLEN 250
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 32.4 bits (74), Expect = 0.063
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
KK++E+ EEE++ +++K EE E + EE +E
Sbjct: 70 KKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQES 109
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 32.6 bits (75), Expect = 0.064
Identities = 14/78 (17%), Positives = 37/78 (47%), Gaps = 3/78 (3%)
Query: 78 NGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
+ + + DD E E + E + EE+++++ + +E++EE K + +E++ +
Sbjct: 95 SDQVVIFDDGDIEEE---LQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDD 151
Query: 138 KMEEEEKEEEETKYLVRD 155
++++ E R
Sbjct: 152 DDDDDDIATRERSLERRR 169
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 32.7 bits (75), Expect = 0.066
Identities = 19/66 (28%), Positives = 39/66 (59%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
DD G E+ + ++++EE++++ EEE+E+ + +++E+EE E ++E
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDE 366
Query: 143 EKEEEE 148
+ EEEE
Sbjct: 367 DDEEEE 372
Score = 32.7 bits (75), Expect = 0.084
Identities = 17/53 (32%), Positives = 36/53 (67%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ + EE++ ++ EEEE + +EEE++E ++ ++++ EEEE+E+E+ K
Sbjct: 328 DGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKK 380
Score = 31.5 bits (72), Expect = 0.17
Identities = 16/47 (34%), Positives = 31/47 (65%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
E+ + +EEE+E E+ +E++E+E++EE+EK+K + E E +
Sbjct: 347 EDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPF 393
Score = 31.5 bits (72), Expect = 0.21
Identities = 17/65 (26%), Positives = 39/65 (60%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
LDD+ + + + EEEE++ ++E+ ++++ + EEE+E + ++E+ EE+E
Sbjct: 302 LDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDE 361
Query: 144 KEEEE 148
++E
Sbjct: 362 DSDDE 366
Score = 28.8 bits (65), Expect = 1.4
Identities = 20/44 (45%), Positives = 31/44 (70%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
ED +EEE EE+E+ + +++E+EEEEEKEKK+++ E E
Sbjct: 347 EDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 27.3 bits (61), Expect = 5.3
Identities = 15/64 (23%), Positives = 36/64 (56%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
DD+ +E K D+ + +++ + ++EE+EE+ ++ EE++ + +EEEE+
Sbjct: 288 DDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEE 347
Query: 145 EEEE 148
+ +
Sbjct: 348 DVDL 351
Score = 26.5 bits (59), Expect = 7.4
Identities = 18/54 (33%), Positives = 32/54 (59%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
DD E E+ V D + EE+E ++E++E +EEE++E+++K+ E E
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 32.7 bits (74), Expect = 0.066
Identities = 26/100 (26%), Positives = 48/100 (48%), Gaps = 2/100 (2%)
Query: 61 IEFSDNTLCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEE 120
+ F DN LQ + L L+++ E+A +D I E++ +++
Sbjct: 313 LRFLDNLDFLQFLSKYIGCLWRNLNNEKWGNEEAE--RKDYILDSSSVPLEKQFDDILYF 370
Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
EK E E + +E E +E++E+ E E + +K + D LE+
Sbjct: 371 EKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELEN 410
Score = 27.3 bits (60), Expect = 4.9
Identities = 17/66 (25%), Positives = 28/66 (42%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
+LD + EK K E E + EE E +E E E+E K + +++E
Sbjct: 352 ILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENH 411
Query: 143 EKEEEE 148
+ +E
Sbjct: 412 FRAADE 417
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 32.3 bits (74), Expect = 0.067
Identities = 12/38 (31%), Positives = 26/38 (68%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
EK +E ++ +KE +KE E +E + ++ +K++E++ E
Sbjct: 72 EKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQME 109
Score = 29.2 bits (66), Expect = 0.68
Identities = 9/38 (23%), Positives = 27/38 (71%), Gaps = 1/38 (2%)
Query: 112 EEEEEVKEEEKE-EEEEKEKKEEEEKEKMEEEEKEEEE 148
E+ +E+++ KE ++E +E +E + +K+++ ++++ E
Sbjct: 72 EKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQME 109
>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model
describes flagellar biosynthesis protein FlhA, one of a
large number of genes associated with the biosynthesis
of functional bacterial flagella. Homologs of many such
proteins, including FlhA, function in type III protein
secretion systems. A separate model describes InvA
(Salmonella enterica), LcrD (Yersinia enterocolitica),
HrcV (Xanthomonas), etc., all of which score below the
noise cutoff for this model [Cellular processes,
Chemotaxis and motility].
Length = 678
Score = 32.6 bits (75), Expect = 0.070
Identities = 12/43 (27%), Positives = 20/43 (46%), Gaps = 2/43 (4%)
Query: 117 VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
+ ++EEE E + +E+ EEEE + D+ LE
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESINDILAL--DDLELE 350
Score = 30.7 bits (70), Expect = 0.32
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEE 135
+ ++EEE E + +EE ++EEE
Sbjct: 311 RRSKQEEEAAAEAAKAQEEAAEEEEES 337
Score = 30.7 bits (70), Expect = 0.35
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ ++EEE E K +EE E++EE + + ++ E E
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEESINDILALDDLELE 350
Score = 30.7 bits (70), Expect = 0.37
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKE 129
K+EEE E + + + E+EEE +
Sbjct: 314 KQEEEAAAEAAKAQEEAAEEEEESIND 340
Score = 28.8 bits (65), Expect = 1.5
Identities = 9/32 (28%), Positives = 19/32 (59%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE 133
+++ +++EE E K +E+ EEE+E +
Sbjct: 309 LRRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340
Score = 27.2 bits (61), Expect = 4.8
Identities = 8/29 (27%), Positives = 15/29 (51%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
K+EEE + +EE E+E++ +
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEESIND 340
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 32.8 bits (75), Expect = 0.077
Identities = 14/44 (31%), Positives = 29/44 (65%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
N K E + + EE+ K+++K+E++++E+ + EEK ++E E
Sbjct: 728 WNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
Score = 30.5 bits (69), Expect = 0.39
Identities = 13/39 (33%), Positives = 26/39 (66%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
K+ E + EEK ++++K++K++EE+ K EE+ + E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 29.7 bits (67), Expect = 0.84
Identities = 13/43 (30%), Positives = 26/43 (60%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
+ E + + EE+ +++++KEKK+EEE ++ E+ E E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAEA 772
Score = 28.9 bits (65), Expect = 1.5
Identities = 14/45 (31%), Positives = 24/45 (53%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
+ K+ E + K++++KE+++EEE K EEK E E
Sbjct: 727 NWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
Score = 28.1 bits (63), Expect = 2.2
Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEE-EKEKKEEEEKEKMEEE 142
+ + KE G L D KK E +E+ EE+ E + +K K +++ K E E
Sbjct: 1523 RNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAE 1582
Score = 27.0 bits (60), Expect = 6.6
Identities = 12/46 (26%), Positives = 27/46 (58%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
N ++ E + + E K ++K+++E+K+++E + +EK E E
Sbjct: 726 RNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 32.3 bits (74), Expect = 0.078
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
EEEE+E EE EE EE+ + + +++ EE KE+ E
Sbjct: 151 EEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEE 186
Score = 30.8 bits (70), Expect = 0.25
Identities = 15/40 (37%), Positives = 23/40 (57%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
EEE+EE EE E + EE + + K+ EE ++ EE +
Sbjct: 151 EEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRR 190
Score = 29.6 bits (67), Expect = 0.61
Identities = 17/45 (37%), Positives = 27/45 (60%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E E EEEEEE +E E+E EE+ ++ ++ E +E+EE +
Sbjct: 146 ELELPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRR 190
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 32.1 bits (73), Expect = 0.079
Identities = 12/48 (25%), Positives = 23/48 (47%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ E E EE+ + + E E + KE+ + EK ++ K + + K
Sbjct: 57 QPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPK 104
Score = 31.3 bits (71), Expect = 0.15
Identities = 10/48 (20%), Positives = 21/48 (43%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EE + E E E + + E E + + ++ + EK+ ++ K
Sbjct: 51 APTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPK 98
Score = 31.3 bits (71), Expect = 0.18
Identities = 14/46 (30%), Positives = 22/46 (47%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E +E + E E EE+ + E E E K +E+ K E++ K
Sbjct: 50 EAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPK 95
Score = 29.4 bits (66), Expect = 0.64
Identities = 9/44 (20%), Positives = 19/44 (43%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ EE + + E EE+ + + E E + KE+ + +
Sbjct: 48 VLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPE 91
Score = 27.8 bits (62), Expect = 2.2
Identities = 13/69 (18%), Positives = 27/69 (39%), Gaps = 7/69 (10%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEE-----KEKKEEEEKEKMEE 141
+ + E E+ K E + E K +EK + E+ K K + + K K +
Sbjct: 54 EEPQPEPEPP--EEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111
Query: 142 EEKEEEETK 150
+ + + +
Sbjct: 112 KPQPKPKKP 120
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 32.6 bits (74), Expect = 0.082
Identities = 12/57 (21%), Positives = 25/57 (43%), Gaps = 3/57 (5%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEE---EVKEEEKEEEEEKEKKEEEEKEKME 140
+AK+ K + IK E E E E + +EE+ + ++++ K++
Sbjct: 796 RAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852
Score = 30.7 bits (69), Expect = 0.32
Identities = 14/66 (21%), Positives = 24/66 (36%), Gaps = 1/66 (1%)
Query: 88 AKETEKAGVLY-EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
K K IK K++ +E+ ++E + E + E E + EEK
Sbjct: 776 GKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGI 835
Query: 147 EETKYL 152
+ L
Sbjct: 836 DAPAIL 841
>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 379
Score = 32.2 bits (73), Expect = 0.086
Identities = 6/66 (9%), Positives = 26/66 (39%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
+ ++ K +E + E+ + E++ + + + +++ + +K ++ +
Sbjct: 197 LSLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKA 256
Query: 143 EKEEEE 148
E
Sbjct: 257 LAALEA 262
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 32.0 bits (73), Expect = 0.087
Identities = 14/53 (26%), Positives = 32/53 (60%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
+ E+E+E E E+ ++KE+ E ++K++MEE + +++ +++ L S
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSS 125
Score = 27.0 bits (60), Expect = 4.3
Identities = 11/34 (32%), Positives = 19/34 (55%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
+ +KE E E+ K+EE E +++KE +E
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106
Score = 27.0 bits (60), Expect = 5.2
Identities = 19/61 (31%), Positives = 34/61 (55%), Gaps = 2/61 (3%)
Query: 78 NGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
G+ L DQ E EK +++ E+ +K+EE E + K+E +E + ++K ++ KE
Sbjct: 63 TGVKDFLSDQKPEDEKELS--ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKE 120
Query: 138 K 138
K
Sbjct: 121 K 121
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 32.3 bits (74), Expect = 0.090
Identities = 20/59 (33%), Positives = 36/59 (61%), Gaps = 2/59 (3%)
Query: 90 ETEKAGVLY--EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
E E GV E+NI+ E + +E ++E + EE+ E+ E + +K+EEE EK + + ++
Sbjct: 77 EGEIKGVFTLGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 32.3 bits (74), Expect = 0.091
Identities = 10/40 (25%), Positives = 19/40 (47%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ EEE E+++ ++EEE++ E+ EE
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299
Score = 30.8 bits (70), Expect = 0.30
Identities = 9/39 (23%), Positives = 18/39 (46%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ EE E+ + E+E+E+ + E++ EE
Sbjct: 261 PPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299
Score = 30.0 bits (68), Expect = 0.64
Identities = 8/39 (20%), Positives = 19/39 (48%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
+ EEE E +++ ++++E+++ E EE
Sbjct: 261 PPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299
Score = 29.2 bits (66), Expect = 0.92
Identities = 10/40 (25%), Positives = 18/40 (45%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ EEE E++ + E+EE++ E+ EE
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299
Score = 29.2 bits (66), Expect = 1.1
Identities = 10/35 (28%), Positives = 18/35 (51%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKM 139
EEE E ++ E+EEE++ E+ E++
Sbjct: 266 EEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 32.3 bits (74), Expect = 0.094
Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 4/50 (8%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKE----EEEKEKMEEEEKEEEE 148
+E + +E ++ E+EK E E+EK E +EE +K++ E EE
Sbjct: 4 LREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEE 53
Score = 31.6 bits (72), Expect = 0.17
Identities = 9/49 (18%), Positives = 28/49 (57%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+++ E +E + ++E+ E EE++ + + KE++++ + E + +
Sbjct: 4 LREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLE 52
Score = 31.6 bits (72), Expect = 0.17
Identities = 13/55 (23%), Positives = 27/55 (49%), Gaps = 3/55 (5%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK---KEEEEKEKMEEEEKEEEETK 150
++K E+++ E EE K E +EE +K + + +E ++E E + +
Sbjct: 12 AKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASG 66
Score = 29.6 bits (67), Expect = 0.62
Identities = 14/43 (32%), Positives = 23/43 (53%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+E E E KE K E+EK + EEEK + + ++E ++
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDK 43
Score = 29.2 bits (66), Expect = 0.88
Identities = 14/47 (29%), Positives = 23/47 (48%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+K+ E E +E K E E+ E E+++ E EE +K + E
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAE 47
Score = 28.9 bits (65), Expect = 1.4
Identities = 13/37 (35%), Positives = 19/37 (51%)
Query: 114 EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+E++E E +E K E+EK EEEK E +
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDAL 37
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 32.5 bits (75), Expect = 0.096
Identities = 19/56 (33%), Positives = 31/56 (55%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
+K + E + +EE E E +KEEE+EK KE E K E EK+++ + ++
Sbjct: 157 GILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKE 212
Score = 32.1 bits (74), Expect = 0.10
Identities = 16/52 (30%), Positives = 31/52 (59%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
D+ K+E E+ E ++EEE ++ KE E +K E E+K++ + + +E +
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQI 217
Score = 29.8 bits (68), Expect = 0.70
Identities = 12/46 (26%), Positives = 28/46 (60%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
++ ++K E +KEEE+E+ KE + ++ + +K+++ K +E
Sbjct: 169 KEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 31.9 bits (73), Expect = 0.099
Identities = 14/45 (31%), Positives = 23/45 (51%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K KEEEEE ++ E EKK +E ++ +EE+++
Sbjct: 16 KPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRF 60
Score = 26.8 bits (60), Expect = 5.2
Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
KEEEE+ E + E E++ ++ +EE E EE+++ EE +
Sbjct: 20 PKEEEEEALELAIQEAHENALEKKLLDELDEELDE--EEDDRFLEEYR 65
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 32.1 bits (73), Expect = 0.11
Identities = 21/45 (46%), Positives = 29/45 (64%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
KE +EK+EE++EE EE K EE ++E +EEE+ E EK E
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIE 321
Score = 32.1 bits (73), Expect = 0.11
Identities = 20/51 (39%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
+K+++EEK+EE+ EEVK EE +EE E+E E+ ++ E+ E K++
Sbjct: 279 LKEKKEEKDEEKSEEVKTEEVDEEFEEE--EKGFYYELYEKVNIEANKKFI 327
Score = 32.1 bits (73), Expect = 0.14
Identities = 15/46 (32%), Positives = 31/46 (67%)
Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
+E+KE+++E++EEK ++ + E+ E EE+E+ L V++E+
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEA 322
Score = 28.7 bits (64), Expect = 1.5
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 117 VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
+KE ++++EE+ E+K EE K + +EE EEEE +
Sbjct: 276 LKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 31.6 bits (72), Expect = 0.11
Identities = 14/51 (27%), Positives = 38/51 (74%), Gaps = 3/51 (5%)
Query: 102 IKKEEEEKE---EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
+KKE EEK+ ++++ K+++K+++++ +KK+++ ++K E+E +++ E
Sbjct: 74 VKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 29.7 bits (67), Expect = 0.43
Identities = 13/67 (19%), Positives = 37/67 (55%)
Query: 92 EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
EK ++ K++++K+++++++ K+++ E+++EKE +++ E E ++
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSEL 139
Query: 152 LVRDVHL 158
R L
Sbjct: 140 KPRKYAL 146
Score = 29.7 bits (67), Expect = 0.52
Identities = 11/49 (22%), Positives = 37/49 (75%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
+ E+ +KE EE+++ K ++K+ +++K+K ++++ +K +++ ++++E +
Sbjct: 70 EIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Score = 28.9 bits (65), Expect = 0.95
Identities = 12/47 (25%), Positives = 34/47 (72%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E ++K++E EE+++ +KE EE+++ K +++K K ++++ ++++
Sbjct: 58 TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDD 104
Score = 28.5 bits (64), Expect = 1.2
Identities = 14/57 (24%), Positives = 36/57 (63%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
EE EK ++E EE ++ + ++++ K+KK++++ +K ++++ + E+ + LE
Sbjct: 68 AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 27.7 bits (62), Expect = 2.1
Identities = 17/41 (41%), Positives = 27/41 (65%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E E + K++E EE EK KKE EEK+K + ++K+ ++ K
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKK 95
Score = 27.0 bits (60), Expect = 4.4
Identities = 13/48 (27%), Positives = 39/48 (81%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
KKE E+ E+ ++E +E++K + ++K+ K++++K+K ++++K++++++
Sbjct: 64 KKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSE 111
Score = 27.0 bits (60), Expect = 4.7
Identities = 11/48 (22%), Positives = 36/48 (75%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++ E+ K+E EE++ + +K++ ++K+ K++++K+ ++++ E+++ K
Sbjct: 69 EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 31.7 bits (72), Expect = 0.11
Identities = 23/61 (37%), Positives = 40/61 (65%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
KE+ A +D K + + ++ ++EE +++ + EEKE KE+EE+E+ EEEE+EE+E
Sbjct: 224 KESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDE 283
Query: 149 T 149
Sbjct: 284 D 284
Score = 26.7 bits (59), Expect = 6.1
Identities = 15/56 (26%), Positives = 31/56 (55%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
+ K + EE++ ++ + E KE +++EEEE+ ++EEEE+++ E
Sbjct: 230 AASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 31.9 bits (73), Expect = 0.11
Identities = 19/46 (41%), Positives = 30/46 (65%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+EE EE E+ +EEE+E++ + + E+EE E EEEE ++E K
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 31.7 bits (72), Expect = 0.12
Identities = 15/67 (22%), Positives = 39/67 (58%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
+AK +A ++ +++ E +EEV EE+ E+++ E ++ +++ E++++E
Sbjct: 50 QKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQE 109
Query: 146 EEETKYL 152
E++ + L
Sbjct: 110 EKQVREL 116
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 31.9 bits (73), Expect = 0.12
Identities = 31/69 (44%), Positives = 42/69 (60%), Gaps = 5/69 (7%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK-----KEEEEKEKMEEEEKEEEETKYLV 153
E+ I++EEEE+ E EEE +EE++EE++EK K E KEK E+ EE E + L
Sbjct: 17 EEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIEDLLEELELELLE 76
Query: 154 RDVHLESCE 162
DV LE E
Sbjct: 77 SDVALEVAE 85
Score = 27.2 bits (61), Expect = 4.7
Identities = 17/31 (54%), Positives = 25/31 (80%)
Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K EEK EEEE+E+ E E+E+ EE+E+E++E
Sbjct: 15 KVEEKIEEEEEEEAPEAEEEEEEEDEEEKKE 45
Score = 26.1 bits (58), Expect = 9.8
Identities = 17/32 (53%), Positives = 27/32 (84%)
Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
E+V+E+ +EEEEE+ + EEE+E+ +EEEK+E
Sbjct: 14 EKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKE 45
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 31.3 bits (71), Expect = 0.13
Identities = 16/63 (25%), Positives = 25/63 (39%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
+AK+ K L + E + E++ + EE +EEE +E EE
Sbjct: 122 SKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPA 181
Query: 146 EEE 148
EE
Sbjct: 182 EES 184
Score = 30.5 bits (69), Expect = 0.24
Identities = 15/47 (31%), Positives = 18/47 (38%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
K ++ K+ E E K E E EKK E EE EE
Sbjct: 123 KAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEE 169
Score = 28.2 bits (63), Expect = 1.5
Identities = 15/47 (31%), Positives = 20/47 (42%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
KK ++ E E++V E E EK+ E E EEEE
Sbjct: 125 KKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEA 171
Score = 26.7 bits (59), Expect = 5.8
Identities = 8/52 (15%), Positives = 21/52 (40%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E + ++++ + ++ K + E++ E + E EK E +
Sbjct: 111 EGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEE 162
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 31.7 bits (72), Expect = 0.13
Identities = 10/57 (17%), Positives = 27/57 (47%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
+ +++ E+ K K + E+ KK +++K E+++ ++++ V L
Sbjct: 32 IYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVPLQVNPAQLF 88
Score = 31.3 bits (71), Expect = 0.15
Identities = 14/74 (18%), Positives = 32/74 (43%), Gaps = 5/74 (6%)
Query: 74 EGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE 133
+ K N D E + I+K + + + E VK+ +E++E++KK +
Sbjct: 20 KYKANK-----DKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPK 74
Query: 134 EEKEKMEEEEKEEE 147
++K ++ +
Sbjct: 75 KKKVPLQVNPAQLF 88
Score = 31.3 bits (71), Expect = 0.15
Identities = 15/66 (22%), Positives = 31/66 (46%), Gaps = 3/66 (4%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK---EEEEKEKMEEEEKEEEETKYLV 153
L + K +++ E ++ ++K E+ K E +K+ +EK+EE+ K
Sbjct: 16 LESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKK 75
Query: 154 RDVHLE 159
+ V L+
Sbjct: 76 KKVPLQ 81
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 31.6 bits (72), Expect = 0.13
Identities = 14/49 (28%), Positives = 23/49 (46%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVR 154
+ ++E+E EV+E E E E E++ E+ E KY+ R
Sbjct: 141 KTDEEKELLREVEELESRLATEPSPAPELEEQLALMEKSYELAAKYMPR 189
Score = 29.3 bits (66), Expect = 1.0
Identities = 14/55 (25%), Positives = 22/55 (40%)
Query: 96 VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
Y D ++ E + +E KE +E EE + + E E EE+ K
Sbjct: 124 AAYRDINRELGSFYEYPKTDEEKELLREVEELESRLATEPSPAPELEEQLALMEK 178
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 32.0 bits (72), Expect = 0.14
Identities = 19/45 (42%), Positives = 30/45 (66%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
D + +EEE + +EE EE +E++KE E+ + +EEK K +EE K
Sbjct: 238 DKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIK 282
Score = 29.7 bits (66), Expect = 0.71
Identities = 25/60 (41%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEE--EEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
E +I K + +K EEEE +EE EE E++KE E+ E EE+ K +EE K++ DV
Sbjct: 229 EIDILKNDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDV 288
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 31.7 bits (73), Expect = 0.16
Identities = 24/62 (38%), Positives = 43/62 (69%), Gaps = 2/62 (3%)
Query: 89 KETEKAGVLYEDNIKKEEE--EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
++ EK + E+N+ ++ E EK EEE E+ ++E +++++E EKKEEE +E +EE+ +E
Sbjct: 85 QKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQEL 144
Query: 147 EE 148
E
Sbjct: 145 ER 146
Score = 30.1 bits (69), Expect = 0.51
Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKE--EEEEKEKKEEEEK-EKMEEEE 143
+ KE E+ + KKE E ++E E KEE + E EKE +E + +K+E+
Sbjct: 32 KIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRL 91
Query: 144 KEEEET 149
++EE
Sbjct: 92 LQKEEN 97
Score = 28.6 bits (65), Expect = 1.5
Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 3/69 (4%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKE---EEEEKEKKEEEEKEKM 139
+L++ KE E K+E + E E+E++E E E+ +KEE K+
Sbjct: 43 ILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKL 102
Query: 140 EEEEKEEEE 148
E EK EEE
Sbjct: 103 ELLEKREEE 111
Score = 27.8 bits (63), Expect = 2.7
Identities = 17/61 (27%), Positives = 32/61 (52%), Gaps = 1/61 (1%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESC 161
+ + +E EEE + ++E +KE E K++ E KE++ + + E E + R L+
Sbjct: 29 AEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEI-HKLRNEFEKELRERRNELQKL 87
Query: 162 E 162
E
Sbjct: 88 E 88
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 31.7 bits (72), Expect = 0.16
Identities = 11/46 (23%), Positives = 24/46 (52%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EE + +E+ + E E E+E+ E+ +EE +++++ K
Sbjct: 316 IHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVK 361
Score = 27.5 bits (61), Expect = 4.0
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
EEE ++E + ++E E EE +EE
Sbjct: 242 EEESIDDELDVLREIEAEEAGIGPIEEEVV 271
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 31.5 bits (72), Expect = 0.17
Identities = 17/51 (33%), Positives = 32/51 (62%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++ K+ E E+EEE+ + +K +E+ E+ EE++E+ EEE+ E E +
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172
Score = 30.0 bits (68), Expect = 0.45
Identities = 19/52 (36%), Positives = 34/52 (65%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E + + E + E+EEE++ ++ +E+ ++E EE+E+E EEEE E E+ K
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173
Score = 29.6 bits (67), Expect = 0.71
Identities = 21/65 (32%), Positives = 38/65 (58%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+D E E V + I+ + E EEE++E K+ +++ +EE +++EEE + EE E
Sbjct: 110 EDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEA 169
Query: 145 EEEET 149
E+E+
Sbjct: 170 EKEKA 174
Score = 29.2 bits (66), Expect = 0.95
Identities = 15/46 (32%), Positives = 26/46 (56%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E + E ++E E + E EEE+++ K+ +E E E++EEE
Sbjct: 117 EWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAA 162
Score = 28.5 bits (64), Expect = 1.6
Identities = 17/42 (40%), Positives = 25/42 (59%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
+ K E +K +E+ +E EE EEE +E++ E EKEK E
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASE 176
Score = 28.1 bits (63), Expect = 1.9
Identities = 18/52 (34%), Positives = 30/52 (57%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
E+ KK+E E+ E +++ EEE+ E EE E ++E + E +KE E +
Sbjct: 79 KEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESS 130
Score = 27.7 bits (62), Expect = 2.5
Identities = 14/46 (30%), Positives = 29/46 (63%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ E ++E E E+E+E++E +K +E+ E++ EE++EE +
Sbjct: 119 IDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEE 164
Score = 27.7 bits (62), Expect = 2.9
Identities = 15/52 (28%), Positives = 30/52 (57%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
D+ + + + ++E E E++EE++E KK +E+ ++ EE EEE +
Sbjct: 112 SDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAE 163
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 31.4 bits (71), Expect = 0.17
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 13/78 (16%)
Query: 92 EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEE-------------EEKEKKEEEEKEK 138
EK E I KE + EEE E+EV++ KEEE E ++ +E KE
Sbjct: 367 EKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKEL 426
Query: 139 MEEEEKEEEETKYLVRDV 156
+EE++ E+ K +V
Sbjct: 427 SKEEKELLEKLKMEASEV 444
Score = 28.3 bits (63), Expect = 2.1
Identities = 15/48 (31%), Positives = 28/48 (58%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ ++K+ EE E+ +E ++ E+E E+E EK +EE+ + EE
Sbjct: 364 ERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEG 411
Score = 27.1 bits (60), Expect = 4.9
Identities = 18/48 (37%), Positives = 28/48 (58%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
DN E E E++V+E EK +E + EEE ++++E+ KEEE
Sbjct: 353 DNTCGPEPPLLERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEE 400
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 31.4 bits (71), Expect = 0.18
Identities = 19/64 (29%), Positives = 27/64 (42%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
DD +E +D E+ + EE E +E +E+E E E EE E E
Sbjct: 224 DDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEAS 283
Query: 145 EEEE 148
E+ E
Sbjct: 284 EDSE 287
Score = 29.8 bits (67), Expect = 0.59
Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 4/70 (5%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEE----EEEVKEEEKEEEEEKEKKEEEEKEK 138
V+ D + A +D I+++ +E++ ++ E E +EE E ++ EE+E E
Sbjct: 208 VVRDMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEA 267
Query: 139 MEEEEKEEEE 148
+ E +E E
Sbjct: 268 TDGEGEEGEM 277
Score = 29.0 bits (65), Expect = 1.3
Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 5/65 (7%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
DDQ E + ++E E +E EE+E + + E EE + E ++ +E
Sbjct: 238 DDQPDNNED-----SEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESD 292
Query: 145 EEEET 149
E+ ET
Sbjct: 293 EDTET 297
Score = 28.7 bits (64), Expect = 1.4
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 5/51 (9%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKK-----EEEEKEKMEEEEKEEEETK 150
EE E +E EE+ E E EE E E+ E ++ +E+ + E
Sbjct: 252 EESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTETPGEDA 302
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 30.8 bits (70), Expect = 0.20
Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 5/57 (8%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEE-----EKEKKEEEEKEKMEEEEKEEEETK 150
D+ + +++ + + E E EK KKE E+++ EEEEK EE K
Sbjct: 119 ADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEK 175
Score = 29.7 bits (67), Expect = 0.62
Identities = 17/51 (33%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEE--KEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ ++++ E+E ++E E K+E E++++EEEEK EE+ +EEE
Sbjct: 130 SSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180
Score = 28.9 bits (65), Expect = 0.96
Identities = 21/45 (46%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 85 DDQAKETEKAGVLYE-DNIKKEEEEKEEEEEEEVKEEEKEEEEEK 128
DD E E A +L E + IKKE E++E EEEE EE++ EE+
Sbjct: 136 DDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 30.4 bits (69), Expect = 0.22
Identities = 20/51 (39%), Positives = 35/51 (68%)
Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
+I+++ EEKE E+EE+ ++EEKE+E K ++EE+EK EE ++ +
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPL 63
Score = 26.1 bits (58), Expect = 7.3
Identities = 19/46 (41%), Positives = 30/46 (65%)
Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
EE +EE++ E+EEKE+KEE+EKE + ++EE K L ++
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKN 60
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 31.3 bits (71), Expect = 0.22
Identities = 14/66 (21%), Positives = 23/66 (34%), Gaps = 8/66 (12%)
Query: 98 YEDNIKKEEEEKEEEEEE------EVKEEEKEEEEEKEKK--EEEEKEKMEEEEKEEEET 149
YE NI + EE E++ K E + K EE+ ++ EE
Sbjct: 870 YEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEEL 929
Query: 150 KYLVRD 155
++
Sbjct: 930 MKRAKE 935
Score = 30.9 bits (70), Expect = 0.35
Identities = 15/65 (23%), Positives = 28/65 (43%), Gaps = 2/65 (3%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE--KEKMEEE 142
D+ E E +++I+ EE + + E + E + K EE+ +E+M
Sbjct: 864 DNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGA 923
Query: 143 EKEEE 147
+ EE
Sbjct: 924 DSPEE 928
Score = 28.6 bits (64), Expect = 1.9
Identities = 10/40 (25%), Positives = 19/40 (47%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
E + + +E+ +E + + EE K KE ++K K
Sbjct: 902 EALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 30.8 bits (69), Expect = 0.23
Identities = 19/53 (35%), Positives = 36/53 (67%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
++E EE EE E E +EE+ +E E++E++E E +E+ EE+ ++ E + + +D
Sbjct: 132 RQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQD 184
Score = 28.5 bits (63), Expect = 1.4
Identities = 16/45 (35%), Positives = 32/45 (71%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
++ EE+ +E E++E +E E+EE+ +E+ ++ E E +E++E E E
Sbjct: 145 ERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETE 189
Score = 27.7 bits (61), Expect = 2.5
Identities = 19/64 (29%), Positives = 41/64 (64%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ ++ EKA E+ ++ E E+ EE+ +E +++E++E E +E+ EE+ + E E E+
Sbjct: 124 RQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQ 183
Query: 147 EETK 150
+E++
Sbjct: 184 DESE 187
Score = 27.7 bits (61), Expect = 2.8
Identities = 18/64 (28%), Positives = 39/64 (60%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
DQ + +K + + ++ +E+ E+ +E + EE+ E E +E+K +E +++ E+E +
Sbjct: 105 DQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQERER 164
Query: 146 EEET 149
EE+T
Sbjct: 165 EEQT 168
Score = 26.6 bits (58), Expect = 6.8
Identities = 15/45 (33%), Positives = 30/45 (66%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
++E++E++E E EE EE+ ++ E E E++E E +++K E+
Sbjct: 153 EREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 31.2 bits (71), Expect = 0.24
Identities = 12/42 (28%), Positives = 25/42 (59%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
++ K + E+ +EEE++ +++ + EE K+ EE K + E
Sbjct: 60 KEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDE 101
Score = 30.0 bits (68), Expect = 0.46
Identities = 16/44 (36%), Positives = 25/44 (56%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
EE +EE +E K + EE +E+E EE++ + EE + EE
Sbjct: 51 EESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94
Score = 28.1 bits (63), Expect = 2.6
Identities = 12/50 (24%), Positives = 24/50 (48%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
+ EE EE+ +E + + +E EEE+ E+ + ++ EE +
Sbjct: 43 RGSFRDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTV 92
>gnl|CDD|216831 pfam01988, VIT1, VIT family. This family includes the vacuolar
Fe2+/Mn2+ uptake transporter, Ccc1 and the vacuolar iron
transporter VIT1.
Length = 209
Score = 30.7 bits (70), Expect = 0.25
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
E++ EE + ++E+ E E E E+EE Y R + E
Sbjct: 59 ERDLEEAELEREKRELENDPEGEREELAEIYRARGLSEE 97
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 30.8 bits (70), Expect = 0.26
Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 11/78 (14%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK------- 136
L Q K +K +L+E + + K + E + K + + +E++KEKK+EE+K
Sbjct: 289 LKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERL 348
Query: 137 ----EKMEEEEKEEEETK 150
EK+E + ++EE K
Sbjct: 349 EERIEKLEVQATDKEENK 366
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 30.6 bits (70), Expect = 0.26
Identities = 15/39 (38%), Positives = 23/39 (58%)
Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
E E+E++K +EE+K EE++K EE Y + D E
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEPYGYCLVDGRKE 134
Score = 27.2 bits (61), Expect = 3.8
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
I + E ++E+++ KEE+K +EEK+K EE
Sbjct: 91 IYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 31.1 bits (71), Expect = 0.28
Identities = 13/46 (28%), Positives = 25/46 (54%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
I+ E+EK++ EE + + E ++ E+EK E + K E + +
Sbjct: 438 IRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAK 483
Score = 26.4 bits (59), Expect = 8.5
Identities = 13/38 (34%), Positives = 19/38 (50%), Gaps = 5/38 (13%)
Query: 118 KEEEKEEEEEKEKKEE-----EEKEKMEEEEKEEEETK 150
K E + E+EK+K EE E ++ E EK E +
Sbjct: 435 KAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREAR 472
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 30.7 bits (70), Expect = 0.28
Identities = 11/41 (26%), Positives = 23/41 (56%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
+K E++ E+EE E+ ++K E K+ K + + K + +
Sbjct: 419 RKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
Score = 30.7 bits (70), Expect = 0.32
Identities = 11/37 (29%), Positives = 19/37 (51%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKM 139
K+ + EK+ E+EE K K++ E KK + +
Sbjct: 417 KQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGET 453
Score = 29.1 bits (66), Expect = 1.0
Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
D E E+EE E + E+++ +K++K E++ EK EE EK + K
Sbjct: 389 DKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEK-EEAEKAAAKKK 438
Score = 28.0 bits (63), Expect = 2.5
Identities = 15/60 (25%), Positives = 28/60 (46%), Gaps = 6/60 (10%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKEEEKE------EEEEKEKKEEEEKEKMEEEEKEEEETK 150
L++ + E EE+E E E K+ + E+K +KEE EK +++ + +
Sbjct: 387 LHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446
Score = 27.2 bits (61), Expect = 4.1
Identities = 10/51 (19%), Positives = 29/51 (56%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
N+ E +K +++ + +++ ++EE EK +++ + ++ + + ETK
Sbjct: 404 GNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 30.5 bits (69), Expect = 0.28
Identities = 13/45 (28%), Positives = 22/45 (48%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
KK+++EKEE E +E EE E+ K+ + + + E
Sbjct: 68 KKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEV 112
Score = 29.4 bits (66), Expect = 0.60
Identities = 14/44 (31%), Positives = 24/44 (54%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+K+++E+EEV E EE + E+ EE +K+ E K +
Sbjct: 67 TKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAA 110
Score = 27.1 bits (60), Expect = 4.2
Identities = 13/48 (27%), Positives = 26/48 (54%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
K +K+ ++ + K++EKEE E +E + E+ EE +K+ +
Sbjct: 55 KTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYEL 102
Score = 25.9 bits (57), Expect = 8.4
Identities = 17/57 (29%), Positives = 25/57 (43%)
Query: 93 KAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
K KKE+EE E EE+ + E+ EE +K+ E K + E E+
Sbjct: 61 KKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDV 117
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 31.0 bits (70), Expect = 0.29
Identities = 16/37 (43%), Positives = 23/37 (62%)
Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E E E K +EE E K+KKEE+ KEK ++ K ++
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQK 41
Score = 29.9 bits (67), Expect = 0.64
Identities = 14/44 (31%), Positives = 28/44 (63%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E EK+ EEE++ ++K+EE+ KEK+ ++ K +E + + +
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQA 49
Score = 29.5 bits (66), Expect = 0.97
Identities = 14/42 (33%), Positives = 27/42 (64%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
E E E++ E + E K+++EEK K++E +K K ++E + +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46
Score = 27.2 bits (60), Expect = 5.4
Identities = 14/45 (31%), Positives = 29/45 (64%), Gaps = 1/45 (2%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEK-KEEEEKEKMEEEEKEE 146
KK E+E E +++ +E+ KE+E +K K ++E K K++ ++ +
Sbjct: 10 KKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASD 54
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 30.4 bits (69), Expect = 0.29
Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 105 EEEEKE--EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
+++ K+ +EEE+E KEEEK + K K ++ K K+EE+EK + E + E
Sbjct: 30 DDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86
Score = 28.5 bits (64), Expect = 1.6
Identities = 19/64 (29%), Positives = 36/64 (56%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+D+ KE EKA V + KK + K EE+E+ +E+E++ E E+ E++ + +
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100
Query: 145 EEEE 148
+ +E
Sbjct: 101 KLQE 104
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 30.0 bits (68), Expect = 0.29
Identities = 8/61 (13%), Positives = 19/61 (31%), Gaps = 2/61 (3%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ K ++ L+ + + + E K++ E E + +EE
Sbjct: 97 EPKPSKLE--LFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEE 154
Query: 147 E 147
Sbjct: 155 A 155
Score = 26.6 bits (59), Expect = 5.5
Identities = 8/39 (20%), Positives = 9/39 (23%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E E K+KK E E E
Sbjct: 107 NAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEA 145
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 30.8 bits (70), Expect = 0.30
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 96 VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
K EEEE+E+EEEEE +EE +E E E+
Sbjct: 340 TWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373
Score = 29.6 bits (67), Expect = 0.67
Identities = 15/31 (48%), Positives = 23/31 (74%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
E++ EEEE+E+ +EEE+EEE E+ + EE
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373
Score = 29.6 bits (67), Expect = 0.77
Identities = 13/31 (41%), Positives = 24/31 (77%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
++++EEEE ++EE+EEEEE+ ++ E E+
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373
Score = 29.3 bits (66), Expect = 0.88
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
E+K+EEEE+E +EEE+EEEE +E + EE
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375
Score = 28.9 bits (65), Expect = 1.4
Identities = 15/30 (50%), Positives = 25/30 (83%)
Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEK 130
N ++++EE+E+E+EEE +EEE+ EE E E+
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEE 372
Score = 28.5 bits (64), Expect = 1.7
Identities = 17/32 (53%), Positives = 24/32 (75%)
Query: 117 VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
V E+K+EEEE+E +EEEE+E+ EE + EE
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373
Score = 27.7 bits (62), Expect = 2.7
Identities = 15/42 (35%), Positives = 27/42 (64%)
Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
+I + E++ +EEE+E+EEE+E++EE E+ + EE
Sbjct: 332 HILPQGRCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373
Score = 27.3 bits (61), Expect = 4.1
Identities = 19/35 (54%), Positives = 26/35 (74%)
Query: 114 EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E E++E E+EEEEE+ + +EE E EEEE+E EE
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEE 198
Score = 26.9 bits (60), Expect = 5.8
Identities = 21/43 (48%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE-EKEEEET 149
E E E EE +EEE+ E ++E + EEE+E+ EE KEEE T
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGT 206
Score = 26.6 bits (59), Expect = 7.1
Identities = 12/33 (36%), Positives = 23/33 (69%)
Query: 117 VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
++++EE+E+++EEE+E+ EE E+ E E
Sbjct: 340 TWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEE 372
Score = 26.2 bits (58), Expect = 9.7
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 95 GVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK 136
L E ++EEEE E +EE+ EEE+EEE E+ KEEE
Sbjct: 165 VELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGT 206
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 30.7 bits (69), Expect = 0.30
Identities = 15/46 (32%), Positives = 24/46 (52%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ +E E++ + K E E+ E EE E+EK EEE ++ K
Sbjct: 294 GNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGK 339
Score = 29.5 bits (66), Expect = 0.83
Identities = 16/57 (28%), Positives = 29/57 (50%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
E+ K E + E E E+ +E EEEK ++E +K ++ +K + + L +D
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354
Score = 28.4 bits (63), Expect = 2.1
Identities = 11/48 (22%), Positives = 30/48 (62%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
I+++E+ +E EEE+ +E ++ +K KK + +K +++++ + +
Sbjct: 314 IEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDD 361
Score = 28.0 bits (62), Expect = 2.6
Identities = 17/64 (26%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
E+ L + K E E++E+ EE EEEK EEE K+ ++ +K++ ++
Sbjct: 292 ASGNDPEEREDKLSPEIPAKPEIEQDEDSEE--SEEEKNEEEGGLSKKGKKLKKLKGKKN 349
Query: 145 EEEE 148
++
Sbjct: 350 GLDK 353
Score = 26.5 bits (58), Expect = 8.8
Identities = 11/45 (24%), Positives = 25/45 (55%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
EE +EE+ EEE +K ++ +K K ++ +K + + ++ +
Sbjct: 319 DSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSD 363
>gnl|CDD|234655 PRK00133, metG, methionyl-tRNA synthetase; Reviewed.
Length = 673
Score = 30.9 bits (71), Expect = 0.30
Identities = 10/61 (16%), Positives = 21/61 (34%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESCE 162
+ E+++ E E KE + +E + E ++ K +R + E
Sbjct: 528 TRIEDKQIEALIEASKEAAAAKAAAAAAAAPLAEEPIAETISFDDFAKVDLRVAKIVEAE 587
Query: 163 F 163
Sbjct: 588 K 588
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 30.8 bits (70), Expect = 0.31
Identities = 17/53 (32%), Positives = 30/53 (56%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVR 154
+ K +EE+ E+E +EEE+ E EK+ ++ +E +E E+E E K +
Sbjct: 394 LSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELE 446
Score = 28.1 bits (63), Expect = 2.5
Identities = 13/37 (35%), Positives = 19/37 (51%)
Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
E + KEEE +EK+ EE+E+ E E+ K
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKL 427
Score = 27.7 bits (62), Expect = 3.4
Identities = 23/76 (30%), Positives = 38/76 (50%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ + EK G E+ + EK ++ EE E +EE E +++ EE K ++E+ E E
Sbjct: 400 EERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESEL 459
Query: 147 EETKYLVRDVHLESCE 162
E + VRD + E
Sbjct: 460 ERFRREVRDKVRKDRE 475
Score = 27.0 bits (60), Expect = 6.5
Identities = 18/51 (35%), Positives = 26/51 (50%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
+ KEEE E + E+EE E E+ K+ E E+ EEE L R++
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKREL 445
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 30.7 bits (69), Expect = 0.31
Identities = 15/55 (27%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 104 KEEEEKEEE--EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
+E +K+++ +E+ EK+ +K+KK+EEE++ EE+ + + + LV +
Sbjct: 2 VDESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLKGDLELLVERI 56
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 30.8 bits (70), Expect = 0.31
Identities = 18/53 (33%), Positives = 28/53 (52%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
E+ +++ +EK EE +++EE E KE E++ E EKEE KY
Sbjct: 187 EEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKY 239
Score = 29.6 bits (67), Expect = 0.72
Identities = 16/46 (34%), Positives = 24/46 (52%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+K EEEE+ E +E +EE + EE+ E KE E++ E
Sbjct: 182 LKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLE 227
Score = 28.8 bits (65), Expect = 1.1
Identities = 15/58 (25%), Positives = 27/58 (46%)
Query: 92 EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
EK L ++ E +E K+ E E EKE+ ++ +EK+ +E + + E
Sbjct: 196 EKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEA 253
Score = 26.1 bits (58), Expect = 9.1
Identities = 13/49 (26%), Positives = 27/49 (55%), Gaps = 1/49 (2%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKE-EEEKEKMEEEEKEEEETK 150
++E K EEE E ++ E++ + E E EKE++ ++ +E+ +
Sbjct: 198 REELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQE 246
Score = 26.1 bits (58), Expect = 9.3
Identities = 14/45 (31%), Positives = 22/45 (48%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K EEE+E E + K EE + E+E E ++ E++ E
Sbjct: 183 KAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLE 227
Score = 26.1 bits (58), Expect = 9.4
Identities = 18/46 (39%), Positives = 26/46 (56%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
KK E K EEEEE + +++ EE K EEE ++E +E E+
Sbjct: 177 KKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEK 222
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 30.6 bits (70), Expect = 0.33
Identities = 21/44 (47%), Positives = 30/44 (68%), Gaps = 2/44 (4%)
Query: 107 EEKEEEEEEEVKE-EEKEEEEEKEKKE-EEEKEKMEEEEKEEEE 148
EE E E E++ +E E +E EK K+E EE+KEK++EEE + E
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLE 569
Score = 29.0 bits (66), Expect = 1.1
Identities = 20/65 (30%), Positives = 41/65 (63%), Gaps = 8/65 (12%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
L+ +A+E E + KE E+ +EE EE+ ++ ++EE++ E+ E+E ++ ++E +
Sbjct: 532 LEQKAEEAEA--------LLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAK 583
Query: 144 KEEEE 148
KE +E
Sbjct: 584 KEADE 588
>gnl|CDD|232975 TIGR00437, feoB, ferrous iron transporter FeoB. FeoB (773 amino
acids in E. coli), a cytoplasmic membrane protein
required for iron(II) update, is encoded in an operon
with FeoA (75 amino acids), which is also required, and
is regulated by Fur. There appear to be two copies in
Archaeoglobus fulgidus and Clostridium acetobutylicum
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 591
Score = 30.5 bits (69), Expect = 0.35
Identities = 14/80 (17%), Positives = 29/80 (36%), Gaps = 2/80 (2%)
Query: 49 EEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEE 108
E E + + + + + LL +L+D + E EK + + KE
Sbjct: 164 IEIVPEAYQVVEVVEGLIEIIYSISKRGLEILLGLLEDLSLEIEKIERNLAEVVIKESPS 223
Query: 109 KEEEEEEEVKEEEKEEEEEK 128
E+ +E++ E+
Sbjct: 224 N--LSPTEIADEDRVLVEKS 241
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 30.5 bits (70), Expect = 0.37
Identities = 9/32 (28%), Positives = 19/32 (59%)
Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+V+EE +E+ E +E E+ E++ ++ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 30.2 bits (69), Expect = 0.55
Identities = 10/32 (31%), Positives = 20/32 (62%)
Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
+ EEE+ +++ E K+ EE+++K E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 29.8 bits (68), Expect = 0.59
Identities = 9/32 (28%), Positives = 18/32 (56%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
+ E+E +E E +E E+++K+ E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 29.4 bits (67), Expect = 0.78
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEE 133
+EE +E+ E +E EE+E+K +K E
Sbjct: 513 EEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 29.4 bits (67), Expect = 0.96
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
+ EEE +E+ E KE +E+E+K + E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 29.0 bits (66), Expect = 1.2
Identities = 11/33 (33%), Positives = 20/33 (60%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
+ + EE+ +E+++ +E EEKEKK E+
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
Score = 27.8 bits (63), Expect = 2.9
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
I EEE +E+ E ++ EE+EK+ ++ E
Sbjct: 510 IDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 27.8 bits (63), Expect = 3.0
Identities = 7/31 (22%), Positives = 16/31 (51%)
Query: 120 EEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ +EE +++ E K EE++++ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
Score = 27.5 bits (62), Expect = 4.2
Identities = 7/32 (21%), Positives = 17/32 (53%)
Query: 114 EEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
+ E + +E+ E +E E+++ + E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 27.1 bits (61), Expect = 5.2
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
EEE +E+ KE EE+++K E+ E
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 27.1 bits (61), Expect = 5.6
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ E+E KE+ E ++ EE+EK+ E+
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEK 540
Score = 26.7 bits (60), Expect = 7.7
Identities = 8/31 (25%), Positives = 17/31 (54%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
+ EEE ++ E +E E+KE++ ++
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 30.4 bits (68), Expect = 0.38
Identities = 16/45 (35%), Positives = 30/45 (66%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+++ K +E +E K EEK+ +EE+E++E EE+ K EE + ++
Sbjct: 199 QEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
Score = 26.5 bits (58), Expect = 6.7
Identities = 14/47 (29%), Positives = 32/47 (68%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
K + K+E+ +++E +E + E+++ +EE++E+ EEE ++EE +
Sbjct: 193 KAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQ 239
Score = 26.5 bits (58), Expect = 7.1
Identities = 13/46 (28%), Positives = 27/46 (58%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ K + ++E + +E ++ K EE++ ++ +EEE+ EEE K
Sbjct: 189 SVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAK 234
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 30.4 bits (68), Expect = 0.39
Identities = 22/72 (30%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
V ++Q +E EKA + + N +E + ++ + ++K+E K E+E E KE E ++K E
Sbjct: 275 VAENQKREIEKAQIEIKKN-DEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPV 333
Query: 143 EKEEEETKYLVR 154
++ ++TK V
Sbjct: 334 AEDLQKTKPQVE 345
Score = 29.2 bits (65), Expect = 1.1
Identities = 12/67 (17%), Positives = 31/67 (46%), Gaps = 4/67 (5%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
D ++ + A +DN K+ +E ++++E + + ++ ++ + + E E
Sbjct: 229 ADKAQQKADFA----QDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIE 284
Query: 144 KEEEETK 150
K + E K
Sbjct: 285 KAQIEIK 291
Score = 28.0 bits (62), Expect = 2.2
Identities = 13/59 (22%), Positives = 28/59 (47%)
Query: 92 EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E V + E+ + E ++ +E + K+ + K+E + EK E+++ E + K
Sbjct: 271 EDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKK 329
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 30.7 bits (69), Expect = 0.39
Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
DD E E+ G +D+ E E++E+ E E +E +E + E E + E +
Sbjct: 205 DDTESEDEEDG---DDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQS 261
Query: 145 EEEE 148
++++
Sbjct: 262 DQDD 265
Score = 28.4 bits (63), Expect = 1.9
Identities = 13/51 (25%), Positives = 28/51 (54%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
D+ + E+EE ++++ E+E++ E E E +E ++ E ++E E
Sbjct: 203 MGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESES 253
Score = 28.4 bits (63), Expect = 2.2
Identities = 19/79 (24%), Positives = 39/79 (49%), Gaps = 6/79 (7%)
Query: 78 NGLLCVLDDQA------KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK 131
+ L +DDQ +E ++ L E+ E E EE+ +++ E ++EE+ + +
Sbjct: 174 DRLSAAIDDQQAFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEG 233
Query: 132 EEEEKEKMEEEEKEEEETK 150
E +E +E E + E++
Sbjct: 234 EGQEGSAPQESEATDRESE 252
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 30.5 bits (68), Expect = 0.41
Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 2/69 (2%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE--EKEKMEEEEKEE 146
KE L ++K+ E+ E EE E E E+ E E+E+ E E E+E++E + E
Sbjct: 442 KENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERERLERDRLER 501
Query: 147 EETKYLVRD 155
+ L R+
Sbjct: 502 DRLDRLERE 510
Score = 28.9 bits (64), Expect = 1.5
Identities = 16/52 (30%), Positives = 32/52 (61%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
++E + E ++ E+++ E E+E++E E+E+ME E+E E + L R+
Sbjct: 441 EKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERE 492
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 29.7 bits (67), Expect = 0.41
Identities = 19/55 (34%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 97 LYEDNIKKEEEEKEEE-EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
L+ + +KE++E +E EE+E + +KE++E+KEKK E+ K + + + TK
Sbjct: 90 LFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTK 144
Score = 27.8 bits (62), Expect = 1.8
Identities = 18/48 (37%), Positives = 32/48 (66%)
Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
N+ EE EKE++E + EE++ + +KEKKE++EK+ E+ K++
Sbjct: 89 NLFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKST 136
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 30.4 bits (69), Expect = 0.41
Identities = 18/51 (35%), Positives = 26/51 (50%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
ED +KEE E + + EK +E +K EEKEK+E+ E +E
Sbjct: 516 EDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIEKAVAELKEA 566
Score = 27.7 bits (62), Expect = 3.3
Identities = 21/58 (36%), Positives = 27/58 (46%), Gaps = 19/58 (32%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEE-------------------EEKEKMEEEEKEEEE 148
EEE E VKE E EE+K++KEE E +K+ EEKE+ E
Sbjct: 500 EEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIE 557
>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
ArcB; Provisional.
Length = 779
Score = 30.3 bits (69), Expect = 0.43
Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 5/40 (12%)
Query: 100 DNIKK-----EEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
DNI + E EK EE +E E+ K E +E+E+ + E
Sbjct: 110 DNIAQLNQEIAEREKAEEARQEAFEQLKNEIKEREETQIE 149
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 30.2 bits (68), Expect = 0.43
Identities = 13/49 (26%), Positives = 24/49 (48%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
D+ E+E+ +E+E++ + EE+E E+ + E E E E
Sbjct: 219 ADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME 267
Score = 27.5 bits (61), Expect = 2.9
Identities = 9/47 (19%), Positives = 26/47 (55%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
+ E + E+ E+E+ +E++++++ +E++ E+ + EE
Sbjct: 215 EPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEM 261
Score = 27.5 bits (61), Expect = 3.6
Identities = 11/37 (29%), Positives = 23/37 (62%)
Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ EE+ +E + + E + E++ KE ++++ EEEE
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEE 243
Score = 27.1 bits (60), Expect = 4.6
Identities = 15/51 (29%), Positives = 27/51 (52%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
+++ KE+E+ ++ EEEE + E+ EE E +ME E ++T
Sbjct: 226 DEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDT 276
Score = 26.3 bits (58), Expect = 7.9
Identities = 11/54 (20%), Positives = 26/54 (48%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
D + + E E+E++ ++E+ ++ EE+E + + + EE E+
Sbjct: 212 LGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGE 265
Score = 26.3 bits (58), Expect = 8.1
Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 99 EDNIKKEEEEKEEEEEEE-----VKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
ED+ K++E++ + EEEE + E+ EE E E E E ++ + ++
Sbjct: 227 EDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPDSDD 281
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 30.0 bits (68), Expect = 0.44
Identities = 9/40 (22%), Positives = 13/40 (32%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
EE++ + E E E EEE E+
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEK 154
Score = 29.6 bits (67), Expect = 0.61
Identities = 9/40 (22%), Positives = 13/40 (32%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
EEE E E E + + EEE ++
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEK 154
Score = 29.2 bits (66), Expect = 0.72
Identities = 11/40 (27%), Positives = 12/40 (30%)
Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EEE+ E E E E EEE K
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEK 154
Score = 28.8 bits (65), Expect = 1.1
Identities = 11/40 (27%), Positives = 13/40 (32%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
EEE+ E + E E E EE EK
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEK 154
Score = 28.5 bits (64), Expect = 1.3
Identities = 8/41 (19%), Positives = 13/41 (31%)
Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
+EE+ + E E E E+E E+
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEKK 155
Score = 26.9 bits (60), Expect = 4.6
Identities = 8/41 (19%), Positives = 15/41 (36%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
+++E+ V E E + E E E+ E+
Sbjct: 114 MEEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEK 154
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 30.3 bits (68), Expect = 0.44
Identities = 19/58 (32%), Positives = 37/58 (63%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
QA E E+ L ++ +K +E++K+ EE E+ + E++++EE+ +K E++K E K
Sbjct: 96 QAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAK 153
Score = 29.5 bits (66), Expect = 0.66
Identities = 15/52 (28%), Positives = 34/52 (65%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ KKEE+ EE + ++ E+E+ ++ EKE+ + +E++K EE +++ + +
Sbjct: 79 QQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLE 130
Score = 27.6 bits (61), Expect = 3.5
Identities = 16/65 (24%), Positives = 38/65 (58%), Gaps = 3/65 (4%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKE---EEEEKEKKEEEEKEKMEEEE 143
A+E + ++ +K+ E+E+ + +E++ + EE E + E+K+++E+ K E+++
Sbjct: 88 VAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKK 147
Query: 144 KEEEE 148
K E
Sbjct: 148 KAEAA 152
Score = 26.8 bits (59), Expect = 5.2
Identities = 20/66 (30%), Positives = 26/66 (39%), Gaps = 2/66 (3%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEE--EEEKEKKEEEEKEKMEEEEK 144
+A + KK EE + EE + K E + E E K EK K E E K
Sbjct: 155 KAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAK 214
Query: 145 EEEETK 150
+ E K
Sbjct: 215 AKAEKK 220
Score = 26.5 bits (58), Expect = 8.5
Identities = 14/64 (21%), Positives = 24/64 (37%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
K+ E + K E E K + E++ E++ EK+K + K +
Sbjct: 192 AKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKA 251
Query: 147 EETK 150
E K
Sbjct: 252 AERK 255
Score = 26.5 bits (58), Expect = 9.1
Identities = 16/57 (28%), Positives = 39/57 (68%), Gaps = 2/57 (3%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEE--EEEKEKKEEEEKEKMEEEEKEEEETKYLV 153
+ + KK E++++++EE+ +E + ++ E+E+ K+ E+E+ K +E++K+ EE +
Sbjct: 71 QSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQA 127
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 29.4 bits (66), Expect = 0.45
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
+EE+EEEEE E +++ E + ++ ++ E E EE
Sbjct: 94 TRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGEE 131
Score = 27.5 bits (61), Expect = 1.9
Identities = 12/38 (31%), Positives = 24/38 (63%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
+E+EEEEEE K+++ +E + ++ ++ E + EE
Sbjct: 94 TRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGEE 131
Score = 26.3 bits (58), Expect = 5.4
Identities = 10/29 (34%), Positives = 24/29 (82%)
Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
++EE+EEEEE EK+++ ++ ++++ ++ E
Sbjct: 95 RQEEEEEEEENEKQQQSDEAQVQQHQQHE 123
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 30.4 bits (69), Expect = 0.46
Identities = 19/60 (31%), Positives = 33/60 (55%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
LD Q E ++A YE E+ + ++ EE++EE EE K KK++ +K++ E+
Sbjct: 381 NLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Score = 27.7 bits (62), Expect = 3.6
Identities = 12/51 (23%), Positives = 32/51 (62%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
++ ++++ ++ E E+ E+E ++ E K +K+E+E E++E+ +E +
Sbjct: 271 LDEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQ 321
Score = 26.6 bits (59), Expect = 8.5
Identities = 18/53 (33%), Positives = 31/53 (58%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
D IK+ E E++ E+E+K+ E + E+++++ EE EK E +K E L
Sbjct: 278 DKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANL 330
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 30.0 bits (68), Expect = 0.49
Identities = 16/41 (39%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
+Y D++ KE EEKE+E++ + KE ++E + K EE E
Sbjct: 384 VYRDDVYKEREEKEKEKKAK-KEGKEERRIHFQNKSIEEAE 423
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 29.4 bits (67), Expect = 0.50
Identities = 13/56 (23%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE-EKEKMEEEEKEEEETKYL 152
Y KK++++K+E E E K+++ EE+++ ++ + + +K YL
Sbjct: 80 YFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAAIKKVPLLKGYL 135
>gnl|CDD|184899 PRK14906, PRK14906, DNA-directed RNA polymerase subunit beta'/alpha
domain fusion protein; Provisional.
Length = 1460
Score = 30.2 bits (68), Expect = 0.52
Identities = 17/61 (27%), Positives = 28/61 (45%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVH 157
+ D+I +E EE EV + +E E K + E M+ E K+ + L R++
Sbjct: 189 FVDDIGDDERLTAEEVRAEVADIYEEYNERKALRREAFDAFMQIEPKQLISDEALYREMR 248
Query: 158 L 158
L
Sbjct: 249 L 249
Score = 27.5 bits (61), Expect = 4.0
Identities = 20/81 (24%), Positives = 39/81 (48%)
Query: 57 RWRHIEFSDNTLCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEE 116
R HIE + + +G P+ L +LD + K+ EK I ++E EE+ ++
Sbjct: 93 RMGHIELAAPVSHIWYFKGSPSRLGYLLDIKPKDLEKVLYFASYIITSVDKEAREEDADD 152
Query: 117 VKEEEKEEEEEKEKKEEEEKE 137
+++E + EE + + + E
Sbjct: 153 LRDELAADLEELDAERDRLIE 173
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 29.5 bits (67), Expect = 0.52
Identities = 17/39 (43%), Positives = 25/39 (64%)
Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
E+E+E++K +EE+K EE+EK EE Y + D H E
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEPYGYCLVDGHKE 133
Score = 27.2 bits (61), Expect = 3.4
Identities = 15/33 (45%), Positives = 23/33 (69%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
I + EE++E+++ KEE+K +EEKEK EE
Sbjct: 90 IYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 30.2 bits (68), Expect = 0.54
Identities = 16/43 (37%), Positives = 25/43 (58%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
ED EE + +E+E E +++ E E E + E EE+E+E E
Sbjct: 650 EDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692
Score = 29.8 bits (67), Expect = 0.58
Identities = 14/43 (32%), Positives = 29/43 (67%), Gaps = 1/43 (2%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKE-EEEKEKMEEEEKEEEET 149
E EEE+E++++E + +E+E + ++ E E E E+ +E+E+
Sbjct: 646 EGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQE 688
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 30.0 bits (68), Expect = 0.56
Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 1/57 (1%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE-EKEEEETKYLVR 154
E + K+ EK+ EE + K+ E+ E+K +E K+ EE E +E +
Sbjct: 76 EAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEIAEMMRDEKVPIRE 132
>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
family. This model represents a relatively
well-conserved region near the C-terminus of the tape
measure protein of a lambda and related phage. This
protein, which controls phage tail length, is typically
about 1000 residues in length. Both low-complexity
sequence and insertion/deletion events appear common in
this family. Mutational studies suggest a ruler or
template role in the determination of phage tail length.
Similar behavior is attributed to proteins from
distantly related or unrelated families in other phage
[Mobile and extrachromosomal element functions, Prophage
functions].
Length = 332
Score = 29.8 bits (67), Expect = 0.57
Identities = 12/66 (18%), Positives = 24/66 (36%), Gaps = 2/66 (3%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEE--EEEKEKKEEEEKEKMEEE 142
+ T++ + +E+ + +E+ K E EE E+K E +K+ E
Sbjct: 1 NQLLLLTQQIADRKLKKLNTADEKSLQSRSDEIIALIKLEKLLEEAERKALEALKKLAEA 60
Query: 143 EKEEEE 148
Sbjct: 61 TASIRA 66
>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
Length = 592
Score = 30.0 bits (67), Expect = 0.57
Identities = 15/59 (25%), Positives = 25/59 (42%), Gaps = 4/59 (6%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK----EEEEKEKME 140
D+A ++ +A E E E E + E KE+ + E+K ++ EK E
Sbjct: 59 DEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVEKAPAE 117
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 29.5 bits (67), Expect = 0.59
Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 4/58 (6%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
E IKK EEEKEE E+ + E K E EK EE+E+ EE++ +E +L +
Sbjct: 125 LEQEIKKLEEEKEELEKRVAELEAKLEAIEKR----EEEERQIEEKRHADEIAFLKKQ 178
Score = 26.4 bits (59), Expect = 6.6
Identities = 20/61 (32%), Positives = 34/61 (55%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESCE 162
K + E+ + E E+ ++ +EE+EE EK+ E + K+E EK EEE + + H +
Sbjct: 114 KALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIA 173
Query: 163 F 163
F
Sbjct: 174 F 174
>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
Length = 697
Score = 30.1 bits (69), Expect = 0.60
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEE 127
K+E++ E EE +EE E EEE
Sbjct: 328 KREKKAAELAAEEAEEEEAAEPEEE 352
Score = 29.3 bits (67), Expect = 1.0
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
++K E++ E EE +EEE E EE+
Sbjct: 326 LRKREKKAAELAAEEAEEEEAAEPEEES 353
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 29.8 bits (68), Expect = 0.62
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
EE+ E ++E E EE+E+K+EE+K
Sbjct: 63 LEEQRERLEELAPELLEEEEEKKEEKKG 90
Score = 27.1 bits (61), Expect = 5.5
Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 114 EEEVKEEEKEEEEEKEKK-EEEEKEKMEEEEKEEEETKYL 152
+E V+E EE+ ++ EE E +EEEE+++EE K L
Sbjct: 52 KEVVEEVNSLSLEEQRERLEELAPELLEEEEEKKEEKKGL 91
Score = 26.7 bits (60), Expect = 6.2
Identities = 13/39 (33%), Positives = 22/39 (56%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
EE++E EE + E+EEE+++EKK E+ +
Sbjct: 63 LEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKGK 101
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 29.7 bits (68), Expect = 0.64
Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 19/60 (31%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEE-------------------EEKEKMEEEEKEEEETK 150
+EE E VK+ E EE+K++KE E +K+ +EKE+ E
Sbjct: 502 DEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAA 561
Score = 27.8 bits (63), Expect = 3.2
Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 16/68 (23%)
Query: 97 LYEDNIKK--------EEEEKEEEEEEEVKEE--------EKEEEEEKEKKEEEEKEKME 140
L ++ I++ EE+K+ +E E + + EK +E +K +EKEK+E
Sbjct: 500 LSDEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIE 559
Query: 141 EEEKEEEE 148
KE +E
Sbjct: 560 AAIKELKE 567
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 29.5 bits (66), Expect = 0.64
Identities = 17/42 (40%), Positives = 30/42 (71%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
K+EEE + + EEV+E+E+E E+ ++ +EE++E E EE+
Sbjct: 18 KDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59
Score = 26.0 bits (57), Expect = 7.8
Identities = 15/39 (38%), Positives = 25/39 (64%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
E++ + EE EE+EEE E+ +EE++E +E EE+
Sbjct: 21 EEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 28.9 bits (65), Expect = 0.68
Identities = 12/39 (30%), Positives = 29/39 (74%)
Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EE++ VK+ KE+ ++K++K++++K+K ++ K+ + K
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
trafficking and secretion].
Length = 340
Score = 29.5 bits (67), Expect = 0.68
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
E+++EE +E E EKEK EEE++E+ +E E
Sbjct: 1 MGFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFER 36
Score = 29.1 bits (66), Expect = 1.1
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
E+ EEE++E E E+E+ +E+ EEEEKE E K
Sbjct: 4 FEKLEEELEELEAEKEKIEEEDEEEEKEGWFERLK 38
Score = 28.8 bits (65), Expect = 1.2
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKM 139
E+ EEE EE E+EK EEE++E+++E E++
Sbjct: 2 GFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFERL 37
Score = 28.4 bits (64), Expect = 1.8
Identities = 26/79 (32%), Positives = 36/79 (45%), Gaps = 22/79 (27%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE----------------------EEKEKME 140
K EEE +E E E+E EEE EEEE++ E + KEK++
Sbjct: 6 KLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQGLSKTKKNFGKGIKGLFLKKIKEKLD 65
Query: 141 EEEKEEEETKYLVRDVHLE 159
E+ EE E + DV +E
Sbjct: 66 EDLLEELEELLIEADVGVE 84
Score = 27.6 bits (62), Expect = 3.1
Identities = 12/36 (33%), Positives = 24/36 (66%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKM 139
++ EE+ EE E E ++ E+E+EEE+++ E ++
Sbjct: 5 EKLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQG 40
Score = 27.6 bits (62), Expect = 3.2
Identities = 14/30 (46%), Positives = 22/30 (73%)
Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E+ EE+ ++ E EKEK+EEE++EEE+
Sbjct: 1 MGFFEKLEEELEELEAEKEKIEEEDEEEEK 30
Score = 27.2 bits (61), Expect = 3.7
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
E+ E + EE E E+EK ++E+EE+EK E+
Sbjct: 2 GFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFER 36
>gnl|CDD|237001 PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase
subunit E2; Reviewed.
Length = 411
Score = 29.8 bits (68), Expect = 0.71
Identities = 7/44 (15%), Positives = 7/44 (15%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
EE E E E E E
Sbjct: 78 EEGEAEAAAAAEAAPEAPAPEPAPAAAAAAAAAPAAAAAPAAPA 121
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 29.5 bits (67), Expect = 0.71
Identities = 21/73 (28%), Positives = 34/73 (46%), Gaps = 7/73 (9%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
D+ +E +K G + E +EE E+ E KEE EE EK+ E E+++E
Sbjct: 190 SDEVEEELKKLGF-------ERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELA 242
Query: 144 KEEEETKYLVRDV 156
K+ E + +
Sbjct: 243 KKYLEELLALYEY 255
Score = 29.1 bits (66), Expect = 1.2
Identities = 15/56 (26%), Positives = 26/56 (46%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
KEE E+ E+E E + EE KE ++ ++ E +E E + E ++
Sbjct: 220 IKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALSKFLKTDKT 275
Score = 28.4 bits (64), Expect = 1.7
Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 2/63 (3%)
Query: 92 EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEE--EKEKKEEEEKEKMEEEEKEEEET 149
K L E+ K + EE ++ +E EK E+E E E++ E + +++E E+E E
Sbjct: 67 PKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERL 126
Query: 150 KYL 152
+
Sbjct: 127 EPW 129
Score = 28.0 bits (63), Expect = 3.0
Identities = 27/135 (20%), Positives = 56/135 (41%), Gaps = 11/135 (8%)
Query: 7 FTIRGEEEEEEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDN 66
T+ ++ EE + E D E + + + Y V +E ++ + + F
Sbjct: 150 GTVPEDKLEELKLESDVENV-EYISTDKGYVYV---VVVVLKELSDEVEEELKKLGFERL 205
Query: 67 TLCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEE 126
L EG P+ L+ + ++ +E EK +++ +E +E ++ EE+ + E
Sbjct: 206 EL---EEEGTPSELIREIKEELEEIEK----ERESLLEELKELAKKYLEELLALYEYLEI 258
Query: 127 EKEKKEEEEKEKMEE 141
E E+ E K +
Sbjct: 259 ELERAEALSKFLKTD 273
>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6. This region covers
TM regions 4-6 of the ryanodine receptor 1 family.
Length = 217
Score = 29.3 bits (66), Expect = 0.72
Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 105 EEEEKEEEEEEEVKEEEKEE-EEEKEKKEEEEKEKMEEEEKEEEETKYLVR 154
E E +E E E++E EE +E K E EK +E+ EK + YL R
Sbjct: 72 EVEPEEPPTPEGAPAEKRELKEEGEEPKSEPEKADLEDGEKRTKILNYLAR 122
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 29.6 bits (66), Expect = 0.72
Identities = 12/47 (25%), Positives = 27/47 (57%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
E+ + + E E E + ++K+E+++ EE++ E ++ + EKE
Sbjct: 92 EEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKE 138
Score = 28.1 bits (62), Expect = 2.2
Identities = 14/48 (29%), Positives = 24/48 (50%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
++ EE E E E E + + ++EK+K EE+K E K++
Sbjct: 85 VADVNNGEEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQ 132
Score = 26.9 bits (59), Expect = 6.1
Identities = 14/47 (29%), Positives = 20/47 (42%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K EE + E E E E + ++EK+K EE+K E
Sbjct: 80 TKDNVVADVNNGEEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPE 126
Score = 26.5 bits (58), Expect = 7.2
Identities = 10/41 (24%), Positives = 21/41 (51%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
EE E E E E + +++++K EE++ E +++
Sbjct: 92 EEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQ 132
>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
Length = 1560
Score = 29.7 bits (67), Expect = 0.75
Identities = 5/41 (12%), Positives = 15/41 (36%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
K+ +E + + E + + E +++ E +
Sbjct: 885 KDSKEGDADAEVAEVDAAPGGAVDHEPPVAKQEGNAEGGDG 925
Score = 29.7 bits (67), Expect = 0.86
Identities = 8/46 (17%), Positives = 13/46 (28%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+ + E K+E E + + M EEK
Sbjct: 896 VAEVDAAPGGAVDHEPPVAKQEGNAEGGDGAALDAAAGRLMTREEK 941
>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
Validated.
Length = 535
Score = 29.6 bits (67), Expect = 0.79
Identities = 15/50 (30%), Positives = 25/50 (50%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
IK ++ K E + +E K E+K+ E E +EEE K+++E
Sbjct: 447 GAKIKIQKALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDE 496
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 29.4 bits (66), Expect = 0.79
Identities = 11/47 (23%), Positives = 17/47 (36%)
Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ E EE+ + E K + + EEEE+ EE
Sbjct: 69 SKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEE 115
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 29.8 bits (67), Expect = 0.82
Identities = 15/47 (31%), Positives = 25/47 (53%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ EE++EEE E V + ++ E E+E E E++E+ E K
Sbjct: 8 QANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAK 54
Score = 28.6 bits (64), Expect = 1.5
Identities = 15/44 (34%), Positives = 22/44 (50%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
EEE+ + EE EEE E + ++ E E+E E+EE
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEE 44
Score = 27.1 bits (60), Expect = 5.4
Identities = 12/44 (27%), Positives = 24/44 (54%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ EEE+EEE++ + + + E +++ E+EE +EE
Sbjct: 6 NTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEA 49
Score = 27.1 bits (60), Expect = 6.3
Identities = 14/60 (23%), Positives = 30/60 (50%), Gaps = 2/60 (3%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEK--EKKEEEEKEKMEEE 142
D Q ++ LYE ++ + ++ + ++ + + EE +E+EEKE++ EE
Sbjct: 763 DFQFFNVQRLTELYEKEVRYLMQAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQLLEE 822
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 29.4 bits (66), Expect = 0.85
Identities = 16/33 (48%), Positives = 19/33 (57%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
E+ EEEEE +EEE+EE E E EE E
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFE 431
Score = 29.0 bits (65), Expect = 1.2
Identities = 17/42 (40%), Positives = 26/42 (61%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
D +++ +EK E EEE+EEEEE+E++ E + MEE
Sbjct: 384 DELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEE 425
Score = 27.9 bits (62), Expect = 3.2
Identities = 18/49 (36%), Positives = 29/49 (59%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
E++ +E +E +E+ E+ EEE+E++EEEE+E E E EE
Sbjct: 377 EEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEE 425
Score = 27.9 bits (62), Expect = 3.2
Identities = 15/25 (60%), Positives = 16/25 (64%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKE 129
EEEE+EEEEEEE E E EE
Sbjct: 404 EEEEEEEEEEEEAAEAEAPMEEPVP 428
Score = 27.5 bits (61), Expect = 3.3
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
+++ EE+EEEEEEE +E + E +E
Sbjct: 398 VERWAEEEEEEEEEEEEEAAEAEAPMEEPVP 428
>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
Length = 1634
Score = 29.5 bits (66), Expect = 0.85
Identities = 30/141 (21%), Positives = 62/141 (43%), Gaps = 23/141 (16%)
Query: 10 RGEEEEEEEEEEDFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIR--WRH--IEFSD 65
+GE +EEEEE E+ + VF Y +E ++ G++ R IE
Sbjct: 1473 KGENPDEEEEEAFREE--------------KEKVFTYLNKELDEAGLKRVLRRCVIETYQ 1518
Query: 66 NT-----LCLQLVEGKPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEE 120
T L V G A E ++A + D + + ++++++E+ +V E+
Sbjct: 1519 KTDKSKVLLRLRVGNVGKGKKKETLATADEIDQAAHMLMDQVGQVPDDEDDDEDGDVLEK 1578
Query: 121 EKEEEEEKEKKEEEEKEKMEE 141
E ++ ++ ++ E + ++ E
Sbjct: 1579 EVDQLQQSLERLAEVQRELAE 1599
>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
Length = 702
Score = 29.6 bits (66), Expect = 0.87
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEK 130
I EE+ EEE + +VKEE+ + +EK
Sbjct: 9 IGTSEEDDEEEMDMDVKEEDDGDRRNREK 37
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 28.8 bits (65), Expect = 0.91
Identities = 17/49 (34%), Positives = 30/49 (61%), Gaps = 1/49 (2%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+++EEEEK +EEE E + E+ + +++ KE M++EE EE +
Sbjct: 122 LEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKE-MQKEEDEEMRHR 169
Score = 26.9 bits (60), Expect = 4.1
Identities = 13/38 (34%), Positives = 24/38 (63%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
E+ E EEEE ++EE+ E + K E+ +++++ KE
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKE 157
Score = 26.9 bits (60), Expect = 4.9
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
E+ E EE+EK++EEE+E++ K E L
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRL 151
Score = 26.1 bits (58), Expect = 8.2
Identities = 17/45 (37%), Positives = 29/45 (64%), Gaps = 1/45 (2%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE-KEKMEEEEKEEEE 148
E+ E+EEEE+ + +E E+ K + E+ K+K +E +KEE+E
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDE 164
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 29.3 bits (65), Expect = 0.91
Identities = 12/45 (26%), Positives = 24/45 (53%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+KE+ EE+E +E E+ + E+ +++ E + EE +E
Sbjct: 97 RKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEP 141
>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
Length = 249
Score = 29.3 bits (66), Expect = 0.91
Identities = 12/45 (26%), Positives = 17/45 (37%), Gaps = 3/45 (6%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
++ EE EE+EE E E+ E +E E E
Sbjct: 205 RDPEEAEEKEEA---AAETAGVEDAAAAAAEAEEGEEWVAAANNE 246
Score = 27.3 bits (61), Expect = 3.4
Identities = 8/31 (25%), Positives = 12/31 (38%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
+ +++EE E E E EE E
Sbjct: 207 PEEAEEKEEAAAETAGVEDAAAAAAEAEEGE 237
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 29.3 bits (66), Expect = 0.92
Identities = 13/54 (24%), Positives = 21/54 (38%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVH 157
+ E EEV E ++E E + E EK+ EE E + + +
Sbjct: 224 DGGPSELISELEEVIAEIQDELESLRSELEALAEKIAEELLAVREILEIEKALG 277
Score = 28.9 bits (65), Expect = 1.4
Identities = 19/70 (27%), Positives = 29/70 (41%), Gaps = 1/70 (1%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
E + E EE K E+ EE + EE+ + +EE E +E+ +E
Sbjct: 81 FILPEEVEKLEAELK-SLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEEL 139
Query: 147 EETKYLVRDV 156
E YL D+
Sbjct: 140 EPLAYLDFDL 149
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 29.3 bits (66), Expect = 0.94
Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKE--EEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
KKEEEE+EEE+E +K E EE++ + ++E+ E EE+ K
Sbjct: 176 LFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPK 230
Score = 26.6 bits (59), Expect = 6.7
Identities = 18/49 (36%), Positives = 27/49 (55%), Gaps = 6/49 (12%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEE------KEKKEEEEKEKMEEEEKEEE 147
E +K +EE+EE E+E+EEE E + + K +ME E+ EE
Sbjct: 108 YEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEE 156
>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
Length = 499
Score = 29.3 bits (65), Expect = 0.98
Identities = 18/56 (32%), Positives = 31/56 (55%), Gaps = 4/56 (7%)
Query: 83 VLDDQAKETEKA----GVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEE 134
V+D + ETE+ L+ + ++ EEE + E + + EE++EEEE E + E
Sbjct: 60 VIDGENSETERPRDLTAALFAEAGEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPE 115
Score = 28.1 bits (62), Expect = 2.5
Identities = 15/35 (42%), Positives = 20/35 (57%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
E EE EEE + E + E +E+ EEEE E + E
Sbjct: 81 EAGEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPE 115
Score = 27.3 bits (60), Expect = 4.7
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
E EE +EE+ ++E + E +E+ EEEE E
Sbjct: 81 EAGEEAEEEDNDRECPDTEAEEEDEEEEIE 110
Score = 26.6 bits (58), Expect = 8.3
Identities = 14/35 (40%), Positives = 19/35 (54%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
E EE EEE+ E + E E+E +EEE + E
Sbjct: 81 EAGEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPE 115
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 29.1 bits (66), Expect = 0.99
Identities = 13/50 (26%), Positives = 32/50 (64%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E+N ++E EE E E+ E ++ E+ +E +EE++ + ++ +++++E
Sbjct: 333 EENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDE 382
Score = 26.4 bits (59), Expect = 9.1
Identities = 11/51 (21%), Positives = 27/51 (52%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ + +EE E+ E E+ + E+ E+ EEE+ + +++ E+++
Sbjct: 331 FIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDD 381
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 29.4 bits (66), Expect = 1.00
Identities = 10/43 (23%), Positives = 20/43 (46%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E + + E E EEE ++ ++ + + EE ++ EE
Sbjct: 267 PEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEE 309
Score = 27.8 bits (62), Expect = 3.2
Identities = 11/48 (22%), Positives = 17/48 (35%), Gaps = 3/48 (6%)
Query: 105 EEEEKEE---EEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
E + E E +EE E ++ EE + EE + E
Sbjct: 270 EPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEELMFDAVEA 317
>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
Length = 432
Score = 29.3 bits (66), Expect = 1.0
Identities = 10/43 (23%), Positives = 23/43 (53%), Gaps = 3/43 (6%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKK---EEEEKEKMEEEEKEE 146
E ++EE+ + + EE+ EEE + +++ E++E+
Sbjct: 97 ETQQEEKAVNPFTPQPGQREERRPTLESEEEWRARLKREQEEQ 139
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 29.1 bits (65), Expect = 1.0
Identities = 21/45 (46%), Positives = 33/45 (73%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
KK+E+ KEE ++ + KEE KE+ KEK++E+EK+ E ++EEE
Sbjct: 119 KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEE 163
Score = 27.5 bits (61), Expect = 3.4
Identities = 17/65 (26%), Positives = 38/65 (58%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
+ ++ G + KE + + +EEE+ KE+ KEE+++K++K +EE + + +E+
Sbjct: 78 KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137
Query: 146 EEETK 150
+E+
Sbjct: 138 KEKRP 142
Score = 26.4 bits (58), Expect = 9.8
Identities = 20/63 (31%), Positives = 38/63 (60%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
D+ K EK G K +E + E +E ++E+++ +EEK+KK+E+ KE+ ++ +
Sbjct: 74 DEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKP 133
Query: 145 EEE 147
+EE
Sbjct: 134 KEE 136
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 28.7 bits (65), Expect = 1.0
Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 5/64 (7%)
Query: 87 QAKETEKAGVLYE---DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
QAK + K L E D ++ E E + ++ +KE+ + + +E E + E E
Sbjct: 45 QAK-SIK-QHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEA 102
Query: 144 KEEE 147
+ +
Sbjct: 103 ESDH 106
Score = 27.2 bits (61), Expect = 3.4
Identities = 8/41 (19%), Positives = 18/41 (43%), Gaps = 2/41 (4%)
Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
+ + K+E+ E ++ E + KE E E + ++
Sbjct: 73 QAKIARYKKEKARYRSEAKELEAKAKEA--EAESDHALHQH 111
>gnl|CDD|237431 PRK13570, PRK13570, anthranilate synthase component I; Provisional.
Length = 455
Score = 29.2 bits (66), Expect = 1.1
Identities = 16/62 (25%), Positives = 27/62 (43%), Gaps = 2/62 (3%)
Query: 96 VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE--EETKYLV 153
Y D ++E E+ EE+K+ + E E E + K + +EE E+ K +
Sbjct: 149 NRYSDRSEEELEKALNVVLEELKQPAEAEHELIELSKLSFKSNITKEEFCGMVEKAKEYI 208
Query: 154 RD 155
R
Sbjct: 209 RA 210
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 29.1 bits (65), Expect = 1.1
Identities = 10/54 (18%), Positives = 27/54 (50%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
+ E + ++E++ E++++K + E +EK ++EE + V++
Sbjct: 2 AYRNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKE 55
Score = 29.1 bits (65), Expect = 1.1
Identities = 15/67 (22%), Positives = 33/67 (49%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ EK + + E +EK +++EEV + + +++ +E K E ++ +
Sbjct: 14 NDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTK 73
Query: 147 EETKYLV 153
EE+K L+
Sbjct: 74 EESKQLL 80
Score = 28.4 bits (63), Expect = 2.2
Identities = 14/58 (24%), Positives = 31/58 (53%), Gaps = 1/58 (1%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEE-KEKKEEEEKEKMEEEEKEEEETKYLVRD 155
E NI + +E+++E++ ++ E +E+ +KKEE + ++ KE+ + L
Sbjct: 8 EANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIA 65
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 28.2 bits (63), Expect = 1.1
Identities = 14/28 (50%), Positives = 17/28 (60%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKE 129
K +KEEE+EEE+ E KEEE E
Sbjct: 90 FNKLTSKKEEEKEEEIPEPTKEEELLGE 117
Score = 27.5 bits (61), Expect = 2.3
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 128 KEKKEEEEKEKMEEEEKEEE 147
KKEEE++E++ E KEEE
Sbjct: 94 TSKKEEEKEEEIPEPTKEEE 113
Score = 25.9 bits (57), Expect = 8.2
Identities = 12/17 (70%), Positives = 13/17 (76%)
Query: 118 KEEEKEEEEEKEKKEEE 134
KEEEKEEE + KEEE
Sbjct: 97 KEEEKEEEIPEPTKEEE 113
Score = 25.9 bits (57), Expect = 8.2
Identities = 13/35 (37%), Positives = 20/35 (57%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
KKEEE++EE E +EE E + K++ K+
Sbjct: 96 KKEEEKEEEIPEPTKEEELLGEIRDLLKQQNSSKD 130
>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 696
Score = 29.1 bits (66), Expect = 1.2
Identities = 12/31 (38%), Positives = 23/31 (74%)
Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K +E++ EE+++ EE +KE+ EEEE+ ++
Sbjct: 325 KRKEQQAEEKEKPAEEAKKEEEEEEEESVDD 355
Score = 28.4 bits (64), Expect = 1.9
Identities = 15/38 (39%), Positives = 24/38 (63%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
+++ EEKE+ EE KEEE+EEEE + + ++E
Sbjct: 328 EQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDPIELE 365
Score = 28.0 bits (63), Expect = 2.5
Identities = 16/51 (31%), Positives = 29/51 (56%), Gaps = 4/51 (7%)
Query: 101 NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
+ K +E++ EE+E+ +E +KEE E++EEE + + + E E Y
Sbjct: 322 LLSKRKEQQAEEKEKPAEEAKKEE----EEEEEESVDDVLLIDPIELELGY 368
>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein. This family consists of mammalian
MRVI1 proteins which are related to the
lymphoid-restricted membrane protein (JAW1) and the IP3
receptor associated cGMP kinase substrates A and B
(IRAGA and IRAGB). The function of MRVI1 is unknown
although mutations in the Mrvi1 gene induces myeloid
leukaemia by altering the expression of a gene important
for myeloid cell growth and/or differentiation so it has
been speculated that Mrvi1 is a tumour suppressor gene.
IRAG is very similar in sequence to MRVI1 and is an
essential NO/cGKI-dependent regulator of IP3-induced
calcium release. Activation of cGKI decreases
IP3-stimulated elevations in intracellular calcium,
induces smooth muscle relaxation and contributes to the
antiproliferative and pro-apoptotic effects of NO/cGMP.
Jaw1 is a member of a class of proteins with
COOH-terminal hydrophobic membrane anchors and is
structurally similar to proteins involved in vesicle
targeting and fusion. This suggests that the function
and/or the structure of the ER in lymphocytes may be
modified by lymphoid-restricted resident ER proteins.
Length = 538
Score = 28.8 bits (64), Expect = 1.2
Identities = 15/43 (34%), Positives = 28/43 (65%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
EEEK+ ++ ++++E E EE E+ +K +E E EE ++E+
Sbjct: 415 EEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKEQ 457
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 28.3 bits (64), Expect = 1.2
Identities = 15/45 (33%), Positives = 28/45 (62%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
++ KE E + ++E + E+ EKK E+ K K E+++ ++EE K
Sbjct: 52 KDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
Length = 382
Score = 28.7 bits (65), Expect = 1.3
Identities = 14/48 (29%), Positives = 24/48 (50%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+EEEE+EE EEEE + E + ++E+ + + +E K
Sbjct: 225 GEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEK 272
Score = 28.3 bits (64), Expect = 1.7
Identities = 15/60 (25%), Positives = 30/60 (50%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESCEF 163
EEEE+EEE EEE + E + K++ ++ + +E + + V+++ +F
Sbjct: 225 GEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQPVNVQPAQF 284
>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
2-oxoacid:ferredoxin oxidoreductases, beta subunit
[Energy production and conversion].
Length = 294
Score = 28.8 bits (65), Expect = 1.3
Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 15/68 (22%)
Query: 96 VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEE-----------EEKEKMEEEEK 144
LY E + EEE+ +++ EE + EE ++ ++ ++
Sbjct: 230 PLYRY----EPGKAEEEKGKDIPIGIFYPVEEYLEYEERFKHLTKSNPELIQKLQKDVDE 285
Query: 145 EEEETKYL 152
EE K L
Sbjct: 286 RWEELKRL 293
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 28.9 bits (64), Expect = 1.3
Identities = 9/40 (22%), Positives = 22/40 (55%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
+DN ++ ++ K + + E +++ EEE+ K ++ K
Sbjct: 99 VQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKG 138
>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
to archaeal Holliday junction resolvase [Nucleotide
transport and metabolism].
Length = 175
Score = 28.2 bits (63), Expect = 1.3
Identities = 19/54 (35%), Positives = 30/54 (55%), Gaps = 2/54 (3%)
Query: 96 VLYEDNIKKEEEEKEEEEEEEVKEEEKEEE--EEKEKKEEEEKEKMEEEEKEEE 147
Y +++ + E K E EE +++ E+E E + +KEEE K K E+K EE
Sbjct: 21 RAYIRSLQGKVESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEE 74
Score = 28.2 bits (63), Expect = 1.5
Identities = 16/39 (41%), Positives = 24/39 (61%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
E+ ++K E E+ E + KEEE + +E EKK EE +E
Sbjct: 39 EETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 28.7 bits (64), Expect = 1.4
Identities = 19/72 (26%), Positives = 34/72 (47%)
Query: 81 LCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
L ++ KE G YED + E +++E + +E EE E++ + +E E++
Sbjct: 308 LERNGNRIKELADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVA 367
Query: 141 EEEKEEEETKYL 152
E E E + L
Sbjct: 368 EMELLEVQVDDL 379
Score = 27.6 bits (61), Expect = 3.3
Identities = 14/47 (29%), Positives = 23/47 (48%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
D + +E ++E E+E E+ E EE E E E + + E + E
Sbjct: 341 DVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387
Score = 27.2 bits (60), Expect = 4.5
Identities = 15/52 (28%), Positives = 24/52 (46%)
Query: 92 EKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
EK +EE ++E E +E EE E E E + ++ E++ E E
Sbjct: 336 EKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387
>gnl|CDD|233990 TIGR02740, TraF-like, TraF-like protein. This protein is related
to the F-type conjugation system pilus assembly proteins
TraF (TIGR02739)and TrbB (TIGR02738) both of which
exhibit a thioredoxin fold. The protein represented by
this model has the same length and architecture as TraF,
but lacks the CXXC-motif found in TrbB and believed to
be responsible for the disulfide isomerase activity of
that protein.
Length = 271
Score = 28.5 bits (64), Expect = 1.5
Identities = 14/53 (26%), Positives = 21/53 (39%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE 141
K+ ++ YED KEEEE E ++ EE +E + E
Sbjct: 36 KDEKRGWFWYEDPPPKEEEEPLEPAPPAPPAQQPAREELAPFSVAWLRENLPE 88
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 28.7 bits (65), Expect = 1.5
Identities = 15/64 (23%), Positives = 26/64 (40%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
+E V I+ +EE EE E KE + E +E+K ++ +E ++E
Sbjct: 378 SSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRE 437
Query: 146 EEET 149
Sbjct: 438 LAFK 441
Score = 27.1 bits (61), Expect = 4.7
Identities = 21/63 (33%), Positives = 29/63 (46%), Gaps = 4/63 (6%)
Query: 94 AGVLYEDNIKKEEE----EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
A +L E+ EE EE E E +EE EE + KE E E + +EEK ++
Sbjct: 369 AQLLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDL 428
Query: 150 KYL 152
L
Sbjct: 429 LSL 431
Score = 26.4 bits (59), Expect = 7.9
Identities = 8/47 (17%), Positives = 24/47 (51%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
++ ++ E +EE+ +++ E + E K E+ + +E+ ++
Sbjct: 410 KEALETEALAQEEKLADDLLSLEGLDRELAFKLAEKGIKTLEDLAEQ 456
>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
Provisional.
Length = 719
Score = 28.6 bits (64), Expect = 1.5
Identities = 13/41 (31%), Positives = 25/41 (60%), Gaps = 3/41 (7%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+E+ +++EE ++ E E+E E K E + MEEE +++
Sbjct: 442 DEDSDDDEEFQIPESEQEPE---TTKNETKDTAMEEEPQDK 479
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 28.5 bits (64), Expect = 1.5
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
+ ++EE+K +EE+K E+EK EEE + + D H E
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLEEEYGFCILDGHKE 133
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 28.4 bits (64), Expect = 1.5
Identities = 19/48 (39%), Positives = 31/48 (64%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
E ++K+ E + EEE E+K++ E+ EKEK+ EE ++ + EE KE
Sbjct: 299 GEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKE 346
Score = 28.1 bits (63), Expect = 2.2
Identities = 19/71 (26%), Positives = 39/71 (54%), Gaps = 3/71 (4%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE---KMEEE 142
+ ++E + +EE E +++ E+++E+EK EE ++ EE KE K+EE
Sbjct: 295 LKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEI 354
Query: 143 EKEEEETKYLV 153
+K+ E+ + +
Sbjct: 355 QKKLEDLEKRL 365
Score = 27.3 bits (61), Expect = 3.6
Identities = 15/43 (34%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 104 KEEEEKEEEEEEEVKEE--EKEEEEEKEKKEEEEKEKMEEEEK 144
+E+E++ EE E+ + EE E + E+ +K+ E+ EK E+ K
Sbjct: 327 REKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLK 369
>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3). This family
consists of several optic atrophy 3 (OPA3) proteins.
OPA3 deficiency causes type III 3-methylglutaconic
aciduria (MGA) in humans. This disease manifests with
early bilateral optic atrophy, spasticity,
extrapyramidal dysfunction, ataxia, and cognitive
deficits, but normal longevity.
Length = 134
Score = 28.0 bits (63), Expect = 1.6
Identities = 14/30 (46%), Positives = 19/30 (63%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
KKEEE ++E EE E + E E E E++K
Sbjct: 105 KKEEELQQELEELEARVGELELEIERQKAR 134
Score = 26.5 bits (59), Expect = 5.5
Identities = 13/43 (30%), Positives = 24/43 (55%)
Query: 96 VLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
++YE + +E K+EEE ++ EE + E E + E +K +
Sbjct: 92 LVYEYQRQSRKEAKKEEELQQELEELEARVGELELEIERQKAR 134
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 28.6 bits (64), Expect = 1.6
Identities = 14/51 (27%), Positives = 31/51 (60%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
+E EE+ + ++KE+++E+ K +EE++E +E E++ +E + L
Sbjct: 336 SPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEEL 386
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 28.4 bits (64), Expect = 1.7
Identities = 13/43 (30%), Positives = 21/43 (48%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
I + E EE EEE EE +E + E K E++ ++ +
Sbjct: 823 KTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQ 865
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 28.1 bits (63), Expect = 1.7
Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 4/71 (5%)
Query: 76 KPNGLLCVLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
KP VL D + G Y D + K EE+++E K++ K + KE + ++
Sbjct: 129 KPERPYTVLGDLSAHV---GWKYADVVAKLEEKRKERAAAYYKKKVKLRKAWKEAR-KKA 184
Query: 136 KEKMEEEEKEE 146
K+ +
Sbjct: 185 LAKLPKAIVAV 195
>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway components
[Intracellular trafficking and secretion].
Length = 94
Score = 27.3 bits (61), Expect = 1.7
Identities = 13/49 (26%), Positives = 26/49 (53%), Gaps = 2/49 (4%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
KK + + E +EE+K EE ++++++ E + EE ++ E K
Sbjct: 39 FKKAASDVKNELDEELKLEELDDKKKELT--AELQATKEELDQLASELK 85
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 28.4 bits (64), Expect = 1.7
Identities = 20/76 (26%), Positives = 40/76 (52%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
L+++ KE E+ + +KK ++ + E +E+ ++ E KEK EE +E EE +
Sbjct: 339 LEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEEELK 398
Query: 144 KEEEETKYLVRDVHLE 159
+ +EE + L + +
Sbjct: 399 ELKEELESLYSEGKIS 414
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 27.2 bits (61), Expect = 1.9
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 112 EEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
EE+KEEEEE+E+KEE E+E
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 26.5 bits (59), Expect = 3.3
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEE 134
E K+EE+EEEEEKE+ EEE
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 26.5 bits (59), Expect = 3.5
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKE 129
EE++EE +EEE++EE E+E
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 26.1 bits (58), Expect = 4.4
Identities = 12/24 (50%), Positives = 18/24 (75%)
Query: 119 EEEKEEEEEKEKKEEEEKEKMEEE 142
EE+++E++EEEEKE+ EEE
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEE 96
Score = 26.1 bits (58), Expect = 4.8
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEE 143
EE+KE++EEEE+++ EEE
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 26.1 bits (58), Expect = 5.3
Identities = 14/22 (63%), Positives = 17/22 (77%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEE 125
EEK+EEEEEE ++EE EEE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEE 96
Score = 25.3 bits (56), Expect = 9.0
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEE 135
+E+++EEEEE+EK+E EE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEE 95
Score = 25.3 bits (56), Expect = 9.4
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEE 135
+++E+EEEEE++++ EEE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 27.6 bits (62), Expect = 1.9
Identities = 12/36 (33%), Positives = 25/36 (69%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
++ E ++ +EE+++ EEE + + E+ K+ E EK+K
Sbjct: 91 ERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126
Score = 27.2 bits (61), Expect = 2.8
Identities = 13/33 (39%), Positives = 20/33 (60%)
Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
E ++ +EEK+ EEE K + EE ++ E E K
Sbjct: 94 EAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126
>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
family 4 of glycosyl hydrolases [Carbohydrate transport
and metabolism].
Length = 442
Score = 28.4 bits (64), Expect = 2.1
Identities = 14/50 (28%), Positives = 27/50 (54%), Gaps = 3/50 (6%)
Query: 97 LYEDNIKKEEEEKEEEEE---EEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
Y ++K+++ ++ E+ EV + EKE E +K E +EK + E+
Sbjct: 262 YYPYDMKEKKPDEIEKFGTRANEVMKREKELFELYKKPELKEKPEELEKR 311
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 28.5 bits (64), Expect = 2.1
Identities = 14/68 (20%), Positives = 36/68 (52%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
+++ + + + + E+ + + EE + EE++ ++E + E + ++ EEE + + E
Sbjct: 811 IIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINEN 870
Query: 143 EKEEEETK 150
+ E E K
Sbjct: 871 KNEFVEFK 878
Score = 27.7 bits (62), Expect = 3.6
Identities = 15/74 (20%), Positives = 35/74 (47%), Gaps = 7/74 (9%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK----- 138
++++ K ++ L I++ EEE + E + + E + + +K K K+
Sbjct: 839 INEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLV 898
Query: 139 --MEEEEKEEEETK 150
+E +++E+ TK
Sbjct: 899 ISSDEIKQDEKTTK 912
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 28.0 bits (63), Expect = 2.1
Identities = 11/59 (18%), Positives = 28/59 (47%)
Query: 95 GVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLV 153
N ++ + K E+E+ + E++ E + + + E +++ EE + +E +V
Sbjct: 17 EPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLREEIERLKEPPLIV 75
Score = 26.8 bits (60), Expect = 6.2
Identities = 14/54 (25%), Positives = 23/54 (42%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKY 151
D I + E E +E E+ K + EKEK+ +E+ E E + +
Sbjct: 5 ALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREV 58
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 28.3 bits (62), Expect = 2.2
Identities = 18/49 (36%), Positives = 34/49 (69%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
I+ E+EE++ E+E++ E+E E + K E+EK+K E+E+++ E+ K
Sbjct: 149 IELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEK 197
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 28.2 bits (63), Expect = 2.2
Identities = 22/66 (33%), Positives = 41/66 (62%), Gaps = 3/66 (4%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEE--EEEVKEEEKEEEE-EKEKKEEEEKEKMEEEEKEE 146
E+ VL E E+E +++ E E+ +KE+EK ++E E+E +E +E+E+ ++ E E+
Sbjct: 509 FKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEK 568
Query: 147 EETKYL 152
E + L
Sbjct: 569 EAQEAL 574
Score = 26.3 bits (58), Expect = 9.8
Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 11/63 (17%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEE-----------EKEKKEEEEKEKMEEEEKEEEETKY 151
+KE E+K E E+ +KE+EK ++E E+ KK E EKE E + ++E +
Sbjct: 524 EKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVES 583
Query: 152 LVR 154
++R
Sbjct: 584 IIR 586
Score = 26.3 bits (58), Expect = 10.0
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEE---EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME 140
L+ KE EK E +++ +E K+ E E+E +E K ++E E E KEK
Sbjct: 534 LEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKI 593
Query: 141 EEEKEEEETKYLVR 154
+ KE + + LV+
Sbjct: 594 HKAKEIKSIEDLVK 607
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 28.4 bits (64), Expect = 2.2
Identities = 12/49 (24%), Positives = 31/49 (63%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
D +++E + E E+E +E++++E++ ++ + ++ ++ E EKE E
Sbjct: 404 DELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEA 452
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 28.3 bits (63), Expect = 2.2
Identities = 7/53 (13%), Positives = 19/53 (35%), Gaps = 1/53 (1%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK-KEEEEKEKMEEEEKEEEETK 150
+ E+ + + ++E+ ++ + E E + +E M K
Sbjct: 332 IKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVK 384
Score = 27.9 bits (62), Expect = 2.9
Identities = 12/63 (19%), Positives = 23/63 (36%), Gaps = 7/63 (11%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
K KA KKE +K E E+ E ++E + + ++ ++K
Sbjct: 338 AEKNEAKAR-------KKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGL 390
Query: 147 EET 149
+
Sbjct: 391 IDA 393
>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
Length = 673
Score = 28.3 bits (63), Expect = 2.2
Identities = 12/47 (25%), Positives = 26/47 (55%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ E E+K +E+ ++ ++ E E+ ++ + K+E EE+E E
Sbjct: 364 AQAEIEQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLE 410
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 28.0 bits (63), Expect = 2.3
Identities = 20/44 (45%), Positives = 27/44 (61%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EE EEE EE +EE KE E EKE E E+K ++ +K ++ K
Sbjct: 64 EEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGK 107
Score = 27.2 bits (61), Expect = 4.2
Identities = 21/53 (39%), Positives = 32/53 (60%), Gaps = 6/53 (11%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKE------EEEEKEKKEEEEKEKMEEEEKEEEE 148
+++ EEE EE EEE+KE E E E++ EKK ++ +K ++EE EE
Sbjct: 63 LEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEE 115
Score = 27.2 bits (61), Expect = 4.3
Identities = 18/53 (33%), Positives = 29/53 (54%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
YE+ +K+ E EKE E E+ E+K ++ K+ K+EE + E + EE
Sbjct: 74 YEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPEP 126
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 26.9 bits (60), Expect = 2.3
Identities = 9/18 (50%), Positives = 14/18 (77%)
Query: 110 EEEEEEEVKEEEKEEEEE 127
E ++EE KEEE+EE ++
Sbjct: 79 AEAKKEEKKEEEEEESDD 96
Score = 25.4 bits (56), Expect = 9.3
Identities = 8/23 (34%), Positives = 16/23 (69%)
Query: 120 EEKEEEEEKEKKEEEEKEKMEEE 142
E +KE+K+EEE+E+ +++
Sbjct: 75 AAAAAEAKKEEKKEEEEEESDDD 97
Score = 25.4 bits (56), Expect = 9.3
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 116 EVKEEEKEEEEEKEKKEEEEKEKM 139
E ++EE+KE++EEE + M
Sbjct: 75 AAAAAEAKKEEKKEEEEEESDDDM 98
Score = 25.4 bits (56), Expect = 9.7
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 107 EEKEEEEEEEVKEEEKEEEEE 127
E ++E K+EE+EEE +
Sbjct: 75 AAAAAEAKKEEKKEEEEEESD 95
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 28.1 bits (63), Expect = 2.3
Identities = 17/42 (40%), Positives = 23/42 (54%)
Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVH 157
E+K E E+ E+ ++ EEE EK EE +EE K R V
Sbjct: 93 ELKSGELEKRRERREEAEEELEKAIEEGDDEEIKKQSKRTVR 134
>gnl|CDD|215059 PLN00111, PLN00111, accumulation of photosystem one; Provisional.
Length = 399
Score = 28.1 bits (63), Expect = 2.5
Identities = 11/46 (23%), Positives = 17/46 (36%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
+ + E+E +E + E E + E EEE K L
Sbjct: 221 GGRIVDFEDESEEPDPPPEGPSSPLLTELDDSEIEAPSSEEELKEL 266
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 26.8 bits (60), Expect = 2.5
Identities = 11/35 (31%), Positives = 26/35 (74%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
K+E+++K+++ E E E+KE++E E+++ ++K
Sbjct: 38 KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72
>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 145
Score = 27.4 bits (61), Expect = 2.5
Identities = 8/41 (19%), Positives = 17/41 (41%)
Query: 88 AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEK 128
A K +I E+ E +++ +++EE +K
Sbjct: 100 AYMYRKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKK 140
Score = 26.3 bits (58), Expect = 5.4
Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 5/44 (11%)
Query: 97 LYEDNIKKEE-----EEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
+Y N E+ E ++ K+ EE ++K + +
Sbjct: 102 MYRKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKKWYQND 145
>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306). This
family of proteobacterial species proteins has no known
function.
Length = 115
Score = 26.9 bits (60), Expect = 2.5
Identities = 13/46 (28%), Positives = 17/46 (36%), Gaps = 4/46 (8%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+K EE E EE+ E+ EE E E+EE
Sbjct: 9 RKLAVRAEEPAEPAETAEEEAAAAA----PAPAPEEEEEAELEDEE 50
Score = 26.5 bits (59), Expect = 4.2
Identities = 13/55 (23%), Positives = 23/55 (41%), Gaps = 3/55 (5%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKM---EEEEKEEEETKYLVRDV 156
E E+E E+EEE E E +E E+ + + + + +L + V
Sbjct: 23 ETAEEEAAAAAPAPAPEEEEEAELEDEELLEELDLPDPDTLTPGSDFSAFLAKGV 77
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 27.8 bits (62), Expect = 2.6
Identities = 18/50 (36%), Positives = 31/50 (62%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ K E E++ +E E KEE+K +EE+ E +E E++E +E+ E+ E
Sbjct: 4 ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNE 53
Score = 26.3 bits (58), Expect = 7.1
Identities = 17/47 (36%), Positives = 31/47 (65%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K+ ++ K E EE+ +E + +EE+K K+E+ E E++E+EE E+
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDS 48
Score = 26.3 bits (58), Expect = 7.4
Identities = 18/65 (27%), Positives = 36/65 (55%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
+ + + E++ KE E KEE++ +E E +E E+E+ ++ EE +++ EE +
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60
Query: 146 EEETK 150
+E K
Sbjct: 61 DENNK 65
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 28.0 bits (63), Expect = 2.6
Identities = 10/52 (19%), Positives = 30/52 (57%), Gaps = 1/52 (1%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E + ++ + +EEEE + +E+ ++E++ +++++E+ + E + K
Sbjct: 217 EIQNELDDID-DEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIK 267
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 28.1 bits (63), Expect = 2.6
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
L + + E +G+L +++ E K+EE + E++E +E E E+E EEE +E ++
Sbjct: 361 LKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNV 420
Query: 144 K 144
Sbjct: 421 G 421
Score = 26.9 bits (60), Expect = 5.6
Identities = 22/66 (33%), Positives = 38/66 (57%)
Query: 88 AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
E++K ++ K+++E EEEE E+ +E + E+ K K E+ +K EEEE+ +E
Sbjct: 433 EAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492
Query: 148 ETKYLV 153
E +L
Sbjct: 493 ENPWLK 498
Score = 26.2 bits (58), Expect = 9.9
Identities = 18/70 (25%), Positives = 32/70 (45%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ K + ++ K + E K+EE + E E+ ++E E +E+ +EEE EE
Sbjct: 355 RKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEE 414
Query: 147 EETKYLVRDV 156
K + R
Sbjct: 415 PSKKNVGRRK 424
>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B. Escherichia coli
stringent starvation protein B (SspB), is thought to
enhance the specificity of degradation of tmRNA-tagged
proteins by the ClpXP protease. The tmRNA tag, also
known as ssrA, is an 11-aa peptide added to the C
terminus of proteins stalled during translation, targets
proteins for degradation by ClpXP and ClpAP. SspB a
cytoplasmic protein that specifically binds to residues
1-4 and 7 of the tag. Binding of SspB enhances
degradation of tagged proteins by ClpX, and masks
sequence elements important for ClpA interactions,
inhibiting degradation by ClpA. However, more recent
work has cast doubt on the importance of SspB in
wild-type cells. SspB is encoded in an operon whose
synthesis is stimulated by carbon, amino acid, and
phosphate starvation. SspB may play a special role
during nutrient stress, for example by ensuring rapid
degradation of the products of stalled translation,
without causing a global increase in degradation of all
ClpXP substrates.
Length = 153
Score = 27.2 bits (61), Expect = 2.6
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
E EE +EEEEEE + + E+E + ++ K K
Sbjct: 111 EPEEADEEEEEEDDDADDSPEDEDPEPKDPPKTK 144
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 28.2 bits (62), Expect = 2.6
Identities = 12/40 (30%), Positives = 27/40 (67%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
E K++EE+E ++ ++ +EEE KKE+E ++++ + +
Sbjct: 1024 ERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHD 1063
Score = 27.8 bits (61), Expect = 3.7
Identities = 13/60 (21%), Positives = 34/60 (56%)
Query: 91 TEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ + + +++ E K+ +E +K++E++E E ++ +EE+ +E+E+E+ K
Sbjct: 1000 SRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRK 1059
Score = 27.4 bits (60), Expect = 4.8
Identities = 20/51 (39%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
E EK E ++ VKE K++EE KE+ E ++ K EE K+E+E + +R
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEE-KERMESLQRAKEEEIGKKEKEREQRIRK 1059
Score = 27.0 bits (59), Expect = 5.2
Identities = 11/40 (27%), Positives = 24/40 (60%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
E +++EE+E E + +EE+ K+E+E+E+ + +
Sbjct: 1024 ERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHD 1063
Score = 26.6 bits (58), Expect = 9.2
Identities = 19/66 (28%), Positives = 29/66 (43%), Gaps = 3/66 (4%)
Query: 85 DDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
D + +E + E E EE +E E E+EE +K+EE + E EE+
Sbjct: 606 DLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREE---NARKKEELRGNFELEER 662
Query: 145 EEEETK 150
+ E K
Sbjct: 663 GDPEKK 668
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 27.7 bits (62), Expect = 2.6
Identities = 17/50 (34%), Positives = 27/50 (54%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E++ K E + EE++ E + E ++EEEE + EEE KE E+
Sbjct: 132 EESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEEEVKEPED 181
>gnl|CDD|235604 PRK05778, PRK05778, 2-oxoglutarate ferredoxin oxidoreductase
subunit beta; Validated.
Length = 301
Score = 27.9 bits (63), Expect = 2.7
Identities = 12/55 (21%), Positives = 20/55 (36%), Gaps = 5/55 (9%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
L+++ GV Y K E EE E++ E E + +E +
Sbjct: 250 LEEELGGKIPIGVFY-----KNERPTFEERLEKLIEPLLELPPAALRPGKEALDT 299
Score = 27.1 bits (61), Expect = 3.8
Identities = 12/49 (24%), Positives = 20/49 (40%), Gaps = 1/49 (2%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ K E ++ V + K EE+ ++ EKM EEE +
Sbjct: 211 NTSTKSPAYMREYYKKRVYKL-KLEEDYDPTDRDKAAEKMLEEELGGKI 258
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 28.1 bits (62), Expect = 2.8
Identities = 21/71 (29%), Positives = 33/71 (46%), Gaps = 4/71 (5%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK----KEEEEKEKM 139
D + E +D K E +E ++EE + E + E++ E E+++
Sbjct: 717 FSDSTSDDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDES 776
Query: 140 EEEEKEEEETK 150
EEEKEEEE K
Sbjct: 777 SEEEKEEEENK 787
Score = 26.5 bits (58), Expect = 9.5
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
E EE+E EEEKEEEE KE + K+K
Sbjct: 769 ESSEEDESSEEEKEEEENKEVSAKRAKKK 797
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 27.4 bits (61), Expect = 2.8
Identities = 13/61 (21%), Positives = 28/61 (45%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
+E +L E +K+E E +EE E + + E ++E E++ ++ K +
Sbjct: 59 HSEIHALLRERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDK 118
Query: 150 K 150
+
Sbjct: 119 E 119
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 27.8 bits (62), Expect = 2.8
Identities = 20/46 (43%), Positives = 31/46 (67%), Gaps = 3/46 (6%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEK---EKKEEEEKEKMEEEEKEE 146
KE+ +E+EE EE E EKEEEE++ +KEEEE++ + + K+
Sbjct: 137 KEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQA 182
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 27.6 bits (62), Expect = 2.9
Identities = 16/47 (34%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKE-EEEKEKMEEEEKEEEETKYL 152
EEK EEE E E++K E++K++ E ++EE + E+ L
Sbjct: 8 EEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTPEDLSCL 54
Score = 26.0 bits (58), Expect = 9.3
Identities = 16/40 (40%), Positives = 24/40 (60%)
Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E EE+ +EE ++ E++K EE+KEK+ E E EE
Sbjct: 4 DETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEE 43
>gnl|CDD|215283 PLN02518, PLN02518, pheophorbide a oxygenase.
Length = 539
Score = 27.9 bits (62), Expect = 2.9
Identities = 8/26 (30%), Positives = 14/26 (53%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEK 130
E +++E EE+ E+E +E K
Sbjct: 59 EAALQQDEGEEQRVEQELGQESSDSK 84
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 28.1 bits (63), Expect = 2.9
Identities = 13/58 (22%), Positives = 27/58 (46%), Gaps = 9/58 (15%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEE----EEEK-----EKKEEEEKEKMEEEEKEEEE 148
++ + E EK ++E+ + +E +EK E K +E+ +E ++EE
Sbjct: 655 ESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEER 712
Score = 27.3 bits (61), Expect = 4.9
Identities = 9/49 (18%), Positives = 29/49 (59%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
E +E ++ E ++ ++E+++ + E ++ + +E+ + ++E K L +
Sbjct: 652 ETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVE 700
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 27.2 bits (61), Expect = 2.9
Identities = 15/28 (53%), Positives = 18/28 (64%), Gaps = 1/28 (3%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
E+EE E K EKE E+ KKEEE +E
Sbjct: 82 EKEERELRKRAEKEALEQA-KKEEELRE 108
Score = 26.1 bits (58), Expect = 6.6
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 122 KEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K E+EE+E ++ EKE +E+ +KEEE
Sbjct: 80 KNEKEERELRKRAEKEALEQAKKEEEL 106
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 27.6 bits (61), Expect = 2.9
Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 6/42 (14%)
Query: 106 EEEKEEEEEEEVKEEEKEEEE------EKEKKEEEEKEKMEE 141
+ E EE+EE +EEE+EEEE EK + K E
Sbjct: 223 GSDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKASTSE 264
Score = 26.4 bits (58), Expect = 7.2
Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 4/34 (11%)
Query: 105 EEEEKEEEEEEEVKEEEKE----EEEEKEKKEEE 134
E EE EEEEEEE +EEE + E+ +
Sbjct: 227 ESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKA 260
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 26.9 bits (60), Expect = 2.9
Identities = 14/25 (56%), Positives = 18/25 (72%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEE 134
EE +EEE+EEEEE+E+ EEE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 26.2 bits (58), Expect = 4.1
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEE 135
E +EEE+EEEEE+E++ EEE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 26.2 bits (58), Expect = 4.3
Identities = 14/28 (50%), Positives = 20/28 (71%)
Query: 113 EEEEVKEEEKEEEEEKEKKEEEEKEKME 140
+EEE+EEEEE+E++EE E+E M
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 25.8 bits (57), Expect = 6.4
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 119 EEEKEEEEEKEKKEEEEKEKMEEEE 143
EEEE+E++EEEE+E+ EEE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 28.0 bits (62), Expect = 2.9
Identities = 14/50 (28%), Positives = 27/50 (54%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
+K E +EE E +K +K +EE++ +E E + E K+ E+++
Sbjct: 468 RKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHKSEIKKIAESEFK 517
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 27.5 bits (62), Expect = 3.0
Identities = 8/47 (17%), Positives = 24/47 (51%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ E E + ++ E ++++++++ E++E+E + EE
Sbjct: 33 NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79
Score = 27.2 bits (61), Expect = 4.1
Identities = 7/39 (17%), Positives = 20/39 (51%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+ E +E++ +++++ ++ E+E+ EEE
Sbjct: 33 NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71
Score = 26.0 bits (58), Expect = 9.1
Identities = 9/49 (18%), Positives = 21/49 (42%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVR 154
++ E EE+ E + ++E++++ +EEE + R
Sbjct: 33 NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEAR 81
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 27.7 bits (61), Expect = 3.1
Identities = 16/49 (32%), Positives = 27/49 (55%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
DN + +E+ ++ E EE EEEE++ +E + EK + E +E E
Sbjct: 422 DNEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFPDEAE 470
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 27.6 bits (62), Expect = 3.1
Identities = 16/59 (27%), Positives = 31/59 (52%)
Query: 91 TEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
T K + + K E E E+E E ++EE++ E +E+ +E +++ E+ + E E
Sbjct: 200 TAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREK 258
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 27.7 bits (62), Expect = 3.2
Identities = 21/78 (26%), Positives = 40/78 (51%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
+L ++ E+ E +++ E + +E EE + EEK EE ++E + E + + E
Sbjct: 306 ILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEA 365
Query: 143 EKEEEETKYLVRDVHLES 160
E EE E++ + LE+
Sbjct: 366 ELEELESRLEELEEQLET 383
Score = 26.6 bits (59), Expect = 7.9
Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 1/69 (1%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEE-EVKEEEKEEEEEKEKKEEEEKEKMEE 141
L++Q +E + I++ EE EE E E E E+ EE E E++ E
Sbjct: 842 DLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSE 901
Query: 142 EEKEEEETK 150
E +E E +
Sbjct: 902 ELRELESKR 910
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 27.3 bits (61), Expect = 3.3
Identities = 15/53 (28%), Positives = 24/53 (45%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
E E + E E KE + + K + + K K + +K EE+ K V+ V
Sbjct: 72 VEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPV 124
>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
Endonuclease that cleaves the 5'-overhanging flap
structure that is generated by displacement synthesis
when DNA polymerase encounters the 5'-end of a
downstream Okazaki fragment. Has 5'-endo-/exonuclease
and 5'-pseudo-Y-endonuclease activities. Cleaves the
junction between single and double-stranded regions of
flap DNA.
Length = 338
Score = 27.6 bits (62), Expect = 3.4
Identities = 17/45 (37%), Positives = 23/45 (51%)
Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
E+K E EE E ++ EE+ E+ E+ EE KY R L S
Sbjct: 85 ELKAETLEERREIREEAEEKWEEALEKGDLEEARKYAQRSSRLTS 129
>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
Length = 621
Score = 27.8 bits (62), Expect = 3.4
Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 13/60 (21%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKE-------------EEEKEKMEEEEKEEEETKYLVRDV 156
++E E VKE EK E+KEK+E + EK+ E ++K EE K + ++
Sbjct: 504 KDEVERMVKEAEKNAAEDKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEKKEKIENL 563
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 26.9 bits (60), Expect = 3.6
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
ED +EE++EEE++EE +E++ EEEE K+
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEESPVKK 144
Score = 26.9 bits (60), Expect = 4.0
Identities = 12/32 (37%), Positives = 26/32 (81%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK 136
EE++ +++EE+ +EE+ EE++++++ EEEE
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEES 140
Score = 26.5 bits (59), Expect = 4.5
Identities = 14/34 (41%), Positives = 27/34 (79%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
EE+E +++EE+E +E+++E+++E E +EEE K
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEESPVK 143
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 27.1 bits (60), Expect = 3.6
Identities = 21/56 (37%), Positives = 36/56 (64%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVR 154
E I+ EE EKE+E E +E+ E++E++E +EEEKE+ E+ K++E + +
Sbjct: 56 EQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRK 111
Score = 26.7 bits (59), Expect = 5.6
Identities = 24/63 (38%), Positives = 43/63 (68%)
Query: 88 AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
A E E+ ++++K +EEE++ E EE KE+E E E ++ E++EKE++ +EE++EE
Sbjct: 36 ADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEE 95
Query: 148 ETK 150
E +
Sbjct: 96 EAE 98
>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
Length = 985
Score = 27.4 bits (61), Expect = 3.6
Identities = 16/54 (29%), Positives = 29/54 (53%)
Query: 109 KEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESCE 162
KE +E +K+ ++EEE K +E+ K+ E E+ L++D +L+ E
Sbjct: 563 KEINDESFIKQGFAKDEEELSKLILKERGKVLPETLEKLLFGDLIKDDYLDLNE 616
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 27.3 bits (61), Expect = 3.7
Identities = 22/49 (44%), Positives = 30/49 (61%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
+ K EE + EE EE E EEE+ EEEE E++ E E+E+ E E E +
Sbjct: 3 DKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQ 51
>gnl|CDD|224849 COG1938, COG1938, Archaeal enzymes of ATP-grasp superfamily
[General function prediction only].
Length = 244
Score = 27.3 bits (61), Expect = 3.7
Identities = 13/38 (34%), Positives = 26/38 (68%)
Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
+++++E +E EE+ EK E+ +++ E E+EEE Y+
Sbjct: 207 DKLEKEAEEIEEQLEKLAEQLEKEEERVEREEEPPMYM 244
Score = 26.1 bits (58), Expect = 8.8
Identities = 18/46 (39%), Positives = 27/46 (58%), Gaps = 3/46 (6%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKE---EEKEEEEEKEKKEEEEKEKM 139
+ N+ ++ EKE EE EE E E+ E+EEE+ ++EEE M
Sbjct: 199 MLGLNVDTDKLEKEAEEIEEQLEKLAEQLEKEEERVEREEEPPMYM 244
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 27.4 bits (61), Expect = 3.7
Identities = 18/76 (23%), Positives = 40/76 (52%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
L+ Q ++ E+ + E+++ ++E + EEE ++ + EEE E + E + K+E E
Sbjct: 124 LEQQRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEA 183
Query: 144 KEEEETKYLVRDVHLE 159
+ + + D++ E
Sbjct: 184 RGRAKEERENEDINRE 199
>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
Length = 673
Score = 27.5 bits (61), Expect = 3.7
Identities = 17/62 (27%), Positives = 33/62 (53%), Gaps = 2/62 (3%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
++ +E EK ED K++ + + + + V + EK+ +E +K + KEK+E +
Sbjct: 544 VERMVQEAEKFA--KEDKEKRDAVDTKNQADSVVYQTEKQLKELGDKVPADVKEKVEAKL 601
Query: 144 KE 145
KE
Sbjct: 602 KE 603
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 27.4 bits (61), Expect = 3.8
Identities = 15/42 (35%), Positives = 26/42 (61%)
Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
E +E V+E EK EE K+ K ++E + + + ++E+E YL
Sbjct: 17 SEYKEVVEEYEKLEELRKKLKAKKEDDSDDTDYEDEDEELYL 58
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 27.6 bits (62), Expect = 3.8
Identities = 20/57 (35%), Positives = 25/57 (43%), Gaps = 2/57 (3%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE--ETKYLVRDVH 157
E K E EE+KE EKE E K+K E + K EE K L ++V
Sbjct: 727 PPELLPKVERLLEELKELEKELERLKKKLAAAELADLLSNAKAEEIGGVKVLAKEVD 783
>gnl|CDD|236356 PRK08944, motB, flagellar motor protein MotB; Reviewed.
Length = 302
Score = 27.3 bits (61), Expect = 3.8
Identities = 15/60 (25%), Positives = 32/60 (53%), Gaps = 4/60 (6%)
Query: 83 VLDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
L+ QA E++ AG ++ +++ E EVK E+ +E+ E ++ +++E+E
Sbjct: 103 TLEFQAGESDSAG----GTQQQRGKQRGGEASAEVKAEKAAAQEQIEDLVKKLAQQLEKE 158
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 27.7 bits (61), Expect = 3.9
Identities = 11/41 (26%), Positives = 17/41 (41%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
K E+ E +K E+ ++E E E E E E +
Sbjct: 547 AKREDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVTD 587
>gnl|CDD|236168 PRK08156, PRK08156, type III secretion system protein SpaS;
Validated.
Length = 361
Score = 27.3 bits (61), Expect = 4.0
Identities = 14/35 (40%), Positives = 22/35 (62%), Gaps = 3/35 (8%)
Query: 113 EEEEVKEEEKEEE---EEKEKKEEEEKEKMEEEEK 144
+++EVK E KE+E E K K+ E +E + E+ K
Sbjct: 211 DKQEVKREYKEQEGNPEIKSKRREAHQEILSEQVK 245
>gnl|CDD|217451 pfam03249, TSA, Type specific antigen. There are several antigenic
variants in Rickettsia tsutsugamushi, and a
type-specific antigen (TSA) of 56-kilodaltons located on
the rickettsial surface is responsible for the
variation. TSA proteins are probably integral membrane
proteins.
Length = 502
Score = 27.5 bits (61), Expect = 4.0
Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 10/59 (16%)
Query: 97 LYED--------NIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
LY+D IKK E+ ++EE+ + + K+K+ EK K + KE E
Sbjct: 346 LYKDLVKLQRHAGIKKAMEKLAAQQEEDAGNQG--GGDCKKKQGASEKSKEGGKGKETE 402
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 27.3 bits (61), Expect = 4.0
Identities = 5/21 (23%), Positives = 9/21 (42%)
Query: 106 EEEKEEEEEEEVKEEEKEEEE 126
+EEE+EE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303
Score = 27.3 bits (61), Expect = 4.0
Identities = 6/15 (40%), Positives = 8/15 (53%)
Query: 103 KKEEEEKEEEEEEEV 117
+EEEEEE+
Sbjct: 289 AAAAPAEEEEEEEDD 303
Score = 26.9 bits (60), Expect = 5.8
Identities = 6/18 (33%), Positives = 8/18 (44%)
Query: 110 EEEEEEEVKEEEKEEEEE 127
EE+EEEE+
Sbjct: 285 AAAAAAAAPAEEEEEEED 302
Score = 26.5 bits (59), Expect = 6.1
Identities = 5/22 (22%), Positives = 9/22 (40%)
Query: 118 KEEEKEEEEEKEKKEEEEKEKM 139
++EEEE++ M
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDDM 304
Score = 26.5 bits (59), Expect = 6.9
Identities = 6/21 (28%), Positives = 9/21 (42%)
Query: 107 EEKEEEEEEEVKEEEKEEEEE 127
EEE+EEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303
Score = 26.1 bits (58), Expect = 8.7
Identities = 6/21 (28%), Positives = 9/21 (42%)
Query: 128 KEKKEEEEKEKMEEEEKEEEE 148
EEEE+EE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 27.3 bits (60), Expect = 4.0
Identities = 17/61 (27%), Positives = 36/61 (59%)
Query: 97 LYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
L + ++E E+EE+ + +E+ EE + +E KE +E +K + ++ +E +Y V++
Sbjct: 205 LMVEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNK 264
Query: 157 H 157
H
Sbjct: 265 H 265
Score = 26.9 bits (59), Expect = 6.2
Identities = 18/56 (32%), Positives = 32/56 (57%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
D +++ ++ +E+ EE EE KE +E K+KK ++ KE +E E + + RD
Sbjct: 217 DEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHKPLWTRD 272
>gnl|CDD|234235 TIGR03503, TIGR03503, TIGR03503 family protein. This set of
conserved hypothetical protein has a phylogenetic range
that closely matches that of TIGR03501, a putative
C-terminal protein targeting signal.
Length = 374
Score = 27.3 bits (61), Expect = 4.1
Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 1/45 (2%)
Query: 110 EEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE-EEKEEEETKYLV 153
EE E E EE EE + K E+E+ + E E K+ + ++
Sbjct: 308 EEPPEAEPSAEEAATEEALKAKAEQEQALLVEREAKDSKFWIIII 352
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 26.9 bits (60), Expect = 4.1
Identities = 22/45 (48%), Positives = 29/45 (64%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
EEK E+ EEEV+E E EE E E +EE +E+ E E +EE+ K
Sbjct: 1 MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAK 45
Score = 26.5 bits (59), Expect = 5.6
Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKME-EEEKEEEETKYL 152
+ + EE E EE E+EV+EE EEE E E +EE+ + E E + +E E +YL
Sbjct: 7 QVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERYL 61
>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
and repair].
Length = 1187
Score = 27.3 bits (61), Expect = 4.2
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
IK + E+ E V+EE +E E E+EKK
Sbjct: 243 IKLRRKLYGEKRAERVREELREVEREREKKR 273
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 27.1 bits (60), Expect = 4.2
Identities = 17/56 (30%), Positives = 32/56 (57%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
KK ++++ +E ++ EEE E E+E + +E E + +EEE++ + Y D L
Sbjct: 156 KKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGML 211
>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein. This family consists of several
eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
expression of functional F1-ATPase requires two proteins
encoded by the ATP11 and ATP12 genes. Atp11p is a
molecular chaperone of the mitochondrial matrix that
participates in the biogenesis pathway to form F1, the
catalytic unit of the ATP synthase.
Length = 250
Score = 26.9 bits (60), Expect = 4.3
Identities = 10/53 (18%), Positives = 25/53 (47%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
Y + ++ +E E E +K+ K++ E+KE ++ ++ + + K
Sbjct: 4 YRSKLLQKAKESGLEFIERLKKALKDKIEKKEFSAKKPPTGPSKQASKFKTLK 56
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 27.2 bits (60), Expect = 4.3
Identities = 10/42 (23%), Positives = 20/42 (47%)
Query: 121 EKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLESCE 162
++E + E+EK ++++ E + + RD H S E
Sbjct: 2 DEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRE 43
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 26.0 bits (58), Expect = 4.3
Identities = 11/20 (55%), Positives = 13/20 (65%)
Query: 108 EKEEEEEEEVKEEEKEEEEE 127
EEE KEEE+EEEE+
Sbjct: 61 AAAAAAEEEKKEEEEEEEED 80
Score = 25.3 bits (56), Expect = 8.7
Identities = 9/22 (40%), Positives = 15/22 (68%)
Query: 118 KEEEKEEEEEKEKKEEEEKEKM 139
EEE+++E++EEEE + M
Sbjct: 62 AAAAAEEEKKEEEEEEEEDDDM 83
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 27.4 bits (61), Expect = 4.4
Identities = 11/44 (25%), Positives = 17/44 (38%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+E E E E + E E+ E E+ E+ +EE
Sbjct: 204 TVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENN 247
Score = 27.0 bits (60), Expect = 5.5
Identities = 11/46 (23%), Positives = 17/46 (36%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+E E E E E E +E E +E + E+E +
Sbjct: 205 VDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPV 250
Score = 26.6 bits (59), Expect = 6.4
Identities = 13/46 (28%), Positives = 17/46 (36%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
E E E EV+ E + +E E E E+ EEE
Sbjct: 202 EPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENN 247
Score = 26.2 bits (58), Expect = 9.1
Identities = 12/48 (25%), Positives = 19/48 (39%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
E + + E E E + + EE E EE ++ EEE +
Sbjct: 202 EPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKP 249
>gnl|CDD|224009 COG1084, COG1084, Predicted GTPase [General function prediction
only].
Length = 346
Score = 27.3 bits (61), Expect = 4.4
Identities = 11/41 (26%), Positives = 20/41 (48%)
Query: 106 EEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ +EE+ EE++ EE E+ K K ++ +EE
Sbjct: 290 IDIADEEKLEEIEASVLEEGGEEPLKISATKGCGLDKLREE 330
>gnl|CDD|234271 TIGR03591, polynuc_phos, polyribonucleotide nucleotidyltransferase.
Members of this protein family are polyribonucleotide
nucleotidyltransferase, also called polynucleotide
phosphorylase. Some members have been shown also to have
additional functions as guanosine pentaphosphate
synthetase and as poly(A) polymerase (see model
TIGR02696 for an exception clade, within this family)
[Transcription, Degradation of RNA].
Length = 684
Score = 27.5 bits (62), Expect = 4.4
Identities = 18/46 (39%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 103 KKEEEEKEEEEEEEVKEE--EKEEEEEKEKKEEEEKEKMEEEEKEE 146
K+E + +EEV E E+EEEEE+ +E+E KE ++ EK+
Sbjct: 255 KQERYAALDAIKEEVLEALAEEEEEEEEADREKEIKEAFKDLEKKI 300
>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor. The ribosome recycling
factor (RRF / ribosome release factor) dissociates the
ribosome from the mRNA after termination of translation,
and is essential bacterial growth. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis.
Length = 165
Score = 26.7 bits (60), Expect = 4.5
Identities = 14/45 (31%), Positives = 30/45 (66%), Gaps = 1/45 (2%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKE-EEKEEEEEKE 129
D+ K+ EK + ED +K+ E+E ++ ++ +K+ +E +++EKE
Sbjct: 119 DKLKKLEKDKEISEDEVKRAEKEIQKLTDKYIKKIDELLKKKEKE 163
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 27.3 bits (61), Expect = 4.6
Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 1/66 (1%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKE-EEKEEEEEKEKKEEEEKEKMEEE 142
L K+T + + EE+ + E E+KE EE E EKE E EE + +E
Sbjct: 50 LGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDE 109
Query: 143 EKEEEE 148
EK +
Sbjct: 110 EKSFLD 115
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 26.9 bits (60), Expect = 4.7
Identities = 14/36 (38%), Positives = 21/36 (58%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
K+E EE E E +++ K +EE K+KK +E K
Sbjct: 234 KRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGV 269
>gnl|CDD|205200 pfam13019, Telomere_Sde2, Telomere stability and silencing. Sde2
has been identified in fission yeast as an important
factor in telomere formation and maintenance. This is a
more N-terminal domain on these nuclear proteins, and is
essential for telomeric silencing and genomic stability.
Length = 163
Score = 26.7 bits (59), Expect = 4.7
Identities = 10/36 (27%), Positives = 17/36 (47%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
E K E +K E +++E+E +K E + E
Sbjct: 128 EAKALAEYLAIKPEMEKKEKEARRKRWESVVEALER 163
Score = 26.7 bits (59), Expect = 5.0
Identities = 10/29 (34%), Positives = 12/29 (41%)
Query: 120 EEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E K E K E EK++ E K E
Sbjct: 128 EAKALAEYLAIKPEMEKKEKEARRKRWES 156
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 26.9 bits (60), Expect = 4.7
Identities = 12/49 (24%), Positives = 33/49 (67%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
+++++ + E+ E+ E ++ EE E+ ++EE+E+ K +++E+++ E
Sbjct: 155 AELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQRE 203
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 27.2 bits (60), Expect = 4.8
Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 6/52 (11%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEK 138
AK+ +K +++ E+ + E + V E++ EE+K KKEEEE
Sbjct: 469 SAKQKDK------QSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAA 514
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 27.2 bits (60), Expect = 4.9
Identities = 11/35 (31%), Positives = 21/35 (60%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKE 137
++ + KE+E EE V+ + +E+E E E+ E +
Sbjct: 238 RRRDLRKEQELEENVERDSDDEDEHGEDSEDGETK 272
>gnl|CDD|131743 TIGR02696, pppGpp_PNP, guanosine pentaphosphate synthetase
I/polynucleotide phosphorylase. Sohlberg, et al present
characterization of two proteins from Streptomyces
coelicolor. The protein in this family was shown to have
poly(A) polymerase activity and may be responsible for
polyadenylating RNA in this species. Reference 2 showed
that a nearly identical plasmid-encoded protein from
Streptomyces antibioticus is a bifunctional enzyme that
acts also as a guanosine pentaphosphate synthetase.
Length = 719
Score = 27.1 bits (60), Expect = 4.9
Identities = 21/80 (26%), Positives = 34/80 (42%), Gaps = 13/80 (16%)
Query: 88 AKETEKAGVL--YEDNIKKEEEEKEEEEEEEV------KEEEKEEEEEKEKKEEEEKEKM 139
AK T + + Y+D++ + E ++E +E E+ +E K + E+
Sbjct: 248 AKPTGEFPLFPDYQDDVYEAVEGAVKDELSAALTIAGKQEREEALDEVKALVAAKLAEQF 307
Query: 140 EEEEKE-----EEETKYLVR 154
E EKE TK LVR
Sbjct: 308 EGREKEISAAYRAVTKKLVR 327
>gnl|CDD|236449 PRK09282, PRK09282, pyruvate carboxylase subunit B; Validated.
Length = 592
Score = 27.1 bits (61), Expect = 5.0
Identities = 17/67 (25%), Positives = 30/67 (44%), Gaps = 10/67 (14%)
Query: 100 DNIKKEEEEKEEEEEE---------EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ +KE EE + E+E ++ ++ EE E E K E E ++ E T+
Sbjct: 408 EKARKEAEELGKSEKEDVLTYALFPQIAKKFLEEREAGELKPEPEPKEAAAAGAEGIPTE 467
Query: 151 YLVRDVH 157
+ V +V
Sbjct: 468 FKV-EVD 473
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 27.2 bits (61), Expect = 5.2
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 91 TEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEK 130
+ G E + E+E+ E+ + E EK EE+E+E
Sbjct: 840 KLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEEL 879
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 26.9 bits (59), Expect = 5.2
Identities = 11/61 (18%), Positives = 23/61 (37%), Gaps = 2/61 (3%)
Query: 88 AKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
++GV D+I+ + E + EK E + + E +K +E +
Sbjct: 65 LNRNLRSGVFQLDDIRPQLRALRTELG--TAQGEKRAAETEREAARSELQKARQEREAVR 122
Query: 148 E 148
+
Sbjct: 123 Q 123
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 26.8 bits (59), Expect = 5.2
Identities = 18/52 (34%), Positives = 28/52 (53%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
DN KE+ +E E + K E E E E ++E E+E EE+E E++ +
Sbjct: 235 NDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEEQENEDKGEE 286
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 26.8 bits (60), Expect = 5.4
Identities = 15/46 (32%), Positives = 29/46 (63%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+ ++EE E++E+E+ +EK +E++E+ EE E+ EE E+
Sbjct: 95 KMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 27.0 bits (61), Expect = 5.5
Identities = 15/42 (35%), Positives = 19/42 (45%)
Query: 115 EEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDV 156
EE+KE EKE E+ K K + + KE K L V
Sbjct: 726 EELKELEKELEQLKAKLAAAAAGDLLAQAKEVNGVKVLAAQV 767
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 27.2 bits (60), Expect = 5.6
Identities = 11/74 (14%), Positives = 28/74 (37%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
+ E + ++N KE+ K + + EE+E+ E K +K+ +
Sbjct: 37 NRNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEEDGSESISKLNVNSKKISLNQ 96
Query: 144 KEEEETKYLVRDVH 157
++ + + +
Sbjct: 97 VSTQKWRKELDLLA 110
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 26.9 bits (60), Expect = 5.7
Identities = 6/33 (18%), Positives = 19/33 (57%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEE 135
++++ E+++ E + E+E+ E E+++
Sbjct: 1356 PRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 27.0 bits (59), Expect = 5.7
Identities = 18/55 (32%), Positives = 32/55 (58%)
Query: 94 AGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
G +N + E +KE+++EE E +EE + ++ +E +EK E++E EE E
Sbjct: 279 VGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENE 333
Score = 26.6 bits (58), Expect = 8.1
Identities = 16/65 (24%), Positives = 32/65 (49%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
+ + ++ + KEE + + +E ++EE +E EE E+ E +++ E EK
Sbjct: 290 SEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKG 349
Query: 146 EEETK 150
EE +
Sbjct: 350 IEEKR 354
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 27.0 bits (60), Expect = 5.7
Identities = 21/53 (39%), Positives = 32/53 (60%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
+KE EKE +E +E + + KE+ + EK+ E K EE E+E EE + +RD
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRD 879
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 25.9 bits (57), Expect = 5.7
Identities = 17/25 (68%), Positives = 21/25 (84%)
Query: 118 KEEEKEEEEEKEKKEEEEKEKMEEE 142
EEK EE+E+EKK+EEEKE+ EEE
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 25.2 bits (55), Expect = 9.6
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKE 129
EEK EE+EEE K+EE++EEEE+E
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEE 97
>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
Length = 177
Score = 26.3 bits (58), Expect = 5.8
Identities = 23/50 (46%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEE--EEKEEEETKYL 152
EE K EE EEEV+ E EE E+ +E EK ++E E +E E KYL
Sbjct: 2 SEEIKNEEVEEEVEATETEETVEEVVEETPEKSELELANERADEFENKYL 51
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 26.8 bits (59), Expect = 5.9
Identities = 10/33 (30%), Positives = 22/33 (66%)
Query: 104 KEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK 136
E +E+E++ E+E + E+E++++ EE+K
Sbjct: 365 PLPEYSSDEDEDDSDEDEVDYEKERKRRREEDK 397
>gnl|CDD|223097 COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 577
Score = 26.8 bits (60), Expect = 6.0
Identities = 12/55 (21%), Positives = 26/55 (47%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLV 153
+ E + E ++ ++ +E+ E++ EE EK+E ++E E + V
Sbjct: 184 ALGLTPEPDGYLGEYYVKIAKDLEEDPGNDEEEAREEVEKLESGDEEAELWRKFV 238
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 26.6 bits (59), Expect = 6.2
Identities = 16/48 (33%), Positives = 33/48 (68%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
K E +EE ++ ++E E ++ +++++K EEE+ K ++++EEEE K
Sbjct: 170 KPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERK 217
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 26.7 bits (59), Expect = 6.2
Identities = 21/63 (33%), Positives = 33/63 (52%)
Query: 86 DQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKE 145
D A E + + + +++E EV+ E EEEE E+ EE+E+E+ EEE +E
Sbjct: 42 DDAIELAEPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEE 101
Query: 146 EEE 148
E
Sbjct: 102 FEP 104
>gnl|CDD|184860 PRK14858, tatA, twin arginine translocase protein A; Provisional.
Length = 108
Score = 26.0 bits (57), Expect = 6.3
Identities = 18/60 (30%), Positives = 31/60 (51%)
Query: 90 ETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEET 149
E +KA ++ ++++E EE+E+ E E K+E E E K EE++ + E T
Sbjct: 38 EFKKATDDFKQSMQEESRTAEEKEKAEKLAETKKEAEAPEAKAEEDQAPKPKGAGEPPAT 97
>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
Length = 542
Score = 26.8 bits (60), Expect = 6.6
Identities = 17/63 (26%), Positives = 30/63 (47%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
+++ E E A +E + E ++E EEEK EEE K + + ++ +E+
Sbjct: 467 PEEELAEEEAAAEEAALEEDEEAALEVELSDDEELEEEKAEEELKYEDLLKRLRELAKED 526
Query: 144 KEE 146
EE
Sbjct: 527 PEE 529
>gnl|CDD|237046 PRK12297, obgE, GTPase CgtA; Reviewed.
Length = 424
Score = 26.6 bits (60), Expect = 6.6
Identities = 17/48 (35%), Positives = 21/48 (43%), Gaps = 1/48 (2%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
D + E EE E EEE E EEE K EEE++ E+
Sbjct: 316 DELLYAVAELLEETPEFPLEEE-EVEEEVYYKFEEEEKDFTITRDEDG 362
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 26.7 bits (59), Expect = 6.7
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKE 132
+K+ E +EEE EE +E + EE E E E
Sbjct: 396 VKRVVREVDEEEIEEEEEAMQPEEMEMEGFE 426
>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
Length = 193
Score = 26.4 bits (59), Expect = 6.7
Identities = 18/60 (30%), Positives = 32/60 (53%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
+ K +E+ EE E EV+++E+ +E E E + +E ++ E E + E + RD L
Sbjct: 3 NEEQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEADARIAELEAQLAEAQAEERDSVL 62
>gnl|CDD|227900 COG5613, COG5613, Uncharacterized conserved protein [Function
unknown].
Length = 400
Score = 26.9 bits (59), Expect = 6.8
Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 2/59 (3%)
Query: 85 DDQAKETEKAGVLYED-NIKKEEEEKEEEEEEEVKEEEK-EEEEEKEKKEEEEKEKMEE 141
Q E E A +L + +K + E + +K E ++K KK+EEE +K E+
Sbjct: 110 SPQQFEIELAKLLSLVESAQKVSKLLNSNEIYMDQNTKKMENNQQKIKKQEEEAKKAEK 168
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 26.2 bits (58), Expect = 6.8
Identities = 16/50 (32%), Positives = 31/50 (62%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
++ + +E+E+ E +EEEV + E++EE + +EE+ +EEE + E
Sbjct: 13 DEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEA 62
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 26.2 bits (58), Expect = 6.9
Identities = 15/48 (31%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEK--EKMEEEEKEEEETK 150
E+K+ E + +EE E ++E EE++K + +E+ E+EE K
Sbjct: 30 RYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELK 77
Score = 25.8 bits (57), Expect = 8.3
Identities = 10/46 (21%), Positives = 25/46 (54%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
+ +I+ + +EE E + + EE+++ ++ +EK + EE +
Sbjct: 33 KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKN 78
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 26.0 bits (57), Expect = 7.1
Identities = 11/38 (28%), Positives = 21/38 (55%)
Query: 113 EEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETK 150
+ + ++ K+ K +E +EK+K E E EE++ K
Sbjct: 35 DGNVLMKDVKDIATVKPAEEVKEKKKKEGTESEEDDEK 72
>gnl|CDD|220112 pfam09110, HAND, HAND. The HAND domain adopts a secondary
structure consisting of four alpha helices, three of
which (H2, H3, H4) form an L-like configuration. Helix
H2 runs antiparallel to helices H3 and H4, packing
closely against helix H4, whilst helix H1 reposes in the
concave surface formed by these three helices and runs
perpendicular to them. The domain confers DNA and
nucleosome binding properties to the protein.
Length = 109
Score = 25.7 bits (57), Expect = 7.1
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 100 DNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEK 144
D++ +EE EEE E + K E+++ + + EEEE+EK E+
Sbjct: 64 DDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108
>gnl|CDD|237478 PRK13709, PRK13709, conjugal transfer nickase/helicase TraI;
Provisional.
Length = 1747
Score = 26.7 bits (59), Expect = 7.4
Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 5/56 (8%)
Query: 106 EEEKEEEEEEEVKEEEKEEEE---EKEKKEEEEKEKMEEEEKE--EEETKYLVRDV 156
E EV + + EE E E++ +E KM E + + + +T+ VRD+
Sbjct: 1632 GAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVRDI 1687
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 26.6 bits (58), Expect = 7.5
Identities = 15/55 (27%), Positives = 31/55 (56%)
Query: 105 EEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLE 159
+ E E +E+E K+E+ +E+ + + E++++ +EE +E E + R H E
Sbjct: 26 DWFESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRRTHTE 80
>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
protein; Provisional.
Length = 196
Score = 26.1 bits (57), Expect = 7.7
Identities = 16/63 (25%), Positives = 29/63 (46%), Gaps = 9/63 (14%)
Query: 95 GVLYE--DNIKKEEEEKEEEEEEEVKEEEKEE-------EEEKEKKEEEEKEKMEEEEKE 145
GVL E D++ K +EE + + + KE E + K + E + EE++
Sbjct: 133 GVLVEGYDDVVKAKEEAKAAAKAAALAKAKEASLKSAEAELKASKGKSIEVNVIGEEDEG 192
Query: 146 EEE 148
E++
Sbjct: 193 EKD 195
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 26.1 bits (56), Expect = 7.8
Identities = 24/61 (39%), Positives = 34/61 (55%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ K + V +D + +++ EEEEE EEE EEE E E++ EEE E E E+E
Sbjct: 41 KGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEP 100
Query: 147 E 147
E
Sbjct: 101 E 101
>gnl|CDD|216295 pfam01093, Clusterin, Clusterin.
Length = 434
Score = 26.6 bits (59), Expect = 7.8
Identities = 18/38 (47%), Positives = 21/38 (55%)
Query: 111 EEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEE 148
E EEE K E+ K++KEE K E EEK EEE
Sbjct: 33 ERTEEEHKNLMSTLEKTKKEKEEALKLANEVEEKLEEE 70
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 25.7 bits (57), Expect = 8.1
Identities = 13/40 (32%), Positives = 29/40 (72%), Gaps = 2/40 (5%)
Query: 108 EKEEEEEEEVKE--EEKEEEEEKEKKEEEEKEKMEEEEKE 145
+ E+EE + + E +E+ +E K+++E++EKE++E+ +E
Sbjct: 57 DDEDEEADRIYESIDERMDERRKKRREQKEKEEIEKYREE 96
>gnl|CDD|214923 smart00937, PCRF, This domain is found in peptide chain release
factors.
Length = 116
Score = 25.5 bits (57), Expect = 8.1
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 107 EEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEE 142
E EEEE++E++E +EE EE E + EEE E++ +
Sbjct: 1 ELLEEEEDDEMRELAEEELEELEAELEEELEELLKL 36
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 26.2 bits (58), Expect = 8.2
Identities = 13/65 (20%), Positives = 30/65 (46%)
Query: 87 QAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ E +K + + + +E + EE + EE+ ++E + E E E + + + +
Sbjct: 217 ERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPK 276
Query: 147 EETKY 151
+TK
Sbjct: 277 RKTKA 281
>gnl|CDD|223619 COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1
[Posttranslational modification, protein turnover,
chaperones].
Length = 205
Score = 26.2 bits (58), Expect = 8.3
Identities = 9/50 (18%), Positives = 20/50 (40%), Gaps = 1/50 (2%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKK-EEEEKEKMEEEEKEE 146
K+ E + E+ +++ K + + E K +E+ KE+
Sbjct: 45 LALVQKEAALEALQRAEQRLQQAAKAKRAKDAAANAAEGKAFLEKNAKEK 94
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 26.5 bits (59), Expect = 8.5
Identities = 16/53 (30%), Positives = 31/53 (58%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRD 155
+K +E + E++EEEV+E+ ++ EK ++ EE K + E + EE + +
Sbjct: 91 RKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQME 143
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 26.6 bits (59), Expect = 8.5
Identities = 21/67 (31%), Positives = 32/67 (47%)
Query: 84 LDDQAKETEKAGVLYEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEE 143
L++ E E+ E+ +K E KEE EE E EE E+ + ++ +EE E+
Sbjct: 311 LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSAL 370
Query: 144 KEEEETK 150
EE E
Sbjct: 371 LEELEEL 377
>gnl|CDD|224553 COG1638, DctP, TRAP-type C4-dicarboxylate transport system,
periplasmic component [Carbohydrate transport and
metabolism].
Length = 332
Score = 26.1 bits (58), Expect = 8.9
Identities = 12/45 (26%), Positives = 21/45 (46%)
Query: 102 IKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ + +E EE+ + V+E E E E+ ++ E E E E
Sbjct: 258 LLEAAKEAAEEQRKLVEELEDELLEKLKEAGVEVVEPDAAEAFRE 302
>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
Length = 212
Score = 26.2 bits (58), Expect = 8.9
Identities = 15/88 (17%), Positives = 34/88 (38%), Gaps = 8/88 (9%)
Query: 71 QLVEGKP-NGLLCVLDDQAKETEKAGVLYEDNI---KKEEEEKEEEEEEEVKEEEKEEEE 126
KP G + D +KE A + ++ + + E+ + +EE+
Sbjct: 125 PRKGKKPKKGFAGIPADTSKEEVVALPVKQNKNSEVIPFQRTPKREKARVITKEERAFNA 184
Query: 127 ----EKEKKEEEEKEKMEEEEKEEEETK 150
+ K + K E++++ +E+ K
Sbjct: 185 YRTLRQAKLNAKFVGKKEKKKQAKEKKK 212
>gnl|CDD|189037 cd09867, PIN_FEN1, PIN domain of Flap Endonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Flap endonuclease-1 (FEN1) is
involved in multiple DNA metabolic pathways, including
DNA replication processes (5' flap DNA endonuclease
activity and double stranded DNA 5'-exonuclease
activity) and DNA repair processes (long-patch base
excision repair) in eukaryotes and archaea. Interaction
between FEN1 and PCNA (Proliferating cell nuclear
antigen) is an essential prerequisite to FEN1's DNA
replication functionality and stimulates FEN1 nuclease
activity by 10-50 fold. FEN1 belongs to the
FEN1-EXO1-like family of structure-specific, 5'
nucleases. These nucleases contain a PIN (PilT N
terminus) domain with a helical arch/clamp region (I
domain) of variable length (approximately 45 residues in
FEN1 PIN domains) and a H3TH (helix-3-turn-helix)
domain, an atypical helix-hairpin-helix-2-like region.
Both the H3TH domain (not included here) and the helical
arch/clamp region are involved in DNA binding. Nucleases
within this group also have a carboxylate-rich active
site that is involved in binding essential divalent
metal ion cofactors (Mg2+/Mn2+). FEN1 has a C-terminal
extension containing residues forming the consensus
PIP-box - Qxx(M/L/I)xxF(Y/F) which serves to anchor FEN1
to PCNA.
Length = 261
Score = 25.9 bits (58), Expect = 9.0
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHLES 160
E+K E E+ E+ ++ EE+ E+ +EE EE KY R V +
Sbjct: 86 ELKSGELEKRRERREEAEEKLEEAKEEGDAEEARKYAKRTVRVTK 130
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 26.0 bits (57), Expect = 9.3
Identities = 13/61 (21%), Positives = 30/61 (49%)
Query: 99 EDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
ED+ K +E+ ++++EE+ +E+K + ++E +E+ K+ + L
Sbjct: 173 EDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPKEKRQKHHDPERRL 232
Query: 159 E 159
E
Sbjct: 233 E 233
>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
Provisional.
Length = 606
Score = 26.3 bits (58), Expect = 9.3
Identities = 7/30 (23%), Positives = 14/30 (46%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEE 127
+ +E E E E EE+ +E+++
Sbjct: 577 NNNEASEETETITVPENNEEDEEDDDEDDD 606
>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
Length = 292
Score = 25.9 bits (58), Expect = 9.5
Identities = 18/43 (41%), Positives = 24/43 (55%)
Query: 116 EVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYLVRDVHL 158
E+K EE EE E ++ EE+ E+ +EE EE KY R L
Sbjct: 38 ELKAEEIEERREVREEAEEKYEEAKEEGDLEEARKYAQRSSRL 80
>gnl|CDD|143630 cd07892, PolyPPase_VTC2-3_like, Polyphosphate(polyP) polymerase
domain of yeast vacuolar transport chaperone (VTC)
proteins VTC-2, and -3 , and similar proteins.
Saccharomyces cerevisiae VTC-1, -2, -3, and -4 comprise
the membrane-integral VTC complex. VTC-2, -3, and -4
contain polyP polymerase domains. S. cerevisiae
VTC-2,and -3 belong to this subgroup. For VTC4 it has
been shown that this domain generates polyP from ATP by
a phosphotransfer reaction releasing ADP. This activity
is metal ion-dependent. The ATP gamma phosphate may be
cleaved and then transferred to an acceptor phosphate to
form polyP. PolyP is ubiquitous. In prokaryotes, it is a
store of phosphate and energy. In eukaryotes, polyPs
have roles in bone calcification, and osmoregulation,
and in phosphate transport in the symbiosis of
mycorrhizal fungi and plants. This subgroup belongs to
the CYTH/triphosphate tunnel metalloenzyme (TTM)-like
superfamily, whose enzymes have a unique active site
located within an eight-stranded beta barrel.
Length = 303
Score = 26.2 bits (58), Expect = 9.6
Identities = 18/78 (23%), Positives = 36/78 (46%), Gaps = 7/78 (8%)
Query: 89 KETEKAGVLYEDNIKKEEEEKEEEEEEEVKE-------EEKEEEEEKEKKEEEEKEKMEE 141
K ++K + E E EE++ ++KE K + E+K +K E+ +E
Sbjct: 89 KLSDKPDIFVEKKTFDENTSSFEEDKLQLKEKYINGFIFGKYKFEKKLQKMEKRGADLEN 148
Query: 142 EEKEEEETKYLVRDVHLE 159
+K+ E + +R+ L+
Sbjct: 149 LKKDVENIQDFIRENKLQ 166
>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
mechanisms].
Length = 733
Score = 26.3 bits (58), Expect = 9.9
Identities = 13/49 (26%), Positives = 22/49 (44%)
Query: 98 YEDNIKKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEE 146
+ E K +E+E + KE + ++ +EE E + EE KE
Sbjct: 392 LGKGREVVENAKFCPKEDEPGLKRKERVDVEDGPQEELIELISEEVKEG 440
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 26.1 bits (58), Expect = 10.0
Identities = 11/50 (22%), Positives = 19/50 (38%)
Query: 103 KKEEEEKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEEETKYL 152
K+ EEEK +EE + + K + E M+ + K +
Sbjct: 20 KQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRANRLKKV 69
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 26.2 bits (57), Expect = 10.0
Identities = 13/40 (32%), Positives = 26/40 (65%)
Query: 108 EKEEEEEEEVKEEEKEEEEEKEKKEEEEKEKMEEEEKEEE 147
EK + E K ++ +++E+KEK++E +K+K +E E +
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.302 0.127 0.343
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,720,894
Number of extensions: 896263
Number of successful extensions: 27656
Number of sequences better than 10.0: 1
Number of HSP's gapped: 13840
Number of HSP's successfully gapped: 4275
Length of query: 163
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 74
Effective length of database: 6,990,096
Effective search space: 517267104
Effective search space used: 517267104
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.7 bits)
S2: 55 (24.8 bits)