RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10931
(139 letters)
>gnl|CDD|238681 cd01385, MYSc_type_IX, Myosin motor domain, type IX myosins. Myosin
IX is a processive single-headed motor, which might play
a role in signalling. This catalytic (head) domain has
ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 692
Score = 118 bits (298), Expect = 5e-32
Identities = 43/67 (64%), Positives = 54/67 (80%)
Query: 2 TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
+FEQLCINYANE LQYYFNQH+F+ EQEEY EGI W +IE++DN C+QL KP GLL
Sbjct: 394 SFEQLCINYANEQLQYYFNQHIFKLEQEEYQGEGITWTNIEYTDNVGCIQLFSKKPTGLL 453
Query: 62 CVLDDQA 68
+LD+++
Sbjct: 454 YLLDEES 460
>gnl|CDD|214580 smart00242, MYSc, Myosin. Large ATPases. ATPase; molecular motor.
Muscle contraction consists of a cyclical interaction
between myosin and actin. The core of the myosin
structure is similar in fold to that of kinesin.
Length = 677
Score = 118 bits (297), Expect = 8e-32
Identities = 41/68 (60%), Positives = 53/68 (77%)
Query: 2 TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
+FEQLCINYANE LQ +FNQHVF+ EQEEY +EGI W I+F DN C+ L+E KP G+L
Sbjct: 386 SFEQLCINYANEKLQQFFNQHVFKLEQEEYEREGIDWTFIDFFDNQDCIDLIEKKPPGIL 445
Query: 62 CVLDDQAK 69
+LD++ +
Sbjct: 446 SLLDEECR 453
>gnl|CDD|238071 cd00124, MYSc, Myosin motor domain. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 679
Score = 111 bits (281), Expect = 1e-29
Identities = 44/76 (57%), Positives = 55/76 (72%), Gaps = 2/76 (2%)
Query: 2 TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
+FEQLCINYANE LQ +FNQHVF+ EQEEY +EGI W I+F+DN + L+E KP GLL
Sbjct: 379 SFEQLCINYANEKLQQFFNQHVFKLEQEEYQEEGIDWESIDFTDNQEVIDLIEKKPGGLL 438
Query: 62 CVLDDQ--AKKRRDEK 75
+LD++ K DE
Sbjct: 439 SLLDEECLFPKGTDET 454
>gnl|CDD|238677 cd01381, MYSc_type_VII, Myosin motor domain, type VII myosins.
Myosins in this group have been associated with
functions in sensory systems such as vision and hearing.
This catalytic (head) domain has ATPase activity and
belongs to the larger group of P-loop NTPases. Myosins
are actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 671
Score = 103 bits (258), Expect = 1e-26
Identities = 37/68 (54%), Positives = 52/68 (76%)
Query: 2 TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
+FEQLCIN+ANE+LQ +F QH+F+ EQEEYN E I W+HIEF DN L L+ KP ++
Sbjct: 379 SFEQLCINFANENLQQFFVQHIFKLEQEEYNLEHINWQHIEFVDNQDALDLIAIKPLNIM 438
Query: 62 CVLDDQAK 69
++D+++K
Sbjct: 439 SLIDEESK 446
>gnl|CDD|215687 pfam00063, Myosin_head, Myosin head (motor domain).
Length = 679
Score = 103 bits (258), Expect = 1e-26
Identities = 38/69 (55%), Positives = 52/69 (75%), Gaps = 1/69 (1%)
Query: 2 TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFS-DNTLCLQLVEGKPNGL 60
+FEQLCINYANE LQ +FN H+F+ EQEEY +EGI W I++ DN C+ L+E KP G+
Sbjct: 380 SFEQLCINYANEKLQQFFNHHMFKLEQEEYVREGIAWTFIDYGLDNQACIDLIEKKPPGI 439
Query: 61 LCVLDDQAK 69
L +LD++ +
Sbjct: 440 LSLLDEECR 448
>gnl|CDD|238680 cd01384, MYSc_type_XI, Myosin motor domain, plant-specific type XI
myosin, involved in organelle transport. This catalytic
(head) domain has ATPase activity and belongs to the
larger group of P-loop NTPases. Myosins are
actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 674
Score = 102 bits (256), Expect = 3e-26
Identities = 38/64 (59%), Positives = 46/64 (71%)
Query: 3 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 62
FEQ CIN NE LQ +FNQHVF+ EQEEY KE I W +IEF DN L L+E KP G++
Sbjct: 385 FEQFCINLTNEKLQQHFNQHVFKMEQEEYTKEEIDWSYIEFVDNQDVLDLIEKKPGGIIA 444
Query: 63 VLDD 66
+LD+
Sbjct: 445 LLDE 448
>gnl|CDD|238676 cd01380, MYSc_type_V, Myosin motor domain, type V myosins. Myosins
V transport a variety of intracellular cargo
processively along actin filaments, such as membraneous
organelles and mRNA. This catalytic (head) domain has
ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 691
Score = 102 bits (255), Expect = 4e-26
Identities = 40/67 (59%), Positives = 49/67 (73%), Gaps = 1/67 (1%)
Query: 3 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLC 62
FEQ CINYANE LQ FNQHVF+ EQEEY KEGI W I+F DN C+ L+E K G+L
Sbjct: 386 FEQFCINYANEKLQQQFNQHVFKLEQEEYLKEGIEWTFIDFYDNQPCIDLIESKL-GILS 444
Query: 63 VLDDQAK 69
+LD++ +
Sbjct: 445 LLDEECR 451
>gnl|CDD|238683 cd01387, MYSc_type_XV, Myosin motor domain, type XV myosins. In
vertebrates, myosin XV appears to be expressed in
sensory tissue and play a role in hearing. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Myosins are
actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 677
Score = 97.3 bits (242), Expect = 2e-24
Identities = 39/66 (59%), Positives = 49/66 (74%)
Query: 2 TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
+FEQLCINYANE+LQY FN+ VFQ EQEEY +E + W I F+DN + L+ KP G+L
Sbjct: 378 SFEQLCINYANENLQYLFNKIVFQEEQEEYIREQLDWTEIAFADNQPVINLISLKPYGIL 437
Query: 62 CVLDDQ 67
+LDDQ
Sbjct: 438 RILDDQ 443
>gnl|CDD|238679 cd01383, MYSc_type_VIII, Myosin motor domain, plant-specific type
VIII myosins, a subgroup which has been associated with
endocytosis, cytokinesis, cell-to-cell coupling and
gating at plasmodesmata. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 677
Score = 94.5 bits (235), Expect = 2e-23
Identities = 39/67 (58%), Positives = 51/67 (76%)
Query: 2 TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
+FEQ CINYANE LQ +FN+H+F+ EQEEY ++GI W +EF DN CL L E KP GLL
Sbjct: 382 SFEQFCINYANERLQQHFNRHLFKLEQEEYEEDGIDWTKVEFEDNQECLDLFEKKPLGLL 441
Query: 62 CVLDDQA 68
+LD+++
Sbjct: 442 SLLDEES 448
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 88.2 bits (219), Expect = 3e-21
Identities = 39/67 (58%), Positives = 50/67 (74%), Gaps = 1/67 (1%)
Query: 3 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGK-PNGLL 61
FEQLCINY NE LQ +FNQH+F+ EQEEY KEGI W I++ DN C+ L+E K P G+L
Sbjct: 446 FEQLCINYTNEKLQQFFNQHMFKLEQEEYVKEGIEWSFIDYFDNQPCIDLIEKKNPLGIL 505
Query: 62 CVLDDQA 68
+LD++
Sbjct: 506 SLLDEEC 512
>gnl|CDD|238675 cd01379, MYSc_type_III, Myosin motor domain, type III myosins.
Myosin III has been shown to play a role in the vision
process in insects and in hearing in mammals. Myosin
III, an unconventional myosin, does not form dimers.
This catalytic (head) domain has ATPase activity and
belongs to the larger group of P-loop NTPases. Myosins
are actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 653
Score = 88.0 bits (218), Expect = 3e-21
Identities = 36/68 (52%), Positives = 50/68 (73%)
Query: 2 TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
+FEQLCIN ANE +QYYFNQH+F +EQ+EY EG+ R +E+ DN L + KP GLL
Sbjct: 390 SFEQLCINIANEQIQYYFNQHIFAWEQQEYLNEGVDARLVEYEDNRPLLDMFLQKPLGLL 449
Query: 62 CVLDDQAK 69
+LD++++
Sbjct: 450 ALLDEESR 457
>gnl|CDD|238673 cd01377, MYSc_type_II, Myosin motor domain, type II myosins. Myosin
II mediates cortical contraction in cell motility, and
is the motor in smooth and skeletal muscle. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Myosins are
actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 693
Score = 85.7 bits (213), Expect = 2e-20
Identities = 35/66 (53%), Positives = 45/66 (68%), Gaps = 1/66 (1%)
Query: 3 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFS-DNTLCLQLVEGKPNGLL 61
FEQLCINY NE LQ +FN H+F EQEEY +EGI W I+F D + L+E P G+L
Sbjct: 392 FEQLCINYTNEKLQQFFNHHMFVLEQEEYQREGIEWTFIDFGLDLQPTIDLIEKNPMGIL 451
Query: 62 CVLDDQ 67
+LD++
Sbjct: 452 SLLDEE 457
>gnl|CDD|238674 cd01378, MYSc_type_I, Myosin motor domain, type I myosins. Myosin I
generates movement at the leading edge in cell motility,
and class I myosins have been implicated in phagocytosis
and vesicle transport. Myosin I, an unconventional
myosin, does not form dimers. This catalytic (head)
domain has ATPase activity and belongs to the larger
group of P-loop NTPases. Myosins are actin-dependent
molecular motors that play important roles in muscle
contraction, cell motility, and organelle transport. The
head domain is a molecular motor, which utilizes ATP
hydrolysis to generate directed movement toward the plus
end along actin filaments. A cyclical interaction
between myosin and actin provides the driving force.
Rates of ATP hydrolysis and consequently the speed of
movement along actin filaments vary widely, from about
0.04 micrometer per second for myosin I to 4.5
micrometer per second for myosin II in skeletal muscle.
Myosin II moves in discrete steps about 5-10 nm long and
generates 1-5 piconewtons of force. Upon ATP binding,
the myosin head dissociates from an actin filament. ATP
hydrolysis causes the head to pivot and associate with a
new actin subunit. The release of Pi causes the head to
pivot and move the filament (power stroke). Release of
ADP completes the cycle.
Length = 674
Score = 84.6 bits (210), Expect = 4e-20
Identities = 33/65 (50%), Positives = 43/65 (66%), Gaps = 1/65 (1%)
Query: 3 FEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGK-PNGLL 61
FEQ CINY NE LQ F + + EQEEY +EGI+W IE+ +N + L+EGK P G+
Sbjct: 384 FEQFCINYVNEKLQQIFIELTLKAEQEEYVREGIKWTPIEYFNNKIVCDLIEGKRPPGIF 443
Query: 62 CVLDD 66
+LDD
Sbjct: 444 SILDD 448
>gnl|CDD|238678 cd01382, MYSc_type_VI, Myosin motor domain, type VI myosins. Myosin
VI is a monomeric myosin, which moves towards the
minus-end of actin filaments, in contrast to most other
myosins. It has been implicated in endocytosis,
secretion, and cell migration. This catalytic (head)
domain has ATPase activity and belongs to the larger
group of P-loop NTPases. Myosins are actin-dependent
molecular motors that play important roles in muscle
contraction, cell motility, and organelle transport. The
head domain is a molecular motor, which utilizes ATP
hydrolysis to generate directed movement toward the
minus end along actin filaments. A cyclical interaction
between myosin and actin provides the driving force.
Rates of ATP hydrolysis and consequently the speed of
movement along actin filaments vary widely, from about
0.04 micrometer per second for myosin I to 4.5
micrometer per second for myosin II in skeletal muscle.
Myosin II moves in discrete steps about 5-10 nm long and
generates 1-5 piconewtons of force. Upon ATP binding,
the myosin head dissociates from an actin filament. ATP
hydrolysis causes the head to pivot and associate with a
new actin subunit. The release of Pi causes the head to
pivot and move the filament (power stroke). Release of
ADP completes the cycle.
Length = 717
Score = 81.8 bits (202), Expect = 4e-19
Identities = 30/68 (44%), Positives = 46/68 (67%)
Query: 2 TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
+FEQ CINY NE LQ +FN+ + + EQE Y +EG+ + + DN C+ L+E K NG+L
Sbjct: 414 SFEQFCINYCNEKLQQFFNERILKEEQELYQREGLGVNEVHYVDNQDCIDLIEAKLNGIL 473
Query: 62 CVLDDQAK 69
+LD++ +
Sbjct: 474 DILDEENR 481
>gnl|CDD|240229 PTZ00014, PTZ00014, myosin-A; Provisional.
Length = 821
Score = 57.0 bits (138), Expect = 2e-10
Identities = 30/74 (40%), Positives = 40/74 (54%), Gaps = 2/74 (2%)
Query: 4 EQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLCV 63
EQL IN NE LQ F VF+ E + Y EGI +E++ N + L+ GK +L +
Sbjct: 481 EQLFINITNEMLQKNFVDIVFERESKLYKDEGISTEELEYTSNESVIDLLCGKGKSVLSI 540
Query: 64 LDDQ--AKKRRDEK 75
L+DQ A DEK
Sbjct: 541 LEDQCLAPGGTDEK 554
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 46.1 bits (109), Expect = 1e-06
Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 5/96 (5%)
Query: 27 EQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGE 86
E E + G + H S+ V+ + ++Q + ++DEK G +GG
Sbjct: 802 EHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEK-----GVDGGG 856
Query: 87 EMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
G + EE+E EE EEEEE+++EE+ +++E EE
Sbjct: 857 GSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 45.4 bits (107), Expect = 2e-06
Identities = 28/84 (33%), Positives = 37/84 (44%), Gaps = 1/84 (1%)
Query: 56 KPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKA 115
+P + L D +K E E G R G E + E GE E GE E+E + E K
Sbjct: 619 RPVAKVMALGDLSKGDVAEAEHT-GERTGEEGERPTEAEGENGEESGGEAEQEGETETKG 677
Query: 116 KKKEQEEEGGGGGGEQEKEEEREG 139
+ + + E GEQE E E E
Sbjct: 678 ENESEGEIPAERKGEQEGEGEIEA 701
Score = 44.6 bits (105), Expect = 4e-06
Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 2/76 (2%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ--EE 122
+ K E E + G ++ E + + E + E GE+E + + +AK+ E+ +
Sbjct: 795 EGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDG 854
Query: 123 EGGGGGGEQEKEEERE 138
GG GG+ E+EEE E
Sbjct: 855 GGGSDGGDSEEEEEEE 870
Score = 41.9 bits (98), Expect = 3e-05
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
+ + + E E E G E +E +G+ E E E E +E E + E +A+ KE E+EG
Sbjct: 722 EAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEG 781
Query: 125 ---GGGGGEQEKEEEREG 139
G GE + +E EG
Sbjct: 782 EIQAGEDGEMKGDEGAEG 799
Score = 41.5 bits (97), Expect = 4e-05
Identities = 23/79 (29%), Positives = 35/79 (44%), Gaps = 8/79 (10%)
Query: 66 DQAKKRRDEKE-----EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
+ +R+ E+E E + GE + E+ E E E E+E E + E+ ++ E
Sbjct: 684 EIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVED 743
Query: 121 EEEGGGGGGEQEKEEEREG 139
E EG G E E EG
Sbjct: 744 EGEGEAEGKH---EVETEG 759
Score = 39.6 bits (92), Expect = 2e-04
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 2/77 (2%)
Query: 65 DDQAKKRRDEKEEKEGGRE--GGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
+ +AK+ + E + E G E +G E+ GE + EEGEE E++ + E K E E
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVET 757
Query: 123 EGGGGGGEQEKEEEREG 139
EG E E E E EG
Sbjct: 758 EGDRKETEHEGETEAEG 774
Score = 36.9 bits (85), Expect = 0.002
Identities = 20/74 (27%), Positives = 35/74 (47%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
+ +R E E + G GGE + E + + EGE E+K E++ + + + +E
Sbjct: 646 GEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEAD 705
Query: 126 GGGGEQEKEEEREG 139
G + +E E EG
Sbjct: 706 HKGETEAEEVEHEG 719
Score = 36.9 bits (85), Expect = 0.002
Identities = 20/56 (35%), Positives = 34/56 (60%), Gaps = 7/56 (12%)
Query: 83 EGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
E ++ KG + G D + EEEEE+++EE+ +++E+EE E+E+EE E
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEE-------EEEEEENEE 892
Score = 36.5 bits (84), Expect = 0.002
Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 1/71 (1%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVE-EGEEEEEKKKEEKAKKKEQEEEGG 125
+A+ E+ E +EG E KG+ E+ E E +GE+E E + E K + E E
Sbjct: 654 EAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAE 713
Query: 126 GGGGEQEKEEE 136
E E E E
Sbjct: 714 EVEHEGETEAE 724
Score = 36.5 bits (84), Expect = 0.002
Identities = 19/75 (25%), Positives = 34/75 (45%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
D + + +E E + G E +G+ E GEE E E E E E + + + + + +E
Sbjct: 705 DHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKET 764
Query: 125 GGGGGEQEKEEEREG 139
G + + +E E
Sbjct: 765 EHEGETEAEGKEDED 779
Score = 36.1 bits (83), Expect = 0.004
Identities = 18/65 (27%), Positives = 26/65 (40%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
E E K G+ + + E E E +E E+E E + E + K E G E E
Sbjct: 748 EAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGET 807
Query: 134 EEERE 138
E +
Sbjct: 808 EAGEK 812
Score = 35.7 bits (82), Expect = 0.005
Identities = 20/74 (27%), Positives = 33/74 (44%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
+A++ + + + E EG + K E E E+E E + + + E + + E E E
Sbjct: 665 GEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAE 724
Query: 126 GGGGEQEKEEEREG 139
G E E E EG
Sbjct: 725 GTEDEGEIETGEEG 738
Score = 35.4 bits (81), Expect = 0.006
Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 1/65 (1%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
E + KE EG E +GKE+ E E++ GE+ E K E K E E E G ++ +
Sbjct: 758 EGDRKETEHEGETEAEGKEDEDE-GEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHE 816
Query: 134 EEERE 138
+
Sbjct: 817 GQSET 821
Score = 35.4 bits (81), Expect = 0.007
Identities = 17/62 (27%), Positives = 32/62 (51%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
+++E EG + GE+ + K + G E +VE E E +K+E + E + + E +
Sbjct: 775 KEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGE 834
Query: 134 EE 135
+E
Sbjct: 835 QE 836
Score = 34.2 bits (78), Expect = 0.013
Identities = 23/64 (35%), Positives = 31/64 (48%), Gaps = 2/64 (3%)
Query: 75 KEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKE 134
+ E EG E KG++E E E +E + + E + EE + E E EG GE E
Sbjct: 678 ENESEGEIPA--ERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETG 735
Query: 135 EERE 138
EE E
Sbjct: 736 EEGE 739
Score = 32.7 bits (74), Expect = 0.046
Identities = 19/74 (25%), Positives = 38/74 (51%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
+ + E +++ G +E E +G+ + E+ G + +EE+ +++E+EEE
Sbjct: 818 QSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEE 877
Query: 125 GGGGGEQEKEEERE 138
E+E+EEE E
Sbjct: 878 EEEEEEEEEEEENE 891
Score = 30.3 bits (68), Expect = 0.31
Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 74 EKEEKEGGREGGEEMKGKEE---NGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGE 130
+E +E EG E +GK E G+ E E E E + KE++ + + Q E G G+
Sbjct: 735 GEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGD 794
Query: 131 QEKEEERE 138
+ E + E
Sbjct: 795 EGAEGKVE 802
>gnl|CDD|238682 cd01386, MYSc_type_XVIII, Myosin motor domain, type XVIII myosins.
This catalytic (head) domain has ATPase activity and
belongs to the larger group of P-loop NTPases. Myosins
are actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 767
Score = 43.3 bits (102), Expect = 1e-05
Identities = 25/83 (30%), Positives = 34/83 (40%), Gaps = 13/83 (15%)
Query: 2 TFEQLCINYANEHLQYYFNQHVFQYEQEEYNKEGIRWRHIEFSDNTLCLQLVEGKPNGLL 61
TFE+LC NY E LQ F+ F E Y +EG+ +EF L E P +
Sbjct: 396 TFEELCHNYLQERLQLLFHHRTFVQPLERYAEEGVE---VEFD-------LAEPSPGTTV 445
Query: 62 CVLD---DQAKKRRDEKEEKEGG 81
++D Q + E G
Sbjct: 446 ALVDQAPQQVVVPAGLRAEDARG 468
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 40.7 bits (96), Expect = 9e-05
Identities = 22/62 (35%), Positives = 39/62 (62%), Gaps = 2/62 (3%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
++ KKR +EK+EK+ K +EE EE E +E E+EEE+++ E+ K++E+E++
Sbjct: 416 VEKAEKKREEEKKEKKKKAF--AGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Query: 124 GG 125
Sbjct: 474 KK 475
Score = 39.5 bits (93), Expect = 2e-04
Identities = 18/63 (28%), Positives = 37/63 (58%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
+ + EK+ +E +E ++ ++ EE+E E+ ++EEEK++EE+ ++E+EEE
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Query: 127 GGG 129
Sbjct: 473 KKK 475
Score = 34.9 bits (81), Expect = 0.008
Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 3/66 (4%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
KK ++ E+K E +E K K G++ E EE EEE+EKK+EEK +++E+ EE
Sbjct: 413 KKIVEKAEKKR--EEEKKEKKKKAFAGKKKEEEE-EEEKEKKEEEKEEEEEEAEEEKEEE 469
Query: 129 GEQEKE 134
E++K+
Sbjct: 470 EEKKKK 475
Score = 34.5 bits (80), Expect = 0.012
Identities = 21/66 (31%), Positives = 37/66 (56%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
+ +K E K +EE E+ + +++E+++EE+ +KKE+E+E E+EK
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Query: 134 EEEREG 139
EEE E
Sbjct: 467 EEEEEK 472
Score = 33.7 bits (78), Expect = 0.020
Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 1/67 (1%)
Query: 73 DEKEEKE-GGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQ 131
E+E + G + + K E + EE ++E++KK KK+E+EEE E+
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 132 EKEEERE 138
++EEE E
Sbjct: 455 KEEEEEE 461
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 38.2 bits (89), Expect = 7e-04
Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 21/99 (21%)
Query: 48 LC--LQLVEG---KPNGL-----LCVLD---DQAKKRRDEKEEKEGGREGG-------EE 87
LC L LVEG + N L L +LD + +++E+ G EE
Sbjct: 181 LCLDLDLVEGLLERKNELPHLKKLIILDTLIKSKEININKEEKNNGSNVNNNGNKNNKEE 240
Query: 88 MKGKEENGEEDEVEEGEEEEEKKKEEKAKK-KEQEEEGG 125
KG + + E +++ G E +K+K EK K KE+ ++ G
Sbjct: 241 QKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLG 279
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 36.2 bits (83), Expect = 0.004
Identities = 16/60 (26%), Positives = 34/60 (56%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
++R E++ + +E+ +ED+ +E ++EEE+++EE+ K + E+E GG
Sbjct: 134 RRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGG 193
Score = 31.9 bits (72), Expect = 0.10
Identities = 18/57 (31%), Positives = 32/57 (56%)
Query: 82 REGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
R EE +N D+ +E E+E++ +E+ +++E+EEE G E E++E E
Sbjct: 138 RHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGE 194
Score = 27.3 bits (60), Expect = 3.9
Identities = 21/80 (26%), Positives = 36/80 (45%)
Query: 57 PNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAK 116
P + DD + D+ +E++ E EE + K + E++E E GE+ +K E
Sbjct: 147 PRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKT 206
Query: 117 KKEQEEEGGGGGGEQEKEEE 136
+ E G G +K +E
Sbjct: 207 DCFKFIEAGAGDDVFDKIDE 226
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 35.3 bits (81), Expect = 0.006
Identities = 27/75 (36%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG--- 125
++ +EE+ E EE + +EE E +E EEGE+EEE+++ E E+E EG
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEE-EEGEDEEEEEEVEADNGSEEEMEGSSEG 492
Query: 126 -GGGGEQEKEEEREG 139
G G E E++ ER
Sbjct: 493 DGDGEEPEEDAERRN 507
Score = 31.8 bits (72), Expect = 0.095
Identities = 23/72 (31%), Positives = 34/72 (47%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
Q R K G +E+ EE+ VEE EEEEE+++EE+ + +E+E E
Sbjct: 412 QGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEE 471
Query: 127 GGGEQEKEEERE 138
E E + E
Sbjct: 472 EEEEVEADNGSE 483
Score = 26.8 bits (59), Expect = 4.4
Identities = 19/74 (25%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Query: 67 QAKKRR-DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
Q ++R+ + + + +E E EE EEEE+++EE+ +++++ EE
Sbjct: 407 QERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEE 466
Query: 126 GGGGEQEKEEEREG 139
G E+E+E E +
Sbjct: 467 GEDEEEEEEVEADN 480
Score = 26.4 bits (58), Expect = 5.6
Identities = 16/74 (21%), Positives = 38/74 (51%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
+++ ++ +++ E+E G + EE + + +NG E+E+E E + +E + + + E
Sbjct: 451 EEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEM 510
Query: 125 GGGGGEQEKEEERE 138
G E ++ R
Sbjct: 511 AGISRMSEGQQPRG 524
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 34.1 bits (79), Expect = 0.013
Identities = 17/75 (22%), Positives = 31/75 (41%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
QA++RR+ E G G++ E EEE++ E + + ++ +
Sbjct: 124 AAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQA 183
Query: 125 GGGGGEQEKEEEREG 139
GE+ + EER
Sbjct: 184 EAERGERGRREERGR 198
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 32.6 bits (75), Expect = 0.038
Identities = 16/55 (29%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
++ + E+E +E +E EE K +E + ++ EE++K EEK +KK+
Sbjct: 269 EEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSP---EEQRKLEEKERKKQA 320
Score = 32.6 bits (75), Expect = 0.048
Identities = 16/38 (42%), Positives = 24/38 (63%)
Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
EE +EE ++KKEEK K++ + + EQ K EE+E
Sbjct: 278 EERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKE 315
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 32.6 bits (74), Expect = 0.042
Identities = 22/73 (30%), Positives = 32/73 (43%), Gaps = 5/73 (6%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKA--KKKEQEEEGGG 126
KK+ +K+E E +GGE ++ E EG E E+ E+ E E EGG
Sbjct: 93 KKKEKKKKETEKPAQGGE---KPDQGPEAKGEGEGHEPEDPPPEDTPPPPGGEGEVEGGP 149
Query: 127 GGGEQEKEEEREG 139
G ++EG
Sbjct: 150 SPGPGPGPLDQEG 162
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 32.7 bits (74), Expect = 0.045
Identities = 22/56 (39%), Positives = 27/56 (48%), Gaps = 4/56 (7%)
Query: 70 KRRDEKEEKEGGREGGEEMKGKEENGEEDEVE-EGEEEEEKKKEEKAKKKEQEEEG 124
K+R+E EK RE E K +EE E E E E E E E++ E AK E
Sbjct: 579 KKREEAVEK-AKREA--EQKAREEREREKEKEKEREREREREAERAAKASSSSHES 631
Score = 32.4 bits (73), Expect = 0.063
Identities = 17/43 (39%), Positives = 29/43 (67%), Gaps = 8/43 (18%)
Query: 97 EDEVEEGEEE-EEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
E+ VE+ + E E+K +EE+ ++KE+E+E +E+E ERE
Sbjct: 582 EEAVEKAKREAEQKAREEREREKEKEKE-------RERERERE 617
Score = 31.2 bits (70), Expect = 0.15
Identities = 17/49 (34%), Positives = 29/49 (59%), Gaps = 7/49 (14%)
Query: 89 KGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
K +EE E+ + E ++ E+++ EK K+KE+E E+E+E ER
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERER-------EREREAER 620
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 32.0 bits (73), Expect = 0.069
Identities = 13/48 (27%), Positives = 22/48 (45%)
Query: 92 EENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEEREG 139
E +EDE EE E+ ++ +EE+ + E+E G E +
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGS 414
Score = 30.1 bits (68), Expect = 0.30
Identities = 16/69 (23%), Positives = 31/69 (44%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
KRR + + +E + +E+ +E E EEGE+ EE+ + + + G
Sbjct: 356 DKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSD 415
Query: 129 GEQEKEEER 137
E + ++E
Sbjct: 416 SESKADKES 424
Score = 29.3 bits (66), Expect = 0.57
Identities = 16/71 (22%), Positives = 26/71 (36%), Gaps = 5/71 (7%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
D + DE EE+E + +EE GE+ E E + E+ E + E
Sbjct: 366 DFEEVDEDEDEEEE-----QRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKA 420
Query: 126 GGGGEQEKEEE 136
+ + E
Sbjct: 421 DKESASDSDSE 431
Score = 28.9 bits (65), Expect = 0.84
Identities = 14/52 (26%), Positives = 23/52 (44%)
Query: 87 EMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
+ + EE + +E EEEE++ E + ++ E EE G E E
Sbjct: 359 RARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSS 410
Score = 27.8 bits (62), Expect = 2.1
Identities = 17/72 (23%), Positives = 32/72 (44%), Gaps = 1/72 (1%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
+ D+ + R D + +E + EE + + + EE+E E EEE + E + +
Sbjct: 354 MRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEE-GEDSEEEGSQSREDGSSESSSDV 412
Query: 124 GGGGGGEQEKEE 135
G + +KE
Sbjct: 413 GSDSESKADKES 424
Score = 26.2 bits (58), Expect = 6.4
Identities = 15/59 (25%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEV-EEGEEEEEKKKEEKAKKKEQE 121
+D+ +++R ++ E+E G + EE E+G + + G + E K +E A + E
Sbjct: 373 DEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 32.2 bits (73), Expect = 0.070
Identities = 15/35 (42%), Positives = 21/35 (60%)
Query: 88 MKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
E E+ + E E E +KKKEEKAK+KE ++
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKEEKAKEKELKK 35
Score = 26.8 bits (59), Expect = 5.3
Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 3/64 (4%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
E E+K E E K KEE +E E+++ + +KE KAK + Q+ G ++ +
Sbjct: 7 EAEKKILTEEELERKKKKEEKAKEKELKK---LKAAQKEAKAKLQAQQASDGTNVPKKSE 63
Query: 134 EEER 137
++ R
Sbjct: 64 KKSR 67
>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
Length = 1560
Score = 32.4 bits (74), Expect = 0.071
Identities = 10/52 (19%), Positives = 18/52 (34%), Gaps = 2/52 (3%)
Query: 79 EGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGE 130
+E K +E + EV E + + ++EG GG+
Sbjct: 875 ATLAAELKENKDSKEGDADAEVAEVDAAPG--GAVDHEPPVAKQEGNAEGGD 924
Score = 27.1 bits (60), Expect = 3.6
Identities = 13/59 (22%), Positives = 21/59 (35%), Gaps = 8/59 (13%)
Query: 68 AKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
A + KE K+ K + + E EV+ + AK++ E G G
Sbjct: 875 ATLAAELKENKDS--------KEGDADAEVAEVDAAPGGAVDHEPPVAKQEGNAEGGDG 925
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 32.0 bits (73), Expect = 0.076
Identities = 20/48 (41%), Positives = 30/48 (62%)
Query: 76 EEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
EE+E +E +E + K + EE+ EE E+EE+KKK +K K+ E E
Sbjct: 38 EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85
Score = 28.2 bits (63), Expect = 1.6
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 97 EDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
E E E +EEEE++KEEK +++E+ + E+E++EE+
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEK 70
Score = 26.3 bits (58), Expect = 7.7
Identities = 17/47 (36%), Positives = 28/47 (59%)
Query: 91 KEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
KE E + EE EE+EEKK+EE+ ++EE E++K++ +
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTK 75
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 31.8 bits (73), Expect = 0.083
Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 3/45 (6%)
Query: 85 GEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
EE+K EEEE+++EE+ +++ EEE G G
Sbjct: 285 DEELKEVLSA---QAQAAAAEEEEEEEEEEEEEEPSEEEAAAGLG 326
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 31.7 bits (72), Expect = 0.10
Identities = 14/54 (25%), Positives = 31/54 (57%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKK 117
+ + +DE + E +E K K+E E+ + E+ +E++EK+++++ KK
Sbjct: 381 YTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 29.8 bits (67), Expect = 0.40
Identities = 12/52 (23%), Positives = 27/52 (51%)
Query: 75 KEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
K+ E E + + + EE +E E+++K++ ++ K+K Q++E
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 27.5 bits (61), Expect = 3.0
Identities = 13/62 (20%), Positives = 31/62 (50%)
Query: 77 EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
G + E +K + ++ + GE ++E E+A+ K +EE+ E++++E+
Sbjct: 360 TDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQ 419
Query: 137 RE 138
+
Sbjct: 420 AD 421
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 31.4 bits (72), Expect = 0.11
Identities = 20/73 (27%), Positives = 32/73 (43%), Gaps = 1/73 (1%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
L D+ E+EE E G E K K + ++ E+EE +K K K + ++
Sbjct: 387 LHDKPLLAEGEEEEGENGNLSPAERK-KLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKK 445
Query: 124 GGGGGGEQEKEEE 136
G GE +K +
Sbjct: 446 AKGPDGETKKVDP 458
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 31.6 bits (71), Expect = 0.13
Identities = 21/47 (44%), Positives = 30/47 (63%)
Query: 77 EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
EKE E E+ K +E+GEE +VEE +E +E KK++ K KE +E
Sbjct: 212 EKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKE 258
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 31.4 bits (71), Expect = 0.14
Identities = 17/66 (25%), Positives = 26/66 (39%), Gaps = 21/66 (31%)
Query: 72 RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKK--------------EEKAKK 117
R+ EE+ ++ + +EDE E +EE KK + KA K
Sbjct: 538 RNSTEERI-------DLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPK 590
Query: 118 KEQEEE 123
+EEE
Sbjct: 591 LTEEEE 596
Score = 30.6 bits (69), Expect = 0.26
Identities = 10/59 (16%), Positives = 22/59 (37%), Gaps = 14/59 (23%)
Query: 94 NGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG--------------GGGGEQEKEEERE 138
+ EE + ++ E + E + + E+ ++ G G + +EEE
Sbjct: 540 STEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAA 598
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 31.3 bits (71), Expect = 0.15
Identities = 21/48 (43%), Positives = 25/48 (52%), Gaps = 9/48 (18%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
D KE E GREG ED+ EEG + EE ++KAKKK Q
Sbjct: 153 DNKEAPETGREGD---------DGEDDDEEGSDGEEDGAKKKAKKKRQ 191
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 30.9 bits (70), Expect = 0.16
Identities = 17/59 (28%), Positives = 30/59 (50%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
++ + +E KE +E +++ E E EE EEE E++ EE + + E+E E
Sbjct: 55 DDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEK 113
Score = 30.1 bits (68), Expect = 0.25
Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 1/73 (1%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMK-GKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
+ DE+E K+ + E++ KEE E E+ E++ + +KE++ ++E EEE
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98
Query: 124 GGGGGGEQEKEEE 136
E EKE E
Sbjct: 99 DEESSDENEKETE 111
Score = 28.2 bits (63), Expect = 1.2
Identities = 18/70 (25%), Positives = 32/70 (45%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
+ +E+E + +E EE EE++ E++E K + + + +E EEE
Sbjct: 40 PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99
Query: 129 GEQEKEEERE 138
E E E+E
Sbjct: 100 EESSDENEKE 109
Score = 26.6 bits (59), Expect = 4.6
Identities = 17/60 (28%), Positives = 30/60 (50%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
++ K+ E E + G+ K EE+ EE+E E+ E +E +KE + K + E+
Sbjct: 62 IEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKE 121
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 31.1 bits (70), Expect = 0.17
Identities = 11/66 (16%), Positives = 22/66 (33%)
Query: 58 NGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKK 117
+ R +++ KE G E E E ++ EEK + A
Sbjct: 782 VNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAIL 841
Query: 118 KEQEEE 123
++++
Sbjct: 842 NVKKKK 847
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 30.8 bits (70), Expect = 0.17
Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 7/79 (8%)
Query: 65 DDQAKKRRDEKEEKEGG--REGGEEMKGKEENGEEDEVEEGEEEE-----EKKKEEKAKK 117
+D+ K++R E++ + ++ +D+ E E EK K+E+A++
Sbjct: 102 EDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEE 161
Query: 118 KEQEEEGGGGGGEQEKEEE 136
KE+EEE E+ +EEE
Sbjct: 162 KEREEEEKAAEEEKAREEE 180
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 31.1 bits (71), Expect = 0.17
Identities = 12/80 (15%), Positives = 33/80 (41%), Gaps = 3/80 (3%)
Query: 58 NGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKK 117
+ + + DD +E+ + E + ++++ +E+ EE EE + + +E
Sbjct: 95 SDQVVIFDDG---DIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDD 151
Query: 118 KEQEEEGGGGGGEQEKEEER 137
+ +++ E+ R
Sbjct: 152 DDDDDDIATRERSLERRRRR 171
Score = 30.0 bits (68), Expect = 0.36
Identities = 10/65 (15%), Positives = 32/65 (49%), Gaps = 6/65 (9%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEE------KKKEEKAKKK 118
D A + +E ++ + + +E K E+ E+D+ ++ +++ +++ + + +
Sbjct: 116 YDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWE 175
Query: 119 EQEEE 123
E+ E
Sbjct: 176 EKRAE 180
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.9 bits (69), Expect = 0.20
Identities = 24/74 (32%), Positives = 44/74 (59%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
+++ K + +K+E E ++ E K +EEN + E + EE+KKK E+AKK E++E+
Sbjct: 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689
Query: 125 GGGGGEQEKEEERE 138
++E EE ++
Sbjct: 1690 AAEALKKEAEEAKK 1703
Score = 30.5 bits (68), Expect = 0.31
Identities = 22/74 (29%), Positives = 43/74 (58%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
++ KK + K+ +E ++ E +K + E ++ E + +E EEKKK E+ KK E+E +
Sbjct: 1671 EEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI 1730
Query: 125 GGGGGEQEKEEERE 138
++E EE+++
Sbjct: 1731 KAEEAKKEAEEDKK 1744
Score = 29.7 bits (66), Expect = 0.50
Identities = 22/74 (29%), Positives = 41/74 (55%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
+++ K +K E+E + EE K EE+ ++ E + EE+EKK E KK+ +E +
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Query: 125 GGGGGEQEKEEERE 138
++E EE+++
Sbjct: 1704 AEELKKKEAEEKKK 1717
Score = 29.7 bits (66), Expect = 0.55
Identities = 19/71 (26%), Positives = 35/71 (49%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
D+AKK+ +E ++ + ++ EE K K + ++ E + E K E +A E E
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364
Query: 126 GGGGEQEKEEE 136
++K+EE
Sbjct: 1365 KAEAAEKKKEE 1375
Score = 29.0 bits (64), Expect = 0.88
Identities = 22/68 (32%), Positives = 37/68 (54%)
Query: 70 KRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
K +E ++ E ++ EE K EE+ ++ +E EE KK E+ KKKE EE+
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEEL 1721
Query: 130 EQEKEEER 137
++ +EE +
Sbjct: 1722 KKAEEENK 1729
Score = 29.0 bits (64), Expect = 1.1
Identities = 23/88 (26%), Positives = 49/88 (55%)
Query: 51 QLVEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKK 110
+ E + ++ + +++ K + +E ++ E + EE+K EE ++ E + +E EEKK
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKK 1647
Query: 111 KEEKAKKKEQEEEGGGGGGEQEKEEERE 138
K E+ KK E+E + ++ EE+++
Sbjct: 1648 KAEELKKAEEENKIKAAEEAKKAEEDKK 1675
Score = 28.6 bits (63), Expect = 1.1
Identities = 22/86 (25%), Positives = 41/86 (47%)
Query: 53 VEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKE 112
+ K L ++ KK + K+ +E + E K E ++ E + EE++KK
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690
Query: 113 EKAKKKEQEEEGGGGGGEQEKEEERE 138
+A KKE EE ++++ EE++
Sbjct: 1691 AEALKKEAEEAKKAEELKKKEAEEKK 1716
Score = 28.6 bits (63), Expect = 1.4
Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 4/78 (5%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK----EQ 120
++AKK ++K++ E ++ E+ K E +++ E + EE KKKE + KKK ++
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723
Query: 121 EEEGGGGGGEQEKEEERE 138
EE E+ K+E E
Sbjct: 1724 AEEENKIKAEEAKKEAEE 1741
Score = 28.2 bits (62), Expect = 1.9
Identities = 21/74 (28%), Positives = 42/74 (56%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
+D+ K K+E E ++ E K + E ++ E + EEE K K E+AKK+ +E++
Sbjct: 1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKK 1744
Query: 125 GGGGGEQEKEEERE 138
++++EE+++
Sbjct: 1745 KAEEAKKDEEEKKK 1758
Score = 27.8 bits (61), Expect = 2.2
Identities = 18/56 (32%), Positives = 32/56 (57%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
++AKK+ D ++K ++ +E K K E ++ E + KKK ++AKKK +E
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429
Score = 27.0 bits (59), Expect = 3.9
Identities = 19/73 (26%), Positives = 44/73 (60%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
++AKK+ DE ++ ++ +E K EE + DE ++ EE ++ + +KA++K++ +E
Sbjct: 1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552
Query: 126 GGGGEQEKEEERE 138
++ EE+++
Sbjct: 1553 KAEELKKAEEKKK 1565
Score = 27.0 bits (59), Expect = 4.1
Identities = 22/70 (31%), Positives = 39/70 (55%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
KK +EK++ E ++ EE K K E +++ E+ ++ EE KK+E+ KKK +
Sbjct: 1709 KKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768
Query: 129 GEQEKEEERE 138
+E +E+E
Sbjct: 1769 KAEEIRKEKE 1778
Score = 27.0 bits (59), Expect = 4.5
Identities = 18/71 (25%), Positives = 37/71 (52%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
D+AKK+ +E ++ + ++ EE K K + ++ + + +E KK EE K E ++
Sbjct: 1473 DEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE 1532
Query: 126 GGGGEQEKEEE 136
++ K+ E
Sbjct: 1533 AKKADEAKKAE 1543
Score = 27.0 bits (59), Expect = 4.7
Identities = 18/71 (25%), Positives = 35/71 (49%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
++AKK+ D ++K + E E DE E EE+ E +++K + K++ +
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384
Query: 126 GGGGEQEKEEE 136
E++K +E
Sbjct: 1385 KKAEEKKKADE 1395
Score = 26.6 bits (58), Expect = 6.7
Identities = 20/71 (28%), Positives = 37/71 (52%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
++AKK +++++ + E K E ++ E EE ++ EE KK E+ K + EE
Sbjct: 1678 EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKK 1737
Query: 126 GGGGEQEKEEE 136
+++K EE
Sbjct: 1738 EAEEDKKKAEE 1748
Score = 26.3 bits (57), Expect = 7.3
Identities = 18/55 (32%), Positives = 33/55 (60%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
D+AKK+ +E ++ E ++ EE K +E ++ E + +E +KK EE KK ++
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADE 1501
>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
domain 1. The small mechanosensitive channel, MscS, is
a part of the turgor-driven solute efflux system that
protects bacteria from lysis in the event of osmotic
shock. The MscS protein alone is sufficient to form a
functional mechanosensitive channel gated directly by
tension in the lipid bilayer. The MscS proteins are
heptamers of three transmembrane subunits with seven
converging M3 domains, and this domain is one of the
inner membrane domains.
Length = 339
Score = 30.7 bits (70), Expect = 0.21
Identities = 8/28 (28%), Positives = 13/28 (46%)
Query: 109 KKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
K+ E ++ +EEE G + EE
Sbjct: 265 AKRAEILAQRAEEEEESSEGAAETIEEP 292
>gnl|CDD|237626 PRK14154, PRK14154, heat shock protein GrpE; Provisional.
Length = 208
Score = 30.5 bits (68), Expect = 0.21
Identities = 18/35 (51%), Positives = 22/35 (62%)
Query: 105 EEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEEREG 139
E + K K EK + E+E+E GGG G QE E REG
Sbjct: 14 ETKAKNKWEKVMEAEEEQEEGGGDGSQEMEPHREG 48
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 30.4 bits (69), Expect = 0.26
Identities = 20/65 (30%), Positives = 32/65 (49%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQE 132
DE + EG E K E + EDE E+ E ++ K++ + E++EE E E
Sbjct: 109 DEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAE 168
Query: 133 KEEER 137
E+E+
Sbjct: 169 AEKEK 173
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 30.4 bits (69), Expect = 0.31
Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 2/70 (2%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKK--KEEKAKKKEQEEEGGG 126
KK ++ E +E E E++ E + +EG EE+ EEK + E+
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 127 GGGEQEKEEE 136
+++K+EE
Sbjct: 284 EILKEKKDEE 293
Score = 28.4 bits (64), Expect = 1.4
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 106 EEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
E E+++EE + E E G EQE E +
Sbjct: 231 ETEEREEETDVEIETTSETKGTKQEQEGSTEED 263
Score = 28.0 bits (63), Expect = 1.7
Identities = 15/71 (21%), Positives = 30/71 (42%), Gaps = 5/71 (7%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
+ + + +EE + E E K G + E E EE+ E+ + ++ E+
Sbjct: 227 KETSETEEREEETDVEIETTSETK-----GTKQEQEGSTEEDPSLFSEEKEDPDKTEDLD 281
Query: 126 GGGGEQEKEEE 136
+EK++E
Sbjct: 282 KLEILKEKKDE 292
Score = 27.7 bits (62), Expect = 2.3
Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 2/46 (4%)
Query: 95 GEEDEVEEGEEEEEKKKE--EKAKKKEQEEEGGGGGGEQEKEEERE 138
E E EE EEE + + E + K +QE+EG EE+E
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKE 272
Score = 26.1 bits (58), Expect = 7.7
Identities = 12/46 (26%), Positives = 21/46 (45%)
Query: 93 ENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
E EE+ E E E K ++ ++ EE+ E+E ++ E
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTE 278
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 29.8 bits (67), Expect = 0.40
Identities = 9/52 (17%), Positives = 21/52 (40%)
Query: 87 EMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
+M + EE E + E ++E K++++++ G + E
Sbjct: 201 DMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSE 252
Score = 27.1 bits (60), Expect = 3.2
Identities = 9/42 (21%), Positives = 20/42 (47%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEE 106
D+ K ++ ++ E G + ++ + +E+E GE E
Sbjct: 226 DEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME 267
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 29.3 bits (66), Expect = 0.45
Identities = 11/50 (22%), Positives = 24/50 (48%), Gaps = 1/50 (2%)
Query: 89 KGKEENGE-EDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
+E E E+E + E+E+ + + K++++ E E+EK +
Sbjct: 93 SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142
Score = 26.3 bits (58), Expect = 4.7
Identities = 15/55 (27%), Positives = 27/55 (49%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
D ++ +E+ E +E E K + + ++ EV + + E+EK K E K K
Sbjct: 94 SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 29.6 bits (67), Expect = 0.47
Identities = 19/67 (28%), Positives = 38/67 (56%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQE 132
+ ++ G GEE + +EE+G +DE EE ++++ +++EE ++EE+ + E
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDE 366
Query: 133 KEEEREG 139
+EE E
Sbjct: 367 DDEEEEE 373
Score = 29.2 bits (66), Expect = 0.66
Identities = 15/52 (28%), Positives = 28/52 (53%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
D+ E+E + + EE+ + D+ ++ EEEEE+K+++K K E
Sbjct: 339 DDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 28.8 bits (65), Expect = 0.96
Identities = 17/51 (33%), Positives = 30/51 (58%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
D+ ++ E E + +E+ +ED +E +EEEE++++EK KKK E
Sbjct: 337 DDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAEST 387
Score = 28.4 bits (64), Expect = 1.5
Identities = 15/56 (26%), Positives = 27/56 (48%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
+D R D+ + + + + + G ED+ E+ EE+ E+KK +K KE
Sbjct: 126 IDKDDDVRDDDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKE 181
Score = 28.0 bits (63), Expect = 1.9
Identities = 15/51 (29%), Positives = 29/51 (56%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
D+ + +E + + ++E E+ + E+ EEEEE++KE+K KK +
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTR 388
Score = 27.7 bits (62), Expect = 2.4
Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 3/74 (4%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
D+ D +G + EE G ++ EED+ ++ EEEEE + +E++EE
Sbjct: 304 DEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEE---DVDLSDEEEDEED 360
Query: 125 GGGGGEQEKEEERE 138
E ++EEE E
Sbjct: 361 EDSDDEDDEEEEEE 374
Score = 27.3 bits (61), Expect = 3.0
Identities = 18/58 (31%), Positives = 36/58 (62%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
+D+ D +EE+E EE ++E+ ++++ EE EEEE++KK++K+ + + E
Sbjct: 333 EDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 25.7 bits (57), Expect = 9.9
Identities = 18/74 (24%), Positives = 31/74 (41%)
Query: 63 VLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
+ +E+EE E E+ E EED EEE+E+ ++ + E+EE
Sbjct: 313 NFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372
Query: 123 EGGGGGGEQEKEEE 136
E +++ E
Sbjct: 373 EEEKEKKKKKSAES 386
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 29.8 bits (68), Expect = 0.47
Identities = 9/49 (18%), Positives = 23/49 (46%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
+ +E G E++ +++ +E+E E+ ++ E + +K E
Sbjct: 174 VDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLE 222
Score = 26.3 bits (59), Expect = 6.0
Identities = 13/53 (24%), Positives = 22/53 (41%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
++R E + E+ EE + +E EEEEE + ++ E E
Sbjct: 163 ERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESE 215
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 29.2 bits (66), Expect = 0.51
Identities = 10/38 (26%), Positives = 18/38 (47%), Gaps = 2/38 (5%)
Query: 96 EEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
+ +EE EE + A K+E+ EG +++K
Sbjct: 161 DAKLLEEFAAEEAEA--AAAAKEEESAEGEKKESKKKK 196
Score = 26.9 bits (60), Expect = 3.7
Identities = 9/34 (26%), Positives = 15/34 (44%)
Query: 100 VEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
+ EE +E +A +EEE G ++ K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESK 193
Score = 26.5 bits (59), Expect = 4.8
Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 3/41 (7%)
Query: 72 RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKE 112
+ + E E E +E+E EGE++E KKK+
Sbjct: 159 WKDAKLLE---EFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 29.4 bits (66), Expect = 0.52
Identities = 17/74 (22%), Positives = 33/74 (44%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
D A++ D+ E++ E G++ + E EE E +E +++E + EEG
Sbjct: 216 MDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEG 275
Query: 125 GGGGGEQEKEEERE 138
E ++ E +
Sbjct: 276 EMDAAEASEDSESD 289
Score = 27.9 bits (62), Expect = 1.9
Identities = 16/75 (21%), Positives = 32/75 (42%), Gaps = 4/75 (5%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
DD ++ DE++ + + E+ E G E+ E EE++ + + +E E +
Sbjct: 224 DDGIEEDADEEDGDDDQPDNNED----SEAGREESEGSDESEEDEAEATDGEGEEGEMDA 279
Query: 125 GGGGGEQEKEEEREG 139
+ E +E E
Sbjct: 280 AEASEDSESDESDED 294
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 29.3 bits (66), Expect = 0.53
Identities = 14/55 (25%), Positives = 35/55 (63%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK 118
L ++ +K + E EEK+ + ++ K K++ ++ + ++ +++ EKK E++A+ K
Sbjct: 67 LAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121
Score = 28.5 bits (64), Expect = 0.98
Identities = 10/82 (12%), Positives = 37/82 (45%)
Query: 40 HIEFSDNTLCLQLVEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDE 99
D + + + + A++ K+E E ++ + K ++ ++D+
Sbjct: 40 PSHLQDRHFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDK 99
Query: 100 VEEGEEEEEKKKEEKAKKKEQE 121
++ +++++K +++ K+ E +
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDK 121
Score = 27.3 bits (61), Expect = 2.1
Identities = 14/53 (26%), Positives = 32/53 (60%)
Query: 86 EEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
EE++ ++ EE + + ++++ KKK++K K K+ +++ + EKE E +
Sbjct: 69 EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121
Score = 26.2 bits (58), Expect = 5.7
Identities = 9/56 (16%), Positives = 28/56 (50%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
+ K++ K++K ++ ++ K ++ ++ E ++ +E E+K ++ E
Sbjct: 79 EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134
Score = 26.2 bits (58), Expect = 5.8
Identities = 10/54 (18%), Positives = 33/54 (61%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
KK +EK++ + ++ ++ K K+++ ++D+ ++ E++++K+ E + +
Sbjct: 75 KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKS 128
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 29.6 bits (66), Expect = 0.64
Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEE---KKKEEKAKKKEQEEEG 124
DE +++ + E+ + E+G E+ V+E EE E K EE + + E++
Sbjct: 4044 DEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQA 4098
Score = 29.6 bits (66), Expect = 0.68
Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 3/51 (5%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKG--KEENGEEDEVEEGEEEEEKKKEE 113
+++ + + +E E RE +M G K+ E+E + EEE + EE
Sbjct: 3842 NEEDTANQSDLDESEA-RELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891
Score = 29.2 bits (65), Expect = 0.77
Identities = 17/87 (19%), Positives = 31/87 (35%), Gaps = 16/87 (18%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENG----------------EEDEVEEGEEEE 107
+D +A E+ + E +E +E N +E E+G EE
Sbjct: 4011 MDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEEN 4070
Query: 108 EKKKEEKAKKKEQEEEGGGGGGEQEKE 134
++ EE + + +E G E +
Sbjct: 4071 VQENEESTEDGVKSDEELEQGEVPEDQ 4097
Score = 27.3 bits (60), Expect = 3.1
Identities = 20/81 (24%), Positives = 33/81 (40%), Gaps = 6/81 (7%)
Query: 65 DDQAKKRRDEKEEK----EGGREGGEEMKGKEENGEEDEVE--EGEEEEEKKKEEKAKKK 118
+D K D EE E E G + + E GE E + + + + K + +
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEA 4116
Query: 119 EQEEEGGGGGGEQEKEEEREG 139
++E G GE E+ E +G
Sbjct: 4117 DEENTDKGIVGENEELGEEDG 4137
Score = 26.1 bits (57), Expect = 7.9
Identities = 12/69 (17%), Positives = 28/69 (40%), Gaps = 4/69 (5%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKE----EKAKKKEQEEEGGGGGG 129
++E ++G + + + + E +EE + E + E++ G G
Sbjct: 4085 DEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVGENEELGEEDGVRGNGTA 4144
Query: 130 EQEKEEERE 138
+ E E+ +E
Sbjct: 4145 DGEFEQVQE 4153
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 28.4 bits (64), Expect = 0.65
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGGE 130
EE+++++EE+ +K+E EEE G G
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEEAAAGLGA 103
Score = 27.2 bits (61), Expect = 1.4
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
EE+K++EE+ ++KE+ EE G
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEEAAAG 100
>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
Length = 639
Score = 29.4 bits (66), Expect = 0.68
Identities = 16/85 (18%), Positives = 34/85 (40%)
Query: 39 RHIEFSDNTLCLQLVEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEED 98
+ IE L L +L DD + D+ ++ + G ++ +E+N +
Sbjct: 81 KTIEPDSQDLLLDKRGEHKARVLSATDDDTHSQTDDIIKQVTQKAGQDDSDQQEKNSQSQ 140
Query: 99 EVEEGEEEEEKKKEEKAKKKEQEEE 123
+ E E ++ + +K E+E
Sbjct: 141 SASQAESLEHVQQSAQTSEKVDEKE 165
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 29.2 bits (66), Expect = 0.77
Identities = 13/60 (21%), Positives = 36/60 (60%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
+ +QA D+ ++ + + ++ E++ E+D+ ++ ++E+E+KKE K +K +++
Sbjct: 126 VLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDD 185
Score = 26.1 bits (58), Expect = 8.4
Identities = 10/71 (14%), Positives = 36/71 (50%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
D + D+ ++ + + ++ +++ E+D+ ++ +++ + + EEK + KE E+
Sbjct: 123 DIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLS 182
Query: 125 GGGGGEQEKEE 135
++++
Sbjct: 183 DDDDFVWDEDD 193
Score = 25.7 bits (57), Expect = 9.5
Identities = 8/75 (10%), Positives = 33/75 (44%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
+ ++ ++ + + + ++ ++DE ++ +++++ +E +KKE +E
Sbjct: 119 NYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKEL 178
Query: 124 GGGGGGEQEKEEERE 138
+ +E +
Sbjct: 179 EKLSDDDDFVWDEDD 193
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 28.6 bits (64), Expect = 0.80
Identities = 18/74 (24%), Positives = 31/74 (41%), Gaps = 1/74 (1%)
Query: 66 DQAKKRRDE-KEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
QA+ + + KEEKEG ++ K + + E E++ + + E +K+ E
Sbjct: 97 AQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAA 156
Query: 125 GGGGGEQEKEEERE 138
E EE E
Sbjct: 157 AVAAEEAAAAEEEE 170
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 28.8 bits (64), Expect = 0.81
Identities = 16/77 (20%), Positives = 31/77 (40%)
Query: 62 CVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
++ D + + + EE + + EE+E EE + + + ++E+E
Sbjct: 155 NIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEE 214
Query: 122 EEGGGGGGEQEKEEERE 138
E G GE + E E
Sbjct: 215 EAEGSDDGEDVVDYEGE 231
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 28.1 bits (63), Expect = 0.82
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGGE 130
EEEEE+++EE+ +++E EEE G G
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMAGLGA 102
Score = 26.2 bits (58), Expect = 3.2
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
EEEE+++EE+ +++E+ EE G
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEEAMAG 99
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 29.1 bits (66), Expect = 0.89
Identities = 21/73 (28%), Positives = 38/73 (52%), Gaps = 4/73 (5%)
Query: 71 RRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKK----EEKAKKKEQEEEGGG 126
RR EK+ KE + + K + +E++ E EE++E+ + K+E+EEE
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549
Query: 127 GGGEQEKEEEREG 139
++ +E+E EG
Sbjct: 550 EKDKETEEDEPEG 562
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 28.9 bits (65), Expect = 0.93
Identities = 14/54 (25%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAK--KKEQEEEG 124
+E E+ G E + NG + +++ ++K KE K+ K+E+ E G
Sbjct: 844 EEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEERETG 897
Score = 26.6 bits (59), Expect = 6.3
Identities = 15/62 (24%), Positives = 21/62 (33%), Gaps = 4/62 (6%)
Query: 77 EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
E G E E E EED+ + K +++ G K+EE
Sbjct: 838 ENAGKMEEYVE----ENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEE 893
Query: 137 RE 138
RE
Sbjct: 894 RE 895
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 28.6 bits (65), Expect = 0.94
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 90 GKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
G++ E EE EE EE+++EE+ ++ E+ E
Sbjct: 224 GRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257
Score = 26.3 bits (59), Expect = 5.3
Identities = 15/35 (42%), Positives = 20/35 (57%)
Query: 81 GREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKA 115
GR+G +EE E +E EE EEEEE ++ E
Sbjct: 224 GRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 28.7 bits (64), Expect = 0.96
Identities = 21/100 (21%), Positives = 42/100 (42%), Gaps = 7/100 (7%)
Query: 45 DNTLCLQLVEGKPNGLLCVLDDQAKKRRDEKEEKEG-----GREGGEEMK--GKEENGEE 97
+ L + V + + + EK EKE + E K GK++
Sbjct: 112 NGKLQVTYVTHSKVAHEAEIRNLGDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRR 171
Query: 98 DEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
E + +E+ +KK+E+ ++ ++++ GGGGG +
Sbjct: 172 VEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGL 211
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 28.8 bits (65), Expect = 1.0
Identities = 10/55 (18%), Positives = 27/55 (49%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK 118
LD + + D+ +++E R +E E++ + + ++ EE + + + K+
Sbjct: 214 LDVEIQNELDDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKE 268
Score = 26.1 bits (58), Expect = 7.2
Identities = 12/46 (26%), Positives = 22/46 (47%), Gaps = 1/46 (2%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEE-KKKEEKAKK 117
DE+EE+ E ++ + ++DE E E E +E ++K
Sbjct: 227 DEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKEVSEK 272
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 28.8 bits (64), Expect = 1.0
Identities = 18/73 (24%), Positives = 35/73 (47%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
+ KK EE + + +E + E E+ ++ EE +K+ + ++K+QEE+
Sbjct: 80 QRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQAR 139
Query: 126 GGGGEQEKEEERE 138
EQ+K+ E
Sbjct: 140 KAAAEQKKKAEAA 152
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 28.8 bits (64), Expect = 1.0
Identities = 5/66 (7%), Positives = 11/66 (16%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
E + + +G E + E Q+
Sbjct: 104 TPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQP 163
Query: 134 EEEREG 139
+
Sbjct: 164 SSFLQP 169
Score = 28.0 bits (62), Expect = 2.0
Identities = 8/63 (12%), Positives = 10/63 (15%)
Query: 77 EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
E E G E + +Q E EE
Sbjct: 119 SPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEP 178
Query: 137 REG 139
Sbjct: 179 EPP 181
Score = 27.6 bits (61), Expect = 2.2
Identities = 8/72 (11%), Positives = 14/72 (19%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
Q + ++ + E E E + E EE
Sbjct: 57 QVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELAS 116
Query: 127 GGGEQEKEEERE 138
G +
Sbjct: 117 GLSPENTSGSSP 128
Score = 27.6 bits (61), Expect = 2.7
Identities = 7/70 (10%), Positives = 13/70 (18%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
K+ R + G E E+ E ++
Sbjct: 84 KEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSH 143
Query: 129 GEQEKEEERE 138
+ E
Sbjct: 144 PGPHEPAPPE 153
Score = 26.9 bits (59), Expect = 4.6
Identities = 6/67 (8%), Positives = 12/67 (17%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQE 132
E E E ++ + ++ E
Sbjct: 125 GSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSE 184
Query: 133 KEEEREG 139
E + G
Sbjct: 185 PEPDSPG 191
Score = 26.5 bits (58), Expect = 5.2
Identities = 14/74 (18%), Positives = 15/74 (20%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
D EKE +GG G E G G EE E
Sbjct: 72 DTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESP 131
Query: 126 GGGGEQEKEEEREG 139
G
Sbjct: 132 ASHSPPPSPPSHPG 145
Score = 26.1 bits (57), Expect = 7.0
Identities = 13/78 (16%), Positives = 22/78 (28%), Gaps = 1/78 (1%)
Query: 63 VLDDQAKKRRDEKEEKEGGREGGEEMKGK-EENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
V + ++ + E E R G +E +G+ +G E E+
Sbjct: 61 VAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSP 120
Query: 122 EEGGGGGGEQEKEEEREG 139
E G E
Sbjct: 121 ENTSGSSPESPASHSPPP 138
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 28.8 bits (64), Expect = 1.0
Identities = 18/75 (24%), Positives = 33/75 (44%), Gaps = 7/75 (9%)
Query: 71 RRDEKEEKEGGREGGEEMKGKEENGEEDEV-------EEGEEEEEKKKEEKAKKKEQEEE 123
R D E+ +E G+E E++ + +E + ++ +E E ++
Sbjct: 449 RADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDG 508
Query: 124 GGGGGGEQEKEEERE 138
G G E+E+EEER
Sbjct: 509 GVGQAIEEEEEEERA 523
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 28.8 bits (64), Expect = 1.1
Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 2/55 (3%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEE--DEVEEGEEEEEKKKEEKAKKKE 119
+ +KR EK + E + E +K K+ + E + E+E E K+ E KK+E
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331
Score = 27.3 bits (60), Expect = 3.4
Identities = 19/82 (23%), Positives = 34/82 (41%), Gaps = 9/82 (10%)
Query: 66 DQAKKRRDEKEEK--------EGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKK 117
D A K+RDE +K + + + ++ E+E+ + E +K EE K
Sbjct: 241 DNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKA 300
Query: 118 KEQE-EEGGGGGGEQEKEEERE 138
K+ + + EKE E +
Sbjct: 301 KDHKAFDLKQESKASEKEAEDK 322
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 28.7 bits (64), Expect = 1.1
Identities = 20/72 (27%), Positives = 41/72 (56%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
+ K KEE++ + EE K K+E +E+ + +EE K+K +K++++E+
Sbjct: 96 KEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVE 155
Query: 127 GGGEQEKEEERE 138
++E+E++RE
Sbjct: 156 EPRDREEEKKRE 167
Score = 26.0 bits (57), Expect = 7.5
Identities = 19/72 (26%), Positives = 41/72 (56%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
+ + + ++E++EKE +E ++ K K + +D + E +E++ +EK K+KE++ E
Sbjct: 97 EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156
Query: 125 GGGGGEQEKEEE 136
E++K E
Sbjct: 157 PRDREEEKKRER 168
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 28.4 bits (64), Expect = 1.1
Identities = 6/25 (24%), Positives = 11/25 (44%)
Query: 104 EEEEEKKKEEKAKKKEQEEEGGGGG 128
A+++E+EE+ G G
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDDMGFG 307
Score = 26.5 bits (59), Expect = 5.8
Identities = 6/25 (24%), Positives = 9/25 (36%)
Query: 105 EEEEKKKEEKAKKKEQEEEGGGGGG 129
++E+EEE G G
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDDMGFG 307
>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
Length = 249
Score = 28.1 bits (63), Expect = 1.2
Identities = 12/42 (28%), Positives = 16/42 (38%)
Query: 72 RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEE 113
RD +E +E E ++ E EEGEE E
Sbjct: 205 RDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246
Score = 28.1 bits (63), Expect = 1.5
Identities = 11/39 (28%), Positives = 14/39 (35%)
Query: 96 EEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKE 134
EE E +E E E+ A + EEG E
Sbjct: 208 EEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 28.2 bits (63), Expect = 1.4
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 91 KEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
++E E E+ ++K++ E KKKE EE
Sbjct: 75 EDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106
Score = 27.0 bits (60), Expect = 3.5
Identities = 9/31 (29%), Positives = 19/31 (61%)
Query: 92 EENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
+ E++ E E+ KK+E+A+ K+++E
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKE 103
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 28.2 bits (63), Expect = 1.4
Identities = 15/55 (27%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMK-GKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
K R++ ++ + G++GG+ + GK+ N E D GE++ + K +K +K +
Sbjct: 734 VGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAK 788
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 28.1 bits (63), Expect = 1.5
Identities = 23/80 (28%), Positives = 36/80 (45%), Gaps = 15/80 (18%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAK---------------KK 118
KE +E E +E++ E+ + E EE + E + KEE K +
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60
Query: 119 EQEEEGGGGGGEQEKEEERE 138
+ G GGGGE+E+EE +
Sbjct: 61 KPAASGEGGGGEEEEEEAKA 80
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 28.1 bits (63), Expect = 1.7
Identities = 11/32 (34%), Positives = 21/32 (65%)
Query: 91 KEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
+ E E++ ++E + + KK+E+ KKKE E+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 26.9 bits (60), Expect = 4.8
Identities = 13/34 (38%), Positives = 21/34 (61%)
Query: 105 EEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
EE +++KEEK KEQ+ E++K++E E
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELE 584
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 28.0 bits (62), Expect = 1.7
Identities = 17/55 (30%), Positives = 30/55 (54%)
Query: 82 REGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
RE M+ EE G++ E E+ E+ ++ + E ++++ E EG G G +E E
Sbjct: 191 REMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESE 245
Score = 27.2 bits (60), Expect = 3.7
Identities = 15/73 (20%), Positives = 33/73 (45%), Gaps = 2/73 (2%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
+ A++ D+ E ++ E G++ + E EE EGE +E +E + E G
Sbjct: 198 ELAEEMGDDTESED--EEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGE 255
Query: 126 GGGGEQEKEEERE 138
+ ++++ +
Sbjct: 256 EEMVQSDQDDLPD 268
Score = 26.8 bits (59), Expect = 4.1
Identities = 14/58 (24%), Positives = 28/58 (48%), Gaps = 1/58 (1%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEEN-GEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
D + +E+EE+ G G+E +E+ + E E GEEE + ++ + ++
Sbjct: 216 DDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDD 273
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 27.8 bits (62), Expect = 1.7
Identities = 12/51 (23%), Positives = 23/51 (45%)
Query: 86 EEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEE 136
E+ K ++ G + + +++ ++ K K Q E QEK+EE
Sbjct: 19 EKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69
Score = 26.3 bits (58), Expect = 6.4
Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 3/51 (5%)
Query: 70 KRRDEKEEKEGGR-EGGEEMKGKEENG--EEDEVEEGEEEEEKKKEEKAKK 117
K E K + + E+ K + E + +E++E+KK+ K KK
Sbjct: 27 KGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 27.4 bits (61), Expect = 1.9
Identities = 12/62 (19%), Positives = 17/62 (27%), Gaps = 3/62 (4%)
Query: 75 KEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKE 134
E K E + + G E ++KK +A E E
Sbjct: 96 AEPKPSKLELFNAALAEADGGPT---AEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAE 152
Query: 135 EE 136
EE
Sbjct: 153 EE 154
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 27.8 bits (62), Expect = 2.0
Identities = 19/66 (28%), Positives = 34/66 (51%), Gaps = 7/66 (10%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEE-------EKKKEEKAK 116
++ QA + DE+E + R G + E E+++ +E ++EE E K EKA+
Sbjct: 5 VNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKAR 64
Query: 117 KKEQEE 122
KE ++
Sbjct: 65 LKELKK 70
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 28.1 bits (62), Expect = 2.1
Identities = 21/85 (24%), Positives = 39/85 (45%), Gaps = 8/85 (9%)
Query: 54 EGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEE 113
+G + L+ DD++ + +E E E + +E+ E DE EE +E E E+
Sbjct: 917 DGGWSFLMVGSDDESDESEEEVSEYEASSD--------DESDETDEDEESDESSEDLSED 968
Query: 114 KAKKKEQEEEGGGGGGEQEKEEERE 138
+++ +EE G E E + +
Sbjct: 969 ESENDSSDEEDGEDWDELESKAAYD 993
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 27.9 bits (62), Expect = 2.1
Identities = 17/71 (23%), Positives = 33/71 (46%)
Query: 68 AKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGG 127
+K+ ++ + +E EE GK + EDE E +E + E++ K + +
Sbjct: 227 KEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEE 286
Query: 128 GGEQEKEEERE 138
E+EKE+ +
Sbjct: 287 TEEKEKEKRKR 297
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 27.5 bits (61), Expect = 2.3
Identities = 21/70 (30%), Positives = 42/70 (60%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGG 128
K+ DEKEE++ + + K +E+ E +E E+ +E E ++E+ A+++E+EE
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEE 92
Query: 129 GEQEKEEERE 138
E+E E+ ++
Sbjct: 93 KEEEAEDVKQ 102
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 27.7 bits (61), Expect = 2.4
Identities = 15/64 (23%), Positives = 28/64 (43%)
Query: 60 LLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
+ +DD K D +E ++ +G EE E+E + E + +KE++ E
Sbjct: 409 TMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFPDE 468
Query: 120 QEEE 123
E +
Sbjct: 469 AELQ 472
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 27.5 bits (61), Expect = 2.4
Identities = 10/60 (16%), Positives = 22/60 (36%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
E + R+ DE E+ +E+E E++ K++ +E++
Sbjct: 338 AISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDK 397
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 27.4 bits (61), Expect = 2.9
Identities = 13/38 (34%), Positives = 24/38 (63%)
Query: 89 KGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGG 126
K K+E+ + ++ E++++KKKE+K K E E + G
Sbjct: 56 KDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93
>gnl|CDD|165587 PHA03343, PHA03343, US22 family homolog; Provisional.
Length = 578
Score = 27.3 bits (60), Expect = 3.2
Identities = 18/83 (21%), Positives = 26/83 (31%), Gaps = 12/83 (14%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVE------------EGEEEEEKKKEEKAK 116
+K+ E KG+EE D++E E E A
Sbjct: 99 RKKGPGGETAAAAAAAAAAEKGEEEEEAGDKIEPPAVRGSVSPIRGHETVEGAAAAATAA 158
Query: 117 KKEQEEEGGGGGGEQEKEEEREG 139
+G GGGE E+ +G
Sbjct: 159 TSAAACDGDDGGGEDGGEDGGKG 181
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 26.2 bits (58), Expect = 3.3
Identities = 9/29 (31%), Positives = 16/29 (55%)
Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
E E +E ++EEK ++ E+E + G
Sbjct: 78 EAAAEADEAEEEEKEEEAEEESDDDMLFG 106
Score = 25.4 bits (56), Expect = 7.8
Identities = 6/29 (20%), Positives = 14/29 (48%)
Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
E E ++ +EE+ +++ +EE
Sbjct: 77 AEAAAEADEAEEEEKEEEAEEESDDDMLF 105
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 27.4 bits (61), Expect = 3.5
Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 2/62 (3%)
Query: 64 LDDQAKKRRDEKEEKEGGREGGEEMKG--KEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
L+++A + E+ ++ +E + +E+ E+ E +EE KK+ K KK E
Sbjct: 443 LEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPE 502
Query: 122 EE 123
+
Sbjct: 503 VD 504
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 27.0 bits (59), Expect = 3.6
Identities = 12/52 (23%), Positives = 28/52 (53%)
Query: 72 RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
+KE+++ E E K + ++ DE+ E +EE+++ +E + + +E
Sbjct: 291 EGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADE 342
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 27.1 bits (60), Expect = 3.7
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 4/62 (6%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGE--EEEEKKKEEKAKKKEQEEE 123
D K+ +++KEEK+ E EE+K +E + E +E E+G E EK E KK +E
Sbjct: 274 DWLKELKEKKEEKD--EEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKEVI 331
Query: 124 GG 125
G
Sbjct: 332 AG 333
>gnl|CDD|234064 TIGR02936, fdxN_nitrog, ferredoxin III, nif-specific. Members of
this family are homodimeric ferredoxins from nitrogen
fixation regions of many nitrogen-fixing bacteria. As
characterized in Rhodobacter capsulatus, these proteins
are homodimeric, with two 4Fe-4S clusters bound per
monomer. Although nif-specific, this protein family is
not usiveral, as other nitrogenase systems may
substitute flavodoxins, or different types of ferredoxin
[Central intermediary metabolism, Nitrogen fixation].
Length = 91
Score = 25.8 bits (57), Expect = 3.7
Identities = 8/24 (33%), Positives = 14/24 (58%)
Query: 87 EMKGKEENGEEDEVEEGEEEEEKK 110
+KG E GE ++ ++E E+K
Sbjct: 39 TLKGINEEGELVASDDDDDEIERK 62
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 27.2 bits (60), Expect = 3.7
Identities = 19/67 (28%), Positives = 35/67 (52%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQE 132
+ + EK GG G + E +ED+ +E ++ E EEK+KKK+++ + +
Sbjct: 199 ETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDD 258
Query: 133 KEEEREG 139
K+ +R G
Sbjct: 259 KKGKRGG 265
Score = 26.1 bits (57), Expect = 8.5
Identities = 16/65 (24%), Positives = 30/65 (46%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
+ EE+E K + E E+ E E E+ EE+ K KK ++ +G G +++
Sbjct: 296 DPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355
Query: 134 EEERE 138
+ +
Sbjct: 356 SDSGD 360
Score = 26.1 bits (57), Expect = 9.2
Identities = 16/66 (24%), Positives = 33/66 (50%)
Query: 73 DEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQE 132
+E+E+K + ++E+ EE E E+ EEE K+ K KK + ++ G + +
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357
Query: 133 KEEERE 138
++ +
Sbjct: 358 SGDDSD 363
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 27.1 bits (60), Expect = 3.9
Identities = 12/59 (20%), Positives = 32/59 (54%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
+ + + +K+E++ E ++ ++E ++ E E +E+KK+ E+A K+ ++
Sbjct: 73 SAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQ 131
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 27.0 bits (60), Expect = 3.9
Identities = 7/50 (14%), Positives = 18/50 (36%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKA 115
++ + EK E E G + +E E + +++ + +
Sbjct: 326 EEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQEEADGQG 375
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 26.8 bits (60), Expect = 4.0
Identities = 9/30 (30%), Positives = 16/30 (53%)
Query: 93 ENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
+DE+E ++ E+ E K+KE+ E
Sbjct: 499 SGLSDDEIERMVKDAEEYAAEDKKRKERIE 528
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 26.9 bits (60), Expect = 4.3
Identities = 20/72 (27%), Positives = 34/72 (47%), Gaps = 9/72 (12%)
Query: 76 EEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQ---------EEEGGG 126
+EKE GG G + E+++++KK+ + + KE E++ G
Sbjct: 154 QEKEAAGSGGGGGGTWGLVGVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGE 213
Query: 127 GGGEQEKEEERE 138
GGGE +E +RE
Sbjct: 214 GGGEGGEERQRE 225
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 26.9 bits (60), Expect = 4.6
Identities = 17/49 (34%), Positives = 25/49 (51%), Gaps = 9/49 (18%)
Query: 91 KEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEEREG 139
+E EE+E E GE +EE + EE+ +++ E KEEE G
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEP---------EEVPKEEEGTG 207
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 26.6 bits (59), Expect = 4.6
Identities = 12/56 (21%), Positives = 26/56 (46%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
A ++ +K++KE E ++ + ++ EE+E + EE +E A +
Sbjct: 62 PATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSD 117
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 26.7 bits (59), Expect = 5.3
Identities = 11/37 (29%), Positives = 17/37 (45%), Gaps = 5/37 (13%)
Query: 101 EEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
E E + +KKE K++ E E +E +ER
Sbjct: 338 AEKNEAKARKKEIAQKRRAAERE-----INREARQER 369
Score = 26.4 bits (58), Expect = 5.5
Identities = 11/39 (28%), Positives = 17/39 (43%), Gaps = 8/39 (20%)
Query: 100 VEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
E + +K E KA+KKE ++ + ERE
Sbjct: 330 AEIKKTRTAEKNEAKARKKEIAQK--------RRAAERE 360
Score = 26.0 bits (57), Expect = 7.8
Identities = 13/73 (17%), Positives = 22/73 (30%), Gaps = 4/73 (5%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
+ K R EK E R+ K + E E+ +E +A+ + +
Sbjct: 331 EIKKTRTAEKNE-AKARKKEIAQKRRAA---EREINREARQERAAAMARARARRAAVKAK 386
Query: 126 GGGGEQEKEEERE 138
G E
Sbjct: 387 KKGLIDASPNEDT 399
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 26.7 bits (59), Expect = 5.3
Identities = 13/54 (24%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK 118
++ K E G G +E ++ E+DE + E E+++K ++K +K+
Sbjct: 318 EEIPKLDEKSLSEFLGYMGGIDE----DDEDEDDEESKEEVEKKQKVKKKPRKR 367
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 26.4 bits (58), Expect = 5.4
Identities = 12/50 (24%), Positives = 20/50 (40%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
E EK+ G EE +E E + E EE + + + + + E
Sbjct: 333 EVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAER 382
>gnl|CDD|220776 pfam10483, Hap2_elong, Histone acetylation protein 2. Hap2 is one
of three histone acetyltransferases proteins that, in
yeasts, are found associated with elongating forms of
RNA polymerase II (Elongator). The Haps can be isolated
in two forms, as a six-subunit complex with Elongator
and as a complex of the three proteins on their own. The
role of the Hap complex in transcription is still
speculative, being possibly to keep the HAT activity of
free Elongator in check, allowing histone acetylation
only in the presence of a transcribing polymerase, or
the interaction with Haps might render Elongator
susceptible to modifications thereby altering its
activity.
Length = 280
Score = 26.5 bits (59), Expect = 5.5
Identities = 14/57 (24%), Positives = 24/57 (42%), Gaps = 17/57 (29%)
Query: 90 GKEENGEEDEVEEGEEEEEK----------KKEEKAKKK-------EQEEEGGGGGG 129
KE+ + E E + E + +K+ A+++ QE+ GGG GG
Sbjct: 219 LKEKPEGDAEAAEEDPESLEDLTTFNLGLSEKQRLAREQVELPFFDAQEKLGGGEGG 275
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 26.2 bits (58), Expect = 5.5
Identities = 16/48 (33%), Positives = 27/48 (56%)
Query: 76 EEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
E E + KE EE+E +E ++++E KKE+K KK ++E+
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Score = 25.8 bits (57), Expect = 6.6
Identities = 14/55 (25%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 65 DDQAKKRRDEKEEKEGGR-EGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK 118
++ E EE+E + +E+K +++ ++ + + E + KKK++K KKK
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 26.3 bits (58), Expect = 5.8
Identities = 13/72 (18%), Positives = 35/72 (48%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
+ KK + ++ ++ ++ EE + ++E+ E+ E+ ++EE++ + + +
Sbjct: 152 KKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGML 211
Query: 125 GGGGGEQEKEEE 136
E+E EE
Sbjct: 212 VDSSDEEEGEEA 223
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 26.2 bits (57), Expect = 5.8
Identities = 15/54 (27%), Positives = 25/54 (46%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
Q K + ++E+ + + EE ++E EE E EEE K+EE +
Sbjct: 190 VQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
Score = 26.2 bits (57), Expect = 7.2
Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 2/59 (3%)
Query: 78 KEGGREGGEEMKGKEENGEEDEVEEGEE--EEEKKKEEKAKKKEQEEEGGGGGGEQEKE 134
GG + K K+E E +E EEK+ +E+ +++E EEE G+ +
Sbjct: 185 SLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 25.3 bits (56), Expect = 6.0
Identities = 6/26 (23%), Positives = 17/26 (65%)
Query: 102 EGEEEEEKKKEEKAKKKEQEEEGGGG 127
E++K+E+ +++E++++ G G
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDDDMGFG 86
Score = 25.3 bits (56), Expect = 6.0
Identities = 7/28 (25%), Positives = 15/28 (53%)
Query: 102 EGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
++E+K +++E+EE+ G G
Sbjct: 59 AAAAAAAAEEEKKEEEEEEEEDDDMGFG 86
>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2. This model
describes a tightly conserved subfamily of the larger
family of sugar (and other) transporters described by
PFAM model pfam00083. Members of this subfamily include
closely related forms SV2A and SV2B of synaptic vesicle
protein from vertebrates and a more distantly related
homolog (below trusted cutoff) from Drosophila
melanogaster. Members are predicted to have two sets of
six transmembrane helices.
Length = 742
Score = 26.5 bits (58), Expect = 6.0
Identities = 17/64 (26%), Positives = 29/64 (45%), Gaps = 1/64 (1%)
Query: 76 EEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEE 135
E + EG E + E+DE+ EGE + + E K E +G G +++ +
Sbjct: 69 EAADDEEEGEASSDATEGHDEDDEIYEGEYQGIPRAENGG-KDEIMADGAPKAGVRDELK 127
Query: 136 EREG 139
+ EG
Sbjct: 128 DGEG 131
>gnl|CDD|184860 PRK14858, tatA, twin arginine translocase protein A; Provisional.
Length = 108
Score = 25.6 bits (56), Expect = 6.0
Identities = 12/52 (23%), Positives = 22/52 (42%)
Query: 72 RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
+EKE+ E E +E + E EED+ + + E +K + +
Sbjct: 57 AEEKEKAEKLAETKKEAEAPEAKAEEDQAPKPKGAGEPPATVASKAGDGAKA 108
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 26.5 bits (58), Expect = 6.1
Identities = 18/61 (29%), Positives = 28/61 (45%), Gaps = 7/61 (11%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGK-------EENGEEDEVEEGEEEEEKKKEEKAKKK 118
+ +KR+ ++E EG E + K +E E DE EE++K+ AK K
Sbjct: 72 KERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDESSVANLSEEERKKYAAKLK 131
Query: 119 E 119
E
Sbjct: 132 E 132
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 26.4 bits (58), Expect = 6.2
Identities = 13/57 (22%), Positives = 23/57 (40%), Gaps = 8/57 (14%)
Query: 77 EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKK--------EQEEEGG 125
K+ R + E+++E G + K KE+ +K E+EE+G
Sbjct: 23 RKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEEDGS 79
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 26.2 bits (58), Expect = 6.2
Identities = 15/77 (19%), Positives = 35/77 (45%)
Query: 60 LLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
+L L D A K++ ++ +E + +E+ E +EE EEE E ++ + + +
Sbjct: 105 VLDDLADAALKKKGFSFKRPKYPNEEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQ 164
Query: 120 QEEEGGGGGGEQEKEEE 136
+ + G + ++
Sbjct: 165 ETKYKRGDTSLTPQAKD 181
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 26.6 bits (59), Expect = 6.2
Identities = 14/55 (25%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 70 KRRDEKEEKEGGREGGEEMKGKEENGE---EDEVEEGEEEEEKKKEEKAKKKEQE 121
+ R+++E++ G+ E K K+ N E ++ + EE+ + +K K K+Q
Sbjct: 1521 ENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQY 1575
Score = 26.6 bits (59), Expect = 6.5
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 93 ENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
E ++ + EE+ KKK++K KKKE+E +
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYK 760
Score = 26.2 bits (58), Expect = 7.7
Identities = 15/44 (34%), Positives = 27/44 (61%), Gaps = 6/44 (13%)
Query: 94 NGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
NG++ E + + EEK K++K K+K++EEE + +E+ R
Sbjct: 729 NGKDAEFKISDSVEEKTKKKKKKEKKKEEE------YKREEKAR 766
Score = 26.2 bits (58), Expect = 8.8
Identities = 17/63 (26%), Positives = 27/63 (42%), Gaps = 9/63 (14%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKE------ENGEEDEVEEGEEEEEKKKEEKAK--KKEQ 120
+ R E++E G E + + K N E++ +EE E + KK + K K
Sbjct: 1521 ENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKN-IEEDYAESDIKKRKNKKQYKSNT 1579
Query: 121 EEE 123
E E
Sbjct: 1580 EAE 1582
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 26.5 bits (58), Expect = 6.4
Identities = 6/73 (8%), Positives = 18/73 (24%), Gaps = 1/73 (1%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
+A R++ + G ++ + E E + E ++
Sbjct: 367 SEADIEREQPGDLAGQAPAAHQVDAEAA-SAAPEEPAALASEAHDETEPEVPEKAAPIPD 425
Query: 126 GGGGEQEKEEERE 138
++
Sbjct: 426 PAKPDELAVAGPG 438
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 26.3 bits (58), Expect = 6.5
Identities = 21/64 (32%), Positives = 25/64 (39%), Gaps = 7/64 (10%)
Query: 52 LVEGKPNGLLCVLDDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKK 111
L G G D E EE E +E EEDE E EEEE+++
Sbjct: 19 LTFGNGKGPFASAQDLT-----EDEEAEDDVVDEDEEDEAVV--EEDENELTEEEEDEEG 71
Query: 112 EEKA 115
E KA
Sbjct: 72 EVKA 75
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 25.7 bits (57), Expect = 6.7
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 77 EKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEK 114
+E + EE + +E++ E+D+ +E EEEE K+ K
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
peptidase. Corresponds to Merops family C37.
Norwalk-like viruses (NLVs), including the Southampton
virus, cause acute non-bacterial gastroenteritis in
humans. The NLV genome encodes three open reading frames
(ORFs). ORF1 encodes a polyprotein, which is processed
by the viral protease into six proteins.
Length = 535
Score = 26.4 bits (58), Expect = 6.7
Identities = 23/72 (31%), Positives = 32/72 (44%), Gaps = 5/72 (6%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGG 125
D+ KK R+E+ GG+ +E E EE E+ E + E EE+ K Q G
Sbjct: 259 DEYKKIREER----GGKYSIQEYLEDRERYEE-ELAERQATEADFCEEEEAKIRQRIFGL 313
Query: 126 GGGGEQEKEEER 137
+Q KEE
Sbjct: 314 RKTRKQRKEERA 325
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 26.2 bits (58), Expect = 7.0
Identities = 15/30 (50%), Positives = 18/30 (60%), Gaps = 4/30 (13%)
Query: 94 NGEEDEVE----EGEEEEEKKKEEKAKKKE 119
N E+EVE EEE KKK++K KKK
Sbjct: 163 NPTEEEVELLKARLEEERAKKKKKKKKKKT 192
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 26.2 bits (58), Expect = 7.1
Identities = 11/64 (17%), Positives = 36/64 (56%), Gaps = 3/64 (4%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEE---DEVEEGEEEEEKKKEEKAKKKEQE 121
+ K++ ++ +++G ++ ++ K ++ + + + E +E E K+ +K KKK+++
Sbjct: 6 GENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKK 65
Query: 122 EEGG 125
++
Sbjct: 66 KKKK 69
Score = 26.2 bits (58), Expect = 7.2
Identities = 10/63 (15%), Positives = 29/63 (46%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
K++K+ ++ G + K K+ + D+ + E + +E K+ +++ +++K
Sbjct: 10 VKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKK 69
Query: 134 EEE 136
Sbjct: 70 NLG 72
>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
Provisional.
Length = 291
Score = 26.3 bits (57), Expect = 7.3
Identities = 17/55 (30%), Positives = 27/55 (49%)
Query: 65 DDQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKE 119
D+ +E ++ +GG E EE K + ++G+E V G EKK K+ E
Sbjct: 78 DEDVDDDSEEGDDDDGGAEDEEEEKVRGQSGQEGTVALGSGSTEKKLIGSEKQTE 132
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 25.7 bits (57), Expect = 7.3
Identities = 15/53 (28%), Positives = 27/53 (50%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQE 121
KK R + +E EE + + E +E+E + E+E K ++ K++E E
Sbjct: 88 KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENE 140
>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
N-terminal domain. Autophagocytosis is a
starvation-induced process responsible for transport of
cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
ubiquitin like modifier that is topologically similar to
the canonical E2 enzyme. It catalyzes the conjugation of
Atg8 and phosphatidylethanolamine.
Length = 146
Score = 25.8 bits (57), Expect = 7.5
Identities = 8/52 (15%), Positives = 25/52 (48%)
Query: 72 RDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
DE E+ G K K++ +E+++ E ++++ + ++ +++
Sbjct: 93 EDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDDDD 144
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 26.0 bits (57), Expect = 7.8
Identities = 15/56 (26%), Positives = 28/56 (50%)
Query: 82 REGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
E + E EEDE + E ++++ + E + E E E G GE++++E+
Sbjct: 635 DENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDD 690
Score = 26.0 bits (57), Expect = 8.4
Identities = 18/68 (26%), Positives = 33/68 (48%), Gaps = 2/68 (2%)
Query: 57 PNGLLCVLDD-QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKA 115
GL +LD +A + + ++ G E E+ E + +EDE E E+ E + + +
Sbjct: 622 NEGLQLILDVLKADENKSRHQQLFEGEEEDEDDLE-ETDDDEDECEAIEDSESESESDGE 680
Query: 116 KKKEQEEE 123
+E E+E
Sbjct: 681 DGEEDEQE 688
>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
Provisional.
Length = 231
Score = 25.7 bits (56), Expect = 7.9
Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 2/60 (3%)
Query: 70 KRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
KRR +K+ K E + K E+ D+ ++ EE+E +E + ++ + EGG G
Sbjct: 87 KRRTKKKAKADKPE--KSPKAVEKLCPPDDRDDKNEEKEPTEEAQRNEESGDAEGGASGR 144
>gnl|CDD|203437 pfam06405, RCC_reductase, Red chlorophyll catabolite reductase (RCC
reductase). This family consists of several red
chlorophyll catabolite reductase (RCC reductase)
proteins. Red chlorophyll catabolite (RCC) reductase
(RCCR) and pheophorbide (Pheide) a oxygenase (PaO)
catalyze the key reaction of chlorophyll catabolism,
porphyrin macrocycle cleavage of Pheide a to a primary
fluorescent catabolite (pFCC).
Length = 255
Score = 25.9 bits (57), Expect = 8.0
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 95 GEEDEVEEGEEEEEKKKEEKAKKKEQE 121
G E EVEE E E +K++ ++K E
Sbjct: 202 GGEREVEESEREALEKRDLIVRRKSIE 228
>gnl|CDD|167284 PRK01833, tatA, twin arginine translocase protein A; Provisional.
Length = 74
Score = 24.8 bits (54), Expect = 8.0
Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 5/45 (11%)
Query: 81 GREGGEEMKG-----KEENGEEDEVEEGEEEEEKKKEEKAKKKEQ 120
G + GE +KG ++ ++ E E+ E +E E+KAK+KEQ
Sbjct: 29 GTDLGESVKGFKKAMADDKPKDAEFEKVEAKEAASTEQKAKEKEQ 73
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 26.1 bits (58), Expect = 8.1
Identities = 16/40 (40%), Positives = 20/40 (50%)
Query: 69 KKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEE 108
++RR E E EE + EE EE+E EE EEE
Sbjct: 294 RRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
>gnl|CDD|221303 pfam11902, DUF3422, Protein of unknown function (DUF3422). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria, archaea and
eukaryotes. Proteins in this family are typically
between 426 to 444 amino acids in length.
Length = 419
Score = 26.0 bits (58), Expect = 8.2
Identities = 9/24 (37%), Positives = 11/24 (45%), Gaps = 1/24 (4%)
Query: 35 GIRW-RHIEFSDNTLCLQLVEGKP 57
+RW RH EFS T +P
Sbjct: 74 RLRWERHTEFSTYTFFRPGPGDEP 97
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 25.9 bits (57), Expect = 8.3
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEE 113
EK+ E++ ++E EE+E EE EE+E+ ++
Sbjct: 154 SMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 26.0 bits (57), Expect = 8.3
Identities = 13/58 (22%), Positives = 28/58 (48%), Gaps = 1/58 (1%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
+A ++ +KE ++ + K E+ ++ E ++ + EE K K+ K + E E
Sbjct: 83 QRAAEQARQKELEQRAAAE-KAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAE 139
Score = 26.0 bits (57), Expect = 8.7
Identities = 13/58 (22%), Positives = 32/58 (55%)
Query: 66 DQAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEE 123
AKK ++ +++ E E E+ + E+ +++ + E+ K+ E+A K+ +E++
Sbjct: 61 PAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQ 118
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 25.8 bits (57), Expect = 8.4
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 91 KEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
+E E+E+EE + + +K + KEQEE
Sbjct: 155 EEPEEMEEELEEDAADRDARKRAAEEAKEQEE 186
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 25.8 bits (56), Expect = 8.7
Identities = 22/62 (35%), Positives = 35/62 (56%)
Query: 74 EKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEK 133
E+EEKE RE EE++ E + ++ + + EE +KEEK + E+EE+ G E+
Sbjct: 112 EEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENN 171
Query: 134 EE 135
E
Sbjct: 172 GE 173
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 25.5 bits (56), Expect = 8.9
Identities = 13/56 (23%), Positives = 24/56 (42%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEE 122
+ R +K K+ ++ E+ + E EE E EE +KK + + K +
Sbjct: 54 KKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTA 109
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 25.3 bits (56), Expect = 9.0
Identities = 18/39 (46%), Positives = 28/39 (71%), Gaps = 2/39 (5%)
Query: 100 VEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEERE 138
+EE EE+E++KEEK +K+E+E+E G G +KEE +
Sbjct: 14 IEEKREEKEREKEEKERKEEKEKE--WGKGLVQKEEREK 50
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 25.5 bits (56), Expect = 9.1
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 92 EENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEG 124
EE+ +E + E EE++ + K +KKE++E+
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
>gnl|CDD|178680 PLN03134, PLN03134, glycine-rich RNA-binding protein 4;
Provisional.
Length = 144
Score = 25.4 bits (55), Expect = 9.1
Identities = 13/44 (29%), Positives = 15/44 (34%)
Query: 86 EEMKGKEENGEEDEVEEGEEEEEKKKEEKAKKKEQEEEGGGGGG 129
EM GKE NG V + + GG GGG
Sbjct: 94 SEMDGKELNGRHIRVNPANDRPSAPRAYGGGGGYSGGGGGYGGG 137
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 26.0 bits (57), Expect = 9.1
Identities = 13/38 (34%), Positives = 23/38 (60%), Gaps = 6/38 (15%)
Query: 100 VEEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEEER 137
+EE E EK +E+A+++ Q EE ++ +EEE+
Sbjct: 251 LEEKRRELEKLAKEEAERERQAEE------QRRREEEK 282
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 25.2 bits (56), Expect = 9.3
Identities = 14/73 (19%), Positives = 28/73 (38%), Gaps = 13/73 (17%)
Query: 67 QAKKRRDEKEEKEGGREGGEEMKGKEENGEEDEVEEGEEE-EEKKKEEKAKKKEQEEEGG 125
QAK K+ E ++ E E++ +++K +++ KE E +
Sbjct: 45 QAKS------IKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKA- 97
Query: 126 GGGGEQEKEEERE 138
+E E E +
Sbjct: 98 -----KEAEAESD 105
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 25.5 bits (56), Expect = 9.4
Identities = 16/63 (25%), Positives = 33/63 (52%), Gaps = 7/63 (11%)
Query: 77 EKEGGREGGEEMKGKEENGEEDEV-EEGEEEEEKKKEEKAKKKEQEEEGGGGGGEQEKEE 135
G E K E+ +E+ V EE E+++ + ++ +K+QE++ +QE+++
Sbjct: 59 STAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQ------QQEEKQ 112
Query: 136 ERE 138
RE
Sbjct: 113 VRE 115
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.303 0.130 0.363
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,335,791
Number of extensions: 704784
Number of successful extensions: 7451
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4520
Number of HSP's successfully gapped: 1441
Length of query: 139
Length of database: 10,937,602
Length adjustment: 87
Effective length of query: 52
Effective length of database: 7,078,804
Effective search space: 368097808
Effective search space used: 368097808
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.7 bits)
S2: 54 (24.4 bits)