RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5721
(290 letters)
>gnl|CDD|238665 cd01369, KISc_KHC_KIF5, Kinesin motor domain, kinesin heavy chain
(KHC) or KIF5-like subgroup. Members of this group have
been associated with organelle transport. This catalytic
(head) domain has ATPase activity and belongs to the
larger group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In most kinesins, the motor domain is found at
the N-terminus (N-type). N-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 325
Score = 150 bits (380), Expect = 3e-43
Identities = 54/67 (80%), Positives = 60/67 (89%)
Query: 8 ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
AL NVI+AL DG THIPYRDSKLTRILQ+SLGGN+RTT+IICCSP+S+NESET STL F
Sbjct: 259 ALGNVINALTDGKSTHIPYRDSKLTRILQDSLGGNSRTTLIICCSPSSYNESETLSTLRF 318
Query: 68 GRRAKTI 74
G RAKTI
Sbjct: 319 GARAKTI 325
>gnl|CDD|214526 smart00129, KISc, Kinesin motor, catalytic domain. ATPase.
Microtubule-dependent molecular motors that play
important roles in intracellular transport of organelles
and in cell division.
Length = 335
Score = 123 bits (311), Expect = 4e-33
Identities = 47/75 (62%), Positives = 55/75 (73%), Gaps = 1/75 (1%)
Query: 8 ALRNVISALADGNK-THIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLD 66
AL NVI+ALA +K HIPYRDSKLTR+LQ+SLGGN++T +I SP+S N ET STL
Sbjct: 261 ALGNVINALAQHSKSRHIPYRDSKLTRLLQDSLGGNSKTLMIANVSPSSSNLEETLSTLR 320
Query: 67 FGRRAKTIKNVVTVN 81
F RAK IKN VN
Sbjct: 321 FASRAKEIKNKPIVN 335
>gnl|CDD|215803 pfam00225, Kinesin, Kinesin motor domain.
Length = 326
Score = 122 bits (309), Expect = 7e-33
Identities = 41/67 (61%), Positives = 50/67 (74%)
Query: 8 ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
AL NVI+ALA+ +HIPYRDSKLTR+LQ+SLGGN++T +I SP+ N ET STL F
Sbjct: 260 ALGNVINALAENQSSHIPYRDSKLTRLLQDSLGGNSKTLMIANISPSDSNYEETLSTLRF 319
Query: 68 GRRAKTI 74
RAK I
Sbjct: 320 ASRAKNI 326
>gnl|CDD|238054 cd00106, KISc, Kinesin motor domain. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Kinesins are microtubule-dependent
molecular motors that play important roles in
intracellular transport and in cell division. In most
kinesins, the motor domain is found at the N-terminus
(N-type), in some its is found in the middle (M-type),
or C-terminal (C-type). N-type and M-type kinesins are
(+) end-directed motors, while C-type kinesins are (-)
end-directed motors, i.e. they transport cargo towards
the (-) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 328
Score = 113 bits (285), Expect = 2e-29
Identities = 43/66 (65%), Positives = 50/66 (75%), Gaps = 1/66 (1%)
Query: 8 ALRNVISALADGNK-THIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLD 66
AL NVISAL+ G K HIPYRDSKLTR+LQ+SLGGN++T +I SP+S N ET STL
Sbjct: 263 ALGNVISALSSGQKKKHIPYRDSKLTRLLQDSLGGNSKTLMIANISPSSENYDETLSTLR 322
Query: 67 FGRRAK 72
F RAK
Sbjct: 323 FASRAK 328
>gnl|CDD|238668 cd01372, KISc_KIF4, Kinesin motor domain, KIF4-like subfamily.
Members of this group seem to perform a variety of
functions, and have been implicated in neuronal
organelle transport and chromosome segregation during
mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 341
Score = 111 bits (279), Expect = 2e-28
Identities = 42/70 (60%), Positives = 52/70 (74%), Gaps = 2/70 (2%)
Query: 8 ALRNVISALADGNK--THIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTL 65
AL NVISAL D +K +H+PYRDSKLTR+LQ+SLGGN+ T +I C SPA N ET +TL
Sbjct: 272 ALGNVISALGDESKKGSHVPYRDSKLTRLLQDSLGGNSHTLMIACVSPADSNFEETLNTL 331
Query: 66 DFGRRAKTIK 75
+ RA+ IK
Sbjct: 332 KYANRARNIK 341
>gnl|CDD|238667 cd01371, KISc_KIF3, Kinesin motor domain, kinesins II or KIF3_like
proteins. Subgroup of kinesins, which form heterotrimers
composed of 2 kinesins and one non-motor accessory
subunit. Kinesins II play important roles in ciliary
transport, and have been implicated in neuronal
transport, melanosome transport, the secretory pathway,
and mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In this group the motor
domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 333
Score = 107 bits (268), Expect = 5e-27
Identities = 42/67 (62%), Positives = 49/67 (73%)
Query: 8 ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
AL NVISAL DG THIPYRDSKLTR+LQ+SLGGN++T + PA +N ET STL +
Sbjct: 267 ALGNVISALVDGKSTHIPYRDSKLTRLLQDSLGGNSKTVMCANIGPADYNYDETLSTLRY 326
Query: 68 GRRAKTI 74
RAK I
Sbjct: 327 ANRAKNI 333
>gnl|CDD|238660 cd01364, KISc_BimC_Eg5, Kinesin motor domain, BimC/Eg5 spindle pole
proteins, participate in spindle assembly and chromosome
segregation during cell division. This catalytic (head)
domain has ATPase activity and belongs to the larger
group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In most kinesins, the motor domain is found at
the N-terminus (N-type), N-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 352
Score = 98.1 bits (245), Expect = 1e-23
Identities = 41/70 (58%), Positives = 50/70 (71%), Gaps = 1/70 (1%)
Query: 12 VISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDFGRRA 71
VI+AL + HIPYR+SKLTR+LQ+SLGG +T+II SPAS N ET STL++ RA
Sbjct: 282 VINALVE-KSPHIPYRESKLTRLLQDSLGGRTKTSIIATISPASINLEETLSTLEYAHRA 340
Query: 72 KTIKNVVTVN 81
K IKN VN
Sbjct: 341 KNIKNKPEVN 350
>gnl|CDD|238661 cd01365, KISc_KIF1A_KIF1B, Kinesin motor domain, KIF1_like
proteins. KIF1A (Unc104) transports synaptic vesicles to
the nerve terminal, KIF1B has been implicated in
transport of mitochondria. Both proteins are expressed
in neurons. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
In contrast to the majority of dimeric kinesins, most
KIF1A/Unc104 kinesins are monomeric motors. A
lysine-rich loop in KIF1A binds to the negatively
charged C-terminus of tubulin and compensates for the
lack of a second motor domain, allowing KIF1A to move
processively.
Length = 356
Score = 97.8 bits (244), Expect = 2e-23
Identities = 41/80 (51%), Positives = 49/80 (61%), Gaps = 7/80 (8%)
Query: 9 LRNVISALAD-------GNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESET 61
L VISALAD + IPYRDS LT +L+E+LGGN++T +I SPA N ET
Sbjct: 277 LGKVISALADNSSAKSKKKSSFIPYRDSVLTWLLKENLGGNSKTAMIATISPADINYEET 336
Query: 62 KSTLDFGRRAKTIKNVVTVN 81
STL + RAK I NV VN
Sbjct: 337 LSTLRYADRAKKIVNVAVVN 356
>gnl|CDD|238670 cd01374, KISc_CENP_E, Kinesin motor domain, CENP-E/KIP2-like
subgroup, involved in chromosome movement and/or spindle
elongation during mitosis. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Kinesins are microtubule-dependent
molecular motors that play important roles in
intracellular transport and in cell division. In most
kinesins, the motor domain is found at the N-terminus
(N-type). N-type kinesins are (+) end-directed motors,
i.e. they transport cargo towards the (+) end of the
microtubule. Kinesin motor domains hydrolyze ATP at a
rate of about 80 per second, and move along the
microtubule at a speed of about 6400 Angstroms per
second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 321
Score = 94.3 bits (235), Expect = 2e-22
Identities = 42/68 (61%), Positives = 47/68 (69%), Gaps = 1/68 (1%)
Query: 8 ALRNVISALADG-NKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLD 66
L VIS L++G N HIPYRDSKLTRILQ SL GNART II SPAS + ET +TL
Sbjct: 254 TLGTVISKLSEGKNSGHIPYRDSKLTRILQPSLSGNARTAIICTISPASSHVEETLNTLK 313
Query: 67 FGRRAKTI 74
F RAK +
Sbjct: 314 FASRAKKV 321
>gnl|CDD|238666 cd01370, KISc_KIP3_like, Kinesin motor domain, KIP3-like subgroup.
The yeast kinesin KIP3 plays a role in positioning the
mitotic spindle. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 338
Score = 93.9 bits (234), Expect = 4e-22
Identities = 39/69 (56%), Positives = 49/69 (71%), Gaps = 2/69 (2%)
Query: 8 ALRNVISALADGNK--THIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTL 65
AL N I+AL DG K HIPYRDSKLTR+L++SLGGN +T +I SP+S + ET +TL
Sbjct: 270 ALGNCINALVDGKKKNKHIPYRDSKLTRLLKDSLGGNCKTVMIANISPSSSHYEETHNTL 329
Query: 66 DFGRRAKTI 74
+ RAK I
Sbjct: 330 KYANRAKNI 338
>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
Length = 568
Score = 94.4 bits (235), Expect = 1e-21
Identities = 65/254 (25%), Positives = 104/254 (40%), Gaps = 33/254 (12%)
Query: 8 ALRNVISAL-ADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLD 66
L NVI+AL HIPYR+SKLTR+LQ+SLGGN T +I SP+S + ET +TL
Sbjct: 269 TLGNVINALGDKKKSGHIPYRESKLTRLLQDSLGGNCNTRVICTISPSSNSFEETINTLK 328
Query: 67 FGRRAKTIKNVVTVNE----ELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSR---WRN 119
F RAK+IKN + VN EE K +++ + + + + S +
Sbjct: 329 FASRAKSIKNKIQVNSSSDSSREIEEIKFDLSEDRSEIEILVFREQSQLSQSSLSGIFAY 388
Query: 120 GETVKEDEQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERE 179
+++K+ E L +D+ + S + E ++ L+EE
Sbjct: 389 MQSLKK-ETETLKSRIDLIMK---------------------SIISGTFERKKLLKEEGW 426
Query: 180 RLYQQLD-EKDEEINQQSQYAEKLKEQIMEQEEL--ITTARRDYEQLQGEMTRLTQENES 236
+ L + E E+L ++ + +L + E T E+E
Sbjct: 427 KYKSTLQFLRIEIDRLLLLREEELSKKKTKIHKLNKLRHDLSSLLSSIPEETSDRVESEK 486
Query: 237 AKEEVKEVLQALEE 250
A + L
Sbjct: 487 ASKLRSSASTKLNL 500
>gnl|CDD|238662 cd01366, KISc_C_terminal, Kinesin motor domain,
KIFC2/KIFC3/ncd-like carboxy-terminal kinesins. Ncd is a
spindle motor protein necessary for chromosome
segregation in meiosis. KIFC2/KIFC3-like kinesins have
been implicated in motility of the Golgi apparatus as
well as dentritic and axonal transport in neurons. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In this subgroup the motor domain is found at
the C-terminus (C-type). C-type kinesins are (-)
end-directed motors, i.e. they transport cargo towards
the (-) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 329
Score = 84.5 bits (210), Expect = 7e-19
Identities = 32/69 (46%), Positives = 47/69 (68%), Gaps = 1/69 (1%)
Query: 8 ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
AL +VISAL + H+PYR+SKLT +LQ+SLGGN++T + + SP N SET +L F
Sbjct: 261 ALGDVISALRSKDS-HVPYRNSKLTYLLQDSLGGNSKTLMFVNISPLESNLSETLCSLRF 319
Query: 68 GRRAKTIKN 76
R ++++
Sbjct: 320 ASRVRSVEL 328
>gnl|CDD|238669 cd01373, KISc_KLP2_like, Kinesin motor domain, KLP2-like subgroup.
Members of this subgroup seem to play a role in mitosis
and meiosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 337
Score = 81.4 bits (201), Expect = 9e-18
Identities = 40/69 (57%), Positives = 49/69 (71%), Gaps = 3/69 (4%)
Query: 9 LRNVISALAD---GNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTL 65
L +VI AL D G + H+PYRDSKLT +L++SLGGNA+TTII SP+S ET STL
Sbjct: 269 LGHVIMALVDVAHGKQRHVPYRDSKLTFLLRDSLGGNAKTTIIANVSPSSKCFGETLSTL 328
Query: 66 DFGRRAKTI 74
F +RAK I
Sbjct: 329 KFAQRAKLI 337
>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
Length = 1320
Score = 80.0 bits (197), Expect = 1e-16
Identities = 42/80 (52%), Positives = 55/80 (68%), Gaps = 4/80 (5%)
Query: 9 LRNVISALAD----GNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKST 64
L N+I+ LA+ G + HIPYRDS+LT +LQESLGGNA+ ++ SP+ +SET ST
Sbjct: 365 LGNLINILAEISQTGKQRHIPYRDSRLTFLLQESLGGNAKLAMVCAISPSQSCKSETFST 424
Query: 65 LDFGRRAKTIKNVVTVNEEL 84
L F +RAK IKN VNE +
Sbjct: 425 LRFAQRAKAIKNKAVVNEVM 444
>gnl|CDD|238663 cd01367, KISc_KIF2_like, Kinesin motor domain, KIF2-like group.
KIF2 is a protein expressed in neurons, which has been
associated with axonal transport and neuron development;
alternative splice forms have been implicated in
lysosomal translocation. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Kinesins are microtubule-dependent
molecular motors that play important roles in
intracellular transport and in cell division. In this
subgroup the motor domain is found in the middle
(M-type) of the protein chain. M-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second (KIF2 may be slower). To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 322
Score = 71.5 bits (176), Expect = 2e-14
Identities = 28/65 (43%), Positives = 43/65 (66%), Gaps = 1/65 (1%)
Query: 8 ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
AL+ I ALA NK H+P+R SKLT++L++S GN++T +I SP++ + T +TL +
Sbjct: 259 ALKECIRALAS-NKAHVPFRGSKLTQVLRDSFIGNSKTVMIATISPSASSCEHTLNTLRY 317
Query: 68 GRRAK 72
R K
Sbjct: 318 ADRVK 322
>gnl|CDD|238659 cd01363, Motor_domain, Myosin and Kinesin motor domain. These
ATPases belong to the P-loop NTPase family and provide
the driving force in myosin and kinesin mediated
processes.
Length = 186
Score = 68.3 bits (167), Expect = 7e-14
Identities = 29/46 (63%), Positives = 39/46 (84%), Gaps = 1/46 (2%)
Query: 8 ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSP 53
L NVISALA+ + +H+PYR+SKLTR+LQ+SLGGN+RT ++ C SP
Sbjct: 142 TLGNVISALAERD-SHVPYRESKLTRLLQDSLGGNSRTLMVACISP 186
>gnl|CDD|238671 cd01375, KISc_KIF9_like, Kinesin motor domain, KIF9-like subgroup;
might play a role in cell shape remodeling. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In most kinesins, the motor domain is found at
the N-terminus (N-type). N-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 334
Score = 69.8 bits (171), Expect = 1e-13
Identities = 29/64 (45%), Positives = 42/64 (65%)
Query: 9 LRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDFG 68
L VI+AL++ +TH+PYR+SKLT +L++SLGGN +T ++ N ET STL F
Sbjct: 271 LEQVINALSEKARTHVPYRNSKLTHVLRDSLGGNCKTVMLATIWVEPSNLDETLSTLRFA 330
Query: 69 RRAK 72
+R
Sbjct: 331 QRVA 334
>gnl|CDD|238672 cd01376, KISc_KID_like, Kinesin motor domain, KIF22/Kid-like
subgroup. Members of this group might play a role in
regulating chromosomal movement along microtubules in
mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 319
Score = 66.7 bits (163), Expect = 1e-12
Identities = 29/65 (44%), Positives = 40/65 (61%), Gaps = 1/65 (1%)
Query: 8 ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
L V+ AL G IPYR+SKLTR+LQ+SLGG +R ++ +P +T STL+F
Sbjct: 256 VLSKVVDALNKGLP-RIPYRESKLTRLLQDSLGGGSRCIMVANIAPERSFYQDTLSTLNF 314
Query: 68 GRRAK 72
R+K
Sbjct: 315 ASRSK 319
>gnl|CDD|238664 cd01368, KISc_KIF23_like, Kinesin motor domain, KIF23-like
subgroup. Members of this group may play a role in
mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 345
Score = 48.5 bits (116), Expect = 2e-06
Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 2/65 (3%)
Query: 8 ALRNVISALADGNKTHIPYRDSKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDF 67
LR + L+ +PYRDSKLT + Q G + +I+ +P + + ET + F
Sbjct: 283 VLRE--NQLSGSTNKMVPYRDSKLTHLFQNYFDGEGKARMIVNVNPCASDYDETLHVMKF 340
Query: 68 GRRAK 72
A+
Sbjct: 341 SAIAQ 345
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 45.1 bits (107), Expect = 3e-05
Identities = 24/82 (29%), Positives = 46/82 (56%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
EE ++LE E L ++L+E + E+ + + E+LKE+I +E + E+L+ +
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354
Query: 229 RLTQENESAKEEVKEVLQALEE 250
L + E +E++ +L+ LEE
Sbjct: 355 ELEEAKEELEEKLSALLEELEE 376
Score = 42.8 bits (101), Expect = 2e-04
Identities = 41/180 (22%), Positives = 78/180 (43%), Gaps = 11/180 (6%)
Query: 81 NEELTAEEWKRRYEKEK-----EKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPV 135
EE+ E KR+ +E+ E+ + +L+ LE EL E E L + +
Sbjct: 778 KEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEI 837
Query: 136 DMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQ 195
+ E + + L L E+++LE+E + L ++ +E +EE+ +
Sbjct: 838 EELEEKLDELEEELEELEKELEELKEELEEL-EAEKEELEDELKELEEEKEELEEELREL 896
Query: 196 SQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENES-----AKEEVKEVLQALEE 250
+LKE+I + E + E+L+ E+ L +E E + E++ ++ LEE
Sbjct: 897 ESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEE 956
Score = 41.6 bits (98), Expect = 3e-04
Identities = 24/88 (27%), Positives = 43/88 (48%)
Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTR 229
E ++LEEE L QL++ +EE+ L++ + E + R E+L+ E+
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727
Query: 230 LTQENESAKEEVKEVLQALEEPTRTLLN 257
L +E E + ++E+ + LEE L
Sbjct: 728 LEEELEQLQSRLEELEEELEELEEELEE 755
Score = 41.2 bits (97), Expect = 5e-04
Identities = 28/82 (34%), Positives = 53/82 (64%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
EE +LEEE E L ++L+E ++EI + E+L+E++ E +E + + + E+L+GE++
Sbjct: 246 EELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEIS 305
Query: 229 RLTQENESAKEEVKEVLQALEE 250
L + E + E++E+ + LEE
Sbjct: 306 LLRERLEELENELEELEERLEE 327
Score = 41.2 bits (97), Expect = 6e-04
Identities = 26/86 (30%), Positives = 51/86 (59%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
E LEEE E+L +L+E +EE+ + + E+L+E++ E EE + + +L+
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLK 778
Query: 225 GEMTRLTQENESAKEEVKEVLQALEE 250
E+ L ++ ++ +EE++E+ + LEE
Sbjct: 779 EEIEELEEKRQALQEELEELEEELEE 804
Score = 40.1 bits (94), Expect = 0.001
Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 15/164 (9%)
Query: 87 EEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPE 146
EE +R+ E+ + + K +L LE EL + ++ E+E L + ++ E
Sbjct: 705 EELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELE 764
Query: 147 SAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQI 206
S+ E KL+EE E L ++ EE+ + + E+ + ++
Sbjct: 765 EELESL---------------EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRL 809
Query: 207 MEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
E + + + E+L+ E+ L +E E +E++ E+ + LEE
Sbjct: 810 DALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853
Score = 37.4 bits (87), Expect = 0.008
Identities = 21/83 (25%), Positives = 42/83 (50%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
E + LEE ERL ++L++ EE+ + E+L+ ++ E E + E+L+ +
Sbjct: 403 KREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRL 462
Query: 228 TRLTQENESAKEEVKEVLQALEE 250
L +E +EE++ + + L
Sbjct: 463 KELERELAELQEELQRLEKELSS 485
Score = 35.8 bits (83), Expect = 0.026
Identities = 24/83 (28%), Positives = 46/83 (55%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
+ +KLEEE + L +L ++ + + + E+L+ Q+ E + + + EQLQ +
Sbjct: 680 EAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRL 739
Query: 228 TRLTQENESAKEEVKEVLQALEE 250
L +E E +EE++E+ + LEE
Sbjct: 740 EELEEELEELEEELEELQERLEE 762
Score = 35.5 bits (82), Expect = 0.031
Identities = 28/82 (34%), Positives = 48/82 (58%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
EE ++ E+E E L +L+E EE+ + + +LKE+I E E I+ R E+L+ E+
Sbjct: 260 EELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELE 319
Query: 229 RLTQENESAKEEVKEVLQALEE 250
L + E KE+++ + + LEE
Sbjct: 320 ELEERLEELKEKIEALKEELEE 341
Score = 35.5 bits (82), Expect = 0.038
Identities = 23/88 (26%), Positives = 46/88 (52%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
EE ++L+EE L ++++E + EI+ + E+L+ ++ E EE + + E L+ E+
Sbjct: 280 REELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEEL 339
Query: 228 TRLTQENESAKEEVKEVLQALEEPTRTL 255
E ++ + E+ +A EE L
Sbjct: 340 EERETLLEELEQLLAELEEAKEELEEKL 367
Score = 34.3 bits (79), Expect = 0.074
Identities = 24/82 (29%), Positives = 44/82 (53%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E ++LE E E L ++L+E E+I + E+ + + E E+L+ E+L+ +++
Sbjct: 309 ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLS 368
Query: 229 RLTQENESAKEEVKEVLQALEE 250
L +E E E ++E L LE
Sbjct: 369 ALLEELEELFEALREELAELEA 390
Score = 34.3 bits (79), Expect = 0.086
Identities = 22/93 (23%), Positives = 46/93 (49%), Gaps = 11/93 (11%)
Query: 169 EERQKLEEERERLYQQLDEKD---EEINQQSQYAEKLKEQIME----QEEL----ITTAR 217
E +++ E + ER + L+ + EE+ +Q + E+ E+ + EL +
Sbjct: 172 ERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLL 231
Query: 218 RDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
++L+ E+ L +E +EE++E+ + LEE
Sbjct: 232 AKLKELRKELEELEEELSRLEEELEELQEELEE 264
Score = 33.1 bits (76), Expect = 0.17
Identities = 24/93 (25%), Positives = 49/93 (52%), Gaps = 12/93 (12%)
Query: 169 EERQKLEEERERLY------------QQLDEKDEEINQQSQYAEKLKEQIMEQEELITTA 216
E Q+L+ E L ++L+E +EE+++ + E+L+E++ E E+ I
Sbjct: 213 ERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEEL 272
Query: 217 RRDYEQLQGEMTRLTQENESAKEEVKEVLQALE 249
+ + E+L+ E+ L +E KEE++E+ +
Sbjct: 273 KSELEELREELEELQEELLELKEEIEELEGEIS 305
Score = 32.4 bits (74), Expect = 0.33
Identities = 44/192 (22%), Positives = 78/192 (40%), Gaps = 12/192 (6%)
Query: 69 RRAKTIKNVVTVNEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWR-------NGE 121
+ K + + EL EE + E+ +E+ K ++E+LE E+S R N
Sbjct: 261 ELEEAEKEIEELKSEL--EELREELEELQEELLELKEEIEELEGEISLLRERLEELENEL 318
Query: 122 TVKEDEQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERER- 180
E+ L + ++ E+ + + L + LEE E
Sbjct: 319 EELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELF 378
Query: 181 --LYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAK 238
L ++L E + E+ + E+LK +I EE + E L+ E+ L E E +
Sbjct: 379 EALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQ 438
Query: 239 EEVKEVLQALEE 250
E++E+ + LEE
Sbjct: 439 TELEELNEELEE 450
Score = 30.5 bits (69), Expect = 1.2
Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 2/94 (2%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQ-SQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
+++L EERE L ++L++ E+ + + EKL + E EEL R + +L+ E+
Sbjct: 334 ALKEEL-EERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAEL 392
Query: 228 TRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQ 261
+ E E K E++ + + LE + L + +
Sbjct: 393 AEIRNELEELKREIESLEERLERLSERLEDLKEE 426
Score = 30.5 bits (69), Expect = 1.2
Identities = 37/165 (22%), Positives = 66/165 (40%), Gaps = 18/165 (10%)
Query: 86 AEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTP 145
EE + E+ +E+ K K+E L+ EL E L + +
Sbjct: 311 LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSAL 370
Query: 146 ESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQ 205
+ A+ EE +LE E + +L+E EI + E+L E+
Sbjct: 371 LEELEELFEAL-----------REELAELEAELAEIRNELEELKREIESLEERLERLSER 419
Query: 206 IMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
+ + +E + ++L+ E+ L E E EE++E+ + LEE
Sbjct: 420 LEDLKEEL-------KELEAELEELQTELEELNEELEELEEQLEE 457
Score = 29.7 bits (67), Expect = 2.2
Identities = 25/87 (28%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 168 NEERQKLEEERERL--YQQLDEKDEEINQQSQYAE--KLKEQIMEQEELITTARRDYEQL 223
++ +KLE + E+ YQ+L + E+ A+ +L++++ E EE ++ + E+L
Sbjct: 199 EKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEEL 258
Query: 224 QGEMTRLTQENESAKEEVKEVLQALEE 250
Q E+ +E E K E++E+ + LEE
Sbjct: 259 QEELEEAEKEIEELKSELEELREELEE 285
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 40.4 bits (95), Expect = 0.001
Identities = 23/101 (22%), Positives = 46/101 (45%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E ++LEE +L ++L E + EI + + E+ +E++ E E I EQL+ E+
Sbjct: 740 AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK 799
Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGTLL 269
L + + + E+ + + L + R+ + L
Sbjct: 800 ALREALDELRAELTLLNEEAANLRERLESLERRIAATERRL 840
Score = 40.0 bits (94), Expect = 0.001
Identities = 24/95 (25%), Positives = 50/95 (52%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
EE ++L+EE + ++L+E E+ + + E+L+ ++ E EE I +++ L E+
Sbjct: 238 REELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEI 297
Query: 228 TRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQS 262
+RL Q+ + +E + + + LEE L +
Sbjct: 298 SRLEQQKQILRERLANLERQLEELEAQLEELESKL 332
Score = 38.9 bits (91), Expect = 0.002
Identities = 21/86 (24%), Positives = 49/86 (56%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
S ++ +LE E E+L +++ + +E+ + E+L+E++ E EE + A + E+L+
Sbjct: 729 SALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
Query: 225 GEMTRLTQENESAKEEVKEVLQALEE 250
++ +L +E ++ +E + E+ L
Sbjct: 789 AQIEQLKEELKALREALDELRAELTL 814
Score = 38.9 bits (91), Expect = 0.003
Identities = 22/88 (25%), Positives = 39/88 (44%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
E ++LEE E ++L E + EI + E+LKE++ E + R + L E
Sbjct: 760 EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEA 819
Query: 228 TRLTQENESAKEEVKEVLQALEEPTRTL 255
L + ES + + + LE+ +
Sbjct: 820 ANLRERLESLERRIAATERRLEDLEEQI 847
Score = 37.7 bits (88), Expect = 0.006
Identities = 19/83 (22%), Positives = 43/83 (51%)
Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGE 226
+ ++LE + E L +LDE EE+ + + E+LKE++ E + + E+L+
Sbjct: 314 LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373
Query: 227 MTRLTQENESAKEEVKEVLQALE 249
+ L ++ E+ + +V ++ +
Sbjct: 374 LEELEEQLETLRSKVAQLELQIA 396
Score = 37.7 bits (88), Expect = 0.007
Identities = 35/175 (20%), Positives = 64/175 (36%), Gaps = 8/175 (4%)
Query: 81 NEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAAS 140
E E+ + R + ++ + + ++E+LE L E E L ++
Sbjct: 737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKE 796
Query: 141 IAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAE 200
A + + +L NEE L E E L +++ + + + E
Sbjct: 797 ELKALREALDELRAEL--------TLLNEEAANLRERLESLERRIAATERRLEDLEEQIE 848
Query: 201 KLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTL 255
+L E I I E+L+ E+ L E S +E + + LEE + L
Sbjct: 849 ELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903
Score = 37.0 bits (86), Expect = 0.011
Identities = 21/83 (25%), Positives = 44/83 (53%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
+++Q L E L +QL+E + ++ + ++L E++ E EE + + + E L+ E+
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAEL 360
Query: 228 TRLTQENESAKEEVKEVLQALEE 250
L E E + ++E+ + LE
Sbjct: 361 EELEAELEELESRLEELEEQLET 383
Score = 36.2 bits (84), Expect = 0.018
Identities = 22/101 (21%), Positives = 45/101 (44%), Gaps = 7/101 (6%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ--- 224
E+ ++L E+ E L +++E +E I + E L + EE + R + E+L
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903
Query: 225 ----GEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQ 261
+ + L +E E +E++ ++ LE + N +
Sbjct: 904 RELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQER 944
Score = 36.2 bits (84), Expect = 0.020
Identities = 44/204 (21%), Positives = 79/204 (38%), Gaps = 20/204 (9%)
Query: 69 RRAKTIKNVVTVNEELT-----AEEWKR---RYEKEKEKSSRYKGK---LEKLELELS-- 115
RR +T + + E L E +R E++ EK+ RYK L +LEL L
Sbjct: 173 RRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELELALLVL 232
Query: 116 RWRNGETVKEDEQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEER-QKL 174
R E+ Q L + + + + + L L EE ++L
Sbjct: 233 RLEELREELEELQEELKEAEEELEELTAELQELEEKL------EELRLEVSELEEEIEEL 286
Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQEN 234
++E L ++ +++ + L+ Q+ E E + ++L E+ L ++
Sbjct: 287 QKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL 346
Query: 235 ESAKEEVKEVLQALEEPTRTLLNF 258
E KEE++ + LEE L
Sbjct: 347 EELKEELESLEAELEELEAELEEL 370
Score = 35.8 bits (83), Expect = 0.027
Identities = 27/90 (30%), Positives = 47/90 (52%), Gaps = 3/90 (3%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
L E ++L+ E E L ++L+E EE+ + + E+L+E++ E E+ + A R+ QLQ
Sbjct: 429 KLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQ 488
Query: 225 GEMT---RLTQENESAKEEVKEVLQALEEP 251
+ RL + E E VK +L+
Sbjct: 489 ARLDSLERLQENLEGFSEGVKALLKNQSGL 518
Score = 35.8 bits (83), Expect = 0.029
Identities = 21/76 (27%), Positives = 36/76 (47%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
E ++LE E E L +L+E +E++ +L+ QI I E+L+
Sbjct: 357 EAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRR 416
Query: 228 TRLTQENESAKEEVKE 243
RL QE E ++++E
Sbjct: 417 ERLQQEIEELLKKLEE 432
Score = 35.0 bits (81), Expect = 0.045
Identities = 21/74 (28%), Positives = 40/74 (54%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
EE +LEE+ E L ++L+ + E+ + E+L+ ++ E EE + T R QL+ ++
Sbjct: 337 EELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIA 396
Query: 229 RLTQENESAKEEVK 242
L E E + ++
Sbjct: 397 SLNNEIERLEARLE 410
Score = 32.7 bits (75), Expect = 0.27
Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 1/84 (1%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIME-QEELITTARRDYEQLQGE 226
+EE ++LE +R L ++L+E E++ Q E L+ +I QE L E+ +
Sbjct: 900 SEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEAL 959
Query: 227 MTRLTQENESAKEEVKEVLQALEE 250
++ + E A+ +K + ++E
Sbjct: 960 ENKIEDDEEEARRRLKRLENKIKE 983
Score = 28.9 bits (65), Expect = 4.6
Identities = 22/94 (23%), Positives = 42/94 (44%), Gaps = 9/94 (9%)
Query: 166 LSNEERQKLEEER--ERLYQQLDEKDE-------EINQQSQYAEKLKEQIMEQEELITTA 216
L + ER+ ER E L +Q++E E EI + + E+L+ ++ +
Sbjct: 826 LESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASL 885
Query: 217 RRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
L+ E+ L++E + + E+ + LEE
Sbjct: 886 EEALALLRSELEELSEELRELESKRSELRRELEE 919
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 39.8 bits (94), Expect = 0.001
Identities = 24/97 (24%), Positives = 51/97 (52%), Gaps = 13/97 (13%)
Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGE 226
E R +L++ +RL Q K+E ++++ + EK +E++ ++E+ + +++ E+ + E
Sbjct: 77 LRERRNELQKLEKRLLQ----KEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEE 132
Query: 227 MTRLTQENESA--------KEEVKE-VLQALEEPTRT 254
+ L +E EE KE +L+ +EE R
Sbjct: 133 LEELIEEQLQELERISGLTAEEAKEILLEKVEEEARH 169
Score = 32.1 bits (74), Expect = 0.31
Identities = 17/83 (20%), Positives = 50/83 (60%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
+++ L E +E +++ +E ++E+ ++ +KL+++++++EE + E+ + E+
Sbjct: 53 AIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEEL 112
Query: 228 TRLTQENESAKEEVKEVLQALEE 250
+ +E E ++E+++ + LEE
Sbjct: 113 EKKEKELEQKQQELEKKEEELEE 135
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 38.2 bits (89), Expect = 0.002
Identities = 31/153 (20%), Positives = 56/153 (36%), Gaps = 32/153 (20%)
Query: 137 MAASIAPTPESAPASILPAIPGSGLMLGSL---------------SNEERQKLEEERERL 181
A + A I + + L LG + + QKLEE + L
Sbjct: 22 GIALLVLFGAGLGALISTLLGIALLALGLVLLALGLLCLLLKAPVQSVRPQKLEELQGEL 81
Query: 182 ---YQQLDEKDEEINQQSQYAEKLKEQIMEQEEL--------------ITTARRDYEQLQ 224
QQL E EE+ + +L+ ++ + +E + + ++L
Sbjct: 82 SELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELA 141
Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTRTLLN 257
E+ L Q+ EE++E L+ L+E + L
Sbjct: 142 KELRELRQDLREEVEELREELERLQENLQRLQE 174
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 39.3 bits (92), Expect = 0.002
Identities = 42/176 (23%), Positives = 75/176 (42%), Gaps = 8/176 (4%)
Query: 82 EELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASI 141
E+L E K+ + +E+ R K K+ +LE E++ KE E + + + +
Sbjct: 271 EQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAE 330
Query: 142 APTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEI----NQQSQ 197
+ + I L+ EE +L+EE E L +L+E D+E ++
Sbjct: 331 IDKLLAEIEELEREIEEERKRRDKLT-EEYAELKEELEDLRAELEEVDKEFAETRDELKD 389
Query: 198 YAEKLKEQIMEQEELITTARRDYEQLQ---GEMTRLTQENESAKEEVKEVLQALEE 250
Y EKL++ E EL R E+LQ E+ L + ++ E+ + E+
Sbjct: 390 YREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445
Score = 37.4 bits (87), Expect = 0.008
Identities = 22/87 (25%), Positives = 47/87 (54%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E KLE E ++L +++E + EI ++ + +KL E+ E +E + R + E++ E
Sbjct: 322 ERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFA 381
Query: 229 RLTQENESAKEEVKEVLQALEEPTRTL 255
E + +E+++++ + + E R L
Sbjct: 382 ETRDELKDYREKLEKLKREINELKREL 408
Score = 37.0 bits (86), Expect = 0.013
Identities = 37/178 (20%), Positives = 66/178 (37%), Gaps = 23/178 (12%)
Query: 92 RYEKEKEKSSRYKGKLEKLE-----LELSRWRNGET--------------VKEDEQVNLA 132
R +E+EK+ RY+ L++ L E E ++
Sbjct: 202 RLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEIS 261
Query: 133 DPVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEI 192
+ I E I L + E+ +LE E L + + EK+ E+
Sbjct: 262 ELEKRLEEIEQLLEELNKKIKDLGEEEQLRV----KEKIGELEAEIASLERSIAEKEREL 317
Query: 193 NQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
+ KL+ +I + I R+ E+ + +LT+E KEE++++ LEE
Sbjct: 318 EDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEE 375
Score = 34.3 bits (79), Expect = 0.074
Identities = 21/86 (24%), Positives = 36/86 (41%), Gaps = 3/86 (3%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
E + L+ E L +L + +++ SQ +I E E+ I ++ E+L+ +
Sbjct: 680 RERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERL 739
Query: 228 TRLT---QENESAKEEVKEVLQALEE 250
L E E VK L+ LE
Sbjct: 740 EELEEDLSSLEQEIENVKSELKELEA 765
Score = 34.3 bits (79), Expect = 0.086
Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 2/89 (2%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
L++ER+ L QL E + +I + EK ++++ E + + + +++
Sbjct: 882 SRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKG 941
Query: 229 RLTQ--ENESAKEEVKEVLQALEEPTRTL 255
+ E E + E+V+ LQ +EE R L
Sbjct: 942 EDEEIPEEELSLEDVQAELQRVEEEIRAL 970
Score = 33.5 bits (77), Expect = 0.14
Identities = 20/82 (24%), Positives = 40/82 (48%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
EE Q+L EE L + + +IN+ + E +I +QE + D + + E+
Sbjct: 413 EELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472
Query: 229 RLTQENESAKEEVKEVLQALEE 250
L +E + ++E+ ++ + L E
Sbjct: 473 DLKEEYDRVEKELSKLQRELAE 494
Score = 33.1 bits (76), Expect = 0.17
Identities = 24/91 (26%), Positives = 50/91 (54%), Gaps = 4/91 (4%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
+ +E Q+L + +++ E ++EI Q Q EKLKE++ E EE +++ ++ E ++
Sbjct: 702 NRLDELSQELSDAS----RKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVK 757
Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTRTL 255
E+ L E +E++ ++ +AL + L
Sbjct: 758 SELKELEARIEELEEDLHKLEEALNDLEARL 788
Score = 32.7 bits (75), Expect = 0.28
Identities = 19/81 (23%), Positives = 40/81 (49%)
Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTR 229
E++ LE+E + L +Q + E+I + E L + E EE + L+ +
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGD 886
Query: 230 LTQENESAKEEVKEVLQALEE 250
L +E + + +++E+ + +EE
Sbjct: 887 LKKERDELEAQLRELERKIEE 907
Score = 32.3 bits (74), Expect = 0.34
Identities = 18/89 (20%), Positives = 43/89 (48%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
+ E + EE +L ++D+ EI + + E+ +++ + E + + E L+
Sbjct: 311 AEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLR 370
Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTR 253
E+ + +E ++E+K+ + LE+ R
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKR 399
Score = 32.0 bits (73), Expect = 0.43
Identities = 38/164 (23%), Positives = 75/164 (45%), Gaps = 10/164 (6%)
Query: 87 EEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPE 146
E ++ E+ +++ + K +LE+LE +LS + E L ++ E
Sbjct: 719 GEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLE 778
Query: 147 SAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEIN---QQSQYAEK-- 201
A + + S + E KLEEE R+ +L E ++++N + +Y EK
Sbjct: 779 EALNDLEARLSHSRI---PEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEI 835
Query: 202 --LKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKE 243
L+EQ ++ +E I + ++ E L G+ L +E E + +++
Sbjct: 836 QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRD 879
Score = 31.6 bits (72), Expect = 0.52
Identities = 17/86 (19%), Positives = 40/86 (46%), Gaps = 3/86 (3%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
E +L+ E +RL ++L EE+ + ++ +I E EE + ++ + ++
Sbjct: 398 KREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKL 457
Query: 228 TRLTQENESAKEE---VKEVLQALEE 250
+L + ++E +KE +E+
Sbjct: 458 EQLAADLSKYEQELYDLKEEYDRVEK 483
Score = 30.4 bits (69), Expect = 1.3
Identities = 20/86 (23%), Positives = 41/86 (47%)
Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
N ++++LEEE E L L + + + + ++L+ Q+ E E I E+ +
Sbjct: 858 NLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRK 917
Query: 226 EMTRLTQENESAKEEVKEVLQALEEP 251
++ L + E+ +EE+ E+ E
Sbjct: 918 RLSELKAKLEALEEELSEIEDPKGED 943
Score = 29.7 bits (67), Expect = 2.1
Identities = 24/90 (26%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 168 NEERQKLEEERERL--YQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
++ ++L ERE+ YQ L ++ E + E LKE R E ++
Sbjct: 197 RQQLERLRREREKAERYQALLKEKREY----EGYELLKE--------KEALERQKEAIER 244
Query: 226 EMTRLTQENESAKEEVKEVLQALEEPTRTL 255
++ L +E E EE+ E+ + LEE + L
Sbjct: 245 QLASLEEELEKLTEEISELEKRLEEIEQLL 274
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 38.8 bits (91), Expect = 0.003
Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 2/78 (2%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
EE ++LEEE + L ++L++ + + + + A + E+ E + E+L E+
Sbjct: 333 KEELKELEEELKELEEELEKIKKLLKKLPKKAR--GQLPPEKREQLEKLLETKEKLSEEL 390
Query: 228 TRLTQENESAKEEVKEVL 245
L +E + KEE++ +
Sbjct: 391 EELEEELKELKEELESLY 408
>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242).
Length = 401
Score = 38.3 bits (89), Expect = 0.003
Identities = 20/92 (21%), Positives = 42/92 (45%), Gaps = 3/92 (3%)
Query: 164 GSLSNEERQKLEEERERLYQ---QLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDY 220
G L + +KL E+ E L ++E+ +E +Q +KL Q+ + + Y
Sbjct: 70 GQLLQTKIRKLTEQDEGLRNIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRY 129
Query: 221 EQLQGEMTRLTQENESAKEEVKEVLQALEEPT 252
++L+ L +EN+ + ++ ++Q E
Sbjct: 130 QKLEELSRHLKEENQCLQIQLDALVQECNEKI 161
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 38.5 bits (90), Expect = 0.003
Identities = 18/84 (21%), Positives = 40/84 (47%), Gaps = 1/84 (1%)
Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
LE+ +L+E +E N+QS+ +LK +I ++ + T ++++ + L
Sbjct: 313 HSLEKLDTA-IDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQ 371
Query: 232 QENESAKEEVKEVLQALEEPTRTL 255
E EE+ ++ L++ +T
Sbjct: 372 AEFVDNAEELAKLQDELDKIVKTK 395
Score = 30.0 bits (68), Expect = 1.8
Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 4/79 (5%)
Query: 181 LYQQLDEKDEEINQQSQYAEKLKEQIMEQEEL----ITTARRDYEQLQGEMTRLTQENES 236
L QQ+ D +I+ Q + + I EQ + I + Y++L E + E E
Sbjct: 179 LNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEE 238
Query: 237 AKEEVKEVLQALEEPTRTL 255
+E+ ++ +E+P+ L
Sbjct: 239 LTDELLNLVMDIEDPSAAL 257
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 38.3 bits (90), Expect = 0.004
Identities = 23/88 (26%), Positives = 45/88 (51%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E ++LE++ E L +Q DE E I +Q +L+E++ E + + ++ E+L +
Sbjct: 348 ESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKLSEMLQ 407
Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLL 256
L ++ A+E+++ L E R L
Sbjct: 408 GLRKDELEAREKLERYRNKLHEIKRYLE 435
Score = 32.5 bits (75), Expect = 0.25
Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 5/73 (6%)
Query: 189 DEEINQQSQYAEKL-----KEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKE 243
+E++ + + +K K +I E E L+ D EQ+ E+ L + E +EEV++
Sbjct: 85 EEQLFEAEELNDKFRFRKAKHEINEIESLLDLIEEDIEQILEELQELLESEEKNREEVEQ 144
Query: 244 VLQALEEPTRTLL 256
+ E ++LL
Sbjct: 145 LKDLYRELRKSLL 157
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 38.3 bits (90), Expect = 0.004
Identities = 21/86 (24%), Positives = 47/86 (54%), Gaps = 4/86 (4%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
EE + L +E E+L ++L+EK E+ Q + + L+E E ++ I A+++ +++ E+
Sbjct: 536 AEEAEALLKEAEKLKEELEEKKEK--LQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKEL 593
Query: 228 TRLTQENESA--KEEVKEVLQALEEP 251
+L + ++ E+ E + L +
Sbjct: 594 RQLQKGGYASVKAHELIEARKRLNKA 619
Score = 34.8 bits (81), Expect = 0.054
Identities = 22/89 (24%), Positives = 45/89 (50%), Gaps = 14/89 (15%)
Query: 162 MLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYE 221
++ SL ER +LE++ E L E ++ + + EKL+E E+++L+ A ++ +
Sbjct: 521 LIASLEELER-ELEQKAEEAEALLKEAEKLKEELEEKKEKLQE---EEDKLLEEAEKEAQ 576
Query: 222 QLQGEMTRLTQENESAKEEVKEVLQALEE 250
Q + AK+E E+++ L +
Sbjct: 577 QAI----------KEAKKEADEIIKELRQ 595
Score = 28.3 bits (64), Expect = 5.9
Identities = 20/73 (27%), Positives = 38/73 (52%), Gaps = 7/73 (9%)
Query: 184 QLDEKDEEINQ-----QSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAK 238
+ E E++N+ + E L+++ E E L+ A + E+L+ + +L +E +
Sbjct: 510 LIGEDKEKLNELIASLEELERE-LEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLL 568
Query: 239 EEV-KEVLQALEE 250
EE KE QA++E
Sbjct: 569 EEAEKEAQQAIKE 581
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 38.0 bits (89), Expect = 0.005
Identities = 27/119 (22%), Positives = 49/119 (41%), Gaps = 21/119 (17%)
Query: 148 APASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIM 207
+ P P + L L++E L QQL+ + E Q AE +++++
Sbjct: 131 PGPFVPPEDPENLL----------HALQQEVLTLKQQLELQAREKAQSQALAEAQQQELV 180
Query: 208 EQEELITTARRDYEQLQGEMTRLTQENESA----KEEVKEV-LQAL------EEPTRTL 255
E L ++L+ ++ +L ++ K++ KE+ QA EE TR L
Sbjct: 181 ALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKRLELSEEETRIL 239
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 37.4 bits (87), Expect = 0.008
Identities = 20/84 (23%), Positives = 42/84 (50%), Gaps = 5/84 (5%)
Query: 167 SNEERQKLEEERERLYQ---QLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQL 223
+ ER++L+ E ERL Q QLD + E+++ E+ ++ + +E + + +
Sbjct: 72 ARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNE 131
Query: 224 QGEMTRLTQENESAKEEVKEVLQA 247
+ LT E A++ + ++L A
Sbjct: 132 LYRVAGLTP--EQARKLLLKLLDA 153
Score = 30.4 bits (69), Expect = 1.2
Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 2/91 (2%)
Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
RQ+L +E ER+ Q+ ++ EE + + E + + E+ + ARR+ E+LQ E RL
Sbjct: 28 RQRLAKEAERILQEAQKEAEEKLEAAL-LEAKELLLRERNQQRQEARREREELQREEERL 86
Query: 231 TQENESAKEEVKEVLQALEEPTRTLLNFFRQ 261
Q+ E ++ L LE
Sbjct: 87 VQKEEQLDARAEK-LDNLENQLEEREKALSA 116
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 37.0 bits (86), Expect = 0.010
Identities = 39/180 (21%), Positives = 75/180 (41%), Gaps = 22/180 (12%)
Query: 74 IKNVVTVNEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLAD 133
I++++ EE E K +E+++ + ++E LE L+ + E+ + L +
Sbjct: 201 IEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLE 260
Query: 134 PVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEIN 193
+ E E ++L EE E ++L+E + EI
Sbjct: 261 IESLELEALKIREEE-------------------LRELERLLEELEEKIERLEELEREIE 301
Query: 194 QQSQYAEKLKEQIMEQEELITTARR---DYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
+ + E L+ + E EEL+ + E+L+ ++ +L E E EE E+ + LEE
Sbjct: 302 ELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEE 361
Score = 33.6 bits (77), Expect = 0.12
Identities = 19/89 (21%), Positives = 43/89 (48%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E +KL+ ERL + ++ ++ ++ + AE+ E EE + E+L+ E+
Sbjct: 319 ELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELE 378
Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLN 257
+ + + +E ++E+ + L E + L
Sbjct: 379 KALERLKQLEEAIQELKEELAELSAALEE 407
Score = 32.0 bits (73), Expect = 0.37
Identities = 33/164 (20%), Positives = 61/164 (37%), Gaps = 10/164 (6%)
Query: 87 EEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPE 146
E++++ E KE K K+E+LE +LS ED L + + + E
Sbjct: 167 EKYEKLSELLKEVIKEAKAKIEELEGQLSE---LLEDIEDLLEALEEELKELKKLEEIQE 223
Query: 147 SAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQI 206
L E +LEEE+ERL + E + + + + +E++
Sbjct: 224 EQEEEEL-------EQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEEL 276
Query: 207 MEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
E E L+ E+L+ + + E + + + E
Sbjct: 277 RELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEEL 320
Score = 29.7 bits (67), Expect = 2.0
Identities = 18/56 (32%), Positives = 33/56 (58%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQL 223
EE ++LEEE E+L ++L+E +++ + Q E+L+ + E EEL + + L
Sbjct: 687 LEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKAL 742
Score = 29.7 bits (67), Expect = 2.2
Identities = 41/171 (23%), Positives = 77/171 (45%), Gaps = 14/171 (8%)
Query: 86 AEEWKRRYEKEKEKSSRY---KGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIA 142
E +R E+ +EK R + ++E+LE EL R E+ L + +
Sbjct: 276 LRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLE 335
Query: 143 PTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKL 202
E + + L NE + LEE + L ++L+E ++E+ + + ++L
Sbjct: 336 EKLEKLESELE--------ELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQL 387
Query: 203 KEQIMEQEELITTARRDYEQLQ---GEMTRLTQENESAKEEVKEVLQALEE 250
+E I E +E + E++Q E+ + +E E EE++E ++ LEE
Sbjct: 388 EEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEE 438
Score = 29.0 bits (65), Expect = 3.7
Identities = 24/85 (28%), Positives = 48/85 (56%), Gaps = 2/85 (2%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
+E +L EE E L ++L E +EE+ + + E LKE++ E+ E + + E+L+ ++
Sbjct: 493 EKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKL 552
Query: 228 TR--LTQENESAKEEVKEVLQALEE 250
L +E ++ ++E+ + LEE
Sbjct: 553 QLQQLKEELRQLEDRLQELKELLEE 577
Score = 29.0 bits (65), Expect = 3.8
Identities = 19/88 (21%), Positives = 36/88 (40%)
Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQEN 234
EE + L + + + EE+ ++ ++ E E EEL R E+L +
Sbjct: 469 EEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528
Query: 235 ESAKEEVKEVLQALEEPTRTLLNFFRQS 262
E +E+++++ LEE Q
Sbjct: 529 EELEEKLEKLENLLEELEELKEKLQLQQ 556
Score = 28.2 bits (63), Expect = 6.9
Identities = 16/81 (19%), Positives = 34/81 (41%)
Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTR 229
E + + E +L E + E+ + + E E++ Q EL + E+L+ ++
Sbjct: 606 EERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEE 665
Query: 230 LTQENESAKEEVKEVLQALEE 250
L E + ++ Q E+
Sbjct: 666 LEAEIRRELQRIENEEQLEEK 686
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 36.1 bits (84), Expect = 0.011
Identities = 26/92 (28%), Positives = 47/92 (51%), Gaps = 1/92 (1%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
S ++ KL+ E+L ++L+E + I + EKLK+ E EE T + +QL+
Sbjct: 58 SFPSQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEE-RTELLEELKQLE 116
Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
E+ +L E E ++ E ++ L+E T+
Sbjct: 117 KELKKLKAELEKYEKNDPERIEKLKEETKVAK 148
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 37.0 bits (86), Expect = 0.012
Identities = 24/89 (26%), Positives = 55/89 (61%), Gaps = 2/89 (2%)
Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGE 226
+ +E ++ E+E ++L ++LD+ EE+ + + E+L++++ E E+ + +YE+L+ E
Sbjct: 610 AEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYS--EEEYEELREE 667
Query: 227 MTRLTQENESAKEEVKEVLQALEEPTRTL 255
L++E + E++E+ + EE +TL
Sbjct: 668 YLELSRELAGLRAELEELEKRREEIKKTL 696
Score = 28.9 bits (65), Expect = 4.1
Identities = 26/91 (28%), Positives = 42/91 (46%)
Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ 222
L + EE +K EE E+L ++L + EI + EKL+E + EL E+
Sbjct: 512 LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEE 571
Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEEPTR 253
L + L + + EE++E L+ LE
Sbjct: 572 LAELLKELEELGFESVEELEERLKELEPFYN 602
Score = 28.5 bits (64), Expect = 5.9
Identities = 17/71 (23%), Positives = 33/71 (46%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
E + K EE + E++ ++ + EK KE+I E+ IT + ++ E+
Sbjct: 365 EEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKEL 424
Query: 228 TRLTQENESAK 238
+ +E + AK
Sbjct: 425 KKAIEELKKAK 435
Score = 28.1 bits (63), Expect = 6.4
Identities = 26/87 (29%), Positives = 49/87 (56%), Gaps = 6/87 (6%)
Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEEL---ITTARRDYEQLQGEMTRLT 231
E+E E + ++++E E+ + + EKL++++ E EEL I ++ E L+G +L
Sbjct: 199 EKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLE 258
Query: 232 ---QENESAKEEVKEVLQALEEPTRTL 255
+E E EE+K+ ++ LEE + L
Sbjct: 259 EKIRELEERIEELKKEIEELEEKVKEL 285
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 35.8 bits (83), Expect = 0.014
Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 3/80 (3%)
Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQ---IMEQEELITTARRDYEQL 223
EE +L+E+ L Q+ E +E++ E+L+++ I + + +L
Sbjct: 78 LQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENREL 137
Query: 224 QGEMTRLTQENESAKEEVKE 243
+ E+ L QENE+ + E +
Sbjct: 138 REELAELKQENEALEAENER 157
Score = 33.4 bits (77), Expect = 0.070
Identities = 25/87 (28%), Positives = 48/87 (55%), Gaps = 6/87 (6%)
Query: 166 LSNEE--RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQL 223
LS+ R++L E L Q+L E EE+ + + +L+++ E ++ ++T + E+L
Sbjct: 58 LSDTPSARERLPE----LQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERL 113
Query: 224 QGEMTRLTQENESAKEEVKEVLQALEE 250
Q E+ R+ Q + +A E +E + EE
Sbjct: 114 QKELARIKQLSANAIELDEENRELREE 140
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 36.5 bits (84), Expect = 0.018
Identities = 25/93 (26%), Positives = 43/93 (46%), Gaps = 7/93 (7%)
Query: 59 SETKSTLDFGRRAKTIKNVVTVNEELTAEE----WKRRYEKEKEKSSRYKGKLEKLELEL 114
S T D G A+ I +V TV+ EL E WK R E K+K ++ K +K E
Sbjct: 80 SHLNDTQDVGIHARHISSVSTVDSELNDEYGNPIWKNRVESWKDKKNKKKKSAKKKEAHK 139
Query: 115 SRWRNGETVKEDEQVNLADPVDMAASIAPTPES 147
++ + ++E + +P+ + + P P +
Sbjct: 140 AQIPPEQQMEEKPSADAYEPL---SRVIPIPRN 169
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 35.8 bits (83), Expect = 0.027
Identities = 26/118 (22%), Positives = 46/118 (38%), Gaps = 29/118 (24%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
NEE +LE RL Q+LD + + EQ+ +E + A+ + EQ + +
Sbjct: 441 NEEEYELELRLGRLKQRLD--------SATATPEELEQLEINDEALEKAQEEQEQAEANV 492
Query: 228 TRLTQENESAK-----------------EEVKEVLQALEE----PTRTLLNFFRQSSP 264
+L E + ++++ L LE +LL+F R +P
Sbjct: 493 EQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQAGSLLHFLRNEAP 550
Score = 33.9 bits (78), Expect = 0.098
Identities = 14/76 (18%), Positives = 34/76 (44%)
Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
+L E ++ + L + Q + + ++ EQ+ AR +Q + ++ RL
Sbjct: 603 TELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQ 662
Query: 232 QENESAKEEVKEVLQA 247
E +S K++++ +
Sbjct: 663 NEQQSLKDKLELAIAE 678
Score = 30.0 bits (68), Expect = 1.8
Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 6/87 (6%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDY-----EQL 223
E + +L ++ L QL E +E+NQ+ A + EL+ + + EQL
Sbjct: 288 EAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQL 347
Query: 224 QGEMTRLTQENESAKEEVKEVLQALEE 250
Q ++ +L S EEV+ L AL
Sbjct: 348 QADLDQLPSI-RSELEEVEARLDALTG 373
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 35.7 bits (83), Expect = 0.028
Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 20/86 (23%)
Query: 168 NEERQKLEEERERLYQQ---LDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
E R +L+ RL Q+ LD K E +++ KE+ +E++E ++L
Sbjct: 72 KERRNELQRLERRLLQREETLDRKMESLDK--------KEENLEKKE---------KELS 114
Query: 225 GEMTRLTQENESAKEEVKEVLQALEE 250
+ L ++ E +E + E + LE
Sbjct: 115 NKEKNLDEKEEELEELIAEQREELER 140
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 34.9 bits (80), Expect = 0.031
Identities = 22/86 (25%), Positives = 38/86 (44%), Gaps = 1/86 (1%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
EE +E + + L +D + Q++ A K+ E+ + + TA Y+Q E
Sbjct: 121 QEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEK 180
Query: 228 TRLTQENESAKEEVKEVLQALEEPTR 253
QENE +KE V++ + E
Sbjct: 181 AS-EQENEDSKEPVEKAERTKAETDD 205
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 35.1 bits (81), Expect = 0.036
Identities = 32/160 (20%), Positives = 57/160 (35%), Gaps = 20/160 (12%)
Query: 102 RYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPESAPASILP-----AI 156
YK KLEK+E +L E + + V +A P
Sbjct: 356 AYKPKLEKVERKLP-----ELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTE 410
Query: 157 PGSGLMLGSLSNEER------QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQE 210
+ + ++LEEE L ++L+E EI + E+ + + + +
Sbjct: 411 EEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRRE-VRDK 469
Query: 211 ELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
R+ + RL +E E K+ V+E+ + L E
Sbjct: 470 VRK---DREIRARDRRIERLEKELEEKKKRVEELERKLAE 506
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 35.1 bits (81), Expect = 0.041
Identities = 23/101 (22%), Positives = 46/101 (45%), Gaps = 11/101 (10%)
Query: 140 SIAPTPESAPASILPAIPGSGLMLGSL-------SNEERQKLEEERERLYQQLDEKDEEI 192
++ + +I P + S L G L +++ R +L+ + Q + K++ +
Sbjct: 8 TMTRAVKPRRFAIRPILYASVLSAGVLLCAFSAHASDNRDQLKS----IQQDIAAKEKSV 63
Query: 193 NQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQE 233
QQ Q L Q+ +QEE I+ A R + Q + +L ++
Sbjct: 64 RQQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQ 104
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 34.6 bits (80), Expect = 0.048
Identities = 21/88 (23%), Positives = 41/88 (46%)
Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ 222
+ S+ + R+KL+ +E + DE EK ++++ I+ R+ E+
Sbjct: 173 INSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEE 232
Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEE 250
LQ E+ LT E+ + E+L+ + E
Sbjct: 233 LQQELQELTIAIEALTNKKSELLEEIAE 260
Score = 31.9 bits (73), Expect = 0.38
Identities = 25/106 (23%), Positives = 44/106 (41%), Gaps = 13/106 (12%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
+E + L + +QL+E +E+ + + E L + E E I A + E+ +G
Sbjct: 214 QELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRG--- 270
Query: 229 RLTQENESAKE--EVKEVLQALEEPTRTLLNFFRQSSPKGTLLYNH 272
SAKE ++K + L+ T F S ++ YN
Sbjct: 271 ------WSAKEISKLKAKVSLLQ--KLTGWKFLSLSGSTLSMSYNR 308
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 34.9 bits (81), Expect = 0.052
Identities = 35/227 (15%), Positives = 71/227 (31%), Gaps = 52/227 (22%)
Query: 29 SKLTRILQESLGGNARTTIIICCSPASFNESET---KSTLDFGRRAKTIKNVVTVNEELT 85
KL L + + S E + + K IK + EL
Sbjct: 60 DKLRSYLPK------LNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELE 113
Query: 86 AEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVK---------EDEQVNLADPVD 136
E + E+E E+ + +L+LS + V + E++ L V+
Sbjct: 114 NE--IKELEQEIERLEPWGN----FDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVE 167
Query: 137 MAASIAPTPESAPASILPA--------------------IPGSGLMLGSLSNEERQKLEE 176
I+ ++ + G +E ++++E
Sbjct: 168 NVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGT-----PSELIREIKE 222
Query: 177 ERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQL 223
E E + ++ + EE+ + A+K E+++ E + E L
Sbjct: 223 ELEEIEKERESLLEELK---ELAKKYLEELLALYEYLEIELERAEAL 266
Score = 30.7 bits (70), Expect = 0.93
Identities = 24/91 (26%), Positives = 42/91 (46%), Gaps = 9/91 (9%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQ-----EELITTARRD 219
LSNE +KL +L + LD+ + L+E+ + EELI +
Sbjct: 39 ELSNERLRKLRSLLTKLSEALDK----LRSYLPKLNPLREEKKKVSVKSLEELIKDVEEE 94
Query: 220 YEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
E+++ E+ L +E + E+KE+ Q +E
Sbjct: 95 LEKIEKEIKELEEEISELENEIKELEQEIER 125
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 34.6 bits (79), Expect = 0.052
Identities = 16/70 (22%), Positives = 26/70 (37%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
Q EE R + + I Q+ + +QI + E I R ++L+
Sbjct: 210 NAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQA 269
Query: 229 RLTQENESAK 238
RL QE +
Sbjct: 270 RLEQEVAQLE 279
Score = 30.8 bits (69), Expect = 1.00
Identities = 15/80 (18%), Positives = 35/80 (43%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
EE + ++ Q + ++D +I+Q++Q EQI E+E + +L+ E+
Sbjct: 217 EELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVA 276
Query: 229 RLTQENESAKEEVKEVLQAL 248
+L ++ ++
Sbjct: 277 QLEAYYQAYVRLRQQAAATQ 296
Score = 29.2 bits (65), Expect = 3.0
Identities = 15/84 (17%), Positives = 37/84 (44%), Gaps = 4/84 (4%)
Query: 163 LGSLSNEERQKLEEERERLY---QQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRD 219
L +L+ E+R++LE + + L +QL ++ Q + QI ++ + + T
Sbjct: 153 LKTLA-EQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANA 211
Query: 220 YEQLQGEMTRLTQENESAKEEVKE 243
+ E+ R + + +++
Sbjct: 212 AQARTEELARRAAAAQQTAQAIQQ 235
Score = 28.1 bits (62), Expect = 5.9
Identities = 15/89 (16%), Positives = 37/89 (41%), Gaps = 3/89 (3%)
Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
+ L E + + E +K +++ + + AR++ + Q E+ RLT
Sbjct: 84 RALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLT 143
Query: 232 QENESAKEEVKEVL---QALEEPTRTLLN 257
++ + + +K + + LE ++L
Sbjct: 144 KQAQDLQTRLKTLAEQRRQLEAQAQSLQA 172
>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B. A
small number of taxonomically diverse prokaryotic
species, including Methanosarcina barkeri, have what
appears to be a second ATP synthase, in addition to the
normal F1F0 ATPase in bacteria and A1A0 ATPase in
archaea. These enzymes use ion gradients to synthesize
ATP, CC and in principle may run in either direction.
This model represents the F0 subunit B of this apparent
second ATP synthase.
Length = 246
Score = 34.3 bits (79), Expect = 0.056
Identities = 24/92 (26%), Positives = 48/92 (52%), Gaps = 10/92 (10%)
Query: 164 GSLSNEERQKLEEERERLYQQLDEKDEEINQQ-SQYAEKLKEQI-MEQEELITTARRDYE 221
G L++ + +K E E+ER ++ +EK+EE++QQ K KE+ E++ L+ AR + +
Sbjct: 43 GELADADTKKREAEQER--REYEEKNEELDQQREVLLTKAKEEAQAERQRLLDEAREEAD 100
Query: 222 QLQGEMTRLTQENESAKEEVKEVLQALEEPTR 253
+++ + E+ + E + L T
Sbjct: 101 EIRE------KWQEALRREQAALSDELRRRTG 126
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 33.9 bits (78), Expect = 0.087
Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 16/93 (17%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E Q+L E + RL QQL + E++ + +L + ++L E+
Sbjct: 70 ELIQELNELKARLQQQLLQSREQLQLLIESLAQLSSEF--------------QELANEIF 115
Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLNFFRQ 261
+ N E ++ L+ L +P R +L FR+
Sbjct: 116 --EELNRRLAELNQQNLKQLLKPLREVLEKFRE 146
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 33.3 bits (77), Expect = 0.13
Identities = 26/90 (28%), Positives = 49/90 (54%), Gaps = 2/90 (2%)
Query: 169 EERQKLEEERERLYQQLDEKDEEI-NQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
E ++LE++ E L Q D+ E I ++ Y+E L+E++ E E+ + ++ E+L +
Sbjct: 344 ETVRELEKQLEELEAQYDQLVERIAEKKVAYSE-LQEELEEIEKQLEEIEKEQEELSESL 402
Query: 228 TRLTQENESAKEEVKEVLQALEEPTRTLLN 257
L ++ A+E+++E Q L E R L
Sbjct: 403 QGLRKDELEAREKLQEYRQKLHEIKRYLEK 432
Score = 29.5 bits (67), Expect = 2.2
Identities = 21/56 (37%), Positives = 31/56 (55%)
Query: 201 KLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
K K+ I E EEL+ T D EQ+ E+ L + E ++EV+E+ E +TLL
Sbjct: 98 KAKKAIKEIEELLDTIEEDIEQILEELNELKESEEKNRKEVEELKDKYRELRKTLL 153
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 33.3 bits (76), Expect = 0.16
Identities = 20/100 (20%), Positives = 40/100 (40%), Gaps = 7/100 (7%)
Query: 170 ERQKLEEERERLYQQLDEKDEEI--NQQSQYAEKLKEQIMEQEELITTARR-----DYEQ 222
E++KL++E E+ ++L E++ + + E LK E E +I + E
Sbjct: 540 EQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEI 599
Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQS 262
E +E + + QA + + + +F Q
Sbjct: 600 KSIEDLVKLKETKQKIPQKPTNFQADKIGDKVRIRYFGQK 639
>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
C-methyltransferase; Provisional.
Length = 390
Score = 32.8 bits (75), Expect = 0.17
Identities = 20/121 (16%), Positives = 41/121 (33%), Gaps = 18/121 (14%)
Query: 127 EQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSLS---------------NEER 171
EQ + V+ T A+ + +GL+L +++ ++
Sbjct: 3 EQEKSSAVVEETREAVETTSQPVATEKKSKNRTGLVLSAVAIAIALAAGAGLYYHGKQQA 62
Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
Q + L QL Q ++L+ + +Q + + A R L ++ L
Sbjct: 63 QNQTATNDALANQLTALQ---KAQESQKQELEGILKQQAKALDQANRQQAALAKQLDELQ 119
Query: 232 Q 232
Q
Sbjct: 120 Q 120
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 32.0 bits (73), Expect = 0.22
Identities = 20/79 (25%), Positives = 36/79 (45%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
EER+ LE+E E D EI + + +L+ Q +E ++ +L+ E +
Sbjct: 110 EERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIARLKSEAS 169
Query: 229 RLTQENESAKEEVKEVLQA 247
++ QE A+ E + A
Sbjct: 170 KIKQELVDAELEFQTTAAA 188
Score = 29.7 bits (67), Expect = 1.4
Identities = 17/76 (22%), Positives = 36/76 (47%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
+++ K EER+ L +++ EK+ + +L+ +I+E + + E E+
Sbjct: 103 DDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIA 162
Query: 229 RLTQENESAKEEVKEV 244
RL E K+E+ +
Sbjct: 163 RLKSEASKIKQELVDA 178
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 32.6 bits (75), Expect = 0.24
Identities = 11/44 (25%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQ-YAEKLKEQIMEQ 209
S E + LE+E ++L +L E E ++ + A++L++++ +
Sbjct: 340 SEESLEALEKEVKKLKAELLEAAEALSAIRKKAAKELEKEVTAE 383
Score = 30.3 bits (69), Expect = 1.3
Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 9/80 (11%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDY-----EQL 223
E K++EE QLD +E + + +KLK +++E E ++ R+ +++
Sbjct: 325 EYLDKIKEE----LAQLDNSEESLEALEKEVKKLKAELLEAAEALSAIRKKAAKELEKEV 380
Query: 224 QGEMTRLTQENESAKEEVKE 243
E+ L E E+K
Sbjct: 381 TAELKALAMEKARFTVELKP 400
Score = 28.7 bits (65), Expect = 4.3
Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 4/90 (4%)
Query: 162 MLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYE 221
L S E R L+E E +L+E +E + A K I + E + + +
Sbjct: 277 ELEEASEELRAYLDE-LEFDPNRLEEVEERLFALKSLARKYGVTIEDLLEYLDKIKEELA 335
Query: 222 QLQG---EMTRLTQENESAKEEVKEVLQAL 248
QL + L +E + K E+ E +AL
Sbjct: 336 QLDNSEESLEALEKEVKKLKAELLEAAEAL 365
Score = 27.9 bits (63), Expect = 7.8
Identities = 18/92 (19%), Positives = 40/92 (43%), Gaps = 11/92 (11%)
Query: 164 GSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKE----QIMEQEELITTARRD 219
L ++ +E + ++Q + E++ ++ + + + Q+ E EEL +
Sbjct: 149 AGLEELAQEAYQEAYQA-WKQARRELEDLQEKERERAQRADLLQFQLEELEEL---NLQP 204
Query: 220 --YEQLQGEMTRLTQENESAKEEVKEVLQALE 249
E+L+ E RL +E E ++ L+ L
Sbjct: 205 GEDEELEEERKRL-SNSEKLAEAIQNALELLS 235
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 32.7 bits (75), Expect = 0.25
Identities = 41/181 (22%), Positives = 67/181 (37%), Gaps = 25/181 (13%)
Query: 92 RYEKEKEKSSRYKGK--------------LEKLELELSRWRNGETVKEDEQVNLADPVDM 137
RYE+++E++ + + LE LE E+ R E E+ LA+ V
Sbjct: 224 RYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRD 283
Query: 138 AASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQ 197
E +L + L E +E RE L + +E + + +
Sbjct: 284 LRERLEELEEERDDLL-----AEAGLDDADAE---AVEARREELEDRDEELRDRLEECRV 335
Query: 198 YAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEV---KEVLQALEEPTRT 254
A+ E+ E E+L+ E L E E A+E V +E ++ LEE
Sbjct: 336 AAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEE 395
Query: 255 L 255
L
Sbjct: 396 L 396
Score = 31.9 bits (73), Expect = 0.38
Identities = 28/109 (25%), Positives = 51/109 (46%), Gaps = 10/109 (9%)
Query: 163 LGSLSNEERQKLEEERERLYQQLDEKD----EEINQQSQYAEKLKEQIMEQEELITTARR 218
L L++E R++L E+RER + E D EE + + AE+ EQ+ E+ + + R
Sbjct: 618 LAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELREERD 677
Query: 219 DYEQLQGEMTRLTQENESAKEEVKEV------LQALEEPTRTLLNFFRQ 261
D + G + +E E +E + + L+AL + L + +
Sbjct: 678 DLQAEIGAVENELEELEELRERREALENRVEALEALYDEAEELESMYGD 726
Score = 31.9 bits (73), Expect = 0.38
Identities = 25/81 (30%), Positives = 42/81 (51%), Gaps = 7/81 (8%)
Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTR 229
++LEE RE L + + E+ E I ++ + AE+L+E+ E E + E+ +
Sbjct: 510 RIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEA-------EAEEKREAAAE 562
Query: 230 LTQENESAKEEVKEVLQALEE 250
+E E A+EEV E+ L E
Sbjct: 563 AEEEAEEAREEVAELNSKLAE 583
Score = 31.9 bits (73), Expect = 0.45
Identities = 21/82 (25%), Positives = 37/82 (45%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E ++L EE L +L+E E + + + E+L+E+I E E A D + +
Sbjct: 356 ERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLE 415
Query: 229 RLTQENESAKEEVKEVLQALEE 250
L +E + +E E+ L
Sbjct: 416 ELREERDELREREAELEATLRT 437
Score = 31.2 bits (71), Expect = 0.74
Identities = 25/92 (27%), Positives = 42/92 (45%), Gaps = 11/92 (11%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAE------KLKEQIMEQEELITTAR----R 218
E ++LE E E L ++++E +E + + E +L+E+ + EELI R
Sbjct: 475 ERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEE 534
Query: 219 DYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
E+ + R E E+ EE +E EE
Sbjct: 535 KRERAEELRER-AAELEAEAEEKREAAAEAEE 565
Score = 30.4 bits (69), Expect = 1.3
Identities = 25/84 (29%), Positives = 43/84 (51%), Gaps = 3/84 (3%)
Query: 169 EERQKLEEERERLYQQLDEKD---EEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
EE ++ EE+RE+ + DE D EE ++ + E L+ +I + E I R+ E+L
Sbjct: 220 EEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETEREREELAE 279
Query: 226 EMTRLTQENESAKEEVKEVLQALE 249
E+ L + E +EE ++L
Sbjct: 280 EVRDLRERLEELEEERDDLLAEAG 303
Score = 29.6 bits (67), Expect = 2.2
Identities = 23/87 (26%), Positives = 38/87 (43%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E+ LEE E L ++ E + E+ + + E +E+I E EE I R + ++
Sbjct: 349 EDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLG 408
Query: 229 RLTQENESAKEEVKEVLQALEEPTRTL 255
E +EE E+ + E TL
Sbjct: 409 NAEDFLEELREERDELREREAELEATL 435
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 32.6 bits (75), Expect = 0.26
Identities = 15/76 (19%), Positives = 38/76 (50%), Gaps = 3/76 (3%)
Query: 173 KLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQ 232
+L+EER +L ++L+E E N+ S+ ++ + E+ + ++L+ ++ L
Sbjct: 33 ELDEERRKLLRELEELQAERNELSK---EIGRALKRGEDDAEELIAEVKELKEKLKELEA 89
Query: 233 ENESAKEEVKEVLQAL 248
+ + E+ +L +
Sbjct: 90 ALDELEAELDTLLLTI 105
>gnl|CDD|144876 pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain. These
proteins contain several 22 residue repeats which form a
pair of alpha helices. This family includes:
Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein
E.
Length = 191
Score = 31.4 bits (72), Expect = 0.33
Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 14/101 (13%)
Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQ-YAEKLKEQIMEQ------------ 209
LG + E +L +E E L ++L + E++ + Q Y ++LK ++ +
Sbjct: 21 LGPYAQEFWARLSKETEALREELQKDLEDVRARLQPYLDELKAKVGQNLEELRQRLAPYA 80
Query: 210 EELITTARRDYEQLQGEMTRLTQE-NESAKEEVKEVLQALE 249
EEL RD E+L+ ++ +E + ++ V+ + L
Sbjct: 81 EELRKRLNRDAEELRRKLAPYAEELRDRLRQNVEALRARLG 121
Score = 31.4 bits (72), Expect = 0.35
Identities = 27/125 (21%), Positives = 48/125 (38%), Gaps = 28/125 (22%)
Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQ--------YAEKLKEQIMEQ-------- 209
L E ++ LE+ R RL LDE ++ Q + YAE+L++++
Sbjct: 39 LREELQKDLEDVRARLQPYLDELKAKVGQNLEELRQRLAPYAEELRKRLNRDAEELRRKL 98
Query: 210 ----EELITTARRDYEQLQGEM--------TRLTQENESAKEEVKEVLQALEEPTRTLLN 257
EEL R++ E L+ + +L + E KE V + + +
Sbjct: 99 APYAEELRDRLRQNVEALRARLGPYVEELRQKLAERLEELKESVGPYAEEYKAQLSEQVE 158
Query: 258 FFRQS 262
R+
Sbjct: 159 ELREK 163
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 31.9 bits (73), Expect = 0.33
Identities = 17/88 (19%), Positives = 35/88 (39%), Gaps = 5/88 (5%)
Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ 222
+ +L E ER RL L E A +L +++ ++++ A E
Sbjct: 83 VANLRASLSAA-EAERSRLQALLAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVEL 141
Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEE 250
L ++ L ++ + ++ L A E+
Sbjct: 142 LNQQIAALRRQLAA----LEAALDASEK 165
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 31.1 bits (71), Expect = 0.33
Identities = 13/84 (15%), Positives = 44/84 (52%), Gaps = 3/84 (3%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQ--SQYAEKLKEQIMEQEELITTARRDYEQLQG 225
+E +KL+ E ++ ++L ++++++ +Q + E K + E ++ ++ + Q
Sbjct: 41 EKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQ 100
Query: 226 EMTRLTQE-NESAKEEVKEVLQAL 248
E+ + QE + +++ + ++ +
Sbjct: 101 ELQQKQQELLQPIYDKIDKAIKEV 124
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 32.1 bits (74), Expect = 0.35
Identities = 22/104 (21%), Positives = 46/104 (44%), Gaps = 28/104 (26%)
Query: 168 NEERQKLEEERERLYQQLDE---------------------KDEEINQQSQYAEKLKEQI 206
E ++ +E+RE +++L+E KD E + ++ E L E+
Sbjct: 118 EEIEEEFQEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEEF-EALPEE- 175
Query: 207 MEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
E+EEL ++L+ E+ + ++ + E +E L+ L+
Sbjct: 176 -EREEL----EEKIDELEEELQEILRQLRELEREAREKLRELDR 214
Score = 29.7 bits (68), Expect = 2.0
Identities = 21/92 (22%), Positives = 41/92 (44%), Gaps = 21/92 (22%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQ-QSQYAEKLKE---QIMEQ--EELITTARR 218
+L EER++LEE+ + L ++L E ++ + + + EKL+E ++ L+ +
Sbjct: 171 ALPEEEREELEEKIDELEEELQEILRQLRELEREAREKLRELDREVALFAVGPLLDELKE 230
Query: 219 DYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
Y L +V L A++E
Sbjct: 231 KYADL---------------PKVLAYLDAVQE 247
>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
Validated.
Length = 339
Score = 31.6 bits (73), Expect = 0.37
Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 4/48 (8%)
Query: 163 LGSLSNEER----QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQI 206
LG L EER + E ++ + L+E+ EE+ + A E I
Sbjct: 46 LGKLPPEERKEAGALINELKQAIEAALEERKEELEAAALNARLAAETI 93
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 31.9 bits (73), Expect = 0.39
Identities = 30/161 (18%), Positives = 58/161 (36%), Gaps = 25/161 (15%)
Query: 107 LEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSL 166
+LE ++ R E + A P+ ++ PA+P SL
Sbjct: 83 AAELESQVLRLEAEVARLRAEADSQAAIEF--------PDDLLSAEDPAVPELIKGQQSL 134
Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQI-MEQEELI------------ 213
+ L + E + Q+ + + E+ + L++Q+ + EEL
Sbjct: 135 FESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLVS 194
Query: 214 ----TTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
R+ + QGE+ RL E E K ++ E+ ++
Sbjct: 195 RLELLELERERAEAQGELGRLEAELEVLKRQIDELQLERQQ 235
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 32.0 bits (73), Expect = 0.39
Identities = 16/82 (19%), Positives = 37/82 (45%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
R ++ E+ L L E+ + + +Q E+ K+ + + ++ ++ E+L+
Sbjct: 174 AAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANE 233
Query: 228 TRLTQENESAKEEVKEVLQALE 249
+RL E SA+ + +A
Sbjct: 234 SRLKNEIASAEAAAAKAREAAA 255
Score = 29.7 bits (67), Expect = 1.9
Identities = 11/84 (13%), Positives = 36/84 (42%)
Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
+++ +++ +++ +++I +Q KL++Q+ E I + + ++ +L
Sbjct: 34 AADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLR 93
Query: 232 QENESAKEEVKEVLQALEEPTRTL 255
++ + + E R L
Sbjct: 94 KQIADLNARLNALEVQEREQRRRL 117
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 31.8 bits (72), Expect = 0.39
Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 2/89 (2%)
Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDY--EQLQ 224
+ E + K E+ERL Q E+ + QQ Q E K+ +EQ++ AR+ ++ +
Sbjct: 89 AEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKK 148
Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTR 253
E + E+AK + + E
Sbjct: 149 AEAAKAKAAAEAAKLKAAAEAKKKAEEAA 177
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 31.5 bits (72), Expect = 0.43
Identities = 25/104 (24%), Positives = 45/104 (43%), Gaps = 9/104 (8%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E+ +KL +E ++L+E +EE+ + E L + E I A + EQ +G T
Sbjct: 211 EKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRG-FT 269
Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGTLLYNH 272
E++KE L+ L+ T + S ++ Y+
Sbjct: 270 F------KEIEKLKEQLKLLQS--LTGWKITKLSGNTLSMTYDR 305
Score = 30.4 bits (69), Expect = 0.99
Identities = 19/100 (19%), Positives = 41/100 (41%), Gaps = 11/100 (11%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAE-----------KLKEQIMEQEELITTA 216
+E + L + +L + D +EE+ Q Q + + KE++ + + I
Sbjct: 164 MKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIK 223
Query: 217 RRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
+ E+L+ E+ L + E + E+ + E + L
Sbjct: 224 VKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLE 263
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 31.9 bits (73), Expect = 0.44
Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 14/90 (15%)
Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT- 228
ERQK +EE E L ++LD+ +++ + + + M EE +Y+Q E+T
Sbjct: 192 ERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTPEEK----DDEYDQRVRELTF 247
Query: 229 --------RLTQENESAKEEVKEVLQALEE 250
R E E AKEE E L+ LE
Sbjct: 248 DRRAQPTDRTKTEEELAKEE-AERLKKLEA 276
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 31.5 bits (72), Expect = 0.48
Identities = 8/40 (20%), Positives = 23/40 (57%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIM 207
E+ ++LEE + L ++ E + ++ + + E L++++
Sbjct: 327 REKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLE 366
>gnl|CDD|220484 pfam09943, DUF2175, Uncharacterized protein conserved in archaea
(DUF2175). This domain, found in various hypothetical
archaeal proteins, has no known function.
Length = 101
Score = 29.7 bits (67), Expect = 0.48
Identities = 17/71 (23%), Positives = 38/71 (53%), Gaps = 4/71 (5%)
Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEEL---ITTARRDYEQ 222
L + +K++ ++ L ++ +++ A++L EQ+ E EE+ I R+D E+
Sbjct: 31 LVEDASKKIKGDKPEDLVALLRLEDYLHEGIVLAKEL-EQVAESEEIKKVIRQVRKDLEK 89
Query: 223 LQGEMTRLTQE 233
L ++TR ++
Sbjct: 90 LAAKLTRKIEK 100
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 31.6 bits (72), Expect = 0.49
Identities = 20/96 (20%), Positives = 51/96 (53%), Gaps = 7/96 (7%)
Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
L++E +KLE+E ++L +L+++++E+ + EK E++ ++ EL+ + ++
Sbjct: 283 LASELEKKLEKELKKLENKLEKQEDELEE----LEKAAEELRQKGELLYAN---LQLIEE 335
Query: 226 EMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQ 261
+ + + EE+K L + P+ +F++
Sbjct: 336 GLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKK 371
Score = 27.7 bits (62), Expect = 8.2
Identities = 28/172 (16%), Positives = 57/172 (33%), Gaps = 27/172 (15%)
Query: 74 IKNVVTVNEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLAD 133
++ + E E+ + E+ ++ + + K E L L G V LAD
Sbjct: 291 LEKELKKLEN-KLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLK-----SVRLAD 344
Query: 134 PVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEER-----QKLEEERERLYQQLDEK 188
E + + S N +R +KL+ + L +QL E
Sbjct: 345 FYGN--------EEIKIELDKSKTPS-------ENAQRYFKKYKKLKGAKVNLDRQLSEL 389
Query: 189 DEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEE 240
E I A+ E+ ++ + E+ + + ++ + E+
Sbjct: 390 KEAIAYYES-AKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 29.2 bits (66), Expect = 0.50
Identities = 18/68 (26%), Positives = 39/68 (57%)
Query: 183 QQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVK 242
++L EKDE+I Q + EKL ++ ++ I R ++L+ E+ L ++ E ++E++
Sbjct: 5 KKLAEKDEQIAQLMEEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELE 64
Query: 243 EVLQALEE 250
+ + L+
Sbjct: 65 NLEERLKR 72
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 31.5 bits (72), Expect = 0.51
Identities = 20/80 (25%), Positives = 36/80 (45%), Gaps = 3/80 (3%)
Query: 162 MLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQ--YAEKLKEQIMEQEELITTARRD 219
+L L E +LE + L +QL + E ++ + Y EKL++++ Q E ++
Sbjct: 201 LLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKN 260
Query: 220 YEQLQG-EMTRLTQENESAK 238
LQ E+ R + K
Sbjct: 261 ELALQAIELQREFNKEIKEK 280
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 31.6 bits (72), Expect = 0.51
Identities = 23/85 (27%), Positives = 43/85 (50%)
Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
+K E+E + L LDE E I Q +L++ + E E+ +T + E++Q +T L
Sbjct: 350 RKFEKELKELESVLDEILENIEAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLR 409
Query: 232 QENESAKEEVKEVLQALEEPTRTLL 256
++ A+E ++ + L E R +
Sbjct: 410 KDELEARENLERLKSKLHEIKRYME 434
>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
Length = 1112
Score = 31.5 bits (72), Expect = 0.55
Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 5/55 (9%)
Query: 176 EERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELIT--TARRDYEQLQGEMT 228
+ERE +Y Q+D+ ++E + EK+ ++I+ + I TARR E + E+T
Sbjct: 78 DEREDIYAQIDKLEKEAYEIL---EKVLDEILPEAFAIVKETARRFAENEEIEVT 129
>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0). Part of the CF(0)
(base unit) of the ATP synthase. The base unit is
thought to translocate protons through membrane (inner
membrane in mitochondria, thylakoid membrane in plants,
cytoplasmic membrane in bacteria). The B subunits are
thought to interact with the stalk of the CF(1)
subunits. This domain should not be confused with the ab
CF(1) proteins (in the head of the ATP synthase) which
are found in pfam00006.
Length = 132
Score = 30.3 bits (69), Expect = 0.58
Identities = 25/84 (29%), Positives = 40/84 (47%), Gaps = 7/84 (8%)
Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
+ L E ++L Q E E IN + A+KLKE+I + A++D E+L
Sbjct: 45 RLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEEI------LAEAQKDAERLLE 98
Query: 226 E-MTRLTQENESAKEEVKEVLQAL 248
+ QE E A E+++ + AL
Sbjct: 99 SARAEIEQEKEQALAELRQQVAAL 122
>gnl|CDD|226396 COG3879, COG3879, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 247
Score = 30.8 bits (70), Expect = 0.66
Identities = 11/62 (17%), Positives = 26/62 (41%)
Query: 194 QQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTR 253
Q S+ + + ++ + + + ++ L E+ L + +S + V ALE+
Sbjct: 40 QTSKGESVRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSVRRSVLTDDAALEDRLE 99
Query: 254 TL 255
L
Sbjct: 100 KL 101
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 31.1 bits (70), Expect = 0.73
Identities = 21/87 (24%), Positives = 45/87 (51%), Gaps = 4/87 (4%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E + EEE + L +DE +++ +Q E+ + E+E+L R+ +++ +
Sbjct: 337 SEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQEREKLT----RELDKINIQSD 392
Query: 229 RLTQENESAKEEVKEVLQALEEPTRTL 255
+LT+ +S K E + + ++LE+ R
Sbjct: 393 KLTKSVKSRKLEAQGIFKSLEKTLRQY 419
>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
Nexins. BAR domains are dimerization, lipid binding and
curvature sensing modules found in many different
proteins with diverse functions. Sorting nexins (SNXs)
are Phox homology (PX) domain containing proteins that
are involved in regulating membrane traffic and protein
sorting in the endosomal system. SNXs differ from each
other in their lipid-binding specificity, subcellular
localization and specific function in the endocytic
pathway. A subset of SNXs also contain BAR domains. The
PX-BAR structural unit determines the specific membrane
targeting of SNXs. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 218
Score = 30.4 bits (69), Expect = 0.73
Identities = 13/75 (17%), Positives = 28/75 (37%)
Query: 176 EERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENE 235
++R L +++ + EKLK + + + E+ + + + E
Sbjct: 110 DDRADALLTLQSLKKDLASKKAQLEKLKAAPGIKPAKVEELEEELEEAESALEEARKRYE 169
Query: 236 SAKEEVKEVLQALEE 250
E +KE L+ E
Sbjct: 170 EISERLKEELKRFHE 184
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 30.9 bits (70), Expect = 0.75
Identities = 17/63 (26%), Positives = 30/63 (47%)
Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
R L +E L ++ K++E +Q +KL+ +I E E + T + Y+QL E
Sbjct: 363 RTPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQLYKQLLDEYENA 422
Query: 231 TQE 233
+
Sbjct: 423 PKS 425
Score = 29.0 bits (65), Expect = 3.8
Identities = 13/61 (21%), Positives = 32/61 (52%)
Query: 183 QQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVK 242
++D +++ ++ E L+EQI E E I + + L ++ ++ +E + ++E +
Sbjct: 256 AEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENE 315
Query: 243 E 243
E
Sbjct: 316 E 316
Score = 27.4 bits (61), Expect = 9.3
Identities = 24/104 (23%), Positives = 41/104 (39%), Gaps = 13/104 (12%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
E E+ + +L+ E+I++ E++ +I I +Q+ E+
Sbjct: 255 PAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKI-------KQVNEEL 307
Query: 228 TRLTQENESAKEEVK------EVLQALEEPTRTLLNFFRQSSPK 265
T + QENE +EE K E+L E L SS +
Sbjct: 308 TTVRQENEELEEEYKIKKRTVELLPDAENNVAKLQALVVASSER 351
>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20. This
family consists of several phage minor structural
protein GP20 sequences of around 180 residues in length.
The function of this family is unknown.
Length = 156
Score = 30.0 bits (68), Expect = 0.79
Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 18/68 (26%)
Query: 173 KLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQ 232
K EER+ L +QL +D++I E LK+++ +D E+LQ ++ +L Q
Sbjct: 24 KANEERDSLKKQLKNRDKQI-------EDLKKKV-----------KDNEELQKKIEKLKQ 65
Query: 233 ENESAKEE 240
+N++AKEE
Sbjct: 66 QNKTAKEE 73
>gnl|CDD|240376 PTZ00350, PTZ00350, adenylosuccinate synthetase; Provisional.
Length = 436
Score = 30.8 bits (70), Expect = 0.83
Identities = 19/67 (28%), Positives = 37/67 (55%), Gaps = 6/67 (8%)
Query: 159 SGLMLGSLSNEE------RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEEL 212
+GL +G L N E R+ +E+ +E+ + + +EE+ + YAEKLK+ I++
Sbjct: 157 TGLRVGDLLNFETFEKKYRKLVEKLQEQYNIEEYDAEEELERYKGYAEKLKDMIVDTVYF 216
Query: 213 ITTARRD 219
+ A ++
Sbjct: 217 MNKAIKE 223
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 31.3 bits (70), Expect = 0.84
Identities = 38/184 (20%), Positives = 79/184 (42%), Gaps = 22/184 (11%)
Query: 69 RRAKTIKNVVTVNEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQ 128
R + + + +++ AEE K+ E + + K + EK ++E + + E K+ E+
Sbjct: 1592 ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEE 1651
Query: 129 VNLADPVDM--AASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLD 186
+ A+ + AA A E EE +K EE+ ++ + L
Sbjct: 1652 LKKAEEENKIKAAEEAKKAEEDKKKA----------------EEAKKAEEDEKKAAEALK 1695
Query: 187 EKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQ 246
++ EE + + +K E+ + EEL ++ E+ + + +E E K++ +E +
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEEL----KKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751
Query: 247 ALEE 250
EE
Sbjct: 1752 DEEE 1755
>gnl|CDD|221101 pfam11382, DUF3186, Protein of unknown function (DUF3186). This
bacterial family of proteins has no known function.
Length = 307
Score = 30.8 bits (70), Expect = 0.85
Identities = 39/156 (25%), Positives = 56/156 (35%), Gaps = 36/156 (23%)
Query: 158 GSGL----MLGSLSNE------ERQKLEEERERLYQQLDEKDEEINQ--QSQYAEKLKE- 204
GSGL +L L +E E +L ERE L +Q D+ I A +L
Sbjct: 25 GSGLLQENLLSGLEDEFSDLRTENDRLRAEREALNEQASAADQFIAALAPRLVAGRLTGK 84
Query: 205 QIMEQEELITTARRDY-------EQLQ-------GEMTRLTQE--NESAKEEVKEVLQAL 248
+ L+ T D E L G +T LT + + +++K V+
Sbjct: 85 SVA----LVRTPDADDDDVDAVSEALTQAGATVVGRIT-LTDNFVDPESADKLKSVVNND 139
Query: 249 EEPTRTLLNF--FRQSSPKGTLLYNHYLLGWPNQSE 282
+P L+ S G LL LL P S
Sbjct: 140 LKPAGDQLSTAAADPGSQAGDLLGAALLLNPPTAST 175
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 31.1 bits (70), Expect = 0.86
Identities = 16/80 (20%), Positives = 34/80 (42%), Gaps = 2/80 (2%)
Query: 177 ERERLYQQLDEKDEEINQQSQYAEK--LKEQIMEQEELITTARRDYEQLQGEMTRLTQEN 234
RE D E+ +++ + LKE++++++E + T D + +
Sbjct: 6 RREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIA 65
Query: 235 ESAKEEVKEVLQALEEPTRT 254
+ K+ KE + L E +T
Sbjct: 66 DEVKKSTKEESKQLLEVLKT 85
Score = 28.4 bits (63), Expect = 6.1
Identities = 17/103 (16%), Positives = 43/103 (41%), Gaps = 1/103 (0%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
++++++ + + E + + + KE+ + E++ T +++Q E+
Sbjct: 39 DKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEIL 98
Query: 229 RLTQENESAKEEV-KEVLQALEEPTRTLLNFFRQSSPKGTLLY 270
+ T + KE + K++ E + FR PK +Y
Sbjct: 99 QKTIPSFEPKESILKKLEDIKPEQAKKQTKLFRLFEPKQLPIY 141
>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein. This family of proteins
is found in bacteria. Proteins in this family are
typically between 397 and 1543 amino acids in length.
This family is the SidE protein in the Dot/Icm pathway of
Legionella pneumophila bacteria. There is little
literature describing the family.
Length = 1443
Score = 31.0 bits (70), Expect = 0.87
Identities = 19/85 (22%), Positives = 44/85 (51%), Gaps = 6/85 (7%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEIN--QQSQYAEKLKEQIMEQEELITTARRDYEQ 222
SL+ + + E + + + Q + + +S AEKL+E I + ++L+ ++ Y +
Sbjct: 1268 SLNQMKPKTFEMQEKEIQQNFELLAKLEKTLDKSDTAEKLREDIPKLKDLLIAKQKAYPE 1327
Query: 223 LQGEMTRLTQENESAKEEVKEVLQA 247
M +L ++E+ +++E+ QA
Sbjct: 1328 ----MVQLQYKSEALITQLRELCQA 1348
>gnl|CDD|153311 cd07627, BAR_Vps5p, The Bin/Amphiphysin/Rvs (BAR) domain of yeast
Sorting Nexin Vps5p. BAR domains are dimerization,
lipid binding and curvature sensing modules found in
many different proteins with diverse functions. Sorting
nexins (SNXs) are Phox homology (PX) domain containing
proteins that are involved in regulating membrane
traffic and protein sorting in the endosomal system.
SNXs differ from each other in their lipid-binding
specificity, subcellular localization and specific
function in the endocytic pathway. A subset of SNXs also
contain BAR domains. The PX-BAR structural unit
determines the specific membrane targeting of SNXs.
Vsp5p is the yeast counterpart of human SNX1 and is part
of the retromer complex, which functions in the
endosome-to-Golgi retrieval of vacuolar protein sorting
receptor Vps10p, the Golgi-resident membrane protein
A-ALP, and endopeptidase Kex2. BAR domains form dimers
that bind to membranes, induce membrane bending and
curvature, and may also be involved in protein-protein
interactions.
Length = 216
Score = 30.4 bits (69), Expect = 0.91
Identities = 19/73 (26%), Positives = 37/73 (50%)
Query: 178 RERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESA 237
R++L+Q + E++++ EKLK Q Q+E + + + E+ + + L +E E
Sbjct: 110 RQKLWQYWQSAESELSKKKAQLEKLKRQGKTQQEKLNSLLSELEEAERRASELKKEFEEV 169
Query: 238 KEEVKEVLQALEE 250
E +K L+ E
Sbjct: 170 SELIKSELERFER 182
>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904). This
family consists of several bacterial and archaeal
hypothetical proteins of unknown function.
Length = 72
Score = 28.4 bits (64), Expect = 0.95
Identities = 16/69 (23%), Positives = 35/69 (50%)
Query: 182 YQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEV 241
+ L+E + +I Q + L+ +I E +E + E+L+ E +L +E + + +
Sbjct: 3 LELLEELESKIQQAVETIALLQMEIEELKEENEQLSEENEELEEENKKLKEERNAWQSRL 62
Query: 242 KEVLQALEE 250
+ +L L+E
Sbjct: 63 RALLGKLDE 71
>gnl|CDD|165311 PHA03011, PHA03011, hypothetical protein; Provisional.
Length = 120
Score = 29.1 bits (65), Expect = 1.0
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQI 206
+E +E E + L + + D+EI+ +KLKE I
Sbjct: 77 LDEYNLIENEIKDLEIIIQDNDDEIHFLRAEIDKLKENI 115
>gnl|CDD|237616 PRK14127, PRK14127, cell division protein GpsB; Provisional.
Length = 109
Score = 28.8 bits (65), Expect = 1.1
Identities = 13/50 (26%), Positives = 18/50 (36%), Gaps = 1/50 (2%)
Query: 218 RDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGT 267
+DYE Q E+ L QEN K +V E L + +
Sbjct: 37 KDYEAFQKEIEELQQENARLKAQVDE-LTKQVSVGASSSSVATTQPSSSA 85
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 30.7 bits (69), Expect = 1.1
Identities = 24/103 (23%), Positives = 43/103 (41%), Gaps = 9/103 (8%)
Query: 172 QKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQE-ELITTARRDYEQLQGE-MTR 229
++L+ L + + + N+ + +LKEQ+ L + + L G R
Sbjct: 2088 EELDCTNVNLVEGV----RKYNEYGRGMGELKEQLSNTVVILGVKEKNADDALSGTPGER 2143
Query: 230 LTQENES-AKEEVKEVLQALEEPTRTLLNFFRQ--SSPKGTLL 269
L +E +S E +++L LEE R + SS K +L
Sbjct: 2144 LEREVKSVFVEAPRDMLFLLEEEVRKRKGSVMKFKSSKKPAVL 2186
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 30.2 bits (69), Expect = 1.2
Identities = 13/46 (28%), Positives = 23/46 (50%)
Query: 190 EEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENE 235
E + + +L+EQI + E + + E+L+ E+ RL E E
Sbjct: 1 ERLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELE 46
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 30.2 bits (68), Expect = 1.3
Identities = 21/83 (25%), Positives = 35/83 (42%), Gaps = 4/83 (4%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ---- 224
E +QK E+ERL Q E+ Q+ Q E K+ ++Q++ A + +
Sbjct: 91 ELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAE 150
Query: 225 GEMTRLTQENESAKEEVKEVLQA 247
E R + A E K+ +A
Sbjct: 151 AEAKRAAAAAKKAAAEAKKKAEA 173
>gnl|CDD|218006 pfam04286, DUF445, Protein of unknown function (DUF445). Predicted
to be a membrane protein.
Length = 366
Score = 29.9 bits (68), Expect = 1.4
Identities = 17/81 (20%), Positives = 34/81 (41%), Gaps = 5/81 (6%)
Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
RQ+ ++ RL L E + EKLK +++ + +E L+ +
Sbjct: 206 RQEFDKALRRLIDDLLNDPELAAE----VEKLKRRLLADPAVQDYLEGLWESLKDALLDD 261
Query: 231 TQENESA-KEEVKEVLQALEE 250
+ +S ++ + E+L L E
Sbjct: 262 LADPDSLLRQRISELLAELGE 282
>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family. In a
subset of endospore-forming members of the Firmcutes,
members of this protein family are found, several to a
genome. Two very strongly conserved sequences regions
are separated by a highly variable linker region. Much
of the linker region was excised from the seed alignment
for this model. A characterized member is the
prespore-specific transcription RsfA from Bacillus
subtilis, previously called YwfN, which is controlled by
sigma factor F and seems to fine-tune expression of some
genes in the sigma-F regulon. A paralog in Bacillus
subtilis is designated YlbO [Regulatory functions, DNA
interactions, Cellular processes, Sporulation and
germination].
Length = 161
Score = 29.3 bits (66), Expect = 1.4
Identities = 18/63 (28%), Positives = 30/63 (47%), Gaps = 7/63 (11%)
Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGE 226
+N Q L++E ERL Q + + + + EKL++++ EE DY+ L
Sbjct: 95 TNPSDQALQKENERLKNQNESLQKRNEELEKELEKLRQRLSTIEE-------DYQTLIDI 147
Query: 227 MTR 229
M R
Sbjct: 148 MDR 150
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 30.0 bits (69), Expect = 1.4
Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 11/78 (14%)
Query: 173 KLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQ 232
+L+EER L +L+E E N S+ + K + + E LI E+ L +
Sbjct: 32 ELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIA-----------EVKELKE 80
Query: 233 ENESAKEEVKEVLQALEE 250
E ++ + E+ E+ LEE
Sbjct: 81 EIKALEAELDELEAELEE 98
>gnl|CDD|183108 PRK11377, PRK11377, dihydroxyacetone kinase subunit M; Provisional.
Length = 473
Score = 30.1 bits (68), Expect = 1.4
Identities = 26/129 (20%), Positives = 49/129 (37%), Gaps = 17/129 (13%)
Query: 134 PVDMAASIAPTPE--SAPASI-LPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDE 190
++ P P S A P + ++ + +EEE+ERL Q +D
Sbjct: 243 EEVAPPTLRPVPSPVSGKAFYYQPVLC-------TVQAKSTLTVEEEQERLRQAIDFTLL 295
Query: 191 EINQQSQYAEKLKEQIMEQEELITTARR----DYEQLQGEMTRLTQENESAKEEVKEVLQ 246
++ K + ++ I + D E L RL E+ +A+ ++VL+
Sbjct: 296 DLM---TLTAKAEASGLDDIAAIFSGHHTLLDDPELLAAASERLQHEHCTAEYAWQQVLK 352
Query: 247 ALEEPTRTL 255
L + + L
Sbjct: 353 ELSQQYQQL 361
>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein. YkyA is a
family of proteins containing a lipoprotein signal and a
hydrolase domain. It is similar to cell wall binding
proteins and might also be recognisable by a host immune
defence system. It is thus likely to belong to pathways
important for pathogenicity.
Length = 205
Score = 29.6 bits (67), Expect = 1.5
Identities = 13/88 (14%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 169 EERQKLEEERERLYQQ-LDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
++ ++LE++ + LY Q + ++ + +++ + ++E+L+ + E+ + E
Sbjct: 40 KKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLLKKEKESIEKSEEEF 99
Query: 228 TRLTQ-----ENESAKEEVKEVLQALEE 250
+ E++ K++ K++++ ++E
Sbjct: 100 KSAKKYIEKIEDKKLKKKAKQLVKVMKE 127
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 29.8 bits (67), Expect = 1.5
Identities = 25/104 (24%), Positives = 41/104 (39%), Gaps = 12/104 (11%)
Query: 179 ERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAK 238
ERL +L + ++E + QY KL+ Q +YE L E+ L +E E
Sbjct: 32 ERLDSELRDAEKERDTYKQYLSKLESQ--------NVEISNYEALDSELDELKKEEERLL 83
Query: 239 EEVKEVLQALEEPTRTLLNFFRQSSPKGTLLYNHYLLGWPNQSE 282
+E++E LE+ L + + L N L +
Sbjct: 84 DELEE----LEKEDDDLDGELVELQEEKEQLENEELQYLREYNL 123
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 29.2 bits (66), Expect = 1.5
Identities = 21/71 (29%), Positives = 37/71 (52%)
Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
R + E RE L + L + + EI + E+LK Q+ + E + + QL+ ++ L
Sbjct: 45 RDRDLEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKLKTL 104
Query: 231 TQENESAKEEV 241
Q+ ++ KEEV
Sbjct: 105 EQKLKNEKEEV 115
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 28.8 bits (65), Expect = 1.6
Identities = 17/83 (20%), Positives = 33/83 (39%), Gaps = 15/83 (18%)
Query: 171 RQKLEEERERLYQQLDEKDEEINQQS---------------QYAEKLKEQIMEQEELITT 215
+Q+LE+ E+L + + E Q S + L E I +Q++ +
Sbjct: 11 QQELEQAEEKLEELQQYRLEYRQQLSGSGQGISAAELRNYQAFISALDEAIAQQQQELEQ 70
Query: 216 ARRDYEQLQGEMTRLTQENESAK 238
A + EQ + + QE + +
Sbjct: 71 AEKQVEQAREQWQEANQERKKLE 93
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 29.6 bits (67), Expect = 1.8
Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 2/57 (3%)
Query: 70 RAKTIKNVVTVNEELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKED 126
+ K NE+L AE + +KEK+K + K ++E+LE + + T KE+
Sbjct: 310 KRKLKSKFERDNEKLDAE--VKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEE 364
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 29.6 bits (67), Expect = 1.8
Identities = 19/117 (16%), Positives = 55/117 (47%), Gaps = 7/117 (5%)
Query: 141 IAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAE 200
+ A I A + ++ + E +++L ++ + +++ +I ++ + E
Sbjct: 50 REEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWE 109
Query: 201 KLKEQIMEQ------EELITTARRD-YEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
+ E+++E+ EE R + E+++ + +L + E ++ +E+L++ EE
Sbjct: 110 EELERLIEEAKAEGYEEGYEKGREEGLEEVRELIEQLNKIIEKLIKKREEILESSEE 166
>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A.
Length = 427
Score = 29.8 bits (68), Expect = 1.8
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 163 LGSLSNEERQKLEEERERLYQQLDE 187
L L+ EE +KLE+E E L +++ E
Sbjct: 386 LRRLTKEEIEKLEKEIEELEKEIAE 410
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 29.6 bits (66), Expect = 1.9
Identities = 23/104 (22%), Positives = 47/104 (45%), Gaps = 3/104 (2%)
Query: 169 EERQKLEEERERLYQ---QLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
ER L +R Q +L++ + + Q+ Q +++ E + E + + + +Q
Sbjct: 206 AERNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLK 265
Query: 226 EMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGTLL 269
+MT ++ E+ + +++ + L E RTL S K LL
Sbjct: 266 KMTIQRRDEETERIDLQLENEQLHEDLRTLQERLESSQQKAGLL 309
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 29.3 bits (66), Expect = 2.0
Identities = 14/46 (30%), Positives = 27/46 (58%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELIT 214
E + L+ + E + ++DE +EI+Q +KL+++I E +E I
Sbjct: 52 NEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIV 97
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 29.6 bits (67), Expect = 2.0
Identities = 15/72 (20%), Positives = 31/72 (43%), Gaps = 1/72 (1%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQ-YAEKLKEQIMEQEELITTARRDYEQLQGEM 227
E + L+ E ERL ++ D++I Q Q ++L ++I + + + +QLQ +
Sbjct: 80 SENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSERQQLQGLIDQLQRRL 139
Query: 228 TRLTQENESAKE 239
+
Sbjct: 140 AGVLTGPSGGGS 151
>gnl|CDD|215505 PLN02935, PLN02935, Bifunctional NADH kinase/NAD(+) kinase.
Length = 508
Score = 29.8 bits (67), Expect = 2.0
Identities = 11/49 (22%), Positives = 19/49 (38%)
Query: 86 AEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADP 134
A EWKR+YE E+ ++ + + K + Q L +
Sbjct: 64 AAEWKRKYELERARNQQLEHKELSSGECNEESNDQRLENLANQPMLYNE 112
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 28.7 bits (65), Expect = 2.1
Identities = 12/85 (14%), Positives = 42/85 (49%), Gaps = 7/85 (8%)
Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
+++LE+E ++ +L++ ++E+ + + +K + + ++ ++ E R
Sbjct: 20 QKQLEKEFKKRQAELEKLEKELQKLKEKLQK-DAATLSEAA-REKKEKELQKKVQEFQRK 77
Query: 231 TQE-----NESAKEEVKEVLQALEE 250
Q+ + +EE++++L + +
Sbjct: 78 QQKLQQDLQKRQQEELQKILDKINK 102
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 29.7 bits (67), Expect = 2.2
Identities = 28/151 (18%), Positives = 59/151 (39%), Gaps = 15/151 (9%)
Query: 107 LEKLELELSRWRNGETVKEDEQVNLADPVDMAASIAPTPESAPASILPAIPGSGLMLGSL 166
LE+ + EL + + +A ++A P + + I + GS L +
Sbjct: 325 LEERDKELLESLPKLALPAEHVKEIAA--ELAEIDKPA--TTDSEIPHRLSGSEL---TQ 377
Query: 167 SNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGE 226
Q+++ E + QL ++ E+ ++ +K I +E+ QL E
Sbjct: 378 LEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEEQ--------IAQLLEE 429
Query: 227 MTRLTQENESAKEEVKEVLQALEEPTRTLLN 257
+ E ++ E++E+L+ LE +
Sbjct: 430 LGEAQNELFRSEAEIEELLRQLETLKEAIEA 460
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 29.3 bits (66), Expect = 2.3
Identities = 19/92 (20%), Positives = 40/92 (43%), Gaps = 9/92 (9%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTAR---------R 218
+ +K + E E L + L+ + E+ +L+ +I E E I A R
Sbjct: 30 RKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDER 89
Query: 219 DYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
+ L E+ + S ++E+ E+++ +E+
Sbjct: 90 ELRALNIEIQIAKERINSLEDELAELMEEIEK 121
Score = 28.9 bits (65), Expect = 2.6
Identities = 21/83 (25%), Positives = 40/83 (48%), Gaps = 3/83 (3%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEM 227
N E Q +E L +L E EEI + + E LKE++ E+ + A +L+ E+
Sbjct: 95 NIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAE---ARLEEEV 151
Query: 228 TRLTQENESAKEEVKEVLQALEE 250
+ +E + + +E+ + L+
Sbjct: 152 AEIREEGQELSSKREELKEKLDP 174
Score = 27.7 bits (62), Expect = 5.7
Identities = 19/85 (22%), Positives = 39/85 (45%), Gaps = 2/85 (2%)
Query: 168 NEERQKLEEERERLYQQLDE--KDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
E Q++ E +R ++L + E+ + + KE+I E+ + + E+L+
Sbjct: 65 ESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEK 124
Query: 226 EMTRLTQENESAKEEVKEVLQALEE 250
E+ L + E ++ + E LEE
Sbjct: 125 EIEDLKERLERLEKNLAEAEARLEE 149
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 29.7 bits (67), Expect = 2.3
Identities = 12/51 (23%), Positives = 25/51 (49%), Gaps = 4/51 (7%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRD 219
+E+ ++ +ER+ L + IN AE+L ++ + + I RR+
Sbjct: 136 QEQFEVTQERDALQAE----KAYINALEGQAEQLTAEVRDILDQILDTRRE 182
Score = 28.5 bits (64), Expect = 4.9
Identities = 23/94 (24%), Positives = 43/94 (45%), Gaps = 1/94 (1%)
Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ 222
L L K +EE E ++ + + E I Q+ ++L + + +Q E + D Q
Sbjct: 31 LQLLQEAVNSKRQEEAEPAAEEAELQAELIQQELAINDQLSQALNQQTERLNALASDDRQ 90
Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
L + +L Q + + +E+ VL+ +R LL
Sbjct: 91 LANLLLQLLQSSRTIREQ-IAVLRGSLLLSRILL 123
>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
Length = 900
Score = 29.4 bits (67), Expect = 2.4
Identities = 15/75 (20%), Positives = 37/75 (49%), Gaps = 8/75 (10%)
Query: 171 RQKLEEERERLYQQLDEKDEEINQ-QSQYAEKLKEQIMEQEE-------LITTARRDYEQ 222
++L + ER +++ E+ +EI + + + AE L +++ + E ++ D ++
Sbjct: 756 PEQLPKTVERFFEEWKEQKKEIEKLRKELAELLASELLSKAEEVGGVKVVVEEVDADEDE 815
Query: 223 LQGEMTRLTQENESA 237
L+ L +E + A
Sbjct: 816 LRKIANELAKEGKVA 830
>gnl|CDD|234038 TIGR02865, spore_II_E, stage II sporulation protein E. Stage II
sporulation protein E (SpoIIE) is a multiple membrane
spanning protein with two separable functions. It plays
a role in the switch to polar cell division during
sporulation. By means of it protein phosphatase
activity, located in the C-terminal region, it activates
sigma-F. All proteins that score above the trusted
cutoff to this model are found in endospore-forming
Gram-positive bacteria. Surprisingly, a sequence from
the Cyanobacterium-like (and presumably
non-spore-forming) photosynthesizer Heliobacillus
mobilis is homologous, and scores between the trusted
and noise cutoffs [Cellular processes, Sporulation and
germination].
Length = 764
Score = 29.3 bits (66), Expect = 2.6
Identities = 23/98 (23%), Positives = 43/98 (43%), Gaps = 12/98 (12%)
Query: 171 RQKLEEERERLYQQL-------DEKDEEINQQSQYAEKLKEQIME--QEELITTARRD-Y 220
R++LEE R + +QL ++ +EIN + + + L+E+I+ + I Y
Sbjct: 428 RKRLEEGRRLVAEQLKGVAESVEDIAKEINLEIVFHQLLEEKIIRALNKNGIPYEDVLAY 487
Query: 221 EQLQGEMT-RLTQENESAKEE-VKEVLQALEEPTRTLL 256
G + LT + E K++ + E T L+
Sbjct: 488 NTEGGNIDVELTIAACGGRGECEKKIAPIISEVTGELM 525
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 28.7 bits (65), Expect = 2.7
Identities = 17/65 (26%), Positives = 38/65 (58%), Gaps = 5/65 (7%)
Query: 169 EERQKLEEERERLYQQ---LDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
+E +KLEEE+E L ++ L+ K E I ++ + +++E+ E I ++ +QL+
Sbjct: 127 QEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADE--IAFLKKQNQQLKS 184
Query: 226 EMTRL 230
++ ++
Sbjct: 185 QLEQI 189
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 29.4 bits (65), Expect = 2.7
Identities = 45/201 (22%), Positives = 67/201 (33%), Gaps = 24/201 (11%)
Query: 86 AEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLADPVDM--AASIAP 143
W E E + S + E+ S KE EQ +PVD A
Sbjct: 231 DGNWVWTRELEPQDDSEDDYVIPDAEIISSPTLEVTAPKEVEQPLQPEPVDEETVAETKA 290
Query: 144 TPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLK 203
E P + P S + EE + + Q E + K K
Sbjct: 291 EEEPQPTQTVETKPTSAP---ESTVEENLPEINQPTQAVQPTSETISTTPVEPTDQLKPK 347
Query: 204 EQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVL------------------ 245
E QEEL T + E+L + L + N EE+K L
Sbjct: 348 EVDQIQEELKKTKEIEVEELPTKKNDLVEINFDDLEELKFELVQTNQEKEPEKAVENWAT 407
Query: 246 -QALEEPTRTLLNFFRQSSPK 265
L+EPT++ +++++Q PK
Sbjct: 408 DYQLDEPTQSNIDWYKQEDPK 428
>gnl|CDD|233327 TIGR01241, FtsH_fam, ATP-dependent metalloprotease FtsH.
HflB(FtsH) is a pleiotropic protein required for correct
cell division in bacteria. It has ATP-dependent zinc
metalloprotease activity. It was formerly designated
cell division protein FtsH [Cellular processes, Cell
division, Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 495
Score = 29.2 bits (66), Expect = 2.8
Identities = 13/40 (32%), Positives = 21/40 (52%), Gaps = 8/40 (20%)
Query: 234 NESAKEEVKEVLQALEEPTRTLLNFFRQ---SSPKGTLLY 270
+ AKEE+ E++ L+ P++ F + PKG LL
Sbjct: 60 IDEAKEELMEIVDFLKNPSK-----FTKLGAKIPKGVLLV 94
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 29.1 bits (65), Expect = 2.9
Identities = 16/72 (22%), Positives = 37/72 (51%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E+ ++L++E+E L ++L+E + E + + ++L+ + EE++ + L+
Sbjct: 142 EKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWD 201
Query: 229 RLTQENESAKEE 240
L E +EE
Sbjct: 202 ELEPGVELPEEE 213
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 28.0 bits (63), Expect = 3.0
Identities = 21/80 (26%), Positives = 36/80 (45%), Gaps = 3/80 (3%)
Query: 174 LEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ---LQGEMTRL 230
LEEE+ L +L E+ + +KL+E + EQ E+ A++ YE+ E
Sbjct: 1 LEEEKSSLQAELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEE 60
Query: 231 TQENESAKEEVKEVLQALEE 250
Q E+K+ + L+
Sbjct: 61 LQALRKQLNELKKEIAQLKA 80
Score = 26.9 bits (60), Expect = 7.8
Identities = 21/89 (23%), Positives = 50/89 (56%), Gaps = 7/89 (7%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYA----EKLKEQIMEQEEL---ITTARRDYE 221
E ++L E++E +++ + E++ +Q++ A +K + ++++ E + R+
Sbjct: 10 AELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLN 69
Query: 222 QLQGEMTRLTQENESAKEEVKEVLQALEE 250
+L+ E+ +L E ESA+ E+ E ++ EE
Sbjct: 70 ELKKEIAQLKAEAESAQAELSEAEESWEE 98
>gnl|CDD|129807 TIGR00724, urea_amlyse_rel, biotin-dependent carboxylase
uncharacterized domain. Urea amidolyase of
Saccharomyces cerevisiae is a 1835 amino acid protein
with an amidase domain, a biotin/lipoyl cofactor
attachment domain, a carbamoyl-phosphate synthase L
chain-like domain, and uncharacterized regions. It has
both urea carboxylase and allophanate hydrolase
activities. This model models a domain that represents
uncharacterized prokaryotic proteins of about 300 amino
acids, regions of prokaryotic urea carboxylase and of
the urea carboxylase region of yeast urea amidolyase,
and regions of other biotin-containing proteins [Unknown
function, General].
Length = 314
Score = 29.0 bits (65), Expect = 3.0
Identities = 32/121 (26%), Positives = 44/121 (36%), Gaps = 23/121 (19%)
Query: 82 EELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRNGETVKED-----EQVNL-ADPV 135
E EWK S R +L+ +L+ N E + QV P+
Sbjct: 194 EAFWRSEWKVSSN-----SDRMGYRLQGPKLKH-ARPNRELLTHGIVYGSIQVPPNGQPI 247
Query: 136 DMAASIAPTPESAP--ASI----LPAI----PGSGLMLGSLSNEERQKLEEERERLYQQL 185
+ A A T P A + L + PG + LS EE KL E +ER +QL
Sbjct: 248 ILMAD-AQTTGGYPKIAVVIEADLWKVAQVRPGQSIKFVPLSLEEALKLRESQERYIKQL 306
Query: 186 D 186
Sbjct: 307 R 307
>gnl|CDD|233588 TIGR01823, PabB-fungal, aminodeoxychorismate synthase, fungal
clade. This model represents the fungal clade of a
para-aminobenzoate synthesis enzyme,
aminodeoxychorismate synthase, which acts on chorismate
in a pathway that yields PABA, a precursor of folate.
Length = 742
Score = 29.1 bits (65), Expect = 3.1
Identities = 12/52 (23%), Positives = 20/52 (38%), Gaps = 4/52 (7%)
Query: 47 IIICCSPASFNESETKSTLDFGRRAKTIKNVVTVNEELTAEEWKRRYEKEKE 98
I + SP F E + + IK V ++ E+ +R + KE
Sbjct: 534 IFLSTSPEKFLEVGMDTHAKL----RPIKGTVKKGPQMNLEKARRILKTPKE 581
>gnl|CDD|240169 cd05146, RIO3_euk, RIO kinase family; eukaryotic RIO3, catalytic
domain. The RIO kinase catalytic domain family is part
of a larger superfamily, that includes the catalytic
domains of other kinases such as the typical
serine/threonine/tyrosine protein kinases (PKs),
aminoglycoside phosphotransferase, choline kinase, and
phosphoinositide 3-kinase (PI3K). RIO kinases are
atypical protein serine kinases containing a kinase
catalytic signature, but otherwise show very little
sequence similarity to typical PKs. Serine kinases
catalyze the transfer of the gamma-phosphoryl group from
ATP to serine residues in protein substrates. The RIO
catalytic domain is truncated compared to the catalytic
domains of typical PKs, with deletions of the loops
responsible for substrate binding. RIO3 is present only
in multicellular eukaryotes. Its function is still
unknown.
Length = 197
Score = 28.6 bits (64), Expect = 3.1
Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 4/42 (9%)
Query: 192 INQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQE 233
I A KLK+ + EE+ + Y Q+ M +L +E
Sbjct: 106 IGDDQVPAPKLKDAKLNDEEM----KNAYYQVLSMMKQLYKE 143
>gnl|CDD|226873 COG4467, COG4467, Regulator of replication initiation timing
[Replication, recombination, and repair].
Length = 114
Score = 27.8 bits (62), Expect = 3.2
Identities = 16/73 (21%), Positives = 34/73 (46%)
Query: 177 ERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENES 236
+++ ++ Q+D +E++ LK+ + E T R + E+L+ + T E +
Sbjct: 2 DKKEIFDQVDNLEEQLGVLLAELGGLKQHLGSLVEENTALRLENEKLRERLGEPTLEKTA 61
Query: 237 AKEEVKEVLQALE 249
K+E V + E
Sbjct: 62 VKKEKPAVKKKGE 74
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 28.9 bits (65), Expect = 3.3
Identities = 14/80 (17%), Positives = 37/80 (46%)
Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
+L E R+++ +L + + + E+ + + ++ + +LQ ++ L
Sbjct: 319 EAQLAELRQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRLSKLPKLQVQLREL 378
Query: 231 TQENESAKEEVKEVLQALEE 250
+E E+A+ + +LQ +E
Sbjct: 379 EREAEAARSLYETLLQRYQE 398
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 28.0 bits (63), Expect = 3.4
Identities = 13/39 (33%), Positives = 22/39 (56%)
Query: 218 RDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
+DYE L E L +E E +E+++E + E +TL+
Sbjct: 32 KDYEALYKENEELKEEIERLEEKLEEYKELEETLQKTLV 70
>gnl|CDD|222707 pfam14357, DUF4404, Domain of unknown function (DUF4404). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved residues (P and G)
that may be functionally important.
Length = 85
Score = 27.2 bits (61), Expect = 3.4
Identities = 13/47 (27%), Positives = 20/47 (42%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEE 211
SL E R L E + + QL E+D + A++L + E
Sbjct: 18 SLDEETRALLSELIDDIEAQLAEEDATEAEDESLADRLNLAVERFEA 64
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 28.4 bits (64), Expect = 3.4
Identities = 18/70 (25%), Positives = 31/70 (44%), Gaps = 5/70 (7%)
Query: 176 EERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENE 235
EE++ L + E+ Q + EKLK+Q+ E+ I R + E + +
Sbjct: 92 EEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRA-----KKEALKARKAAA 146
Query: 236 SAKEEVKEVL 245
A+E+V L
Sbjct: 147 KAQEKVNRSL 156
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 28.1 bits (63), Expect = 3.5
Identities = 15/77 (19%), Positives = 31/77 (40%), Gaps = 6/77 (7%)
Query: 140 SIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYA 199
S+ P ES A + L+ + ++ + E+ + + S+
Sbjct: 70 SLHPKDESEAAPDVVPKSSEELVQ--MKKAISALKQKI----KRDSAERKKAASSGSRAI 123
Query: 200 EKLKEQIMEQEELITTA 216
EKL++++ EQE + A
Sbjct: 124 EKLRKKLAEQEAALLLA 140
>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit. This
family consist of exonuclease VII, large subunit
EC:3.1.11.6 This enzyme catalyzes exonucleolytic
cleavage in either 5'->3' or 3'->5' direction to yield
5'-phosphomononucleotides. This exonuclease VII enzyme
is composed of one large subunit and 4 small ones.
Length = 295
Score = 28.8 bits (65), Expect = 3.6
Identities = 30/134 (22%), Positives = 49/134 (36%), Gaps = 25/134 (18%)
Query: 136 DMAASI-APTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQ 194
D+ A + A TP +A ++P E QKLE +RL + L + E+
Sbjct: 124 DLVADVRAATPTAAAELLVPD-----------RTELLQKLEGLEQRLSRALKNRLEKEQD 172
Query: 195 Q-SQYAEKLKEQIMEQ------------EELITTARRDYEQLQGEMTRLTQENESAKEEV 241
+ + E+LK + LI++ + + Q + RL E K +
Sbjct: 173 RLNLLRERLKSLSRRLLEQHEERLAELRDRLISSIQNLLSRKQSRLERLILNRELEKNSL 232
Query: 242 KEVLQALEEPTRTL 255
E A T L
Sbjct: 233 LENKLATANLTAQL 246
>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein. A number of members of this
family are annotated as being transmembrane proteins
induced by tumour necrosis factor alpha, but no
literature was found to support this.
Length = 330
Score = 28.6 bits (64), Expect = 3.6
Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 13/111 (11%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
EE ++LE+E +QQL E Q+ + EKL+EQ I R+ +QL +
Sbjct: 4 EEWKELEKE----FQQLQETHRLYKQKLEEVEKLQEQCTSA---IARQRKRLKQLIVSLK 56
Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLNFFR---QSSPKGTLLYNHYLLG 276
+L +S E E+++ LEE + N F PK LY +LG
Sbjct: 57 KL---KKSLTPEDSELVEQLEEQIKERKNQFFDMESFLPKKNGLYLRLVLG 104
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 28.9 bits (65), Expect = 3.6
Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 12/90 (13%)
Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQEN 234
+E R+R ++ LD +E N+ ++ KLK A+ D +L E L +E
Sbjct: 40 KEWRQRQFE-LDSLRKEFNKLNKEVAKLK-----------IAKEDATELIAETKELKKEI 87
Query: 235 ESAKEEVKEVLQALEEPTRTLLNFFRQSSP 264
+ EV+E AL+ +T+ N S P
Sbjct: 88 TEKEAEVQEAKAALDAKLKTIGNLVHDSVP 117
>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
ArcB; Provisional.
Length = 779
Score = 28.8 bits (65), Expect = 3.7
Identities = 12/37 (32%), Positives = 22/37 (59%), Gaps = 3/37 (8%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQ 205
ER+K EE R+ ++QL EI ++ + +L++Q
Sbjct: 120 AEREKAEEARQEAFEQL---KNEIKEREETQIELEQQ 153
Score = 28.4 bits (64), Expect = 5.3
Identities = 25/88 (28%), Positives = 37/88 (42%), Gaps = 25/88 (28%)
Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQG 225
LS Q LEE R+RL S+ KL+E M + +L QL+
Sbjct: 73 LSVVVEQ-LEESRQRL--------------SRLVAKLEE--MRERDLELN-----VQLKD 110
Query: 226 EMTRLTQE---NESAKEEVKEVLQALEE 250
+ +L QE E A+E +E + L+
Sbjct: 111 NIAQLNQEIAEREKAEEARQEAFEQLKN 138
>gnl|CDD|198290 cd03181, GST_C_EF1Bgamma_like, Glutathione S-transferase
C-terminal-like, alpha helical domain of the Gamma
subunit of Elongation Factor 1B and similar proteins.
Glutathione S-transferase (GST) C-terminal domain
family, Gamma subunit of Elongation Factor 1B
(EF1Bgamma) subfamily; EF1Bgamma is part of the
eukaryotic translation elongation factor-1 (EF1) complex
which plays a central role in the elongation cycle
during protein biosynthesis. EF1 consists of two
functionally distinct units, EF1A and EF1B. EF1A
catalyzes the GTP-dependent binding of aminoacyl-tRNA to
the ribosomal A site concomitant with the hydrolysis of
GTP. The resulting inactive EF1A:GDP complex is recycled
to the active GTP form by the guanine-nucleotide
exchange factor EF1B, a complex composed of at least two
subunits, alpha and gamma. Metazoan EFB1 contain a third
subunit, beta. The EF1B gamma subunit contains a GST
fold consisting of an N-terminal thioredoxin-fold domain
and a C-terminal alpha helical domain. The GST-like
domain of EF1Bgamma is believed to mediate the
dimerization of the EF1 complex, which in yeast is a
dimer of the heterotrimer EF1A:EF1Balpha:EF1Bgamma. In
addition to its role in protein biosynthesis, EF1Bgamma
may also display other functions. The recombinant rice
protein has been shown to possess GSH conjugating
activity. The yeast EF1Bgamma binds to membranes in a
calcium dependent manner and is also part of a complex
that binds to the msrA (methionine sulfoxide reductase)
promoter suggesting a function in the regulation of its
gene expression. Also included in this subfamily is the
GST_C-like domain at the N-terminus of human valyl-tRNA
synthetase (ValRS) and its homologs. Metazoan ValRS
forms a stable complex with Elongation Factor-1H
(EF-1H), and together, they catalyze consecutive steps
in protein biosynthesis, tRNA aminoacylation and its
transfer to EF.
Length = 123
Score = 27.5 bits (62), Expect = 3.8
Identities = 12/27 (44%), Positives = 16/27 (59%), Gaps = 2/27 (7%)
Query: 232 QENESAKEEVKEVLQALEEP--TRTLL 256
+ + AKE++K L LEE TRT L
Sbjct: 36 KAVDKAKEDLKRALGVLEEHLLTRTYL 62
>gnl|CDD|224644 COG1730, GIM5, Predicted prefoldin, molecular chaperone implicated
in de novo protein folding [Posttranslational
modification, protein turnover, chaperones].
Length = 145
Score = 27.7 bits (62), Expect = 3.8
Identities = 12/44 (27%), Positives = 25/44 (56%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEE 211
+ ++LE+ E+L Q L E + I Q Q A++L+++ ++
Sbjct: 100 KKRIEELEKAIEKLQQALAELAQRIEQLEQEAQQLQQKQAAAQK 143
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 29.0 bits (65), Expect = 3.9
Identities = 35/187 (18%), Positives = 61/187 (32%), Gaps = 30/187 (16%)
Query: 85 TAEEWKRRYEKEKEKSSRYKGK-------LEKLELELSRWRNGETVKEDEQVNLA----- 132
+ + + E+ + + + + LE LEL+L V +E+V A
Sbjct: 453 QVQAFLKEAEEIMQAKRQKRDRLLELEEDLETLELQLRHLDAAGAVPTEEEVAAARARRD 512
Query: 133 ----DPVDMAASIAPTPESAPASILPAIPGSGLMLGSLS-----NEERQKLEEERERLYQ 183
D A + PT + ++ A + +L RQ+ E R RL Q
Sbjct: 513 ALWQDIRRGYALLEPTASAYEEAVESADQLADRLLREAQLVGELQSLRQQEEAARRRLEQ 572
Query: 184 QLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKE 243
E + + E + Q EM E +A+E+V+
Sbjct: 573 LEKELEVLELALAALREAWQAQWAA---------AGLPLTPAEMEDWLAERATAREQVRA 623
Query: 244 VLQALEE 250
+A E
Sbjct: 624 YFKARAE 630
>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein. This family
consists of several eukaryotic paraflagellar rod
component proteins. The eukaryotic flagellum represents
one of the most complex macromolecular structures found
in any organism and contains more than 250 proteins. In
addition to its locomotive role, the flagellum is
probably involved in nutrient uptake since receptors for
host low-density lipoproteins are localised on the
flagellar membrane as well as on the flagellar pocket
membrane.
Length = 289
Score = 28.5 bits (64), Expect = 3.9
Identities = 15/77 (19%), Positives = 33/77 (42%)
Query: 179 ERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAK 238
+ + ++ K + + + E+ KE+ E++ R+ Y + E + Q+N +
Sbjct: 2 KDISAVIEAKHRQKSACEEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQQ 61
Query: 239 EEVKEVLQALEEPTRTL 255
+ +Q LE R L
Sbjct: 62 QACWRAIQELERELRDL 78
>gnl|CDD|212684 cd11734, Ssq1_like_NBD, Nucleotide-binding domain of Saccharomyces
cerevisiae Ssq1 and similar proteins. Ssq1p (also
called Stress-seventy subfamily Q protein 1, Ssc2p,
Ssh1p, mtHSP70 homolog) belongs to the heat shock
protein 70 (HSP70) family of chaperones that assist in
protein folding and assembly, and can direct incompetent
"client" proteins towards degradation. Typically, HSP70s
have a nucleotide-binding domain (NBD) and a
substrate-binding domain (SBD). The nucleotide sits in a
deep cleft formed between the two lobes of the NBD. The
two subdomains of each lobe change conformation between
ATP-bound, ADP-bound, and nucleotide-free states. ATP
binding opens up the substrate-binding site;
substrate-binding increases the rate of ATP hydrolysis.
Hsp70 chaperone activity is regulated by various
co-chaperones: J-domain proteins and nucleotide exchange
factors (NEFs). S. cerevisiae Ssq1p is a mitochondrial
chaperone that is involved in iron-sulfur (Fe/S) center
biogenesis. Ssq1p plays a role in the maturation of
Yfh1p, a nucleus-encoded mitochondrial protein involved
in iron homeostasis (and a homolog of human frataxin,
implicated in the neurodegenerative disease,
Friedreich's ataxia).
Length = 373
Score = 28.6 bits (64), Expect = 3.9
Identities = 18/58 (31%), Positives = 27/58 (46%), Gaps = 4/58 (6%)
Query: 29 SKLTRILQESLGGNARTTIIICCSPASFNESETKSTLDFGRRAKTIKNVVTVNEELTA 86
KL + + LG +I PA FN+S+ ++T D G A V+ + E TA
Sbjct: 120 KKLKKTAEAYLGKRVDEAVITV--PAYFNDSQRQATKDAGTLAGL--KVLRIINEPTA 173
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 28.5 bits (64), Expect = 3.9
Identities = 10/68 (14%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 179 ERLYQQLDEKDEEINQ-QSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESA 237
+ ++L + EE+ +++ ++++ + ++++ I +++ ++L+ E+T L+ ++
Sbjct: 36 DDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKAL 95
Query: 238 KEEVKEVL 245
+ E+++ L
Sbjct: 96 EAELQDKL 103
Score = 28.1 bits (63), Expect = 6.4
Identities = 19/115 (16%), Positives = 44/115 (38%), Gaps = 18/115 (15%)
Query: 171 RQKLEEERERLYQQLDEK----DEEINQQSQYAEKLKEQIMEQEEL-------------- 212
R+ L + + + L + D ++ + ++ K+ + E EEL
Sbjct: 4 RKLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKA 63
Query: 213 ITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGT 267
+ E+++ E+ L +E +K + L++ ++ N +S P G
Sbjct: 64 KGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSIPNIPHESVPVGK 118
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 28.6 bits (64), Expect = 4.0
Identities = 20/96 (20%), Positives = 42/96 (43%), Gaps = 7/96 (7%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
EE Q LE++ + L ++ +EE + E+L+ ++ E + +L+ E
Sbjct: 47 EEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAEV-------AKLEEERE 99
Query: 229 RLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSP 264
+ E ++E++E +A E + LL +
Sbjct: 100 KKEAETRQLQQELREAQEAHERARQELLEAAAAPTA 135
>gnl|CDD|223541 COG0465, HflB, ATP-dependent Zn proteases [Posttranslational
modification, protein turnover, chaperones].
Length = 596
Score = 28.8 bits (65), Expect = 4.1
Identities = 12/40 (30%), Positives = 20/40 (50%), Gaps = 6/40 (15%)
Query: 234 NESAKEEVKEVLQALEEPTRTLLNFFRQSS--PKGTLLYN 271
+ AKEE+ E++ L+ P + + + PKG LL
Sbjct: 155 VDEAKEELSELVDFLKNPKK----YQALGAKIPKGVLLVG 190
>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility
and secretion / Signal transduction mechanisms].
Length = 408
Score = 28.8 bits (64), Expect = 4.1
Identities = 21/95 (22%), Positives = 36/95 (37%), Gaps = 1/95 (1%)
Query: 163 LGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQ 222
E ++ E + L ++ EI + +L +I E T +
Sbjct: 308 AVEHMEESASEVSEGVK-LVEETGSSLGEIAAAIEEVSQLISEIAAATEEQTAVLEEINA 366
Query: 223 LQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLN 257
E+ +TQEN +A EE+ + L+E LL
Sbjct: 367 SIEELDDVTQENAAAVEELAAASEELKELAEKLLE 401
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 28.8 bits (65), Expect = 4.1
Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 1/62 (1%)
Query: 189 DEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQAL 248
DEEI + + AE + EL+ AR + E L + + +E EE KE ++
Sbjct: 485 DEEIERMVEDAEANAALDKKFRELV-EARNEAESLIYSLEKALKEIVKVSEEEKEKIEEA 543
Query: 249 EE 250
Sbjct: 544 IT 545
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that assembles
onto the CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. Fta7 is the
equivalent component of the fission yeast Sim4 complex.
Length = 159
Score = 28.1 bits (63), Expect = 4.2
Identities = 24/107 (22%), Positives = 44/107 (41%), Gaps = 9/107 (8%)
Query: 154 PAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELI 213
P L L +R LE + +QL+ +EEI ++ EK E + E E+
Sbjct: 13 PPTKEDSLDYDQLLRRKRA-LERQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNA 71
Query: 214 TTARRDYEQ--------LQGEMTRLTQENESAKEEVKEVLQALEEPT 252
R+ E+ L+ + + +ENE ++ + E+ + P
Sbjct: 72 KALEREREEESKNLHPVLRLLESEVLEENELLQDSLLELSERNFSPN 118
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 28.8 bits (64), Expect = 4.2
Identities = 17/92 (18%), Positives = 31/92 (33%), Gaps = 10/92 (10%)
Query: 166 LSNEERQKLEEERERLYQQLDEKDEEINQ-------QSQYAEKLKEQIMEQEELITTARR 218
RQ ++ + +QL E + Q + E+ + E E ++
Sbjct: 673 ELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGS 732
Query: 219 DYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
D L L Q + + + VL+A E
Sbjct: 733 D---LAAREDALNQSLKELMHQARTVLKARTE 761
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 28.5 bits (64), Expect = 4.2
Identities = 17/75 (22%), Positives = 38/75 (50%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
E Q+ EE +L+++ DE +E ++ + +L ++I E E + + +L+ ++
Sbjct: 186 NEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIK 245
Query: 229 RLTQENESAKEEVKE 243
L + ++AK K
Sbjct: 246 ALRAKEKAAKRREKR 260
>gnl|CDD|145958 pfam03084, Sigma_1_2, Reoviral Sigma1/Sigma2 family. Reoviruses
are double-stranded RNA viruses. They lack a membrane
envelope and their capsid is organised in two concentric
icosahedral layers: an inner core and an outer capsid
layer. The sigma1 protein is found in the outer capsid,
and the sigma2 protein is found in the core. There are
four other kinds of protein (besides sigma2) in the
core, termed lambda 1-3, mu2. Interactions between
sigma2 and lambda 1 and lambda 3 are thought to initiate
core formation, followed by mu2 and lambda2. Sigma1 is a
trimeric protein, and is positioned at the 12 vertices
of the icosahedral outer capsid layer. Its N-terminal
fibrous tail, arranged as a triple coiled coil, anchors
it in the virion, and a C-terminal globular head
interacts with the cellular receptor. These two parts
form by separate trimerisation events. The N-terminal
fibrous tail forms on the polysome, without the
involvement of ATP or chaperones. The post-
translational assembly of the C-terminal globular head
involves the chaperone activity of Hsp90, which is
associated with phosphorylation of Hsp90 during the
process. Sigma1 protein acts as a cell attachment
protein, and determines viral virulence, pathways of
spread, and tropism. Junctional adhesion molecule has
been identified as a receptor for sigma1. In type 3
reoviruses, a small region, predicted to form a beta
sheet, in the N-terminal tail was found to bind target
cell surface sialic acid (i.e. sialic acid acts as a
co-receptor) and promote apoptosis. The sigma1 protein
also binds to the lambda2 core protein.
Length = 415
Score = 28.7 bits (64), Expect = 4.3
Identities = 13/50 (26%), Positives = 20/50 (40%), Gaps = 8/50 (16%)
Query: 162 MLGSLSNEERQKLEEE--------RERLYQQLDEKDEEINQQSQYAEKLK 203
G +S + Q+L + R +L E D NQ YA ++K
Sbjct: 343 RAGLISQAKGQQLTQNADDTQQFKRAKLEAIKAEDDALYNQNPNYARRIK 392
>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
Length = 663
Score = 28.6 bits (64), Expect = 4.3
Identities = 18/67 (26%), Positives = 39/67 (58%), Gaps = 6/67 (8%)
Query: 189 DEEINQQSQYAEKLKEQIMEQEELITTARRDYEQL----QGEMTRLT-QENESAKEEVKE 243
DEEI + + AE+ KEQ E+++ + A+ + E L + +++ L + +++ K+E+K+
Sbjct: 543 DEEIEKMVKEAEEYKEQ-DEKKKELVDAKNEAETLIYSVEKQLSDLKDKISDADKDELKQ 601
Query: 244 VLQALEE 250
+ L
Sbjct: 602 KITKLRS 608
>gnl|CDD|217436 pfam03226, Yippee, Yippee putative zinc-binding protein.
Length = 109
Score = 27.2 bits (61), Expect = 4.4
Identities = 12/24 (50%), Positives = 15/24 (62%), Gaps = 5/24 (20%)
Query: 89 WKRRYEKEKEKSSRYK-GK--LEK 109
WK YE E+S +YK GK LE+
Sbjct: 82 WK--YEFAYEESQKYKEGKFILER 103
>gnl|CDD|217331 pfam03036, Perilipin, Perilipin family. The perilipin family
includes lipid droplet-associated protein (perilipin)
and adipose differentiation-related protein
(adipophilin).
Length = 390
Score = 28.2 bits (63), Expect = 4.8
Identities = 32/112 (28%), Positives = 40/112 (35%), Gaps = 16/112 (14%)
Query: 139 ASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQY 198
A A E + P + LGSLS KL R R YQ K + Q+ Q
Sbjct: 194 APDATKAEGFEVGSVEQAPSYFVRLGSLSA----KL---RRRAYQHSLAKLRQARQRGQE 246
Query: 199 AEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
A Q+ ELI +AR+ Q +L E KE Q E
Sbjct: 247 AIH---QLPSTAELIESARKGVHQ------KLGGAQEKLKELWLSWSQGQPE 289
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 28.2 bits (63), Expect = 4.9
Identities = 15/52 (28%), Positives = 28/52 (53%), Gaps = 4/52 (7%)
Query: 164 GSLSNEERQKLEEERERLYQQLDEKDEEINQQS----QYAEKLKEQIMEQEE 211
LS ++RQ+L +E + Q +K E+ ++ + K KE+ ++QEE
Sbjct: 360 TDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEE 411
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 28.4 bits (63), Expect = 5.0
Identities = 37/171 (21%), Positives = 74/171 (43%), Gaps = 27/171 (15%)
Query: 74 IKNVVTVNEELTAEEWKR-RYEKEKEKSSRYKGKLEKLELELSRWRNGETVKEDEQVNLA 132
++ + EEL + + R+EK +EK + + + EKLE EL W++ + +D +NL
Sbjct: 280 KEDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENELKSWKS---LLQDIGLNLR 336
Query: 133 DPVDMAASIAPTPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEI 192
P D++ I L NEE Q L+E+ + + + +
Sbjct: 337 TPDDLSRRIV----------------------VLQNEELQ-LKEKNGSISSSAKQLETTL 373
Query: 193 NQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKE 243
Q +K +I+E ++ + + +LQ +T +T+E + + +
Sbjct: 374 QQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNS 424
>gnl|CDD|178588 PLN03014, PLN03014, carbonic anhydrase.
Length = 347
Score = 28.2 bits (62), Expect = 5.1
Identities = 15/49 (30%), Positives = 26/49 (53%)
Query: 200 EKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQAL 248
E LK+ ++E+EEL T A EQ+ + T ++ A + V+ + Q
Sbjct: 88 EALKKLLIEKEELKTVAAAKVEQITAALQTGTSSDKKAFDPVETIKQGF 136
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 28.5 bits (64), Expect = 5.1
Identities = 11/38 (28%), Positives = 21/38 (55%)
Query: 82 EELTAEEWKRRYEKEKEKSSRYKGKLEKLELELSRWRN 119
EEL E+ ++ KE+++ + K + EK + EL +
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 28.4 bits (63), Expect = 5.2
Identities = 16/82 (19%), Positives = 40/82 (48%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
+E+ L+ E + L +L+ K + +++ + +E+ I R YE+ + ++
Sbjct: 336 QEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLR 395
Query: 229 RLTQENESAKEEVKEVLQALEE 250
L ++ E+ +E + + L+E
Sbjct: 396 VLQKKIENLQETFRRKERRLKE 417
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 28.1 bits (63), Expect = 5.3
Identities = 11/42 (26%), Positives = 25/42 (59%), Gaps = 5/42 (11%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQE 210
+E+ K E+E + L + ++ E+ + AEKL+ + +++E
Sbjct: 69 KEKAKREKEEKGLRELEEDTPED-----ELAEKLRLRKLQEE 105
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 27.9 bits (62), Expect = 5.3
Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 4/63 (6%)
Query: 200 EKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFF 259
EKLK++I +EE + + + ++ EM L +E K + E L+ + +
Sbjct: 14 EKLKDEIKAEEEALESEEKHLKEYDKEMEELLEE----KMQHVEELRQIHADINDMETEI 69
Query: 260 RQS 262
+QS
Sbjct: 70 KQS 72
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 28.3 bits (63), Expect = 5.4
Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 3/74 (4%)
Query: 179 ERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT---QENE 235
ERL Q + + E+ Q Q +L+ QI L + + D QL+ E L
Sbjct: 409 ERLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMV 468
Query: 236 SAKEEVKEVLQALE 249
SAK++ K+ +Q++E
Sbjct: 469 SAKQKDKQSMQSME 482
>gnl|CDD|216073 pfam00709, Adenylsucc_synt, Adenylosuccinate synthetase.
Length = 420
Score = 28.3 bits (64), Expect = 5.4
Identities = 26/83 (31%), Positives = 43/83 (51%), Gaps = 15/83 (18%)
Query: 160 GLMLGSLSNEE--RQKLEE-------ERERLYQQLDEKDEEINQQ-SQYAEKLKEQIMEQ 209
G+ +G L +EE R+KLE+ E+LY EEI ++ +YAE+L+ I +
Sbjct: 143 GIRVGDLLDEEVLREKLEQLLEEKNAILEKLYGAEPLDVEEILEEYLEYAERLRPYITDT 202
Query: 210 EELITTARRD-----YEQLQGEM 227
EL+ A ++ +E QG +
Sbjct: 203 SELLNRALKEGKKVLFEGAQGTL 225
>gnl|CDD|149682 pfam08702, Fib_alpha, Fibrinogen alpha/beta chain family.
Fibrinogen is a protein involved in platelet aggregation
and is essential for the coagulation of blood. This
domain forms part of the central coiled coiled region of
the protein which is formed from two sets of three
non-identical chains (alpha, beta and gamma).
Length = 146
Score = 27.3 bits (61), Expect = 5.5
Identities = 10/63 (15%), Positives = 20/63 (31%), Gaps = 5/63 (7%)
Query: 170 ERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTR 229
QK + + +Y + L+ +I QE I + + ++ R
Sbjct: 68 GDQKQAQPNDNIYNAYSKSLR-----KMIEYILETKINTQESQIRVLQEVLRSNRSKIQR 122
Query: 230 LTQ 232
L
Sbjct: 123 LEV 125
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 28.5 bits (64), Expect = 5.5
Identities = 22/106 (20%), Positives = 39/106 (36%), Gaps = 13/106 (12%)
Query: 167 SNEERQKLEEERERLY----------QQLDEKDEEINQQSQYAEKLKEQIMEQEELITTA 216
NE R+ E ER+ D + S+ L+E+I++ E I
Sbjct: 27 VNEVRRCDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEV 86
Query: 217 RRDYEQLQGE---MTRLTQENESAKEEVKEVLQALEEPTRTLLNFF 259
+ E L+ E + + K + E L+ L E + ++F
Sbjct: 87 EENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNLDIDFK 132
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 27.2 bits (61), Expect = 5.7
Identities = 19/87 (21%), Positives = 45/87 (51%), Gaps = 4/87 (4%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKL----KEQIMEQEELITTARRDYEQL 223
EE ++ EE ++ ++L++K+EE+ + +K + + E+ ++ ++
Sbjct: 20 REEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEK 79
Query: 224 QGEMTRLTQENESAKEEVKEVLQALEE 250
+ E+ L E E K E++++ + LEE
Sbjct: 80 EEEIKELKAELEELKAEIEKLEEKLEE 106
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 28.1 bits (63), Expect = 5.7
Identities = 24/93 (25%), Positives = 42/93 (45%), Gaps = 2/93 (2%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQ 224
+ E KL + +L K E +NQ +LKE+I ++ E I +R Q +
Sbjct: 48 EGATNEDGKLAADLLKLEVA--RKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRR 105
Query: 225 GEMTRLTQENESAKEEVKEVLQALEEPTRTLLN 257
+++ + + E + E LQ + TR+ LN
Sbjct: 106 SDLSSASYQLEKRRASQLEKLQDEIKRTRSKLN 138
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 27.7 bits (62), Expect = 5.8
Identities = 17/47 (36%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKL---KEQIMEQEEL 212
EER+KLEE+RE ++ +E +EE ++ + E+ ++ EQEE
Sbjct: 29 EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEY 75
>gnl|CDD|238111 cd00187, TOP4c, DNA Topoisomerase, subtype IIA; domain A';
bacterial DNA topoisomerase IV (C subunit, ParC),
bacterial DNA gyrases (A subunit, GyrA),mammalian DNA
toposiomerases II. DNA topoisomerases are essential
enzymes that regulate the conformational changes in DNA
topology by catalysing the concerted breakage and
rejoining of DNA strands during normal cellular growth.
Length = 445
Score = 27.9 bits (63), Expect = 6.2
Identities = 8/28 (28%), Positives = 17/28 (60%)
Query: 160 GLMLGSLSNEERQKLEEERERLYQQLDE 187
+ L L+ ER+KL +E + L ++++
Sbjct: 396 DMRLRRLTKLEREKLLKELKELEAEIED 423
>gnl|CDD|212667 cd10170, HSP70_NBD, Nucleotide-binding domain of the HSP70 family.
HSP70 (70-kDa heat shock protein) family chaperones
assist in protein folding and assembly and can direct
incompetent "client" proteins towards degradation.
Typically, HSP70s have a nucleotide-binding domain (NBD)
and a substrate-binding domain (SBD). The nucleotide
sits in a deep cleft formed between the two lobes of the
NBD. The two subdomains of each lobe change conformation
between ATP-bound, ADP-bound, and nucleotide-free
states. ATP binding opens up the substrate-binding site;
substrate-binding increases the rate of ATP hydrolysis.
HSP70 chaperone activity is regulated by various
co-chaperones: J-domain proteins and nucleotide exchange
factors (NEFs). Some HSP70 family members are not
chaperones but instead, function as NEFs to remove ADP
from their HSP70 chaperone partners during the ATP
hydrolysis cycle, some may function as both chaperones
and NEFs.
Length = 369
Score = 28.0 bits (63), Expect = 6.2
Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 3/53 (5%)
Query: 30 KLTRILQESLGGNARTTIIICCSPASFNESETKSTLDFGRRAKTIKNVVTVNE 82
KL + LG +I PA FN+++ ++T + A + V +NE
Sbjct: 118 KLKEDAEAYLGEPVTEAVITV--PAYFNDAQREATKEAAEIAG-LNVVRLINE 167
>gnl|CDD|224006 COG1080, PtsA, Phosphoenolpyruvate-protein kinase (PTS system EI
component in bacteria) [Carbohydrate transport and
metabolism].
Length = 574
Score = 27.9 bits (63), Expect = 6.2
Identities = 21/81 (25%), Positives = 31/81 (38%), Gaps = 7/81 (8%)
Query: 174 LEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARR----DYEQLQGEMTR 229
+E E ERL L E+ EK E E++ I A D E +
Sbjct: 37 VEAEIERLDAALAAARAELEAL---KEKAAEDGGEEKAAIFEAHLMLLEDPELTEEAEEL 93
Query: 230 LTQENESAKEEVKEVLQALEE 250
+ Q SA+ +KEV++
Sbjct: 94 IRQGKVSAEYALKEVIEEFAA 114
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 28.0 bits (63), Expect = 6.3
Identities = 10/40 (25%), Positives = 17/40 (42%), Gaps = 3/40 (7%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAE---KLKEQ 205
L EE E L ++L E + + + Q +L+E
Sbjct: 69 ASLFDLREENEELKKELLELESRLQELEQLEAENARLREL 108
>gnl|CDD|220947 pfam11026, DUF2721, Protein of unknown function (DUF2721). This
family is conserved in bacteria. The function is not
known.
Length = 130
Score = 27.1 bits (61), Expect = 6.3
Identities = 15/53 (28%), Positives = 22/53 (41%), Gaps = 17/53 (32%)
Query: 148 APASILPAIPGSGLMLGSLSN--------------EERQKLEEERERLYQQLD 186
APA +L AI GL+L +N + + ER R+ +QL
Sbjct: 4 APAFLLSAI---GLLLLVYTNRLARIADRIRQLEARLEETDDPERARIRRQLR 53
>gnl|CDD|220180 pfam09325, Vps5, Vps5 C terminal like. Vps5 is a sorting nexin
that functions in membrane trafficking. This is the C
terminal dimerisation domain.
Length = 236
Score = 27.6 bits (62), Expect = 6.6
Identities = 13/75 (17%), Positives = 40/75 (53%)
Query: 176 EERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENE 235
+R + +QQ +++++++ KLK +++ + A+++ ++ + ++ + +E E
Sbjct: 128 NQRVKAWQQWQNAEQDLSKKRAQLAKLKGAGKSRQDKLQQAKKEVDEAERKVQQAKKEFE 187
Query: 236 SAKEEVKEVLQALEE 250
E +K+ L+ E
Sbjct: 188 DISETIKKELERFET 202
>gnl|CDD|217163 pfam02646, RmuC, RmuC family. This family contains several
bacterial RmuC DNA recombination proteins. The function
of the RMUC protein is unknown but it is suspected that
it is either a structural protein that protects DNA
against nuclease action, or is itself involved in DNA
cleavage at the regions of DNA secondary structures.
Length = 301
Score = 27.6 bits (62), Expect = 6.7
Identities = 16/81 (19%), Positives = 36/81 (44%), Gaps = 17/81 (20%)
Query: 171 RQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRL 230
RQ+L++ + + + L + ++++ E E + + E+LQ + L
Sbjct: 1 RQRLDQMLKPVDETL--------------QTFEQRVEESFEKVA---EELERLQEGLGEL 43
Query: 231 TQENESAKEEVKEVLQALEEP 251
+ N+ +EV + AL+ P
Sbjct: 44 QELNQQLAKEVGNLTAALKNP 64
>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118). This is
the highly conserved family of the major tail subunit
protein.
Length = 121
Score = 27.0 bits (59), Expect = 6.8
Identities = 15/59 (25%), Positives = 34/59 (57%), Gaps = 4/59 (6%)
Query: 169 EERQKLEEERERLYQQLDEKDEE----INQQSQYAEKLKEQIMEQEELITTARRDYEQL 223
EER K+E ERE+ ++ E+ ++ +N+Q + +K + + +Q++ I + + +L
Sbjct: 50 EERDKMETEREKRDKESKEERDKFISTMNEQQRLMDKQNDMMGQQQQSIDSLSKSVGKL 108
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 28.0 bits (63), Expect = 7.1
Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 7/71 (9%)
Query: 169 EERQKLEEERERLYQQL-------DEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYE 221
++Q+ E ++L DE ++ + E L E + E E R+ E
Sbjct: 530 RQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLE 589
Query: 222 QLQGEMTRLTQ 232
QLQ + RL
Sbjct: 590 QLQARIQRLAA 600
>gnl|CDD|180307 PRK05892, PRK05892, nucleoside diphosphate kinase regulator;
Provisional.
Length = 158
Score = 27.0 bits (60), Expect = 7.5
Identities = 10/31 (32%), Positives = 14/31 (45%)
Query: 165 SLSNEERQKLEEERERLYQQLDEKDEEINQQ 195
L+ R LE E RL + D E+N +
Sbjct: 7 GLAPAARDHLEAELARLRARRDRLAVEVNDR 37
>gnl|CDD|234263 TIGR03568, NeuC_NnaA, UDP-N-acetyl-D-glucosamine 2-epimerase,
UDP-hydrolysing. This family of enzymes catalyzes the
combined epimerization and UDP-hydrolysis of
UDP-N-acetylglucosamine to N-acetylmannosamine. This is
in contrast to the related enzyme WecB (TIGR00236) which
retains the UDP moiety. NeuC acts in concert with NeuA
and NeuB to synthesize CMP-N5-acetyl-neuraminate.
Length = 365
Score = 27.9 bits (63), Expect = 7.6
Identities = 12/20 (60%), Positives = 18/20 (90%)
Query: 231 TQENESAKEEVKEVLQALEE 250
T E ESA+E++KE+L+AL+E
Sbjct: 212 TLEKESAEEQIKELLKALDE 231
>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972). This
family consists of several hypothetical bacterial
sequences. The function of this family is unknown.
Length = 106
Score = 26.5 bits (59), Expect = 7.8
Identities = 19/70 (27%), Positives = 32/70 (45%), Gaps = 14/70 (20%)
Query: 181 LYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENESAKEE 240
L+ L E ++++NQ +LK+Q+ E E E L ENE +E
Sbjct: 6 LFDALSELEQQLNQLLAELGELKQQLAELLE--------------ENAELRIENEHLRER 51
Query: 241 VKEVLQALEE 250
++E+ Q +E
Sbjct: 52 LEELEQEQKE 61
>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein. Bacillus subtilis stage V
sporulation protein R is involved in spore cortex
formation. Little is known about cortex biosynthesis,
except that it depends on several sigma E controlled
genes, including spoVR.
Length = 427
Score = 27.5 bits (62), Expect = 7.9
Identities = 18/67 (26%), Positives = 29/67 (43%), Gaps = 7/67 (10%)
Query: 201 KLKEQIMEQEELITTARRDYEQLQGE--MTRLTQENESAKEEVKEVLQALEEPTRTLLNF 258
L+E+ QEE R +Y Q Q + + E +EE + EEP +L F
Sbjct: 176 SLEEEKARQEE-----REEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQEDILYF 230
Query: 259 FRQSSPK 265
+++P
Sbjct: 231 LEKNAPL 237
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 27.7 bits (62), Expect = 8.1
Identities = 25/102 (24%), Positives = 48/102 (47%), Gaps = 15/102 (14%)
Query: 151 SILPAIPGSGLMLGSLSNEERQKLE--EERERLYQQLDEKDEEINQQSQYAEKLKEQIME 208
+I ++P + L LS + + LE RE + L+ + E++ Q+ Y E +
Sbjct: 227 AISKSLPETKQYLKKLSQDLSKALEKISSRE---KHLNNQLEQLVQE--YREARRT---- 277
Query: 209 QEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
++ + Y Q ++ LT+E EE+++V Q +EE
Sbjct: 278 ----LSQVQEKYNQASQGVSELTRELNEISEELEQVKQEMEE 315
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 28.0 bits (63), Expect = 8.3
Identities = 16/82 (19%), Positives = 31/82 (37%), Gaps = 8/82 (9%)
Query: 169 EERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMT 228
++ + +EE E+L QQL + ++ Q E LK+ R L +
Sbjct: 73 DKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDD------NDEETRETLSTL--SLR 124
Query: 229 RLTQENESAKEEVKEVLQALEE 250
+L ++++ L E
Sbjct: 125 QLESRLAQTLDQLQNAQNDLAE 146
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 27.4 bits (61), Expect = 8.3
Identities = 25/93 (26%), Positives = 44/93 (47%), Gaps = 11/93 (11%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITT----------AR 217
EER+K +E+ + QQ + +E+ ++ +Y ++L++Q + EEL+ R
Sbjct: 91 AEERRKTLQEQTQQEQQRAQYQDELARK-RYQKELEQQRRQNEELLKMQEESVLRQEAMR 149
Query: 218 RDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
R E+ EM R T E E+ E + E
Sbjct: 150 RATEEEILEMRRETIEEEAELERENIRAKIEAE 182
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are about 260 amino acids in
length.
Length = 250
Score = 27.6 bits (62), Expect = 8.5
Identities = 12/78 (15%), Positives = 31/78 (39%), Gaps = 10/78 (12%)
Query: 184 QLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQEN-------ES 236
Q ++ + ++K ++ +Q + + QL E+ L N +
Sbjct: 26 QALAAAQQTVAAAAQSQKKIDKWADQTQ---ELLAEIRQLLKEIENLRVYNDQLQRLVAN 82
Query: 237 AKEEVKEVLQALEEPTRT 254
++E+ + Q +E+ +T
Sbjct: 83 QQQEIASLQQQIEQIEKT 100
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 27.5 bits (61), Expect = 8.5
Identities = 21/82 (25%), Positives = 44/82 (53%), Gaps = 12/82 (14%)
Query: 170 ERQKLEEERERLYQQ---LDEKDEEINQQSQYAEKLKEQIMEQ-EELITTARRDYEQLQG 225
ERQ+L++ RL ++ LD KDE ++ KE+ +E E+ +T + ++ +
Sbjct: 95 ERQELKQIESRLTERATSLDRKDENLSS--------KEKTLESKEQSLTDKSKHIDEREE 146
Query: 226 EMTRLTQENESAKEEVKEVLQA 247
++ +L ++ ++ E V + QA
Sbjct: 147 QVEKLEEQKKAELERVAALSQA 168
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 27.9 bits (62), Expect = 8.6
Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 4/79 (5%)
Query: 174 LEEERERL--YQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLT 231
L+E R+ L + L+ Q+ Y + + + + +A+ Y+QL ++ R
Sbjct: 232 LDEYRDELERLEALERALRNFLQR--YRRYARTMLRRRATRLRSAQTQYDQLSRDLGRAR 289
Query: 232 QENESAKEEVKEVLQALEE 250
E E+A+EE +E+ E
Sbjct: 290 DELETAREEERELDARTEA 308
>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
Provisional.
Length = 319
Score = 27.3 bits (60), Expect = 8.7
Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 9/111 (8%)
Query: 154 PAIPGSGLMLGSLSNEERQKLEEERERLYQQ--------LDEKDEEINQQSQYAEKLKEQ 205
P+ G L+ E+RQ LE+ + + QQ +E+ E QQ+ ++ +Q
Sbjct: 102 PSAGGEVKTPEQLTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQ 161
Query: 206 IMEQEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLL 256
+ +++ + R EQ + TR +Q + + + ++P + LL
Sbjct: 162 L-AEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLL 211
>gnl|CDD|220897 pfam10883, DUF2681, Protein of unknown function (DUF2681). This
family of proteins is found in bacteria. Proteins in
this family are typically between 81 and 117 amino acids
in length.
Length = 87
Score = 25.8 bits (57), Expect = 8.7
Identities = 18/42 (42%), Positives = 24/42 (57%), Gaps = 2/42 (4%)
Query: 216 ARRDYEQLQGEMTRLTQENESAKEEVK--EVLQALEEPTRTL 255
A+R+ +LQ E +L E A+ EVK +V Q EE TR L
Sbjct: 28 AQRENRKLQAENEQLATEKAVAETEVKNAKVRQKNEENTRRL 69
>gnl|CDD|233056 TIGR00619, sbcd, exonuclease SbcD. All proteins in this family for
which functions are known are double-stranded DNA
exonuclease (as part of a complex with SbcC homologs).
This complex functions in the initiation of
recombination and recombinational repair and is
particularly important in regulating the stability of
DNA sections that can form secondary structures. This
family is likely homologous to the MRE11 family. This
family is based on the phylogenomic analysis of JA Eisen
(1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 253
Score = 27.4 bits (61), Expect = 8.8
Identities = 16/66 (24%), Positives = 20/66 (30%), Gaps = 5/66 (7%)
Query: 125 EDEQVNLADPVDMAASIAPTPESAPASILPA-----IPGSGLMLGSLSNEERQKLEEERE 179
D Q+ L + P L G L+L L + RQ LE
Sbjct: 111 GDPQILLLKDTANGELLIVGLPLLPREALLTRAGLDGFGLELLLLHLDVKLRQALEGLIR 170
Query: 180 RLYQQL 185
RL L
Sbjct: 171 RLDPDL 176
>gnl|CDD|153370 cd07686, F-BAR_Fer, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Fer (Fes related)
tyrosine kinase. F-BAR domains are dimerization modules
that bind and bend membranes and are found in proteins
involved in membrane dynamics and actin reorganization.
Fer (Fes related) is a cytoplasmic (or nonreceptor)
tyrosine kinase expressed in a wide variety of tissues,
and is found to reside in both the cytoplasm and the
nucleus. It plays important roles in neuronal
polarization and neurite development, cytoskeletal
reorganization, cell migration, growth factor signaling,
and the regulation of cell-cell interactions mediated by
adherens junctions and focal adhesions. Fer kinase also
regulates cell cycle progression in malignant cells. It
contains an N-terminal F-BAR domain, an SH2 domain, and
a C-terminal catalytic kinase domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 234
Score = 27.3 bits (60), Expect = 8.8
Identities = 18/67 (26%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 209 QEELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLNFFRQSSPKGTL 268
+ E+ + + E+L+ +LT+E SAKE+ K+ + +E T + +++ K +
Sbjct: 118 EAEMYKVTKTELEKLKCSYRQLTKEVNSAKEKYKDAVAKGKE-TEKARERYDKATMKLHM 176
Query: 269 LYNHYLL 275
L+N Y+L
Sbjct: 177 LHNQYVL 183
>gnl|CDD|227365 COG5032, TEL1, Phosphatidylinositol kinase and protein kinases of the
PI-3 kinase family [Signal transduction mechanisms / Cell
division and chromosome partitioning / Chromatin
structure and dynamics / DNA replication, recombination,
and repair / Intracellular trafficking and secretion].
Length = 2105
Score = 27.8 bits (62), Expect = 8.9
Identities = 34/220 (15%), Positives = 64/220 (29%), Gaps = 52/220 (23%)
Query: 103 YKGKLEKLELELSRWRNGETV-------------------KEDEQVNLADPVDMAASIAP 143
L++ L+R+ + K + +LA+ +DM S+
Sbjct: 1089 LVKYLQRWYEALNRYFELLSKGDRLFAISFTKLRNVDALGKLELYSSLAE-IDMFLSLHR 1147
Query: 144 TPESAPASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLK 203
+ + A G + Q+L E Q + +E QY
Sbjct: 1148 RRKLLETLVATAYEQVGEW------YKAQQLYEV-----AQRKARSKEFPFSLQYLYWHI 1196
Query: 204 EQIMEQEELITTARRD------YEQLQGEMTRLTQEN------ESAKEEVKEVL--QALE 249
I ++L + E L E R + ES EE+ + + +
Sbjct: 1197 NDIDCADKLQSVLAELSLVTGISELLLEESWRRALFSNIKDSLESELEEIIDGMYKSNED 1256
Query: 250 EPTRTLLNFFRQSSPKGTLLYNHYLLGWPNQSERVSLWYR 289
LL+ + + L G + S+ + L
Sbjct: 1257 FGALMLLSLSAELW-------DKILEGRSSCSKSIKLSLN 1289
>gnl|CDD|221190 pfam11727, ISG65-75, Invariant surface glycoprotein. This family
is found in Trypanosome species, and appears to be one
of two invariant surface glycoproteins, ISG65 and ISG75.
that are found in the mammalian stage of the parasitic
protozoan. the sequence suggests the two families are
polypeptides with N-terminal signal sequences,
hydrophilic extracellular domains, single trans-membrane
alpha-helices and short cytoplasmic domains. they are
both expressed in the bloodstream form but not in the
midgut stage. Both polypeptides are distributed over the
entire surface of the parasite.
Length = 286
Score = 27.5 bits (61), Expect = 8.9
Identities = 5/42 (11%), Positives = 22/42 (52%)
Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTA 216
+ +++++ + ++ ++I++ A+K + + E + A
Sbjct: 95 DSDKKKIKEICEKAKDKIDEALPEAKKAGREAEKLAEEVKEA 136
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex which
functions in messenger ribonucleoprotein metabolism and
plays a role in preventing the transcription-associated
genetic instability. Tho2, along with four other
subunits forms THO.
Length = 296
Score = 27.3 bits (61), Expect = 8.9
Identities = 13/62 (20%), Positives = 31/62 (50%), Gaps = 6/62 (9%)
Query: 195 QSQYA---EKLKEQIMEQE---ELITTARRDYEQLQGEMTRLTQENESAKEEVKEVLQAL 248
+ Y E+L++QI E + I +++ ++L+ + +L +E + E ++ + L
Sbjct: 26 KELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRL 85
Query: 249 EE 250
E
Sbjct: 86 SE 87
>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV. Bacterial DNA
topoisomerase IV, GyrA, ParC.
Length = 444
Score = 27.5 bits (62), Expect = 9.0
Identities = 23/100 (23%), Positives = 37/100 (37%), Gaps = 17/100 (17%)
Query: 98 EKSSRY-KGKLEKLELELSRWRNGETVKEDEQVNL----ADPVDMA-----ASIAPTPES 147
+ Y GKLE L + DE + L D A +
Sbjct: 340 TRRKEYLLGKLEAERLHILEGLFIALSIIDEIIVLIRSSKDLAKEAKEKLMERFELSEIQ 399
Query: 148 APASILPAIPGSGLMLGSLSNEERQKLEEERERLYQQLDE 187
A A IL + L L+ E +KLE+E + L +++++
Sbjct: 400 ADA-IL------DMRLRRLTKLEVEKLEKELKELEKEIED 432
>gnl|CDD|205988 pfam13815, Dzip-like_N, Iguana/Dzip1-like DAZ-interacting protein
N-terminal. The DAZ gene-product - Deleted in
Azoospermia - and a closely related sequence are
required early in germ-cell development in order to
maintain germ-cell populations. This family is the
N-terminal region that is the only part of the protein
in some fungi and lower metazoa.
Length = 118
Score = 26.5 bits (59), Expect = 9.0
Identities = 12/45 (26%), Positives = 25/45 (55%)
Query: 174 LEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARR 218
L +E L Q E +E + + Q AE+ ++++ +Q E + + +R
Sbjct: 71 LLHSQECLTQSNSELEERLQEALQEAEEREQELSKQSEELKSLKR 115
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 26.4 bits (59), Expect = 9.2
Identities = 22/85 (25%), Positives = 42/85 (49%), Gaps = 11/85 (12%)
Query: 173 KLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRDYEQLQGEMTRLTQ 232
+L+EER L +L+E E N+ S+ K K++ + E LI E+ L
Sbjct: 33 ELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALI-----------AEVKELKD 81
Query: 233 ENESAKEEVKEVLQALEEPTRTLLN 257
E ++ + E++E+ L++ ++ N
Sbjct: 82 ELKALEAELRELEAELDKLLLSIPN 106
>gnl|CDD|218760 pfam05812, Herpes_BLRF2, Herpesvirus BLRF2 protein. This family
consists of several Herpesvirus BLRF2 proteins.
Length = 118
Score = 26.5 bits (59), Expect = 9.3
Identities = 10/39 (25%), Positives = 22/39 (56%)
Query: 219 DYEQLQGEMTRLTQENESAKEEVKEVLQALEEPTRTLLN 257
E+L E+ +L EN++ K+++++ + E P +L
Sbjct: 4 TVEELAAELQKLQMENKALKKKLRQSVGPGESPDDEILT 42
>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
topoisomerase IV), A subunit [DNA replication,
recombination, and repair].
Length = 804
Score = 27.6 bits (62), Expect = 9.3
Identities = 12/58 (20%), Positives = 34/58 (58%)
Query: 161 LMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARR 218
L L L+ E +K+E+E + L +++ + ++ + + + + +K++++E ++ RR
Sbjct: 426 LRLRRLTGLEEEKIEKELKELEKEIADLEKILASEERLLDIIKKELLEIKKKFGDERR 483
>gnl|CDD|218623 pfam05524, PEP-utilisers_N, PEP-utilising enzyme, N-terminal.
Length = 123
Score = 26.4 bits (59), Expect = 9.4
Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 6/79 (7%)
Query: 175 EEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARR---DYEQLQGEMTRLT 231
E E RL L++ EE+ AE+ E + E+E I A D +L E+ L
Sbjct: 34 EAEIARLEAALEKAREELEAL---AERAAESLGEEEAAIFEAHLMLLDDPELLEEVEELI 90
Query: 232 QENESAKEEVKEVLQALEE 250
+E SA+ VKEV++
Sbjct: 91 REGLSAEAAVKEVIEEYAA 109
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 27.2 bits (61), Expect = 9.4
Identities = 17/97 (17%), Positives = 28/97 (28%), Gaps = 23/97 (23%)
Query: 168 NEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITT------------ 215
+ L E L ++DE E+N Q E T
Sbjct: 134 RPKIAALTAEIAALQAEIDEAQAEVNAA---------YQEAQCEAEGTGGTGVAGKGPVY 184
Query: 216 --ARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
R + Q + L ++A +++ ALE
Sbjct: 185 KEKREKLDAAQARLETLKARLDAAIAQLEAQKAALER 221
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 27.7 bits (62), Expect = 9.4
Identities = 19/84 (22%), Positives = 44/84 (52%), Gaps = 4/84 (4%)
Query: 164 GSLSNEERQKLE----EERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEELITTARRD 219
+++ E + +LE EE + L +QL+E ++ + A + E I++ E+ + + D
Sbjct: 262 QTITEERKAELEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLD 321
Query: 220 YEQLQGEMTRLTQENESAKEEVKE 243
E+L+ + L + E ++++E
Sbjct: 322 VEELKALLEALEEILEKNLQKLEE 345
>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator. DivIC from B.
subtilis is necessary for both vegetative and
sporulation septum formation. These proteins are mainly
composed of an amino terminal coiled-coil.
Length = 80
Score = 25.6 bits (57), Expect = 9.6
Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 3/45 (6%)
Query: 217 RRDYEQLQGEMTRLTQENESAKEEVKEVLQ---ALEEPTRTLLNF 258
++ LQ E+ +L ENE + EVK++ +EE R+ L
Sbjct: 23 NQEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEERARSELGL 67
>gnl|CDD|220373 pfam09738, DUF2051, Double stranded RNA binding protein (DUF2051).
This is a novel protein identified as interacting with
the leucine-rich repeat domain of human flightless-I,
FliI protein.
Length = 294
Score = 27.2 bits (60), Expect = 9.8
Identities = 19/97 (19%), Positives = 45/97 (46%), Gaps = 12/97 (12%)
Query: 165 SLSNEERQKLEEERERLYQQLDEK-----------DEEINQQSQYAEKLKEQIMEQEELI 213
S S + + E ++ L +++EK D E + LK+++ E EE +
Sbjct: 69 SSSVDTEASIRELKDSL-IEVEEKVKKAMVSNAQLDNEKTNLMYEVDLLKDKLEELEESL 127
Query: 214 TTARRDYEQLQGEMTRLTQENESAKEEVKEVLQALEE 250
+R++E+ ++ L + + + ++KE+ + L +
Sbjct: 128 AELQREFEEKTRDLEALKRAIDILQHQLKELKEQLRQ 164
>gnl|CDD|214016 cd12923, iSH2_PI3K_IA_R, Inter-Src homology 2 (iSH2) helical domain
of Class IA Phosphoinositide 3-kinase Regulatory
subunits. PI3Ks catalyze the transfer of the
gamma-phosphoryl group from ATP to the 3-hydroxyl of the
inositol ring of D-myo-phosphatidylinositol (PtdIns) or
its derivatives. They play an important role in a
variety of fundamental cellular processes, including
cell motility, the Ras pathway, vesicle trafficking and
secretion, immune cell activation, and apoptosis. They
are classified according to their substrate specificity,
regulation, and domain structure. Class IA PI3Ks are
heterodimers of a p110 catalytic (C) subunit and a
p85-related regulatory (R) subunit. The R subunit
down-regulates PI3K basal activity, stabilizes the C
subunit, and plays a role in the activation downstream
of tyrosine kinases. All R subunits contain two SH2
domains that flank an intervening helical domain (iSH2),
which binds to the N-terminal adaptor-binding domain
(ABD) of the catalytic subunit. In vertebrates, there
are three genes (PIK3R1, PIK3R2, and PIK3R3) that encode
for different Class IA PI3K R subunits.
Length = 152
Score = 26.8 bits (60), Expect = 9.8
Identities = 19/69 (27%), Positives = 36/69 (52%), Gaps = 3/69 (4%)
Query: 200 EKLKEQIMEQEELITTARRDYEQLQGEMTRLTQENES---AKEEVKEVLQALEEPTRTLL 256
EKL +++ E + R+Y++L + +L+QE + A E +E ++ EE RT
Sbjct: 4 EKLAKKLKEINKEYLDKSREYDELYEKYNKLSQEIQLKRQALEAFEEAVKMFEEQLRTQE 63
Query: 257 NFFRQSSPK 265
F +++ P
Sbjct: 64 KFQKEAQPH 72
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 27.4 bits (61), Expect = 10.0
Identities = 12/53 (22%), Positives = 27/53 (50%), Gaps = 6/53 (11%)
Query: 160 GLMLGSLSNEERQKLEEERERLYQQLDEKDEEINQQSQYAEKLKEQIMEQEEL 212
++ L+ ++ + ++ ER + ++K EE Y E++K+ I + EE
Sbjct: 6 TELISKLAEKQAKSKLQQLERARDKQEKKAEE------YQEQIKKAIEKAEEA 52
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.127 0.351
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,611,958
Number of extensions: 1412506
Number of successful extensions: 5085
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4184
Number of HSP's successfully gapped: 1010
Length of query: 290
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 194
Effective length of database: 6,679,618
Effective search space: 1295845892
Effective search space used: 1295845892
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 58 (26.0 bits)