RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7226
(279 letters)
>gnl|CDD|215803 pfam00225, Kinesin, Kinesin motor domain.
Length = 326
Score = 164 bits (417), Expect = 7e-49
Identities = 78/242 (32%), Positives = 116/242 (47%), Gaps = 48/242 (19%)
Query: 24 DHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS----- 78
+ F FD +F P + E ++ E PL+ + G N T+ AYGQTG GKTYT+
Sbjct: 38 REKTFTFDRVFDPEATQEFVYEETAKPLVESVLEGYNVTIFAYGQTGSGKTYTMEGDPPE 97
Query: 79 ---AMIMKTLQHVMQRCNKD-------DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNI 128
+I + L+ + +R + V +SYL++Y+EK YDLL+ +
Sbjct: 98 EEPGIIPRALEDLFERIEERKERKWSFSVRVSYLEIYNEKIYDLLSPSS----------- 146
Query: 129 PQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNA 188
+ R++ DP+ G G LT ++ S ++L+ GN+N+ A T MN
Sbjct: 147 KKKKLRIR--EDPKKGVYVKG-------LTEVEVTSAEEVLELLELGNKNRTVASTNMNE 197
Query: 189 QSSRSHTICTIYL----------GAMAKLHLVDLAGSEQLFS--LSDNYLLRNEARKINL 236
+SSRSH I TI + KL+LVDLAGSE+ + L+ EA IN
Sbjct: 198 ESSRSHAIFTITVEQRNRDTDGSVKTGKLNLVDLAGSERASKTGAAGGRRLK-EAANINK 256
Query: 237 SL 238
SL
Sbjct: 257 SL 258
>gnl|CDD|238054 cd00106, KISc, Kinesin motor domain. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Kinesins are microtubule-dependent
molecular motors that play important roles in
intracellular transport and in cell division. In most
kinesins, the motor domain is found at the N-terminus
(N-type), in some its is found in the middle (M-type),
or C-terminal (C-type). N-type and M-type kinesins are
(+) end-directed motors, while C-type kinesins are (-)
end-directed motors, i.e. they transport cargo towards
the (-) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 328
Score = 164 bits (417), Expect = 7e-49
Identities = 76/240 (31%), Positives = 111/240 (46%), Gaps = 44/240 (18%)
Query: 28 FIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSA------MI 81
F FD++F PN + E ++ PL+ + G N T+ AYGQTG GKTYT+ +I
Sbjct: 47 FTFDHVFDPNSTQEDVYETTAKPLVESVLEGYNGTIFAYGQTGSGKTYTMFGSPKDPGII 106
Query: 82 MKTLQHVMQRCNKDD-------VYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQSDQR 134
+ L+ + ++ V +SYL++Y+EK YDLL+ P
Sbjct: 107 PRALEDLFNLIDERKEKNKSFSVSVSYLEIYNEKVYDLLSP-----------EPPSKPLS 155
Query: 135 VQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSSRSH 194
++ P G LT ++ S A+ L+ G +N+ TA T MN +SSRSH
Sbjct: 156 LRE---------DPKGGVYVKGLTEVEVGSAEDALSLLQKGLKNRTTASTAMNERSSRSH 206
Query: 195 TICTIYL-----------GAMAKLHLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQ 243
I TI++ +KL+LVDLAGSE+ EA+ IN SL L
Sbjct: 207 AIFTIHVEQRNTTNDGRSIKSSKLNLVDLAGSERAKKTGAEGDRLKEAKNINKSLSALGN 266
>gnl|CDD|214526 smart00129, KISc, Kinesin motor, catalytic domain. ATPase.
Microtubule-dependent molecular motors that play
important roles in intracellular transport of organelles
and in cell division.
Length = 335
Score = 155 bits (393), Expect = 3e-45
Identities = 81/242 (33%), Positives = 115/242 (47%), Gaps = 47/242 (19%)
Query: 24 DHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSA---- 79
+ F FD +F S E +F E PL++ + G NAT+ AYGQTG GKTYT+
Sbjct: 44 GEKKFTFDKVFDATASQEDVFEETAAPLVDSVLEGYNATIFAYGQTGSGKTYTMIGTPDS 103
Query: 80 --MIMKTLQHVMQRCNKD------DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQS 131
+I + L+ + ++ +K V +SYL++Y+EK DLLN + L+ I +
Sbjct: 104 PGIIPRALKDLFEKIDKREEGWQFSVKVSYLEIYNEKIRDLLNPSS-KKLE--IREDEKG 160
Query: 132 DQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSS 191
V+G LT + S L+ GN+N+ A TKMN +SS
Sbjct: 161 GVYVKG-------------------LTEISVSSFEEVYNLLEKGNKNRTVAATKMNEESS 201
Query: 192 RSHTICTIYL-----------GAMAKLHLVDLAGSEQLFSLSDNYLLRN-EARKINLSLH 239
RSH + TI + G +KL+LVDLAGSE+ + R EA IN SL
Sbjct: 202 RSHAVFTITVEQKIKNSSSGSGKASKLNLVDLAGSER-AKKTGAEGDRLKEAGNINKSLS 260
Query: 240 YL 241
L
Sbjct: 261 AL 262
>gnl|CDD|238671 cd01375, KISc_KIF9_like, Kinesin motor domain, KIF9-like subgroup;
might play a role in cell shape remodeling. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In most kinesins, the motor domain is found at
the N-terminus (N-type). N-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 334
Score = 145 bits (367), Expect = 1e-41
Identities = 73/245 (29%), Positives = 111/245 (45%), Gaps = 43/245 (17%)
Query: 24 DHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS----- 78
D F FD +F S E ++ V P+++ +G N T+ AYGQTG GKT+T++
Sbjct: 47 DFS-FKFDGVFHNA-SQEEVYETVAKPVVDSALDGYNGTIFAYGQTGAGKTFTMTGGTES 104
Query: 79 ----AMIMKTLQHVMQRCNKD-----DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIP 129
+I + L+ V + V++SYL++Y+E+ YDLL
Sbjct: 105 YKDRGLIPRALEQVFREVAMRATKTYTVHVSYLEIYNEQLYDLLG--------------- 149
Query: 130 QSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQ 189
T L++ + A+ L+ G N+ A T MN
Sbjct: 150 -DTPEALESLPAVTILEDSEQNIHVKGLSLHSATTEEEALNLLFLGETNRTIAETSMNQA 208
Query: 190 SSRSHTICTIYLGA-----------MAKLHLVDLAGSEQLFSLSDNYLLRNEARKINLSL 238
SSRSH I TI+L + ++KL+LVDLAGSE++ + + EA+ IN SL
Sbjct: 209 SSRSHCIFTIHLESRSREAGSEVVRLSKLNLVDLAGSERVSKTGVSGQVLKEAKYINKSL 268
Query: 239 HYLEQ 243
+LEQ
Sbjct: 269 SFLEQ 273
>gnl|CDD|238663 cd01367, KISc_KIF2_like, Kinesin motor domain, KIF2-like group.
KIF2 is a protein expressed in neurons, which has been
associated with axonal transport and neuron development;
alternative splice forms have been implicated in
lysosomal translocation. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Kinesins are microtubule-dependent
molecular motors that play important roles in
intracellular transport and in cell division. In this
subgroup the motor domain is found in the middle
(M-type) of the protein chain. M-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second (KIF2 may be slower). To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 322
Score = 139 bits (352), Expect = 2e-39
Identities = 73/235 (31%), Positives = 109/235 (46%), Gaps = 42/235 (17%)
Query: 22 FFDHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS--- 78
+ + F FD +F +NE ++ + PLI H+F G AT AYGQTG GKTYT+
Sbjct: 47 YIEKHTFRFDYVFDEAVTNEEVYRSTVKPLIPHVFEGGVATCFAYGQTGSGKTYTMLGDE 106
Query: 79 ------AMIMKTLQHVMQRCNKD-DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQS 131
A+ + + ++ + N D V +S+ ++Y K +DLLN K +++
Sbjct: 107 NQEGLYALAARDIFRLLAQPNDDLGVTVSFFEIYGGKLFDLLNDRKRLSVLE------DG 160
Query: 132 DQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSS 191
VQ LT K + S++ ++LI +GN + T T N QSS
Sbjct: 161 KGNVQ-----------------IVGLTEKPVTSVDELLELIESGNSLRTTGSTGANDQSS 203
Query: 192 RSHTICTIYL------GAMAKLHLVDLAGSEQLFS--LSDNYLLRNEARKINLSL 238
RSH I I L + KL +DLAGSE+ + + R E +IN SL
Sbjct: 204 RSHAILQIILKNKKLNKLLGKLSFIDLAGSER-GADTSEHDRQTRKEGAEINKSL 257
>gnl|CDD|238666 cd01370, KISc_KIP3_like, Kinesin motor domain, KIP3-like subgroup.
The yeast kinesin KIP3 plays a role in positioning the
mitotic spindle. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 338
Score = 128 bits (323), Expect = 6e-35
Identities = 72/260 (27%), Positives = 117/260 (45%), Gaps = 56/260 (21%)
Query: 5 SEVGFQNRERQQTKIWLFFDHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLL 64
E F+N ++ K + FD +F + E ++ PL++ + NG NAT+
Sbjct: 39 EEDAFRNLRARRNK------ELKYSFDRVFDETSTQEEVYENTTKPLVDGVLNGYNATVF 92
Query: 65 AYGQTGGGKTYTVSA------MIMKTLQHVMQRCNKD------DVYMSYLQLYSEKCYDL 112
AYG TG GKT+T+ +++ T++ + + + +V +SYL++Y+E DL
Sbjct: 93 AYGATGAGKTHTMLGTDSDPGLMVLTMKDLFDKIEERKDDKEFEVSLSYLEIYNETIRDL 152
Query: 113 LN-GNKEVTLKNWIFNIPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQL 171
L+ + + L+ DP G A LT + S ++L
Sbjct: 153 LSPSSGPLELRE----------------DPN-------QGIVVAGLTEHQPKSAEEILEL 189
Query: 172 IVTGNENKVTAVTKMNAQSSRSHTICTIYLGA------------MAKLHLVDLAGSEQLF 219
++ GN N+ T+ NA SSRSH + I + + KL L+DLAGSE+
Sbjct: 190 LMKGNRNRTQEPTEANATSSRSHAVLQITVRQKDRTASINQQVRIGKLSLIDLAGSERA- 248
Query: 220 SLSDNYLLR-NEARKINLSL 238
S ++N R E IN SL
Sbjct: 249 SATNNRGQRLKEGANINRSL 268
>gnl|CDD|238662 cd01366, KISc_C_terminal, Kinesin motor domain,
KIFC2/KIFC3/ncd-like carboxy-terminal kinesins. Ncd is a
spindle motor protein necessary for chromosome
segregation in meiosis. KIFC2/KIFC3-like kinesins have
been implicated in motility of the Golgi apparatus as
well as dentritic and axonal transport in neurons. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In this subgroup the motor domain is found at
the C-terminus (C-type). C-type kinesins are (-)
end-directed motors, i.e. they transport cargo towards
the (-) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 329
Score = 126 bits (318), Expect = 2e-34
Identities = 72/235 (30%), Positives = 105/235 (44%), Gaps = 44/235 (18%)
Query: 27 VFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSA------M 80
F FD +F P+ S E +F EV PL+ +G N + AYGQTG GKTYT+ +
Sbjct: 46 SFSFDRVFDPDASQEDVFEEV-SPLVQSALDGYNVCIFAYGQTGSGKTYTMEGPPENPGI 104
Query: 81 IMKTLQH---VMQRCNKD----DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQSDQ 133
I + L+ + + + S L++Y+E DLL K I + + +
Sbjct: 105 IPRALEQLFNTAEELKEKGWSYTITASMLEIYNETIRDLLATKPAPKKKLEIKHDSKGET 164
Query: 134 RVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSSRS 193
V LT + S +L+ G++N+ A T MN SSRS
Sbjct: 165 YVTN-------------------LTEVPVSSPEEVTRLLNLGSKNRSVASTNMNEHSSRS 205
Query: 194 HTICTIYL-------GAMA--KLHLVDLAGSEQLF-SLSDNYLLRNEARKINLSL 238
H + + + G KL+LVDLAGSE+L S + L+ EA+ IN SL
Sbjct: 206 HAVFQLKIRGTNLQTGEQTRGKLNLVDLAGSERLKKSGATGDRLK-EAQAINKSL 259
>gnl|CDD|238668 cd01372, KISc_KIF4, Kinesin motor domain, KIF4-like subfamily.
Members of this group seem to perform a variety of
functions, and have been implicated in neuronal
organelle transport and chromosome segregation during
mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 341
Score = 126 bits (319), Expect = 3e-34
Identities = 72/230 (31%), Positives = 110/230 (47%), Gaps = 56/230 (24%)
Query: 26 QVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYT--------- 76
+ F FD +F P+ S E ++ + PL++ +F G NAT+LAYGQTG GKTYT
Sbjct: 40 KSFTFDYVFDPSTSQEEVYNTCVAPLVDGLFEGYNATVLAYGQTGSGKTYTMGTAFTASE 99
Query: 77 -VSAM--IMKTLQHVMQRC--NKDD----VYMSYLQLYSEKCYDLLNGNKEVTLKNWIFN 127
+ I + +QH+ ++ KD+ + +S+L+LY+E+ DLL + + K+
Sbjct: 100 DEEEVGIIPRAIQHIFKKIDEKKDEPDFQLKVSFLELYNEEVRDLL--SPSTSEKS---- 153
Query: 128 IPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMN 187
+Q D + G LT ++S + + G+ ++ TA T MN
Sbjct: 154 ------PIQIREDSKGNIIIVG-------LTEVTVNSAQEVMSCLEQGSLSRTTASTAMN 200
Query: 188 AQSSRSHTICTIYLGAM-------------------AKLHLVDLAGSEQL 218
+QSSRSH I TI L +K H VDLAGSE+L
Sbjct: 201 SQSSRSHAIFTITLEQTRKNGPIAPMSGDDKNSTLTSKFHFVDLAGSERL 250
>gnl|CDD|238665 cd01369, KISc_KHC_KIF5, Kinesin motor domain, kinesin heavy chain
(KHC) or KIF5-like subgroup. Members of this group have
been associated with organelle transport. This catalytic
(head) domain has ATPase activity and belongs to the
larger group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In most kinesins, the motor domain is found at
the N-terminus (N-type). N-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 325
Score = 122 bits (309), Expect = 5e-33
Identities = 78/239 (32%), Positives = 107/239 (44%), Gaps = 46/239 (19%)
Query: 24 DHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYT-VSAMIM 82
D + F FD +F PN + E ++ V P+++ + NG N T+ AYGQTG GKTYT
Sbjct: 41 DGKTFSFDRVFPPNTTQEDVYNFVAKPIVDDVLNGYNGTIFAYGQTGSGKTYTMEGPPGD 100
Query: 83 KTLQHVMQRCNKD--------------DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNI 128
L+ ++ R D V +SYL++Y EK DLL+
Sbjct: 101 PELKGIIPRIVHDIFEHISSMDENLEFHVKVSYLEIYMEKIRDLLD-------------- 146
Query: 129 PQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNA 188
S +Q D G G LT + + S +++I G N+ A T MN
Sbjct: 147 -VSKDNLQVHEDKNRGVYVKG-------LTERFVSSPEEVLEVINEGKSNRAVASTNMNE 198
Query: 189 QSSRSHTICTIYLGA---------MAKLHLVDLAGSEQLFSLSDNYLLRNEARKINLSL 238
+SSRSH+I I L KL LVDLAGSE++ EA+KIN SL
Sbjct: 199 ESSRSHSIFLITLKQENVETGSKKRGKLFLVDLAGSEKVSKTGAEGQTLEEAKKINKSL 257
>gnl|CDD|238660 cd01364, KISc_BimC_Eg5, Kinesin motor domain, BimC/Eg5 spindle pole
proteins, participate in spindle assembly and chromosome
segregation during cell division. This catalytic (head)
domain has ATPase activity and belongs to the larger
group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In most kinesins, the motor domain is found at
the N-terminus (N-type), N-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 352
Score = 122 bits (309), Expect = 9e-33
Identities = 73/251 (29%), Positives = 116/251 (46%), Gaps = 62/251 (24%)
Query: 27 VFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS-------- 78
+ FD +FGP ++++V+ P+++ + G N T+ AYGQTG GKTYT+
Sbjct: 49 TYTFDKVFGPEADQIEVYSQVVSPILDEVLMGYNCTIFAYGQTGTGKTYTMEGDRTDNKG 108
Query: 79 ---------AMIMKTLQHV---MQRCNKD-DVYMSYLQLYSEKCYDLLNGNKEVTLKNWI 125
+I + L + ++ N + V +SYL+LY+E+ +DLL+ ++ I
Sbjct: 109 STWELSPHAGIIPRALYQLFEKLESQNTEYSVKVSYLELYNEELFDLLSSESDLNKPLRI 168
Query: 126 FNIPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKL-----DSLNSAVQLIVTGNENKV 180
F+ N + ++ L ++ N ++L+ G+ +
Sbjct: 169 FD-------------------DTNNKG---GVVIQGLEEITVNNANEGLKLLEKGSAKRK 206
Query: 181 TAVTKMNAQSSRSHTICTIY------------LGAMAKLHLVDLAGSEQLFSLSDNYLLR 228
TA T MN QSSRSH+I +I L + KL+LVDLAGSE + S R
Sbjct: 207 TAATLMNDQSSRSHSIFSITIHIKETTISGEELVKIGKLNLVDLAGSENI-GRSGAENKR 265
Query: 229 -NEARKINLSL 238
EA IN SL
Sbjct: 266 AREAGNINQSL 276
>gnl|CDD|238667 cd01371, KISc_KIF3, Kinesin motor domain, kinesins II or KIF3_like
proteins. Subgroup of kinesins, which form heterotrimers
composed of 2 kinesins and one non-motor accessory
subunit. Kinesins II play important roles in ciliary
transport, and have been implicated in neuronal
transport, melanosome transport, the secretory pathway,
and mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In this group the motor
domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 333
Score = 121 bits (306), Expect = 1e-32
Identities = 75/243 (30%), Positives = 115/243 (47%), Gaps = 49/243 (20%)
Query: 26 QVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS------- 78
+VF FD ++ PN + E ++ E PL++ + G N T+ AYGQTG GKT+T+
Sbjct: 48 KVFTFDAVYDPNSTQEDVYNETARPLVDSVLEGYNGTIFAYGQTGTGKTFTMEGVREPPE 107
Query: 79 --AMIMKTLQHVMQRCNKDD-----VYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQS 131
+I + H+ K + V +SYL++Y+E+ DLL +
Sbjct: 108 LRGIIPNSFAHIFGHIAKAENVQFLVRVSYLEIYNEEVRDLLGKD--------------Q 153
Query: 132 DQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSS 191
++++ P G + VK + ++ + L GN+N+ T MN SS
Sbjct: 154 KKKLELKERPDRGVYVKD----LSMFVVKNAEEMDKLMTL---GNKNRSVGATNMNEDSS 206
Query: 192 RSHTICTIYLGA------------MAKLHLVDLAGSE-QLFSLSDNYLLRNEARKINLSL 238
RSH+I TI + + KL+LVDLAGSE Q + + L+ EA KINLSL
Sbjct: 207 RSHSIFTITIECSEKGEDGENHIRVGKLNLVDLAGSERQSKTGATGDRLK-EATKINLSL 265
Query: 239 HYL 241
L
Sbjct: 266 SAL 268
>gnl|CDD|238670 cd01374, KISc_CENP_E, Kinesin motor domain, CENP-E/KIP2-like
subgroup, involved in chromosome movement and/or spindle
elongation during mitosis. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Kinesins are microtubule-dependent
molecular motors that play important roles in
intracellular transport and in cell division. In most
kinesins, the motor domain is found at the N-terminus
(N-type). N-type kinesins are (+) end-directed motors,
i.e. they transport cargo towards the (+) end of the
microtubule. Kinesin motor domains hydrolyze ATP at a
rate of about 80 per second, and move along the
microtubule at a speed of about 6400 Angstroms per
second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 321
Score = 118 bits (298), Expect = 2e-31
Identities = 68/234 (29%), Positives = 107/234 (45%), Gaps = 46/234 (19%)
Query: 28 FIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSAM-----IM 82
F FD +FG +N ++ + P++ G N T+ AYGQT GKT+T+S I+
Sbjct: 42 FTFDRVFGGESTNREVYERIAKPVVRSALEGYNGTIFAYGQTSSGKTFTMSGDEQEPGII 101
Query: 83 K-----TLQHVMQRCNKD-DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQSDQRVQ 136
Q + +++ + +SYL++Y+EK DLL+ + Q ++
Sbjct: 102 PLAVRDIFQRIQDTPDREFLLRVSYLEIYNEKIKDLLSPS---------------PQELR 146
Query: 137 GPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSSRSHTI 196
DP G A LT + + S +QLI G +N+ T N +SSRSHTI
Sbjct: 147 IREDPNKGVV-------VAGLTEEIVTSPEHLLQLIARGEKNRHVGETDFNERSSRSHTI 199
Query: 197 CTIYLGA------------MAKLHLVDLAGSEQLFSLSDNYLLRNEARKINLSL 238
+ + + ++ L+L+DLAGSE+ S + R E IN SL
Sbjct: 200 FQLTIESRERGDSESGTVRVSTLNLIDLAGSERA-SQTGAGERRKEGSFINKSL 252
>gnl|CDD|238672 cd01376, KISc_KID_like, Kinesin motor domain, KIF22/Kid-like
subgroup. Members of this group might play a role in
regulating chromosomal movement along microtubules in
mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 319
Score = 117 bits (296), Expect = 2e-31
Identities = 69/234 (29%), Positives = 109/234 (46%), Gaps = 41/234 (17%)
Query: 28 FIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS------AMI 81
+ FD +G + E IF+ + P++ H+ +G NAT+ AYG TG GKT+T+ +I
Sbjct: 49 YQFDAFYGTECTQEDIFSREVKPIVPHLLSGQNATVFAYGSTGAGKTHTMLGDPNEPGLI 108
Query: 82 MKTLQHVMQRCNKD----DVYMSYLQLYSEKCYDLLNG-NKEVTLKNWIFNIPQSDQRVQ 136
+TL +++ K MSY ++Y+EK YDLL KE+ ++ D +
Sbjct: 109 PRTLSDLLRMGRKQAWTGAFSMSYYEIYNEKVYDLLEPAKKELPIRE------DKDGNIL 162
Query: 137 GPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSSRSHTI 196
LT K + S+ + + ++N+ A TK+N SSRSH +
Sbjct: 163 IV-----------------GLTSKPIKSMAEFEEAYIPASKNRTVAATKLNDNSSRSHAV 205
Query: 197 CTIYLGAMA-------KLHLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQ 243
I + A KL+L+DLAGSE + + E+ IN SL L +
Sbjct: 206 LRIKVTQPASNIQLEGKLNLIDLAGSEDNRRTGNEGIRLKESAAINSSLFVLSK 259
>gnl|CDD|238661 cd01365, KISc_KIF1A_KIF1B, Kinesin motor domain, KIF1_like
proteins. KIF1A (Unc104) transports synaptic vesicles to
the nerve terminal, KIF1B has been implicated in
transport of mitochondria. Both proteins are expressed
in neurons. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
In contrast to the majority of dimeric kinesins, most
KIF1A/Unc104 kinesins are monomeric motors. A
lysine-rich loop in KIF1A binds to the negatively
charged C-terminus of tubulin and compensates for the
lack of a second motor domain, allowing KIF1A to move
processively.
Length = 356
Score = 110 bits (278), Expect = 3e-28
Identities = 73/244 (29%), Positives = 109/244 (44%), Gaps = 52/244 (21%)
Query: 28 FIFDNIF---GPNDSN----ETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTV--- 77
F FD+ + D + E +F ++ L++H F G N L AYGQTG GK+YT+
Sbjct: 50 FSFDHSYWSHDSEDPHYASQEDVFEDLGRELLDHAFEGYNVCLFAYGQTGSGKSYTMMGY 109
Query: 78 ---SAMIMKTLQHVMQRCNKD-------DVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFN 127
+I + + + QR +V +SY+++Y+EK DLLN K+
Sbjct: 110 KEEKGIIPRLCEELFQRIESKKEQNLSYEVEVSYMEIYNEKVRDLLNPKKK--------- 160
Query: 128 IPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMN 187
+ + +V+ P GP L+ + S L+ GN+++ TA T MN
Sbjct: 161 -NKGNLKVR--EHPVLGPYVED-------LSKVAVTSYEDIQNLLEEGNKSRTTASTNMN 210
Query: 188 AQSSRSHTICTIYL-------------GAMAKLHLVDLAGSEQLFSLSDNYLLRNEARKI 234
SSRSH + TI L ++K+ LVDLAGSE+ S E I
Sbjct: 211 DTSSRSHAVFTIVLTQKKLDKETDLTTEKVSKISLVDLAGSERASSTGAEGDRLKEGSNI 270
Query: 235 NLSL 238
N SL
Sbjct: 271 NKSL 274
>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
Length = 568
Score = 107 bits (268), Expect = 3e-26
Identities = 76/242 (31%), Positives = 111/242 (45%), Gaps = 43/242 (17%)
Query: 23 FDHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSA--- 79
+ FD +FGP+ + E ++ E + PLI+ + G N T+ AYGQTG GKTYT+S
Sbjct: 53 SKEGTYAFDKVFGPSATQEDVYEETIKPLIDSLLLGYNCTVFAYGQTGSGKTYTMSGTEE 112
Query: 80 ------MIMKTLQHVMQRCNKDDVY---MSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQ 130
+ +K L ++ + + +SYL++Y+EK YDLL+ N+E
Sbjct: 113 EPGIIPLSLKELFSKLEDLSMTKDFAVSISYLEIYNEKIYDLLSPNEE------------ 160
Query: 131 SDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQS 190
G A LT K + S + L+ G +N+ TA T++N +S
Sbjct: 161 ----------SLNIREDSLLGVKVAGLTEKHVSSKEEILDLLRKGEKNRTTASTEINDES 210
Query: 191 SRSHTICTIYLG---------AMAKLHLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYL 241
SRSH+I I L +KL LVDLAGSE+ + E IN SL L
Sbjct: 211 SRSHSIFQIELASKNKVSGTSETSKLSLVDLAGSERAARTGNRGTRLKEGASINKSLLTL 270
Query: 242 EQ 243
Sbjct: 271 GN 272
>gnl|CDD|238669 cd01373, KISc_KLP2_like, Kinesin motor domain, KLP2-like subgroup.
Members of this subgroup seem to play a role in mitosis
and meiosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 337
Score = 99.1 bits (247), Expect = 4e-24
Identities = 67/263 (25%), Positives = 113/263 (42%), Gaps = 57/263 (21%)
Query: 16 QTKIWLFFDHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTY 75
T +W ++F FD++ N + E +F V PL+ +G N ++ AYGQTG GKTY
Sbjct: 31 DTLVWHSHPPRMFTFDHVADSNTNQEDVFQSVGKPLVEDCLSGYNGSIFAYGQTGSGKTY 90
Query: 76 TV--------------SAMIMKTLQHVMQRCNKDD----------VYMSYLQLYSEKCYD 111
T+ +I + +++ +++ S+L++Y+E+ D
Sbjct: 91 TMMGPSSSDDESPHGLQGVIPRIFEYLFSLIQREEEKRGDGLKFLCKCSFLEIYNEQITD 150
Query: 112 LLNGNKEVTLKNWIFNIPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQL 171
LL+ + + ++ D G +LT + + S Q+
Sbjct: 151 LLD---------------PTSRNLKIRED-------IKKGVYVENLTEEYVSSYEDVYQV 188
Query: 172 IVTGNENKVTAVTKMNAQSSRSHTICTIYLGAMAK-----------LHLVDLAGSEQLFS 220
++ G N+ A T MN++SSRSH + T + + K L+LVDLAGSE+
Sbjct: 189 LLKGLSNRKVAATSMNSESSRSHAVFTCTIESWEKKASSTNIRTSRLNLVDLAGSERQKD 248
Query: 221 LSDNYLLRNEARKINLSLHYLEQ 243
+ EA+ IN SL L
Sbjct: 249 DGAEGVRLKEAKNINKSLSTLGH 271
>gnl|CDD|238664 cd01368, KISc_KIF23_like, Kinesin motor domain, KIF23-like
subgroup. Members of this group may play a role in
mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 345
Score = 99.0 bits (247), Expect = 5e-24
Identities = 72/246 (29%), Positives = 110/246 (44%), Gaps = 43/246 (17%)
Query: 23 FDHQVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVS---- 78
F F +FGPN + + F +PL+ + G N+ L YG T GKTYT+
Sbjct: 52 QKETKFSFSKVFGPNTTQKEFFEGTALPLVQDLLKGKNSLLFTYGVTNSGKTYTMQGSPG 111
Query: 79 --AMIMKTLQHVMQRCNKDDVYMSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQSDQRVQ 136
++ ++L + V++SY+++Y+ YDLL + T K + + D
Sbjct: 112 DGGILPRSLDVIFNSIGGYSVFVSYVEIYNNYIYDLLEDSPSSTKKRQSLRL-REDH--- 167
Query: 137 GPPDPQTGPASPGNGRS-AASLTVKKLDSLNSAVQLIVTGNENKVTAVTKMNAQSSRSHT 195
NG A LT ++ S A ++ G +N+ A TK+N +SSRSH+
Sbjct: 168 -------------NGNMYVAGLTEVEVSSTEEAREVFKRGQKNRRVAGTKLNRESSRSHS 214
Query: 196 ICTIYLG-----------------AMAKLHLVDLAGSEQLFSLSDNYLLR-NEARKINLS 237
+ TI L +++L LVDLAGSE+ S + N R EA IN S
Sbjct: 215 VFTIKLVQAPGDSDGDVDQDKDQITVSQLSLVDLAGSERT-SRTQNTGERLKEAGNINTS 273
Query: 238 LHYLEQ 243
L L +
Sbjct: 274 LMTLGK 279
>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
Length = 1320
Score = 76.5 bits (188), Expect = 1e-15
Identities = 70/256 (27%), Positives = 112/256 (43%), Gaps = 62/256 (24%)
Query: 26 QVFIFDNIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTV-------- 77
Q F FD+I P + E IF V PL+ + G N+++ AYGQTG GKTYT+
Sbjct: 132 QTFTFDSIADPESTQEDIFQLVGAPLVENCLAGFNSSVFAYGQTGSGKTYTMWGPANGLL 191
Query: 78 --------SAMIMKTLQHVMQRCNKDDV-----------YMSYLQLYSEKCYDLLNGNKE 118
+ + + + R N++ + S+L++Y+E+ DLL+
Sbjct: 192 EEHLSGDQQGLTPRVFERLFARINEEQIKHADRQLKYQCRCSFLEIYNEQITDLLD---- 247
Query: 119 VTLKNWIFNIPQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNEN 178
S + +Q D ++G +LT + + ++ QL++ G N
Sbjct: 248 -----------PSQKNLQIREDVKSGV-------YVENLTEEYVKTMKDVTQLLIKGLSN 289
Query: 179 KVTAVTKMNAQSSRSHTICTIYLGAMAK-------------LHLVDLAGSEQLFSLSDNY 225
+ T T +NA+SSRSH++ T + + K ++LVDLAGSE+
Sbjct: 290 RRTGATSINAESSRSHSVFTCVVESRCKSVADGLSSFKTSRINLVDLAGSERQKLTGAAG 349
Query: 226 LLRNEARKINLSLHYL 241
EA IN SL L
Sbjct: 350 DRLKEAGNINRSLSQL 365
>gnl|CDD|238659 cd01363, Motor_domain, Myosin and Kinesin motor domain. These
ATPases belong to the P-loop NTPase family and provide
the driving force in myosin and kinesin mediated
processes.
Length = 186
Score = 51.8 bits (124), Expect = 4e-08
Identities = 31/84 (36%), Positives = 41/84 (48%), Gaps = 12/84 (14%)
Query: 170 QLIVTGNENKVTAVTKMNAQSSRSHTICTIYLGA------------MAKLHLVDLAGSEQ 217
L+ GN N+ TA T MN SSRSH++ I+ G + K++LVDLAGSE+
Sbjct: 60 DLMDKGNANRTTAATAMNEHSSRSHSVFRIHFGGKNALASATEQPKVGKINLVDLAGSER 119
Query: 218 LFSLSDNYLLRNEARKINLSLHYL 241
+ E IN SL L
Sbjct: 120 IDFSGAEGSRLTETANINKSLSTL 143
Score = 47.9 bits (114), Expect = 9e-07
Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 7/54 (12%)
Query: 43 IFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTV------SAMIMKTLQHVMQ 90
+F +V PL+ +G N + AYGQTG GKTYT+ + +I +T+ V+
Sbjct: 8 VFRDV-GPLLQSALDGYNVCIFAYGQTGSGKTYTMEGKREGAGIIPRTVTDVID 60
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 40.7 bits (95), Expect = 7e-04
Identities = 20/36 (55%), Positives = 29/36 (80%), Gaps = 3/36 (8%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
+ EEEEE+ +EEEEEEE+ ++++E+EEEE EE L
Sbjct: 862 DSEEEEEEEEEEEEEEEE---EEEEEEEEEENEEPL 894
Score = 36.5 bits (84), Expect = 0.013
Identities = 14/35 (40%), Positives = 26/35 (74%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++ + G EEEEEE+++ ++++E+EEEE+EE
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEE 886
Score = 28.4 bits (63), Expect = 4.9
Identities = 14/62 (22%), Positives = 34/62 (54%)
Query: 214 GSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
Q + +D+ +++E + L+ + +++EKG + + ++++E+EEEE+
Sbjct: 815 HEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEE 874
Query: 274 EE 275
EE
Sbjct: 875 EE 876
Score = 27.7 bits (61), Expect = 8.2
Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 242 EQE-EEEEKGKEEEEEEEKK-KGKKKKEKEEEEKEEN 276
E E E E+G+E E+E E + +GK + E E + KE
Sbjct: 729 EGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETE 765
Score = 27.7 bits (61), Expect = 9.0
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ E E KG+ E E E + K ++E E E + +
Sbjct: 670 EGETETKGENESEGEIPAERKGEQEGEGEIEAK 702
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 39.1 bits (92), Expect = 0.002
Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 9/79 (11%)
Query: 200 YLGAMAKLHLVDLAGSEQLFSLSD---NYLLRNEARKINLSLHYLEQEEEEEKGKEEEEE 256
+L + K + A L++ +L ++ ++ E+ K+ EEE
Sbjct: 373 FLSIIFKHNPELAARLAAFLELTEEEIEFLTGSKKATK------KIKKIVEKAEKKREEE 426
Query: 257 EEKKKGKKKKEKEEEEKEE 275
+++KK K K++EE+EE
Sbjct: 427 KKEKKKKAFAGKKKEEEEE 445
Score = 36.8 bits (86), Expect = 0.009
Identities = 18/34 (52%), Positives = 29/34 (85%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E++++ GK++EEEEE++K KK++EKEEEE+E
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Score = 36.1 bits (84), Expect = 0.016
Identities = 13/35 (37%), Positives = 30/35 (85%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+++EEE+ +E+E++EE+K+ ++++ +EE+E+EE
Sbjct: 437 GKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 35.3 bits (82), Expect = 0.026
Identities = 19/38 (50%), Positives = 28/38 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+++ K KEEEEEEEK+K +++KE+EEEE EE +
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 34.9 bits (81), Expect = 0.039
Identities = 14/34 (41%), Positives = 27/34 (79%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++++ K+EEEEEE+K+ K+++++EEEE+ E
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Score = 33.7 bits (78), Expect = 0.10
Identities = 18/35 (51%), Positives = 31/35 (88%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+EEEEEK K+EEE+EE+++ +++++EEEEK++
Sbjct: 441 EEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 33.0 bits (76), Expect = 0.19
Identities = 15/34 (44%), Positives = 31/34 (91%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++EEEEE+ +++EEE+E+++ + ++EKEEEE+++
Sbjct: 440 KEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 32.6 bits (75), Expect = 0.23
Identities = 15/34 (44%), Positives = 31/34 (91%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++EEEEE+ KE++EEE++++ ++ +E++EEE+E+
Sbjct: 439 KKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 36.3 bits (83), Expect = 0.003
Identities = 12/39 (30%), Positives = 26/39 (66%)
Query: 237 SLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+LH+LE EE+ + +EEE+++ + + E ++E+ +E
Sbjct: 2 TLHFLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 38.0 bits (89), Expect = 0.004
Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 9/71 (12%)
Query: 207 LHLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKK 266
L+LVD N L E + E+E+ + +EE +EE ++K ++KK
Sbjct: 242 LYLVDKLA---------NLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKK 292
Query: 267 EKEEEEKEENL 277
++E E K L
Sbjct: 293 KEEREAKLAKL 303
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 37.6 bits (87), Expect = 0.006
Identities = 17/38 (44%), Positives = 27/38 (71%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+E + E GKEEE+E+E+ K +KKK+KE+ ++E K
Sbjct: 95 AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132
Score = 32.9 bits (75), Expect = 0.16
Identities = 17/38 (44%), Positives = 26/38 (68%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E +EEEK KE+ +EE+KKK +K KE+ ++ K + K
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138
Score = 31.4 bits (71), Expect = 0.49
Identities = 14/37 (37%), Positives = 24/37 (64%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+EE +EK +E+E+EK+K ++ EEEK+ V+
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVR 170
Score = 31.0 bits (70), Expect = 0.71
Identities = 10/35 (28%), Positives = 24/35 (68%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ +E K + +EEE++K+ K+++K+++EK +
Sbjct: 91 KTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125
Score = 30.6 bits (69), Expect = 0.83
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EQ +EE+K K+E+ +EE K K K+E +E+ +
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPK 144
Score = 30.2 bits (68), Expect = 1.4
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ +EE++K KE+ +EE K + K++ KE+ +E
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEK 146
Score = 29.5 bits (66), Expect = 2.1
Identities = 11/34 (32%), Positives = 25/34 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+++ +EEK K++E+ +E+ K +K KE+ +E++
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143
Score = 29.5 bits (66), Expect = 2.1
Identities = 11/34 (32%), Positives = 24/34 (70%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+EE +++ +EE +E++ +K+KEKE++ +E
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157
Score = 29.5 bits (66), Expect = 2.2
Identities = 11/38 (28%), Positives = 22/38 (57%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+ + K +E + E K+ +K+KE+ +EEK++ K
Sbjct: 86 KGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEK 123
Score = 29.1 bits (65), Expect = 2.7
Identities = 12/35 (34%), Positives = 25/35 (71%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ +EE + K +EE +EK+ K+K++++E++ EE
Sbjct: 123 KPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157
Score = 28.7 bits (64), Expect = 3.6
Identities = 12/35 (34%), Positives = 28/35 (80%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
++E +E++ +E+E+E++KK ++ +++EEE+K E
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRER 168
Score = 28.7 bits (64), Expect = 3.7
Identities = 15/35 (42%), Positives = 26/35 (74%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+ +EE K ++ +EE ++K+ K+KEKE+E+K E
Sbjct: 122 EKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156
Score = 28.3 bits (63), Expect = 4.8
Identities = 12/35 (34%), Positives = 24/35 (68%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+ +E+ KE+E+E+EKK + + +EE+++E
Sbjct: 135 EEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV 169
Score = 27.9 bits (62), Expect = 6.2
Identities = 11/35 (31%), Positives = 16/35 (45%)
Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
++ E ++ +KK KKKE EEEK
Sbjct: 161 EEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEK 195
Score = 27.9 bits (62), Expect = 6.4
Identities = 13/35 (37%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 242 EQEEEEEKGKEEE-EEEEKKKGKKKKEKEEEEKEE 275
E+E+E+EK EE + EE+KK ++ + K +K
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPP 179
Score = 27.9 bits (62), Expect = 6.6
Identities = 11/34 (32%), Positives = 26/34 (76%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+E+E+ K ++++++E+ K+ K ++ +EE KE+
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140
Score = 27.5 bits (61), Expect = 9.7
Identities = 9/34 (26%), Positives = 18/34 (52%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+++ E + ++ KK K+KE E+E+
Sbjct: 162 EEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEK 195
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 37.4 bits (87), Expect = 0.006
Identities = 19/32 (59%), Positives = 26/32 (81%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EEEEE+ +E++EEEEK K+++ EEEEKEE
Sbjct: 38 EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69
Score = 37.1 bits (86), Expect = 0.008
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+E+ +K +E +EEEEK++ KKK +K +E E
Sbjct: 50 EEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTE 83
Score = 36.3 bits (84), Expect = 0.014
Identities = 18/38 (47%), Positives = 24/38 (63%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
++EEE +EEE+EE+KKK KK KE E + N K
Sbjct: 54 TTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91
Score = 35.9 bits (83), Expect = 0.021
Identities = 18/36 (50%), Positives = 28/36 (77%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+EEEE+ KEE++EEE+K K++E +EEE++E K
Sbjct: 37 DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKK 72
Score = 34.4 bits (79), Expect = 0.068
Identities = 15/32 (46%), Positives = 23/32 (71%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
E+EE+ +EE +EEE+K+ KKKK K+ +E
Sbjct: 49 EEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80
Score = 34.0 bits (78), Expect = 0.072
Identities = 15/38 (39%), Positives = 28/38 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E +EEE+ ++EE++EE++K K+E+ +EE+E+ K
Sbjct: 34 EVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK 71
Score = 34.0 bits (78), Expect = 0.094
Identities = 18/36 (50%), Positives = 27/36 (75%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
YL E+E EK +EEEEE+K+ KK++E++ +KEE
Sbjct: 24 YLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEE 59
Score = 32.8 bits (75), Expect = 0.17
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++EEEE+ +EEE +E+++ ++KK+K ++ KE
Sbjct: 47 KKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80
Score = 32.0 bits (73), Expect = 0.34
Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 11/49 (22%)
Query: 242 EQEEEEEKGKE-----------EEEEEEKKKGKKKKEKEEEEKEENLVK 279
E+E+EE+K +E EEEE+E+KK K KK KE + E L K
Sbjct: 41 EEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89
Score = 32.0 bits (73), Expect = 0.35
Identities = 18/37 (48%), Positives = 25/37 (67%), Gaps = 3/37 (8%)
Query: 242 EQEEEEEKGKEEEE---EEEKKKGKKKKEKEEEEKEE 275
E+E E+E EEEE EE+K++ +K +KEEE EE
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEE 64
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 36.9 bits (86), Expect = 0.006
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++EE+EEK +E+ + K K KK + + EEKE+
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEK 71
Score = 33.9 bits (78), Expect = 0.060
Identities = 12/37 (32%), Positives = 21/37 (56%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
EEE+E+ +EE+ + K KK K + E++E +
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74
Score = 28.1 bits (63), Expect = 5.2
Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 15/53 (28%)
Query: 242 EQEEEEEKGKE----EEEEEE-----------KKKGKKKKEKEEEEKEENLVK 279
+ E+E++ K+ EE+EE+ K K K + EE+EK + +
Sbjct: 25 DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKE 77
Score = 27.3 bits (61), Expect = 9.3
Identities = 12/35 (34%), Positives = 21/35 (60%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
+ + EEK K + E+EEK + +++ E+E E L
Sbjct: 63 KAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKL 97
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 37.0 bits (86), Expect = 0.006
Identities = 12/32 (37%), Positives = 23/32 (71%)
Query: 246 EEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
E EK K+E EE++++ ++K +EE+ +EE +
Sbjct: 150 ELEKIKKERAEEKEREEEEKAAEEEKAREEEI 181
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 35.7 bits (83), Expect = 0.008
Identities = 19/46 (41%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 232 RKINLSLHYLEQE-EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
R I++ E+E E+EEK ++EE+E+E KG +KE+ E+ EE
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEEL 55
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 36.2 bits (84), Expect = 0.010
Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 7/60 (11%)
Query: 221 LSDNYLLRNEARKINLSLHYLEQEEEE-------EKGKEEEEEEEKKKGKKKKEKEEEEK 273
NY +R++ R+ L ++EEEE E G E+ E+ K+ K+++K E E+
Sbjct: 101 SDPNYEIRSDVRRQLRFLAQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 34.0 bits (78), Expect = 0.011
Identities = 8/47 (17%), Positives = 26/47 (55%)
Query: 230 EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E K+ ++ + + EE E ++E++++ K + ++++ E++
Sbjct: 33 EEEKVEIARSVMIEGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDD 79
Score = 25.9 bits (57), Expect = 9.1
Identities = 4/30 (13%), Positives = 19/30 (63%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
E ++++E +++++++K ++++E
Sbjct: 51 EDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 35.0 bits (81), Expect = 0.024
Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 8/88 (9%)
Query: 197 CTIYLGAMAKLHLVDLAGSEQLFSLSDN----YLLR--NEARKINLSLHYLEQEEEEEKG 250
I ++AK+ V G ++ + SD +LL+ +K L + E E +K
Sbjct: 116 LPILESSLAKICDVKEEGDDKFYKYSDEKLLAWLLKKVERLKKKELDIK--EAEAARDKK 173
Query: 251 KEEEEEEEKKKGKKKKEKEEEEKEENLV 278
K EEEKKK KKK K+++ K+ V
Sbjct: 174 KSNNAEEEKKKKKKKSAKKKKLKKVAAV 201
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 34.4 bits (79), Expect = 0.030
Identities = 17/45 (37%), Positives = 29/45 (64%), Gaps = 3/45 (6%)
Query: 230 EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
E INL + E+ E+E+K +E EE++ KK++KE++EK+
Sbjct: 84 EGYSINL---FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 34.0 bits (78), Expect = 0.032
Identities = 13/32 (40%), Positives = 24/32 (75%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EE E++ KE +E E+K+ K K++++E+KE+
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124
Score = 30.5 bits (69), Expect = 0.60
Identities = 17/38 (44%), Positives = 26/38 (68%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
++E++E KE EE+E K K+KKEK+E++ E L K
Sbjct: 95 SEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAK 132
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 34.7 bits (80), Expect = 0.031
Identities = 12/37 (32%), Positives = 25/37 (67%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
L E++ K E E+ +E+ + +++E+EEEE++E+
Sbjct: 153 LSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDF 189
Score = 34.3 bits (79), Expect = 0.042
Identities = 14/36 (38%), Positives = 24/36 (66%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ E+K KE E E+ ++ +K +E+EEEE+EE+
Sbjct: 151 EKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEED 186
Score = 27.4 bits (61), Expect = 8.0
Identities = 12/38 (31%), Positives = 21/38 (55%), Gaps = 6/38 (15%)
Query: 244 EEEEEKGKEEEEEEEKKKG------KKKKEKEEEEKEE 275
EEE+ K E++ K+ ++ ++ EEEE+EE
Sbjct: 145 EEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE 182
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 33.0 bits (76), Expect = 0.035
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 251 KEEEEEEEKKKGKKKKEKEEEEKEENL 277
EE++E+++ +++KE+ EEE L
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAAAGL 101
Score = 33.0 bits (76), Expect = 0.040
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 252 EEEEEEEKKKGKKKKEKEEEEKEE 275
E+KK ++++E+E+EE EE
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEE 95
Score = 31.5 bits (72), Expect = 0.12
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 246 EEEKGKEEEEEEEKKKGKKKKEKEEE 271
EE++EE+++ ++K+E EEE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 31.5 bits (72), Expect = 0.13
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 252 EEEEEEEKKKGKKKKEKEEEEKEE 275
EEKK+ ++++E++EE +EE
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEE 96
Score = 28.8 bits (65), Expect = 1.2
Identities = 10/17 (58%), Positives = 15/17 (88%)
Query: 242 EQEEEEEKGKEEEEEEE 258
E++EEEE+ +E+EE EE
Sbjct: 79 EKKEEEEEEEEKEESEE 95
Score = 28.8 bits (65), Expect = 1.3
Identities = 11/16 (68%), Positives = 14/16 (87%)
Query: 242 EQEEEEEKGKEEEEEE 257
++EEEEE+ KEE EEE
Sbjct: 81 KEEEEEEEEKEESEEE 96
Score = 28.4 bits (64), Expect = 1.4
Identities = 10/17 (58%), Positives = 15/17 (88%)
Query: 242 EQEEEEEKGKEEEEEEE 258
++EEEEE+ ++EE EEE
Sbjct: 80 KKEEEEEEEEKEESEEE 96
Score = 28.4 bits (64), Expect = 1.5
Identities = 9/17 (52%), Positives = 15/17 (88%)
Query: 242 EQEEEEEKGKEEEEEEE 258
E+++EEE+ +EE+EE E
Sbjct: 78 EEKKEEEEEEEEKEESE 94
Score = 28.0 bits (63), Expect = 1.9
Identities = 6/24 (25%), Positives = 15/24 (62%)
Query: 252 EEEEEEEKKKGKKKKEKEEEEKEE 275
++K ++++E+EE+E+ E
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESE 94
Score = 27.6 bits (62), Expect = 3.0
Identities = 10/17 (58%), Positives = 12/17 (70%)
Query: 242 EQEEEEEKGKEEEEEEE 258
E+EEEEEK + EEE
Sbjct: 83 EEEEEEEKEESEEEAAA 99
Score = 27.2 bits (61), Expect = 3.8
Identities = 8/17 (47%), Positives = 14/17 (82%)
Query: 242 EQEEEEEKGKEEEEEEE 258
+E++EE+ +EEE+EE
Sbjct: 77 AEEKKEEEEEEEEKEES 93
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 35.0 bits (81), Expect = 0.040
Identities = 24/73 (32%), Positives = 42/73 (57%), Gaps = 4/73 (5%)
Query: 207 LHLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKK 266
L L D E L +L +R E + S+ L+ +EE ++ KEE+E +++K +K
Sbjct: 517 LQLCDKLRDEWLPNLG----IRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKL 572
Query: 267 EKEEEEKEENLVK 279
+K+EE+K++ L K
Sbjct: 573 KKQEEKKKKELEK 585
Score = 34.2 bits (79), Expect = 0.063
Identities = 13/37 (35%), Positives = 26/37 (70%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
L++E+EE++ +E++ K K +++K+K+E EK E
Sbjct: 553 LQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589
Score = 30.8 bits (70), Expect = 0.99
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
L ++E+E K ++ + +K++ KKKKE E+ EK
Sbjct: 553 LQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 33.9 bits (78), Expect = 0.046
Identities = 9/33 (27%), Positives = 23/33 (69%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
+++K K+++++++ KK KK +K E++ E+
Sbjct: 86 WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Score = 33.5 bits (77), Expect = 0.055
Identities = 8/32 (25%), Positives = 24/32 (75%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+++K K+++++++KK K +K+ E+E +++
Sbjct: 91 SKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122
Score = 32.7 bits (75), Expect = 0.10
Identities = 12/34 (35%), Positives = 27/34 (79%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
+++++K K+++++++ K +KK EKE E+K E+L
Sbjct: 92 KKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDL 125
Score = 32.4 bits (74), Expect = 0.15
Identities = 17/34 (50%), Positives = 26/34 (76%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E EE EK K+E EE++K K KKKK K++++K++
Sbjct: 66 ELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDK 99
Score = 32.4 bits (74), Expect = 0.16
Identities = 8/33 (24%), Positives = 23/33 (69%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++++ K K+++++++K K K ++++EKE
Sbjct: 86 WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Score = 31.6 bits (72), Expect = 0.31
Identities = 13/36 (36%), Positives = 26/36 (72%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
L +E E+ K + EE+++ K K KK K+K++++K++
Sbjct: 67 LAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKK 102
Score = 31.2 bits (71), Expect = 0.34
Identities = 10/35 (28%), Positives = 27/35 (77%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+++E EE++ + ++++ KKK K K+K++++K++
Sbjct: 74 VKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108
Score = 30.4 bits (69), Expect = 0.62
Identities = 10/31 (32%), Positives = 24/31 (77%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EE++K K ++++ +KKK K K +K++++ ++
Sbjct: 79 EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109
Score = 30.4 bits (69), Expect = 0.67
Identities = 14/34 (41%), Positives = 25/34 (73%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+E EE K ++E EEK+K K KK+K +++K+++
Sbjct: 65 KELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKD 98
Score = 29.7 bits (67), Expect = 1.1
Identities = 8/34 (23%), Positives = 24/34 (70%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++ + ++K +++++++K K KK+ + E+K+E
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Score = 29.7 bits (67), Expect = 1.3
Identities = 8/34 (23%), Positives = 26/34 (76%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++++ ++K +++++++ KK K ++K+E+E E+
Sbjct: 87 KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 34.9 bits (80), Expect = 0.052
Identities = 17/37 (45%), Positives = 29/37 (78%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLV 278
E+EE E+ +EEEEEEE+++ + ++E+ E+E+EE V
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEV 476
Score = 33.7 bits (77), Expect = 0.11
Identities = 15/34 (44%), Positives = 28/34 (82%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++ EEEE +EEEEEEE+++ ++++ +EEE ++E
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDE 470
Score = 33.0 bits (75), Expect = 0.17
Identities = 18/34 (52%), Positives = 26/34 (76%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E EEEE +EEEEEEE+++ +++ E+EE E EE
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEE 471
Score = 32.6 bits (74), Expect = 0.25
Identities = 15/34 (44%), Positives = 27/34 (79%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+EEE + +EEEEEEE+++ ++ +E+E E++EE
Sbjct: 439 SEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472
Score = 32.2 bits (73), Expect = 0.36
Identities = 19/41 (46%), Positives = 28/41 (68%)
Query: 235 NLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
S QE EEE+ EEEEEEE+++ ++++E EEEE E+
Sbjct: 429 GESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGED 469
Score = 28.7 bits (64), Expect = 3.9
Identities = 16/34 (47%), Positives = 27/34 (79%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EEEEE+ +EEEEEE++ + ++ +++EEEE+ E
Sbjct: 444 SVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477
Score = 28.0 bits (62), Expect = 7.0
Identities = 13/34 (38%), Positives = 25/34 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+EEEE++ +EEE E+E+++ + + + EE+ E
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 33.9 bits (78), Expect = 0.053
Identities = 13/52 (25%), Positives = 31/52 (59%)
Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+ ++EEE E+ +E+++EEE++K ++++ ++E+E+ E L
Sbjct: 29 EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKS 80
Score = 33.1 bits (76), Expect = 0.085
Identities = 15/36 (41%), Positives = 26/36 (72%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+EE EE+ K EE+ E ++K +++ E+E E+K+E
Sbjct: 23 AEEEEREERKKLEEKREGERKEEEELEEEREKKKEE 58
Score = 33.1 bits (76), Expect = 0.094
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+E EE K EE+ E E+K+ ++ +E+ E++KEE
Sbjct: 23 AEEEEREERKKLEEKREGERKEEEELEEEREKKKEE 58
Score = 30.1 bits (68), Expect = 1.0
Identities = 12/34 (35%), Positives = 25/34 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E EEEE + +++ EE+ + + K+++E EEE +++
Sbjct: 22 EAEEEEREERKKLEEKREGERKEEEELEEEREKK 55
Score = 29.3 bits (66), Expect = 1.4
Identities = 15/37 (40%), Positives = 28/37 (75%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
++E EEE+ +E ++ EEK++G++K+E+E EE+ E
Sbjct: 18 RQQREAEEEEREERKKLEEKREGERKEEEELEEEREK 54
Score = 29.3 bits (66), Expect = 1.5
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+Q +++ EEEE EE+KK ++K+E E +E+EE
Sbjct: 13 EKQARRQQREAEEEEREERKKLEEKREGERKEEEE 47
Score = 27.7 bits (62), Expect = 4.8
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E++E EE+ ++E+EE EK K E+E +K
Sbjct: 58 EEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSA 94
Score = 27.4 bits (61), Expect = 6.1
Identities = 15/52 (28%), Positives = 30/52 (57%)
Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E R+ +EE E+K +EEE +E +++ +K++E+ E+ K +V+
Sbjct: 34 LEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVE 85
Score = 27.0 bits (60), Expect = 8.2
Identities = 12/36 (33%), Positives = 26/36 (72%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+ EE +K +E+ E E K++ + ++E+E++++EE
Sbjct: 24 EEEEREERKKLEEKREGERKEEEELEEEREKKKEEE 59
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 32.7 bits (75), Expect = 0.055
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 251 KEEEEEEEKKKGKKKKEKEEEEKEENL 277
EEEEEE+++ ++++E+ EEE L
Sbjct: 74 AAEEEEEEEEEEEEEEEESEEEAMAGL 100
Score = 32.3 bits (74), Expect = 0.065
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 252 EEEEEEEKKKGKKKKEKEEEEKEEN 276
EE+++ ++++E+EEEE EE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 30.8 bits (70), Expect = 0.24
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 250 GKEEEEEEEKKKGKKKKEKEEEE 272
EEEEE+++ ++++E+E EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEE 94
Score = 30.4 bits (69), Expect = 0.29
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 247 EEKGKEEEEEEEKKKGKKKKEKEEE 271
EEEEEEE+++ ++++E EEE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 30.0 bits (68), Expect = 0.47
Identities = 9/25 (36%), Positives = 18/25 (72%)
Query: 250 GKEEEEEEEKKKGKKKKEKEEEEKE 274
EEEE+++ ++++E+EE E+E
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 33.5 bits (77), Expect = 0.059
Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 241 LEQEEEEEKGKE--EEEEEEKKKGKKKKEKEEEEKEE 275
E EEEE+K K+ +E ++EKK+ K KKEK E K
Sbjct: 147 AEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGS 183
Score = 32.0 bits (73), Expect = 0.21
Identities = 15/35 (42%), Positives = 26/35 (74%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
EEEEK ++++++E KK+ K+KK+K+E+ E K
Sbjct: 150 EEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184
Score = 31.6 bits (72), Expect = 0.30
Identities = 14/34 (41%), Positives = 24/34 (70%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E E+E K E+E E +++ KK+K+K++E K+E
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKE 166
Score = 30.8 bits (70), Expect = 0.47
Identities = 17/34 (50%), Positives = 25/34 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+E + KE E EEE+KK KKKK++ ++EK+E
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKE 169
Score = 30.5 bits (69), Expect = 0.68
Identities = 15/34 (44%), Positives = 25/34 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++ + + + E EEEEKK+ KKKKE ++E+KE+
Sbjct: 137 KETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170
Score = 30.5 bits (69), Expect = 0.68
Identities = 15/33 (45%), Positives = 27/33 (81%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+E E +EE++E++KKK KK++KE+++K+E
Sbjct: 143 VEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175
Score = 29.3 bits (66), Expect = 1.6
Identities = 14/35 (40%), Positives = 27/35 (77%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ E+E + +EEE++E+KKK + KKEK+E++ ++
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175
Score = 28.9 bits (65), Expect = 2.2
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
E G E E E++ K +KE E EE+E
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEE 153
Score = 28.5 bits (64), Expect = 3.2
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ E++ + EEEE+++KK KK+ +KE++EK++
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKD 172
Score = 28.1 bits (63), Expect = 4.1
Identities = 11/32 (34%), Positives = 26/32 (81%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
E ++E+++ K+++E+ + KG KKK+K++++K
Sbjct: 162 EVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 27.8 bits (62), Expect = 5.2
Identities = 8/34 (23%), Positives = 27/34 (79%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++++E +K K+E++++++K + K K++++K++
Sbjct: 158 KKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191
Score = 27.0 bits (60), Expect = 8.2
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
E + + E+E K K+ + +EEE+KE
Sbjct: 124 PPSELGSESETSEKETTAKVEKEAEVEEEEKKE 156
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 33.5 bits (77), Expect = 0.063
Identities = 12/38 (31%), Positives = 19/38 (50%)
Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L Y E ++ + + EE E K++E E EK+E
Sbjct: 154 LWYAEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKE 191
Score = 28.8 bits (65), Expect = 2.7
Identities = 12/28 (42%), Positives = 15/28 (53%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKE 267
E+ E KEEE E +KK KKK+
Sbjct: 169 AAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 27.3 bits (61), Expect = 7.5
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEK 268
+ +E E +EEE E +K + KK+K
Sbjct: 168 FAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 33.2 bits (76), Expect = 0.066
Identities = 10/32 (31%), Positives = 20/32 (62%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+EEEE +E +E E++ + K + +E+K+
Sbjct: 96 DEEEETEEESTDETEQEDPPETKTESKEKKKR 127
Score = 32.4 bits (74), Expect = 0.10
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
E E + +E+++ E K K +KEK + E
Sbjct: 112 EDPPETKTESKEKKKREVPKPKTEKEKPKTEP 143
Score = 32.0 bits (73), Expect = 0.14
Identities = 14/43 (32%), Positives = 24/43 (55%)
Query: 233 KINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
KI L + EEEE + + +E E++ + K E +E++K E
Sbjct: 86 KIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKRE 128
Score = 29.3 bits (66), Expect = 1.2
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++ E+E+ + + E +EKKK + K K E+EK +
Sbjct: 107 DETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140
Score = 29.0 bits (65), Expect = 1.9
Identities = 10/35 (28%), Positives = 20/35 (57%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+ EEE + E+E+ + K + K++K+ E +
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKKREVPKPK 133
Score = 28.6 bits (64), Expect = 2.6
Identities = 12/34 (35%), Positives = 20/34 (58%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+E EEE E E+E+ + + KEK++ E +
Sbjct: 98 EEETEEESTDETEQEDPPETKTESKEKKKREVPK 131
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 34.1 bits (79), Expect = 0.070
Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 1/38 (2%)
Query: 242 EQEEE-EEKGKEEEEEEEKKKGKKKKEKEEEEKEENLV 278
EQ E EE E EEEE+KK +KK E+ V
Sbjct: 65 EQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKGKV 102
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 33.3 bits (76), Expect = 0.076
Identities = 17/48 (35%), Positives = 31/48 (64%), Gaps = 3/48 (6%)
Query: 223 DNYLLRN-EARKINLSL--HYLEQEEEEEKGKEEEEEEEKKKGKKKKE 267
D+ +R + + SL ++LE+E+E++K +EE+E KKK + K+E
Sbjct: 72 DHKFIRRCKLSDADFSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119
Score = 29.1 bits (65), Expect = 1.9
Identities = 13/22 (59%), Positives = 17/22 (77%)
Query: 254 EEEEEKKKGKKKKEKEEEEKEE 275
EEE+EKKK K+EKE ++KE
Sbjct: 94 EEEKEKKKNLNKEEKEAKKKER 115
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 33.6 bits (77), Expect = 0.083
Identities = 11/30 (36%), Positives = 21/30 (70%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEE 270
E+EEEE++ ++EEE+++ K+K K+
Sbjct: 153 FEKEEEEQRRLLLQKEEEEQQMNKRKNKQA 182
Score = 29.0 bits (65), Expect = 2.3
Identities = 15/32 (46%), Positives = 24/32 (75%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
EQEE EE + E+EEEE+++ +KE+EE++
Sbjct: 143 EQEELEEALEFEKEEEEQRRLLLQKEEEEQQM 174
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 32.6 bits (75), Expect = 0.088
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 6/35 (17%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+EE E + + E+E E+ K KEEE +E
Sbjct: 82 EKEERELRKRAEKEALEQAK------KEEELREAK 110
Score = 28.8 bits (65), Expect = 1.5
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKK 264
E+E E+ KEEE E K++ +K
Sbjct: 93 EKEALEQAKKEEELREAKRQQRK 115
Score = 27.2 bits (61), Expect = 5.8
Identities = 13/42 (30%), Positives = 22/42 (52%), Gaps = 2/42 (4%)
Query: 224 NYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKK 265
+ +NE K L ++E E+ K+EEE E K+ ++K
Sbjct: 76 LFWKKNE--KEERELRKRAEKEALEQAKKEEELREAKRQQRK 115
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 33.1 bits (76), Expect = 0.092
Identities = 22/65 (33%), Positives = 34/65 (52%), Gaps = 15/65 (23%)
Query: 223 DNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEE------------EEKKKGKKKKEKEE 270
+ Y + ++ RK L +LEQ E EE+ K +EEE E+ + +K KE ++
Sbjct: 104 ERYEITSDVRK---QLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQK 160
Query: 271 EEKEE 275
EE EE
Sbjct: 161 EEDEE 165
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 32.3 bits (74), Expect = 0.100
Identities = 13/34 (38%), Positives = 25/34 (73%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
Y E +EK K+E E+EE ++ +++K++++EEK
Sbjct: 49 YERLELMDEKWKKETEDEEFQQKREEKKRKDEEK 82
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 32.6 bits (75), Expect = 0.10
Identities = 11/35 (31%), Positives = 22/35 (62%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+EE E + +EEE+ ++ K+ + + E++ EN
Sbjct: 105 EEEEVEVEELDEEEQIDELLEKELAKLKREKRREN 139
Score = 31.9 bits (73), Expect = 0.20
Identities = 14/39 (35%), Positives = 26/39 (66%), Gaps = 1/39 (2%)
Query: 242 EQEEEEEKGKEEEE-EEEKKKGKKKKEKEEEEKEENLVK 279
E EE +E+ + +E E+E K K++K +E E K++ ++K
Sbjct: 110 EVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148
Score = 30.7 bits (70), Expect = 0.47
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
E E EE +E+ +E +K+ K K ++ E E
Sbjct: 108 EVEVEELDEEEQIDELLEKELAKLKREKRRENE 140
Score = 30.3 bits (69), Expect = 0.56
Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 7/42 (16%)
Query: 242 EQEEEEEKGKEEEEEEE-------KKKGKKKKEKEEEEKEEN 276
E+E E E+ EEE+ +E K K +K++E E ++KE
Sbjct: 106 EEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEIL 147
Score = 29.9 bits (68), Expect = 0.71
Identities = 12/52 (23%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
R + RK ++E+E+ +EEE E E+ +++ ++ E++ L +
Sbjct: 87 RKKVRK-----LLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKR 133
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 34.0 bits (77), Expect = 0.11
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 5/44 (11%)
Query: 241 LEQEEEEEKGKEEE---EEEEKKKGK--KKKEKEEEEKEENLVK 279
++ EE+EK E E EE KK + KKKE EE++K E L K
Sbjct: 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723
Score = 33.2 bits (75), Expect = 0.17
Identities = 17/51 (33%), Positives = 28/51 (54%)
Query: 229 NEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
N+ + + E +++ E+ K+ EE+E+K KKE EE +K E L K
Sbjct: 1659 NKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709
Score = 32.8 bits (74), Expect = 0.22
Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 230 EARKINLSLHYLEQEEE---EEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E +K L E EE+ EE K EEE + K + KK +E+++K E K
Sbjct: 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
Score = 32.4 bits (73), Expect = 0.27
Identities = 18/36 (50%), Positives = 25/36 (69%), Gaps = 3/36 (8%)
Query: 243 QEEEEEKGKEEE---EEEEKKKGKKKKEKEEEEKEE 275
++ EE K K EE EEEKKK ++ K+KE EEK++
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648
Score = 32.4 bits (73), Expect = 0.32
Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEE---EEEEKKKGKKKKEKEEEEKEE 275
L+ + + ++E EE+K K EE +EEEKKK K++EE++ EE
Sbjct: 1721 LKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEE 1772
Score = 31.3 bits (70), Expect = 0.66
Identities = 12/37 (32%), Positives = 20/37 (54%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+ E E E E EEK + +KK++E ++K + K
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385
Score = 31.3 bits (70), Expect = 0.80
Identities = 19/50 (38%), Positives = 30/50 (60%)
Query: 230 EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E KI E EE+++K +E +++EE+KK +KEEE+K E + K
Sbjct: 1726 EENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRK 1775
Score = 29.7 bits (66), Expect = 2.2
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E EE+ + E+++EE KKK K+K EE+K+ + K
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK 1397
Score = 29.3 bits (65), Expect = 2.7
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+ E ++K E ++ EE KK + K+ EE +K + K
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541
Score = 29.3 bits (65), Expect = 2.8
Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 5/59 (8%)
Query: 226 LLRNEARKINLSLHYLEQEEEEEKGKEEE---EEEEKKKGK--KKKEKEEEEKEENLVK 279
L + EA + + + EEE + EE EE+KKK + KK E++E++ E L K
Sbjct: 1638 LKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696
Score = 29.3 bits (65), Expect = 3.0
Identities = 14/38 (36%), Positives = 24/38 (63%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E+++ EE K EEE + K + KK+ +E+++K E K
Sbjct: 1714 EKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751
Score = 29.0 bits (64), Expect = 3.6
Identities = 14/39 (35%), Positives = 24/39 (61%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
++ EE +K E ++ EE KK + K+ EE++K + L K
Sbjct: 1515 AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553
Score = 28.6 bits (63), Expect = 4.5
Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 241 LEQEEEEEKGKEEE---EEEEKKKGKKKKEKEEEEKEENLVK 279
+++ EE K K +E E KKK + K+ EE +K + K
Sbjct: 1488 AKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK 1529
Score = 28.6 bits (63), Expect = 4.9
Identities = 18/36 (50%), Positives = 24/36 (66%), Gaps = 3/36 (8%)
Query: 241 LEQEEEEEKGKEE---EEEEEKKKGKKKKEKEEEEK 273
L++E EE K EE +E EEKKK ++ K+ EEE K
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729
Score = 28.6 bits (63), Expect = 5.0
Identities = 14/50 (28%), Positives = 29/50 (58%)
Query: 230 EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
EA+K + + E+ ++ ++ K++ EE +KK + KK E ++K + K
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKK 1517
Score = 28.6 bits (63), Expect = 5.6
Identities = 16/39 (41%), Positives = 26/39 (66%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
L++ EE++K E ++ EEKKK + K+K EE K+ + K
Sbjct: 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAK 1321
Score = 28.2 bits (62), Expect = 5.7
Identities = 12/37 (32%), Positives = 20/37 (54%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+ E E +E E E+K +K+KEE +K+ + K
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384
Score = 28.2 bits (62), Expect = 6.5
Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 3/36 (8%)
Query: 241 LEQEEEEEKGKEE---EEEEEKKKGKKKKEKEEEEK 273
L++ EEE+K E+ +E EEKKK ++ K+ EEE K
Sbjct: 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENK 1660
Score = 28.2 bits (62), Expect = 7.3
Identities = 14/37 (37%), Positives = 27/37 (72%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
++ EEEK K E+ ++++ + KKK E+ ++ +EEN +K
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIK 1662
Score = 27.8 bits (61), Expect = 8.2
Identities = 10/39 (25%), Positives = 17/39 (43%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+++ EE K E + E + + E EE+ E K
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372
Score = 27.8 bits (61), Expect = 8.3
Identities = 14/35 (40%), Positives = 24/35 (68%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++ EE++K E ++ EE KK ++KK+ EE +K E
Sbjct: 1539 AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAE 1573
Score = 27.8 bits (61), Expect = 9.3
Identities = 13/43 (30%), Positives = 24/43 (55%), Gaps = 4/43 (9%)
Query: 241 LEQEEEEEKGKEEE----EEEEKKKGKKKKEKEEEEKEENLVK 279
+++ EE+K K +E +KK + KK+ EE++K + K
Sbjct: 1396 AKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 31.7 bits (72), Expect = 0.11
Identities = 14/27 (51%), Positives = 20/27 (74%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEE 271
+ K EE+EEEKKK ++K+E+EEE
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 31.7 bits (72), Expect = 0.11
Identities = 15/27 (55%), Positives = 22/27 (81%)
Query: 251 KEEEEEEEKKKGKKKKEKEEEEKEENL 277
EE+ EEK++ KKK+E++EEE+EE L
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEEAL 99
Score = 31.3 bits (71), Expect = 0.16
Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 6/36 (16%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
EE+ E+ +EE+++EE +KE+EEEE L
Sbjct: 73 AAEEKAEEKEEEKKKEE------EKEEEEEEALAGL 102
Score = 27.8 bits (62), Expect = 2.4
Identities = 10/24 (41%), Positives = 19/24 (79%)
Query: 252 EEEEEEEKKKGKKKKEKEEEEKEE 275
EE+ ++K ++KK++EE+E+EE
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEE 95
Score = 27.5 bits (61), Expect = 3.4
Identities = 9/24 (37%), Positives = 18/24 (75%)
Query: 252 EEEEEEEKKKGKKKKEKEEEEKEE 275
EE+ ++ +++K+KEEE++EE
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEE 94
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 32.9 bits (75), Expect = 0.17
Identities = 16/31 (51%), Positives = 21/31 (67%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
E E EK EEE E+KK K++K KE+E K+
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKK 35
Score = 32.6 bits (74), Expect = 0.29
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
E E E++ EEE E +KKK +K KEKE ++
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKK 35
Score = 28.7 bits (64), Expect = 4.2
Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 3/38 (7%)
Query: 241 LEQEEEEEKGKEEE---EEEEKKKGKKKKEKEEEEKEE 275
L +EE E K K+EE E+E KK +KE + + + +
Sbjct: 13 LTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 31.9 bits (73), Expect = 0.23
Identities = 15/34 (44%), Positives = 22/34 (64%), Gaps = 3/34 (8%)
Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
Y E+E+E++K +EEKK K++KEK EE
Sbjct: 92 QYFEEEKEKKKAM---SKEEKKAIKEEKEKLEEP 122
Score = 29.5 bits (67), Expect = 1.4
Identities = 15/26 (57%), Positives = 19/26 (73%), Gaps = 3/26 (11%)
Query: 253 EEEEEEKK---KGKKKKEKEEEEKEE 275
EEE+E+KK K +KK KEE+EK E
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLE 120
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 32.5 bits (74), Expect = 0.24
Identities = 14/34 (41%), Positives = 20/34 (58%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++ E K K E E E+K K + KK+ EEE K +
Sbjct: 126 AKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAK 159
Score = 31.7 bits (72), Expect = 0.34
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+++ EEE + E +KK + KK+ E E K + K
Sbjct: 148 AKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAK 186
Score = 29.0 bits (65), Expect = 2.7
Identities = 9/35 (25%), Positives = 20/35 (57%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
LEQ+ EE + + E+ +K+ +++ E+ K+
Sbjct: 73 LEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQA 107
Score = 28.6 bits (64), Expect = 3.2
Identities = 9/35 (25%), Positives = 19/35 (54%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
++ E+ + KE E+ +K K+ E+ ++ EE
Sbjct: 83 QRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117
Score = 27.9 bits (62), Expect = 5.6
Identities = 9/32 (28%), Positives = 17/32 (53%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
E E+ K ++ E+ K ++K+K+ EE
Sbjct: 93 ELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEA 124
Score = 27.9 bits (62), Expect = 5.8
Identities = 7/34 (20%), Positives = 19/34 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ ++E + + E+ K+ + K+ EE++K+
Sbjct: 88 QARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQA 121
Score = 27.9 bits (62), Expect = 6.2
Identities = 9/34 (26%), Positives = 19/34 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EQ ++ + K+++ EE K K + + + E + E
Sbjct: 108 EQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAE 141
Score = 27.9 bits (62), Expect = 6.5
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+Q EE++K EE + ++ + K K E E E+K +
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAK 145
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 30.3 bits (69), Expect = 0.24
Identities = 18/59 (30%), Positives = 33/59 (55%), Gaps = 6/59 (10%)
Query: 223 DNYLLRNEARKI------NLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
D+Y+L E K+ L+ YLE++EE+E ++ +E K KKK++ +++ E
Sbjct: 5 DSYILSEEEVKLKERIWDELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 32.1 bits (73), Expect = 0.29
Identities = 10/34 (29%), Positives = 23/34 (67%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+Q+ EE + K+ E+E K+ +K++ +E+K++
Sbjct: 86 QQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ 119
Score = 30.2 bits (68), Expect = 1.1
Identities = 11/34 (32%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+Q E+E+ K+ E+E + ++KK+ EE K+
Sbjct: 95 KQAAEQERLKQLEKERLAAQ-EQKKQAEEAAKQA 127
Score = 29.4 bits (66), Expect = 2.3
Identities = 10/32 (31%), Positives = 14/32 (43%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
+ E K K E E +K + KK+ E E
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAA 193
Score = 27.8 bits (62), Expect = 6.6
Identities = 9/33 (27%), Positives = 19/33 (57%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
LE+E + ++++ EE K+ K+++ EE
Sbjct: 106 LEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138
Score = 27.8 bits (62), Expect = 7.5
Identities = 10/35 (28%), Positives = 19/35 (54%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
EQE ++ KE +E+KK ++ K+ K++
Sbjct: 99 EQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQ 133
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 30.7 bits (70), Expect = 0.30
Identities = 19/40 (47%), Positives = 25/40 (62%), Gaps = 8/40 (20%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKE--------EEEKEE 275
EE K EEE++E++KK KKKKE E E++KEE
Sbjct: 67 GEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEE 106
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 29.9 bits (68), Expect = 0.31
Identities = 14/33 (42%), Positives = 22/33 (66%), Gaps = 1/33 (3%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+E EE+K K+EEE+E +K+ +K E+ E K
Sbjct: 49 EEIEEKKRKQEEEKERRKE-ARKAERAEARKRG 80
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 31.0 bits (71), Expect = 0.31
Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 5/55 (9%)
Query: 220 SLSDNY-LLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
L++ Y L+ L L+ E+ EK + E+ K K +KKK K+EE K
Sbjct: 46 DLNEEYKDLKESLEAALLDKKELKAWEKAEK----KAEKAKAKAEKKKAKKEEPK 96
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 30.3 bits (68), Expect = 0.33
Identities = 15/38 (39%), Positives = 27/38 (71%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+ E EEE+ +E++EEE+ +G++++ +EEEE E K
Sbjct: 48 DDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGK 85
Score = 27.6 bits (61), Expect = 2.7
Identities = 13/34 (38%), Positives = 24/34 (70%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
QE ++E +EEE +E+ ++ + + E+EE E+EE
Sbjct: 44 AQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEE 77
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 31.6 bits (72), Expect = 0.37
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ E+E+E E E+ KK ++ + K+++E EE
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106
Score = 27.4 bits (61), Expect = 7.8
Identities = 9/32 (28%), Positives = 18/32 (56%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
++ E+ +++E+ + KKKKE EE +
Sbjct: 78 KELSASSLEAEQAKKKEEAEAKKKKEMEELKA 109
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 31.6 bits (71), Expect = 0.37
Identities = 16/64 (25%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 212 LAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
LA + + S+ L+ +INL + E +E++ K+E+ +E++ + ++K++
Sbjct: 2 LAPGDVIKRSSEE--LKQRQIQINLVDWFESGESKEKEPKDEDGQEQRISDGEGEQKQKG 59
Query: 272 EKEE 275
KEE
Sbjct: 60 GKEE 63
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 31.5 bits (72), Expect = 0.40
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+EEEK + ++ +E + +E EEE EE
Sbjct: 133 EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEE 165
Score = 30.4 bits (69), Expect = 0.89
Identities = 10/38 (26%), Positives = 24/38 (63%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E EEE+++ ++ +E+ ++ ++ E+E E+EE +
Sbjct: 133 EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAE 170
Score = 29.2 bits (66), Expect = 2.6
Identities = 14/35 (40%), Positives = 20/35 (57%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E ++E E E+EEE+ + KK KE +EE E
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEE 156
Score = 28.1 bits (63), Expect = 5.9
Identities = 11/35 (31%), Positives = 20/35 (57%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
++E E ++EEE++E K K+ EE +E+
Sbjct: 124 DKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDE 158
Score = 27.7 bits (62), Expect = 6.8
Identities = 7/29 (24%), Positives = 19/29 (65%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEE 270
+ +E+ ++ EE+EEE + ++ + ++E
Sbjct: 144 KAKEDSDEELSEEDEEEAAEEEEAEAEKE 172
Score = 27.7 bits (62), Expect = 7.7
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E E + + +EE++E KK + E+ EE EE
Sbjct: 126 EIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEE 159
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 31.7 bits (72), Expect = 0.41
Identities = 16/38 (42%), Positives = 24/38 (63%), Gaps = 4/38 (10%)
Query: 242 EQEEEEEKGKEE----EEEEEKKKGKKKKEKEEEEKEE 275
EE E K KEE EE E+K+K + ++KE+ +K+E
Sbjct: 394 ASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
Score = 31.7 bits (72), Expect = 0.42
Identities = 12/34 (35%), Positives = 24/34 (70%), Gaps = 1/34 (2%)
Query: 242 EQEEEEEKGK-EEEEEEEKKKGKKKKEKEEEEKE 274
E + +EEK K EE E+++K++ + KEK ++++
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432
Score = 30.9 bits (70), Expect = 0.69
Identities = 12/47 (25%), Positives = 25/47 (53%)
Query: 230 EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
++ N L ++ E K + + EE + K K++K K+EE +++
Sbjct: 370 LLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQ 416
Score = 29.0 bits (65), Expect = 3.1
Identities = 9/36 (25%), Positives = 24/36 (66%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
++ E + + E + +EEK K ++ ++K++E+ +E+
Sbjct: 388 VKDETDASEEAEAKAKEEKLKQEENEKKQKEQADED 423
Score = 28.6 bits (64), Expect = 4.4
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E ++E EE E + K+ K K+E+ E++++E
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKE 418
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 31.9 bits (72), Expect = 0.42
Identities = 10/34 (29%), Positives = 19/34 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+E E++ E + E ++ + +E EEE+ EE
Sbjct: 297 PEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE 330
Score = 30.3 bits (68), Expect = 1.2
Identities = 13/28 (46%), Positives = 15/28 (53%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
+ EE EE+ EEE KK K KK K
Sbjct: 319 DSEESEEEKNEEEGGLSKKGKKLKKLKG 346
Score = 29.9 bits (67), Expect = 1.6
Identities = 10/35 (28%), Positives = 19/35 (54%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ EE + K E K + ++ ++ EE E+E+N
Sbjct: 294 GNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKN 328
Score = 29.2 bits (65), Expect = 2.7
Identities = 7/33 (21%), Positives = 18/33 (54%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E K + E++E+ ++ ++K E+E ++
Sbjct: 305 SPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKK 337
Score = 28.8 bits (64), Expect = 3.9
Identities = 14/38 (36%), Positives = 19/38 (50%), Gaps = 5/38 (13%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKK-----KKEKEEEEKE 274
E EE E+ K EEE KKGKK K+ ++ +
Sbjct: 318 EDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355
Score = 28.4 bits (63), Expect = 5.0
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKE 267
EQ+E+ E+ +EE+ EEE KK K+
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKK 340
Score = 28.4 bits (63), Expect = 5.1
Identities = 10/38 (26%), Positives = 19/38 (50%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+ E E+ EE EEEK + + K+ ++ ++ K
Sbjct: 310 AKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGK 347
Score = 28.0 bits (62), Expect = 6.2
Identities = 5/34 (14%), Positives = 19/34 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E + E ++E+ EE +++ +++ ++ ++
Sbjct: 307 EIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 31.2 bits (71), Expect = 0.43
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKK 266
+ EEE EK + EE +KKK K K
Sbjct: 69 DDEEEGEKELQREERLKKKKRVKTK 93
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 31.2 bits (71), Expect = 0.46
Identities = 16/39 (41%), Positives = 20/39 (51%), Gaps = 3/39 (7%)
Query: 244 EEEEEKGK---EEEEEEEKKKGKKKKEKEEEEKEENLVK 279
EEE E K EEE ++KKK KKKK K+ +
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEA 204
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 30.6 bits (69), Expect = 0.47
Identities = 14/35 (40%), Positives = 21/35 (60%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E E+EE+ E+EEEEE+ + K+ E+K N
Sbjct: 66 EDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNIN 100
Score = 29.1 bits (65), Expect = 1.5
Identities = 14/35 (40%), Positives = 29/35 (82%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+EE+EE+ +E E+ E++++ + +E+EEE++E+N
Sbjct: 54 EEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDN 88
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 30.6 bits (69), Expect = 0.47
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
+EE+EE + E EE++K K +K +EE E
Sbjct: 32 AEEEDEEIQEALREAEEERKAKHRKMEEEREV 63
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 31.2 bits (71), Expect = 0.47
Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 3/34 (8%)
Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
Y + ++EE+K +EEKK K +KEK EEE
Sbjct: 92 AYFKAQKEEKKAM---SKEEKKAIKAEKEKLEEE 122
Score = 28.8 bits (65), Expect = 2.4
Identities = 12/26 (46%), Positives = 17/26 (65%), Gaps = 3/26 (11%)
Query: 253 EEEEEEKK---KGKKKKEKEEEEKEE 275
+ ++EEKK K +KK K E+EK E
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLE 120
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 31.6 bits (72), Expect = 0.47
Identities = 16/33 (48%), Positives = 22/33 (66%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++ E K + EE+ KKK KK+K+KEEE K E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKRE 762
Score = 30.1 bits (68), Expect = 1.8
Identities = 13/37 (35%), Positives = 24/37 (64%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ ++ E + EE+ +KKK K+KK++EE ++EE
Sbjct: 728 WNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764
Score = 28.5 bits (64), Expect = 5.1
Identities = 10/33 (30%), Positives = 16/33 (48%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
+EK EE+ E K +K K++ + E L
Sbjct: 1551 SNQEKNIEEDYAESDIKKRKNKKQYKSNTEAEL 1583
Score = 28.1 bits (63), Expect = 6.6
Identities = 12/24 (50%), Positives = 19/24 (79%), Gaps = 1/24 (4%)
Query: 252 EEEEEEEKKKGKKKKEKEE-EEKE 274
EE+ +++KKK KKK+E+ + EEK
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKA 765
Score = 27.8 bits (62), Expect = 7.6
Identities = 14/49 (28%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 233 KINLSLHYLEQEEEEEKGKEEEEEEEKKKG--KKKKEKEEEEKEENLVK 279
K ++ + QEE+E G+ E E +++KKG + +E+ EE+ +
Sbjct: 1515 KKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAE 1563
Score = 27.8 bits (62), Expect = 7.7
Identities = 13/48 (27%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 223 DNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEE 270
N+ ++ KI S E+ ++++K ++++EEE K++ K + E E
Sbjct: 726 RNWNGKDAEFKI--SDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
>gnl|CDD|197773 smart00527, HMG17, domain in high mobilty group proteins HMG14 and
HMG 17.
Length = 88
Score = 29.5 bits (66), Expect = 0.47
Identities = 14/29 (48%), Positives = 16/29 (55%)
Query: 249 KGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
K K E + KKG K K+ EE KEEN
Sbjct: 42 KKKSENVAAKGKKGAKGKQTEEAGKEENN 70
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 31.4 bits (72), Expect = 0.51
Identities = 13/50 (26%), Positives = 21/50 (42%), Gaps = 9/50 (18%)
Query: 227 LRNEARKINLSLHYLEQ--EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
LR + RK E+ E+EE + +++ E K K E +K
Sbjct: 414 LRKKQRK-------AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
Score = 29.9 bits (68), Expect = 1.8
Identities = 22/51 (43%), Positives = 28/51 (54%), Gaps = 5/51 (9%)
Query: 230 EARKINLSLH--YLEQEEEEEKGK-EEEEEEEKKKGKKKKEKEE--EEKEE 275
A +I L LH L E EEE+G+ E+KK +KK+ K E EKEE
Sbjct: 379 GAIEIYLKLHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEE 429
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically
binds the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 29.7 bits (67), Expect = 0.51
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
E+ E EEK EEE ++ ++++ +EEE+KE
Sbjct: 46 EKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKE 78
Score = 28.9 bits (65), Expect = 1.1
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 3/37 (8%)
Query: 242 EQEEEEEKGKEEE---EEEEKKKGKKKKEKEEEEKEE 275
EQ EEEEK + EE EEE K +E+E EE+E+
Sbjct: 40 EQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEED 76
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 31.6 bits (72), Expect = 0.52
Identities = 16/38 (42%), Positives = 27/38 (71%), Gaps = 2/38 (5%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE--KEEN 276
L + EEEE+ +E E +E+ +G++++E+E EE KEE
Sbjct: 167 LREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEE 204
Score = 30.8 bits (70), Expect = 0.71
Identities = 14/37 (37%), Positives = 24/37 (64%), Gaps = 5/37 (13%)
Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++++EE+ +E+EEEEE +++E EE E EE
Sbjct: 341 WVNPEQKDEEEEQEDEEEEE-----EEEEPEEPEPEE 372
Score = 27.3 bits (61), Expect = 9.2
Identities = 10/21 (47%), Positives = 17/21 (80%)
Query: 242 EQEEEEEKGKEEEEEEEKKKG 262
+++EEEE+ +EE EE E ++G
Sbjct: 353 QEDEEEEEEEEEPEEPEPEEG 373
Score = 27.3 bits (61), Expect = 9.6
Identities = 13/32 (40%), Positives = 19/32 (59%), Gaps = 8/32 (25%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+K +EEE+E+E E+EEEE+E
Sbjct: 343 NPEQKDEEEEQEDE--------EEEEEEEEPE 366
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 29.1 bits (66), Expect = 0.56
Identities = 7/21 (33%), Positives = 14/21 (66%)
Query: 255 EEEEKKKGKKKKEKEEEEKEE 275
+ +KK+E+EEEE+++
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDD 81
Score = 28.4 bits (64), Expect = 1.1
Identities = 10/17 (58%), Positives = 11/17 (64%)
Query: 242 EQEEEEEKGKEEEEEEE 258
EE+ KEEEEEEE
Sbjct: 62 AAAAAEEEKKEEEEEEE 78
Score = 28.4 bits (64), Expect = 1.4
Identities = 5/20 (25%), Positives = 13/20 (65%)
Query: 253 EEEEEEKKKGKKKKEKEEEE 272
+++ K+++E+EEE+
Sbjct: 61 AAAAAAEEEKKEEEEEEEED 80
Score = 27.6 bits (62), Expect = 2.3
Identities = 11/17 (64%), Positives = 12/17 (70%)
Query: 242 EQEEEEEKGKEEEEEEE 258
EEEK +EEEEEEE
Sbjct: 63 AAAAEEEKKEEEEEEEE 79
Score = 27.2 bits (61), Expect = 3.0
Identities = 7/25 (28%), Positives = 13/25 (52%)
Query: 250 GKEEEEEEEKKKGKKKKEKEEEEKE 274
++ KK++E+EEEE +
Sbjct: 57 AAAAAAAAAAEEEKKEEEEEEEEDD 81
Score = 27.2 bits (61), Expect = 3.5
Identities = 10/15 (66%), Positives = 13/15 (86%)
Query: 244 EEEEEKGKEEEEEEE 258
EEE+K +EEEEEE+
Sbjct: 66 AEEEKKEEEEEEEED 80
Score = 26.4 bits (59), Expect = 5.0
Identities = 8/17 (47%), Positives = 14/17 (82%)
Query: 242 EQEEEEEKGKEEEEEEE 258
EEE+++ +EEEEE++
Sbjct: 65 AAEEEKKEEEEEEEEDD 81
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 31.6 bits (71), Expect = 0.58
Identities = 13/34 (38%), Positives = 25/34 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++EE EK K E E++ +++ +++KEKE+E + E
Sbjct: 580 KREEAVEKAKREAEQKAREEREREKEKEKERERE 613
Score = 30.8 bits (69), Expect = 0.98
Identities = 11/29 (37%), Positives = 22/29 (75%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
+ E E+K +EE E E++K+ ++++E+E E
Sbjct: 589 KREAEQKAREEREREKEKEKERERERERE 617
Score = 28.9 bits (64), Expect = 3.4
Identities = 13/40 (32%), Positives = 26/40 (65%)
Query: 236 LSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L+ L ++ EE K + E E+K + ++++EKE+E++ E
Sbjct: 572 LASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKERE 611
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 28.1 bits (63), Expect = 0.60
Identities = 13/31 (41%), Positives = 23/31 (74%)
Query: 234 INLSLHYLEQEEEEEKGKEEEEEEEKKKGKK 264
++LSL E+ E+EE+ ++EEE EE+ + K+
Sbjct: 16 VSLSLCEEEKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 31.5 bits (71), Expect = 0.60
Identities = 13/54 (24%), Positives = 24/54 (44%), Gaps = 2/54 (3%)
Query: 226 LLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
L+ L + E + +E K +E+ ++ + + KEK E E+E L
Sbjct: 179 LIEETENLAELIIDLEELKLQELK--LKEQAKKALEYYQLKEKLELEEENLLYL 230
Score = 30.3 bits (68), Expect = 1.2
Identities = 16/64 (25%), Positives = 32/64 (50%)
Query: 216 EQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E++ L + E + + E+E + KE +EEE++KK ++++ K ++EE
Sbjct: 238 ERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEE 297
Query: 276 NLVK 279
L
Sbjct: 298 ELKS 301
Score = 30.3 bits (68), Expect = 1.5
Identities = 24/64 (37%), Positives = 36/64 (56%)
Query: 216 EQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+ L +YL NE R L +++EE E K+E E+EE+ + KE +EEEKE+
Sbjct: 223 EEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEK 282
Query: 276 NLVK 279
L +
Sbjct: 283 KLQE 286
Score = 29.2 bits (65), Expect = 2.9
Identities = 17/35 (48%), Positives = 27/35 (77%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
LE+E ++EK + EE E+E K+ + K+E EEEE+E+
Sbjct: 327 LEKELKKEKEEIEELEKELKELEIKREAEEEEEEQ 361
Score = 28.4 bits (63), Expect = 5.3
Identities = 18/39 (46%), Positives = 28/39 (71%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
LE+E +E + K E EEEE+++ +K +EK E+ +EE L K
Sbjct: 341 LEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAK 379
Score = 28.4 bits (63), Expect = 5.3
Identities = 20/49 (40%), Positives = 27/49 (55%)
Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E K LSL E EEEEK ++ + EEEK++ K +E+E EE
Sbjct: 761 EEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEE 809
Score = 28.4 bits (63), Expect = 5.8
Identities = 13/55 (23%), Positives = 21/55 (38%)
Query: 225 YLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
LL K + E+EE ++ +EE E EE+E+ N +
Sbjct: 941 LLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDE 995
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 30.7 bits (70), Expect = 0.61
Identities = 8/33 (24%), Positives = 23/33 (69%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ E++ ++ E+ + +KK K+K+ K++E+ ++
Sbjct: 5 EALLEQRRRKREQRKARKKQKRKEAKKKEDAQK 37
Score = 27.7 bits (62), Expect = 5.6
Identities = 11/34 (32%), Positives = 20/34 (58%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ E+ + K+++ +E KKK +K + EE K E
Sbjct: 13 RKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNE 46
Score = 26.9 bits (60), Expect = 9.4
Identities = 10/37 (27%), Positives = 24/37 (64%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
LEQ + + ++ +++++K+ KKK++ ++ E EE
Sbjct: 7 LLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEV 43
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 31.3 bits (72), Expect = 0.62
Identities = 8/32 (25%), Positives = 18/32 (56%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ EEE +E+ E + ++++++ EK E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 28.2 bits (64), Expect = 5.2
Identities = 10/32 (31%), Positives = 19/32 (59%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
++ EEE KE+ E K+ ++K++K E+
Sbjct: 510 IDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
Score = 27.5 bits (62), Expect = 9.7
Identities = 8/31 (25%), Positives = 17/31 (54%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
+E+E K + E ++ + K+KK ++ E
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 31.0 bits (71), Expect = 0.64
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 246 EEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EE K + + ++++E+EEEE+EE
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEE 315
Score = 31.0 bits (71), Expect = 0.71
Identities = 10/31 (32%), Positives = 19/31 (61%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
E + + EEEEE+++ ++++E EEE
Sbjct: 290 EVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 30.2 bits (69), Expect = 1.2
Identities = 9/34 (26%), Positives = 21/34 (61%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+ +E + + E+++ ++++E+EEE EE
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319
Score = 29.4 bits (67), Expect = 1.7
Identities = 8/32 (25%), Positives = 19/32 (59%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
L++ + EEEE+++ ++++E+ EE
Sbjct: 288 LKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319
Score = 28.7 bits (65), Expect = 3.1
Identities = 9/35 (25%), Positives = 20/35 (57%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E +E + EE+++ ++++E+EE +EE
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 30.5 bits (69), Expect = 0.68
Identities = 16/38 (42%), Positives = 24/38 (63%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E+E++ EEE EEE ++ + EKE EEK E+ V+
Sbjct: 82 GDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVE 119
Score = 30.1 bits (68), Expect = 0.84
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+EEE+E E++E++ K+ +E EEE +EE
Sbjct: 66 KEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98
Score = 30.1 bits (68), Expect = 1.1
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E EE +E+ KE E+K+ ++++EE +EEN
Sbjct: 61 EIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEEN 95
Score = 28.9 bits (65), Expect = 2.2
Identities = 11/35 (31%), Positives = 23/35 (65%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
++E++ + KE+EE EE+ + + ++ +E EKE
Sbjct: 77 DKEDKGDAEKEDEESEEENEEEDEESSDENEKETE 111
Score = 28.9 bits (65), Expect = 2.6
Identities = 11/35 (31%), Positives = 23/35 (65%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+E + KE++ + EK+ + ++E EEE++E +
Sbjct: 69 EKEAANSEDKEDKGDAEKEDEESEEENEEEDEESS 103
Score = 28.5 bits (64), Expect = 3.3
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
++ E E+ KEEE+E + K+ K E+E EE+
Sbjct: 57 QETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEES 91
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 31.2 bits (70), Expect = 0.70
Identities = 17/35 (48%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
Query: 244 EEEEEKGKEEEEEEEKKKGKK--KKEKEEEEKEEN 276
EEEEE EE+EE +K+ K ++ EEEE+EE
Sbjct: 141 EEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEEL 175
Score = 30.4 bits (68), Expect = 1.2
Identities = 19/36 (52%), Positives = 25/36 (69%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
+EEEE + +E+EE EK+ GK EK EEE+EE L
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEEL 175
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.8 bits (70), Expect = 0.73
Identities = 11/36 (30%), Positives = 20/36 (55%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L++E E+E E++ +E ++ +KK EK E
Sbjct: 198 LLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASR 233
Score = 28.1 bits (63), Expect = 5.6
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
+ E E + E E K K+K K + KE
Sbjct: 254 DDESAWEGFESEYEPINKPVRPKRKTKAQRNKE 286
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 28.5 bits (64), Expect = 0.74
Identities = 12/28 (42%), Positives = 18/28 (64%)
Query: 248 EKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
K KE + EEEK++ +++ K EE KE
Sbjct: 24 AKKKELKAEEEKREEEEEARKREERKER 51
Score = 28.1 bits (63), Expect = 1.1
Identities = 17/37 (45%), Positives = 24/37 (64%), Gaps = 4/37 (10%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKE---EEEKEE 275
E + EEEK +EEEEE K++ +K++EK EE E
Sbjct: 28 ELKAEEEK-REEEEEARKREERKEREKNKSFEELLNE 63
Score = 26.1 bits (58), Expect = 5.5
Identities = 13/35 (37%), Positives = 25/35 (71%)
Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
L L+ +++E K +EE+ EEE++ K+++ KE E+
Sbjct: 19 LAKLKAKKKELKAEEEKREEEEEARKREERKEREK 53
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 30.4 bits (69), Expect = 0.77
Identities = 19/109 (17%), Positives = 38/109 (34%), Gaps = 28/109 (25%)
Query: 168 AVQLIVTGNENKVTAVTKMNAQSSRSHTICTIYLGAMAKLHLVDLAGSEQLFSLSDNYLL 227
++ GN +V + ++ S+TI L L G L+
Sbjct: 112 GLRSASAGNWGEVDRNQGLAVLAA-SNTIL-----------LFVLVG----------VLV 149
Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ L Y E+++ ++K + EE ++ K+ +K E
Sbjct: 150 ------LQAGLWYAERKDAKQKEEFAAEERKEALAAAAKKSATPQKVET 192
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 30.0 bits (68), Expect = 0.80
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKK 266
E+E++EE E+E EEE+ KK K
Sbjct: 122 EEEDDEEDDDEDESEEEESPVKKVK 146
Score = 29.6 bits (67), Expect = 0.99
Identities = 9/27 (33%), Positives = 20/27 (74%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEK 268
E++EEEE +E+++E+E ++ + +K
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEESPVKK 144
Score = 27.3 bits (61), Expect = 5.3
Identities = 16/42 (38%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
H + EE+E EE+EEEE ++ +++E E+EE+ VK
Sbjct: 104 QHLVASEEDESDDDEEDEEEED--DEEDDDEDESEEEESPVK 143
Score = 26.9 bits (60), Expect = 9.1
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKK 266
E+EE++E+ +E+E EE++ KK
Sbjct: 121 EEEEDDEEDDDEDESEEEESPVKKV 145
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 30.2 bits (69), Expect = 0.83
Identities = 14/33 (42%), Positives = 21/33 (63%), Gaps = 3/33 (9%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
Y E E+E++K +EE KK K++K+K EE
Sbjct: 94 YFEAEKEKKKAMSKEE---KKAIKEEKDKLEEP 123
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 29.3 bits (66), Expect = 0.88
Identities = 8/35 (22%), Positives = 17/35 (48%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
+ E +E ++ +K++E EEE ++ L
Sbjct: 70 AAAAAAGAEAAAEADEAEEEEKEEEAEEESDDDML 104
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 30.5 bits (69), Expect = 0.91
Identities = 10/34 (29%), Positives = 20/34 (58%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E E+EEE+ + +E EEE+ + +++ + E
Sbjct: 372 EDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGS 405
Score = 28.9 bits (65), Expect = 3.0
Identities = 12/35 (34%), Positives = 25/35 (71%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L+ + EE ++E+EEEE++ + ++E+ E+ +EE
Sbjct: 362 LDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEE 396
Score = 28.5 bits (64), Expect = 4.1
Identities = 8/34 (23%), Positives = 23/34 (67%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+E+E+ ++ +E E+++G+ +E+ + +E+
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSRED 403
Score = 28.5 bits (64), Expect = 4.2
Identities = 9/34 (26%), Positives = 20/34 (58%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+E+ ++ +EEE E+ +++G + +E E
Sbjct: 377 EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSS 410
Score = 27.8 bits (62), Expect = 6.4
Identities = 10/34 (29%), Positives = 19/34 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++EEE+ + EEEE E + + + +E+ E
Sbjct: 375 DEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSES 408
Score = 27.8 bits (62), Expect = 8.3
Identities = 8/36 (22%), Positives = 22/36 (61%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
E +E+E++ +E+ +E +++ + E+E + E+
Sbjct: 369 EVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDG 404
Score = 27.4 bits (61), Expect = 8.4
Identities = 7/38 (18%), Positives = 15/38 (39%)
Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
H E+ E+ E+ + E+ + + E K +
Sbjct: 385 HEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADK 422
Score = 27.4 bits (61), Expect = 9.8
Identities = 10/35 (28%), Positives = 17/35 (48%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E EEEE + EEE + ++ G + + E+
Sbjct: 384 EHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSES 418
>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain. This family consists
of several ATP synthase E chain sequences which are
components of the CF(0) subunit.
Length = 83
Score = 28.6 bits (64), Expect = 1.0
Identities = 10/33 (30%), Positives = 19/33 (57%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
E++ E + +E+ E+ K KKK+ ++E K
Sbjct: 36 EEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKS 68
>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
N-terminus. This is an N-terminal domain of DNA
polymerase III polC subunit A that is found only in
Firmicutes. DNA polymerase polC-type III enzyme
functions as the 'replicase' in low G + C Gram-positive
bacteria. Purine asymmetry is a characteristic of
organisms with a heterodimeric DNA polymerase III
alpha-subunit constituted by polC which probably plays a
direct role in the maintenance of strand-biased gene
distribution; since, among prokaryotic genomes, the
distribution of genes on the leading and lagging strands
of the replication fork is known to be biased. The
domain is associated with DNA_pol3_alpha pfam07733.
Length = 180
Score = 29.6 bits (67), Expect = 1.1
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
+ EQ+EEEE EE E KK + +K+K+E+E
Sbjct: 148 FEEQKEEEEAKLAEEALEALKKKEAEKKKKEKE 180
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 30.7 bits (70), Expect = 1.1
Identities = 14/34 (41%), Positives = 25/34 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++E+E+ +++EEEEE++K KKKK+ E + E
Sbjct: 357 DEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 30.0 bits (68), Expect = 1.7
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
+++ ++E +EEEEEE++KK KK E E
Sbjct: 359 EDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
Score = 28.8 bits (65), Expect = 3.7
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
E EE+E+ E++EEEE+++ +KKK+K E L
Sbjct: 356 EDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
Score = 27.7 bits (62), Expect = 9.2
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E EEEEE G ++E+EE+ +++E++ + +E
Sbjct: 320 EEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDE 354
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 30.8 bits (69), Expect = 1.1
Identities = 11/33 (33%), Positives = 25/33 (75%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
+++E+++ +++EEEEE+++ K + E+EE E
Sbjct: 159 DEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191
Score = 29.6 bits (66), Expect = 2.4
Identities = 12/34 (35%), Positives = 26/34 (76%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E E+E++ +E++EEEE+++ + K +E+E++E
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191
Score = 28.5 bits (63), Expect = 5.3
Identities = 15/34 (44%), Positives = 25/34 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E +++EE +EEEEEEE+ KG +++E+E E+
Sbjct: 162 EDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGED 195
>gnl|CDD|224023 COG1098, VacB, Predicted RNA binding protein (contains ribosomal
protein S1 domain) [Translation, ribosomal structure and
biogenesis].
Length = 129
Score = 29.2 bits (66), Expect = 1.1
Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKG-KKKKEKEEEEKEENL 277
+E KI+LS+ LE+E E++ K + K+G + + + EE+L
Sbjct: 63 IDENGKISLSIRKLEEEPEKQHRKPRFSKSRPKEGFETLLSRLLKWIEESL 113
>gnl|CDD|217451 pfam03249, TSA, Type specific antigen. There are several antigenic
variants in Rickettsia tsutsugamushi, and a
type-specific antigen (TSA) of 56-kilodaltons located on
the rickettsial surface is responsible for the
variation. TSA proteins are probably integral membrane
proteins.
Length = 502
Score = 30.2 bits (68), Expect = 1.2
Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
Query: 212 LAGSEQLFSL-SDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEE 270
L G+EQ+ L D L+ A I ++ L ++EE+ G + + +KK+G +K KE
Sbjct: 337 LNGNEQIIQLYKDLVKLQRHA-GIKKAMEKLAAQQEEDAGNQGGGDCKKKQGASEKSKEG 395
Query: 271 EEKEE 275
+ +E
Sbjct: 396 GKGKE 400
>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
[DNA replication, recombination, and repair].
Length = 663
Score = 30.3 bits (69), Expect = 1.2
Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 20/66 (30%)
Query: 52 INHMFNGINA-----TLLAYGQTGGGKTYTVSAMIMKTLQHVMQRCNKDDVYMSY----- 101
I + GI TLL G TG GKT+T++ +I K + + +++
Sbjct: 21 IAELVEGIENGLKHQTLL--GVTGSGKTFTMANVIAKV--------QRPTLVLAHNKTLA 70
Query: 102 LQLYSE 107
QLYSE
Sbjct: 71 AQLYSE 76
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 28.9 bits (65), Expect = 1.2
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGK 263
LE EEE KEEE E+ +K K
Sbjct: 84 LEDREEERLEKEEEREKRARKRK 106
Score = 28.1 bits (63), Expect = 1.8
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKK 265
E+ EE+ E+EEE EK+ K+K
Sbjct: 83 RLEDREEERLEKEEEREKRARKRK 106
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 30.1 bits (68), Expect = 1.3
Identities = 16/62 (25%), Positives = 25/62 (40%), Gaps = 2/62 (3%)
Query: 210 VDLAGSEQLFSLSDNYL-LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEK 268
V+L + Y A + +E E EE +E + +KKK KKK+
Sbjct: 380 VNLDRQLSELKEAIAYYESAKTALEKAEGKKAIE-EIREELIEEGLLKSKKKKRKKKEWF 438
Query: 269 EE 270
E+
Sbjct: 439 EK 440
Score = 30.1 bits (68), Expect = 1.3
Identities = 17/53 (32%), Positives = 26/53 (49%), Gaps = 3/53 (5%)
Query: 226 LLRNEARKINLSLHYLEQEEEE---EKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L+ + LE+ E + E+ +EE EE K KKKK K++E E+
Sbjct: 388 ELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
Length = 194
Score = 29.8 bits (67), Expect = 1.3
Identities = 12/35 (34%), Positives = 22/35 (62%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E E++ +EE +E KK K+ KEE+++++N
Sbjct: 5 EFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQN 39
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 29.1 bits (66), Expect = 1.3
Identities = 9/38 (23%), Positives = 20/38 (52%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E+EE+E+K + E E K G++ + ++ ++
Sbjct: 83 EREEDEKKRLKRENERRKALGEEPLKSLDDLPKDKEEP 120
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 29.7 bits (67), Expect = 1.3
Identities = 14/47 (29%), Positives = 30/47 (63%)
Query: 223 DNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
+N+L +A++ LE+++++E+ KE EE E +++ K+K +E
Sbjct: 90 ENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEE 136
Score = 29.3 bits (66), Expect = 2.2
Identities = 13/34 (38%), Positives = 29/34 (85%), Gaps = 1/34 (2%)
Query: 243 QEEEEEKGKEEEEE-EEKKKGKKKKEKEEEEKEE 275
QE E +K K+++++ EE+++ ++KK++EEEE+++
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQEEEERKQ 218
Score = 27.8 bits (62), Expect = 6.5
Identities = 13/40 (32%), Positives = 26/40 (65%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+L ++ + + K ++ EEK+K +++KE+EE E + L K
Sbjct: 92 WLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAK 131
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 30.1 bits (69), Expect = 1.3
Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 2/39 (5%)
Query: 241 LEQEEEEEKGKEEEEEE--EKKKGKKKKEKEEEEKEENL 277
LE +EE K + E E+E E++ +K EK +KEENL
Sbjct: 60 LEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENL 98
Score = 29.4 bits (67), Expect = 2.3
Identities = 11/49 (22%), Positives = 29/49 (59%)
Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L + + LE++E+E + K++E E+++++ ++ E++ +E E
Sbjct: 98 LDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146
Score = 28.6 bits (65), Expect = 3.7
Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 14/68 (20%)
Query: 216 EQLFSLSDNYLLRNEARKINLSLHYLEQEEE------EEKGKEEEEEEEKKKGKKKKEKE 269
E+ N L + E R L Q+EE E K EEE E+K+K ++K++E
Sbjct: 74 EKELRERRNELQKLEKR--------LLQKEENLDRKLELLEKREEELEKKEKELEQKQQE 125
Query: 270 EEEKEENL 277
E+KEE L
Sbjct: 126 LEKKEEEL 133
Score = 28.6 bits (65), Expect = 4.3
Identities = 21/54 (38%), Positives = 34/54 (62%), Gaps = 3/54 (5%)
Query: 228 RNEARKINLS--LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
R ++ NL L LE+ EEE + K+E+E E+K++ +KKE+E EE E ++
Sbjct: 90 RLLQKEENLDRKLELLEKREEELE-KKEKELEQKQQELEKKEEELEELIEEQLQ 142
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 30.3 bits (67), Expect = 1.3
Identities = 17/48 (35%), Positives = 28/48 (58%)
Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
R+ A K + L EQ+ E+EK K E+E E + K E+E+++ E+
Sbjct: 141 RDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQ 188
>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809). This
family consists of several proteins of unknown function
Raphanus sativus (Radish) and Brassica napus (Rape).
Length = 138
Score = 29.0 bits (64), Expect = 1.3
Identities = 18/35 (51%), Positives = 28/35 (80%), Gaps = 1/35 (2%)
Query: 241 LEQEEEEEKGK-EEEEEEEKKKGKKKKEKEEEEKE 274
+E +EE+++GK E E +EEKK+GK + E +EE+KE
Sbjct: 97 IEGKEEKKEGKGEIEGKEEKKEGKGEIEGKEEKKE 131
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 30.1 bits (68), Expect = 1.4
Identities = 11/33 (33%), Positives = 17/33 (51%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
EE+EE+ ++E EEE KK E + +
Sbjct: 113 LDEEQEERVEKEREEELAGDAMKKLENRTADSK 145
Score = 29.3 bits (66), Expect = 2.4
Identities = 11/34 (32%), Positives = 19/34 (55%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
+E++EE+ +EEE+E K E EE+ +
Sbjct: 176 LFRREKKEEEEEEEEDEALIKSLSFGPETEEDRR 209
Score = 28.2 bits (63), Expect = 5.0
Identities = 16/42 (38%), Positives = 20/42 (47%), Gaps = 6/42 (14%)
Query: 242 EQEEEEEKGKEEEEEE------EKKKGKKKKEKEEEEKEENL 277
EQEE EK +EEE E + K+E E E+ E L
Sbjct: 116 EQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEEL 157
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 30.1 bits (68), Expect = 1.4
Identities = 11/39 (28%), Positives = 24/39 (61%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
L + +EEE+ +E+E EE+++ + ++ +K E V+
Sbjct: 394 LSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVE 432
>gnl|CDD|216991 pfam02357, NusG, Transcription termination factor nusG.
Length = 90
Score = 28.0 bits (63), Expect = 1.4
Identities = 14/45 (31%), Positives = 23/45 (51%)
Query: 225 YLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
Y+LR ++ + LE++ E EE E +K G+KKK +
Sbjct: 4 YVLRTKSGQEKKVAENLERQGIESFLPPEEVVEVRKNGRKKKVER 48
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 29.6 bits (66), Expect = 1.5
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
QE+ K +E +E + ++ + ++E+EEEE EE
Sbjct: 199 QEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEE 231
Score = 28.9 bits (64), Expect = 2.4
Identities = 12/35 (34%), Positives = 24/35 (68%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
YL++ + EEK +EE+EEE+ + + K+E+ + +
Sbjct: 209 YLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
>gnl|CDD|235395 PRK05298, PRK05298, excinuclease ABC subunit B; Provisional.
Length = 652
Score = 30.0 bits (69), Expect = 1.5
Identities = 24/64 (37%), Positives = 29/64 (45%), Gaps = 26/64 (40%)
Query: 57 NGINA-----TLLAYGQTGGGKTYTVSAMIMKT------LQHVMQRCNKDDVYMSYL--Q 103
GI A TLL G TG GKT+T++ +I + L H NK L Q
Sbjct: 26 EGIEAGEKHQTLL--GVTGSGKTFTMANVIARLQRPTLVLAH-----NKT------LAAQ 72
Query: 104 LYSE 107
LYSE
Sbjct: 73 LYSE 76
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 30.1 bits (68), Expect = 1.5
Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 2/73 (2%)
Query: 209 LVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGK--EEEEEEEKKKGKKKK 266
L+ E+ S ++ ++ LS+ +E+ E+ + E +EE KK+ KK
Sbjct: 438 LLKKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKV 497
Query: 267 EKEEEEKEENLVK 279
+K E L++
Sbjct: 498 KKIPEVDTYLLLE 510
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 28.7 bits (64), Expect = 1.5
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKK 265
E+E E +EEEEE+ KKK K K
Sbjct: 93 RLEDERELAREEEEEKRKKKKKNK 116
Score = 26.8 bits (59), Expect = 6.0
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKK 266
E+E + EEEEE++KK KK K
Sbjct: 94 LEDERELAREEEEEKRKKKKKNK 116
>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
[Transcription].
Length = 182
Score = 29.5 bits (66), Expect = 1.5
Identities = 15/46 (32%), Positives = 24/46 (52%)
Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
L E KI ++ L++ + E ++E EEK KG+ K + EE
Sbjct: 114 LEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEVIKGEPLEE 159
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 29.6 bits (67), Expect = 1.6
Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 6/55 (10%)
Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEE--EEKEENLVK 279
L S +E E + EEE E +KK +K +EKE+ EE E+NL++
Sbjct: 292 LSGLKNSGEPS----LKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIE 342
>gnl|CDD|183737 PRK12773, flhB, flagellar biosynthesis protein FlhB; Reviewed.
Length = 646
Score = 30.0 bits (67), Expect = 1.7
Identities = 19/48 (39%), Positives = 22/48 (45%), Gaps = 2/48 (4%)
Query: 224 NYLLRNEAR--KINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
N L EA KI+L L E E E G E EE++KG K E
Sbjct: 276 NICLSTEAADFKIDLQLFAAEDEGRTEPGSERRRREEREKGNVPKSPE 323
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 29.3 bits (66), Expect = 1.7
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
+ +EE+KGKEE+ E E+ + ++ E EE E
Sbjct: 21 NENKEEDKGKEEDLEFEEIEKEEIIEDSEESNE 53
Score = 29.3 bits (66), Expect = 1.9
Identities = 10/38 (26%), Positives = 21/38 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+EE++ K ++ E EE +K+ + +E E + +K
Sbjct: 23 NKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60
Score = 28.6 bits (64), Expect = 3.1
Identities = 8/38 (21%), Positives = 21/38 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
++ +EE+ EE E+EE + ++ + + E+ ++
Sbjct: 27 DKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENN 64
Score = 28.6 bits (64), Expect = 3.2
Identities = 11/47 (23%), Positives = 27/47 (57%)
Query: 229 NEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+A+ N+ ++ E +E+ K +EE+ E ++ +K++ E+ E+
Sbjct: 6 KDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESN 52
Score = 28.2 bits (63), Expect = 4.1
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
++E E+ + E + E+ K + K KEE +K EN ++
Sbjct: 41 KEEIIEDSEESNEVKIEELKDENNKLKEENKKLENELE 78
Score = 28.2 bits (63), Expect = 4.3
Identities = 15/33 (45%), Positives = 19/33 (57%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
EE+ KE E +EE K ++ E EE EKEE
Sbjct: 12 NMEEDCCKENENKEEDKGKEEDLEFEEIEKEEI 44
Score = 27.8 bits (62), Expect = 5.9
Identities = 13/34 (38%), Positives = 19/34 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E EE + K EE ++E K K++ +K E E E
Sbjct: 46 EDSEESNEVKIEELKDENNKLKEENKKLENELEA 79
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 30.0 bits (68), Expect = 1.8
Identities = 14/36 (38%), Positives = 22/36 (61%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
LE EEE + K + ++ + +K+E+EEE EEN
Sbjct: 459 LEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEEN 494
Score = 27.3 bits (61), Expect = 9.9
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++EEE K ++ + K+ K +KE+EEEE +E
Sbjct: 458 ELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 29.6 bits (66), Expect = 1.8
Identities = 16/40 (40%), Positives = 24/40 (60%), Gaps = 6/40 (15%)
Query: 242 EQEEEEEKGKEEE------EEEEKKKGKKKKEKEEEEKEE 275
EEEE++ K+EE EEEEK++ K++ E+ E E
Sbjct: 229 VLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAE 268
Score = 29.6 bits (66), Expect = 1.9
Identities = 13/35 (37%), Positives = 20/35 (57%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+EE+EE +E EE EE + K ++K + E
Sbjct: 112 EEEEKEESREEREEVEETEGVTKSEQKNDWRDAEE 146
Score = 29.6 bits (66), Expect = 2.2
Identities = 16/39 (41%), Positives = 21/39 (53%), Gaps = 5/39 (12%)
Query: 242 EQEEEEEKGKEE-----EEEEEKKKGKKKKEKEEEEKEE 275
E EE E K E + EE +K +K+ E EEEEK +
Sbjct: 125 EVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPK 163
Score = 27.3 bits (60), Expect = 9.8
Identities = 13/35 (37%), Positives = 23/35 (65%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+E+ G E E EE++K + ++E+EE E+ E + K
Sbjct: 100 MQEDSGAENETVEEEEKEESREEREEVEETEGVTK 134
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 28.2 bits (63), Expect = 1.9
Identities = 9/40 (22%), Positives = 20/40 (50%)
Query: 237 SLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ + E +EKG + +++E+ K K K+ K ++
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPV 41
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 28.4 bits (64), Expect = 1.9
Identities = 12/34 (35%), Positives = 20/34 (58%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+ + E+K +E++ E EK +EK E E E+
Sbjct: 20 EKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEK 53
Score = 28.4 bits (64), Expect = 2.3
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E E+ + EE+ E E +K +K+ E+ E E
Sbjct: 34 EAEKLAKMSAEEKAEYELEKLEKELEELEAELARR 68
Score = 28.0 bits (63), Expect = 2.6
Identities = 10/37 (27%), Positives = 21/37 (56%)
Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ +E+ + + K+EE++ E +K K +E+ E E
Sbjct: 15 KAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYEL 51
Score = 27.2 bits (61), Expect = 5.8
Identities = 10/35 (28%), Positives = 20/35 (57%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ E++++ K+ E E+ K ++K + E EK E
Sbjct: 21 KAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLE 55
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 29.1 bits (65), Expect = 1.9
Identities = 15/56 (26%), Positives = 28/56 (50%), Gaps = 8/56 (14%)
Query: 228 RNEARKINLSLHYLEQEEEE--------EKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ A+K+ + + EEE+ E+ +EE E+ E ++ +E EE E+E
Sbjct: 4 KGNAKKLKVRPPPTKDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59
>gnl|CDD|200031 TIGR00586, mutt, mutator mutT protein. All proteins in this family
for which functions are known are involved in repairing
oxidative damage to dGTP (they are 8-oxo-dGTPases). This
family is based on the phylogenomic analysis of JA Eisen
(1999, Ph.D. Thesis, Stanford University). Lowering the
threshold picks up members of MutT superfamily well [DNA
metabolism, DNA replication, recombination, and repair].
Length = 128
Score = 28.7 bits (64), Expect = 1.9
Identities = 15/44 (34%), Positives = 18/44 (40%)
Query: 214 GSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEE 257
G Q FS + R I L LE+ E GKE + EE
Sbjct: 59 GIPQHFSEFEKLEYEFYPRHITLWFWLLERWEGGPPGKEGQPEE 102
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 29.8 bits (68), Expect = 2.0
Identities = 12/58 (20%), Positives = 29/58 (50%)
Query: 219 FSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
F + +++++ ++EE+E+ KE E +K + +KKK+ + + +E
Sbjct: 156 FGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEG 213
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 28.9 bits (65), Expect = 2.0
Identities = 7/34 (20%), Positives = 9/34 (26%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E + G E KKK + E
Sbjct: 108 AALAEADGGPTAEATTPKKKKAAAEAAAAEAAAP 141
Score = 28.1 bits (63), Expect = 3.3
Identities = 9/36 (25%), Positives = 10/36 (27%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
LE + E KKKK E E
Sbjct: 102 KLELFNAALAEADGGPTAEATTPKKKKAAAEAAAAE 137
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 29.7 bits (67), Expect = 2.0
Identities = 16/64 (25%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
Query: 216 EQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++L L + + E + LE++ E+ + EE EE K+K + ++ KEE + E
Sbjct: 508 KELRELEEELIELLELEEALKEE--LEEKLEKLENLLEELEELKEKLQLQQLKEELRQLE 565
Query: 276 NLVK 279
+ ++
Sbjct: 566 DRLQ 569
Score = 28.6 bits (64), Expect = 4.5
Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 3/49 (6%)
Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
LR E ++ L LE+E E E EE K++ ++K EK E EE
Sbjct: 499 LREEIEELEKELRELEEELIE---LLELEEALKEELEEKLEKLENLLEE 544
Score = 28.2 bits (63), Expect = 6.5
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
+E+ EE E+ EE EEE + +E EE ++
Sbjct: 290 IERLEELEREIEELEEELEGLRALLEELEELLEK 323
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 29.3 bits (66), Expect = 2.0
Identities = 12/39 (30%), Positives = 22/39 (56%), Gaps = 3/39 (7%)
Query: 242 EQEEEEEKGKE---EEEEEEKKKGKKKKEKEEEEKEENL 277
E+EEEE + E EE+ ++ K+ E+ +E++E
Sbjct: 151 EEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRR 189
Score = 27.7 bits (62), Expect = 7.8
Identities = 14/35 (40%), Positives = 21/35 (60%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L +EEEEE + EEE EE + +++ EE +E
Sbjct: 149 LPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKE 183
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 28.4 bits (64), Expect = 2.2
Identities = 8/33 (24%), Positives = 21/33 (63%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E + + +E++ +EE+ K + ++ KE E +++
Sbjct: 94 EAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126
Score = 26.9 bits (60), Expect = 6.6
Identities = 10/38 (26%), Positives = 21/38 (55%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E +E E + +EE++ + + K++ EE ++ E K
Sbjct: 88 EIDERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKK 125
Score = 26.5 bits (59), Expect = 9.3
Identities = 11/33 (33%), Positives = 23/33 (69%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
E + ++ K+ +EEE K++ ++ KE+E E+K+
Sbjct: 94 EAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 29.8 bits (67), Expect = 2.2
Identities = 17/66 (25%), Positives = 34/66 (51%)
Query: 210 VDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
++++ S + LS L + ++ L L E +E+ EE ++E KK K+KK+
Sbjct: 154 LEMSPSTPVEKLSGWDTLSDSDQEAVLPLVKKSPSEAKEEKAEERKQESKKGAKRKKDAS 213
Query: 270 EEEKEE 275
++K +
Sbjct: 214 GDDKSK 219
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 29.7 bits (67), Expect = 2.2
Identities = 16/33 (48%), Positives = 24/33 (72%), Gaps = 3/33 (9%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+EEE+E EEEEEE++ +G K+ ++EE EE
Sbjct: 391 EEEEDE---EEEEEEDEDEGPSKEHSDDEEFEE 420
Score = 29.3 bits (66), Expect = 2.5
Identities = 13/36 (36%), Positives = 25/36 (69%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ EEEE++ +EEEE+E++ K+ + EE E+++
Sbjct: 388 RDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDV 423
Score = 28.9 bits (65), Expect = 3.8
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+E+EEE+ +E+E+E K+ +E EE++ E
Sbjct: 391 EEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESK 426
>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily.
Length = 201
Score = 29.0 bits (65), Expect = 2.2
Identities = 10/36 (27%), Positives = 17/36 (47%), Gaps = 1/36 (2%)
Query: 52 INHMFNGINATLLAYGQTGGGKTYTVSAMIMKTLQH 87
I + +G+ +LA TG GKT ++ L+
Sbjct: 17 IEALLSGLRDVILA-APTGSGKTLAALLPALEALKR 51
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 29.7 bits (66), Expect = 2.3
Identities = 11/47 (23%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
++E + + ++ +EE+E+ + + +E++ GKK+KE+E+ ++
Sbjct: 1014 KHEIKDR-IVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRK 1059
>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD. This
family consists of several putative bacterial stage IV
sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
is known to be essential for efficient sporulation
although its exact function is unknown.
Length = 383
Score = 29.3 bits (66), Expect = 2.3
Identities = 15/37 (40%), Positives = 20/37 (54%)
Query: 233 KINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
I +Y Q++ + KEE E+ KK KKK EKE
Sbjct: 312 SIVRETYYEVQDKVVKLTKEEAVEKGKKLAKKKLEKE 348
>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein. PACS-1 is a
cytosolic sorting protein that directs the localisation
of membrane proteins in the trans-Golgi network
(TGN)/endosomal system. PACS-1 connects the clathrin
adaptor AP-1 to acidic cluster sorting motifs contained
in the cytoplasmic domain of cargo proteins such as
furin, the cation-independent mannose-6-phosphate
receptor and in viral proteins such as human
immunodeficiency virus type 1 Nef.
Length = 413
Score = 29.4 bits (66), Expect = 2.4
Identities = 12/26 (46%), Positives = 15/26 (57%), Gaps = 6/26 (23%)
Query: 256 EEEKKK------GKKKKEKEEEEKEE 275
+E+ KK GKK KEKE E K +
Sbjct: 332 KEKNKKVPVMFLGKKPKEKEVESKSQ 357
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 29.4 bits (67), Expect = 2.4
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EEEE KEE + E +++ K + E+ + E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAE 69
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 27.6 bits (62), Expect = 2.4
Identities = 8/33 (24%), Positives = 22/33 (66%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++ ++E E E +K+ ++K+EK+E+ +++
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKK 80
Score = 27.2 bits (61), Expect = 3.1
Identities = 10/37 (27%), Positives = 22/37 (59%)
Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
L + ++E + E EK++ +K+++KE+ EK+
Sbjct: 44 LSEFKDYKDEPPPEPTETWLEKREREKREKKEKLEKK 80
>gnl|CDD|220223 pfam09405, Btz, CASC3/Barentsz eIF4AIII binding. This domain is
found on CASC3 (cancer susceptibility candidate gene 3
protein) which is also known as Barentsz (Btz). CASC3 is
a component of the EJC (exon junction complex) which is
a complex that is involved in post-transcriptional
regulation of mRNA in metazoa. The complex is formed by
the association of four proteins (eIF4AIII, Barentsz,
Mago, and Y14), mRNA, and ATP. This domain wraps around
eIF4AIII and stacks against the 5' nucleotide.
Length = 116
Score = 28.2 bits (63), Expect = 2.5
Identities = 10/36 (27%), Positives = 22/36 (61%), Gaps = 1/36 (2%)
Query: 242 EQEEEEEKGKE-EEEEEEKKKGKKKKEKEEEEKEEN 276
E++ E E +E+K++ +++ EK ++E +EN
Sbjct: 5 ERQSGRTPSAEPTEPKEDKERKRREHEKYDDEDDEN 40
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 29.4 bits (66), Expect = 2.6
Identities = 14/53 (26%), Positives = 27/53 (50%)
Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
L + + L E+ ++E + + EE +E ++ K + EKE +E + L K
Sbjct: 527 LEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKK 579
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 29.2 bits (66), Expect = 2.7
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
+E EEE E E E K K+++E EE
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEE 262
Score = 28.8 bits (65), Expect = 4.3
Identities = 12/37 (32%), Positives = 21/37 (56%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
E EE EE+ E E + KG K++++ E++ +L
Sbjct: 230 SETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSL 266
Score = 27.7 bits (62), Expect = 7.8
Identities = 12/30 (40%), Positives = 21/30 (70%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
+++E K + EE+ KK+ KK+K K EE++
Sbjct: 649 KKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
Score = 27.7 bits (62), Expect = 8.0
Identities = 10/37 (27%), Positives = 19/37 (51%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
L++ E E+ +EE + E + + K K+E+E
Sbjct: 226 LKETSETEEREEETDVEIETTSETKGTKQEQEGSTEE 262
>gnl|CDD|219014 pfam06414, Zeta_toxin, Zeta toxin. This family consists of several
bacterial zeta toxin proteins. Zeta toxin is thought to
be part of a postregulational killing system in
bacteria. It relies on antitoxin/toxin systems that
secure stable inheritance of low and medium copy number
plasmids during cell division and kill cells that have
lost the plasmid.
Length = 191
Score = 28.4 bits (64), Expect = 2.8
Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 2/47 (4%)
Query: 61 ATLLAYGQTGGGKTYTVSAMIMKTLQHVMQRCNKDDVYMSYLQLYSE 107
A LL GQ G GKT A++ + + R + D+ +Y Y E
Sbjct: 14 AVLLG-GQPGAGKTELARALLEELGGGNVVRIDPDE-LRTYHPDYDE 58
>gnl|CDD|238549 cd01129, PulE-GspE, PulE/GspE The type II secretory pathway is the
main terminal branch of the general secretory pathway
(GSP). It is responsible for the export the majority of
Gram-negative bacterial exoenzymes and toxins. PulE is a
cytoplasmic protein of the GSP, which contains an ATP
binding site and a tetracysteine motif. This subgroup
also includes PillB and HofB.
Length = 264
Score = 28.8 bits (65), Expect = 2.9
Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 9/62 (14%)
Query: 23 FDHQVFIFD-NIFGPNDSNETIFTEVLVPLINHMFNGINATLLAYGQTGGGKTYTVSAMI 81
D + I D G N IF ++L +GI +L G TG GKT T+ + +
Sbjct: 50 LDKKNQILDLEKLGLKPENLEIFRKLL-----EKPHGI---ILVTGPTGSGKTTTLYSAL 101
Query: 82 MK 83
+
Sbjct: 102 SE 103
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 29.2 bits (65), Expect = 3.0
Identities = 14/53 (26%), Positives = 28/53 (52%)
Query: 223 DNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
D + N+ R+I + +++ +EE ++ + + K+ K EKE E+KE
Sbjct: 272 DKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKEL 324
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 29.0 bits (66), Expect = 3.0
Identities = 12/34 (35%), Positives = 15/34 (44%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E EE E+EEEEE + E E E+
Sbjct: 186 GSELEELDDDEDEEEEEDENDDSLAADESELPEK 219
>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
Length = 196
Score = 28.3 bits (63), Expect = 3.0
Identities = 8/32 (25%), Positives = 19/32 (59%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
+ ++ + + + +E EE K+ ++KE E +E
Sbjct: 32 EQAQQNQPQQQTVDEIEELKQKLQQKEVEAQE 63
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 29.0 bits (66), Expect = 3.0
Identities = 15/47 (31%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 234 INLSLHYLEQEEE-EEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
NL+ Y E +EE + +++E + K +KKKEK+E + + K
Sbjct: 43 TNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 27.1 bits (61), Expect = 9.7
Identities = 11/51 (21%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 217 QLFSLSDNY-LLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKK 266
++ +L++ Y ++ E + L L+ + +K KE++E + K K +
Sbjct: 41 EVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 27.6 bits (61), Expect = 3.1
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 254 EEEEEKKKGKKKKEKEEEEKEEN 276
K + KK+++KEEEE+EE+
Sbjct: 82 ATAGAKAEAKKEEKKEEEEEEED 104
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 28.1 bits (62), Expect = 3.3
Identities = 11/34 (32%), Positives = 23/34 (67%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EQ E+EE+ + E+EE+++ + ++ K + EE+
Sbjct: 37 EQREQEEQERREQEEQDRLEREELKRRAAEERLR 70
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 28.8 bits (64), Expect = 3.4
Identities = 16/39 (41%), Positives = 24/39 (61%), Gaps = 5/39 (12%)
Query: 242 EQEEEEEKGKEEEEEEEKK-----KGKKKKEKEEEEKEE 275
E+ EE G+EEEE+EE +G ++ +EEEE+ E
Sbjct: 179 EERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAE 217
Score = 27.7 bits (61), Expect = 7.9
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E E +EEEEE E + E E+ +
Sbjct: 203 EGNRELNEEEEEEAEGSDDGEDVVDYEGERIDK 235
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 28.7 bits (65), Expect = 3.4
Identities = 11/53 (20%), Positives = 23/53 (43%), Gaps = 5/53 (9%)
Query: 226 LLRNEARKINLSLH-----YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
L + ++ L + + +EE + E+E E ++ K+ +K EE
Sbjct: 197 LKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEEL 249
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 28.8 bits (65), Expect = 3.6
Identities = 11/56 (19%), Positives = 21/56 (37%), Gaps = 4/56 (7%)
Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEE----KKKGKKKKEKEEEEKEENLVK 279
+ ++ L + +E EEE + E+ +E K E K + +V
Sbjct: 503 KELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVL 558
>gnl|CDD|130727 TIGR01666, YCCS, TIGR01666 family membrane protein. This model
represents a clade of sequences from gamma and beta
proteobacteria. These proteins are >700 amino acids long
and many have been annotated as putative membrane
proteins. The gene from Salmonella has been annotated as
a putative efflux transporter. The gene from E. coli has
the name yccS [Cell envelope, Other].
Length = 704
Score = 28.7 bits (64), Expect = 3.7
Identities = 17/109 (15%), Positives = 40/109 (36%), Gaps = 22/109 (20%)
Query: 181 TAVTKMNAQSSR--------------SHTICTIYLGAMAKLHLVDLAGSEQLFSLSDNYL 226
T V+ MN + + +H++ Y+ A+ A ++L +L
Sbjct: 577 TTVSNMNNEPVKYKAYLQKGFRLLKLNHSL-LSYISALG-------AHRDRLKNLQQTAQ 628
Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+ + L Y + EE +++E + + +++E +E
Sbjct: 629 FLDGFYPVAKKLIYTLEHIEEIPEAIFNQQQESIETLELRKQEMTAEER 677
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 28.2 bits (63), Expect = 3.7
Identities = 19/50 (38%), Positives = 28/50 (56%)
Query: 226 LLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L R EAR L E+ EEE+ EEE EE+++ ++ EEE+E+
Sbjct: 64 LDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEED 113
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 27.1 bits (60), Expect = 3.7
Identities = 15/35 (42%), Positives = 23/35 (65%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
LE EE EE +EEEEEEE++ + + ++E+ E
Sbjct: 55 LEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELE 89
Score = 26.0 bits (57), Expect = 9.2
Identities = 14/35 (40%), Positives = 23/35 (65%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E EE +E+ +EEEEEE++ + + + EE E+E
Sbjct: 59 ESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIE 93
>gnl|CDD|131051 TIGR01996, PTS-II-BC-sucr, PTS system, sucrose-specific IIBC
component. This model represents the fused enzyme II B
and C components of the sucrose-specific PTS sugar
transporter system. Sucrose is converted to
sucrose-6-phosphate in the process of translocation into
the cell. Some of these transporters lack their own IIA
domains and instead use the glucose IIA protein (IIAglc
or Crr). The exceptions to this rule are Staphylococci,
Streptococci, Lactococci, Lactobacilli, etc. which
contain their own A domain as a C-terminal fusion. This
family is closely related to the trehalose transporting
PTS IIBC enzymes and the B and C domains of each are
described by subfamily-domain level TIGRFAMs models
(TIGR00826 and TIGR00852, respectively).
Length = 461
Score = 28.8 bits (65), Expect = 3.7
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 52 INHMFNGINATLLAYGQTGGGKTYTVSAM 80
I+H F+ I LLA +TGG + +++M
Sbjct: 318 IHHSFHAIETQLLANTKTGGNFLFPIASM 346
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 28.9 bits (64), Expect = 3.8
Identities = 10/33 (30%), Positives = 22/33 (66%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
EEE + ++ E+++++K K++KEK + +L
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSL 117
Score = 28.5 bits (63), Expect = 4.2
Identities = 12/57 (21%), Positives = 31/57 (54%), Gaps = 8/57 (14%)
Query: 221 LSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
+SD Y+ E R+ ++ E+ +E+++ E++K+G+++ E +E++
Sbjct: 77 MSDQYVKLEEERR--------HRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDI 125
Score = 28.1 bits (62), Expect = 6.7
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
E++ ++ K KE++E+E+++ KKKE E +
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 27.7 bits (62), Expect = 3.9
Identities = 15/38 (39%), Positives = 24/38 (63%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
YL++E E +K EE++K +K KEK E+E+ +N
Sbjct: 42 YLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQ 79
>gnl|CDD|176652 cd06140, DNA_polA_I_Bacillus_like_exo, inactive DEDDy 3'-5'
exonuclease domain of Bacillus stearothermophilus DNA
polymerase I and similar family-A DNA polymerases.
Bacillus stearothermophilus-like Polymerase I (Pol I), a
subgroup of the family-A DNA polymerases, contains an
inactive DnaQ-like 3'-5' exonuclease domain in the same
polypeptide chain as the polymerase region. The
exonuclease-like domain of these proteins possess the
same fold as the Klenow fragment (KF) of Escherichia
coli Pol I, but does not contain the four critical
metal-binding residues necessary for activity. The
function of this domain is unknown. It might act as a
spacer between the polymerase and the 5'-3' exonuclease
domains. Some members of this subgroup, such as those
from Bacillus sphaericus and Thermus aquaticus, are
thermostable DNA polymerases.
Length = 178
Score = 28.0 bits (63), Expect = 3.9
Identities = 16/58 (27%), Positives = 23/58 (39%), Gaps = 10/58 (17%)
Query: 225 YLLRNEARKINLS---LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
YLL +L+ YL +E + EE KG K +EE E+L +
Sbjct: 90 YLLDPTRSSYDLADLAKRYLGRELPSD-------EEVYGKGAKFAVPDEEVLAEHLAR 140
>gnl|CDD|165339 PHA03046, PHA03046, Hypothetical protein; Provisional.
Length = 142
Score = 27.8 bits (62), Expect = 3.9
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
++EE E E+ ++ G KK + EE+E NL+
Sbjct: 12 DDEENDRNEHREKTSEEDGHYKKRLDVEEEEPNLIN 47
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 28.3 bits (64), Expect = 3.9
Identities = 12/34 (35%), Positives = 24/34 (70%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E+ E E + ++EE+E E + + ++E+ E+E+EE
Sbjct: 167 EEREAERRERKEEKEREVARLRAQQEEAEDEREE 200
Score = 28.0 bits (63), Expect = 6.7
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 6/40 (15%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGK------KKKEKEEEEKEE 275
Q EE+++ K EE+EEE++ + K EEEE+E
Sbjct: 24 AQIEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERER 63
Score = 27.6 bits (62), Expect = 7.2
Identities = 14/36 (38%), Positives = 22/36 (61%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
E++EEE + E EEE K +++E+E + KEE
Sbjct: 35 EEKEEERRIDEMMEEERLKALAEEEERERKRKEERR 70
>gnl|CDD|215349 PLN02647, PLN02647, acyl-CoA thioesterase.
Length = 437
Score = 28.6 bits (64), Expect = 3.9
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E EEE+ +E E + +K K+ ++K E E E
Sbjct: 225 ETEEEKLLFEEAEARNKLRKKKRGEQKREFENGEA 259
Score = 27.8 bits (62), Expect = 7.0
Identities = 15/34 (44%), Positives = 19/34 (55%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
L E EEEK EE E K KKK+ +++ E E
Sbjct: 222 LSPETEEEKLLFEEAEARNKLRKKKRGEQKREFE 255
>gnl|CDD|216584 pfam01580, FtsK_SpoIIIE, FtsK/SpoIIIE family. FtsK has extensive
sequence similarity to wide variety of proteins from
prokaryotes and plasmids, termed the FtsK/SpoIIIE
family. This domain contains a putative ATP binding
P-loop motif. It is found in the FtsK cell division
protein from E. coli and the stage III sporulation
protein E SpoIIIE, which has roles in regulation of
prespore specific gene expression in B. subtilis. A
mutation in FtsK causes a temperature sensitive block
in cell division and it is involved in peptidoglycan
synthesis or modification. The SpoIIIE protein is
implicated in intercellular chromosomal DNA transfer.
Length = 201
Score = 28.1 bits (63), Expect = 3.9
Identities = 8/23 (34%), Positives = 13/23 (56%)
Query: 60 NATLLAYGQTGGGKTYTVSAMIM 82
LL G TG GK+ ++ +I+
Sbjct: 38 MPHLLIAGATGSGKSTFLNTLIL 60
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 28.4 bits (64), Expect = 4.0
Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 5/37 (13%)
Query: 240 YLEQEEEE--EKGKEEEEEEEKKKGKKKKEKEEEEKE 274
Y E+EE+E +K K+E +EE + + K EE E
Sbjct: 390 YKEREEKEKEKKAKKEGKEERR---IHFQNKSIEEAE 423
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 28.4 bits (64), Expect = 4.1
Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 7/43 (16%)
Query: 240 YLEQ-------EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
Y+E+ +E K K E++ K KK K E+KE+
Sbjct: 363 YIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEK 405
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 28.7 bits (64), Expect = 4.2
Identities = 14/36 (38%), Positives = 25/36 (69%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+L++ +E+++ K+EE+ EE K + +E EEEEK
Sbjct: 275 WLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310
Score = 28.7 bits (64), Expect = 4.9
Identities = 10/29 (34%), Positives = 21/29 (72%)
Query: 251 KEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
KE++EE++++K ++ K +E +E+ E K
Sbjct: 280 KEKKEEKDEEKSEEVKTEEVDEEFEEEEK 308
>gnl|CDD|163125 TIGR03071, couple_hipA, HipA N-terminal domain. Although Pfam
models pfam07805 and pfam07804 currently are called
HipA-like N-terminal domain and HipA-like C-terminal
domain, respectively, those models hit the central and
C-terminal regions of E. coli HipA but not the
N-terminal region. This model hits the N-terminal region
of HipA and its homologs, and also identifies proteins
that lack match regions for pfam07804 and pfam07805.
Length = 101
Score = 26.9 bits (60), Expect = 4.4
Identities = 13/38 (34%), Positives = 19/38 (50%), Gaps = 1/38 (2%)
Query: 208 HLVDLAGSEQLFSLSDNYLLRNEARKINLSLHYLEQEE 245
L + G + F D+YL A ++LSL L+ EE
Sbjct: 13 ILTEDEGGKLSFRYDDDYLGSPSAPPLSLSLP-LQAEE 49
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 28.1 bits (62), Expect = 4.5
Identities = 13/38 (34%), Positives = 24/38 (63%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
++ EE E+ E E EEK ++ +E++E E+EE ++
Sbjct: 133 QEREELEERMEWERREEKIDEREDQEEQEREREEQTIE 170
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 27.9 bits (62), Expect = 4.6
Identities = 12/27 (44%), Positives = 24/27 (88%)
Query: 247 EEKGKEEEEEEEKKKGKKKKEKEEEEK 273
E+K K+++ E++K++ KKKKEK++++K
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKK 167
>gnl|CDD|236289 PRK08563, PRK08563, DNA-directed RNA polymerase subunit E';
Provisional.
Length = 187
Score = 27.9 bits (63), Expect = 4.6
Identities = 17/46 (36%), Positives = 21/46 (45%), Gaps = 12/46 (26%)
Query: 231 ARKINLSLHYLEQEEEEEK----------GKEEEEEEEKKKGKKKK 266
AR + +SL E+ K GK E EEEKKK KK+
Sbjct: 144 ARIVAVSL--KERRPRGSKIGLTMRQPGLGKLEWIEEEKKKAKKEA 187
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 28.0 bits (63), Expect = 4.7
Identities = 14/42 (33%), Positives = 23/42 (54%)
Query: 230 EARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
K+ L L++E EE K + E + + +K KK +E+ EE
Sbjct: 63 ALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEE 104
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 28.5 bits (64), Expect = 4.7
Identities = 11/42 (26%), Positives = 20/42 (47%)
Query: 234 INLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+N LE+E EE + + E KK++++ E + E
Sbjct: 859 LNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRE 900
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 28.5 bits (64), Expect = 4.9
Identities = 19/49 (38%), Positives = 30/49 (61%)
Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L + ++ L LE+E EE K + EE E EK++ + + ++ EEEKEE
Sbjct: 840 LEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEE 888
Score = 28.1 bits (63), Expect = 5.9
Identities = 12/52 (23%), Positives = 23/52 (44%)
Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
R L E EE E+ KE+ E +++ +++ EE E+ ++
Sbjct: 306 LLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELE 357
Score = 27.8 bits (62), Expect = 8.4
Identities = 12/39 (30%), Positives = 19/39 (48%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
LEQ E + +EE EE+ ++ E+ E E L +
Sbjct: 349 LEQLLAELEEAKEELEEKLSALLEELEELFEALREELAE 387
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 27.5 bits (61), Expect = 4.9
Identities = 13/37 (35%), Positives = 24/37 (64%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+E+ E++ + ++E EK + KKK+KE E + + L K
Sbjct: 86 REQREKELAKRQKELEKIELSKKKQKERERRRKKLTK 122
>gnl|CDD|216212 pfam00955, HCO3_cotransp, HCO3- transporter family. This family
contains Band 3 anion exchange proteins that exchange
CL-/HCO3-. This family also includes cotransporters of
Na+/HCO3-.
Length = 501
Score = 28.4 bits (64), Expect = 4.9
Identities = 9/52 (17%), Positives = 14/52 (26%)
Query: 99 MSYLQLYSEKCYDLLNGNKEVTLKNWIFNIPQSDQRVQGPPDPQTGPASPGN 150
Y+ YS C + N T +W + G + P
Sbjct: 181 SHYITNYSCVCVPPITPNNHSTEIDWTNLTKSECENYGGLLIGTSCGPHPPP 232
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 28.3 bits (63), Expect = 4.9
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 239 HYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
LE E+EE+ E+ E+ +++ K+ E+EE
Sbjct: 230 TELEVEDEEKALDEQTEDPQQEDALAGDAKKALEEEE 266
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 28.4 bits (63), Expect = 4.9
Identities = 12/37 (32%), Positives = 16/37 (43%)
Query: 243 QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+ EE K K E +KK + K E+ K E K
Sbjct: 178 KAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAK 214
Score = 28.0 bits (62), Expect = 6.0
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EQ EE K K+ E+E K+ +K++ K +E++++
Sbjct: 85 EEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQ 119
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 26.9 bits (60), Expect = 4.9
Identities = 8/23 (34%), Positives = 13/23 (56%)
Query: 250 GKEEEEEEEKKKGKKKKEKEEEE 272
E KK+ KK++E+EE +
Sbjct: 73 AAAAAAAEAKKEEKKEEEEEESD 95
>gnl|CDD|234655 PRK00133, metG, methionyl-tRNA synthetase; Reviewed.
Length = 673
Score = 28.6 bits (65), Expect = 4.9
Identities = 7/32 (21%), Positives = 8/32 (25%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
E E KE + EE E
Sbjct: 536 EALIEASKEAAAAKAAAAAAAAPLAEEPIAET 567
>gnl|CDD|241384 cd13230, PH1_SSRP1-like, Structure Specific Recognition protein 1
(SSRP1) Pleckstrin homology (PH) domain, repeat 1.
SSRP1 is a component of FACT (facilitator of chromatin
transcription), an essential chromatin reorganizing
factor. In yeast FACT (yFACT) is composed of three
proteins: Spt16/Cdc68, Pob3, and Nhp6. In metazoans the
Pob3 and Nhp6 orthologs are fused to form SSRP1/T160 in
human and mouse, respectively. The middle domain of the
Pob3 subunit (Pob3-M) has an unusual double pleckstrin
homology (PH) architecture. yFACT interacts in a
physiologically important way with the central
single-strand DNA binding factor RPA to promote a step
in DNA Replication. Coordinated function by yFACT and
RPA is important during nucleosome deposition. These
results support the model that the FACT family has an
essential role in constructing nucleosomes during DNA
replication, and suggest that RPA contributes to this
process. Members of this cd are composed of the first
PH-like repeat. PH domains have diverse functions, but
in general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 137
Score = 27.5 bits (62), Expect = 5.0
Identities = 14/38 (36%), Positives = 20/38 (52%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLV 278
+++EE E EEE ++K K K EKE E +V
Sbjct: 84 FDKDEEIELELNLTEEELEEKYKGKLEKEYEGPLYEVV 121
>gnl|CDD|199812 cd10488, MH1_R-SMAD, N-terminal Mad Homology 1 (MH1) domain of
receptor regulated SMADs. The MH1 is a small
DNA-binding domain present in SMAD (small mothers
against decapentaplegic) family of proteins, which are
signal transducers and transcriptional modulators that
mediate multiple signaling pathways. It binds to the
major groove in an unusual manner via a beta hairpin
structure. It negatively regulates the functions of the
MH2 domain, the C-terminal domain of SMAD. This MH1
domain is found in all receptor regulated SMADs
(R-SMADs) including SMAD1, SMAD2, SMAD3, SMAD5 and
SMAD9. SMAD1 plays an essential role in bone development
and postnatal bone formation through activation by bone
morphogenetic protein (BMP) type 1 receptor kinase.
SMAD2 regulates multiple cellular processes, such as
cell proliferation, apoptosis and differentiation, while
SMAD3 modulates signals of activin and TGF-beta. SMAD4,
a common mediator SMAD (co-SMAD) binds R-SMADs, forming
an oligomeric complex that binds to DNA and serves as a
transcription factor. SMAD5 is involved in bone
morphogenetic proteins (BMP) signal modulation, possibly
playing a role in the pathway involving inhibition of
hematopoietic progenitor cells by TGF-beta. SMAD9 (also
known as SMAD8) can mediate the differentiation of
mesenchymal stem cells (MSCs) into tendon-like cells by
inhibiting the osteogenic pathway.
Length = 123
Score = 27.2 bits (60), Expect = 5.0
Identities = 16/32 (50%), Positives = 19/32 (59%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
EQ EEEK E+ + KK KKK + EE EK
Sbjct: 13 EQNGEEEKWAEKAVKSLVKKLKKKGQLEELEK 44
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 27.3 bits (61), Expect = 5.2
Identities = 12/34 (35%), Positives = 25/34 (73%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
EQ+ ++ KE+ +++++KK KKKK+K++ K+
Sbjct: 97 EQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130
Score = 26.6 bits (59), Expect = 9.5
Identities = 9/28 (32%), Positives = 22/28 (78%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEK 268
L+++ +++K K+++++++KKK KK K
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAK 132
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 27.9 bits (62), Expect = 5.4
Identities = 9/34 (26%), Positives = 15/34 (44%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
+E+E++ EEEE + + EE E
Sbjct: 230 PKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263
>gnl|CDD|222117 pfam13422, DUF4110, Domain of unknown function (DUF4110). This is
a family that is found predominantly at the C-terminus
of Kelch-containing proteins. However, the exact
function of this region is not known.
Length = 99
Score = 26.9 bits (60), Expect = 5.4
Identities = 12/28 (42%), Positives = 14/28 (50%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEK 268
LE EEEE+ G EE K+K E
Sbjct: 69 LEDEEEEQAGIEEVVSLAKEKDWGGGEG 96
>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
Length = 329
Score = 27.9 bits (63), Expect = 5.5
Identities = 11/20 (55%), Positives = 13/20 (65%)
Query: 56 FNGINATLLAYGQTGGGKTY 75
F+ N LL YG TG GKT+
Sbjct: 179 FDKNNENLLFYGNTGTGKTF 198
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 28.1 bits (63), Expect = 5.6
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 230 EARKINLSLHYLEQE-EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ R L+ LE+E EE EK KEE EEE K + E ++E KE
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK 425
Score = 27.7 bits (62), Expect = 8.8
Identities = 23/53 (43%), Positives = 32/53 (60%), Gaps = 2/53 (3%)
Query: 227 LRNEARKINLSLHYLEQEEE--EEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
+E R+I L LE+E EE+ KE EE+EE+ + KKK KE E++ E L
Sbjct: 305 YLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL 357
>gnl|CDD|216796 pfam01935, DUF87, Domain of unknown function DUF87. The function
of this prokaryotic domain is unknown. It contains
several conserved aspartates and histidines that could
be metal ligands.
Length = 218
Score = 27.7 bits (62), Expect = 5.6
Identities = 7/16 (43%), Positives = 11/16 (68%)
Query: 66 YGQTGGGKTYTVSAMI 81
G TG GK+ TV+ ++
Sbjct: 29 LGSTGSGKSNTVAVLL 44
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 26.8 bits (60), Expect = 5.7
Identities = 14/32 (43%), Positives = 20/32 (62%), Gaps = 3/32 (9%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
++ +EEEK + EEE K+ K K +E EEK
Sbjct: 1 KERKEEEK---AQREEELKRLKNLKREEIEEK 29
>gnl|CDD|205495 pfam13315, DUF4085, Protein of unknown function (DUF4085). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 101 and 269 amino
acids in length.
Length = 208
Score = 27.8 bits (62), Expect = 5.9
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 238 LHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
L + E +EE E+ EEEE + KE+ EE KE+ L
Sbjct: 17 LSFPESQEEWEEELREEEESGNDYLQSLKEEIEEIKEDLL 56
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 28.0 bits (63), Expect = 5.9
Identities = 11/49 (22%), Positives = 27/49 (55%)
Query: 227 LRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L + ++ LE++E+E KE+ +E++++ ++ ++ EE E
Sbjct: 92 LDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140
>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal. This is the N terminal of
the DNA repair protein Rad54.
Length = 191
Score = 27.7 bits (62), Expect = 5.9
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 252 EEEEEEEKKKGKKKKEKEEEEKEENLVK 279
++ + EE+K K ++ +E E K N K
Sbjct: 143 DKPKIEEEKAEKDQEPEESETKLSNGPK 170
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 27.2 bits (61), Expect = 6.0
Identities = 15/38 (39%), Positives = 21/38 (55%)
Query: 232 RKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
R+I L L +E EK K + EKK +++KEKE
Sbjct: 92 RRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129
Score = 26.5 bits (59), Expect = 9.2
Identities = 17/52 (32%), Positives = 28/52 (53%), Gaps = 10/52 (19%)
Query: 232 RKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKK---------KKEKEEEEKE 274
R I+L H + E+E + KE++++ K+ GKK ++ KE EKE
Sbjct: 60 RVIDLKKHNAKVEKELLREKEKKKKR-KRPGKKRRIALRLRRERTKERAEKE 110
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 28.1 bits (63), Expect = 6.0
Identities = 10/32 (31%), Positives = 24/32 (75%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
E+EEE++ + E+E+EE + +++++ +E +K
Sbjct: 288 EEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078). This
family is found from fungi to humans, but its exact
function is not known.
Length = 88
Score = 26.5 bits (59), Expect = 6.2
Identities = 17/38 (44%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
EQ EE EK +EE E E K++ +++KEK + EE K
Sbjct: 47 EQMEELEKAREETERERKER-EERKEKRKRAIEERRKK 83
>gnl|CDD|236325 PRK08654, PRK08654, pyruvate carboxylase subunit A; Validated.
Length = 499
Score = 28.0 bits (63), Expect = 6.3
Identities = 12/51 (23%), Positives = 25/51 (49%), Gaps = 1/51 (1%)
Query: 226 LLRNEA-RKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++ NE + NL H++E+E + + EE+++ K EK ++
Sbjct: 424 VMENENFVRGNLHTHFIEEETTILEEMKRYALEEEEREKTLSEKFFPGNKK 474
>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 292
Score = 28.0 bits (62), Expect = 6.3
Identities = 15/57 (26%), Positives = 25/57 (43%)
Query: 129 PQSDQRVQGPPDPQTGPASPGNGRSAASLTVKKLDSLNSAVQLIVTGNENKVTAVTK 185
++ +VQ QT P +AAS +K S + +QL +G+ + V K
Sbjct: 174 TATEAKVQTASPAQTPATPPAGKGAAASGQLKSAPSSHYTLQLSASGSYDNVNGFAK 230
>gnl|CDD|225364 COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase
PilT [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 353
Score = 28.0 bits (63), Expect = 6.4
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 64 LAYGQTGGGKTYTVSAMI 81
L G TG GK+ T++AMI
Sbjct: 129 LVTGPTGSGKSTTLAAMI 146
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 27.8 bits (63), Expect = 6.4
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEE 271
E EE+ +E EEEEE+++ ++ +E E E
Sbjct: 229 LAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 27.0 bits (60), Expect = 6.4
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+E+ E+E+ K E EE+ KK ++ EE N +
Sbjct: 5 IEERLEKEQRKLEAPEEKSKKNAAREVPLEENLNVNRNR 43
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 27.9 bits (63), Expect = 6.5
Identities = 12/37 (32%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEE-EEKEENL 277
E E++ E+ + +E EK K + +++KE+ +E+E+ L
Sbjct: 531 ELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKL 567
>gnl|CDD|219905 pfam08564, CDC37_C, Cdc37 C terminal domain. Cdc37 is a protein
required for the activity of numerous eukaryotic protein
kinases. This domains corresponds to the C terminal
domain whose function is unclear. It is found C terminal
to the Hsp90 chaperone (Heat shocked protein 90) binding
domain pfam08565 and the N terminal kinase binding
domain of Cdc37 pfam03234.
Length = 89
Score = 26.2 bits (58), Expect = 6.6
Identities = 10/32 (31%), Positives = 22/32 (68%)
Query: 245 EEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+ +G++E +E E++ +++ EKEE E+E+
Sbjct: 58 NAKIEGEKEFKELEEEYNEEEAEKEEIEEEDE 89
Score = 26.2 bits (58), Expect = 8.5
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
+ E EK +E EEE ++ +K+E EEE++
Sbjct: 57 PNAKIEGEKEFKELEEEYNEEEAEKEEIEEEDE 89
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 27.9 bits (62), Expect = 6.6
Identities = 7/33 (21%), Positives = 21/33 (63%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
E+E+E++ + +++E+E + + + + E + E
Sbjct: 648 EEEDEDDLEETDDDEDECEAIEDSESESESDGE 680
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 27.3 bits (61), Expect = 6.7
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
E EEE E K E EE +KK ++ EK+E++ E
Sbjct: 16 EAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAE 48
>gnl|CDD|234340 TIGR03744, traC_PFL_4706, conjugative transfer ATPase, PFL_4706
family. Members of this protein family are predicted
ATP-binding proteins apparently associated with DNA
conjugal transfer. Members are found both in plasmids
and in bacterial chromosomal regions that appear to
derive from integrative elements such as conjugative
transposons. More distant homologs, outside the scope of
this family, include type IV secretion/conjugal transfer
proteins such as TraC, VirB4 and TrsE. The granularity
of this protein family definition is chosen so as to
represent one distinctive clade and act as a marker
through which to define and recognize the class of
mobile element it serves [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 893
Score = 28.1 bits (63), Expect = 6.8
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 60 NATLLAYGQTGGGKTYTVSAMIMKTL 85
NA LL G TG GK+ T++ ++M+ +
Sbjct: 475 NAHLLILGPTGAGKSATLTNLLMQVM 500
>gnl|CDD|131972 TIGR02926, AhaH, ATP synthase archaeal, H subunit. he A1/A0 ATP
synthase is homologous to the V-type (V1/V0, vacuolar)
ATPase, but functions in the ATP synthetic direction as
does the F1/F0 ATPase of bacteria. The hydrophilic A1
"stalk" complex (AhaABCDEFG) is the site of ATP
generation and is coupled to the membrane-embedded
proton translocating A0 complex. It is unclear precisely
where AhaH fits into these complexes.
Length = 85
Score = 26.3 bits (58), Expect = 6.8
Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 7/49 (14%)
Query: 228 RNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEK--EEEEKE 274
R EAR++ E EEE K EE +E +++ +K+ EK EE EKE
Sbjct: 30 REEAREL-----LEEAEEEASKLGEEIIKEAEEEIEKEAEKIREEGEKE 73
>gnl|CDD|216848 pfam02018, CBM_4_9, Carbohydrate binding domain. This family
includes diverse carbohydrate binding domains.
Length = 134
Score = 27.0 bits (60), Expect = 6.9
Identities = 17/79 (21%), Positives = 25/79 (31%), Gaps = 16/79 (20%)
Query: 72 GKTYTVSAMIMKTLQHVMQRCNKDDVYMSYLQLYSEKCYDLLNGNKEVTLKNW-----IF 126
G TYTVS + + ++ + YD + K V W F
Sbjct: 62 GTTYTVSFWVK----ASSGPQ---TISVTLQITDASGNYDTVADEKVVLTGEWTKLEGTF 114
Query: 127 NIPQSDQR----VQGPPDP 141
IP + V+ PD
Sbjct: 115 TIPGTASTVELYVELGPDS 133
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 27.8 bits (62), Expect = 6.9
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 247 EEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
E+K E +++ EKKK KKK++KE + + E
Sbjct: 60 EDKNNESKKKSEKKKKKKKEKKEPKSEGET 89
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 27.8 bits (62), Expect = 7.0
Identities = 10/39 (25%), Positives = 18/39 (46%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
E +E E ++ K KKKK+K++++K
Sbjct: 37 DAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAY 75
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 27.5 bits (61), Expect = 7.0
Identities = 12/22 (54%), Positives = 19/22 (86%)
Query: 252 EEEEEEEKKKGKKKKEKEEEEK 273
+++E+EEKKK K+ K +E+EEK
Sbjct: 185 QQKEKEEKKKVKEAKRREKEEK 206
>gnl|CDD|225435 COG2880, COG2880, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 67
Score = 25.9 bits (57), Expect = 7.1
Identities = 12/47 (25%), Positives = 23/47 (48%), Gaps = 4/47 (8%)
Query: 232 RKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLV 278
+K++L E E+ + + +E+ E KG K+ KE E+ +
Sbjct: 20 KKVDLK----EGEKVKIVIRVKEKIYEILKGSLKEIKEILEEILEEI 62
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 26.1 bits (58), Expect = 7.3
Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 239 HYLE-QEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
YLE ++ +EK +E EE++E+ + KKE+E E +
Sbjct: 44 DYLEVTDKVKEKKEEREEDKEELIERIKKEEETFEDVDL 82
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
protein. This domain family is found in eukaryotes, and
is typically between 212 and 238 amino acids in length.
The family is found in association with pfam01805. There
are two completely conserved residues (W and H) that may
be functionally important. PRP21 is required for
assembly of the prespliceosome and it interacts with U2
snRNP and/or pre-mRNA in the prespliceosome. This family
also contains proteins similar to PRP21, such as the
mammalian SF3a. SF3a also interacts with U2 snRNP from
the prespliceosome, converting it to its active form.
Length = 230
Score = 27.4 bits (61), Expect = 7.3
Identities = 15/41 (36%), Positives = 24/41 (58%), Gaps = 3/41 (7%)
Query: 220 SLSDNYLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKK 260
S+ D Y + A K + + +++EE+ KEEEEEEE+
Sbjct: 12 SVKDKYYILERAYKR---AEWEKHQKKEEQKKEEEEEEERI 49
>gnl|CDD|233063 TIGR00631, uvrb, excinuclease ABC, B subunit. All proteins in this
family for wich functions are known are DNA helicases
that function in the nucleotide excision repair and are
endonucleases that make the 3' incision next to DNA
damage. They are part of a pathway requiring UvrA, UvrB,
UvrC, and UvrD homologs. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 655
Score = 27.6 bits (62), Expect = 7.7
Identities = 16/51 (31%), Positives = 28/51 (54%), Gaps = 15/51 (29%)
Query: 62 TLLAYGQTGGGKTYTVSAMIMKTLQHVMQRCNKDDVYMSY-----LQLYSE 107
TLL G TG GKT+T++ +V+ + N+ + +++ QLY+E
Sbjct: 33 TLL--GVTGSGKTFTMA--------NVIAQVNRPTLVIAHNKTLAAQLYNE 73
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 27.0 bits (60), Expect = 7.7
Identities = 11/35 (31%), Positives = 22/35 (62%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
++E E+EK KEE + + +K++ + E++K E
Sbjct: 80 KKEEAEKEKEKEERFMKALAEAEKERAELEKKKAE 114
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 27.6 bits (62), Expect = 8.1
Identities = 7/29 (24%), Positives = 22/29 (75%)
Query: 246 EEEKGKEEEEEEEKKKGKKKKEKEEEEKE 274
+ E+ K ++ +++KK+ KK K+++++ ++
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
>gnl|CDD|203903 pfam08312, cwf21, cwf21 domain. The cwf21 family is involved in
mRNA splicing. It has been isolated as a subcomplex of
the splicosome in Schizosaccharomyces pombe. The
function of the cwf21 domain is to bind directly to the
spliceosomal protein Prp8. Mutations in the cwf21 domain
prevent Prp8 from binding. The structure of this domain
has recently been solved which shows this domain to be
composed of two alpha helices.
Length = 46
Score = 24.8 bits (55), Expect = 8.3
Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 232 RKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEE 272
R+I + +++ EE+G EEE EEK +KK EE E
Sbjct: 7 REIEVKC-LELRDKLEEQGLSEEEIEEKVDELRKKLLEELE 46
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation.
Members of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned
functions within the nucleus. A set of Formin-Binding
Proteins (FBPs) has been shown to bind FH1 with their WW
domain.
Length = 392
Score = 27.7 bits (62), Expect = 8.3
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 240 YLEQEEEEEKGKEEEEEEEKKKGKKKKEKE 269
+ + EE K +EEEEE KK K+ E E
Sbjct: 330 FSKAAEENIKKEEEEEERRKKLVKETTEYE 359
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 27.6 bits (61), Expect = 8.7
Identities = 14/34 (41%), Positives = 19/34 (55%), Gaps = 6/34 (17%)
Query: 242 EQEEEEEKGKEEEEEEE------KKKGKKKKEKE 269
E EE+EE+ +EEEEEEE +K+ K
Sbjct: 227 ESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKA 260
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 27.8 bits (61), Expect = 8.7
Identities = 12/38 (31%), Positives = 19/38 (50%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+ EEEE+ + + EE EK+ + E ENL +
Sbjct: 251 DPEEEEQLPESKPEETEKRVSAFQVEGSTLISAENLRQ 288
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 27.2 bits (61), Expect = 8.9
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
E EEE+ +E+EEE+++K G K K E KE+++
Sbjct: 28 APEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDI 63
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 27.5 bits (62), Expect = 9.1
Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 8/82 (9%)
Query: 204 MAKLHLVDLAGSEQLFSLSDN------YLLRNEARKINLSLHYLEQEEEEEKGKEEEE-- 255
++ + LF + + LL E L Y+ EE E +EE
Sbjct: 343 ASEKRQAEFDAILDLFMEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEETV 402
Query: 256 EEEKKKGKKKKEKEEEEKEENL 277
EE +++ K+ E E +EE L
Sbjct: 403 EELRERAKEALETEALAQEEKL 424
>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 26.3 bits (58), Expect = 9.1
Identities = 12/37 (32%), Positives = 25/37 (67%)
Query: 241 LEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENL 277
LE+ + ++ KE++EEEE K+ + + E+E ++ + L
Sbjct: 57 LEKRKRDQVLKEQKEEEEAKRLQSELERELMKRAQKL 93
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 26.2 bits (58), Expect = 9.2
Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 3/54 (5%)
Query: 225 YLLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKE-KEEEEKEENL 277
+ R E R ++ E+E ++E KE E + + K+++ KEE+E+ E +
Sbjct: 30 WEKRMEKRLEQQAIKAREKELKDE--KEAERQRRIQAIKERRAAKEEKERYEKM 81
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 27.3 bits (61), Expect = 9.3
Identities = 8/32 (25%), Positives = 14/32 (43%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEK 273
+ E E + +E K G++ E +EK
Sbjct: 96 KIERSTPSLIERKTQEVKDSGEEIAEMMRDEK 127
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 27.4 bits (61), Expect = 9.3
Identities = 10/38 (26%), Positives = 19/38 (50%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEENLVK 279
+ E K +EE E K+K ++ +EK + +E+
Sbjct: 35 DAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERN 72
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 25.8 bits (57), Expect = 9.5
Identities = 9/17 (52%), Positives = 13/17 (76%)
Query: 259 KKKGKKKKEKEEEEKEE 275
KKK KKKK+K + ++E
Sbjct: 20 KKKKKKKKKKNKSKEEV 36
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 27.1 bits (60), Expect = 9.7
Identities = 11/35 (31%), Positives = 24/35 (68%)
Query: 242 EQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
++ ++EE ++ + E+K+ K+ +E+EE E+EE
Sbjct: 245 KKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEE 279
>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
Length = 249
Score = 26.9 bits (60), Expect = 9.9
Identities = 14/50 (28%), Positives = 21/50 (42%)
Query: 226 LLRNEARKINLSLHYLEQEEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEE 275
L R+E + + L + EE E+ +E E + E EE EE
Sbjct: 189 LPRDEEWDVMVDLFFWRDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEE 238
>gnl|CDD|216186 pfam00910, RNA_helicase, RNA helicase. This family includes RNA
helicases thought to be involved in duplex unwinding
during viral RNA replication. Members of this family
are found in a variety of single stranded RNA viruses.
Length = 105
Score = 26.0 bits (58), Expect = 9.9
Identities = 11/36 (30%), Positives = 16/36 (44%), Gaps = 1/36 (2%)
Query: 63 LLAYGQTGGGKTYTVSAMIMKTLQHVMQRCNKDDVY 98
+ YG G GK+ + L+H + KD VY
Sbjct: 1 IWLYGPPGCGKSTLAKYLARALLKH-LGLPKKDSVY 35
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 26.5 bits (59), Expect = 10.0
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 244 EEEEEKGKEEEEEEEKKKGKKKKEKEEEEKEEN 276
+E+EE + E +E+ ++KK +E++EKEE
Sbjct: 58 DEDEEADRIYESIDERMDERRKKRREQKEKEEI 90
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.130 0.362
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,822,769
Number of extensions: 1317678
Number of successful extensions: 10795
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8134
Number of HSP's successfully gapped: 1167
Length of query: 279
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 183
Effective length of database: 6,679,618
Effective search space: 1222370094
Effective search space used: 1222370094
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 58 (26.0 bits)