RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11229
(705 letters)
>gnl|CDD|238667 cd01371, KISc_KIF3, Kinesin motor domain, kinesins II or KIF3_like
proteins. Subgroup of kinesins, which form heterotrimers
composed of 2 kinesins and one non-motor accessory
subunit. Kinesins II play important roles in ciliary
transport, and have been implicated in neuronal
transport, melanosome transport, the secretory pathway,
and mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In this group the motor
domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 333
Score = 545 bits (1407), Expect = 0.0
Identities = 239/394 (60%), Positives = 280/394 (71%), Gaps = 61/394 (15%)
Query: 16 ENVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYST 75
ENV+V VR RPL+K+E G +I VD G +TV NP A EPPK FTFD V
Sbjct: 1 ENVKVVVRCRPLNKREKSEGAPEIVGVDENRGQVTVHNPKADAKEPPKVFTFDAV----- 55
Query: 76 VDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIF 135
+D NS Q D+YNETARP+VD VL G+NGTIF
Sbjct: 56 -----------------------------YDPNSTQEDVYNETARPLVDSVLEGYNGTIF 86
Query: 136 AYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEI 195
AYGQTGTGKTFTMEGV PEL+GIIPNSFAHIFGHIAKA E+V+FLVRVSY EIYNEE+
Sbjct: 87 AYGQTGTGKTFTMEGVREPPELRGIIPNSFAHIFGHIAKA-ENVQFLVRVSYLEIYNEEV 145
Query: 196 RDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNK 255
RDLL K++ + LE+KERPD GVY VKDLS ++ NA++M+K+M+LGNK
Sbjct: 146 RDLLGKDQKKKLELKERPDRGVY-------------VKDLSMFVVKNAEEMDKLMTLGNK 192
Query: 256 NRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDL 315
NRSVGAT MNE SSRSH+IF+I IE S GE+ +++G+L+LVDL
Sbjct: 193 NRSVGATNMNEDSSRSHSIFTITIECS-------EKGEDGE------NHIRVGKLNLVDL 239
Query: 316 AGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSLG 375
AGSERQ+KTGA+G RLKEA+KINLSLS LGNVISALVDGK THIPYR+SKLTR+LQDSLG
Sbjct: 240 AGSERQSKTGATGDRLKEATKINLSLSALGNVISALVDGKSTHIPYRDSKLTRLLQDSLG 299
Query: 376 GNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
GNSKTVMCA +GPA YNY+ET+STLRYA+R K I
Sbjct: 300 GNSKTVMCANIGPADYNYDETLSTLRYANRAKNI 333
>gnl|CDD|215803 pfam00225, Kinesin, Kinesin motor domain.
Length = 326
Score = 445 bits (1147), Expect = e-153
Identities = 172/389 (44%), Positives = 224/389 (57%), Gaps = 65/389 (16%)
Query: 23 RVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTVDLLNGI 82
RVRPL+++E G DI VD +
Sbjct: 1 RVRPLNEREKSRGSSDIVNVDETDSEDKESVV---------------------------- 32
Query: 83 ITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFAYGQTGT 142
+ KTFTFD VFD + Q +Y ETA+P+V+ VL G+N TIFAYGQTG+
Sbjct: 33 -------ITNKGREKTFTFDRVFDPEATQEFVYEETAKPLVESVLEGYNVTIFAYGQTGS 85
Query: 143 GKTFTMEGVNNVPELKGIIPNSFAHIFGHIAK-ADESVKFLVRVSYFEIYNEEIRDLLSK 201
GKT+TMEG + E GIIP + +F I + + F VRVSY EIYNE+I DLLS
Sbjct: 86 GKTYTMEG-DPPEEEPGIIPRALEDLFERIEERKERKWSFSVRVSYLEIYNEKIYDLLSP 144
Query: 202 NKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKNRSVGA 261
+ + L ++E P GVYVK L+ +A+++ +++ LGNKNR+V +
Sbjct: 145 SSKKKK------------LRIREDPKKGVYVKGLTEVEVTSAEEVLELLELGNKNRTVAS 192
Query: 262 TAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLAGSERQ 321
T MNE SSRSHAIF+I +E + G+L+LVDLAGSER
Sbjct: 193 TNMNEESSRSHAIFTITVEQRNRDTDGSVK---------------TGKLNLVDLAGSERA 237
Query: 322 AKTG-ASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSLGGNSKT 380
+KTG A G+RLKEA+ IN SLS LGNVI+AL + + +HIPYR+SKLTR+LQDSLGGNSKT
Sbjct: 238 SKTGAAGGRRLKEAANINKSLSALGNVINALAENQSSHIPYRDSKLTRLLQDSLGGNSKT 297
Query: 381 VMCATVGPASYNYEETISTLRYASRVKKI 409
+M A + P+ NYEET+STLR+ASR K I
Sbjct: 298 LMIANISPSDSNYEETLSTLRFASRAKNI 326
>gnl|CDD|214526 smart00129, KISc, Kinesin motor, catalytic domain. ATPase.
Microtubule-dependent molecular motors that play
important roles in intracellular transport of organelles
and in cell division.
Length = 335
Score = 444 bits (1144), Expect = e-152
Identities = 182/401 (45%), Positives = 230/401 (57%), Gaps = 67/401 (16%)
Query: 17 NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
N+RV VRVRPL+K+E P F
Sbjct: 1 NIRVVVRVRPLNKREKSRKS-------------------------PSVVPFPD------- 28
Query: 77 DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
T+ + + K FTFD VFDA + Q D++ ETA P+VD VL G+N TIFA
Sbjct: 29 ---KVGKTLTVRSPKNRQGEKKFTFDKVFDATASQEDVFEETAAPLVDSVLEGYNATIFA 85
Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEIR 196
YGQTG+GKT+TM G P+ GIIP + +F I K +E +F V+VSY EIYNE+IR
Sbjct: 86 YGQTGSGKTYTMIGT---PDSPGIIPRALKDLFEKIDKREEGWQFSVKVSYLEIYNEKIR 142
Query: 197 DLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKN 256
DLL+ + LE++E GVYVK L+ ++ +++ ++ GNKN
Sbjct: 143 DLLNPSSK--------------KLEIREDEKGGVYVKGLTEISVSSFEEVYNLLEKGNKN 188
Query: 257 RSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLA 316
R+V AT MNE SSRSHA+F+I +E K + GS + +L+LVDLA
Sbjct: 189 RTVAATKMNEESSRSHAVFTITVE------QKIKNSSS--------GSGKASKLNLVDLA 234
Query: 317 GSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD-GKCTHIPYRNSKLTRILQDSLG 375
GSER KTGA G RLKEA IN SLS LGNVI+AL K HIPYR+SKLTR+LQDSLG
Sbjct: 235 GSERAKKTGAEGDRLKEAGNINKSLSALGNVINALAQHSKSRHIPYRDSKLTRLLQDSLG 294
Query: 376 GNSKTVMCATVGPASYNYEETISTLRYASRVKKICNKARVN 416
GNSKT+M A V P+S N EET+STLR+ASR K+I NK VN
Sbjct: 295 GNSKTLMIANVSPSSSNLEETLSTLRFASRAKEIKNKPIVN 335
>gnl|CDD|238054 cd00106, KISc, Kinesin motor domain. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Kinesins are microtubule-dependent
molecular motors that play important roles in
intracellular transport and in cell division. In most
kinesins, the motor domain is found at the N-terminus
(N-type), in some its is found in the middle (M-type),
or C-terminal (C-type). N-type and M-type kinesins are
(+) end-directed motors, while C-type kinesins are (-)
end-directed motors, i.e. they transport cargo towards
the (-) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 328
Score = 420 bits (1081), Expect = e-143
Identities = 180/393 (45%), Positives = 225/393 (57%), Gaps = 67/393 (17%)
Query: 17 NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
N+RV VR+RPL+ +ES T D TV
Sbjct: 1 NIRVVVRIRPLNGRESK--------------------------SEESCITVD---DNKTV 31
Query: 77 DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
T+ P PK+FTFD VFD NS Q D+Y TA+P+V+ VL G+NGTIFA
Sbjct: 32 -------TLTPPKDGRKAGPKSFTFDHVFDPNSTQEDVYETTAKPLVESVLEGYNGTIFA 84
Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADES-VKFLVRVSYFEIYNEEI 195
YGQTG+GKT+TM G P+ GIIP + +F I + E F V VSY EIYNE++
Sbjct: 85 YGQTGSGKTYTMFG---SPKDPGIIPRALEDLFNLIDERKEKNKSFSVSVSYLEIYNEKV 141
Query: 196 RDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNK 255
DLLS E +P L ++E P GVYVK L+ +A+D ++ G K
Sbjct: 142 YDLLSP------EPPSKP------LSLREDPKGGVYVKGLTEVEVGSAEDALSLLQKGLK 189
Query: 256 NRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDL 315
NR+ +TAMNE SSRSHAIF+I +E + R+ + +L+LVDL
Sbjct: 190 NRTTASTAMNERSSRSHAIFTIHVEQRNTTNDGRSI--------------KSSKLNLVDL 235
Query: 316 AGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKC-THIPYRNSKLTRILQDSL 374
AGSER KTGA G RLKEA IN SLS LGNVISAL G+ HIPYR+SKLTR+LQDSL
Sbjct: 236 AGSERAKKTGAEGDRLKEAKNINKSLSALGNVISALSSGQKKKHIPYRDSKLTRLLQDSL 295
Query: 375 GGNSKTVMCATVGPASYNYEETISTLRYASRVK 407
GGNSKT+M A + P+S NY+ET+STLR+ASR K
Sbjct: 296 GGNSKTLMIANISPSSENYDETLSTLRFASRAK 328
>gnl|CDD|238665 cd01369, KISc_KHC_KIF5, Kinesin motor domain, kinesin heavy chain
(KHC) or KIF5-like subgroup. Members of this group have
been associated with organelle transport. This catalytic
(head) domain has ATPase activity and belongs to the
larger group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In most kinesins, the motor domain is found at
the N-terminus (N-type). N-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 325
Score = 389 bits (1001), Expect = e-131
Identities = 175/395 (44%), Positives = 228/395 (57%), Gaps = 70/395 (17%)
Query: 15 IENVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYS 74
N++V R RPL++KE G I T +
Sbjct: 1 ECNIKVVCRFRPLNEKEELRGSKSIVKFP-----------------GEDTVSI------- 36
Query: 75 TVDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTI 134
+ + KTF+FD VF N+ Q D+YN A+PIVD VL+G+NGTI
Sbjct: 37 ----------------AGSDDGKTFSFDRVFPPNTTQEDVYNFVAKPIVDDVLNGYNGTI 80
Query: 135 FAYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEE 194
FAYGQTG+GKT+TMEG PELKGIIP IF HI+ DE+++F V+VSY EIY E+
Sbjct: 81 FAYGQTGSGKTYTMEGPPGDPELKGIIPRIVHDIFEHISSMDENLEFHVKVSYLEIYMEK 140
Query: 195 IRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGN 254
IRDLL +KD L+V E + GVYVK L+ ++ +++ ++++ G
Sbjct: 141 IRDLLDVSKDN--------------LQVHEDKNRGVYVKGLTERFVSSPEEVLEVINEGK 186
Query: 255 KNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVD 314
NR+V +T MNE SSRSH+IF I ++ + G + G +L LVD
Sbjct: 187 SNRAVASTNMNEESSRSHSIFLITLKQENVETGSKKRG----------------KLFLVD 230
Query: 315 LAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSL 374
LAGSE+ +KTGA GQ L+EA KIN SLS LGNVI+AL DGK THIPYR+SKLTRILQDSL
Sbjct: 231 LAGSEKVSKTGAEGQTLEEAKKINKSLSALGNVINALTDGKSTHIPYRDSKLTRILQDSL 290
Query: 375 GGNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
GGNS+T + P+SYN ET+STLR+ +R K I
Sbjct: 291 GGNSRTTLIICCSPSSYNESETLSTLRFGARAKTI 325
>gnl|CDD|238662 cd01366, KISc_C_terminal, Kinesin motor domain,
KIFC2/KIFC3/ncd-like carboxy-terminal kinesins. Ncd is a
spindle motor protein necessary for chromosome
segregation in meiosis. KIFC2/KIFC3-like kinesins have
been implicated in motility of the Golgi apparatus as
well as dentritic and axonal transport in neurons. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In this subgroup the motor domain is found at
the C-terminus (C-type). C-type kinesins are (-)
end-directed motors, i.e. they transport cargo towards
the (-) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 329
Score = 367 bits (945), Expect = e-122
Identities = 155/395 (39%), Positives = 213/395 (53%), Gaps = 73/395 (18%)
Query: 17 NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
N+RVF RVRPL ES E +F
Sbjct: 3 NIRVFCRVRPLLPSEST--------------------------EYSSVISFPDED----- 31
Query: 77 DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
G I + G K+F+FD VFD ++ Q D++ E + P+V L G+N IFA
Sbjct: 32 ---GGTIELSKGTGK----KKSFSFDRVFDPDASQEDVFEEVS-PLVQSALDGYNVCIFA 83
Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESV--KFLVRVSYFEIYNEE 194
YGQTG+GKT+TMEG PE GIIP + +F + A+ + + + S EIYNE
Sbjct: 84 YGQTGSGKTYTMEGP---PENPGIIPRALEQLF-NTAEELKEKGWSYTITASMLEIYNET 139
Query: 195 IRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGN 254
IRDLL+ + LE+K YV +L+ ++ +++ ++++LG+
Sbjct: 140 IRDLLATKPAPKKK-----------LEIKHDSKGETYVTNLTEVPVSSPEEVTRLLNLGS 188
Query: 255 KNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVD 314
KNRSV +T MNE SSRSHA+F + I ++ G++T G +L+LVD
Sbjct: 189 KNRSVASTNMNEHSSRSHAVFQLKIRGTNLQTGEQTRG----------------KLNLVD 232
Query: 315 LAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSL 374
LAGSER K+GA+G RLKEA IN SLS LG+VISAL +H+PYRNSKLT +LQDSL
Sbjct: 233 LAGSERLKKSGATGDRLKEAQAINKSLSALGDVISALRSKD-SHVPYRNSKLTYLLQDSL 291
Query: 375 GGNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
GGNSKT+M + P N ET+ +LR+ASRV+ +
Sbjct: 292 GGNSKTLMFVNISPLESNLSETLCSLRFASRVRSV 326
>gnl|CDD|238661 cd01365, KISc_KIF1A_KIF1B, Kinesin motor domain, KIF1_like
proteins. KIF1A (Unc104) transports synaptic vesicles to
the nerve terminal, KIF1B has been implicated in
transport of mitochondria. Both proteins are expressed
in neurons. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
In contrast to the majority of dimeric kinesins, most
KIF1A/Unc104 kinesins are monomeric motors. A
lysine-rich loop in KIF1A binds to the negatively
charged C-terminus of tubulin and compensates for the
lack of a second motor domain, allowing KIF1A to move
processively.
Length = 356
Score = 359 bits (923), Expect = e-118
Identities = 162/416 (38%), Positives = 231/416 (55%), Gaps = 75/416 (18%)
Query: 16 ENVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNP--AGSAHEPPKTFTFDTVTPY 73
NV+V VRVRP + +E + G I V + + T+KNP A + + PK+F+FD Y
Sbjct: 1 ANVKVAVRVRPFNSREKNRGSKCI--VQMPGKVTTLKNPKAADATRKKPKSFSFDHS--Y 56
Query: 74 STVDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGT 133
+ D + + + Q D++ + R ++D G+N
Sbjct: 57 WSHD------------------SEDPHYAS-------QEDVFEDLGRELLDHAFEGYNVC 91
Query: 134 IFAYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKA-DESVKFLVRVSYFEIYN 192
+FAYGQTG+GK++TM G E KGIIP +F I ++++ + V VSY EIYN
Sbjct: 92 LFAYGQTGSGKSYTMMG---YKEEKGIIPRLCEELFQRIESKKEQNLSYEVEVSYMEIYN 148
Query: 193 EEIRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSL 252
E++RDLL+ K L+V+E P +G YV+DLS + +D++ ++
Sbjct: 149 EKVRDLLNPKKKN-----------KGNLKVREHPVLGPYVEDLSKVAVTSYEDIQNLLEE 197
Query: 253 GNKNRSVGATAMNEVSSRSHAIFSII-----IETSSGLGGKRTTGENWRLWAEAQGSVQM 307
GNK+R+ +T MN+ SSRSHA+F+I+ ++ + L ++ +
Sbjct: 198 GNKSRTTASTNMNDTSSRSHAVFTIVLTQKKLDKETDLTTEKVS---------------- 241
Query: 308 GRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD-------GKCTHIP 360
++ LVDLAGSER + TGA G RLKE S IN SL+TLG VISAL D K + IP
Sbjct: 242 -KISLVDLAGSERASSTGAEGDRLKEGSNINKSLTTLGKVISALADNSSAKSKKKSSFIP 300
Query: 361 YRNSKLTRILQDSLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKICNKARVN 416
YR+S LT +L+++LGGNSKT M AT+ PA NYEET+STLRYA R KKI N A VN
Sbjct: 301 YRDSVLTWLLKENLGGNSKTAMIATISPADINYEETLSTLRYADRAKKIVNVAVVN 356
>gnl|CDD|238668 cd01372, KISc_KIF4, Kinesin motor domain, KIF4-like subfamily.
Members of this group seem to perform a variety of
functions, and have been implicated in neuronal
organelle transport and chromosome segregation during
mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 341
Score = 357 bits (918), Expect = e-118
Identities = 155/397 (39%), Positives = 210/397 (52%), Gaps = 64/397 (16%)
Query: 18 VRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTVD 77
VRV VRVRPL KE G + + P TV
Sbjct: 3 VRVAVRVRPLLPKELLEGC-QVCV------------------------SVVPGEPQVTV- 36
Query: 78 LLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFAY 137
K+FTFD VFD ++ Q ++YN P+VD + G+N T+ AY
Sbjct: 37 ----------------GTDKSFTFDYVFDPSTSQEEVYNTCVAPLVDGLFEGYNATVLAY 80
Query: 138 GQTGTGKTFTMEGVNNVPEL---KGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEE 194
GQTG+GKT+TM E GIIP + HIF I + + F ++VS+ E+YNEE
Sbjct: 81 GQTGSGKTYTMGTAFTASEDEEEVGIIPRAIQHIFKKIDEKKDEPDFQLKVSFLELYNEE 140
Query: 195 IRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGN 254
+RDLLS + E+ I +++E + + L+ N+A ++ + G+
Sbjct: 141 VRDLLSPSTS------EKSPI-----QIREDSKGNIIIVGLTEVTVNSAQEVMSCLEQGS 189
Query: 255 KNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVD 314
+R+ +TAMN SSRSHAIF+I +E R G + + + S + H VD
Sbjct: 190 LSRTTASTAMNSQSSRSHAIFTITLEQ------TRKNGPIAPMSGDDKNSTLTSKFHFVD 243
Query: 315 LAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD--GKCTHIPYRNSKLTRILQD 372
LAGSER KTGA+G RLKE IN L LGNVISAL D K +H+PYR+SKLTR+LQD
Sbjct: 244 LAGSERLKKTGATGDRLKEGISINSGLLALGNVISALGDESKKGSHVPYRDSKLTRLLQD 303
Query: 373 SLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
SLGGNS T+M A V PA N+EET++TL+YA+R + I
Sbjct: 304 SLGGNSHTLMIACVSPADSNFEETLNTLKYANRARNI 340
>gnl|CDD|238666 cd01370, KISc_KIP3_like, Kinesin motor domain, KIP3-like subgroup.
The yeast kinesin KIP3 plays a role in positioning the
mitotic spindle. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 338
Score = 346 bits (889), Expect = e-114
Identities = 155/395 (39%), Positives = 219/395 (55%), Gaps = 59/395 (14%)
Query: 17 NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
++ V VRVRP ++KE G + V +++ + V FD P
Sbjct: 1 SLTVAVRVRPFNEKEKQEGTRRV--VKVVDDRMLV---------------FD---PKDEE 40
Query: 77 DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
D + +N ++FD VFD S Q ++Y T +P+VD VL+G+N T+FA
Sbjct: 41 DAFRNLRARRNK-------ELKYSFDRVFDETSTQEEVYENTTKPLVDGVLNGYNATVFA 93
Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEIR 196
YG TG GKT TM G ++ P G++ + +F I + + +F V +SY EIYNE IR
Sbjct: 94 YGATGAGKTHTMLGTDSDP---GLMVLTMKDLFDKIEERKDDKEFEVSLSYLEIYNETIR 150
Query: 197 DLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKN 256
DLLS + LE++E P+ G+ V L+ + +A+++ +++ GN+N
Sbjct: 151 DLLSPSSGP--------------LELREDPNQGIVVAGLTEHQPKSAEEILELLMKGNRN 196
Query: 257 RSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLA 316
R+ T N SSRSHA+ I + RT N V++G+L L+DLA
Sbjct: 197 RTQEPTEANATSSRSHAVLQITVRQKD-----RTASIN--------QQVRIGKLSLIDLA 243
Query: 317 GSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKC--THIPYRNSKLTRILQDSL 374
GSER + T GQRLKE + IN SL LGN I+ALVDGK HIPYR+SKLTR+L+DSL
Sbjct: 244 GSERASATNNRGQRLKEGANINRSLLALGNCINALVDGKKKNKHIPYRDSKLTRLLKDSL 303
Query: 375 GGNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
GGN KTVM A + P+S +YEET +TL+YA+R K I
Sbjct: 304 GGNCKTVMIANISPSSSHYEETHNTLKYANRAKNI 338
>gnl|CDD|238660 cd01364, KISc_BimC_Eg5, Kinesin motor domain, BimC/Eg5 spindle pole
proteins, participate in spindle assembly and chromosome
segregation during cell division. This catalytic (head)
domain has ATPase activity and belongs to the larger
group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In most kinesins, the motor domain is found at
the N-terminus (N-type), N-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 352
Score = 344 bits (885), Expect = e-113
Identities = 162/411 (39%), Positives = 216/411 (52%), Gaps = 70/411 (17%)
Query: 17 NVRVFVRVRPLSKKESDAG-HVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYST 75
N++V VR RP + +E V + II A ST
Sbjct: 3 NIQVVVRCRPRNSRERKEKSSVVVEVSGSSKEIIVSTGGADK---------------QST 47
Query: 76 VDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIF 135
KT+TFD VF + Q+++Y++ PI+D+VL G+N TIF
Sbjct: 48 ---------------------KTYTFDKVFGPEADQIEVYSQVVSPILDEVLMGYNCTIF 86
Query: 136 AYGQTGTGKTFTMEGVNN--------VPELKGIIPNSFAHIFGHIAKADESVKFLVRVSY 187
AYGQTGTGKT+TMEG + GIIP + +F + + ++ V+VSY
Sbjct: 87 AYGQTGTGKTYTMEGDRTDNKGSTWELSPHAGIIPRALYQLFEKLESQNT--EYSVKVSY 144
Query: 188 FEIYNEEIRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDME 247
E+YNEE+ DLLS D + ++ + GV ++ L NNA++
Sbjct: 145 LELYNEELFDLLSSESD------LNKPLRIFDDTNNKG---GVVIQGLEEITVNNANEGL 195
Query: 248 KIMSLGNKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQM 307
K++ G+ R AT MN+ SSRSH+IFSI I K TT L V++
Sbjct: 196 KLLEKGSAKRKTAATLMNDQSSRSHSIFSITIHI------KETTISGEEL-------VKI 242
Query: 308 GRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLT 367
G+L+LVDLAGSE ++GA +R +EA IN SL TLG VI+ALV+ K HIPYR SKLT
Sbjct: 243 GKLNLVDLAGSENIGRSGAENKRAREAGNINQSLLTLGRVINALVE-KSPHIPYRESKLT 301
Query: 368 RILQDSLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKICNKARVNQD 418
R+LQDSLGG +KT + AT+ PAS N EET+STL YA R K I NK VNQ
Sbjct: 302 RLLQDSLGGRTKTSIIATISPASINLEETLSTLEYAHRAKNIKNKPEVNQK 352
>gnl|CDD|238670 cd01374, KISc_CENP_E, Kinesin motor domain, CENP-E/KIP2-like
subgroup, involved in chromosome movement and/or spindle
elongation during mitosis. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Kinesins are microtubule-dependent
molecular motors that play important roles in
intracellular transport and in cell division. In most
kinesins, the motor domain is found at the N-terminus
(N-type). N-type kinesins are (+) end-directed motors,
i.e. they transport cargo towards the (+) end of the
microtubule. Kinesin motor domains hydrolyze ATP at a
rate of about 80 per second, and move along the
microtubule at a speed of about 6400 Angstroms per
second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 321
Score = 331 bits (850), Expect = e-108
Identities = 157/394 (39%), Positives = 215/394 (54%), Gaps = 74/394 (18%)
Query: 17 NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
++V VRVRPL+ +ESD V + + +T++
Sbjct: 1 KIKVSVRVRPLNPRESDNEQVAWSIDN-----------------------DNTISL---- 33
Query: 77 DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
P ++FTFD VF S ++Y A+P+V L G+NGTIFA
Sbjct: 34 --------------EESTPGQSFTFDRVFGGESTNREVYERIAKPVVRSALEGYNGTIFA 79
Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEIR 196
YGQT +GKTFTM G P GIIP + IF I + +FL+RVSY EIYNE+I+
Sbjct: 80 YGQTSSGKTFTMSGDEQEP---GIIPLAVRDIFQRIQDTPDR-EFLLRVSYLEIYNEKIK 135
Query: 197 DLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKN 256
DLLS + + L ++E P+ GV V L+ I + + + ++++ G KN
Sbjct: 136 DLLSPSPQE--------------LRIREDPNKGVVVAGLTEEIVTSPEHLLQLIARGEKN 181
Query: 257 RSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLA 316
R VG T NE SSRSH IF + IE+ G+V++ L+L+DLA
Sbjct: 182 RHVGETDFNERSSRSHTIFQLTIESRERG-------------DSESGTVRVSTLNLIDLA 228
Query: 317 GSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKC-THIPYRNSKLTRILQDSLG 375
GSER ++TGA G+R KE S IN SL TLG VIS L +GK HIPYR+SKLTRILQ SL
Sbjct: 229 GSERASQTGA-GERRKEGSFINKSLLTLGTVISKLSEGKNSGHIPYRDSKLTRILQPSLS 287
Query: 376 GNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
GN++T + T+ PAS + EET++TL++ASR KK+
Sbjct: 288 GNARTAIICTISPASSHVEETLNTLKFASRAKKV 321
>gnl|CDD|238669 cd01373, KISc_KLP2_like, Kinesin motor domain, KLP2-like subgroup.
Members of this subgroup seem to play a role in mitosis
and meiosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 337
Score = 295 bits (756), Expect = 3e-94
Identities = 157/410 (38%), Positives = 221/410 (53%), Gaps = 91/410 (22%)
Query: 17 NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
V+V VR+RP ++ E+D G + ++ +P
Sbjct: 2 AVKVVVRIRPPNEIEADGGQGQCLKKLSSDTLVWHSHP---------------------- 39
Query: 77 DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
P+ FTFD V D+N+ Q D++ +P+V+ LSG+NG+IFA
Sbjct: 40 -------------------PRMFTFDHVADSNTNQEDVFQSVGKPLVEDCLSGYNGSIFA 80
Query: 137 YGQTGTGKTFTMEG-----VNNVPELKGIIPNSFAHIFGHI----AKADESVKFLVRVSY 187
YGQTG+GKT+TM G + L+G+IP F ++F I K + +KFL + S+
Sbjct: 81 YGQTGSGKTYTMMGPSSSDDESPHGLQGVIPRIFEYLFSLIQREEEKRGDGLKFLCKCSF 140
Query: 188 FEIYNEEIRDLL---SKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNAD 244
EIYNE+I DLL S+N L+++E GVYV++L+ ++ +
Sbjct: 141 LEIYNEQITDLLDPTSRN-----------------LKIREDIKKGVYVENLTEEYVSSYE 183
Query: 245 DMEKIMSLGNKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGS 304
D+ +++ G NR V AT+MN SSRSHA+F+ IE+ W + S
Sbjct: 184 DVYQVLLKGLSNRKVAATSMNSESSRSHAVFTCTIES----------------WEKKASS 227
Query: 305 V--QMGRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD---GKCTHI 359
+ RL+LVDLAGSERQ GA G RLKEA IN SLSTLG+VI ALVD GK H+
Sbjct: 228 TNIRTSRLNLVDLAGSERQKDDGAEGVRLKEAKNINKSLSTLGHVIMALVDVAHGKQRHV 287
Query: 360 PYRNSKLTRILQDSLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKI 409
PYR+SKLT +L+DSLGGN+KT + A V P+S + ET+STL++A R K I
Sbjct: 288 PYRDSKLTFLLRDSLGGNAKTTIIANVSPSSKCFGETLSTLKFAQRAKLI 337
>gnl|CDD|238671 cd01375, KISc_KIF9_like, Kinesin motor domain, KIF9-like subgroup;
might play a role in cell shape remodeling. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Kinesins are
microtubule-dependent molecular motors that play
important roles in intracellular transport and in cell
division. In most kinesins, the motor domain is found at
the N-terminus (N-type). N-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second. To achieve that, kinesin head groups work in
pairs. Upon replacing ADP with ATP, a kinesin motor
domain increases its affinity for microtubule binding
and locks in place. Also, the neck linker binds to the
motor domain, which repositions the other head domain
through the coiled-coil domain close to a second tubulin
dimer, about 80 Angstroms along the microtubule.
Meanwhile, ATP hydrolysis takes place, and when the
second head domain binds to the microtubule, the first
domain again replaces ADP with ATP, triggering a
conformational change that pulls the first domain
forward.
Length = 334
Score = 287 bits (735), Expect = 4e-91
Identities = 151/390 (38%), Positives = 205/390 (52%), Gaps = 57/390 (14%)
Query: 18 VRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTVD 77
++VFVRVRP T I + S + P V
Sbjct: 2 IQVFVRVRP--------------TPTKQGSSIKLGPDGKSV---------SSNLPKDLVR 38
Query: 78 LLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFAY 137
V N +F FD VF N+ Q ++Y A+P+VD L G+NGTIFAY
Sbjct: 39 ------GVVNNQQEDF----SFKFDGVFH-NASQEEVYETVAKPVVDSALDGYNGTIFAY 87
Query: 138 GQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEIRD 197
GQTG GKTFTM G + +G+IP + +F +A + + V VSY EIYNE++ D
Sbjct: 88 GQTGAGKTFTMTGGTESYKDRGLIPRALEQVFREVAM-RATKTYTVHVSYLEIYNEQLYD 146
Query: 198 LLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKNR 257
LL + + + + E + ++VK LS + A ++ ++ LG NR
Sbjct: 147 LLGDTPEALESLPA--------VTILEDSEQNIHVKGLSLHSATTEEEALNLLFLGETNR 198
Query: 258 SVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLAG 317
++ T+MN+ SSRSH IF+I +E+ S G V++ +L+LVDLAG
Sbjct: 199 TIAETSMNQASSRSHCIFTIHLESRSREAGSEV--------------VRLSKLNLVDLAG 244
Query: 318 SERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSLGGN 377
SER +KTG SGQ LKEA IN SLS L VI+AL + TH+PYRNSKLT +L+DSLGGN
Sbjct: 245 SERVSKTGVSGQVLKEAKYINKSLSFLEQVINALSEKARTHVPYRNSKLTHVLRDSLGGN 304
Query: 378 SKTVMCATVGPASYNYEETISTLRYASRVK 407
KTVM AT+ N +ET+STLR+A RV
Sbjct: 305 CKTVMLATIWVEPSNLDETLSTLRFAQRVA 334
>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
Length = 568
Score = 281 bits (720), Expect = 5e-86
Identities = 168/508 (33%), Positives = 257/508 (50%), Gaps = 64/508 (12%)
Query: 68 DTVTPYSTVDLLNGIITVKNPAGSAHE-----PPKTFTFDTVFDANSKQLDIYNETARPI 122
+ + + N + +H T+ FD VF ++ Q D+Y ET +P+
Sbjct: 22 SDIKSTIRIIPGELGERLINTSKKSHVSLEKSKEGTYAFDKVFGPSATQEDVYEETIKPL 81
Query: 123 VDKVLSGFNGTIFAYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFL 182
+D +L G+N T+FAYGQTG+GKT+TM G E GIIP S +F + + F
Sbjct: 82 IDSLLLGYNCTVFAYGQTGSGKTYTMSGT---EEEPGIIPLSLKELFSKLEDLSMTKDFA 138
Query: 183 VRVSYFEIYNEEIRDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANN 242
V +SY EIYNE+I DLLS N++ L ++E +GV V L+ ++
Sbjct: 139 VSISYLEIYNEKIYDLLSPNEES--------------LNIREDSLLGVKVAGLTEKHVSS 184
Query: 243 ADDMEKIMSLGNKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQ 302
+++ ++ G KNR+ +T +N+ SSRSH+IF I + + + + G T
Sbjct: 185 KEEILDLLRKGEKNRTTASTEINDESSRSHSIFQIELASKNKVSGTSETS---------- 234
Query: 303 GSVQMGRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD-GKCTHIPY 361
+L LVDLAGSER A+TG G RLKE + IN SL TLGNVI+AL D K HIPY
Sbjct: 235 ------KLSLVDLAGSERAARTGNRGTRLKEGASINKSLLTLGNVINALGDKKKSGHIPY 288
Query: 362 RNSKLTRILQDSLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKICNKARVNQDPKD 421
R SKLTR+LQDSLGGN T + T+ P+S ++EETI+TL++ASR K I NK +VN
Sbjct: 289 RESKLTRLLQDSLGGNCNTRVICTISPSSNSFEETINTLKFASRAKSIKNKIQVNS---- 344
Query: 422 ALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKR 481
+EI E+K L E E E+ + S+ A ++
Sbjct: 345 --SSDSSREIEEIKFDLSED----RSEI-------EILVFREQSQLSQSSLSGIFAYMQS 391
Query: 482 AQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEH--------RKQKEE 533
+ + +++ +++L+ K ++ G + + + + Q L R+++
Sbjct: 392 LKKETETLKSRIDLIMKSIISGTFERKKLLKEEGWKYKSTLQFLRIEIDRLLLLREEELS 451
Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESE 561
+ K I + + R D+ SS+ EE+
Sbjct: 452 KKKTKIHKLNKLRHDLSSLLSSIPEETS 479
Score = 31.2 bits (71), Expect = 2.0
Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 20/101 (19%)
Query: 254 NKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLV 313
+K RS +T +N SSRSH+ F + S+ +T E L+ V
Sbjct: 488 SKLRSSASTKLNLRSSRSHSKFRDHLNGSN-----SSTKEL--------------SLNQV 528
Query: 314 DLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDG 354
DLAGSER+ + G+ L+E +N SLS+LG+VI AL
Sbjct: 529 DLAGSERKVSQ-SVGELLRETQSLNKSLSSLGDVIHALGSK 568
>gnl|CDD|238663 cd01367, KISc_KIF2_like, Kinesin motor domain, KIF2-like group.
KIF2 is a protein expressed in neurons, which has been
associated with axonal transport and neuron development;
alternative splice forms have been implicated in
lysosomal translocation. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Kinesins are microtubule-dependent
molecular motors that play important roles in
intracellular transport and in cell division. In this
subgroup the motor domain is found in the middle
(M-type) of the protein chain. M-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second (KIF2 may be slower). To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 322
Score = 272 bits (699), Expect = 6e-86
Identities = 136/395 (34%), Positives = 191/395 (48%), Gaps = 76/395 (19%)
Query: 16 ENVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEP-PKTFTFDTVTPYS 74
+ V VR RPL+ KE G D+ + + N +TV HEP K
Sbjct: 1 MKITVAVRKRPLNDKELSKGETDVVSCES-NPTVTV-------HEPKTK----------- 41
Query: 75 TVDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTI 134
VDL I TF FD VFD ++Y T +P++ V G T
Sbjct: 42 -VDL-TKYIE-----------KHTFRFDYVFDEAVTNEEVYRSTVKPLIPHVFEGGVATC 88
Query: 135 FAYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEE 194
FAYGQTG+GKT+TM G N G+ + IF +A+ ++ V VS+FEIY +
Sbjct: 89 FAYGQTGSGKTYTMLGDENQE---GLYALAARDIFRLLAQPND--DLGVTVSFFEIYGGK 143
Query: 195 IRDLLSKNKD-QCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLG 253
+ DLL+ K LE + V + L+ + D++ +++ G
Sbjct: 144 LFDLLNDRKRLSVLEDGKG----------------NVQIVGLTEKPVTSVDELLELIESG 187
Query: 254 NKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLV 313
N R+ G+T N+ SSRSHAI II++ + + +G+L +
Sbjct: 188 NSLRTTGSTGANDQSSRSHAILQIILKN-------------------KKLNKLLGKLSFI 228
Query: 314 DLAGSERQAKTGASG-QRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQD 372
DLAGSER A T Q KE ++IN SL L I AL K H+P+R SKLT++L+D
Sbjct: 229 DLAGSERGADTSEHDRQTRKEGAEINKSLLALKECIRALASNK-AHVPFRGSKLTQVLRD 287
Query: 373 SLGGNSKTVMCATVGPASYNYEETISTLRYASRVK 407
S GNSKTVM AT+ P++ + E T++TLRYA RVK
Sbjct: 288 SFIGNSKTVMIATISPSASSCEHTLNTLRYADRVK 322
>gnl|CDD|238672 cd01376, KISc_KID_like, Kinesin motor domain, KIF22/Kid-like
subgroup. Members of this group might play a role in
regulating chromosomal movement along microtubules in
mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 319
Score = 238 bits (610), Expect = 5e-73
Identities = 134/392 (34%), Positives = 189/392 (48%), Gaps = 74/392 (18%)
Query: 17 NVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYSTV 76
NVRV VRVRP E D + + GI + + A S
Sbjct: 1 NVRVVVRVRPFLDCEED-------SSSCVRGIDSDQGQAKS------------------- 34
Query: 77 DLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFA 136
+ ++NP K + FD + Q DI++ +PIV +LSG N T+FA
Sbjct: 35 ------VEIENP--RNRGETKKYQFDAFYGTECTQEDIFSREVKPIVPHLLSGQNATVFA 86
Query: 137 YGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEIR 196
YG TG GKT TM G P G+IP + + + K + F +SY+EIYNE++
Sbjct: 87 YGSTGAGKTHTMLG---DPNEPGLIPRTLSDLLRMGRKQAWTGAFS--MSYYEIYNEKVY 141
Query: 197 DLLS-KNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNK 255
DLL K+ L ++E D + + L++ + + E+ +K
Sbjct: 142 DLLEPAKKE---------------LPIREDKDGNILIVGLTSKPIKSMAEFEEAYIPASK 186
Query: 256 NRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDL 315
NR+V AT +N+ SSRSHA+ I + T + G+L+L+DL
Sbjct: 187 NRTVAATKLNDNSSRSHAVLRIKV-TQPASNIQLE-----------------GKLNLIDL 228
Query: 316 AGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRILQDSLG 375
AGSE +TG G RLKE++ IN SL L V+ AL G IPYR SKLTR+LQDSLG
Sbjct: 229 AGSEDNRRTGNEGIRLKESAAINSSLFVLSKVVDALNKGL-PRIPYRESKLTRLLQDSLG 287
Query: 376 GNSKTVMCATVGPASYNYEETISTLRYASRVK 407
G S+ +M A + P Y++T+STL +ASR K
Sbjct: 288 GGSRCIMVANIAPERSFYQDTLSTLNFASRSK 319
>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
Length = 1320
Score = 249 bits (636), Expect = 4e-70
Identities = 140/341 (41%), Positives = 197/341 (57%), Gaps = 42/341 (12%)
Query: 97 KTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIFAYGQTGTGKTFTMEGVNNV-- 154
+TFTFD++ D S Q DI+ P+V+ L+GFN ++FAYGQTG+GKT+TM G N
Sbjct: 132 QTFTFDSIADPESTQEDIFQLVGAPLVENCLAGFNSSVFAYGQTGSGKTYTMWGPANGLL 191
Query: 155 -----PELKGIIPNSFAHIFGHIAK-----ADESVKFLVRVSYFEIYNEEIRDLLSKNKD 204
+ +G+ P F +F I + AD +K+ R S+ EIYNE+I DLL
Sbjct: 192 EEHLSGDQQGLTPRVFERLFARINEEQIKHADRQLKYQCRCSFLEIYNEQITDLL----- 246
Query: 205 QCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNKNRSVGATAM 264
D L+++E GVYV++L+ D+ +++ G NR GAT++
Sbjct: 247 ---------DPSQKNLQIREDVKSGVYVENLTEEYVKTMKDVTQLLIKGLSNRRTGATSI 297
Query: 265 NEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDLAGSERQAKT 324
N SSRSH++F+ ++E+ + A+ S + R++LVDLAGSERQ T
Sbjct: 298 NAESSRSHSVFTCVVESRC------------KSVADGLSSFKTSRINLVDLAGSERQKLT 345
Query: 325 GASGQRLKEASKINLSLSTLGNVISALVD----GKCTHIPYRNSKLTRILQDSLGGNSKT 380
GA+G RLKEA IN SLS LGN+I+ L + GK HIPYR+S+LT +LQ+SLGGN+K
Sbjct: 346 GAAGDRLKEAGNINRSLSQLGNLINILAEISQTGKQRHIPYRDSRLTFLLQESLGGNAKL 405
Query: 381 VMCATVGPASYNYEETISTLRYASRVKKICNKARVNQDPKD 421
M + P+ ET STLR+A R K I NKA VN+ +D
Sbjct: 406 AMVCAISPSQSCKSETFSTLRFAQRAKAIKNKAVVNEVMQD 446
>gnl|CDD|238664 cd01368, KISc_KIF23_like, Kinesin motor domain, KIF23-like
subgroup. Members of this group may play a role in
mitosis. This catalytic (head) domain has ATPase
activity and belongs to the larger group of P-loop
NTPases. Kinesins are microtubule-dependent molecular
motors that play important roles in intracellular
transport and in cell division. In most kinesins, the
motor domain is found at the N-terminus (N-type). N-type
kinesins are (+) end-directed motors, i.e. they
transport cargo towards the (+) end of the microtubule.
Kinesin motor domains hydrolyze ATP at a rate of about
80 per second, and move along the microtubule at a speed
of about 6400 Angstroms per second. To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 345
Score = 220 bits (563), Expect = 7e-66
Identities = 119/393 (30%), Positives = 185/393 (47%), Gaps = 57/393 (14%)
Query: 16 ENVRVFVRVRPLSKKESDAGHVDIATVDLLNGIITVKNPAGSAHEPPKTFTFDTVTPYST 75
+ V+V++RVRPLSK E ++ + I + + H P +
Sbjct: 1 DPVKVYLRVRPLSKDELES--------EDEGCIEVINSTTIQLHPPKGSAAR-------- 44
Query: 76 VDLLNGIITVKNPAGSAHEPPKTFTFDTVFDANSKQLDIYNETARPIVDKVLSGFNGTIF 135
K+ + F+F VF N+ Q + + TA P+V +L G N +F
Sbjct: 45 ----------KSERNGG-QKETKFSFSKVFGPNTTQKEFFEGTALPLVQDLLKGKNSLLF 93
Query: 136 AYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHIAKADESVKFLVRVSYFEIYNEEI 195
YG T +GKT+TM+G P GI+P S IF I + V VSY EIYN I
Sbjct: 94 TYGVTNSGKTYTMQGS---PGDGGILPRSLDVIFNSIGG------YSVFVSYVEIYNNYI 144
Query: 196 RDLLSKNKDQCLEVKERPDIGVYCLEVKERPDIGVYVKDLSAYIANNADDMEKIMSLGNK 255
DLL + + + L ++E + +YV L+ ++ ++ ++ G K
Sbjct: 145 YDLLEDSPSSTKKRQS--------LRLREDHNGNMYVAGLTEVEVSSTEEAREVFKRGQK 196
Query: 256 NRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTTGENWRLWAEAQGSVQMGRLHLVDL 315
NR V T +N SSRSH++F+I + + G + + + + +L LVDL
Sbjct: 197 NRRVAGTKLNRESSRSHSVFTIKLVQAPGDSDGDVDQD--------KDQITVSQLSLVDL 248
Query: 316 AGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVD-----GKCTHIPYRNSKLTRIL 370
AGSER ++T +G+RLKEA IN SL TLG I L + +PYR+SKLT +
Sbjct: 249 AGSERTSRTQNTGERLKEAGNINTSLMTLGKCIEVLRENQLSGSTNKMVPYRDSKLTHLF 308
Query: 371 QDSLGGNSKTVMCATVGPASYNYEETISTLRYA 403
Q+ G K M V P + +Y+ET+ ++++
Sbjct: 309 QNYFDGEGKARMIVNVNPCASDYDETLHVMKFS 341
>gnl|CDD|238659 cd01363, Motor_domain, Myosin and Kinesin motor domain. These
ATPases belong to the P-loop NTPase family and provide
the driving force in myosin and kinesin mediated
processes.
Length = 186
Score = 151 bits (383), Expect = 2e-42
Identities = 77/157 (49%), Positives = 99/157 (63%), Gaps = 14/157 (8%)
Query: 232 VKDLSAYIANNADDMEKIMSLGNKNRSVGATAMNEVSSRSHAIFSIIIETSSGLGGKRTT 291
++ + I D+ +M GN NR+ ATAMNE SSRSH++F I GGK
Sbjct: 44 KREGAGIIPRTVTDVIDLMDKGNANRTTAATAMNEHSSRSHSVFRIH------FGGKNAL 97
Query: 292 GENWRLWAEAQGSVQMGRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISAL 351
A A ++G+++LVDLAGSER +GA G RL E + IN SLSTLGNVISAL
Sbjct: 98 -------ASATEQPKVGKINLVDLAGSERIDFSGAEGSRLTETANINKSLSTLGNVISAL 150
Query: 352 VDGKCTHIPYRNSKLTRILQDSLGGNSKTVMCATVGP 388
+ +H+PYR SKLTR+LQDSLGGNS+T+M A + P
Sbjct: 151 AERD-SHVPYRESKLTRLLQDSLGGNSRTLMVACISP 186
Score = 67.9 bits (166), Expect = 5e-13
Identities = 24/59 (40%), Positives = 33/59 (55%), Gaps = 5/59 (8%)
Query: 114 IYNETARPIVDKVLSGFNGTIFAYGQTGTGKTFTMEGVNNVPELKGIIPNSFAHIFGHI 172
++ + P++ L G+N IFAYGQTG+GKT+TMEG E GIIP +
Sbjct: 8 VFRDVG-PLLQSALDGYNVCIFAYGQTGSGKTYTMEG---KREGAGIIP-RTVTDVIDL 61
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 52.8 bits (127), Expect = 5e-07
Identities = 44/215 (20%), Positives = 98/215 (45%), Gaps = 27/215 (12%)
Query: 431 ISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD----- 485
+S+ K++ EEA E +L+ TE +E E + +E + +L++ +
Sbjct: 167 VSKYKERKEEA-------------ERKLERTEENLERLEDLLEELEKQLEKLERQAEKAE 213
Query: 486 -HQSMVEKLELLQKKVLVGG--ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEK 542
+Q + +L L+ +L+ E E E++++L + E +++ EE K+ I+E
Sbjct: 214 RYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKE-IEEL 272
Query: 543 DAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQI 602
+E ++ E+ LQEE L ++++++ + +E + E E+E + + ++
Sbjct: 273 KSELEELREELEELQEELLELKEEIEELEGEISLLRERLE----ELENELEELEERLEEL 328
Query: 603 TKEYSYQQAIIQKFIPTKYLELMTQNATWNEDIGE 637
++ + +++ T EL A E E
Sbjct: 329 KEKIEALKEELEE-RETLLEELEQLLAELEEAKEE 362
Score = 48.6 bits (116), Expect = 1e-05
Identities = 50/203 (24%), Positives = 101/203 (49%), Gaps = 17/203 (8%)
Query: 394 EETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPN 453
+ + L A + K+ + R + + L + ++E+ EL+++LEEA
Sbjct: 219 KAELRELELALLLAKL-KELRKELEELEEELSRLEEELEELQEELEEAEK---------- 267
Query: 454 PEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEI 513
E++E + +E +E EL + + + + ++ LL++++ L E E
Sbjct: 268 ---EIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEER 324
Query: 514 QQQL---LDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
++L ++A +ELE R+ EEL++ + E + + ++EEK S+L EE E L + L++
Sbjct: 325 LEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREE 384
Query: 571 SSMLLAAKEERKDVQRENRKEIE 593
+ L A E ++ E ++EIE
Sbjct: 385 LAELEAELAEIRNELEELKREIE 407
Score = 48.2 bits (115), Expect = 2e-05
Identities = 43/156 (27%), Positives = 84/156 (53%), Gaps = 10/156 (6%)
Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
Q+E+ EL+++LEEA E E EL+ E E E +E + E++ +
Sbjct: 792 QEELEELEEELEEA------ERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLD 845
Query: 488 SMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERI 547
+ E+LE L+K++ E L E+ E + + EL+ ++++EEL++ ++E ++E
Sbjct: 846 ELEEELEELEKEL----EELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELA 901
Query: 548 DMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
+++E+ L+E E L KL+++ L +EE ++
Sbjct: 902 ELKEEIEKLRERLEELEAKLERLEVELPELEEELEE 937
Score = 45.1 bits (107), Expect = 1e-04
Identities = 28/144 (19%), Positives = 72/144 (50%), Gaps = 7/144 (4%)
Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQK 531
E E E ++A + +E+ ++ L + L + E +Q + QE+E +++
Sbjct: 780 EIEELEEKRQALQEELEELEEELEEAERRL---DALERELESLEQRRERLEQEIEELEEE 836
Query: 532 EEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKE 591
EEL++ + E + E ++E++ L+EE E L + +++ L +EE+++++ +E
Sbjct: 837 IEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELE----EE 892
Query: 592 IEGFLDNVRQITKEYSYQQAIIQK 615
+ + ++ +E + +++
Sbjct: 893 LRELESELAELKEEIEKLRERLEE 916
Score = 44.7 bits (106), Expect = 2e-04
Identities = 31/124 (25%), Positives = 67/124 (54%), Gaps = 11/124 (8%)
Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
+ + ELK +++ +S+ + LE L+++ LE+ E Q + L +EL +++ E
Sbjct: 685 KLEEELKSLKNELRSLEDLLEELRRQ--------LEELERQLEELK---RELAALEEELE 733
Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
+L+ ++E + E ++EE+ LQE E L ++L+ + L KEE ++++ + + E
Sbjct: 734 QLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQE 793
Query: 594 GFLD 597
+
Sbjct: 794 ELEE 797
Score = 43.9 bits (104), Expect = 3e-04
Identities = 35/161 (21%), Positives = 80/161 (49%), Gaps = 5/161 (3%)
Query: 457 ELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQ 516
EL E E +E E EL+ + + + +LE L++++ E L + ++
Sbjct: 675 ELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQL----EELKRELAALEE 730
Query: 517 LLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLA 576
L+ LE +++ EEL++ ++E ++EE+ SL+E L ++++++ A
Sbjct: 731 ELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQA 790
Query: 577 AKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFI 617
+EE ++++ E +E E LD + + + ++ +++ I
Sbjct: 791 LQEELEELEEE-LEEAERRLDALERELESLEQRRERLEQEI 830
Score = 43.9 bits (104), Expect = 3e-04
Identities = 33/145 (22%), Positives = 71/145 (48%), Gaps = 3/145 (2%)
Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGG---ENLLEKAEIQQQLLDAAAQELEHRK 529
+E + E++ +S+ + + E+LE LQ+++L E L + + ++ L+ ELE +
Sbjct: 263 EEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELE 322
Query: 530 QKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
++ EELK+ I+ E + E L++ L + +++ L A EE +++ R
Sbjct: 323 ERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALR 382
Query: 590 KEIEGFLDNVRQITKEYSYQQAIIQ 614
+E+ + +I E + I+
Sbjct: 383 EELAELEAELAEIRNELEELKREIE 407
Score = 43.2 bits (102), Expect = 6e-04
Identities = 50/229 (21%), Positives = 100/229 (43%), Gaps = 21/229 (9%)
Query: 394 EETISTLRYASRVKKICNKARVNQDPKDALLIKYQK---EISELKKKLEEASVPPPREEP 450
E+ I L+ S ++++ + Q+ L + ++ EIS L+++LEE E
Sbjct: 266 EKEIEELK--SELEELREELEELQEELLELKEEIEELEGEISLLRERLEELEN---ELEE 320
Query: 451 TPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK 510
EL E ++ + EL++ ++ + E+LE +L E L E
Sbjct: 321 LEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEA 380
Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
+ L EL + + EELK+ I+ EE+ L E E L ++LK++
Sbjct: 381 LR---EELAELEAELAEIRNELEELKREIESL-------EERLERLSERLEDLKEELKEL 430
Query: 571 SSMLLAAKEERKDVQRENRK---EIEGFLDNVRQITKEYSYQQAIIQKF 616
+ L + E +++ E + ++E D ++++ +E + Q +Q+
Sbjct: 431 EAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRL 479
Score = 39.7 bits (93), Expect = 0.006
Identities = 38/179 (21%), Positives = 89/179 (49%), Gaps = 4/179 (2%)
Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
++++ ELK++L E + EL+E +E E EL+ +
Sbjct: 715 ERQLEELKRELAALEE---ELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLE 771
Query: 488 SMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERI 547
+ KL+ +++ + L E+ E ++ L+ A + L+ +++ E L++ + + E
Sbjct: 772 EALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIE 831
Query: 548 DMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEY 606
++EE+ L+E+ + L ++L+++ L KEE ++++ E ++E+E L + + +E
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAE-KEELEDELKELEEEKEEL 889
Score = 38.9 bits (91), Expect = 0.010
Identities = 39/182 (21%), Positives = 92/182 (50%), Gaps = 10/182 (5%)
Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD 485
++ +++LK+++EE + EL+E E +E +E D + EL+ +
Sbjct: 769 SLEEALAKLKEEIEELEEKRQALQE------ELEELEEELEEAERRLDALERELESLEQR 822
Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE 545
+ + +++E L++++ E L EK + ++ L+ +ELE K++ EEL+ +E + E
Sbjct: 823 RERLEQEIEELEEEI----EELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDE 878
Query: 546 RIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
++EE+ L+EE L +L ++ + +E ++++ + + + ++ +E
Sbjct: 879 LKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEE 938
Query: 606 YS 607
Y
Sbjct: 939 YE 940
Score = 37.8 bits (88), Expect = 0.026
Identities = 28/144 (19%), Positives = 76/144 (52%), Gaps = 7/144 (4%)
Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEEL 535
+ EL++ QS + + E+LE L+++ L + LE+ E + + L+ A +L+ ++ EE
Sbjct: 729 EEELEQLQSRLEELEEELEELEEE-LEELQERLEELEEELESLEEALAKLKEEIEELEEK 787
Query: 536 KKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRK---EI 592
++ +QE E ++EE+ + + L ++L+ + ++E ++++ E + ++
Sbjct: 788 RQALQE---ELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKL 844
Query: 593 EGFLDNVRQITKEYSYQQAIIQKF 616
+ + + ++ KE + +++
Sbjct: 845 DELEEELEELEKELEELKEELEEL 868
Score = 37.4 bits (87), Expect = 0.030
Identities = 19/106 (17%), Positives = 52/106 (49%), Gaps = 3/106 (2%)
Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
+ L E+ + L+ +EL+ K + L+ ++E + ++E + L+ E L
Sbjct: 670 KELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALE 729
Query: 565 KKLKKVSSMLLAAKEERKDVQR---ENRKEIEGFLDNVRQITKEYS 607
++L+++ S L +EE ++++ E ++ +E + + + + +
Sbjct: 730 EELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA 775
Score = 34.3 bits (79), Expect = 0.28
Identities = 40/186 (21%), Positives = 80/186 (43%), Gaps = 28/186 (15%)
Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
++E+ EL+ + EE EL E E E E E ++EL + + +
Sbjct: 862 KEELEELEAEKEELED-------------ELKELEEEKEELEEELRELESELAELKEEIE 908
Query: 488 SMVEKLELLQKKVLVGGENLLEKAE-IQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER 546
+ E+LE L+ K+ L E E ++++ D ELE ++ EE + + +
Sbjct: 909 KLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRA 968
Query: 547 IDM----EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQI 602
I+ EE+Y L+ + E L + +K+ ++ +E+++ F + +I
Sbjct: 969 IEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRER----------FKETFDKI 1018
Query: 603 TKEYSY 608
+ +S
Sbjct: 1019 NENFSE 1024
Score = 33.9 bits (78), Expect = 0.32
Identities = 32/141 (22%), Positives = 64/141 (45%), Gaps = 18/141 (12%)
Query: 424 LIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQ 483
L + ++E EL+++L E E EL E + +E +E +A+L+R +
Sbjct: 879 LKELEEEKEELEEELREL-------------ESELAELKEEIEKLRERLEELEAKLERLE 925
Query: 484 SDHQSMVEKL-ELLQKKVLVGGENLLEKAEIQQQLLD----AAAQELEHRKQKEEELKKN 538
+ + E+L E + + E +E+ E + + L A +E E +++ EELK
Sbjct: 926 VELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQ 985
Query: 539 IQEKDAERIDMEEKYSSLQEE 559
++ + + + E L +E
Sbjct: 986 REDLEEAKEKLLEVIEELDKE 1006
Score = 29.7 bits (67), Expect = 6.2
Identities = 17/101 (16%), Positives = 46/101 (45%), Gaps = 11/101 (10%)
Query: 508 LEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL 567
E E++++L +LE +++ + LK ++ + ++ + L+ + E L ++L
Sbjct: 667 RELKELEEEL-AELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKREL 725
Query: 568 KKVSSMLLAAKEERKDVQRENR---KEIEGFLDNVRQITKE 605
A +EE + +Q +E+E + + ++ +
Sbjct: 726 A-------ALEEELEQLQSRLEELEEELEELEEELEELQER 759
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 48.1 bits (115), Expect = 2e-05
Identities = 33/156 (21%), Positives = 68/156 (43%), Gaps = 10/156 (6%)
Query: 457 ELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKV------LVGGENLLEK 510
E++E E +E E E + L + + + + E+LE L+K++ + L +
Sbjct: 678 EIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLAR 737
Query: 511 AEIQQQLLDAAAQELEHR----KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKK 566
E + + L+ +L + + EEL++ ++E + E + E + L+ + E L ++
Sbjct: 738 LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEE 797
Query: 567 LKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQI 602
LK + L + E + E E R+I
Sbjct: 798 LKALREALDELRAELTLLNEEAANLRERLESLERRI 833
Score = 42.7 bits (101), Expect = 7e-04
Identities = 50/254 (19%), Positives = 107/254 (42%), Gaps = 42/254 (16%)
Query: 396 TISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPE 455
+ LR +++ + L + ++E+ EL +L+E E
Sbjct: 228 ALLVLRLEELREEL--------EELQEELKEAEEELEELTAELQEL-------------E 266
Query: 456 PELDETEPCVEPSEVIEDETDA---------ELKRAQSDHQSMVEKLELLQKKVLVGGEN 506
+L+E +E SE+ E+ + E+ R + Q + E+L L++++
Sbjct: 267 EKLEELR--LEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQ 324
Query: 507 LLE---KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGL 563
L E K + + L ++LE K++ E L+ ++E +AE ++E + L+E+ E L
Sbjct: 325 LEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETL 384
Query: 564 SKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLE 623
KV+ + L +++R +E D ++ +E + +++K + E
Sbjct: 385 R---SKVAQLELQIASLNNEIER-LEARLERLEDRRERLQQE---IEELLKKLEEAELKE 437
Query: 624 LMTQNATWNEDIGE 637
L + E++ E
Sbjct: 438 LQAELEELEEELEE 451
Score = 41.6 bits (98), Expect = 0.002
Identities = 44/235 (18%), Positives = 90/235 (38%), Gaps = 22/235 (9%)
Query: 392 NYEETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPT 451
EE + LR + + + L EIS L+++ + E
Sbjct: 264 ELEEKLEELRLEVSELE----EEIEE--LQKELYALANEISRLEQQKQILRERLANLERQ 317
Query: 452 PNP-EPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK 510
E +L+E E ++ E + +L+ + + +S+ +LE L+ ++ E L +
Sbjct: 318 LEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAEL----EELESR 373
Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
E ++ L+ ++ + + L I+ +A +E++ LQ+E E L KKL++
Sbjct: 374 LEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEA 433
Query: 571 -----------SSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQ 614
L +E + E +E+ L+ Q + A +Q
Sbjct: 434 ELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQ 488
Score = 40.8 bits (96), Expect = 0.002
Identities = 46/240 (19%), Positives = 107/240 (44%), Gaps = 7/240 (2%)
Query: 403 ASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPR-EEPTPNPEPELDET 461
+R++ + L + + EI EL+++LEEA E E ++++
Sbjct: 735 LARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQL 794
Query: 462 EPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGG---ENLLEKAEIQQQLL 518
+ ++ DE AEL + ++ E+LE L++++ E+L E+ E + +
Sbjct: 795 KEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI 854
Query: 519 DAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAK 578
++ A E+E ++ EEL+ ++ ER +EE + L+ E E LS++L+++ S +
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELR 914
Query: 579 EERKDVQREN---RKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLELMTQNATWNEDI 635
E ++++ + +EG + + + S + ++ + ++ +
Sbjct: 915 RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRL 974
Score = 32.0 bits (73), Expect = 1.2
Identities = 25/102 (24%), Positives = 55/102 (53%), Gaps = 11/102 (10%)
Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
+++ ++ +++E ++K EL+K + E E ++EE+ L++E E LS+++
Sbjct: 675 RRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQI------ 728
Query: 574 LLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
RKD+ R E+E + + Q++KE + +A I++
Sbjct: 729 ----SALRKDLARL-EAEVEQLEERIAQLSKELTELEAEIEE 765
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 46.6 bits (111), Expect = 4e-05
Identities = 34/186 (18%), Positives = 80/186 (43%), Gaps = 9/186 (4%)
Query: 421 DALLIKYQKEISELKKKLEEAS--VPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAE 478
+ + +K + E+++ LEE + + EE + ++ E E + E E + E
Sbjct: 257 TEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERE 316
Query: 479 LKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKN 538
L+ A+ + +++ L ++ E L + E +++ D +E K++ E+L+
Sbjct: 317 LEDAEERLAKLEAEIDKLLAEI----EELEREIEEERKRRDKLTEEYAELKEELEDLRAE 372
Query: 539 IQEKDAERIDMEEKYSSLQEESEGLSKK---LKKVSSMLLAAKEERKDVQRENRKEIEGF 595
++E D E + ++ +E+ E L ++ LK+ L + + + I G
Sbjct: 373 LEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGI 432
Query: 596 LDNVRQ 601
+ +
Sbjct: 433 EAKINE 438
Score = 46.6 bits (111), Expect = 4e-05
Identities = 42/197 (21%), Positives = 83/197 (42%), Gaps = 34/197 (17%)
Query: 418 DPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDA 477
+ + L K ++ +++L+ +L P + EL + E V E E +
Sbjct: 768 EELEEDLHKLEEALNDLEARL--------SHSRIPEIQAELSKLEEEVSRIEARLREIEQ 819
Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE----- 532
+L R + E L+K E E+Q+Q +D Q K+ E
Sbjct: 820 KLNRLTLE-------KEYLEK----------EIQELQEQRIDLKEQIKSIEKEIENLNGK 862
Query: 533 -EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSML--LAAKEERKDVQRENR 589
EEL++ ++E +A D+E + L++E + L +L+++ + L A+ E+K +
Sbjct: 863 KEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSEL 922
Query: 590 KE-IEGFLDNVRQITKE 605
K +E + + +I
Sbjct: 923 KAKLEALEEELSEIEDP 939
Score = 46.6 bits (111), Expect = 5e-05
Identities = 46/190 (24%), Positives = 91/190 (47%), Gaps = 33/190 (17%)
Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD 485
K +EISEL+K+LEE E L+E ++ I+D + E R +
Sbjct: 255 KLTEEISELEKRLEEI-------------EQLLEEL------NKKIKDLGEEEQLRVKEK 295
Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE 545
+ ++ L++ + L E AE + L+A E++ + EEL++ I+E+
Sbjct: 296 IGELEAEIASLERSIAEKEREL-EDAEERLAKLEA---EIDKLLAEIEELEREIEEERKR 351
Query: 546 RIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR----------ENRKEIEGF 595
R + E+Y+ L+EE E L +L++V ++E KD + E ++E++
Sbjct: 352 RDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRL 411
Query: 596 LDNVRQITKE 605
+ ++++++E
Sbjct: 412 QEELQRLSEE 421
Score = 44.3 bits (105), Expect = 2e-04
Identities = 51/220 (23%), Positives = 92/220 (41%), Gaps = 34/220 (15%)
Query: 421 DALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELK 480
+A + K EI EL++++EE R + E E + +E +E D E
Sbjct: 328 EAEIDKLLAEIEELEREIEEERK---RRDKLTE---EYAELKEELEDLRAELEEVDKEFA 381
Query: 481 RAQSDHQSMVEKLELLQKKV--LVGGENLL---------EKAEIQQQLLDAAAQ--ELEH 527
+ + + EKLE L++++ L + L E A++ + A+ ELE
Sbjct: 382 ETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEE 441
Query: 528 RKQ-KEEELKK---NIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
K+ K E+KK +++ A+ E++ L+EE + + K+L K+ L A+ + +
Sbjct: 442 EKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARA 501
Query: 584 VQRENR----------KEIEGFLDNVRQITK-EYSYQQAI 612
+ R I+G V Q+ Y AI
Sbjct: 502 SEERVRGGRAVEEVLKASIQGVHGTVAQLGSVGERYATAI 541
Score = 41.6 bits (98), Expect = 0.002
Identities = 41/197 (20%), Positives = 87/197 (44%), Gaps = 20/197 (10%)
Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
+KEI +L+++ E+ E E +L E +E + E +A ++ + D
Sbjct: 722 EKEIEQLEQEEEKLK------ERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLH 775
Query: 488 SMVEKLELLQKKVLVGG----ENLLEKAEIQQQLLDAAAQELE----HRKQKEEELKKNI 539
+ E L L+ ++ + L K E + ++A +E+E ++E L+K I
Sbjct: 776 KLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEI 835
Query: 540 QEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ-RENRKEIEGFLDN 598
QE +RID++E+ S+++E E L+ K ++ L + E + R+ +
Sbjct: 836 QELQEQRIDLKEQIKSIEKEIENLNGKKEE-----LEEELEELEAALRDLESRLGDLKKE 890
Query: 599 VRQITKEYSYQQAIIQK 615
++ + + I++
Sbjct: 891 RDELEAQLRELERKIEE 907
Score = 38.5 bits (90), Expect = 0.013
Identities = 38/169 (22%), Positives = 80/169 (47%), Gaps = 14/169 (8%)
Query: 457 ELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLL-------- 508
EL+E E +E ++I DE +L+R + + + LL++K G LL
Sbjct: 178 ELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALER 237
Query: 509 EKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSL-QEESEGLSKKL 567
+K I++QL + +ELE ++ EL+K ++E + ++ +K L +EE + +K+
Sbjct: 238 QKEAIERQL-ASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKI 296
Query: 568 KKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKF 616
++ + + + E +E+E + + ++ E A I++
Sbjct: 297 GELEAEI----ASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEEL 341
Score = 35.4 bits (82), Expect = 0.13
Identities = 38/171 (22%), Positives = 78/171 (45%), Gaps = 17/171 (9%)
Query: 465 VEPSEVIED-----ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENL------LEKAEI 513
VE ++I++ E D + ++A + + + E +E L + + L EKAE
Sbjct: 153 VERRKIIDEIAGVAEFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAER 212
Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
Q LL E + E +K E+ E I+ + +SL+EE E L++++ ++
Sbjct: 213 YQALLK----EKREYEGYELLKEKEALERQKEAIERQ--LASLEEELEKLTEEISELEKR 266
Query: 574 LLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLEL 624
L ++ +++ ++ + E V++ E + A +++ I K EL
Sbjct: 267 LEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKEREL 317
Score = 33.9 bits (78), Expect = 0.40
Identities = 25/132 (18%), Positives = 61/132 (46%), Gaps = 15/132 (11%)
Query: 465 VEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQE 524
E S + + A + +S +L+ L+++ LE + + L + E
Sbjct: 649 FEKSGAMTGGSRAP-RGGILFSRSEPAELQRLRER--------LEGLKRE---LSSLQSE 696
Query: 525 LEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV 584
L + + +EL + + + + ++E++ L++E E L ++L+++ L + ++E ++V
Sbjct: 697 LRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENV 756
Query: 585 QRE---NRKEIE 593
+ E IE
Sbjct: 757 KSELKELEARIE 768
Score = 30.8 bits (70), Expect = 2.9
Identities = 24/142 (16%), Positives = 70/142 (49%), Gaps = 6/142 (4%)
Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
E + EL+ ++ + + +L L+K+ + L + ++ ++ ++E ++++
Sbjct: 865 ELEEELEELEAALRDLESRLGDLKKER----DELEAQLRELERKIEELEAQIEKKRKRLS 920
Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
ELK ++ + E ++E+ +E E L+ V + L +EE + ++ N I+
Sbjct: 921 ELKAKLEALEEELSEIEDPKGEDEEIPEEEL-SLEDVQAELQRVEEEIRALEPVNMLAIQ 979
Query: 594 GFLDNVRQITKEYSYQQAIIQK 615
+ + ++++ E ++A +++
Sbjct: 980 EYEEVLKRL-DELKEKRAKLEE 1000
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 45.8 bits (109), Expect = 8e-05
Identities = 31/132 (23%), Positives = 67/132 (50%), Gaps = 2/132 (1%)
Query: 455 EPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQ 514
P ++ E E E +E E +L+ + + +E+ E L + E L E+ E
Sbjct: 464 SPHVETIEEDRERVEELEAEL-EDLEEEVEEVEERLERAEDL-VEAEDRIERLEERREDL 521
Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
++L+ + +E ++++ EEL++ E +AE + E + +EE+E +++ +++S L
Sbjct: 522 EELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKL 581
Query: 575 LAAKEERKDVQR 586
KE + ++R
Sbjct: 582 AELKERIESLER 593
Score = 31.2 bits (71), Expect = 2.3
Identities = 29/146 (19%), Positives = 56/146 (38%), Gaps = 23/146 (15%)
Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL--LDAAAQELEHRK- 529
E D E++R + + E + E L E E +++L L+A ++L
Sbjct: 216 AELDEEIERYEEQREQARETRDEAD-------EVLEEHEERREELETLEAEIEDLRETIA 268
Query: 530 ---QKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVS---SMLLAAKEERKD 583
++ EEL + +++ ++EE+ L E+ + V L EE +D
Sbjct: 269 ETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRD 328
Query: 584 VQRENR-------KEIEGFLDNVRQI 602
E R +E E ++ +
Sbjct: 329 RLEECRVAAQAHNEEAESLREDADDL 354
Score = 31.2 bits (71), Expect = 2.6
Identities = 33/152 (21%), Positives = 61/152 (40%), Gaps = 8/152 (5%)
Query: 472 EDETDAELKRAQSDH--QSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRK 529
E AE R ++ EK E + E E AE+ +L + +E
Sbjct: 534 EKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKL-AELKERIE-SL 591
Query: 530 QKEEELKKNIQEKDAERIDMEEKYSSLQE----ESEGLSKKLKKVSSMLLAAKEERKDVQ 585
++ L I + + E + EK +L E E L++K ++ + E R +
Sbjct: 592 ERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEA 651
Query: 586 RENRKEIEGFLDNVRQITKEYSYQQAIIQKFI 617
RE+++ E +L+ V + E ++ +Q I
Sbjct: 652 REDKERAEEYLEQVEEKLDELREERDDLQAEI 683
Score = 29.2 bits (66), Expect = 8.3
Identities = 36/164 (21%), Positives = 69/164 (42%), Gaps = 25/164 (15%)
Query: 457 ELDETEPCVEPSE--VIEDETD-AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEI 513
EL+ E +E + E E + EL D + +E+LE ++LL +A +
Sbjct: 252 ELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELE-------EERDDLLAEAGL 304
Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
+A E + ++EEL+ ++E R+ + EE+E L + +
Sbjct: 305 DDADAEAVEARREELEDRDEELRDRLEE---CRVAAQAH----NEEAESLREDADDL--- 354
Query: 574 LLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFI 617
EER + RE E+E L+ R+ ++ + +++ I
Sbjct: 355 -----EERAEELREEAAELESELEEAREAVEDRREEIEELEEEI 393
Score = 29.2 bits (66), Expect = 8.5
Identities = 30/108 (27%), Positives = 50/108 (46%), Gaps = 15/108 (13%)
Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
E+L E A+ L+ A+EL +++ EL+ ++E D E+ L+EE E L
Sbjct: 345 ESLREDADD----LEERAEEL---REEAAELESELEEAREAVEDRREEIEELEEEIEELR 397
Query: 565 KK-------LKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
++ L L +EER ++ RE E+E L R+ +E
Sbjct: 398 ERFGDAPVDLGNAEDFLEELREERDEL-REREAELEATLRTARERVEE 444
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 44.7 bits (105), Expect = 2e-04
Identities = 44/221 (19%), Positives = 92/221 (41%), Gaps = 7/221 (3%)
Query: 400 LRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELD 459
L+ A KK + +D AL + +E + ++ E + EE E
Sbjct: 1557 LKKAEEKKKAEEAKKAEEDKNMAL--RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKK 1614
Query: 460 ETEPCVEPSEVIEDETD----AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQ 515
E ++ E+ + E + +LK+ +++ + E+L+ +++ + +KAE +
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
Query: 516 QLLDAAAQELEHRKQKEEELKKNIQE-KDAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
+ + A + E K+ E LKK +E K AE + +E + E +++ K+ +
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734
Query: 575 LAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
+ E + E K+ E + + KE + I+K
Sbjct: 1735 AKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRK 1775
Score = 36.7 bits (84), Expect = 0.059
Identities = 46/201 (22%), Positives = 85/201 (42%), Gaps = 11/201 (5%)
Query: 400 LRYASRVKKICNKARVNQDPKDALLIK-----YQKEISELKKKLEEASVPPPREEPTPNP 454
+ A KK + ++ K A K +K+ KKK EEA E
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAK--KAAEAAKAEA 1352
Query: 455 EPELDETEPCVEPSEVIEDETDAELKRAQSDHQSM--VEKLELLQKKVLVGGENL--LEK 510
E DE E E +E E + + K+A + + +K + +KK + L+K
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412
Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
A ++ D A ++ E +K+ +E KK + K A+ + + + EE++ +++ KK
Sbjct: 1413 AAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKA 1472
Query: 571 SSMLLAAKEERKDVQRENRKE 591
A+E +K + + + E
Sbjct: 1473 DEAKKKAEEAKKADEAKKKAE 1493
Score = 34.0 bits (77), Expect = 0.35
Identities = 47/203 (23%), Positives = 88/203 (43%), Gaps = 21/203 (10%)
Query: 400 LRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELD 459
+ A+ KK ++A+ ++ K A K +E KK +EA ++ EL
Sbjct: 1502 AKKAAEAKKKADEAKKAEEAKKADEAKKAEE----AKKADEA-----KKAEEKKKADELK 1552
Query: 460 ETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLD 519
+ E + E + E E K+A+ D + K E +K E +++ E ++++
Sbjct: 1553 KAEELKKAEEKKKAE---EAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA 1609
Query: 520 AAAQELEHRKQKEEELKK---------NIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
A++ E K K EELKK +++K+AE E+ +EE++ + + K
Sbjct: 1610 EEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKK 1669
Query: 571 SSMLLAAKEERKDVQRENRKEIE 593
+ EE K + + +K E
Sbjct: 1670 AEEDKKKAEEAKKAEEDEKKAAE 1692
Score = 33.2 bits (75), Expect = 0.64
Identities = 41/214 (19%), Positives = 83/214 (38%), Gaps = 15/214 (7%)
Query: 391 YNYEETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPR--- 447
+ ++ A KK +A+ ++ ++ I+ +E +A++
Sbjct: 1220 AEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARK 1279
Query: 448 --EEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVE----KLELLQKKVL 501
E + + DE + E + E + AE + + + E K + +KK
Sbjct: 1280 ADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKK-- 1337
Query: 502 VGGENLLEKAEIQQQLLDAAAQELEHRKQKEE--ELKKNIQEKDAERIDMEEKYSSLQEE 559
E + AE + +AAA E E ++K E E KK +K A+ + + +E
Sbjct: 1338 --AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395
Query: 560 SEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
++ +++ KK + L A +K +K E
Sbjct: 1396 AKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429
Score = 32.8 bits (74), Expect = 0.90
Identities = 48/228 (21%), Positives = 97/228 (42%), Gaps = 23/228 (10%)
Query: 401 RYASRVKKI--CNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPEL 458
+ A +KK NK + ++ K A K + E E KK E+ E E
Sbjct: 1647 KKAEELKKAEEENKIKAAEEAKKAEEDKKKAE--EAKKAEEDEK---KAAEALKKEAEEA 1701
Query: 459 DETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKL----ELLQKKVLVGGENLLEKAEIQ 514
+ E ++ E E + ELK+A+ +++ E+ E +KK ++ EK +I
Sbjct: 1702 KKAEE-LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIA 1760
Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQEKDAE-RIDMEEKYSSLQEESEGLSKKLKKVSSM 573
+ + E RK+KE +++ + E+D + R+++++K + + + + K+ + +
Sbjct: 1761 HLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLV 1820
Query: 574 LLAAKE----------ERKDVQRENRKEIEGFLDNVRQITKEYSYQQA 611
+ +KE + K++Q E E N E ++A
Sbjct: 1821 INDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEA 1868
Score = 29.7 bits (66), Expect = 7.0
Identities = 32/184 (17%), Positives = 74/184 (40%), Gaps = 5/184 (2%)
Query: 412 KARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVI 471
KA + +A + + E + E+A ++E + + ++
Sbjct: 1337 KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEA 1396
Query: 472 EDETDAELKRAQSDHQSMVEK--LELLQKKV--LVGGENLLEKAEIQQQLLDAAAQELEH 527
+ + + + K+A ++ K + +KK + +KAE + + D A ++ E
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAE-EAKKADEAKKKAEE 1455
Query: 528 RKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRE 587
K+ EE KK + K A+ + + + +E++ +++ KK + A E +K
Sbjct: 1456 AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515
Query: 588 NRKE 591
+ E
Sbjct: 1516 KKAE 1519
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 43.2 bits (102), Expect = 5e-04
Identities = 47/213 (22%), Positives = 92/213 (43%), Gaps = 33/213 (15%)
Query: 422 ALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKR 481
LL ++E+ EL+++L+E EL+E S++ E EL
Sbjct: 579 RLLRTRKEELEELRERLKELK----------KKLKELEERL-----SQLEELLQSLELSE 623
Query: 482 AQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQE 541
A+++ + E+LE + LEK +Q +L + LE ++K EEL+ I+
Sbjct: 624 AENELEEAEEELE-----------SELEKLNLQAELEELLQAALEELEEKVEELEAEIRR 672
Query: 542 KDAERIDMEEKYSSLQEESEGLSKKLKKVSSML--LAAKEERKDVQRENRKEIEGFLDNV 599
+ +RI+ EE+ EE E L ++L+++ L L K + E + + L+ +
Sbjct: 673 EL-QRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEEL 731
Query: 600 RQ----ITKEYSYQQAIIQKFIPTKYLELMTQN 628
++ + K + + +K + +N
Sbjct: 732 KKELEKLEKALELLEELREKLGKAGLRADILRN 764
Score = 40.5 bits (95), Expect = 0.003
Identities = 29/123 (23%), Positives = 58/123 (47%), Gaps = 3/123 (2%)
Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLE---KAEIQQQLLDAAAQELEHRKQ 530
E EL+R + + +E+LE L++++ E L E ++LL+ E ++
Sbjct: 274 EELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEK 333
Query: 531 KEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRK 590
EE+L+K E + + E L+E + L ++L+++ L A E K ++ ++
Sbjct: 334 LEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQE 393
Query: 591 EIE 593
E
Sbjct: 394 LKE 396
Score = 38.2 bits (89), Expect = 0.018
Identities = 29/113 (25%), Positives = 55/113 (48%), Gaps = 13/113 (11%)
Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL---LDAAAQELEHRKQKEEE 534
+LK + + + EKLE L+ ++ E E A++ ++ L+ +ELE +K E
Sbjct: 323 KLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALE 382
Query: 535 LKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRE 587
K ++E E L+EE LS L+++ L ++E ++++RE
Sbjct: 383 RLKQLEEAIQE----------LKEELAELSAALEEIQEELEELEKELEELERE 425
Score = 36.7 bits (85), Expect = 0.043
Identities = 39/181 (21%), Positives = 78/181 (43%), Gaps = 22/181 (12%)
Query: 418 DPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETD- 476
+ L + ++ I ELK++L E L+E + E+ E E +
Sbjct: 378 EKALERLKQLEEAIQELKEELAEL-------------SAALEEIQ-----EELEELEKEL 419
Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
EL+R + + ++KLE ++ + E A ++ + E +++ EL
Sbjct: 420 EELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELY 479
Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKL---KKVSSMLLAAKEERKDVQRENRKEIE 593
+ E+ E + E++ + L+EE E L K+L ++ LL +E K+ E +++E
Sbjct: 480 ELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLE 539
Query: 594 G 594
Sbjct: 540 N 540
Score = 34.7 bits (80), Expect = 0.19
Identities = 31/176 (17%), Positives = 76/176 (43%), Gaps = 15/176 (8%)
Query: 474 ETDAELKRAQSDHQSMVEKL---ELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQ 530
E DA LK + + ++++L E +K + E ++++A+ + + L+ EL +
Sbjct: 144 EFDAFLKSKPKERKEILDELFGLEKYEKLSELLKE-VIKEAKAKIEELEGQLSELL--ED 200
Query: 531 KEEELKKNIQEKDAER----IDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR 586
E+ L+ +E + I E++ L++E E L ++L ++ +E + +
Sbjct: 201 IEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLE 260
Query: 587 ENRKEIE---GFLDNVRQITKEYSYQQAIIQKF--IPTKYLELMTQNATWNEDIGE 637
E+E + +R++ + + I++ + + EL + + E
Sbjct: 261 IESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEE 316
Score = 30.5 bits (69), Expect = 3.8
Identities = 47/238 (19%), Positives = 100/238 (42%), Gaps = 27/238 (11%)
Query: 392 NYEETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPT 451
+ E I+ L A +C + ++ + LL Y+ E+ EL+++L E
Sbjct: 445 SKELMIAELAGAGEKCPVCGQEL-PEEHEKELLELYELELEELEEELSREKEEAELREEI 503
Query: 452 PNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA 511
E EL E E + +E+ EL+ ++++E+LE L++K+ + + L E+
Sbjct: 504 EELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL--QQLKEEL 561
Query: 512 EIQQQL------LDAAAQELEHRKQKEEELKKNIQEKDA----------------ERIDM 549
+ L + L RK++ EEL++ ++E + +++
Sbjct: 562 RQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLEL 621
Query: 550 EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ--RENRKEIEGFLDNVRQITKE 605
E + L+E E L +L+K++ + + ++ E +E+E + Q +
Sbjct: 622 SEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIEN 679
Score = 30.1 bits (68), Expect = 4.9
Identities = 35/189 (18%), Positives = 80/189 (42%), Gaps = 29/189 (15%)
Query: 423 LLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRA 482
+ K +++I++L+ K + E P EL E E + + + + EL+
Sbjct: 432 EIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPE-----EHEKELLELYELELEEL 486
Query: 483 Q------SDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
+ + + E++E L+K++ E L+E E+++ L + ++LE + EEL+
Sbjct: 487 EEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELE 546
Query: 537 KNIQEKDAERIDMEEKYSS------------------LQEESEGLSKKLKKVSSMLLAAK 578
+ ++ +++ E + +EE E L ++LK++ L +
Sbjct: 547 ELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELE 606
Query: 579 EERKDVQRE 587
E ++
Sbjct: 607 ERLSQLEEL 615
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 41.7 bits (99), Expect = 0.001
Identities = 23/86 (26%), Positives = 50/86 (58%), Gaps = 4/86 (4%)
Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
E + EL+ +++ Q + ++L LQK+ + + LE E +++ L+ +ELE ++Q+ E
Sbjct: 72 EFEKELRERRNELQKLEKRL--LQKEENL--DRKLELLEKREEELEKKEKELEQKQQELE 127
Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEE 559
+ ++ ++E E++ E+ S L E
Sbjct: 128 KKEEELEELIEEQLQELERISGLTAE 153
Score = 41.3 bits (98), Expect = 0.002
Identities = 30/150 (20%), Positives = 74/150 (49%), Gaps = 18/150 (12%)
Query: 471 IEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAA--QELEHR 528
E+E L+ A+ + +++ ++ L K+ E + E +++L + Q+LE R
Sbjct: 36 AEEEAKRILEEAKKEAEAIKKEALLEAKE-----EIHKLRNEFEKELRERRNELQKLEKR 90
Query: 529 -KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV--- 584
QKEE L + ++ + ++E+K L+++ + L KK +++ ++ +E + +
Sbjct: 91 LLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGL 150
Query: 585 -QRENRKEIEGFLDNVRQITKEYSYQQAII 613
E ++ + L+ V +E ++ A++
Sbjct: 151 TAEEAKEIL---LEKVE---EEARHEAAVL 174
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 40.1 bits (94), Expect = 0.001
Identities = 27/126 (21%), Positives = 61/126 (48%), Gaps = 8/126 (6%)
Query: 490 VEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDM 549
+ K + + E E +E++QQL EL+ + EE I E ++E D+
Sbjct: 61 LLKAPVQSVRPQKLEELQGELSELKQQL-----SELQEELEDLEER---IAELESELEDL 112
Query: 550 EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQ 609
+E L+E + L ++L+ + + +E ++++++ R+E+E + + ++ +
Sbjct: 113 KEDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRL 172
Query: 610 QAIIQK 615
Q IQ+
Sbjct: 173 QEAIQE 178
Score = 32.8 bits (75), Expect = 0.32
Identities = 24/110 (21%), Positives = 49/110 (44%), Gaps = 7/110 (6%)
Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEH---- 527
+E EL + + E+LE L++++ L + E Q L + E
Sbjct: 74 LEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESL 133
Query: 528 ---RKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
K+ +EL++ Q+ E ++ E+ LQE + L + ++++ S+L
Sbjct: 134 EESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLL 183
Score = 29.7 bits (67), Expect = 3.4
Identities = 25/140 (17%), Positives = 59/140 (42%), Gaps = 13/140 (9%)
Query: 397 ISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEP 456
L + + + L + ++++SEL+++LE+ EE
Sbjct: 54 ALGLLCLLLKAPVQSVRPQKLEELQGELSELKQQLSELQEELED------LEE------- 100
Query: 457 ELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQ 516
+ E E +E + LK + +S+ E ++ L K++ ++L E+ E ++
Sbjct: 101 RIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEVEELRE 160
Query: 517 LLDAAAQELEHRKQKEEELK 536
L+ + L+ ++ +EL+
Sbjct: 161 ELERLQENLQRLQEAIQELQ 180
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 41.1 bits (96), Expect = 0.002
Identities = 34/165 (20%), Positives = 68/165 (41%), Gaps = 2/165 (1%)
Query: 431 ISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMV 490
S K+K +E E E E + + E + + +
Sbjct: 165 GSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEE 224
Query: 491 EKLELLQKKVLVGG--ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERID 548
E L L L + L E +Q+ ++++ QELE ++ ++ K +E++ E+
Sbjct: 225 ENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKL 284
Query: 549 MEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
EE+ L +E E L +L K+ + +E+ K+ ++E +K +
Sbjct: 285 QEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEK 329
Score = 33.4 bits (76), Expect = 0.58
Identities = 33/191 (17%), Positives = 74/191 (38%), Gaps = 17/191 (8%)
Query: 391 YNYEETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEP 450
EE + L Y ++ + + + + ++E+ + ++ L +
Sbjct: 221 ELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLK------- 273
Query: 451 TPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK 510
E E E + E E + +S+ + + ++K L E L+K
Sbjct: 274 ------ENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEK-LKESEKELKK 326
Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
E + L +E+E +++ +EL+ + ++ E +E+ L++ E L K K
Sbjct: 327 LEKE---LKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLE 383
Query: 571 SSMLLAAKEER 581
S L +A + +
Sbjct: 384 SERLSSAAKLK 394
Score = 31.1 bits (70), Expect = 2.6
Identities = 26/145 (17%), Positives = 65/145 (44%), Gaps = 3/145 (2%)
Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQK 531
+ EL R + + ++ ++++L ++ E +++L + + L K++
Sbjct: 239 RIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLA--KEE 296
Query: 532 EEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKE 591
EE + ++ + + D EEK ++E + L K+LKK + ++E K+++ + E
Sbjct: 297 EELKSELLKLERRKVDD-EEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAE 355
Query: 592 IEGFLDNVRQITKEYSYQQAIIQKF 616
E + K ++ ++ K
Sbjct: 356 EEEEEQLEKLQEKLEQLEEELLAKK 380
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 40.4 bits (95), Expect = 0.003
Identities = 53/222 (23%), Positives = 105/222 (47%), Gaps = 30/222 (13%)
Query: 424 LIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVI----------ED 473
LIK + EI LKK+LE+ ++ E +LDE E E +E++ +
Sbjct: 534 LIKLKGEIKSLKKELEKLE---ELKKKLAELEKKLDELEE--ELAELLKELEELGFESVE 588
Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
E + LK + + +E + ++ E ++ + ++ LD A +EL +++ E
Sbjct: 589 ELEERLKELEPFYNEYLELKDAEKEL-----EREEKELKKLEEELDKAFEELAETEKRLE 643
Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR------E 587
EL+K ++E E+ EE+Y L+EE LS++L + + L ++ R+++++ E
Sbjct: 644 ELRKELEE--LEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKE 701
Query: 588 NRKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLELMTQNA 629
+E E + ++ K + + +K KY L+ + A
Sbjct: 702 ELEEREKAKKELEKLEKALERVEELREKV--KKYKALLKERA 741
Score = 37.0 bits (86), Expect = 0.036
Identities = 41/198 (20%), Positives = 89/198 (44%), Gaps = 25/198 (12%)
Query: 423 LLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRA 482
L + ++EI+ ++++++E R E EL++ +E E E K
Sbjct: 315 RLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERH----ELYEEAKAK 370
Query: 483 QSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR----KQKEEELKKN 538
+ + + + ++L L + L E LE+ E ++ ++ ++ R K++ +ELKK
Sbjct: 371 KEELERLKKRLTGLTPEKL---EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKA 427
Query: 539 IQE-KDAER--------IDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
I+E K A+ + E + L+E + +LK++ L +E+ + + R+
Sbjct: 428 IEELKKAKGKCPVCGRELTEEHRKELLEE----YTAELKRIEKELKEIEEKERKL-RKEL 482
Query: 590 KEIEGFLDNVRQITKEYS 607
+E+E L ++ K
Sbjct: 483 RELEKVLKKESELIKLKE 500
Score = 35.0 bits (81), Expect = 0.17
Identities = 42/220 (19%), Positives = 92/220 (41%), Gaps = 47/220 (21%)
Query: 394 EETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPN 453
++ I L+ A +C + + ++ + LL +Y E+ ++K+L+E
Sbjct: 425 KKAIEELKKAKGKCPVCGR-ELTEEHRKELLEEYTAELKRIEKELKEI------------ 471
Query: 454 PEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEI 513
E E + EL+ +EK+ + +++ E + E+
Sbjct: 472 ---------------EEKERKLRKELRE--------LEKVLKKESELIKLKELAEQLKEL 508
Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
+++L +ELE + ++ E+LK+ + + E ++++ L+E L KKL ++
Sbjct: 509 EEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEE----LKKKLAELEKK 564
Query: 574 LLAAKEERKDVQRENR-------KEIEGFLDNVRQITKEY 606
L +EE ++ +E +E+E L + EY
Sbjct: 565 LDELEEELAELLKELEELGFESVEELEERLKELEPFYNEY 604
Score = 33.5 bits (77), Expect = 0.48
Identities = 25/100 (25%), Positives = 57/100 (57%), Gaps = 5/100 (5%)
Query: 508 LEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL 567
LEK E + + L+ +E+E +++ E L+ + ++ + + ++EE+ L++E E L +K+
Sbjct: 223 LEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKV 282
Query: 568 KKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYS 607
K++ + +E+ + + + E +LD +R+I K S
Sbjct: 283 KELKEL-----KEKAEEYIKLSEFYEEYLDELREIEKRLS 317
Score = 30.8 bits (70), Expect = 3.1
Identities = 25/93 (26%), Positives = 52/93 (55%), Gaps = 5/93 (5%)
Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
ELK+ + + V++L+ L++K E ++ +E ++ LD +E+E R + EE
Sbjct: 269 EELKKEIEELEEKVKELKELKEKA----EEYIKLSEFYEEYLDEL-REIEKRLSRLEEEI 323
Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
I+E+ E + EE+ L+++ + L K+L++
Sbjct: 324 NGIEERIKELEEKEERLEELKKKLKELEKRLEE 356
Score = 30.4 bits (69), Expect = 4.0
Identities = 21/98 (21%), Positives = 47/98 (47%), Gaps = 1/98 (1%)
Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
EN + + + + LE ++ E +++ I+EK+ E ++ + + + E L
Sbjct: 161 ENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELR 220
Query: 565 KKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQI 602
++L+K+ + EE K+ E KE+E + R++
Sbjct: 221 EELEKLEKEVKEL-EELKEEIEELEKELESLEGSKRKL 257
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 39.8 bits (94), Expect = 0.005
Identities = 25/103 (24%), Positives = 47/103 (45%), Gaps = 12/103 (11%)
Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
L+ E ++ L+ A+E E ++ E+LK+ ++EEK LQEE + L
Sbjct: 519 NELIASLEELERELEQKAEEAEALLKEAEKLKE----------ELEEKKEKLQEEEDKLL 568
Query: 565 KKL-KKVSSMLLAAKEERKDVQRENR-KEIEGFLDNVRQITKE 605
++ K+ + AK+E ++ +E R + G+ E
Sbjct: 569 EEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIE 611
Score = 30.6 bits (70), Expect = 3.8
Identities = 34/128 (26%), Positives = 57/128 (44%), Gaps = 36/128 (28%)
Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQ-QQLLDAA----------- 521
E +A LK A+ + + EK E LQ++ + LLE+AE + QQ + A
Sbjct: 538 EAEALLKEAEKLKEELEEKKEKLQEEE----DKLLEEAEKEAQQAIKEAKKEADEIIKEL 593
Query: 522 ------------AQELE--HRK------QKEEELKKNIQEKDAERIDMEEKYSSLQEESE 561
A EL ++ +KE++ KK ++++ ++ E KY SL ++ E
Sbjct: 594 RQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGE 653
Query: 562 GLSKKLKK 569
LS K
Sbjct: 654 VLSIPDDK 661
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 39.9 bits (94), Expect = 0.005
Identities = 39/166 (23%), Positives = 72/166 (43%), Gaps = 20/166 (12%)
Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD 485
+YQ ++ EL+++LEE + E + + +E E +E E+E D ELK +D
Sbjct: 352 RYQADLEELEERLEEQN------EVVEEADEQQEENE---ARAEAAEEEVD-ELKSQLAD 401
Query: 486 HQSMVEKLELLQKKVLV--GGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKD 543
+Q + L++ Q + + LE+A +QL ++ + EE + QE
Sbjct: 402 YQ---QALDVQQTRAIQYQQAVQALERA---KQLCGLPDLTADNAEDWLEEFQAKEQEAT 455
Query: 544 AERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
E + +E+K S Q + + V +A + R + R
Sbjct: 456 EELLSLEQKLSVAQAAHSQFEQAYQLVRK--IAGEVSRSEAWDVAR 499
Score = 36.1 bits (84), Expect = 0.074
Identities = 24/114 (21%), Positives = 48/114 (42%), Gaps = 1/114 (0%)
Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
E +L++AQ+ + L L+ + L E + Q L A E R
Sbjct: 998 EQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPADSGAEER 1057
Query: 529 -KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEER 581
+ + +EL + + R +E++ + + E + L+KKL+K+ +E+
Sbjct: 1058 ARARRDELHARLSANRSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEMREQV 1111
Score = 31.1 bits (71), Expect = 2.3
Identities = 30/149 (20%), Positives = 57/149 (38%), Gaps = 43/149 (28%)
Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK--------------AEIQQQLLDAAAQ 523
+L Q M +L L E+ LE+ A QQ+ ++
Sbjct: 301 QLAAEQYRLVEMARELAEL-----NEAESDLEQDYQAASDHLNLVQTALRQQEKIERYQA 355
Query: 524 ELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
+LE +++ EE + ++E D ++ + E + + +EE + L +L ++ D
Sbjct: 356 DLEELEERLEEQNEVVEEADEQQEENEARAEAAEEEVDELKSQLADY--------QQALD 407
Query: 584 VQRENRKEIEGFLDNVRQITKEYSYQQAI 612
VQ+ T+ YQQA+
Sbjct: 408 VQQ----------------TRAIQYQQAV 420
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 38.6 bits (90), Expect = 0.007
Identities = 47/209 (22%), Positives = 84/209 (40%), Gaps = 12/209 (5%)
Query: 401 RYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDE 460
Y ++ K + ++ L K E+ E ++LEE + EE E E+DE
Sbjct: 30 EYEETALELEEKLKQEEEEAQLLEKK-ADELEEENRRLEEEAAA--SEEERERLEAEVDE 86
Query: 461 TEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDA 520
V E ++ +AE ++ Q + + E E ++++L E
Sbjct: 87 ATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELL---EAAAAPTAPPHVAAPV 143
Query: 521 AAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEE 580
++LE EE + D + EE+ + E++E L +L+ + S L AA++E
Sbjct: 144 NGEQLEPDDNGEEASADLETDPDMKDRSEEERVTYA-EKNERLQTQLQALKSELAAARDE 202
Query: 581 RK----DVQ-RENRKEIEGFLDNVRQITK 604
K D+ EN + +RQI
Sbjct: 203 SKETANDILHEENVRAGRDKYKTLRQIRS 231
Score = 28.9 bits (65), Expect = 8.5
Identities = 30/189 (15%), Positives = 79/189 (41%), Gaps = 26/189 (13%)
Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD 485
+ ++E EL++++E+ E ++ + +E E E + +LK+ + +
Sbjct: 2 EAEREQQELEERMEQM-------------EEDMRRAQKELEEYEETALELEEKLKQEEEE 48
Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE 545
Q LE ++ L E+A ++ + E++ + +L++ ++K+AE
Sbjct: 49 AQ----LLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAE 104
Query: 546 RIDMEEKYSSLQEESEGLSKKLKKV------SSMLLAAKEERKDVQRENRKEIEGFLD-- 597
++++ QE E ++L + + A + +N +E L+
Sbjct: 105 TRQLQQELREAQEAHERARQELLEAAAAPTAPPHVAAPVNGEQLEPDDNGEEASADLETD 164
Query: 598 -NVRQITKE 605
+++ ++E
Sbjct: 165 PDMKDRSEE 173
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 38.1 bits (89), Expect = 0.007
Identities = 19/76 (25%), Positives = 43/76 (56%), Gaps = 3/76 (3%)
Query: 516 QLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSK---KLKKVSS 572
+ L QEL +++ EL++ + E E +++++ S+L+ E E L K ++K++S+
Sbjct: 66 ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSA 125
Query: 573 MLLAAKEERKDVQREN 588
+ EE ++++ E
Sbjct: 126 NAIELDEENRELREEL 141
Score = 36.1 bits (84), Expect = 0.030
Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 5/91 (5%)
Query: 479 LKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL--LDAAAQELEHRKQKEEELK 536
L Q + + E+L LQ+++ E E E++Q+L L+A + L+ + ++L
Sbjct: 68 LPELQQELAELQEELAELQEQL---AELQQENQELKQELSTLEAELERLQKELARIKQLS 124
Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKL 567
N E D E ++ E+ + L++E+E L +
Sbjct: 125 ANAIELDEENRELREELAELKQENEALEAEN 155
Score = 29.6 bits (67), Expect = 3.9
Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 8/89 (8%)
Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
E +L Q ++Q + ++L L+ ++ E A I+Q L A A EL+ +
Sbjct: 83 AELQEQLAELQQENQELKQELSTLEAEL---ERLQKELARIKQ--LSANAIELDEENR-- 135
Query: 533 EELKKNIQEKDAERIDMEEKYSSLQEESE 561
EL++ + E E +E + LQE +
Sbjct: 136 -ELREELAELKQENEALEAENERLQENEQ 163
>gnl|CDD|234013 TIGR02785, addA_Gpos, helicase-exonuclease AddAB, AddA subunit,
Firmicutes type. AddAB, also called RexAB, substitutes
for RecBCD in several bacterial lineages. These DNA
recombination proteins act before synapse and are
particularly important for DNA repair of double-stranded
breaks by homologous recombination. The term AddAB is
used broadly, with AddA homologous between the
Firmicutes (as modeled here) and the
alphaproteobacteria, while the partner AddB proteins
show no strong homology across the two groups of species
[DNA metabolism, DNA replication, recombination, and
repair].
Length = 1230
Score = 39.3 bits (92), Expect = 0.008
Identities = 37/173 (21%), Positives = 69/173 (39%), Gaps = 22/173 (12%)
Query: 451 TPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK 510
TPNPE L+ E EV E T LK Q + + +LE LQ+K L++
Sbjct: 180 TPNPEKWLNNL---AEAYEVKEKFTIESLKLQQQIKELLKNELEGLQEK--------LQR 228
Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKD--AERIDMEEKYSSLQEESEGLSKKLK 568
A D A LE+ + + + + IQE A+ ++ + ++ + + + K
Sbjct: 229 ALELFMAEDGLAPRLENFQLDLQNIDELIQESLAQADWNELRKAVAAFKFK----NLKAA 284
Query: 569 KVSSMLLAAKEERKDVQRENRKEIEG-----FLDNVRQITKEYSYQQAIIQKF 616
K EE ++ E +K++E F + + + +++
Sbjct: 285 KGDEEDADLLEEADKLREEAKKQLEKLKTDYFTRSEEDHLRIMQEMKPVVKTL 337
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 38.9 bits (91), Expect = 0.009
Identities = 22/97 (22%), Positives = 44/97 (45%), Gaps = 6/97 (6%)
Query: 491 EKLELLQKKVLVGG---ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERI 547
+L+++ + G + LEK E+E + K EL K I + ++
Sbjct: 9 NNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKK 68
Query: 548 DMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV 584
D E +++E + L ++L ++S+ L A + E +D
Sbjct: 69 DKIE---EIKKELKELKEELTELSAALKALEAELQDK 102
Score = 29.3 bits (66), Expect = 8.2
Identities = 20/93 (21%), Positives = 43/93 (46%), Gaps = 4/93 (4%)
Query: 524 ELEHRKQKEEELKKNIQEKDAER-IDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
+L K + + D E+ I ++++ L E E L K ++S + AK ++K
Sbjct: 12 DLVKESLKA---RGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKK 68
Query: 583 DVQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
D E +KE++ + + +++ +A +Q
Sbjct: 69 DKIEEIKKELKELKEELTELSAALKALEAELQD 101
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 36.8 bits (86), Expect = 0.011
Identities = 16/97 (16%), Positives = 50/97 (51%), Gaps = 12/97 (12%)
Query: 519 DAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAK 578
A + +++ ++L+ +Q+K+ E ++++ LQ+++ LS++ +K L K
Sbjct: 31 PAGKAAQKQLEKEFKKLQAELQKKEKE---LQKEEQKLQKQAATLSEEARKAKQQELQQK 87
Query: 579 EE---------RKDVQRENRKEIEGFLDNVRQITKEY 606
++ ++++Q++ ++ ++ D + + KE
Sbjct: 88 QQELQQKQQAAQQELQQKQQELLQPIYDKIDKAIKEV 124
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 38.5 bits (89), Expect = 0.013
Identities = 30/183 (16%), Positives = 62/183 (33%), Gaps = 15/183 (8%)
Query: 399 TLR-YASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPE 457
L +R+KK + V Q + K + E ++++ + +
Sbjct: 287 ALPTVGNRIKKEEQETLVRQFTVEQTQAKSKVESAKIETDRAREMETLAHDNVKQIVGAQ 346
Query: 458 LDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL 517
E + + + A++ ++V+ L G E AE +L
Sbjct: 347 HGILSAKREGAV----DKRRTISTARAGLDALVKGL----------GGAAPESAEELLEL 392
Query: 518 LDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAA 577
+AA ++ E L+ Q +R +++ + +E LSK + LL
Sbjct: 393 NNAARLTVDEYPAAREALESAGQRNVEDRTRAVDEFKAADQELSSLSKGSSNIEYRLLQV 452
Query: 578 KEE 580
+E
Sbjct: 453 REN 455
Score = 34.2 bits (78), Expect = 0.29
Identities = 37/207 (17%), Positives = 74/207 (35%), Gaps = 20/207 (9%)
Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVI---EDETDAELKRAQS 484
Q++I+EL+ +LE + + LD + + E E A LK+
Sbjct: 673 QRQIAELQARLERLTHTQ---SDIAIAKAALDAAQTRQKVLERQYQQEVTECAGLKKDLK 729
Query: 485 DHQSMVEKLELLQKKVLVGG-ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKD 543
+ K+ + K+ + G + L Q R + +L+K I +
Sbjct: 730 RAAMLSRKVHSIAKQGMTGALQALGAAHFPQVAPEQHDDIVDIERIEHRRQLQKRIDAVN 789
Query: 544 AERIDMEEK----YSSLQEESEG----LSKKLKKVSSMLLAAKEERKDVQRENRKEIEGF 595
A + E+ S ++E + +L + L + +D E +
Sbjct: 790 ARLRRLREEIIGRMSDAKKEDTAALSEVGAELDDIPEYLARLQTLTEDALPEFLARFQEL 849
Query: 596 L-----DNVRQITKEYSYQQAIIQKFI 617
L D V Q+ +++A+I++ I
Sbjct: 850 LNRSSDDGVTQLLSHLDHERALIEERI 876
Score = 32.3 bits (73), Expect = 1.3
Identities = 23/111 (20%), Positives = 43/111 (38%), Gaps = 1/111 (0%)
Query: 465 VEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENL-LEKAEIQQQLLDAAAQ 523
V+ ED ++ R Q ++ + E+ L + ++ L A+ Q L A +
Sbjct: 625 VKAMLSREDFYMIKIMRQQGEYIKLQEQANALAHIQALNFASIDLPSAQRQIAELQARLE 684
Query: 524 ELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
L H + K + + +E +Y E GL K LK+ + +
Sbjct: 685 RLTHTQSDIAIAKAALDAAQTRQKVLERQYQQEVTECAGLKKDLKRAAMLS 735
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 36.4 bits (85), Expect = 0.013
Identities = 21/96 (21%), Positives = 44/96 (45%), Gaps = 7/96 (7%)
Query: 512 EIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL-KKV 570
Q+QL + ++ E+EL+K ++ + + E E K+L KKV
Sbjct: 18 AAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSE------AAREKKEKELQKKV 71
Query: 571 SSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEY 606
++ ++D+Q+ ++E++ LD + + KE
Sbjct: 72 QEFQRKQQKLQQDLQKRQQEELQKILDKINKAIKEV 107
>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV.
The binding of the LYPxL motif of late HIV p6Gag and
EIAV p9Gag to this domain is necessary for viral
budding.This domain is generally central between an
N-terminal Bro1 domain, pfam03097 and a C-terminal
proline-rich domain. The retroviruses thus used this
domain to hijack the ESCRT system of the cell.
Length = 292
Score = 37.7 bits (88), Expect = 0.014
Identities = 46/227 (20%), Positives = 87/227 (38%), Gaps = 45/227 (19%)
Query: 412 KARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVI 471
R + P L + ++E+ + + LE+AS
Sbjct: 67 GERWTRPPSSELTAELREELRKYRGYLEQAS----------------------------- 97
Query: 472 EDETDAELKRAQSDHQSMVEKL----ELLQKKVLVGGENLLEKAEIQQQLLDAAAQELE- 526
E+DA+L+ +H+ +E L E L+ + + + Q L +L
Sbjct: 98 --ESDAQLRSKLEEHEENLELLSGPEEELEALLPSSSPSKTPEVSEQISRLRELLNKLNE 155
Query: 527 ---HRKQKEEELKKNIQEKDAERIDME-EKYSSLQEE--SEGLSKKLKKVSSMLLAAKEE 580
R++ EELK+ Q+ D ++ + K S EE E KK + + +
Sbjct: 156 LKAEREKLLEELKEKAQDDDISKLLIALNKLGSSNEEQLFEEELKKFDPLQERVEQNLSK 215
Query: 581 RKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKF--IPTKYLELM 625
++++ +E ++ E FL R+ E ++ ++QK KY EL
Sbjct: 216 QEELLKEIQEANEEFL-QERKSDSEQKEREKVLQKLENAYDKYKELS 261
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 37.7 bits (87), Expect = 0.023
Identities = 51/229 (22%), Positives = 105/229 (45%), Gaps = 25/229 (10%)
Query: 419 PKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAE 478
K+ L+ Q+E ++LKK+LE A E+ EL+ T+ VE ++ ++ + E
Sbjct: 33 VKETELMLAQEEANKLKKELEVA------EKEKLQVLKELESTKRTVEDLKLKLEKAEKE 86
Query: 479 LKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKN 538
++A+ D + + E L++ + E +E+ LD+ +EL +Q+ + L
Sbjct: 87 EQQAKQDSELAKLRAEELEQGI---QELEVERYITATAELDSVKEELRKIRQEYDAL--- 140
Query: 539 IQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM----------LLAAKEERKDVQREN 588
++E+DA EE + + + + + K++ +M L A+EER E
Sbjct: 141 VEERDAALKRAEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEK 200
Query: 589 RKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLELMTQNATWNEDIGE 637
++ E + +++ KE + + Q P K LE + + + E + +
Sbjct: 201 DQDRETYEKELKEAEKE---LERLKQDLDPEKDLEKLAEASAELESLQK 246
Score = 29.6 bits (66), Expect = 6.2
Identities = 44/176 (25%), Positives = 79/176 (44%), Gaps = 14/176 (7%)
Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIE--DETDAELKRAQSD 485
+ E + LE+ E+ E EL+ + ++P + +E E AEL+ Q +
Sbjct: 188 EAEEERIGAALEKDQDRETYEKELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKE 247
Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAA---QELEHRKQKEEELKKNIQEK 542
M + + K++ NL + AE ++ L + QELE K++ EEL++ E
Sbjct: 248 ISIM-ASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEA 306
Query: 543 DAERIDME--------EKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRK 590
+ +E EK S +E+ + KL++ SS A++E + + E RK
Sbjct: 307 EEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRK 362
Score = 29.2 bits (65), Expect = 8.9
Identities = 40/155 (25%), Positives = 71/155 (45%), Gaps = 15/155 (9%)
Query: 444 PPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVG 503
P R E E EL + E +++ ++ AE ++ Q +++LE ++ V
Sbjct: 24 LPQRSERVLVKETELMLAQ--EEANKLKKELEVAEKEKLQV-----LKELESTKRTV--- 73
Query: 504 GENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER-IDMEEKYSSLQEESEG 562
E+L K E ++ A Q+ E K + EEL++ IQE + ER I + S++EE
Sbjct: 74 -EDLKLKLEKAEKEEQQAKQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRK 132
Query: 563 LSKKLKKVSSMLLAA---KEERKDVQRENRKEIEG 594
+ ++ + AA EE + N K++E
Sbjct: 133 IRQEYDALVEERDAALKRAEEAICASKVNEKKVEE 167
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 36.4 bits (85), Expect = 0.028
Identities = 21/67 (31%), Positives = 31/67 (46%), Gaps = 3/67 (4%)
Query: 524 ELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLK---KVSSMLLAAKEE 580
E+ K+ EE +K + E E + E +EE E L KKLK K L K
Sbjct: 35 EIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKAR 94
Query: 581 RKDVQRE 587
K++++E
Sbjct: 95 LKELEKE 101
Score = 34.1 bits (79), Expect = 0.13
Identities = 28/116 (24%), Positives = 55/116 (47%), Gaps = 21/116 (18%)
Query: 487 QSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQ--EKD- 543
+S+ E++ ++K N AEI Q+ + L+ +++ EEL+K ++ EKD
Sbjct: 30 KSLKEEIAEMKKNEE---HNEKLMAEIAQENKRLV-EPLKKAEEEVEELRKKLKDYEKDK 85
Query: 544 -------AERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEI 592
A ++E++ +L+ ESE L ++ +KV ER ++ + I
Sbjct: 86 QSLKNLKARLKELEKELKNLKWESEVLEQRFEKVE-------RERDELYDKFEAAI 134
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 35.3 bits (82), Expect = 0.028
Identities = 30/112 (26%), Positives = 57/112 (50%), Gaps = 20/112 (17%)
Query: 515 QQLLDAAAQELEHR----KQKEEELKKN--------------IQEKDAERIDMEEKYSSL 556
Q LDA +E E R KQ+EEEL+K ++E +A+R E+K
Sbjct: 13 QLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEE 72
Query: 557 QEESEGLSKKLKKVSSML--LAAKEERKDVQRENRKEIEGFLDNVRQITKEY 606
++ + +++K++ + L L A+ E+ + + E + E FL++V + + E+
Sbjct: 73 KKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEYQPYEEFLESVVERSPEF 124
>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
(TBPIP). This family consists of several eukaryotic
TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
been demonstrated to interact with the human
immunodeficiency virus type 1 (HIV-1) viral protein Tat,
then modulate the essential replication process of HIV.
In addition, TBP-1 has been shown to be a component of
the 26S proteasome, a basic multiprotein complex that
degrades ubiquitinated proteins in an ATP-dependent
fashion. Human TBPIP interacts with human TBP-1 then
modulates the inhibitory action of human TBP-1 on
HIV-Tat-mediated transactivation.
Length = 169
Score = 35.7 bits (83), Expect = 0.029
Identities = 19/84 (22%), Positives = 39/84 (46%), Gaps = 7/84 (8%)
Query: 529 KQKEEELKKNIQEK--DAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAK-----EER 581
Q + EL + + D E ++ E+ L+++ L +LK ++S L + +E
Sbjct: 62 NQDQFELPSDEELNKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDLTTEELQEEIQEL 121
Query: 582 KDVQRENRKEIEGFLDNVRQITKE 605
K RE +++E + + +T E
Sbjct: 122 KKEVREIEEKLESLEEGWKPVTPE 145
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 36.4 bits (85), Expect = 0.047
Identities = 29/122 (23%), Positives = 56/122 (45%), Gaps = 16/122 (13%)
Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
+E + + + +L+ L++++L E L K E LD + LE ++++
Sbjct: 58 EEVHKLRAELERELKERRNELQRLERRLLQREETLDRKME----SLDKKEENLEKKEKEL 113
Query: 533 EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLA-AKEE-RKDVQRENRK 590
+KN+ EK EE+ L E ++L+++S + AKE ++V+ E R
Sbjct: 114 SNKEKNLDEK-------EEELEELIAEQR---EELERISGLTQEEAKEILLEEVEEEARH 163
Query: 591 EI 592
E
Sbjct: 164 EA 165
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 36.1 bits (84), Expect = 0.060
Identities = 32/182 (17%), Positives = 79/182 (43%), Gaps = 28/182 (15%)
Query: 393 YEETISTLRYASRVKKICNKARVNQDPKD------ALLIKYQKEISELKKKLEEASVPPP 446
YE I + ++ I + + N + K ALL ++ + + +KLEE
Sbjct: 294 YESHIE--KALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKLEE-----K 346
Query: 447 REEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGEN 506
++P+ + E E T+ +++I+ + +H ++ L+ + K +
Sbjct: 347 LKDPSTSIELE-SITDLIESINDIIDAINELI-----REHNEKIDNLKKEKNKA----KK 396
Query: 507 LLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKK 566
L + + ++++ +++++ L+K I + E +E + +L++E + L K+
Sbjct: 397 KLWLHLVAE-----LKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQ 451
Query: 567 LK 568
L
Sbjct: 452 LT 453
>gnl|CDD|191413 pfam05970, PIF1, PIF1-like helicase. This family includes
homologues of the PIF1 helicase, which inhibits
telomerase activity and is cell cycle regulated. This
family includes a large number of largely
uncharacterized plant proteins. This family includes a
P-loop motif that is involved in nucleotide binding.
Length = 364
Score = 35.8 bits (83), Expect = 0.062
Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 4/39 (10%)
Query: 108 NSKQLDIYNETARPIVDKVLSGFNGTIFAYGQTGTGKTF 146
N +Q ++++ I++ V + G F YG GTGKT+
Sbjct: 3 NDEQKSVFDQ----IIESVDNNLGGVFFVYGYGGTGKTY 37
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 34.2 bits (79), Expect = 0.063
Identities = 31/122 (25%), Positives = 59/122 (48%), Gaps = 24/122 (19%)
Query: 478 ELKRAQSDHQSMVEKLELLQK------KVLVGGENLLEKAEIQ--QQLLDAAAQELEHRK 529
L AQ + + EKLE LQ+ + L G + AE++ Q + A + + ++
Sbjct: 6 NLAEAQQELEQAEEKLEELQQYRLEYRQQLSGSGQGISAAELRNYQAFISALDEAIAQQQ 65
Query: 530 QKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
Q+ E+ +K +++ E++ +E KKL+K L +E++K+ + ENR
Sbjct: 66 QELEQAEKQVEQA-------REQWQEANQE----RKKLEK-----LLEREKKKEQKEENR 109
Query: 590 KE 591
+E
Sbjct: 110 RE 111
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 34.9 bits (81), Expect = 0.068
Identities = 22/91 (24%), Positives = 44/91 (48%), Gaps = 6/91 (6%)
Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQ------EKDAERIDMEEKYSSLQEESEGLSKKLK 568
+ L+ +ELE KQ+ EL+ I+ E+ ER ++ E+ L++E + L +L+
Sbjct: 68 KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELE 127
Query: 569 KVSSMLLAAKEERKDVQRENRKEIEGFLDNV 599
K E+ K+ + ++ + DN+
Sbjct: 128 KYEKNDPERIEKLKEETKVAKEAANRWTDNI 158
>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region. This family
represents an N-terminal conserved region found in
several huntingtin-associated protein 1 (HAP1)
homologues. HAP1 binds to huntingtin in a polyglutamine
repeat-length-dependent manner. However, its possible
role in the pathogenesis of Huntington's disease is
unclear. This family also includes a similar N-terminal
conserved region from hypothetical protein products of
ALS2CR3 genes found in the human juvenile amyotrophic
lateral sclerosis critical region 2q33-2q34.
Length = 307
Score = 35.6 bits (82), Expect = 0.073
Identities = 34/143 (23%), Positives = 61/143 (42%), Gaps = 21/143 (14%)
Query: 466 EPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKV-LVGGEN--LLEKA----------- 511
E SE E+ + S +LE LQ+K+ L+ EN L +A
Sbjct: 136 EESEDESSESTPLRPQESSSSSHGCFQLEALQEKLKLLEEENEHLRSEASHLKTETVTYE 195
Query: 512 EIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLK--- 568
E +QQL++ ++L + L + + +K + +E+ + L + L KK K
Sbjct: 196 EKEQQLVNDCVKQLREANDQIASLSEELAKKTEDLERQQEEITHLLSQIVDLQKKCKSYA 255
Query: 569 ----KVSSMLLAAKEERKDVQRE 587
++ L AAK+ ++ +Q E
Sbjct: 256 LENEELQQHLAAAKDAQRQLQAE 278
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 35.4 bits (82), Expect = 0.079
Identities = 23/130 (17%), Positives = 58/130 (44%), Gaps = 7/130 (5%)
Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEEL 535
EL+ S + ++ + L++++ + E + LD A ++L+ Q+
Sbjct: 164 MKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIK 223
Query: 536 KKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGF 595
K ++E EE+ L+ + E L+ K ++++ + A+++ + + KEIE
Sbjct: 224 VKKLEEL-------EEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFKEIEKL 276
Query: 596 LDNVRQITKE 605
+ ++ +
Sbjct: 277 KEQLKLLQSL 286
Score = 34.6 bits (80), Expect = 0.13
Identities = 19/92 (20%), Positives = 46/92 (50%), Gaps = 9/92 (9%)
Query: 522 AQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV-----SSMLLA 576
+ LE K+ + L K ++ ++ + + ++ +L+EE L K+L+ + L
Sbjct: 150 DENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQL-KQLEDELEDCDPTELDR 208
Query: 577 AKEERKDVQREN---RKEIEGFLDNVRQITKE 605
AKE+ K + +E K++E + ++++ +
Sbjct: 209 AKEKLKKLLQEIMIKVKKLEELEEELQELESK 240
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 35.0 bits (81), Expect = 0.096
Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 9/108 (8%)
Query: 485 DHQSMVEKLELLQKKV---LVGGENLLEKAEIQQQLLDAAAQE--LEHRKQKEEELKKNI 539
D ++ + E L++KV L G N K L+ A ++ L + + +LK+ I
Sbjct: 28 DLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEI 87
Query: 540 QEK----DAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
++K + + + ++ S L S L K+ L + +
Sbjct: 88 EQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRS 135
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 35.0 bits (81), Expect = 0.10
Identities = 53/223 (23%), Positives = 91/223 (40%), Gaps = 58/223 (26%)
Query: 405 RVKKICNKARVNQDPKDALLIKYQKEISEL---KKKLEEASVPPPREEPTPNPEPELD-- 459
RV + A+ +D I+ Q+E + KK+ + + P+ TP P
Sbjct: 11 RVVRGNTYAQEIIPIRDGDEIQRQQEARQRALMKKRTKR--IQNPKIMGTPPPVEGRKHI 68
Query: 460 --ETEPCVE------PSEVIEDETDAELKRAQS----------DHQSMVEKLEL------ 495
+TEP +E P E +TD L R + D ++ +E+ EL
Sbjct: 69 DVQTEPYLEELTDKIPEADAETQTDFFLDRPPTPLYVPAKTGIDVETQIEEGELFDFDLE 128
Query: 496 ----LQKKVLVGGENLLEKA--EIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDM 549
L+ VLVG LE+A E+ ++ +EL +Q++ + ++ + AE +
Sbjct: 129 VEPILE--VLVG--KTLEQALLEVLEE------EELAELRQQQRQFEQRRNAELAETQRL 178
Query: 550 EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEI 592
EE +EE E K+ K ERK ++E ++I
Sbjct: 179 EEAERRRREEKERRKKQDK-----------ERKQREKETAEKI 210
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 35.2 bits (81), Expect = 0.11
Identities = 32/140 (22%), Positives = 61/140 (43%), Gaps = 12/140 (8%)
Query: 468 SEVIEDETDAELKRAQSDHQSMV-EKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELE 526
S ++ + E + + Q +K EL Q + E L+ + + Q+ L + ++ E
Sbjct: 38 SRILNTLEEFEKEANEKRAQYRSAKKKELSQLE-----EQLINQKKEQKNLFNEQIKQFE 92
Query: 527 HRKQKE----EELKKNIQEKDAERIDMEEKYSSLQEE-SEGLSKKLKKVSSMLLAAKEER 581
Q E E L+ EKD E +E++ L +E + L + + K E
Sbjct: 93 LALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEE 152
Query: 582 KDVQRENRKEIEGFLDNVRQ 601
+ ++ EN K++E L+ R+
Sbjct: 153 R-LKFENEKKLEESLELERE 171
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 35.4 bits (82), Expect = 0.11
Identities = 25/128 (19%), Positives = 50/128 (39%), Gaps = 4/128 (3%)
Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
I D + + + ++ L + ++++L+ E L E Q+ L EL
Sbjct: 798 SDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSE- 856
Query: 529 KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEE---RKDVQ 585
Q+ EE +NI E E ++ + L + +K V S ++E +K +
Sbjct: 857 IQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKK 916
Query: 586 RENRKEIE 593
+ + E
Sbjct: 917 DLEKTDEE 924
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 35.5 bits (81), Expect = 0.11
Identities = 35/144 (24%), Positives = 64/144 (44%), Gaps = 22/144 (15%)
Query: 444 PPPREEPTPNPEPELDETEPCV-EPSEVIEDETDAELKRAQSDHQSMVE-KLELLQKKVL 501
++P+P L+ T V S+ + E + E R + E ++ L++++
Sbjct: 321 AGHLQQPSPVNMSALERTAAWVLMNSQYEDIEYETEENREEGTQAEKYEQEIARLKERLR 380
Query: 502 VGGENLLEKAEIQQQLLDAAAQELEHR-KQKEEELKKNIQEKDAERIDMEEKYSSLQEES 560
V L +E E R +E++++K +QE A D EE+ QEE
Sbjct: 381 VSVRRL---------------EEYERRLLGQEQQMQKLLQEYQARLEDSEERLRRQQEEK 425
Query: 561 EGLSKKLKKVSSMLLAAKEE-RKD 583
+ ++K + S L+A +EE +KD
Sbjct: 426 DS---QMKSIISRLMAVEEELKKD 446
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 35.3 bits (82), Expect = 0.12
Identities = 17/89 (19%), Positives = 42/89 (47%), Gaps = 10/89 (11%)
Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
L ++ E+Q + + E ++Q+ L+ E + ++ ++E + LQE++ S
Sbjct: 152 LTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETS 211
Query: 565 KKLKKVSSMLLAAKEERKDVQRENRKEIE 593
++ K++RK++ + K +E
Sbjct: 212 QE----------RKQKRKEITDQAAKRLE 230
Score = 29.9 bits (68), Expect = 5.5
Identities = 20/109 (18%), Positives = 51/109 (46%), Gaps = 4/109 (3%)
Query: 444 PPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVG 503
P P +PE L + V + + E ++Q+ ++ ++L L+
Sbjct: 130 KPGPFVPPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAEL 189
Query: 504 GENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEK 552
E ++ E++ QL + E ++++++ K+ I ++ A+R+++ E+
Sbjct: 190 EE---KQQELEAQLEQLQEKAAETSQERKQKRKE-ITDQAAKRLELSEE 234
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 34.6 bits (80), Expect = 0.13
Identities = 23/120 (19%), Positives = 51/120 (42%), Gaps = 14/120 (11%)
Query: 505 ENLLEKAEIQQQLLDAAAQELEHRK------QKEEELKKNIQEKDAERIDMEEKYSSLQE 558
++LL ++ + + E RK + E L K ++ + E D+E + S L+
Sbjct: 7 KSLLAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLES 66
Query: 559 ESEGLSKKLKKVSSMLLAAKEER--------KDVQRENRKEIEGFLDNVRQITKEYSYQQ 610
E + + +++K+ L A K+ER + +E +E L + + ++ +
Sbjct: 67 EIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEI 126
Score = 31.2 bits (71), Expect = 1.4
Identities = 38/210 (18%), Positives = 93/210 (44%), Gaps = 28/210 (13%)
Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVL-VGGENLLEKAEIQ---------QQLLDAA 521
E LK+A+++ +++ + LE L+ ++ + + ++EIQ ++ L A
Sbjct: 26 IKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAV 85
Query: 522 --AQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKE 579
+EL + + K+ I + E ++ E+ L++E E L ++L++
Sbjct: 86 KDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLER---------- 135
Query: 580 ERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLELMTQNATWNEDIGEWE 639
K++ E +E + +R+ +E S ++ +++ + + L + + +G
Sbjct: 136 LEKNLA-EAEARLEEEVAEIREEGQELSSKREELKEKLDPELLSEYERIRKNKKGVGVVP 194
Query: 640 LKCVAYTGNNMRKIVPLPEDTTATQIAQPD 669
L+ G +M+ LP T +++ + D
Sbjct: 195 LEGRVCGGCHMK----LP-SQTLSKVRKKD 219
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 34.3 bits (79), Expect = 0.15
Identities = 27/153 (17%), Positives = 61/153 (39%), Gaps = 16/153 (10%)
Query: 460 ETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLL------EKAEI 513
++E + + A S + +++++L +++ EN L + ++
Sbjct: 46 DSESSSNSVPSLSLSSTASSLSDSSTYSRSLKEVKL-ERQAQEAYENWLSAKQAQRQKKL 104
Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKK-----LK 568
Q+ L + QE E +++ E ++ +EK E + + Q + K
Sbjct: 105 QKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQ----QAAKQRTPKHKKEAAES 160
Query: 569 KVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQ 601
SS+ +AK ER Q E +K ++ + +
Sbjct: 161 ASSSLSGSAKPERNVSQEEAKKRLQEWELKKLK 193
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 34.5 bits (80), Expect = 0.15
Identities = 25/131 (19%), Positives = 68/131 (51%), Gaps = 6/131 (4%)
Query: 469 EVIEDETDAELKRAQSDHQSMVEK--LELLQKKVLVGGENLLEKA---EIQQQLLDAAAQ 523
E+E + + K + + ++++++ E +++ E L E+ EI +++ +
Sbjct: 55 LAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEA 114
Query: 524 ELEHRKQKEEELKKNIQEKDAERIDM-EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
E + +++K+++L++ I E + ERI+ EE+ +EE + + ++ + + ER+
Sbjct: 115 EAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREAERR 174
Query: 583 DVQRENRKEIE 593
+ + E +E+
Sbjct: 175 ERKEEKEREVA 185
Score = 33.0 bits (76), Expect = 0.57
Identities = 28/137 (20%), Positives = 58/137 (42%), Gaps = 17/137 (12%)
Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQE---LEHRKQKEE 533
AE K + M+E+ L E E+ +++ A + E K+++E
Sbjct: 34 AEEKEEERRIDEMMEEERLKALAE----EEERERKRKEERREGRAVLQEQIEEREKRRQE 89
Query: 534 ELKKNIQEKDA-----ERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQREN 588
E ++ +QE++ ERI E+ + E++ +K KK+ + EER + + E
Sbjct: 90 EYEERLQEREQMDEIIERIQEED-----EAEAQEKREKQKKLREEIDEFNEERIERKEEE 144
Query: 589 RKEIEGFLDNVRQITKE 605
++ + + +E
Sbjct: 145 KEREREEELKILEYQRE 161
Score = 29.1 bits (66), Expect = 7.9
Identities = 24/120 (20%), Positives = 50/120 (41%), Gaps = 20/120 (16%)
Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEEL 535
D ++R Q + + E E +K+ + EI + + ++ E ++++ EE
Sbjct: 102 DEIIERIQEEDE--AEAQEKREKQ-------KKLREEIDEFNEERIERKEEEKEREREEE 152
Query: 536 KKNIQ---EKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEI 592
K ++ EK + E + +EE E + L A++E + +RE E+
Sbjct: 153 LKILEYQREKAEREEEREAERRERKEEKEREVAR--------LRAQQEEAEDEREELDEL 204
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 34.8 bits (80), Expect = 0.15
Identities = 36/207 (17%), Positives = 84/207 (40%), Gaps = 24/207 (11%)
Query: 434 LKKKLEEASVPPPREEPTPNPEP---ELDETEPCVEPSEVIEDETDAEL-----KRAQSD 485
L+ K E S+ P ++ + +++ E + S + ++ K Q++
Sbjct: 368 LENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNN 427
Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE 545
+ ++K ELL+K+ + E + + E+ L + A Q+ K+ EL KN+ +K +
Sbjct: 428 KEQDIKKEELLEKEFVKKSEKIPKNDELLDNL-ELAKQKF---FNKDIELSKNMLQKFNK 483
Query: 546 RIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
+ E +S + + + VS L E G ++ + ++ +
Sbjct: 484 FKNEESAENSF-IDIIKQADLVLWVSPNALILGVED-----------LGLIELINKVNES 531
Query: 606 YSYQQAIIQKFIPTKYLELMTQNATWN 632
+ + + I +KF K + +++
Sbjct: 532 FEFIKTIFKKFNKNKLVIAISKKQATK 558
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 35.1 bits (81), Expect = 0.16
Identities = 42/229 (18%), Positives = 84/229 (36%), Gaps = 30/229 (13%)
Query: 392 NYEETISTLRYASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPT 451
E TL + + KAR+ +D + L +++ ++ ++ S ++
Sbjct: 744 KLYEFAYTLEEEIQYLEDLLKARLEKDYQSKYLDLI-DNLNDAREIKDKESKAIDLDDID 802
Query: 452 PNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKL-ELLQKKVLVGGENLLEK 510
E E + I D L+ D E L EL+++ + E+L +K
Sbjct: 803 -------FELELIGKQEINI-DYILELLQTFN-DKNGAYESLKELIERIIKEWIEDLRQK 853
Query: 511 AEIQQQLLDAAAQ-------------ELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQ 557
++ ++L++A Q EL +KEEE K+ +E E ++ EE
Sbjct: 854 KKLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEE---EGLNEEELAFYDD 910
Query: 558 EESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEY 606
G + AKE+ + ++ + + + V K +
Sbjct: 911 LALNGGKLPENGTELVEKLAKEKSLREKNKDDWKAK---EEVEAKLKAF 956
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 34.8 bits (81), Expect = 0.16
Identities = 37/158 (23%), Positives = 71/158 (44%), Gaps = 31/158 (19%)
Query: 424 LIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEP--------SEVIEDET 475
L +++I + LEE +LDE E E +++E E
Sbjct: 254 LQDLKEQIDQNLALLEEL---------------DLDEAEEENEEIEERIDTLYDILEKEV 298
Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEEL 535
A+ K + + + + LE +++ + LL + + QQ ELE ++ E++L
Sbjct: 299 KAK-KFVEKNIDKLTDFLEHAREQN----KQLLLELDRLQQSYTLNEDELETVRELEKQL 353
Query: 536 KK--NIQEKDAERIDMEEK-YSSLQEESEGLSKKLKKV 570
++ ++ ERI ++ YS LQEE E + K+L+++
Sbjct: 354 EELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLEEI 391
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 33.6 bits (77), Expect = 0.18
Identities = 34/129 (26%), Positives = 51/129 (39%), Gaps = 18/129 (13%)
Query: 523 QELEHRKQKEEELKKNIQEKDA--ERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEE 580
+ E RKQ+ EEL+K D ER+D + S L+EES S+ + K + KE
Sbjct: 39 ERREQRKQEIEELEKERIMNDRLLERVD--KLLSELKEESLDSSQAVMKSLNENFTDKEN 96
Query: 581 RKDVQRENRKEIEGFLDNVRQITKEYSYQQAI------------IQKFIPTKYLELMTQN 628
+ Q + +E D V+ E + I QK + K EL +
Sbjct: 97 VEPEQPTYNEMVEDLFDQVKDEVDEKNGAALIEELQKHRDKLKKEQKELLKKLDELEKEE 156
Query: 629 AT--WNEDI 635
W+ D
Sbjct: 157 KKKIWSVDT 165
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 35.0 bits (80), Expect = 0.19
Identities = 29/144 (20%), Positives = 64/144 (44%), Gaps = 12/144 (8%)
Query: 474 ETDAELKRAQSDHQSMV-------EKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELE 526
+T+AEL+ SD QS + + E KK + +L A +Q ++D +E+
Sbjct: 688 QTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIP 747
Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR 586
+ K +++ ++IQ + D+EE+ + L + ++ KV + E + +
Sbjct: 748 ELRNKLQKVNRDIQR---LKNDIEEQETLL--GTIMPEEESAKVCLTDVTIMERFQMELK 802
Query: 587 ENRKEIEGFLDNVRQITKEYSYQQ 610
+ ++I ++ + + QQ
Sbjct: 803 DVERKIAQQAAKLQGSDLDRTVQQ 826
Score = 30.0 bits (67), Expect = 5.1
Identities = 36/163 (22%), Positives = 64/163 (39%), Gaps = 4/163 (2%)
Query: 428 QKEISELKKKLEEASVPPPREEPT-PNPEPELDETEPCVEPSEVIEDETDAELKRAQSDH 486
+KEI EL+ KL++ + R + E L P E ++V TD +
Sbjct: 743 EKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCL--TDVTIMERFQME 800
Query: 487 QSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER 546
VE+ Q L G + ++ Q+ + EL+ K E +K IQ++ +
Sbjct: 801 LKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQ-HELDTVVSKIELNRKLIQDQQEQI 859
Query: 547 IDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
++ K + L+ E + L++ E +VQ R
Sbjct: 860 QHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIR 902
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 34.6 bits (79), Expect = 0.19
Identities = 22/131 (16%), Positives = 53/131 (40%), Gaps = 4/131 (3%)
Query: 510 KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
KA I Q L+ +E +K L+ ++ + + +E E++E L ++ K+
Sbjct: 137 KATILQNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKE 196
Query: 570 VSSMLLAAKEERKDVQR---ENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPTKYLELMT 626
S + ER + EN++ I +++ +T++ +++ + EL
Sbjct: 197 AQSSSESMSAERNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEA-ELER 255
Query: 627 QNATWNEDIGE 637
+ + +
Sbjct: 256 MKGELKQRLKK 266
Score = 34.2 bits (78), Expect = 0.25
Identities = 19/129 (14%), Positives = 61/129 (47%), Gaps = 9/129 (6%)
Query: 491 EKLELLQKKVLVGGENLLEKAEIQQ--QLLDAAAQELEHRKQKEEELKKNIQEKDAER-- 546
E+ +L++K + + G+ ++ I+Q L + ++ E +++ +E + + + AER
Sbjct: 151 ERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNA 210
Query: 547 -----IDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQ 601
+ +++ L+++ + L++K ++ +L ++ +++R + + Q
Sbjct: 211 LLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQ 270
Query: 602 ITKEYSYQQ 610
E + +
Sbjct: 271 RRDEETERI 279
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 34.3 bits (79), Expect = 0.19
Identities = 36/191 (18%), Positives = 86/191 (45%), Gaps = 22/191 (11%)
Query: 402 YASRVKKICNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDET 461
Y +K N+ + +L ++EI L+KK + + + P E ++ E
Sbjct: 92 YRELKEKR-NEFNLGGRSIKSL----EREIERLEKKQQTSVLTPEEEREL---VQKIKEL 143
Query: 462 EPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA-EIQQQLLDA 520
+E ++ +E + +LK +++ + +K + +K+ + L +A E ++++
Sbjct: 144 RKELEDAKKALEE-NEKLKELKAEIDELKKKAREIHEKI----QELANEAQEYHEEMIKL 198
Query: 521 AAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEE 580
++ +EL+K E E +++ +K L EE L +L+++ + A + +
Sbjct: 199 --------FEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAK 250
Query: 581 RKDVQRENRKE 591
K +R ++E
Sbjct: 251 EKAAKRREKRE 261
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 34.3 bits (79), Expect = 0.21
Identities = 32/114 (28%), Positives = 50/114 (43%), Gaps = 11/114 (9%)
Query: 482 AQSDHQSMVEKLELLQKKVLVGGEN-LLEKAEIQQQLLDAAAQELEHRKQKEEE------ 534
Q D Q + E Q G N ++ Q A Q+L+ R+QKE+E
Sbjct: 133 GQHDQQLLFRPDEQRQLLDTFAGANEKVKAYRELYQAWLKARQQLKDRQQKEQELAQRLD 192
Query: 535 -LKKNIQEKDAERIDMEEKYSSLQEESEGLSK--KLKKVSSMLLAAKEERKDVQ 585
L+ ++E + + E +L+ E + LS KL+++S LAA DVQ
Sbjct: 193 FLQFQLEELEEADLQPGE-DEALEAEQQRLSNLEKLRELSQNALAALRGDVDVQ 245
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 33.1 bits (76), Expect = 0.23
Identities = 18/99 (18%), Positives = 45/99 (45%), Gaps = 17/99 (17%)
Query: 520 AAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEE-----------SEGLSKKLK 568
+ +LE +K ++ + +Q+ +++ K + LQ++ +E KK K
Sbjct: 44 KVSADLESEFKKRQKELQKMQK------ELKAKEAKLQDDGKMEALSDRAKAEAEIKKEK 97
Query: 569 KVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYS 607
V++ +E KD+ R +E + L+ +++ + +
Sbjct: 98 LVNAFNKKQQEYEKDLNRREAEEEQKLLEKIQRAIESVA 136
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 34.3 bits (79), Expect = 0.28
Identities = 21/98 (21%), Positives = 37/98 (37%), Gaps = 14/98 (14%)
Query: 518 LDAAAQELEHRKQKE--------EELKKNIQE--KDAERIDMEEKYSSLQEESEGLSKKL 567
L+ Q+++ Q EL++ + E K I EE+ + L EE +L
Sbjct: 378 LEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNEL 437
Query: 568 KKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
+ A EE ++ IE + + TK+
Sbjct: 438 FRSE----AEIEELLRQLETLKEAIEALRKTLDEKTKQ 471
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 33.8 bits (78), Expect = 0.29
Identities = 29/143 (20%), Positives = 52/143 (36%), Gaps = 25/143 (17%)
Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK-----AEIQQQLLDAAAQELE 526
E+ + +L S + +KL+ L++++ L + L+ A QEL
Sbjct: 163 EELLNKDLNLINSIKPKLRKKLQALKEEI-----ASLRQLADELNLCDPLELEKARQELR 217
Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR 586
K E +K ++E E ++ +L + S LL E + ++
Sbjct: 218 SLSVKISEKRKQLEELQQELQELTIAIEALTNK-----------KSELLEEIAEAEKIRE 266
Query: 587 ENR----KEIEGFLDNVRQITKE 605
E R KEI V + K
Sbjct: 267 ECRGWSAKEISKLKAKVSLLQKL 289
Score = 30.0 bits (68), Expect = 4.1
Identities = 21/99 (21%), Positives = 47/99 (47%), Gaps = 9/99 (9%)
Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV--- 570
+ L + + LE K+ EE L K++ ++ + + +K +L+EE L ++L
Sbjct: 145 LEGLKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASL-RQLADELNL 203
Query: 571 --SSMLLAAKEERKDVQRE---NRKEIEGFLDNVRQITK 604
L A++E + + + RK++E ++++T
Sbjct: 204 CDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTI 242
>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
unknown].
Length = 833
Score = 33.8 bits (77), Expect = 0.32
Identities = 31/179 (17%), Positives = 58/179 (32%), Gaps = 28/179 (15%)
Query: 413 ARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIE 472
A+ D YQ++I + + + + R + T
Sbjct: 448 AKAKILQADKAAKAYQEDILQREAQSRGKTAAAERSQEQ--------MTA---------A 490
Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVL---VGGENLLEKAEIQQQLLDAAAQELEHRK 529
+ ++ +D EK +K L + L K E +Q+ L LEH K
Sbjct: 491 LKALLAFQQQIADLSGAKEKAS--DQKSLLWKAEEQYALLKEEAKQRQLQEQKALLEH-K 547
Query: 530 QKEEELKKNIQEKDAERIDMEEK-----YSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
++ E + E ++ D E S + S+ + L + ++ L A E
Sbjct: 548 KETLEYTSQLAELLDQQADRFELSAQAAGSQKERGSDLYREALAQNAAALNKALNELAA 606
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 34.2 bits (78), Expect = 0.33
Identities = 33/211 (15%), Positives = 79/211 (37%), Gaps = 7/211 (3%)
Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELD-ETEPCVEPSEVIEDETDAELKRAQS 484
K Q+ + +L+ ++EE E T + P + + + + + +R +
Sbjct: 257 KKQQLLKQLRARIEELRAQEAVLEET-QERINRARKAAPLAAHIKAVT-QIEQQAQRIHT 314
Query: 485 DHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDA 544
+ QS + L K + E ++ L +QE+ R E +I+E
Sbjct: 315 ELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVA--TSIREISC 372
Query: 545 ERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEE--RKDVQRENRKEIEGFLDNVRQI 602
++ + + +LQ++ L++KL+ + L + E D + ++++G L + ++
Sbjct: 373 QQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQ 432
Query: 603 TKEYSYQQAIIQKFIPTKYLELMTQNATWNE 633
+ + I + E
Sbjct: 433 QELQQRYAELCAAAITCTAQCEKLEKIHLQE 463
Score = 32.6 bits (74), Expect = 0.76
Identities = 23/166 (13%), Positives = 48/166 (28%), Gaps = 27/166 (16%)
Query: 467 PSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELE 526
+ ++D T+ + L LQ + +L + QQ A +L
Sbjct: 592 ITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQ-----DLQDVRLHLQQCSQELALKLT 646
Query: 527 HRKQKEEELKKNIQEKDAERI----------------DMEEKYSSLQEESEGLSKKLKKV 570
+ L + + A I M+ + L E L++ +
Sbjct: 647 ALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLL 706
Query: 571 SSMLLAAKEERKDVQREN------RKEIEGFLDNVRQITKEYSYQQ 610
+ +E ++ ++ D + Q KE +Q
Sbjct: 707 RELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQA 752
Score = 31.5 bits (71), Expect = 2.0
Identities = 25/131 (19%), Positives = 53/131 (40%), Gaps = 3/131 (2%)
Query: 485 DHQSMVEKLELLQKKVLVGGENLL-EKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKD 543
D + + +E +KK L G LL ++++ RKQ E+ K+++E
Sbjct: 176 DQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREAL 235
Query: 544 AERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQIT 603
+ + +E E KK + + + A+ E Q +E + ++ R+
Sbjct: 236 QQTQQSHAYLTQKREAQEEQLKKQQLLKQLR--ARIEELRAQEAVLEETQERINRARKAA 293
Query: 604 KEYSYQQAIIQ 614
++ +A+ Q
Sbjct: 294 PLAAHIKAVTQ 304
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 33.9 bits (78), Expect = 0.33
Identities = 41/204 (20%), Positives = 82/204 (40%), Gaps = 10/204 (4%)
Query: 425 IKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEP-SEVIEDETDAELKRAQ 483
+ + LK++L E S + E E EL + +E +++E E +A+ +
Sbjct: 251 VNIDSRLERLKEQLVENSELLTQLE-LDEAEEELGLIQEKIESLYDLLEREVEAK-NVVE 308
Query: 484 SDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK---KNIQ 540
+ + + LE ++ E+L E+ E ++ A EL ++ E+ELK +
Sbjct: 309 ENLPILPDYLEKAKEN----NEHLKEEIERVKESYRLAETELGSVRKFEKELKELESVLD 364
Query: 541 EKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVR 600
E E YS LQ+ E + K L + +E ++++ + E
Sbjct: 365 EILENIEAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLERLKS 424
Query: 601 QITKEYSYQQAIIQKFIPTKYLEL 624
++ + Y + +P +L L
Sbjct: 425 KLHEIKRYMEKSNLPGLPETFLSL 448
>gnl|CDD|206243 pfam14073, Cep57_CLD, Centrosome localisation domain of Cep57. The
CLD or centrosome localisation domain of Cep57 is found
at the N-terminus, and lies approximately between
residues 58 and 239. This region lies within the first
alpha-helical coiled-coil segment of Cep57, and
localises to the centrosome internally to gamma-tubulin,
suggesting that it is either on both centrioles or on a
centromatrix component. This N-terminal region can also
multimerise with the N-terminus of other Cep57
molecules. The C-terminal part, Family Cep57_MT_bd,
pfam06657, is the microtubule-binding region of Cep57.
Length = 178
Score = 32.9 bits (75), Expect = 0.35
Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 8/74 (10%)
Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQK 531
+ E + Q D Q+ +EKLE+L+K+ L + Q L + Q+LE + Q+
Sbjct: 101 QVSLQREKEHDQMDVQAQLEKLEVLEKEYL--------RLTRTQSLAETKIQQLEEKLQE 152
Query: 532 EEELKKNIQEKDAE 545
EE +K +Q+K AE
Sbjct: 153 EEHQRKLVQDKAAE 166
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 33.0 bits (75), Expect = 0.38
Identities = 34/169 (20%), Positives = 60/169 (35%), Gaps = 15/169 (8%)
Query: 433 ELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEK 492
LK+K PP E E + + + +E+E +L QS + +V
Sbjct: 66 GLKEKSTSEPTVPPEEAEPH---AEEEGQLAVRKTKQKVEEEVKEQL---QSLLEKIVVS 119
Query: 493 LELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE---RIDM 549
+ ++ L E + + D + LE K EE E+ A + DM
Sbjct: 120 KQ--EEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDM 177
Query: 550 EEKYSSLQEESEGLSKK----LKKVSSMLLAAKEERKDVQRENRKEIEG 594
+EK S + E + K + + + +D E+ K I+
Sbjct: 178 KEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKE 226
Score = 32.6 bits (74), Expect = 0.58
Identities = 30/150 (20%), Positives = 56/150 (37%), Gaps = 16/150 (10%)
Query: 417 QDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELD--------------ETE 462
Q K ++E+ E + L E V +EE P EP+LD ET
Sbjct: 91 QLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETL 150
Query: 463 PCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAA 522
+ E ED E ++ Q M EK +++ E + + + + D
Sbjct: 151 EAGKVHEETEDSYHVEETASEQYKQDMKEKAS--EQENEDSKEPVEKAERTKAETDDVTE 208
Query: 523 QELEHRKQKEEELKKNIQEKDAERIDMEEK 552
++ + E+ K +E E ++ +++
Sbjct: 209 EDYDEEDNPVEDSKAIKEELAKEPVEEQQE 238
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 32.9 bits (75), Expect = 0.40
Identities = 31/128 (24%), Positives = 65/128 (50%), Gaps = 11/128 (8%)
Query: 481 RAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQ 540
+ + M +KL L++++ E ++ E+ Q+LL ELE +++ ++L+K I
Sbjct: 64 GSDAAIAEMEQKLAKLREEL---TELHKKRGELAQRLLLLN-DELEQLRREIQQLEKTIA 119
Query: 541 EKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLA------AKEERK-DVQRENRKEIE 593
E +E +E + L+EE + K + + L++ A EE+ +Q+EN++ +E
Sbjct: 120 ELRSEITSLETEIRDLREELQEKEKDNETLQDELISLNIELNALEEKLRKLQKENQELVE 179
Query: 594 GFLDNVRQ 601
++ Q
Sbjct: 180 RWMAKKGQ 187
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 33.6 bits (77), Expect = 0.40
Identities = 26/102 (25%), Positives = 54/102 (52%), Gaps = 7/102 (6%)
Query: 467 PSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELE 526
P +IE + + ++EKL L+K++ E+ LEK +Q+ L +ELE
Sbjct: 495 PHFIIEQAKT-FYGEFKEEINVLIEKLSALEKELEQKNEH-LEKLLKEQEKLK---KELE 549
Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLK 568
++ +E ++N + + E+ + +E +L++E E + ++LK
Sbjct: 550 QEMEELKERERNKKLE-LEK-EAQEALKALKKEVESIIRELK 589
Score = 32.1 bits (73), Expect = 1.1
Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 3/87 (3%)
Query: 509 EKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQE-ESEGLSKKL 567
K EI + +A E E +QK E L+K ++E++ + ++E++ L+E E +
Sbjct: 509 FKEEINVLIEKLSALEKEL-EQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELE 567
Query: 568 KKVSSMLLAAKEERKDVQRE-NRKEIE 593
K+ L A K+E + + RE K+I
Sbjct: 568 KEAQEALKALKKEVESIIRELKEKKIH 594
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 33.0 bits (76), Expect = 0.43
Identities = 23/107 (21%), Positives = 39/107 (36%), Gaps = 11/107 (10%)
Query: 499 KVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQE 558
+ L E +E + + +A + L E+ K+ Q +A+ +E L E
Sbjct: 197 QALTAKEKAIEAERAKAEAAEAEQELLR------EKQKEEEQMMEAQERSYQEHVKQLIE 250
Query: 559 ESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
+ E +KL +L K Q + EGF + KE
Sbjct: 251 KMEAEREKLLAEQERMLEHKL-----QEQEELLKEGFKTEAESLQKE 292
>gnl|CDD|235206 PRK04031, PRK04031, DNA primase; Provisional.
Length = 408
Score = 33.2 bits (77), Expect = 0.44
Identities = 27/120 (22%), Positives = 45/120 (37%), Gaps = 21/120 (17%)
Query: 419 PKDALLIKYQKEISELKKKLEEASV-PPPREEPTPNPEPELDETEPCVEPSEVIE----- 472
P + L + K+ + +K++E P P PE + P + V E
Sbjct: 265 PVEQYLEELGKKAQKAAEKVKEEEEKPEKEPAEQPEPEEKEPAPVPAEKEETVREHIKEL 324
Query: 473 ---------DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQ 523
DE +K +V+KLE + KV ++ I Q+LLD A++
Sbjct: 325 KGTLEARLLDENWNVIKEVPVR--DLVDKLEEAEDKV----YAIVFDGIITQRLLDLASE 378
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 33.4 bits (77), Expect = 0.44
Identities = 16/108 (14%), Positives = 39/108 (36%), Gaps = 12/108 (11%)
Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
Q+ ++QKE K+ E I + +S ++++ K+L+K+ +
Sbjct: 241 QKAERLRQEAAAYEKQQKE-------LAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEAR 293
Query: 574 LLAAKEERKDVQRENRKEIEG-----FLDNVRQITKEYSYQQAIIQKF 616
L + + R G + ++K Y + +++
Sbjct: 294 LAEERPVEEGKPLAFRFPPPGKRLGKLVLEFENVSKGYDGGRLLLKDL 341
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 32.9 bits (75), Expect = 0.48
Identities = 30/152 (19%), Positives = 62/152 (40%), Gaps = 23/152 (15%)
Query: 464 CVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQ 523
C E ++++ + D+EL+ A+ + + + L L+ + + LD+
Sbjct: 23 CEECADILLERLDSELRDAEKERDTYKQYLSKLESQ---------NVEISNYEALDS--- 70
Query: 524 ELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKD 583
EL+ K++EE L ++E + E D++ + LQEE E L + +
Sbjct: 71 ELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLE-----------NEELQYLR 119
Query: 584 VQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
+ DN++ + +Y Y + K
Sbjct: 120 EYNLFDRNNLQLEDNLQSLELQYEYSLNQLDK 151
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 33.4 bits (76), Expect = 0.51
Identities = 23/125 (18%), Positives = 41/125 (32%), Gaps = 4/125 (3%)
Query: 445 PPREEPTPNPEPELDETEPCVEPSEVIEDE----TDAELKRAQSDHQSMVEKLELLQKKV 500
P E N E E E E E+E AE K Q + K + +
Sbjct: 651 RPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGET 710
Query: 501 LVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEES 560
+ E + + + E ++ E+E + + K + + K + + E+
Sbjct: 711 EAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGET 770
Query: 561 EGLSK 565
E K
Sbjct: 771 EAEGK 775
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 33.4 bits (77), Expect = 0.53
Identities = 33/165 (20%), Positives = 64/165 (38%), Gaps = 28/165 (16%)
Query: 452 PNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ-SMVEKLELLQKKVLVGGENLLEK 510
P + EL + E E + E + ++ +++ S L++ VLVG E L ++
Sbjct: 709 PEVDEELIDEEAEKEFELLKE--IISAIRNLRAEMNLSPSAPLKV----VLVGSEELEDR 762
Query: 511 AEIQQQLLD--AAAQELEHRKQKEEELKKNIQE---------KDAERIDMEEKYSSLQEE 559
E + + A +ELE EE + A ID+ + + L++E
Sbjct: 763 LEANEDDIKGLANLEELEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKE 822
Query: 560 SEGLSKKLKKVSSMLLAAK----------EERKDVQRENRKEIEG 594
E L K++ ++ L E+ K+ E + ++
Sbjct: 823 LEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAK 867
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 33.1 bits (76), Expect = 0.56
Identities = 33/182 (18%), Positives = 69/182 (37%), Gaps = 18/182 (9%)
Query: 443 VPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQ-------SDHQSMVE-KLE 494
V R+E E + ++ I D+ L + SD + + L+
Sbjct: 38 VNSKRQEEAEPAAEEAELQAELIQQELAINDQLSQALNQQTERLNALASDDRQLANLLLQ 97
Query: 495 LLQ-------KKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERI 547
LLQ + ++ G LL + +QQ A + + + + +E EK A
Sbjct: 98 LLQSSRTIREQIAVLRGSLLLSRILLQQLGPLPEAGQPQEQFEVTQERDALQAEK-AYIN 156
Query: 548 DMEEKYSSLQEESEGLSKKLKKVSSMLLAAK-EERKDVQRE-NRKEIEGFLDNVRQITKE 605
+E + L E + ++ LL + +R+ + + N++++ D +R + +
Sbjct: 157 ALEGQAEQLTAEVRDILDQILDTRRELLNSLLSQREAISLQLNQQQLSAASDELRSLLHQ 216
Query: 606 YS 607
S
Sbjct: 217 QS 218
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 33.2 bits (75), Expect = 0.57
Identities = 25/127 (19%), Positives = 51/127 (40%), Gaps = 15/127 (11%)
Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
E+I+D+ + D ++ + + V EN E+ + E E
Sbjct: 594 ELIQDDEKGNFED-LEDEENSSDNEMEESRGSSVTAENEESADEVDYE------TEREEN 646
Query: 529 KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQREN 588
+K+EEL+ N + ++ ++ EE + ++LK + E + + E+
Sbjct: 647 ARKKEELRGNFELEERGD-PEKKDVDWYTEEKRKIEEQLK-------INRSEFETMVPES 698
Query: 589 RKEIEGF 595
R IEG+
Sbjct: 699 RVVIEGY 705
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 32.5 bits (74), Expect = 0.61
Identities = 25/72 (34%), Positives = 34/72 (47%), Gaps = 15/72 (20%)
Query: 522 AQELEHRKQKEEELKKNIQE--------KDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
AQ LE KQK EE +K I + K E+ + E K ++ LSKK KK
Sbjct: 17 AQSLEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKKKKF--- 73
Query: 574 LLAAKEERKDVQ 585
K+E+ DV+
Sbjct: 74 ----KKEKVDVR 81
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 32.2 bits (74), Expect = 0.64
Identities = 15/80 (18%), Positives = 39/80 (48%), Gaps = 10/80 (12%)
Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR 586
+Q + EL++ I++ + E+ ++E++ + L+ + E + K +EER+ ++
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK----------REEEERQIEEK 166
Query: 587 ENRKEIEGFLDNVRQITKEY 606
+ EI +Q+ +
Sbjct: 167 RHADEIAFLKKQNQQLKSQL 186
Score = 30.3 bits (69), Expect = 2.2
Identities = 16/69 (23%), Positives = 40/69 (57%), Gaps = 7/69 (10%)
Query: 509 EKAEIQQQL--LDAAAQELEHR----KQKEEELKKNIQE-KDAERIDMEEKYSSLQEESE 561
K+E++Q++ L+ +ELE R + K E ++K +E + E ++ + L+++++
Sbjct: 121 GKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQ 180
Query: 562 GLSKKLKKV 570
L +L+++
Sbjct: 181 QLKSQLEQI 189
>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
The small mechanosensitive channel, MscS, is a part of
the turgor-driven solute efflux system that protects
bacteria from lysis in the event of osmotic shock. The
MscS protein alone is sufficient to form a functional
mechanosensitive channel gated directly by tension in
the lipid bilayer. The MscS proteins are heptamers of
three transmembrane subunits with seven converging M3
domains, and this MscS_porin is towards the N-terminal
of the molecules. The high concentration of negative
charges at the extracellular entrance of the pore helps
select the cations for efflux.
Length = 239
Score = 32.2 bits (74), Expect = 0.65
Identities = 19/125 (15%), Positives = 52/125 (41%), Gaps = 5/125 (4%)
Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
+L + E+ L+Q L N L++ + +Q D ++++ ++ EL+
Sbjct: 1 NQLDQLNKRKLPTPEQKALIQ--DLEQALNFLDEIDKSKQKADQYQKQIDDAPKELRELR 58
Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFL 596
+ ++ + + ++L L ++L + S L +E+ + + E++
Sbjct: 59 QELEALKKTDAPVFPELANLSLSQ--LEQRLAQTLSQLQELQEQLQQENSQLI-ELQTRP 115
Query: 597 DNVRQ 601
+ +Q
Sbjct: 116 ERAQQ 120
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 32.6 bits (75), Expect = 0.67
Identities = 53/206 (25%), Positives = 87/206 (42%), Gaps = 51/206 (24%)
Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEP-SEVIEDE----TDAE-- 478
Y++EI EL+K+L+E + R + E+D E E EDE AE
Sbjct: 51 LYEQEIRELRKQLDELTNERARL------QLEIDNLRLAAEDFREKYEDELNLRQSAEND 104
Query: 479 ---LKR-------AQSDHQSMVEKL-------------ELLQKKVLVGGENLLEKAEIQQ 515
L++ A+ D + +E L E+ + + + G+ +E ++
Sbjct: 105 IVGLRKDLDEATLARVDLEMKIESLQEELAFLKKNHEEEVRELQSQIQGQVNVEMDAARK 164
Query: 516 QLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLL 575
L A E+ R Q EE +KN QE EE Y S + E L + + L
Sbjct: 165 LDLTKALAEI--RAQYEELAEKNRQE-------AEEWYKS---KLEELQQAAARNGDALR 212
Query: 576 AAKEERKDVQRE-NRKEIEGFLDNVR 600
+AKEE +++R+ EIE L +++
Sbjct: 213 SAKEEITELRRQIQSLEIE--LQSLK 236
>gnl|CDD|237868 PRK14960, PRK14960, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 702
Score = 32.7 bits (74), Expect = 0.71
Identities = 46/205 (22%), Positives = 76/205 (37%), Gaps = 34/205 (16%)
Query: 440 EASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKK 499
P P EP P PEPE E EP EP E + + +L +H ++
Sbjct: 410 VEPEPEPEPEPEPEPEPE-PEPEPEPEPEPEPEPQPNQDLMVFDPNHHELIG-------- 460
Query: 500 VLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEE 559
LE A +Q+ + + E + E++L + E ++I+ E ++ E
Sbjct: 461 --------LESAVVQETV---SVLEEDFIPVPEQKLVQVQAETQVKQIEPEPASTA---E 506
Query: 560 SEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPT 619
GL + S LA D+ E E + + E A++++ T
Sbjct: 507 PIGLFEASSAEFS--LAQDTSAYDLVSEPVIEQQSLVQ------AEIVETVAVVKEPNAT 558
Query: 620 KYLELMTQNA---TWNEDIGEWELK 641
+LM Q+ GEW L+
Sbjct: 559 DNSQLMPQDILKLPSQTLEGEWTLE 583
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are about 260 amino acids in
length.
Length = 250
Score = 32.2 bits (74), Expect = 0.75
Identities = 14/90 (15%), Positives = 36/90 (40%), Gaps = 12/90 (13%)
Query: 516 QLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLL 575
Q L AA Q + Q +K I + + ++ + L +E E L ++ ++
Sbjct: 26 QALAAAQQTVAAAAQ----SQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRLVA 81
Query: 576 AAKEERKDVQRENRKEIEGFLDNVRQITKE 605
++E +Q++ ++ + + +
Sbjct: 82 NQQQEIASLQQQ--------IEQIEKTRQG 103
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 32.7 bits (75), Expect = 0.75
Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 13/105 (12%)
Query: 515 QQLLDAAAQELEHRKQKEEELK----KNIQEKDAERID-----MEEKYSSLQEESEGLSK 565
+Q L +ELE R+QKEEE++ K I EK E ++ M+ + +EE + +
Sbjct: 86 EQGLQRKLKELE-REQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQMEI 144
Query: 566 KLKKVSSMLLAAKEERKDVQRENR---KEIEGFLDNVRQITKEYS 607
K + S L KEE K +QR + KE E + ++ +EY
Sbjct: 145 LEKALKSYLKIVKEENKSLQRLAKALQKESEERTQDETKMIEEYR 189
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 32.8 bits (75), Expect = 0.77
Identities = 27/116 (23%), Positives = 50/116 (43%), Gaps = 3/116 (2%)
Query: 503 GGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEG 562
GGE +LE Q L +AA + +EL+K E R E +L+E E
Sbjct: 103 GGEEILEAQGELGQYLFSAAAGVGSLGSVRDELEKEADELWKPRGRKPEINVALKELKE- 161
Query: 563 LSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIP 618
L ++++V + ++ E +E+ +RQ+ KE + +++ +P
Sbjct: 162 LEAEIREV-QLKTRTWKDLVKALDEAEEELANLRKELRQLEKEKQ-RLERLRRLLP 215
Score = 29.3 bits (66), Expect = 8.0
Identities = 21/136 (15%), Positives = 49/136 (36%), Gaps = 15/136 (11%)
Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
DE + +RA + + ++ + + + L +A + + A A + Q
Sbjct: 686 DERLRDAERALEEAEERHDEAQSALEAWQEQWYDALLEAGLGGRASPAGALDALELLQNI 745
Query: 533 EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL-------------KKVSSMLLAAKE 579
+E + + ME + +EE E L++ + + + L A++
Sbjct: 746 KEKLQAADDLRQRIAAMERDLARFEEEVEALAEAVAPEMLGTPADETARALKQRLKRARD 805
Query: 580 ERKDVQR--ENRKEIE 593
++ E +E E
Sbjct: 806 TAAAAEKLAEEIEEAE 821
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 32.2 bits (73), Expect = 0.77
Identities = 26/113 (23%), Positives = 49/113 (43%), Gaps = 4/113 (3%)
Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
E++ E L RA V + L++ E L E + +++LL ELE
Sbjct: 106 ELVRKELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLK----ELEEL 161
Query: 529 KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEER 581
+ + EE+++ ++ + E +EE L E L K+ ++ + +EE
Sbjct: 162 EAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEEL 214
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 31.1 bits (71), Expect = 0.78
Identities = 19/91 (20%), Positives = 35/91 (38%), Gaps = 16/91 (17%)
Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA----------------EIQQ 515
E++ EL +AQ++ + + +L+ L K + LEK
Sbjct: 15 EEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSALELSNYQRFIRQLD 74
Query: 516 QLLDAAAQELEHRKQKEEELKKNIQEKDAER 546
Q + QEL +Q+ E ++ + E E
Sbjct: 75 QRIQQQQQELALLQQEVEAKRERLLEARREL 105
>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1. This is
the N-terminal 200 residues of a set of proteins
conserved from yeasts to humans. Most of the proteins in
this entry have an RhoGEF pfam00621 domain at their
C-terminal end.
Length = 158
Score = 31.2 bits (71), Expect = 0.85
Identities = 25/106 (23%), Positives = 51/106 (48%), Gaps = 2/106 (1%)
Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
E L + + ++ L +E E RKQ E++L + E + E+ ++++K L+E L
Sbjct: 51 ELELLREDNER-LSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQLE 109
Query: 565 KKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQ 610
K K +S + +E ++++E E + + + EY +Q
Sbjct: 110 LKAKNLSDQVSRLEERETELKKEYNSLHERYT-KLLKNYVEYVERQ 154
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 32.3 bits (74), Expect = 0.94
Identities = 19/83 (22%), Positives = 36/83 (43%), Gaps = 15/83 (18%)
Query: 510 KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEE-------KYSSLQE---- 558
+ E +++ + L K++EE+ KK +++ + +I E KYS+ E
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLP 613
Query: 559 ----ESEGLSKKLKKVSSMLLAA 577
+ E +SKK +K S
Sbjct: 614 THDADGEEISKKERKKLSKEYDK 636
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 31.2 bits (71), Expect = 0.96
Identities = 15/44 (34%), Positives = 26/44 (59%)
Query: 509 EKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEK 552
K ++++ L A +E E K+KEE K + E + ER ++E+K
Sbjct: 68 AKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKK 111
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 32.2 bits (73), Expect = 1.00
Identities = 21/105 (20%), Positives = 41/105 (39%), Gaps = 3/105 (2%)
Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKK 537
EL + + +H+ L++K EN + + +L E K+ E E +
Sbjct: 76 ELMQKELEHK---RAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKAENEAAE 132
Query: 538 NIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
+E + ++ + LQ E E K+ K+ S + E +
Sbjct: 133 AEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQ 177
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 32.2 bits (73), Expect = 1.1
Identities = 20/68 (29%), Positives = 35/68 (51%), Gaps = 10/68 (14%)
Query: 523 QELEHRKQKEEELKKNIQEKDAERIDMEEK--YSSLQEESEGLSKKLKKVSSMLLAAKEE 580
+ELE +K+KEE+ K EK+ +++ +K + LQ + + K S K
Sbjct: 16 EELERKKKKEEKAK----EKELKKLKAAQKEAKAKLQAQQASDGTNVPKKS----EKKSR 67
Query: 581 RKDVQREN 588
++DV+ EN
Sbjct: 68 KRDVEDEN 75
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 32.5 bits (74), Expect = 1.1
Identities = 21/92 (22%), Positives = 39/92 (42%), Gaps = 4/92 (4%)
Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA----EIQQQLLDAAAQELEH 527
E E DA + + + ++ +LE LQ LE+A E Q A Q +
Sbjct: 299 ERELDARTEALEREADALRTRLEALQGSPAYQDAEELERARADAEALQAAAADARQAIRE 358
Query: 528 RKQKEEELKKNIQEKDAERIDMEEKYSSLQEE 559
+ + EE ++ + E+ D E + + +E+
Sbjct: 359 AESRLEEERRRLDEEAGRLDDAERELRAAREQ 390
>gnl|CDD|226025 COG3494, COG3494, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 279
Score = 31.6 bits (72), Expect = 1.1
Identities = 17/87 (19%), Positives = 30/87 (34%), Gaps = 16/87 (18%)
Query: 120 RPIVDKVLSGFNGTIFAYGQTGT-GKTFTMEGVNNV---------PELKGIIPNSFA--- 166
R D L F + G+ G K EGV+ V P + + P+
Sbjct: 37 RGEADPELKEFEYKEVSIGEVGKLIKLLKTEGVDRVVLAGGVERRPNFRDLRPDKIGLAV 96
Query: 167 --HIFGHIAKADESVKFLVRVSYFEIY 191
I + + D+++ V + + E
Sbjct: 97 LPKIVEALIRGDDALLKAV-IDFIESR 122
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 32.0 bits (73), Expect = 1.2
Identities = 23/140 (16%), Positives = 60/140 (42%), Gaps = 6/140 (4%)
Query: 468 SEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEH 527
+E + + E R + + + + +K + E +E+ E + L ELE
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKR---ELEE 447
Query: 528 RKQKEEELKKNIQE-KDAERIDM--EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV 584
K++ E+L+ ++ + R + + + + E L K+L++ + + + ++
Sbjct: 448 LKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAEL 507
Query: 585 QRENRKEIEGFLDNVRQITK 604
++ + E+ G V+ + K
Sbjct: 508 RKMRKLELSGKGTPVKVVEK 527
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 31.9 bits (73), Expect = 1.2
Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 3/93 (3%)
Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL--LDAAAQELE 526
EVI +E +A K + S +E LEL +++ GE +AE++ +D E +
Sbjct: 175 EVISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLEAELEVLKRQIDELQLERQ 234
Query: 527 HRKQK-EEELKKNIQEKDAERIDMEEKYSSLQE 558
+Q EE+ + + E A ++ E+ + ++
Sbjct: 235 QIEQTFREEVLEELTEAQARLAELRERLNKARD 267
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 32.2 bits (74), Expect = 1.2
Identities = 49/344 (14%), Positives = 113/344 (32%), Gaps = 80/344 (23%)
Query: 307 MGRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVISALVDGKCTHIPYRNSKL 366
+G +H+ DL + L + S+ L L + + L + L
Sbjct: 28 LGVVHIEDLKEELSNERLRKLRSLLTKLSEA---LDKLRSYLPKLNPLREEKKKVSVKSL 84
Query: 367 TRILQDSLGGNSKTVMCATVGPASYNYEETISTLRYASRVKKICNKARVNQDPKDALLIK 426
+++D EE + + +K++ + ++ L
Sbjct: 85 EELIKDV--------------------EEELEKI--EKEIKELEEEISELENEIKEL--- 119
Query: 427 YQKEISELKKKLEEASVPPPREEPTPN--------PEPELDETEPCVEPSEVIEDETDAE 478
++EI L+ + PE +L+E + + V TD
Sbjct: 120 -EQEIERLEP-WGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKG 177
Query: 479 --------LKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQ 530
LK + + ++KL + ++ G +L+ +ELE ++
Sbjct: 178 YVYVVVVVLKELSDEVEEELKKLGFERLELEEEG--------TPSELIREIKEELEEIEK 229
Query: 531 KEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRK 590
+ E L + ++E ++ L E L +L++ ++ K ++
Sbjct: 230 ERESLLEELKELA------KKYLEELLALYEYLEIELERAEALSKFLKTDKTFA------ 277
Query: 591 EIEGFL-----DNVRQITKEYSYQQAIIQK--------FIPTKY 621
IEG++ ++++ + + A ++ +PTK
Sbjct: 278 -IEGWVPEDRVKKLKELIDKATGGSAYVEFVEPDEEEEEVPTKL 320
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 31.9 bits (72), Expect = 1.2
Identities = 28/110 (25%), Positives = 50/110 (45%), Gaps = 12/110 (10%)
Query: 479 LKRAQSDHQSMVEKL-------ELLQKKVLVGGENLLEKAEIQQQ-----LLDAAAQELE 526
L A+ ++ E L E ++K + GG+ LE+A+ Q L A + L
Sbjct: 106 LAVAEKAGRAEAEGLRAALAGAEEVRKNLEEGGQQELEEAQRLHQEQLSSLTQAHLEALS 165
Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLA 576
+ K E L+K++Q + R + ++ Q E++ L ++L K L A
Sbjct: 166 SLRSKAEGLEKSLQSLETRRAGEAKALAAAQAEADTLREQLSKTQEELEA 215
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 32.0 bits (72), Expect = 1.2
Identities = 33/148 (22%), Positives = 64/148 (43%), Gaps = 18/148 (12%)
Query: 449 EPTPNPE-PELDETEPCVEPSEVIED-ETDAELKRAQSDHQSMVEKLELLQKKVLVGGEN 506
EP+ P EL ++E EP E +E+ E EL+ + + ++E L L+Q++ +
Sbjct: 252 EPSCRPSMSELLQSEFINEPRENLEEREAAMELRDRIEEQELLLEFLFLIQQRKQEAADK 311
Query: 507 L----------LEKAEIQQQLLDAAAQEL------EHRKQKEEELKKNIQEKDAERIDME 550
L +++ +Q +L ++ R ++ E +E D ++
Sbjct: 312 LQDTISLLSSDIDQVVKRQLVLQQKGSDVRSFLASRKRIRQGAETLAAEEENDDNSSKLD 371
Query: 551 EKYSSLQEESEGLSKKLKKVSSMLLAAK 578
+ S ES L + LKK+ S+ A +
Sbjct: 372 DTLESTLLESSRLMRNLKKLESVYFATR 399
>gnl|CDD|115071 pfam06390, NESP55, Neuroendocrine-specific golgi protein P55
(NESP55). This family consists of several mammalian
neuroendocrine-specific golgi protein P55 (NESP55)
sequences. NESP55 is a novel member of the chromogranin
family and is a soluble, acidic, heat-stable secretory
protein that is expressed exclusively in endocrine and
nervous tissues, although less widely than
chromogranins.
Length = 261
Score = 31.4 bits (70), Expect = 1.3
Identities = 32/137 (23%), Positives = 57/137 (41%), Gaps = 14/137 (10%)
Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELD-ETEPCVEPSEVIEDETDAELKRAQS 484
+Y ++ + + E E EPE + +T P EP EDE + + +
Sbjct: 106 EYDEDDFDTETDSETEPESDIESETEFETEPETEPDTAPTTEPETEPEDEPGPVVPKGAT 165
Query: 485 DHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDA 544
HQS+ E+L L+ L+ A+ + + QE E ++ EE + + D
Sbjct: 166 FHQSLTERLHALK----------LQSADASPRRAPPSTQEPESAREGEEPERGPL---DK 212
Query: 545 ERIDMEEKYSSLQEESE 561
+ D EE+ +EE +
Sbjct: 213 DPRDPEEEEEEKEEEKQ 229
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 32.0 bits (73), Expect = 1.3
Identities = 34/212 (16%), Positives = 80/212 (37%), Gaps = 33/212 (15%)
Query: 420 KDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAEL 479
K+ + ++ + + +++L+E L + +++ E ++E+
Sbjct: 866 KETIYLQSAQRVELAERQLQE-----------------LKIDVKSISSLKLVNLELESEI 908
Query: 480 KRAQSDHQS-MVEKLELLQK-----KVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
+ S ++E LE + K L+ +L E I+ L K
Sbjct: 909 IELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLP--------ELNKLH 960
Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR--KE 591
E++ ++E E D+ +K + L E + +LK L ++ +Q + KE
Sbjct: 961 EVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKE 1020
Query: 592 IEGFLDNVRQITKEYSYQQAIIQKFIPTKYLE 623
+ + ++ +K S + + P + L+
Sbjct: 1021 LPVEVAELQSASKIISSESTELSILKPLQKLK 1052
Score = 30.0 bits (68), Expect = 5.5
Identities = 47/299 (15%), Positives = 103/299 (34%), Gaps = 58/299 (19%)
Query: 414 RVNQDPKDALLIKYQ---KEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEV 470
+ + L+ + + I+ LKK L + E + V+ E+
Sbjct: 910 ELKKSLSSDLIENLEFKTELIARLKKLLNNIDL-------------EEGPSIEYVKLPEL 956
Query: 471 IED-ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLL----EKAEIQQQL--LDAAAQ 523
+ E +++LK +++ +++K +L ++ L E AE+ +Q L + +
Sbjct: 957 NKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTK 1016
Query: 524 ELEHRKQKEEELKKNIQEKDAE-------------RIDMEEKYSSLQEESEGLS------ 564
+L+ + EL+ + +E + + + + LQ + L
Sbjct: 1017 QLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLENNQLQARYKALKLRRENS 1076
Query: 565 -------KKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFI 617
+L+ ++L + +V N + L + + + Q I KF+
Sbjct: 1077 LLDDKQLYQLESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQ-EISKFL 1135
Query: 618 PTK--YLELMTQNATWNEDIGEWELKCVAYTGNNMRKIVPLPEDTTATQIAQPDFSHLY 674
LE + Q + + EL + + N P P + + S LY
Sbjct: 1136 SQLVNTLEPVFQKLSVLQL----ELDGLFWEANLEALPSPPPFAALSEKR--LYQSALY 1188
>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like. This family is a set
of eukaryotic tropomyosins. Within the yeast Tmp1 and
Tmp2, biochemical and sequence analyses indicate that
Tpm2p spans four actin monomers along a filament,
whereas Tpmlp spans five. Despite its shorter length,
Tpm2p can compete with Tpm1p for binding to F-actin.
Over-expression of Tpm2p in vivo alters the axial
budding of haploids to a bipolar pattern, and this can
be partially suppressed by co-over-expression of Tpm1p.
This suggests distinct functions for the two
tropomyosins, and indicates that the ratio between them
is important for correct morphogenesis. The family also
contains higher eukaryote Tmp3 members.
Length = 143
Score = 30.6 bits (70), Expect = 1.3
Identities = 31/141 (21%), Positives = 67/141 (47%), Gaps = 35/141 (24%)
Query: 430 EISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSM 489
EI+ L+KK ++ E E+++ E ++ ++ +E++ + ++ +++
Sbjct: 36 EITSLQKKNQQL-------------EEEVEKLEEQLKEAKEKLEESE----KLATNAEAL 78
Query: 490 VEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDM 549
+++LL E LE++E + L ++L +K EE ++ ++ + ER +
Sbjct: 79 TRRIQLL--------EEELEESE---KRLKETTEKLREADKKAEESERKVKALENERDEW 127
Query: 550 EEKYSSLQEESEGLSKKLKKV 570
EEKY E L KK K+
Sbjct: 128 EEKY-------EELEKKYKEA 141
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 32.1 bits (74), Expect = 1.3
Identities = 20/90 (22%), Positives = 38/90 (42%), Gaps = 7/90 (7%)
Query: 452 PNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA 511
PN E + +E + EDE + E + S+ L +KVL E A
Sbjct: 176 PNAEEDPAHVGSELEELDDDEDEEEEE----DENDDSLAADESELPEKVL---EKFKALA 228
Query: 512 EIQQQLLDAAAQELEHRKQKEEELKKNIQE 541
+ ++L A +++E R + ++ K ++
Sbjct: 229 KQYKKLRKAQEKKVEGRLAQHKKYAKLREK 258
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 31.3 bits (71), Expect = 1.3
Identities = 21/84 (25%), Positives = 40/84 (47%), Gaps = 7/84 (8%)
Query: 523 QELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
++ + +EE K ++ + E I+ EE +E +E ++LK ++ L KEE K
Sbjct: 15 EDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKL---KEENK 71
Query: 583 DVQRENRKEIEGFLDNVRQITKEY 606
++ E+E D + + EY
Sbjct: 72 KLE----NELEALKDRLLRTVAEY 91
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 31.6 bits (71), Expect = 1.3
Identities = 26/170 (15%), Positives = 52/170 (30%), Gaps = 6/170 (3%)
Query: 428 QKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
Q E + + + P PE E +I D + +
Sbjct: 53 QANEPETLQPKNQTENGETAADLPPKPEERWSYIEELEAREVLINDPEEPSNGGGVEESA 112
Query: 488 SMVE----KLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKD 543
+ LE +Q + + L Q ++A Q E + QK + +
Sbjct: 113 QLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVE 172
Query: 544 AERID--MEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKE 591
E+I ++E + + ++ + A K ++ D + KE
Sbjct: 173 TEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKADKADKTKPKPKE 222
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 31.5 bits (72), Expect = 1.3
Identities = 18/128 (14%), Positives = 34/128 (26%), Gaps = 10/128 (7%)
Query: 414 RVNQDPKDALLIKYQKEISE------LKKKLEEASVPPPREEP---TPNPEPELDETEPC 464
RVN P +A + + + V P E P P +P
Sbjct: 70 RVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQP- 128
Query: 465 VEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQE 524
V+ + + + + + A Q + + A
Sbjct: 129 VQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPV 188
Query: 525 LEHRKQKE 532
++ K+KE
Sbjct: 189 MDKPKRKE 196
>gnl|CDD|178770 PLN03231, PLN03231, putative alpha-galactosidase; Provisional.
Length = 357
Score = 31.5 bits (71), Expect = 1.3
Identities = 21/79 (26%), Positives = 32/79 (40%), Gaps = 12/79 (15%)
Query: 289 RTTGENWRLWAEAQGSVQMGRLHLVDLAGSERQAKTGASGQRLKEASKINLSLSTLGNVI 348
R TG++W W + R D A + A G + S ++L + G
Sbjct: 236 RVTGDDWDDWKYLVKHFDVAR----DFAAAGLIAIPSVVGGK----SWVDLDMLPFGR-- 285
Query: 349 SALVDGKCTHIPYRNSKLT 367
L D + PYRNS+L+
Sbjct: 286 --LTDPAAAYGPYRNSRLS 302
>gnl|CDD|153311 cd07627, BAR_Vps5p, The Bin/Amphiphysin/Rvs (BAR) domain of yeast
Sorting Nexin Vps5p. BAR domains are dimerization,
lipid binding and curvature sensing modules found in
many different proteins with diverse functions. Sorting
nexins (SNXs) are Phox homology (PX) domain containing
proteins that are involved in regulating membrane
traffic and protein sorting in the endosomal system.
SNXs differ from each other in their lipid-binding
specificity, subcellular localization and specific
function in the endocytic pathway. A subset of SNXs also
contain BAR domains. The PX-BAR structural unit
determines the specific membrane targeting of SNXs.
Vsp5p is the yeast counterpart of human SNX1 and is part
of the retromer complex, which functions in the
endosome-to-Golgi retrieval of vacuolar protein sorting
receptor Vps10p, the Golgi-resident membrane protein
A-ALP, and endopeptidase Kex2. BAR domains form dimers
that bind to membranes, induce membrane bending and
curvature, and may also be involved in protein-protein
interactions.
Length = 216
Score = 31.1 bits (71), Expect = 1.5
Identities = 23/97 (23%), Positives = 45/97 (46%), Gaps = 13/97 (13%)
Query: 516 QLLDAAAQELEHRKQKEEELKKN-------IQEKDAERIDMEEKYSSLQEESEGLSKKLK 568
Q +A EL +K + E+LK+ + +E + E + S L++E E +S+ +K
Sbjct: 115 QYWQSAESELSKKKAQLEKLKRQGKTQQEKLNSLLSELEEAERRASELKKEFEEVSELIK 174
Query: 569 KVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKE 605
+ ER + + R +E +L++ + KE
Sbjct: 175 SELERF---ERERVE---DFRNSVEIYLESAIESQKE 205
>gnl|CDD|224477 COG1561, COG1561, Uncharacterized stress-induced protein [Function
unknown].
Length = 290
Score = 31.4 bits (72), Expect = 1.6
Identities = 19/105 (18%), Positives = 41/105 (39%), Gaps = 21/105 (20%)
Query: 500 VLVGGENLLEKAEIQQQLLDAAAQELEH----RKQKEEELKKNIQEKDAERIDMEEKYSS 555
VL E + ++ +L A + L+ R+++ LK ++ +R+D
Sbjct: 116 VLAIEEEEEDLEALEALILAALEEALDDLIEMREREGAALKADLL----QRLD------- 164
Query: 556 LQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVR 600
+ + ++KV S++ E ++ E + LD R
Sbjct: 165 ------AIEELVEKVESLMPEILEWYRERLVARLNEAQDQLDEDR 203
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 31.7 bits (72), Expect = 1.6
Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 4/80 (5%)
Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLE----KAEIQQQLLDAAAQELEHRKQKEE 533
+L++A + S E LL+ K+ E L E + ++Q+ + + ++ +K
Sbjct: 520 DLQKALQEQGSKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPDQDQNLSRKIA 579
Query: 534 ELKKNIQEKDAERIDMEEKY 553
EL+ +Q+KD + MEE+Y
Sbjct: 580 ELEAALQKKDEDMRAMEERY 599
Score = 29.8 bits (67), Expect = 6.3
Identities = 20/86 (23%), Positives = 42/86 (48%)
Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
+ ++ E Q ++ A QEL ++Q +++ D + E QEE + L+
Sbjct: 133 QQIMTLEESVQHVVMTAIQELMSKEQGSSPSRESAGNLDQQLKKALEDLKEAQEEKDELA 192
Query: 565 KKLKKVSSMLLAAKEERKDVQRENRK 590
++ ++ +L +EE+ +Q+EN K
Sbjct: 193 QRCHELDKQVLLLQEEKNSLQQENEK 218
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 30.3 bits (69), Expect = 1.7
Identities = 21/81 (25%), Positives = 38/81 (46%), Gaps = 1/81 (1%)
Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKA-EIQQQLLDAAAQELEHRKQK 531
DE A++ ++ + + E+ + L + E E+A EI +Q A Q E K +
Sbjct: 36 DERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAE 95
Query: 532 EEELKKNIQEKDAERIDMEEK 552
EE + I+E I+ E++
Sbjct: 96 AEEELERIKEAAEAEIEAEKE 116
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 31.4 bits (71), Expect = 1.7
Identities = 24/100 (24%), Positives = 47/100 (47%), Gaps = 12/100 (12%)
Query: 493 LELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEK 552
LE L+K V G + ++++A ++E +KE E +E +R+ +EE
Sbjct: 363 LERLEKDVEEGEKTIVKEAR-----------QIEEELEKEVEKLGKEEESLFKRVALEEG 411
Query: 553 YSSLQEESEGLSKKL-KKVSSMLLAAKEERKDVQRENRKE 591
L+++ E K+L K+ +L K E +V++ +
Sbjct: 412 LKELEQDEENFLKELSKEEKELLEKLKMEASEVEKLFGRA 451
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 31.5 bits (72), Expect = 1.8
Identities = 13/34 (38%), Positives = 17/34 (50%)
Query: 441 ASVPPPREEPTPNPEPELDETEPCVEPSEVIEDE 474
A PP EEP P P P D+ +P + E + E
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAE 293
>gnl|CDD|223182 COG0104, PurA, Adenylosuccinate synthase [Nucleotide transport and
metabolism].
Length = 430
Score = 31.4 bits (72), Expect = 1.9
Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 8/71 (11%)
Query: 522 AQELEHRKQKEEELKKNIQEK--------DAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
+L + E+L++ ++ K AE +D E+ E +E L + VS +
Sbjct: 149 VGDLLDPETLREKLERLLEYKNFQLVKYYGAEAVDFEDILDEYYEYAERLKPYVTDVSVL 208
Query: 574 LLAAKEERKDV 584
L A + K V
Sbjct: 209 LNDALDAGKRV 219
>gnl|CDD|218380 pfam05010, TACC, Transforming acidic coiled-coil-containing protein
(TACC). This family contains the proteins TACC 1, 2 and
3 the genes for which are found concentrated in the
centrosomes of eukaryotic and may play a conserved role
in organising centrosomal microtubules. The human TACC
proteins have been linked to cancer and TACC2 has been
identified as a possible tumour suppressor (AZU-1). The
functional homologue (Alp7) in Schizosaccharomyces pombe
has been shown to be required for organisation of
bipolar spindles.
Length = 207
Score = 30.5 bits (69), Expect = 2.1
Identities = 32/135 (23%), Positives = 63/135 (46%), Gaps = 4/135 (2%)
Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQK 531
+ + DA LK+ + + S + E L+KK LE +I + AQ +E +++
Sbjct: 4 QKDMDAALKKIKREVISSELESEELKKKYEKLRSENLEMGKIVDEFEKTIAQMIEESQKQ 63
Query: 532 EEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKE 591
+E K+ +QE AE+ +SL+ L K+ +K ++ E K + +K
Sbjct: 64 KELSKRELQEVLAEKDQAYADLNSLETSFSDLFKRYEKYKEVI----EGYKKNEETLKKC 119
Query: 592 IEGFLDNVRQITKEY 606
+ +LD +++ + Y
Sbjct: 120 AQEYLDRLKKEEQRY 134
Score = 29.7 bits (67), Expect = 3.7
Identities = 44/200 (22%), Positives = 91/200 (45%), Gaps = 22/200 (11%)
Query: 416 NQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPEL-DETEPCVEPSEVIEDE 474
+Q DA L K ++E+ + + EE + ++ DE E + +++IE+
Sbjct: 3 SQKDMDAALKKIKREVISSELESEELKKKYEKLRSENLEMGKIVDEFEKTI--AQMIEES 60
Query: 475 TDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK--AEIQQQLLDAAAQELEHRKQKE 532
++ + + ++++ L +K N LE +++ ++ + + +E K+ E
Sbjct: 61 -----QKQKELSKRELQEV-LAEKDQAYADLNSLETSFSDLFKRY-EKYKEVIEGYKKNE 113
Query: 533 EELKKNIQEKDAERIDMEEKYSSLQ----EESEGLSKKLKKVSSMLLAAKEERKDVQREN 588
E LKK QE E++Y +L+ E+ E ++++ +V S AK E +Q
Sbjct: 114 ETLKKCAQEYLDRLKKEEQRYQALKAHAEEKLEIANEEIAQVRS---KAKAETAALQASL 170
Query: 589 RKE---IEGFLDNVRQITKE 605
RKE ++ + + Q KE
Sbjct: 171 RKEQMKVQSLEETLEQKNKE 190
>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH. This
bacillus clade of FliH proteins is not found by the Pfam
FliH model pfam02108, but is closely related to the
sequences identified by that model. Sequences identified
by this model are observed in flagellar operons in an
analogous position relative to other flagellar operon
genes.
Length = 255
Score = 30.9 bits (70), Expect = 2.1
Identities = 15/76 (19%), Positives = 38/76 (50%), Gaps = 4/76 (5%)
Query: 497 QKKVLVGGENLLEKAEIQQQLLDAAAQE---LEHRKQKEEEL-KKNIQEKDAERIDMEEK 552
Q++ ++ + + E +Q+L D + LE + + ++ ++ + A R +E++
Sbjct: 15 QERGIIPLRQVTDVPETEQELADEEQEFEQILEKAEAEAAQIIEQAEAQAAAIREQIEQE 74
Query: 553 YSSLQEESEGLSKKLK 568
+ +EE E L ++ K
Sbjct: 75 RAQWEEERERLIQEAK 90
>gnl|CDD|237965 PRK15429, PRK15429, formate hydrogenlyase transcriptional activator
FhlA; Provisional.
Length = 686
Score = 31.3 bits (71), Expect = 2.1
Identities = 26/71 (36%), Positives = 33/71 (46%), Gaps = 18/71 (25%)
Query: 314 DLAGSERQAKTGASGQRLKE---ASKINLSLSTLGNVISALVDGKCTHIPYRNSKLTRIL 370
DL G ER A TGAS QR+ A K +L L +G++ L KL R+L
Sbjct: 446 DLFGHERGAFTGASAQRIGRFELADKSSLFLDEVGDMPLEL-----------QPKLLRVL 494
Query: 371 QDS----LGGN 377
Q+ LG N
Sbjct: 495 QEQEFERLGSN 505
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 31.0 bits (70), Expect = 2.2
Identities = 27/147 (18%), Positives = 57/147 (38%), Gaps = 5/147 (3%)
Query: 436 KKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLEL 495
+++ P + E + + + +E E + AE R + Q +
Sbjct: 45 PGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA 104
Query: 496 LQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSS 555
Q + + + E Q+Q +A A++ K K E + +++A++ EE +
Sbjct: 105 KQAE-----QAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAK 159
Query: 556 LQEESEGLSKKLKKVSSMLLAAKEERK 582
E++ + + KK + AK E K
Sbjct: 160 AAAEAKKKAAEAKKKAEAEAKAKAEAK 186
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 30.3 bits (69), Expect = 2.2
Identities = 13/61 (21%), Positives = 32/61 (52%)
Query: 541 EKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVR 600
E+ E ++E++ +L+EE E L ++++++ LL E ++ ++ +E E
Sbjct: 1 EEKEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKYAI 60
Query: 601 Q 601
+
Sbjct: 61 E 61
Score = 28.3 bits (64), Expect = 8.1
Identities = 9/41 (21%), Positives = 24/41 (58%)
Query: 529 KQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
+ +EE L++ ++E + E +++++ L E E K+ ++
Sbjct: 10 EDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTER 50
>gnl|CDD|234605 PRK00059, prsA, peptidylprolyl isomerase; Provisional.
Length = 336
Score = 30.8 bits (70), Expect = 2.3
Identities = 14/92 (15%), Positives = 41/92 (44%), Gaps = 9/92 (9%)
Query: 504 GENLLEKAEIQQQLLDAAAQELEHR------KQKEEELKKNIQEKDAE-RIDMEEKYSSL 556
G+N + ++++Q+ Q L+ QK +ELK + + E ++++K + +
Sbjct: 69 GDNYEKNEQVKEQIKQQKEQILDSLITEKVLLQKAKELK--LIPSEEELNKEVDKKINEI 126
Query: 557 QEESEGLSKKLKKVSSMLLAAKEERKDVQREN 588
+++ ++ ++ +E K+ +
Sbjct: 127 KKQFNNDEEQFEEALKATGFTEETFKEYLKNQ 158
>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
finger [Chromatin structure and dynamics].
Length = 271
Score = 30.7 bits (69), Expect = 2.3
Identities = 24/100 (24%), Positives = 36/100 (36%), Gaps = 9/100 (9%)
Query: 464 CVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQ 523
+ + + K D Q+ E + L K++ LL KA Q+ A
Sbjct: 35 VCDIIKNLRQMISILKKIIDLDSQTYEEVEDGLLKEI----RELLLKAIYIQKEKSDLAD 90
Query: 524 ELE-----HRKQKEEELKKNIQEKDAERIDMEEKYSSLQE 558
E HRK ++ + K EK A RI+ S E
Sbjct: 91 RAEKLLRRHRKLLDDRIAKRPHEKVAARIENCHDAVSRLE 130
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 31.2 bits (70), Expect = 2.3
Identities = 31/114 (27%), Positives = 45/114 (39%), Gaps = 7/114 (6%)
Query: 446 PREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGE 505
E EPE +E E S + K +++ E Q V
Sbjct: 144 AEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQ----VEAG 199
Query: 506 NLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEE 559
EK + +QQ AA ELE K+K EE +K ++E++ R E S +EE
Sbjct: 200 KEFEKLKQKQQ---EAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEE 250
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 29.6 bits (67), Expect = 2.4
Identities = 26/118 (22%), Positives = 55/118 (46%), Gaps = 4/118 (3%)
Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEEL 535
+ E Q++ + ++EK E ++K+ + L E E Q ++ + A Q+ E K E
Sbjct: 2 EEEKSSLQAELKRLLEKKEDAEEKI----QKLQEDLEEQAEIANEAQQKYERELVKHAED 57
Query: 536 KKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
+ +Q + +++++ + L+ E+E +L + K+ +D E K IE
Sbjct: 58 IEELQALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIE 115
Score = 29.2 bits (66), Expect = 3.3
Identities = 25/125 (20%), Positives = 52/125 (41%), Gaps = 22/125 (17%)
Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
AELKR + EK++ LQ+ + E Q ++ + A Q+ E K
Sbjct: 6 SSLQAELKRLLEKKEDAEEKIQKLQEDL-----------EEQAEIANEAQQKYERELVKH 54
Query: 533 EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAK----EERKDVQREN 588
E + +Q + ++ + L++E L + + + L A+ E++K ++ E
Sbjct: 55 AEDIEELQA-------LRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDEL 107
Query: 589 RKEIE 593
+ +
Sbjct: 108 SELEK 112
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 31.2 bits (71), Expect = 2.5
Identities = 24/87 (27%), Positives = 46/87 (52%), Gaps = 11/87 (12%)
Query: 508 LEKAEIQQQLLDAAAQELEHRKQKEE----ELKKNIQEKDAERIDMEEKYSSLQEESEGL 563
LE +I+ + +AAA ELE +++ + E K+ + E AE + Q E+ L
Sbjct: 358 LEDLQIELERANAAASELEKKQKNFDKILAEWKRKVDELQAEL-------DTAQREARNL 410
Query: 564 SKKLKKVSSMLLAAKEERKDVQRENRK 590
S +L ++ + L K++ + ++REN+
Sbjct: 411 STELFRLKNELEELKDQVEALRRENKN 437
Score = 30.0 bits (68), Expect = 5.5
Identities = 29/114 (25%), Positives = 54/114 (47%), Gaps = 10/114 (8%)
Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
A+L++ + D S+ E+LE E KAE+++QL A A+ + R + E E
Sbjct: 264 AQLRQLEHDLDSLREQLE----------EESEAKAELERQLSKANAEIQQWRSKFESEGA 313
Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRK 590
+E + + + +K S L+E +E + K + + E +D+Q E +
Sbjct: 314 LRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELER 367
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 31.0 bits (70), Expect = 2.5
Identities = 25/86 (29%), Positives = 44/86 (51%), Gaps = 2/86 (2%)
Query: 510 KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
KA ++ LL+A + ++R++ E+E K QE + E+ +SL + E LS K K
Sbjct: 67 KALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDENLSSKEKT 126
Query: 570 VSS--MLLAAKEERKDVQRENRKEIE 593
+ S L K + D + E +++E
Sbjct: 127 LESKEQSLTDKSKHIDEREEQVEKLE 152
>gnl|CDD|163611 pfam10046, BLOC1_2, Biogenesis of lysosome-related organelles
complex-1 subunit 2. Members of this family of proteins
play a role in cellular proliferation, as well as in the
biogenesis of specialized organelles of the
endosomal-lysosomal system.
Length = 99
Score = 28.8 bits (65), Expect = 2.7
Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 11/105 (10%)
Query: 471 IEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQ--QLLDAAAQELEHR 528
E + K +++ + VE L+LL+K N + ++ Q E+E
Sbjct: 1 AEKMFNKISKYVEAELEISVEDLKLLEK------MNENTALKYKKMRQSAKGLRVEIEAL 54
Query: 529 KQKEEELKKNIQEKDA--ERID-MEEKYSSLQEESEGLSKKLKKV 570
QK EELK +Q+ DA +++ +EE L E S+ L K+K +
Sbjct: 55 NQKYEELKPYLQQIDAIEQQVTTLEEAVYELDEYSKELESKVKSL 99
>gnl|CDD|203446 pfam06428, Sec2p, GDP/GTP exchange factor Sec2p. In Saccharomyces
cerevisiae, Sec2p is a GDP/GTP exchange factor for
Sec4p, which is required for vesicular transport at the
post-Golgi stage of yeast secretion.
Length = 90
Score = 28.8 bits (65), Expect = 2.7
Identities = 15/55 (27%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKL-KKVSSMLLAAKEERKDVQRENRK 590
++ E+ +R+ E++ L++E E L+ L + + M+ A+ ER+ V+ +N K
Sbjct: 1 ADLAEEKKKRLRAEKEKKKLEQEVEDLTASLFDEANKMVADARREREAVEIKNEK 55
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 31.0 bits (71), Expect = 2.7
Identities = 28/109 (25%), Positives = 48/109 (44%), Gaps = 16/109 (14%)
Query: 482 AQSDHQSMVEKLELLQKKVL--VGGENLLEKAEIQQ--QLLDAAAQELEHRKQKEEELKK 537
Q +HQS+++ LQ+++L G L + Q+ Q A +ELE ++KE E +
Sbjct: 128 GQHEHQSLLK--PELQRQLLDAFAGLEELAQEAYQEAYQAWKQARRELEDLQEKERERAQ 185
Query: 538 -------NIQEKDAERIDMEEKYSSLQEESEGLSK--KLKKVSSMLLAA 577
++E + + E L+EE + LS KL + L
Sbjct: 186 RADLLQFQLEELEELNLQPGE-DEELEEERKRLSNSEKLAEAIQNALEL 233
>gnl|CDD|112890 pfam04094, DUF390, Protein of unknown function (DUF390). This is a
family of long proteins currently only found in the rice
genome. They have no known function. However they may be
some kind of transposable element.
Length = 843
Score = 30.9 bits (69), Expect = 2.7
Identities = 40/174 (22%), Positives = 75/174 (43%), Gaps = 25/174 (14%)
Query: 415 VNQDPKDALLIKYQKEISELKKKLEEA----SVPPPREEPTPNPEPELDETEPCVEPSEV 470
V + +D L ++Y +EL+KKL+ A RE+ E + E +E +
Sbjct: 475 VLNEAQDDLRLQYGSRAAELEKKLDAAQGVLDAAAAREQRAAENEAASRQREEALEARAM 534
Query: 471 IEDETDAELKRAQSDHQSMVEKLEL-----------------LQKKVLVGGENLLEKAEI 513
+E +R +D ++ V E L++ L E LE+AE
Sbjct: 535 ALEERACAKERDLADREAAVAIREATLAAHEAACAEEEFALRLREDALTERERALEEAEA 594
Query: 514 QQQLLDAAAQELEHRKQ-KEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKK 566
Q L A L R+ +EE+ ++N++ AER ++++ + L+ ++ L +
Sbjct: 595 AAQQL---ADSLFLREAAQEEQARRNLEGARAERAALDQRAAELEARAKELDAR 645
>gnl|CDD|227318 COG4985, COG4985, ABC-type phosphate transport system, auxiliary
component [Inorganic ion transport and metabolism].
Length = 289
Score = 30.7 bits (69), Expect = 2.7
Identities = 18/107 (16%), Positives = 52/107 (48%), Gaps = 12/107 (11%)
Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
E+ D + EL++ + +++ ++++ + ++ + + LE+ ++ + R
Sbjct: 156 ELDGDPLERELQKRLLEVETLRDQVDKMVEQQVRVINSQLERLRLE-----------KRR 204
Query: 529 KQKEEELKKNIQEK-DAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
Q +L Q+ AE+ +++++ + LQ E + L +L++ L
Sbjct: 205 LQLNGQLDDEFQQHYVAEKSELQKRLAQLQTELDALRAELERQFLYL 251
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 30.6 bits (70), Expect = 2.8
Identities = 14/88 (15%), Positives = 40/88 (45%), Gaps = 1/88 (1%)
Query: 531 KEEELKKNIQEKDAERI-DMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR 589
+E+ K+ D +++ +++E+ L E E L + ++S + A + +D E
Sbjct: 15 REKLKKRGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELI 74
Query: 590 KEIEGFLDNVRQITKEYSYQQAIIQKFI 617
E++ + ++++ +A + +
Sbjct: 75 AEVKELKEKLKELEAALDELEAELDTLL 102
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 31.0 bits (71), Expect = 2.8
Identities = 28/103 (27%), Positives = 54/103 (52%), Gaps = 8/103 (7%)
Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
+++E E A K + + ++ + LE +++ + L E+ + +Q ELE
Sbjct: 296 DILEREVKAR-KYVEKNSDTLPDFLEHAKEQN----KELKEEIDRVKQSYTLNESELESV 350
Query: 529 KQKEEELK--KNIQEKDAERIDMEEK-YSSLQEESEGLSKKLK 568
+Q E++L+ + ++ ERI +E YS LQEE E + K+L+
Sbjct: 351 RQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLE 393
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 28.9 bits (65), Expect = 2.9
Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 9/69 (13%)
Query: 523 QELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
E + K +E+ELK EK+AER + + +++E K +K +AAK K
Sbjct: 38 LEQQAIKAREKELK---DEKEAER---QRRIQAIKERRA---AKEEKERYEKMAAKMHAK 88
Query: 583 DVQRENRKE 591
V+R R+E
Sbjct: 89 KVERLKRRE 97
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 29.5 bits (65), Expect = 2.9
Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 1/40 (2%)
Query: 439 EEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAE 478
EE P P EE PEPE +E E EP E E+E + E
Sbjct: 65 EEEEEPEPEEEGEEEPEPE-EEGEEEPEPEETGEEEPEPE 103
Score = 28.0 bits (61), Expect = 9.7
Identities = 12/40 (30%), Positives = 13/40 (32%), Gaps = 3/40 (7%)
Query: 439 EEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAE 478
EE P E EP E E E E E + E
Sbjct: 73 EEEGEEEPEPEEEGEEEP---EPEETGEEEPEPEPEPEPE 109
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 30.5 bits (69), Expect = 3.0
Identities = 16/79 (20%), Positives = 38/79 (48%), Gaps = 4/79 (5%)
Query: 466 EPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQEL 525
D++L Q + +++ ++E L ++ E + K + Q+ +D + E+
Sbjct: 27 ALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQI----EEIQSKIDELQKEIDQSKAEI 82
Query: 526 EHRKQKEEELKKNIQEKDA 544
+ +++ ELK+NI E+
Sbjct: 83 KKLQKEIAELKENIVERQE 101
Score = 29.3 bits (66), Expect = 6.7
Identities = 14/78 (17%), Positives = 38/78 (48%), Gaps = 3/78 (3%)
Query: 510 KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
A + + +L +++++ ++ I+ D + +++ K LQ+E + ++KK
Sbjct: 25 FAALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKK 84
Query: 570 VSSMLLAAKE---ERKDV 584
+ + KE ER+++
Sbjct: 85 LQKEIAELKENIVERQEL 102
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 30.1 bits (68), Expect = 3.1
Identities = 26/139 (18%), Positives = 62/139 (44%), Gaps = 5/139 (3%)
Query: 458 LDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQ---KKVLVGGENLLEKAEIQ 514
L E E + ++ +D E + H+++ +L + + + GE L+E+
Sbjct: 16 LSEKEELLSSTDYGDDLESVEALLKK--HEALEAELAAHEERVEALNELGEQLIEEGHPD 73
Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSML 574
+ + +EL R ++ EL + +++ E +D+++ + + + L +K ++S
Sbjct: 74 AEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASED 133
Query: 575 LAAKEERKDVQRENRKEIE 593
L E + + KE+E
Sbjct: 134 LGKDLESVEELLKKHKELE 152
>gnl|CDD|220381 pfam09753, Use1, Membrane fusion protein Use1. This entry is of a
family of proteins all approximately 300 residues in
length. The proteins have a single C-terminal
trans-membrane domain and a SNARE [soluble NSF
(N-ethylmaleimide-sensitive fusion protein) attachment
protein receptor] domain of approximately 60 residues.
The SNARE domains are essential for membrane fusion and
are conserved from yeasts to humans. Use1 is one of the
three protein subunits that make up the SNARE complex
and it is specifically required for Golgi-endoplasmic
reticulum retrograde transport.
Length = 251
Score = 30.1 bits (68), Expect = 3.2
Identities = 32/155 (20%), Positives = 61/155 (39%), Gaps = 16/155 (10%)
Query: 419 PKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAE 478
P +L +Y++ + LK LE P E + ++ V E + A
Sbjct: 50 PSVDVLTEYRRRVDFLKGLLEAEKPSSPSE------KALANQKLSRGRAPTVDEPRSPAS 103
Query: 479 LK-RAQSDHQSMVE-KLELLQKKVLVGGENLLEKAEIQQQLLD--AAAQELEHRKQKEEE 534
+ Q+ + E + ELL + + + + D A+A ++ Q +
Sbjct: 104 KEIHQQTKSRYTSELRKELLSGSTSLRRSTDAKDRDSSGPINDDEASAASIDQVLQYHQ- 162
Query: 535 LKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
N+QEK AE +M +L+E+S + +K+
Sbjct: 163 ---NLQEKLAE--EMLLLARNLKEQSLAAQQIIKE 192
>gnl|CDD|151039 pfam10473, Cenp-F_leu_zip, Leucine-rich repeats of kinetochore
protein Cenp-F/LEK1. Cenp-F, a centromeric kinetochore,
microtubule-binding protein consisting of two
1,600-amino acid-long coils, is essential for the full
functioning of the mitotic checkpoint pathway. There are
several leucine-rich repeats along the sequence of LEK1
that are considered to be zippers, though they do not
appear to be binding DNA directly in this instance.
Length = 140
Score = 29.4 bits (66), Expect = 3.3
Identities = 25/93 (26%), Positives = 49/93 (52%), Gaps = 11/93 (11%)
Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
+LK ++ + + +K+E L+++ LE +E Q+L+ E E+ K + E LK
Sbjct: 10 EKLKESEREADLLKDKVENLERE--------LEMSEENQELVIL---EAENSKAEVETLK 58
Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
I+E D+E +++ E E L+K+L++
Sbjct: 59 TKIEEMAESLKDLELDLVTVRSEKENLTKQLQE 91
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 30.2 bits (69), Expect = 3.3
Identities = 20/82 (24%), Positives = 34/82 (41%), Gaps = 22/82 (26%)
Query: 510 KAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER-----------IDMEEKYSSLQE 558
K E++ LLD + H+ QK++E ++ K + ID E SL+E
Sbjct: 53 KEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRLFVLDFKGDIDASE-VESLRE 111
Query: 559 ESEGLSKKLKKVSSMLLAAKEE 580
E ++++L AK
Sbjct: 112 E----------ITAILAVAKPG 123
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 30.5 bits (69), Expect = 3.4
Identities = 27/122 (22%), Positives = 55/122 (45%), Gaps = 10/122 (8%)
Query: 466 EPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQEL 525
E + E + EL++ Q+ Q +++LE +++ L E ++Q +AA Q
Sbjct: 79 EQRKKKEQQQAEELQQKQAAEQERLKQLE--KER--------LAAQEQKKQAEEAAKQAA 128
Query: 526 EHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ 585
+KQ EE K A+ ++ ++ +++ +KK + + AA E +K +
Sbjct: 129 LKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAE 188
Query: 586 RE 587
E
Sbjct: 189 AE 190
Score = 29.8 bits (67), Expect = 5.2
Identities = 29/126 (23%), Positives = 52/126 (41%), Gaps = 18/126 (14%)
Query: 465 VEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQE 524
V+P V+E + R Q +S E +KK E L +K +Q+ L ++
Sbjct: 55 VDPGAVVE-----QYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERL----KQ 105
Query: 525 LEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV 584
LE + +E KK Q ++A + + Q+++E + K + A+ E K
Sbjct: 106 LEKERLAAQEQKK--QAEEAAKQAALK-----QKQAEEAAAKAAAAAK--AKAEAEAKRA 156
Query: 585 QRENRK 590
+K
Sbjct: 157 AAAAKK 162
>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
coiled-coil domain [Function unknown].
Length = 231
Score = 30.2 bits (68), Expect = 3.4
Identities = 20/96 (20%), Positives = 41/96 (42%), Gaps = 6/96 (6%)
Query: 507 LLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKK 566
LLE EI +L + L K ++ ++D E + E + + E L +
Sbjct: 11 LLENPEILVDVLTQRPEILYEVLAKLTPWQQLATKQDVEELRKETEQRQKELADEKLEVR 70
Query: 567 LKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQI 602
+K + KE+ K +QR +E ++++++
Sbjct: 71 KQKAT------KEDLKLLQRFQEEEFRATKEDIKRL 100
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 28.0 bits (63), Expect = 3.6
Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 10/63 (15%)
Query: 507 LLEKAEIQQQLL----DAAAQELEHR------KQKEEELKKNIQEKDAERIDMEEKYSSL 556
L EK E QL+ + +EL+H + K +EL+K I E + +E++ +L
Sbjct: 7 LAEKDEQIAQLMEEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENL 66
Query: 557 QEE 559
+E
Sbjct: 67 EER 69
>gnl|CDD|235395 PRK05298, PRK05298, excinuclease ABC subunit B; Provisional.
Length = 652
Score = 30.4 bits (70), Expect = 3.9
Identities = 9/11 (81%), Positives = 10/11 (90%)
Query: 138 GQTGTGKTFTM 148
G TG+GKTFTM
Sbjct: 39 GVTGSGKTFTM 49
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 30.1 bits (68), Expect = 3.9
Identities = 30/155 (19%), Positives = 63/155 (40%), Gaps = 16/155 (10%)
Query: 468 SEVIEDETDAELKRAQSDHQSMVE-KLELLQKKVLVGGENLLEKAEIQQ-QLLDAAAQEL 525
+++ +A R + ++ + + + Q + + + A+ Q L+A EL
Sbjct: 266 LQLLPLGREAAALREVLESPTIQDLRQQYAQVRQQIADLSTELGAKHPQLVALEAQLAEL 325
Query: 526 EHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ 585
R+Q EL++ + E +E++ ++L++E L +L K+ L + + +
Sbjct: 326 --RQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRLSKLPK--LQVQLRELERE 381
Query: 586 RE-NRKEIEGFLDNVRQITKEYSYQQAIIQKFIPT 619
E R E L YQ+ IQ+ P
Sbjct: 382 AEAARSLYETLLQ---------RYQELSIQEASPI 407
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 29.8 bits (67), Expect = 4.0
Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 494 ELLQKKVLVGGENLLEKAEIQQQLLDAA---AQELEHRKQKEEELKKNIQEKDAERIDME 550
E+L++ GG+ +A+ L A AQ L QK +E + +K E +
Sbjct: 126 EMLEENAKKGGKIDKMEADSTD--LKARLRKAQILLEGLQKNQEELFKLLDKYNELREQV 183
Query: 551 EKYSSLQEE 559
+K SS ++E
Sbjct: 184 QKESSKKKE 192
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 30.1 bits (68), Expect = 4.1
Identities = 28/142 (19%), Positives = 59/142 (41%), Gaps = 20/142 (14%)
Query: 473 DETDAELKRAQSDHQSMVEKLELLQ------KKVLVGGENLLEKAEIQQQLLDAAAQELE 526
+ + + + Q+ + E+L L + L+ + E + D ++L
Sbjct: 332 PDAENNVAKLQALVVASSERLLELAQQWEAHRTPLI--DEYRSLKEKNRNKEDETQRQL- 388
Query: 527 HRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK---VSSMLLAAKEERKD 583
+ ++L+ I+E ++E E+ Y L +E E K + + +L K +K
Sbjct: 389 ---DEIKKLRNKIEELESELQTKEQLYKQLLDEYENAPKSVSRSAYTRRILEIIKNIKK- 444
Query: 584 VQRENRKEIEGFLDNVRQITKE 605
Q+E +I+ L + R + KE
Sbjct: 445 -QKE---DIDKILSDTRSLQKE 462
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 30.4 bits (68), Expect = 4.2
Identities = 23/113 (20%), Positives = 52/113 (46%), Gaps = 7/113 (6%)
Query: 509 EKAEIQQQLLDA------AAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEG 562
E+ ++Q+L A A QEL ++ ++L+ ++ +R +E + SLQ +
Sbjct: 117 EREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQ 176
Query: 563 LSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
L ++ S +L K +++E + + + + T+E + + A Q+
Sbjct: 177 LQASATQLKSQVLDLKLRSAQIEQEAQ-NLATRANAAQARTEELARRAAAAQQ 228
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 30.1 bits (68), Expect = 4.3
Identities = 30/172 (17%), Positives = 59/172 (34%), Gaps = 35/172 (20%)
Query: 435 KKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEK-- 492
+ +EE S +E + + + D V+ E E+E + + + V
Sbjct: 416 DESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAIPSTILEEEPTVGGGL 475
Query: 493 ---LELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRK-QKEEELKKNIQEKDAERID 548
L+LL+ + +L E E R+ KE+E K + E
Sbjct: 476 AAALKLLK----------------SRGILKKNQLERERREFLKEKERLKLLAEIRE---- 515
Query: 549 MEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVR 600
++ E + K + M +EE + + R + E + +V+
Sbjct: 516 ------RIERERDRNDGKYSR---MSAREREEYARPENDQRDKEEAYKPDVK 558
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 30.3 bits (68), Expect = 4.3
Identities = 36/204 (17%), Positives = 79/204 (38%), Gaps = 23/204 (11%)
Query: 412 KARVNQDPKDALLIKYQKEISELKKKLEEASVPPP--------REEPTPNPE---PELDE 460
K + + K+ + Q I EL K+L + + RE+ T + + D+
Sbjct: 334 KLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDK 393
Query: 461 TEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQ----- 515
V+ ++ L++ + S+++ + + ++ + K I+Q
Sbjct: 394 LTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINIEQLFPKG 453
Query: 516 ----QLLDAAAQELEHRKQKE--EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
+ + + EL Q+ E K+I ++ +++ + L + E L +L +
Sbjct: 454 SGINESIKKSILELNDEIQERIKTEENKSITLEED-IKNLKHDINELTQILEKLELELSE 512
Query: 570 VSSMLLAAKEERKDVQRENRKEIE 593
+S +KEE + R EIE
Sbjct: 513 ANSKFELSKEENERELVAQRIEIE 536
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 29.2 bits (66), Expect = 4.4
Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 8/67 (11%)
Query: 476 DAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE-- 533
+AE++R Q+ + + +LE L++ E L +A+ +Q E + + +KEE
Sbjct: 63 EAEIERLQNTIERLKTQLEDLER------ELALLQAKERQLEKKLKTLEQKLKNEKEEVQ 116
Query: 534 ELKKNIQ 540
LK IQ
Sbjct: 117 RLKNIIQ 123
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 30.2 bits (68), Expect = 4.4
Identities = 13/62 (20%), Positives = 25/62 (40%), Gaps = 4/62 (6%)
Query: 416 NQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTP----NPEPELDETEPCVEPSEVI 471
+D K A+ + + L + + PPP EP EP+ + + E ++
Sbjct: 248 EEDLKLAVELVLLPRATRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIP 307
Query: 472 ED 473
E+
Sbjct: 308 EE 309
Score = 29.4 bits (66), Expect = 7.5
Identities = 22/86 (25%), Positives = 35/86 (40%), Gaps = 6/86 (6%)
Query: 422 ALLIKYQKEISELKKKLE-----EASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETD 476
AL + + +LK +E A+ P E P P P + EP EP E + + D
Sbjct: 239 ALHGRTEVTEEDLKLAVELVLLPRATRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPD 298
Query: 477 AELKRAQSDHQSMVEKLE-LLQKKVL 501
+ Q + M + +E L +L
Sbjct: 299 DGEETDQIPEELMFDAVEADLPDNIL 324
>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat. Spectrin repeat-domains are
found in several proteins involved in cytoskeletal
structure. These include spectrin, alpha-actinin and
dystrophin. The sequence repeat used in this family is
taken from the structural repeat in reference. The
spectrin domain- repeat forms a three helix bundle. The
second helix is interrupted by proline in some
sequences. The repeats are defined by a characteristic
tryptophan (W) residue at position 17 in helix A and a
leucine (L) at 2 residues from the carboxyl end of helix
C. Although the domain occurs in ultiple repeats along
sequences, the domains are actually stable on their own
- ie they act, biophysically, like domains rather than
repeats that along function when aggregated.
Length = 105
Score = 28.4 bits (64), Expect = 4.4
Identities = 21/127 (16%), Positives = 49/127 (38%), Gaps = 32/127 (25%)
Query: 423 LLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRA 482
LL ++ ++ +L+ +EE E L + + V
Sbjct: 2 LLQQFFRDADDLESWIEEK-------------EALLSSEDYGKDLESV------------ 36
Query: 483 QSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK---KNI 539
Q++++K + L+ ++ + + E+ ++L+ E +++ EEL + +
Sbjct: 37 ----QALLKKHKALEAELAAHQDRVEALNELAEKLIAEGHYASEEIQERLEELNERWEQL 92
Query: 540 QEKDAER 546
E AER
Sbjct: 93 LELAAER 99
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 30.0 bits (68), Expect = 4.5
Identities = 38/182 (20%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQ-- 530
E + + + + + LE L + E + E E ++LL+ + + KQ
Sbjct: 295 SEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNK-ISTNKQSL 353
Query: 531 -----KEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ 585
K +++K I+E AE +D E+ + LQ+E + + K ++ + K
Sbjct: 354 ITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSEL--------VKEKY-- 403
Query: 586 RENRKEIEGFLDNVRQITKEYSYQQAIIQKFIP------TKYLELM------TQNATWNE 633
V + K+ + +II+K+IP YL++M T + +NE
Sbjct: 404 ---------HRGIVTDLLKDSGIKASIIKKYIPYFNKQINHYLQIMEADYNFTLDEEFNE 454
Query: 634 DI 635
I
Sbjct: 455 TI 456
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 30.4 bits (68), Expect = 4.5
Identities = 17/51 (33%), Positives = 22/51 (43%), Gaps = 5/51 (9%)
Query: 439 EEASVPPPREEPTP--NPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQ 487
E P E P PEP ++ET+P P E + E KRA+ Q
Sbjct: 475 EPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFE---EVEEKRAREREQ 522
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 29.6 bits (67), Expect = 4.5
Identities = 28/129 (21%), Positives = 63/129 (48%), Gaps = 12/129 (9%)
Query: 474 ETDAELKRAQSDHQSMVEKLEL--------LQKKVLVGGENLLEKAEIQQQLLDAAAQEL 525
+ + +L+ AQ+ + + EK EL L ++ L ++L + A+ + L A +++
Sbjct: 56 QLERKLEEAQARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLAKALEAELQQAEEQV 115
Query: 526 EHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSK---KLKKVSSMLLAAKEERK 582
E K++ L++ I E A++ ++ + ++ + E +++ S+M + E K
Sbjct: 116 EKLKKQLAALEQKIAELRAKKEALKARKAA-AKAQEKVNRSLGGGSSSSAMAAFERMEEK 174
Query: 583 DVQRENRKE 591
+RE R E
Sbjct: 175 IEEREARAE 183
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 30.0 bits (68), Expect = 4.6
Identities = 23/94 (24%), Positives = 48/94 (51%), Gaps = 7/94 (7%)
Query: 508 LEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL 567
L + EI++ + +A A E +K+KEE +N D+ E+ +L+E + L +
Sbjct: 498 LSEEEIERMVKEAEANAEEDKKRKEEIEARN--NADSLAYQAEK---TLKEAGDKLPAEE 552
Query: 568 KKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQ 601
K+ A E ++ ++ E+ +EI+ + ++Q
Sbjct: 553 KEKIEK--AVAELKEALKGEDVEEIKAKTEELQQ 584
>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin.
Length = 237
Score = 29.5 bits (67), Expect = 4.7
Identities = 42/139 (30%), Positives = 64/139 (46%), Gaps = 31/139 (22%)
Query: 473 DETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
++ +A+LK A+ + K E + +K LV E LE+AE + + ++ EL E
Sbjct: 95 EQLEAQLKEAKEIAEEADRKYEEVARK-LVVVEGDLERAEERAEAAESKIVEL------E 147
Query: 533 EELK---KNIQ------EKDAERID-MEEKYSSLQ----------EESEGLSKKLKKVSS 572
EELK N++ EK ++R D EEK L E +E +KL+K
Sbjct: 148 EELKVVGNNLKSLEVSEEKASQREDSYEEKIRDLTEKLKEAETRAEFAERSVQKLEKEVD 207
Query: 573 ML----LAAKEERKDVQRE 587
L LA KE+ K + E
Sbjct: 208 RLEDELLAEKEKYKAISEE 226
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 30.1 bits (68), Expect = 4.8
Identities = 21/110 (19%), Positives = 40/110 (36%), Gaps = 18/110 (16%)
Query: 477 AELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELK 536
AEL S+ ++ KL L ++ +++ L EL ++K EEL+
Sbjct: 185 AELTTLLSEQRAQQAKLAQLLEE--------------RKKTLAQLNSELSADQKKLEELR 230
Query: 537 KNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQR 586
N E E + +E + + ++ AA+ +R
Sbjct: 231 ANESRLKNEIASAEAAAAKAREAAA----AAEAAAARARAAEAKRTGETY 276
Score = 28.9 bits (65), Expect = 9.8
Identities = 13/85 (15%), Positives = 35/85 (41%), Gaps = 6/85 (7%)
Query: 512 EIQQQL--LDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
+++ L A ++L + + + + +E+ + K + L EE + +L
Sbjct: 158 ARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNS 217
Query: 570 VSSMLLAAKEERKDVQRENRKEIEG 594
L+A +++ + R N ++
Sbjct: 218 ----ELSADQKKLEELRANESRLKN 238
>gnl|CDD|180481 PRK06231, PRK06231, F0F1 ATP synthase subunit B; Validated.
Length = 205
Score = 29.4 bits (66), Expect = 4.8
Identities = 27/105 (25%), Positives = 51/105 (48%), Gaps = 15/105 (14%)
Query: 506 NLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER----------IDMEEKYSS 555
LLE A+ + + A A+E+ + E K+ EK+A R ++E++
Sbjct: 100 QLLENAKQRHENALAQAKEIIDQANYEALQLKSELEKEANRQANLIIFQARQEIEKERRE 159
Query: 556 LQEESEGLSKKLKKVSSMLLAAKEERKDVQRE-NRKEIEGFLDNV 599
L+E+ + S +L +ML A + +K V RE + K ++ F+ +
Sbjct: 160 LKEQLQKESVEL----AMLAAEELIKKKVDREDDDKLVDEFIREL 200
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 30.0 bits (68), Expect = 4.8
Identities = 29/139 (20%), Positives = 61/139 (43%), Gaps = 9/139 (6%)
Query: 475 TDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLD----AAAQEL----- 525
+L+R Q++ QS+ +KLEL + E L + + Q + L A + L
Sbjct: 654 ARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFR 713
Query: 526 EHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQ 585
E R ++ + + E D + + + + +++ K+LKK LA+ + +
Sbjct: 714 ELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTV 773
Query: 586 RENRKEIEGFLDNVRQITK 604
+E +++IE + +I
Sbjct: 774 KELKRQIEELETTIERIAV 792
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 30.0 bits (68), Expect = 4.9
Identities = 26/106 (24%), Positives = 43/106 (40%), Gaps = 12/106 (11%)
Query: 487 QSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAER 546
+KL+ L + V L +K E + A +ELE K K + K + + AE
Sbjct: 704 NEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQKIDELKDKAET 763
Query: 547 I----------DMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERK 582
I D ++ SL+ ++ L KL + +LA+ K
Sbjct: 764 INGVKVLVEVVDAKDM-KSLKTMADRLKSKLGS-AIYVLASFANGK 807
>gnl|CDD|204444 pfam10319, 7TM_GPCR_Srj, Serpentine type 7TM GPCR chemoreceptor
Srj. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srj is part of the Str
superfamily of chemoreceptors. The srj family is
designated as the out-group based on its location in
preliminary phylogenetic analyses of the entire
superfamily. Chemoperception is one of the central
senses of soil nematodes like C. elegans which are
otherwise 'blind' and 'deaf'.
Length = 310
Score = 29.8 bits (68), Expect = 4.9
Identities = 8/45 (17%), Positives = 16/45 (35%), Gaps = 10/45 (22%)
Query: 164 SFAHIFGHIA----------KADESVKFLVRVSYFEIYNEEIRDL 198
S H + AD+ ++ +R + E Y + D+
Sbjct: 135 SILFFLFHGSVWTAICYFCLYADDEIRDYIREDFRETYGVDSMDI 179
>gnl|CDD|221623 pfam12531, DUF3731, DNA-K related protein. This domain family is
found in bacteria, and is approximately 250 amino acids
in length. There are two conserved sequence motifs: RPG
and WRR. The proteins in this family are frequently
annotated as DNA-K related proteins however there is
little accompanying literature to confirm this.
Length = 249
Score = 29.5 bits (67), Expect = 5.0
Identities = 20/69 (28%), Positives = 30/69 (43%), Gaps = 14/69 (20%)
Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDA----------ERIDMEEKYSSLQEES 560
QQQLLD A L+ Q+ +L K + ER+ +E+K E
Sbjct: 65 EGQQQQLLDDLAPYLQPGAQRNLKLPKGPTPQSYDEMVRLAASLERLPVEDK----VELG 120
Query: 561 EGLSKKLKK 569
E L ++L+K
Sbjct: 121 EWLLERLQK 129
>gnl|CDD|221232 pfam11802, CENP-K, Centromere-associated protein K. CENP-K is one
of seven new CENP-A-nucleosome distal (CAD) centromere
components (the others being CENP-L, CENP-O, CENP-P,
CENP-Q, CENP-R and CENP-S) that are identified as
assembling on the CENP-A nucleosome associated complex,
NAC. The CENP-A NAC is essential, as disruption of the
complex causes errors of chromosome alignment and
segregation that preclude cell survival despite
continued centromere-derived mitotic checkpoint
signalling. CENP-K is centromere-associated through its
interaction with one or more components of the CENP-A
NAC.
Length = 273
Score = 29.5 bits (66), Expect = 5.1
Identities = 36/191 (18%), Positives = 69/191 (36%), Gaps = 37/191 (19%)
Query: 439 EEASVPPPREEPTPNP----EPELDETEPCVEPSEVIED-----------ETDAELKRAQ 483
E+A + P +P P E L E E + E + E+DA+L
Sbjct: 4 EQAELFPDISTDSPAPIDAEEELLRECENIWKDMEECQSKLTLIGTETLPESDAQLSLLI 63
Query: 484 SDHQSMVEKLELLQKKV-----------LVGGENLLEKAEIQQQLLDAAAQELEHRKQKE 532
+++ + E QK+ L G+ L+K ++Q L L + K
Sbjct: 64 MQMKALTAEYEQWQKRTPEIISLNEDVLLTLGKEELQK--LRQDL----EMVLSSVQSKN 117
Query: 533 EELKKN---IQEKDAERIDMEEKYSSLQEESEGLSKKLKK--VSSMLLAAKEERKDVQRE 587
E+LK++ Q+ E+ + + + Q E + + + L K+ + +
Sbjct: 118 EKLKEDLEREQQWLDEQQQILDSLNERQSELKNQVVTFSESRIFQELKTKMLRIKEYKEK 177
Query: 588 NRKEIEGFLDN 598
+ FL+
Sbjct: 178 LLSALGEFLEE 188
>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
homology Domain (IMD), a dimerization module that binds
and bends membranes. Inverse (I)-BAR (or IMD) is a
member of the Bin/Amphiphysin/Rvs (BAR) domain family.
It is a dimerization and lipid-binding module that bends
membranes and induces membrane protrusions in the
opposite direction compared to classical BAR and F-BAR
domains, which produce membrane invaginations. IMD
domains are found in Insulin Receptor tyrosine kinase
Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
Brain-specific Angiogenesis Inhibitor 1-Associated
Protein 2-like (BAIAP2L) proteins. These are
multi-domain proteins that act as scaffolding proteins
and transducers of a variety of signaling pathways that
link membrane dynamics and the underlying actin
cytoskeleton. Most members contain an N-terminal IMD, an
SH3 domain, and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus, exccept for MIM which does not
carry an SH3 domain. Some members contain additional
domains and motifs. The IMD domain binds and bundles
actin filaments, binds membranes and produces membrane
protrusions, and interacts with the small GTPase Rac.
Length = 223
Score = 29.3 bits (66), Expect = 5.2
Identities = 24/128 (18%), Positives = 53/128 (41%), Gaps = 16/128 (12%)
Query: 504 GENLLEKAEIQQQL---LDAAAQELEHRKQKEEELKKNIQEKDAERI-DMEEKYSSLQEE 559
GE L + + + + L+ A+ + L+K + E D + I E+ Y E
Sbjct: 67 GEALKQIVDTHKSIEASLEQVAKAF--HGELILPLEKKL-ELDQKVINKFEKDYKK---E 120
Query: 560 SEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQKFIPT 619
+ + L K S L K+ +K Q+ + + + + Q +E + +Q ++ F+
Sbjct: 121 YKQKREDLDKARSEL---KKLQKKSQKSGTGKYQ---EKLDQALEELNDKQKELEAFVSQ 174
Query: 620 KYLELMTQ 627
+ + +
Sbjct: 175 GLRDALLE 182
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 30.2 bits (68), Expect = 5.2
Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
E+ L E +++ L+ A+E R+++ EE ++ +EK A D + + +++ E L
Sbjct: 247 EDFL--LEEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQ 304
Query: 565 KKLKKVSS 572
LKK S
Sbjct: 305 NLLKKASR 312
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 30.3 bits (68), Expect = 5.2
Identities = 12/53 (22%), Positives = 19/53 (35%)
Query: 441 ASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKL 493
AS PPP P P+ EP D+ + T+ + ++ L
Sbjct: 425 ASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDAL 477
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 29.9 bits (68), Expect = 5.3
Identities = 30/128 (23%), Positives = 57/128 (44%), Gaps = 21/128 (16%)
Query: 487 QSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEH-------------RKQKEE 533
Q+++ LQ+K L G L + + Q+ L A Q LEH E+
Sbjct: 201 QALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQLLQEAINSKRLTLSEK 260
Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
+++ + +A RI + +E E ++ +L S LL A E+ + ++N + ++
Sbjct: 261 TVQEAQSQDEAARIQAN---PLVAQELE-INLQL---SQRLLKATEKLNTLTQQNLR-VK 312
Query: 594 GFLDNVRQ 601
+LD + Q
Sbjct: 313 NWLDRLTQ 320
Score = 29.5 bits (67), Expect = 7.4
Identities = 23/106 (21%), Positives = 53/106 (50%), Gaps = 16/106 (15%)
Query: 492 KLELLQKKVLVGGENLLEKAEIQQ--QLLDAAAQELEHRKQKEEELKKNIQEKDAERIDM 549
+L+ L K+ L+ E+ L + +++Q LLD ++ K++ E+LK+ + + A+
Sbjct: 44 QLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQ----KEETEQLKQQLAQAPAKLRQA 99
Query: 550 EEKYSSLQ--------EESEGLSKKLKKVSSMLLAAKEERKDVQRE 587
+ + +L+ E LS L+++ S L ++ ++ Q +
Sbjct: 100 QAELEALKDDNDEETRETLSTLS--LRQLESRLAQTLDQLQNAQND 143
>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
Length = 329
Score = 29.9 bits (68), Expect = 5.3
Identities = 10/16 (62%), Positives = 11/16 (68%)
Query: 131 NGTIFAYGQTGTGKTF 146
N + YG TGTGKTF
Sbjct: 183 NENLLFYGNTGTGKTF 198
>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
PREFRAN. This family is the long protein of a novel
CRISPR subtype, PREFRAN, which is most common in
Prevotella and Francisella, although widely distributed.
The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
helicase Cas3 and endonuclease Cas3-HD.
Length = 1287
Score = 30.2 bits (68), Expect = 5.6
Identities = 25/106 (23%), Positives = 51/106 (48%), Gaps = 7/106 (6%)
Query: 513 IQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSS 572
I+ L + +++E RK K + K+ +EK +++ ++ Y S+Q ++ L K +K+ S
Sbjct: 373 IRTALEEYIKKQIEPRKGKSK--KEKYEEK-VKKVVKKQDYLSIQFINDLLEKYIKEFDS 429
Query: 573 ---MLLAAKEERKDVQRENRKEIEGFLDNVRQITKEYSYQQAIIQK 615
+ ++ ++ R E DN+ I Y + I+QK
Sbjct: 430 KDRLFENIRDYFAEISLM-RNEDFQTNDNIANIKTAYQTFERILQK 474
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
protein. This domain family is found in eukaryotes, and
is typically between 212 and 238 amino acids in length.
The family is found in association with pfam01805. There
are two completely conserved residues (W and H) that may
be functionally important. PRP21 is required for
assembly of the prespliceosome and it interacts with U2
snRNP and/or pre-mRNA in the prespliceosome. This family
also contains proteins similar to PRP21, such as the
mammalian SF3a. SF3a also interacts with U2 snRNP from
the prespliceosome, converting it to its active form.
Length = 230
Score = 29.3 bits (66), Expect = 5.7
Identities = 10/38 (26%), Positives = 21/38 (55%)
Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEE 551
+ +L+ A + E K +++E +K +E++ ERI
Sbjct: 16 KYYILERAYKRAEWEKHQKKEEQKKEEEEEEERIAFAS 53
>gnl|CDD|151608 pfam11166, DUF2951, Protein of unknown function (DUF2951). This
family of proteins has no known function. It has a
highly conserved sequence.
Length = 97
Score = 27.8 bits (62), Expect = 6.2
Identities = 18/69 (26%), Positives = 39/69 (56%), Gaps = 6/69 (8%)
Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL-LDAAAQELEHRKQKE 532
ET+ ++R + + +++ E L+ ++ G++ E + L LD+ +E E K+ +
Sbjct: 7 ETERRIQRLEENDKTIFESLDEIKD-----GQHTQELVNQKMDLTLDSINRERELDKENK 61
Query: 533 EELKKNIQE 541
EE +KNI++
Sbjct: 62 EENRKNIKD 70
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 29.3 bits (66), Expect = 6.3
Identities = 27/146 (18%), Positives = 61/146 (41%), Gaps = 21/146 (14%)
Query: 460 ETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLD 519
E E ++ E + + E R +++ + + + Q++ ++ L + Q++L
Sbjct: 67 ELEAKIKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQ 126
Query: 520 AAAQELEH--------------RKQKEEELKKNIQEKDAERIDMEEKY--SSLQEESEGL 563
Q E R+ EEE+ + +E E ++E + + ++ E+ G
Sbjct: 127 QRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEARGR 186
Query: 564 SKKLKK-----VSSMLLAAKEERKDV 584
+K+ ++ + L A EER+ V
Sbjct: 187 AKEERENEDINREMLKLKANEERETV 212
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 29.8 bits (67), Expect = 6.3
Identities = 27/166 (16%), Positives = 61/166 (36%), Gaps = 23/166 (13%)
Query: 472 EDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEK----AEIQQQLLDAAAQELEH 527
E E + E+ A+++ + + +LE+ Q+ E E AE + ++ A+
Sbjct: 229 EAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKAETRRE 288
Query: 528 RKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLK---------KVSSMLLAAK 578
+Q E ++ IQE+ A+ + E +++ E E ++ + S + A
Sbjct: 289 AEQAEILAEQAIQEEKAQA-EQEVQHAKALEAREMRVGLIERQKETELEPQERSYFINAA 347
Query: 579 EERKDVQRENRKEIE---------GFLDNVRQITKEYSYQQAIIQK 615
+ + + + I E + Q A++
Sbjct: 348 QRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAERAEQAALVAA 393
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 29.6 bits (66), Expect = 6.3
Identities = 11/44 (25%), Positives = 20/44 (45%)
Query: 440 EASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQ 483
+ VP E E E DE EP +V++++++ E +
Sbjct: 132 KPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEEL 175
>gnl|CDD|218967 pfam06273, eIF-4B, Plant specific eukaryotic initiation factor 4B.
This family consists of several plant specific
eukaryotic initiation factor 4B proteins.
Length = 496
Score = 29.7 bits (66), Expect = 6.6
Identities = 32/153 (20%), Positives = 58/153 (37%), Gaps = 11/153 (7%)
Query: 442 SVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVL 501
PREE E LD + E T ++ R S S E
Sbjct: 265 GAARPREEVLA--EKGLDWKKLDSEIEAKTSRPTSSQSSRPSSAQSSRSES------PGS 316
Query: 502 VGGENLLEKAEIQQQLLDAAAQE--LEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEE 559
G E +++ DA +E LE + + ++ ++ + +R + EE+ +L+EE
Sbjct: 317 QGSEGVVKPRPKVNPFGDAKPREVVLEEKGKDWRKIDLELEHRRVDRPETEEE-KNLKEE 375
Query: 560 SEGLSKKLKKVSSMLLAAKEERKDVQRENRKEI 592
L LKK ++ +K ++ + E+
Sbjct: 376 INLLRVDLKKEEAIAPESKGSGQEQSHKGLSEL 408
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 28.1 bits (63), Expect = 6.7
Identities = 25/106 (23%), Positives = 47/106 (44%), Gaps = 19/106 (17%)
Query: 529 KQKEEELKKNIQEKDAERIDMEEKYSSLQ----------EESEGLSKKLKKVSSMLLAAK 578
K+ +E +K ++E D D +E Y S++ ++K K + S+L AA+
Sbjct: 1 KKARKEAQKALEE-DPSIYDYDEVYDSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAE 59
Query: 579 EERKD--------VQRENRKEIEGFLDNVRQITKEYSYQQAIIQKF 616
+++ +Q+E KE + F D + +T Y Q +K
Sbjct: 60 RRKREREIAEERKLQKEREKEGDEFADKEKFVTSAYKKQLEENRKL 105
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that assembles
onto the CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. Fta7 is the
equivalent component of the fission yeast Sim4 complex.
Length = 159
Score = 28.5 bits (64), Expect = 6.8
Identities = 30/135 (22%), Positives = 56/135 (41%), Gaps = 18/135 (13%)
Query: 484 SDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDA----AAQELEHRKQKEEELKKNI 539
+ ++L L++K + E L Q +LL+ ELE + +EL+KN
Sbjct: 16 KEDSLDYDQL--LRRKRAL--ERQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNA 71
Query: 540 QEKDAERIDMEEKY--------SSLQEESEGLSKKLKKVSSMLLAAK--EERKDVQRENR 589
+ + ER + + S + EE+E L L ++S + E D+ ++
Sbjct: 72 KALEREREEESKNLHPVLRLLESEVLEENELLQDSLLELSERNFSPNLDPELLDLLKQLN 131
Query: 590 KEIEGFLDNVRQITK 604
K +E N+ Q+
Sbjct: 132 KHLESLQGNLEQLAG 146
>gnl|CDD|219083 pfam06548, Kinesin-related, Kinesin-related. This family
represents a region within kinesin-related proteins from
higher plants. Many family members also contain the
pfam00225 domain. Kinesins are ATP-driven microtubule
motor proteins that produce directed force. Some family
members are associated with the phragmoplast, a
structure composed mainly of microtubules that executes
cytokinesis in higher plants.
Length = 488
Score = 29.5 bits (66), Expect = 6.9
Identities = 35/161 (21%), Positives = 63/161 (39%), Gaps = 11/161 (6%)
Query: 410 CNKARVNQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSE 469
+ K+ LL +EI +LK +L+ + T L + S
Sbjct: 214 FKNFYFDMGEKEVLL----EEIQDLKSQLQCMLSS---SKSTARTRSSLLARSFQLRKSA 266
Query: 470 VIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRK 529
+ E + K + + + E + K + E L + E + L + ELE K
Sbjct: 267 IPESGDENAEKTLEKERERWTE----AESKWISLTEELRVELEANRSLAEKLEMELESEK 322
Query: 530 QKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKV 570
+ EELK +Q + E+Y+ LQE+ L +K +++
Sbjct: 323 KCTEELKDALQRAMQGHARIIEQYAELQEKYNDLLEKHRRI 363
>gnl|CDD|180700 PRK06800, fliH, flagellar assembly protein H; Validated.
Length = 228
Score = 29.1 bits (65), Expect = 7.0
Identities = 15/72 (20%), Positives = 34/72 (47%), Gaps = 5/72 (6%)
Query: 482 AQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKN-IQ 540
+ + Q E+L QK + L + +QQ L+ Q+L +++ +E + ++
Sbjct: 32 VEEEIQKDHEELLAQQKSL----HKELNQLRQEQQKLERERQQLLADREQFQEHVQQQMK 87
Query: 541 EKDAERIDMEEK 552
E +A R +++
Sbjct: 88 EIEAARQQFQKE 99
>gnl|CDD|220716 pfam10359, Fmp27_WPPW, RNA pol II promoter Fmp27 protein domain.
Fmp27_WPPW is a conserved domain of a family of proteins
involved in RNA polymerase II transcription initiation.
It contains characteristic HQR and WPPW sequence motifs.
and is towards the C-terminal in members which contain
Fmp27_SW pfam10305.
Length = 470
Score = 29.6 bits (67), Expect = 7.1
Identities = 16/63 (25%), Positives = 36/63 (57%), Gaps = 2/63 (3%)
Query: 512 EIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVS 571
+Q++LL+ ++LE + +K EE +++ D E +++ L+EE L ++L+ +
Sbjct: 160 RVQRELLEERLKQLEEQIKKLEEKLDDLELNDTE--ELQSDLEELEEELSVLKERLEFLE 217
Query: 572 SML 574
+L
Sbjct: 218 KLL 220
>gnl|CDD|234063 TIGR02928, TIGR02928, orc1/cdc6 family replication initiation
protein. Members of this protein family are found
exclusively in the archaea. This set of DNA binding
proteins shows homology to the origin recognition
complex subunit 1/cell division control protein 6 family
in eukaryotes. Several members may be found in genome
and interact with each other [DNA metabolism, DNA
replication, recombination, and repair].
Length = 365
Score = 29.5 bits (67), Expect = 7.1
Identities = 9/12 (75%), Positives = 11/12 (91%)
Query: 134 IFAYGQTGTGKT 145
+F YG+TGTGKT
Sbjct: 43 VFIYGKTGTGKT 54
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 29.3 bits (66), Expect = 7.1
Identities = 20/132 (15%), Positives = 46/132 (34%), Gaps = 7/132 (5%)
Query: 474 ETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEE 533
+L + M+ +L + +++L LL + + L A+ L + E
Sbjct: 18 AFIRQLLLRLGRLEQMLGELAAVLEQLL-----LLLAFRAEAEQLRTFARSL--QALNLE 70
Query: 534 ELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKEIE 593
+++ + K + + + LQ E L++ + + EE E ++
Sbjct: 71 LIQELNELKARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFEELNRRLAELNQQNL 130
Query: 594 GFLDNVRQITKE 605
L + E
Sbjct: 131 KQLLKPLREVLE 142
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 29.3 bits (66), Expect = 7.2
Identities = 32/157 (20%), Positives = 66/157 (42%), Gaps = 11/157 (7%)
Query: 421 DALLIK-YQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAEL 479
LL + Y+KE+ KK+ E V + E LDE + + E +++
Sbjct: 196 QELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDD 255
Query: 480 KRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEI-QQQLLDAAAQELEHRKQKEEEL--- 535
+S + + E + K V + ++ + +++ L+ A+E + K+K +L
Sbjct: 256 ---ESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARL 312
Query: 536 ---KKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKK 569
K + +K+ R +E+ E+ + +KL K
Sbjct: 313 KEIAKEVAQKEKARARKKEQRKERGEKKKLKRRKLGK 349
>gnl|CDD|236272 PRK08475, PRK08475, F0F1 ATP synthase subunit B; Validated.
Length = 167
Score = 28.4 bits (64), Expect = 7.3
Identities = 7/32 (21%), Positives = 19/32 (59%)
Query: 556 LQEESEGLSKKLKKVSSMLLAAKEERKDVQRE 587
+ +SK+L+++ L +KE+++D ++
Sbjct: 51 YKSRINKISKRLEEIQEKLKESKEKKEDALKK 82
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 29.6 bits (67), Expect = 7.3
Identities = 44/211 (20%), Positives = 85/211 (40%), Gaps = 45/211 (21%)
Query: 425 IKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPS---EVIEDETDAELKR 481
K I EL + LEE +E + EL + S + ++D+ ++ +
Sbjct: 108 ATTPKSIQELVEALEELLEELLKETASDPVVQELVSIFNDLIDSIKEDNLKDDLESLIAS 167
Query: 482 AQSDHQSMVEKL--------ELLQKKVLVGGENLLEKAEIQ-------------QQLLDA 520
A+ + + +KL E L++ + E LL K E + +QL
Sbjct: 168 AKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLE 227
Query: 521 AAQELEH-RKQKEEELKKNIQEKDAE------------RIDMEEKY-----SSLQEESEG 562
+E E RK+ EE+L++ ++ + I+++ ++ ++EE G
Sbjct: 228 FEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEERNG 287
Query: 563 LSKKLKKVSSMLLA---AKEERKDVQRENRK 590
KL +++S L A + R + + EN K
Sbjct: 288 RLAKLAELNSRLKGLEKALDSRSEAEDENHK 318
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 29.6 bits (67), Expect = 7.3
Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 1/52 (1%)
Query: 447 REEPTPNPEPELD-ETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQ 497
R P P+P + E E P E E+E + E + ++ E + LL+
Sbjct: 295 RRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEVRATVAEAIRLLE 346
>gnl|CDD|233050 TIGR00609, recB, exodeoxyribonuclease V, beta subunit. The RecBCD
holoenzyme is a multifunctional nuclease with potent
ATP-dependent exodeoxyribonuclease activity. Ejection of
RecD, as occurs at chi recombinational hotspots,
cripples exonuclease activity in favor of recombinagenic
helicase activity. All proteins in this family for which
functions are known are DNA-DNA helicases that are used
as part of an exonuclease-helicase complex (made up of
RecBCD homologs) that function to generate substrates
for the initiation of recombination and recombinational
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1087
Score = 29.7 bits (67), Expect = 7.4
Identities = 10/22 (45%), Positives = 12/22 (54%)
Query: 130 FNGTIFAYGQTGTGKTFTMEGV 151
NGT GTGKTFT+ +
Sbjct: 8 LNGTFLIEASAGTGKTFTIAQL 29
>gnl|CDD|237320 PRK13258, PRK13258, 7-cyano-7-deazaguanine reductase; Provisional.
Length = 114
Score = 27.8 bits (63), Expect = 7.5
Identities = 7/17 (41%), Positives = 10/17 (58%), Gaps = 1/17 (5%)
Query: 667 QPDFSHLY-RYYPSEKV 682
QPDF+ + Y P +K
Sbjct: 34 QPDFATIVIDYIPDKKC 50
>gnl|CDD|181465 PRK08557, PRK08557, hypothetical protein; Provisional.
Length = 417
Score = 29.3 bits (66), Expect = 7.7
Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 9/77 (11%)
Query: 143 GKTFTMEGVNNVPELKGIIPNSFAHI-----FGHIAKADESVKF--LVRVSYFEIYNEEI 195
GK E + N EL I+ N + + G K ++ +K L + E+I
Sbjct: 91 GKYIKEELIENPEELNEILENDYVGVEIGNFLGVGVKKEDRIKIKDLSLKKELDF--EKI 148
Query: 196 RDLLSKNKDQCLEVKER 212
D L KNK++ +++E
Sbjct: 149 EDYLEKNKERIEKLEEN 165
>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
family. This model represents a relatively
well-conserved region near the C-terminus of the tape
measure protein of a lambda and related phage. This
protein, which controls phage tail length, is typically
about 1000 residues in length. Both low-complexity
sequence and insertion/deletion events appear common in
this family. Mutational studies suggest a ruler or
template role in the determination of phage tail length.
Similar behavior is attributed to proteins from
distantly related or unrelated families in other phage
[Mobile and extrachromosomal element functions, Prophage
functions].
Length = 332
Score = 29.0 bits (65), Expect = 7.8
Identities = 27/143 (18%), Positives = 49/143 (34%), Gaps = 38/143 (26%)
Query: 514 QQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM 573
++L A + L+ R + I K L EE+E +K
Sbjct: 15 LKKLNTADEKSLQSRSDE---------------IIALIKLEKLLEEAE------RKALEA 53
Query: 574 LLAAKEERKDVQRENRKEIEGFLDNVRQ---------ITKEYSYQQAIIQKFIPTKYL-- 622
L E ++ +N+++++ F +Q I + + QQ + K + K L
Sbjct: 54 LKKLAEATASIRAQNKRQLDRFGLGDKQRERLDARLQIDRTFRKQQRDLNKAMTAKGLAG 113
Query: 623 ------ELMTQNATWNEDIGEWE 639
+L A NE + E
Sbjct: 114 SDLYKEQLAAIKAALNEALAELH 136
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 28.9 bits (65), Expect = 7.8
Identities = 25/133 (18%), Positives = 45/133 (33%), Gaps = 15/133 (11%)
Query: 416 NQDPKDALLIKYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCV-------EPS 468
+D + L + S+ + KL N E D+ E + +
Sbjct: 56 KRDLRAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDAR 115
Query: 469 EVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR 528
D++D+ SD S + E L E +I+++ A +E E
Sbjct: 116 NSDADDSDSSSDSDSSDDDSDDDDSEDETA------ALLRELEKIKKE--RAEEKEREEE 167
Query: 529 KQKEEELKKNIQE 541
++ EE K +E
Sbjct: 168 EKAAEEEKAREEE 180
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 29.2 bits (66), Expect = 8.0
Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 7/87 (8%)
Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKL-----KK 569
++ L A+EL RK E+E +E+ AE ++E+ L E + L + L K
Sbjct: 5 REALAELAKEL--RKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKP 62
Query: 570 VSSMLLAAKEERKDVQRENRKEIEGFL 596
+S EE ++ + E +L
Sbjct: 63 AASGEGGGGEEEEEEAKAEAAEFRAYL 89
>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
IRSp53/MIM homology Domain (IMD), of Missing In
Metastasis. The IMD domain, also called
Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
dimerization and lipid-binding module that bends
membranes and induces membrane protrusions. Members of
this subfamily include missing in metastasis (MIM) or
metastasis suppressor 1 (MTSS1), metastasis suppressor
1-like (MTSSL) or ABBA (Actin-Bundling protein with
BAIAP2 homology), and similar proteins. They contain an
N-terminal IMD and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus. MIM was originally identified
as a missing transcript from metastatic bladder and
prostate cancer cells. It is a scaffold protein that
functions in a signaling pathway between the PDGF
receptor, Src kinases, and actin assembly. It may also
function as a cofactor of the Sonic hedgehog (Shh)
transcriptional pathway and may participate in tumor
development and progression via this pathway. ABBA
regulates actin and plasma membrane dynamics to promote
the extension of radial glia, which is important in
neuronal migration, axon guidance and neurogenesis. The
IMD domain of MIM binds and bundles actin filaments,
binds membranes, and interacts with the small GTPase
Rac.
Length = 231
Score = 29.0 bits (65), Expect = 8.2
Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 7/61 (11%)
Query: 478 ELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKK 537
E K+A+ + + LQKK G +L Q LD+A Q++ + EE +K
Sbjct: 123 EYKKARQEIKKKSSDTIRLQKKARKGKGDL-------QPQLDSAMQDVNDKYLLLEETEK 175
Query: 538 N 538
Sbjct: 176 K 176
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 28.0 bits (63), Expect = 8.3
Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 2/83 (2%)
Query: 505 ENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLS 564
E L ++ E ++ ++ ++LE K+ EE L+K + A+ E K ++ +E +
Sbjct: 35 EALYKENEELKEEIERLEEKLEEYKELEETLQKTLVV--AQETAEEVKANAQKEAELIIK 92
Query: 565 KKLKKVSSMLLAAKEERKDVQRE 587
+ K ++ A EE K + E
Sbjct: 93 EAEAKAERIVNDANEEAKKLATE 115
>gnl|CDD|221198 pfam11740, KfrA_N, Plasmid replication region DNA-binding N-term.
The broad host-range plasmid RK2 is able to replicate in
and be inherited in a stable manner in diverse
Gram-negative bacterial species. It encodes a number of
co-ordinately regulated operons including a central
control korF1 operon that represses the kfrA operon. The
KfrA polypeptide is a site-specific DNA-binding protein
whose operator overlaps the kfrA promoter. The
N-terminus, containing an helix-turn-helix motif, is
essential for function. Downstream from this family is
an extended coiled-coil domain containing a heptad
repeat segment which is probably responsible for
formation of multimers, and may provide an example of a
bridge to host structures required for plasmid
partitioning.
Length = 120
Score = 28.0 bits (63), Expect = 8.4
Identities = 9/49 (18%), Positives = 19/49 (38%), Gaps = 3/49 (6%)
Query: 511 AEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEE 559
A + + + A + L + E +AE + + +L+EE
Sbjct: 72 AALWEAAQEEAEERLA---AARAAAEAERAELEAELAEAAAEAEALEEE 117
>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats.
Length = 101
Score = 27.3 bits (61), Expect = 8.7
Identities = 17/111 (15%), Positives = 48/111 (43%), Gaps = 23/111 (20%)
Query: 458 LDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELLQKKVLVGGENLLEKAEIQQQL 517
L+E E + ++ +D +++++K E + ++ E + +
Sbjct: 14 LEEKEQLLASEDLGKDLESV---------EALLKKHEAFEAELEAHEERV--------EA 56
Query: 518 LDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLK 568
L+ ++L ++ + I+E+ ++ E++ L+E +E +KL+
Sbjct: 57 LNELGEQL---IEEGHPDAEEIEER---LEELNERWEELKELAEERRQKLE 101
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 29.1 bits (65), Expect = 8.8
Identities = 26/123 (21%), Positives = 43/123 (34%), Gaps = 8/123 (6%)
Query: 437 KLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEKLELL 496
K +E V + L E + E +E D E+ +D ++ EL
Sbjct: 164 KKQELPVEETLSTIHEDDASTLHEDDELDEEVTSYLNELDDEVTSYFNDGENEENDDELE 223
Query: 497 QKKV--LVGGENLLEKAEIQQQLLD------AAAQELEHRKQKEEELKKNIQEKDAERID 548
+ + L GEN E E ++ A E E + E+E ++ E+
Sbjct: 224 AEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEEQENEDK 283
Query: 549 MEE 551
EE
Sbjct: 284 GEE 286
>gnl|CDD|237292 PRK13182, racA, polar chromosome segregation protein; Reviewed.
Length = 175
Score = 28.5 bits (64), Expect = 8.8
Identities = 29/148 (19%), Positives = 57/148 (38%), Gaps = 30/148 (20%)
Query: 433 ELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSDHQSMVEK 492
+K ++EE ++P+ N E S V D + + +
Sbjct: 48 YVKSQIEEGQNMQDTQKPSSNDVEETQVNTIVQNISSV--------------DFEQLEAQ 93
Query: 493 LELLQKKVLVGGENLLEKAEIQQQLLDAA-----AQELEHRKQKEEELKKNIQEKDAERI 547
L + +++ E+++QL A Q L+HR++ EE L++ +Q+ +A
Sbjct: 94 LNTITRRL----------DELERQLQQKADDVVSYQLLQHRREMEEMLER-LQKLEARLK 142
Query: 548 DMEEKYSSLQEESEGLSKKLKKVSSMLL 575
+E Y + E +K K M+
Sbjct: 143 KLEPIYITPDTEPTYEREKKPKRRKMIF 170
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 29.4 bits (66), Expect = 9.0
Identities = 28/150 (18%), Positives = 59/150 (39%), Gaps = 24/150 (16%)
Query: 491 EKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHR-KQKEEELKKNIQEKDAERIDM 549
+++ V E E+A + A + R K+ +EL+ ++ E +D+
Sbjct: 664 LSTLFCVQRLRVAAELQKEEARL-------ALEGNIERTKELNDELRAELELHRKEILDL 716
Query: 550 EEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENR-KEIEGFLDNVRQITKEYSY 608
+ + AA+EE++ QRE+R + +E L+ V + Y
Sbjct: 717 FD------------CGTADTEDAFREAAREEQQLTQRESRLESLEAQLEGVA--AEAYEL 762
Query: 609 QQAIIQKFIPTKYL-ELMTQNATWNEDIGE 637
++ Q+ + + L L +E++ E
Sbjct: 763 SASLDQRELKEEELALLEEAIDALDEEVEE 792
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 28.3 bits (64), Expect = 9.0
Identities = 21/86 (24%), Positives = 37/86 (43%), Gaps = 15/86 (17%)
Query: 523 QELEHRKQKE-------EELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSM-- 573
L H K+K E LKK ++EK+ + ++ ++ E + L K+ KK+
Sbjct: 77 HILTHLKEKLHMLEEELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNKKLRQQGG 136
Query: 574 ------LLAAKEERKDVQRENRKEIE 593
LL ++ + E RK +E
Sbjct: 137 LLHVPALLYDYDKTVEKVEEKRKSVE 162
>gnl|CDD|177020 CHL00081, chlI, Mg-protoporyphyrin IX chelatase.
Length = 350
Score = 28.8 bits (65), Expect = 9.1
Identities = 28/79 (35%), Positives = 38/79 (48%), Gaps = 8/79 (10%)
Query: 500 VLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEK----DAERIDMEEKYSS 555
VLVG N E+ E++ QLLD E R K+ EL+ I E+ D + EKY
Sbjct: 189 VLVGSGNP-EEGELRPQLLDRFGMHAEIRTVKDPELRVKIVEQRTSFDKNPQEFREKYEE 247
Query: 556 LQEESEGLSKKLKKVSSML 574
QEE L K+ ++L
Sbjct: 248 SQEE---LRSKIVAAQNLL 263
>gnl|CDD|220222 pfam09403, FadA, Adhesion protein FadA. FadA (Fusobacterium
adhesin A) is an adhesin which forms two alpha helices.
Length = 126
Score = 28.0 bits (62), Expect = 9.1
Identities = 19/79 (24%), Positives = 35/79 (44%), Gaps = 9/79 (11%)
Query: 515 QQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEES---------EGLSK 565
Q L+ E KQ++E +K +QE ++ EE Y+ L+EES + L K
Sbjct: 37 QALMQKEEARFEEEKQEKETAEKEVQELKERQLGREELYAKLKEESKTRFHKDEYKKLLK 96
Query: 566 KLKKVSSMLLAAKEERKDV 584
+ + + L +++
Sbjct: 97 RYDEYLNKLEQKITDKEQQ 115
>gnl|CDD|223024 PHA03252, PHA03252, DNA packaging tegument protein UL25;
Provisional.
Length = 589
Score = 28.9 bits (65), Expect = 9.7
Identities = 16/85 (18%), Positives = 36/85 (42%), Gaps = 5/85 (5%)
Query: 512 EIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVS 571
E ++L D +A L ++ E+L ++ + ++++ LQ E E L ++ +
Sbjct: 29 EDLRRLRDDSALRL---RRYREDLLRDRLLRRRLGEELDDLQKRLQTECEDLRSRVSEAE 85
Query: 572 SMLLAAKE--ERKDVQRENRKEIEG 594
++LL E ++G
Sbjct: 86 ALLLHDASGGEGGGATNGGEVNVDG 110
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 28.3 bits (64), Expect = 9.7
Identities = 31/120 (25%), Positives = 56/120 (46%), Gaps = 28/120 (23%)
Query: 474 ETDAELKRAQSDHQSMVEKLE--LLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQK 531
E + ELK +++ Q + E LLQK E L++ + E ++K
Sbjct: 68 EAERELKERRNELQ----RQEKRLLQK------EETLDR-------------KDESLEKK 104
Query: 532 EEELKKNIQEKDAERIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDVQRENRKE 591
EE L++ +E A + +EEK L+E E ++L+++S + EE K++ E +
Sbjct: 105 EESLEEKEKELAARQQQLEEKEEELEELIEEQQQELERISGL---TAEEAKEILLEEVEA 161
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 29.0 bits (65), Expect = 9.8
Identities = 32/159 (20%), Positives = 54/159 (33%), Gaps = 29/159 (18%)
Query: 426 KYQKEISELKKKLEEASVPPPREEPTPNPEPELDETEPCVEPSEVIEDETDAELKRAQSD 485
K +S KK + + E E +E +VI ++ AE D
Sbjct: 213 KKSNIMSSFFKKKTKEKKEKKEASESTVKE-ESEEESG---KRDVILEDESAEPTGLDED 268
Query: 486 HQSMVEKLELLQKKVLVGGENLLEKAEIQQQLLDAAAQELEHRKQKEEELKKNIQEKDAE 545
E + + + D+ + E K+K + LKK + E + E
Sbjct: 269 -----EDEDEPKPSG---------------ERSDSEEETEEKEKEKRKRLKK-MMEDEDE 307
Query: 546 RIDMEEKYSSLQEESEGLSKKLKKVSSMLLAAKEERKDV 584
+ME S EE E + L +EE+++V
Sbjct: 308 DEEMEIVPESPVEEEESEEPE----PPPLPKKEEEKEEV 342
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.129 0.358
Gapped
Lambda K H
0.267 0.0724 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,556,769
Number of extensions: 3572205
Number of successful extensions: 6230
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5124
Number of HSP's successfully gapped: 712
Length of query: 705
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 601
Effective length of database: 6,324,786
Effective search space: 3801196386
Effective search space used: 3801196386
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 63 (28.1 bits)