RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15183
(1973 letters)
>gnl|CDD|99850 cd06147, Rrp6p_like_exo, DEDDy 3'-5' exonuclease domain of yeast
Rrp6p, human polymyositis/scleroderma autoantigen
100kDa, and similar proteins. Yeast Rrp6p and its human
homolog, the polymyositis/scleroderma autoantigen 100kDa
(PM/Scl-100), are exosome-associated proteins involved
in the degradation and processing of precursors to
stable RNAs. Both proteins contain a DEDDy-type
DnaQ-like 3'-5' exonuclease domain possessing three
conserved sequence motifs termed ExoI, ExoII and ExoIII,
with a specific YX(3)D pattern at ExoIII. The motifs are
clustered around the active site and contain four
conserved acidic residues that serve as ligands for the
two metal ions required for catalysis. PM/Scl-100, an
autoantigen present in the nucleolar compartment of the
cell, reacts with autoantibodies produced by about 50%
of patients with polymyositis-scleroderma overlap
syndrome.
Length = 192
Score = 328 bits (844), Expect = e-103
Identities = 118/192 (61%), Positives = 149/192 (77%), Gaps = 1/192 (0%)
Query: 317 SDTPLMMITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTL 376
+TPL + E++ +LV +LK +EIA+DLE+H+YRSY G+TCLMQISTR++DYIVDTL
Sbjct: 1 DETPLTFVDTEEKLEELVEKLKNCKEIAVDLEHHSYRSYLGFTCLMQISTREEDYIVDTL 60
Query: 377 KLREDLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQSL 436
KLR+D+ +LNEV TD NI+KVFHGADSDI WLQ+DFGLYVV +FDT QA + L +PR SL
Sbjct: 61 KLRDDMHILNEVFTDPNILKVFHGADSDIIWLQRDFGLYVVNLFDTGQAARVLNLPRHSL 120
Query: 437 AYLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL-SAAAHGK 495
AYLL+ YC+VD+DK +QL DWR RPLPE I+YAR DTHYLLY+YD ++ +L A
Sbjct: 121 AYLLQKYCNVDADKKYQLADWRIRPLPEEMIKYAREDTHYLLYIYDRLRNELLERANALA 180
Query: 496 QNLVLSTFTNSR 507
NL+ S SR
Sbjct: 181 PNLLESVLNCSR 192
Score = 328 bits (844), Expect = e-103
Identities = 118/192 (61%), Positives = 149/192 (77%), Gaps = 1/192 (0%)
Query: 1762 SDTPLMMITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTL 1821
+TPL + E++ +LV +LK +EIA+DLE+H+YRSY G+TCLMQISTR++DYIVDTL
Sbjct: 1 DETPLTFVDTEEKLEELVEKLKNCKEIAVDLEHHSYRSYLGFTCLMQISTREEDYIVDTL 60
Query: 1822 KLREDLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQSL 1881
KLR+D+ +LNEV TD NI+KVFHGADSDI WLQ+DFGLYVV +FDT QA + L +PR SL
Sbjct: 61 KLRDDMHILNEVFTDPNILKVFHGADSDIIWLQRDFGLYVVNLFDTGQAARVLNLPRHSL 120
Query: 1882 AYLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL-SAAAHGK 1940
AYLL+ YC+VD+DK +QL DWR RPLPE I+YAR DTHYLLY+YD ++ +L A
Sbjct: 121 AYLLQKYCNVDADKKYQLADWRIRPLPEEMIKYAREDTHYLLYIYDRLRNELLERANALA 180
Query: 1941 QNLVLSTFTNSR 1952
NL+ S SR
Sbjct: 181 PNLLESVLNCSR 192
>gnl|CDD|223426 COG0349, Rnd, Ribonuclease D [Translation, ribosomal structure and
biogenesis].
Length = 361
Score = 194 bits (496), Expect = 2e-54
Identities = 86/287 (29%), Positives = 141/287 (49%), Gaps = 9/287 (3%)
Query: 324 ITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLE 383
IT + + + L+ + IAID E+ R+Y CL+QIS + ++D L DL
Sbjct: 1 ITTGDLLAAACALLRGSKAIAIDTEFMRLRTYYPRLCLIQISDGEGASLIDPLAGILDLP 60
Query: 384 VLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMP-RQSLAYLLKH 442
L +L D N+VK+FH A D++ L FGL +FDT A K LA L++
Sbjct: 61 PLVALLADPNVVKIFHAARFDLEVLLNLFGLLPTPLFDTQIAAKLAGFGTSHGLADLVEE 120
Query: 443 YCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAAAHGKQNLVLST 502
V+ DK+ Q DW RPL E ++YA D YLL +YD + +L+ G+
Sbjct: 121 LLGVELDKSEQRSDWLARPLSEAQLEYAAADVEYLLPLYDKLTEELARE--GRLEWAEDE 178
Query: 503 FTNSRNICKLKYEKPVFNEEGYMNIFRSHALLNNQQKYALRELYKWRDRIARDKDESTGY 562
F + + V E+ + I +H L+ ++ LREL WR+R AR++D + +
Sbjct: 179 FR----LLPTRRTYKVLPEDAWREIKIAH-SLDPRELAVLRELAAWREREARERDLARNF 233
Query: 563 VLPNHMLLQMAQSIPRDIQGIFACCNPVPQTVKEHVLDIHAIILKAR 609
VL + L ++A+ P++++ +P+ + H + A++ A
Sbjct: 234 VLKDEALWELARYTPKNLKE-LDALGLIPKERRRHGKLLLALLANAL 279
Score = 155 bits (395), Expect = 5e-41
Identities = 62/182 (34%), Positives = 90/182 (49%), Gaps = 3/182 (1%)
Query: 1769 ITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLE 1828
IT + + + L+ + IAID E+ R+Y CL+QIS + ++D L DL
Sbjct: 1 ITTGDLLAAACALLRGSKAIAIDTEFMRLRTYYPRLCLIQISDGEGASLIDPLAGILDLP 60
Query: 1829 VLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMP-RQSLAYLLKH 1887
L +L D N+VK+FH A D++ L FGL +FDT A K LA L++
Sbjct: 61 PLVALLADPNVVKIFHAARFDLEVLLNLFGLLPTPLFDTQIAAKLAGFGTSHGLADLVEE 120
Query: 1888 YCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAAAHGKQNLVLST 1947
V+ DK+ Q DW RPL E ++YA D YLL +YD + +L+ G+
Sbjct: 121 LLGVELDKSEQRSDWLARPLSEAQLEYAAADVEYLLPLYDKLTEELARE--GRLEWAEDE 178
Query: 1948 FT 1949
F
Sbjct: 179 FR 180
>gnl|CDD|176654 cd06142, RNaseD_exo, DEDDy 3'-5' exonuclease domain of Ribonuclease
D and similar proteins. Ribonuclease (RNase) D is a
bacterial enzyme involved in the maturation of small
stable RNAs and the 3' maturation of tRNA. It contains a
DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing
three conserved sequence motifs termed ExoI, ExoII and
ExoIII, with a specific YX(3)D pattern at ExoIII. These
motifs are clustered around the active site and contain
four conserved acidic residues that serve as ligands for
the two metal ions required for catalysis. In vivo,
RNase D only becomes essential upon removal of other
ribonucleases. Eukaryotic RNase D homologs include yeast
Rrp6p, human PM/Scl-100, and the Drosophila melanogaster
egalitarian protein.
Length = 178
Score = 180 bits (458), Expect = 1e-51
Identities = 61/164 (37%), Positives = 90/164 (54%), Gaps = 2/164 (1%)
Query: 329 QVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLEVLNEV 388
++ L L IA+D E+ +Y CL+QIST + Y++D L + DL L E+
Sbjct: 1 ELEDLCERLASAGVIAVDTEFMRLNTYYPRLCLIQISTGGEVYLIDPLAI-GDLSPLKEL 59
Query: 389 LTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYLLKHYCDVD 447
L D NIVKVFH A D++ L++DFG+ +FDT A + L + LA L++ V+
Sbjct: 60 LADPNIVKVFHAAREDLELLKRDFGILPQNLFDTQIAARLLGLGDSVGLAALVEELLGVE 119
Query: 448 SDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAA 491
DK Q DW RPL + ++YA D YLL +Y+ +K +L
Sbjct: 120 LDKGEQRSDWSKRPLTDEQLEYAALDVRYLLPLYEKLKEELEEE 163
Score = 180 bits (458), Expect = 1e-51
Identities = 61/164 (37%), Positives = 90/164 (54%), Gaps = 2/164 (1%)
Query: 1774 QVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLEVLNEV 1833
++ L L IA+D E+ +Y CL+QIST + Y++D L + DL L E+
Sbjct: 1 ELEDLCERLASAGVIAVDTEFMRLNTYYPRLCLIQISTGGEVYLIDPLAI-GDLSPLKEL 59
Query: 1834 LTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYLLKHYCDVD 1892
L D NIVKVFH A D++ L++DFG+ +FDT A + L + LA L++ V+
Sbjct: 60 LADPNIVKVFHAAREDLELLKRDFGILPQNLFDTQIAARLLGLGDSVGLAALVEELLGVE 119
Query: 1893 SDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAA 1936
DK Q DW RPL + ++YA D YLL +Y+ +K +L
Sbjct: 120 LDKGEQRSDWSKRPLTDEQLEYAALDVRYLLPLYEKLKEELEEE 163
>gnl|CDD|216604 pfam01612, DNA_pol_A_exo1, 3'-5' exonuclease. This domain is
responsible for the 3'-5' exonuclease proofreading
activity of E. coli DNA polymerase I (polI) and other
enzymes, it catalyzes the hydrolysis of unpaired or
mismatched nucleotides. This domain consists of the
amino-terminal half of the Klenow fragment in E. coli
polI it is also found in the Werner syndrome helicase
(WRN), focus forming activity 1 protein (FFA-1) and
ribonuclease D (RNase D). Werner syndrome is a human
genetic disorder causing premature aging; the WRN
protein has helicase activity in the 3'-5' direction.
The FFA-1 protein is required for formation of a
replication foci and also has helicase activity; it is a
homologue of the WRN protein. RNase D is a 3'-5'
exonuclease involved in tRNA processing. Also found in
this family is the autoantigen PM/Scl thought to be
involved in polymyositis-scleroderma overlap syndrome.
Length = 172
Score = 174 bits (442), Expect = 1e-49
Identities = 67/169 (39%), Positives = 98/169 (57%), Gaps = 3/169 (1%)
Query: 323 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGY--TCLMQISTRDKDYIVDTLKLRE 380
++T +++ L+ EL + +A+D E + +Y CL+QIST + YI+D L L +
Sbjct: 3 IVTTEDELEDLIEELLNAKVVAVDTETTSLDTYSYTLVGCLIQISTGEGAYIIDPLALGD 62
Query: 381 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYL 439
DLE L +L D NI KV H A D++ L +DFG+ + +FDT A L PR SL L
Sbjct: 63 DLEGLKRLLEDPNITKVGHNAKFDLEVLARDFGIKLENLFDTMLAAYLLGYPRSHSLDDL 122
Query: 440 LKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL 488
+ Y V+ DK+ Q DWR RPL E ++YA D YLL +YD ++ +L
Sbjct: 123 AEKYLGVELDKSEQCADWRARPLSEEQLRYAAEDADYLLRLYDKLRKEL 171
Score = 174 bits (442), Expect = 1e-49
Identities = 67/169 (39%), Positives = 98/169 (57%), Gaps = 3/169 (1%)
Query: 1768 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGY--TCLMQISTRDKDYIVDTLKLRE 1825
++T +++ L+ EL + +A+D E + +Y CL+QIST + YI+D L L +
Sbjct: 3 IVTTEDELEDLIEELLNAKVVAVDTETTSLDTYSYTLVGCLIQISTGEGAYIIDPLALGD 62
Query: 1826 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYL 1884
DLE L +L D NI KV H A D++ L +DFG+ + +FDT A L PR SL L
Sbjct: 63 DLEGLKRLLEDPNITKVGHNAKFDLEVLARDFGIKLENLFDTMLAAYLLGYPRSHSLDDL 122
Query: 1885 LKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL 1933
+ Y V+ DK+ Q DWR RPL E ++YA D YLL +YD ++ +L
Sbjct: 123 AEKYLGVELDKSEQCADWRARPLSEEQLRYAAEDADYLLRLYDKLRKEL 171
>gnl|CDD|176650 cd06129, RNaseD_like, DEDDy 3'-5' exonuclease domain of RNase D,
WRN, and similar proteins. The RNase D-like group is
composed of RNase D, WRN, and similar proteins. They
contain a DEDDy-type, DnaQ-like, 3'-5' exonuclease
domain that contains three conserved sequence motifs
termed ExoI, ExoII and ExoIII, with a specific YX(3)D
pattern at ExoIII. These motifs are clustered around the
active site and contain four conserved acidic residues
that serve as ligands for the two metal ions required
for catalysis. RNase D is involved in the 3'-end
processing of tRNA precursors. RNase D-like proteins in
eukaryotes include yeast Rrp6p, human PM/Scl-100 and
Drosophila melanogaster egalitarian (Egl) protein. WRN
is a unique DNA helicase possessing exonuclease
activity. Mutation in the WRN gene is implicated in
Werner syndrome, a disease associated with premature
aging and increased predisposition to cancer. Yeast
Rrp6p and the human Polymyositis/scleroderma autoantigen
100kDa (PM/Scl-100) are exosome-associated proteins
involved in the degradation and processing of precursors
to stable RNAs. Egl is a component of an mRNA-binding
complex which is required for oocyte specification. The
Egl subfamily does not possess a completely conserved
YX(3)D pattern at the ExoIII motif.
Length = 161
Score = 155 bits (394), Expect = 2e-43
Identities = 52/146 (35%), Positives = 77/146 (52%), Gaps = 2/146 (1%)
Query: 342 EIAIDLEYHNYRSYQGYTCLMQISTRD-KDYIVDTLKLREDLEVLNEVLTDKNIVKVFHG 400
IA D+E+ R Y G L+Q+ + K Y+ D L L D + L +L + +IVK HG
Sbjct: 15 VIAFDMEWPPGRRYYGEVALIQLCVSEEKCYLFDPLSLSVDWQGLKMLLENPSIVKALHG 74
Query: 401 ADSDIKWLQKDFGLYVVGMFDTHQACKFLPMP-RQSLAYLLKHYCDVDSDKTFQLFDWRH 459
+ D+ L +DFG + +FDT A +P R SLA L++H+ DK+ DW +
Sbjct: 75 IEGDLWKLLRDFGEKLQRLFDTTIAANLKGLPERWSLASLVEHFLGKTLDKSISCADWSY 134
Query: 460 RPLPEPAIQYARTDTHYLLYVYDCMK 485
RPL E YA D + LL +Y ++
Sbjct: 135 RPLTEDQKLYAAADVYALLIIYTKLR 160
Score = 155 bits (394), Expect = 2e-43
Identities = 52/146 (35%), Positives = 77/146 (52%), Gaps = 2/146 (1%)
Query: 1787 EIAIDLEYHNYRSYQGYTCLMQISTRD-KDYIVDTLKLREDLEVLNEVLTDKNIVKVFHG 1845
IA D+E+ R Y G L+Q+ + K Y+ D L L D + L +L + +IVK HG
Sbjct: 15 VIAFDMEWPPGRRYYGEVALIQLCVSEEKCYLFDPLSLSVDWQGLKMLLENPSIVKALHG 74
Query: 1846 ADSDIKWLQKDFGLYVVGMFDTHQACKFLPMP-RQSLAYLLKHYCDVDSDKTFQLFDWRH 1904
+ D+ L +DFG + +FDT A +P R SLA L++H+ DK+ DW +
Sbjct: 75 IEGDLWKLLRDFGEKLQRLFDTTIAANLKGLPERWSLASLVEHFLGKTLDKSISCADWSY 134
Query: 1905 RPLPEPAIQYARTDTHYLLYVYDCMK 1930
RPL E YA D + LL +Y ++
Sbjct: 135 RPLTEDQKLYAAADVYALLIIYTKLR 160
>gnl|CDD|214681 smart00474, 35EXOc, 3'-5' exonuclease. 3\' -5' exonuclease
proofreading domain present in DNA polymerase I, Werner
syndrome helicase, RNase D and other enzymes.
Length = 172
Score = 138 bits (349), Expect = 3e-37
Identities = 63/171 (36%), Positives = 97/171 (56%), Gaps = 7/171 (4%)
Query: 323 MITEPEQVTQLVSELKQQQ-EIAIDLEYHNYRSYQGYTCLMQIS-TRDKDYIVDTLKLRE 380
++T+ E + +L+ +L+ E+A+D E SY G L+QIS T + +I+D L L +
Sbjct: 3 VVTDSETLEELLEKLRAAGGEVALDTETTGLDSYSGKLVLIQISVTGEGAFIIDPLALGD 62
Query: 381 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQS---LA 437
DLE+L ++L D+ I KV H A D+ L + FG+ + +FDT A +L + S LA
Sbjct: 63 DLEILKDLLEDETITKVGHNAKFDLHVLAR-FGIELENIFDTMLA-AYLLLGGPSKHGLA 120
Query: 438 YLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL 488
LL Y V+ DK Q DW RPL E ++YA D LL +Y+ ++ +L
Sbjct: 121 TLLLGYLGVELDKEEQKSDWGARPLSEEQLEYAAEDADALLRLYEKLEKEL 171
Score = 138 bits (349), Expect = 3e-37
Identities = 63/171 (36%), Positives = 97/171 (56%), Gaps = 7/171 (4%)
Query: 1768 MITEPEQVTQLVSELKQQQ-EIAIDLEYHNYRSYQGYTCLMQIS-TRDKDYIVDTLKLRE 1825
++T+ E + +L+ +L+ E+A+D E SY G L+QIS T + +I+D L L +
Sbjct: 3 VVTDSETLEELLEKLRAAGGEVALDTETTGLDSYSGKLVLIQISVTGEGAFIIDPLALGD 62
Query: 1826 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQS---LA 1882
DLE+L ++L D+ I KV H A D+ L + FG+ + +FDT A +L + S LA
Sbjct: 63 DLEILKDLLEDETITKVGHNAKFDLHVLAR-FGIELENIFDTMLA-AYLLLGGPSKHGLA 120
Query: 1883 YLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL 1933
LL Y V+ DK Q DW RPL E ++YA D LL +Y+ ++ +L
Sbjct: 121 TLLLGYLGVELDKEEQKSDWGARPLSEEQLEYAAEDADALLRLYEKLEKEL 171
>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
Length = 444
Score = 141 bits (357), Expect = 2e-35
Identities = 103/413 (24%), Positives = 171/413 (41%), Gaps = 51/413 (12%)
Query: 870 PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKKDGETQ----VGNDISNIEAV- 924
IV DNG+ + G+A + P +F +++ + R E + + VGN+ N
Sbjct: 8 TIVIDNGSGTTKAGFAGNDTPTTVFPSIVGRRRDEDSVMEDTEEKDTYVGNEAQNDRDNS 67
Query: 925 RFQLKTQFDKNVVTHYDIQEQIFDYAFS---HLSINTEGNVNHPIVLTEPFLNPNYSRSL 981
+L+ + ++ ++D EQI+DY F L + E HP++LTEP LNP +R
Sbjct: 68 LLELRYPIENGIILNWDAMEQIWDYTFFNKGDLLPSPEE---HPLLLTEPPLNPPSNREK 124
Query: 982 MSELLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAK 1041
++ELLFE VP++ I ++ S +G ++G++I G THVIPV++G + K
Sbjct: 125 ITELLFETLNVPALYLAIQAVLSLYASGSSDETGLVIDSGDSVTHVIPVVDGIVLPKAVK 184
Query: 1042 RIDLGGFSVIHHLHKLLQLKYPSHINS------------ITPSRSEELL---WDYGFVAT 1086
RID+GG + +L KLL+ KYP I EE+ + +V+
Sbjct: 185 RIDIGGRDITDYLKKLLREKYPPSRGYNLKSELVEYSSEIVNEIKEEVCETDDESAYVSL 244
Query: 1087 DYREHLRKWLDAEFYDSNVVKVQL-PYAVPVPNLTTEQQKDRRKELAKKLVEMNAKKREE 1145
D E + + S QL ++ +E+ + K E
Sbjct: 245 DAEEEFEEEEEKPAEKSTESTFQLSKETSIAKE---SKELPDGEEI--EFGNEERFKAPE 299
Query: 1146 RLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQ-LQQKIEKTK 1204
L E ++ L E +I E + E + N+I L + + Q +Q E +
Sbjct: 300 ILFKPELPISGLEEAGKIDESKQELVAENYEISPTNLGNDIAGLPELVYQSIQICDEDVR 359
Query: 1205 AKIIAYNN-----GEDLTEEPKAKLSKEI-----------AVPESEAEFKAWL 1241
Y+N G +L KE+ +P + AWL
Sbjct: 360 KS--LYSNIVLTGGTSKIPGFAERLQKELTSLAPSIWKVSVIPPPDPSLDAWL 410
Score = 76.7 bits (189), Expect = 4e-14
Identities = 46/190 (24%), Positives = 82/190 (43%), Gaps = 2/190 (1%)
Query: 1282 ELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQ 1341
E +E+ + + + K + A ++ + E + E + PE +
Sbjct: 248 EEFEEEEEKPAEKSTESTFQLSKETS-IAKESKELPDGEEIEFGNEERFKA-PEILFKPE 305
Query: 1342 EQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANN 1401
+EA ++ + + P+ LG+ AG+ E + + + + +SL +N
Sbjct: 306 LPISGLEEAGKIDESKQELVAENYEISPTNLGNDIAGLPELVYQSIQICDEDVRKSLYSN 365
Query: 1402 IFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFA 1461
I +TG ++PGF ERL K+L P SV +P LDAW GA + E F
Sbjct: 366 IVLTGGTSKIPGFAERLQKELTSLAPSIWKVSVIPPPDPSLDAWLGASILASLETFQQLW 425
Query: 1462 VTQSDYQEKG 1471
+T+ +Y+E G
Sbjct: 426 ITKEEYEEHG 435
>gnl|CDD|214592 smart00268, ACTIN, Actin. ACTIN subfamily of
ACTIN/mreB/sugarkinase/Hsp70 superfamily.
Length = 373
Score = 131 bits (332), Expect = 1e-32
Identities = 70/271 (25%), Positives = 125/271 (46%), Gaps = 40/271 (14%)
Query: 870 PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKK--DGETQVGNDISNIEAVRFQ 927
IV DNG+ + G+A + P ++F +++ +P+ +G + VG++ +
Sbjct: 3 AIVIDNGSGTIKAGFAGEDFPQVVFPSIVGRPKDGKGMVGDAKDIFVGDEAQEKRGG-LE 61
Query: 928 LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 986
LK + +V ++D E+I+DY F + L + E HP++LTEP +NP +R + E++
Sbjct: 62 LKYPIENGIVENWDDMEKIWDYTFFNELRVEPE---EHPVLLTEPPMNPKSNREKILEIM 118
Query: 987 FECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAKRIDLG 1046
FE + P++ I ++ S +G +G++I G THV+PV++G + KRID+
Sbjct: 119 FETFNFPALYIAIQAVLSLYASG--RTTGLVIDSGDGVTHVVPVVDGYVLPHAIKRIDIA 176
Query: 1047 GFSVIHHLHKLLQLKYPSHINSITPSRSEELLWDYGFVATDYREHLRKWLDAEFYDSNVV 1106
G + +L +LL + NS AEF +
Sbjct: 177 GRDITDYLKELLS-ERGYQFNSS----------------------------AEFEIVREI 207
Query: 1107 KVQLPYAVPVPNLTTEQQKDRRKELAKKLVE 1137
K +L Y V + E + R + KL +
Sbjct: 208 KEKLCY-VA-EDFEKEMKLARESSESSKLEK 236
Score = 86.9 bits (216), Expect = 1e-17
Identities = 39/146 (26%), Positives = 64/146 (43%)
Query: 1326 EEILRQHDPEFTSLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNF 1385
E+ ++ S E+ + N + +G ER PE LF P ++G Q GI E +
Sbjct: 219 EKEMKLARESSESSKLEKTYELPDGNTIKVGNERFRIPEILFSPELIGLEQKGIHELVYE 278
Query: 1386 VLNSYPQHIAQSLANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAW 1445
+ + + L NI ++G +PGF ERL K+L + P + V + W
Sbjct: 279 SIQKCDIDVRKDLYENIVLSGGSTLIPGFGERLEKELKQLAPKKLKVKVIAPPERKYSVW 338
Query: 1446 SGARQFSLSENFHDFAVTQSDYQEKG 1471
G + F D +T+ +Y+E G
Sbjct: 339 LGGSILASLSTFEDMWITKKEYEESG 364
>gnl|CDD|130455 TIGR01388, rnd, ribonuclease D. This model describes ribonuclease
D, a 3'-exonuclease shown to act on tRNA both in vitro
and when overexpressed in vivo. Trusted members of this
family are restricted to the Proteobacteria; Aquifex,
Mycobacterial, and eukaryotic homologs are not
full-length homologs. Ribonuclease D is not essential in
E. coli and is deleterious when overexpressed. Its
precise biological role is still unknown [Transcription,
RNA processing].
Length = 367
Score = 123 bits (311), Expect = 4e-30
Identities = 79/307 (25%), Positives = 140/307 (45%), Gaps = 15/307 (4%)
Query: 323 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDL 382
IT +++ + ++ +A+D E+ R++ L+Q++ ++ ++D L + D
Sbjct: 1 WITTDDELATVCEAVRTFPFVALDTEFVRERTFWPQLGLIQVADGEQLALIDPLVI-IDW 59
Query: 383 EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYLLK 441
L E+L D+++VKV H A D++ FG +FDT A F A L++
Sbjct: 60 SPLKELLRDESVVKVLHAASEDLEVFLNLFGELPQPLFDTQIAAAFCGFGMSMGYAKLVQ 119
Query: 442 HYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAAAHGKQNLVLS 501
V+ DK+ DW RPL + ++YA D YLL +Y KL G+ +
Sbjct: 120 EVLGVELDKSESRTDWLARPLTDAQLEYAAADVTYLLPLYA--KLMERLEESGRLAWLEE 177
Query: 502 TFTNSRNICKLKYEKPVFNEEGYMNIFRSHALLNNQQKYALRELYKWRDRIARDKDESTG 561
T + + V E+ + +I ++ L QQ L+ L WR+R AR++D
Sbjct: 178 ECTLLTD----RRTYVVNPEDAWRDI-KNAWQLRPQQLAVLQALAAWREREARERDLPRN 232
Query: 562 YVLPNHMLLQMAQSIPRDIQGIFACCNPVPQTVKEHVLDIHAIILKARLQSLTKPVEKL- 620
+VL L ++A+ P ++ A P +++H + A++ A L P + L
Sbjct: 233 FVLKEEALWELARQAPGNL-TELASLGPKGSEIRKHGDTLLALVKTA----LALPEDALP 287
Query: 621 QPSLDGM 627
Q L+ M
Sbjct: 288 QAPLNLM 294
Score = 94.1 bits (234), Expect = 4e-20
Identities = 46/170 (27%), Positives = 82/170 (48%), Gaps = 2/170 (1%)
Query: 1768 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDL 1827
IT +++ + ++ +A+D E+ R++ L+Q++ ++ ++D L + D
Sbjct: 1 WITTDDELATVCEAVRTFPFVALDTEFVRERTFWPQLGLIQVADGEQLALIDPLVI-IDW 59
Query: 1828 EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYLLK 1886
L E+L D+++VKV H A D++ FG +FDT A F A L++
Sbjct: 60 SPLKELLRDESVVKVLHAASEDLEVFLNLFGELPQPLFDTQIAAAFCGFGMSMGYAKLVQ 119
Query: 1887 HYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAA 1936
V+ DK+ DW RPL + ++YA D YLL +Y + L +
Sbjct: 120 EVLGVELDKSESRTDWLARPLTDAQLEYAAADVTYLLPLYAKLMERLEES 169
>gnl|CDD|200935 pfam00022, Actin, Actin.
Length = 367
Score = 106 bits (267), Expect = 3e-24
Identities = 51/203 (25%), Positives = 99/203 (48%), Gaps = 23/203 (11%)
Query: 870 PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKKD---GETQVGNDISNIEAVRF 926
+V DNG+ + G+A + P + +++ +PR G+ + + +E
Sbjct: 5 ALVIDNGSGTTKAGFAGEDAPRAVIPSVVGRPRGRGVMVKYYVGDEAL-SKRPGLE---- 59
Query: 927 QLKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSEL 985
++ + +V ++D E+I+++ F L ++ E HP++LTEP LNP +R +E+
Sbjct: 60 -VRYPIEDGIVENWDAMEKIWEHTFFEELRVDPE---EHPLLLTEPPLNPPANREKATEI 115
Query: 986 LFECYQVPSVCYGID---SLFSYQYNGWEGQS-GVIISCGYQCTHVIPVINGCIDASKAK 1041
+FE + VP++ S ++ G++ G+++ G T V+PV G + +
Sbjct: 116 MFETFGVPALYLAKQAVLSAYAS------GRTTGLVVDSGAGVTSVVPVYEGYVLQKAIR 169
Query: 1042 RIDLGGFSVIHHLHKLLQLKYPS 1064
R DL G + +L KLL + S
Sbjct: 170 RSDLAGDDLTDYLRKLLSSRTYS 192
Score = 85.0 bits (211), Expect = 5e-17
Identities = 41/118 (34%), Positives = 57/118 (48%), Gaps = 1/118 (0%)
Query: 1355 IGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANNIFVTGSLCQLPGF 1414
+G ER PE LF PS++GS AGI E + +N+ + SL NI VTG PGF
Sbjct: 241 LGNERFRVPEILFNPSLIGSESAGIPELIYDSINACDVDLRPSLLANIVVTGGTTLFPGF 300
Query: 1415 VERLNKDLLENRPFQSHFSVSLAEN-PELDAWSGARQFSLSENFHDFAVTQSDYQEKG 1471
ERL K+L + P + N + AW G + F V++ +Y+E G
Sbjct: 301 TERLEKELAQLAPSGVKVKIIAPPNERKYSAWIGGSILASLGTFQQMWVSKQEYEEHG 358
>gnl|CDD|240225 PTZ00004, PTZ00004, actin-2; Provisional.
Length = 378
Score = 100 bits (251), Expect = 4e-22
Identities = 59/249 (23%), Positives = 121/249 (48%), Gaps = 20/249 (8%)
Query: 871 IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKER---GKKDGETQVGNDISNIEAVRFQ 927
V DNG+ + G+A + P +F +++ +P+ G ++ + VG++ + +
Sbjct: 9 AVVDNGSGMVKAGFAGDDAPRCVFPSIVGRPKNPGIMVGMEEKDCYVGDEAQDKRGI-LT 67
Query: 928 LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 986
LK + +VT++D E+I+ + F + L + E HP++LTE LNP +R M++++
Sbjct: 68 LKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPE---EHPVLLTEAPLNPKANREKMTQIM 124
Query: 987 FECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAKRIDLG 1046
FE + VP++ I ++ S +G +G+++ G +H +P+ G R+D+
Sbjct: 125 FETHNVPAMYVAIQAVLSLYASGR--TTGIVLDSGDGVSHTVPIYEGYSLPHAIHRLDVA 182
Query: 1047 GFSVIHHLHKLLQLKYPSHINSITPSRSEELLWD----YGFVATDYREHLRKWLDAEFYD 1102
G + ++ K+L + T + +E++ D ++A D+ E + A D
Sbjct: 183 GRDLTEYMMKIL----HERGTTFTTTAEKEIVRDIKEKLCYIALDFDEEMGN--SAGSSD 236
Query: 1103 SNVVKVQLP 1111
+LP
Sbjct: 237 KYEESYELP 245
Score = 60.9 bits (148), Expect = 3e-09
Identities = 37/122 (30%), Positives = 50/122 (40%), Gaps = 1/122 (0%)
Query: 1351 NQLHIGVERMCGPECLFQPSMLGSIQA-GISETLNFVLNSYPQHIAQSLANNIFVTGSLC 1409
+ +G ER PE LFQPS++G + GI E +N I + L NI ++G
Sbjct: 248 TIITVGSERFRCPEALFQPSLIGKEEPPGIHELTFQSINKCDIDIRKDLYGNIVLSGGTT 307
Query: 1410 QLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQE 1469
G ERL K+L P V + W G S F VT+ +Y E
Sbjct: 308 MYRGLPERLTKELTTLAPSTMKIKVVAPPERKYSVWIGGSILSSLPTFQQMWVTKEEYDE 367
Query: 1470 KG 1471
G
Sbjct: 368 SG 369
>gnl|CDD|145428 pfam02268, TFIIA_gamma_N, Transcription initiation factor IIA,
gamma subunit, helical domain. Accurate transcription
in vivo requires at least six general transcription
initiation factors, in addition to RNA polymerase II.
Transcription initiation factor IIA (TFIIA) is a
multimeric protein which facilitates the binding of
TFIID to the TATA box. The N-terminal domain of the
gamma subunit is a 4 helix bundle.
Length = 49
Score = 90.4 bits (225), Expect = 5e-22
Identities = 34/49 (69%), Positives = 38/49 (77%)
Query: 1 MSYQLYRNTTLGNTLQESLDELIQYGTITPTLAMKVLLQFDKSINGALP 49
M Y+LYR +TLG L ++LDELI G ITP LAMKVLLQFDKSIN AL
Sbjct: 1 MYYELYRRSTLGMALTDALDELISDGRITPQLAMKVLLQFDKSINEALA 49
>gnl|CDD|199901 cd10145, TFIIA_gamma_N, Gamma subunit of transcription initiation
factor IIA, N-terminal helical domain. Transcription
factor II A (TFIIA) is one of the general transcription
factors for RNA polymerase II. TFIIA increases the
affinity of the TATA-binding protein (TBP) for DNA, in
order to assemble the initiation complex. TFIIA also
functions as an activator during development and
differentiation, and is involved in transcription from
TATA-less promoters. TFIIA is composed of more than one
subunit in various organisms. Mammalian TFIIA large
subunits (TFIIA alpha and beta), and the smaller
subunit (TFIIA gamma) form a heterotrimer. TFIIA alpha
and beta are encoded by a single TFIIA_alpha_beta gene
and post-translationally processed and cleaved. TOA1
and TOA2 are the two subunits of Yeast TFIIA which
correspond to Mammalian TFIIA_alpha_beta and TFIIA
gamma, respectively. TOA1 and TOA2 form a heterodimeric
protein complex. The TFIIA gamma subunit is highly
conserved between humans, Drosophila and yeast and it
is required for TFIIA function. The N-terminal domain
of the gamma subunit forms a 4-helix bundle together
with the alpha subunit.
Length = 49
Score = 87.6 bits (218), Expect = 5e-21
Identities = 37/48 (77%), Positives = 40/48 (83%)
Query: 2 SYQLYRNTTLGNTLQESLDELIQYGTITPTLAMKVLLQFDKSINGALP 49
Y+LYRNTTLG LQ++LDELIQ G ITP LAMKVLLQFDKSIN AL
Sbjct: 1 YYELYRNTTLGIALQDALDELIQDGQITPQLAMKVLLQFDKSINEALA 48
>gnl|CDD|190407 pfam02751, TFIIA_gamma_C, Transcription initiation factor IIA, gamma
subunit. Accurate transcription in vivo requires at
least six general transcription initiation factors, in
addition to RNA polymerase II. Transcription initiation
factor IIA (TFIIA) is a multimeric protein which
facilitates the binding of TFIID to the TATA box. The
C-terminal domain of the gamma subunit is a 12 stranded
beta-barrel.
Length = 51
Score = 86.2 bits (214), Expect = 1e-20
Identities = 31/51 (60%), Positives = 40/51 (78%), Gaps = 2/51 (3%)
Query: 1484 KVKSRLTFKSGKLNTYRFCDNVWTFVLTDVEFRE-VAEIARVNKLKIVACD 1533
KVKS+LTFK G L+TYRFCDNVWTF++ + +F+ E V+K+KIVACD
Sbjct: 1 KVKSKLTFK-GHLDTYRFCDNVWTFIVKNAQFKLDDQETVTVDKVKIVACD 50
>gnl|CDD|240426 PTZ00466, PTZ00466, actin-like protein; Provisional.
Length = 380
Score = 91.5 bits (227), Expect = 4e-19
Identities = 59/206 (28%), Positives = 106/206 (51%), Gaps = 14/206 (6%)
Query: 859 EYLDELRDSSLPIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKER---GKKDGETQVG 915
EY +L S+ PI+ DNG + G+A + PNL+F + + +P+ +R G +G VG
Sbjct: 4 EYAKQLY-SNQPIIIDNGTGYIKAGFAGEDVPNLVFPSYVGRPKYKRVMAGAVEGNIFVG 62
Query: 916 NDISNIEAVRFQLKTQFDKN--VVTHYDIQEQIFDYAFSHLSINTEGNVNHPIVLTEPFL 973
N E R LK + N ++ +++ E I+ + ++ + IN+E HP++LTE L
Sbjct: 63 N---KAEEYRGLLKVTYPINHGIIENWNDMENIWIHVYNSMKINSE---EHPVLLTEAPL 116
Query: 974 NPNYSRSLMSELLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVING 1033
NP ++ ++E+ FE + VP++ I ++ S G +G ++ CG H + + G
Sbjct: 117 NPQKNKEKIAEVFFETFNVPALFISIQAILSLYSCG--KTNGTVLDCGDGVCHCVSIYEG 174
Query: 1034 CIDASKAKRIDLGGFSVIHHLHKLLQ 1059
+ R D+ G + +L LL+
Sbjct: 175 YSITNTITRTDVAGRDITTYLGYLLR 200
Score = 46.9 bits (111), Expect = 7e-05
Identities = 29/123 (23%), Positives = 56/123 (45%)
Query: 1349 EANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANNIFVTGSL 1408
+ +Q+ IG ER PE LF PS+LG G+SE + + + ++L ++I ++G
Sbjct: 249 DGSQILIGSERYRAPEVLFNPSILGLEYLGLSELIVTSITRADMDLRRTLYSHIVLSGGT 308
Query: 1409 CQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQ 1468
GF +RL ++ + P +S + + G + F +++ ++
Sbjct: 309 TMFHGFGDRLLNEIRKFAPKDITIRISAPPERKFSTFIGGSILASLATFKKIWISKQEFD 368
Query: 1469 EKG 1471
E G
Sbjct: 369 EYG 371
>gnl|CDD|173506 PTZ00281, PTZ00281, actin; Provisional.
Length = 376
Score = 91.3 bits (226), Expect = 4e-19
Identities = 57/229 (24%), Positives = 115/229 (50%), Gaps = 14/229 (6%)
Query: 871 IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKE-----RGKKDGETQVGNDISNIEAVR 925
+V DNG+ C+ G+A + P +F +++ +PR G+KD + VG++ + +
Sbjct: 9 LVIDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHTGVMVGMGQKD--SYVGDEAQSKRGI- 65
Query: 926 FQLKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSE 984
LK + +VT++D E+I+ + F + L + E HP++LTE LNP +R M++
Sbjct: 66 LTLKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPE---EHPVLLTEAPLNPKANREKMTQ 122
Query: 985 LLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAKRID 1044
++FE + P++ I ++ S +G +G+++ G +H +P+ G R+D
Sbjct: 123 IMFETFNTPAMYVAIQAVLSLYASG--RTTGIVMDSGDGVSHTVPIYEGYALPHAILRLD 180
Query: 1045 LGGFSVIHHLHKLLQLKYPSHINSITPSRSEELLWDYGFVATDYREHLR 1093
L G + ++ K+L + S + ++ +VA D+ ++
Sbjct: 181 LAGRDLTDYMMKILTERGYSFTTTAEREIVRDIKEKLAYVALDFEAEMQ 229
Score = 60.9 bits (147), Expect = 3e-09
Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 6/137 (4%)
Query: 1337 TSLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQ 1396
++L + EL + + IG ER PE LFQPS LG AGI ET + I +
Sbjct: 235 SALEKSYELP--DGQVITIGNERFRCPEALFQPSFLGMESAGIHETTYNSIMKCDVDIRK 292
Query: 1397 SLANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPE--LDAWSGARQFSLS 1454
L N+ ++G PG +R+NK+L P S + + PE W G +
Sbjct: 293 DLYGNVVLSGGTTMFPGIADRMNKELTALAP--STMKIKIIAPPERKYSVWIGGSILASL 350
Query: 1455 ENFHDFAVTQSDYQEKG 1471
F +++ +Y E G
Sbjct: 351 STFQQMWISKEEYDESG 367
>gnl|CDD|199900 cd10014, TFIIA_gamma_C, Gamma subunit of transcription initiation
factor IIA, C-terminal domain. Transcription factor II A
(TFIIA) is one of the general transcription factors for
RNA polymerase II. TFIIA increases the affinity of the
TATA-binding protein (TBP) for DNA, in order to assemble
the initiation complex. TFIIA also functions as an
activator during development and differentiation, and is
involved in transcription from TATA-less promoters. TFIIA
is composed of more than one subunit in various
organisms. Mammalian TFIIA large subunits (TFIIA alpha
and beta), and the smaller subunit (TFIIA gamma) form a
heterotrimer. TFIIA alpha and beta are encoded by a
single TFIIA_alpha_beta gene and post-translationally
processed and cleaved. TOA1 and TOA2 are the two subunits
of Yeast TFIIA which correspond to Mammalian
TFIIA_alpha_beta and TFIIA gamma, respectively. TOA1 and
TOA2 form a heterodimeric protein complex. The TFIIA
gamma subunit is highly conserved between humans,
Drosophila and yeast and it is required for TFIIA
function. The C-terminal domain of the gamma
(TFIIA_gamma_C) subunit forms a beta-barrel structure
together with TFIIA beta.
Length = 47
Score = 80.0 bits (198), Expect = 2e-18
Identities = 32/46 (69%), Positives = 37/46 (80%), Gaps = 2/46 (4%)
Query: 1489 LTFKSGKLNTYRFCDNVWTFVLTDVEFREV-AEIARVNKLKIVACD 1533
LTFK G L TYRFCDNVWTF+L DVEF+E E+ V+K+KIVACD
Sbjct: 1 LTFK-GHLKTYRFCDNVWTFILNDVEFKESNTEVVTVDKVKIVACD 45
>gnl|CDD|236771 PRK10829, PRK10829, ribonuclease D; Provisional.
Length = 373
Score = 84.7 bits (210), Expect = 5e-17
Identities = 71/259 (27%), Positives = 112/259 (43%), Gaps = 15/259 (5%)
Query: 323 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDL 382
MIT + + + + IA+D E+ R+Y L+Q+ ++ ++D L + D
Sbjct: 5 MITTDDALASVCEAARAFPAIALDTEFVRTRTYYPQLGLIQLYDGEQLSLIDPLGIT-DW 63
Query: 383 EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQS-LAYLLK 441
+L D + K H D++ FG + DT F P A +++
Sbjct: 64 SPFKALLRDPQVTKFLHAGSEDLEVFLNAFGELPQPLIDTQILAAFCGRPLSCGFASMVE 123
Query: 442 HYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAAAHGKQNLVLS 501
Y V DK+ DW RPL E +YA D YLL + KL A G L
Sbjct: 124 EYTGVTLDKSESRTDWLARPLSERQCEYAAADVFYLLPIAA--KLMAETEAAGWLPAALD 181
Query: 502 TFTNSRNICKLKYE--KPVFNEEGYMNIFRSHAL-LNNQQKYALRELYKWRDRIARDKDE 558
R +C+ + E P EE Y +I ++A L +Q L+ L WR R AR++D
Sbjct: 182 ---ECRLLCQRRQEVLAP---EEAYRDI--TNAWQLRTRQLACLQLLADWRLRKARERDL 233
Query: 559 STGYVLPNHMLLQMAQSIP 577
+ +V+ L Q+A+ +P
Sbjct: 234 AVNFVVREEHLWQVARYMP 252
Score = 69.3 bits (170), Expect = 6e-12
Identities = 42/157 (26%), Positives = 66/157 (42%), Gaps = 2/157 (1%)
Query: 1768 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDL 1827
MIT + + + + IA+D E+ R+Y L+Q+ ++ ++D L + D
Sbjct: 5 MITTDDALASVCEAARAFPAIALDTEFVRTRTYYPQLGLIQLYDGEQLSLIDPLGIT-DW 63
Query: 1828 EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQS-LAYLLK 1886
+L D + K H D++ FG + DT F P A +++
Sbjct: 64 SPFKALLRDPQVTKFLHAGSEDLEVFLNAFGELPQPLIDTQILAAFCGRPLSCGFASMVE 123
Query: 1887 HYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLL 1923
Y V DK+ DW RPL E +YA D YLL
Sbjct: 124 EYTGVTLDKSESRTDWLARPLSERQCEYAAADVFYLL 160
>gnl|CDD|99851 cd06148, Egl_like_exo, DEDDy 3'-5' exonuclease domain of Drosophila
Egalitarian (Egl) and similar proteins. The Egalitarian
(Egl) protein subfamily is composed of Drosophila Egl
and similar proteins. Egl is a component of an
mRNA-binding complex which is required for oocyte
specification. Egl contains a DEDDy-type DnaQ-like 3'-5'
exonuclease domain possessing three conserved sequence
motifs termed ExoI, ExoII and ExoIII, with a specific
YX(3)D pattern at ExoIII. The motifs are clustered
around the active site and contain four conserved acidic
residues that serve as ligands for the two metal ions
required for catalysis. The conservation of this
subfamily throughout eukaryotes suggests that its
members may be part of ancient RNA processing complexes
that are likely to participate in the regulated
processing of specific mRNAs. Some members of this
subfamily do not have a completely conserved YX(3)D
pattern at the ExoIII motif.
Length = 197
Score = 76.2 bits (188), Expect = 3e-15
Identities = 56/181 (30%), Positives = 76/181 (41%), Gaps = 29/181 (16%)
Query: 332 QLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKD-YIVDTLKLREDLEVLN---- 386
+ + LK+Q+ I +D E N +G CL+QI+TR Y+ D LKL V
Sbjct: 2 EAIIHLKKQKVIGLDCEGVNL-GRKGKLCLVQIATRTGQIYLFDILKL--GSIVFINGLK 58
Query: 387 EVLTDKNIVKVFHGA--DSDIKWLQKDFGLYVVGMFDTHQA---------CKFLPMPRQS 435
++L K I+KV H DSD L +G+ + +FDT A F P S
Sbjct: 59 DILESKKILKVIHDCRRDSDA--LYHQYGIKLNNVFDTQVADALLQEQETGGFNPDRVIS 116
Query: 436 LAYLLKHYCDVDSDKTFQLFD--------WRHRPLPEPAIQYARTDTHYLLYVYDCMKLD 487
L LL Y + + W RPL E I+YA D LL +Y M
Sbjct: 117 LVQLLDKYLYISISLKEDVKKLMREDPKFWALRPLTEDMIRYAALDVLCLLPLYYAMLDA 176
Query: 488 L 488
L
Sbjct: 177 L 177
Score = 76.2 bits (188), Expect = 3e-15
Identities = 56/181 (30%), Positives = 76/181 (41%), Gaps = 29/181 (16%)
Query: 1777 QLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKD-YIVDTLKLREDLEVLN---- 1831
+ + LK+Q+ I +D E N +G CL+QI+TR Y+ D LKL V
Sbjct: 2 EAIIHLKKQKVIGLDCEGVNL-GRKGKLCLVQIATRTGQIYLFDILKL--GSIVFINGLK 58
Query: 1832 EVLTDKNIVKVFHGA--DSDIKWLQKDFGLYVVGMFDTHQA---------CKFLPMPRQS 1880
++L K I+KV H DSD L +G+ + +FDT A F P S
Sbjct: 59 DILESKKILKVIHDCRRDSDA--LYHQYGIKLNNVFDTQVADALLQEQETGGFNPDRVIS 116
Query: 1881 LAYLLKHYCDVDSDKTFQLFD--------WRHRPLPEPAIQYARTDTHYLLYVYDCMKLD 1932
L LL Y + + W RPL E I+YA D LL +Y M
Sbjct: 117 LVQLLDKYLYISISLKEDVKKLMREDPKFWALRPLTEDMIRYAALDVLCLLPLYYAMLDA 176
Query: 1933 L 1933
L
Sbjct: 177 L 177
>gnl|CDD|227452 COG5123, TOA2, Transcription initiation factor IIA, gamma subunit
[Transcription].
Length = 113
Score = 71.1 bits (174), Expect = 2e-14
Identities = 35/67 (52%), Positives = 45/67 (67%), Gaps = 1/67 (1%)
Query: 3 YQLYRNTTLGNTLQESLDELIQYGTITPTLAMKVLLQFDKSINGALPSKVKSRLTFKSGK 62
Y+LYR + +G L+++LDELI G I+P LAM VL FDKS+ L V S LTFK GK
Sbjct: 5 YELYRRSMIGKVLEDALDELISAGVISPNLAMHVLETFDKSVPEVLNKSVTSSLTFK-GK 63
Query: 63 LNTFLFI 69
L+T+ F
Sbjct: 64 LDTYNFC 70
Score = 53.8 bits (129), Expect = 2e-08
Identities = 26/58 (44%), Positives = 32/58 (55%), Gaps = 8/58 (13%)
Query: 1482 NNKVKSRLTFKSGKLNTYRFCDNVWTFVLT------DVEFREVAEIA-RVNKLKIVAC 1532
N V S LTFK GKL+TY FCD VWTF+ + E+ E+ K +IVAC
Sbjct: 51 NKSVTSSLTFK-GKLDTYNFCDGVWTFITKNFVMSLNNEYFNAGEVNIVACKARIVAC 107
>gnl|CDD|176653 cd06141, WRN_exo, DEDDy 3'-5' exonuclease domain of WRN and similar
proteins. WRN is a unique RecQ DNA helicase exhibiting
an exonuclease activity. It contains a DEDDy-type
DnaQ-like 3'-5' exonuclease domain possessing three
conserved sequence motifs termed ExoI, ExoII and ExoIII,
with a specific YX(3)D pattern at ExoIII. These motifs
are clustered around the active site and contain four
conserved acidic residues that serve as ligands for the
two metal ions required for catalysis. Mutations in the
WRN gene cause Werner syndrome, an autosomal recessive
disorder associated with premature aging and increased
susceptibility to cancer and type II diabetes. WRN
interacts with key proteins involved in DNA replication,
recombination, and repair. It is believed to maintain
genomic stability and life span by participating in DNA
processes. WRN is stimulated by Ku70/80, an important
regulator of genomic stability.
Length = 170
Score = 73.0 bits (180), Expect = 2e-14
Identities = 41/169 (24%), Positives = 75/169 (44%), Gaps = 7/169 (4%)
Query: 324 ITEPEQVTQLVSELKQQQE-IAIDLEY-HNYRSYQGY-TCLMQISTRDKDYIVDTLKLRE 380
+ + V EL +++ + D E+ ++R + L+Q++T + + + +
Sbjct: 1 TDSAQDAEEAVKELLGKEKVVGFDTEWRPSFRKGKRNKVALLQLATESRCLLFQLAHMDK 60
Query: 381 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACK--FLPMPRQSLAY 438
L ++L D +I+KV G D + L +DFG+ V G+ D K SLA
Sbjct: 61 LPPSLKQLLEDPSILKVGVGIKGDARKLARDFGIEVRGVVDLSHLAKRVGPRRKLVSLAR 120
Query: 439 LLKHY--CDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMK 485
L++ + K + +W RPL + I YA TD + L +Y +
Sbjct: 121 LVEEVLGLPLSKPKKVRCSNWEARPLSKEQILYAATDAYASLELYRKLL 169
Score = 73.0 bits (180), Expect = 2e-14
Identities = 41/169 (24%), Positives = 75/169 (44%), Gaps = 7/169 (4%)
Query: 1769 ITEPEQVTQLVSELKQQQE-IAIDLEY-HNYRSYQGY-TCLMQISTRDKDYIVDTLKLRE 1825
+ + V EL +++ + D E+ ++R + L+Q++T + + + +
Sbjct: 1 TDSAQDAEEAVKELLGKEKVVGFDTEWRPSFRKGKRNKVALLQLATESRCLLFQLAHMDK 60
Query: 1826 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACK--FLPMPRQSLAY 1883
L ++L D +I+KV G D + L +DFG+ V G+ D K SLA
Sbjct: 61 LPPSLKQLLEDPSILKVGVGIKGDARKLARDFGIEVRGVVDLSHLAKRVGPRRKLVSLAR 120
Query: 1884 LLKHY--CDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMK 1930
L++ + K + +W RPL + I YA TD + L +Y +
Sbjct: 121 LVEEVLGLPLSKPKKVRCSNWEARPLSKEQILYAATDAYASLELYRKLL 169
>gnl|CDD|201312 pfam00570, HRDC, HRDC domain. The HRDC (Helicase and RNase D
C-terminal) domain has a putative role in nucleic acid
binding. Mutations in the HRDC domain cause human
disease. It is interesting to note that the RecQ
helicase in Deinococcus radiodurans has three tandem
HRDC domains.
Length = 68
Score = 69.1 bits (170), Expect = 3e-14
Identities = 18/68 (26%), Positives = 37/68 (54%)
Query: 538 QKYALRELYKWRDRIARDKDESTGYVLPNHMLLQMAQSIPRDIQGIFACCNPVPQTVKEH 597
Q L+ L +WRD +AR++D + + LL++A+ +PR ++ + P+ ++ +
Sbjct: 1 QLALLKALREWRDELAREEDVPPYVIFSDATLLEIAEKLPRTLEELLRIPGVGPRKLERY 60
Query: 598 VLDIHAII 605
+I II
Sbjct: 61 GEEILEII 68
>gnl|CDD|176655 cd06146, mut-7_like_exo, DEDDy 3'-5' exonuclease domain of
Caenorhabditis elegans mut-7 and similar proteins. The
mut-7 subfamily is composed of Caenorhabditis elegans
mut-7 and similar proteins found in plants and
metazoans. Mut-7 is implicated in posttranscriptional
gene silencing. It contains a DEDDy-type DnaQ-like 3'-5'
exonuclease domain possessing three conserved sequence
motifs, termed ExoI, ExoII and ExoIII, with a specific
YX(3)D pattern at ExoIII. These motifs are clustered
around the active site and contain four conserved acidic
residues that serve as ligands for the two metal ions
required for catalysis.
Length = 193
Score = 73.1 bits (180), Expect = 3e-14
Identities = 43/187 (22%), Positives = 76/187 (40%), Gaps = 27/187 (14%)
Query: 323 MITEPEQVTQLVSE--LKQQQEIAIDLEYHNYRSYQGYT--CLMQISTRDKDYIVDTLKL 378
++ E++ L+ L+ + + ID E+ ++Q++T D+ +++D L L
Sbjct: 3 IVDSEEELEALLLALSLEAGRVVGIDSEWKPSFLGDSDPRVAILQLATEDEVFLLDLLAL 62
Query: 379 REDL-----EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLY------VVGMFDTHQACK 427
+L + D +++K+ G D+K L + V + D K
Sbjct: 63 ENLESEDWDRLLKRLFEDPDVLKLGFGFKQDLKALSASYPALKCMFERVQNVLDLQNLAK 122
Query: 428 ------------FLPMPRQSLAYLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTH 475
LP + LA L++ DK+ Q +W RPL E I YA D +
Sbjct: 123 ELQKSDMGRLKGNLPSKTKGLADLVQEVLGKPLDKSEQCSNWERRPLREEQILYAALDAY 182
Query: 476 YLLYVYD 482
LL V+D
Sbjct: 183 CLLEVFD 189
Score = 73.1 bits (180), Expect = 3e-14
Identities = 43/187 (22%), Positives = 76/187 (40%), Gaps = 27/187 (14%)
Query: 1768 MITEPEQVTQLVSE--LKQQQEIAIDLEYHNYRSYQGYT--CLMQISTRDKDYIVDTLKL 1823
++ E++ L+ L+ + + ID E+ ++Q++T D+ +++D L L
Sbjct: 3 IVDSEEELEALLLALSLEAGRVVGIDSEWKPSFLGDSDPRVAILQLATEDEVFLLDLLAL 62
Query: 1824 REDL-----EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLY------VVGMFDTHQACK 1872
+L + D +++K+ G D+K L + V + D K
Sbjct: 63 ENLESEDWDRLLKRLFEDPDVLKLGFGFKQDLKALSASYPALKCMFERVQNVLDLQNLAK 122
Query: 1873 ------------FLPMPRQSLAYLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTH 1920
LP + LA L++ DK+ Q +W RPL E I YA D +
Sbjct: 123 ELQKSDMGRLKGNLPSKTKGLADLVQEVLGKPLDKSEQCSNWERRPLREEQILYAALDAY 182
Query: 1921 YLLYVYD 1927
LL V+D
Sbjct: 183 CLLEVFD 189
>gnl|CDD|128635 smart00341, HRDC, Helicase and RNase D C-terminal. Hypothetical
role in nucleic acid binding. Mutations in the HRDC
domain cause human disease.
Length = 81
Score = 68.5 bits (168), Expect = 6e-14
Identities = 23/81 (28%), Positives = 41/81 (50%)
Query: 535 NNQQKYALRELYKWRDRIARDKDESTGYVLPNHMLLQMAQSIPRDIQGIFACCNPVPQTV 594
+Q LR L +WRD IAR +D +VLP+ L++MA ++P ++ + A +
Sbjct: 1 RERQLRLLRRLRQWRDEIARREDVPPYFVLPDETLIKMAAALPTNVSELLAIDGVGEEKA 60
Query: 595 KEHVLDIHAIILKARLQSLTK 615
+ + D+ A+I +A
Sbjct: 61 RRYGKDLLAVIQEASDSPSEA 81
>gnl|CDD|185631 PTZ00452, PTZ00452, actin; Provisional.
Length = 375
Score = 70.6 bits (172), Expect = 2e-12
Identities = 65/270 (24%), Positives = 122/270 (45%), Gaps = 26/270 (9%)
Query: 871 IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRK---ERGKKDGETQVGNDISNIEAVRFQ 927
+V DNG+ C++G A + P F ++ + ++ + E VG + V
Sbjct: 8 VVIDNGSGYCKIGIAGDDAPTSCFPAIVGRSKQNDGIFSTFNKEYYVGEEAQAKRGV-LA 66
Query: 928 LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 986
+K ++ +D E I+ +AF + L ++ E + P+ +T+ +N ++R M++++
Sbjct: 67 IKEPIQNGIINSWDDIEIIWHHAFYNELCMSPE---DQPVFMTDAPMNSKFNRERMTQIM 123
Query: 987 FECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAKRIDLG 1046
FE + P + +++ S +G G+++ G TH +PV G +I+L
Sbjct: 124 FETFNTPCLYISNEAVLSLYTSG--KTIGLVVDSGEGVTHCVPVFEGHQIPQAITKINLA 181
Query: 1047 GFSVIHHLHKLLQ-LKYP---SHINSITPSRSEELLWDYGFVATDYREHLRKWLDAEFYD 1102
G +L ++LQ L Y H I + E L + A D ++ R + ++ D
Sbjct: 182 GRLCTDYLTQILQELGYSLTEPHQRIIVKNIKERLC----YTALDPQDEKRIYKESNSQD 237
Query: 1103 SNVVKVQLPYAVPVPN-LTTEQQKDRRKEL 1131
S PY +P N LT + QK R E+
Sbjct: 238 S-------PYKLPDGNILTIKSQKFRCSEI 260
Score = 46.3 bits (109), Expect = 1e-04
Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 2/135 (1%)
Query: 1338 SLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQS 1397
S +Q+ + N L I ++ E LFQP ++G AGI + + Q
Sbjct: 233 SNSQDSPYKLPDGNILTIKSQKFRCSEILFQPKLIGLEVAGIHHLAYSSIKKCDLDLRQE 292
Query: 1398 LANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAW-SGARQFSLSEN 1456
L NI ++G PG RL+ +L P Q V+ + AW G+ Q +LS
Sbjct: 293 LCRNIVLSGGTTLFPGIANRLSNELTNLVPSQLKIQVAAPPDRRFSAWIGGSIQCTLSTQ 352
Query: 1457 FHDFAVTQSDYQEKG 1471
+ + + +Y E+G
Sbjct: 353 QPQW-IKRQEYDEQG 366
>gnl|CDD|240343 PTZ00280, PTZ00280, Actin-related protein 3; Provisional.
Length = 414
Score = 69.8 bits (171), Expect = 5e-12
Identities = 64/280 (22%), Positives = 120/280 (42%), Gaps = 40/280 (14%)
Query: 867 SSLPI-VFDNGAWCCRVGWASCEKPNLIFKNLIA------KPRKERGKKDGETQVGNDIS 919
S+LP+ V DNG ++G+A +P I LIA + R ++G +D + +G++
Sbjct: 2 STLPVVVIDNGTGYTKMGYAGNTEPTYIIPTLIADNSKQSRRRSKKGFEDLDFYIGDEAL 61
Query: 920 NIEAVRFQLKTQFDKNVVTHYDIQEQIFDYA-FSHLSINTEGNVNHPIVLTEPFLNPNYS 978
+ + L +V +D+ E+ ++ F +L E H +LTEP +NP +
Sbjct: 62 A-ASKSYTLTYPMKHGIVEDWDLMEKFWEQCIFKYLRCEPE---EHYFILTEPPMNPPEN 117
Query: 979 RSLMSELLFECYQVPSVCYGID---SLFSYQYNGWEGQ-----SGVIISCGYQCTHVIPV 1030
R +E++FE + V + + +L + + + +G +I G THVIPV
Sbjct: 118 REYTAEIMFETFNVKGLYIAVQAVLALRASWTSKKAKELGGTLTGTVIDSGDGVTHVIPV 177
Query: 1031 INGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSITPSRSEELLWDYGFVATDYRE 1090
++G + S K I L G + + + ++L+ + ++ + Y +VA D
Sbjct: 178 VDGYVIGSSIKHIPLAGRDITNFIQQMLRERGEPIPAEDILLLAQRIKEKYCYVAPD--- 234
Query: 1091 HLRKWLDAEF--YDSNVVKVQLPYAVPVPNLTTEQQKDRR 1128
+ EF YDS+ P + +
Sbjct: 235 -----IAKEFEKYDSD----------PKNHFKKYTAVNSV 259
Score = 42.0 bits (99), Expect = 0.002
Identities = 33/153 (21%), Positives = 56/153 (36%), Gaps = 18/153 (11%)
Query: 1336 FTSLNQEQELSPKEANQLHIGVERMCGPECLFQPSML-GSIQAGISETLNFVLNSYPQHI 1394
F ++ K+ + +G ER GPE F P + + E ++ + S P
Sbjct: 250 FKKYTAVNSVT-KKPYTVDVGYERFLGPEMFFHPEIFSSEWTTPLPEVVDDAIQSCPIDC 308
Query: 1395 AQSLANNIFVTGSLCQLPGFVERLNKD--------------LLENRPFQSHFSVSLAENP 1440
+ L NI ++G GF +RL +D L + V++ +P
Sbjct: 309 RRPLYKNIVLSGGSTMFKGFDKRLQRDVRKRVDRRLKKAEELSGGKLKPIPIDVNVVSHP 368
Query: 1441 --ELDAWSGARQFSLSENFHDFAVTQSDYQEKG 1471
W G + S F T+++Y E G
Sbjct: 369 RQRYAVWYGGSMLASSPEFEKVCHTKAEYDEYG 401
>gnl|CDD|176656 cd09018, DEDDy_polA_RNaseD_like_exo, DEDDy 3'-5' exonuclease domain
of family-A DNA polymerases, RNase D, WRN, and similar
proteins. DEDDy exonucleases, part of the DnaQ-like (or
DEDD) exonuclease superfamily, catalyze the excision of
nucleoside monophosphates at the DNA or RNA termini in
the 3'-5' direction. They contain four invariant acidic
residues in three conserved sequence motifs termed ExoI,
ExoII and ExoIII. DEDDy exonucleases are classified as
such because of the presence of a specific YX(3)D
pattern at ExoIII. The four conserved acidic residues
serve as ligands for the two metal ions required for
catalysis. This family of DEDDy exonucleases includes
the proofreading domains of family A DNA polymerases, as
well as RNases such as RNase D and yeast Rrp6p. The
Egalitarian (Egl) and Bacillus-like DNA Polymerase I
subfamilies do not possess a completely conserved YX(3)D
pattern at the ExoIII motif. In addition, the
Bacillus-like DNA polymerase I subfamily has inactive
3'-5' exonuclease domains which do not possess the
metal-binding residues necessary for activity.
Length = 150
Score = 58.0 bits (140), Expect = 2e-09
Identities = 29/144 (20%), Positives = 51/144 (35%), Gaps = 5/144 (3%)
Query: 343 IAIDLEYHNYRSYQGYTCLMQIS-TRDKDYIVDTLKLREDLEVLNEVLTDKNIVKVFHGA 401
A D E + + L+Q++ ++ LE+L +L D+ +KV
Sbjct: 2 FAFDTETDSLDNISANLVLIQLAIEPGVAALIPVAHDYLALELLKPLLEDEKALKVGQNL 61
Query: 402 DSDIKWLQKDFGLYVVGMFDTHQACKFL--PMPRQSLAYLLKHYCDV--DSDKTFQLFDW 457
D L F FDT L R + L++ + ++ W
Sbjct: 62 KYDRGILLNYFIELRGIAFDTMLEAYILNSVAGRWDMDSLVERWLGHKLIKFESIAGKLW 121
Query: 458 RHRPLPEPAIQYARTDTHYLLYVY 481
++PL E +YA D L ++
Sbjct: 122 FNQPLTEEQGRYAAEDADVTLQIH 145
Score = 58.0 bits (140), Expect = 2e-09
Identities = 29/144 (20%), Positives = 51/144 (35%), Gaps = 5/144 (3%)
Query: 1788 IAIDLEYHNYRSYQGYTCLMQIS-TRDKDYIVDTLKLREDLEVLNEVLTDKNIVKVFHGA 1846
A D E + + L+Q++ ++ LE+L +L D+ +KV
Sbjct: 2 FAFDTETDSLDNISANLVLIQLAIEPGVAALIPVAHDYLALELLKPLLEDEKALKVGQNL 61
Query: 1847 DSDIKWLQKDFGLYVVGMFDTHQACKFL--PMPRQSLAYLLKHYCDV--DSDKTFQLFDW 1902
D L F FDT L R + L++ + ++ W
Sbjct: 62 KYDRGILLNYFIELRGIAFDTMLEAYILNSVAGRWDMDSLVERWLGHKLIKFESIAGKLW 121
Query: 1903 RHRPLPEPAIQYARTDTHYLLYVY 1926
++PL E +YA D L ++
Sbjct: 122 FNQPLTEEQGRYAAEDADVTLQIH 145
>gnl|CDD|212657 cd00012, NBD_sugar-kinase_HSP70_actin, Nucleotide-Binding Domain of
the sugar kinase/HSP70/actin superfamily. This
superfamily includes the actin family, the HSP70 family
of molecular chaperones and nucleotide exchange factors,
the ROK (repressor, ORF, kinase) family, the hexokinase
family, the FGGY family (which includes glycerol kinase
and similar carbohydrate kinases such as rhamnulokinase
and xylulokinase), the exopolyphosphatase/guanosine
pentaphosphate phosphohydrolase/nucleoside triphosphate
diphosphohydrolase family, propionate kinase/acetate
kinase family, glycerol dehydratase reactivase,
2-hydroxyglutaryl-CoA dehydratase component A,
N-acetylglucosamine kinase, butyrate kinase 2,
Escherichia coli YeaZ and similar glycoproteases, the
cell shape-determining protein MreB, the plasmid DNA
segregation factor ParM, cell cycle proteins FtsA, Pili
assembly protein PilM, ethanolamine utilization protein
EutJ, and similar proteins. The nucleotide-binding site
residues are conserved; the nucleotide sits in a deep
cleft formed between the two lobes of the
nucleotide-binding domain (NBD). Substrate binding to
superfamily members is associated with closure of this
catalytic site cleft. The functional activities of
several members of the superfamily, including
hexokinases, actin, and HSP70s, are modulated by
allosteric effectors, which may act on the cleft closure.
Length = 185
Score = 48.7 bits (116), Expect = 5e-06
Identities = 27/168 (16%), Positives = 47/168 (27%), Gaps = 20/168 (11%)
Query: 913 QVGNDISNIEAVRFQLKTQFDKNVVTHYDIQEQIFDYAFSHLSINTEGNVNHPIVLTEPF 972
V + I VT D E+ + + + +TEP
Sbjct: 13 GVADLDGEILPEEIVPTPVGRPGAVTDLDELEEALRELLKEALRQLKSEI-DAVGITEPG 71
Query: 973 LNPNYSRSLMS---------ELLFECYQVPSVCYGID---SLFSYQYNGWEGQSGVIISC 1020
P +R ++ L E V D + + G E + +++
Sbjct: 72 GVPKENREVIILPNLLLIPLALALEDLGGVPVAVVNDAVAAALAEGLFGKEEDTVLVVDL 131
Query: 1021 GYQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQL--KYPSHI 1066
G T + V +G K G + L +L L P +
Sbjct: 132 GTGTTGIAIVEDG-----KGGVGAAGELGIAEALAAVLNLLDGLPDAV 174
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and C-terminal
domains of this protein are well conserved, but the
central hinge region is skewed in composition and highly
divergent [Cellular processes, Cell division, DNA
metabolism, Chromosome-associated proteins].
Length = 1164
Score = 47.0 bits (112), Expect = 1e-04
Identities = 43/263 (16%), Positives = 95/263 (36%), Gaps = 34/263 (12%)
Query: 1126 DRRKELAK-------------KLVEMNAKKREERLVDDERHLNELLELREIV-ELTPSDH 1171
DR+KE A L+ +++ ERL + L + E +
Sbjct: 169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYEL 228
Query: 1172 SHAREAF---KSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNG-EDLTEEPKAKLSKE 1227
+EA K + L + + +L ++I + + ++ E+L ++ K +E
Sbjct: 229 LKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEE 288
Query: 1228 IAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISE----- 1282
K + E + + A + +++ +D +R E +L++E
Sbjct: 289 QL------RVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE 342
Query: 1283 --LARKEKRDDDFGMR-DEDWDVYKVINKDAG--DTDSEEEQERLIELEEILRQHDPEFT 1337
+ + KR D E + + + + D + E ++ L + E L + E
Sbjct: 343 REIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN 402
Query: 1338 SLNQEQELSPKEANQLHIGVERM 1360
L +E + +E +L + +
Sbjct: 403 ELKRELDRLQEELQRLSEELADL 425
Score = 42.7 bits (101), Expect = 0.002
Identities = 49/253 (19%), Positives = 99/253 (39%), Gaps = 29/253 (11%)
Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 1181
E+ + EL K+L E+ E + E L ++E + ++ + +
Sbjct: 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313
Query: 1182 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEP--KAKLSKEIA---------- 1229
++D + + +L+ +I+K A+I ++ EE + KL++E A
Sbjct: 314 E-RELEDAEERLAKLEAEIDKLLAEIEELE--REIEEERKRRDKLTEEYAELKEELEDLR 370
Query: 1230 --VPESEAEFKAWLIETKKKRAYI---IDKKNARKQRRQDLAKRRTAAAQERMRLISELA 1284
+ E + EF E K R + + N K+ L + ++E L + +A
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIA 430
Query: 1285 RKEKRDDDFGMRDEDWDVYKVINKDAGD-----TDSEEEQERLIELEEILRQHDPEFTSL 1339
E + ++ E D I K D + ++ L +L+E + + E + L
Sbjct: 431 GIEAKINELEEEKE--DKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKL 488
Query: 1340 NQEQELSPKEANQ 1352
++EL+ EA
Sbjct: 489 --QRELAEAEAQA 499
Score = 36.6 bits (85), Expect = 0.15
Identities = 45/248 (18%), Positives = 92/248 (37%), Gaps = 24/248 (9%)
Query: 1123 QQKDRRKELAKKLVEMNAKKREERLVDD----ERHLNELLEL-----REIVELTPSDHSH 1173
+ + L ++L + K+ L + E L+EL + R+I E+
Sbjct: 671 SEPAELQRLRERLEGL--KRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQL 728
Query: 1174 AREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPES 1233
+E K +++L + ++ L+Q+IE K+++ + EE KL + + E+
Sbjct: 729 EQEEEKLKER--LEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEA 786
Query: 1234 EAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDF 1293
+ +++ + R + R R+ L E KE ++
Sbjct: 787 RLSHSRIPEIQAELSK--LEE----EVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQE 840
Query: 1294 GMRDEDWDVYKVINK---DAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEA 1350
D + + + EE +E L ELE LR + L +E++ +
Sbjct: 841 QRI--DLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQL 898
Query: 1351 NQLHIGVE 1358
+L +E
Sbjct: 899 RELERKIE 906
Score = 34.3 bits (79), Expect = 0.83
Identities = 22/105 (20%), Positives = 45/105 (42%), Gaps = 6/105 (5%)
Query: 1125 KDRRKELAKKLVEMNAKKRE--ERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 1182
K++ K + K++ +N KK E E L E L +L + + + +
Sbjct: 846 KEQIKSIEKEIENLNGKKEELEEEL---EELEAALRDLESRLGDLKKERDELEAQLRELE 902
Query: 1183 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKE 1227
I++L I + ++++ + KAK+ A E+PK + +
Sbjct: 903 -RKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEI 946
Score = 31.6 bits (72), Expect = 6.0
Identities = 32/123 (26%), Positives = 60/123 (48%), Gaps = 5/123 (4%)
Query: 1120 TTEQQKDRRKELAKKLVEMNAKKREE-RLVDDERHLN-ELLELREIVELTPSDHSHAREA 1177
T ++ KD R++L K E+N KRE RL ++ + L+ EL +L + + + E
Sbjct: 383 TRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEE 442
Query: 1178 FKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF 1237
+ + I+ + QL + K + ++ D E+ +KL +E+A E+EA+
Sbjct: 443 KEDKAL-EIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELA--EAEAQA 499
Query: 1238 KAW 1240
+A
Sbjct: 500 RAS 502
>gnl|CDD|235591 PRK05755, PRK05755, DNA polymerase I; Provisional.
Length = 880
Score = 45.9 bits (110), Expect = 3e-04
Identities = 30/122 (24%), Positives = 49/122 (40%), Gaps = 4/122 (3%)
Query: 304 FLKCEEPKQALPLSDTPLMMITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQ 363
L+ +A PL + I + E++ +++LK A D E + Q +
Sbjct: 279 LLRRAAAAEAAPLDEEDYETILDEEELEAWLAKLKAAGLFAFDTETTSLDPMQAELVGLS 338
Query: 364 ISTRDKD--YIVDTLKLREDLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGM-F 420
+ + YI RE L L +L D I KV D+ L +G+ + G+ F
Sbjct: 339 FAVEPGEAAYIPLDQLDREVLAALKPLLEDPAIKKVGQNLKYDLHVL-ARYGIELRGIAF 397
Query: 421 DT 422
DT
Sbjct: 398 DT 399
Score = 45.9 bits (110), Expect = 3e-04
Identities = 30/122 (24%), Positives = 49/122 (40%), Gaps = 4/122 (3%)
Query: 1749 FLKCEEPKQALPLSDTPLMMITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQ 1808
L+ +A PL + I + E++ +++LK A D E + Q +
Sbjct: 279 LLRRAAAAEAAPLDEEDYETILDEEELEAWLAKLKAAGLFAFDTETTSLDPMQAELVGLS 338
Query: 1809 ISTRDKD--YIVDTLKLREDLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGM-F 1865
+ + YI RE L L +L D I KV D+ L +G+ + G+ F
Sbjct: 339 FAVEPGEAAYIPLDQLDREVLAALKPLLEDPAIKKVGQNLKYDLHVL-ARYGIELRGIAF 397
Query: 1866 DT 1867
DT
Sbjct: 398 DT 399
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 43.5 bits (103), Expect = 0.001
Identities = 39/237 (16%), Positives = 87/237 (36%), Gaps = 26/237 (10%)
Query: 1118 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHL-NELLELREIVELTPSDHSHARE 1176
L + ++ R+ K ++ E+L L EL EL E +E E
Sbjct: 706 ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEE 765
Query: 1177 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 1236
+ L++ + K K +I +E +L +E+ E +
Sbjct: 766 E---------------LESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLD 810
Query: 1237 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 1296
+E+ ++R ++++ + + + + +E + + + + K + + +
Sbjct: 811 ALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELE-ELE 869
Query: 1297 DEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQL 1353
E ++ + +E +E ELEE LR+ + E L +E E + +L
Sbjct: 870 AEKEELEDEL---------KELEEEKEELEEELRELESELAELKEEIEKLRERLEEL 917
Score = 39.3 bits (92), Expect = 0.027
Identities = 39/212 (18%), Positives = 87/212 (41%), Gaps = 12/212 (5%)
Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSD---HSHAREAF 1178
+++ + +E + L E K +E + + E L E E +E + A E
Sbjct: 757 QERLEELEEELESLEEA-LAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERE 815
Query: 1179 KSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFK 1238
+ L + I +L+++IE+ + K+ + E+ +L +E+ E EAE +
Sbjct: 816 LESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELE--ELEAEKE 873
Query: 1239 AWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDE 1298
+E + K ++K ++ ++L +E +L L E + + +
Sbjct: 874 E--LEDELKELE--EEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELP 929
Query: 1299 DWDVYKVINKDAGDTDSEEEQERLIELEEILR 1330
++ + + ++ DT E + + LEE +
Sbjct: 930 --ELEEELEEEYEDTLETELEREIERLEEEIE 959
Score = 37.0 bits (86), Expect = 0.12
Identities = 22/136 (16%), Positives = 51/136 (37%), Gaps = 9/136 (6%)
Query: 1122 EQQKDRRKELAKKLVEMNAKKRE-----ERLVDDERHLNELLELREIVELTPSDHSHARE 1176
E+ + ELA+ E+ KRE ERL L +L E + +E + E
Sbjct: 383 EELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELE 442
Query: 1177 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 1236
+++L + + +L+ ++++ + ++ E+ + L + E+E
Sbjct: 443 ELNE----ELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQR 498
Query: 1237 FKAWLIETKKKRAYII 1252
+ + +
Sbjct: 499 ASQGVRAVLEALESGL 514
Score = 33.1 bits (76), Expect = 1.7
Identities = 42/229 (18%), Positives = 84/229 (36%), Gaps = 48/229 (20%)
Query: 1125 KDRRKELAKKL--VEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 1182
K+R++E +KL E N ++ E+ L + E+ L +L E E + RE ++
Sbjct: 171 KERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALL 230
Query: 1183 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 1242
+ +++L K + +L++++ + + ++ +E + KEI
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEEL-------QEELEEAEKEI-------------- 269
Query: 1243 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDV 1302
K ++L + +E + L E+ E R
Sbjct: 270 -------------EELKSELEELREELEELQEELLELKEEIEELEGEISLLRER------ 310
Query: 1303 YKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSL-NQEQELSPKEA 1350
+ + + EE +ERL EL+E + E +EL A
Sbjct: 311 -----LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 42.6 bits (101), Expect = 0.002
Identities = 17/64 (26%), Positives = 34/64 (53%)
Query: 764 KSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSS 823
K+ +K++ EK +K E ++EK+++ K K E+ EE EK + K + E + +
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Query: 824 KGKS 827
+ +
Sbjct: 467 EEEE 470
Score = 38.4 bits (90), Expect = 0.035
Identities = 18/74 (24%), Positives = 36/74 (48%), Gaps = 1/74 (1%)
Query: 752 KITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRH 811
K + + K+E+K E EK+++KK ++KEE+ +K K E+ +E E+
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKK-KAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Query: 812 KIKSEPKENDSSKG 825
+ + E ++
Sbjct: 465 EKEEEEEKKKKQAT 478
Score = 35.7 bits (83), Expect = 0.29
Identities = 14/66 (21%), Positives = 36/66 (54%)
Query: 761 KQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEN 820
+ +++ ++ ++ KK E++E+K++ A KK + E+ EE +K + + + E E
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Query: 821 DSSKGK 826
+ + +
Sbjct: 465 EKEEEE 470
Score = 31.4 bits (72), Expect = 6.1
Identities = 23/115 (20%), Positives = 52/115 (45%), Gaps = 6/115 (5%)
Query: 701 KTSRFET-PYQRFLKSK--EYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEP 757
+ R E P+ + E A + ++ +++I+ LT K + + +
Sbjct: 364 RKVRREVLPFLSIIFKHNPELAARLAAFLE---LTEEEIEFLTGSKKATKKIKKIVEKAE 420
Query: 758 VVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHK 812
++ K E+K + K+K+ E E+EK+E+ ++ + + E+ E+ K+ K
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 42.8 bits (100), Expect = 0.002
Identities = 32/117 (27%), Positives = 58/117 (49%), Gaps = 9/117 (7%)
Query: 713 LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
K E AK E++ K E+KK++ L + E ++ ++ + K +EE + E
Sbjct: 1612 AKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671
Query: 773 KEKRK--KILREREEEKEEQPAMKKIKVE-------KPEESNEKTKRHKIKSEPKEN 820
++K+K + + E+EK+ A+KK E K +E+ EK K ++K +EN
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN 1728
Score = 42.4 bits (99), Expect = 0.003
Identities = 31/111 (27%), Positives = 54/111 (48%)
Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
K++E K E + KE E KK + L + E ++ ++ + K E K E E+
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741
Query: 774 EKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
+K+K +++EE++++ A K + EK E K K I+ E E D +
Sbjct: 1742 DKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKR 1792
Score = 36.7 bits (84), Expect = 0.19
Identities = 36/145 (24%), Positives = 67/145 (46%), Gaps = 8/145 (5%)
Query: 686 DKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKT 745
D++K K + + K + + K E AK E K +KK DA K
Sbjct: 1281 DELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA----KK 1336
Query: 746 EPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKI--KVEKPEE 803
+ EE K E + + ++ E +EK + +++EE K++ A KK + +K +E
Sbjct: 1337 KAEEAKK-AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395
Query: 804 SNEKTKRHKIKSEP-KENDSSKGKS 827
+ +K + K K++ K+ ++K K+
Sbjct: 1396 AKKKAEEDKKKADELKKAAAAKKKA 1420
Score = 35.5 bits (81), Expect = 0.40
Identities = 33/138 (23%), Positives = 63/138 (45%), Gaps = 14/138 (10%)
Query: 694 KYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKI 753
K ++ K ++ ++K+ +A E+V K E+KK+ A + K E +K
Sbjct: 1565 KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA--EEAKKAEEAKIKA 1622
Query: 754 TQ---------EPVVLKQIKSEEKV---EMEKEKRKKILREREEEKEEQPAMKKIKVEKP 801
+ + LK+ ++EEK E++K + + ++ EE K+ + KK + K
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
Query: 802 EESNEKTKRHKIKSEPKE 819
E +EK +K E +E
Sbjct: 1683 AEEDEKKAAEALKKEAEE 1700
Score = 34.0 bits (77), Expect = 1.2
Identities = 30/139 (21%), Positives = 60/139 (43%), Gaps = 7/139 (5%)
Query: 694 KYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKE---NAEQKKIDAL-TPQVKTEPEE 749
K ++ K K + + K+ E AKA E E E+ + + K + +
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382
Query: 750 NVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKI-KVEKPEESNEKT 808
K +E K ++++K E +K+K ++ + +K+ A KK + +K +E+ +K
Sbjct: 1383 AKKKAEE--KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKA 1440
Query: 809 KRHKIKSEPKENDSSKGKS 827
+ K E K+ K+
Sbjct: 1441 EEAKKADEAKKKAEEAKKA 1459
Score = 33.6 bits (76), Expect = 1.5
Identities = 30/113 (26%), Positives = 57/113 (50%), Gaps = 4/113 (3%)
Query: 714 KSKEYAKAIQEKV--DKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEM 771
K++E KA ++K + + AE+ K AL + + E +I + + ++ K + E
Sbjct: 1553 KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612
Query: 772 EKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
+K + KI + EE K+ + KK++ K +E+ EK K ++K +EN
Sbjct: 1613 KKAEEAKI--KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKA 1663
Score = 33.2 bits (75), Expect = 1.9
Identities = 29/109 (26%), Positives = 51/109 (46%), Gaps = 10/109 (9%)
Query: 713 LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
LK AK ++ K+ E+KK D + K + EE K + ++ K E+ + +
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKAD----EAKKKAEEAKKADEAKKKAEEAKKAEEAKKK 1465
Query: 773 KEKRKKI--LREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
E+ KK +++ EE ++ KK K EE+ +K K +E K+
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKK----KAEEAKKKADEAKKAAEAKK 1510
Score = 32.8 bits (74), Expect = 2.4
Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 8/113 (7%)
Query: 713 LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLK----QIKSEEK 768
LK + E++ K+ AE+KK ++K EEN KI E + + K+EE
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKK---KAEELKKAEEEN-KIKAEEAKKEAEEDKKKAEEA 1749
Query: 769 VEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEND 821
+ E+EK+K ++EEEK+ + K+ + EE +E+ ++ +++ + K D
Sbjct: 1750 KKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKD 1802
Score = 32.8 bits (74), Expect = 2.6
Identities = 31/123 (25%), Positives = 50/123 (40%), Gaps = 13/123 (10%)
Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
K E AK + K+ E KK + E E K +E+K E K
Sbjct: 1322 KKAEEAKKKADAAKKKAEEAKKAA----EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377
Query: 774 EKRKKILREREEEKEEQPAMKKI--------KVEKPEESNEKTKRHKIKSEPKEN-DSSK 824
+K ++ EE+K+ A KK +++K + +K K K+E K+ D +K
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437
Query: 825 GKS 827
K+
Sbjct: 1438 KKA 1440
Score = 32.4 bits (73), Expect = 3.2
Identities = 31/136 (22%), Positives = 57/136 (41%), Gaps = 9/136 (6%)
Query: 686 DKMKIGSNKYQKIKLKTSRFETPYQRFLKSK-EYAKAIQEKVDKENAEQKKIDALTPQV- 743
+ K K + K E K K E K +++ K A +KK D +
Sbjct: 1369 AEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAE 1428
Query: 744 -KTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVE--K 800
K + +E K +E + E K + E+ K+ + +++ EE ++ KK E K
Sbjct: 1429 EKKKADEAKKKAEE----AKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484
Query: 801 PEESNEKTKRHKIKSE 816
+E+ +K + K K++
Sbjct: 1485 ADEAKKKAEEAKKKAD 1500
Score = 32.4 bits (73), Expect = 3.8
Identities = 24/106 (22%), Positives = 49/106 (46%), Gaps = 2/106 (1%)
Query: 724 EKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILRER 783
E+ K+ E KK + + E ++ IK+EE + ++ K+ + ++
Sbjct: 1233 EEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKA 1292
Query: 784 EEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE--PKENDSSKGKS 827
+E K+ + K + +K E +K K K+E K+ D++K K+
Sbjct: 1293 DEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKA 1338
Score = 32.0 bits (72), Expect = 4.1
Identities = 45/222 (20%), Positives = 85/222 (38%), Gaps = 13/222 (5%)
Query: 1119 LTTEQQKDRRKELAKKLVEM-NAKKREERLVDDERHLNELLELREIVELTPSDHSHAREA 1177
E+ K + KK E A + +++ +D++ +EL + + A E
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430
Query: 1178 FKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF 1237
K+ + K ++ ++K E+ K A E+ + +AK E A EA+
Sbjct: 1431 KKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKK 1490
Query: 1238 KAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARK--------EKR 1289
KA E KK+A K K++ + K A + + E A+K EK+
Sbjct: 1491 KA---EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE-AKKADEAKKAEEKK 1546
Query: 1290 DDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQ 1331
D + E+ + K +EE++ + E ++
Sbjct: 1547 KADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588
Score = 31.6 bits (71), Expect = 5.2
Identities = 27/143 (18%), Positives = 59/143 (41%), Gaps = 3/143 (2%)
Query: 686 DKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKT 745
+ + + + + + K + KA + K+ E+KK D + +
Sbjct: 1345 AEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA--DAAKKKAEEKKKADEAKKKAEE 1402
Query: 746 EPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKI-KVEKPEES 804
+ ++ ++ + K+ +K EK+K + ++ EE K+ A KK + +K EE+
Sbjct: 1403 DKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462
Query: 805 NEKTKRHKIKSEPKENDSSKGKS 827
+K + K E K+ K+
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKA 1485
Score = 31.6 bits (71), Expect = 5.5
Identities = 25/109 (22%), Positives = 47/109 (43%), Gaps = 4/109 (3%)
Query: 714 KSKEYAKAIQEKVDKENA----EQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKV 769
K+ E KA + K + A E KK D + + + K +E ++K E++
Sbjct: 1498 KADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL 1557
Query: 770 EMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPK 818
+ +EK+K ++ EE + K + +K EE+ + + E K
Sbjct: 1558 KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKK 1606
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 41.4 bits (97), Expect = 0.005
Identities = 25/108 (23%), Positives = 51/108 (47%), Gaps = 1/108 (0%)
Query: 724 EKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILRER 783
E +E E++++ + K +P+E K + K+ + ++ E EKEK+ + R+R
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDR 160
Query: 784 EEEKEEQPAMKKIKVEKPEES-NEKTKRHKIKSEPKENDSSKGKSGGT 830
EEEK+ + K + +KP + K+ + E + + + G
Sbjct: 161 EEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKP 208
Score = 37.9 bits (88), Expect = 0.058
Identities = 38/161 (23%), Positives = 75/161 (46%), Gaps = 7/161 (4%)
Query: 693 NKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVK 752
K +K K K E P R K KE AK + +KE ++KK++ + + + E V+
Sbjct: 113 VKEEKKKKKEKPKEEPKDR--KPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVR 170
Query: 753 ITQEPVVLKQIKSEEKVE--MEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKR 810
P + K K + E+EK+++ RE + K E+P + + + ++ ++ ++
Sbjct: 171 AKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETT 230
Query: 811 HKIKSEPKENDSSKGKSGGTISTVDFSKVNYNKYMAKPDTK 851
S +E++S + S+ K + + MA P+T+
Sbjct: 231 ---TSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETR 268
Score = 37.2 bits (86), Expect = 0.10
Identities = 29/114 (25%), Positives = 54/114 (47%), Gaps = 4/114 (3%)
Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
+SK + ++V+K ++ A T K E+ K +E +Q+K E+K + EK
Sbjct: 68 ESKLSSDEAVKRVEKGGSKGPA--AKTKPAKEPKNESGK--EEEKEKEQVKEEKKKKKEK 123
Query: 774 EKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSKGKS 827
K + R+ +EE +E+ K+ + EK ++ E R + K + S+ K
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKK 177
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 41.1 bits (96), Expect = 0.006
Identities = 31/140 (22%), Positives = 57/140 (40%), Gaps = 5/140 (3%)
Query: 696 QKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQ 755
+ ++ + E+ Q K +E + ++ +E E+K + + E EE
Sbjct: 245 ELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELL 304
Query: 756 EPVVLKQIKSEEKVEMEKEKRKKI-----LREREEEKEEQPAMKKIKVEKPEESNEKTKR 810
+ K E+ E EKE +K +E EE E++ +IK E EE E+ ++
Sbjct: 305 KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEK 364
Query: 811 HKIKSEPKENDSSKGKSGGT 830
+ K E E + K +
Sbjct: 365 LQEKLEQLEEELLAKKKLES 384
Score = 41.1 bits (96), Expect = 0.007
Identities = 25/160 (15%), Positives = 57/160 (35%), Gaps = 3/160 (1%)
Query: 664 NTHLSSYDPQDKKIAQIGLFFEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQ 723
L+ + + +K+ E + E + E + I+
Sbjct: 774 EKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAEL-LEEEQLLIEQEEKIK 832
Query: 724 EKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME--KEKRKKILR 781
E+ +E A + K + ++ E E ++ L Q ++ E+E K K + +
Sbjct: 833 EEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESK 892
Query: 782 EREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEND 821
E +E++E++ ++ + + E E +I E
Sbjct: 893 EEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILL 932
Score = 35.7 bits (82), Expect = 0.34
Identities = 30/249 (12%), Positives = 85/249 (34%), Gaps = 21/249 (8%)
Query: 1121 TEQQKDRRKELAKK------LVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHA 1174
+ +++ +++ L K L E+ E +L + + L +
Sbjct: 166 SREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEE 225
Query: 1175 REAFKSMG---INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVP 1231
+ I L + + Q++IE +K ++ + + KE +
Sbjct: 226 NLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQ 285
Query: 1232 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQ---ERMRLISELARKEK 1288
E E + A E K ++++ + + +++ + ++ + E KE
Sbjct: 286 EEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKEL 345
Query: 1289 RDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPK 1348
++ ++ + ++ + + +++ EE+L + E L+ +L +
Sbjct: 346 ---------KELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEE 396
Query: 1349 EANQLHIGV 1357
E +
Sbjct: 397 ELELKNEEE 405
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 41.3 bits (97), Expect = 0.007
Identities = 37/188 (19%), Positives = 74/188 (39%), Gaps = 23/188 (12%)
Query: 1118 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDD---------------ERHLNELLE-LR 1161
L +Q K+ ++L +L E+ E RL+ ++ L EL E L
Sbjct: 551 KLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLS 610
Query: 1162 EIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPK 1221
++ EL S E ++L + +L + E + A E+ EE +
Sbjct: 611 QLEELLQSLELSEAENEL---EEAEEELESELEKLNLQAELEELLQAALEELEEKVEELE 667
Query: 1222 AKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLIS 1281
A++ +E+ E+E + + E ++ + ++ ++ ++L K+ Q L S
Sbjct: 668 AEIRRELQRIENEEQLE----EKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELES 723
Query: 1282 ELARKEKR 1289
A E+
Sbjct: 724 RKAELEEL 731
Score = 31.7 bits (72), Expect = 5.2
Identities = 22/146 (15%), Positives = 63/146 (43%), Gaps = 11/146 (7%)
Query: 684 FEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQV 743
E ++ K ++++ E + + + ++E ++K + +++++ L ++
Sbjct: 279 LERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKL 338
Query: 744 KTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREE--EKEEQPAMKKIKVEKP 801
+ E ++ +E + ++ E+R K L ER E EKE + A++++K +
Sbjct: 339 EKLESELEELAEE---------KNELAKLLEERLKELEERLEELEKELEKALERLKQLEE 389
Query: 802 EESNEKTKRHKIKSEPKENDSSKGKS 827
K + ++ + +E +
Sbjct: 390 AIQELKEELAELSAALEEIQEELEEL 415
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 40.4 bits (95), Expect = 0.011
Identities = 44/234 (18%), Positives = 93/234 (39%), Gaps = 29/234 (12%)
Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERH--LNELLELREIVELTPSDHSHAREAFK 1179
E+ ++ +KEL E++ +++++++ + +L EL +E S E
Sbjct: 281 EEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELA 340
Query: 1180 SMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKA 1239
+ +++L + + L+ ++E+ +A++ + + EE L ++A E +
Sbjct: 341 ELE-EKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIA--- 396
Query: 1240 WLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDED 1299
+ + + R + L RR QE L+ +L E + E+
Sbjct: 397 ---SLNNEIERL-------EARLERLEDRRERLQQEIEELLKKLEEAEL--KELQAELEE 444
Query: 1300 WDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQL 1353
+ + +EE ERL E E LR+ E Q + + +E QL
Sbjct: 445 --------LEEELEELQEELERLEEALEELRE---ELEEAEQALDAAERELAQL 487
Score = 40.0 bits (94), Expect = 0.015
Identities = 48/243 (19%), Positives = 107/243 (44%), Gaps = 15/243 (6%)
Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELR-EIVELTPSDHSHAREAFKS 1180
E+ ++ +EL ++L E + E E +L ELR E+ EL + ++ +
Sbjct: 235 EELREELEELQEELKEAEEELEELTAELQELEE-KLEELRLEVSELE-EEIEELQKELYA 292
Query: 1181 MGI------NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE 1234
+ Q L + + L++++E+ +A++ + D E A+L +++ E +
Sbjct: 293 LANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE--ELK 350
Query: 1235 AEFKAWLIETKKKRAYIIDKKNARKQRRQDL--AKRRTAAAQERMRLI-SELARKEKRDD 1291
E ++ E ++ A + + ++ ++ + L + + A + ++ + +E+ R E R +
Sbjct: 351 EELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410
Query: 1292 DFGMRDEDWD-VYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEA 1350
R E + + K + + +E Q L ELEE L + E L + E +E
Sbjct: 411 RLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREEL 470
Query: 1351 NQL 1353
+
Sbjct: 471 EEA 473
Score = 38.9 bits (91), Expect = 0.035
Identities = 44/236 (18%), Positives = 84/236 (35%), Gaps = 36/236 (15%)
Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 1181
Q EL ++ E+ + E E E+ EL +E + REA +
Sbjct: 750 AQLSKELTELEAEIEELEERLEEAEEELAE-AEAEIEELEAQIEQLKEELKALREALDEL 808
Query: 1182 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIA------------ 1229
+ LN+ L++++E + +I A + EE +LS++I
Sbjct: 809 R-AELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEEL 867
Query: 1230 VPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKR 1289
+ E E+E +A L E + + ++L++ +R L EL ++
Sbjct: 868 IEELESELEALLNERASLEEAL----ALLRSELEELSEELRELESKRSELRRELEELREK 923
Query: 1290 DDDFGMR------------DEDWDVYKVI------NKDAGDTDSEEEQERLIELEE 1327
+R + + Y + ++ + D EE + RL LE
Sbjct: 924 LAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLEN 979
Score = 35.0 bits (81), Expect = 0.58
Identities = 43/242 (17%), Positives = 92/242 (38%), Gaps = 60/242 (24%)
Query: 1125 KDRRKELAKKLV--EMNAKKREERLVDDERHLNEL----------LELREIVELTPSDHS 1172
K+RRKE +KL N + E+ L + ER L L EL+ EL
Sbjct: 171 KERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKA--EL------ 222
Query: 1173 HAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPE 1232
RE ++ + +++L + + +LQ+++++ EE +L+ E+ E
Sbjct: 223 --RELELALLVLRLEELREELEELQEELKE--------------AEEELEELTAELQELE 266
Query: 1233 SEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDD 1292
+ +E + ++++ Q+ + +++ +++ E + +
Sbjct: 267 EK-------LEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRE---RLANLER 316
Query: 1293 FGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQ 1352
+ + EE + +L EL E L + + + L +E E E +
Sbjct: 317 --------------QLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEE 362
Query: 1353 LH 1354
L
Sbjct: 363 LE 364
Score = 34.3 bits (79), Expect = 0.86
Identities = 49/247 (19%), Positives = 89/247 (36%), Gaps = 18/247 (7%)
Query: 1117 PNLTTEQQKDRRKELAKKLVEMNAKKRE-----ERLVDDERHLNELLELREIVELTPSDH 1171
N + +++ +EL +K+ E+ K E L + L E LE S
Sbjct: 668 TNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQ 727
Query: 1172 SHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVP 1231
A + ++ L + I QL +++ + +A+I + EE A+ EI
Sbjct: 728 ISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE-- 785
Query: 1232 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 1291
E EA+ + E K R +D+ A L + + L +A E+R +
Sbjct: 786 ELEAQIEQLKEELKALRE-ALDELRAELTL---LNEEAANLRERLESLERRIAATERRLE 841
Query: 1292 DF-----GMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELS 1346
D + ++ + I + + EE + L L + L E E
Sbjct: 842 DLEEQIEELSEDIESLAAEIEE--LEELIEELESELEALLNERASLEEALALLRSELEEL 899
Query: 1347 PKEANQL 1353
+E +L
Sbjct: 900 SEELREL 906
Score = 33.9 bits (78), Expect = 1.2
Identities = 32/213 (15%), Positives = 77/213 (36%), Gaps = 22/213 (10%)
Query: 1155 NELLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGE 1214
E+ EL E +E + +A + +L K + +L++++E+ + ++ +
Sbjct: 677 REIEELEEKIEELEEKIAELEKA--------LAELRKELEELEEELEQLRKELEELSRQI 728
Query: 1215 DLTEEPKAKLSKEIAVPESE--------AEFKAWLIETKKKRAYIIDKKNARKQRRQDLA 1266
+ A+L E+ E E +A + E +++ ++ + ++L
Sbjct: 729 SALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
Query: 1267 KRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDT-DSEEEQERLIEL 1325
+ +E L L + + + + + + RL +L
Sbjct: 789 AQIEQLKEELKALREALDELRAE-----LTLLNEEAANLRERLESLERRIAATERRLEDL 843
Query: 1326 EEILRQHDPEFTSLNQEQELSPKEANQLHIGVE 1358
EE + + + SL E E + +L +E
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELE 876
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 39.8 bits (94), Expect = 0.017
Identities = 40/143 (27%), Positives = 62/143 (43%), Gaps = 25/143 (17%)
Query: 1122 EQQKDRRKELAKKLVEMNAK--KREERLVDDERHLN---ELLELRE-IVELTPSDHSHAR 1175
E+ R E K+L E + K E+RL+ E +L+ ELLE RE +E + +
Sbjct: 64 EEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123
Query: 1176 EAFKSMGINNIQDLNKSINQLQQKIEK----TK--AKIIAYNNGEDLTEEPKAKLSKEIA 1229
+ + ++L + I + Q++E+ T AK I E+ A L KEI
Sbjct: 124 QELEKK----EEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIKEI- 178
Query: 1230 VPESEAEFKAWLIETKKKRAYII 1252
E EA+ E K+A I
Sbjct: 179 --EEEAK------EEADKKAKEI 193
Score = 32.1 bits (74), Expect = 3.6
Identities = 17/56 (30%), Positives = 27/56 (48%)
Query: 761 KQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
K+I E K E E K++ +L +EE + + +K E+ E + KR K E
Sbjct: 41 KRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEE 96
Score = 31.7 bits (73), Expect = 4.2
Identities = 27/143 (18%), Positives = 58/143 (40%), Gaps = 22/143 (15%)
Query: 1189 LNKSINQLQQKIEKTKAKII---AYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETK 1245
+ K I + + K + +AK I A E + +E + +EI +E E E +
Sbjct: 24 VRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEK-----ELR 78
Query: 1246 KKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKV 1305
++R + + Q+ ++L ++ + L + E++ + ++E+ +
Sbjct: 79 ERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELE---- 134
Query: 1306 INKDAGDTDSEEEQERLIELEEI 1328
E +E+L ELE I
Sbjct: 135 ----------ELIEEQLQELERI 147
Score = 31.3 bits (72), Expect = 6.7
Identities = 28/122 (22%), Positives = 59/122 (48%), Gaps = 15/122 (12%)
Query: 696 QKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAE-QKKIDALTPQVKTEPEENVKIT 754
++++ + + + +R L+ +E E ++K E +KK L + ++ ++
Sbjct: 75 KELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQK-----QQELEKK 129
Query: 755 QEPVVLKQIKSEEKVEME------KEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKT 808
+E L+++ E+ E+E E+ K+IL E+ EE+ A IK E EE+ E+
Sbjct: 130 EE--ELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIK-EIEEEAKEEA 186
Query: 809 KR 810
+
Sbjct: 187 DK 188
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 38.2 bits (89), Expect = 0.018
Identities = 20/63 (31%), Positives = 28/63 (44%)
Query: 765 SEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
E EKE K+ +E E E+EE+ KK K K E+ +K K+ K+ K
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188
Query: 825 GKS 827
K
Sbjct: 189 KKK 191
Score = 30.8 bits (70), Expect = 5.3
Identities = 20/67 (29%), Positives = 42/67 (62%), Gaps = 1/67 (1%)
Query: 743 VKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPE 802
+ +E E + K T V K+ + EE+ + EK+K+K++ +E++E+K+++ M + K K +
Sbjct: 128 LGSESETSEKETTAKVE-KEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKK 186
Query: 803 ESNEKTK 809
+ +K K
Sbjct: 187 KKKKKKK 193
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 37.4 bits (87), Expect = 0.025
Identities = 22/108 (20%), Positives = 41/108 (37%), Gaps = 12/108 (11%)
Query: 677 IAQIGLFFEDKMKIGSNKYQK--IKLKTS--RFETPYQRFLKSKEYAKAIQEKVDKENAE 732
A + K+++ N+Y+K I++ T + ET + L + E K I + KE E
Sbjct: 13 SADSAAENDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPE 72
Query: 733 QKKIDALTPQVKTEPEENVKITQ--------EPVVLKQIKSEEKVEME 772
Q+++ VK T+ E + E+ +
Sbjct: 73 QEELKNQLFSENATENNTVKATKKQLFSSEYEQTSSSSESTSEEETKK 120
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 38.2 bits (89), Expect = 0.026
Identities = 21/141 (14%), Positives = 40/141 (28%), Gaps = 6/141 (4%)
Query: 721 AIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME-KEKRKKI 779
E V P + +PE Q + E + KEK K
Sbjct: 32 VGIELVPLAVFLLAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPE 91
Query: 780 LREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSKGKSGGTISTV--DFS 837
++ ++ + +P K K + + K K+ K + + + +
Sbjct: 92 --KKPKKPKPKPKPKPKPKPKVKPQPKPKKPPS-KTAAKAPAAPNQPARPPSAASASGAA 148
Query: 838 KVNYNKYMAKPDTKILPDPYY 858
Y++ I P Y
Sbjct: 149 TGPSASYLSGLRRAIRRAPRY 169
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 37.2 bits (87), Expect = 0.080
Identities = 23/100 (23%), Positives = 52/100 (52%), Gaps = 11/100 (11%)
Query: 710 QRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKV 769
+R + +E +QE++ +E ++++ + + E E+ +I ++I+ E++
Sbjct: 64 KRKEERREGRAVLQEQI-EEREKRRQEEYE--ERLQEREQMDEII------ERIQEEDEA 114
Query: 770 EME--KEKRKKILREREEEKEEQPAMKKIKVEKPEESNEK 807
E + +EK+KK+ E +E EE+ K+ + E+ E K
Sbjct: 115 EAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELK 154
Score = 31.8 bits (73), Expect = 3.7
Identities = 28/174 (16%), Positives = 70/174 (40%), Gaps = 13/174 (7%)
Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 1181
++K+ R+E L E ++ + R + E L E ++ EI+E + + +
Sbjct: 63 RKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREK 122
Query: 1182 GINNIQDLNKSINQLQQKIEKTKA-------KIIAYNNGEDLTEEPKAKLSKEIAVPESE 1234
+++++ + ++ E+ K KI+ Y + EE + +E +
Sbjct: 123 QKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREAERRER-----K 177
Query: 1235 AEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 1288
E + + + ++ D++ + R DL + +ER + E ++ +
Sbjct: 178 EEKEREVARLRAQQEEAEDEREELDELRADLYQEE-YERKERQKEKEEAEKRRR 230
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 36.9 bits (85), Expect = 0.094
Identities = 29/131 (22%), Positives = 57/131 (43%), Gaps = 16/131 (12%)
Query: 723 QEKVDKENAEQKKIDALTPQVK---------TEPEENVKITQEPVVLKQIKSEEKVEMEK 773
+EK + + E+ + +T ++K V+ +E LKQ + E +E+E+
Sbjct: 159 EEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEE 218
Query: 774 EK-----RKKILREREEEKEEQPAMKKIK--VEKPEESNEKTKRHKIKSEPKENDSSKGK 826
K R+K+L E E+ ++++ A +K + EK E +R +E ++ G
Sbjct: 219 LKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGL 278
Query: 827 SGGTISTVDFS 837
S F+
Sbjct: 279 SEDKKPFKCFT 289
Score = 33.1 bits (75), Expect = 1.8
Identities = 26/104 (25%), Positives = 46/104 (44%), Gaps = 3/104 (2%)
Query: 728 KENAEQKKI-DALTPQVKTEPEENVKITQEPVVLKQIKS-EEKVEMEKEKRKKILREREE 785
+E ++ +AL Q + +P + EP Q S E +E+E++++ EREE
Sbjct: 66 REERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREE 125
Query: 786 EKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSKGKSGG 829
+E + K + ++ E K K EP+E + K S
Sbjct: 126 VEETEGVTKSEQKNDWRDAEECQKEEKEP-EPEEEEKPKRGSLE 168
Score = 31.9 bits (72), Expect = 3.4
Identities = 28/135 (20%), Positives = 57/135 (42%), Gaps = 16/135 (11%)
Query: 673 QDKKIAQIGLFFEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAE 732
+ G F K+K N + + + ++ E + ++ +A E +++ +
Sbjct: 164 RGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALE-LEELKKK 222
Query: 733 QKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPA 792
++ + K EE Q KQ +++ K E+EKR+ L+E E + + A
Sbjct: 223 RE------ERRKVLEEE----EQRR---KQEEADRKSREEEEKRR--LKEEIERRRAEAA 267
Query: 793 MKKIKVEKPEESNEK 807
K+ KV + S +K
Sbjct: 268 EKRQKVPEDGLSEDK 282
>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein. This family consists of mammalian
MRVI1 proteins which are related to the
lymphoid-restricted membrane protein (JAW1) and the IP3
receptor associated cGMP kinase substrates A and B
(IRAGA and IRAGB). The function of MRVI1 is unknown
although mutations in the Mrvi1 gene induces myeloid
leukaemia by altering the expression of a gene important
for myeloid cell growth and/or differentiation so it has
been speculated that Mrvi1 is a tumour suppressor gene.
IRAG is very similar in sequence to MRVI1 and is an
essential NO/cGKI-dependent regulator of IP3-induced
calcium release. Activation of cGKI decreases
IP3-stimulated elevations in intracellular calcium,
induces smooth muscle relaxation and contributes to the
antiproliferative and pro-apoptotic effects of NO/cGMP.
Jaw1 is a member of a class of proteins with
COOH-terminal hydrophobic membrane anchors and is
structurally similar to proteins involved in vesicle
targeting and fusion. This suggests that the function
and/or the structure of the ER in lymphocytes may be
modified by lymphoid-restricted resident ER proteins.
Length = 538
Score = 36.5 bits (84), Expect = 0.16
Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 6/84 (7%)
Query: 757 PVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
P +L+ I + + E+EK+ K L++ E + E+ + K EE E +
Sbjct: 400 PSLLRFISTYSWADAEEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEE------W 453
Query: 817 PKENDSSKGKSGGTISTVDFSKVN 840
KE SSK S T V + K N
Sbjct: 454 DKEQRSSKLASWVTFLQVSYRKAN 477
>gnl|CDD|219714 pfam08066, PMC2NT, PMC2NT (NUC016) domain. This domain is found at
the N-terminus of 3'-5' exonucleases with HRDC domains,
and also in putative exosome components.
Length = 91
Score = 33.4 bits (77), Expect = 0.20
Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 4/90 (4%)
Query: 105 IVKCTKTAQSFPSTHENSLLIASPNYISGVAGTSDKVMTLVDSLLKTQ----NISKSMSK 160
+V+ T+ A S + + P + + S ++++L++ LL++ I +
Sbjct: 1 LVQTTRAASSLAAQDIDFYRSFDPEFSQSLDEQSQRLLSLINDLLQSADSRGRIPDRSEE 60
Query: 161 LYLEGQKDILTEANDKLLESINTRIDVMAG 190
+E Q +++ ND LLE +T +D + G
Sbjct: 61 DDVEDQWKAVSDVNDSLLEKADTLLDELKG 90
Score = 33.4 bits (77), Expect = 0.20
Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 4/90 (4%)
Query: 1550 IVKCTKTAQSFPSTHENSLLIASPNYISGVAGTSDKVMTLVDSLLKTQ----NISKSMSK 1605
+V+ T+ A S + + P + + S ++++L++ LL++ I +
Sbjct: 1 LVQTTRAASSLAAQDIDFYRSFDPEFSQSLDEQSQRLLSLINDLLQSADSRGRIPDRSEE 60
Query: 1606 LYLEGQKDILTEANDKLLESINTRIDVMAG 1635
+E Q +++ ND LLE +T +D + G
Sbjct: 61 DDVEDQWKAVSDVNDSLLEKADTLLDELKG 90
>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
Length = 156
Score = 34.4 bits (80), Expect = 0.23
Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 12/93 (12%)
Query: 1195 QLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDK 1254
+ + +E+ + KI + L +AK E+A + EA+ L E + + A II+
Sbjct: 28 PIMKALEERQKKIA-----DGLAAAERAKKELELAQAKYEAQ----LAEARAEAAEIIE- 77
Query: 1255 KNARKQRRQDLAKRRTAAAQERMRLISELARKE 1287
A+K+ Q + + + A E R+ A+ E
Sbjct: 78 -QAKKRAAQIIEEAKAEAEAEAARIK-AQAQAE 108
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino acids
in length. This family contains a P-loop motif suggesting
it is a nucleotide binding protein. It may be involved in
replication.
Length = 1198
Score = 36.2 bits (84), Expect = 0.23
Identities = 53/269 (19%), Positives = 97/269 (36%), Gaps = 49/269 (18%)
Query: 1119 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLE-LREIVELTPSDHSHAREA 1177
L EQQ + K + + ++ E +L + L +LLE + +E D R
Sbjct: 661 LQNEQQSLKDK--LELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTE 718
Query: 1178 FKSMGINNIQDLNKSINQLQQKIEK--------TKAKIIAYNN---GEDLTEEPKAKLSK 1226
+ +L+ + QL IE K Y+ D+ +L +
Sbjct: 719 RLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKR 778
Query: 1227 EI----------AVPESEA-EFKAWLIETKKKRAYIIDKKNARKQR-----------RQD 1264
+I AV E E++A++ ET R + +++ + +Q+
Sbjct: 779 QIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQE 838
Query: 1265 LAKRRTAAAQERMRLISELARKEKRDDDF-----GMRDEDWDVYKVINKDAGDTDSEEEQ 1319
L + R +L E EK+ D G+RDE + + + ++ + +
Sbjct: 839 LTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDE----MRQLAELKEPANANQAE 894
Query: 1320 ----ERLIELEEILRQHDPEFTSLNQEQE 1344
ERL +LEE R+ L + E
Sbjct: 895 GSISERLDQLEEFKRKRKRLSGDLKKFLE 923
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 35.8 bits (81), Expect = 0.23
Identities = 28/115 (24%), Positives = 59/115 (51%), Gaps = 4/115 (3%)
Query: 706 ETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVK----TEPEENVKITQEPVVLK 761
E P Q K K Q+++ K++ ++K++A K E E + QE +
Sbjct: 62 EQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQ 121
Query: 762 QIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
+E+++ E+EK++ L++ + EK+++ A+K K EK E++ + + ++ E
Sbjct: 122 WKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEEE 176
>gnl|CDD|193205 pfam12729, 4HB_MCP_1, Four helix bundle sensory module for signal
transduction. This family is a four helix bundle that
operates as a ubiquitous sensory module in prokaryotic
signal-transduction. The 4HB_MCP is always found between
two predicted transmembrane helices indicating that it
detects only extracellular signals. In many cases the
domain is associated with a cytoplasmic HAMP domain
suggesting that most proteins carrying the bundle might
share the mechanism of transmembrane signalling which is
well-characterized in E coli chemoreceptors.
Length = 181
Score = 34.5 bits (80), Expect = 0.26
Identities = 27/104 (25%), Positives = 39/104 (37%), Gaps = 27/104 (25%)
Query: 1183 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 1242
+ +I++L I++L +K EKT LTEE K EFK L
Sbjct: 81 LKDIEELRAEIDKLLKKYEKTI-----------LTEEEKKLF----------NEFKEQLK 119
Query: 1243 ETKKKRAYIID--KKN----ARKQRRQDLAKRRTAAAQERMRLI 1280
+K R ++D K A +L R A + LI
Sbjct: 120 AYRKVRNKVLDLAKAGKNDEAYALYLTELEPARDAVIEALDELI 163
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 35.3 bits (82), Expect = 0.27
Identities = 32/128 (25%), Positives = 56/128 (43%), Gaps = 24/128 (18%)
Query: 682 LFFEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKE---------------YAKAIQEKV 726
LF E++ K+ KY ++ K + E Q FL SKE KAI+ +
Sbjct: 152 LFLEERDKL-EAKYNQVPRKGVKAEEVLQEFLNSKEAVEEAILQTDQALTAKEKAIEAER 210
Query: 727 DKENAEQKKIDALTPQVKTEP---EENVKITQEPVVLKQIKS---EEKVEMEKEKRKKIL 780
K A + + + L + K E E + QE V KQ+ E+ ++ E+ + +
Sbjct: 211 AKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHV--KQLIEKMEAEREKLLAEQERMLE 268
Query: 781 REREEEKE 788
+ +E++E
Sbjct: 269 HKLQEQEE 276
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 36.1 bits (83), Expect = 0.27
Identities = 29/132 (21%), Positives = 43/132 (32%), Gaps = 24/132 (18%)
Query: 777 KKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSKGKSGGTISTVDF 836
K R +++ KE+ P K+IK+E P E + + E K D+
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAP------------- 838
Query: 837 SKVNYNKYMAKPD-----TKILPD--PYYEYLDELRDSSLPIVFDNGAWCCRVGWASCEK 889
N KP T+ILP Y ++ D +P+ G E
Sbjct: 839 --AILNVKKKKPYKVDNMTRILPQQSRYISFI--KDDRFVPVRKFKGGVVVLRDREPKEP 894
Query: 890 PNLIFKNLIAKP 901
LI K
Sbjct: 895 VALIETVRQMKD 906
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 34.6 bits (80), Expect = 0.32
Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 15/94 (15%)
Query: 1197 QQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKK- 1255
+++EK K K A ED++E KAK K++ KA + K+K+ Y++ KK
Sbjct: 125 LKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYK-------KAKKKKKKRKKTYVVAKKG 177
Query: 1256 NARKQRRQDLAKRRTAAAQERMRLISELARKEKR 1289
N K R K + RM +K+ R
Sbjct: 178 NKGKAGRPKGVKGKYKVVDPRM-------KKDLR 204
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 35.2 bits (81), Expect = 0.35
Identities = 24/120 (20%), Positives = 43/120 (35%), Gaps = 5/120 (4%)
Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIK----SEEKV 769
K ++ + ++ E K D + EP + E + EE
Sbjct: 229 KKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETE 288
Query: 770 EMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSKGKSGG 829
E EKEKRK+ L++ E+++E M+ + EE + + +E S
Sbjct: 289 EKEKEKRKR-LKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPD 347
Score = 32.9 bits (75), Expect = 1.7
Identities = 19/94 (20%), Positives = 41/94 (43%), Gaps = 12/94 (12%)
Query: 731 AEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQ 790
++KK + + E + + V+L+ +S E ++++ E+E E +
Sbjct: 226 TKEKKEKKEASESTVKEESEEESGKRDVILED-ESAEPTGLDED---------EDEDEPK 275
Query: 791 PAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
P+ ++ E+ E EK KR ++K D +
Sbjct: 276 PSGERSDSEEETEEKEKEKRKRLKK--MMEDEDE 307
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 35.3 bits (82), Expect = 0.40
Identities = 30/105 (28%), Positives = 50/105 (47%), Gaps = 13/105 (12%)
Query: 710 QRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKV 769
++ ++E AK I E+ KE AE K +AL + K EE K+ E + +++K
Sbjct: 25 KKLGSAEELAKRIIEEAKKE-AETLKKEALL-EAK---EEVHKLRAE--LERELKERRNE 77
Query: 770 EMEKEKRKKILREREE---EKEEQPAMKKIKVEKPEESNEKTKRH 811
E+R L +REE K E K+ +EK E+ +++
Sbjct: 78 LQRLERR---LLQREETLDRKMESLDKKEENLEKKEKELSNKEKN 119
Score = 32.2 bits (74), Expect = 2.9
Identities = 34/130 (26%), Positives = 61/130 (46%), Gaps = 14/130 (10%)
Query: 1232 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 1291
+E K +IE KK A + KK A + ++++ K R +E +EL R E+R
Sbjct: 29 SAEELAKR-IIEEAKKEAETL-KKEALLEAKEEVHKLRAELERELKERRNELQRLERRLL 86
Query: 1292 DFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQE------- 1344
R+E D K+ + D + + E++++ L E+ L + + E L EQ
Sbjct: 87 Q---REETLD-RKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERIS 142
Query: 1345 -LSPKEANQL 1353
L+ +EA ++
Sbjct: 143 GLTQEEAKEI 152
Score = 31.4 bits (72), Expect = 5.4
Identities = 24/89 (26%), Positives = 51/89 (57%), Gaps = 3/89 (3%)
Query: 713 LKSKEYAKAIQEKVDKENAEQ-KKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEM 771
L++KE ++ ++++E E+ ++ L ++ + EE + E + K+ ++ EK E
Sbjct: 54 LEAKEEVHKLRAELERELKERRNELQRLERRLL-QREETLDRKMESLDKKE-ENLEKKEK 111
Query: 772 EKEKRKKILREREEEKEEQPAMKKIKVEK 800
E ++K L E+EEE EE A ++ ++E+
Sbjct: 112 ELSNKEKNLDEKEEELEELIAEQREELER 140
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 34.8 bits (80), Expect = 0.45
Identities = 32/119 (26%), Positives = 45/119 (37%), Gaps = 10/119 (8%)
Query: 719 AKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKK 778
KA E K AE K A + K E E K E K+ ++E K + E +KK
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAE--AKKKAEAEAKKKAAAEAKKK 218
Query: 779 ILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEND------SSKG--KSGG 829
E + + A K EK + K K+ + +D S K K+GG
Sbjct: 219 AAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDDLFGGLDSGKNAPKTGG 277
Score = 33.2 bits (76), Expect = 1.6
Identities = 26/113 (23%), Positives = 46/113 (40%), Gaps = 10/113 (8%)
Query: 710 QRFLKSKEYAKAIQEKVDKENAEQKKIDA---LTPQVKTEPEENVKITQEPVVLKQIKSE 766
QR K ++ A+ +Q+K E K+++ + K + EE K KQ ++E
Sbjct: 80 QRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL----KQKQAE 135
Query: 767 EKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
E K + E K A KK E +++ + + K +E K+
Sbjct: 136 EAAAKAAAAAK--AKAEAEAKRAAAAAKKAAAEAKKKAEAEAAK-KAAAEAKK 185
>gnl|CDD|223820 COG0749, PolA, DNA polymerase I - 3'-5' exonuclease and polymerase
domains [DNA replication, recombination, and repair].
Length = 593
Score = 34.9 bits (81), Expect = 0.51
Identities = 24/102 (23%), Positives = 40/102 (39%), Gaps = 4/102 (3%)
Query: 324 ITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLE 383
IT+ + +++L IA D E + + +++ ++ + L E L
Sbjct: 6 ITDLAVLNAWLTKLNAAANIAFDTETDGLDPHGADLVGLSVASEEEAAYIPLLHGPEQLN 65
Query: 384 V---LNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDT 422
V L +L D+ I KV D K L G+ FDT
Sbjct: 66 VLAALKPLLEDEGIKKVGQNLKYDYKVLAN-LGIEPGVAFDT 106
Score = 34.9 bits (81), Expect = 0.51
Identities = 24/102 (23%), Positives = 40/102 (39%), Gaps = 4/102 (3%)
Query: 1769 ITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLE 1828
IT+ + +++L IA D E + + +++ ++ + L E L
Sbjct: 6 ITDLAVLNAWLTKLNAAANIAFDTETDGLDPHGADLVGLSVASEEEAAYIPLLHGPEQLN 65
Query: 1829 V---LNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDT 1867
V L +L D+ I KV D K L G+ FDT
Sbjct: 66 VLAALKPLLEDEGIKKVGQNLKYDYKVLAN-LGIEPGVAFDT 106
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 33.4 bits (77), Expect = 0.53
Identities = 17/93 (18%), Positives = 49/93 (52%), Gaps = 7/93 (7%)
Query: 709 YQRFLKSKEYAKAIQEKVDKENAE-QKKIDALTPQVKTEPE----ENVKITQEPVVLKQI 763
Q+ L KA Q++++KE + Q ++ +++ E + + +++E KQ
Sbjct: 23 VQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQ 82
Query: 764 KSEEKVEMEKEKRKKILREREEEKEEQPAMKKI 796
+ ++K + ++K++ ++E ++++Q ++ I
Sbjct: 83 ELQQKQQELQQKQQAA--QQELQQKQQELLQPI 113
>gnl|CDD|151681 pfam11239, DUF3040, Protein of unknown function (DUF3040). Some
members in this family of proteins with unknown function
are annotated as membrane proteins however this cannot be
confirmed.
Length = 82
Score = 31.8 bits (73), Expect = 0.54
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 1315 SEEEQERLIELEEILRQHDPEFTS 1338
SE EQ RL E+E L DP+F +
Sbjct: 4 SEHEQRRLEEIERALYAEDPKFAA 27
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 35.0 bits (81), Expect = 0.55
Identities = 55/216 (25%), Positives = 99/216 (45%), Gaps = 33/216 (15%)
Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 1181
E++++R +EL KKL E+ +KR E L + E +E K +
Sbjct: 334 EEKEERLEELKKKLKEL--EKRLEELEERHELYEEAKAKKE----------ELERLKKRL 381
Query: 1182 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE-AEFKAW 1240
+ L K + +++EK K +I EE +K++ I + E E K
Sbjct: 382 TGLTPEKLEKEL----EELEKAKEEI----------EEEISKITARIGELKKEIKELKKA 427
Query: 1241 LIETKK-KRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDED 1299
+ E KK K + + ++ R++L + TA E R+ EL E+++ +R E
Sbjct: 428 IEELKKAKGKCPVCGRELTEEHRKELLEEYTA---ELKRIEKELKEIEEKERK--LRKEL 482
Query: 1300 WDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPE 1335
++ KV+ K++ +E E+L ELEE L++++ E
Sbjct: 483 RELEKVLKKESELIKLKELAEQLKELEEKLKKYNLE 518
Score = 33.5 bits (77), Expect = 1.4
Identities = 26/134 (19%), Positives = 63/134 (47%), Gaps = 6/134 (4%)
Query: 689 KIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPE 748
++G ++++ + E Y +L+ K+ K ++ + + ++++D ++ +
Sbjct: 581 ELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEK 640
Query: 749 ENVKITQEPVVLKQIKSEEKVE------MEKEKRKKILREREEEKEEQPAMKKIKVEKPE 802
++ +E L++ SEE+ E +E + LR EE E++ K +EK +
Sbjct: 641 RLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLK 700
Query: 803 ESNEKTKRHKIKSE 816
E E+ ++ K + E
Sbjct: 701 EELEEREKAKKELE 714
Score = 33.5 bits (77), Expect = 1.6
Identities = 32/145 (22%), Positives = 59/145 (40%), Gaps = 10/145 (6%)
Query: 684 FEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAE-QKKIDALTPQ 742
F ++ + +K + + +K E + E++ K+ E +K+++ L +
Sbjct: 301 FYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEER 360
Query: 743 VKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEE-QPAMKKIKVEKP 801
+ E K + L+++K EK +K L E E+ KEE + + KI
Sbjct: 361 HELYEEAKAKKEE----LERLKKRLTGL-TPEKLEKELEELEKAKEEIEEEISKITARIG 415
Query: 802 EESNEKTKRHKIKSEPKENDSSKGK 826
E E + K E K +KGK
Sbjct: 416 ELKKEIKELKKAIEELK---KAKGK 437
Score = 33.1 bits (76), Expect = 1.8
Identities = 38/169 (22%), Positives = 79/169 (46%), Gaps = 20/169 (11%)
Query: 1129 KELAKKLVEMNAKKREERLVDDERHLNELL----ELREIVELTPSDHSHARE-----AFK 1179
E + +E + +K EE++ + E + EL EL E V+ A E F
Sbjct: 243 LEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFY 302
Query: 1180 SMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKA 1239
++ ++++ K +++L+++I + +I ++L EE + +L +E+ E E +
Sbjct: 303 EEYLDELREIEKRLSRLEEEINGIEERI------KEL-EEKEERL-EELKKKLKELEKR- 353
Query: 1240 WLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 1288
+E ++R + ++ A+K+ + L KR T E++ E K K
Sbjct: 354 --LEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAK 400
Score = 32.3 bits (74), Expect = 3.7
Identities = 52/227 (22%), Positives = 95/227 (41%), Gaps = 36/227 (15%)
Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDE-----RHLNELLELREIVELTPSDHSHARE 1176
E K++ KEL + L E+N E + +E + + EL EL+E +E +E
Sbjct: 193 ELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIE------ELEKE 246
Query: 1177 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 1236
G + + L + I +L+++IE+ K +I E+L E K K KE+ + +AE
Sbjct: 247 LESLEG--SKRKLEEKIRELEERIEELKKEI------EEL--EEKVKELKEL---KEKAE 293
Query: 1237 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 1296
L E ++ +D+ ++R L + I EL KE+R ++ +
Sbjct: 294 EYIKLSEFYEE---YLDELREIEKRLSRLEEE----INGIEERIKELEEKEERLEELKKK 346
Query: 1297 DEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQ 1343
+ ++ + + E E +E L + T L E+
Sbjct: 347 LK-----ELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388
>gnl|CDD|234792 PRK00558, uvrC, excinuclease ABC subunit C; Validated.
Length = 598
Score = 34.7 bits (81), Expect = 0.58
Identities = 22/96 (22%), Positives = 39/96 (40%), Gaps = 20/96 (20%)
Query: 1098 AEFYDSNVV--KVQLPYAVPVPNLTTEQQKDRRKELA-------KKLVEM---NAK-KRE 1144
+FY + ++ +P+ + L R+ ++ K+L+E+ NA+ E
Sbjct: 298 GQFYLQKSIPKEILVPHELDDEELAEALLAGRKVKIRVPQRGDKKELLELAEKNAREALE 357
Query: 1145 ERLVDDERHLNELLELREIVEL--TPS-----DHSH 1173
+L D L EL E++ L P D SH
Sbjct: 358 RKLSDQSTQQQALEELAELLGLPEPPYRIECFDISH 393
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 34.6 bits (80), Expect = 0.65
Identities = 45/213 (21%), Positives = 91/213 (42%), Gaps = 25/213 (11%)
Query: 1139 NAKKRE-ERLVDDERHLNELLELRE-------IVELTPSDHSHAREAFKSMGINNI--QD 1188
NA + + ++DD L +L E RE VE SD + + K+ I +D
Sbjct: 145 NATPSDRQDMIDDLLQLGKLEEYRERASDARLGVERVLSDQRGSLDQLKAQ-IEEKEEKD 203
Query: 1189 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLS----------KEIAVPESE-AEF 1237
L++ +N L+ ++ + +I Y + E + + +E+ E+E +
Sbjct: 204 LHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDL 263
Query: 1238 KAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRD 1297
+ + ET+++R + ++ ++R ++L + R E ++ E R ++ RD
Sbjct: 264 RETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRD 323
Query: 1298 EDW-DVYKVINKDAGDTDSEEE--QERLIELEE 1327
E+ D + A + E E +E +LEE
Sbjct: 324 EELRDRLEECRVAAQAHNEEAESLREDADDLEE 356
Score = 30.8 bits (70), Expect = 9.7
Identities = 47/264 (17%), Positives = 87/264 (32%), Gaps = 53/264 (20%)
Query: 1120 TTEQQKDRRKELAKKLVEMNAKKREER------LVDDERHL-----------NELLELRE 1162
++ + +E A L E + REE L + + E+ ELRE
Sbjct: 339 AHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRE 398
Query: 1163 IVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNN---------- 1212
P D +A E F +L + +L+ + +T + +
Sbjct: 399 RFGDAPVDLGNA-EDFLEELREERDELREREAELEATL-RTARERVEEAEALLEAGKCPE 456
Query: 1213 -GEDLTEEP-----------KAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQ 1260
G+ + P +L E+ E E E +E + D+ ++
Sbjct: 457 CGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEE 516
Query: 1261 RRQDLAKR----RTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSE 1316
RR+DL + R ++R R R + + + + E + ++E
Sbjct: 517 RREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREA--------AAEAEEEAE 568
Query: 1317 EEQERLIELEEILRQHDPEFTSLN 1340
E +E + EL L + SL
Sbjct: 569 EAREEVAELNSKLAELKERIESLE 592
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and
chromosome partitioning].
Length = 420
Score = 34.3 bits (79), Expect = 0.68
Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 21/117 (17%)
Query: 1187 QDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE-AEFKAWLIET- 1244
+K + Q+Q++I + KI ++ +AKL K++ E+E A +A LIET
Sbjct: 34 AADDKQLKQIQKEIAALEKKI-------REQQDQRAKLEKQLKSLETEIASLEAQLIETA 86
Query: 1245 ------KKKRAYIIDKKNA----RKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 1291
+K+ A + + NA +++R+ LA+ AA +R A +D
Sbjct: 87 DDLKKLRKQIADLNARLNALEVQEREQRRRLAE--QLAALQRSGRNPPPALLVSPED 141
Score = 32.4 bits (74), Expect = 2.3
Identities = 29/175 (16%), Positives = 62/175 (35%), Gaps = 10/175 (5%)
Query: 1123 QQKDRRKELAKKLVEMNA--KKREERLVDDERHLNELLELREIVELTPS----DHSHARE 1176
+ D K+L K++ ++NA E + + R L E L + P +
Sbjct: 84 ETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQ 143
Query: 1177 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 1236
+ I LN + + ++ T ++ A ++ E + +A+
Sbjct: 144 RSVRLAIY-YGALNPARAERIDALKATLKQLAAV--RAEIAAEQAELTTLLSEQRAQQAK 200
Query: 1237 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARK-EKRD 1290
L E KK A + + +A +++ ++L + E + A+ E
Sbjct: 201 LAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAA 255
Score = 30.5 bits (69), Expect = 9.5
Identities = 33/189 (17%), Positives = 66/189 (34%), Gaps = 21/189 (11%)
Query: 1113 AVPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHS 1172
AV + + K++ K++ A +++ R D+R +L + + +E +
Sbjct: 25 AVLAAAFSAAADDKQLKQIQKEI---AALEKKIREQQDQR--AKLEKQLKSLETEIASLE 79
Query: 1173 HAREAFKSMGIN----NIQDLNKSINQLQQKIEKTK---AKIIAYNNGEDLTEEPKAKLS 1225
+ I DLN +N L+ + + + A+ +A P +S
Sbjct: 80 AQLIETA-DDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVS 138
Query: 1226 KEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRL---ISE 1282
PE I ++ +A K + LA R A E+ L +SE
Sbjct: 139 -----PEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSE 193
Query: 1283 LARKEKRDD 1291
++ +
Sbjct: 194 QRAQQAKLA 202
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 32.6 bits (75), Expect = 0.68
Identities = 23/107 (21%), Positives = 44/107 (41%), Gaps = 11/107 (10%)
Query: 1183 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE-FKAWL 1241
I + + L K +L+++IE+ + K+ Y E+ L K + V + AE KA
Sbjct: 31 IKDYEALYKENEELKEEIERLEEKLEEYKELEE-------TLQKTLVVAQETAEEVKA-- 81
Query: 1242 IETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 1288
+K+ II + A+ +R + A + +R +
Sbjct: 82 -NAQKEAELIIKEAEAKAERIVNDANEEAKKLATEYDELKRQSRLFR 127
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 33.8 bits (78), Expect = 0.72
Identities = 21/113 (18%), Positives = 42/113 (37%), Gaps = 23/113 (20%)
Query: 1123 QQKDRRKELAKKLVEMNA-----KKREERLVDD-----------ERHLNELLEL-REIVE 1165
+ ++ L + L + K E L E L +L +L E+ +
Sbjct: 141 LLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELED 200
Query: 1166 LTPSDHSHAREAFKSM------GINNIQDLNKSINQLQQKIEKTKAKIIAYNN 1212
P++ A+E K + + +++L + + +L+ KIE K N
Sbjct: 201 CDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT 253
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 33.7 bits (77), Expect = 0.78
Identities = 56/221 (25%), Positives = 94/221 (42%), Gaps = 43/221 (19%)
Query: 1042 RIDLGGFSVIHHLHKLLQ-----LKYPSHINSITPSRSEELLW------DYGFVATD--- 1087
RI + G S + + K LQ +Y I+SI P+ + E+ D +ATD
Sbjct: 10 RIIVEGASDVEVISKALQRLALGSEYNITISSIIPTTNVEIAKRAAEGADLVLIATDADR 69
Query: 1088 --------YREHLRKWL------------DAEFYDSNVVKVQLPYAVPVPNLTTEQQ--- 1124
+ E LR + D E D +V+ +L A+ L T Q+
Sbjct: 70 VGRELAEKFFEELRGMVGHIERMKIPIGHDVEHIDVELVRKELKNALVRAGLKTLQRVPE 129
Query: 1125 ----KDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKS 1180
K+ +EL +KL E+ +K EE L + E E E++E ++ ++S E K
Sbjct: 130 YMDLKEDYEELKEKLEELQKEK-EELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKK 188
Query: 1181 MGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPK 1221
+ + DL K ++L+ +E + ++I+ E L PK
Sbjct: 189 L-PGEVYDLKKRWDELEPGVELPEEELISDLVKETLNLAPK 228
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 34.2 bits (79), Expect = 0.87
Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 3/133 (2%)
Query: 1199 KIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNAR 1258
K + + + + A S A A KK+ + +
Sbjct: 46 KKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKK 105
Query: 1259 KQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYK---VINKDAGDTDS 1315
+++ L K + + ++++ + DDD + D+D D ++D D D
Sbjct: 106 AEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDV 165
Query: 1316 EEEQERLIELEEI 1328
++E E E +E+
Sbjct: 166 DDEDEEKKEAKEL 178
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with
the myosin-like domain [Function unknown].
Length = 499
Score = 33.8 bits (77), Expect = 0.89
Identities = 46/228 (20%), Positives = 80/228 (35%), Gaps = 32/228 (14%)
Query: 1123 QQKDRRKELAKKLVEMNAKKREERLVDDERHL--NELLELREIVELTPSDHSHAREAFKS 1180
Q D R +L E+ + E+R + ER +EL + R+ E + + AR+
Sbjct: 75 QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAK 134
Query: 1181 MGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAW 1240
Q+L + Q Q + K L + ++ + + + S + K+
Sbjct: 135 A----QQELARLTKQAQDLQTRLKTLAEQR---RQLEAQAQSLQASQKQLQASATQLKSQ 187
Query: 1241 LIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDW 1300
+++ K + A I +Q Q+LA R AA L A ++ RD
Sbjct: 188 VLDLKLRSAQI-------EQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDA-- 238
Query: 1301 DVYKVINKD----AGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQE 1344
++ K A E + +L LE L QE
Sbjct: 239 ---QISQKAQQIAARAEQIRERERQLQRLET-------AQARLEQEVA 276
>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A.
Length = 427
Score = 34.0 bits (79), Expect = 0.91
Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 26/132 (19%)
Query: 1126 DRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINN 1185
+ R E+ K+ E +K EERL E L L ++ ++E+ I
Sbjct: 315 EHRLEVYKRRKEYLLEKLEERLHILEGLLKALNKIDFVIEV----------------IRG 358
Query: 1186 IQDLNKSINQLQQKIEKTKAKIIA----YNNGEDLTEEPKAKLSKEIAVPESEAEFKAWL 1241
DL K+ +L +++ + +A + LT+E KL KEI E E E
Sbjct: 359 SIDLKKAKKELIEELSEIQADYLLDMRLRR----LTKEEIEKLEKEIE--ELEKEIAELE 412
Query: 1242 IETKKKRAYIID 1253
++ I
Sbjct: 413 KILASEKKLWIK 424
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 32.3 bits (74), Expect = 0.96
Identities = 21/99 (21%), Positives = 45/99 (45%), Gaps = 6/99 (6%)
Query: 720 KAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKI 779
+ I + +E E++K + K EE K + +V K+ + + E+EK K K +
Sbjct: 10 RIIDIEEKREEKEREK------EEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPL 63
Query: 780 LREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPK 818
R ++E ++ ++ + + P + K+ K + K
Sbjct: 64 ARYADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGK 102
>gnl|CDD|237478 PRK13709, PRK13709, conjugal transfer nickase/helicase TraI;
Provisional.
Length = 1747
Score = 34.0 bits (78), Expect = 1.00
Identities = 18/78 (23%), Positives = 32/78 (41%), Gaps = 1/78 (1%)
Query: 740 TPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKR-KKILREREEEKEEQPAMKKIKV 798
P +P VL Q ++EE + E E+R +I+R+ E K + P K +
Sbjct: 1624 IPDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQA 1683
Query: 799 EKPEESNEKTKRHKIKSE 816
+ E+ + + E
Sbjct: 1684 VRDIAGQERDRAAISERE 1701
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 33.8 bits (77), Expect = 1.1
Identities = 20/112 (17%), Positives = 51/112 (45%), Gaps = 9/112 (8%)
Query: 713 LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
LK + + + VD ++ KK+ ++ + E+ V ++ LK+ +S+E
Sbjct: 160 LKKNILSGNVSD-VDTDSISDKKVVE---ALREDNEKGVNFRRDMTDLKERESQEDA--- 212
Query: 773 KEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
KR + L+E ++K+ + K + +++ +K + + + + + K
Sbjct: 213 --KRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPK 262
Score = 31.5 bits (71), Expect = 4.8
Identities = 27/120 (22%), Positives = 46/120 (38%), Gaps = 4/120 (3%)
Query: 714 KSKEYAKAIQEKVDKE-NAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
K + KA Q+ + NA++++ + Q + + T P KQ+ +K E+E
Sbjct: 225 KQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIE 284
Query: 773 KEKRKKILREREEEKEEQPAMKKIKVEKPEESNE---KTKRHKIKSEPKENDSSKGKSGG 829
K + + + E K + +K E E K + K EP D K K
Sbjct: 285 KAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQV 344
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 33.8 bits (78), Expect = 1.2
Identities = 23/123 (18%), Positives = 51/123 (41%), Gaps = 19/123 (15%)
Query: 1215 DLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKK--NARKQRRQDLAKRRTAA 1272
DL +E A++ EI PE E + L + + ++ + ++ +K + + +
Sbjct: 384 DLLDEAGARVRLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRI 443
Query: 1273 AQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAG---DTDSEEEQERLIELEEIL 1329
+ L +E+ D+ +V+ + G E+E+E+L+ LE L
Sbjct: 444 PELEKELEAEVDED--------------DIAEVVARWTGIPVAKLLEDEKEKLLNLERRL 489
Query: 1330 RQH 1332
++
Sbjct: 490 KKR 492
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 33.2 bits (76), Expect = 1.2
Identities = 33/150 (22%), Positives = 64/150 (42%), Gaps = 11/150 (7%)
Query: 694 KYQKIKLKTSRFETPYQRFLKS--KEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENV 751
K QK+ + + E +R + AK E+ ++ A+Q K E
Sbjct: 103 KLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESAS 162
Query: 752 KITQEPVVLKQIKSEEKV-----EMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNE 806
++ S+E+ E E +K K+ ++REEE+ +Q ++ + E+ +++ E
Sbjct: 163 SSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222
Query: 807 KTKR--HKIKSEPKENDSSKGKSG--GTIS 832
++ +K PK ++G GTIS
Sbjct: 223 AWQKWMKNVKQRPKPVPLNQGMDSLRGTIS 252
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 32.7 bits (74), Expect = 1.4
Identities = 16/66 (24%), Positives = 38/66 (57%)
Query: 761 KQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEN 820
+Q + E + ME E+R++ + ERE+++E++ ++ +E+ + +E + +SE + +
Sbjct: 132 RQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESD 191
Query: 821 DSSKGK 826
D K
Sbjct: 192 DDKTEK 197
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 33.5 bits (77), Expect = 1.5
Identities = 30/177 (16%), Positives = 69/177 (38%), Gaps = 20/177 (11%)
Query: 1114 VPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHL-NELLEL-------REIVE 1165
+P + +E+ + + E K + LV+ L ELL+ +E+V
Sbjct: 84 IPRQSGLSEEVAEAEAKATSVAAEATTPKSIQELVEALEELLEELLKETASDPVVQELVS 143
Query: 1166 LTPS--DHSHAREAFKSMG------INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLT 1217
+ D + + L+K + +L+ + E+ + + E L
Sbjct: 144 IFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELL- 202
Query: 1218 EEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQ 1274
+ + +L + E+ E K +E ++++ + +K ++ RQ+L ++ A Q
Sbjct: 203 SKLEEELLARLESKEAALE-KQLRLEFEREKEEL--RKKYEEKLRQELERQAEAHEQ 256
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 33.0 bits (75), Expect = 1.6
Identities = 32/123 (26%), Positives = 62/123 (50%), Gaps = 10/123 (8%)
Query: 714 KSKEYAKAIQ-EKVDKENAEQKKIDA---LTPQVKTEPEENVKITQEPVVLKQIKSE--- 766
+ +E AK E+ K AE++K + + +VK E E + T + + ++
Sbjct: 53 REREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDEN 112
Query: 767 EKVEMEKEKRKKILR-EREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPK--ENDSS 823
E+ E E K +++ R +R+ E+ E+ +K ++EK E+ +R +++ PK N +
Sbjct: 113 EEEEYEAWKLRELKRIKRDREEREEMEREKAEIEKMRNMTEEERRAELRKNPKVSTNKAK 172
Query: 824 KGK 826
KGK
Sbjct: 173 KGK 175
>gnl|CDD|176652 cd06140, DNA_polA_I_Bacillus_like_exo, inactive DEDDy 3'-5'
exonuclease domain of Bacillus stearothermophilus DNA
polymerase I and similar family-A DNA polymerases.
Bacillus stearothermophilus-like Polymerase I (Pol I), a
subgroup of the family-A DNA polymerases, contains an
inactive DnaQ-like 3'-5' exonuclease domain in the same
polypeptide chain as the polymerase region. The
exonuclease-like domain of these proteins possess the
same fold as the Klenow fragment (KF) of Escherichia
coli Pol I, but does not contain the four critical
metal-binding residues necessary for activity. The
function of this domain is unknown. It might act as a
spacer between the polymerase and the 5'-3' exonuclease
domains. Some members of this subgroup, such as those
from Bacillus sphaericus and Thermus aquaticus, are
thermostable DNA polymerases.
Length = 178
Score = 32.2 bits (74), Expect = 1.6
Identities = 27/101 (26%), Positives = 39/101 (38%), Gaps = 13/101 (12%)
Query: 341 QEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLEVLNEVLTDKNIVKVFHG 400
E+A+ +E + + ++ Y + DL L E L D+ I KV H
Sbjct: 4 DEVALYVELLGENYHTADIIGLALANGGGAYYIPLELALLDLAALKEWLEDEKIPKVGHD 63
Query: 401 ADSDIKWLQKDFGLYVVGM-FDTHQACKFLPMPRQSLAYLL 440
A L K G+ + G+ FDT A AYLL
Sbjct: 64 AKRAYVAL-KRHGIELAGVAFDTMLA-----------AYLL 92
Score = 32.2 bits (74), Expect = 1.6
Identities = 27/101 (26%), Positives = 39/101 (38%), Gaps = 13/101 (12%)
Query: 1786 QEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLEVLNEVLTDKNIVKVFHG 1845
E+A+ +E + + ++ Y + DL L E L D+ I KV H
Sbjct: 4 DEVALYVELLGENYHTADIIGLALANGGGAYYIPLELALLDLAALKEWLEDEKIPKVGHD 63
Query: 1846 ADSDIKWLQKDFGLYVVGM-FDTHQACKFLPMPRQSLAYLL 1885
A L K G+ + G+ FDT A AYLL
Sbjct: 64 AKRAYVAL-KRHGIELAGVAFDTMLA-----------AYLL 92
>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator. DivIC from B. subtilis
is necessary for both vegetative and sporulation septum
formation. These proteins are mainly composed of an amino
terminal coiled-coil.
Length = 80
Score = 30.2 bits (69), Expect = 1.7
Identities = 8/36 (22%), Positives = 17/36 (47%)
Query: 1184 NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEE 1219
I L + +L+ + E+ +A++ + D EE
Sbjct: 24 QEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEE 59
>gnl|CDD|218637 pfam05558, DREPP, DREPP plasma membrane polypeptide. This family
contains several plant plasma membrane proteins termed
DREPPs as they are developmentally regulated plasma
membrane polypeptides.
Length = 201
Score = 32.3 bits (73), Expect = 1.8
Identities = 21/72 (29%), Positives = 32/72 (44%), Gaps = 1/72 (1%)
Query: 721 AIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKIL 780
+ EKV E+KK + P +TE + VK +E VV ++ K EE V + K
Sbjct: 121 FVFEKVSTFIVEEKKPEEEAPAAETEKKPAVK-EKEIVVEEEKKEEEAVPVAAAPSTKAP 179
Query: 781 REREEEKEEQPA 792
+ EE +
Sbjct: 180 ETKVEETPAKAP 191
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 32.9 bits (76), Expect = 1.8
Identities = 16/89 (17%), Positives = 38/89 (42%)
Query: 716 KEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEK 775
E+ + ++ + +++ L + + EE + E ++ + EE VE +E+
Sbjct: 349 AEFDAILDLFMEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRER 408
Query: 776 RKKILREREEEKEEQPAMKKIKVEKPEES 804
K+ L +EE+ A + +E +
Sbjct: 409 AKEALETEALAQEEKLADDLLSLEGLDRE 437
>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I. coiled-coil structure. It
has been suggested that the domains I and II from laminin
A, B1 and B2 may come together to form a triple helical
coiled-coil structure.
Length = 263
Score = 32.4 bits (74), Expect = 1.9
Identities = 29/134 (21%), Positives = 60/134 (44%), Gaps = 13/134 (9%)
Query: 1113 AVPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELL----ELREIVELTP 1168
A+P P ++ ++L+ +L ++A KR +L D E+ L++L L++
Sbjct: 14 ALPAPYKILYNIENLTQQLSGRLSPISATKR--QLEDLEKALSKLAQDAENLQKKANQAL 71
Query: 1169 SDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEI 1228
+ A + + + + L ++I L+ I++ K+ GE+ P + LS+ +
Sbjct: 72 AKAQKVNAAIERT-LGHAKTLAEAIKNLRDNIKEILEKLAGL--GENAFALPGSALSRML 128
Query: 1229 AVPESEAEFKAWLI 1242
A EA+ I
Sbjct: 129 A----EAQRMLGEI 138
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 31.5 bits (72), Expect = 2.1
Identities = 24/99 (24%), Positives = 44/99 (44%), Gaps = 19/99 (19%)
Query: 1189 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKR 1248
+ K++++ Q KI +DL E + K + + E E E L E +++
Sbjct: 31 ILKALDERQAKIA------------DDLAEAERLKEEAQALLAEYEQE----LEEAREQA 74
Query: 1249 AYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKE 1287
+ II+ A+K+ Q + + A +E R+ E A E
Sbjct: 75 SEIIE--QAKKEAEQIAEEIKAEAEEELERIK-EAAEAE 110
Score = 29.6 bits (67), Expect = 9.0
Identities = 21/99 (21%), Positives = 44/99 (44%), Gaps = 5/99 (5%)
Query: 714 KSKEYAKAIQEKVDKENAEQKK-----IDALTPQVKTEPEENVKITQEPVVLKQIKSEEK 768
+ KE A+A+ + ++E E ++ I+ + + EE +E + + +E +
Sbjct: 51 RLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEAE 110
Query: 769 VEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEK 807
+E EKE+ + LR E A K + + E + +
Sbjct: 111 IEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQKD 149
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 32.6 bits (75), Expect = 2.1
Identities = 22/94 (23%), Positives = 44/94 (46%), Gaps = 8/94 (8%)
Query: 711 RFLKSKEYAKAIQEKVDKENAEQKKIDA----LTPQVKTEPEENVKITQEPVVLKQIKSE 766
F + KE K ++E++ + E +KI L + + + + E ++ + K
Sbjct: 328 DFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLS 387
Query: 767 EKVEMEKEKRKKILREREEEKEEQPAMKKIKVEK 800
E++E +E+ K++ +EE E + KI V K
Sbjct: 388 EELEELEEELKEL----KEELESLYSEGKISVNK 417
>gnl|CDD|237358 PRK13351, PRK13351, elongation factor G; Reviewed.
Length = 687
Score = 33.0 bits (76), Expect = 2.2
Identities = 23/103 (22%), Positives = 33/103 (32%), Gaps = 9/103 (8%)
Query: 1291 DDFGMRDEDWDV----YKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELS 1346
D + + V + EE +E +L E L + D E L E E
Sbjct: 175 DLITEPELHFSEGDGGSTVEEGPIPEELLEEVEEAREKLIEALAEFDDELLELYLEGEEL 234
Query: 1347 PKEANQLHIGVERMCGPECLFQPSMLGSIQ--AGISETLNFVL 1387
E QL + P + GS GI L+ V+
Sbjct: 235 SAE--QLRAPLREGTR-SGHLVPVLFGSALKNIGIEPLLDAVV 274
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 32.5 bits (74), Expect = 2.2
Identities = 24/106 (22%), Positives = 43/106 (40%), Gaps = 5/106 (4%)
Query: 709 YQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEK 768
QR K +A E+ K+ E++K + K + K E K+ K E K
Sbjct: 95 EQRAAAEKAAKQA--EQAAKQAEEKQKQAE---EAKAKQAAEAKAKAEAEAEKKAKEEAK 149
Query: 769 VEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIK 814
+ E+E + K E +++ E + + + E+ K K + K
Sbjct: 150 KQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAK 195
Score = 31.3 bits (71), Expect = 5.5
Identities = 29/111 (26%), Positives = 39/111 (35%), Gaps = 9/111 (8%)
Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
K E A E+ K+ E K A + K E E K +E K+ EE
Sbjct: 105 KQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEE---AKKQAEEEAKAKAA 161
Query: 774 EKRKKILRE-----REEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
+ KK E E K + A K K E+ + E K K +E
Sbjct: 162 AEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKA-KAAAEAAA 211
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 32.5 bits (74), Expect = 2.3
Identities = 26/127 (20%), Positives = 48/127 (37%), Gaps = 13/127 (10%)
Query: 717 EYAKAIQEKVDKENAEQKKIDA--LTP-QVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
+Y + +EK ++ +TP + EN +I + K+E K ++
Sbjct: 293 QYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKT---RTAEKNEAKARKKE 349
Query: 774 EKRKKILREREEEKEE-QPAMKKIKVEKPEESNEKTKRHKIK------SEPKENDSSKGK 826
+K+ ERE +E Q + + + K K+ + P EN+ SKG
Sbjct: 350 IAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESKGS 409
Query: 827 SGGTIST 833
+T
Sbjct: 410 PPQVEAT 416
>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH. This
bacillus clade of FliH proteins is not found by the Pfam
FliH model pfam02108, but is closely related to the
sequences identified by that model. Sequences identified
by this model are observed in flagellar operons in an
analogous position relative to other flagellar operon
genes.
Length = 255
Score = 32.1 bits (73), Expect = 2.3
Identities = 33/149 (22%), Positives = 58/149 (38%), Gaps = 20/149 (13%)
Query: 1189 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKR 1248
++ I + I + II D+ E + +L+ E E E + + +
Sbjct: 1 MSNIIKSTKSVIPAQERGIIPLRQVTDVPET-EQELADEEQEFEQILE------KAEAEA 53
Query: 1249 AYIIDKKNAR-KQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGM-RDEDWDVYKVI 1306
A II++ A+ R+ + + R +ER RLI E A++E + F E +Y+
Sbjct: 54 AQIIEQAEAQAAAIREQIEQERAQWEEERERLIQE-AKQEGYEAGFQAGESEALSIYQST 112
Query: 1307 NKDAGDTDS----------EEEQERLIEL 1325
+A E Q +IEL
Sbjct: 113 IDEANAIVEEAKDDYEEKIESAQPLIIEL 141
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 32.6 bits (74), Expect = 2.4
Identities = 15/105 (14%), Positives = 47/105 (44%), Gaps = 4/105 (3%)
Query: 716 KEYAKAIQEKVDKENAEQKKIDALTPQV----KTEPEENVKITQEPVVLKQIKSEEKVEM 771
++Y + ++ + EQ++ + E ++ Q + + ++K
Sbjct: 62 QQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAE 121
Query: 772 EKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
E EK+ ++ ++++EE+ + A ++ K + ++ + K+K+
Sbjct: 122 EAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAA 166
Score = 30.7 bits (69), Expect = 9.2
Identities = 26/105 (24%), Positives = 39/105 (37%), Gaps = 12/105 (11%)
Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
K E AKA A + K A + K + EE K E K + K + E
Sbjct: 147 KKAEAAKAKA----AAEAAKLKAAA---EAKKKAEEAAK-AAEEAKAKAEAAAAKKKAEA 198
Query: 774 EKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPK 818
E + + + E + + A KK + + EK K K+ K
Sbjct: 199 EAKAAAEKAKAEAEAKAKAEKKAE----AAAEEKAAAEKKKAAAK 239
>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
component [Carbohydrate transport and metabolism].
Length = 472
Score = 32.7 bits (75), Expect = 2.5
Identities = 24/114 (21%), Positives = 49/114 (42%), Gaps = 14/114 (12%)
Query: 1119 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIV-----ELTPSDH-- 1171
+ + +D + K+ EM + + + N+L +++I+ + S
Sbjct: 337 TSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKKIIFACDAGMGSSAMGA 396
Query: 1172 SHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLS 1225
S R+ K+ G+N+I N +IN L A ++ + +DLT+ K K+
Sbjct: 397 SILRKKLKNAGLNDISVTNYAINNL-----PADADLVITH--KDLTDRAKRKVP 443
>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
factor dissociates the posttermination complex, composed
of the ribosome, deacylated tRNA, and mRNA, after
termination of translation. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis. RRF is believed to bind the ribosome at the
A-site in a manner that mimics tRNA, but the specific
mechanisms remain unclear. RRF is essential for
bacterial growth. It is not necessary for cell growth in
archaea or eukaryotes, but is found in mitochondria or
chloroplasts of some eukaryotic species.
Length = 179
Score = 31.5 bits (72), Expect = 2.5
Identities = 18/96 (18%), Positives = 41/96 (42%), Gaps = 13/96 (13%)
Query: 1114 VPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSH 1173
V +P LT E RRKEL K ++ + + +++ +L + E++ +
Sbjct: 96 VNLPPLTEE----RRKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKK 151
Query: 1174 AREAFKSMGINNIQDLNK-SINQLQQKIEKTKAKII 1208
A E +Q L I ++ + ++ + +++
Sbjct: 152 AEED--------LQKLTDEYIKKIDELLKSKEKELL 179
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 32.6 bits (75), Expect = 2.5
Identities = 31/147 (21%), Positives = 63/147 (42%), Gaps = 24/147 (16%)
Query: 1118 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREA 1177
NL E L K+V + +++ E+RL +E + L L E+ L+P E
Sbjct: 736 NLRAEMNLSPSAPL--KVVLVGSEELEDRLEANEDDIKGLANLEELEILSPDPEEPPVEV 793
Query: 1178 FKSMGI--------------NNIQDLNKSINQLQQKIEKTKAKIIAYNNGED-LTEEPKA 1222
+G + L K + +L+++I++ + K+ + E + + P+
Sbjct: 794 TAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKL----SNEGFVAKAPEE 849
Query: 1223 KLSKEIAVPESEAEFKAWLIETKKKRA 1249
+ KE E AE++ L + +++ A
Sbjct: 850 VVEKEK---EKLAEYQVKLAKLEERLA 873
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 31.6 bits (72), Expect = 2.5
Identities = 19/62 (30%), Positives = 37/62 (59%), Gaps = 2/62 (3%)
Query: 760 LKQIKSE--EKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEP 817
KQ + + E E E+E+RKK+ +RE E++E+ +++ + +K EE K + + + E
Sbjct: 13 EKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQ 72
Query: 818 KE 819
+E
Sbjct: 73 EE 74
Score = 30.4 bits (69), Expect = 6.2
Identities = 19/96 (19%), Positives = 42/96 (43%), Gaps = 12/96 (12%)
Query: 713 LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
L+ K+ + +E ++E E+KK++ + E EE + EK + E
Sbjct: 11 LEEKQARRQQREAEEEEREERKKLEEKREGERKEEEE------------LEEEREKKKEE 58
Query: 773 KEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKT 808
+E++++ + R+E++E + VE+
Sbjct: 59 EERKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSA 94
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 32.4 bits (74), Expect = 2.5
Identities = 26/172 (15%), Positives = 61/172 (35%), Gaps = 24/172 (13%)
Query: 696 QKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQ 755
+K K + + + + + ++ + + +P + T
Sbjct: 221 EKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKT- 279
Query: 756 EPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKS 815
K +++EK E E+ K ++ +++ + +K+I E ++ + ++ + +
Sbjct: 280 -----KAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRK 334
Query: 816 EPKENDSSKGKSGGTISTVDFSKVNYNKYMAKPDTKILPDPYYEYLDELRDS 867
E E K + K+ +KY P +LP DEL S
Sbjct: 335 ERGEKKKLKRR-----------KLGKHKYPEPPLEVVLP-------DELSGS 368
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 31.9 bits (73), Expect = 2.9
Identities = 20/60 (33%), Positives = 34/60 (56%), Gaps = 2/60 (3%)
Query: 746 EPEENVKITQEPVVLKQI--KSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEE 803
+ EE K+ + K + K EEK + ++EK +K LRE EE+ E +K+++ K +E
Sbjct: 45 KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRKLQE 104
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 31.6 bits (72), Expect = 3.0
Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 3/92 (3%)
Query: 742 QVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKP 801
E E QE ++++K EEK E ++ +E++E + ++ + E
Sbjct: 44 AAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE---EENEEEDE 100
Query: 802 EESNEKTKRHKIKSEPKENDSSKGKSGGTIST 833
E S+E K + K+E S + T
Sbjct: 101 ESSDENEKETEEKTESNVEKEITNPSWKPVGT 132
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 32.5 bits (74), Expect = 3.0
Identities = 31/130 (23%), Positives = 53/130 (40%), Gaps = 7/130 (5%)
Query: 715 SKEYAKAIQEKVDKE-NAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
+E I++ E EQK + E + ++ QE LK+ + +K+E+EK
Sbjct: 510 KEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKK-ELEQEMEELKERERNKKLELEK 568
Query: 774 EKRK--KILREREEEKEEQPAMKKIKVEKPEESNEKTKRH---KIKSEPKENDSSKGKSG 828
E ++ K L++ E + KKI K +S E + K K K + K G
Sbjct: 569 EAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIG 628
Query: 829 GTISTVDFSK 838
+ F +
Sbjct: 629 DKVRIRYFGQ 638
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 32.7 bits (74), Expect = 3.0
Identities = 31/132 (23%), Positives = 47/132 (35%), Gaps = 18/132 (13%)
Query: 684 FEDKMKIGSNKYQKIK----LKT--SRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKID 737
FE M GS KIK LK +R + + + E V K N + +
Sbjct: 2056 FEISMIEGSLGESKIKFIGGLKVYDARCVIYIEELDCTNVN---LVEGVRKYNEYGRGMG 2112
Query: 738 ALTPQVK------TEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEE--KEE 789
L Q+ E+N ++++ E K E + +L EEE K +
Sbjct: 2113 ELKEQLSNTVVILGVKEKNADDALSGTPGERLEREVKSVF-VEAPRDMLFLLEEEVRKRK 2171
Query: 790 QPAMKKIKVEKP 801
MK +KP
Sbjct: 2172 GSVMKFKSSKKP 2183
>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
repair].
Length = 480
Score = 32.4 bits (74), Expect = 3.1
Identities = 18/108 (16%), Positives = 39/108 (36%), Gaps = 14/108 (12%)
Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
KS E KV Q + L+ +++ + E+ + ++E E
Sbjct: 249 KSDERGSPYGTKVVYGGVGQLWVFLLSYKLQKKEEQTL--------------RTRIEKEL 294
Query: 774 EKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEND 821
EK +K L + + + +I EK + + ++ + K+
Sbjct: 295 EKAEKSLEKLKGREFNCEKDARIAAEKILKDYSSVEFLEVDFKSKKKR 342
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 32.4 bits (74), Expect = 3.2
Identities = 20/111 (18%), Positives = 45/111 (40%), Gaps = 10/111 (9%)
Query: 711 RFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKS-EEKV 769
RF ++++ +A + AE + T + + E E + +QI E +
Sbjct: 236 RFTRTEKVEQAATAAAEVPPAEMDTEEDRTKEREAELEA---------LQEQIDELESSI 286
Query: 770 EMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEN 820
E + K + + ++ EE +++ E EE K + ++ + + N
Sbjct: 287 EEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENN 337
>gnl|CDD|239776 cd04243, AAK_AK-HSDH-like, AAK_AK-HSDH-like: Amino Acid Kinase
Superfamily (AAK), AK-HSDH-like; this family includes the
N-terminal catalytic domain of aspartokinase (AK) of the
bifunctional enzyme AK- homoserine dehydrogenase (HSDH).
These aspartokinases are found in such bacteria as E.
coli (AKI-HSDHI, ThrA and AKII-HSDHII, MetL) and in
higher plants (Z. mays AK-HSDH). AK and HSDH are the
first and third enzymes in the biosynthetic pathway of
the aspartate family of amino acids. AK catalyzes the
phosphorylation of Asp to P-aspartyl phosphate. HSDH
catalyzes the NADPH-dependent conversion of Asp
3-semialdehyde to homoserine. ThrA and MetL are involved
in threonine and methionine biosynthesis, respectively.
In E. coli, ThrA is subject to allosteric regulation by
the end product L-threonine and the native enzyme is
reported to be tetrameric. As with bacteria, plant AK and
HSDH are feedback inhibited by pathway end products.
Maize AK-HSDH is a Thr-sensitive 180-kD enzyme.
Arabidopsis AK-HSDH is an alanine-activated,
threonine-sensitive enzyme whose ACT domains, located
C-terminal to the AK catalytic domain, were shown to be
involved in allosteric activation. Also included in this
CD is the catalytic domain of the aspartokinase (AK) of
the lysine-sensitive aspartokinase isoenzyme AKIII, a
monofunctional class enzyme (LysC) found in some bacteria
such as E. coli. In E. coli, LysC is reported to be a
homodimer of 50 kD subunits. Also included in this CD is
the catalytic domain of aspartokinase (AK) of the
bifunctional enzyme AK - DAP decarboxylase (DapDC) found
in some bacteria. DapDC, which is the lysA gene product,
catalyzes the decarboxylation of DAP to lysine.
Length = 293
Score = 31.8 bits (73), Expect = 3.2
Identities = 21/87 (24%), Positives = 41/87 (47%), Gaps = 8/87 (9%)
Query: 1130 ELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQDL 1189
LA+ + + ERHL+ + EL L+ + A S+ + ++DL
Sbjct: 48 ALAELAASGDDAQAIVLQEIRERHLDLIKEL-----LSGESAAELLAALDSL-LERLKDL 101
Query: 1190 NKSINQLQQKIEKTKAKIIAYNNGEDL 1216
+ I L + +KT+A+++++ GE L
Sbjct: 102 LEGIRLLGELSDKTRAEVLSF--GELL 126
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 32.1 bits (73), Expect = 3.4
Identities = 28/113 (24%), Positives = 47/113 (41%), Gaps = 12/113 (10%)
Query: 711 RFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIK-SEEKV 769
R KS +K +E E E++ + P + P+ + ++ +EE+
Sbjct: 19 REKKSIRQSKLEEELNKLEKKEEEAECKKKFKANPVP----ASVKLPLYEEIMEQNEERR 74
Query: 770 EMEKEKRKKILRERE-----EEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEP 817
E +EK K IL + E+EEQ K I K S + + K K++P
Sbjct: 75 EEVREKSKAILLSSQKPFSFYEREEQK--KAILPRKLRSSTSEREPKKFKAKP 125
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 30.1 bits (68), Expect = 3.4
Identities = 15/59 (25%), Positives = 32/59 (54%)
Query: 761 KQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
+I ++E++E KE+ ++I++E EE E + + EK + + R + + E +E
Sbjct: 15 AEISADEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAEE 73
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 31.0 bits (70), Expect = 3.5
Identities = 26/97 (26%), Positives = 46/97 (47%)
Query: 728 KENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEK 787
K+ E+KK + L VK + I +E +K K E+ E E+E ++I + E
Sbjct: 11 KKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIED 70
Query: 788 EEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
EE+ + + E+ EE N K + K+ +S++
Sbjct: 71 EEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQ 107
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 31.7 bits (72), Expect = 3.8
Identities = 24/78 (30%), Positives = 33/78 (42%), Gaps = 6/78 (7%)
Query: 748 EENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVE-----KPE 802
EE +K QE VL+Q E E + ++ E E E E + KI+ E K E
Sbjct: 132 EELLK-MQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEARGRAKEE 190
Query: 803 ESNEKTKRHKIKSEPKEN 820
NE R +K + E
Sbjct: 191 RENEDINREMLKLKANEE 208
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 32.0 bits (73), Expect = 3.9
Identities = 28/154 (18%), Positives = 52/154 (33%), Gaps = 32/154 (20%)
Query: 698 IKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQ----KKIDA--------------- 738
+ K R +RF K+ +Q E AE+ KID
Sbjct: 847 LIQKFGRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELES 906
Query: 739 ----LTPQVKTEPEENVK-----ITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEE 789
L + ++ EN++ I + +L I EE +E K ++ + E E +
Sbjct: 907 EIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKL 966
Query: 790 QPAMKK----IKVEKPEESNEKTKRHKIKSEPKE 819
+ ++ +K ++K+ KE
Sbjct: 967 KETSEEYEDLLKKSTILVREGNKANSELKNFKKE 1000
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 31.7 bits (72), Expect = 4.0
Identities = 22/86 (25%), Positives = 41/86 (47%), Gaps = 7/86 (8%)
Query: 693 NKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVK 752
Y+++K T Q LK EY K +Q+ K + + DA + + + + +E
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLK--EYNKKLQDYTKKLGEVKDETDA-SEEAEAKAKEEKL 407
Query: 753 ITQEPVVLKQIKSEEKVEMEKEKRKK 778
+E + K +E+ + +KEKR+K
Sbjct: 408 KQEE----NEKKQKEQADEDKEKRQK 429
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 31.1 bits (71), Expect = 4.0
Identities = 25/98 (25%), Positives = 42/98 (42%), Gaps = 11/98 (11%)
Query: 1120 TTEQQKDRRKELAKKLVEM--NAKKREER-----LVDDERHLNELLELREIVELTPSDHS 1172
Q R+K+L +KL E A+K EE+ +E E LE ++ +E
Sbjct: 46 ALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLAKALE 105
Query: 1173 HAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAY 1210
+ + ++ L K + L+QKI + +AK A
Sbjct: 106 AELQQAEEQ----VEKLKKQLAALEQKIAELRAKKEAL 139
>gnl|CDD|217630 pfam03592, Terminase_2, Terminase small subunit. Packaging of
double-stranded viral DNA concatemers requires
interaction of the prohead with virus DNA. This process
is mediated by a phage-encoded DNA recognition and
terminase protein. The terminase enzymes described so
far, which are hetero-oligomers composed of a small and a
large subunit, do not have a significant level of
sequence homology. The small terminase subunit is thought
to form a nucleoprotein structure that helps to position
the terminase large subunit at the packaging initiation
site.
Length = 141
Score = 30.3 bits (69), Expect = 4.2
Identities = 26/115 (22%), Positives = 40/115 (34%), Gaps = 24/115 (20%)
Query: 1245 KKKRAYIIDKKNARK--------QRRQDLAKRRTAAAQERMRLISELAR---KEKRDDDF 1293
KK A I + K + ++ KR A E + ++E+AR E
Sbjct: 28 SKKTARSIGSELLTKPKIKAYIDELMKEREKRTIITADEVLEELTEIARGDETELVLVGT 87
Query: 1294 GMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFT-SLNQEQELSP 1347
+ + KV KD RL LE + + H FT + E E+
Sbjct: 88 KEGEVEKIEKKVKLKD-----------RLKALELLGK-HYGLFTDKVEVEGEVKK 130
>gnl|CDD|237301 PRK13200, psaA, photosystem I P700 chlorophyll a apoprotein A1;
Provisional.
Length = 766
Score = 32.1 bits (73), Expect = 4.3
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 4/48 (8%)
Query: 1431 HFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQEKGGEFFRVH 1478
HF +LA+ P+ W ++L + HDF SD +E + F H
Sbjct: 33 HFDRTLAKGPKTTTWI----WNLHADAHDFDTHTSDLEEVSRKIFSAH 76
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 31.5 bits (71), Expect = 4.6
Identities = 25/108 (23%), Positives = 51/108 (47%), Gaps = 1/108 (0%)
Query: 713 LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
L+ A A E + KE + + ++ +++ K +E L+ + K E+E
Sbjct: 231 LEKLAEASAELESLQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELE 290
Query: 773 KEKRK-KILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
+EK++ + LRE+E E EE + + ++ + + E R K+ P +
Sbjct: 291 EEKKELEELREKEGEAEEAASSLEAELNRTKSEKESKAREKMVEIPLK 338
>gnl|CDD|234275 TIGR03599, YloV, DAK2 domain fusion protein YloV. This model
describes a protein family that contains an N-terminal
DAK2 domain (pfam02734), so named because of similarity
to the dihydroxyacetone kinase family family. The
GTP-binding protein CgtA (a member of the obg family) is
a bacterial GTPase associated with ribosome biogenesis,
and it has a characteristic extension (TIGR03595) in
certain lineages. This protein family described here was
found, by the method of partial phylognetic profiling, to
have a phylogenetic distribution strongly correlated to
that of TIGR03595. This correlation implies some form of
functional coupling.
Length = 530
Score = 31.8 bits (73), Expect = 4.6
Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 6/47 (12%)
Query: 1174 AREAFKSMGINNI----QDLNKSINQLQQKIEKTKAK--IIAYNNGE 1214
E FKS+G + + Q +N S + + IEK AK I+ NN
Sbjct: 326 IAELFKSLGADVVIEGGQTMNPSTEDILKAIEKVNAKNVIVLPNNKN 372
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 30.0 bits (68), Expect = 4.9
Identities = 22/80 (27%), Positives = 39/80 (48%), Gaps = 4/80 (5%)
Query: 761 KQIKSEEKVEMEK--EKRKKILREREEEK-EEQPAMKKIKVEKPEESNEKTKRHK-IKSE 816
K+I+ E K E+EK ++RK+ L+ EEE +E+ A + + ++ E E
Sbjct: 55 KRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYE 114
Query: 817 PKENDSSKGKSGGTISTVDF 836
+ D K K+ T+ +D
Sbjct: 115 EEYIDEDKYKTTVTVEELDP 134
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 31.7 bits (73), Expect = 5.0
Identities = 17/88 (19%), Positives = 37/88 (42%), Gaps = 2/88 (2%)
Query: 733 QKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPA 792
+ ++ L ++ +E ++ +E + ++KS E+E RK+ + EE+ + A
Sbjct: 3 KVRVYELAKELGVSSKELLEKLKELGI--EVKSHSSTVEEEEARKEEAKREAEEEAKAEA 60
Query: 793 MKKIKVEKPEESNEKTKRHKIKSEPKEN 820
+ E EE+ + E E
Sbjct: 61 EEAAAAEAEEEAKAEAAAAAPAEEAAEA 88
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 31.6 bits (72), Expect = 5.0
Identities = 22/147 (14%), Positives = 47/147 (31%), Gaps = 15/147 (10%)
Query: 1118 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREA 1177
+ DR +EL +++E ++ + + E L EL I E +
Sbjct: 794 GINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQ--------KN 845
Query: 1178 FKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF 1237
K + ++ ++ ++Q E N + +PK IA
Sbjct: 846 LKQEIKLELSEIQEAEEEIQNINEN-------KNEFVEFKNDPKKLNKLIIAKDVLIKLV 898
Query: 1238 KAWLIETKKKRAYIIDKKNARKQRRQD 1264
+ + ++ KK+ K +
Sbjct: 899 ISSDEIKQDEKTTKKKKKDLEKTDEEA 925
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 31.4 bits (70), Expect = 5.1
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 10/101 (9%)
Query: 723 QEKVDKENAEQKKIDA---LTPQVKTEPEENVKITQEPVVL--KQIKSE-EKVEMEKEKR 776
Q K + ENA + + L + + +E K +E + L QIK+E EK + E+EK+
Sbjct: 132 QAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQ 191
Query: 777 KKILREREEEKEEQPAMK-KIKVEKPEESNEKTKRHKIKSE 816
K E+E++K A K I++E+ ++ E K+ IK +
Sbjct: 192 KT---EQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQ 229
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 31.4 bits (71), Expect = 5.3
Identities = 29/129 (22%), Positives = 62/129 (48%), Gaps = 14/129 (10%)
Query: 700 LKTSRFETPYQR---FLKSKEYAKAIQEKVDKE----NAEQKKIDA-LTPQVKTEPEENV 751
KT++ E+ + L++KE A+ +E++++E E K+I++ LT + + ++
Sbjct: 59 KKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDE 118
Query: 752 KITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRH 811
++ + L E E + K + EREE+ E+ KK ++E+ ++ R
Sbjct: 119 NLSSKEKTL------ESKEQSLTDKSKHIDEREEQVEKLEEQKKAELERVAALSQAEARE 172
Query: 812 KIKSEPKEN 820
I +E +
Sbjct: 173 IILAETENK 181
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 31.6 bits (71), Expect = 5.4
Identities = 31/125 (24%), Positives = 61/125 (48%), Gaps = 19/125 (15%)
Query: 725 KVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILRER- 783
+VDK++AE+ +I+ K +KI E +++++ EE+ +E+E+ ++I RER
Sbjct: 429 RVDKDHAERARIEKENAHRKAL---EMKIL-EKKRIERLEREERERLERERMERIERERL 484
Query: 784 -----EEEKEEQPAMKKIKVEKPEESN---------EKTKRHKIKSEPKENDSSKGKSGG 829
E E+ E+ +++ ++++ E EK +R+ + EN S G G
Sbjct: 485 ERERLERERLERDRLERDRLDRLERERVDRLERDRLEKARRNSYFLKGMENGLSAGGGPG 544
Query: 830 TISTV 834
V
Sbjct: 545 DGPGV 549
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 30.9 bits (70), Expect = 5.4
Identities = 19/71 (26%), Positives = 34/71 (47%), Gaps = 4/71 (5%)
Query: 709 YQRFLKSKEYAKAIQEKVDKENAE-QKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEE 767
+R +++ E K K DK N E KI++ ++ Q +K++ S+E
Sbjct: 9 AKRVVRALESEKYKANK-DKGNPEIYNKINSQDKAIEKFKLL--IKAQMAERVKKLHSQE 65
Query: 768 KVEMEKEKRKK 778
K E +K+ +KK
Sbjct: 66 KKEEKKKPKKK 76
>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
partitioning].
Length = 264
Score = 31.0 bits (70), Expect = 5.5
Identities = 14/110 (12%), Positives = 35/110 (31%), Gaps = 6/110 (5%)
Query: 699 KLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPV 758
+ + + + + ++ + ++ VD++ L Q + E TQ
Sbjct: 80 PTEPAAVKDAERLTPEQRQLLEQME--VDQKAQPT----QLGEQPEQARIEEQPRTQSQK 133
Query: 759 VLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKT 808
Q + + ++ + R + + +P K K E K
Sbjct: 134 AQSQATTVQTQPVKPKPRPEKPQPVAPAPAPEPVEKAPKAEAAPPPKPKA 183
>gnl|CDD|212668 cd10225, MreB_like, MreB and similar proteins. MreB is a bacterial
protein which assembles into filaments resembling those
of eukaryotic F-actin. It is involved in determining the
shape of rod-like bacterial cells, by assembling into
large fibrous spirals beneath the cell membrane. MreB has
also been implicated in chromosome segregation;
specifically MreB is thought to bind to and segregate the
replication origin of bacterial chromosomes.
Length = 320
Score = 31.2 bits (72), Expect = 5.6
Identities = 11/47 (23%), Positives = 18/47 (38%), Gaps = 8/47 (17%)
Query: 1398 LANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDA 1444
L I +TG L G E ++++ V +AE+P
Sbjct: 270 LDRGIVLTGGGALLRGLDELISEET--------GLPVRVAEDPLTCV 308
>gnl|CDD|226169 COG3643, COG3643, Glutamate formiminotransferase [Amino acid
transport and metabolism].
Length = 302
Score = 31.3 bits (71), Expect = 5.8
Identities = 24/105 (22%), Positives = 39/105 (37%), Gaps = 12/105 (11%)
Query: 758 VVLKQIKSEEKVEMEKEKRKKI---------LREREEEKEEQPAMKKIKVEKPEESNEKT 808
V LK +EE VE+ KE K++ L E + E+ + I+ + E EK
Sbjct: 94 VPLKDTTTEECVEISKELGKRVGEELGIPVYLYEDAATRPERQNLADIRKGQYEGLKEKI 153
Query: 809 KRHKIKSEPKENDSSKG-KSGGTISTVDFSKVNYNKYMAKPDTKI 852
K K P +G T + +N + + +I
Sbjct: 154 KEPKWA--PDFGPREVHPTAGVTAIGARMPLIAFNVNLDTSNMEI 196
>gnl|CDD|223994 COG1067, LonB, Predicted ATP-dependent protease [Posttranslational
modification, protein turnover, chaperones].
Length = 647
Score = 31.6 bits (72), Expect = 5.8
Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 6/90 (6%)
Query: 737 DALTPQVKTEPEE--NVKITQEPVVLKQ-IKSEEKVEMEKEKRKKILREREEEKEEQPAM 793
D P VK+ + ++ + EP V K + + E+ + I + E E+P +
Sbjct: 33 DEAVPIVKSAAFQRRHILLIGEPGVGKSMLAQAMSFLLPLEELEDIRVKPNPEDPERPKV 92
Query: 794 KKIKVEKPEESNEKTKRHKIKSEPKENDSS 823
+ + +PE +E K+E K S
Sbjct: 93 ETVPAGRPESFSE---LLGQKAEEKAEYLS 119
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 31.3 bits (71), Expect = 5.8
Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 6/97 (6%)
Query: 732 EQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKI------LREREE 785
++++ + K E +K + + + K E + EK K++ L E +
Sbjct: 69 LEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSK 128
Query: 786 EKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDS 822
E ++Q +EK E+N+ +R K ++E K +S
Sbjct: 129 ELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEES 165
>gnl|CDD|224141 COG1220, HslU, ATP-dependent protease HslVU (ClpYQ), ATPase subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 444
Score = 31.5 bits (72), Expect = 5.8
Identities = 9/53 (16%), Positives = 23/53 (43%), Gaps = 7/53 (13%)
Query: 769 VEMEKEKRKKILREREEEKEEQ-------PAMKKIKVEKPEESNEKTKRHKIK 814
V++ +E++ + ++++ EE E+ P K + + R K +
Sbjct: 108 VKLVREEKIEKVKDKAEELAEERILDALVPPAKNFWGQSENKQESSATREKFR 160
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 30.4 bits (68), Expect = 5.9
Identities = 18/77 (23%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
Query: 742 QVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKP 801
Q + E EE ++ +E LK+ +EE++ E+E R++ E++E+ + + ++
Sbjct: 44 QERREQEEQDRLEREE--LKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQE 101
Query: 802 EESNEKTKRHKIKSEPK 818
+E E+ ++ K ++E +
Sbjct: 102 QEEQERIQKQKEEAEAR 118
Score = 29.7 bits (66), Expect = 9.7
Identities = 25/98 (25%), Positives = 51/98 (52%), Gaps = 1/98 (1%)
Query: 719 AKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKK 778
A+ +E+ ++E EQ++ D L + ++ +E +Q + + + EK KRK
Sbjct: 35 AREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKA 94
Query: 779 ILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
E++E +EEQ ++K K E + E+ +R +++ E
Sbjct: 95 EEEEKQE-QEEQERIQKQKEEAEARAREEAERMRLERE 131
>gnl|CDD|237561 PRK13923, PRK13923, putative spore coat protein regulator protein
YlbO; Provisional.
Length = 170
Score = 30.2 bits (68), Expect = 6.0
Identities = 14/62 (22%), Positives = 28/62 (45%), Gaps = 2/62 (3%)
Query: 734 KKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAM 793
++I L + + EN + QE + ++ V M + +R IL E+E++ A
Sbjct: 111 EQIGKLQEEEEKLSWENQTLKQELAITEEDYRALIVIMNRARRMAIL--VEDERKSLAAF 168
Query: 794 KK 795
+
Sbjct: 169 EM 170
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the autophagic
process and in the vacuolar protein sorting pathway.
Apg14p may be a component specifically required for the
function of Apg6/Vps30p through the autophagic pathway.
There are 17 auto-phagosomal component proteins which are
categorized into six functional units, one of which is
the AS-PI3K complex (Vps30/Atg6 and Atg14). The AS-PI3K
complex and the Atg2-Atg18 complex are essential for
nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol 3-phosphate
(PtdIns(3)P) at the pre-autophagosomal structure (PAS).
The localisation of this complex at the PAS is controlled
by Atg14. Autophagy mediates the cellular response to
nutrient deprivation, protein aggregation, and pathogen
invasion in humans, and malfunction of autophagy has been
implicated in multiple human diseases including cancer.
This effect seems to be mediated through direct
interaction of the human Atg14 with Beclin 1 in the human
phosphatidylinositol 3-kinase class III complex.
Length = 307
Score = 31.2 bits (71), Expect = 6.0
Identities = 15/47 (31%), Positives = 23/47 (48%)
Query: 1243 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKR 1289
E ++KR I + K A QRR DL+ + R + +L + KR
Sbjct: 86 EIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKR 132
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 30.0 bits (68), Expect = 6.4
Identities = 18/71 (25%), Positives = 39/71 (54%)
Query: 760 LKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
L+ + + K E E E+ ++ E++ + EE+ A K+ K +K ++ +K K+ K ++ +E
Sbjct: 52 LELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEE 111
Query: 820 NDSSKGKSGGT 830
+ SK +
Sbjct: 112 KEGSKSSEESS 122
>gnl|CDD|185654 PTZ00475, PTZ00475, RESA-like protein; Provisional.
Length = 282
Score = 30.8 bits (69), Expect = 6.4
Identities = 29/109 (26%), Positives = 48/109 (44%), Gaps = 9/109 (8%)
Query: 77 NQEIPKSD----EANKELSKTLDVLVQNAYKSIVKCTKTAQSFPSTHENSLLIASPNYIS 132
N I K D EAN+ + + VL Q + I T +P +N+ I Y+S
Sbjct: 126 NSGINKKDIYLKEANERMIRNSIVLRQCKSRFISIIT---NYYPFKEQNNPFIKQAQYVS 182
Query: 133 GVAGTSDKVMTLVDSLLKTQNISKSMSKLYLEGQKDILTEANDKLLESI 181
D ++ +D NI +++ LY E ++L E +++LE I
Sbjct: 183 SSNYVLDDIINNID--YSIDNIHRAIDNLYYEHILNLLEEEKNEILEEI 229
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 31.2 bits (70), Expect = 6.4
Identities = 34/140 (24%), Positives = 61/140 (43%), Gaps = 13/140 (9%)
Query: 691 GSNKYQKIKLK-TSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEE 749
G NK++ + + ++F K E A A+ E+ D+ A KK + + + +
Sbjct: 27 GFNKFRGERWRLQNKFLLAAGSR-KKIESALAVDEEPDENGAVSKKKPTRSVK-RATKKT 84
Query: 750 NVKITQEPVVLKQIKSEEKVEMEKEKRKKILRER----------EEEKEEQPAMKKIKVE 799
V+I++ ++ E ++KE +K R R EEEK E+ K+ KV+
Sbjct: 85 VVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVK 144
Query: 800 KPEESNEKTKRHKIKSEPKE 819
K +E E S+ +E
Sbjct: 145 KMDEDVEDQGSESEVSDVEE 164
>gnl|CDD|237377 PRK13405, bchH, magnesium chelatase subunit H; Provisional.
Length = 1209
Score = 31.2 bits (71), Expect = 7.3
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 406 KWLQKDFGLYVVGMFDTHQACKFLP 430
+WL++DFG V F TH A +F+P
Sbjct: 589 RWLREDFGADAVLHFGTHGALEFMP 613
Score = 31.2 bits (71), Expect = 7.3
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 1851 KWLQKDFGLYVVGMFDTHQACKFLP 1875
+WL++DFG V F TH A +F+P
Sbjct: 589 RWLREDFGADAVLHFGTHGALEFMP 613
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 31.2 bits (71), Expect = 7.5
Identities = 27/106 (25%), Positives = 56/106 (52%), Gaps = 9/106 (8%)
Query: 716 KEYAKAIQEKVDKENAEQKKIDALTPQVKTE--PEENVKITQEPVVLKQIKSEEKVEMEK 773
++ AKA + K+ + ++K++ L +++ E P ++I +P K ++K E+++
Sbjct: 497 RDAAKA-EMKLISLDKKKKQLLQLCDKLRDEWLPNLGIRIEDKPDGPSVWKLDDKEELQR 555
Query: 774 EKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
EK E +E +EQ ++K+K ++ ++ E K K K P E
Sbjct: 556 EK------EEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 31.2 bits (71), Expect = 7.7
Identities = 22/108 (20%), Positives = 40/108 (37%), Gaps = 15/108 (13%)
Query: 715 SKEYAKAIQEK---VDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKV-- 769
+ K N + K + L P VK + K+ E ++K+ +K+
Sbjct: 463 GNRNFMQNINDSQLLPKINITKAKDEILNPVVKRALLQARKVVNE--LVKKYGPPDKIVI 520
Query: 770 EMEKE--------KRKKILREREEEKEEQPAMKKIKVEKPEESNEKTK 809
EM +E + K ++ E+K ++ A K EE E +
Sbjct: 521 EMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSS 568
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 30.4 bits (69), Expect = 7.8
Identities = 33/139 (23%), Positives = 66/139 (47%), Gaps = 17/139 (12%)
Query: 695 YQKIKLKTSRFETPYQRFLKSKEYAKA-IQEKVDKENAEQKKIDALTPQVKTEPEEN--- 750
+ K++ + K K+ K ++ + K A +KK++ L E EE
Sbjct: 66 FSKVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEKEKW 125
Query: 751 ---------VKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEK- 800
VK+ + +LK K+ ++ E +K+K KK +ER+E+ E++ A ++ K E+
Sbjct: 126 TKALAKAEGVKVKDDEKLLK--KALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREEN 183
Query: 801 -PEESNEKTKRHKIKSEPK 818
+ ++K + K K++ K
Sbjct: 184 LKKRKDDKKNKKKKKAKKK 202
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 31.3 bits (72), Expect = 7.9
Identities = 23/105 (21%), Positives = 43/105 (40%), Gaps = 10/105 (9%)
Query: 732 EQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQP 791
+++++ Q + E E+ K E + + + +++ + K + ++ +KEE
Sbjct: 168 SKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEIT 227
Query: 792 AMKKIKVEKPEESNEKTKRHKI-KSEPKENDSSKGKSGGTISTVD 835
MK+I EE + K E KE KSG I T
Sbjct: 228 PMKEI----NEEERRVVVEGYVFKVEIKE-----LKSGRHILTFK 263
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 30.4 bits (68), Expect = 8.2
Identities = 25/117 (21%), Positives = 43/117 (36%), Gaps = 10/117 (8%)
Query: 716 KEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENV------KITQEPVVLKQIKSE--- 766
+E A+ E+ Q + A + T P E K T E K +E
Sbjct: 109 EESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQK 168
Query: 767 EKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSS 823
VE EK K +++++ + + KP E+ K K++PK + +
Sbjct: 169 TPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKA-DKADKTKPKPKEKA 224
>gnl|CDD|143435 cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus AldA1
(SACOL0154) aldehyde dehydrogenase-like. Uncharacterized
aldehyde dehydrogenase from Staphylococcus aureus (AldA1,
locus SACOL0154) and other similar sequences are present
in this CD.
Length = 475
Score = 30.9 bits (70), Expect = 8.3
Identities = 24/105 (22%), Positives = 50/105 (47%), Gaps = 15/105 (14%)
Query: 1160 LREIVELTPSDHSHA----REAFKSMGINNIQDLNKSINQLQQKIEKTK---AKIIAYNN 1212
L EI + T +D A +EAFK+ + + +N++ I++ K A + +N
Sbjct: 29 LSEITDATDADVDRAVKAAQEAFKTWRKTTVAERANILNKIADIIDENKELLAMVETLDN 88
Query: 1213 GEDLTEEPKAKLSKEIAVPESEAEFK--AWLIETKKKRAYIIDKK 1255
G+ + E ++ + +P + F+ A +I ++ A +ID+
Sbjct: 89 GKPIRE------TRAVDIPLAADHFRYFAGVIRAEEGSANMIDED 127
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 30.8 bits (70), Expect = 8.3
Identities = 19/99 (19%), Positives = 35/99 (35%), Gaps = 3/99 (3%)
Query: 746 EPEENVKITQEPVVLKQIKSEEKV---EMEKEKRKKILREREEEKEEQPAMKKIKVEKPE 802
NV + + ++ + E KV +K+K++ E +++ E++ KK K E
Sbjct: 26 YIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKS 85
Query: 803 ESNEKTKRHKIKSEPKENDSSKGKSGGTISTVDFSKVNY 841
E K K K F+K+
Sbjct: 86 EGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAE 124
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 30.0 bits (68), Expect = 8.3
Identities = 22/106 (20%), Positives = 51/106 (48%), Gaps = 5/106 (4%)
Query: 740 TPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVE 799
TP E E K +E L + + K E E++++ K ++++ K+++ K K +
Sbjct: 50 TPIYDAEYTEAKKKKKE---LAEEIEKVKKEYEEKQKWK--WKKKKSKKKKDKDKDKKDD 104
Query: 800 KPEESNEKTKRHKIKSEPKENDSSKGKSGGTISTVDFSKVNYNKYM 845
K ++ +EK + + + ++ S ++ T+S + K +K +
Sbjct: 105 KKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRKYALHKDI 150
>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
Length = 673
Score = 31.0 bits (70), Expect = 8.4
Identities = 22/93 (23%), Positives = 37/93 (39%), Gaps = 8/93 (8%)
Query: 719 AKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQI--KSEEKVEMEKEKR 776
AI + D LT T E +I Q+ K + EK E+ E+
Sbjct: 334 EAAIAKAPDVPGWSNTASGTLTDSYVTPKEAQAEIEQKLAQEKAQLDAAVEKGELTPEQA 393
Query: 777 KKILREREEEKEEQPAMKKIKVEKPEESNEKTK 809
K+I R + E +E + ++EK +++ K
Sbjct: 394 KQIARAKLEPEERE------RLEKIDKAQAALK 420
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 30.0 bits (68), Expect = 8.8
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 762 QIKSEEKVEMEKEKRKKILREREEEKEE 789
+ + EEK M KE++K I E+E+ +EE
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLEEE 122
>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD. YqfD is part of
the sigma-E regulon in the sporulation program of
endospore-forming Gram-positive bacteria. Mutation
results in a sporulation defect in Bacillus subtilis.
Members are found in all currently known
endospore-forming bacteria, including the genera
Bacillus, Symbiobacterium, Carboxydothermus,
Clostridium, and Thermoanaerobacter [Cellular processes,
Sporulation and germination].
Length = 382
Score = 30.8 bits (70), Expect = 8.8
Identities = 25/131 (19%), Positives = 53/131 (40%), Gaps = 15/131 (11%)
Query: 675 KKIAQIGLFFEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQK 734
+K ++ L + +++ G K + L + ++ KE+AK E+ K+ +
Sbjct: 248 EKTVEVPLKTQREVRTG-KKIVRYYLNIGGKKIYLKK--PPKEFAKYDVEEKTKKFGLWR 304
Query: 735 KIDALTPQVKTEPEE--NVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPA 792
+ ++ E VK E V ++ + K + EKE +K++ + +
Sbjct: 305 W----SLPIEIVKETYYEVKEKVEKVTKEEAIEKAKEKAEKELKKEL------DPNAKIV 354
Query: 793 MKKIKVEKPEE 803
KI E+ E
Sbjct: 355 SDKILSERVEG 365
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 31.0 bits (70), Expect = 9.0
Identities = 13/91 (14%), Positives = 33/91 (36%), Gaps = 7/91 (7%)
Query: 729 ENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKE 788
NA K++ ++ +PE+ + + K+I E+ K + +
Sbjct: 361 INANPDKLENKKIEISIKPEKKTINNSQIITTKKIN-------IVEESNKNSVHFDTLYK 413
Query: 789 EQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
+ K K+ + + + K ++ E +
Sbjct: 414 TKIFYHKKKINQNNKEQDIKKEELLEKEFVK 444
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 30.4 bits (68), Expect = 9.4
Identities = 15/54 (27%), Positives = 22/54 (40%), Gaps = 4/54 (7%)
Query: 763 IKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
I E + + EK+K E E+E+ QP K + P E + SE
Sbjct: 155 ILGEGSDDSDSEKKKPPGNEEEQEQAPQPR----KQQPPGPRREPDEGTPSSSE 204
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.132 0.383
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 99,129,109
Number of extensions: 9908410
Number of successful extensions: 14188
Number of sequences better than 10.0: 1
Number of HSP's gapped: 13095
Number of HSP's successfully gapped: 578
Length of query: 1973
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1862
Effective length of database: 6,014,308
Effective search space: 11198641496
Effective search space used: 11198641496
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (29.5 bits)