RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14859
(1097 letters)
>gnl|CDD|187699 cd09275, RNase_HI_RT_DIRS1, DIRS1 family of RNase HI in long-term
repeat retroelements. Ribonuclease H (RNase H) enzymes
are divided into two major families, Type 1 and Type 2,
based on amino acid sequence similarities and
biochemical properties. RNase H is an endonuclease that
cleaves the RNA strand of an RNA/DNA hybrid in a
sequence non-specific manner in the presence of divalent
cations. RNase H is widely present in various organisms,
including bacteria, archaea and eukaryotes. RNase HI
has also been observed as adjunct domains to the reverse
transcriptase gene in retroviruses, in long-term repeat
(LTR)-bearing retrotransposons and non-LTR
retrotransposons. RNase HI in LTR retrotransposons
perform degradation of the original RNA template,
generation of a polypurine tract (the primer for
plus-strand DNA synthesis), and final removal of RNA
primers from newly synthesized minus and plus strands.
The catalytic residues for RNase H enzymatic activity,
three aspartatic acids and one glutamatic acid residue
(DEDD), are unvaried across all RNase H domains.
Phylogenetic patterns of RNase HI of LTR retroelements
is classified into five major families, Ty3/Gypsy,
Ty1/Copia, Bel/Pao, DIRS1 and the vertebrate
retroviruses. The structural features of DIRS1-group
elements are different from typical LTR elements. RNase
H inhibitors have been explored as an anti-HIV drug
target because RNase H inactivation inhibits reverse
transcription.
Length = 120
Score = 117 bits (296), Expect = 6e-31
Identities = 51/116 (43%), Positives = 71/116 (61%)
Query: 774 ISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQS 833
+ TDAS GWG+ + S+ GLWS E++N HIN E+ AV AL L + V+V+S
Sbjct: 2 LFTDASLSGWGAVLSGSWAQGLWSAEERNKHINFLELLAVLLALQHWGARLSNRKVLVRS 61
Query: 834 DNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
DN T V+Y+ RQGGT+S LL+ ++ L ++ I + A+ IPG N AD LSR
Sbjct: 62 DNTTAVAYINRQGGTRSPELLALARQLVLWCEERNIWLRARHIPGVLNVAADRLSR 117
>gnl|CDD|239684 cd03714, RT_DIRS1, RT_DIRS1: Reverse transcriptases (RTs) occurring
in the DIRS1 group of retransposons. Members of the
subfamily include the Dictyostelium DIRS-1, Volvox
carteri kangaroo, and Panagrellus redivivus PAT
elements. These elements differ from LTR and
conventional non-LTR retrotransposons. They contain
split direct repeat (SDR) termini, and have been
proposed to integrate via double-stranded closed-circle
DNA intermediates assisted by an encoded recombinase
which is similar to gamma-site-specific integrase.
Length = 119
Score = 90.1 bits (224), Expect = 2e-21
Identities = 41/117 (35%), Positives = 66/117 (56%), Gaps = 1/117 (0%)
Query: 551 IDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRG 610
+DL AYFH+PI + L ++ G+ LPFGL+ AP+ F + + + LR G
Sbjct: 1 VDLKDAYFHIPILPRSRDLLGFAWQGETYQFKALPFGLSLAPRVFTKVVEALLAPLRLLG 60
Query: 611 MRVVVYLDDFLLVNQDPRILEIQGK-LAVSILGSLGWIVNLQKSSLSPAPVLQFLGI 666
+R+ YLDD L++ + E + L ++L +LG+ +NL+KS L P + FLG+
Sbjct: 61 VRIFSYLDDLLIIASSIKTSEAVLRHLRATLLANLGFTLNLEKSKLGPTQRITFLGL 117
>gnl|CDD|239685 cd03715, RT_ZFREV_like, RT_ZFREV_like: A subfamily of reverse
transcriptases (RTs) found in sequences similar to the
intact endogenous retrovirus ZFERV from zebrafish and to
Moloney murine leukemia virus RT. An RT gene is usually
indicative of a mobile element such as a retrotransposon
or retrovirus. RTs occur in a variety of mobile
elements, including retrotransposons, retroviruses,
group II introns, bacterial msDNAs, hepadnaviruses, and
caulimoviruses. These elements can be divided into two
major groups. One group contains retroviruses and DNA
viruses whose propagation involves an RNA intermediate.
They are grouped together with transposable elements
containing long terminal repeats (LTRs). The other
group, also called poly(A)-type retrotransposons,
contain fungal mitochondrial introns and transposable
elements that lack LTRs. Phylogenetic analysis suggests
that ZFERV belongs to a distinct group of retroviruses.
Length = 210
Score = 78.5 bits (194), Expect = 3e-16
Identities = 55/194 (28%), Positives = 91/194 (46%), Gaps = 7/194 (3%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFR 537
HIQE+LE G+L + + + + V K G R V +L+ +NQ + P ++ N +
Sbjct: 20 HIQELLEAGILVP--CQSPWNTPILPVKKPGGNDYRMVQDLRLVNQAVLPIHPAVPNPYT 77
Query: 538 IPSFLQ-KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF- 595
+ S L K + +DL+ A+F +P+ Q A + G T LP G +P F
Sbjct: 78 LLSLLPPKHQWYTVLDLANAFFSLPLAPDSQPLFAFEWEGQQYTFTRLPQGFKNSPTLFH 137
Query: 596 -ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
A + L G ++ Y+DD LL ++ LG LG+ V+ +K+
Sbjct: 138 EALARDLAPFPLEHEGTILLQYVDDLLLAADSEEDCLKGTDALLTHLGELGYKVSPKKAQ 197
Query: 655 LSPAPVLQFLGIMW 668
+ A V +FLG++W
Sbjct: 198 ICRAEV-KFLGVVW 210
>gnl|CDD|238825 cd01647, RT_LTR, RT_LTR: Reverse transcriptases (RTs) from
retrotransposons and retroviruses which have long
terminal repeats (LTRs) in their DNA copies but not in
their RNA template. RT catalyzes DNA replication from an
RNA template, and is responsible for the replication of
retroelements. An RT gene is usually indicative of a
mobile element such as a retrotransposon or retrovirus.
RTs are present in a variety of mobile elements,
including retrotransposons, retroviruses, group II
introns, bacterial msDNAs, hepadnaviruses, and
Caulimoviruses.
Length = 177
Score = 72.6 bits (179), Expect = 2e-14
Identities = 34/135 (25%), Positives = 54/135 (40%), Gaps = 4/135 (2%)
Query: 487 GVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGD 546
G+++ S S + +V K +G R ++ + LN+ ++ L + L
Sbjct: 1 GIIEPSSSPYA--SPVVVVKKKDGKLRLCVDYRKLNKVTIKDRYPLPTIDELLEELAGAK 58
Query: 547 YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLL 606
+DL Y +P+ + A + T +PFGL AP F L N + L
Sbjct: 59 VFSKLDLRSGYHQIPLAEESRPKTAFRTPFGLYEYTRMPFGLKNAPATFQRLMNKILGDL 118
Query: 607 RSRGMRVVVYLDDFL 621
V VYLDD L
Sbjct: 119 LGDF--VEVYLDDIL 131
>gnl|CDD|215698 pfam00078, RVT_1, Reverse transcriptase (RNA-dependent DNA
polymerase). A reverse transcriptase gene is usually
indicative of a mobile element such as a retrotransposon
or retrovirus. Reverse transcriptases occur in a variety
of mobile elements, including retrotransposons,
retroviruses, group II introns, bacterial msDNAs,
hepadnaviruses, and caulimoviruses.
Length = 194
Score = 69.3 bits (170), Expect = 3e-13
Identities = 41/192 (21%), Positives = 75/192 (39%), Gaps = 30/192 (15%)
Query: 505 VPKGNGGT-RPVLN----------LKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+PK G RP+ + K L SP + + L+ + + +DL
Sbjct: 1 IPKKGKGKYRPISSVDYKILNKATKKRLEPIFSPPQPGFRPGRSTHNLLKGSKWFLKLDL 60
Query: 554 SQAYFHVPI--------------KTTHQRFLALSYNGD---VLAMTCLPFGLATAPQAFA 596
+A+ +P + R ++ NG+ LP G +P F
Sbjct: 61 KKAFDSIPHDPLDRPLTAFGLPQRFIRWRTFSVLVNGNPGGRYEWRGLPQGSVLSPLLFN 120
Query: 597 SLSNWVASLLRSR--GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
N + LR R G+ + Y DD L+ ++ P L+ + + L LG +N +K+
Sbjct: 121 LFMNELLRPLRKRFPGLTYLRYADDILIFSKSPEELQEILEEVLEFLKELGLKLNPEKTK 180
Query: 655 LSPAPVLQFLGI 666
++ + ++FLG
Sbjct: 181 ITHSDEVKFLGY 192
>gnl|CDD|187698 cd09274, RNase_HI_RT_Ty3, Ty3/Gypsy family of RNase HI in long-term
repeat retroelements. Ribonuclease H (RNase H) enzymes
are divided into two major families, Type 1 and Type 2,
based on amino acid sequence similarities and
biochemical properties. RNase H is an endonuclease that
cleaves the RNA strand of an RNA/DNA hybrid in a
sequence non-specific manner in the presence of divalent
cations. RNase H is widely present in various organisms,
including bacteria, archaea and eukaryotes. RNase HI
has also been observed as adjunct domains to the reverse
transcriptase gene in retroviruses, in long-term repeat
(LTR)-bearing retrotransposons and non-LTR
retrotransposons. RNase HI in LTR retrotransposons
perform degradation of the original RNA template,
generation of a polypurine tract (the primer for
plus-strand DNA synthesis), and final removal of RNA
primers from newly synthesized minus and plus strands.
The catalytic residues for RNase H enzymatic activity,
three aspartatic acids and one glutamatic acid residue
(DEDD), are unvaried across all RNase H domains.
Phylogenetic patterns of RNase HI of LTR retroelements
is classified into five major families, Ty3/Gypsy,
Ty1/Copia, Bel/Pao, DIRS1 and the vertebrate
retroviruses. Ty3/Gypsy family widely distributed among
the genomes of plants, fungi and animals. RNase H
inhibitors have been explored as an anti-HIV drug target
because RNase H inactivation inhibits reverse
transcription.
Length = 121
Score = 57.1 bits (139), Expect = 8e-10
Identities = 38/126 (30%), Positives = 62/126 (49%), Gaps = 17/126 (13%)
Query: 773 FISTDASDLGWG---SQVDS------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPL 823
+ TDASD G G SQ+D +F S + ++N+ +KE+ A+ AL
Sbjct: 1 VLETDASDFGIGAVLSQLDDGKERPIAFASRKLTPAERNYSTYEKELLAIVFALKKFRHY 60
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
L V +D++++ YL + K L+ + + LL Q++ I ++ PG N V
Sbjct: 61 LLGRKFTVYTDHKSLK-YLLTK---KDLN--PRLARWLLLLQEFDFEI--EYRPGKENVV 112
Query: 884 ADSLSR 889
AD+LSR
Sbjct: 113 ADALSR 118
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 51.5 bits (124), Expect = 2e-06
Identities = 35/157 (22%), Positives = 51/157 (32%), Gaps = 4/157 (2%)
Query: 49 DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
+P V+ P P +A AA P+ A + + A P
Sbjct: 642 APAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP 701
Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKAS 168
P+ + P AA Q A AS P AA PLP +P + A
Sbjct: 702 APAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLP--PEPDDPPDP--AGAPAQ 757
Query: 169 RGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVD 205
P P + P SP S ++E ++ +D
Sbjct: 758 PPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMD 794
Score = 48.8 bits (117), Expect = 1e-05
Identities = 29/155 (18%), Positives = 41/155 (26%), Gaps = 2/155 (1%)
Query: 49 DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
V+ P + G A P A + A + + A P
Sbjct: 652 HHPKHVAVPDASDGGDGWPAKA--GGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATP 709
Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKAS 168
+ P PP + A + P P P+ A A+
Sbjct: 710 PAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAA 769
Query: 169 RGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEV 203
P PPSE S D ED + +EV
Sbjct: 770 PAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEV 804
Score = 45.0 bits (107), Expect = 2e-04
Identities = 38/203 (18%), Positives = 56/203 (27%), Gaps = 25/203 (12%)
Query: 39 AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
A + PA A + ++APA A AP AA + A P
Sbjct: 602 APASSGPPEEAARPAAPAAPAAPAAPAPAGA-AAAPAEASAAPAPGVAAPEHHPKHVAVP 660
Query: 99 GSSASV--------LAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPF 150
+S AAP +P A + A AQPA + P A
Sbjct: 661 DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA------ 714
Query: 151 FCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPA 210
A + + P + P+PL P D D + P
Sbjct: 715 ----------DDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAP 764
Query: 211 LFSFAPSTKEREPSIPDPDSELA 233
+ AP+ + +
Sbjct: 765 APAAAPAAAPPPSPPSEEEEMAE 787
Score = 40.7 bits (96), Expect = 0.005
Identities = 30/115 (26%), Positives = 45/115 (39%), Gaps = 3/115 (2%)
Query: 30 ERCLAQLVQAITARKS-STLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAH 88
ER L ++ + R + +PA + P A+ +P ++A A AP AA
Sbjct: 371 ERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAP-AAAP 429
Query: 89 QPSATVQAAPGSSASVLAAPLPSS-SGQPFISPPAQSAAFLAQPASTASLPPSAA 142
QP+ AP + AP + S P +P AQ A A + P A
Sbjct: 430 QPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAP 484
Score = 37.7 bits (88), Expect = 0.037
Identities = 21/102 (20%), Positives = 30/102 (29%)
Query: 34 AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSAT 93
A A A ++ A P A + P + S P AA
Sbjct: 408 AAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPA 467
Query: 94 VQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTA 135
A + AP P ++ P +P A +A A A
Sbjct: 468 PAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDA 509
Score = 36.9 bits (86), Expect = 0.076
Identities = 19/93 (20%), Positives = 31/93 (33%), Gaps = 4/93 (4%)
Query: 51 SPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLP 110
+PA + A+ + + A P A AP A + + + P
Sbjct: 405 APAAAPAPAAAAP--AAAAAPAPAAAPQPAPAP--APAPAPPSPAGNAPAGGAPSPPPAA 460
Query: 111 SSSGQPFISPPAQSAAFLAQPASTASLPPSAAH 143
+ S QP +P A A + + P AA
Sbjct: 461 APSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAA 493
Score = 36.5 bits (85), Expect = 0.092
Identities = 28/140 (20%), Positives = 45/140 (32%), Gaps = 13/140 (9%)
Query: 57 PGMASGVSDQSPPLS-SAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQ 115
+G + P + +A A A A + AAP + + AP P S
Sbjct: 387 VAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAG 446
Query: 116 PFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQP 175
+ A S A P++ + P+AA P A A+ A P
Sbjct: 447 NAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAP------------APAPPAAPAPAAAP 494
Query: 176 PPPSESVTPIPLSPVSSDQE 195
P+ P ++ +E
Sbjct: 495 AAPAAPAAPAGADDAATLRE 514
Score = 36.1 bits (84), Expect = 0.13
Identities = 20/108 (18%), Positives = 32/108 (29%), Gaps = 17/108 (15%)
Query: 49 DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAP--------------VQAAHQPSATV 94
+ A + A + P ++ P A AP + +
Sbjct: 405 APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSA 464
Query: 95 QAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
Q AP +A+ P+ + PA +AA A A A A
Sbjct: 465 QPAPAPAAAPEPTAAPAPAPPA---APAPAAAPAAPAAPAAPAGADDA 509
Score = 32.7 bits (75), Expect = 1.3
Identities = 24/165 (14%), Positives = 42/165 (25%), Gaps = 32/165 (19%)
Query: 73 APANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPA 132
P+ + + + A G+ A AAP +++ P +P +AA A
Sbjct: 364 LPSASDDERGLLARLERLERRLGVAGGAGAPAAAAP-SAAAAAPAAAPAPAAAA--PAAA 420
Query: 133 STASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSS 192
+ + + A A PP P +P
Sbjct: 421 AAPAPAAAPQP-----------------------APAPAPAPAPPSPAGNAPAGGAPSPP 457
Query: 193 DQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPDPDSELASQGV 237
S + P T P+ P + A+
Sbjct: 458 PAAAPSAQPAPAPAAAP------EPTAAPAPAPPAAPAPAAAPAA 496
>gnl|CDD|187690 cd06222, RNase_H, RNase H is an endonuclease that cleaves the RNA
strand of an RNA/DNA hybrid in a sequence non-specific
manner. Ribonuclease H (RNase H) enzymes are divided
into two major families, Type 1 and Type 2, based on
amino acid sequence similarities and biochemical
properties. RNase H is an endonuclease that cleaves the
RNA strand of an RNA/DNA hybrid in a sequence
non-specific manner in the presence of divalent cations.
RNase H is widely present in various organisms,
including bacteria, archaea and eukaryotes. Most
prokaryotic and eukaryotic genomes contain multiple
RNase H genes. Despite the lack of amino acid sequence
homology, Type 1 and type 2 RNase H share a main-chain
fold and steric configurations of the four acidic
active-site residues and have the same catalytic
mechanism and functions in cells. RNase H is involved in
DNA replication, repair and transcription. One of the
important functions of RNase H is to remove Okazaki
fragments during DNA replication. RNase H inhibitors
have been explored as an anti-HIV drug target because
RNase H inactivation inhibits reverse transcription.
Length = 123
Score = 45.0 bits (107), Expect = 1e-05
Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 7/87 (8%)
Query: 805 INKKEMFAVHQAL--SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFL 862
N+ E+ A+ +AL +L+L L + + +D++ VV + + L + + L
Sbjct: 39 NNEAELLALLEALELALDLGLKKLIIE---TDSKYVVDLINSWSKGWKKNNLLLWDILLL 95
Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSR 889
LS+ I + +P N VAD L++
Sbjct: 96 LSKFIDIRF--EHVPREGNEVADRLAK 120
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 48.4 bits (115), Expect = 3e-05
Identities = 38/210 (18%), Positives = 57/210 (27%), Gaps = 12/210 (5%)
Query: 39 AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
A A + +P G A+ P A P + P A P +
Sbjct: 2728 ARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
Query: 99 GSSASVLAAPLPS----SSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP--FFC 152
+S S LPS + + PA + A PA P SA P P
Sbjct: 2788 VASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847
Query: 153 DPSYYGHYLQSAMKASRGQVAQPPPPSESVTP------IPLSPVSSDQEDFSEEDEVVDC 206
G + R ++ P + + VS E F+ + +
Sbjct: 2848 PSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER 2907
Query: 207 NPPALFSFAPSTKEREPSIPDPDSELASQG 236
P P + + P P P
Sbjct: 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937
Score = 46.1 bits (109), Expect = 1e-04
Identities = 24/137 (17%), Positives = 40/137 (29%)
Query: 51 SPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLP 110
A P + SP L +APA P + P + P
Sbjct: 2716 VSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
Query: 111 SSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRG 170
++ ++ PA ++ ++ + + P+ P P S
Sbjct: 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835
Query: 171 QVAQPPPPSESVTPIPL 187
A PPPP +PL
Sbjct: 2836 PTAPPPPPGPPPPSLPL 2852
Score = 42.6 bits (100), Expect = 0.001
Identities = 37/187 (19%), Positives = 49/187 (26%), Gaps = 45/187 (24%)
Query: 51 SPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQA-------------------AHQPS 91
+PA + P AS PP S+ P P P + P+
Sbjct: 2813 APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPA 2872
Query: 92 ATVQAAPGSSASVLAAPLPSSSGQPFISPP--------------AQSAAFLAQPASTASL 137
A A LA P S S + F PP Q P
Sbjct: 2873 AKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
Query: 138 PPSAAHHLYPLPFFCDPS------------YYGHYLQSAMKASRGQVAQPPPPSESVTPI 185
PP PL DP+ + G + + R +V QP P E+
Sbjct: 2933 PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASS 2992
Query: 186 PLSPVSS 192
Sbjct: 2993 TPPLTGH 2999
Score = 35.7 bits (82), Expect = 0.18
Identities = 42/180 (23%), Positives = 61/180 (33%), Gaps = 23/180 (12%)
Query: 51 SPAVSDPGMASGVSDQSPPL-SSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPL 109
P P S P + QAS+P Q + +A + GS S+ P
Sbjct: 2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAA--RPTVGSLTSLADPPP 2703
Query: 110 PSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASR 169
P + +P +P A +A P A+ S A P P G A
Sbjct: 2704 PPPTPEP--APHALVSATPLPPGPAAARQASPALPAAPAP---PAVPAG----PATPGGP 2754
Query: 170 GQVAQPPPPSESVTPI-PLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPDP 228
+ A+PP + P P +P + + PA+ S + S E PS DP
Sbjct: 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLT---------RPAVASLSESR-ESLPSPWDP 2804
Score = 32.6 bits (74), Expect = 1.7
Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 3/80 (3%)
Query: 57 PGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQP 116
P + SD +P ++ A + + A + A P +TV AA + VLA + + S P
Sbjct: 1115 PLLVQFFSDHAPRIARAVSEAINAGSAAVATASPESTVDAAVRAHG-VLADAVAALS--P 1171
Query: 117 FISPPAQSAAFLAQPASTAS 136
+ PA AFL A +A+
Sbjct: 1172 AVRDPACPLAFLVALADSAA 1191
Score = 32.2 bits (73), Expect = 2.2
Identities = 23/117 (19%), Positives = 34/117 (29%), Gaps = 6/117 (5%)
Query: 39 AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
R+S+ +P G DQ+ P + PA+ + A P P
Sbjct: 381 PTRKRRSARHAATPF----ARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLP 436
Query: 99 GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPS 155
+ P P QP PA A +T + P P D +
Sbjct: 437 SAEPGSDDGPAPPPERQP--PAPATEPAPDDPDDATRKALDALRERRPPEPPGADLA 491
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 47.4 bits (112), Expect = 5e-05
Identities = 41/239 (17%), Positives = 66/239 (27%), Gaps = 20/239 (8%)
Query: 53 AVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAA----HQPSATVQAAPG-SSASVLAA 107
+++P + + + +AP PV + PV + QP+ Q PG + + A
Sbjct: 315 PITEPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIA 374
Query: 108 PLPSS--SGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAM 165
P P + P Q L QP + A P Y Q A
Sbjct: 375 PAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQ----PYYAPAPEQPAQ 430
Query: 166 KASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSI 225
+ + P + + Q + E P +
Sbjct: 431 QPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQP------QPV 484
Query: 226 PDPDSELASQGVVCQKLGSPAWNRVRYTESQRAL---QAGGVFQPLAIPSEFPHSNSST 281
V K P E +RA Q +QP+ P + P S+
Sbjct: 485 EQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSS 543
Score = 32.0 bits (72), Expect = 2.5
Identities = 30/146 (20%), Positives = 39/146 (26%), Gaps = 8/146 (5%)
Query: 52 PAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS 111
P A+ Q P + AP P Q A QP A S A
Sbjct: 403 PQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTY 462
Query: 112 SSGQPFISPPAQSAAFLAQPA---STASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKAS 168
+ Q + P AQ + P P P YY ++
Sbjct: 463 QTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPAR---PPLYYFEEVEEKRARE 519
Query: 169 RGQVAQ--PPPPSESVTPIPLSPVSS 192
R Q+A P P P P+
Sbjct: 520 REQLAAWYQPIPEPVKEPEPIKSSLK 545
Score = 31.2 bits (70), Expect = 4.0
Identities = 24/114 (21%), Positives = 41/114 (35%), Gaps = 10/114 (8%)
Query: 5 PKRDTSDPQPEAKR-------LCHLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDP 57
++ +P+P + L + EE EK R QL A + + P
Sbjct: 485 EQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQL--AAWYQPIPEPVKEPEPIKS 542
Query: 58 GMASGVSDQSPPLSSAPA-NPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLP 110
+ + PP+ +A A +P+ + +A AAP S + P P
Sbjct: 543 SLKAPSVAAVPPVEAAAAVSPLASGVKKATLATGAAATVAAPVFSLANSGGPRP 596
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 45.5 bits (108), Expect = 1e-04
Identities = 28/134 (20%), Positives = 47/134 (35%), Gaps = 5/134 (3%)
Query: 69 PLSSAPANPV-QASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAF 127
P ++A A + P + A A ++A AA +++ P P A A
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP 425
Query: 128 LAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPL 187
+A PA+ A P +A P P+ A +VA P + +
Sbjct: 426 VAAPAAAA--PAAAPAAA-PAAVALAPAPP-AQAAPETVAIPVRVAPEPAVASAAPAPAA 481
Query: 188 SPVSSDQEDFSEED 201
+P ++ E D
Sbjct: 482 APAAARLTPTEEGD 495
Score = 40.9 bits (96), Expect = 0.004
Identities = 24/104 (23%), Positives = 35/104 (33%), Gaps = 5/104 (4%)
Query: 38 QAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA 97
A + +PA + AS + AP A+AP A A V A
Sbjct: 389 PAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALA 448
Query: 98 PGSSASVLAAPLPSSSG---QPFISPPAQSAAFLAQPASTASLP 138
P + AAP + P + + + A A PA+ P
Sbjct: 449 P--APPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
Score = 40.5 bits (95), Expect = 0.005
Identities = 25/112 (22%), Positives = 38/112 (33%), Gaps = 4/112 (3%)
Query: 39 AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQ-AAHQPSATVQAA 97
A A K + A + V+ + + A A ASAP A P A V A
Sbjct: 373 AAPAEKKTPARPEAAAPAA---APVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAP 429
Query: 98 PGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP 149
++ + A P++ P + +A P A P A+ P
Sbjct: 430 AAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAA 481
Score = 35.1 bits (81), Expect = 0.21
Identities = 25/142 (17%), Positives = 43/142 (30%), Gaps = 29/142 (20%)
Query: 87 AHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLY 146
A +P+A +AA + A P ++ ++ A + A A PA+ AS P +
Sbjct: 363 AFKPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPP---- 418
Query: 147 PLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDC 206
A A+ A P+ + + L+P Q
Sbjct: 419 ------------AAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQ------------ 454
Query: 207 NPPALFSFAPSTKEREPSIPDP 228
P EP++
Sbjct: 455 -AAPETVAIPVRVAPEPAVASA 475
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 44.5 bits (105), Expect = 3e-04
Identities = 42/195 (21%), Positives = 62/195 (31%), Gaps = 32/195 (16%)
Query: 34 AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSAT 93
AQ A A ++ + + P A + + +++APA A + AA Q SA
Sbjct: 383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASA- 441
Query: 94 VQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCD 153
+ G+ A AP P+++ P A A+ A+ P AA P P
Sbjct: 442 -RGPGGAPA---PAPAPAAAPAAAARPAAAGPR--PVAAAAAAAPARAAPAAAPAP---- 491
Query: 154 PSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFS 213
PPP E + P SP + + PA
Sbjct: 492 ------------------ADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPAT-- 531
Query: 214 FAPSTKEREPSIPDP 228
A E P P
Sbjct: 532 -ADPDDAFETLAPAP 545
Score = 33.7 bits (77), Expect = 0.65
Identities = 30/134 (22%), Positives = 42/134 (31%), Gaps = 26/134 (19%)
Query: 34 AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSAT 93
A+ V A AR+S + A P + A P A+ P A +P A
Sbjct: 416 ARAVAAAPARRSPAPEALAAARQASARGPGG-APAPAPAPAAAPAAAARPAAAGPRPVAA 474
Query: 94 VQAAPGSSASVLAAPLPSSSGQP----------FISPPAQSAAFLAQ------------- 130
AA + A+ AAP P+ P +P AA
Sbjct: 475 AAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADP 534
Query: 131 --PASTASLPPSAA 142
T + P+AA
Sbjct: 535 DDAFETLAPAPAAA 548
Score = 32.9 bits (75), Expect = 1.1
Identities = 26/110 (23%), Positives = 36/110 (32%), Gaps = 18/110 (16%)
Query: 50 QSPAVSDPGMASGVSDQSPP----------------LSSAPANPVQASAPVQAAHQPSAT 93
+PA + P A +D PP +APA V S P A P
Sbjct: 478 AAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDA 537
Query: 94 VQA-APGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
+ AP +A+ ++ PP SA+ L P AA
Sbjct: 538 FETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLP-DMFDGDWPALAA 586
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 44.1 bits (104), Expect = 5e-04
Identities = 34/184 (18%), Positives = 52/184 (28%), Gaps = 11/184 (5%)
Query: 52 PAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSA---SVLAAP 108
PAV+ G G + A+A +A V A G++ + AA
Sbjct: 360 PAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAA 419
Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKAS 168
+ P P +A A + P+ A+ CD S ++
Sbjct: 420 ATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASA 479
Query: 169 RGQVAQPPPPSESVTP--------IPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKE 220
A P E P + S ED PPA + P+
Sbjct: 480 PASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAA 539
Query: 221 REPS 224
P+
Sbjct: 540 AAPA 543
Score = 41.8 bits (98), Expect = 0.002
Identities = 20/107 (18%), Positives = 33/107 (30%), Gaps = 1/107 (0%)
Query: 37 VQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAP-ANPVQASAPVQAAHQPSATVQ 95
+ +P + + + + P A+ ASAP A +A
Sbjct: 433 PATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEP 492
Query: 96 AAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
A ++ S + P + + A A PA A P AA
Sbjct: 493 APRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAA 539
Score = 41.4 bits (97), Expect = 0.003
Identities = 33/156 (21%), Positives = 50/156 (32%), Gaps = 13/156 (8%)
Query: 49 DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA-PGSSASVLAA 107
A + + ++A A+A A T AA P + A AA
Sbjct: 492 PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAA 551
Query: 108 PLP---------SSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYG 158
L SS + A+ AA A A+ P A + P P +
Sbjct: 552 ALDVLRNAGMRVSSDRGARAAAAAKPAA--APAAAPKPAAPRVAVQV-PTPRARAATGDA 608
Query: 159 HYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQ 194
+A + PPP E + P P+S+D+
Sbjct: 609 PPNGAARAEQAAESRGAPPPWEDIPPDDYVPLSADE 644
Score = 41.0 bits (96), Expect = 0.003
Identities = 32/161 (19%), Positives = 49/161 (30%), Gaps = 7/161 (4%)
Query: 37 VQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQ-PSATVQ 95
A S+ + G A + ++ P A AP A + A
Sbjct: 386 RAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADG 445
Query: 96 AAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPS 155
AP + + A S + PPA S + A PAS A PP AA P +
Sbjct: 446 DAPVPAKANARASADSRCDERDAQPPADSGSASA-PASDA--PPDAAFEPAPRAAAPSAA 502
Query: 156 YYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQED 196
+ A+ + P + P P + +
Sbjct: 503 TPAAVPDARAPAAASREDAPAAAAP---PAPEARPPTPAAA 540
Score = 36.0 bits (83), Expect = 0.13
Identities = 23/94 (24%), Positives = 29/94 (30%), Gaps = 7/94 (7%)
Query: 49 DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
D D + S P S AP + AP AA + A A AA
Sbjct: 460 DSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAAT---PAAVPDARAPAA- 515
Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
S P + A A P A+ P + A
Sbjct: 516 -ASREDAP--AAAAPPAPEARPPTPAAAAPAARA 546
Score = 30.6 bits (69), Expect = 5.8
Identities = 28/163 (17%), Positives = 46/163 (28%), Gaps = 24/163 (14%)
Query: 87 AHQPSATVQAAPGSSASVL-----------AAPLPSSSGQPFISPPAQSAAFLAQPASTA 135
A +P+ T APG AA +S P ++ +A P + A
Sbjct: 357 AFEPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA 416
Query: 136 SLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQE 195
+ + A P P +A + P P+++ +++
Sbjct: 417 AAAATRAE---APPAAPAPP------ATADRGDDAADGDAPVPAKANARASADSRCDERD 467
Query: 196 DFSEEDEVVDCNPPALFSFAPSTKEREPSIP-DPDSELASQGV 237
D P S AP EP+ S V
Sbjct: 468 AQPPADSGSASAPA---SDAPPDAAFEPAPRAAAPSAATPAAV 507
>gnl|CDD|238587 cd01182, INT_REC_C, DNA breaking-rejoining enzymes,
intergrase/recombinases, C-terminal catalytic domain. The
tyrosine recombinase/integrase family share the same
catalytic domain containing six conserved active site
residues. The best-studied members of this diverse family
include the bacteriophage lambda integrase, the
bacteriophage P1 Cre recombinase, the yeast Flp
recombinase and the bacterial XerD/C recombinases. Their
overall reaction mechanism is essentially identical and
involves cleavage of a single strand of a DNA duplex by
nucleophilic attack of a conserved tyrosine to give a 3'
phosphotyrosyl protein-DNA adduct. In the second
rejoining step, a terminal 5' hydroxyl attacks the
covalent adduct to release the enzyme and generate duplex
DNA. Many intergrase/recombinases also have N-terminal
domains, which show little sequence or structure
similarity.
Length = 162
Score = 40.6 bits (95), Expect = 0.001
Identities = 37/154 (24%), Positives = 55/154 (35%), Gaps = 22/154 (14%)
Query: 935 HVAILLLLSSGRRVHDLTLLSLDPDHFQELDDFVVFWPVFGSKTDSSSHLQSGWKIKENS 994
ILLLL +G RV +L L+L LD + +KT
Sbjct: 18 RALILLLLYTGLRVSEL--LALRWSDID-LDKGTITVR--RTKT--------------GK 58
Query: 995 SDPLFCIPTWIRHLST-LSQRRMGSRPLTSLFITTRGIVQPASRSVIAGWVKTALRGANI 1053
+ P L L RR +P LF + RG + +R + +K A + A I
Sbjct: 59 ERTVPLSPELAELLREYLELRRPAPKPDDYLFPSRRGGPKRLTRRAVRRLLKKAGKRAGI 118
Query: 1054 I--ASPGSMRSAVATFRYNSDLPLDSILRQGNWR 1085
+P +R AT + +PL+ I
Sbjct: 119 PERLTPHDLRHTFATRLLEAGVPLEVIQELLGHS 152
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 42.0 bits (98), Expect = 0.002
Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 11/102 (10%)
Query: 45 SSTLDQSPAVSDPGMA----SGVSDQSPPLSSAPANPV-------QASAPVQAAHQPSAT 93
S+T +P S+P + S + PLS+AP + A+ +AA+QP+A
Sbjct: 440 SNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAAN 499
Query: 94 VQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTA 135
+ A +A+ +S G PF S P SA + A+TA
Sbjct: 500 LPTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTA 541
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 41.8 bits (99), Expect = 0.002
Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 1/79 (1%)
Query: 64 SDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQ 123
+D P ++AP A+A +A + +A + + AA P++ +P + A
Sbjct: 34 ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA 93
Query: 124 SAAFLAQPASTASLPPSAA 142
+A A PA+ A+ P+AA
Sbjct: 94 AAPA-APPAAAAAAAPAAA 111
Score = 38.7 bits (91), Expect = 0.018
Identities = 17/82 (20%), Positives = 31/82 (37%)
Query: 49 DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
D P + A+ + + + A A +A A A +A A P +A+ AA
Sbjct: 35 DYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAA 94
Query: 109 LPSSSGQPFISPPAQSAAFLAQ 130
P++ + +AA +
Sbjct: 95 APAAPPAAAAAAAPAAAAVEDE 116
Score = 37.2 bits (87), Expect = 0.063
Identities = 21/87 (24%), Positives = 35/87 (40%), Gaps = 3/87 (3%)
Query: 55 SDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSG 114
+D G S + + ++A A A+AP A A A + A A P P+++
Sbjct: 34 ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPA---APPKPAAAA 90
Query: 115 QPFISPPAQSAAFLAQPASTASLPPSA 141
+P A AA A + A++
Sbjct: 91 AAAAAPAAPPAAAAAAAPAAAAVEDEV 117
Score = 33.3 bits (77), Expect = 0.90
Identities = 18/68 (26%), Positives = 28/68 (41%)
Query: 41 TARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGS 100
A ++ + A S P A + P + PA A+ P AA +A AAP +
Sbjct: 42 AAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPA 101
Query: 101 SASVLAAP 108
+A+ A
Sbjct: 102 AAAAAAPA 109
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 40.8 bits (96), Expect = 0.003
Identities = 21/108 (19%), Positives = 32/108 (29%), Gaps = 5/108 (4%)
Query: 38 QAITARKSSTLDQSPAVSDPGMASGVS---DQSPPLSSAPANPVQASAPVQAAHQPSATV 94
+A ++ DQS A V + ++ PA + P+
Sbjct: 137 KAQQEEITTMADQSSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVAT 196
Query: 95 QAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
AP A PS + + + A A P A LP A
Sbjct: 197 APAPAVDPQQNAVVAPSQANVD--TAATPAPAAPATPDGAAPLPTDQA 242
Score = 37.7 bits (88), Expect = 0.024
Identities = 17/106 (16%), Positives = 33/106 (31%), Gaps = 3/106 (2%)
Query: 41 TARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAP-VQAAHQPSATVQAAPG 99
+A S QS + + + P + P + P V A P+ Q
Sbjct: 151 SAELSQNSGQSVPLDT-STTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAV 209
Query: 100 SSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHL 145
+ S ++ + P +A A ++ P + + L
Sbjct: 210 VAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVST-PAADPNAL 254
Score = 33.8 bits (78), Expect = 0.46
Identities = 16/71 (22%), Positives = 25/71 (35%)
Query: 38 QAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA 97
A + T Q+PAV+ + Q+ ++ + AN A+ P AA
Sbjct: 178 PAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPL 237
Query: 98 PGSSASVLAAP 108
P A V
Sbjct: 238 PTDQAGVSTPA 248
Score = 30.8 bits (70), Expect = 4.0
Identities = 16/104 (15%), Positives = 28/104 (26%), Gaps = 1/104 (0%)
Query: 34 AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSAT 93
A+L Q + + + D +P S PA + V Q +
Sbjct: 152 AELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAV-DPQQNAVV 210
Query: 94 VQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASL 137
+ + AP ++ P A A +L
Sbjct: 211 APSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNAL 254
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 40.4 bits (95), Expect = 0.004
Identities = 34/177 (19%), Positives = 54/177 (30%), Gaps = 15/177 (8%)
Query: 23 EEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASA 82
EE +KI+ + + A K P P +S + P
Sbjct: 122 EEVAQKIKYAKWKAARIHKALKEGE---DPNPGPPLDEEDEDADVATTNSDNSFP---GE 175
Query: 83 PVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
A + ++ S + P SS PPA P+S S P +
Sbjct: 176 DADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA--------PSSFQSDTPPPS 227
Query: 143 HHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSE 199
P P + A+P PPS S TP P+ ++ D + ++
Sbjct: 228 PESPTNPSP-PPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDDDAIAK 283
Score = 29.7 bits (67), Expect = 8.6
Identities = 24/139 (17%), Positives = 37/139 (26%), Gaps = 19/139 (13%)
Query: 98 PGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYY 157
P A ++S F A +++ S PPS++ + P +
Sbjct: 153 PLDEEDEDADVATTNSDNSFPGEDAD------PASASPSDPPSSSPGVPSFPSPPEDPSS 206
Query: 158 GHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPS 217
PP PS + P S PP S
Sbjct: 207 PSDSS-----------LPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLS 255
Query: 218 TKEREPSIPDPDSELASQG 236
T +P+ P + A G
Sbjct: 256 T--AKPTPPSASATPAPIG 272
>gnl|CDD|205634 pfam13456, RVT_3, Reverse transcriptase-like. This domain is found
in plants and appears to be part of a retrotransposon.
Length = 88
Score = 37.2 bits (87), Expect = 0.004
Identities = 19/63 (30%), Positives = 36/63 (57%), Gaps = 6/63 (9%)
Query: 829 VMVQSDNQTVVSYLRRQGGTKSL--SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADS 886
++V+SD+Q VV ++ + +S +LL E+ K LL + + + +P N VAD+
Sbjct: 25 LIVESDSQLVVQQIQGEYEARSRLAALLREIRK--LLKKFDSVSV--SHVPRECNRVADA 80
Query: 887 LSR 889
L++
Sbjct: 81 LAK 83
>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
PatA/PatG family. This model describes a protease
domain associated with the maturation of various members
of the cyanobactin family of ribosomally produced,
heavily modified bioactive metabolites. Members include
the PatA protein and C-terminal domain of the PatG
protein of Prochloron didemni, TenA and a region of TenG
from Nostoc spongiaeforme var. tenue, etc.
Length = 602
Score = 40.9 bits (96), Expect = 0.004
Identities = 35/151 (23%), Positives = 47/151 (31%), Gaps = 25/151 (16%)
Query: 12 PQPEAKRLCHLE-------EKTEKIERCLA------QLVQAITARKSS----TLDQSPAV 54
P P+A R L E E+ ERCLA +Q +T S T +Q
Sbjct: 185 PDPQAVRTALLNSAIPCDPEDDEECERCLAGRLNIPGAIQLLTGETMSEGLVTSEQDGVE 244
Query: 55 SDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSG 114
G + +S S P APV + + A LP++
Sbjct: 245 EASGCGVQGTIESST-SVIPPGRAAEPAPVSIPVAAPGEGATPAAAQIELSAGVLPNA-- 301
Query: 115 QPFISPPAQSAAFLAQPASTASLPPSAAHHL 145
PA + T S PSA L
Sbjct: 302 ----ISPATPVRPASNGV-TPSQAPSAEPQL 327
>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
Validated.
Length = 598
Score = 40.7 bits (96), Expect = 0.005
Identities = 20/69 (28%), Positives = 31/69 (44%)
Query: 48 LDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAA 107
L + P G P + A A A+A AA P+A + A P ++A+ A
Sbjct: 387 LQEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAP 446
Query: 108 PLPSSSGQP 116
P P+++ QP
Sbjct: 447 PAPAAAPQP 455
Score = 31.0 bits (71), Expect = 3.7
Identities = 13/75 (17%), Positives = 20/75 (26%)
Query: 36 LVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQ 95
++ + S G A G + +A A P A P A
Sbjct: 383 ALRRLQEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAA 442
Query: 96 AAPGSSASVLAAPLP 110
AA + + P
Sbjct: 443 AAAPPAPAAAPQPAV 457
>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
Length = 361
Score = 39.7 bits (93), Expect = 0.008
Identities = 31/104 (29%), Positives = 40/104 (38%), Gaps = 15/104 (14%)
Query: 54 VSDPGMA-SGVSDQS------PPLSSAPANPVQASAPV-QAAHQP-------SATVQAAP 98
+S PG A + V+ QS P LS P AP Q P S AP
Sbjct: 177 LSGPGTAGTTVAKQSYALPARPDLSGGMGTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAP 236
Query: 99 GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
+S+ L AP + G S P +A A + A+ P AA
Sbjct: 237 VTSSGFLGAPTTLAPGVLEGSEPTPTAPSSAPATAPAAAAPQAA 280
Score = 30.4 bits (69), Expect = 4.8
Identities = 19/80 (23%), Positives = 29/80 (36%), Gaps = 15/80 (18%)
Query: 34 AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPL------SSAPANPVQASAPVQAA 87
V ++ +STL P +SG L S P +SAP
Sbjct: 216 QGDVLPVS---NSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTAPSSAPA--- 269
Query: 88 HQPSATVQAAPGSSASVLAA 107
+A AAP ++A+ +A
Sbjct: 270 ---TAPAAAAPQAAATSSSA 286
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 38.5 bits (89), Expect = 0.015
Identities = 24/105 (22%), Positives = 38/105 (36%)
Query: 37 VQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQA 96
++ T K + V+ +S V+ S S+ A P A P Q + T QA
Sbjct: 284 IEKATGLKQIDTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQA 343
Query: 97 APGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSA 141
+ + +S P + P A+ QP ST S+
Sbjct: 344 SAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSS 388
Score = 37.4 bits (86), Expect = 0.042
Identities = 24/131 (18%), Positives = 48/131 (36%), Gaps = 4/131 (3%)
Query: 21 HLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQA 80
+L+ + IE+ A ++ I + + S +S ++ S + S P
Sbjct: 276 NLDNRALDIEK--ATGLKQIDTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSV 333
Query: 81 SAPVQAAHQPSATVQAAPGSSASVLAAPLPSS--SGQPFISPPAQSAAFLAQPASTASLP 138
+ Q SA ++ G S + P + + +P P + Q +ST ++
Sbjct: 334 TTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNIT 393
Query: 139 PSAAHHLYPLP 149
+A LP
Sbjct: 394 STANGPTTSLP 404
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 39.0 bits (91), Expect = 0.017
Identities = 25/123 (20%), Positives = 32/123 (26%), Gaps = 4/123 (3%)
Query: 65 DQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQS 124
Q+P L P + P + A P P Q F+ P+Q+
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQA 226
Query: 125 AAFLAQPAST-ASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVT 183
A P PP L P + Q PPP T
Sbjct: 227 PAQPPLPPQLPQQPPPLQQPQFPGLSQQMPP---PPPQPPQQQQQPPQPQAQPPPQNQPT 283
Query: 184 PIP 186
P P
Sbjct: 284 PHP 286
Score = 33.2 bits (76), Expect = 0.91
Identities = 21/85 (24%), Positives = 24/85 (28%), Gaps = 5/85 (5%)
Query: 62 GVSDQSPPLSSAPANPVQASAPVQAAHQ--PSATVQAAPGSSASVLAAPLPSSSGQPFIS 119
G+S Q PP P P Q P Q Q P PG APLP +
Sbjct: 250 GLSQQMPP--PPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQG-QNAPLPPPQQPQLLP 306
Query: 120 PPAQSAAFLAQPASTASLPPSAAHH 144
Q P L +
Sbjct: 307 LVQQPQGQQRGPQFREQLVQLSQQQ 331
Score = 29.7 bits (67), Expect = 9.7
Identities = 26/122 (21%), Positives = 33/122 (27%), Gaps = 6/122 (4%)
Query: 68 PPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQ---S 124
P L P + QP A S + A L P + P Q
Sbjct: 123 PSLWGTAPKPEPQPPQAPES-QPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLP 181
Query: 125 AAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTP 184
+ A+ P YP P P Q + A AQPP P +
Sbjct: 182 QGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQP--QQFLPAPSQAPAQPPLPPQLPQQ 239
Query: 185 IP 186
P
Sbjct: 240 PP 241
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 39.0 bits (91), Expect = 0.019
Identities = 45/287 (15%), Positives = 70/287 (24%), Gaps = 46/287 (16%)
Query: 1 MASNPKRDTSDPQPEAKRLCHLEEKTEKIERCLAQLV--QAITARKSSTLDQSPAVSDPG 58
A + + P P E L + + + S
Sbjct: 60 AACDRFEPPTGPPPGPG---TEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDP 116
Query: 59 MASGVSDQSPPLSSAPA-------------NPVQASAPVQAAHQPSATVQAAPGSSASVL 105
SPP S AP P + A+ A+ A+ +A L
Sbjct: 117 PPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPL 176
Query: 106 AAPLPS--SSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCD--------PS 155
++P + + P PP + A P P +A P P S
Sbjct: 177 SSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASS 236
Query: 156 YYGHYLQSAMKASRGQVAQPPPP-----------------SESVTPIPLSPVSSDQEDFS 198
+S+ + P P S P P S SS +E
Sbjct: 237 SDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSP 296
Query: 199 EEDEVVDCNPPALFSFAPSTKEREPSIPDPDSELASQGVVCQKLGSP 245
+ PA S ++ S S +S +
Sbjct: 297 SPSPSSPGSGPAP-SSPRASSSSSSSRESSSSSTSSSSESSRGAAVS 342
Score = 34.8 bits (80), Expect = 0.30
Identities = 19/109 (17%), Positives = 33/109 (30%), Gaps = 6/109 (5%)
Query: 45 SSTLDQSPAVSDPGMASGV------SDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
SST S + ++ G S PP + P++P + P +A P+A+
Sbjct: 327 SSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPT 386
Query: 99 GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYP 147
A A P + + + + L P
Sbjct: 387 RRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435
Score = 30.9 bits (70), Expect = 4.6
Identities = 23/105 (21%), Positives = 36/105 (34%), Gaps = 10/105 (9%)
Query: 45 SSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASV 104
S T D S P G + + PP S+ ++ +P+A A G +
Sbjct: 823 SHTPDGGSESSGPARPPGAAARPPPARSSESSK----------SKPAAAGGRARGKNGRR 872
Query: 105 LAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP 149
P + +PP +AA A P+ L P+P
Sbjct: 873 RPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMP 917
>gnl|CDD|238823 cd01645, RT_Rtv, RT_Rtv: Reverse transcriptases (RTs) from
retroviruses (Rtvs). RTs catalyze the conversion of
single-stranded RNA into double-stranded viral DNA for
integration into host chromosomes. Proteins in this
subfamily contain long terminal repeats (LTRs) and are
multifunctional enzymes with RNA-directed DNA
polymerase, DNA directed DNA polymerase, and
ribonuclease hybrid (RNase H) activities. The viral RNA
genome enters the cytoplasm as part of a nucleoprotein
complex, and the process of reverse transcription
generates in the cytoplasm forming a linear DNA duplex
via an intricate series of steps. This duplex DNA is
colinear with its RNA template, but contains terminal
duplications known as LTRs that are not present in viral
RNA. It has been proposed that two specialized template
switches, known as strand-transfer reactions or "jumps",
are required to generate the LTRs.
Length = 213
Score = 37.6 bits (88), Expect = 0.019
Identities = 46/203 (22%), Positives = 82/203 (40%), Gaps = 20/203 (9%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
A++ + E L+ G ++ ST+ + + +F++ K +G R + +L+ +N +L
Sbjct: 16 ALTELVTEQLKEGHIEP--STSPWNTPVFVIKKKSGKWRLLHDLRAVNAQTQDM-GALQP 72
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR------FLALSYNGDVLAM----TCL 584
P+ L KG +I +DL +F +P+ H F S N A L
Sbjct: 73 GLPHPAALPKGWPLIVLDLKDCFFSIPL---HPDDRERFAFTVPSINNKGPAKRYQWKVL 129
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVV--YLDDFLLVNQDPRILEIQGKLAVSILG 642
P G+ +P S R + +V+ Y+DD L+ + L + L
Sbjct: 130 PQGMKNSPTICQSFVAQALEPFRKQYPDIVIYHYMDDILIASDLEGQLREIYEELRQTLL 189
Query: 643 SLGWIVNLQKSSLSPAPVLQFLG 665
G + +K P Q+LG
Sbjct: 190 RWGLTIPPEKVQKE--PPFQYLG 210
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 38.5 bits (89), Expect = 0.021
Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 17/121 (14%)
Query: 3 SNPKRDTSDPQPEAKRLCHL---EEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGM 59
S K DT P P A L + E+ T ++ LA + A A +++PA S+ G
Sbjct: 164 STDKVDTEIPSPVAGTLLEIRAPEDDTVEVGTVLAIIGDANAAPAEPAEEEAPAPSEAG- 222
Query: 60 ASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFIS 119
S PA A AP AA P A A ++A AAP+ S P+++
Sbjct: 223 ------------SEPAPDPAARAP-HAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYVT 269
Query: 120 P 120
P
Sbjct: 270 P 270
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 38.0 bits (87), Expect = 0.024
Identities = 33/129 (25%), Positives = 55/129 (42%), Gaps = 9/129 (6%)
Query: 23 EEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASA 82
E+ E+ R A A + ++ +P+ A+ + + + A A P +A+A
Sbjct: 182 EKDRERARREDAAAAAAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAA 241
Query: 83 PVQAAHQPSATVQAAPGSSAS--VLAAPLPSSSGQP---FISPPAQSAAFLAQ----PAS 133
A A A P +A+ AA P+ + P +PPA++AA A+ PA
Sbjct: 242 APAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAK 301
Query: 134 TASLPPSAA 142
A+ P AA
Sbjct: 302 AAAAPAKAA 310
Score = 36.5 bits (83), Expect = 0.064
Identities = 26/105 (24%), Positives = 42/105 (40%), Gaps = 1/105 (0%)
Query: 39 AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPS-ATVQAA 97
A A+ ++ ++ A A+ +PP +A A+ P +AA P+ A A
Sbjct: 227 AAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPA 286
Query: 98 PGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
++ AA P+ + A A A PA A+ P AA
Sbjct: 287 KAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAA 331
Score = 34.2 bits (77), Expect = 0.40
Identities = 30/109 (27%), Positives = 47/109 (43%), Gaps = 9/109 (8%)
Query: 39 AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
A A+ ++ ++ A A +PP +A A+ P +AA P A A P
Sbjct: 234 APPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPP-AKAAAPP 292
Query: 99 GSSASV---LAAPLPSSSGQP--FISPPAQSAAFLAQPASTASLPPSAA 142
+A+ AA ++ P +PPA++A A PA A+ P AA
Sbjct: 293 AKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAA---APPAKAATPPAKAA 338
>gnl|CDD|220944 pfam11018, Cuticle_3, Pupal cuticle protein C1. Insect cuticles
are composite structures whose mechanical properties are
optimised for biological function. The major components
are the chitin filament system and the cuticular
proteins, and the cuticle's properties are determined
largely by the interactions between these two sets of
molecules. The proteins can be ordered by species.
Length = 164
Score = 36.5 bits (84), Expect = 0.026
Identities = 22/91 (24%), Positives = 30/91 (32%), Gaps = 5/91 (5%)
Query: 61 SGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISP 120
S V +S A + A P+ AAP +A AAP + +
Sbjct: 34 SSVRKSDTRIS-NNAYQPAYAKTAYAYAAPAVYAAAAPVYAAHAYAAPAVHYAAAAHYAA 92
Query: 121 PAQSAAFLAQPASTA----SLPPSAAHHLYP 147
PA + A PA TA + P Y
Sbjct: 93 PAYAKYAYAAPAVTAKAAYAAPAPVYKTAYA 123
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 37.8 bits (88), Expect = 0.027
Identities = 30/179 (16%), Positives = 46/179 (25%), Gaps = 15/179 (8%)
Query: 59 MASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP--LPSSSGQP 116
SGV++ S PV P+ +QA P P P+++
Sbjct: 40 SQSGVAEFPWDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPA 99
Query: 117 FISPPAQSAAFLAQPASTASLPPSAAH--HLYPLPFFCDPSYYGHYLQ---------SAM 165
+ PA A Q P P D LQ ++
Sbjct: 100 TFALPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASG 159
Query: 166 KASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDC--NPPALFSFAPSTKERE 222
+ Q + + P Q D + +DE A + K E
Sbjct: 160 QLPSQQQSAQKNDESQLQQQPNGETPPQQTDGAGDDESEALVRLREADGTLEQRIKGAE 218
>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
Provisional.
Length = 408
Score = 37.9 bits (88), Expect = 0.029
Identities = 23/102 (22%), Positives = 32/102 (31%), Gaps = 5/102 (4%)
Query: 37 VQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQA 96
V T D A D + L+SA Q QP + A
Sbjct: 166 VLPAEKPTLLTKDMPSAPQDETHTLSSDEHEKGLTSAQLTTAQPDDAPGTPAQPLTPLAA 225
Query: 97 APGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLP 138
+ A V++ P P ++ SP QP TA+ P
Sbjct: 226 EAQAKAEVISTPSPVTAA---ASPTITPHQ--TQPLPTAAAP 262
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 38.1 bits (88), Expect = 0.031
Identities = 18/107 (16%), Positives = 29/107 (27%), Gaps = 10/107 (9%)
Query: 84 VQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAH 143
++ A + + A + +V+AAP + P + PA +
Sbjct: 348 LKTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVP 407
Query: 144 HLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPV 190
P P S P P V P P +P
Sbjct: 408 QFCGDPGLVSPYNPQSPGTSY----------GPEPVGPVPPQPTNPY 444
>gnl|CDD|221404 pfam12067, Sox_C_TAD, Sox C-terminal transactivation domain. This
domain is found at the C-terminus of the Sox family of
transcription factors. It is found associated with
pfam00505. It binds to the Armadillo repeats (pfam00514)
in Catenin beta-1 (CTNNB1), which is involved in
transcriptional regulation. It functions as a
transactivating domain (TAD).
Length = 197
Score = 36.6 bits (85), Expect = 0.035
Identities = 19/76 (25%), Positives = 27/76 (35%), Gaps = 1/76 (1%)
Query: 107 APLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGH-YLQSAM 165
+P + A + A L Q A T ++P + YYG Y+
Sbjct: 49 QMMPYGYNSSYAPQNAHAPALLRQMAVTENIPQGSPAPSIMGCPTPPQMYYGQMYVPECA 108
Query: 166 KASRGQVAQPPPPSES 181
K Q+ Q PP ES
Sbjct: 109 KHHPVQLGQLSPPPES 124
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 37.5 bits (88), Expect = 0.040
Identities = 26/157 (16%), Positives = 43/157 (27%), Gaps = 25/157 (15%)
Query: 71 SSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQ 130
P P Q++AP +A +A A A + P S +P A L +
Sbjct: 365 LPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPAS-------APQQAPAVPLPE 417
Query: 131 PASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSES----VTPIP 186
S +A L + K S A P S + +
Sbjct: 418 TTSQLL---AARQQLQRAQ-----------GATKAKKSEPAAASRARPVNSALERLASVR 463
Query: 187 LSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREP 223
+P + ++ +E P P +
Sbjct: 464 PAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKAL 500
Score = 37.2 bits (87), Expect = 0.061
Identities = 24/136 (17%), Positives = 34/136 (25%), Gaps = 26/136 (19%)
Query: 77 PVQASAPVQAAHQPSATV--QAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPAST 134
P + Q +A A + + +A P + P S P Q PA
Sbjct: 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQ------APAVP 414
Query: 135 ASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQ 194
S L + + R Q A SE PV+S
Sbjct: 415 LPETTSQL------------------LAARQQLQRAQGATKAKKSEPAAASRARPVNSAL 456
Query: 195 EDFSEEDEVVDCNPPA 210
E + A
Sbjct: 457 ERLASVRPAPSALEKA 472
Score = 31.8 bits (73), Expect = 2.6
Identities = 18/123 (14%), Positives = 34/123 (27%), Gaps = 20/123 (16%)
Query: 34 AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQ---- 89
A +A+ ++ + A +P + A P S + A Q
Sbjct: 373 QSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRA 432
Query: 90 ----------PSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPP 139
P+A +A P +SA A + + + A + + P
Sbjct: 433 QGATKAKKSEPAAASRARPVNSALERLASVRPA------PSALEKAPAKKEAYRWKATNP 486
Query: 140 SAA 142
Sbjct: 487 VEV 489
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 36.9 bits (86), Expect = 0.045
Identities = 19/96 (19%), Positives = 27/96 (28%), Gaps = 1/96 (1%)
Query: 49 DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
P + Q P P P+Q Q A P V +AP +
Sbjct: 113 VPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAP-QPVHSAPQPAQQAFQPA 171
Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHH 144
P ++ QP + + AAHH
Sbjct: 172 EPVAAPQPEPVAEPAPVMDKPKRKEAVIVMNVAAHH 207
Score = 33.5 bits (77), Expect = 0.62
Identities = 38/169 (22%), Positives = 46/169 (27%), Gaps = 18/169 (10%)
Query: 43 RKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSA 102
R S D D GV + N A+A A +PS Q P ++
Sbjct: 43 RMKSKRDDDSYDEDVEDDEGVGEVRVH----RVNHAPANAQEHEAARPSPQHQYQPPYAS 98
Query: 103 SVLAAPLPSSSGQPFISPPAQSAAFLAQPA-STASLPPSAAHHLYPLPFFCDPSYYGHYL 161
A P P P Q A AQPA P PL P
Sbjct: 99 ---AQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQV----- 150
Query: 162 QSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPA 210
A Q P P PV++ Q + E V P
Sbjct: 151 -----APAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKPKR 194
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 37.4 bits (86), Expect = 0.058
Identities = 45/193 (23%), Positives = 65/193 (33%), Gaps = 20/193 (10%)
Query: 52 PAVSDPGMASGVSDQSPPLSSAPANPVQASA-PVQAAHQPSATVQAAPGSSASVLAAPLP 110
P++ P A+ PP SA A P Q S Q A QP + S+ S+ LP
Sbjct: 179 PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLP 238
Query: 111 SSSGQPFISPPAQSAAFLAQPASTASLP--------PSAAHHLYPLPFFCDPSYYGHYLQ 162
S P + P S PA ++ P P H L P F
Sbjct: 239 SP--HPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQP 296
Query: 163 SAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKERE 222
+ S+ P ++ + P S + + E + PP APS +
Sbjct: 297 FGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPL----PP-----APSMPHIK 347
Query: 223 PSIPDPDSELASQ 235
P P +L +Q
Sbjct: 348 PPPTTPIPQLPNQ 360
Score = 33.9 bits (77), Expect = 0.55
Identities = 32/134 (23%), Positives = 47/134 (35%), Gaps = 21/134 (15%)
Query: 66 QSPPLSSAPANP----VQASAPVQAAHQPSATVQAAPGSSA--------SVLAAPLPSSS 113
QS PL S PA P S P +A+ P + + + P S L A P S
Sbjct: 411 QSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPS 470
Query: 114 GQP-FISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQV 172
+ P ++++ P S A PLP + + + +
Sbjct: 471 LPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPI-------QIKEEPLDEAE-EP 522
Query: 173 AQPPPPSESVTPIP 186
PPPP S +P P
Sbjct: 523 ESPPPPPRSPSPEP 536
Score = 33.1 bits (75), Expect = 1.1
Identities = 39/199 (19%), Positives = 59/199 (29%), Gaps = 30/199 (15%)
Query: 48 LDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAA 107
+DQ S P + S ++S SSA +Q P ++Q PG++ + A
Sbjct: 143 IDQDNRSSSPSIPSPQDNESDSDSSAQQQLLQPQGP--------PSIQVPPGAALAPSAP 194
Query: 108 PLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKA 167
P P + A Q + A+ P PL PS + L S
Sbjct: 195 P-----------PTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPP 243
Query: 168 SRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPD 227
+ Q A P + P P SS PP + + +
Sbjct: 244 LQPQTASQQSP-QPPAPSSRHPQSSHHGPG----------PPMPHALQQGPVFLQHPSSN 292
Query: 228 PDSELASQGVVCQKLGSPA 246
P L P+
Sbjct: 293 PPQPFGLAQSQVPPLPLPS 311
Score = 32.0 bits (72), Expect = 2.2
Identities = 65/277 (23%), Positives = 83/277 (29%), Gaps = 44/277 (15%)
Query: 39 AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
+I + L S P A V Q P+++ PA Q +P+ PS Q P
Sbjct: 180 SIQVPPGAALAPSAPPPTPS-AQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLP 238
Query: 99 G---------SSASVLAAPLPSSS---------GQPFISPPAQSAAFLAQPASTASLPPS 140
+S P PSS G P Q FL P+S P
Sbjct: 239 SPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFG 298
Query: 141 AAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPP-----PPSESV--------TPIPL 187
A P PS + + S Q QPP PP+ S+ TPIP
Sbjct: 299 LAQSQVPPLPL--PSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQ 356
Query: 188 SPVSSDQE--DFSEEDEVVDCN-----PPALFSFAPSTKEREPSIPDPDSELASQGVVCQ 240
P S + PPAL + PS P +L Q Q
Sbjct: 357 LPNQSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMPQS---Q 413
Query: 241 KLGSPAWNRVRYTESQRALQAGGVFQPLAIPSEFPHS 277
L S T+SQ + S P S
Sbjct: 414 PLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQS 450
>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 484
Score = 36.9 bits (86), Expect = 0.061
Identities = 25/112 (22%), Positives = 32/112 (28%), Gaps = 8/112 (7%)
Query: 40 ITARKSSTLD--QSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA 97
I A S L PA + P A +P S PA P AP H T Q
Sbjct: 40 IVALADSDLAALAPPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQRE 99
Query: 98 P------GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAH 143
+A AP + + + A A +AA
Sbjct: 100 QLVARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAA 151
Score = 31.9 bits (73), Expect = 2.2
Identities = 14/97 (14%), Positives = 25/97 (25%), Gaps = 9/97 (9%)
Query: 52 PAVSDPGMASGVSDQS----PPLSSAPANPVQASAPV--QAAHQPSATVQAAPGSSASVL 105
P + P S + + P L + +AA + QA +
Sbjct: 65 PPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPEAQAPAAPAERAA 124
Query: 106 AAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
A + + P + A+ A A
Sbjct: 125 AENAARRLARAAAAAP---RPRVPADAAAAVADAVKA 158
>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
Length = 547
Score = 36.7 bits (86), Expect = 0.070
Identities = 15/55 (27%), Positives = 23/55 (41%)
Query: 78 VQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPA 132
V A+AP AA +A AA ++ + A + + + A A A PA
Sbjct: 194 VAAAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAAPGKAPHASPA 248
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 36.4 bits (84), Expect = 0.072
Identities = 30/90 (33%), Positives = 35/90 (38%), Gaps = 10/90 (11%)
Query: 33 LAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSA 92
L QL + RK L Q P A V Q PP + P A+ P PSA
Sbjct: 126 LKQLDCELVIRKKEALPQPPPP-----APVVMMQPPPPHAMPPASPPAAQPA-----PSA 175
Query: 93 TVQAAPGSSASVLAAPLPSSSGQPFISPPA 122
+ P + AS A P SS P SP A
Sbjct: 176 PASSPPPTPASPPPAKAPKSSHPPLKSPMA 205
Score = 33.7 bits (77), Expect = 0.41
Identities = 16/61 (26%), Positives = 21/61 (34%), Gaps = 6/61 (9%)
Query: 79 QASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLP 138
APV P A+ A P+SS P +P + A A +S P
Sbjct: 145 PPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPP--TPASPPPA----KAPKSSHP 198
Query: 139 P 139
P
Sbjct: 199 P 199
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 36.7 bits (85), Expect = 0.074
Identities = 29/111 (26%), Positives = 39/111 (35%), Gaps = 12/111 (10%)
Query: 33 LAQLVQAITARKSSTLDQ---SPAVSDPGMASGVSDQSPPLSSAPA-NPVQASAPVQAAH 88
LAQL Q + P + P + P ++A + +P Q+SA Q +
Sbjct: 359 LAQLTQKGDDASGGRGPKQHIKPVFTQP-----AAAPQPSAAAAASPSPSQSSAAAQPSA 413
Query: 89 QPSATVQAA---PGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTAS 136
SAT A S A P+ S P PAQ P S S
Sbjct: 414 PQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVS 464
Score = 34.4 bits (79), Expect = 0.44
Identities = 31/173 (17%), Positives = 56/173 (32%), Gaps = 23/173 (13%)
Query: 29 IERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAH 88
+E L QL Q ++ + P + + + P ++A A+P + + A
Sbjct: 352 VELTLIQLAQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAA-- 409
Query: 89 QPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPL 148
QPSA A + + + P P +A +PA
Sbjct: 410 QPSAPQSATQPAGTPP-TVSVDPPAAVP--VNPPSTAPQAVRPAQFKEEKKIP------- 459
Query: 149 PFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEED 201
S S ++ + + Q I +P + +E F+EED
Sbjct: 460 -----VSKVSSLGPSTLRPIQEKAEQAT------GNIKEAPTGTQKEIFTEED 501
>gnl|CDD|238822 cd01644, RT_pepA17, RT_pepA17: Reverse transcriptase (RTs) in
retrotransposons. This subfamily represents the RT
domain of a multifunctional enzyme. C-terminal to the RT
domain is a domain homologous to aspartic proteinases
(corresponding to Merops family A17) encoded by
retrotransposons and retroviruses. RT catalyzes DNA
replication from an RNA template and is responsible for
the replication of retroelements.
Length = 213
Score = 35.4 bits (82), Expect = 0.095
Identities = 34/172 (19%), Positives = 53/172 (30%), Gaps = 45/172 (26%)
Query: 535 HFRIPSFLQKGDYMISIDLSQAY--FHVPIKTTH-QRFLALSYNGD----VLAMTCLPFG 587
FR +G +S D+ + + V + RFL + MT +PFG
Sbjct: 54 RFR------QGKIAVSADIEKMFHQVKVRPEDRDVLRFLWRKDGDEPKPIEYRMTVVPFG 107
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVV--------YLDDFL-----------LVNQDPR 628
A+AP L+N Y+DD L + +
Sbjct: 108 AASAP----FLANRALKQHAEDHPHEAAAKIIKRNFYVDDILVSTDTLNEAVNVAKRLIA 163
Query: 629 ILEIQG-KLA------VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLD 673
+L+ G L +L L L + LG+ W+P D
Sbjct: 164 LLKKGGFNLRKWASNSQEVLDDLPEERVLLDRDSDVT--EKTLGLRWNPKTD 213
>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
This family consists of several brain acid soluble
protein 1 (BASP1) or neuronal axonal membrane protein
NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
calmodulin-binding protein of unknown function.
Length = 233
Score = 35.6 bits (81), Expect = 0.10
Identities = 47/194 (24%), Positives = 73/194 (37%), Gaps = 9/194 (4%)
Query: 7 RDTSDPQPEAKRLCHLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQ 66
++ + Q A+ E K EK ++ AQ T K + + A + A ++
Sbjct: 37 KENEEAQAAAETTEVKEAKEEKPDK-DAQDTANKTEEKEGEKEAAAAKEEAPKAE--PEK 93
Query: 67 SPPLSSAPANPVQASAPVQ---AAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQ 123
+ + A A P +AS P Q AA P+A +A S AS A + + + S
Sbjct: 94 TEGAAEAKAEPPKASDPEQEPAAAPGPAAGGEAPKASEASSQPAESAAPAKEEEKSKEEG 153
Query: 124 SAAFLAQPASTASLPPS---AAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSE 180
A PA+ A S A P PS + +S + + P P+E
Sbjct: 154 EAKKTEAPAAAAQETKSDAAPASDSKPSSSEAAPSSKETPAATEAPSSTAKASAPAAPAE 213
Query: 181 SVTPIPLSPVSSDQ 194
V P +SDQ
Sbjct: 214 EVKPSEAPAANSDQ 227
>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 944
Score = 35.9 bits (83), Expect = 0.13
Identities = 32/176 (18%), Positives = 56/176 (31%), Gaps = 13/176 (7%)
Query: 33 LAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSA 92
LA VQA A + ++ +PA + + Q A+A A SA
Sbjct: 385 LAAAVQAPHANEPQFVNAAPAEKKTALTEQTTAQ--------QQVQAANAEAVAEADASA 436
Query: 93 TVQAAPGSSASVLAAPLPS-SSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP-- 149
+ + L + ++ Q I AQS F A +S + + +
Sbjct: 437 EPADTVEQALDDESELLAALNAEQAVILSQAQSQGFEAS-SSLDADNSAVPEQIDSTAEQ 495
Query: 150 FFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVD 205
+PS + AS A L D+ D++++ +D
Sbjct: 496 SVVNPSVTDTQVDD-TSASNNSAADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLD 550
Score = 30.8 bits (70), Expect = 4.7
Identities = 35/167 (20%), Positives = 50/167 (29%), Gaps = 24/167 (14%)
Query: 2 ASNPKRDTSDPQPEAKRLCHLEEKTEKIERCLAQLVQAITARKSSTLDQS---PAVSDPG 58
A +PK + + A R + KT A L + ++ +S S +
Sbjct: 628 ALSPK-EGDGKKSSADR----KPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELA 682
Query: 59 MASGVSDQSPPLSSAPANPVQA---------SAPVQAAHQPSATVQAAPGSSASVLAAPL 109
V + + SAPA P AP A+ A S SV A
Sbjct: 683 THQSVPEAALASGSAPAPPPVPDPYDRPPWEEAPEVASANDGPNNAAEGNLSESVEDASN 742
Query: 110 PSSSG-------QPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP 149
QP + AQS A T+S +L L
Sbjct: 743 SELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQDTELNLVLLS 789
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 35.2 bits (81), Expect = 0.20
Identities = 16/108 (14%), Positives = 26/108 (24%), Gaps = 8/108 (7%)
Query: 36 LVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQ 95
+++A+ + P + P S +A A V
Sbjct: 355 VIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAAN-IPPKEPVRETATPPPVP 413
Query: 96 AAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAH 143
P +A P+P + A P T PP
Sbjct: 414 PRP------VAPPVPHTPESAPKLTRAAIPVDEK-PKYTPPAPPKEEE 454
>gnl|CDD|238828 cd01651, RT_G2_intron, RT_G2_intron: Reverse transcriptases (RTs)
with group II intron origin. RT transcribes DNA using
RNA as template. Proteins in this subfamily are found in
bacterial and mitochondrial group II introns. Their most
probable ancestor was a retrotransposable element with
both gag-like and pol-like genes. This subfamily of
proteins appears to have captured the RT sequences from
transposable elements, which lack long terminal repeats
(LTRs).
Length = 226
Score = 34.5 bits (80), Expect = 0.23
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 2/60 (3%)
Query: 608 SRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV--LQFLG 665
R +R V Y DDF++ + P+ E +L L LG +N +K+ ++ FLG
Sbjct: 164 FRRLRYVRYADDFVIGVRGPKEAEEIKELIREFLEELGLELNPEKTRITHFKSEGFDFLG 223
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 35.0 bits (80), Expect = 0.23
Identities = 32/138 (23%), Positives = 46/138 (33%), Gaps = 7/138 (5%)
Query: 52 PAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS 111
P + + Q PP + P P A P A +P+A A +A+ A P+
Sbjct: 681 ANTMLPIQWAPGTMQPPPRAPTPMRPPAA--PPGRAQRPAAATGRARPPAAAPGRARPPA 738
Query: 112 SSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQ 171
++ P A A+ P AA P P P + + RG
Sbjct: 739 AAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQ--PPPQAP---PAPQQRPRGA 793
Query: 172 VAQPPPPSESVTPIPLSP 189
PPP T + L P
Sbjct: 794 PTPQPPPQAGPTSMQLMP 811
Score = 30.4 bits (68), Expect = 6.2
Identities = 34/157 (21%), Positives = 49/157 (31%), Gaps = 22/157 (14%)
Query: 51 SPAVSDPGMASGVSDQSPPLSSAPANPVQASA----PVQAAHQPSATVQAAPGSSA---- 102
+ + P A+ + P + A P A+ P AA + AAPG+
Sbjct: 720 ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP 779
Query: 103 -SVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHY- 160
AP G P PP Q A P S +P +A P G
Sbjct: 780 PQAPPAPQQRPRGAPTPQPPPQ-----AGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVK 834
Query: 161 -------LQSAMKASRGQVAQPPPPSESVTPIPLSPV 190
+A++ P P S + I +PV
Sbjct: 835 RGRPSLKKPAALERQAAAGPTPSPGSGTSDKIVQAPV 871
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 34.3 bits (79), Expect = 0.27
Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 2/56 (3%)
Query: 69 PLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQS 124
P+ PA+P + +AP + + + P +S S + P S QP AQ+
Sbjct: 123 PVEPRPASPFENTAPARPTSSTATAAASKPVTSVS--SGPRALSRNQPQYPARAQA 176
>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region. The
defective chorion-1 gene (dec-1) in Drosophila encodes
follicle cell proteins necessary for proper eggshell
assembly. Multiple products of the dec-1 gene are formed
by alternative RNA splicing and proteolytic processing.
Cleavage products include S80 (80 kDa) which is
incorporated into the eggshell, and further proteolysis
of S80 gives S60 (60 kDa).
Length = 407
Score = 34.8 bits (79), Expect = 0.28
Identities = 27/87 (31%), Positives = 37/87 (42%), Gaps = 12/87 (13%)
Query: 66 QSPPLSSAP-----ANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQ----- 115
Q P + S P A PV A AP AA P+A AA +A + A P+ GQ
Sbjct: 92 QLPAMPSMPGLLGAAAPVPAPAPAPAAAPPAAPAPAADTPAAPIPDAVQPAILGQAALQN 151
Query: 116 --PFISPPAQSAAFLAQPASTASLPPS 140
F++P A+ L + PP+
Sbjct: 152 AFTFLNPSNFDASSLLGQSVPTFAPPN 178
>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
factor N terminal domain. The N terminus of the PEA3
transcription factors is implicated in transactivation
and in inhibition of DNA binding. Transactivation is
potentiated by activation of the Ras/MAP kinase and
protein kinase A signalling cascades. The N terminal
region contains conserved MAP kinase phosphorylation
sites.
Length = 336
Score = 34.5 bits (79), Expect = 0.29
Identities = 49/237 (20%), Positives = 68/237 (28%), Gaps = 42/237 (17%)
Query: 6 KRDTSDPQPEAKRLCHLEE-KTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVS 64
KR+ P E C ++ E+CL S D+ PA +
Sbjct: 88 KREPQSPSKELNSSCSQKQPPYPYGEKCLYS---------YSAYDRKPASGFKPPTPPST 138
Query: 65 DQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS-----SSGQPF-- 117
SP +Q S P+ + PS P + S L P+ S S F
Sbjct: 139 PCSPVNPQETVRQLQPSGPLSNSSPPSPHT---PLPNQSPLPPPMSSPDSSYPSEHRFQR 195
Query: 118 -ISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPP 176
+S P S PP PL + + Y + G Q P
Sbjct: 196 QLSEPCLPFPPPPGRGSRDGRPPYHRQMSEPLVPYPPQGFKQEYHDP-LYEEAGVPNQGP 254
Query: 177 PPSESVTPIPLSPVSSDQE--DFSEEDEVVDCNPPALFSFAPSTKEREPSIPDPDSE 231
P P+ QE DF+ + EV C S+ R E
Sbjct: 255 FP---------HPMMIKQEPRDFTYDSEVPGC---------HSSYGRAEGFYSNRHE 293
>gnl|CDD|216011 pfam00589, Phage_integrase, Phage integrase family. Members of this
family cleave DNA substrates by a series of staggered
cuts, during which the protein becomes covalently linked
to the DNA through a catalytic tyrosine residue at the
carboxy end of the alignment. The catalytic site residues
in CRE recombinase are Arg-173, His-289, Arg-292 and
Tyr-324.
Length = 170
Score = 33.5 bits (77), Expect = 0.31
Identities = 39/169 (23%), Positives = 63/169 (37%), Gaps = 23/169 (13%)
Query: 919 ASRVSAVVPNHFQVSRHVAILLLLSSGRRVHDLTLLSLDPDHFQELDDFVVFWPVFGSKT 978
R+ A + + + LLL +G R+ +L LSL +LD+ + + +KT
Sbjct: 8 VERLLAALEEPLNIRDRALVELLLLTGLRISEL--LSLRWSDI-DLDNGTIR--IGQTKT 62
Query: 979 DSSSHLQSGWKIKENSSDPLFCIPTWIRHLSTLSQRRMGSRPLTSLFITTRGIVQPASRS 1038
S S L + W L R+ LF++ RG +P SRS
Sbjct: 63 -RKSR------TVPLSDAALEALKEW------LGDRKEAEES-EFLFVSRRG--KPLSRS 106
Query: 1039 VIAGWVKTALRGANIIA--SPGSMRSAVATFRYNSDLPLDSILRQGNWR 1085
+ + A + A I +P +R AT + PL I +
Sbjct: 107 TVNRAFRRAGKRAGIEKDLTPHDLRHTFATHLAENGAPLRVIQKLLGHS 155
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 34.7 bits (79), Expect = 0.33
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 11/103 (10%)
Query: 52 PAVSDPGMASGVSDQS-PPLSSAPANPVQASAPVQAAHQ-----PSATVQAA----PGSS 101
PA + AS D + P S+A P A AP AA + P+ A+ P +
Sbjct: 42 PAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAVA 101
Query: 102 ASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHH 144
+ AAP P ++ + F S A S AS P A H
Sbjct: 102 PQLAAAPKPDAA-EAFTSAAQAHEAPADAGTSAASKKPDPAAH 143
>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
Validated.
Length = 709
Score = 34.7 bits (79), Expect = 0.35
Identities = 29/160 (18%), Positives = 45/160 (28%), Gaps = 18/160 (11%)
Query: 62 GVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA-----PGSSASVLAAPLPSSSGQP 116
++ PP AP A+ Q + ++Q A P + + + P
Sbjct: 428 QENNDVPPWEDAPDEAQTAAGTAQT---SAKSIQTASEAETPPENQVSKNKAADNETDAP 484
Query: 117 FISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPP 176
P A P A + AH PF YG+ PP
Sbjct: 485 LSEVP-SENPIQATPNDEAVETETFAHEAPAEPF------YGYGFPDNDCPPEDGAEIPP 537
Query: 177 PPSESVTPIPLSPVSSDQE---DFSEEDEVVDCNPPALFS 213
P E P + +D+E + PP +
Sbjct: 538 PDWEHAAPADTAGGGADEEAEAGGIGGNNTPSAPPPEFST 577
Score = 30.8 bits (69), Expect = 5.4
Identities = 36/193 (18%), Positives = 59/193 (30%), Gaps = 16/193 (8%)
Query: 51 SPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLP 110
+P + A+ V + + S + + +A + +P A P +AS A P
Sbjct: 359 APLAAASCDANAVIENTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAAAMPSE 418
Query: 111 SSSGQPF-------ISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQS 163
+ P + P + A TA + + S
Sbjct: 419 GKTAGPVSNQENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTA------SEAETPPENQVS 472
Query: 164 AMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPST--KER 221
KA+ + P S PI +P E + E P + F + E
Sbjct: 473 KNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHE-APAEPFYGYGFPDNDCPPED 531
Query: 222 EPSIPDPDSELAS 234
IP PD E A+
Sbjct: 532 GAEIPPPDWEHAA 544
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 34.2 bits (79), Expect = 0.53
Identities = 39/204 (19%), Positives = 62/204 (30%), Gaps = 37/204 (18%)
Query: 50 QSP-----AVSDPGMASG----------VSDQS-------PPLSSAPANPVQASAPVQAA 87
QSP A + P MASG D + P A
Sbjct: 822 QSPMPLTVACASPEMASGKVWIRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEP 881
Query: 88 HQPSATVQAAPGSSAS--VLAAPLPSSSG-QPFISPPAQSAAFLAQPA----STASLPPS 140
+ V+A V+A P P P +A QP S ++
Sbjct: 882 VVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPVTEQPQVITESDVAVAQE 941
Query: 141 AAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEE 200
A H P+ +P ++ A + + VA+P ++ P+ + + + E
Sbjct: 942 VAEHAEPV---VEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVE 998
Query: 201 DEVVDCNPPA-----LFSFAPSTK 219
EV P + AP T+
Sbjct: 999 PEVAPAQVPEATVEHNHATAPMTR 1022
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 33.9 bits (78), Expect = 0.57
Identities = 19/80 (23%), Positives = 25/80 (31%), Gaps = 13/80 (16%)
Query: 57 PGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQP 116
P S P + A P A AA A + AP AP P ++
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAP--------APAPPAA--- 430
Query: 117 FISPPAQSAAFLAQPASTAS 136
+ P +A A AS
Sbjct: 431 --AAPPARSADPAAAASAGD 448
Score = 32.4 bits (74), Expect = 1.6
Identities = 16/73 (21%), Positives = 26/73 (35%), Gaps = 6/73 (8%)
Query: 70 LSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLA 129
+ P+ A P A P A P AAP ++ +P +P +AA
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPP------AAPARPAAARPAPAPAPPAAAAPP 434
Query: 130 QPASTASLPPSAA 142
++ + SA
Sbjct: 435 ARSADPAAAASAG 447
Score = 30.1 bits (68), Expect = 7.7
Identities = 16/83 (19%), Positives = 26/83 (31%), Gaps = 6/83 (7%)
Query: 22 LEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQAS 81
L ++ E +ER A + + + + P + PA A
Sbjct: 370 LLDRLEALERG-APAPPSAAWGAPTPAAPAAPPP-----AAAPPVPPAAPARPAAARPAP 423
Query: 82 APVQAAHQPSATVQAAPGSSASV 104
AP A A P ++AS
Sbjct: 424 APAPPAAAAPPARSADPAAAASA 446
>gnl|CDD|234630 PRK00095, mutL, DNA mismatch repair protein; Reviewed.
Length = 617
Score = 33.7 bits (78), Expect = 0.57
Identities = 26/119 (21%), Positives = 41/119 (34%), Gaps = 9/119 (7%)
Query: 35 QLVQAITARKSSTLDQSPAVSDPGMASGVSD----QSPPLSSAPANPVQASAPVQAAHQP 90
+VQAI L QS + A+ V + + PL P +S P +
Sbjct: 316 LIVQAI----QEALAQSGLIPAAAGANQVLEPAEPEPLPLQQTPLYASGSSPPASSPSSA 371
Query: 91 SATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSA-AHHLYPL 148
+S PL ++ Q + A + A A PA+ +A +PL
Sbjct: 372 PPEQSEESQEESSAEKNPLQPNASQSEAAAAASAEAAAAAPAAAPEPAEAAEEADSFPL 430
>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
Length = 694
Score = 33.7 bits (77), Expect = 0.68
Identities = 18/86 (20%), Positives = 29/86 (33%), Gaps = 12/86 (13%)
Query: 56 DPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQ 115
+G+ +P L AP PV A S V P+S
Sbjct: 331 AVCTDAGLLPDTPLLPDAPEGSSDPVVPV-------AAATPVDASLPDVRTGTAPTS--- 380
Query: 116 PFISPPAQSAAFLAQPASTASLPPSA 141
++ + + +AQP A+L +A
Sbjct: 381 --LANVSHADPAVAQPTQAATLAGAA 404
Score = 30.6 bits (69), Expect = 4.9
Identities = 21/86 (24%), Positives = 30/86 (34%), Gaps = 8/86 (9%)
Query: 46 STLDQSP-AVSDPGMASGVSDQSPPLSSAPANPVQASA----PVQAAHQPSATVQAAPGS 100
S+ P A + P AS ++ ++ AN A P QAA A + A
Sbjct: 352 SSDPVVPVAAATPVDASLPDVRTGTAPTSLANVSHADPAVAQPTQAATLAGAAPKGATHL 411
Query: 101 SASVLAAPLPSSSGQPFISPPAQSAA 126
L L G P P ++ A
Sbjct: 412 M---LEETLADLRGGPVRIVPPRNPA 434
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 33.3 bits (76), Expect = 0.71
Identities = 20/121 (16%), Positives = 44/121 (36%), Gaps = 5/121 (4%)
Query: 22 LEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQAS 81
++E T +I R LA + A + + T + + + A V A
Sbjct: 180 IDEVTREIGRVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAK 239
Query: 82 APVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSA 141
+ A + + + + A + ++ + A++AA A+ A+ A+ +
Sbjct: 240 KAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKK-----AAKAAAKAAKGAAKATKGKAK 294
Query: 142 A 142
A
Sbjct: 295 A 295
>gnl|CDD|240415 PTZ00429, PTZ00429, beta-adaptin; Provisional.
Length = 746
Score = 33.4 bits (76), Expect = 0.75
Identities = 32/135 (23%), Positives = 44/135 (32%), Gaps = 12/135 (8%)
Query: 49 DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
D S P M G D SP S+APA P A GS+ + A P
Sbjct: 617 DAVELPSTPSM--GTQDGSPAPSAAPAGYDIFEFAGDGTGAPH---PVASGSNGAQHADP 671
Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKAS 168
L + A S AF A S A P S P + + + S +
Sbjct: 672 LGDLFSGLPSTVGASSPAFQAASGSQA--PASP-----PTAASAIEDLFANGMGSGSQTV 724
Query: 169 RGQVAQPPPPSESVT 183
++ P ++ T
Sbjct: 725 PLPISAAPQSADRDT 739
>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl carrier
protein subunit; Validated.
Length = 130
Score = 31.7 bits (72), Expect = 0.79
Identities = 12/47 (25%), Positives = 21/47 (44%), Gaps = 3/47 (6%)
Query: 67 SPPLSSAPANPVQASAPV---QAAHQPSATVQAAPGSSASVLAAPLP 110
+P ++APA P PV + + Q A + A + +P+P
Sbjct: 23 APAQAAAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGADAMPSPMP 69
Score = 30.2 bits (68), Expect = 2.9
Identities = 12/44 (27%), Positives = 16/44 (36%)
Query: 77 PVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISP 120
P QA+AP Q A P A + P ++ SP
Sbjct: 24 PAQAAAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGADAMPSP 67
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 32.6 bits (74), Expect = 1.0
Identities = 31/121 (25%), Positives = 45/121 (37%), Gaps = 22/121 (18%)
Query: 92 ATVQAAP---GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTAS--LPPSAAHHLY 146
A++QAA S S AP P P SA++L +++AS + PS Y
Sbjct: 134 ASLQAARKEGLPSDSPAPAPSPRGRPHPASGSGRLSASYLQDLSTSASECIDPSVVFP-Y 192
Query: 147 PLPFFCDPSYYGHYLQSAMKASRGQVAQPPP-PSESVTPIPLSPVSSDQEDFSEEDEVVD 205
PL ++ +VA P P P S SD E +E+E +
Sbjct: 193 PLN---------------ERSKSSKVASPTPRLGLRTPPNSSSSSGSDSESEEDEEEEEE 237
Query: 206 C 206
Sbjct: 238 E 238
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 32.2 bits (73), Expect = 1.0
Identities = 33/160 (20%), Positives = 43/160 (26%), Gaps = 22/160 (13%)
Query: 59 MASGVSDQSPPLSSAPANPVQASAPVQAAHQPSA---TVQAAPGSSASVLAAPLPSSSGQ 115
MA A +AP H A T+ PG P + SG
Sbjct: 7 MAMNHGRFPDGAKGARMGMNSLAAPNHGPHDLRALMGTLLHYPGGGMDASGRPRSAMSGP 66
Query: 116 PFISPPAQSAAFLAQPASTASLPP---------SAAHHLYPLPFFCDPSYYGHYLQSAMK 166
P QS ++ P A+ L L + Y GH A
Sbjct: 67 MGGGHPHQSMPAYMMFNPSSKPQPFMLVPGPQLMASMQLQKL----NTQYQGHAGAPAGH 122
Query: 167 ASRGQVAQP------PPPSESVTPIPLSPVSSDQEDFSEE 200
G Q PP + + L P D + EE
Sbjct: 123 PGGGGPQQFRPGAGQPPGMQHMPAPALPPNVIDTDLIDEE 162
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 33.1 bits (75), Expect = 1.1
Identities = 37/200 (18%), Positives = 54/200 (27%), Gaps = 27/200 (13%)
Query: 74 PANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPAS 133
P P + P A P + P S P PP Q + +P
Sbjct: 591 PEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEG 650
Query: 134 TASLPPSAAHHLYPLPFFCDPSY----YGHYLQSAMKASRGQVAQPPPPSESVTPIPLSP 189
+ PF DP + Y YL +A K+ E+ T + L
Sbjct: 651 PKIIKSPKPPKSPKPPF--DPKFKEKFYDDYLDAAAKS-----------KETKTTVVLD- 696
Query: 190 VSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIP-----DPDSELASQGVVCQKLGS 244
S + E P P R+ P DPD+E +
Sbjct: 697 -ESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDD---IEFFTP 752
Query: 245 PAWNRVRYTESQRALQAGGV 264
P R + E+ +
Sbjct: 753 PEEERTFFHETPADTPLPDI 772
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 31.3 bits (71), Expect = 1.1
Identities = 17/61 (27%), Positives = 21/61 (34%), Gaps = 1/61 (1%)
Query: 82 APVQAAHQPS-ATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPS 140
AP+ P T A + +V P S+ P A A A P S PP
Sbjct: 80 APLTPPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPPR 139
Query: 141 A 141
A
Sbjct: 140 A 140
>gnl|CDD|182649 PRK10689, PRK10689, transcription-repair coupling factor;
Provisional.
Length = 1147
Score = 32.8 bits (75), Expect = 1.1
Identities = 20/50 (40%), Positives = 32/50 (64%), Gaps = 3/50 (6%)
Query: 1046 TALRGANIIASPGSMRSAVATF--RYNSDLPLDSILRQGNWRGGQNFFNY 1093
+ +R +IIA+P + R AV TF Y+S + ++ILR+ RGGQ ++ Y
Sbjct: 768 SGMRDLSIIATPPARRLAVKTFVREYDSLVVREAILRE-ILRGGQVYYLY 816
>gnl|CDD|216078 pfam00716, Peptidase_S21, Assemblin (Peptidase family S21).
Length = 326
Score = 32.4 bits (74), Expect = 1.2
Identities = 21/97 (21%), Positives = 27/97 (27%), Gaps = 17/97 (17%)
Query: 58 GMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP--------- 108
G GVS + P A V S + P T A + ++ AA
Sbjct: 228 GNDRGVSPGTAPSFDA-TPSVSPSGQPLSPAAPPGTSSVAGTALSASPAALFGDMVYVPL 286
Query: 109 -------LPSSSGQPFISPPAQSAAFLAQPASTASLP 138
+ QP A LA PA P
Sbjct: 287 DAYNQLLAGQAFNQPPDPQGPAPPAELAPPAPAPPPP 323
>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
Length = 226
Score = 32.1 bits (73), Expect = 1.2
Identities = 19/92 (20%), Positives = 34/92 (36%), Gaps = 1/92 (1%)
Query: 53 AVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSS 112
A SD G + D + P + A A Q++ P ++ + P ++A+ P P
Sbjct: 86 APSDAGSQASPDDDAQPAAEAEAAD-QSAPPEASSTSATDEAATDPPATAAARDGPTPDP 144
Query: 113 SGQPFISPPAQSAAFLAQPASTASLPPSAAHH 144
+ QP +S + P + H
Sbjct: 145 TAQPATPDERRSPRQRPPVSGEPPTPSTPDAH 176
>gnl|CDD|187703 cd09279, RNase_HI_archaeal_like, RNAse HI family that includes
Archaeal RNase HI. Ribonuclease H (RNase H) is
classified into two evolutionarily unrelated families,
type 1 (prokaryotic RNase HI, eukaryotic RNase H1 and
viral RNase H) and type 2 (prokaryotic RNase HII and
HIII, and eukaryotic RNase H2). RNase H is an
endonuclease that cleaves the RNA strand of an RNA/DNA
hybrid in a sequence non-specific manner. RNase H is
involved in DNA replication, repair and transcription.
RNase H is widely present in various organisms,
including bacteria, archaea and eukaryotes and most
prokaryotic and eukaryotic genomes contain multiple
RNase H genes. Despite the lack of amino acid sequence
homology, Type 1 and type 2 RNase H share a main-chain
fold and steric configurations of the four acidic
active-site (DEDD) residues and have the same catalytic
mechanism and functions in cells. One of the important
functions of RNase H is to remove Okazaki fragments
during DNA replication. Most archaeal genomes contain
only type 2 RNase H (RNase HII); however, a few contain
RNase HI as well. Although archaeal RNase HI sequences
conserve the DEDD active-site motif, they lack other
common features important for catalytic function, such
as the basic protrusion region. Archaeal RNase HI
homologs are more closely related to retroviral RNase HI
than bacterial and eukaryotic type I RNase H in
enzymatic properties.
Length = 128
Score = 30.9 bits (71), Expect = 1.4
Identities = 15/66 (22%), Positives = 32/66 (48%), Gaps = 1/66 (1%)
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
L + + D+Q VV+ ++ + K+ L +E+ L + + + ++IP N
Sbjct: 60 LGIKKLEIYGDSQLVVNQIQGEYEVKNERLAPYLEEARELLKKFEEVEI-KWIPREENKE 118
Query: 884 ADSLSR 889
AD+L+
Sbjct: 119 ADALAN 124
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 32.6 bits (74), Expect = 1.4
Identities = 31/125 (24%), Positives = 41/125 (32%), Gaps = 13/125 (10%)
Query: 65 DQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQS 124
D PP S +P P S V + A AP ++A+ AAP P++ P
Sbjct: 446 DLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPAN------MRPLSP 499
Query: 125 AAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTP 184
A S P+A + G+ A AQP P
Sbjct: 500 YAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPR------ 553
Query: 185 IPLSP 189
PLSP
Sbjct: 554 -PLSP 557
>gnl|CDD|153413 cd07959, Anticodon_Ia_Leu_AEc, Anticodon-binding domain of archaeal
and eukaryotic cytoplasmic leucyl tRNA synthetases.
This domain is found in leucyl tRNA synthetases (LeuRS),
which belong to the class Ia aminoacyl tRNA synthetases.
It lies C-terminal to the catalytic core domain. In
contrast to other class Ia enzymes, the anticodon is not
used as an identity element in LeuRS (with exceptions
such as Saccharomyces cerevisiae and some other
eukaryotes). No anticodon-binding site can be defined
for this family, which includes archaeal and eukaryotic
cytoplasmic members. LeuRS catalyzes the transfer of
leucine to the 3'-end of its tRNA.
Length = 117
Score = 30.6 bits (70), Expect = 1.5
Identities = 14/27 (51%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 416 RIGAELVGGRLRRFVDAWIRLGAP-AP 441
R GA + LRRF++ W RL AP AP
Sbjct: 81 RGGAGMNKDLLRRFIEVWTRLLAPFAP 107
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 32.2 bits (73), Expect = 1.5
Identities = 19/78 (24%), Positives = 28/78 (35%), Gaps = 4/78 (5%)
Query: 67 SPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAA 126
P A+P P PS T+ A + A+ A P + G P P A
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEA 267
Query: 127 FLAQPASTASLPPSAAHH 144
P + A+ P A+ +
Sbjct: 268 ----PPANATPAPEASRY 281
>gnl|CDD|173156 PRK14693, PRK14693, hypothetical protein; Provisional.
Length = 552
Score = 32.4 bits (73), Expect = 1.5
Identities = 29/96 (30%), Positives = 42/96 (43%), Gaps = 15/96 (15%)
Query: 39 AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSS-------APANPVQASAPVQ-AAHQP 90
A+T+ ST + V+ G A+G +PP + AP NP PV A P
Sbjct: 25 AMTSSAVSTTGTTNTVASGGAAAGAPSFAPPAADAFPQADGAPRNPAVLQFPVPGGAPAP 84
Query: 91 SATVQAAP------GSSASVLAAPLPSSS-GQPFIS 119
+ AAP G A+++ A L +S PF+S
Sbjct: 85 ADARVAAPVVYSAQGEQAAIMKAGLQQASWNNPFVS 120
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 32.2 bits (73), Expect = 1.6
Identities = 23/106 (21%), Positives = 38/106 (35%), Gaps = 16/106 (15%)
Query: 62 GVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPP 121
G + P + P+ + P A AT AAP ++A P +P
Sbjct: 10 GKGEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKK---DKPRRERK 66
Query: 122 AQSAA------FLAQPASTAS------LPPSAAHHLYPLPF-FCDP 154
+ A+ F+ +P + L P H ++ L F +C P
Sbjct: 67 PKPASLWKLEDFVVEPQEGKTRFHDFNLAPELMHAIHDLGFPYCTP 112
>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
synthase PfaA. Members of the seed for this alignment
are involved in omega-3 polyunsaturated fatty acid
biosynthesis, such as the protein PfaA from the
eicosapentaenoic acid biosynthesis operon in
Photobacterium profundum strain SS9. PfaA is encoded
together with PfaB, PfaC, and PfaD, and the functions of
the individual polypeptides have not yet been described.
More distant homologs of PfaA, also included with the
reach of this model, appear to be involved in
polyketide-like biosynthetic mechanisms of
polyunsaturated fatty acid biosynthesis, an alternative
to the more familiar iterated mechanism of chain
extension and desaturation, and in most cases are encoded
near genes for homologs of PfaB, PfaC, and/or PfaD.
Length = 2582
Score = 32.7 bits (74), Expect = 1.7
Identities = 19/90 (21%), Positives = 29/90 (32%), Gaps = 3/90 (3%)
Query: 50 QSPAVSDPGMASGVSD-QSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
Q+P + + V +AP P S PV +A + A A V P
Sbjct: 1125 QAPVIKSVVTQAPVVQVTISVAPAAPVLPAVVSPPVVSAAPAQSVATAVA--MAPVAEVP 1182
Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLP 138
+ Q P+ + A Q + S
Sbjct: 1183 IAVPVQQSVDYMPSVAQAAAPQASVNDSAI 1212
>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486). This
family is made up of members from various Burkholderia
spp. The function is unknown.
Length = 245
Score = 31.8 bits (72), Expect = 1.8
Identities = 20/82 (24%), Positives = 27/82 (32%)
Query: 63 VSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPA 122
V P + P + + P A H + A S LAA P + + A
Sbjct: 78 VPTPHVPAVALPGDTDAPAEPGAAPHVVAERAAAMQAPLPSALAADDPQAPPAGATAADA 137
Query: 123 QSAAFLAQPASTASLPPSAAHH 144
AA A P + P AA
Sbjct: 138 GDAAPDATPPAAGDASPPAAAQ 159
>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model describes a subset of pyruvate dehydrogenase
complex dihydrolipoamide acetyltransferase specifically
close by both phylogenetic and per cent identity (UPGMA)
trees. Members of this set include two or three copies
of the lipoyl-binding domain. E. coli AceF is a member
of this model, while mitochondrial and some other
bacterial forms belong to a separate model [Energy
metabolism, Pyruvate dehydrogenase].
Length = 546
Score = 32.2 bits (73), Expect = 1.8
Identities = 19/63 (30%), Positives = 28/63 (44%), Gaps = 3/63 (4%)
Query: 63 VSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPA 122
V+ +P + APA Q +A AA QP AAP ++ + AP + + P A
Sbjct: 191 VAGSTPATAPAPA-SAQPAAQSPAATQPEPA--AAPAAAKAQAPAPQQAGTQNPAKVDHA 247
Query: 123 QSA 125
A
Sbjct: 248 APA 250
Score = 30.2 bits (68), Expect = 7.0
Identities = 19/49 (38%), Positives = 24/49 (48%), Gaps = 1/49 (2%)
Query: 51 SPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQ-AAP 98
+PA + P S + Q P ++ A QA AP QA Q A V AAP
Sbjct: 201 APASAQPAAQSPAATQPEPAAAPAAAKAQAPAPQQAGTQNPAKVDHAAP 249
>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
Length = 400
Score = 31.7 bits (72), Expect = 2.1
Identities = 23/81 (28%), Positives = 31/81 (38%), Gaps = 2/81 (2%)
Query: 57 PGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQP 116
P +A D +P S A P A A AA + AT A + + A SG
Sbjct: 229 PLLAPWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAA--AKNAKAPEAQPVSGTA 286
Query: 117 FISPPAQSAAFLAQPASTASL 137
P + AA A A+ +L
Sbjct: 287 AAEPAPKEAAKAAAAAAKPAL 307
Score = 30.9 bits (70), Expect = 3.6
Identities = 30/145 (20%), Positives = 44/145 (30%), Gaps = 10/145 (6%)
Query: 7 RDTSDPQPEAKRLCHLEEKTEK----IERCLAQLVQAITARKSST--LDQSPAVSDPGMA 60
R S+P A L L E+ K + LA+ + R T +P D
Sbjct: 185 RYASEP---AGGLTSLTEEAGKARYNASKALAEDIGDTVKRIDGTEVPLLAPWQGDAAPV 241
Query: 61 SGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISP 120
S+ + P S+ P A + +A S AA P+ +
Sbjct: 242 PP-SEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPAPKEAAKAAA 300
Query: 121 PAQSAAFLAQPASTASLPPSAAHHL 145
A A +P P A L
Sbjct: 301 AAAKPALEDKPRPLGIARPGGADDL 325
>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 559
Score = 31.9 bits (72), Expect = 2.1
Identities = 23/146 (15%), Positives = 37/146 (25%), Gaps = 14/146 (9%)
Query: 71 SSAPANPVQASAPVQAAHQPS-ATVQAAPGSSASVLAAP---LPSSSGQPF----ISPPA 122
+ A+APV+AA PS A A ++ L + Q F + P
Sbjct: 117 PVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLFADFLTTAPV 176
Query: 123 QSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESV 182
A A + + P+ A L + + P
Sbjct: 177 PRAPVQAPVVAAPAPVPAIAA---ALAAHAAY---AQDDDEQLDDDGFDLDDALPQILPP 230
Query: 183 TPIPLSPVSSDQEDFSEEDEVVDCNP 208
+P V+ P
Sbjct: 231 AALPPIVVAPAAPAALAAVAAAAPAP 256
>gnl|CDD|178035 PLN02414, PLN02414, glycine dehydrogenase (decarboxylating).
Length = 993
Score = 32.0 bits (73), Expect = 2.2
Identities = 19/52 (36%), Positives = 23/52 (44%), Gaps = 5/52 (9%)
Query: 57 PGMAS-GVSDQSPPLSSAPANPV--QASAPVQAAHQPSATVQAAPGSSASVL 105
PGM GV P P++PV P QP T+ AAP SA +L
Sbjct: 740 PGMGPIGVKKHLAPF--LPSHPVVPTGGIPRPEKTQPLGTISAAPWGSALIL 789
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 31.6 bits (71), Expect = 2.3
Identities = 22/132 (16%), Positives = 40/132 (30%), Gaps = 12/132 (9%)
Query: 4 NPKRDTSDPQPEAKRLCHLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGV 63
N + S+ + E+ + +E+ A + A ++ +Q+ AV
Sbjct: 97 NDPEEPSNGGGVEESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVAVEAR------ 150
Query: 64 SDQSPPLSSAPANPVQASA---PVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISP 120
P A A PV+ S +A A S+ +P +
Sbjct: 151 ---KQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIETA 207
Query: 121 PAQSAAFLAQPA 132
P A +P
Sbjct: 208 PKADKADKTKPK 219
>gnl|CDD|237537 PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional.
Length = 440
Score = 31.4 bits (72), Expect = 2.7
Identities = 16/66 (24%), Positives = 30/66 (45%)
Query: 57 PGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQP 116
G+A+G+ + +SA A+P++ +A A S+ A + A +++G
Sbjct: 335 AGVAAGLGGVARAGASAAASPLRRAASRAAESMKSSFRAGARSTGGGAGGAAAAAAAGAA 394
Query: 117 FISPPA 122
PPA
Sbjct: 395 AAGPPA 400
>gnl|CDD|238231 cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal
catalytic domain. The DNA breaking-rejoining enzyme
superfamily includes type IB topoisomerases and tyrosine
recombinases that share the same fold in their catalytic
domain containing six conserved active site residues. The
best-studied members of this diverse superfamily include
human topoisomerase I, the bacteriophage lambda
integrase, the bacteriophage P1 Cre recombinase, the
yeast Flp recombinase and the bacterial XerD/C
recombinases. Their overall reaction mechanism is
essentially identical and involves cleavage of a single
strand of a DNA duplex by nucleophilic attack of a
conserved tyrosine to give a 3' phosphotyrosyl
protein-DNA adduct. In the second rejoining step, a
terminal 5' hydroxyl attacks the covalent adduct to
release the enzyme and generate duplex DNA. The enzymes
differ in that topoisomerases cleave and then rejoin the
same 5' and 3' termini, whereas a site-specific
recombinase transfers a 5' hydroxyl generated by
recombinase cleavage to a new 3' phosphate partner
located in a different duplex region. Many DNA
breaking-rejoining enzymes also have N-terminal domains,
which show little sequence or structure similarity.
Length = 164
Score = 30.5 bits (69), Expect = 2.8
Identities = 37/167 (22%), Positives = 60/167 (35%), Gaps = 30/167 (17%)
Query: 935 HVAILLLLSSGRRVHDLTLLSLDPDHFQELDDFVVFWPVFGSKTDSSSHLQSGWKIKENS 994
++A+LLLL++G R+ +L +L LD V + G+KT
Sbjct: 18 YLALLLLLATGLRISEL--CALRWSDID-LDKRV--IHITGTKT-KKERT---------- 61
Query: 995 SDPLFCIPTWIRHLSTLSQ--RRMGSRPLTS--LFITTRGIVQP-ASRSVIAGWVKTALR 1049
+P L L + ++ LF + RG SR + K A R
Sbjct: 62 ------VPLSEEALKLLKEYLKKRRPANGDEEYLFPSRRGGPAATLSRRNVKRIFKAAGR 115
Query: 1050 GANII---ASPGSMRSAVATFRYNSDLPLDSILRQGNWRGGQNFFNY 1093
A I +P S+R A+ N+ L L+++ Y
Sbjct: 116 RAGIDKEGLTPHSLRHTFASALLNAGLDLEAVQDLLGHSSIAMTMRY 162
>gnl|CDD|233367 TIGR01349, PDHac_trf_mito, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model represents one of several closely related clades
of the dihydrolipoamide acetyltransferase subunit of the
pyruvate dehydrogenase complex. It includes sequences
from mitochondria and from alpha and beta branches of
the proteobacteria, as well as from some other bacteria.
Sequences from Gram-positive bacteria are not included.
The non-enzymatic homolog protein X, which serves as an
E3 component binding protein, falls within the clade
phylogenetically but is rejected by its low score
[Energy metabolism, Pyruvate dehydrogenase].
Length = 436
Score = 31.3 bits (71), Expect = 2.8
Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 5/66 (7%)
Query: 49 DQSPAVSDPGMASGVSDQSPPLSSAPANPVQAS----APVQAAHQPSATVQAAPGSSAS- 103
D + A + + S S P AP P A AP + + +PS+ + S
Sbjct: 80 DVADAFKNYKLESSASPAPKPSEIAPTAPPSAPKPSPAPQKQSPEPSSPAPLSDKESGDR 139
Query: 104 VLAAPL 109
+ A+PL
Sbjct: 140 IFASPL 145
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 31.6 bits (71), Expect = 2.9
Identities = 42/182 (23%), Positives = 61/182 (33%), Gaps = 22/182 (12%)
Query: 64 SDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPL---PSSSGQPFISP 120
SD+ PP +S P ++ P A + AP S+ A P +SG P
Sbjct: 558 SDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQP 617
Query: 121 PAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYY--------GHYLQSAMKASRGQV 172
P+ + +A L L P + G +Q + R
Sbjct: 618 PSSAPRDMAPSVVRMFL----RERLLEQSTGPKPKSFWEMRAGRDGSGIQQEPSSRRQPA 673
Query: 173 AQPPPPSESVTP--IPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPDPDS 230
Q PP S P L V + + SEE + +P P + E +P DPD
Sbjct: 674 TQSTPPRPSWLPSVFVLPSVDAGRAQPSEESHLSSMSPT-----QPISHEEQPRYEDPDD 728
Query: 231 EL 232
L
Sbjct: 729 PL 730
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 31.7 bits (72), Expect = 2.9
Identities = 21/98 (21%), Positives = 27/98 (27%), Gaps = 2/98 (2%)
Query: 52 PAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS 111
P + G A G P PA+ + S + AA A P
Sbjct: 646 PDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAAL--RAPQAPRPGGP 703
Query: 112 SSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP 149
G + PP A +SL S P P
Sbjct: 704 PGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEP 741
>gnl|CDD|239981 cd04608, CBS_pair_PALP_assoc, This cd contains two tandem repeats
of the cystathionine beta-synthase (CBS pair) domains
associated with the pyridoxal-phosphate (PALP)
dependent enzyme domain upstream. The vitamin B6
complex comprises pyridoxine, pyridoxal, and
pyridoxamine, as well as the 5'-phosphate esters of
pyridoxal (PALP) and pyridoxamine, the last two being
the biologically active coenzyme derivatives. The
members of the PALP family are principally involved in
the biosynthesis of amino acids and amino acid-derived
metabolites, but they are also found in the
biosynthetic pathways of amino sugars and other
amine-containing compounds. CBS is a small domain
originally identified in cystathionine beta-synthase
and subsequently found in a wide range of different
proteins. CBS domains usually come in tandem repeats,
which associate to form a so-called Bateman domain or a
CBS pair which is reflected in this model. The
interface between the two CBS domains forms a cleft
that is a potential ligand binding site. The CBS pair
coexists with a variety of other functional domains.
It has been proposed that the CBS domain may play a
regulatory role, although its exact function is
unknown.
Length = 124
Score = 29.6 bits (67), Expect = 3.3
Identities = 18/50 (36%), Positives = 22/50 (44%)
Query: 38 QAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAA 87
+AI K DQ P V + G G+ LSS + VQ S PV A
Sbjct: 15 EAIEILKEKGFDQLPVVDESGKILGMVTLGNLLSSLSSGKVQPSDPVSKA 64
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
factor (transactivator). This family includes EBV BRLF1
and similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 31.3 bits (71), Expect = 3.3
Identities = 24/179 (13%), Positives = 40/179 (22%), Gaps = 20/179 (11%)
Query: 35 QLVQAITARKSSTLDQSPAVSD------------PGMASGVSDQSPPLSSAPANPVQASA 82
Q +ST S P A + P VQ+
Sbjct: 240 QSESFSDVWPASTQSPREETSAEPLAPASPSSRRPSTAQEEQIACSSPQAEPEQGVQSYV 299
Query: 83 PVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
P + +PS AP ++ P + + P + A + +
Sbjct: 300 PQSSDSRPSCFP--APSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTV 357
Query: 143 HHLYPLPFFCDPSY-YGHYLQSAMKASRGQ-----VAQPPPPSESVTPIPLSPVSSDQE 195
P S L S++ + + PP S
Sbjct: 358 RAPRGRPSGSGQSPPSNSPLLSSLADTPSGAAHQPASLLPPAVVQQQLEDASDKQPPTP 416
Score = 31.3 bits (71), Expect = 3.6
Identities = 29/167 (17%), Positives = 48/167 (28%), Gaps = 13/167 (7%)
Query: 22 LEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQAS 81
LE + + + + S L SP+ + P + SD P + +P + S
Sbjct: 203 LETRASPLSQFHGFTPHPSLPQPQSPLKPSPSSARPQQSESFSDVWPASTQSPRE--ETS 260
Query: 82 APVQAAHQPSATVQAAP--GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQ-PASTASLP 138
A A PS+ + A P Q ++ P S + + PA + + P
Sbjct: 261 AEPLAPASPSSRRPSTAQEEQIACSSPQAEPEQGVQSYV--PQSSDSRPSCFPAPSTTQP 318
Query: 139 PSAAHHLYPLPFFCDPSYYGHYLQSAMKA-SRGQVAQPPPPSESVTP 184
P K V+Q P
Sbjct: 319 T-----FLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTVRAP 360
Score = 30.1 bits (68), Expect = 8.2
Identities = 23/79 (29%), Positives = 31/79 (39%), Gaps = 5/79 (6%)
Query: 64 SDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA----PGSSASVLAAPLPSSSGQPFIS 119
S QSPP S++P A P AAHQP++ + A AS P P SS P
Sbjct: 367 SGQSPP-SNSPLLSSLADTPSGAAHQPASLLPPAVVQQQLEDASDKQPPTPGSSLVPQPD 425
Query: 120 PPAQSAAFLAQPASTASLP 138
+ +A +
Sbjct: 426 EQELGPSVMALLDRDQGIL 444
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 31.4 bits (71), Expect = 3.3
Identities = 20/85 (23%), Positives = 30/85 (35%), Gaps = 1/85 (1%)
Query: 58 GMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPF 117
G + Q PP SS P P S+ + + + + S L P P+ P
Sbjct: 179 GGNIPGAIQPPPPSSLPGLPP-GSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPP 237
Query: 118 ISPPAQSAAFLAQPASTASLPPSAA 142
S PA + Q + +PP
Sbjct: 238 ASIPAPPIPPVIQYVAPPPVPPPQP 262
>gnl|CDD|234994 PRK01973, PRK01973, septum formation inhibitor; Reviewed.
Length = 271
Score = 30.8 bits (70), Expect = 3.3
Identities = 16/67 (23%), Positives = 26/67 (38%), Gaps = 9/67 (13%)
Query: 58 GMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPL-------- 109
A+ +D++ ++ A P A+A AA P+A P + + A
Sbjct: 108 APAAKAADEAAAAAAEAAAPAAAAAAEAAAAAPAAAAAPEPPPAPAPEAVAAQSQTLVID 167
Query: 110 -PSSSGQ 115
P SGQ
Sbjct: 168 RPLRSGQ 174
>gnl|CDD|237001 PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase
subunit E2; Reviewed.
Length = 411
Score = 31.3 bits (72), Expect = 3.4
Identities = 13/50 (26%), Positives = 21/50 (42%)
Query: 73 APANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPA 122
A A +AP A +P+ AA ++ + AAP ++ SP
Sbjct: 82 AEAAAAAEAAPEAPAPEPAPAAAAAAAAAPAAAAAPAAPAAAAAKASPAV 131
>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 233
Score = 30.6 bits (69), Expect = 3.5
Identities = 28/117 (23%), Positives = 42/117 (35%), Gaps = 16/117 (13%)
Query: 16 AKRLCHLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPA 75
A+R+ E+ +K + +L + I + L A S S P PA
Sbjct: 47 AQRVLIQEQALKKASTQIKELEKRI-----AILQAGEAGS-GSFLSNAFKWGTP--QEPA 98
Query: 76 NPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS---SSGQPFISPPAQSAAFLA 129
P A P + A PS S A P P+ + F+ Q+AA +A
Sbjct: 99 PPANAPPPKEPAAPPS-----WRSSPAGPTTQPSPAAANTRSSSFLGGALQTAAGVA 150
>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
Length = 292
Score = 30.8 bits (69), Expect = 3.6
Identities = 20/85 (23%), Positives = 31/85 (36%), Gaps = 15/85 (17%)
Query: 163 SAMKASRGQVAQPPPPSESVTPIPLSPVSS-----------DQEDFSEEDEVVDCNPPAL 211
S A QV PP++ TP+P P ++ D+ D + DE NP ++
Sbjct: 101 STPVARAPQVIDTAPPNDVATPVPSVPEATAEAAAKAGPQDDEADGEQADEAPAHNPESV 160
Query: 212 FSFA----PSTKEREPSIPDPDSEL 232
+ A + P S L
Sbjct: 161 PTRAARETTEANRPTATPPQSSSAL 185
Score = 29.7 bits (66), Expect = 8.6
Identities = 17/75 (22%), Positives = 27/75 (36%), Gaps = 4/75 (5%)
Query: 68 PPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAF 127
P A P + PV + P AT +AA + A + P +P +
Sbjct: 107 APQVIDTAPPNDVATPVPSV--PEATAEAAAKAGPQDDEADGEQADEAPAHNPESVPTR- 163
Query: 128 LAQPASTASLPPSAA 142
A +T + P+A
Sbjct: 164 -AARETTEANRPTAT 177
>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
SFB2/subunit SFB3 [Intracellular trafficking and
secretion].
Length = 861
Score = 31.3 bits (71), Expect = 3.6
Identities = 19/154 (12%), Positives = 37/154 (24%), Gaps = 9/154 (5%)
Query: 42 ARKSSTLDQSPAVSDPGMASGVSDQSPPLS--SAPANPVQASAPVQAAHQPSATVQAAPG 99
+ QS + + + + SA + Q + A
Sbjct: 5 KKGVYPQAQSQVHTGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQIDQAAT 64
Query: 100 SSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGH 159
+ + A S P + ++ + + P + L P+ F D
Sbjct: 65 AMHNTGANNPAPSVMSPAFQSQQKFSSPYGGSMADGTAPK-PTNPLVPVDLFEDQPPPIS 123
Query: 160 YLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSD 193
L PP + + S S
Sbjct: 124 DLFLPPP------PIVPPLTTNFVGSEQSNCSPK 151
>gnl|CDD|223400 COG0323, MutL, DNA mismatch repair enzyme (predicted ATPase) [DNA
replication, recombination, and repair].
Length = 638
Score = 31.2 bits (71), Expect = 3.7
Identities = 30/195 (15%), Positives = 54/195 (27%), Gaps = 24/195 (12%)
Query: 85 QAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHH 144
Q P A+V+A +S + A PS + I + + ++ + + P++
Sbjct: 332 QQGLIPPASVEAPKSASQPLPAFQEPSPLPESRIQKSKVAKSGSSKSDAPSIAEPASGA- 390
Query: 145 LYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVV 204
P P + + P + E + ++V
Sbjct: 391 -SPSPASPSIRPLSKNILPESSPGSLKNEDRSYDDLLEEPAESEDKQEEAEQKAISEDV- 448
Query: 205 DCNPPALFSFAPSTK-----EREPSIPDPDSELASQGVVCQKLGSPAWNRVRYTESQRAL 259
P + E E + L Q R+ Y + + L
Sbjct: 449 ---FPLGEAIGQVHGTYILAEHEDGL-----VLVDQHAAH--------ERILYEKLKNEL 492
Query: 260 QAGGVFQPLAIPSEF 274
G QPL IP
Sbjct: 493 GNVGELQPLLIPIRL 507
>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2). Surfeit locus
protein 2 is part of a group of at least six sequence
unrelated genes (Surf-1 to Surf-6). The six Surfeit
genes have been classified as housekeeping genes, being
expressed in all tissue types tested and not containing
a TATA box in their promoter region. The exact function
of SURF2 is unknown.
Length = 244
Score = 30.7 bits (69), Expect = 3.8
Identities = 19/57 (33%), Positives = 26/57 (45%), Gaps = 1/57 (1%)
Query: 171 QVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPD 227
V QP +S + P SSD++D ED + D PP LF+ KE+ D
Sbjct: 140 GVRQPGRTEKSGSDF-WEPPSSDEDDSDSEDSMSDLYPPELFTLKNPGKEQNGDEDD 195
>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier
protein subunit; Validated.
Length = 155
Score = 29.8 bits (68), Expect = 4.1
Identities = 19/74 (25%), Positives = 28/74 (37%), Gaps = 17/74 (22%)
Query: 63 VSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAA---------PL---- 109
+S + + A A APV AA +A AAP ++ + AA P+
Sbjct: 32 ISRAAAAPVAPVAQQAAA-APVAAAPAAAAAAAAAPAAAPAAAAAEAEGHVVTSPMVGTF 90
Query: 110 ---PSSSGQPFISP 120
PS PF+
Sbjct: 91 YRAPSPDAPPFVEV 104
>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
Provisional.
Length = 319
Score = 30.8 bits (69), Expect = 4.1
Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 8/80 (10%)
Query: 66 QSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPF---ISPPA 122
Q+ L+ Q+ Q+ Q + T QAAP + P+S+ QP+ + PA
Sbjct: 158 QAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQP--RQSKPASTQQPYQDLLQTPA 215
Query: 123 QSAAFLAQPASTASLPPSAA 142
+ AQ + P + A
Sbjct: 216 HTT---AQSKPQQAAPVTRA 232
>gnl|CDD|238824 cd01646, RT_Bac_retron_I, RT_Bac_retron_I: Reverse transcriptases
(RTs) in bacterial retrotransposons or retrons. The
polymerase reaction of this enzyme leads to the
production of a unique RNA-DNA complex called msDNA
(multicopy single-stranded (ss)DNA) in which a small
ssDNA branches out from a small ssRNA molecule via a
2'-5'phosphodiester linkage. Bacterial retron RTs
produce cDNA corresponding to only a small portion of
the retron genome.
Length = 158
Score = 30.0 bits (68), Expect = 4.2
Identities = 16/60 (26%), Positives = 23/60 (38%), Gaps = 6/60 (10%)
Query: 616 YLDDFLLV-NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDR 674
Y+DD + + EI +L L LG +NL K+ + P P G L
Sbjct: 88 YVDDIRIFADSKEEAEEILEELK-EFLAELGLSLNLSKTEILPLP----EGTASKDFLGY 142
>gnl|CDD|212508 cd11649, RsmI_like, Uncharacterized subfamily of the tetrapyrrole
methylase family similar to ribosomal RNA small subunit
methyltransferase I (RsmI). Tetrapyrrole methylase uses
S-AdoMet (S-adenosyl-L-methionine or SAM) in the
methylation of diverse substrates. This uncharacterized
subfamily exhibits sequence similarity to the ribosomal
RNA small subunit methyltransferase I (RsmI), which
catalyzes the 2-O-methylation of the ribose of cytidine
1402 (C1402) in 16S rRNA.
Length = 229
Score = 30.2 bits (69), Expect = 4.7
Identities = 20/68 (29%), Positives = 25/68 (36%), Gaps = 26/68 (38%)
Query: 52 PAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS 111
P V+DPG + V AHQ V G S S+L A L +
Sbjct: 91 PGVADPG----------------------AELVALAHQLGIRVVPLVGPS-SILLA-LMA 126
Query: 112 S--SGQPF 117
S +GQ F
Sbjct: 127 SGLNGQSF 134
>gnl|CDD|148635 pfam07139, DUF1387, Protein of unknown function (DUF1387). This
family represents a conserved region approximately 300
residues long within a number of hypothetical proteins
of unknown function that seem to be restricted to
mammals.
Length = 301
Score = 30.4 bits (68), Expect = 4.9
Identities = 27/133 (20%), Positives = 45/133 (33%), Gaps = 19/133 (14%)
Query: 40 ITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQ---------- 89
+T +K + +S + S +++ P A P V H
Sbjct: 3 VTGKKKNKKKKSKPKPEAPAKSASKEETTPE--EQAAPGDEKDEVNGFHANGSADDTESV 60
Query: 90 -------PSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
SA++ A + ++ A P PSSS +S + P S+A PS+
Sbjct: 61 DSLSEGLDSASLDAREPEAVTLDAPPSPSSSLTNGLSDLQSKLELQSSPHSSAKPHPSSD 120
Query: 143 HHLYPLPFFCDPS 155
H + PS
Sbjct: 121 QHKNAKKYVSKPS 133
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 30.5 bits (69), Expect = 5.0
Identities = 20/86 (23%), Positives = 30/86 (34%), Gaps = 6/86 (6%)
Query: 65 DQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQS 124
D++ P + A P S A T + + P+ +P+ISP
Sbjct: 206 DEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPVPKPYISPDEYP 265
Query: 125 AAFLAQPASTASLPPSAAHHLYPLPF 150
+A L P L PS +PF
Sbjct: 266 SAPLP-PELPQLLQPS-----LVIPF 285
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 30.8 bits (70), Expect = 5.2
Identities = 14/76 (18%), Positives = 21/76 (27%), Gaps = 5/76 (6%)
Query: 30 ERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQ 89
ER + + A +PA S + + PA A AP
Sbjct: 256 ERIVRAFLTWADAAAQP-ATAAPAPSRMTDTNDSKSVTS----QPAAAAAAPAPPPNPPA 310
Query: 90 PSATVQAAPGSSASVL 105
A G ++ L
Sbjct: 311 TPPEPPARRGRGSAAL 326
>gnl|CDD|235737 PRK06197, PRK06197, short chain dehydrogenase; Provisional.
Length = 306
Score = 30.4 bits (69), Expect = 5.3
Identities = 17/54 (31%), Positives = 20/54 (37%), Gaps = 4/54 (7%)
Query: 85 QAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLP 138
AA + V A PG S + LA LP A LAQ +LP
Sbjct: 197 AAAGATTIAVAAHPGVSNTELARNLPR----ALRPVATVLAPLLAQSPEMGALP 246
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 30.3 bits (68), Expect = 5.6
Identities = 29/166 (17%), Positives = 51/166 (30%), Gaps = 12/166 (7%)
Query: 73 APANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPA 132
AP + V A + AA +++ A P G+P + P A +
Sbjct: 281 APGALRGSEYVVDEALEREPAELAAAAVASAASAVG-PVGPGEP--NQPDDVAEAVKAEV 337
Query: 133 STASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSS 192
+ + +A + G + + S + + P + P +
Sbjct: 338 AEVTDEVAAESVVQV------ADRDGESTPAVEETSEADIEREQP-GDLAGQAPAAHQVD 390
Query: 193 DQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPDP--DSELASQG 236
+ + +E A P E+ IPDP ELA G
Sbjct: 391 AEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAG 436
>gnl|CDD|131471 TIGR02418, acolac_catab, acetolactate synthase, catabolic.
Acetolactate synthase (EC 2.2.1.6) combines two
molecules of pyruvate to yield 2-acetolactate with the
release of CO2. This reaction may be involved in either
valine biosynthesis (biosynthetic) or conversion of
pyruvate to acetoin and possibly to 2,3-butanediol
(catabolic). The biosynthetic type, described by
TIGR00118, is also capable of forming
acetohydroxybutyrate from pyruvate and 2-oxobutyrate for
isoleucine biosynthesis. The family described here, part
of the same larger family of thiamine
pyrophosphate-dependent enzymes (pfam00205, pfam02776)
is the catabolic form, generally found associated with
in species with acetolactate decarboxylase and usually
found in the same operon. The model may not encompass
all catabolic acetolactate synthases, but rather one
particular clade in the larger TPP-dependent enzyme
family [Energy metabolism, Fermentation].
Length = 539
Score = 30.5 bits (69), Expect = 5.6
Identities = 37/136 (27%), Positives = 54/136 (39%), Gaps = 24/136 (17%)
Query: 33 LAQLVQAITARKSSTLDQS-PAVSD--PGMASGVSDQSPPLSSAPANPVQASAPVQAAHQ 89
+AQ V IT + L S P S+ G+A+ S+ P + A V+ + ++ HQ
Sbjct: 51 MAQAVGRITGKPGVALVTSGPGCSNLVTGLATANSEGDPVV--AIGGQVKRADLLKLTHQ 108
Query: 90 P-------------SATVQAAPGSSASVLAAPLPSSSGQP---FISPPAQSAAFLAQPAS 133
SA VQ S V A + SG+P F+S P + P S
Sbjct: 109 SMDNVALFRPITKYSAEVQDPDALSEVVANAFRAAESGKPGAAFVSLPQD---VVDSPVS 165
Query: 134 TASLPPSAAHHLYPLP 149
++P S A L P
Sbjct: 166 VKAIPASYAPKLGAAP 181
>gnl|CDD|178145 PLN02530, PLN02530, histidine-tRNA ligase.
Length = 487
Score = 30.5 bits (69), Expect = 5.8
Identities = 21/106 (19%), Positives = 28/106 (26%), Gaps = 23/106 (21%)
Query: 96 AAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPS 155
A SSA+ ++A P S A SAA
Sbjct: 1 TAIASSATRVSASAIRRPSLP--LSSRCSFLLSASSPRGGRCAASAA------------- 45
Query: 156 YYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEED 201
+ S G A P + I ++P DF ED
Sbjct: 46 -------AGGGRSGGTTAPPSVQEDGKPKIDVNP-PKGTRDFPPED 83
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 153
Score = 29.4 bits (66), Expect = 5.9
Identities = 17/50 (34%), Positives = 22/50 (44%), Gaps = 5/50 (10%)
Query: 72 SAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPP 121
SA V AP A PS AP A AP P+S+G+ ++ P
Sbjct: 46 SAVQEQVPTPAPAPAPAVPS-----APTPVAPAAPAPAPASAGENVVTAP 90
Score = 29.4 bits (66), Expect = 6.2
Identities = 15/55 (27%), Positives = 23/55 (41%), Gaps = 4/55 (7%)
Query: 60 ASGVSDQSPPLSSAPANPVQASAPVQ-AAHQPSATVQAAPGSSA---SVLAAPLP 110
A G+ + P A AP +A P A AP ++ +V+ AP+P
Sbjct: 38 AKGLGIDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENVVTAPMP 92
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 30.3 bits (68), Expect = 6.4
Identities = 30/116 (25%), Positives = 44/116 (37%), Gaps = 6/116 (5%)
Query: 71 SSAPANPVQASAPVQAAHQPSA-TVQAAPGSSASVLA-APLPSSSGQPFISPPAQSAAFL 128
S A + V+ AP QP A P + + A P P+S+ P PP +A L
Sbjct: 383 SPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPL 442
Query: 129 AQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTP 184
P+ AS P + A + S+ G ++ SR Q Q + T
Sbjct: 443 -PPSPQASAPRNVASGKPGVDL---GSWQGKFMNFTRNGSRKQPVQASSSDAAQTG 494
>gnl|CDD|233595 TIGR01845, outer_NodT, efflux transporter, outer membrane factor
(OMF) lipoprotein, NodT family. Members of This model
comprise a subfamily of the Outer Membrane Factor (TCDB
1.B.17) porins. OMF proteins operate in conjunction with
a primary transporter of the RND, MFS, ABC, or PET
systems, and a MFP (membrane fusion protein) to tranport
substrates across membranes. The complex thus formed
allows transport (export) of various solutes (heavy
metal cations; drugs, oligosaccharides, proteins, etc.)
across the two envelopes of the Gram-negative bacterial
cell envelope in a single energy-coupled step. Current
data suggest that the OMF (and not the MFP) is largely
responsible for the formation of both the trans-outer
membrane and trans-periplasmic channels. The roles
played by the MFP have yet to be determined [Cellular
processes, Detoxification, Transport and binding
proteins, Porins].
Length = 454
Score = 30.5 bits (69), Expect = 6.5
Identities = 27/107 (25%), Positives = 43/107 (40%), Gaps = 15/107 (14%)
Query: 29 IERCLAQLVQAITA------------RKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPAN 76
++ +AQ A+ A + LDQ P + S + + P + +A
Sbjct: 227 LDVQIAQARNALAALLGKGPSRGLAIARPLLLDQLPPDLPLSLPSDLLRRRPDIRAAERR 286
Query: 77 PVQASAPV---QAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISP 120
A+A + +AA PS T+ A+ G SAS L+ S I P
Sbjct: 287 LAAANAQIGVAKAAFFPSITLSASIGLSASQLSRLFDGGSRFWSIGP 333
>gnl|CDD|117527 pfam08961, DUF1875, Domain of unknown function (DUF1875). The MIT
domain, found in Nuclear receptor-binding factor 2, has
no known function.
Length = 243
Score = 29.8 bits (66), Expect = 7.3
Identities = 19/71 (26%), Positives = 27/71 (38%), Gaps = 8/71 (11%)
Query: 348 KRAECLESRRRLVEPRDPHLASLLL------RARRGKKSSSPQNLEPPGRVSLKVQTLQK 401
K E ++ L RD H+ LLL RA+R ++ + Q E L QT
Sbjct: 2 KLTESEQAHLSLELQRDSHMKQLLLIQERWKRAKREERLKAQQATEQDAARHL--QTSHP 59
Query: 402 PQRCSSPVNPP 412
P + P
Sbjct: 60 PSVEDAEGQSP 70
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 28.5 bits (64), Expect = 7.8
Identities = 23/93 (24%), Positives = 25/93 (26%), Gaps = 13/93 (13%)
Query: 64 SDQSPP--LSSAPANPVQASAPVQAAHQPSAT---VQAAPGSSASVLAAPLPSSSGQPFI 118
S SPP PV P P + V P A+ P P G
Sbjct: 27 SCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPG---- 82
Query: 119 SPPAQSAAFLAQPASTASLPPSAAH----HLYP 147
S PA LPP H YP
Sbjct: 83 PAGPPSPLAPPAPARKPPLPPPRPQRRLLHTYP 115
>gnl|CDD|222843 PHA02030, PHA02030, hypothetical protein.
Length = 336
Score = 29.9 bits (67), Expect = 8.2
Identities = 20/64 (31%), Positives = 24/64 (37%), Gaps = 5/64 (7%)
Query: 65 DQSPPLSSAPANPVQA-SAPVQAAHQPSATV-QAAPGSSASVLAAPLPSSSGQPFISPPA 122
L + P A SA A +A V QAAP A LP P ++P A
Sbjct: 261 AAGSNLPAVPNVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVLPDV---PQVAPVA 317
Query: 123 QSAA 126
AA
Sbjct: 318 APAA 321
>gnl|CDD|223009 PHA03211, PHA03211, serine/threonine kinase US3; Provisional.
Length = 461
Score = 29.9 bits (67), Expect = 8.3
Identities = 18/101 (17%), Positives = 28/101 (27%), Gaps = 20/101 (19%)
Query: 2 ASNPKRDTSDPQPEAKRLCHLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMAS 61
+ + EA RLC ++E LA++ + S
Sbjct: 51 PEHDPPSPHGARDEAARLCQIQE-------LLAEMRS----------SEEYPDSGAEAED 93
Query: 62 GVSDQSPPLSSAP---ANPVQASAPVQAAHQPSATVQAAPG 99
D +P + P A H P+ A PG
Sbjct: 94 DDDDDAPDDVAYPDEYAEDDFLPGDGAPDHDPAPCGPAPPG 134
>gnl|CDD|215184 PLN02325, PLN02325, nudix hydrolase.
Length = 144
Score = 28.7 bits (64), Expect = 8.4
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 361 EPRDPHLASLLLRARRGKKSSSPQNLEP 388
EP+ H ++ +RA S PQNLEP
Sbjct: 81 EPKPSHYVTVFMRAVLADPSQVPQNLEP 108
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 29.4 bits (66), Expect = 8.6
Identities = 21/109 (19%), Positives = 40/109 (36%), Gaps = 2/109 (1%)
Query: 34 AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSAT 93
+ L +A + + +T ++ V+ P G+ + P + P A+A QAA P A
Sbjct: 94 SDLQKATSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAP 153
Query: 94 VQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
+ S A + +AA +P + + P+
Sbjct: 154 AKPRAKSPRPAAKAAPKPTETITAKKAKKTAAA--PKPTADKTATPAKK 200
>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 292
Score = 29.5 bits (66), Expect = 9.4
Identities = 29/107 (27%), Positives = 42/107 (39%), Gaps = 3/107 (2%)
Query: 36 LVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQ 95
L Q +STL PA P + V P A PV+A A + A +P A
Sbjct: 116 LSQLNNVAVTSTLPTEPATVAPVRNASVPTAERP---AITRPVRAQAVSEPAVEPKAAKT 172
Query: 96 AAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
A + A+P + + P A S + P+S +L SA+
Sbjct: 173 ATATEAKVQTASPAQTPATPPAGKGAAASGQLKSAPSSHYTLQLSAS 219
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.133 0.399
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 55,937,345
Number of extensions: 5552964
Number of successful extensions: 6571
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5864
Number of HSP's successfully gapped: 327
Length of query: 1097
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 990
Effective length of database: 6,191,724
Effective search space: 6129806760
Effective search space used: 6129806760
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (28.6 bits)