RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 029905
(185 letters)
>gnl|CDD|238013 cd00056, ENDO3c, endonuclease III; includes endonuclease III
(DNA-(apurinic or apyrimidinic site) lyase), alkylbase
DNA glycosidases (Alka-family) and other DNA
glycosidases.
Length = 158
Score = 34.1 bits (79), Expect = 0.014
Identities = 15/31 (48%), Positives = 16/31 (51%), Gaps = 1/31 (3%)
Query: 106 KAVSELTVLKGVGPATASAVLA-AYAPDLAP 135
A EL L GVG TA+ VL A PD P
Sbjct: 80 DAREELLALPGVGRKTANVVLLFALGPDAFP 110
>gnl|CDD|109681 pfam00633, HHH, Helix-hairpin-helix motif. The helix-hairpin-helix
DNA-binding motif is found to be duplicated in the
central domain of RuvA. The HhH domain of DisA, a
bacterial checkpoint control protein, is a DNA-binding
domain.
Length = 30
Score = 30.1 bits (69), Expect = 0.055
Identities = 11/26 (42%), Positives = 15/26 (57%)
Query: 103 DLTKAVSELTVLKGVGPATASAVLAA 128
+ + EL L GVGP TA A+L+
Sbjct: 5 LIPASREELLALPGVGPKTAEAILSY 30
>gnl|CDD|197622 smart00278, HhH1, Helix-hairpin-helix DNA-binding motif class 1.
Length = 20
Score = 27.6 bits (63), Expect = 0.33
Identities = 9/20 (45%), Positives = 12/20 (60%)
Query: 109 SELTVLKGVGPATASAVLAA 128
EL + G+GP TA +L A
Sbjct: 1 EELLKVPGIGPKTAEKILEA 20
>gnl|CDD|214684 smart00478, ENDO3c, endonuclease III. includes endonuclease III
(DNA-(apurinic or apyrimidinic site) lyase), alkylbase
DNA glycosidases (Alka-family) and other DNA
glycosidases.
Length = 149
Score = 29.9 bits (68), Expect = 0.37
Identities = 11/22 (50%), Positives = 13/22 (59%)
Query: 106 KAVSELTVLKGVGPATASAVLA 127
EL L GVG TA+AVL+
Sbjct: 69 DDREELLKLPGVGRKTANAVLS 90
>gnl|CDD|237034 PRK12278, PRK12278, 50S ribosomal protein L21/unknown domain fusion
protein; Provisional.
Length = 221
Score = 30.2 bits (68), Expect = 0.54
Identities = 18/76 (23%), Positives = 25/76 (32%), Gaps = 11/76 (14%)
Query: 91 KSASEKAFKSLPDLTKAVSELTVLKGVGPATAS----------AVLAAYAPDLAPFMSDE 140
A A +LT + GVGPA A A +AA + DE
Sbjct: 140 APAPAAAAAPPAAAAAGADDLTKITGVGPALAKKLNEAGVTTFAQIAALTDADIAKI-DE 198
Query: 141 AMGAALGHSKDYSLKQ 156
+ KD ++Q
Sbjct: 199 KLSFKGRIEKDGWIEQ 214
>gnl|CDD|234792 PRK00558, uvrC, excinuclease ABC subunit C; Validated.
Length = 598
Score = 30.5 bits (70), Expect = 0.55
Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 1/38 (2%)
Query: 96 KAFKSLPDLTKA-VSELTVLKGVGPATASAVLAAYAPD 132
K F SL + +A V EL + G+ A A+ A
Sbjct: 561 KHFGSLKAIKEASVEELAKVPGISKKLAEAIYEALHKK 598
>gnl|CDD|215487 PLN02900, PLN02900, alanyl-tRNA synthetase.
Length = 936
Score = 30.0 bits (68), Expect = 0.74
Identities = 14/57 (24%), Positives = 20/57 (35%), Gaps = 10/57 (17%)
Query: 80 VFVSSLDD----SSVKSASEKAFKSLPDLTKAVSELTVLKGVGPATASAVLAAYAPD 132
V V+ LD +++K A+EK L D V+ L A P
Sbjct: 832 VLVARLDVGVDAAALKEAAEKVIAKLGDPA------AVVLSSDEEKGKVSLVAAVPP 882
>gnl|CDD|214836 smart00825, PKS_KS, Beta-ketoacyl synthase. The structure of
beta-ketoacyl synthase is similar to that of the
thiolase family and also chalcone synthase. The active
site of beta-ketoacyl synthase is located between the N
and C-terminal domains.
Length = 298
Score = 29.2 bits (67), Expect = 0.99
Identities = 16/50 (32%), Positives = 21/50 (42%), Gaps = 9/50 (18%)
Query: 40 RKELPSLIHQRNPNPHINTTE-----LSKLVRWKLTRGKWRPRLLVFVSS 84
+P +H PNPHI+ E ++L W RPR VSS
Sbjct: 239 HGVIPPTLHFETPNPHIDLEESPLRVPTELTPW---PPPGRPR-RAGVSS 284
>gnl|CDD|205093 pfam12826, HHH_2, Helix-hairpin-helix motif. The HhH domain of
DisA, a bacterial checkpoint control protein, is a
DNA-binding domain.
Length = 64
Score = 27.0 bits (61), Expect = 1.3
Identities = 12/31 (38%), Positives = 16/31 (51%), Gaps = 1/31 (3%)
Query: 96 KAFKSLPDLTKA-VSELTVLKGVGPATASAV 125
K F SL L A V EL + +GP A ++
Sbjct: 21 KHFGSLDALANASVEELLEIDDIGPIVAQSI 51
>gnl|CDD|198403 cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized
protein family UPF0213 related to structure-specific
endonuclease SLX1. This family contains a group of
uncharacterized proteins found mainly in bacteria and
several in dsDNA viruses. Although their function roles
have not been recognized, these proteins show
significant sequence similarities with the N-terminal
GIY-YIG endonuclease domain of structure-specific
endonuclease subunit SLX1, which binds another
structure-specific endonuclease subunit SLX4 to form an
active heterodimeric SLX1-SLX4 complex. This complex
functions as a 5' flap endonuclease in yeast, and has
also been identified as a Holliday junction resolvase in
human.
Length = 68
Score = 27.4 bits (62), Expect = 1.3
Identities = 14/35 (40%), Positives = 16/35 (45%), Gaps = 1/35 (2%)
Query: 68 KLTRGKWRPRLLVFVSSLDDSSVKSASEKAFKSLP 102
K TRG+ RP LV+ DD S E K L
Sbjct: 35 KYTRGR-RPVKLVYSEEFDDRSEALKREYRIKKLS 68
>gnl|CDD|178801 PRK00024, PRK00024, hypothetical protein; Reviewed.
Length = 224
Score = 28.9 bits (66), Expect = 1.3
Identities = 15/32 (46%), Positives = 20/32 (62%), Gaps = 1/32 (3%)
Query: 98 FKSLPDLTKA-VSELTVLKGVGPATASAVLAA 128
F SL L A + EL +KG+GPA A+ + AA
Sbjct: 54 FGSLRGLLDASLEELQSIKGIGPAKAAQLKAA 85
>gnl|CDD|204791 pfam11945, WASH_WAHD, WAHD domain of WASH complex. This domain
forms part of the WASH-complex of domains and proteins
that activates the Arp2/3 complex, see pfam04062. The
Arp2/3 complex regulates endocytosis, sorting, and
trafficking within the cell. The WAHD domain attaches to
the FAM21 proteins via its N-terminal residues and to
the microtubules via its C-terminal residues.
Length = 297
Score = 29.0 bits (65), Expect = 1.4
Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 1/52 (1%)
Query: 100 SLPDLTKAVSELTVLKGVGPATASAVLAAYAPDLAPFMSDEAMGAALGHSKD 151
LPDL +L +GP A + + P+L P +A+ A S++
Sbjct: 241 YLPDLPGIADDLMYSADLGPGIAPSAPSVIIPEL-PSFETDAIEVADHDSQE 291
>gnl|CDD|234014 TIGR02786, addB_alphas, double-strand break repair protein AddB,
alphaproteobacterial type. AddAB is a system well
described in the Firmicutes as a replacement for RecBCD
in many prokaryotes for the repair of double stranded
break DNA damage. More recently, a distantly related
gene pair conserved in many alphaproteobacteria was
shown also to function in double-stranded break repair
in Rhizobium etli. This family consists of AddB proteins
of the alphaproteobacteial type [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1021
Score = 29.3 bits (66), Expect = 1.5
Identities = 25/115 (21%), Positives = 41/115 (35%), Gaps = 8/115 (6%)
Query: 50 RNPNPHINTTELSK-LVRWKLTRGKWRPRLLVFVSSLDDSSVKSASEKAFKSLPD-LTKA 107
R P L R L G P + + S+L + S+++A + L + L A
Sbjct: 461 RGGTPRAGILALPAFFERRLLALG-DDPEVPAWHSALPEQSIEAA-----RPLCELLEIA 514
Query: 108 VSELTVLKGVGPATASAVLAAYAPDLAPFMSDEAMGAALGHSKDYSLKQYLLFAD 162
+ L L G + + A + DE A + + K LFA+
Sbjct: 515 IEPLESLAGRREELDAELAEATVELMEALSRDEEGWLAALYEGEAGHKLASLFAE 569
>gnl|CDD|233862 TIGR02424, TF_pcaQ, pca operon transcription factor PcaQ. Members
of this family are LysR-family transcription factors
associated with operons for catabolism of
protocatechuate. Members occur only in Proteobacteria
[Energy metabolism, Other, Regulatory functions, DNA
interactions].
Length = 300
Score = 28.9 bits (65), Expect = 1.5
Identities = 17/48 (35%), Positives = 24/48 (50%), Gaps = 1/48 (2%)
Query: 70 TRGKWRPRLLVFVSSLDDSSVKSASEKAFKSLPDLTKAVSELTVLKGV 117
TR K+R L FV SVK A+E + P ++K + EL + G
Sbjct: 1 TRIKFR-HLQCFVEVARQGSVKRAAEALHITQPAVSKTLRELEEILGT 47
>gnl|CDD|238685 cd01389, MATA_HMG-box, MATA_HMG-box, class I member of the
HMG-box superfamily of DNA-binding proteins. These
proteins contain a single HMG box, and bind the minor
groove of DNA in a highly sequence-specific manner.
Members include the fungal mating type gene products
MC, MATA1 and Ste11.
Length = 77
Score = 27.2 bits (61), Expect = 1.5
Identities = 7/43 (16%), Positives = 20/43 (46%), Gaps = 8/43 (18%)
Query: 39 YRKELPSLIHQRNPNPHINTTELSKLV--RWKL----TRGKWR 75
YR++ + + NP + E+S+++ W+ + ++
Sbjct: 12 YRQDKHAQLKT--ENPGLTNNEISRIIGRMWRSESPEVKAYYK 52
>gnl|CDD|237395 PRK13482, PRK13482, DNA integrity scanning protein DisA;
Provisional.
Length = 352
Score = 28.6 bits (65), Expect = 1.7
Identities = 10/31 (32%), Positives = 17/31 (54%), Gaps = 1/31 (3%)
Query: 96 KAFKSLPDLTKA-VSELTVLKGVGPATASAV 125
+ F SL L A + +L ++G+G A A+
Sbjct: 305 EHFGSLQGLLAASIEDLDEVEGIGEVRARAI 335
>gnl|CDD|180330 PRK05965, PRK05965, hypothetical protein; Provisional.
Length = 459
Score = 28.5 bits (64), Expect = 2.1
Identities = 12/42 (28%), Positives = 15/42 (35%), Gaps = 9/42 (21%)
Query: 121 TASAVLAAYAPDLAPFMSDE---------AMGAALGHSKDYS 153
A + + Y P A MSD A +GH YS
Sbjct: 282 VAKGLTSGYVPMGAVLMSDHVYQGIADGAGAAAPVGHGYTYS 323
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 28.7 bits (65), Expect = 2.1
Identities = 14/44 (31%), Positives = 20/44 (45%), Gaps = 3/44 (6%)
Query: 82 VSSLDDSSVKSASE--KAFKSLPDLTKA-VSELTVLKGVGPATA 122
V SL D A + F S+ + A EL ++G+G TA
Sbjct: 717 VESLPDVGPVLARNLLEHFGSVEAVMTASEEELMEVEGIGEKTA 760
>gnl|CDD|217002 pfam02371, Transposase_20, Transposase IS116/IS110/IS902 family.
Transposases are needed for efficient transposition of
the insertion sequence or transposon DNA. This family
includes transposases for IS116, IS110 and IS902. This
region is often found with pfam01548. The exact function
of this region is uncertain. This family contains a HHH
motif suggesting a DNA-binding function.
Length = 87
Score = 27.1 bits (61), Expect = 2.1
Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 111 LTVLKGVGPATASAVLAAYAPDLAPFMSDEAMGAALG 147
L + G+GP TA+A+LA D++ F S + A G
Sbjct: 4 LLSIPGIGPITAAALLAE-IGDISRFKSARQLAAYAG 39
>gnl|CDD|144128 pfam00416, Ribosomal_S13, Ribosomal protein S13/S18. This family
includes ribosomal protein S13 from prokaryotes and S18
from eukaryotes.
Length = 106
Score = 27.2 bits (61), Expect = 2.5
Identities = 9/28 (32%), Positives = 13/28 (46%)
Query: 101 LPDLTKAVSELTVLKGVGPATASAVLAA 128
L K LT +KG+G A+ +L
Sbjct: 7 LDGNKKIEIALTYIKGIGRRKANQILKK 34
>gnl|CDD|224115 COG1194, MutY, A/G-specific DNA glycosylase [DNA replication,
recombination, and repair].
Length = 342
Score = 28.1 bits (63), Expect = 2.9
Identities = 11/20 (55%), Positives = 13/20 (65%)
Query: 108 VSELTVLKGVGPATASAVLA 127
EL L GVGP TA A+L+
Sbjct: 112 EEELAALPGVGPYTAGAILS 131
>gnl|CDD|224914 COG2003, RadC, DNA repair proteins [DNA replication, recombination,
and repair].
Length = 224
Score = 27.6 bits (62), Expect = 3.1
Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 4/44 (9%)
Query: 89 SVKSASE---KAFKSLPDLTKA-VSELTVLKGVGPATASAVLAA 128
SV ++ + F SL +L KA V EL+ +KG+G A A + AA
Sbjct: 42 SVLDLAKELLQEFGSLAELLKASVEELSSVKGIGLAKAIQIKAA 85
>gnl|CDD|223177 COG0099, RpsM, Ribosomal protein S13 [Translation, ribosomal
structure and biogenesis].
Length = 121
Score = 27.1 bits (61), Expect = 3.2
Identities = 10/47 (21%), Positives = 19/47 (40%), Gaps = 3/47 (6%)
Query: 101 LPDLTKAVSELTVLKGVGPATASAVLAA--YAPDL-APFMSDEAMGA 144
+P + V LT + G+G A + PD +++E +
Sbjct: 9 IPGNKRVVIALTYIYGIGRRRAKEICKKAGIDPDKRVGELTEEEIER 55
>gnl|CDD|233251 TIGR01054, rgy, reverse gyrase. This model describes reverse
gyrase, found in both archaeal and bacterial
thermophiles. This enzyme, a fusion of a type I
topoisomerase domain and a helicase domain, introduces
positive supercoiling to increase the melting
temperature of DNA double strands. Generally, these
gyrases are encoded as a single polypeptide. An
exception was found in Methanopyrus kandleri, where
enzyme is split within the topoisomerase domain,
yielding a heterodimer of gene products designated RgyB
and RgyA [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1171
Score = 28.3 bits (63), Expect = 3.2
Identities = 25/109 (22%), Positives = 45/109 (41%), Gaps = 11/109 (10%)
Query: 77 RLLVFVSSLDDSSVKSASEKAFKSLPDLTKAV--SELTVLKGVGPAT-ASAVLAAYAPDL 133
RLL+ +S L + ++S SEKA K L L + + E + + ++ L
Sbjct: 423 RLLLLLSILQEERIESLSEKARKLLGYLKRYIDLIERALEEVERHRGEVREIIEEVMKFL 482
Query: 134 APFMSDEAMGAALGHSKDYSL-----KQYLLFADK---LQAKAKVSDIF 174
+ E + + S+D L YL+ D +QA + S ++
Sbjct: 483 REQLKREDVVEKILESEDLVLVARGGIPYLVVPDARTYIQASGRTSRLY 531
>gnl|CDD|190277 pfam02320, UCR_hinge, Ubiquinol-cytochrome C reductase hinge
protein. The ubiquinol-cytochrome C reductase complex
(cytochrome bc1 complex) is a respiratory multienzyme
complex. This Pfam family represents the 'hinge'
protein of the complex which is thought to mediate
formation of the cytochrome c1 and cytochrome c
complex.
Length = 65
Score = 26.1 bits (58), Expect = 3.3
Identities = 11/28 (39%), Positives = 14/28 (50%)
Query: 2 QLEFECSNVNKWKEALASYEACVESLNK 29
L EC N K +A Y+ CVE +N
Sbjct: 6 TLREECKNKEKCVKAKHEYQECVERVNS 33
>gnl|CDD|173858 cd08493, PBP2_DppA_like, The substrate-binding component of an
ABC-type dipeptide import system contains the type 2
periplasmic binding fold. This family represents the
substrate-binding domain of an ATP-binding cassette
(ABC)-type dipeptide import system. The DppA binds
dipeptides and some tripeptides and is involved in
chemotaxis toward dipeptides, whereas the OppA binds
peptides of a wide range of lengths (2-35 amino acid
residues) and plays a role in recycling of cell wall
peptides, which precludes any involvement in chemotaxis.
Most of other periplasmic binding proteins are
comprised of only two globular subdomains corresponding
to domains I and III of the dipeptide/oligopeptide
binding proteins. The structural topology of these
domains is most similar to that of the type 2
periplasmic binding proteins (PBP2), which are
responsible for the uptake of a variety of substrates
such as phosphate, sulfate, polysaccharides,
lysine/arginine/ornithine, and histidine. The PBP2 bind
their ligand in the cleft between these domains in a
manner resembling a Venus flytrap. After binding their
specific ligand with high affinity, they can interact
with a cognate membrane transport complex comprised of
two integral membrane domains and two cytoplasmically
located ATPase domains. This interaction triggers the
ligand translocation across the cytoplasmic membrane
energized by ATP hydrolysis. Besides transport
proteins, the PBP2 superfamily includes the
ligand-binding domains from ionotropic glutamate
receptors, LysR-type transcriptional regulators, and
unorthodox sensor proteins involved in signal
transduction.
Length = 482
Score = 27.9 bits (63), Expect = 3.5
Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 11/57 (19%)
Query: 117 VGPATASAVLAAYAPDLAPFMSDEAMGAALGHSKDYSLKQYLLFADKLQAKAKVSDI 173
V T L PD APF+++ AM A S +Y AD+L A K +
Sbjct: 122 VDDYTVKFTLTR--PD-APFLANLAMPFASILSPEY--------ADQLLAAGKPEQL 167
>gnl|CDD|233858 TIGR02414, pepN_proteo, aminopeptidase N, Escherichia coli type.
The M1 family of zinc metallopeptidases contains a
number of distinct, well-separated clades of proteins
with aminopeptidase activity. Several are designated
aminopeptidase N, EC 3.4.11.2, after the Escherichia
coli enzyme, suggesting a similar activity profile (see
SP|P04825 for a description of catalytic activity). This
family consists of all aminopeptidases closely related
to E. coli PepN and presumed to have similar (not
identical) function. Nearly all are found in
Proteobacteria, but members are found also in
Cyanobacteria, plants, and apicomplexan parasites. This
family differs greatly in sequence from the family of
aminopeptidases typified by Streptomyces lividans PepN
(TIGR02412), from the membrane bound aminopeptidase N
family in animals, etc [Protein fate, Degradation of
proteins, peptides, and glycopeptides].
Length = 863
Score = 27.7 bits (62), Expect = 3.9
Identities = 14/56 (25%), Positives = 29/56 (51%)
Query: 75 RPRLLVFVSSLDDSSVKSASEKAFKSLPDLTKAVSELTVLKGVGPATASAVLAAYA 130
R L ++S+ DD+ +++ + + FKS ++T ++ L+ L LAA+
Sbjct: 675 RNACLSYLSAADDAEIRNLALEQFKSADNMTDRLAALSALVHFESDFRERALAAFY 730
>gnl|CDD|235607 PRK05785, PRK05785, hypothetical protein; Provisional.
Length = 226
Score = 27.3 bits (61), Expect = 4.1
Identities = 12/46 (26%), Positives = 20/46 (43%), Gaps = 3/46 (6%)
Query: 12 KWKEALASYEACVESLNKPN-LISL--DDYYRKELPSLIHQRNPNP 54
W+E +Y ++ ++ N IS D +R EL I + P
Sbjct: 7 TWEELQEAYNKIPKAYDRANRFISFNQDVRWRAELVKTILKYCGRP 52
>gnl|CDD|236091 PRK07768, PRK07768, long-chain-fatty-acid--CoA ligase; Validated.
Length = 545
Score = 27.7 bits (62), Expect = 4.2
Identities = 11/20 (55%), Positives = 13/20 (65%)
Query: 116 GVGPATASAVLAAYAPDLAP 135
GVGP A AVLA ++AP
Sbjct: 50 GVGPGDAVAVLAGAPVEIAP 69
>gnl|CDD|178588 PLN03014, PLN03014, carbonic anhydrase.
Length = 347
Score = 27.4 bits (60), Expect = 4.3
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 18/88 (20%)
Query: 88 SSVKSASEKAFKSLPDLTKAVSELTVLKGVGPATASAVLAAYAPDLAPFMSDEAMGAALG 147
SS S+S + +S+P L + V AA AP +AP+ S+E MG
Sbjct: 40 SSSSSSSSSSSRSVPTLIRN---------------EPVFAAPAPIIAPYWSEE-MGTEAY 83
Query: 148 HSKDYSLKQYLLFADKLQ--AKAKVSDI 173
+LK+ L+ ++L+ A AKV I
Sbjct: 84 DEAIEALKKLLIEKEELKTVAAAKVEQI 111
>gnl|CDD|130156 TIGR01084, mutY, A/G-specific adenine glycosylase. This equivalog
model identifies mutY members of the pfam00730
superfamily (HhH-GPD: Helix-hairpin-helix and Gly/Pro
rich loop followed by a conserved aspartate). The major
members of the superfamily are nth and mutY [DNA
metabolism, DNA replication, recombination, and repair].
Length = 275
Score = 27.4 bits (61), Expect = 4.7
Identities = 9/20 (45%), Positives = 12/20 (60%)
Query: 108 VSELTVLKGVGPATASAVLA 127
+L L GVG TA A+L+
Sbjct: 104 FEDLAALPGVGRYTAGAILS 123
>gnl|CDD|143386 cd00141, NT_POLXc, Nucleotidyltransferase (NT) domain of family X
DNA Polymerases. X family polymerases fill in short
gaps during DNA repair. They are relatively inaccurate
enzymes and play roles in base excision repair, in
non-homologous end joining (NHEJ) which acts mainly to
repair damage due to ionizing radiation, and in V(D)J
recombination. This family includes eukaryotic Pol beta,
Pol lambda, Pol mu, and terminal deoxyribonucleotidyl
transferase (TdT). Pol beta and Pol lambda are primarily
DNA template-dependent polymerases. TdT is a DNA
template-independent polymerase. Pol mu has both
template dependent and template independent activities.
This subgroup belongs to the Pol beta-like NT
superfamily. In the majority of enzymes in this
superfamily, two carboxylates, Dx[D/E], together with a
third more distal carboxylate, coordinate two divalent
metal cations involved in a two-metal ion mechanism of
nucleotide addition. These three carboxylate residues
are fairly well conserved in this family.
Length = 307
Score = 27.2 bits (61), Expect = 5.5
Identities = 9/29 (31%), Positives = 13/29 (44%)
Query: 94 SEKAFKSLPDLTKAVSELTVLKGVGPATA 122
K + D+ + L + GVGP TA
Sbjct: 70 LRKLEELREDVPPGLLLLLRVPGVGPKTA 98
>gnl|CDD|223474 COG0397, COG0397, Uncharacterized conserved protein [Function
unknown].
Length = 488
Score = 27.3 bits (61), Expect = 5.9
Identities = 13/40 (32%), Positives = 17/40 (42%), Gaps = 1/40 (2%)
Query: 126 LAAYAPDLAPFMSDEAMGAALGHSKDYSLKQYL-LFADKL 164
L A L+P + A+ AAL D Y L D+L
Sbjct: 302 LQRLAGALSPLIDQAALEAALNSFSDLYRAAYRGLMRDRL 341
>gnl|CDD|216088 pfam00730, HhH-GPD, HhH-GPD superfamily base excision DNA repair
protein. This family contains a diverse range of
structurally related DNA repair proteins. The
superfamily is called the HhH-GPD family after its
hallmark Helix-hairpin-helix and Gly/Pro rich loop
followed by a conserved aspartate. This includes
endonuclease III, EC:4.2.99.18 and MutY an A/G-specific
adenine glycosylase, both have a C terminal 4Fe-4S
cluster. The family also includes 8-oxoguanine DNA
glycosylases. The methyl-CPG binding protein MBD4 also
contains a related domain that is a thymine DNA
glycosylase. The family also includes
DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other
members of the AlkA family.
Length = 144
Score = 26.2 bits (58), Expect = 6.9
Identities = 14/36 (38%), Positives = 17/36 (47%), Gaps = 2/36 (5%)
Query: 102 PDLTKAVSELTVLKGVGPATASAVLAAY--APDLAP 135
P + + L L GVG TA AVL PD+ P
Sbjct: 74 PLDLEELEALLALPGVGRWTAEAVLLFALGRPDVFP 109
>gnl|CDD|224859 COG1948, MUS81, ERCC4-type nuclease [DNA replication,
recombination, and repair].
Length = 254
Score = 26.5 bits (59), Expect = 7.3
Identities = 15/54 (27%), Positives = 23/54 (42%), Gaps = 3/54 (5%)
Query: 75 RPRLLVFVSSLDDSSVKSASE--KAFKSLPDLTKA-VSELTVLKGVGPATASAV 125
+ L + S+ K A K F S+ D+ A EL +KG+G A +
Sbjct: 177 KELQLYILESIPGIGPKLAERLLKKFGSVEDVLTASEEELMKVKGIGEKKAREI 230
>gnl|CDD|240614 cd12794, Hsm3_like, Hsm3 is a yeast Proteasome chaperone of the
19S regulatory particle and related proteins. This
group contains proteins related to the Hsm3 protein
(Yeast Proteasome Interacting Protein) of Saccharomyces
cerevisiae. S. cerevisiae Hsm3 is a chaperone of
regulatory particles involved in proteasome assembly.
The 26S Proteasome is a large, 2.5 MDa complex comprised
of at least 33 subunits, and relies on chaperones to
facilitate correct assembly. The proteasome contains a
cylindrical 20S core particle and 1-2 19S regulatory
particles, comprised of AAA-ATPase and non-ATPase
subunits. The proteasome acts in ubiquitin-dependent
proteolysis. The 19S RP targets and opens the the
ubiquitin-tagged substrate and releases ubiquitin. Hsm3
acts as a 19S chaperone, binding to the C-terminal
domain of Rpt1 (the 6 ATPase subunits of the 19 S
regulatory particle(s). Hsm3 has a C-shape composed of
11 HEAT repeats. Mutations in the Hsm3-Rpt interface
disrupt formation of the 26 S Proteasome complex.
Length = 455
Score = 26.8 bits (60), Expect = 7.7
Identities = 6/31 (19%), Positives = 13/31 (41%)
Query: 155 KQYLLFADKLQAKAKVSDIFFFLNICVYFSS 185
+F + K+ DI F+ + Y++
Sbjct: 204 DDLFIFTTEEIFKSLEKDILLFILLVQYYTK 234
>gnl|CDD|223255 COG0177, Nth, Predicted EndoIII-related endonuclease [DNA
replication, recombination, and repair].
Length = 211
Score = 26.4 bits (59), Expect = 8.0
Identities = 10/22 (45%), Positives = 12/22 (54%)
Query: 106 KAVSELTVLKGVGPATASAVLA 127
EL L GVG TA+ VL+
Sbjct: 106 DTREELLSLPGVGRKTANVVLS 127
>gnl|CDD|223200 COG0122, AlkA, 3-methyladenine DNA glycosylase/8-oxoguanine DNA
glycosylase [DNA replication, recombination, and
repair].
Length = 285
Score = 26.6 bits (59), Expect = 8.2
Identities = 12/20 (60%), Positives = 14/20 (70%)
Query: 107 AVSELTVLKGVGPATASAVL 126
A+ ELT LKG+GP TA L
Sbjct: 196 AIEELTALKGIGPWTAEMFL 215
>gnl|CDD|236389 PRK09134, PRK09134, short chain dehydrogenase; Provisional.
Length = 258
Score = 26.4 bits (59), Expect = 8.2
Identities = 12/28 (42%), Positives = 14/28 (50%), Gaps = 2/28 (7%)
Query: 46 LIHQR--NPNPHINTTELSKLVRWKLTR 71
+I QR N NP + LSK W TR
Sbjct: 144 MIDQRVWNLNPDFLSYTLSKAALWTATR 171
>gnl|CDD|234691 PRK00220, PRK00220, putative glycerol-3-phosphate acyltransferase
PlsY; Provisional.
Length = 198
Score = 26.3 bits (59), Expect = 8.3
Identities = 11/23 (47%), Positives = 13/23 (56%), Gaps = 2/23 (8%)
Query: 115 KGVGPATASAVLAAYAPDLAPFM 137
KGV ATA+ VL P LA +
Sbjct: 106 KGV--ATAAGVLLGIGPLLALIL 126
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.131 0.380
Gapped
Lambda K H
0.267 0.0940 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,070,363
Number of extensions: 805045
Number of successful extensions: 942
Number of sequences better than 10.0: 1
Number of HSP's gapped: 941
Number of HSP's successfully gapped: 57
Length of query: 185
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 94
Effective length of database: 6,901,388
Effective search space: 648730472
Effective search space used: 648730472
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.0 bits)