RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9547
(363 letters)
>gnl|CDD|224115 COG1194, MutY, A/G-specific DNA glycosylase [DNA replication,
recombination, and repair].
Length = 342
Score = 386 bits (994), Expect = e-134
Identities = 142/351 (40%), Positives = 202/351 (57%), Gaps = 15/351 (4%)
Query: 6 SAKEILAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTY 65
S +I FQE++L WY +N R LPWRE+ +PY VW+SE+MLQQTQV TV+PYYE+F++ +
Sbjct: 4 SDGDIEKFQEALLDWYDKNGRDLPWRETKDPYRVWVSEIMLQQTQVATVIPYYERFLERF 63
Query: 66 PTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGD 125
PTIK A ED VLK WEGLGYYSR RN ++V+E+ GGE P D+++L ++ GVG
Sbjct: 64 PTIKALAAAPEDEVLKAWEGLGYYSRARNLHKAAQEVVERHGGEFPDDEEELAALPGVGP 123
Query: 126 YTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTN-AR 184
YTAGA+ S +N P P +DGNV R+ RLF I DI K KTK L + ++
Sbjct: 124 YTAGAILSFAFNQPEPVLDGNVKRVLSRLFAISGDIGKPKTKKELWELAEQLLTPDRRPG 183
Query: 185 EFNQALMDLGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVV 244
+FNQA+MDLGAT+C K PKC CPL C+AY+N T E +PVK K K+ F +++
Sbjct: 184 DFNQAMMDLGATICTAKKPKCSLCPLRDNCAAYRNGTPEKYPVKKPKKKLPRRFAAFLIL 243
Query: 245 KTDTNKYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEFLSENLPFKVNLNEKCLGNVKH 304
D + L++KRP GLL + F FE + + L + LG +H
Sbjct: 244 NRD-GEVLLEKRPEKGLLGGLWCFPQFEDEADLLDWL--------AADGLAAEPLGAFRH 294
Query: 305 VFSHLKWNMDVYSGTAKEKTIPANKTYKLITETQMKKYAFPVPYQKVWKLF 355
F+H + +++ + + + ++ P P +K+ +
Sbjct: 295 TFTHFRLTIELRA-----SASLVLSDGRWYNLSDLESIGLPAPVKKLLQQL 340
>gnl|CDD|130156 TIGR01084, mutY, A/G-specific adenine glycosylase. This equivalog
model identifies mutY members of the pfam00730
superfamily (HhH-GPD: Helix-hairpin-helix and Gly/Pro
rich loop followed by a conserved aspartate). The major
members of the superfamily are nth and mutY [DNA
metabolism, DNA replication, recombination, and repair].
Length = 275
Score = 315 bits (809), Expect = e-107
Identities = 119/263 (45%), Positives = 164/263 (62%), Gaps = 1/263 (0%)
Query: 12 AFQESILTWYKQNARQ-LPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKD 70
F E +L+WY + R+ LPWR++ PY VW+SEVMLQQTQV TV+PY+E+F++ +PT++
Sbjct: 1 QFSEDLLSWYDKYGRKTLPWRQNKTPYRVWLSEVMLQQTQVATVIPYFERFLERFPTVQA 60
Query: 71 FAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGA 130
A +D VLK+WEGLGYY+R RN ++V+E+FGGE P+D + L ++ GVG YTAGA
Sbjct: 61 LANAPQDEVLKLWEGLGYYARARNLHKAAQEVVEEFGGEFPQDFEDLAALPGVGRYTAGA 120
Query: 131 LASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQAL 190
+ S N P P +DGNV R+ RLF +E K K + TL + + + FNQAL
Sbjct: 121 ILSFALNKPYPILDGNVKRVLSRLFAVEGWPGKKKVENRLWTLAESLLPKADPEAFNQAL 180
Query: 191 MDLGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVVKTDTNK 250
MDLGA +C K PKC CPL FC AY+ T E +PVK K + +V++ +
Sbjct: 181 MDLGAMICTRKKPKCDLCPLQDFCLAYQQGTWEEYPVKKPKAAPPERTTYFLVLQNYDGE 240
Query: 251 YLIQKRPTTGLLSNFYMFLSFES 273
L+++RP GL Y F FE
Sbjct: 241 VLLEQRPEKGLWGGLYCFPQFED 263
>gnl|CDD|182805 PRK10880, PRK10880, adenine DNA glycosylase; Provisional.
Length = 350
Score = 212 bits (542), Expect = 4e-66
Identities = 105/268 (39%), Positives = 145/268 (54%), Gaps = 14/268 (5%)
Query: 13 FQESILTWYKQNARQ-LPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDF 71
F +L WY + R+ LPW+ PY VW+SEVMLQQTQV TV+PY+E+F+ +PT+ D
Sbjct: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65
Query: 72 AFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGAL 131
A D VL +W GLGYY+R RN +QV GGE P +++ ++ GVG TAGA+
Sbjct: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGEFPETFEEVAALPGVGRSTAGAI 125
Query: 132 ASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNARE---FNQ 188
S+ P +DGNV R+ R + + GK K V L + T A FNQ
Sbjct: 126 LSLSLGKHFPILDGNVKRVLARCYAVSG--WPGK-KEVENRLWQLSEQVTPAVGVERFNQ 182
Query: 189 ALMDLGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGKTKVKD---VFLLTVVVK 245
A+MDLGA VC PKC+ CPL C AY N++ +P K K + + FLL
Sbjct: 183 AMMDLGAMVCTRSKPKCELCPLQNGCIAYANHSWALYPGKKPKQTLPERTGYFLLL---- 238
Query: 246 TDTNKYLIQKRPTTGLLSNFYMFLSFES 273
++ +++RP +GL + F F
Sbjct: 239 QHGDEVWLEQRPPSGLWGGLFCFPQFAD 266
>gnl|CDD|238013 cd00056, ENDO3c, endonuclease III; includes endonuclease III
(DNA-(apurinic or apyrimidinic site) lyase), alkylbase
DNA glycosidases (Alka-family) and other DNA
glycosidases.
Length = 158
Score = 150 bits (380), Expect = 2e-44
Identities = 53/162 (32%), Positives = 81/162 (50%), Gaps = 8/162 (4%)
Query: 37 YYVWISEVMLQQTQVKTVLPYYEKFIKTY-PTIKDFAFDTEDNVLKMWEGLGYYSRVRNF 95
+ V +SE++ QQT K V YE+ + Y PT + A E+ + ++ LGY + +
Sbjct: 1 FEVLVSEILSQQTTDKAVNKAYERLFERYGPTPEALAAADEEELRELIRSLGYRRKAKYL 60
Query: 96 QAGCRQVIEQFGGEV---PRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYG 152
+ R ++E FGG V P +++LL++ GVG TA + VD +V R+
Sbjct: 61 KELARAIVEGFGGLVLDDPDAREELLALPGVGRKTANVVLLFALGPDAFPVDTHVRRVLK 120
Query: 153 RLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLG 194
RL I K KT E L+ + + + E NQALMDLG
Sbjct: 121 RLG----LIPKKKTPEELEELLEELLPKPYWGEANQALMDLG 158
>gnl|CDD|214684 smart00478, ENDO3c, endonuclease III. includes endonuclease III
(DNA-(apurinic or apyrimidinic site) lyase), alkylbase
DNA glycosidases (Alka-family) and other DNA
glycosidases.
Length = 149
Score = 131 bits (332), Expect = 2e-37
Identities = 49/153 (32%), Positives = 80/153 (52%), Gaps = 5/153 (3%)
Query: 45 MLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLG-YYSRVRNFQAGCRQVI 103
+ QQT + V E+ + +PT +D A E+ + ++ GLG Y + R R ++
Sbjct: 1 LSQQTTDERVNKATERLFEKFPTPEDLAAADEEELEELIRGLGFYRRKARYLIELARILV 60
Query: 104 EQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISK 163
E++GGEVP D+++LL + GVG TA A+ S P VD +V RI RL ++ +
Sbjct: 61 EEYGGEVPDDREELLKLPGVGRKTANAVLSFALGKPFIPVDTHVLRIAKRLGLVDKKSTP 120
Query: 164 GKTKVVYETLVSKTMSQTNAREFNQALMDLGAT 196
+ + + E L+ + + RE N L+D G T
Sbjct: 121 EEVEKLLEKLLPE----EDWRELNLLLIDFGRT 149
>gnl|CDD|172427 PRK13910, PRK13910, DNA glycosylase MutY; Provisional.
Length = 289
Score = 126 bits (317), Expect = 7e-34
Identities = 85/285 (29%), Positives = 142/285 (49%), Gaps = 28/285 (9%)
Query: 45 MLQQTQVKTVLP-YYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVI 103
M QQTQ+ TV+ +Y F++ +PT+KD A + VL +W GLGYYSR +N + +
Sbjct: 1 MSQQTQINTVVERFYSPFLEAFPTLKDLANAPLEEVLLLWRGLGYYSRAKNLKKSAEICV 60
Query: 104 EQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISK 163
++ ++P D + LL + G+G YTA A+ + + VD N+ R+ RLF ++ +I
Sbjct: 61 KEHHSQLPNDYQSLLKLPGIGAYTANAILCFGFREKSACVDANIKRVLLRLFGLDPNIHA 120
Query: 164 GKTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKCKECPLSRFCSAYKNNTIE 223
++ ++ S + NQAL+DLGA +C K PKC CPL+ +C N E
Sbjct: 121 KDLQIKANDFLNLNESFNH----NQALIDLGALICSPK-PKCAICPLNPYCLG--KNNPE 173
Query: 224 NFPVKLGKTKVKDVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEF 283
+K + V++ L VV++ N+ ++K + Y+ + +
Sbjct: 174 KHTLKKKQEIVQEERYLGVVIQ--NNQIALEK-----IEQKLYLGMHHFPN--------- 217
Query: 284 LSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSGTAKEKTIPAN 328
L ENL +K+ LG +KH + K N+++Y K+ P
Sbjct: 218 LKENLEYKLPF----LGAIKHSHTKFKLNLNLYLAAIKDLKNPIR 258
>gnl|CDD|223255 COG0177, Nth, Predicted EndoIII-related endonuclease [DNA
replication, recombination, and repair].
Length = 211
Score = 121 bits (306), Expect = 7e-33
Identities = 56/206 (27%), Positives = 101/206 (49%), Gaps = 7/206 (3%)
Query: 15 ESILTWY-KQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAF 73
IL + + +P+ + ++ ++ QT + V K YPT +D
Sbjct: 7 LEILDRLRELYPEPKTELDFKDPFELLVAVILSAQTTDEVVNKATPALFKRYPTPEDLLN 66
Query: 74 DTEDNVLKMWEGLGYY-SRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALA 132
E+ + ++ + +G Y ++ +N + R ++E+FGGEVP +++LLS+ GVG TA +
Sbjct: 67 ADEEELEELIKSIGLYRNKAKNIKELARILLEKFGGEVPDTREELLSLPGVGRKTANVVL 126
Query: 133 SICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMD 192
S + IP AVD +V R+ RL + GKT E + K + + + + L+
Sbjct: 127 SFAFGIPAIAVDTHVHRVSNRL-----GLVPGKTPEEVEEALMKLIPKELWTDLHHWLIL 181
Query: 193 LGATVCLFKNPKCKECPLSRFCSAYK 218
G +C + P+C+ECPL+ C +
Sbjct: 182 HGRYICKARKPRCEECPLADLCPSAG 207
>gnl|CDD|216088 pfam00730, HhH-GPD, HhH-GPD superfamily base excision DNA repair
protein. This family contains a diverse range of
structurally related DNA repair proteins. The
superfamily is called the HhH-GPD family after its
hallmark Helix-hairpin-helix and Gly/Pro rich loop
followed by a conserved aspartate. This includes
endonuclease III, EC:4.2.99.18 and MutY an A/G-specific
adenine glycosylase, both have a C terminal 4Fe-4S
cluster. The family also includes 8-oxoguanine DNA
glycosylases. The methyl-CPG binding protein MBD4 also
contains a related domain that is a thymine DNA
glycosylase. The family also includes
DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other
members of the AlkA family.
Length = 144
Score = 108 bits (271), Expect = 1e-28
Identities = 39/135 (28%), Positives = 68/135 (50%), Gaps = 8/135 (5%)
Query: 41 ISEVMLQQTQVKTVLPYYEKFIKTY--PTIKDFAFDTEDNVLKMWEGLGYY-SRVRNFQA 97
+S ++ QQT K ++ + Y PT +D A E+ + ++ +GLG+Y + + +
Sbjct: 1 VSAILSQQTSDKAANKITKRLFERYGFPTPEDLAEADEEELRELIKGLGFYRRKAKYIKE 60
Query: 98 GCRQVIEQFGGEVPRDKK---QLLSIKGVGDYTAGALASIC--YNIPTPAVDGNVFRIYG 152
R ++E + G VP D + LL++ GVG +TA A+ PAVD +V R+
Sbjct: 61 LARILVEGYLGLVPLDLEELEALLALPGVGRWTAEAVLLFALGRPDVFPAVDTHVRRVAK 120
Query: 153 RLFEIEDDISKGKTK 167
RL I+ K + +
Sbjct: 121 RLGLIDTKPPKKEVE 135
>gnl|CDD|130155 TIGR01083, nth, endonuclease III. This equivalog model identifes
nth members of the pfam00730 superfamily (HhH-GPD:
Helix-hairpin-helix and Gly/Pro rich loop followed by a
conserved aspartate). The major members of the
superfamily are nth and mutY [DNA metabolism, DNA
replication, recombination, and repair].
Length = 191
Score = 100 bits (250), Expect = 4e-25
Identities = 51/173 (29%), Positives = 92/173 (53%), Gaps = 6/173 (3%)
Query: 34 NNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYY-SRV 92
NNP+ + ++ ++ Q K+V +K + YPT + A + + + + +G Y ++
Sbjct: 24 NNPFELLVATILSAQATDKSVNKATKKLFEVYPTPQALAQAGLEELEEYIKSIGLYRNKA 83
Query: 93 RNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYG 152
+N A CR ++E++GGEVP D+++L+ + GVG TA + ++ + IP AVD +VFR+
Sbjct: 84 KNIIALCRILVERYGGEVPEDREELVKLPGVGRKTANVVLNVAFGIPAIAVDTHVFRVSN 143
Query: 153 RLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKC 205
RL +SKGK E + K + + + + L+ G C + P C
Sbjct: 144 RL-----GLSKGKDPDKVEEELLKLIPREFWTKLHHWLILHGRYTCKARKPLC 191
>gnl|CDD|239523 cd03431, DNA_Glycosylase_C, DNA glycosylase (MutY in bacteria and
hMYH in humans) is responsible for repairing misread
A*oxoG residues to C*G by removing the inappropriately
paired adenine base from the DNA backbone. It belongs to
the Nudix hydrolase superfamily and is important for the
repair of various genotoxic lesions. Enzymes belonging
to this superfamily requires a divalent cation, such as
Mg2+ or Mn2+ for their activity. They are also
recognized by a highly conserved 23-residue nudix motif
(GX5EX7REUXEEXGU, where U = I, L or V). However, DNA
glycosylase does not seem to contain this signature
motif. DNA glycosylase consists of 2 domains: the
N-terminal domain contains the catalytic properties of
the enzyme and the C-terminal domain affects substrate
(oxoG) binding and enzymatic turnover. The C-terminal
domain is highly similar to MutT, based on secondary
structure and topology, despite low sequence identity.
MutT sanitizes the nucleotide precursor pool by
hydrolyzing oxo-dGTP to oxo-dGMO and inorganic
pyrophosphate. The similarity strongly suggests that the
two proteins share a common evolutionary origin.
Length = 118
Score = 96.6 bits (241), Expect = 1e-24
Identities = 33/121 (27%), Positives = 59/121 (48%), Gaps = 3/121 (2%)
Query: 235 KDVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEFLSENLPFKVNL 294
K + VV++ D + L++KRP GLL+ + F S E + + E L L + L
Sbjct: 1 KRRGIAVVVIRND-GRVLLEKRPEKGLLAGLWEFPSVEWEEEADGE-EALLSALKKALRL 58
Query: 295 NEKCLGNVKHVFSHLKWNMDVYSGTAKEKTIPANKTYKLITETQMKKYAFPVPYQKVWKL 354
+ + LG VKH F+H + + VY + + A + + ++ +YA P +K+ +L
Sbjct: 59 SLEPLGTVKHTFTHFRLTLHVYLARLEG-DLLAPDEGRWVPLEELDEYALPTVMRKILEL 117
Query: 355 F 355
Sbjct: 118 L 118
>gnl|CDD|182661 PRK10702, PRK10702, endonuclease III; Provisional.
Length = 211
Score = 60.8 bits (147), Expect = 8e-11
Identities = 37/127 (29%), Positives = 59/127 (46%), Gaps = 5/127 (3%)
Query: 88 YYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNV 147
Y S+ N CR ++EQ GEVP D+ L ++ GVG TA + + + PT AVD ++
Sbjct: 82 YNSKAENVIKTCRILLEQHNGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHI 141
Query: 148 FRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKCKE 207
FR+ R + GK E + K + + + L+ G C+ + P+C
Sbjct: 142 FRVCNR-----TQFAPGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGS 196
Query: 208 CPLSRFC 214
C + C
Sbjct: 197 CIIEDLC 203
>gnl|CDD|109681 pfam00633, HHH, Helix-hairpin-helix motif. The helix-hairpin-helix
DNA-binding motif is found to be duplicated in the
central domain of RuvA. The HhH domain of DisA, a
bacterial checkpoint control protein, is a DNA-binding
domain.
Length = 30
Score = 38.2 bits (90), Expect = 1e-04
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 106 FGGEVPRDKKQLLSIKGVGDYTAGALASI 134
G +P +++LL++ GVG TA A+ S
Sbjct: 2 LEGLIPASREELLALPGVGPKTAEAILSY 30
>gnl|CDD|197771 smart00525, FES, iron-sulpphur binding domain in DNA-(apurinic or
apyrimidinic site) lyase (subfamily of ENDO3).
Length = 21
Score = 35.6 bits (83), Expect = 0.001
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 197 VCLFKNPKCKECPLSRFCSAY 217
+C + P+C ECPL C AY
Sbjct: 1 ICTARKPRCDECPLKDLCPAY 21
>gnl|CDD|225140 COG2231, COG2231, Uncharacterized protein related to Endonuclease
III [DNA replication, recombination, and repair].
Length = 215
Score = 38.9 bits (91), Expect = 0.002
Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 4/104 (3%)
Query: 114 KKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETL 173
+++LLSIKG+G TA ++ + P VD R+ RL IE+ + K ++E
Sbjct: 114 REELLSIKGIGKETADSILLYALDRPVFVVDKYTRRLLSRLGGIEEK-KYDEIKELFEEN 172
Query: 174 VSKTMSQTNAREFNQALMDLGATVCLFKNPKCKECPLSRFCSAY 217
+ + + +EF+ AL+ A K P C++CPL C Y
Sbjct: 173 LPE--NLRLYQEFH-ALIVEHAKHFCKKKPLCEKCPLKEKCKKY 213
>gnl|CDD|204516 pfam10576, EndIII_4Fe-2S, Iron-sulfur binding domain of
endonuclease III. Escherichia coli endonuclease III (EC
4.2.99.18) is a DNA repair enzyme that acts both as a
DNA N-glycosylase, removing oxidized pyrimidines from
DNA, and as an apurinic/apyrimidinic (AP) endonuclease,
introducing a single-strand nick at the site from which
the damaged base was removed. Endonuclease III is an
iron-sulfur protein that binds a single 4Fe-4S cluster.
The 4Fe-4S cluster does not seem to be important for
catalytic activity, but is probably involved in the
proper positioning of the enzyme along the DNA strand.
The 4Fe-4S cluster is bound by four cysteines which are
all located in a 17 amino acid region at the C-terminal
end of endonuclease III. A similar region is also
present in the central section of mutY and in the
C-terminus of ORF-10 and of the Micro-coccus UV
endonuclease.
Length = 17
Score = 31.6 bits (73), Expect = 0.029
Identities = 8/14 (57%), Positives = 11/14 (78%)
Query: 201 KNPKCKECPLSRFC 214
+ PKC+ECPL+ C
Sbjct: 4 RKPKCEECPLADLC 17
>gnl|CDD|151922 pfam11485, DUF3211, Protein of unknown function (DUF3211). This
archaeal family of proteins has no known function.
Length = 136
Score = 30.0 bits (68), Expect = 0.80
Identities = 15/46 (32%), Positives = 20/46 (43%), Gaps = 4/46 (8%)
Query: 143 VDGNVFRIYGRL----FEIEDDISKGKTKVVYETLVSKTMSQTNAR 184
V+GN FR GR FEI+ + +V Y VS N +
Sbjct: 39 VEGNSFRAEGRFGAFSFEIKGRVYVSNNEVTYIFYVSGGGPNGNGK 84
>gnl|CDD|237519 PRK13812, PRK13812, orotate phosphoribosyltransferase; Provisional.
Length = 176
Score = 28.6 bits (64), Expect = 3.3
Identities = 17/47 (36%), Positives = 24/47 (51%), Gaps = 3/47 (6%)
Query: 151 YGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGATV 197
YG IE + +G+ VV E + + S +A E AL + GATV
Sbjct: 94 YGTGNRIEGRLDEGEEVVVLEDIATTGQSAVDAVE---ALREAGATV 137
>gnl|CDD|140318 PTZ00297, PTZ00297, pantothenate kinase; Provisional.
Length = 1452
Score = 29.1 bits (65), Expect = 4.9
Identities = 15/47 (31%), Positives = 20/47 (42%), Gaps = 3/47 (6%)
Query: 210 LSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVVKTDTNKYLIQKR 256
L R C +++ IE F L DV LL V Y +QK+
Sbjct: 21 LPRGCGGFQHERIETF---LASVDAYDVVLLQEVYAASVLPYFLQKQ 64
>gnl|CDD|149881 pfam08950, DUF1861, Protein of unknown function (DUF1861). This
hypothetical protein, found in bacteria and in the
eukaryote Leishmania, has no known function.
Length = 298
Score = 28.4 bits (64), Expect = 5.1
Identities = 13/40 (32%), Positives = 15/40 (37%), Gaps = 7/40 (17%)
Query: 114 KKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGR 153
+ LL+ KGVG Y YN P I GR
Sbjct: 12 ESVLLTFKGVGGYDV-------YNCSAPFTYKGKTHIAGR 44
>gnl|CDD|218638 pfam05559, DUF763, Protein of unknown function (DUF763). This
family consists of several uncharacterized bacterial and
archaeal proteins of unknown function.
Length = 319
Score = 28.4 bits (64), Expect = 5.7
Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 3/33 (9%)
Query: 111 PRDKKQLLSIKGVGDYTAGAL---ASICYNIPT 140
P D ++LL +KGVG T AL A + Y P
Sbjct: 265 PEDFEELLLLKGVGPSTLRALALVAEVIYGTPP 297
>gnl|CDD|215183 PLN02323, PLN02323, probable fructokinase.
Length = 330
Score = 28.4 bits (64), Expect = 6.3
Identities = 23/92 (25%), Positives = 35/92 (38%), Gaps = 28/92 (30%)
Query: 76 EDNVLKMW----------EGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGD 125
+D V+K+W EG + GCR + F G V K + + G GD
Sbjct: 219 DDTVVKLWHPNLKLLLVTEG----------EEGCRYYTKDFKGRVEGFKVKAVDTTGAGD 268
Query: 126 -YTAGALASICYNIPTPAVDGNVFRIYGRLFE 156
+ G L+ + A D ++ RL E
Sbjct: 269 AFVGGLLSQL-------AKDLSLLEDEERLRE 293
>gnl|CDD|215922 pfam00443, UCH, Ubiquitin carboxyl-terminal hydrolase.
Length = 313
Score = 28.1 bits (63), Expect = 7.1
Identities = 22/160 (13%), Positives = 46/160 (28%), Gaps = 34/160 (21%)
Query: 177 TMSQTNAREFNQALMD-----LGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGK 231
Q +A EF L+D L + + E +++ + ++ K
Sbjct: 88 GYMQQDAHEFLLFLLDQLHEDLNSLKKRKSHAAENESLITKLFQGQLESRLKCLKCKKES 147
Query: 232 TKVKDVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEFLSENLPFK 291
+ + L++ ++ ++ + T ++E L + +
Sbjct: 148 STPEPFSDLSLPIEDSSSV--------------------LKLLTGNFLKLEELEKEEKYY 187
Query: 292 V-NLNEKCLGNVKHVFS--------HLKWNMDVYSGTAKE 322
K K S HLK D T K+
Sbjct: 188 CPKCGGKQNAIKKLDISRLPPVLIIHLKRFEDFNRETEKK 227
>gnl|CDD|224333 COG1415, COG1415, Uncharacterized conserved protein [Function
unknown].
Length = 373
Score = 28.1 bits (63), Expect = 7.2
Identities = 19/72 (26%), Positives = 29/72 (40%), Gaps = 19/72 (26%)
Query: 111 PRDKKQLLSIKGVGDYTAGAL---ASICYNIPT----PAV--------DGNVFRI----Y 151
P D ++LL + G+G T AL A + Y P P DG F + Y
Sbjct: 274 PDDFEELLLVPGIGPSTVRALALVAEVIYGEPPSFRDPVKYAFAVGGKDGVPFPVNRKTY 333
Query: 152 GRLFEIEDDISK 163
L E +++ +
Sbjct: 334 DELIEFLEELVE 345
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.401
Gapped
Lambda K H
0.267 0.0806 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,885,499
Number of extensions: 1675941
Number of successful extensions: 1450
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1420
Number of HSP's successfully gapped: 33
Length of query: 363
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 265
Effective length of database: 6,590,910
Effective search space: 1746591150
Effective search space used: 1746591150
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (26.7 bits)