RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9547
         (363 letters)



>gnl|CDD|224115 COG1194, MutY, A/G-specific DNA glycosylase [DNA replication,
           recombination, and repair].
          Length = 342

 Score =  386 bits (994), Expect = e-134
 Identities = 142/351 (40%), Positives = 202/351 (57%), Gaps = 15/351 (4%)

Query: 6   SAKEILAFQESILTWYKQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTY 65
           S  +I  FQE++L WY +N R LPWRE+ +PY VW+SE+MLQQTQV TV+PYYE+F++ +
Sbjct: 4   SDGDIEKFQEALLDWYDKNGRDLPWRETKDPYRVWVSEIMLQQTQVATVIPYYERFLERF 63

Query: 66  PTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGD 125
           PTIK  A   ED VLK WEGLGYYSR RN     ++V+E+ GGE P D+++L ++ GVG 
Sbjct: 64  PTIKALAAAPEDEVLKAWEGLGYYSRARNLHKAAQEVVERHGGEFPDDEEELAALPGVGP 123

Query: 126 YTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTN-AR 184
           YTAGA+ S  +N P P +DGNV R+  RLF I  DI K KTK     L  + ++      
Sbjct: 124 YTAGAILSFAFNQPEPVLDGNVKRVLSRLFAISGDIGKPKTKKELWELAEQLLTPDRRPG 183

Query: 185 EFNQALMDLGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVV 244
           +FNQA+MDLGAT+C  K PKC  CPL   C+AY+N T E +PVK  K K+   F   +++
Sbjct: 184 DFNQAMMDLGATICTAKKPKCSLCPLRDNCAAYRNGTPEKYPVKKPKKKLPRRFAAFLIL 243

Query: 245 KTDTNKYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEFLSENLPFKVNLNEKCLGNVKH 304
             D  + L++KRP  GLL   + F  FE +      +            L  + LG  +H
Sbjct: 244 NRD-GEVLLEKRPEKGLLGGLWCFPQFEDEADLLDWL--------AADGLAAEPLGAFRH 294

Query: 305 VFSHLKWNMDVYSGTAKEKTIPANKTYKLITETQMKKYAFPVPYQKVWKLF 355
            F+H +  +++ +              +    + ++    P P +K+ +  
Sbjct: 295 TFTHFRLTIELRA-----SASLVLSDGRWYNLSDLESIGLPAPVKKLLQQL 340


>gnl|CDD|130156 TIGR01084, mutY, A/G-specific adenine glycosylase.  This equivalog
           model identifies mutY members of the pfam00730
           superfamily (HhH-GPD: Helix-hairpin-helix and Gly/Pro
           rich loop followed by a conserved aspartate). The major
           members of the superfamily are nth and mutY [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 275

 Score =  315 bits (809), Expect = e-107
 Identities = 119/263 (45%), Positives = 164/263 (62%), Gaps = 1/263 (0%)

Query: 12  AFQESILTWYKQNARQ-LPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKD 70
            F E +L+WY +  R+ LPWR++  PY VW+SEVMLQQTQV TV+PY+E+F++ +PT++ 
Sbjct: 1   QFSEDLLSWYDKYGRKTLPWRQNKTPYRVWLSEVMLQQTQVATVIPYFERFLERFPTVQA 60

Query: 71  FAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGA 130
            A   +D VLK+WEGLGYY+R RN     ++V+E+FGGE P+D + L ++ GVG YTAGA
Sbjct: 61  LANAPQDEVLKLWEGLGYYARARNLHKAAQEVVEEFGGEFPQDFEDLAALPGVGRYTAGA 120

Query: 131 LASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQAL 190
           + S   N P P +DGNV R+  RLF +E    K K +    TL    + + +   FNQAL
Sbjct: 121 ILSFALNKPYPILDGNVKRVLSRLFAVEGWPGKKKVENRLWTLAESLLPKADPEAFNQAL 180

Query: 191 MDLGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVVKTDTNK 250
           MDLGA +C  K PKC  CPL  FC AY+  T E +PVK  K    +     +V++    +
Sbjct: 181 MDLGAMICTRKKPKCDLCPLQDFCLAYQQGTWEEYPVKKPKAAPPERTTYFLVLQNYDGE 240

Query: 251 YLIQKRPTTGLLSNFYMFLSFES 273
            L+++RP  GL    Y F  FE 
Sbjct: 241 VLLEQRPEKGLWGGLYCFPQFED 263


>gnl|CDD|182805 PRK10880, PRK10880, adenine DNA glycosylase; Provisional.
          Length = 350

 Score =  212 bits (542), Expect = 4e-66
 Identities = 105/268 (39%), Positives = 145/268 (54%), Gaps = 14/268 (5%)

Query: 13  FQESILTWYKQNARQ-LPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDF 71
           F   +L WY +  R+ LPW+    PY VW+SEVMLQQTQV TV+PY+E+F+  +PT+ D 
Sbjct: 6   FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65

Query: 72  AFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGAL 131
           A    D VL +W GLGYY+R RN     +QV    GGE P   +++ ++ GVG  TAGA+
Sbjct: 66  ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGEFPETFEEVAALPGVGRSTAGAI 125

Query: 132 ASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNARE---FNQ 188
            S+      P +DGNV R+  R + +      GK K V   L   +   T A     FNQ
Sbjct: 126 LSLSLGKHFPILDGNVKRVLARCYAVSG--WPGK-KEVENRLWQLSEQVTPAVGVERFNQ 182

Query: 189 ALMDLGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGKTKVKD---VFLLTVVVK 245
           A+MDLGA VC    PKC+ CPL   C AY N++   +P K  K  + +    FLL     
Sbjct: 183 AMMDLGAMVCTRSKPKCELCPLQNGCIAYANHSWALYPGKKPKQTLPERTGYFLLL---- 238

Query: 246 TDTNKYLIQKRPTTGLLSNFYMFLSFES 273
              ++  +++RP +GL    + F  F  
Sbjct: 239 QHGDEVWLEQRPPSGLWGGLFCFPQFAD 266


>gnl|CDD|238013 cd00056, ENDO3c, endonuclease III; includes endonuclease III
           (DNA-(apurinic or apyrimidinic site) lyase), alkylbase
           DNA glycosidases (Alka-family) and other DNA
           glycosidases.
          Length = 158

 Score =  150 bits (380), Expect = 2e-44
 Identities = 53/162 (32%), Positives = 81/162 (50%), Gaps = 8/162 (4%)

Query: 37  YYVWISEVMLQQTQVKTVLPYYEKFIKTY-PTIKDFAFDTEDNVLKMWEGLGYYSRVRNF 95
           + V +SE++ QQT  K V   YE+  + Y PT +  A   E+ + ++   LGY  + +  
Sbjct: 1   FEVLVSEILSQQTTDKAVNKAYERLFERYGPTPEALAAADEEELRELIRSLGYRRKAKYL 60

Query: 96  QAGCRQVIEQFGGEV---PRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYG 152
           +   R ++E FGG V   P  +++LL++ GVG  TA  +           VD +V R+  
Sbjct: 61  KELARAIVEGFGGLVLDDPDAREELLALPGVGRKTANVVLLFALGPDAFPVDTHVRRVLK 120

Query: 153 RLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLG 194
           RL      I K KT    E L+ + + +    E NQALMDLG
Sbjct: 121 RLG----LIPKKKTPEELEELLEELLPKPYWGEANQALMDLG 158


>gnl|CDD|214684 smart00478, ENDO3c, endonuclease III.  includes endonuclease III
           (DNA-(apurinic or apyrimidinic site) lyase), alkylbase
           DNA glycosidases (Alka-family) and other DNA
           glycosidases.
          Length = 149

 Score =  131 bits (332), Expect = 2e-37
 Identities = 49/153 (32%), Positives = 80/153 (52%), Gaps = 5/153 (3%)

Query: 45  MLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLG-YYSRVRNFQAGCRQVI 103
           + QQT  + V    E+  + +PT +D A   E+ + ++  GLG Y  + R      R ++
Sbjct: 1   LSQQTTDERVNKATERLFEKFPTPEDLAAADEEELEELIRGLGFYRRKARYLIELARILV 60

Query: 104 EQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISK 163
           E++GGEVP D+++LL + GVG  TA A+ S     P   VD +V RI  RL  ++   + 
Sbjct: 61  EEYGGEVPDDREELLKLPGVGRKTANAVLSFALGKPFIPVDTHVLRIAKRLGLVDKKSTP 120

Query: 164 GKTKVVYETLVSKTMSQTNAREFNQALMDLGAT 196
            + + + E L+ +     + RE N  L+D G T
Sbjct: 121 EEVEKLLEKLLPE----EDWRELNLLLIDFGRT 149


>gnl|CDD|172427 PRK13910, PRK13910, DNA glycosylase MutY; Provisional.
          Length = 289

 Score =  126 bits (317), Expect = 7e-34
 Identities = 85/285 (29%), Positives = 142/285 (49%), Gaps = 28/285 (9%)

Query: 45  MLQQTQVKTVLP-YYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYYSRVRNFQAGCRQVI 103
           M QQTQ+ TV+  +Y  F++ +PT+KD A    + VL +W GLGYYSR +N +      +
Sbjct: 1   MSQQTQINTVVERFYSPFLEAFPTLKDLANAPLEEVLLLWRGLGYYSRAKNLKKSAEICV 60

Query: 104 EQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISK 163
           ++   ++P D + LL + G+G YTA A+    +   +  VD N+ R+  RLF ++ +I  
Sbjct: 61  KEHHSQLPNDYQSLLKLPGIGAYTANAILCFGFREKSACVDANIKRVLLRLFGLDPNIHA 120

Query: 164 GKTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKCKECPLSRFCSAYKNNTIE 223
              ++     ++   S  +    NQAL+DLGA +C  K PKC  CPL+ +C     N  E
Sbjct: 121 KDLQIKANDFLNLNESFNH----NQALIDLGALICSPK-PKCAICPLNPYCLG--KNNPE 173

Query: 224 NFPVKLGKTKVKDVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEF 283
              +K  +  V++   L VV++   N+  ++K     +    Y+ +    +         
Sbjct: 174 KHTLKKKQEIVQEERYLGVVIQ--NNQIALEK-----IEQKLYLGMHHFPN--------- 217

Query: 284 LSENLPFKVNLNEKCLGNVKHVFSHLKWNMDVYSGTAKEKTIPAN 328
           L ENL +K+      LG +KH  +  K N+++Y    K+   P  
Sbjct: 218 LKENLEYKLPF----LGAIKHSHTKFKLNLNLYLAAIKDLKNPIR 258


>gnl|CDD|223255 COG0177, Nth, Predicted EndoIII-related endonuclease [DNA
           replication, recombination, and repair].
          Length = 211

 Score =  121 bits (306), Expect = 7e-33
 Identities = 56/206 (27%), Positives = 101/206 (49%), Gaps = 7/206 (3%)

Query: 15  ESILTWY-KQNARQLPWRESNNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAF 73
             IL    +         +  +P+ + ++ ++  QT  + V        K YPT +D   
Sbjct: 7   LEILDRLRELYPEPKTELDFKDPFELLVAVILSAQTTDEVVNKATPALFKRYPTPEDLLN 66

Query: 74  DTEDNVLKMWEGLGYY-SRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALA 132
             E+ + ++ + +G Y ++ +N +   R ++E+FGGEVP  +++LLS+ GVG  TA  + 
Sbjct: 67  ADEEELEELIKSIGLYRNKAKNIKELARILLEKFGGEVPDTREELLSLPGVGRKTANVVL 126

Query: 133 SICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMD 192
           S  + IP  AVD +V R+  RL      +  GKT    E  + K + +    + +  L+ 
Sbjct: 127 SFAFGIPAIAVDTHVHRVSNRL-----GLVPGKTPEEVEEALMKLIPKELWTDLHHWLIL 181

Query: 193 LGATVCLFKNPKCKECPLSRFCSAYK 218
            G  +C  + P+C+ECPL+  C +  
Sbjct: 182 HGRYICKARKPRCEECPLADLCPSAG 207


>gnl|CDD|216088 pfam00730, HhH-GPD, HhH-GPD superfamily base excision DNA repair
           protein.  This family contains a diverse range of
           structurally related DNA repair proteins. The
           superfamily is called the HhH-GPD family after its
           hallmark Helix-hairpin-helix and Gly/Pro rich loop
           followed by a conserved aspartate. This includes
           endonuclease III, EC:4.2.99.18 and MutY an A/G-specific
           adenine glycosylase, both have a C terminal 4Fe-4S
           cluster. The family also includes 8-oxoguanine DNA
           glycosylases. The methyl-CPG binding protein MBD4 also
           contains a related domain that is a thymine DNA
           glycosylase. The family also includes
           DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other
           members of the AlkA family.
          Length = 144

 Score =  108 bits (271), Expect = 1e-28
 Identities = 39/135 (28%), Positives = 68/135 (50%), Gaps = 8/135 (5%)

Query: 41  ISEVMLQQTQVKTVLPYYEKFIKTY--PTIKDFAFDTEDNVLKMWEGLGYY-SRVRNFQA 97
           +S ++ QQT  K      ++  + Y  PT +D A   E+ + ++ +GLG+Y  + +  + 
Sbjct: 1   VSAILSQQTSDKAANKITKRLFERYGFPTPEDLAEADEEELRELIKGLGFYRRKAKYIKE 60

Query: 98  GCRQVIEQFGGEVPRDKK---QLLSIKGVGDYTAGALASIC--YNIPTPAVDGNVFRIYG 152
             R ++E + G VP D +    LL++ GVG +TA A+           PAVD +V R+  
Sbjct: 61  LARILVEGYLGLVPLDLEELEALLALPGVGRWTAEAVLLFALGRPDVFPAVDTHVRRVAK 120

Query: 153 RLFEIEDDISKGKTK 167
           RL  I+    K + +
Sbjct: 121 RLGLIDTKPPKKEVE 135


>gnl|CDD|130155 TIGR01083, nth, endonuclease III.  This equivalog model identifes
           nth members of the pfam00730 superfamily (HhH-GPD:
           Helix-hairpin-helix and Gly/Pro rich loop followed by a
           conserved aspartate). The major members of the
           superfamily are nth and mutY [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 191

 Score =  100 bits (250), Expect = 4e-25
 Identities = 51/173 (29%), Positives = 92/173 (53%), Gaps = 6/173 (3%)

Query: 34  NNPYYVWISEVMLQQTQVKTVLPYYEKFIKTYPTIKDFAFDTEDNVLKMWEGLGYY-SRV 92
           NNP+ + ++ ++  Q   K+V    +K  + YPT +  A    + + +  + +G Y ++ 
Sbjct: 24  NNPFELLVATILSAQATDKSVNKATKKLFEVYPTPQALAQAGLEELEEYIKSIGLYRNKA 83

Query: 93  RNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYG 152
           +N  A CR ++E++GGEVP D+++L+ + GVG  TA  + ++ + IP  AVD +VFR+  
Sbjct: 84  KNIIALCRILVERYGGEVPEDREELVKLPGVGRKTANVVLNVAFGIPAIAVDTHVFRVSN 143

Query: 153 RLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKC 205
           RL      +SKGK     E  + K + +    + +  L+  G   C  + P C
Sbjct: 144 RL-----GLSKGKDPDKVEEELLKLIPREFWTKLHHWLILHGRYTCKARKPLC 191


>gnl|CDD|239523 cd03431, DNA_Glycosylase_C, DNA glycosylase (MutY in bacteria and
           hMYH in humans) is responsible for repairing misread
           A*oxoG residues to C*G by removing the inappropriately
           paired adenine base from the DNA backbone. It belongs to
           the Nudix hydrolase superfamily and is important for the
           repair of various genotoxic lesions. Enzymes belonging
           to this superfamily requires a divalent cation, such as
           Mg2+ or Mn2+ for their activity. They are also
           recognized by a highly conserved 23-residue nudix motif
           (GX5EX7REUXEEXGU, where U = I, L or V). However, DNA
           glycosylase does not seem to contain this signature
           motif. DNA glycosylase consists of 2 domains: the
           N-terminal domain contains the catalytic properties of
           the enzyme and the C-terminal domain affects substrate
           (oxoG) binding and enzymatic turnover. The C-terminal
           domain is highly similar to MutT, based on secondary
           structure and topology, despite low sequence identity.
           MutT sanitizes the nucleotide precursor pool by
           hydrolyzing oxo-dGTP to oxo-dGMO and inorganic
           pyrophosphate. The similarity strongly suggests that the
           two proteins share a common evolutionary origin.
          Length = 118

 Score = 96.6 bits (241), Expect = 1e-24
 Identities = 33/121 (27%), Positives = 59/121 (48%), Gaps = 3/121 (2%)

Query: 235 KDVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEFLSENLPFKVNL 294
           K   +  VV++ D  + L++KRP  GLL+  + F S E +     + E L   L   + L
Sbjct: 1   KRRGIAVVVIRND-GRVLLEKRPEKGLLAGLWEFPSVEWEEEADGE-EALLSALKKALRL 58

Query: 295 NEKCLGNVKHVFSHLKWNMDVYSGTAKEKTIPANKTYKLITETQMKKYAFPVPYQKVWKL 354
           + + LG VKH F+H +  + VY    +   + A    + +   ++ +YA P   +K+ +L
Sbjct: 59  SLEPLGTVKHTFTHFRLTLHVYLARLEG-DLLAPDEGRWVPLEELDEYALPTVMRKILEL 117

Query: 355 F 355
            
Sbjct: 118 L 118


>gnl|CDD|182661 PRK10702, PRK10702, endonuclease III; Provisional.
          Length = 211

 Score = 60.8 bits (147), Expect = 8e-11
 Identities = 37/127 (29%), Positives = 59/127 (46%), Gaps = 5/127 (3%)

Query: 88  YYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGDYTAGALASICYNIPTPAVDGNV 147
           Y S+  N    CR ++EQ  GEVP D+  L ++ GVG  TA  + +  +  PT AVD ++
Sbjct: 82  YNSKAENVIKTCRILLEQHNGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHI 141

Query: 148 FRIYGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGATVCLFKNPKCKE 207
           FR+  R        + GK     E  + K +      + +  L+  G   C+ + P+C  
Sbjct: 142 FRVCNR-----TQFAPGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGS 196

Query: 208 CPLSRFC 214
           C +   C
Sbjct: 197 CIIEDLC 203


>gnl|CDD|109681 pfam00633, HHH, Helix-hairpin-helix motif.  The helix-hairpin-helix
           DNA-binding motif is found to be duplicated in the
           central domain of RuvA. The HhH domain of DisA, a
           bacterial checkpoint control protein, is a DNA-binding
           domain.
          Length = 30

 Score = 38.2 bits (90), Expect = 1e-04
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 106 FGGEVPRDKKQLLSIKGVGDYTAGALASI 134
             G +P  +++LL++ GVG  TA A+ S 
Sbjct: 2   LEGLIPASREELLALPGVGPKTAEAILSY 30


>gnl|CDD|197771 smart00525, FES, iron-sulpphur binding domain in DNA-(apurinic or
           apyrimidinic site) lyase (subfamily of ENDO3). 
          Length = 21

 Score = 35.6 bits (83), Expect = 0.001
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 197 VCLFKNPKCKECPLSRFCSAY 217
           +C  + P+C ECPL   C AY
Sbjct: 1   ICTARKPRCDECPLKDLCPAY 21


>gnl|CDD|225140 COG2231, COG2231, Uncharacterized protein related to Endonuclease
           III [DNA replication, recombination, and repair].
          Length = 215

 Score = 38.9 bits (91), Expect = 0.002
 Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 4/104 (3%)

Query: 114 KKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGRLFEIEDDISKGKTKVVYETL 173
           +++LLSIKG+G  TA ++     + P   VD    R+  RL  IE+     + K ++E  
Sbjct: 114 REELLSIKGIGKETADSILLYALDRPVFVVDKYTRRLLSRLGGIEEK-KYDEIKELFEEN 172

Query: 174 VSKTMSQTNAREFNQALMDLGATVCLFKNPKCKECPLSRFCSAY 217
           + +  +    +EF+ AL+   A     K P C++CPL   C  Y
Sbjct: 173 LPE--NLRLYQEFH-ALIVEHAKHFCKKKPLCEKCPLKEKCKKY 213


>gnl|CDD|204516 pfam10576, EndIII_4Fe-2S, Iron-sulfur binding domain of
           endonuclease III.  Escherichia coli endonuclease III (EC
           4.2.99.18) is a DNA repair enzyme that acts both as a
           DNA N-glycosylase, removing oxidized pyrimidines from
           DNA, and as an apurinic/apyrimidinic (AP) endonuclease,
           introducing a single-strand nick at the site from which
           the damaged base was removed. Endonuclease III is an
           iron-sulfur protein that binds a single 4Fe-4S cluster.
           The 4Fe-4S cluster does not seem to be important for
           catalytic activity, but is probably involved in the
           proper positioning of the enzyme along the DNA strand.
           The 4Fe-4S cluster is bound by four cysteines which are
           all located in a 17 amino acid region at the C-terminal
           end of endonuclease III. A similar region is also
           present in the central section of mutY and in the
           C-terminus of ORF-10 and of the Micro-coccus UV
           endonuclease.
          Length = 17

 Score = 31.6 bits (73), Expect = 0.029
 Identities = 8/14 (57%), Positives = 11/14 (78%)

Query: 201 KNPKCKECPLSRFC 214
           + PKC+ECPL+  C
Sbjct: 4   RKPKCEECPLADLC 17


>gnl|CDD|151922 pfam11485, DUF3211, Protein of unknown function (DUF3211).  This
           archaeal family of proteins has no known function.
          Length = 136

 Score = 30.0 bits (68), Expect = 0.80
 Identities = 15/46 (32%), Positives = 20/46 (43%), Gaps = 4/46 (8%)

Query: 143 VDGNVFRIYGRL----FEIEDDISKGKTKVVYETLVSKTMSQTNAR 184
           V+GN FR  GR     FEI+  +     +V Y   VS      N +
Sbjct: 39  VEGNSFRAEGRFGAFSFEIKGRVYVSNNEVTYIFYVSGGGPNGNGK 84


>gnl|CDD|237519 PRK13812, PRK13812, orotate phosphoribosyltransferase; Provisional.
          Length = 176

 Score = 28.6 bits (64), Expect = 3.3
 Identities = 17/47 (36%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 151 YGRLFEIEDDISKGKTKVVYETLVSKTMSQTNAREFNQALMDLGATV 197
           YG    IE  + +G+  VV E + +   S  +A E   AL + GATV
Sbjct: 94  YGTGNRIEGRLDEGEEVVVLEDIATTGQSAVDAVE---ALREAGATV 137


>gnl|CDD|140318 PTZ00297, PTZ00297, pantothenate kinase; Provisional.
          Length = 1452

 Score = 29.1 bits (65), Expect = 4.9
 Identities = 15/47 (31%), Positives = 20/47 (42%), Gaps = 3/47 (6%)

Query: 210 LSRFCSAYKNNTIENFPVKLGKTKVKDVFLLTVVVKTDTNKYLIQKR 256
           L R C  +++  IE F   L      DV LL  V       Y +QK+
Sbjct: 21  LPRGCGGFQHERIETF---LASVDAYDVVLLQEVYAASVLPYFLQKQ 64


>gnl|CDD|149881 pfam08950, DUF1861, Protein of unknown function (DUF1861).  This
           hypothetical protein, found in bacteria and in the
           eukaryote Leishmania, has no known function.
          Length = 298

 Score = 28.4 bits (64), Expect = 5.1
 Identities = 13/40 (32%), Positives = 15/40 (37%), Gaps = 7/40 (17%)

Query: 114 KKQLLSIKGVGDYTAGALASICYNIPTPAVDGNVFRIYGR 153
           +  LL+ KGVG Y         YN   P        I GR
Sbjct: 12  ESVLLTFKGVGGYDV-------YNCSAPFTYKGKTHIAGR 44


>gnl|CDD|218638 pfam05559, DUF763, Protein of unknown function (DUF763).  This
           family consists of several uncharacterized bacterial and
           archaeal proteins of unknown function.
          Length = 319

 Score = 28.4 bits (64), Expect = 5.7
 Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 3/33 (9%)

Query: 111 PRDKKQLLSIKGVGDYTAGAL---ASICYNIPT 140
           P D ++LL +KGVG  T  AL   A + Y  P 
Sbjct: 265 PEDFEELLLLKGVGPSTLRALALVAEVIYGTPP 297


>gnl|CDD|215183 PLN02323, PLN02323, probable fructokinase.
          Length = 330

 Score = 28.4 bits (64), Expect = 6.3
 Identities = 23/92 (25%), Positives = 35/92 (38%), Gaps = 28/92 (30%)

Query: 76  EDNVLKMW----------EGLGYYSRVRNFQAGCRQVIEQFGGEVPRDKKQLLSIKGVGD 125
           +D V+K+W          EG          + GCR   + F G V   K + +   G GD
Sbjct: 219 DDTVVKLWHPNLKLLLVTEG----------EEGCRYYTKDFKGRVEGFKVKAVDTTGAGD 268

Query: 126 -YTAGALASICYNIPTPAVDGNVFRIYGRLFE 156
            +  G L+ +       A D ++     RL E
Sbjct: 269 AFVGGLLSQL-------AKDLSLLEDEERLRE 293


>gnl|CDD|215922 pfam00443, UCH, Ubiquitin carboxyl-terminal hydrolase. 
          Length = 313

 Score = 28.1 bits (63), Expect = 7.1
 Identities = 22/160 (13%), Positives = 46/160 (28%), Gaps = 34/160 (21%)

Query: 177 TMSQTNAREFNQALMD-----LGATVCLFKNPKCKECPLSRFCSAYKNNTIENFPVKLGK 231
              Q +A EF   L+D     L +      +    E  +++       + ++    K   
Sbjct: 88  GYMQQDAHEFLLFLLDQLHEDLNSLKKRKSHAAENESLITKLFQGQLESRLKCLKCKKES 147

Query: 232 TKVKDVFLLTVVVKTDTNKYLIQKRPTTGLLSNFYMFLSFESDTPYTSQVEFLSENLPFK 291
           +  +    L++ ++  ++                      +  T    ++E L +   + 
Sbjct: 148 STPEPFSDLSLPIEDSSSV--------------------LKLLTGNFLKLEELEKEEKYY 187

Query: 292 V-NLNEKCLGNVKHVFS--------HLKWNMDVYSGTAKE 322
                 K     K   S        HLK   D    T K+
Sbjct: 188 CPKCGGKQNAIKKLDISRLPPVLIIHLKRFEDFNRETEKK 227


>gnl|CDD|224333 COG1415, COG1415, Uncharacterized conserved protein [Function
           unknown].
          Length = 373

 Score = 28.1 bits (63), Expect = 7.2
 Identities = 19/72 (26%), Positives = 29/72 (40%), Gaps = 19/72 (26%)

Query: 111 PRDKKQLLSIKGVGDYTAGAL---ASICYNIPT----PAV--------DGNVFRI----Y 151
           P D ++LL + G+G  T  AL   A + Y  P     P          DG  F +    Y
Sbjct: 274 PDDFEELLLVPGIGPSTVRALALVAEVIYGEPPSFRDPVKYAFAVGGKDGVPFPVNRKTY 333

Query: 152 GRLFEIEDDISK 163
             L E  +++ +
Sbjct: 334 DELIEFLEELVE 345


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0806    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,885,499
Number of extensions: 1675941
Number of successful extensions: 1450
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1420
Number of HSP's successfully gapped: 33
Length of query: 363
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 265
Effective length of database: 6,590,910
Effective search space: 1746591150
Effective search space used: 1746591150
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (26.7 bits)