RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 017995
         (362 letters)



>gnl|CDD|178528 PLN02940, PLN02940, riboflavin kinase.
          Length = 382

 Score =  676 bits (1747), Expect = 0.0
 Identities = 286/354 (80%), Positives = 316/354 (89%)

Query: 2   TDGMFSEVLKTFLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSM 61
           TDG+ S+VLK FLVKYGK+WDGRE  KIVGKTPLE AA +VEDYGLPC+  EF +E+  +
Sbjct: 25  TDGIVSDVLKAFLVKYGKQWDGREAQKIVGKTPLEAAATVVEDYGLPCSTDEFNSEITPL 84

Query: 62  FSDHLCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSD 121
            S+  C +KALPGANRLIKHL  HGVPMALASNS RA IE+KIS   GW ESFSVIVG D
Sbjct: 85  LSEQWCNIKALPGANRLIKHLKSHGVPMALASNSPRANIEAKISCHQGWKESFSVIVGGD 144

Query: 122 EVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYT 181
           EV  GKPSPDIFLEAAKRLN+EPS+ LVIEDS+ GV+AGKAAGMEV+AVPS+PKQTH Y+
Sbjct: 145 EVEKGKPSPDIFLEAAKRLNVEPSNCLVIEDSLPGVMAGKAAGMEVIAVPSIPKQTHLYS 204

Query: 182 AADEVINSLLDLRPEKWGLPPFQDWIEGTLPSEPWYIGGPVVKGLGRGSKVLGIPTANLS 241
           +ADEVINSLLDL+PEKWGLPPF DWIEGTLP EPW+IGGPV+KG GRGSKVLGIPTANLS
Sbjct: 205 SADEVINSLLDLQPEKWGLPPFNDWIEGTLPIEPWHIGGPVIKGFGRGSKVLGIPTANLS 264

Query: 242 TEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVMSIGWNPYFDNAEKTIEPWLLHEFDEDFY 301
           TE YSDVLSEHPSGVYFGWAGLSTRGVYKMVMSIGWNPYF+N EKTIEPWLLH+F EDFY
Sbjct: 265 TENYSDVLSEHPSGVYFGWAGLSTRGVYKMVMSIGWNPYFNNTEKTIEPWLLHDFGEDFY 324

Query: 302 DEELHLVIVGYIRPEANFPSLETLIAKIHEDRKVAERALDLPLYSKYRDDPYLK 355
            EEL LVIVGYIRPEANFPSLE+LIAKIHEDR++AE+ALDLPLY+KY+DDPYL 
Sbjct: 325 GEELRLVIVGYIRPEANFPSLESLIAKIHEDRRIAEKALDLPLYAKYKDDPYLT 378


>gnl|CDD|178407 PLN02811, PLN02811, hydrolase.
          Length = 220

 Score =  176 bits (447), Expect = 1e-53
 Identities = 92/216 (42%), Positives = 129/216 (59%), Gaps = 17/216 (7%)

Query: 2   TDGMFSEVLKTFLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLP--CAKHEFVNEVY 59
           T+  ++EV +  L +YGK +D   K K++GK  +E A I VE+ GL    +  +F+ E  
Sbjct: 8   TEKFYTEVQEKILARYGKTFDWSLKAKMMGKKAIEAARIFVEESGLSDSLSPEDFLVERE 67

Query: 60  SMFSDHLCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFS---- 115
           +M  D       +PGA RL++HL   G+P+A+A+ SH+   + K   +HG  E FS    
Sbjct: 68  AMLQDLFPTSDLMPGAERLVRHLHAKGIPIAIATGSHKRHFDLKTQ-RHG--ELFSLMHH 124

Query: 116 VIVGSD-EVRTGKPSPDIFLEAAKRL---NMEPSSSLVIEDSVIGVVAGKAAGMEVVAVP 171
           V+ G D EV+ GKP+PDIFL AA+R     ++P   LV ED+  GV A K AGM VV VP
Sbjct: 125 VVTGDDPEVKQGKPAPDIFLAAARRFEDGPVDPGKVLVFEDAPSGVEAAKNAGMSVVMVP 184

Query: 172 --SLPKQTHRYTAADEVINSLLDLRPEKWGLPPFQD 205
              L K   +   AD+V++SLLD +PE+WGLPPF D
Sbjct: 185 DPRLDKSYCK--GADQVLSSLLDFKPEEWGLPPFPD 218


>gnl|CDD|190069 pfam01687, Flavokinase, Riboflavin kinase.  This family represents
           the C-terminal region of the bifunctional riboflavin
           biosynthesis protein known as RibC in Bacillus subtilis.
           The RibC protein from Bacillus subtilis has both
           flavokinase and flavin adenine dinucleotide synthetase
           (FAD-synthetase) activities. RibC plays an essential
           role in the flavin metabolism. This domain is thought to
           have kinase activity.
          Length = 125

 Score =  138 bits (350), Expect = 2e-40
 Identities = 51/126 (40%), Positives = 70/126 (55%), Gaps = 6/126 (4%)

Query: 215 PWYIGGPVVKGLGRGSKVLGIPTANLSTEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVMS 274
           P+ I G VV G GRG   LG PTANLS      +     +GVY     +  + VY  V +
Sbjct: 5   PYSISGTVVHGKGRGRT-LGFPTANLSLPKDKLL---PKNGVYAVRVKIDGK-VYPGVAN 59

Query: 275 IGWNPYFDNAEKTIEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHEDRK 334
           IG+NP F   + TIE  +L +FD D Y EE+ +  + ++RPE  F SLE L A+I +D +
Sbjct: 60  IGYNPTFGGKKPTIEVHIL-DFDGDLYGEEIRVEFLKFLRPEKKFDSLEELKAQIKKDIE 118

Query: 335 VAERAL 340
            A + L
Sbjct: 119 QARKIL 124


>gnl|CDD|214901 smart00904, Flavokinase, Riboflavin kinase.  Riboflavin is
           converted into catalytically active cofactors (FAD and
           FMN) by the actions of riboflavin kinase, which converts
           it into FMN, and FAD synthetase, which adenylates FMN to
           FAD. Eukaryotes usually have two separate enzymes, while
           most prokaryotes have a single bifunctional protein that
           can carry out both catalyses, although exceptions occur
           in both cases. While eukaryotic monofunctional
           riboflavin kinase is orthologous to the bifunctional
           prokaryotic enzyme. the monofunctional FAD synthetase
           differs from its prokaryotic counterpart, and is instead
           related to the PAPS-reductase family. The bacterial FAD
           synthetase that is part of the bifunctional enzyme has
           remote similarity to nucleotidyl transferases and,
           hence, it may be involved in the adenylylation reaction
           of FAD synthetases. This entry represents riboflavin
           kinase, which occurs as part of a bifunctional enzyme or
           a stand-alone enzyme.
          Length = 124

 Score =  134 bits (339), Expect = 8e-39
 Identities = 43/127 (33%), Positives = 62/127 (48%), Gaps = 7/127 (5%)

Query: 215 PWYIGGPVVKGLGRGSKVLGIPTANLSTEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVMS 274
           P+ I G VV G  RG   LG PTANL  +    +     +GVY     +    +Y  V +
Sbjct: 5   PYSISGRVVHGDKRGRT-LGFPTANLPLDDRLLLP---KNGVYAVRVRV-DGKIYPGVAN 59

Query: 275 IGWNPYFDNAEKTIEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHEDRK 334
           IG  P F   ++++E  +L  F  D Y EE+ +  + +IR E  F SL+ L A+I  D +
Sbjct: 60  IGTRPTFGG-DRSVEVHILD-FSGDLYGEEIEVEFLKFIRDEQKFDSLDELKAQISRDIE 117

Query: 335 VAERALD 341
            A   L 
Sbjct: 118 EAREYLA 124


>gnl|CDD|223710 COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General
           function prediction only].
          Length = 221

 Score =  118 bits (298), Expect = 1e-31
 Identities = 60/191 (31%), Positives = 96/191 (50%), Gaps = 3/191 (1%)

Query: 8   EVLKTFLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDY-GLPCAKHEFV-NEVYSMFSDH 65
                 L +YG E    E  ++ G        ++ +   G   A    +   +Y   +  
Sbjct: 22  RAWLEALKEYGIEISDEEIRELHGGGIARIIDLLRKLAAGEDPADLAELERLLYEAEALE 81

Query: 66  LCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRT 125
           L  +K +PG   L++ L   G+P+A+AS+S R   E ++  + G  + F VIV +D+V  
Sbjct: 82  LEGLKPIPGVVELLEQLKARGIPLAVASSSPRRAAE-RVLARLGLLDYFDVIVTADDVAR 140

Query: 126 GKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTAADE 185
           GKP+PDI+L AA+RL ++P   +V+EDS  G+ A KAAGM VV VP+   + H       
Sbjct: 141 GKPAPDIYLLAAERLGVDPEECVVVEDSPAGIQAAKAAGMRVVGVPAGHDRPHLDPLDAH 200

Query: 186 VINSLLDLRPE 196
             +++L    E
Sbjct: 201 GADTVLLDLAE 211


>gnl|CDD|223274 COG0196, RibF, FAD synthase [Coenzyme metabolism].
          Length = 304

 Score =  109 bits (275), Expect = 2e-27
 Identities = 44/128 (34%), Positives = 63/128 (49%), Gaps = 10/128 (7%)

Query: 215 PWYIGGPVVKGLGRGSKVLGIPTANLSTEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVMS 274
           P+ I G VV G   G   LG PTAN+  +           GVY     L   GVY  V +
Sbjct: 185 PYSIEGKVVHGQKLGRT-LGFPTANIYLKDNVLP----AFGVYAVRVKL-DGGVYPGVAN 238

Query: 275 IGWNPYFDNAEKTIEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHEDRK 334
           +G+ P  D +E+++E  +L  F+ D Y E + +  + +IR E  F SL+ L  +I +D  
Sbjct: 239 VGYRPTVDGSERSLEVHILD-FNGDLYGERVKVRFLKFIRDEKKFDSLDELKEQIEKD-- 295

Query: 335 VAERALDL 342
             ERA  L
Sbjct: 296 -IERARKL 302


>gnl|CDD|213672 TIGR01990, bPGM, beta-phosphoglucomutase.  This model represents
           the beta-phosphoglucomutase enzyme which catalyzes the
           interconverison of beta-D-glucose-1-phosphate and
           beta-D-glucose-6-phosphate. The 6-phosphate is capable
           of non-enzymatic anomerization (alpha <-> beta) while
           the 1-phosphate is not. A separate enzyme is responsible
           for the isomerization of the alpha anomers.
           Beta-D-glucose-1-phosphate results from the
           phosphorylysis of maltose (2.4.1.8), trehalose
           (2.4.1.64) or trehalose-6-phosphate (2.4.1.216).
           Alternatively, these reactions can be run in the
           synthetic direction to create the disaccharides. All
           sequenced genomes which contain a member of this family
           also appear to contain at least one putative maltose or
           trehalose phosphorylase. Three species, Lactococcus,
           Enterococcus and Neisseria appear to contain a pair of
           paralogous beta-PGM's. Beta-phosphoglucomutase is a
           member of the haloacid dehalogenase superfamily of
           hydrolase enzymes. These enzymes are characterized by a
           series of three catalytic motifs positioned within an
           alpha-beta (Rossman) fold. beta-PGM contains an inserted
           alpha helical domain in between the first and second
           conserved motifs and thus is a member of subfamily IA of
           the superfamily. The third catalytic motif comes in
           three variants, the third of which, containing a
           conserved DD or ED, is the only one found here as well
           as in several other related enzymes (TIGR01509). The
           enzyme from L. lactis has been extensively characterized
           including a remarkable crystal structure which traps the
           pentacoordinate transition state [Energy metabolism,
           Biosynthesis and degradation of polysaccharides].
          Length = 185

 Score = 92.4 bits (230), Expect = 3e-22
 Identities = 45/116 (38%), Positives = 60/116 (51%), Gaps = 5/116 (4%)

Query: 56  NEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASNSHRA-TIESKISYQHGWNESF 114
           N+ Y      L     LPG   L+  L  + + +ALAS S  A TI  K+       + F
Sbjct: 73  NDYYVELLKELTPADVLPGIKSLLADLKKNNIKIALASASKNAPTILEKL----ELIDYF 128

Query: 115 SVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
             IV   E++ GKP P+IFL AA+ L + PS  + IED+  G+ A KAAGM  V V
Sbjct: 129 DAIVDPAELKKGKPDPEIFLAAAEGLGVSPSECIGIEDAQAGIEAIKAAGMFAVGV 184


>gnl|CDD|213673 TIGR02009, PGMB-YQAB-SF, beta-phosphoglucomutase family hydrolase. 
           This subfamily model groups together three clades: the
           characterized beta-phosphoglucomutases (including those
           from E.coli, B.subtilus and L.lactis, TIGR01990), a
           clade of putative bPGM's from mycobacteria and a clade
           including the uncharacterized E.coli and H.influenzae
           yqaB genes which may prove to be beta-mutases of a
           related 1-phosphosugar. All of these are members of the
           larger Haloacid dehalogenase (HAD) subfamily IA and
           include the "variant 3" glu-asp version of the third
           conserved HAD domain (TIGR01509).
          Length = 185

 Score = 92.0 bits (229), Expect = 4e-22
 Identities = 58/176 (32%), Positives = 87/176 (49%), Gaps = 12/176 (6%)

Query: 2   TDGMFSEVLKTFLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCA---KH---EFV 55
           T  + ++  K    KYG  +D +    + G +  +    I++  G   +    H   E  
Sbjct: 15  TAPLHAQAWKHIAAKYGISFDKQYNESLKGLSREDILRAILKLRGDGLSLEEIHQLAERK 74

Query: 56  NEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASNSHRA-TIESKISYQHGWNESF 114
           NE+Y      L  V  LPG   L+K L   G+ + L S+S  A  I +K+    G  + F
Sbjct: 75  NELYRE-LLRLTGVAVLPGIRNLLKRLKAKGIAVGLGSSSKNAPRILAKL----GLRDYF 129

Query: 115 SVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
             IV + EV+ GKP P+ FL AA+ L + P+  +V ED++ GV A +AAGM  VAV
Sbjct: 130 DAIVDASEVKNGKPHPETFLLAAELLGVPPNECIVFEDALAGVQAARAAGMFAVAV 185


>gnl|CDD|235536 PRK05627, PRK05627, bifunctional riboflavin kinase/FMN
           adenylyltransferase; Reviewed.
          Length = 305

 Score = 90.6 bits (226), Expect = 1e-20
 Identities = 41/124 (33%), Positives = 56/124 (45%), Gaps = 9/124 (7%)

Query: 218 IGGPVVKGLGRGSKVLGIPTANLSTEGYSDVLSEHP-SGVYFGWAGLSTRGVYKMVMSIG 276
           I G VV G   G   LG PTANL            P  GVY     +  +  Y  V +IG
Sbjct: 188 ISGRVVHGQKLGRT-LGFPTANLPLPDRV-----LPADGVYAVRVKVDGK-PYPGVANIG 240

Query: 277 WNPYFDNAEKTIEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHEDRKVA 336
             P  D   + +E  LL +F+ D Y E + +  +  +R E  F SL+ L A+I +D + A
Sbjct: 241 TRPTVDGGRQLLEVHLL-DFNGDLYGEHITVEFLKKLRDEQKFDSLDELKAQIAKDIETA 299

Query: 337 ERAL 340
              L
Sbjct: 300 RAFL 303


>gnl|CDD|233443 TIGR01509, HAD-SF-IA-v3, haloacid dehalogenase superfamily,
           subfamily IA, variant 3 with third motif having DD or
           ED.  This model represents part of one structural
           subfamily of the Haloacid Dehalogenase (HAD) superfamily
           of aspartate-nucleophile hydrolases. The superfamily is
           defined by the presence of three short catalytic motifs.
           The subfamilies are defined based on the location and
           the observed or predicted fold of a so-called "capping
           domain", or the absence of such a domain. Subfamily I
           consists of sequences in which the capping domain is
           found in between the first and second catalytic motifs.
           Subfamily II consists of sequences in which the capping
           domain is found between the second and third motifs.
           Subfamily III sequences have no capping domain in either
           of these positions.The Subfamily IA and IB capping
           domains are predicted by PSI-PRED to consist of an alpha
           helical bundle. Subfamily I encompasses such a wide
           region of sequence space (the sequences are highly
           divergent) that representing it with a single model is
           impossible, resulting in an overly broad description
           which allows in many unrelated sequences. Subfamily IA
           and IB are separated based on an aparrent phylogenetic
           bifurcation. Subfamily IA is still too broad to model,
           but cannot be further subdivided into large chunks based
           on phylogenetic trees. Of the three motifs defining the
           HAD superfamily, the third has three variant forms : (1)
           hhhhsDxxx(x)D, (2) hhhhssxxx(x)D and (3) hhhhDDxxx(x)s
           where _s_ refers to a small amino acid and _h_ to a
           hydrophobic one. All three of these variants are found
           in subfamily IA. Individual models were made based on
           seeds exhibiting only one of the variants each. Variant
           3 (this model) is found in the enzymes
           beta-phosphoglucomutase (TIGR01990) and
           deoxyglucose-6-phosphatase, while many other enzymes of
           subfamily IA exhibit this variant as well as variant 1
           (TIGR01549). These three variant models were created
           withthe knowledge that there will be overlap among them
           - this is by design and serves the purpose of
           eliminating the overlap with models of more distantly
           relatedHAD subfamilies caused by an overly broad single
           model [Unknown function, Enzymes of unknown
           specificity].
          Length = 177

 Score = 85.2 bits (211), Expect = 1e-19
 Identities = 48/150 (32%), Positives = 71/150 (47%), Gaps = 4/150 (2%)

Query: 23  GREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDHLCK--VKALPGANRLIK 80
             E       +           YG   +  +       +F + L K  +K LPG   L++
Sbjct: 30  PDELGVSEVGSLELALRRWKAKYGRTMSAEDAQLLYKQLFYEALEKEGLKPLPGVRALLE 89

Query: 81  HLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAKRL 140
            L   G  +AL +NS RA  +  +    G    F V++ S +V  GKP PDI+L+A K+L
Sbjct: 90  ALRARGKKLALLTNSPRADAKLVLE--LGLRALFDVVIDSSDVGLGKPDPDIYLQALKKL 147

Query: 141 NMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
            ++PS  L ++DS  G+ A KAAGM  V V
Sbjct: 148 GLKPSECLFVDDSPAGIDAAKAAGMHTVLV 177


>gnl|CDD|222115 pfam13419, HAD_2, Haloacid dehalogenase-like hydrolase. 
          Length = 176

 Score = 79.7 bits (197), Expect = 1e-17
 Identities = 37/156 (23%), Positives = 64/156 (41%), Gaps = 7/156 (4%)

Query: 16  KYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCA-KHEFVNEVYSMFSDHLCKVKALPG 74
           + G +    E  +  G    E  A ++ ++ +      E + E               P 
Sbjct: 27  RLGLDISAEELREAGGLPFDEALADLLREHPIDPDEILEALLEYNLESRLEP-----FPD 81

Query: 75  ANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFL 134
              L++ L   GV + + SN  R  +E  +    G  + F  +  SD+V   KP P+ + 
Sbjct: 82  VVELLRRLKAKGVKLVILSNGSREAVERLLEK-LGLLDLFDAVFTSDDVGARKPDPEAYE 140

Query: 135 EAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
              +RL + P   L I+DS   + A +AAG++ V V
Sbjct: 141 RVLERLGLPPEEILFIDDSPEDLEAARAAGIKTVHV 176


>gnl|CDD|223620 COG0546, Gph, Predicted phosphatases [General function prediction
           only].
          Length = 220

 Score = 79.8 bits (197), Expect = 2e-17
 Identities = 48/199 (24%), Positives = 80/199 (40%), Gaps = 14/199 (7%)

Query: 6   FSEVLKTFLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDH 65
                   L + G      E+ + +    L+E    +       A  E V  +   F   
Sbjct: 22  ILRAFNAALAELGLPPLDEEEIRQLIGLGLDELIERLLGEADEEAAAELVERLREEFLTA 81

Query: 66  ---LCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDE 122
              L + +  PG   L+  L   G  + + +N     ++  +    G  + F VIVG D+
Sbjct: 82  YAELLESRLFPGVKELLAALKSAGYKLGIVTNKPERELD-ILLKALGLADYFDVIVGGDD 140

Query: 123 VRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV------PSLPKQ 176
           V   KP P+  L   ++L ++P  +L++ DS+  ++A KAAG+  V V           Q
Sbjct: 141 VPPPKPDPEPLLLLLEKLGLDPEEALMVGDSLNDILAAKAAGVPAVGVTWGYNSREELAQ 200

Query: 177 THRYTAADEVINSLLDLRP 195
                 AD VI+SL +L  
Sbjct: 201 AG----ADVVIDSLAELLA 215


>gnl|CDD|236770 PRK10826, PRK10826, 2-deoxyglucose-6-phosphatase; Provisional.
          Length = 222

 Score = 77.3 bits (191), Expect = 1e-16
 Identities = 36/128 (28%), Positives = 65/128 (50%), Gaps = 4/128 (3%)

Query: 71  ALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSP 130
            LPG    +      G+ + LAS S    +E  +       + F  +  ++++   KP P
Sbjct: 93  LLPGVREALALCKAQGLKIGLASASPLHMLE-AVLTMFDLRDYFDALASAEKLPYSKPHP 151

Query: 131 DIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTH--RYTAADEVIN 188
           +++L  A +L ++P + + +EDS  G++A KAA M  + VP+ P+Q +  R+  AD  + 
Sbjct: 152 EVYLNCAAKLGVDPLTCVALEDSFNGMIAAKAARMRSIVVPA-PEQQNDPRWALADVKLE 210

Query: 189 SLLDLRPE 196
           SL +L   
Sbjct: 211 SLTELTAA 218


>gnl|CDD|215416 PLN02779, PLN02779, haloacid dehalogenase-like hydrolase family
           protein.
          Length = 286

 Score = 78.2 bits (193), Expect = 2e-16
 Identities = 44/134 (32%), Positives = 68/134 (50%), Gaps = 3/134 (2%)

Query: 69  VKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWN--ESFSVIVGSDEVRTG 126
           +   PG  RL+      G+ +A+ S S+   +   ++   G    +   V  G D+V   
Sbjct: 143 LPLRPGVLRLMDEALAAGIKVAVCSTSNEKAVSKIVNTLLGPERAQGLDVFAG-DDVPKK 201

Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTAADEV 186
           KP PDI+  AA+ L ++PS  +V+EDSVIG+ A KAAGM  +   S       ++ AD V
Sbjct: 202 KPDPDIYNLAAETLGVDPSRCVVVEDSVIGLQAAKAAGMRCIVTKSSYTADEDFSGADAV 261

Query: 187 INSLLDLRPEKWGL 200
            + L D+  E + L
Sbjct: 262 FDCLGDVPLEDFDL 275


>gnl|CDD|183215 PRK11587, PRK11587, putative phosphatase; Provisional.
          Length = 218

 Score = 74.3 bits (183), Expect = 2e-15
 Identities = 49/137 (35%), Positives = 70/137 (51%), Gaps = 19/137 (13%)

Query: 69  VKALPGANRLIKHLSCHGVPMA--------LASNSHRATIESKISYQHGWNESFSVIVGS 120
           + ALPGA  L+ HL+  G+P A        +AS  H+A          G      V V +
Sbjct: 82  ITALPGAIALLNHLNKLGIPWAIVTSGSVPVASARHKAA---------GLPAP-EVFVTA 131

Query: 121 DEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRY 180
           + V+ GKP PD +L  A+ L + P   +V+ED+  GV++G AAG  V+AV + P  T R 
Sbjct: 132 ERVKRGKPEPDAYLLGAQLLGLAPQECVVVEDAPAGVLSGLAAGCHVIAV-NAPADTPRL 190

Query: 181 TAADEVINSLLDLRPEK 197
              D V++SL  L   K
Sbjct: 191 DEVDLVLHSLEQLTVTK 207


>gnl|CDD|232818 TIGR00083, ribF, riboflavin kinase/FMN adenylyltransferase.
           multifunctional enzyme: riboflavin kinase (EC 2.7.1.26)
           (flavokinase) / FMN adenylyltransferase (EC 2.7.7.2)
           (FAD pyrophosphorylase) (FAD synthetase) [Biosynthesis
           of cofactors, prosthetic groups, and carriers,
           Riboflavin, FMN, and FAD].
          Length = 288

 Score = 73.6 bits (181), Expect = 8e-15
 Identities = 41/127 (32%), Positives = 60/127 (47%), Gaps = 6/127 (4%)

Query: 214 EPWYIGGPVVKGLGRGSKVLGIPTANLSTEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVM 273
            P++I G V+ G   G   LG PTAN+  +     L     G Y     L+    Y  V 
Sbjct: 167 RPYFICGTVIHGQKLGRT-LGFPTANIKLKNQVLPL---KGGYYVVVVLLNGE-PYPGVG 221

Query: 274 SIGWNPYFDNAEKTIEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHEDR 333
           +IG  P F   +  IE  LL +F  + Y +E+ + +V  IRPE  F SL+ L  +I +D 
Sbjct: 222 NIGNRPTFIGQQLVIEVHLL-DFSGELYGQEIKVTLVKKIRPEQKFSSLDELKNQIQQDI 280

Query: 334 KVAERAL 340
             A++  
Sbjct: 281 LQAKKWF 287


>gnl|CDD|119389 cd01427, HAD_like, Haloacid dehalogenase-like hydrolases. The
           haloacid dehalogenase-like (HAD) superfamily includes
           L-2-haloacid dehalogenase, epoxide hydrolase,
           phosphoserine phosphatase, phosphomannomutase,
           phosphoglycolate phosphatase, P-type ATPase, and many
           others, all of which use a nucleophilic aspartate in
           their phosphoryl transfer reaction. All members possess
           a highly conserved alpha/beta core domain, and many also
           possess a small cap domain, the fold and function of
           which is variable. Members of this superfamily are
           sometimes referred to as belonging to the DDDD
           superfamily of phosphohydrolases.
          Length = 139

 Score = 70.1 bits (172), Expect = 1e-14
 Identities = 33/119 (27%), Positives = 55/119 (46%), Gaps = 17/119 (14%)

Query: 68  KVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEV---- 123
           +++  PG    +K L   G+ +ALA+N  R  +   +  + G ++ F  ++ S+      
Sbjct: 22  ELELYPGVKEALKELKEKGIKLALATNKSRREVLELLE-ELGLDDYFDPVITSNGAAIYY 80

Query: 124 ------------RTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
                         GKP+PD  L A K L ++P   L++ DS+  +   KAAG   VAV
Sbjct: 81  PKEGLFLGGGPFDIGKPNPDKLLAALKLLGVDPEEVLMVGDSLNDIEMAKAAGGLGVAV 139


>gnl|CDD|223943 COG1011, COG1011, Predicted hydrolase (HAD superfamily) [General
           function prediction only].
          Length = 229

 Score = 71.5 bits (175), Expect = 2e-14
 Identities = 43/183 (23%), Positives = 73/183 (39%), Gaps = 6/183 (3%)

Query: 13  FLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDHLCKVKAL 72
            L    K      + +  G+  L    ++     L     E    V  + +     +   
Sbjct: 44  LLKLIEKLEARFLRGEYTGEYGLTLERLLELLERLL--GDEDAELVEELLAALAKLLPDY 101

Query: 73  PGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDI 132
           P A   +K L      + + +N  R   E K+  Q G  + F  +  S++V   KP P+I
Sbjct: 102 PEALEALKELG-KKYKLGILTNGARPHQERKLR-QLGLLDYFDAVFISEDVGVAKPDPEI 159

Query: 133 FLEAAKRLNMEPSSSLVIEDSVIGVVAG-KAAGMEVVAVPSLPKQT-HRYTAADEVINSL 190
           F  A ++L + P  +L + DS+   + G +A GM+ V +    K       A D  I+SL
Sbjct: 160 FEYALEKLGVPPEEALFVGDSLENDILGARALGMKTVWINRGGKPLPDALEAPDYEISSL 219

Query: 191 LDL 193
            +L
Sbjct: 220 AEL 222


>gnl|CDD|215497 PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family
           protein.
          Length = 1057

 Score = 74.1 bits (182), Expect = 2e-14
 Identities = 39/98 (39%), Positives = 57/98 (58%)

Query: 73  PGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDI 132
           PGA  LI      G+ +A+AS++ R  +++ ++        F  IV +D     KP+PDI
Sbjct: 164 PGALELITQCKNKGLKVAVASSADRIKVDANLAAAGLPLSMFDAIVSADAFENLKPAPDI 223

Query: 133 FLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
           FL AAK L +  S  +VIED++ GV A +AAGM  +AV
Sbjct: 224 FLAAAKILGVPTSECVVIEDALAGVQAARAAGMRCIAV 261


>gnl|CDD|182679 PRK10725, PRK10725, fructose-1-P/6-phosphogluconate phosphatase;
           Provisional.
          Length = 188

 Score = 68.6 bits (168), Expect = 1e-13
 Identities = 32/87 (36%), Positives = 51/87 (58%), Gaps = 3/87 (3%)

Query: 88  PMALASNSHRATIESKISYQH-GWNESFSVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSS 146
           PMA+ + S  A  E+ +   H G    F  +V +D+V+  KP+PD FL  A+ + ++P+ 
Sbjct: 104 PMAVGTGSESAIAEALL--AHLGLRRYFDAVVAADDVQHHKPAPDTFLRCAQLMGVQPTQ 161

Query: 147 SLVIEDSVIGVVAGKAAGMEVVAVPSL 173
            +V ED+  G+ A +AAGM+ V V  L
Sbjct: 162 CVVFEDADFGIQAARAAGMDAVDVRLL 188


>gnl|CDD|237310 PRK13222, PRK13222, phosphoglycolate phosphatase; Provisional.
          Length = 226

 Score = 68.7 bits (169), Expect = 2e-13
 Identities = 36/132 (27%), Positives = 56/132 (42%), Gaps = 15/132 (11%)

Query: 73  PGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDI 132
           PG    +  L   G P+A+ +N     +   +    G  + FSV++G D +   KP P  
Sbjct: 96  PGVKETLAALKAAGYPLAVVTNKPTPFVA-PLLEALGIADYFSVVIGGDSLPNKKPDPAP 154

Query: 133 FLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYT--------AAD 184
            L A ++L ++P   L + DS   + A +AAG   V V      T+ Y           D
Sbjct: 155 LLLACEKLGLDPEEMLFVGDSRNDIQAARAAGCPSVGV------TYGYNYGEPIALSEPD 208

Query: 185 EVINSLLDLRPE 196
            VI+   +L P 
Sbjct: 209 VVIDHFAELLPL 220


>gnl|CDD|216069 pfam00702, Hydrolase, haloacid dehalogenase-like hydrolase.  This
           family is structurally different from the alpha/beta
           hydrolase family (pfam00561). This family includes
           L-2-haloacid dehalogenase, epoxide hydrolases and
           phosphatases. The structure of the family consists of
           two domains. One is an inserted four helix bundle, which
           is the least well conserved region of the alignment,
           between residues 16 and 96 of Pseudomonas sp.
           (S)-2-haloacid dehalogenase 1. The rest of the fold is
           composed of the core alpha/beta domain. Those members
           with the characteristic DxD triad at the N-terminus are
           probably phosphatidylglycerolphosphate (PGP)
           phosphatases involved in cardiolipin biosynthesis in the
           mitochondria.
          Length = 187

 Score = 67.3 bits (164), Expect = 3e-13
 Identities = 33/150 (22%), Positives = 55/150 (36%), Gaps = 10/150 (6%)

Query: 18  GKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDHLCKVKALPGANR 77
            KE       +++ +    E  +                 V  + +         PGA  
Sbjct: 45  TKEGREELVRRLLLRALAGEELLEELLRAGATVVAVLDLVVLGLIALTD---PLYPGARE 101

Query: 78  LIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRT---GKPSPDIFL 134
            +K L   G+ +A+ +  +R T  +           F  +V +D       GKP P IF 
Sbjct: 102 ALKELKEAGIKLAILTGDNRLTANAIARLLG----LFDALVSADLYGLVGVGKPDPKIFE 157

Query: 135 EAAKRLNMEPSSSLVIEDSVIGVVAGKAAG 164
            A + L ++P   L++ D V  + A KAAG
Sbjct: 158 LALEELGVKPEEVLMVGDGVNDIPAAKAAG 187


>gnl|CDD|130516 TIGR01449, PGP_bact, 2-phosphoglycolate phosphatase, prokaryotic.
           PGP is an essential enzyme in the glycolate salvage
           pathway in higher organisms (photorespiration in
           plants). Phosphoglycolate results from the oxidase
           activity of RubisCO in the Calvin cycle when
           concentrations of carbon dioxide are low relative to
           oxygen. In Ralstonia (Alcaligenes) eutropha and
           Rhodobacter sphaeroides, the PGP gene (CbbZ) is located
           on an operon along with other Calvin cycle enzymes
           including RubisCO. The only other pertinent experimental
           evidence concerns the gene from E. coli. The in vitro
           activity of the Ralstonia and Escherichia enzymes was
           determined with crude cell extracts of strains
           containing PGP on expression plasmids and compared to
           controls. In E. coli, however, there does not appear to
           be a functional Calvin cycle (RubisCO is absent),
           although the E. coli PGP gene (gph) is on the same
           operon (dam) with ribulose-5-phosphate-3-epimerase
           (rpe), a gene in the pentose-phosphate pathway (along
           with other, unrelated genes). The E. coli enzyme is not
           expressed under normal laboratory conditions; the
           pathway to which it belongs has not been determined. In
           fact, the possibility exists, although unlikely, that
           the E. coli enzyme and others within this equivalog have
           as their physiological substrate another, closely
           related molecule. The other seed chosen for this model,
           from Xylella fastidiosa has no experimental evidence,
           but is a plant pathogen and thus may obtain
           phosphoglycolate from its host. This model has been
           restricted to encompass only proteobacteria as no
           related PGP has been verified outside of this clade.
           Sequences from Aquifex aeolicus and Treponema pallidum
           fall between the trusted and noise cutoffs. Just below
           the noise cutoff is a gene which is part of the operon
           for the biosynthesis of the blue pigment, indigoidine,
           from Erwinia (Pectobacterium) chrysanthemi, a plant
           pathogen. It does not seem likely, considering the
           proposed biosynthetic mechanism, that the
           dephosphorylation of phosphoglycolate or a closely
           related compound is required. Possibly, this gene is
           fortuitously located in this operon, or has an indirect
           relationship to the necessity for the biosynthesis of
           this compound. Sequences from 11 species have been
           annotated as PGP or putative PGP but fall below the
           noise cutoff. None of these have experimental
           validation. This enzyme is a member of the Haloacid
           Dehalogenase (HAD) superfamily of aspartate-nucleophile
           hydrolase enzymes (pfam00702) [Energy metabolism,
           Sugars].
          Length = 213

 Score = 66.8 bits (163), Expect = 6e-13
 Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 1/101 (0%)

Query: 70  KALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPS 129
              PG    +  L   G+ + L +N         +    G  + FSV++G D +   KP 
Sbjct: 85  SVFPGVEATLGALRAKGLRLGLVTNKPTPLARPLLELL-GLAKYFSVLIGGDSLAQRKPH 143

Query: 130 PDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
           PD  L AA+RL + P   + + DS + + A +AAG   V +
Sbjct: 144 PDPLLLAAERLGVAPQQMVYVGDSRVDIQAARAAGCPSVLL 184


>gnl|CDD|215313 PLN02575, PLN02575, haloacid dehalogenase-like hydrolase.
          Length = 381

 Score = 65.7 bits (160), Expect = 6e-12
 Identities = 44/120 (36%), Positives = 63/120 (52%), Gaps = 2/120 (1%)

Query: 74  GANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDIF 133
           G+   +  L  + +PMAL S   R T+E+ I    G    FSVIV +++V  GKP P++F
Sbjct: 220 GSQEFVNVLMNYKIPMALVSTRPRKTLENAIG-SIGIRGFFSVIVAAEDVYRGKPDPEMF 278

Query: 134 LEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTAADEVINSLLDL 193
           + AA+ LN  P   +V  +S   V A   A M+ VAV S     +   AAD V+  L +L
Sbjct: 279 IYAAQLLNFIPERCIVFGNSNQTVEAAHDARMKCVAVAS-KHPIYELGAADLVVRRLDEL 337


>gnl|CDD|215413 PLN02770, PLN02770, haloacid dehalogenase-like hydrolase family
           protein.
          Length = 248

 Score = 64.1 bits (156), Expect = 1e-11
 Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 1/103 (0%)

Query: 68  KVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGK 127
           ++K L G  +L K +   G+  A  +N+ R   E  IS   G ++ F  ++   E    K
Sbjct: 106 QLKPLNGLYKLKKWIEDRGLKRAAVTNAPRENAELMISLL-GLSDFFQAVIIGSECEHAK 164

Query: 128 PSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
           P PD +L+A + L +    + V EDSV G+ AG AAGM VV +
Sbjct: 165 PHPDPYLKALEVLKVSKDHTFVFEDSVSGIKAGVAAGMPVVGL 207


>gnl|CDD|215644 PLN03243, PLN03243, haloacid dehalogenase-like hydrolase;
           Provisional.
          Length = 260

 Score = 63.5 bits (154), Expect = 2e-11
 Identities = 40/124 (32%), Positives = 63/124 (50%), Gaps = 2/124 (1%)

Query: 70  KALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPS 129
           +  PG+   ++ L  H +P+A+AS   R  +E  I    G    FSV++ +++V  GKP 
Sbjct: 109 RLRPGSREFVQALKKHEIPIAVASTRPRRYLERAIE-AVGMEGFFSVVLAAEDVYRGKPD 167

Query: 130 PDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTAADEVINS 189
           P++F+ AA+RL   P   +V  +S   V A     M+ VAV       +  +A D V+  
Sbjct: 168 PEMFMYAAERLGFIPERCIVFGNSNSSVEAAHDGCMKCVAVAG-KHPVYELSAGDLVVRR 226

Query: 190 LLDL 193
           L DL
Sbjct: 227 LDDL 230


>gnl|CDD|130521 TIGR01454, AHBA_synth_RP, 3-amino-5-hydroxybenoic acid synthesis
           related protein.  The enzymes in this equivalog are all
           located in the operons for the biosynthesis of
           3-amino-5-hydroxybenoic acid (AHBA), which is a
           precursor of several antibiotics including ansatrienin ,
           naphthomycin , rifamycin and mitomycin. The role that
           this enzyme plays in this biosynthesis has not been
           elucidated. This enzyme is a member of the Haloacid
           dehalogenase superfamily (pfam00702) of
           aspartate-nucleophile hydrolases. This enzyme is closely
           related to phosphoglycolate phosphatase (TIGR01449), but
           it is unclear what purpose a PGPase or PGPase-like
           activity would serve in these biosyntheses. This model
           is limited to the Gram positive Actinobacteria. The most
           closely related enzyme below the noise cutoff is IndB
           which is involved in the biosynthesis of Indigoidine in
           Pectobacterium (Erwinia) chrysanthemi, a gamma
           proteobacter. This enzyme is similarly related to PGP.
           In this case, too it is unclear what role would be be
           played by a PGPase activity.
          Length = 205

 Score = 62.2 bits (151), Expect = 2e-11
 Identities = 42/129 (32%), Positives = 65/129 (50%), Gaps = 14/129 (10%)

Query: 46  GLPCAKHE-FVNEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASNSH----RATI 100
           GLP    E FV E Y +      +V+  PG   L+  L   GV  A+A+       R+ +
Sbjct: 54  GLPLEMEEPFVRESYRLAG----EVEVFPGVPELLAELRADGVGTAIATGKSGPRARSLL 109

Query: 101 ESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAG 160
           E+      G    F  ++GSDEV   KP+PDI  EA + L++ P  ++++ D+V  + + 
Sbjct: 110 EAL-----GLLPLFDHVIGSDEVPRPKPAPDIVREALRLLDVPPEDAVMVGDAVTDLASA 164

Query: 161 KAAGMEVVA 169
           +AAG   VA
Sbjct: 165 RAAGTATVA 173


>gnl|CDD|222003 pfam13242, Hydrolase_like, HAD-hyrolase-like. 
          Length = 74

 Score = 58.5 bits (142), Expect = 3e-11
 Identities = 25/77 (32%), Positives = 39/77 (50%), Gaps = 14/77 (18%)

Query: 126 GKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTA--- 182
           GKP+P +   A +RL ++P   ++I DS   ++A +AAG+  + V      T   TA   
Sbjct: 3   GKPNPGMLRAALERLGVDPEECVMIGDSDTDILAARAAGIRTILVL-----TGVTTAEDL 57

Query: 183 ------ADEVINSLLDL 193
                  D V++SL DL
Sbjct: 58  ERAPGRPDYVVDSLADL 74


>gnl|CDD|200170 TIGR02252, DREG-2, REG-2-like, HAD superfamily (subfamily IA)
           hydrolase.  This family of proteins includes
           uncharacterized sequences from eukaryotes, cyanobacteria
           and Leptospira as well as the DREG-2 protein from
           Drosophila melanogaster which has been identified as a
           rhythmically (diurnally) regulated gene. This family is
           a member of the Haloacid Dehalogenase (HAD) superfamily
           of aspartate-nucleophile hydrolases. The superfamily is
           defined by the presence of three short catalytic motifs.
           The subfamilies are defined based on the location and
           the observed or predicted fold of a so-called 'capping
           domain', or the absence of such a domain. This family is
           a member of subfamily 1A in which the cap domain
           consists of a predicted alpha helical bundle found in
           between the first and second catalytic motifs. A
           distinctive feature of this family is a conserved tandem
           pair of tryptophan residues in the cap domain. The most
           divergent sequences included within the scope of this
           model are from plants and have "FW" at this position
           instead. Most likely, these sequences, like the vast
           majority of HAD sequences, represent phosphatase
           enzymes.
          Length = 203

 Score = 50.7 bits (122), Expect = 2e-07
 Identities = 38/122 (31%), Positives = 56/122 (45%), Gaps = 10/122 (8%)

Query: 48  PCAKHEFVNEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASN---SHRATIESKI 104
           P +  +   E+YS F+      +  P A +L+K L   G+ + + SN     R  +E+  
Sbjct: 84  PESFEKIFEELYSYFATPEP-WQVYPDAIKLLKDLRERGLILGVISNFDSRLRGLLEAL- 141

Query: 105 SYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVI-GVVAGKAA 163
               G  E F  +V S EV   KP P IF EA +R  + P  +L I DS+       +AA
Sbjct: 142 ----GLLEYFDFVVTSYEVGAEKPDPKIFQEALERAGISPEEALHIGDSLRNDYQGARAA 197

Query: 164 GM 165
           G 
Sbjct: 198 GW 199


>gnl|CDD|182552 PRK10563, PRK10563, 6-phosphogluconate phosphatase; Provisional.
          Length = 221

 Score = 50.8 bits (122), Expect = 2e-07
 Identities = 40/138 (28%), Positives = 68/138 (49%), Gaps = 20/138 (14%)

Query: 39  AIIVEDYGLPCAKHE----FVNEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASN 94
            II +++G+  AK E    +  EV  +F   L   + + GAN L++ ++   VPM + SN
Sbjct: 56  DIISKEHGVTLAKAELEPVYRAEVARLFDSEL---EPIAGANALLESIT---VPMCVVSN 109

Query: 95  SHRATIESKISYQHGWNESFS-----VIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLV 149
                  SK+ +  G           +  G D ++  KP P +   AA+ +N+   + ++
Sbjct: 110 GP----VSKMQHSLGKTGMLHYFPDKLFSGYD-IQRWKPDPALMFHAAEAMNVNVENCIL 164

Query: 150 IEDSVIGVVAGKAAGMEV 167
           ++DS  G  +G AAGMEV
Sbjct: 165 VDDSSAGAQSGIAAGMEV 182


>gnl|CDD|171912 PRK13223, PRK13223, phosphoglycolate phosphatase; Provisional.
          Length = 272

 Score = 50.2 bits (120), Expect = 5e-07
 Identities = 54/179 (30%), Positives = 71/179 (39%), Gaps = 35/179 (19%)

Query: 27  HKIVGKTPLEEA-AIIVEDYGLPCAKHEFVNEVYSMFSDHLCKVKALPGANRLIKHLSCH 85
           H  V     E+A A+ +E Y      HE    VY             PG    +K L   
Sbjct: 74  HDGVDDELAEQALALFMEAYA---DSHEL-TVVY-------------PGVRDTLKWLKKQ 116

Query: 86  GVPMALASNSHRATI-----ESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAKRL 140
           GV MAL +N     +     + KI     W      I+G D +   KP P   L   K  
Sbjct: 117 GVEMALITNKPERFVAPLLDQMKIGRYFRW------IIGGDTLPQKKPDPAALLFVMKMA 170

Query: 141 NMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTAADE----VINSLLDLRP 195
            + PS SL + DS   V+A KAAG++ VA+       H    A+E    VI+ L  L P
Sbjct: 171 GVPPSQSLFVGDSRSDVLAAKAAGVQCVALSY--GYNHGRPIAEESPALVIDDLRALLP 227


>gnl|CDD|237311 PRK13226, PRK13226, phosphoglycolate phosphatase; Provisional.
          Length = 229

 Score = 49.5 bits (118), Expect = 6e-07
 Identities = 22/64 (34%), Positives = 36/64 (56%)

Query: 107 QHGWNESFSVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGME 166
           Q GW +  +V++G D +   KP P   L AA+R+ + P+  + + D    ++A +AAGM 
Sbjct: 131 QLGWEQRCAVLIGGDTLAERKPHPLPLLVAAERIGVAPTDCVYVGDDERDILAARAAGMP 190

Query: 167 VVAV 170
            VA 
Sbjct: 191 SVAA 194


>gnl|CDD|237336 PRK13288, PRK13288, pyrophosphatase PpaX; Provisional.
          Length = 214

 Score = 48.9 bits (117), Expect = 8e-07
 Identities = 29/104 (27%), Positives = 51/104 (49%), Gaps = 5/104 (4%)

Query: 69  VKALPGANRLIKHLSCHGVPMALASNSHRATIES--KISYQHGWNESFSVIVGSDEVRTG 126
           V         +K L   G  + + +   R T+E   K++   G +E F V++  D+V   
Sbjct: 81  VTEYETVYETLKTLKKQGYKLGIVTTKMRDTVEMGLKLT---GLDEFFDVVITLDDVEHA 137

Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
           KP P+  L+A + L  +P  +L++ D+   ++AGK AG +   V
Sbjct: 138 KPDPEPVLKALELLGAKPEEALMVGDNHHDILAGKNAGTKTAGV 181


>gnl|CDD|233463 TIGR01549, HAD-SF-IA-v1, haloacid dehalogenase superfamily,
           subfamily IA, variant 1 with third motif having Dx(3-4)D
           or Dx(3-4)E.  This model represents part of one
           structural subfamily of the Haloacid Dehalogenase (HAD)
           superfamily of aspartate-nucleophile hydrolases. The
           superfamily is defined by the presence of three short
           catalytic motifs. The subfamilies are defined based on
           the location and the observed or predicted fold of a
           so-called "capping domain", or the absence of such a
           domain. Subfamily I consists of sequences in which the
           capping domain is found in between the first and second
           catalytic motifs. Subfamily II consists of sequences in
           which the capping domain is found between the second and
           third motifs. Subfamily III sequences have no capping
           domain in either of these positions.The Subfamily IA and
           IB capping domains are predicted by PSI-PRED to consist
           of an alpha helical bundle. Subfamily I encompasses such
           a wide region of sequence space (the sequences are
           highly divergent) that modelling it with a single
           representation is impossible, resulting in an overly
           broad description which allows in many unrelated
           sequences. Subfamily IA and IB are separated based on an
           aparrent phylogenetic bifurcation. Subfamily IA is still
           too broad to model, but cannot be further subdivided
           into large chunks based on phylogenetic trees. Of the
           three motifs defining the HAD superfamily, the third has
           three variant forms : (1) hhhhsDxxx(x)(D/E), (2)
           hhhhssxxx(x)D and (3) hhhhDDxxx(x)s where _s_ refers to
           a small amino acid and _h_ to a hydrophobic one. All
           three of these variants are found in subfamily IA.
           Individual models were made based on seeds exhibiting
           only one of the variants each. Variant 1 (this model) is
           found in the enzymes phosphoglycolate phosphatase
           (TIGR01449) and enolase-phosphatase. These three variant
           models (see also TIGR01493 and TIGR01509) were created
           withthe knowledge that there will be overlap among them
           - this is by design and serves the purpose of
           eliminating the overlap with models of more distantly
           relatedHAD subfamilies caused by an overly broad single
           model [Unknown function, Enzymes of unknown
           specificity].
          Length = 162

 Score = 48.1 bits (115), Expect = 9e-07
 Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 3/95 (3%)

Query: 70  KALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPS 129
             +PGA  L+  L   G+ + + SN      +  +   HG  + F +I+GSDE+ + KP 
Sbjct: 71  AYIPGAADLLPRLKEAGIKLGIISNGSLRAQKLLLRK-HGLGDYFELILGSDEIGS-KPE 128

Query: 130 PDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAG 164
           P+IFL A + L + P   L + D++  +   + AG
Sbjct: 129 PEIFLAALESLGVPP-EVLHVGDNLSDIKGARNAG 162


>gnl|CDD|234176 TIGR03351, PhnX-like, phosphonatase-like hydrolase.  This clade of
           sequences are the closest homologs to the PhnX enzyme,
           phosphonoacetaldehyde (Pald) hydrolase (phosphonatase,
           TIGR01422). This phosphonatase-like enzyme and PhnX
           itself are members of the haloacid dehalogenase (HAD)
           superfamily (pfam00702) having a a number of distinctive
           features that set them apart from typical HAD enzymes.
           The typical HAD N-terminal motif DxDx(T/V) here is DxAGT
           and the usual conserved lysine prior to the C-terminal
           motif is instead an arginine. Also distinctive of
           phosphonatase, and particular to its bi-catalytic
           mechanism is a conserved lysine in the variable "cap"
           domain. This lysine forms a Schiff base with the
           aldehyde of phosphonoacetaldehyde, providing, through
           the resulting positive charge, a polarization of the C-P
           bond necesary for cleavage as well as a route to the
           initial product of cleavage, an ene-amine. The
           conservation of these elements in this
           phosphonatase-like enzyme suggests that the substrate is
           also, like Pald, a 2-oxo-ethylphosphonate. Despite this,
           the genomic context of members of this family are quite
           distinct from PhnX, which is almost invariably
           associated with the 2-aminoethylphosphonate transaminase
           PhnW (TIGR02326), the source of the substrate Pald.
           Members of this clade are never associated with PhnW,
           but rather associate with families of FAD-dependent
           oxidoreductases related to deaminating amino acid
           oxidases (pfam01266) as well as zinc-dependent
           dehydrogenases (pfam00107). Notably, family members from
           Arthrobacter aurescens TC1 and Nocardia farcinica IFM
           10152 are adjacent to the PhnCDE ABC cassette
           phosphonates transporter (GenProp0236) typically found
           in association with the phosphonates C-P lyase system
           (GenProp0232). These observations suggest two
           possibilities. First, the substrate for this enzyme
           family is also Pald, the non-association with PhnW not
           withstanding. Alternatively, the substrate is something
           very closely related such as
           hydroxyphosphonoacetaldehyde (Hpald). Hpald could come
           from oxidative deamination of
           1-hydroxy-2-aminoethylphosphonate (HAEP) by the
           associated oxidase. HAEP would not be a substrate for
           PhnW due to its high specificity for AEP. HAEP has been
           shown to be a constituent of the sphingophosphonolipid
           of Bacteriovorax stolpii, and presumably has other
           natural sources. If Hpald is the substrate, the product
           would be glycoaldehyde (hydroxyacetaldehyde), and the
           associated alcohol dehydrogenase may serve to convert
           this to glycol.
          Length = 220

 Score = 48.6 bits (116), Expect = 1e-06
 Identities = 38/160 (23%), Positives = 61/160 (38%), Gaps = 21/160 (13%)

Query: 19  KEWDGREKHKIV------GKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDHLCKVKAL 72
             W G+ K + +            EA     D+           E  +   D      AL
Sbjct: 41  SAWMGQSKIEAIRALLAADGADEAEAQAAFADF----------EERLAEAYDDG-PPVAL 89

Query: 73  PGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGW--NESFSVIVGSDEVRTGKPSP 130
           PGA    + L   G+ +AL +   R T E ++  + GW   +    +V   +V  G+P+P
Sbjct: 90  PGAEEAFRSLRSSGIKVALTTGFDRDTAE-RLLEKLGWTVGDDVDAVVCPSDVAAGRPAP 148

Query: 131 DIFLEAAKRLNMEPSSS-LVIEDSVIGVVAGKAAGMEVVA 169
           D+ L A +   ++   S  V  D+   + AG  AG   V 
Sbjct: 149 DLILRAMELTGVQDVQSVAVAGDTPNDLEAGINAGAGAVV 188


>gnl|CDD|162787 TIGR02253, CTE7, HAD superfamily (subfamily IA) hydrolase,
           TIGR02253.  This family of sequences from archaea and
           metazoans includes the human uncharacterized protein
           CTE7. Pyrococcus species appear to have three different
           forms of this enzyme, so it is unclear whether all
           members of this family have the same function. This
           family is a member of the haloacid dehalogenase (HAD)
           superfamily of hydrolases which are characterized by
           three conserved sequence motifs. By virtue of an alpha
           helical domain in-between the first and second conserved
           motif, this family is a member of subfamily IA
           (TIGR01549).
          Length = 221

 Score = 47.8 bits (114), Expect = 2e-06
 Identities = 32/142 (22%), Positives = 58/142 (40%), Gaps = 5/142 (3%)

Query: 56  NEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFS 115
             VY+        ++  PG    +  L   G  + + ++        K+  + G  + F 
Sbjct: 80  AFVYAYHKLKFAYLRVYPGVRDTLMELRESGYRLGIITDGLPVKQWEKLE-RLGVRDFFD 138

Query: 116 VIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVI-GVVAGKAAGMEVVAVPSLP 174
            ++ S+E    KP P IF  A KRL ++P  ++++ D +   +   K  GM+ V +    
Sbjct: 139 AVITSEEEGVEKPHPKIFYAALKRLGVKPEEAVMVGDRLDKDIKGAKNLGMKTVWINQGK 198

Query: 175 KQTHR---YTAADEVINSLLDL 193
                   Y   D  I+SL +L
Sbjct: 199 SSKMEDDVYPYPDYEISSLREL 220


>gnl|CDD|129317 TIGR00213, GmhB_yaeD, D,D-heptose 1,7-bisphosphate phosphatase.
           This family of proteins formerly designated yaeD
           resembles the histidinol phosphatase domain of the
           bifunctional protein HisB. The member from E. coli has
           been characterized as D,D-heptose 1,7-bisphosphate
           phosphatase, GmhB, involved in inner core LPS assembly
           (PMID:11751812) [Cell envelope, Biosynthesis and
           degradation of surface polysaccharides and
           lipopolysaccharides].
          Length = 176

 Score = 45.3 bits (107), Expect = 9e-06
 Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 2/69 (2%)

Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAA--GMEVVAVPSLPKQTHRYTAAD 184
           KP P + L+A K L+++ + S ++ D +  + AG AA     V+     P        AD
Sbjct: 106 KPKPGMLLQARKELHIDMAQSYMVGDKLEDMQAGVAAKVKTNVLVRTGKPITPEAENIAD 165

Query: 185 EVINSLLDL 193
            V+NSL DL
Sbjct: 166 WVLNSLADL 174


>gnl|CDD|233512 TIGR01656, Histidinol-ppas, histidinol-phosphate phosphatase family
           domain.  This domain is found in authentic
           histidinol-phosphate phosphatases which are sometimes
           found as stand-alone entities and sometimes as fusions
           with imidazoleglycerol-phosphate dehydratase
           (TIGR01261). Additionally, a family of proteins
           including YaeD from E. coli (TIGR00213) and various
           other proteins are closely related but may not have the
           same substrate specificity. This domain is a member of
           the haloacid-dehalogenase (HAD) superfamily of
           aspartate-nucleophile hydrolases. This superfamily is
           distinguished by the presence of three motifs: an
           N-terminal motif containing the nucleophilic aspartate,
           a central motif containing an conserved serine or
           threonine, and a C-terminal motif containing a conserved
           lysine (or arginine) and conserved aspartates. More
           specifically, the domian modelled here is a member of
           subfamily III of the HAD-superfamily by virtue of
           lacking a "capping" domain in either of the two common
           positions, between motifs 1 and 2, or between motifs 2
           and 3.
          Length = 147

 Score = 42.8 bits (101), Expect = 5e-05
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPS 172
           KP P + LEA KRL ++ S SLV+ D +  + A + AG+  V +  
Sbjct: 101 KPKPGLILEALKRLGVDASRSLVVGDRLRDLQAARNAGLAAVLLVD 146


>gnl|CDD|188140 TIGR01422, phosphonatase, phosphonoacetaldehyde hydrolase.  This
           enzyme catalyzes the cleavage of the carbon phosphorous
           bond of a phosphonate. The mechanism depends on the
           substrate having a carbonyl one carbon away from the
           cleavage position. This enzyme is a member of the
           Haloacid Dehalogenase (HAD) superfamily of
           aspartate-nucleophile hydrolases (pfam00702), and
           contains a modified version of the conserved catalytic
           motifs of that superfamily: the first motif is usually
           DxDx(T/V), here it is DxAxT, and in the third motif the
           normal conserved lysine is instead an arginine.
           Additionally, the enzyme contains a unique conserved
           catalytic lysine (B. cereus pos. 53) which is involved
           in the binding and activation of the substrate through
           the formation of a Schiff base. The substrate of this
           enzyme is the product of 2-aminoethylphosphonate (AEP)
           transaminase, phosphonoacetaldehyde. This degradation
           pathway for AEP may be related to its toxic properties
           which are utilized by microorganisms as a chemical
           warfare agent [Central intermediary metabolism, Other].
          Length = 253

 Score = 43.1 bits (102), Expect = 1e-04
 Identities = 26/101 (25%), Positives = 51/101 (50%), Gaps = 5/101 (4%)

Query: 73  PGANRLIKHLSCHGVPMALASNSHRATIE--SKISYQHGWNESFSVIVGSDEVRTGKPSP 130
           PGA  +I +L   G+ +   +   R  ++  +  +   G+   +   V +D+V  G+P+P
Sbjct: 102 PGAIEVIAYLRARGIKIGSTTGYTREMMDVVAPEAAAQGYRPDY--NVTADDVPAGRPAP 159

Query: 131 DIFLEAAKRLNMEPSSSLV-IEDSVIGVVAGKAAGMEVVAV 170
            + L+ A  L +   +++V + D+V  +  G+ AGM  V V
Sbjct: 160 WMALKNATELGVYDPAAVVKVGDTVPDIEEGRNAGMWTVGV 200


>gnl|CDD|223319 COG0241, HisB, Histidinol phosphatase and related phosphatases
           [Amino acid transport and metabolism].
          Length = 181

 Score = 41.1 bits (97), Expect = 2e-04
 Identities = 20/68 (29%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGME-VVAVPSLPKQTHRYTAADE 185
           KP P + L A K  N++ S S V+ D +  + A + AG++ V+ +  +   T     A  
Sbjct: 105 KPKPGMLLSALKEYNIDLSRSYVVGDRLTDLQAAENAGIKGVLVLTGIGVTTDGAGRAKW 164

Query: 186 VINSLLDL 193
           V +SL + 
Sbjct: 165 VFDSLAEF 172


>gnl|CDD|162372 TIGR01458, HAD-SF-IIA-hyp3, HAD-superfamily subfamily IIA
           hydrolase, TIGR01458.  This hypothetical equivalog is a
           member of the IIA subfamily (TIGR01460) of the haloacid
           dehalogenase superfamily of aspartate-nucleophile
           hydrolases. One sequence (GP|10716807) has been
           annotated as a "phospholysine phosphohistidine inorganic
           pyrophosphatase," probably in reference to studies on
           similarly described (but unsequenced) enzymes from
           bovine and rat tissues. However, the supporting
           information for this annotation has never been published
           [Unknown function, Enzymes of unknown specificity].
          Length = 257

 Score = 40.6 bits (95), Expect = 6e-04
 Identities = 21/61 (34%), Positives = 30/61 (49%), Gaps = 6/61 (9%)

Query: 126 GKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAG-KAAGMEVVAVPSLPKQTHRYTAAD 184
           GKPS   FLEA +    EP  +++I D     V G +  GM  + V     +T +Y  +D
Sbjct: 178 GKPSKTFFLEALRATGCEPEEAVMIGDDCRDDVGGAQDCGMRGIQV-----RTGKYRPSD 232

Query: 185 E 185
           E
Sbjct: 233 E 233


>gnl|CDD|223720 COG0647, NagD, Predicted sugar phosphatases of the HAD superfamily
           [Carbohydrate transport and metabolism].
          Length = 269

 Score = 39.9 bits (94), Expect = 0.001
 Identities = 16/46 (34%), Positives = 29/46 (63%), Gaps = 1/46 (2%)

Query: 126 GKPSPDIFLEAAKRLNMEPSSSLVIEDSVIG-VVAGKAAGMEVVAV 170
           GKPSP I+  A ++L ++ S  L++ D +   ++  KAAG++ + V
Sbjct: 189 GKPSPAIYEAALEKLGLDRSEVLMVGDRLDTDILGAKAAGLDTLLV 234


>gnl|CDD|236354 PRK08942, PRK08942, D,D-heptose 1,7-bisphosphate phosphatase;
           Validated.
          Length = 181

 Score = 39.0 bits (92), Expect = 0.001
 Identities = 15/38 (39%), Positives = 24/38 (63%)

Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAG 164
           KP P + L  A+RLN++ + S ++ DS+  + A  AAG
Sbjct: 103 KPKPGMLLSIAERLNIDLAGSPMVGDSLRDLQAAAAAG 140


>gnl|CDD|184075 PRK13478, PRK13478, phosphonoacetaldehyde hydrolase; Provisional.
          Length = 267

 Score = 38.7 bits (91), Expect = 0.003
 Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 1/56 (1%)

Query: 116 VIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLV-IEDSVIGVVAGKAAGMEVVAV 170
            +V +D+V  G+P P + L+ A  L +   ++ V ++D+V G+  G  AGM  V V
Sbjct: 147 HVVTTDDVPAGRPYPWMALKNAIELGVYDVAACVKVDDTVPGIEEGLNAGMWTVGV 202


>gnl|CDD|233675 TIGR01993, Pyr-5-nucltdase, pyrimidine 5'-nucleotidase.  This
           family of proteins includes the SDT1/SSM1 gene from
           yeast which has been shown to code for a pyrimidine
           (UMP/CMP) 5'nucleotidase. The family spans plants, fungi
           and a small number of bacteria. These enzymes are
           members of the haloacid dehalogenase (HAD) superfamily
           of hydrolases, specifically the IA subfamily (variant 3,
           TIGR01509).
          Length = 183

 Score = 37.7 bits (88), Expect = 0.004
 Identities = 16/44 (36%), Positives = 26/44 (59%)

Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
           KPSP  + +A +   ++P  ++  +DS   + AGKA GM+ V V
Sbjct: 140 KPSPQAYEKALREAGVDPERAIFFDDSARNIAAGKALGMKTVLV 183


>gnl|CDD|182828 PRK10907, PRK10907, intramembrane serine protease GlpG;
           Provisional.
          Length = 276

 Score = 36.5 bits (85), Expect = 0.013
 Identities = 25/63 (39%), Positives = 33/63 (52%), Gaps = 10/63 (15%)

Query: 216 WYIGGPVVKGLGRGSK-VLGIPTANLSTEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVMS 274
           WY+GG V K LG G   V+ + +A LS  G+   +    SG +FG  GLS  GV   +M 
Sbjct: 159 WYLGGAVEKRLGSGKLIVITLISALLS--GW---VQSKFSGPWFG--GLS--GVVYALMG 209

Query: 275 IGW 277
             W
Sbjct: 210 YVW 212


>gnl|CDD|233462 TIGR01548, HAD-SF-IA-hyp1, haloacid dehalogenase superfamily,
           subfamily IA hydrolase, TIGR01548.  This model
           represents a small and phylogenetically curious clade of
           sequences. Sequences are found from Halobacterium (an
           archaeon), Nostoc and Synechococcus (cyanobacteria) and
           Phytophthora (a stramenophile eukaryote). These appear
           to be members of the haloacid dehalogenase (HAD)
           superfamily of aspartate-nucleophile hydrolases by
           general homology and the conservation of all of the
           recognized catalytic motifs. The variable domain is
           found in between motifs 1 and 2, indicating membership
           in subfamily I and phylogeny and prediction of the alpha
           helical nature of the variable domain (by PSI-PRED)
           indicate membership in subfamily IA. All but the
           Halobacterium sequence currently found are annotated as
           "Imidazoleglycerol-phosphate dehydratase", however, the
           source of the annotation could not be traced and
           significant homology could not be found between any of
           these sequences and known IGPD's.
          Length = 197

 Score = 35.3 bits (81), Expect = 0.028
 Identities = 24/100 (24%), Positives = 44/100 (44%), Gaps = 2/100 (2%)

Query: 64  DHLCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEV 123
             L + + L     L++ L      MA+ +   R     K    HG    F V +  ++ 
Sbjct: 100 LGLIEDETLLTPKGLLRELHRAPKGMAVVTGRPRKDAA-KFLTTHGLEILFPVQIWMEDC 158

Query: 124 RTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAA 163
              KP+P+  + AAK L +E   + ++ D+V  ++ G+ A
Sbjct: 159 -PPKPNPEPLILAAKALGVEACHAAMVGDTVDDIITGRKA 197


>gnl|CDD|106187 PRK13225, PRK13225, phosphoglycolate phosphatase; Provisional.
          Length = 273

 Score = 35.1 bits (80), Expect = 0.037
 Identities = 30/152 (19%), Positives = 61/152 (40%), Gaps = 18/152 (11%)

Query: 19  KEWDGREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDHLCKVKALPGANRL 78
           ++W  R   +  G +P ++A ++                V     D L  ++  PG   L
Sbjct: 105 RQWSSRTIVRRAGLSPWQQARLL--------------QRVQRQLGDCLPALQLFPGVADL 150

Query: 79  IKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAK 138
           +  L    + + + S++ R  IE+ +  Q G    FSV+     + + + +     +   
Sbjct: 151 LAQLRSRSLCLGILSSNSRQNIEAFLQRQ-GLRSLFSVVQAGTPILSKRRA---LSQLVA 206

Query: 139 RLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
           R   +P++ + + D    V A +  G+  VAV
Sbjct: 207 REGWQPAAVMYVGDETRDVEAARQVGLIAVAV 238


>gnl|CDD|130495 TIGR01428, HAD_type_II, 2-haloalkanoic acid dehalogenase, type II. 
           Catalyzes the hydrolytic dehalogenation of small
           L-2-haloalkanoic acids to yield the corresponding
           D-2-hydroxyalkanoic acids. Belongs to the Haloacid
           Dehalogenase (HAD) superfamily of aspartate-nucleophile
           hydrolases (pfam00702), class (subfamily) I. Note that
           the Type I HAD enzymes have not yet been fully
           characterized, but clearly utilize a substantially
           different catalytic mechanism and are thus unlikely to
           be related.
          Length = 198

 Score = 34.6 bits (80), Expect = 0.039
 Identities = 22/92 (23%), Positives = 44/92 (47%), Gaps = 1/92 (1%)

Query: 79  IKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAK 138
           ++ L   G  +A+ SN   A ++S + +  G ++ F  ++ +D VR  KP+P ++  A +
Sbjct: 101 LRALKERGYRLAILSNGSPAMLKSLVKHA-GLDDPFDAVLSADAVRAYKPAPQVYQLALE 159

Query: 139 RLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
            L + P   L +  +   +   K  G +   V
Sbjct: 160 ALGVPPDEVLFVASNPWDLGGAKKFGFKTAWV 191


>gnl|CDD|225090 COG2179, COG2179, Predicted hydrolase of the HAD superfamily
           [General function prediction only].
          Length = 175

 Score = 33.0 bits (76), Expect = 0.13
 Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 1/52 (1%)

Query: 124 RTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVI-GVVAGKAAGMEVVAVPSLP 174
           R  KP    F  A K +N+ P   +++ D +   V+ G  AGM  + V  L 
Sbjct: 90  RAKKPFGRAFRRALKEMNLPPEEVVMVGDQLFTDVLGGNRAGMRTILVEPLV 141


>gnl|CDD|233519 TIGR01668, YqeG_hyp_ppase, HAD superfamily (subfamily IIIA)
           phosphatase, TIGR01668.  This family of hypothetical
           proteins is a member of the IIIA subfamily of the
           haloacid dehalogenase (HAD) superfamily of hydrolases.
           All characterized members of this subfamily (TIGR01662)
           and most characterized members of the HAD superfamily
           are phosphatases. HAD superfamily phosphatases contain
           active site residues in several conserved catalytic
           motifs, all of which are found conserved here. This
           family consists of sequences from fungi, plants,
           cyanobacteria, gram-positive bacteria and Deinococcus.
           There is presently no characterization of any sequence
           in this family.
          Length = 170

 Score = 32.0 bits (73), Expect = 0.23
 Identities = 28/153 (18%), Positives = 47/153 (30%), Gaps = 29/153 (18%)

Query: 71  ALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSP 130
           A P     I+ L   G  + + SN+     E +        ++  + V    V   KP  
Sbjct: 44  AYPALRDWIEELKAAGRKLLIVSNNAG---EQRAKA---VEKALGIPVLPHAV---KPPG 94

Query: 131 DIFLEAAKRLNMEPSSSLVIEDSVI-GVVAGKAAGMEVVAVPSLPKQTHRYTAADEVINS 189
             F  A   + +      V+ D +   V+ G   G   + V  L                
Sbjct: 95  CAFRRAHPEMGLTSEQVAVVGDRLFTDVMGGNRNGSYTILVEPLV--------------- 139

Query: 190 LLDLRPEKWGLPPFQDWIEGTLPSEPWYIGGPV 222
                P++W +      +E T+       GGP 
Sbjct: 140 ----HPDQWFIKRIWRRVERTVLKFLVSRGGPA 168


>gnl|CDD|233517 TIGR01662, HAD-SF-IIIA, HAD-superfamily hydrolase, subfamily IIIA. 
           This subfamily falls within the Haloacid Dehalogenase
           (HAD) superfamily of aspartate-nucleophile hydrolases.
           The Class III subfamilies are characterized by the lack
           of any domains located between either between the first
           and second conserved catalytic motifs (as in the Class I
           subfamilies, TIGR01493, TIGR01509, TIGR01488 and
           TIGR01494) or between the second and third conserved
           catalytic motifs (as in the Class II subfamilies,
           TIGR01460 and TIGR01484) of the superfamily domain. The
           IIIA subfamily contains five major clades:
           histidinol-phosphatase (TIGR01261) and
           histidinol-phosphatase-related protein (TIGR00213) which
           together form a subfamily (TIGR01656), DNA
           3'-phosphatase (TIGR01663, TIGR01664), YqeG (TIGR01668)
           and YrbI (TIGR01670). In the case of histidinol
           phosphatase and PNK-3'-phosphatase, this model
           represents a domain of a bifunctional system. In the
           histidinol phosphatase HisB, a C-terminal domain is an
           imidazoleglycerol-phosphate dehydratase which catalyzes
           a related step in histidine biosynthesis. In
           PNK-3'-phosphatase, N- and C-terminal domains constitute
           the polynucleotide kinase and DNA-binding components of
           the enzyme [Unknown function, Enzymes of unknown
           specificity].
          Length = 132

 Score = 31.2 bits (71), Expect = 0.31
 Identities = 23/106 (21%), Positives = 42/106 (39%), Gaps = 9/106 (8%)

Query: 73  PGANRLIKHLSCHGVPMALASNS----HRATIESKISYQ-HGWNESFSVIVGSDEVRTGK 127
           P     +  L   G  + + +N            +++ +         ++      R  K
Sbjct: 28  PEVPDALAELKEAGYKVVIVTNQSGIGRGKFSSGRVARRLEELGVPIDILYACPHCR--K 85

Query: 128 PSPDIFLEAAKRLN-MEPSSSLVIEDSVI-GVVAGKAAGMEVVAVP 171
           P P +FLEA KR N ++P  S+ + D  +  + A K AG+  + V 
Sbjct: 86  PKPGMFLEALKRFNEIDPEESVYVGDQDLTDLQAAKRAGLAFILVA 131


>gnl|CDD|233800 TIGR02247, HAD-1A3-hyp, epoxide hydrolase N-terminal domain-like
           phosphatase.  This model represents a small clade of
           sequences including C. elegans and mammalian sequences
           as well as a small number of bacteria. In eukaryotes,
           this domain exists as an N-terminal fusion to the
           soluble epoxide hydrolase enzyme and has recently been
           shown to be an active phosphatase, although the nature
           of the biological substrate is unclear. These appear to
           be members of the haloacid dehalogenase (HAD)
           superfamily of aspartate-nucleophile hydrolases by
           general homology and the conservation of all of the
           recognized catalytic motifs (although the first motif is
           unusual in the replacement of the more common aspartate
           with glycine...). The variable domain is found in
           between motifs 1 and 2, indicating membership in
           subfamily I and phylogeny and prediction of the alpha
           helical nature of the variable domain (by PSI-PRED)
           indicate membership in subfamily IA.
          Length = 211

 Score = 31.7 bits (72), Expect = 0.44
 Identities = 24/113 (21%), Positives = 40/113 (35%), Gaps = 5/113 (4%)

Query: 69  VKALPGANRLIKHLSCHGVPMALASNS---HRATIESKISYQHGWNESFSVIVGSDEVRT 125
            K  P     IK L   G   A  +N+     +  E+ +         F  +V S     
Sbjct: 93  TKLRPSMMAAIKTLRAKGFKTACITNNFPTDHSAEEALLPGDIM--ALFDAVVESCLEGL 150

Query: 126 GKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTH 178
            KP P I+    +RL + P   + ++D    +    A G+  + V    +  H
Sbjct: 151 RKPDPRIYQLMLERLGVAPEECVFLDDLGSNLKPAAALGITTIKVSDEEQAIH 203


>gnl|CDD|162788 TIGR02254, YjjG/YfnB, HAD superfamily (subfamily IA) hydrolase,
           TIGR02254.  This family consists of uncharacterized
           proteobacterial and gram positive bacterial sequences
           including YjjG from E. coli and YfnB from B. subtilis.
           This family is a member of the haloacid dehalogenase
           (HAD) superfamily of hydrolases which are characterized
           by three conserved sequence motifs. By virtue of an
           alpha helical domain in-between the first and second
           conserved motif, this family is a member of subfamily IA
           (TIGR01549). Most likely, these enzymes are
           phosphatases.
          Length = 224

 Score = 31.7 bits (72), Expect = 0.44
 Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 6/100 (6%)

Query: 72  LPGANRLIKHLSCHG-VPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSP 130
           LPGA  L+++L       + + +N  R T   ++  + G    F  I  S++    KP  
Sbjct: 99  LPGAFELMENL--QQKFRLYIVTNGVRETQYKRLR-KSGLFPFFDDIFVSEDAGIQKPDK 155

Query: 131 DIFLEAAKRL-NMEPSSSLVIEDSVIG-VVAGKAAGMEVV 168
           +IF  A +R+        L+I DS+   +  G+ AG++  
Sbjct: 156 EIFNYALERMPKFSKEEVLMIGDSLTADIKGGQNAGLDTC 195


>gnl|CDD|130524 TIGR01457, HAD-SF-IIA-hyp2, HAD-superfamily subfamily IIA
           hydrolase, TIGR01457.  This hypothetical equivalog is a
           member of the Class IIA subfamily of the haloacid
           dehalogenase superfamily of aspartate-nucleophile
           hydrolases. The sequences modelled by this equivalog are
           all gram positive (low-GC) bacteria. Sequences found in
           This model are annotated variously as related to NagD or
           4-nitrophenyl phosphatase, and this hypothetical
           equivalog, of all of those within the Class IIA
           subfamily, is most closely related to the E. coli NagD
           enzyme and the PGP_euk equivalog (TIGR01452). However,
           there is presently no evidence that this hypothetical
           equivalog has the same function of either those [Unknown
           function, Enzymes of unknown specificity].
          Length = 249

 Score = 31.7 bits (72), Expect = 0.47
 Identities = 16/57 (28%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 115 SVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDS-VIGVVAGKAAGMEVVAV 170
           +V  G   V  GKP   I  +A + L  +   +L++ D+    ++AG  AG++ + V
Sbjct: 166 TVSTGVKPVFIGKPESIIMEQAMRVLGTDVEETLMVGDNYATDIMAGINAGIDTLLV 222


>gnl|CDD|215296 PLN02540, PLN02540, methylenetetrahydrofolate reductase.
          Length = 565

 Score = 31.6 bits (72), Expect = 0.62
 Identities = 25/105 (23%), Positives = 39/105 (37%), Gaps = 32/105 (30%)

Query: 176 QTHRYTAADEVINSLLDLRPEKWGLPP---------FQDWIEGTLPSEPWYIGGPVVKGL 226
           Q  R  A D+       L+ E WG+P          F  +  G L S PW      + GL
Sbjct: 352 QFMRPRARDK------KLQAE-WGVPLKSVEDVYEVFAKYCLGKLKSSPW----SELDGL 400

Query: 227 GRGSKVLGIPTANLSTEGY---------SDVLSEHPSGVYFGWAG 262
              +K++      ++ +G+         +   S+ PS    GW G
Sbjct: 401 QPETKIINEQLVKINRKGFLTINSQPAVNGEKSDSPS---VGWGG 442


>gnl|CDD|180686 PRK06769, PRK06769, hypothetical protein; Validated.
          Length = 173

 Score = 30.9 bits (70), Expect = 0.68
 Identities = 12/44 (27%), Positives = 21/44 (47%)

Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
           KPS  + L+AA++  ++ +   VI D    +VA        + V
Sbjct: 93  KPSTGMLLQAAEKHGLDLTQCAVIGDRWTDIVAAAKVNATTILV 136


>gnl|CDD|233420 TIGR01452, PGP_euk, phosphoglycolate/pyridoxal phosphate
           phosphatase family.  PGP is an essential enzyme in the
           glycolate salvage pathway in higher organisms
           (photorespiration in plants). Phosphoglycolate results
           from the oxidase activity of RubisCO in the Calvin cycle
           when concentrations of carbon dioxide are low relative
           to oxygen. In mammals, PGP is found in many tissues,
           notably in red blood cells where P-glycolate is and
           important activator of the hydrolysis of
           2,3-bisphosphoglycerate, a major modifier of the oxygen
           affinity of hemoglobin. Pyridoxal phosphate (PLP,
           Vitamin B6) phosphatase is involved in the degradation
           of PLP in mammals and is widely distributed in human
           tissues including erythrocyes. The enzymes described
           here are members of the Haloacid dehalogenase
           superfamily of hydrolase enzymes (pfam00702). Unlike the
           bacterial PGP equivalog (TIGR01449), which is a member
           of class (subfamily) I, these enzymes are members of
           class (subfamily) II. These two families have almost
           certainly arisen from convergent evolution (although
           these two ancestors may themselves have diverged from a
           more distant HAD superfamily progenitor). The primary
           seed sequence for this model comes from Chlamydomonas
           reinhardtii, a photosynthetic alga. The enzyme has been
           purified and characterized and these data are fully
           consistent with the assignment of function as a PGPase
           involved in photorespiration. The second seed, from Homo
           sapiens chromosome 22 has been characterized as a
           pyridoxal phosphatase. Biochemical characterization of
           partially purified PGP's from various tissues including
           red blood cells have been performed while one gene for
           PGP has been localized to chromosome 16p13.3. The
           sequence used here maps to chromosome 22. There is
           indeed a related gene on chromosome 16 (and it is
           expressed, since EST's are found) which shows 46%
           identity and 59% positives by BLAST2 (E=1e-66). The
           chromosome 16 gene is not in evidence in nraa but
           translated from the genomic sequence would score 372.4
           (E=7.9e-113) versus This model, well above trusted. The
           third seed, from C. elegans, is only supported by
           sequence similarity. This model is limited to eukaryotic
           species including S. pombe and S. cerevisiae, although
           several archaea score between the trusted and noise
           cutoffs. This model is closely related to a family of
           bacterial sequences including the E. coli NagD and B.
           subtilus AraL genes which are characterized by the
           ability to hydrolyze para-nitrophenylphosphate (pNPPases
           or NPPases). The chlamydomonas PGPase d.
          Length = 279

 Score = 31.0 bits (70), Expect = 0.96
 Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 4/72 (5%)

Query: 119 GSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSV-IGVVAGKAAGMEVVAVPSLPKQT 177
           G   +  GKPSP +F    +  +++P+ +L++ D +   ++ G   GM  V V S     
Sbjct: 194 GRQPLVVGKPSPYMFECITENFSIDPARTLMVGDRLETDILFGHRCGMTTVLVLS---GV 250

Query: 178 HRYTAADEVINS 189
            R   A E + +
Sbjct: 251 SRLEEAQEYLAA 262


>gnl|CDD|233422 TIGR01460, HAD-SF-IIA, Haloacid Dehalogenase Superfamily Class
           (subfamily) IIA.  This model represents one structural
           subclass of the Haloacid Dehalogenase (HAD) superfamily
           of aspartate-nucleophile hydrolases. The superfamily is
           defined by the presence of three short catalytic motifs.
           The classes are defined based on the location and the
           observed or predicted fold of a so-called "capping
           domain", or the absence of such a domain. Class I
           consists of sequences in which the capping domain is
           found in between the first and second catalytic motifs.
           Class II consists of sequences in which the capping
           domain is found between the second and third motifs.
           Class III sequences have no capping domain in iether of
           these positions. The Class IIA capping domain is
           predicted by PSI-PRED to consist of a mixed alpha-beta
           fold with the basic pattern:
           Helix-Helix-Helix-Sheet-Helix-Loop-Sheet-Helix-Sheet-
           Helix. Presently, this subfamily encompasses a single
           equivalog model (TIGR01452) for the eukaryotic
           phosphoglycolate phosphatase, as well as four
           hypothetical equivalogs covering closely related
           sequences (TIGR01456 and TIGR01458 in eukaryotes,
           TIGR01457 in gram positive bacteria and TIGR01459 in
           gram negative bacteria). The Escherishia coli NagD gene
           and the Bacillus subtilus AraL gene are members of this
           subfamily but are not members of the any of the
           presently defined equivalogs within it. NagD is part of
           the NAG operon responsible for N-acetylglucosamine
           metabolism. The function of this gene is unknown. Genes
           from several organisms have been annotated as NagD, or
           NagD-like. However, without data on the presence of
           other members of this pathway, (such as in the case of
           Yersinia pestis) these assignments should not be given
           great weight. The AraL gene is similar: it is part of
           the L-arabinose operon, but the function is unknown. A
           gene from Halobacterium has been annotated as AraL, but
           no other Ara operon genes have been annotated. Many of
           the genes in this subfamily have been annotated as
           "pNPPase" "4-nitrophenyl phosphatase" or "NPPase". These
           all refer to the same activity versus a common lab test
           compound used to determine phosphatase activity. There
           is no evidence that this activity is physiologically
           relevant [Unknown function, Enzymes of unknown
           specificity].
          Length = 236

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 15/54 (27%), Positives = 24/54 (44%), Gaps = 2/54 (3%)

Query: 119 GSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVI--EDSVIGVVAGKAAGMEVVAV 170
           G +    GKPSP I+  A   L   P    V+  ++    ++  K AG + + V
Sbjct: 180 GREPTVVGKPSPAIYRAALNLLQARPERRDVMVGDNLRTDILGAKNAGFDTLLV 233


>gnl|CDD|151335 pfam10886, DUF2685, Protein of unknown function (DUF2685).
          Members in this family of proteins are annotated as
          uvdY.-2 which is an open reading frame within uvsY.
          However currently there is no known function.
          Length = 54

 Score = 27.5 bits (61), Expect = 1.6
 Identities = 10/31 (32%), Positives = 17/31 (54%), Gaps = 4/31 (12%)

Query: 26 KHKIVGKTPLEEAAIIVEDYGL----PCAKH 52
             +V K P+E+A  +  +YG     PCA++
Sbjct: 2  AICVVCKQPVEKALAVDTEYGPVHPGPCAQY 32


>gnl|CDD|182466 PRK10444, PRK10444, UMP phosphatase; Provisional.
          Length = 248

 Score = 29.8 bits (67), Expect = 2.2
 Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 117 IVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSV-IGVVAGKAAGMEVVAVPS 172
           I G      GKPSP I   A  ++      ++++ D++   ++AG  AG+E + V S
Sbjct: 164 ISGRKPFYVGKPSPWIIRAALNKMQAHSEETVIVGDNLRTDILAGFQAGLETILVLS 220


>gnl|CDD|219943 pfam08631, SPO22, Meiosis protein SPO22/ZIP4 like.  SPO22/ZIP4 in
           yeast is a meiosis specific protein involved in
           sporulation. It has been shown to regulate crossover
           distribution by promoting synaptonemal complex
           formation.
          Length = 280

 Score = 29.7 bits (67), Expect = 2.4
 Identities = 14/61 (22%), Positives = 27/61 (44%), Gaps = 1/61 (1%)

Query: 288 IEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHE-DRKVAERALDLPLYS 346
           +E        E+ Y++ L  +I      E+NF    + I K+ +     A + LD  L++
Sbjct: 128 LEILKKRPGPEEEYEDVLMRMIKSVDVTESNFELAISHINKLSDKAPASAAKCLDYLLFN 187

Query: 347 K 347
           +
Sbjct: 188 R 188


>gnl|CDD|181865 PRK09449, PRK09449, dUMP phosphatase; Provisional.
          Length = 224

 Score = 29.1 bits (66), Expect = 3.4
 Identities = 20/66 (30%), Positives = 31/66 (46%), Gaps = 6/66 (9%)

Query: 70  KALPGANRLIKHLSCHG-VPMALASNSHRATIESKISYQH-GWNESFSVIVGSDEVRTGK 127
             LPGA  L+  L   G V M + +N    T   ++  +  G  + F ++V S++V   K
Sbjct: 95  TPLPGAVELLNAL--RGKVKMGIITNGF--TELQQVRLERTGLRDYFDLLVISEQVGVAK 150

Query: 128 PSPDIF 133
           P   IF
Sbjct: 151 PDVAIF 156


>gnl|CDD|233980 TIGR02712, urea_carbox, urea carboxylase.  Members of this family are
            ATP-dependent urea carboxylase, including characterized
            members from Oleomonas sagaranensis (alpha class
            Proteobacterium) and yeasts such as Saccharomyces
            cerevisiae. The allophanate hydrolase domain of the yeast
            enzyme is not included in this model and is represented
            by an adjacent gene in Oleomonas sagaranensis. The fusion
            of urea carboxylase and allophanate hydrolase is
            designated urea amidolyase. The enzyme from Oleomonas
            sagaranensis was shown to be highly active on acetamide
            and formamide as well as urea [Central intermediary
            metabolism, Nitrogen metabolism].
          Length = 1201

 Score = 29.2 bits (66), Expect = 4.2
 Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 4/33 (12%)

Query: 277  WNPYFDNAEKTIEPWLLHEFDE-DFY---DEEL 305
            WN Y         PWLL  FD+  FY   +EEL
Sbjct: 1019 WNRYRLGGAFQDGPWLLRFFDQIRFYPVSEEEL 1051


>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically
           periplasmic, contain C-terminal PDZ domain
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 347

 Score = 28.7 bits (64), Expect = 4.4
 Identities = 13/47 (27%), Positives = 22/47 (46%), Gaps = 3/47 (6%)

Query: 198 WGLPPFQDWIEGTLPSEPWYIGGPVVKGLGRGSKVLGIPTANLSTEG 244
                + ++I+      P   GGP+V   G   +V+GI TA ++  G
Sbjct: 176 GSAGGYVNFIQTDAAINPGNSGGPLVNIDG---EVVGINTAIIAPSG 219


>gnl|CDD|236761 PRK10795, PRK10795, penicillin-binding protein 2; Provisional.
          Length = 634

 Score = 28.6 bits (64), Expect = 6.4
 Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 191 LDLRPEKWGLPPFQDWIEGTLPSEPWYIGGPVVKGLGRG 229
           +DL  E+ G  P ++W +     +PWY G  +  G+G+G
Sbjct: 420 IDLAEERSGNMPTREWKQKRF-KKPWYQGDTIPVGIGQG 457


>gnl|CDD|224221 COG1302, COG1302, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 131

 Score = 27.2 bits (61), Expect = 6.9
 Identities = 13/33 (39%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 142 MEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLP 174
            E    + I D VI V+AG AA  EV  V  + 
Sbjct: 6   NEELGKIEISDEVIAVIAGIAA-EEVEGVVGMA 37


>gnl|CDD|132589 TIGR03550, F420_cofG, 7,8-didemethyl-8-hydroxy-5-deazariboflavin
           synthase, CofG subunit.  This model represents either a
           subunit or a domain, depending on whether or not the
           genes are fused, of a bifunctional protein that
           completes the synthesis of
           7,8-didemethyl-8-hydroxy-5-deazariboflavin, or FO. FO is
           the chromophore of coenzyme F(420), involved in
           methanogenesis in methanogenic archaea but found in
           certain other lineages as well. The chromophore also
           occurs as a cofactor in DNA photolyases in Cyanobacteria
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Other].
          Length = 322

 Score = 28.0 bits (63), Expect = 7.2
 Identities = 14/46 (30%), Positives = 25/46 (54%), Gaps = 6/46 (13%)

Query: 312 YIRPEANFPSLETLIAKIHEDRKV--AERALDLPLYSKYRDDPYLK 355
           ++ PEA +P ++ L A+  E+      ER   LP+Y +Y  + +L 
Sbjct: 273 HVNPEAPWPEIDEL-ARATEEAGFTLKER---LPVYPEYVREGWLS 314


>gnl|CDD|232841 TIGR00131, gal_kin, galactokinase.  Galactokinase is a member of
           the GHMP kinases (Galactokinase, Homoserine kinase,
           Mevalonate kinase, Phosphomevalonate kinase) and shares
           with them an amino-terminal domain probably related to
           ATP binding.The galactokinases found by This model are
           divided into two sets. Prokaryotic forms are generally
           shorter. The eukaryotic forms are longer because of
           additional central regions and in some cases are known
           to be bifunctional, with regulatory activities that are
           independent of galactokinase activity [Energy
           metabolism, Sugars].
          Length = 386

 Score = 27.9 bits (62), Expect = 9.6
 Identities = 11/20 (55%), Positives = 12/20 (60%)

Query: 332 DRKVAERALDLPLYSKYRDD 351
           D K AER+LDLPL      D
Sbjct: 69  DNKFAERSLDLPLDGSEVSD 88


>gnl|CDD|180485 PRK06245, cofG, FO synthase subunit 1; Reviewed.
          Length = 336

 Score = 27.6 bits (62), Expect = 9.9
 Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 6/45 (13%)

Query: 312 YIRPEANFPSLETLIAKIHEDRKVA--ERALDLPLYSKYRDDPYL 354
           Y+ PE  +P +E L  +I E+      ER   LP+Y KY  + +L
Sbjct: 277 YVNPEYPWPDIEEL-REILEEAGWPLKER---LPVYPKYIKEGWL 317


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.136    0.414 

Gapped
Lambda     K      H
   0.267   0.0872    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,117,862
Number of extensions: 1888366
Number of successful extensions: 1772
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1729
Number of HSP's successfully gapped: 81
Length of query: 362
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 264
Effective length of database: 6,590,910
Effective search space: 1740000240
Effective search space used: 1740000240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (26.6 bits)