RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 017995
(362 letters)
>gnl|CDD|178528 PLN02940, PLN02940, riboflavin kinase.
Length = 382
Score = 676 bits (1747), Expect = 0.0
Identities = 286/354 (80%), Positives = 316/354 (89%)
Query: 2 TDGMFSEVLKTFLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSM 61
TDG+ S+VLK FLVKYGK+WDGRE KIVGKTPLE AA +VEDYGLPC+ EF +E+ +
Sbjct: 25 TDGIVSDVLKAFLVKYGKQWDGREAQKIVGKTPLEAAATVVEDYGLPCSTDEFNSEITPL 84
Query: 62 FSDHLCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSD 121
S+ C +KALPGANRLIKHL HGVPMALASNS RA IE+KIS GW ESFSVIVG D
Sbjct: 85 LSEQWCNIKALPGANRLIKHLKSHGVPMALASNSPRANIEAKISCHQGWKESFSVIVGGD 144
Query: 122 EVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYT 181
EV GKPSPDIFLEAAKRLN+EPS+ LVIEDS+ GV+AGKAAGMEV+AVPS+PKQTH Y+
Sbjct: 145 EVEKGKPSPDIFLEAAKRLNVEPSNCLVIEDSLPGVMAGKAAGMEVIAVPSIPKQTHLYS 204
Query: 182 AADEVINSLLDLRPEKWGLPPFQDWIEGTLPSEPWYIGGPVVKGLGRGSKVLGIPTANLS 241
+ADEVINSLLDL+PEKWGLPPF DWIEGTLP EPW+IGGPV+KG GRGSKVLGIPTANLS
Sbjct: 205 SADEVINSLLDLQPEKWGLPPFNDWIEGTLPIEPWHIGGPVIKGFGRGSKVLGIPTANLS 264
Query: 242 TEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVMSIGWNPYFDNAEKTIEPWLLHEFDEDFY 301
TE YSDVLSEHPSGVYFGWAGLSTRGVYKMVMSIGWNPYF+N EKTIEPWLLH+F EDFY
Sbjct: 265 TENYSDVLSEHPSGVYFGWAGLSTRGVYKMVMSIGWNPYFNNTEKTIEPWLLHDFGEDFY 324
Query: 302 DEELHLVIVGYIRPEANFPSLETLIAKIHEDRKVAERALDLPLYSKYRDDPYLK 355
EEL LVIVGYIRPEANFPSLE+LIAKIHEDR++AE+ALDLPLY+KY+DDPYL
Sbjct: 325 GEELRLVIVGYIRPEANFPSLESLIAKIHEDRRIAEKALDLPLYAKYKDDPYLT 378
>gnl|CDD|178407 PLN02811, PLN02811, hydrolase.
Length = 220
Score = 176 bits (447), Expect = 1e-53
Identities = 92/216 (42%), Positives = 129/216 (59%), Gaps = 17/216 (7%)
Query: 2 TDGMFSEVLKTFLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLP--CAKHEFVNEVY 59
T+ ++EV + L +YGK +D K K++GK +E A I VE+ GL + +F+ E
Sbjct: 8 TEKFYTEVQEKILARYGKTFDWSLKAKMMGKKAIEAARIFVEESGLSDSLSPEDFLVERE 67
Query: 60 SMFSDHLCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFS---- 115
+M D +PGA RL++HL G+P+A+A+ SH+ + K +HG E FS
Sbjct: 68 AMLQDLFPTSDLMPGAERLVRHLHAKGIPIAIATGSHKRHFDLKTQ-RHG--ELFSLMHH 124
Query: 116 VIVGSD-EVRTGKPSPDIFLEAAKRL---NMEPSSSLVIEDSVIGVVAGKAAGMEVVAVP 171
V+ G D EV+ GKP+PDIFL AA+R ++P LV ED+ GV A K AGM VV VP
Sbjct: 125 VVTGDDPEVKQGKPAPDIFLAAARRFEDGPVDPGKVLVFEDAPSGVEAAKNAGMSVVMVP 184
Query: 172 --SLPKQTHRYTAADEVINSLLDLRPEKWGLPPFQD 205
L K + AD+V++SLLD +PE+WGLPPF D
Sbjct: 185 DPRLDKSYCK--GADQVLSSLLDFKPEEWGLPPFPD 218
>gnl|CDD|190069 pfam01687, Flavokinase, Riboflavin kinase. This family represents
the C-terminal region of the bifunctional riboflavin
biosynthesis protein known as RibC in Bacillus subtilis.
The RibC protein from Bacillus subtilis has both
flavokinase and flavin adenine dinucleotide synthetase
(FAD-synthetase) activities. RibC plays an essential
role in the flavin metabolism. This domain is thought to
have kinase activity.
Length = 125
Score = 138 bits (350), Expect = 2e-40
Identities = 51/126 (40%), Positives = 70/126 (55%), Gaps = 6/126 (4%)
Query: 215 PWYIGGPVVKGLGRGSKVLGIPTANLSTEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVMS 274
P+ I G VV G GRG LG PTANLS + +GVY + + VY V +
Sbjct: 5 PYSISGTVVHGKGRGRT-LGFPTANLSLPKDKLL---PKNGVYAVRVKIDGK-VYPGVAN 59
Query: 275 IGWNPYFDNAEKTIEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHEDRK 334
IG+NP F + TIE +L +FD D Y EE+ + + ++RPE F SLE L A+I +D +
Sbjct: 60 IGYNPTFGGKKPTIEVHIL-DFDGDLYGEEIRVEFLKFLRPEKKFDSLEELKAQIKKDIE 118
Query: 335 VAERAL 340
A + L
Sbjct: 119 QARKIL 124
>gnl|CDD|214901 smart00904, Flavokinase, Riboflavin kinase. Riboflavin is
converted into catalytically active cofactors (FAD and
FMN) by the actions of riboflavin kinase, which converts
it into FMN, and FAD synthetase, which adenylates FMN to
FAD. Eukaryotes usually have two separate enzymes, while
most prokaryotes have a single bifunctional protein that
can carry out both catalyses, although exceptions occur
in both cases. While eukaryotic monofunctional
riboflavin kinase is orthologous to the bifunctional
prokaryotic enzyme. the monofunctional FAD synthetase
differs from its prokaryotic counterpart, and is instead
related to the PAPS-reductase family. The bacterial FAD
synthetase that is part of the bifunctional enzyme has
remote similarity to nucleotidyl transferases and,
hence, it may be involved in the adenylylation reaction
of FAD synthetases. This entry represents riboflavin
kinase, which occurs as part of a bifunctional enzyme or
a stand-alone enzyme.
Length = 124
Score = 134 bits (339), Expect = 8e-39
Identities = 43/127 (33%), Positives = 62/127 (48%), Gaps = 7/127 (5%)
Query: 215 PWYIGGPVVKGLGRGSKVLGIPTANLSTEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVMS 274
P+ I G VV G RG LG PTANL + + +GVY + +Y V +
Sbjct: 5 PYSISGRVVHGDKRGRT-LGFPTANLPLDDRLLLP---KNGVYAVRVRV-DGKIYPGVAN 59
Query: 275 IGWNPYFDNAEKTIEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHEDRK 334
IG P F ++++E +L F D Y EE+ + + +IR E F SL+ L A+I D +
Sbjct: 60 IGTRPTFGG-DRSVEVHILD-FSGDLYGEEIEVEFLKFIRDEQKFDSLDELKAQISRDIE 117
Query: 335 VAERALD 341
A L
Sbjct: 118 EAREYLA 124
>gnl|CDD|223710 COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General
function prediction only].
Length = 221
Score = 118 bits (298), Expect = 1e-31
Identities = 60/191 (31%), Positives = 96/191 (50%), Gaps = 3/191 (1%)
Query: 8 EVLKTFLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDY-GLPCAKHEFV-NEVYSMFSDH 65
L +YG E E ++ G ++ + G A + +Y +
Sbjct: 22 RAWLEALKEYGIEISDEEIRELHGGGIARIIDLLRKLAAGEDPADLAELERLLYEAEALE 81
Query: 66 LCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRT 125
L +K +PG L++ L G+P+A+AS+S R E ++ + G + F VIV +D+V
Sbjct: 82 LEGLKPIPGVVELLEQLKARGIPLAVASSSPRRAAE-RVLARLGLLDYFDVIVTADDVAR 140
Query: 126 GKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTAADE 185
GKP+PDI+L AA+RL ++P +V+EDS G+ A KAAGM VV VP+ + H
Sbjct: 141 GKPAPDIYLLAAERLGVDPEECVVVEDSPAGIQAAKAAGMRVVGVPAGHDRPHLDPLDAH 200
Query: 186 VINSLLDLRPE 196
+++L E
Sbjct: 201 GADTVLLDLAE 211
>gnl|CDD|223274 COG0196, RibF, FAD synthase [Coenzyme metabolism].
Length = 304
Score = 109 bits (275), Expect = 2e-27
Identities = 44/128 (34%), Positives = 63/128 (49%), Gaps = 10/128 (7%)
Query: 215 PWYIGGPVVKGLGRGSKVLGIPTANLSTEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVMS 274
P+ I G VV G G LG PTAN+ + GVY L GVY V +
Sbjct: 185 PYSIEGKVVHGQKLGRT-LGFPTANIYLKDNVLP----AFGVYAVRVKL-DGGVYPGVAN 238
Query: 275 IGWNPYFDNAEKTIEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHEDRK 334
+G+ P D +E+++E +L F+ D Y E + + + +IR E F SL+ L +I +D
Sbjct: 239 VGYRPTVDGSERSLEVHILD-FNGDLYGERVKVRFLKFIRDEKKFDSLDELKEQIEKD-- 295
Query: 335 VAERALDL 342
ERA L
Sbjct: 296 -IERARKL 302
>gnl|CDD|213672 TIGR01990, bPGM, beta-phosphoglucomutase. This model represents
the beta-phosphoglucomutase enzyme which catalyzes the
interconverison of beta-D-glucose-1-phosphate and
beta-D-glucose-6-phosphate. The 6-phosphate is capable
of non-enzymatic anomerization (alpha <-> beta) while
the 1-phosphate is not. A separate enzyme is responsible
for the isomerization of the alpha anomers.
Beta-D-glucose-1-phosphate results from the
phosphorylysis of maltose (2.4.1.8), trehalose
(2.4.1.64) or trehalose-6-phosphate (2.4.1.216).
Alternatively, these reactions can be run in the
synthetic direction to create the disaccharides. All
sequenced genomes which contain a member of this family
also appear to contain at least one putative maltose or
trehalose phosphorylase. Three species, Lactococcus,
Enterococcus and Neisseria appear to contain a pair of
paralogous beta-PGM's. Beta-phosphoglucomutase is a
member of the haloacid dehalogenase superfamily of
hydrolase enzymes. These enzymes are characterized by a
series of three catalytic motifs positioned within an
alpha-beta (Rossman) fold. beta-PGM contains an inserted
alpha helical domain in between the first and second
conserved motifs and thus is a member of subfamily IA of
the superfamily. The third catalytic motif comes in
three variants, the third of which, containing a
conserved DD or ED, is the only one found here as well
as in several other related enzymes (TIGR01509). The
enzyme from L. lactis has been extensively characterized
including a remarkable crystal structure which traps the
pentacoordinate transition state [Energy metabolism,
Biosynthesis and degradation of polysaccharides].
Length = 185
Score = 92.4 bits (230), Expect = 3e-22
Identities = 45/116 (38%), Positives = 60/116 (51%), Gaps = 5/116 (4%)
Query: 56 NEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASNSHRA-TIESKISYQHGWNESF 114
N+ Y L LPG L+ L + + +ALAS S A TI K+ + F
Sbjct: 73 NDYYVELLKELTPADVLPGIKSLLADLKKNNIKIALASASKNAPTILEKL----ELIDYF 128
Query: 115 SVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
IV E++ GKP P+IFL AA+ L + PS + IED+ G+ A KAAGM V V
Sbjct: 129 DAIVDPAELKKGKPDPEIFLAAAEGLGVSPSECIGIEDAQAGIEAIKAAGMFAVGV 184
>gnl|CDD|213673 TIGR02009, PGMB-YQAB-SF, beta-phosphoglucomutase family hydrolase.
This subfamily model groups together three clades: the
characterized beta-phosphoglucomutases (including those
from E.coli, B.subtilus and L.lactis, TIGR01990), a
clade of putative bPGM's from mycobacteria and a clade
including the uncharacterized E.coli and H.influenzae
yqaB genes which may prove to be beta-mutases of a
related 1-phosphosugar. All of these are members of the
larger Haloacid dehalogenase (HAD) subfamily IA and
include the "variant 3" glu-asp version of the third
conserved HAD domain (TIGR01509).
Length = 185
Score = 92.0 bits (229), Expect = 4e-22
Identities = 58/176 (32%), Positives = 87/176 (49%), Gaps = 12/176 (6%)
Query: 2 TDGMFSEVLKTFLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCA---KH---EFV 55
T + ++ K KYG +D + + G + + I++ G + H E
Sbjct: 15 TAPLHAQAWKHIAAKYGISFDKQYNESLKGLSREDILRAILKLRGDGLSLEEIHQLAERK 74
Query: 56 NEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASNSHRA-TIESKISYQHGWNESF 114
NE+Y L V LPG L+K L G+ + L S+S A I +K+ G + F
Sbjct: 75 NELYRE-LLRLTGVAVLPGIRNLLKRLKAKGIAVGLGSSSKNAPRILAKL----GLRDYF 129
Query: 115 SVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
IV + EV+ GKP P+ FL AA+ L + P+ +V ED++ GV A +AAGM VAV
Sbjct: 130 DAIVDASEVKNGKPHPETFLLAAELLGVPPNECIVFEDALAGVQAARAAGMFAVAV 185
>gnl|CDD|235536 PRK05627, PRK05627, bifunctional riboflavin kinase/FMN
adenylyltransferase; Reviewed.
Length = 305
Score = 90.6 bits (226), Expect = 1e-20
Identities = 41/124 (33%), Positives = 56/124 (45%), Gaps = 9/124 (7%)
Query: 218 IGGPVVKGLGRGSKVLGIPTANLSTEGYSDVLSEHP-SGVYFGWAGLSTRGVYKMVMSIG 276
I G VV G G LG PTANL P GVY + + Y V +IG
Sbjct: 188 ISGRVVHGQKLGRT-LGFPTANLPLPDRV-----LPADGVYAVRVKVDGK-PYPGVANIG 240
Query: 277 WNPYFDNAEKTIEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHEDRKVA 336
P D + +E LL +F+ D Y E + + + +R E F SL+ L A+I +D + A
Sbjct: 241 TRPTVDGGRQLLEVHLL-DFNGDLYGEHITVEFLKKLRDEQKFDSLDELKAQIAKDIETA 299
Query: 337 ERAL 340
L
Sbjct: 300 RAFL 303
>gnl|CDD|233443 TIGR01509, HAD-SF-IA-v3, haloacid dehalogenase superfamily,
subfamily IA, variant 3 with third motif having DD or
ED. This model represents part of one structural
subfamily of the Haloacid Dehalogenase (HAD) superfamily
of aspartate-nucleophile hydrolases. The superfamily is
defined by the presence of three short catalytic motifs.
The subfamilies are defined based on the location and
the observed or predicted fold of a so-called "capping
domain", or the absence of such a domain. Subfamily I
consists of sequences in which the capping domain is
found in between the first and second catalytic motifs.
Subfamily II consists of sequences in which the capping
domain is found between the second and third motifs.
Subfamily III sequences have no capping domain in either
of these positions.The Subfamily IA and IB capping
domains are predicted by PSI-PRED to consist of an alpha
helical bundle. Subfamily I encompasses such a wide
region of sequence space (the sequences are highly
divergent) that representing it with a single model is
impossible, resulting in an overly broad description
which allows in many unrelated sequences. Subfamily IA
and IB are separated based on an aparrent phylogenetic
bifurcation. Subfamily IA is still too broad to model,
but cannot be further subdivided into large chunks based
on phylogenetic trees. Of the three motifs defining the
HAD superfamily, the third has three variant forms : (1)
hhhhsDxxx(x)D, (2) hhhhssxxx(x)D and (3) hhhhDDxxx(x)s
where _s_ refers to a small amino acid and _h_ to a
hydrophobic one. All three of these variants are found
in subfamily IA. Individual models were made based on
seeds exhibiting only one of the variants each. Variant
3 (this model) is found in the enzymes
beta-phosphoglucomutase (TIGR01990) and
deoxyglucose-6-phosphatase, while many other enzymes of
subfamily IA exhibit this variant as well as variant 1
(TIGR01549). These three variant models were created
withthe knowledge that there will be overlap among them
- this is by design and serves the purpose of
eliminating the overlap with models of more distantly
relatedHAD subfamilies caused by an overly broad single
model [Unknown function, Enzymes of unknown
specificity].
Length = 177
Score = 85.2 bits (211), Expect = 1e-19
Identities = 48/150 (32%), Positives = 71/150 (47%), Gaps = 4/150 (2%)
Query: 23 GREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDHLCK--VKALPGANRLIK 80
E + YG + + +F + L K +K LPG L++
Sbjct: 30 PDELGVSEVGSLELALRRWKAKYGRTMSAEDAQLLYKQLFYEALEKEGLKPLPGVRALLE 89
Query: 81 HLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAKRL 140
L G +AL +NS RA + + G F V++ S +V GKP PDI+L+A K+L
Sbjct: 90 ALRARGKKLALLTNSPRADAKLVLE--LGLRALFDVVIDSSDVGLGKPDPDIYLQALKKL 147
Query: 141 NMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
++PS L ++DS G+ A KAAGM V V
Sbjct: 148 GLKPSECLFVDDSPAGIDAAKAAGMHTVLV 177
>gnl|CDD|222115 pfam13419, HAD_2, Haloacid dehalogenase-like hydrolase.
Length = 176
Score = 79.7 bits (197), Expect = 1e-17
Identities = 37/156 (23%), Positives = 64/156 (41%), Gaps = 7/156 (4%)
Query: 16 KYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCA-KHEFVNEVYSMFSDHLCKVKALPG 74
+ G + E + G E A ++ ++ + E + E P
Sbjct: 27 RLGLDISAEELREAGGLPFDEALADLLREHPIDPDEILEALLEYNLESRLEP-----FPD 81
Query: 75 ANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFL 134
L++ L GV + + SN R +E + G + F + SD+V KP P+ +
Sbjct: 82 VVELLRRLKAKGVKLVILSNGSREAVERLLEK-LGLLDLFDAVFTSDDVGARKPDPEAYE 140
Query: 135 EAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
+RL + P L I+DS + A +AAG++ V V
Sbjct: 141 RVLERLGLPPEEILFIDDSPEDLEAARAAGIKTVHV 176
>gnl|CDD|223620 COG0546, Gph, Predicted phosphatases [General function prediction
only].
Length = 220
Score = 79.8 bits (197), Expect = 2e-17
Identities = 48/199 (24%), Positives = 80/199 (40%), Gaps = 14/199 (7%)
Query: 6 FSEVLKTFLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDH 65
L + G E+ + + L+E + A E V + F
Sbjct: 22 ILRAFNAALAELGLPPLDEEEIRQLIGLGLDELIERLLGEADEEAAAELVERLREEFLTA 81
Query: 66 ---LCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDE 122
L + + PG L+ L G + + +N ++ + G + F VIVG D+
Sbjct: 82 YAELLESRLFPGVKELLAALKSAGYKLGIVTNKPERELD-ILLKALGLADYFDVIVGGDD 140
Query: 123 VRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV------PSLPKQ 176
V KP P+ L ++L ++P +L++ DS+ ++A KAAG+ V V Q
Sbjct: 141 VPPPKPDPEPLLLLLEKLGLDPEEALMVGDSLNDILAAKAAGVPAVGVTWGYNSREELAQ 200
Query: 177 THRYTAADEVINSLLDLRP 195
AD VI+SL +L
Sbjct: 201 AG----ADVVIDSLAELLA 215
>gnl|CDD|236770 PRK10826, PRK10826, 2-deoxyglucose-6-phosphatase; Provisional.
Length = 222
Score = 77.3 bits (191), Expect = 1e-16
Identities = 36/128 (28%), Positives = 65/128 (50%), Gaps = 4/128 (3%)
Query: 71 ALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSP 130
LPG + G+ + LAS S +E + + F + ++++ KP P
Sbjct: 93 LLPGVREALALCKAQGLKIGLASASPLHMLE-AVLTMFDLRDYFDALASAEKLPYSKPHP 151
Query: 131 DIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTH--RYTAADEVIN 188
+++L A +L ++P + + +EDS G++A KAA M + VP+ P+Q + R+ AD +
Sbjct: 152 EVYLNCAAKLGVDPLTCVALEDSFNGMIAAKAARMRSIVVPA-PEQQNDPRWALADVKLE 210
Query: 189 SLLDLRPE 196
SL +L
Sbjct: 211 SLTELTAA 218
>gnl|CDD|215416 PLN02779, PLN02779, haloacid dehalogenase-like hydrolase family
protein.
Length = 286
Score = 78.2 bits (193), Expect = 2e-16
Identities = 44/134 (32%), Positives = 68/134 (50%), Gaps = 3/134 (2%)
Query: 69 VKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWN--ESFSVIVGSDEVRTG 126
+ PG RL+ G+ +A+ S S+ + ++ G + V G D+V
Sbjct: 143 LPLRPGVLRLMDEALAAGIKVAVCSTSNEKAVSKIVNTLLGPERAQGLDVFAG-DDVPKK 201
Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTAADEV 186
KP PDI+ AA+ L ++PS +V+EDSVIG+ A KAAGM + S ++ AD V
Sbjct: 202 KPDPDIYNLAAETLGVDPSRCVVVEDSVIGLQAAKAAGMRCIVTKSSYTADEDFSGADAV 261
Query: 187 INSLLDLRPEKWGL 200
+ L D+ E + L
Sbjct: 262 FDCLGDVPLEDFDL 275
>gnl|CDD|183215 PRK11587, PRK11587, putative phosphatase; Provisional.
Length = 218
Score = 74.3 bits (183), Expect = 2e-15
Identities = 49/137 (35%), Positives = 70/137 (51%), Gaps = 19/137 (13%)
Query: 69 VKALPGANRLIKHLSCHGVPMA--------LASNSHRATIESKISYQHGWNESFSVIVGS 120
+ ALPGA L+ HL+ G+P A +AS H+A G V V +
Sbjct: 82 ITALPGAIALLNHLNKLGIPWAIVTSGSVPVASARHKAA---------GLPAP-EVFVTA 131
Query: 121 DEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRY 180
+ V+ GKP PD +L A+ L + P +V+ED+ GV++G AAG V+AV + P T R
Sbjct: 132 ERVKRGKPEPDAYLLGAQLLGLAPQECVVVEDAPAGVLSGLAAGCHVIAV-NAPADTPRL 190
Query: 181 TAADEVINSLLDLRPEK 197
D V++SL L K
Sbjct: 191 DEVDLVLHSLEQLTVTK 207
>gnl|CDD|232818 TIGR00083, ribF, riboflavin kinase/FMN adenylyltransferase.
multifunctional enzyme: riboflavin kinase (EC 2.7.1.26)
(flavokinase) / FMN adenylyltransferase (EC 2.7.7.2)
(FAD pyrophosphorylase) (FAD synthetase) [Biosynthesis
of cofactors, prosthetic groups, and carriers,
Riboflavin, FMN, and FAD].
Length = 288
Score = 73.6 bits (181), Expect = 8e-15
Identities = 41/127 (32%), Positives = 60/127 (47%), Gaps = 6/127 (4%)
Query: 214 EPWYIGGPVVKGLGRGSKVLGIPTANLSTEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVM 273
P++I G V+ G G LG PTAN+ + L G Y L+ Y V
Sbjct: 167 RPYFICGTVIHGQKLGRT-LGFPTANIKLKNQVLPL---KGGYYVVVVLLNGE-PYPGVG 221
Query: 274 SIGWNPYFDNAEKTIEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHEDR 333
+IG P F + IE LL +F + Y +E+ + +V IRPE F SL+ L +I +D
Sbjct: 222 NIGNRPTFIGQQLVIEVHLL-DFSGELYGQEIKVTLVKKIRPEQKFSSLDELKNQIQQDI 280
Query: 334 KVAERAL 340
A++
Sbjct: 281 LQAKKWF 287
>gnl|CDD|119389 cd01427, HAD_like, Haloacid dehalogenase-like hydrolases. The
haloacid dehalogenase-like (HAD) superfamily includes
L-2-haloacid dehalogenase, epoxide hydrolase,
phosphoserine phosphatase, phosphomannomutase,
phosphoglycolate phosphatase, P-type ATPase, and many
others, all of which use a nucleophilic aspartate in
their phosphoryl transfer reaction. All members possess
a highly conserved alpha/beta core domain, and many also
possess a small cap domain, the fold and function of
which is variable. Members of this superfamily are
sometimes referred to as belonging to the DDDD
superfamily of phosphohydrolases.
Length = 139
Score = 70.1 bits (172), Expect = 1e-14
Identities = 33/119 (27%), Positives = 55/119 (46%), Gaps = 17/119 (14%)
Query: 68 KVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEV---- 123
+++ PG +K L G+ +ALA+N R + + + G ++ F ++ S+
Sbjct: 22 ELELYPGVKEALKELKEKGIKLALATNKSRREVLELLE-ELGLDDYFDPVITSNGAAIYY 80
Query: 124 ------------RTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
GKP+PD L A K L ++P L++ DS+ + KAAG VAV
Sbjct: 81 PKEGLFLGGGPFDIGKPNPDKLLAALKLLGVDPEEVLMVGDSLNDIEMAKAAGGLGVAV 139
>gnl|CDD|223943 COG1011, COG1011, Predicted hydrolase (HAD superfamily) [General
function prediction only].
Length = 229
Score = 71.5 bits (175), Expect = 2e-14
Identities = 43/183 (23%), Positives = 73/183 (39%), Gaps = 6/183 (3%)
Query: 13 FLVKYGKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDHLCKVKAL 72
L K + + G+ L ++ L E V + + +
Sbjct: 44 LLKLIEKLEARFLRGEYTGEYGLTLERLLELLERLL--GDEDAELVEELLAALAKLLPDY 101
Query: 73 PGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDI 132
P A +K L + + +N R E K+ Q G + F + S++V KP P+I
Sbjct: 102 PEALEALKELG-KKYKLGILTNGARPHQERKLR-QLGLLDYFDAVFISEDVGVAKPDPEI 159
Query: 133 FLEAAKRLNMEPSSSLVIEDSVIGVVAG-KAAGMEVVAVPSLPKQT-HRYTAADEVINSL 190
F A ++L + P +L + DS+ + G +A GM+ V + K A D I+SL
Sbjct: 160 FEYALEKLGVPPEEALFVGDSLENDILGARALGMKTVWINRGGKPLPDALEAPDYEISSL 219
Query: 191 LDL 193
+L
Sbjct: 220 AEL 222
>gnl|CDD|215497 PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family
protein.
Length = 1057
Score = 74.1 bits (182), Expect = 2e-14
Identities = 39/98 (39%), Positives = 57/98 (58%)
Query: 73 PGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDI 132
PGA LI G+ +A+AS++ R +++ ++ F IV +D KP+PDI
Sbjct: 164 PGALELITQCKNKGLKVAVASSADRIKVDANLAAAGLPLSMFDAIVSADAFENLKPAPDI 223
Query: 133 FLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
FL AAK L + S +VIED++ GV A +AAGM +AV
Sbjct: 224 FLAAAKILGVPTSECVVIEDALAGVQAARAAGMRCIAV 261
>gnl|CDD|182679 PRK10725, PRK10725, fructose-1-P/6-phosphogluconate phosphatase;
Provisional.
Length = 188
Score = 68.6 bits (168), Expect = 1e-13
Identities = 32/87 (36%), Positives = 51/87 (58%), Gaps = 3/87 (3%)
Query: 88 PMALASNSHRATIESKISYQH-GWNESFSVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSS 146
PMA+ + S A E+ + H G F +V +D+V+ KP+PD FL A+ + ++P+
Sbjct: 104 PMAVGTGSESAIAEALL--AHLGLRRYFDAVVAADDVQHHKPAPDTFLRCAQLMGVQPTQ 161
Query: 147 SLVIEDSVIGVVAGKAAGMEVVAVPSL 173
+V ED+ G+ A +AAGM+ V V L
Sbjct: 162 CVVFEDADFGIQAARAAGMDAVDVRLL 188
>gnl|CDD|237310 PRK13222, PRK13222, phosphoglycolate phosphatase; Provisional.
Length = 226
Score = 68.7 bits (169), Expect = 2e-13
Identities = 36/132 (27%), Positives = 56/132 (42%), Gaps = 15/132 (11%)
Query: 73 PGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDI 132
PG + L G P+A+ +N + + G + FSV++G D + KP P
Sbjct: 96 PGVKETLAALKAAGYPLAVVTNKPTPFVA-PLLEALGIADYFSVVIGGDSLPNKKPDPAP 154
Query: 133 FLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYT--------AAD 184
L A ++L ++P L + DS + A +AAG V V T+ Y D
Sbjct: 155 LLLACEKLGLDPEEMLFVGDSRNDIQAARAAGCPSVGV------TYGYNYGEPIALSEPD 208
Query: 185 EVINSLLDLRPE 196
VI+ +L P
Sbjct: 209 VVIDHFAELLPL 220
>gnl|CDD|216069 pfam00702, Hydrolase, haloacid dehalogenase-like hydrolase. This
family is structurally different from the alpha/beta
hydrolase family (pfam00561). This family includes
L-2-haloacid dehalogenase, epoxide hydrolases and
phosphatases. The structure of the family consists of
two domains. One is an inserted four helix bundle, which
is the least well conserved region of the alignment,
between residues 16 and 96 of Pseudomonas sp.
(S)-2-haloacid dehalogenase 1. The rest of the fold is
composed of the core alpha/beta domain. Those members
with the characteristic DxD triad at the N-terminus are
probably phosphatidylglycerolphosphate (PGP)
phosphatases involved in cardiolipin biosynthesis in the
mitochondria.
Length = 187
Score = 67.3 bits (164), Expect = 3e-13
Identities = 33/150 (22%), Positives = 55/150 (36%), Gaps = 10/150 (6%)
Query: 18 GKEWDGREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDHLCKVKALPGANR 77
KE +++ + E + V + + PGA
Sbjct: 45 TKEGREELVRRLLLRALAGEELLEELLRAGATVVAVLDLVVLGLIALTD---PLYPGARE 101
Query: 78 LIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRT---GKPSPDIFL 134
+K L G+ +A+ + +R T + F +V +D GKP P IF
Sbjct: 102 ALKELKEAGIKLAILTGDNRLTANAIARLLG----LFDALVSADLYGLVGVGKPDPKIFE 157
Query: 135 EAAKRLNMEPSSSLVIEDSVIGVVAGKAAG 164
A + L ++P L++ D V + A KAAG
Sbjct: 158 LALEELGVKPEEVLMVGDGVNDIPAAKAAG 187
>gnl|CDD|130516 TIGR01449, PGP_bact, 2-phosphoglycolate phosphatase, prokaryotic.
PGP is an essential enzyme in the glycolate salvage
pathway in higher organisms (photorespiration in
plants). Phosphoglycolate results from the oxidase
activity of RubisCO in the Calvin cycle when
concentrations of carbon dioxide are low relative to
oxygen. In Ralstonia (Alcaligenes) eutropha and
Rhodobacter sphaeroides, the PGP gene (CbbZ) is located
on an operon along with other Calvin cycle enzymes
including RubisCO. The only other pertinent experimental
evidence concerns the gene from E. coli. The in vitro
activity of the Ralstonia and Escherichia enzymes was
determined with crude cell extracts of strains
containing PGP on expression plasmids and compared to
controls. In E. coli, however, there does not appear to
be a functional Calvin cycle (RubisCO is absent),
although the E. coli PGP gene (gph) is on the same
operon (dam) with ribulose-5-phosphate-3-epimerase
(rpe), a gene in the pentose-phosphate pathway (along
with other, unrelated genes). The E. coli enzyme is not
expressed under normal laboratory conditions; the
pathway to which it belongs has not been determined. In
fact, the possibility exists, although unlikely, that
the E. coli enzyme and others within this equivalog have
as their physiological substrate another, closely
related molecule. The other seed chosen for this model,
from Xylella fastidiosa has no experimental evidence,
but is a plant pathogen and thus may obtain
phosphoglycolate from its host. This model has been
restricted to encompass only proteobacteria as no
related PGP has been verified outside of this clade.
Sequences from Aquifex aeolicus and Treponema pallidum
fall between the trusted and noise cutoffs. Just below
the noise cutoff is a gene which is part of the operon
for the biosynthesis of the blue pigment, indigoidine,
from Erwinia (Pectobacterium) chrysanthemi, a plant
pathogen. It does not seem likely, considering the
proposed biosynthetic mechanism, that the
dephosphorylation of phosphoglycolate or a closely
related compound is required. Possibly, this gene is
fortuitously located in this operon, or has an indirect
relationship to the necessity for the biosynthesis of
this compound. Sequences from 11 species have been
annotated as PGP or putative PGP but fall below the
noise cutoff. None of these have experimental
validation. This enzyme is a member of the Haloacid
Dehalogenase (HAD) superfamily of aspartate-nucleophile
hydrolase enzymes (pfam00702) [Energy metabolism,
Sugars].
Length = 213
Score = 66.8 bits (163), Expect = 6e-13
Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 1/101 (0%)
Query: 70 KALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPS 129
PG + L G+ + L +N + G + FSV++G D + KP
Sbjct: 85 SVFPGVEATLGALRAKGLRLGLVTNKPTPLARPLLELL-GLAKYFSVLIGGDSLAQRKPH 143
Query: 130 PDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
PD L AA+RL + P + + DS + + A +AAG V +
Sbjct: 144 PDPLLLAAERLGVAPQQMVYVGDSRVDIQAARAAGCPSVLL 184
>gnl|CDD|215313 PLN02575, PLN02575, haloacid dehalogenase-like hydrolase.
Length = 381
Score = 65.7 bits (160), Expect = 6e-12
Identities = 44/120 (36%), Positives = 63/120 (52%), Gaps = 2/120 (1%)
Query: 74 GANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDIF 133
G+ + L + +PMAL S R T+E+ I G FSVIV +++V GKP P++F
Sbjct: 220 GSQEFVNVLMNYKIPMALVSTRPRKTLENAIG-SIGIRGFFSVIVAAEDVYRGKPDPEMF 278
Query: 134 LEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTAADEVINSLLDL 193
+ AA+ LN P +V +S V A A M+ VAV S + AAD V+ L +L
Sbjct: 279 IYAAQLLNFIPERCIVFGNSNQTVEAAHDARMKCVAVAS-KHPIYELGAADLVVRRLDEL 337
>gnl|CDD|215413 PLN02770, PLN02770, haloacid dehalogenase-like hydrolase family
protein.
Length = 248
Score = 64.1 bits (156), Expect = 1e-11
Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 1/103 (0%)
Query: 68 KVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGK 127
++K L G +L K + G+ A +N+ R E IS G ++ F ++ E K
Sbjct: 106 QLKPLNGLYKLKKWIEDRGLKRAAVTNAPRENAELMISLL-GLSDFFQAVIIGSECEHAK 164
Query: 128 PSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
P PD +L+A + L + + V EDSV G+ AG AAGM VV +
Sbjct: 165 PHPDPYLKALEVLKVSKDHTFVFEDSVSGIKAGVAAGMPVVGL 207
>gnl|CDD|215644 PLN03243, PLN03243, haloacid dehalogenase-like hydrolase;
Provisional.
Length = 260
Score = 63.5 bits (154), Expect = 2e-11
Identities = 40/124 (32%), Positives = 63/124 (50%), Gaps = 2/124 (1%)
Query: 70 KALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPS 129
+ PG+ ++ L H +P+A+AS R +E I G FSV++ +++V GKP
Sbjct: 109 RLRPGSREFVQALKKHEIPIAVASTRPRRYLERAIE-AVGMEGFFSVVLAAEDVYRGKPD 167
Query: 130 PDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTAADEVINS 189
P++F+ AA+RL P +V +S V A M+ VAV + +A D V+
Sbjct: 168 PEMFMYAAERLGFIPERCIVFGNSNSSVEAAHDGCMKCVAVAG-KHPVYELSAGDLVVRR 226
Query: 190 LLDL 193
L DL
Sbjct: 227 LDDL 230
>gnl|CDD|130521 TIGR01454, AHBA_synth_RP, 3-amino-5-hydroxybenoic acid synthesis
related protein. The enzymes in this equivalog are all
located in the operons for the biosynthesis of
3-amino-5-hydroxybenoic acid (AHBA), which is a
precursor of several antibiotics including ansatrienin ,
naphthomycin , rifamycin and mitomycin. The role that
this enzyme plays in this biosynthesis has not been
elucidated. This enzyme is a member of the Haloacid
dehalogenase superfamily (pfam00702) of
aspartate-nucleophile hydrolases. This enzyme is closely
related to phosphoglycolate phosphatase (TIGR01449), but
it is unclear what purpose a PGPase or PGPase-like
activity would serve in these biosyntheses. This model
is limited to the Gram positive Actinobacteria. The most
closely related enzyme below the noise cutoff is IndB
which is involved in the biosynthesis of Indigoidine in
Pectobacterium (Erwinia) chrysanthemi, a gamma
proteobacter. This enzyme is similarly related to PGP.
In this case, too it is unclear what role would be be
played by a PGPase activity.
Length = 205
Score = 62.2 bits (151), Expect = 2e-11
Identities = 42/129 (32%), Positives = 65/129 (50%), Gaps = 14/129 (10%)
Query: 46 GLPCAKHE-FVNEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASNSH----RATI 100
GLP E FV E Y + +V+ PG L+ L GV A+A+ R+ +
Sbjct: 54 GLPLEMEEPFVRESYRLAG----EVEVFPGVPELLAELRADGVGTAIATGKSGPRARSLL 109
Query: 101 ESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAG 160
E+ G F ++GSDEV KP+PDI EA + L++ P ++++ D+V + +
Sbjct: 110 EAL-----GLLPLFDHVIGSDEVPRPKPAPDIVREALRLLDVPPEDAVMVGDAVTDLASA 164
Query: 161 KAAGMEVVA 169
+AAG VA
Sbjct: 165 RAAGTATVA 173
>gnl|CDD|222003 pfam13242, Hydrolase_like, HAD-hyrolase-like.
Length = 74
Score = 58.5 bits (142), Expect = 3e-11
Identities = 25/77 (32%), Positives = 39/77 (50%), Gaps = 14/77 (18%)
Query: 126 GKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTA--- 182
GKP+P + A +RL ++P ++I DS ++A +AAG+ + V T TA
Sbjct: 3 GKPNPGMLRAALERLGVDPEECVMIGDSDTDILAARAAGIRTILVL-----TGVTTAEDL 57
Query: 183 ------ADEVINSLLDL 193
D V++SL DL
Sbjct: 58 ERAPGRPDYVVDSLADL 74
>gnl|CDD|200170 TIGR02252, DREG-2, REG-2-like, HAD superfamily (subfamily IA)
hydrolase. This family of proteins includes
uncharacterized sequences from eukaryotes, cyanobacteria
and Leptospira as well as the DREG-2 protein from
Drosophila melanogaster which has been identified as a
rhythmically (diurnally) regulated gene. This family is
a member of the Haloacid Dehalogenase (HAD) superfamily
of aspartate-nucleophile hydrolases. The superfamily is
defined by the presence of three short catalytic motifs.
The subfamilies are defined based on the location and
the observed or predicted fold of a so-called 'capping
domain', or the absence of such a domain. This family is
a member of subfamily 1A in which the cap domain
consists of a predicted alpha helical bundle found in
between the first and second catalytic motifs. A
distinctive feature of this family is a conserved tandem
pair of tryptophan residues in the cap domain. The most
divergent sequences included within the scope of this
model are from plants and have "FW" at this position
instead. Most likely, these sequences, like the vast
majority of HAD sequences, represent phosphatase
enzymes.
Length = 203
Score = 50.7 bits (122), Expect = 2e-07
Identities = 38/122 (31%), Positives = 56/122 (45%), Gaps = 10/122 (8%)
Query: 48 PCAKHEFVNEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASN---SHRATIESKI 104
P + + E+YS F+ + P A +L+K L G+ + + SN R +E+
Sbjct: 84 PESFEKIFEELYSYFATPEP-WQVYPDAIKLLKDLRERGLILGVISNFDSRLRGLLEAL- 141
Query: 105 SYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVI-GVVAGKAA 163
G E F +V S EV KP P IF EA +R + P +L I DS+ +AA
Sbjct: 142 ----GLLEYFDFVVTSYEVGAEKPDPKIFQEALERAGISPEEALHIGDSLRNDYQGARAA 197
Query: 164 GM 165
G
Sbjct: 198 GW 199
>gnl|CDD|182552 PRK10563, PRK10563, 6-phosphogluconate phosphatase; Provisional.
Length = 221
Score = 50.8 bits (122), Expect = 2e-07
Identities = 40/138 (28%), Positives = 68/138 (49%), Gaps = 20/138 (14%)
Query: 39 AIIVEDYGLPCAKHE----FVNEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASN 94
II +++G+ AK E + EV +F L + + GAN L++ ++ VPM + SN
Sbjct: 56 DIISKEHGVTLAKAELEPVYRAEVARLFDSEL---EPIAGANALLESIT---VPMCVVSN 109
Query: 95 SHRATIESKISYQHGWNESFS-----VIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLV 149
SK+ + G + G D ++ KP P + AA+ +N+ + ++
Sbjct: 110 GP----VSKMQHSLGKTGMLHYFPDKLFSGYD-IQRWKPDPALMFHAAEAMNVNVENCIL 164
Query: 150 IEDSVIGVVAGKAAGMEV 167
++DS G +G AAGMEV
Sbjct: 165 VDDSSAGAQSGIAAGMEV 182
>gnl|CDD|171912 PRK13223, PRK13223, phosphoglycolate phosphatase; Provisional.
Length = 272
Score = 50.2 bits (120), Expect = 5e-07
Identities = 54/179 (30%), Positives = 71/179 (39%), Gaps = 35/179 (19%)
Query: 27 HKIVGKTPLEEA-AIIVEDYGLPCAKHEFVNEVYSMFSDHLCKVKALPGANRLIKHLSCH 85
H V E+A A+ +E Y HE VY PG +K L
Sbjct: 74 HDGVDDELAEQALALFMEAYA---DSHEL-TVVY-------------PGVRDTLKWLKKQ 116
Query: 86 GVPMALASNSHRATI-----ESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAKRL 140
GV MAL +N + + KI W I+G D + KP P L K
Sbjct: 117 GVEMALITNKPERFVAPLLDQMKIGRYFRW------IIGGDTLPQKKPDPAALLFVMKMA 170
Query: 141 NMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTHRYTAADE----VINSLLDLRP 195
+ PS SL + DS V+A KAAG++ VA+ H A+E VI+ L L P
Sbjct: 171 GVPPSQSLFVGDSRSDVLAAKAAGVQCVALSY--GYNHGRPIAEESPALVIDDLRALLP 227
>gnl|CDD|237311 PRK13226, PRK13226, phosphoglycolate phosphatase; Provisional.
Length = 229
Score = 49.5 bits (118), Expect = 6e-07
Identities = 22/64 (34%), Positives = 36/64 (56%)
Query: 107 QHGWNESFSVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGME 166
Q GW + +V++G D + KP P L AA+R+ + P+ + + D ++A +AAGM
Sbjct: 131 QLGWEQRCAVLIGGDTLAERKPHPLPLLVAAERIGVAPTDCVYVGDDERDILAARAAGMP 190
Query: 167 VVAV 170
VA
Sbjct: 191 SVAA 194
>gnl|CDD|237336 PRK13288, PRK13288, pyrophosphatase PpaX; Provisional.
Length = 214
Score = 48.9 bits (117), Expect = 8e-07
Identities = 29/104 (27%), Positives = 51/104 (49%), Gaps = 5/104 (4%)
Query: 69 VKALPGANRLIKHLSCHGVPMALASNSHRATIES--KISYQHGWNESFSVIVGSDEVRTG 126
V +K L G + + + R T+E K++ G +E F V++ D+V
Sbjct: 81 VTEYETVYETLKTLKKQGYKLGIVTTKMRDTVEMGLKLT---GLDEFFDVVITLDDVEHA 137
Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
KP P+ L+A + L +P +L++ D+ ++AGK AG + V
Sbjct: 138 KPDPEPVLKALELLGAKPEEALMVGDNHHDILAGKNAGTKTAGV 181
>gnl|CDD|233463 TIGR01549, HAD-SF-IA-v1, haloacid dehalogenase superfamily,
subfamily IA, variant 1 with third motif having Dx(3-4)D
or Dx(3-4)E. This model represents part of one
structural subfamily of the Haloacid Dehalogenase (HAD)
superfamily of aspartate-nucleophile hydrolases. The
superfamily is defined by the presence of three short
catalytic motifs. The subfamilies are defined based on
the location and the observed or predicted fold of a
so-called "capping domain", or the absence of such a
domain. Subfamily I consists of sequences in which the
capping domain is found in between the first and second
catalytic motifs. Subfamily II consists of sequences in
which the capping domain is found between the second and
third motifs. Subfamily III sequences have no capping
domain in either of these positions.The Subfamily IA and
IB capping domains are predicted by PSI-PRED to consist
of an alpha helical bundle. Subfamily I encompasses such
a wide region of sequence space (the sequences are
highly divergent) that modelling it with a single
representation is impossible, resulting in an overly
broad description which allows in many unrelated
sequences. Subfamily IA and IB are separated based on an
aparrent phylogenetic bifurcation. Subfamily IA is still
too broad to model, but cannot be further subdivided
into large chunks based on phylogenetic trees. Of the
three motifs defining the HAD superfamily, the third has
three variant forms : (1) hhhhsDxxx(x)(D/E), (2)
hhhhssxxx(x)D and (3) hhhhDDxxx(x)s where _s_ refers to
a small amino acid and _h_ to a hydrophobic one. All
three of these variants are found in subfamily IA.
Individual models were made based on seeds exhibiting
only one of the variants each. Variant 1 (this model) is
found in the enzymes phosphoglycolate phosphatase
(TIGR01449) and enolase-phosphatase. These three variant
models (see also TIGR01493 and TIGR01509) were created
withthe knowledge that there will be overlap among them
- this is by design and serves the purpose of
eliminating the overlap with models of more distantly
relatedHAD subfamilies caused by an overly broad single
model [Unknown function, Enzymes of unknown
specificity].
Length = 162
Score = 48.1 bits (115), Expect = 9e-07
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 3/95 (3%)
Query: 70 KALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPS 129
+PGA L+ L G+ + + SN + + HG + F +I+GSDE+ + KP
Sbjct: 71 AYIPGAADLLPRLKEAGIKLGIISNGSLRAQKLLLRK-HGLGDYFELILGSDEIGS-KPE 128
Query: 130 PDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAG 164
P+IFL A + L + P L + D++ + + AG
Sbjct: 129 PEIFLAALESLGVPP-EVLHVGDNLSDIKGARNAG 162
>gnl|CDD|234176 TIGR03351, PhnX-like, phosphonatase-like hydrolase. This clade of
sequences are the closest homologs to the PhnX enzyme,
phosphonoacetaldehyde (Pald) hydrolase (phosphonatase,
TIGR01422). This phosphonatase-like enzyme and PhnX
itself are members of the haloacid dehalogenase (HAD)
superfamily (pfam00702) having a a number of distinctive
features that set them apart from typical HAD enzymes.
The typical HAD N-terminal motif DxDx(T/V) here is DxAGT
and the usual conserved lysine prior to the C-terminal
motif is instead an arginine. Also distinctive of
phosphonatase, and particular to its bi-catalytic
mechanism is a conserved lysine in the variable "cap"
domain. This lysine forms a Schiff base with the
aldehyde of phosphonoacetaldehyde, providing, through
the resulting positive charge, a polarization of the C-P
bond necesary for cleavage as well as a route to the
initial product of cleavage, an ene-amine. The
conservation of these elements in this
phosphonatase-like enzyme suggests that the substrate is
also, like Pald, a 2-oxo-ethylphosphonate. Despite this,
the genomic context of members of this family are quite
distinct from PhnX, which is almost invariably
associated with the 2-aminoethylphosphonate transaminase
PhnW (TIGR02326), the source of the substrate Pald.
Members of this clade are never associated with PhnW,
but rather associate with families of FAD-dependent
oxidoreductases related to deaminating amino acid
oxidases (pfam01266) as well as zinc-dependent
dehydrogenases (pfam00107). Notably, family members from
Arthrobacter aurescens TC1 and Nocardia farcinica IFM
10152 are adjacent to the PhnCDE ABC cassette
phosphonates transporter (GenProp0236) typically found
in association with the phosphonates C-P lyase system
(GenProp0232). These observations suggest two
possibilities. First, the substrate for this enzyme
family is also Pald, the non-association with PhnW not
withstanding. Alternatively, the substrate is something
very closely related such as
hydroxyphosphonoacetaldehyde (Hpald). Hpald could come
from oxidative deamination of
1-hydroxy-2-aminoethylphosphonate (HAEP) by the
associated oxidase. HAEP would not be a substrate for
PhnW due to its high specificity for AEP. HAEP has been
shown to be a constituent of the sphingophosphonolipid
of Bacteriovorax stolpii, and presumably has other
natural sources. If Hpald is the substrate, the product
would be glycoaldehyde (hydroxyacetaldehyde), and the
associated alcohol dehydrogenase may serve to convert
this to glycol.
Length = 220
Score = 48.6 bits (116), Expect = 1e-06
Identities = 38/160 (23%), Positives = 61/160 (38%), Gaps = 21/160 (13%)
Query: 19 KEWDGREKHKIV------GKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDHLCKVKAL 72
W G+ K + + EA D+ E + D AL
Sbjct: 41 SAWMGQSKIEAIRALLAADGADEAEAQAAFADF----------EERLAEAYDDG-PPVAL 89
Query: 73 PGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGW--NESFSVIVGSDEVRTGKPSP 130
PGA + L G+ +AL + R T E ++ + GW + +V +V G+P+P
Sbjct: 90 PGAEEAFRSLRSSGIKVALTTGFDRDTAE-RLLEKLGWTVGDDVDAVVCPSDVAAGRPAP 148
Query: 131 DIFLEAAKRLNMEPSSS-LVIEDSVIGVVAGKAAGMEVVA 169
D+ L A + ++ S V D+ + AG AG V
Sbjct: 149 DLILRAMELTGVQDVQSVAVAGDTPNDLEAGINAGAGAVV 188
>gnl|CDD|162787 TIGR02253, CTE7, HAD superfamily (subfamily IA) hydrolase,
TIGR02253. This family of sequences from archaea and
metazoans includes the human uncharacterized protein
CTE7. Pyrococcus species appear to have three different
forms of this enzyme, so it is unclear whether all
members of this family have the same function. This
family is a member of the haloacid dehalogenase (HAD)
superfamily of hydrolases which are characterized by
three conserved sequence motifs. By virtue of an alpha
helical domain in-between the first and second conserved
motif, this family is a member of subfamily IA
(TIGR01549).
Length = 221
Score = 47.8 bits (114), Expect = 2e-06
Identities = 32/142 (22%), Positives = 58/142 (40%), Gaps = 5/142 (3%)
Query: 56 NEVYSMFSDHLCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFS 115
VY+ ++ PG + L G + + ++ K+ + G + F
Sbjct: 80 AFVYAYHKLKFAYLRVYPGVRDTLMELRESGYRLGIITDGLPVKQWEKLE-RLGVRDFFD 138
Query: 116 VIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVI-GVVAGKAAGMEVVAVPSLP 174
++ S+E KP P IF A KRL ++P ++++ D + + K GM+ V +
Sbjct: 139 AVITSEEEGVEKPHPKIFYAALKRLGVKPEEAVMVGDRLDKDIKGAKNLGMKTVWINQGK 198
Query: 175 KQTHR---YTAADEVINSLLDL 193
Y D I+SL +L
Sbjct: 199 SSKMEDDVYPYPDYEISSLREL 220
>gnl|CDD|129317 TIGR00213, GmhB_yaeD, D,D-heptose 1,7-bisphosphate phosphatase.
This family of proteins formerly designated yaeD
resembles the histidinol phosphatase domain of the
bifunctional protein HisB. The member from E. coli has
been characterized as D,D-heptose 1,7-bisphosphate
phosphatase, GmhB, involved in inner core LPS assembly
(PMID:11751812) [Cell envelope, Biosynthesis and
degradation of surface polysaccharides and
lipopolysaccharides].
Length = 176
Score = 45.3 bits (107), Expect = 9e-06
Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 2/69 (2%)
Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAA--GMEVVAVPSLPKQTHRYTAAD 184
KP P + L+A K L+++ + S ++ D + + AG AA V+ P AD
Sbjct: 106 KPKPGMLLQARKELHIDMAQSYMVGDKLEDMQAGVAAKVKTNVLVRTGKPITPEAENIAD 165
Query: 185 EVINSLLDL 193
V+NSL DL
Sbjct: 166 WVLNSLADL 174
>gnl|CDD|233512 TIGR01656, Histidinol-ppas, histidinol-phosphate phosphatase family
domain. This domain is found in authentic
histidinol-phosphate phosphatases which are sometimes
found as stand-alone entities and sometimes as fusions
with imidazoleglycerol-phosphate dehydratase
(TIGR01261). Additionally, a family of proteins
including YaeD from E. coli (TIGR00213) and various
other proteins are closely related but may not have the
same substrate specificity. This domain is a member of
the haloacid-dehalogenase (HAD) superfamily of
aspartate-nucleophile hydrolases. This superfamily is
distinguished by the presence of three motifs: an
N-terminal motif containing the nucleophilic aspartate,
a central motif containing an conserved serine or
threonine, and a C-terminal motif containing a conserved
lysine (or arginine) and conserved aspartates. More
specifically, the domian modelled here is a member of
subfamily III of the HAD-superfamily by virtue of
lacking a "capping" domain in either of the two common
positions, between motifs 1 and 2, or between motifs 2
and 3.
Length = 147
Score = 42.8 bits (101), Expect = 5e-05
Identities = 18/46 (39%), Positives = 27/46 (58%)
Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPS 172
KP P + LEA KRL ++ S SLV+ D + + A + AG+ V +
Sbjct: 101 KPKPGLILEALKRLGVDASRSLVVGDRLRDLQAARNAGLAAVLLVD 146
>gnl|CDD|188140 TIGR01422, phosphonatase, phosphonoacetaldehyde hydrolase. This
enzyme catalyzes the cleavage of the carbon phosphorous
bond of a phosphonate. The mechanism depends on the
substrate having a carbonyl one carbon away from the
cleavage position. This enzyme is a member of the
Haloacid Dehalogenase (HAD) superfamily of
aspartate-nucleophile hydrolases (pfam00702), and
contains a modified version of the conserved catalytic
motifs of that superfamily: the first motif is usually
DxDx(T/V), here it is DxAxT, and in the third motif the
normal conserved lysine is instead an arginine.
Additionally, the enzyme contains a unique conserved
catalytic lysine (B. cereus pos. 53) which is involved
in the binding and activation of the substrate through
the formation of a Schiff base. The substrate of this
enzyme is the product of 2-aminoethylphosphonate (AEP)
transaminase, phosphonoacetaldehyde. This degradation
pathway for AEP may be related to its toxic properties
which are utilized by microorganisms as a chemical
warfare agent [Central intermediary metabolism, Other].
Length = 253
Score = 43.1 bits (102), Expect = 1e-04
Identities = 26/101 (25%), Positives = 51/101 (50%), Gaps = 5/101 (4%)
Query: 73 PGANRLIKHLSCHGVPMALASNSHRATIE--SKISYQHGWNESFSVIVGSDEVRTGKPSP 130
PGA +I +L G+ + + R ++ + + G+ + V +D+V G+P+P
Sbjct: 102 PGAIEVIAYLRARGIKIGSTTGYTREMMDVVAPEAAAQGYRPDY--NVTADDVPAGRPAP 159
Query: 131 DIFLEAAKRLNMEPSSSLV-IEDSVIGVVAGKAAGMEVVAV 170
+ L+ A L + +++V + D+V + G+ AGM V V
Sbjct: 160 WMALKNATELGVYDPAAVVKVGDTVPDIEEGRNAGMWTVGV 200
>gnl|CDD|223319 COG0241, HisB, Histidinol phosphatase and related phosphatases
[Amino acid transport and metabolism].
Length = 181
Score = 41.1 bits (97), Expect = 2e-04
Identities = 20/68 (29%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGME-VVAVPSLPKQTHRYTAADE 185
KP P + L A K N++ S S V+ D + + A + AG++ V+ + + T A
Sbjct: 105 KPKPGMLLSALKEYNIDLSRSYVVGDRLTDLQAAENAGIKGVLVLTGIGVTTDGAGRAKW 164
Query: 186 VINSLLDL 193
V +SL +
Sbjct: 165 VFDSLAEF 172
>gnl|CDD|162372 TIGR01458, HAD-SF-IIA-hyp3, HAD-superfamily subfamily IIA
hydrolase, TIGR01458. This hypothetical equivalog is a
member of the IIA subfamily (TIGR01460) of the haloacid
dehalogenase superfamily of aspartate-nucleophile
hydrolases. One sequence (GP|10716807) has been
annotated as a "phospholysine phosphohistidine inorganic
pyrophosphatase," probably in reference to studies on
similarly described (but unsequenced) enzymes from
bovine and rat tissues. However, the supporting
information for this annotation has never been published
[Unknown function, Enzymes of unknown specificity].
Length = 257
Score = 40.6 bits (95), Expect = 6e-04
Identities = 21/61 (34%), Positives = 30/61 (49%), Gaps = 6/61 (9%)
Query: 126 GKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAG-KAAGMEVVAVPSLPKQTHRYTAAD 184
GKPS FLEA + EP +++I D V G + GM + V +T +Y +D
Sbjct: 178 GKPSKTFFLEALRATGCEPEEAVMIGDDCRDDVGGAQDCGMRGIQV-----RTGKYRPSD 232
Query: 185 E 185
E
Sbjct: 233 E 233
>gnl|CDD|223720 COG0647, NagD, Predicted sugar phosphatases of the HAD superfamily
[Carbohydrate transport and metabolism].
Length = 269
Score = 39.9 bits (94), Expect = 0.001
Identities = 16/46 (34%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 126 GKPSPDIFLEAAKRLNMEPSSSLVIEDSVIG-VVAGKAAGMEVVAV 170
GKPSP I+ A ++L ++ S L++ D + ++ KAAG++ + V
Sbjct: 189 GKPSPAIYEAALEKLGLDRSEVLMVGDRLDTDILGAKAAGLDTLLV 234
>gnl|CDD|236354 PRK08942, PRK08942, D,D-heptose 1,7-bisphosphate phosphatase;
Validated.
Length = 181
Score = 39.0 bits (92), Expect = 0.001
Identities = 15/38 (39%), Positives = 24/38 (63%)
Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAG 164
KP P + L A+RLN++ + S ++ DS+ + A AAG
Sbjct: 103 KPKPGMLLSIAERLNIDLAGSPMVGDSLRDLQAAAAAG 140
>gnl|CDD|184075 PRK13478, PRK13478, phosphonoacetaldehyde hydrolase; Provisional.
Length = 267
Score = 38.7 bits (91), Expect = 0.003
Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 1/56 (1%)
Query: 116 VIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLV-IEDSVIGVVAGKAAGMEVVAV 170
+V +D+V G+P P + L+ A L + ++ V ++D+V G+ G AGM V V
Sbjct: 147 HVVTTDDVPAGRPYPWMALKNAIELGVYDVAACVKVDDTVPGIEEGLNAGMWTVGV 202
>gnl|CDD|233675 TIGR01993, Pyr-5-nucltdase, pyrimidine 5'-nucleotidase. This
family of proteins includes the SDT1/SSM1 gene from
yeast which has been shown to code for a pyrimidine
(UMP/CMP) 5'nucleotidase. The family spans plants, fungi
and a small number of bacteria. These enzymes are
members of the haloacid dehalogenase (HAD) superfamily
of hydrolases, specifically the IA subfamily (variant 3,
TIGR01509).
Length = 183
Score = 37.7 bits (88), Expect = 0.004
Identities = 16/44 (36%), Positives = 26/44 (59%)
Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
KPSP + +A + ++P ++ +DS + AGKA GM+ V V
Sbjct: 140 KPSPQAYEKALREAGVDPERAIFFDDSARNIAAGKALGMKTVLV 183
>gnl|CDD|182828 PRK10907, PRK10907, intramembrane serine protease GlpG;
Provisional.
Length = 276
Score = 36.5 bits (85), Expect = 0.013
Identities = 25/63 (39%), Positives = 33/63 (52%), Gaps = 10/63 (15%)
Query: 216 WYIGGPVVKGLGRGSK-VLGIPTANLSTEGYSDVLSEHPSGVYFGWAGLSTRGVYKMVMS 274
WY+GG V K LG G V+ + +A LS G+ + SG +FG GLS GV +M
Sbjct: 159 WYLGGAVEKRLGSGKLIVITLISALLS--GW---VQSKFSGPWFG--GLS--GVVYALMG 209
Query: 275 IGW 277
W
Sbjct: 210 YVW 212
>gnl|CDD|233462 TIGR01548, HAD-SF-IA-hyp1, haloacid dehalogenase superfamily,
subfamily IA hydrolase, TIGR01548. This model
represents a small and phylogenetically curious clade of
sequences. Sequences are found from Halobacterium (an
archaeon), Nostoc and Synechococcus (cyanobacteria) and
Phytophthora (a stramenophile eukaryote). These appear
to be members of the haloacid dehalogenase (HAD)
superfamily of aspartate-nucleophile hydrolases by
general homology and the conservation of all of the
recognized catalytic motifs. The variable domain is
found in between motifs 1 and 2, indicating membership
in subfamily I and phylogeny and prediction of the alpha
helical nature of the variable domain (by PSI-PRED)
indicate membership in subfamily IA. All but the
Halobacterium sequence currently found are annotated as
"Imidazoleglycerol-phosphate dehydratase", however, the
source of the annotation could not be traced and
significant homology could not be found between any of
these sequences and known IGPD's.
Length = 197
Score = 35.3 bits (81), Expect = 0.028
Identities = 24/100 (24%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 64 DHLCKVKALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEV 123
L + + L L++ L MA+ + R K HG F V + ++
Sbjct: 100 LGLIEDETLLTPKGLLRELHRAPKGMAVVTGRPRKDAA-KFLTTHGLEILFPVQIWMEDC 158
Query: 124 RTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAA 163
KP+P+ + AAK L +E + ++ D+V ++ G+ A
Sbjct: 159 -PPKPNPEPLILAAKALGVEACHAAMVGDTVDDIITGRKA 197
>gnl|CDD|106187 PRK13225, PRK13225, phosphoglycolate phosphatase; Provisional.
Length = 273
Score = 35.1 bits (80), Expect = 0.037
Identities = 30/152 (19%), Positives = 61/152 (40%), Gaps = 18/152 (11%)
Query: 19 KEWDGREKHKIVGKTPLEEAAIIVEDYGLPCAKHEFVNEVYSMFSDHLCKVKALPGANRL 78
++W R + G +P ++A ++ V D L ++ PG L
Sbjct: 105 RQWSSRTIVRRAGLSPWQQARLL--------------QRVQRQLGDCLPALQLFPGVADL 150
Query: 79 IKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAK 138
+ L + + + S++ R IE+ + Q G FSV+ + + + + +
Sbjct: 151 LAQLRSRSLCLGILSSNSRQNIEAFLQRQ-GLRSLFSVVQAGTPILSKRRA---LSQLVA 206
Query: 139 RLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
R +P++ + + D V A + G+ VAV
Sbjct: 207 REGWQPAAVMYVGDETRDVEAARQVGLIAVAV 238
>gnl|CDD|130495 TIGR01428, HAD_type_II, 2-haloalkanoic acid dehalogenase, type II.
Catalyzes the hydrolytic dehalogenation of small
L-2-haloalkanoic acids to yield the corresponding
D-2-hydroxyalkanoic acids. Belongs to the Haloacid
Dehalogenase (HAD) superfamily of aspartate-nucleophile
hydrolases (pfam00702), class (subfamily) I. Note that
the Type I HAD enzymes have not yet been fully
characterized, but clearly utilize a substantially
different catalytic mechanism and are thus unlikely to
be related.
Length = 198
Score = 34.6 bits (80), Expect = 0.039
Identities = 22/92 (23%), Positives = 44/92 (47%), Gaps = 1/92 (1%)
Query: 79 IKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSPDIFLEAAK 138
++ L G +A+ SN A ++S + + G ++ F ++ +D VR KP+P ++ A +
Sbjct: 101 LRALKERGYRLAILSNGSPAMLKSLVKHA-GLDDPFDAVLSADAVRAYKPAPQVYQLALE 159
Query: 139 RLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
L + P L + + + K G + V
Sbjct: 160 ALGVPPDEVLFVASNPWDLGGAKKFGFKTAWV 191
>gnl|CDD|225090 COG2179, COG2179, Predicted hydrolase of the HAD superfamily
[General function prediction only].
Length = 175
Score = 33.0 bits (76), Expect = 0.13
Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 1/52 (1%)
Query: 124 RTGKPSPDIFLEAAKRLNMEPSSSLVIEDSVI-GVVAGKAAGMEVVAVPSLP 174
R KP F A K +N+ P +++ D + V+ G AGM + V L
Sbjct: 90 RAKKPFGRAFRRALKEMNLPPEEVVMVGDQLFTDVLGGNRAGMRTILVEPLV 141
>gnl|CDD|233519 TIGR01668, YqeG_hyp_ppase, HAD superfamily (subfamily IIIA)
phosphatase, TIGR01668. This family of hypothetical
proteins is a member of the IIIA subfamily of the
haloacid dehalogenase (HAD) superfamily of hydrolases.
All characterized members of this subfamily (TIGR01662)
and most characterized members of the HAD superfamily
are phosphatases. HAD superfamily phosphatases contain
active site residues in several conserved catalytic
motifs, all of which are found conserved here. This
family consists of sequences from fungi, plants,
cyanobacteria, gram-positive bacteria and Deinococcus.
There is presently no characterization of any sequence
in this family.
Length = 170
Score = 32.0 bits (73), Expect = 0.23
Identities = 28/153 (18%), Positives = 47/153 (30%), Gaps = 29/153 (18%)
Query: 71 ALPGANRLIKHLSCHGVPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSP 130
A P I+ L G + + SN+ E + ++ + V V KP
Sbjct: 44 AYPALRDWIEELKAAGRKLLIVSNNAG---EQRAKA---VEKALGIPVLPHAV---KPPG 94
Query: 131 DIFLEAAKRLNMEPSSSLVIEDSVI-GVVAGKAAGMEVVAVPSLPKQTHRYTAADEVINS 189
F A + + V+ D + V+ G G + V L
Sbjct: 95 CAFRRAHPEMGLTSEQVAVVGDRLFTDVMGGNRNGSYTILVEPLV--------------- 139
Query: 190 LLDLRPEKWGLPPFQDWIEGTLPSEPWYIGGPV 222
P++W + +E T+ GGP
Sbjct: 140 ----HPDQWFIKRIWRRVERTVLKFLVSRGGPA 168
>gnl|CDD|233517 TIGR01662, HAD-SF-IIIA, HAD-superfamily hydrolase, subfamily IIIA.
This subfamily falls within the Haloacid Dehalogenase
(HAD) superfamily of aspartate-nucleophile hydrolases.
The Class III subfamilies are characterized by the lack
of any domains located between either between the first
and second conserved catalytic motifs (as in the Class I
subfamilies, TIGR01493, TIGR01509, TIGR01488 and
TIGR01494) or between the second and third conserved
catalytic motifs (as in the Class II subfamilies,
TIGR01460 and TIGR01484) of the superfamily domain. The
IIIA subfamily contains five major clades:
histidinol-phosphatase (TIGR01261) and
histidinol-phosphatase-related protein (TIGR00213) which
together form a subfamily (TIGR01656), DNA
3'-phosphatase (TIGR01663, TIGR01664), YqeG (TIGR01668)
and YrbI (TIGR01670). In the case of histidinol
phosphatase and PNK-3'-phosphatase, this model
represents a domain of a bifunctional system. In the
histidinol phosphatase HisB, a C-terminal domain is an
imidazoleglycerol-phosphate dehydratase which catalyzes
a related step in histidine biosynthesis. In
PNK-3'-phosphatase, N- and C-terminal domains constitute
the polynucleotide kinase and DNA-binding components of
the enzyme [Unknown function, Enzymes of unknown
specificity].
Length = 132
Score = 31.2 bits (71), Expect = 0.31
Identities = 23/106 (21%), Positives = 42/106 (39%), Gaps = 9/106 (8%)
Query: 73 PGANRLIKHLSCHGVPMALASNS----HRATIESKISYQ-HGWNESFSVIVGSDEVRTGK 127
P + L G + + +N +++ + ++ R K
Sbjct: 28 PEVPDALAELKEAGYKVVIVTNQSGIGRGKFSSGRVARRLEELGVPIDILYACPHCR--K 85
Query: 128 PSPDIFLEAAKRLN-MEPSSSLVIEDSVI-GVVAGKAAGMEVVAVP 171
P P +FLEA KR N ++P S+ + D + + A K AG+ + V
Sbjct: 86 PKPGMFLEALKRFNEIDPEESVYVGDQDLTDLQAAKRAGLAFILVA 131
>gnl|CDD|233800 TIGR02247, HAD-1A3-hyp, epoxide hydrolase N-terminal domain-like
phosphatase. This model represents a small clade of
sequences including C. elegans and mammalian sequences
as well as a small number of bacteria. In eukaryotes,
this domain exists as an N-terminal fusion to the
soluble epoxide hydrolase enzyme and has recently been
shown to be an active phosphatase, although the nature
of the biological substrate is unclear. These appear to
be members of the haloacid dehalogenase (HAD)
superfamily of aspartate-nucleophile hydrolases by
general homology and the conservation of all of the
recognized catalytic motifs (although the first motif is
unusual in the replacement of the more common aspartate
with glycine...). The variable domain is found in
between motifs 1 and 2, indicating membership in
subfamily I and phylogeny and prediction of the alpha
helical nature of the variable domain (by PSI-PRED)
indicate membership in subfamily IA.
Length = 211
Score = 31.7 bits (72), Expect = 0.44
Identities = 24/113 (21%), Positives = 40/113 (35%), Gaps = 5/113 (4%)
Query: 69 VKALPGANRLIKHLSCHGVPMALASNS---HRATIESKISYQHGWNESFSVIVGSDEVRT 125
K P IK L G A +N+ + E+ + F +V S
Sbjct: 93 TKLRPSMMAAIKTLRAKGFKTACITNNFPTDHSAEEALLPGDIM--ALFDAVVESCLEGL 150
Query: 126 GKPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLPKQTH 178
KP P I+ +RL + P + ++D + A G+ + V + H
Sbjct: 151 RKPDPRIYQLMLERLGVAPEECVFLDDLGSNLKPAAALGITTIKVSDEEQAIH 203
>gnl|CDD|162788 TIGR02254, YjjG/YfnB, HAD superfamily (subfamily IA) hydrolase,
TIGR02254. This family consists of uncharacterized
proteobacterial and gram positive bacterial sequences
including YjjG from E. coli and YfnB from B. subtilis.
This family is a member of the haloacid dehalogenase
(HAD) superfamily of hydrolases which are characterized
by three conserved sequence motifs. By virtue of an
alpha helical domain in-between the first and second
conserved motif, this family is a member of subfamily IA
(TIGR01549). Most likely, these enzymes are
phosphatases.
Length = 224
Score = 31.7 bits (72), Expect = 0.44
Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 6/100 (6%)
Query: 72 LPGANRLIKHLSCHG-VPMALASNSHRATIESKISYQHGWNESFSVIVGSDEVRTGKPSP 130
LPGA L+++L + + +N R T ++ + G F I S++ KP
Sbjct: 99 LPGAFELMENL--QQKFRLYIVTNGVRETQYKRLR-KSGLFPFFDDIFVSEDAGIQKPDK 155
Query: 131 DIFLEAAKRL-NMEPSSSLVIEDSVIG-VVAGKAAGMEVV 168
+IF A +R+ L+I DS+ + G+ AG++
Sbjct: 156 EIFNYALERMPKFSKEEVLMIGDSLTADIKGGQNAGLDTC 195
>gnl|CDD|130524 TIGR01457, HAD-SF-IIA-hyp2, HAD-superfamily subfamily IIA
hydrolase, TIGR01457. This hypothetical equivalog is a
member of the Class IIA subfamily of the haloacid
dehalogenase superfamily of aspartate-nucleophile
hydrolases. The sequences modelled by this equivalog are
all gram positive (low-GC) bacteria. Sequences found in
This model are annotated variously as related to NagD or
4-nitrophenyl phosphatase, and this hypothetical
equivalog, of all of those within the Class IIA
subfamily, is most closely related to the E. coli NagD
enzyme and the PGP_euk equivalog (TIGR01452). However,
there is presently no evidence that this hypothetical
equivalog has the same function of either those [Unknown
function, Enzymes of unknown specificity].
Length = 249
Score = 31.7 bits (72), Expect = 0.47
Identities = 16/57 (28%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 115 SVIVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDS-VIGVVAGKAAGMEVVAV 170
+V G V GKP I +A + L + +L++ D+ ++AG AG++ + V
Sbjct: 166 TVSTGVKPVFIGKPESIIMEQAMRVLGTDVEETLMVGDNYATDIMAGINAGIDTLLV 222
>gnl|CDD|215296 PLN02540, PLN02540, methylenetetrahydrofolate reductase.
Length = 565
Score = 31.6 bits (72), Expect = 0.62
Identities = 25/105 (23%), Positives = 39/105 (37%), Gaps = 32/105 (30%)
Query: 176 QTHRYTAADEVINSLLDLRPEKWGLPP---------FQDWIEGTLPSEPWYIGGPVVKGL 226
Q R A D+ L+ E WG+P F + G L S PW + GL
Sbjct: 352 QFMRPRARDK------KLQAE-WGVPLKSVEDVYEVFAKYCLGKLKSSPW----SELDGL 400
Query: 227 GRGSKVLGIPTANLSTEGY---------SDVLSEHPSGVYFGWAG 262
+K++ ++ +G+ + S+ PS GW G
Sbjct: 401 QPETKIINEQLVKINRKGFLTINSQPAVNGEKSDSPS---VGWGG 442
>gnl|CDD|180686 PRK06769, PRK06769, hypothetical protein; Validated.
Length = 173
Score = 30.9 bits (70), Expect = 0.68
Identities = 12/44 (27%), Positives = 21/44 (47%)
Query: 127 KPSPDIFLEAAKRLNMEPSSSLVIEDSVIGVVAGKAAGMEVVAV 170
KPS + L+AA++ ++ + VI D +VA + V
Sbjct: 93 KPSTGMLLQAAEKHGLDLTQCAVIGDRWTDIVAAAKVNATTILV 136
>gnl|CDD|233420 TIGR01452, PGP_euk, phosphoglycolate/pyridoxal phosphate
phosphatase family. PGP is an essential enzyme in the
glycolate salvage pathway in higher organisms
(photorespiration in plants). Phosphoglycolate results
from the oxidase activity of RubisCO in the Calvin cycle
when concentrations of carbon dioxide are low relative
to oxygen. In mammals, PGP is found in many tissues,
notably in red blood cells where P-glycolate is and
important activator of the hydrolysis of
2,3-bisphosphoglycerate, a major modifier of the oxygen
affinity of hemoglobin. Pyridoxal phosphate (PLP,
Vitamin B6) phosphatase is involved in the degradation
of PLP in mammals and is widely distributed in human
tissues including erythrocyes. The enzymes described
here are members of the Haloacid dehalogenase
superfamily of hydrolase enzymes (pfam00702). Unlike the
bacterial PGP equivalog (TIGR01449), which is a member
of class (subfamily) I, these enzymes are members of
class (subfamily) II. These two families have almost
certainly arisen from convergent evolution (although
these two ancestors may themselves have diverged from a
more distant HAD superfamily progenitor). The primary
seed sequence for this model comes from Chlamydomonas
reinhardtii, a photosynthetic alga. The enzyme has been
purified and characterized and these data are fully
consistent with the assignment of function as a PGPase
involved in photorespiration. The second seed, from Homo
sapiens chromosome 22 has been characterized as a
pyridoxal phosphatase. Biochemical characterization of
partially purified PGP's from various tissues including
red blood cells have been performed while one gene for
PGP has been localized to chromosome 16p13.3. The
sequence used here maps to chromosome 22. There is
indeed a related gene on chromosome 16 (and it is
expressed, since EST's are found) which shows 46%
identity and 59% positives by BLAST2 (E=1e-66). The
chromosome 16 gene is not in evidence in nraa but
translated from the genomic sequence would score 372.4
(E=7.9e-113) versus This model, well above trusted. The
third seed, from C. elegans, is only supported by
sequence similarity. This model is limited to eukaryotic
species including S. pombe and S. cerevisiae, although
several archaea score between the trusted and noise
cutoffs. This model is closely related to a family of
bacterial sequences including the E. coli NagD and B.
subtilus AraL genes which are characterized by the
ability to hydrolyze para-nitrophenylphosphate (pNPPases
or NPPases). The chlamydomonas PGPase d.
Length = 279
Score = 31.0 bits (70), Expect = 0.96
Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 4/72 (5%)
Query: 119 GSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSV-IGVVAGKAAGMEVVAVPSLPKQT 177
G + GKPSP +F + +++P+ +L++ D + ++ G GM V V S
Sbjct: 194 GRQPLVVGKPSPYMFECITENFSIDPARTLMVGDRLETDILFGHRCGMTTVLVLS---GV 250
Query: 178 HRYTAADEVINS 189
R A E + +
Sbjct: 251 SRLEEAQEYLAA 262
>gnl|CDD|233422 TIGR01460, HAD-SF-IIA, Haloacid Dehalogenase Superfamily Class
(subfamily) IIA. This model represents one structural
subclass of the Haloacid Dehalogenase (HAD) superfamily
of aspartate-nucleophile hydrolases. The superfamily is
defined by the presence of three short catalytic motifs.
The classes are defined based on the location and the
observed or predicted fold of a so-called "capping
domain", or the absence of such a domain. Class I
consists of sequences in which the capping domain is
found in between the first and second catalytic motifs.
Class II consists of sequences in which the capping
domain is found between the second and third motifs.
Class III sequences have no capping domain in iether of
these positions. The Class IIA capping domain is
predicted by PSI-PRED to consist of a mixed alpha-beta
fold with the basic pattern:
Helix-Helix-Helix-Sheet-Helix-Loop-Sheet-Helix-Sheet-
Helix. Presently, this subfamily encompasses a single
equivalog model (TIGR01452) for the eukaryotic
phosphoglycolate phosphatase, as well as four
hypothetical equivalogs covering closely related
sequences (TIGR01456 and TIGR01458 in eukaryotes,
TIGR01457 in gram positive bacteria and TIGR01459 in
gram negative bacteria). The Escherishia coli NagD gene
and the Bacillus subtilus AraL gene are members of this
subfamily but are not members of the any of the
presently defined equivalogs within it. NagD is part of
the NAG operon responsible for N-acetylglucosamine
metabolism. The function of this gene is unknown. Genes
from several organisms have been annotated as NagD, or
NagD-like. However, without data on the presence of
other members of this pathway, (such as in the case of
Yersinia pestis) these assignments should not be given
great weight. The AraL gene is similar: it is part of
the L-arabinose operon, but the function is unknown. A
gene from Halobacterium has been annotated as AraL, but
no other Ara operon genes have been annotated. Many of
the genes in this subfamily have been annotated as
"pNPPase" "4-nitrophenyl phosphatase" or "NPPase". These
all refer to the same activity versus a common lab test
compound used to determine phosphatase activity. There
is no evidence that this activity is physiologically
relevant [Unknown function, Enzymes of unknown
specificity].
Length = 236
Score = 30.8 bits (70), Expect = 1.0
Identities = 15/54 (27%), Positives = 24/54 (44%), Gaps = 2/54 (3%)
Query: 119 GSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVI--EDSVIGVVAGKAAGMEVVAV 170
G + GKPSP I+ A L P V+ ++ ++ K AG + + V
Sbjct: 180 GREPTVVGKPSPAIYRAALNLLQARPERRDVMVGDNLRTDILGAKNAGFDTLLV 233
>gnl|CDD|151335 pfam10886, DUF2685, Protein of unknown function (DUF2685).
Members in this family of proteins are annotated as
uvdY.-2 which is an open reading frame within uvsY.
However currently there is no known function.
Length = 54
Score = 27.5 bits (61), Expect = 1.6
Identities = 10/31 (32%), Positives = 17/31 (54%), Gaps = 4/31 (12%)
Query: 26 KHKIVGKTPLEEAAIIVEDYGL----PCAKH 52
+V K P+E+A + +YG PCA++
Sbjct: 2 AICVVCKQPVEKALAVDTEYGPVHPGPCAQY 32
>gnl|CDD|182466 PRK10444, PRK10444, UMP phosphatase; Provisional.
Length = 248
Score = 29.8 bits (67), Expect = 2.2
Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 117 IVGSDEVRTGKPSPDIFLEAAKRLNMEPSSSLVIEDSV-IGVVAGKAAGMEVVAVPS 172
I G GKPSP I A ++ ++++ D++ ++AG AG+E + V S
Sbjct: 164 ISGRKPFYVGKPSPWIIRAALNKMQAHSEETVIVGDNLRTDILAGFQAGLETILVLS 220
>gnl|CDD|219943 pfam08631, SPO22, Meiosis protein SPO22/ZIP4 like. SPO22/ZIP4 in
yeast is a meiosis specific protein involved in
sporulation. It has been shown to regulate crossover
distribution by promoting synaptonemal complex
formation.
Length = 280
Score = 29.7 bits (67), Expect = 2.4
Identities = 14/61 (22%), Positives = 27/61 (44%), Gaps = 1/61 (1%)
Query: 288 IEPWLLHEFDEDFYDEELHLVIVGYIRPEANFPSLETLIAKIHE-DRKVAERALDLPLYS 346
+E E+ Y++ L +I E+NF + I K+ + A + LD L++
Sbjct: 128 LEILKKRPGPEEEYEDVLMRMIKSVDVTESNFELAISHINKLSDKAPASAAKCLDYLLFN 187
Query: 347 K 347
+
Sbjct: 188 R 188
>gnl|CDD|181865 PRK09449, PRK09449, dUMP phosphatase; Provisional.
Length = 224
Score = 29.1 bits (66), Expect = 3.4
Identities = 20/66 (30%), Positives = 31/66 (46%), Gaps = 6/66 (9%)
Query: 70 KALPGANRLIKHLSCHG-VPMALASNSHRATIESKISYQH-GWNESFSVIVGSDEVRTGK 127
LPGA L+ L G V M + +N T ++ + G + F ++V S++V K
Sbjct: 95 TPLPGAVELLNAL--RGKVKMGIITNGF--TELQQVRLERTGLRDYFDLLVISEQVGVAK 150
Query: 128 PSPDIF 133
P IF
Sbjct: 151 PDVAIF 156
>gnl|CDD|233980 TIGR02712, urea_carbox, urea carboxylase. Members of this family are
ATP-dependent urea carboxylase, including characterized
members from Oleomonas sagaranensis (alpha class
Proteobacterium) and yeasts such as Saccharomyces
cerevisiae. The allophanate hydrolase domain of the yeast
enzyme is not included in this model and is represented
by an adjacent gene in Oleomonas sagaranensis. The fusion
of urea carboxylase and allophanate hydrolase is
designated urea amidolyase. The enzyme from Oleomonas
sagaranensis was shown to be highly active on acetamide
and formamide as well as urea [Central intermediary
metabolism, Nitrogen metabolism].
Length = 1201
Score = 29.2 bits (66), Expect = 4.2
Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 4/33 (12%)
Query: 277 WNPYFDNAEKTIEPWLLHEFDE-DFY---DEEL 305
WN Y PWLL FD+ FY +EEL
Sbjct: 1019 WNRYRLGGAFQDGPWLLRFFDQIRFYPVSEEEL 1051
>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically
periplasmic, contain C-terminal PDZ domain
[Posttranslational modification, protein turnover,
chaperones].
Length = 347
Score = 28.7 bits (64), Expect = 4.4
Identities = 13/47 (27%), Positives = 22/47 (46%), Gaps = 3/47 (6%)
Query: 198 WGLPPFQDWIEGTLPSEPWYIGGPVVKGLGRGSKVLGIPTANLSTEG 244
+ ++I+ P GGP+V G +V+GI TA ++ G
Sbjct: 176 GSAGGYVNFIQTDAAINPGNSGGPLVNIDG---EVVGINTAIIAPSG 219
>gnl|CDD|236761 PRK10795, PRK10795, penicillin-binding protein 2; Provisional.
Length = 634
Score = 28.6 bits (64), Expect = 6.4
Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 191 LDLRPEKWGLPPFQDWIEGTLPSEPWYIGGPVVKGLGRG 229
+DL E+ G P ++W + +PWY G + G+G+G
Sbjct: 420 IDLAEERSGNMPTREWKQKRF-KKPWYQGDTIPVGIGQG 457
>gnl|CDD|224221 COG1302, COG1302, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 131
Score = 27.2 bits (61), Expect = 6.9
Identities = 13/33 (39%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 142 MEPSSSLVIEDSVIGVVAGKAAGMEVVAVPSLP 174
E + I D VI V+AG AA EV V +
Sbjct: 6 NEELGKIEISDEVIAVIAGIAA-EEVEGVVGMA 37
>gnl|CDD|132589 TIGR03550, F420_cofG, 7,8-didemethyl-8-hydroxy-5-deazariboflavin
synthase, CofG subunit. This model represents either a
subunit or a domain, depending on whether or not the
genes are fused, of a bifunctional protein that
completes the synthesis of
7,8-didemethyl-8-hydroxy-5-deazariboflavin, or FO. FO is
the chromophore of coenzyme F(420), involved in
methanogenesis in methanogenic archaea but found in
certain other lineages as well. The chromophore also
occurs as a cofactor in DNA photolyases in Cyanobacteria
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Other].
Length = 322
Score = 28.0 bits (63), Expect = 7.2
Identities = 14/46 (30%), Positives = 25/46 (54%), Gaps = 6/46 (13%)
Query: 312 YIRPEANFPSLETLIAKIHEDRKV--AERALDLPLYSKYRDDPYLK 355
++ PEA +P ++ L A+ E+ ER LP+Y +Y + +L
Sbjct: 273 HVNPEAPWPEIDEL-ARATEEAGFTLKER---LPVYPEYVREGWLS 314
>gnl|CDD|232841 TIGR00131, gal_kin, galactokinase. Galactokinase is a member of
the GHMP kinases (Galactokinase, Homoserine kinase,
Mevalonate kinase, Phosphomevalonate kinase) and shares
with them an amino-terminal domain probably related to
ATP binding.The galactokinases found by This model are
divided into two sets. Prokaryotic forms are generally
shorter. The eukaryotic forms are longer because of
additional central regions and in some cases are known
to be bifunctional, with regulatory activities that are
independent of galactokinase activity [Energy
metabolism, Sugars].
Length = 386
Score = 27.9 bits (62), Expect = 9.6
Identities = 11/20 (55%), Positives = 12/20 (60%)
Query: 332 DRKVAERALDLPLYSKYRDD 351
D K AER+LDLPL D
Sbjct: 69 DNKFAERSLDLPLDGSEVSD 88
>gnl|CDD|180485 PRK06245, cofG, FO synthase subunit 1; Reviewed.
Length = 336
Score = 27.6 bits (62), Expect = 9.9
Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 6/45 (13%)
Query: 312 YIRPEANFPSLETLIAKIHEDRKVA--ERALDLPLYSKYRDDPYL 354
Y+ PE +P +E L +I E+ ER LP+Y KY + +L
Sbjct: 277 YVNPEYPWPDIEEL-REILEEAGWPLKER---LPVYPKYIKEGWL 317
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.136 0.414
Gapped
Lambda K H
0.267 0.0872 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,117,862
Number of extensions: 1888366
Number of successful extensions: 1772
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1729
Number of HSP's successfully gapped: 81
Length of query: 362
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 264
Effective length of database: 6,590,910
Effective search space: 1740000240
Effective search space used: 1740000240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (26.6 bits)