RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6288
(166 letters)
>gnl|CDD|178407 PLN02811, PLN02811, hydrolase.
Length = 220
Score = 202 bits (516), Expect = 6e-67
Identities = 86/138 (62%), Positives = 97/138 (70%), Gaps = 2/138 (1%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQGKPAP 88
GA RL+ HLH IP AIAT S K F+LKT RH + L HHVV G DPEVKQGKPAP
Sbjct: 82 GAERLVRHLHAKGIPIAIATGSHKRHFDLKTQRHGELFSLMHHVVTGD-DPEVKQGKPAP 140
Query: 89 DVFLVAAKRF-DEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPTVPKHRTEAADLVL 147
D+FL AA+RF D P K LVFEDAP+GV AK AGMS VMVPDP + K + AD VL
Sbjct: 141 DIFLAAARRFEDGPVDPGKVLVFEDAPSGVEAAKNAGMSVVMVPDPRLDKSYCKGADQVL 200
Query: 148 NSLEEFKPELYGLPPFED 165
+SL +FKPE +GLPPF D
Sbjct: 201 SSLLDFKPEEWGLPPFPD 218
>gnl|CDD|178528 PLN02940, PLN02940, riboflavin kinase.
Length = 382
Score = 140 bits (355), Expect = 8e-41
Identities = 65/139 (46%), Positives = 90/139 (64%), Gaps = 9/139 (6%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQGKPAP 88
GA RLI HL H +P A+A++S + + E K S H+ + F +V G EV++GKP+P
Sbjct: 97 GANRLIKHLKSHGVPMALASNSPRANIEAKISCHQGWKESFSVIVGGD---EVEKGKPSP 153
Query: 89 DVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPTVPK--HRTEAADLV 146
D+FL AAKR + +P S CLV ED+ GV+ KAAGM + VP ++PK H +AD V
Sbjct: 154 DIFLEAAKRLNVEP--SNCLVIEDSLPGVMAGKAAGMEVIAVP--SIPKQTHLYSSADEV 209
Query: 147 LNSLEEFKPELYGLPPFED 165
+NSL + +PE +GLPPF D
Sbjct: 210 INSLLDLQPEKWGLPPFND 228
>gnl|CDD|223710 COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General
function prediction only].
Length = 221
Score = 101 bits (254), Expect = 2e-27
Identities = 44/129 (34%), Positives = 65/129 (50%), Gaps = 6/129 (4%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQGKPAP 88
G + L+ L IP A+A+SS + + E +R L F +V +V +GKPAP
Sbjct: 90 GVVELLEQLKARGIPLAVASSSPRRAAERVLARLGL-LDYFDVIVTAD---DVARGKPAP 145
Query: 89 DVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPTVPKHRTEAADLVLN 148
D++L+AA+R P +C+V ED+P G+ AKAAGM V VP H +
Sbjct: 146 DIYLLAAERLGV--DPEECVVVEDSPAGIQAAKAAGMRVVGVPAGHDRPHLDPLDAHGAD 203
Query: 149 SLEEFKPEL 157
++ EL
Sbjct: 204 TVLLDLAEL 212
Score = 27.0 bits (60), Expect = 4.1
Identities = 8/10 (80%), Positives = 9/10 (90%)
Query: 13 VIFDMDGLLL 22
VIFDMDG L+
Sbjct: 5 VIFDMDGTLV 14
>gnl|CDD|183215 PRK11587, PRK11587, putative phosphatase; Provisional.
Length = 218
Score = 85.8 bits (213), Expect = 3e-21
Identities = 54/132 (40%), Positives = 71/132 (53%), Gaps = 24/132 (18%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPE-------V 81
GA+ L+NHL+K IP+AI TS S + ++RHK G PE V
Sbjct: 87 GAIALLNHLNKLGIPWAIVTSG---SVPVASARHK---------AAGLPAPEVFVTAERV 134
Query: 82 KQGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPT-VPKHRT 140
K+GKP PD +L+ A+ PQ +C+V EDAP GVL AAG + V P P R
Sbjct: 135 KRGKPEPDAYLLGAQLLGLAPQ--ECVVVEDAPAGVLSGLAAGCHVIAVNAPADTP--RL 190
Query: 141 EAADLVLNSLEE 152
+ DLVL+SLE+
Sbjct: 191 DEVDLVLHSLEQ 202
>gnl|CDD|233443 TIGR01509, HAD-SF-IA-v3, haloacid dehalogenase superfamily,
subfamily IA, variant 3 with third motif having DD or
ED. This model represents part of one structural
subfamily of the Haloacid Dehalogenase (HAD) superfamily
of aspartate-nucleophile hydrolases. The superfamily is
defined by the presence of three short catalytic motifs.
The subfamilies are defined based on the location and
the observed or predicted fold of a so-called "capping
domain", or the absence of such a domain. Subfamily I
consists of sequences in which the capping domain is
found in between the first and second catalytic motifs.
Subfamily II consists of sequences in which the capping
domain is found between the second and third motifs.
Subfamily III sequences have no capping domain in either
of these positions.The Subfamily IA and IB capping
domains are predicted by PSI-PRED to consist of an alpha
helical bundle. Subfamily I encompasses such a wide
region of sequence space (the sequences are highly
divergent) that representing it with a single model is
impossible, resulting in an overly broad description
which allows in many unrelated sequences. Subfamily IA
and IB are separated based on an aparrent phylogenetic
bifurcation. Subfamily IA is still too broad to model,
but cannot be further subdivided into large chunks based
on phylogenetic trees. Of the three motifs defining the
HAD superfamily, the third has three variant forms : (1)
hhhhsDxxx(x)D, (2) hhhhssxxx(x)D and (3) hhhhDDxxx(x)s
where _s_ refers to a small amino acid and _h_ to a
hydrophobic one. All three of these variants are found
in subfamily IA. Individual models were made based on
seeds exhibiting only one of the variants each. Variant
3 (this model) is found in the enzymes
beta-phosphoglucomutase (TIGR01990) and
deoxyglucose-6-phosphatase, while many other enzymes of
subfamily IA exhibit this variant as well as variant 1
(TIGR01549). These three variant models were created
withthe knowledge that there will be overlap among them
- this is by design and serves the purpose of
eliminating the overlap with models of more distantly
relatedHAD subfamilies caused by an overly broad single
model [Unknown function, Enzymes of unknown
specificity].
Length = 177
Score = 77.1 bits (190), Expect = 3e-18
Identities = 37/113 (32%), Positives = 52/113 (46%), Gaps = 9/113 (7%)
Query: 20 LLLGYNLAI--GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSA 77
L L G L+ L A+ T+S + +L LF V+ S
Sbjct: 72 ALEKEGLKPLPGVRALLEALRARGKKLALLTNSPRADAKLV--LELGLRALFDVVIDSS- 128
Query: 78 DPEVKQGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMV 130
+V GKP PD++L A K+ KP +CL +D+P G+ AKAAGM V+V
Sbjct: 129 --DVGLGKPDPDIYLQALKKLGLKPS--ECLFVDDSPAGIDAAKAAGMHTVLV 177
>gnl|CDD|213673 TIGR02009, PGMB-YQAB-SF, beta-phosphoglucomutase family hydrolase.
This subfamily model groups together three clades: the
characterized beta-phosphoglucomutases (including those
from E.coli, B.subtilus and L.lactis, TIGR01990), a
clade of putative bPGM's from mycobacteria and a clade
including the uncharacterized E.coli and H.influenzae
yqaB genes which may prove to be beta-mutases of a
related 1-phosphosugar. All of these are members of the
larger Haloacid dehalogenase (HAD) subfamily IA and
include the "variant 3" glu-asp version of the third
conserved HAD domain (TIGR01509).
Length = 185
Score = 76.2 bits (188), Expect = 7e-18
Identities = 37/102 (36%), Positives = 48/102 (47%), Gaps = 8/102 (7%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQGKPAP 88
G L+ L I + +SS L F +V S EVK GKP P
Sbjct: 92 GIRNLLKRLKAKGIAVGLGSSSKNAPRILAKL---GLRDYFDAIVDAS---EVKNGKPHP 145
Query: 89 DVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMV 130
+ FL+AA+ P++C+VFEDA GV A+AAGM V V
Sbjct: 146 ETFLLAAELL--GVPPNECIVFEDALAGVQAARAAGMFAVAV 185
>gnl|CDD|216069 pfam00702, Hydrolase, haloacid dehalogenase-like hydrolase. This
family is structurally different from the alpha/beta
hydrolase family (pfam00561). This family includes
L-2-haloacid dehalogenase, epoxide hydrolases and
phosphatases. The structure of the family consists of
two domains. One is an inserted four helix bundle, which
is the least well conserved region of the alignment,
between residues 16 and 96 of Pseudomonas sp.
(S)-2-haloacid dehalogenase 1. The rest of the fold is
composed of the core alpha/beta domain. Those members
with the characteristic DxD triad at the N-terminus are
probably phosphatidylglycerolphosphate (PGP)
phosphatases involved in cardiolipin biosynthesis in the
mitochondria.
Length = 187
Score = 69.6 bits (170), Expect = 2e-15
Identities = 28/96 (29%), Positives = 40/96 (41%), Gaps = 6/96 (6%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQGKPAP 88
GA + L + I AI T + + + L LF +V V GKP P
Sbjct: 98 GAREALKELKEAGIKLAILTGDNRLT----ANAIARLLGLFDALVSADLYGLVGVGKPDP 153
Query: 89 DVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAG 124
+F +A + K P + L+ D N + AKAAG
Sbjct: 154 KIFELALEELGVK--PEEVLMVGDGVNDIPAAKAAG 187
Score = 29.6 bits (66), Expect = 0.55
Identities = 15/121 (12%), Positives = 38/121 (31%), Gaps = 5/121 (4%)
Query: 10 VTHVIFDMDGLLLGYNLAIGALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLF 69
+ V+FD+DG L + + A+ + + E T ++ L
Sbjct: 1 IKAVVFDLDGTLTDGEPVVPEAEALLEAAA-----ALGVAIVIAAGENLTKEGREELVRR 55
Query: 70 HHVVLGSADPEVKQGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVM 129
+ + + +++ A + ++ A + K AG+ +
Sbjct: 56 LLLRALAGEELLEELLRAGATVVAVLDLVVLGLIALTDPLYPGAREALKELKEAGIKLAI 115
Query: 130 V 130
+
Sbjct: 116 L 116
>gnl|CDD|213672 TIGR01990, bPGM, beta-phosphoglucomutase. This model represents
the beta-phosphoglucomutase enzyme which catalyzes the
interconverison of beta-D-glucose-1-phosphate and
beta-D-glucose-6-phosphate. The 6-phosphate is capable
of non-enzymatic anomerization (alpha <-> beta) while
the 1-phosphate is not. A separate enzyme is responsible
for the isomerization of the alpha anomers.
Beta-D-glucose-1-phosphate results from the
phosphorylysis of maltose (2.4.1.8), trehalose
(2.4.1.64) or trehalose-6-phosphate (2.4.1.216).
Alternatively, these reactions can be run in the
synthetic direction to create the disaccharides. All
sequenced genomes which contain a member of this family
also appear to contain at least one putative maltose or
trehalose phosphorylase. Three species, Lactococcus,
Enterococcus and Neisseria appear to contain a pair of
paralogous beta-PGM's. Beta-phosphoglucomutase is a
member of the haloacid dehalogenase superfamily of
hydrolase enzymes. These enzymes are characterized by a
series of three catalytic motifs positioned within an
alpha-beta (Rossman) fold. beta-PGM contains an inserted
alpha helical domain in between the first and second
conserved motifs and thus is a member of subfamily IA of
the superfamily. The third catalytic motif comes in
three variants, the third of which, containing a
conserved DD or ED, is the only one found here as well
as in several other related enzymes (TIGR01509). The
enzyme from L. lactis has been extensively characterized
including a remarkable crystal structure which traps the
pentacoordinate transition state [Energy metabolism,
Biosynthesis and degradation of polysaccharides].
Length = 185
Score = 68.9 bits (169), Expect = 3e-15
Identities = 39/106 (36%), Positives = 55/106 (51%), Gaps = 16/106 (15%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKL---FHHVVLGSADP-EVKQG 84
G L+ L K+NI A+A++S L+ L+L F +V DP E+K+G
Sbjct: 91 GIKSLLADLKKNNIKIALASASKNAPTILEK------LELIDYFDAIV----DPAELKKG 140
Query: 85 KPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMV 130
KP P++FL AA+ P +C+ EDA G+ KAAGM V V
Sbjct: 141 KPDPEIFLAAAEGLGVSPS--ECIGIEDAQAGIEAIKAAGMFAVGV 184
>gnl|CDD|119389 cd01427, HAD_like, Haloacid dehalogenase-like hydrolases. The
haloacid dehalogenase-like (HAD) superfamily includes
L-2-haloacid dehalogenase, epoxide hydrolase,
phosphoserine phosphatase, phosphomannomutase,
phosphoglycolate phosphatase, P-type ATPase, and many
others, all of which use a nucleophilic aspartate in
their phosphoryl transfer reaction. All members possess
a highly conserved alpha/beta core domain, and many also
possess a small cap domain, the fold and function of
which is variable. Members of this superfamily are
sometimes referred to as belonging to the DDDD
superfamily of phosphohydrolases.
Length = 139
Score = 67.8 bits (166), Expect = 4e-15
Identities = 37/142 (26%), Positives = 53/142 (37%), Gaps = 28/142 (19%)
Query: 13 VIFDMDGLLLGYNLAI----------GALRLINHLHKHNIPFAIATSSAKESFELKTSRH 62
V+FD+DG LL I G + L + I A+AT+ ++ E+
Sbjct: 2 VLFDLDGTLLDSEPGIAEIEELELYPGVKEALKELKEKGIKLALATNKSRR--EVLELLE 59
Query: 63 KDTLKLFHHVVLGSAD--------------PEVKQGKPAPDVFLVAAKRFDEKPQPSKCL 108
+ L + V+ S GKP PD L A K P + L
Sbjct: 60 ELGLDDYFDPVITSNGAAIYYPKEGLFLGGGPFDIGKPNPDKLLAALKLLGVDP--EEVL 117
Query: 109 VFEDAPNGVLGAKAAGMSCVMV 130
+ D+ N + AKAAG V V
Sbjct: 118 MVGDSLNDIEMAKAAGGLGVAV 139
>gnl|CDD|215416 PLN02779, PLN02779, haloacid dehalogenase-like hydrolase family
protein.
Length = 286
Score = 63.2 bits (154), Expect = 2e-12
Identities = 39/140 (27%), Positives = 62/140 (44%), Gaps = 21/140 (15%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFE------LKTSRHKDTLKLFHHVVLGSADPEVK 82
G LRL++ I A+ ++S +++ L R L +F A +V
Sbjct: 148 GVLRLMDEALAAGIKVAVCSTSNEKAVSKIVNTLLGPER-AQGLDVF-------AGDDVP 199
Query: 83 QGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPTVPKHRTEA 142
+ KP PD++ +AA+ PS+C+V ED+ G+ AKAAGM C++
Sbjct: 200 KKKPDPDIYNLAAETL--GVDPSRCVVVEDSVIGLQAAKAAGMRCIVTKSSYTADEDFSG 257
Query: 143 ADLVLNSL-----EEFKPEL 157
AD V + L E+F
Sbjct: 258 ADAVFDCLGDVPLEDFDLLF 277
>gnl|CDD|215497 PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family
protein.
Length = 1057
Score = 62.6 bits (152), Expect = 5e-12
Identities = 39/102 (38%), Positives = 54/102 (52%), Gaps = 5/102 (4%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQGKPAP 88
GAL LI + A+A+S+ + + + L +F +V SAD + KPAP
Sbjct: 165 GALELITQCKNKGLKVAVASSADRIKVDANLAAAGLPLSMFDAIV--SADA-FENLKPAP 221
Query: 89 DVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMV 130
D+FL AAK S+C+V EDA GV A+AAGM C+ V
Sbjct: 222 DIFLAAAKILGVPT--SECVVIEDALAGVQAARAAGMRCIAV 261
>gnl|CDD|222115 pfam13419, HAD_2, Haloacid dehalogenase-like hydrolase.
Length = 176
Score = 59.3 bits (144), Expect = 1e-11
Identities = 24/102 (23%), Positives = 43/102 (42%), Gaps = 6/102 (5%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQGKPAP 88
+ L+ L + I ++ ++E+ E + L LF V +V KP P
Sbjct: 81 DVVELLRRLKAKGVKLVILSNGSREAVERLLEKLGL-LDLFDAVFTSD---DVGARKPDP 136
Query: 89 DVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMV 130
+ + +R P + L +D+P + A+AAG+ V V
Sbjct: 137 EAYERVLERLG--LPPEEILFIDDSPEDLEAARAAGIKTVHV 176
>gnl|CDD|215644 PLN03243, PLN03243, haloacid dehalogenase-like hydrolase;
Provisional.
Length = 260
Score = 59.3 bits (143), Expect = 3e-11
Identities = 41/130 (31%), Positives = 66/130 (50%), Gaps = 7/130 (5%)
Query: 24 YNLAIGALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQ 83
Y L G+ + L KH IP A+A++ + E + ++ F VVL + D V +
Sbjct: 108 YRLRPGSREFVQALKKHEIPIAVASTRPRRYLE--RAIEAVGMEGFFSVVLAAED--VYR 163
Query: 84 GKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPTVPKHRTEAA 143
GKP P++F+ AA+R P+ +C+VF ++ + V A M CV V P + A
Sbjct: 164 GKPDPEMFMYAAERLGFIPE--RCIVFGNSNSSVEAAHDGCMKCVAVAG-KHPVYELSAG 220
Query: 144 DLVLNSLEEF 153
DLV+ L++
Sbjct: 221 DLVVRRLDDL 230
>gnl|CDD|236770 PRK10826, PRK10826, 2-deoxyglucose-6-phosphatase; Provisional.
Length = 222
Score = 58.8 bits (143), Expect = 3e-11
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 9/78 (11%)
Query: 84 GKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPTVPKHRTEA- 142
KP P+V+L A + P C+ ED+ NG++ AKAA M ++VP P+ + +
Sbjct: 147 SKPHPEVYLNCAAKLG--VDPLTCVALEDSFNGMIAAKAARMRSIVVPA---PEQQNDPR 201
Query: 143 ---ADLVLNSLEEFKPEL 157
AD+ L SL E
Sbjct: 202 WALADVKLESLTELTAAD 219
Score = 28.0 bits (63), Expect = 2.0
Identities = 8/9 (88%), Positives = 8/9 (88%)
Query: 13 VIFDMDGLL 21
IFDMDGLL
Sbjct: 10 AIFDMDGLL 18
>gnl|CDD|215313 PLN02575, PLN02575, haloacid dehalogenase-like hydrolase.
Length = 381
Score = 59.1 bits (143), Expect = 7e-11
Identities = 39/129 (30%), Positives = 68/129 (52%), Gaps = 7/129 (5%)
Query: 24 YNLAIGALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQ 83
Y L G+ +N L + IP A+ ++ +++ E + ++ F V++ + D V +
Sbjct: 215 YRLRTGSQEFVNVLMNYKIPMALVSTRPRKTLE--NAIGSIGIRGFFSVIVAAED--VYR 270
Query: 84 GKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPTVPKHRTEAA 143
GKP P++F+ AA+ + P+ +C+VF ++ V A A M CV V P + AA
Sbjct: 271 GKPDPEMFIYAAQLLNFIPE--RCIVFGNSNQTVEAAHDARMKCVAVASKH-PIYELGAA 327
Query: 144 DLVLNSLEE 152
DLV+ L+E
Sbjct: 328 DLVVRRLDE 336
>gnl|CDD|182679 PRK10725, PRK10725, fructose-1-P/6-phosphogluconate phosphatase;
Provisional.
Length = 188
Score = 55.5 bits (134), Expect = 4e-10
Identities = 31/70 (44%), Positives = 41/70 (58%), Gaps = 5/70 (7%)
Query: 61 RHKDTLKLFHHVVLGSADPEVKQGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGA 120
H + F VV +AD +V+ KPAPD FL A+ QP++C+VFEDA G+ A
Sbjct: 121 AHLGLRRYFDAVV--AAD-DVQHHKPAPDTFLRCAQLM--GVQPTQCVVFEDADFGIQAA 175
Query: 121 KAAGMSCVMV 130
+AAGM V V
Sbjct: 176 RAAGMDAVDV 185
>gnl|CDD|222003 pfam13242, Hydrolase_like, HAD-hyrolase-like.
Length = 74
Score = 50.0 bits (120), Expect = 5e-09
Identities = 19/73 (26%), Positives = 34/73 (46%), Gaps = 6/73 (8%)
Query: 84 GKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVP----DPTVPKHR 139
GKP P + A +R P +C++ D+ +L A+AAG+ ++V +
Sbjct: 3 GKPNPGMLRAALERLGVDP--EECVMIGDSDTDILAARAAGIRTILVLTGVTTAEDLERA 60
Query: 140 TEAADLVLNSLEE 152
D V++SL +
Sbjct: 61 PGRPDYVVDSLAD 73
>gnl|CDD|223620 COG0546, Gph, Predicted phosphatases [General function prediction
only].
Length = 220
Score = 49.4 bits (118), Expect = 1e-07
Identities = 35/127 (27%), Positives = 53/127 (41%), Gaps = 10/127 (7%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFEL-KTSRHKDTLKLFHHVVLGSADPEVKQGKPA 87
G L+ L I T+ K EL + F V++G D V KP
Sbjct: 93 GVKELLAALKSAGYKLGIVTN--KPERELDILLKALGLADYFD-VIVGGDD--VPPPKPD 147
Query: 88 PDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPTVPKH--RTEAADL 145
P+ L+ ++ P + L+ D+ N +L AKAAG+ V V + AD+
Sbjct: 148 PEPLLLLLEKL--GLDPEEALMVGDSLNDILAAKAAGVPAVGVTWGYNSREELAQAGADV 205
Query: 146 VLNSLEE 152
V++SL E
Sbjct: 206 VIDSLAE 212
>gnl|CDD|223943 COG1011, COG1011, Predicted hydrolase (HAD superfamily) [General
function prediction only].
Length = 229
Score = 43.7 bits (103), Expect = 9e-06
Identities = 24/87 (27%), Positives = 38/87 (43%), Gaps = 7/87 (8%)
Query: 68 LFHHVVLGSADPEVKQGKPAPDVFLVAAKRFDEKPQPSKCLVFED-APNGVLGAKAAGM- 125
F V + +V KP P++F A ++ P+ + L D N +LGA+A GM
Sbjct: 140 YFDAVFISE---DVGVAKPDPEIFEYALEKLGVPPE--EALFVGDSLENDILGARALGMK 194
Query: 126 SCVMVPDPTVPKHRTEAADLVLNSLEE 152
+ + EA D ++SL E
Sbjct: 195 TVWINRGGKPLPDALEAPDYEISSLAE 221
>gnl|CDD|162787 TIGR02253, CTE7, HAD superfamily (subfamily IA) hydrolase,
TIGR02253. This family of sequences from archaea and
metazoans includes the human uncharacterized protein
CTE7. Pyrococcus species appear to have three different
forms of this enzyme, so it is unclear whether all
members of this family have the same function. This
family is a member of the haloacid dehalogenase (HAD)
superfamily of hydrolases which are characterized by
three conserved sequence motifs. By virtue of an alpha
helical domain in-between the first and second conserved
motif, this family is a member of subfamily IA
(TIGR01549).
Length = 221
Score = 43.2 bits (102), Expect = 1e-05
Identities = 23/89 (25%), Positives = 34/89 (38%), Gaps = 9/89 (10%)
Query: 68 LFHHVVLGSADPEVKQGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAP-NGVLGAKAAGMS 126
F V+ E KP P +F A KR KP+ + ++ D + GAK GM
Sbjct: 136 FFDAVITSE---EEGVEKPHPKIFYAALKRLGVKPE--EAVMVGDRLDKDIKGAKNLGMK 190
Query: 127 CVMVP---DPTVPKHRTEAADLVLNSLEE 152
V + + D ++SL E
Sbjct: 191 TVWINQGKSSKMEDDVYPYPDYEISSLRE 219
>gnl|CDD|237310 PRK13222, PRK13222, phosphoglycolate phosphatase; Provisional.
Length = 226
Score = 37.9 bits (89), Expect = 0.001
Identities = 31/135 (22%), Positives = 47/135 (34%), Gaps = 16/135 (11%)
Query: 29 GALRLINHLHKHNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQGKPAP 88
G + L P A+ T+ F F VV+G + KP P
Sbjct: 97 GVKETLAALKAAGYPLAVVTNKP-TPFVAPLLEALGIADYFS-VVIGGDS--LPNKKPDP 152
Query: 89 DVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMV------PDPTVPKHRTEA 142
L+A ++ P + L D+ N + A+AAG V V +P
Sbjct: 153 APLLLACEKLGL--DPEEMLFVGDSRNDIQAARAAGCPSVGVTYGYNYGEP----IALSE 206
Query: 143 ADLVLNSLEEFKPEL 157
D+V++ E P L
Sbjct: 207 PDVVIDHFAELLPLL 221
>gnl|CDD|215413 PLN02770, PLN02770, haloacid dehalogenase-like hydrolase family
protein.
Length = 248
Score = 37.1 bits (86), Expect = 0.002
Identities = 36/112 (32%), Positives = 53/112 (47%), Gaps = 17/112 (15%)
Query: 26 LAIGALRLINHLHK-------HNIPFAIATSSAKESFELKTSRHKDTLKLFHHVVLGSAD 78
LA L+ +N L+K + A T++ +E+ EL S + F V++GS
Sbjct: 102 LASEQLKPLNGLYKLKKWIEDRGLKRAAVTNAPRENAELMISLLGLS-DFFQAVIIGS-- 158
Query: 79 PEVKQGKPAPDVFLVAAKRFDEKPQPSK--CLVFEDAPNGVLGAKAAGMSCV 128
E + KP PD +L A E + SK VFED+ +G+ AAGM V
Sbjct: 159 -ECEHAKPHPDPYLKAL----EVLKVSKDHTFVFEDSVSGIKAGVAAGMPVV 205
>gnl|CDD|223720 COG0647, NagD, Predicted sugar phosphatases of the HAD superfamily
[Carbohydrate transport and metabolism].
Length = 269
Score = 37.2 bits (87), Expect = 0.002
Identities = 22/86 (25%), Positives = 36/86 (41%), Gaps = 21/86 (24%)
Query: 84 GKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNG-VLGAKAAGMSCVMV-----------P 131
GKP+P ++ A ++ S+ L+ D + +LGAKAAG+ ++V
Sbjct: 189 GKPSPAIYEAALEKL--GLDRSEVLMVGDRLDTDILGAKAAGLDTLLVLTGVSSAEDLDR 246
Query: 132 DPTVPKHRTEAADLVLNSLEEFKPEL 157
P V++SL E L
Sbjct: 247 AEVKP-------TYVVDSLAELITAL 265
Score = 33.3 bits (77), Expect = 0.034
Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 1/50 (2%)
Query: 13 VIFDMDGLLLGYNLAI-GALRLINHLHKHNIPFAIATSSAKESFELKTSR 61
+FD+DG+L N AI GA + L P T+++ S E+ +R
Sbjct: 11 FLFDLDGVLYRGNEAIPGAAEALKRLKAAGKPVIFLTNNSTRSREVVAAR 60
>gnl|CDD|205524 pfam13344, Hydrolase_6, Haloacid dehalogenase-like hydrolase.
This family is part of the HAD superfamily.
Length = 101
Score = 35.5 bits (83), Expect = 0.002
Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 1/43 (2%)
Query: 13 VIFDMDG-LLLGYNLAIGALRLINHLHKHNIPFAIATSSAKES 54
+FD+DG L G GA +N L P T+++ S
Sbjct: 1 FLFDVDGVLWRGGEPIPGAAEALNALRAAGKPVVFVTNNSSRS 43
>gnl|CDD|130521 TIGR01454, AHBA_synth_RP, 3-amino-5-hydroxybenoic acid synthesis
related protein. The enzymes in this equivalog are all
located in the operons for the biosynthesis of
3-amino-5-hydroxybenoic acid (AHBA), which is a
precursor of several antibiotics including ansatrienin ,
naphthomycin , rifamycin and mitomycin. The role that
this enzyme plays in this biosynthesis has not been
elucidated. This enzyme is a member of the Haloacid
dehalogenase superfamily (pfam00702) of
aspartate-nucleophile hydrolases. This enzyme is closely
related to phosphoglycolate phosphatase (TIGR01449), but
it is unclear what purpose a PGPase or PGPase-like
activity would serve in these biosyntheses. This model
is limited to the Gram positive Actinobacteria. The most
closely related enzyme below the noise cutoff is IndB
which is involved in the biosynthesis of Indigoidine in
Pectobacterium (Erwinia) chrysanthemi, a gamma
proteobacter. This enzyme is similarly related to PGP.
In this case, too it is unclear what role would be be
played by a PGPase activity.
Length = 205
Score = 36.8 bits (85), Expect = 0.002
Identities = 33/101 (32%), Positives = 47/101 (46%), Gaps = 8/101 (7%)
Query: 29 GALRLINHLHKHNIPFAIAT-SSAKESFELKTSRHKDTLKLFHHVVLGSADPEVKQGKPA 87
G L+ L + AIAT S + L + L LF HV+ GS EV + KPA
Sbjct: 79 GVPELLAELRADGVGTAIATGKSGPRARSLLEAL--GLLPLFDHVI-GS--DEVPRPKPA 133
Query: 88 PDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCV 128
PD+ A + D P+ ++ DA + A+AAG + V
Sbjct: 134 PDIVREALRLLDVPPE--DAVMVGDAVTDLASARAAGTATV 172
>gnl|CDD|233675 TIGR01993, Pyr-5-nucltdase, pyrimidine 5'-nucleotidase. This
family of proteins includes the SDT1/SSM1 gene from
yeast which has been shown to code for a pyrimidine
(UMP/CMP) 5'nucleotidase. The family spans plants, fungi
and a small number of bacteria. These enzymes are
members of the haloacid dehalogenase (HAD) superfamily
of hydrolases, specifically the IA subfamily (variant 3,
TIGR01509).
Length = 183
Score = 35.8 bits (83), Expect = 0.004
Identities = 13/46 (28%), Positives = 23/46 (50%), Gaps = 2/46 (4%)
Query: 85 KPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMV 130
KP+P + A + P+ + + F+D+ + KA GM V+V
Sbjct: 140 KPSPQAYEKALREAGVDPE--RAIFFDDSARNIAAGKALGMKTVLV 183
>gnl|CDD|233517 TIGR01662, HAD-SF-IIIA, HAD-superfamily hydrolase, subfamily IIIA.
This subfamily falls within the Haloacid Dehalogenase
(HAD) superfamily of aspartate-nucleophile hydrolases.
The Class III subfamilies are characterized by the lack
of any domains located between either between the first
and second conserved catalytic motifs (as in the Class I
subfamilies, TIGR01493, TIGR01509, TIGR01488 and
TIGR01494) or between the second and third conserved
catalytic motifs (as in the Class II subfamilies,
TIGR01460 and TIGR01484) of the superfamily domain. The
IIIA subfamily contains five major clades:
histidinol-phosphatase (TIGR01261) and
histidinol-phosphatase-related protein (TIGR00213) which
together form a subfamily (TIGR01656), DNA
3'-phosphatase (TIGR01663, TIGR01664), YqeG (TIGR01668)
and YrbI (TIGR01670). In the case of histidinol
phosphatase and PNK-3'-phosphatase, this model
represents a domain of a bifunctional system. In the
histidinol phosphatase HisB, a C-terminal domain is an
imidazoleglycerol-phosphate dehydratase which catalyzes
a related step in histidine biosynthesis. In
PNK-3'-phosphatase, N- and C-terminal domains constitute
the polynucleotide kinase and DNA-binding components of
the enzyme [Unknown function, Enzymes of unknown
specificity].
Length = 132
Score = 33.9 bits (78), Expect = 0.013
Identities = 31/133 (23%), Positives = 53/133 (39%), Gaps = 16/133 (12%)
Query: 13 VIFDMDGLL---LGYN-------LAIGALRLINHLHKHNIPFAIATSSAKESFELKTSRH 62
V+ D+DG L + Y L + L + I T+ + +S
Sbjct: 3 VVLDLDGTLTDDVPYVDDEDERILYPEVPDALAELKEAGYKVVIVTNQSGIGRGKFSSGR 62
Query: 63 --KDTLKLFHHVVLGSADPEVKQGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPN-GVLG 119
+ +L + + A P + KP P +FL A KRF+E P + + D +
Sbjct: 63 VARRLEELGVPIDILYACPHCR--KPKPGMFLEALKRFNEI-DPEESVYVGDQDLTDLQA 119
Query: 120 AKAAGMSCVMVPD 132
AK AG++ ++V
Sbjct: 120 AKRAGLAFILVAP 132
>gnl|CDD|130516 TIGR01449, PGP_bact, 2-phosphoglycolate phosphatase, prokaryotic.
PGP is an essential enzyme in the glycolate salvage
pathway in higher organisms (photorespiration in
plants). Phosphoglycolate results from the oxidase
activity of RubisCO in the Calvin cycle when
concentrations of carbon dioxide are low relative to
oxygen. In Ralstonia (Alcaligenes) eutropha and
Rhodobacter sphaeroides, the PGP gene (CbbZ) is located
on an operon along with other Calvin cycle enzymes
including RubisCO. The only other pertinent experimental
evidence concerns the gene from E. coli. The in vitro
activity of the Ralstonia and Escherichia enzymes was
determined with crude cell extracts of strains
containing PGP on expression plasmids and compared to
controls. In E. coli, however, there does not appear to
be a functional Calvin cycle (RubisCO is absent),
although the E. coli PGP gene (gph) is on the same
operon (dam) with ribulose-5-phosphate-3-epimerase
(rpe), a gene in the pentose-phosphate pathway (along
with other, unrelated genes). The E. coli enzyme is not
expressed under normal laboratory conditions; the
pathway to which it belongs has not been determined. In
fact, the possibility exists, although unlikely, that
the E. coli enzyme and others within this equivalog have
as their physiological substrate another, closely
related molecule. The other seed chosen for this model,
from Xylella fastidiosa has no experimental evidence,
but is a plant pathogen and thus may obtain
phosphoglycolate from its host. This model has been
restricted to encompass only proteobacteria as no
related PGP has been verified outside of this clade.
Sequences from Aquifex aeolicus and Treponema pallidum
fall between the trusted and noise cutoffs. Just below
the noise cutoff is a gene which is part of the operon
for the biosynthesis of the blue pigment, indigoidine,
from Erwinia (Pectobacterium) chrysanthemi, a plant
pathogen. It does not seem likely, considering the
proposed biosynthetic mechanism, that the
dephosphorylation of phosphoglycolate or a closely
related compound is required. Possibly, this gene is
fortuitously located in this operon, or has an indirect
relationship to the necessity for the biosynthesis of
this compound. Sequences from 11 species have been
annotated as PGP or putative PGP but fall below the
noise cutoff. None of these have experimental
validation. This enzyme is a member of the Haloacid
Dehalogenase (HAD) superfamily of aspartate-nucleophile
hydrolase enzymes (pfam00702) [Energy metabolism,
Sugars].
Length = 213
Score = 34.0 bits (78), Expect = 0.021
Identities = 25/88 (28%), Positives = 41/88 (46%), Gaps = 6/88 (6%)
Query: 72 VVLGSADPEVKQGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGM-SCVMV 130
V++G + Q KP PD L+AA+R PQ + + D+ + A+AAG S ++
Sbjct: 130 VLIGGDS--LAQRKPHPDPLLLAAERLGVAPQ--QMVYVGDSRVDIQAARAAGCPSVLLT 185
Query: 131 PDPTVPK-HRTEAADLVLNSLEEFKPEL 157
+ D++ +SL E P L
Sbjct: 186 YGYRYGEAIDLLPPDVLYDSLNELPPLL 213
>gnl|CDD|223635 COG0561, Cof, Predicted hydrolases of the HAD superfamily
[General function prediction only].
Length = 264
Score = 33.2 bits (76), Expect = 0.041
Identities = 11/38 (28%), Positives = 17/38 (44%), Gaps = 2/38 (5%)
Query: 13 VIFDMDGLLLGYNLAIG--ALRLINHLHKHNIPFAIAT 48
+ FD+DG LL N I + L + + +AT
Sbjct: 6 LAFDLDGTLLDSNKTISPETKEALARLREKGVKVVLAT 43
>gnl|CDD|130495 TIGR01428, HAD_type_II, 2-haloalkanoic acid dehalogenase, type II.
Catalyzes the hydrolytic dehalogenation of small
L-2-haloalkanoic acids to yield the corresponding
D-2-hydroxyalkanoic acids. Belongs to the Haloacid
Dehalogenase (HAD) superfamily of aspartate-nucleophile
hydrolases (pfam00702), class (subfamily) I. Note that
the Type I HAD enzymes have not yet been fully
characterized, but clearly utilize a substantially
different catalytic mechanism and are thus unlikely to
be related.
Length = 198
Score = 32.7 bits (75), Expect = 0.056
Identities = 31/130 (23%), Positives = 44/130 (33%), Gaps = 22/130 (16%)
Query: 12 HVIFDM---DGLLLGYNL------AIGALRLINHLHKHNIPFAIATSSAKESFELKTSRH 62
+ D D L Y LR L + AI ++ + + +H
Sbjct: 73 GLEDDESAADRLAEAYLRLPPHPDVPAGLRA---LKERGYRLAILSNGSPAMLK-SLVKH 128
Query: 63 KDTLKLFHHVVLGSADPEVKQGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNG--VLGA 120
F V+ SAD V+ KPAP V+ +A + P V A N + GA
Sbjct: 129 AGLDDPFDAVL--SADA-VRAYKPAPQVYQLALEALGVPPD-EVLFV---ASNPWDLGGA 181
Query: 121 KAAGMSCVMV 130
K G V
Sbjct: 182 KKFGFKTAWV 191
>gnl|CDD|236232 PRK08309, PRK08309, short chain dehydrogenase; Provisional.
Length = 177
Score = 32.0 bits (73), Expect = 0.068
Identities = 27/79 (34%), Positives = 39/79 (49%), Gaps = 13/79 (16%)
Query: 20 LLLGYNLAIGALRL-INHLHKHNIPFAIAT----SSAKESFEL---KTSRHKDTLKLFHH 71
L L Y+ AL+L I + N PF +A SSAK++ + + +T +LFH
Sbjct: 52 LPLDYHDD-DALKLAIKSTIEKNGPFDLAVAWIHSSAKDALSVVCRELDGSSETYRLFH- 109
Query: 72 VVLGS--ADPEVKQGKPAP 88
VLGS +DP + K P
Sbjct: 110 -VLGSAASDPRIPSEKIGP 127
>gnl|CDD|171912 PRK13223, PRK13223, phosphoglycolate phosphatase; Provisional.
Length = 272
Score = 32.1 bits (73), Expect = 0.095
Identities = 30/98 (30%), Positives = 41/98 (41%), Gaps = 18/98 (18%)
Query: 37 LHKHNIPFAIATSSAKESF------ELKTSRHKDTLKLFHHVVLGSADPEVKQGKPAPDV 90
L K + A+ T+ E F ++K R+ F ++ G P Q KP P
Sbjct: 113 LKKQGVEMALITNKP-ERFVAPLLDQMKIGRY------FRWIIGGDTLP---QKKPDPAA 162
Query: 91 FLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCV 128
L K P S L D+ + VL AKAAG+ CV
Sbjct: 163 LLFVMKMAGVPPSQS--LFVGDSRSDVLAAKAAGVQCV 198
>gnl|CDD|219777 pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase.
This family contains haloacid dehalogenase-like
hydrolase enzymes.
Length = 254
Score = 31.8 bits (73), Expect = 0.11
Identities = 12/38 (31%), Positives = 16/38 (42%), Gaps = 2/38 (5%)
Query: 13 VIFDMDGLLLGYNLAIG--ALRLINHLHKHNIPFAIAT 48
+ D+DG LL + I I L + I IAT
Sbjct: 1 IASDLDGTLLNSDKKISERTKEAIKKLQEKGIKVVIAT 38
>gnl|CDD|237311 PRK13226, PRK13226, phosphoglycolate phosphatase; Provisional.
Length = 229
Score = 29.0 bits (65), Expect = 1.0
Identities = 18/46 (39%), Positives = 22/46 (47%), Gaps = 2/46 (4%)
Query: 85 KPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMV 130
KP P LVAA+R P C+ D +L A+AAGM V
Sbjct: 151 KPHPLPLLVAAERIGVAPT--DCVYVGDDERDILAARAAGMPSVAA 194
>gnl|CDD|200170 TIGR02252, DREG-2, REG-2-like, HAD superfamily (subfamily IA)
hydrolase. This family of proteins includes
uncharacterized sequences from eukaryotes, cyanobacteria
and Leptospira as well as the DREG-2 protein from
Drosophila melanogaster which has been identified as a
rhythmically (diurnally) regulated gene. This family is
a member of the Haloacid Dehalogenase (HAD) superfamily
of aspartate-nucleophile hydrolases. The superfamily is
defined by the presence of three short catalytic motifs.
The subfamilies are defined based on the location and
the observed or predicted fold of a so-called 'capping
domain', or the absence of such a domain. This family is
a member of subfamily 1A in which the cap domain
consists of a predicted alpha helical bundle found in
between the first and second catalytic motifs. A
distinctive feature of this family is a conserved tandem
pair of tryptophan residues in the cap domain. The most
divergent sequences included within the scope of this
model are from plants and have "FW" at this position
instead. Most likely, these sequences, like the vast
majority of HAD sequences, represent phosphatase
enzymes.
Length = 203
Score = 28.4 bits (64), Expect = 1.4
Identities = 20/60 (33%), Positives = 25/60 (41%), Gaps = 6/60 (10%)
Query: 68 LFHHVVLGSADPEVKQGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAP-NGVLGAKAAGMS 126
F VV EV KP P +F A +R P + L D+ N GA+AAG
Sbjct: 146 YFDFVVT---SYEVGAEKPDPKIFQEALERAGI--SPEEALHIGDSLRNDYQGARAAGWR 200
>gnl|CDD|181872 PRK09456, PRK09456, ?-D-glucose-1-phosphatase; Provisional.
Length = 199
Score = 28.1 bits (63), Expect = 1.8
Identities = 11/53 (20%), Positives = 25/53 (47%), Gaps = 6/53 (11%)
Query: 85 KPAPDVF--LVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPTV 135
KP ++ ++ A+ F + + F+D + + A A G++ ++V D
Sbjct: 141 KPEARIYQHVLQAEGFS----AADAVFFDDNADNIEAANALGITSILVTDKQT 189
>gnl|CDD|200549 cd10923, CE4_COG5298, Putative NodB-like catalytic domain of
uncharacterized proteins found in bacteria. This
family corresponds to a group of uncharacterized
bacterial proteins with high sequence similarity to the
catalytic domain of the six-stranded barrel rhizobial
NodB-like proteins, which remove N-linked or O-linked
acetyl groups from cell wall polysaccharides and belong
to the larger carbohydrate esterase 4 (CE4)
superfamily. Some family members contain an additional
copper amine oxidase N-terminal domain.
Length = 250
Score = 28.1 bits (63), Expect = 2.2
Identities = 6/17 (35%), Positives = 11/17 (64%)
Query: 31 LRLINHLHKHNIPFAIA 47
+ ++L+ NIPF +A
Sbjct: 20 KEIADYLYSENIPFHVA 36
>gnl|CDD|233800 TIGR02247, HAD-1A3-hyp, epoxide hydrolase N-terminal domain-like
phosphatase. This model represents a small clade of
sequences including C. elegans and mammalian sequences
as well as a small number of bacteria. In eukaryotes,
this domain exists as an N-terminal fusion to the
soluble epoxide hydrolase enzyme and has recently been
shown to be an active phosphatase, although the nature
of the biological substrate is unclear. These appear to
be members of the haloacid dehalogenase (HAD)
superfamily of aspartate-nucleophile hydrolases by
general homology and the conservation of all of the
recognized catalytic motifs (although the first motif is
unusual in the replacement of the more common aspartate
with glycine...). The variable domain is found in
between motifs 1 and 2, indicating membership in
subfamily I and phylogeny and prediction of the alpha
helical nature of the variable domain (by PSI-PRED)
indicate membership in subfamily IA.
Length = 211
Score = 27.9 bits (62), Expect = 2.6
Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 2/59 (3%)
Query: 85 KPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDPTVPKHRTEAA 143
KP P ++ + +R P +C+ +D + + A A G++ + V D H E A
Sbjct: 152 KPDPRIYQLMLERL--GVAPEECVFLDDLGSNLKPAAALGITTIKVSDEEQAIHDLEKA 208
>gnl|CDD|176184 cd05281, TDH, Threonine dehydrogenase. L-threonine dehydrogenase
(TDH) catalyzes the zinc-dependent formation of
2-amino-3-ketobutyrate from L-threonine via NAD(H)-
dependent oxidation. THD is a member of the
zinc-requiring, medium chain NAD(H)-dependent alcohol
dehydrogenase family (MDR). MDRs have a
NAD(P)(H)-binding domain in a Rossmann fold of a
beta-alpha form. NAD(P)(H)-dependent oxidoreductases are
the major enzymes in the interconversion of alcohols and
aldehydes, or ketones. The N-terminal region typically
has an all-beta catalytic domain. These proteins
typically form dimers (typically higher plants, mammals)
or tetramers (yeast, bacteria) and have 2 tightly bound
zinc atoms per subunit. Sorbitol and aldose reductase
are NAD(+) binding proteins of the polyol pathway, which
interconverts glucose and fructose.
Length = 341
Score = 27.6 bits (62), Expect = 3.0
Identities = 19/46 (41%), Positives = 24/46 (52%), Gaps = 8/46 (17%)
Query: 114 PNGVLG---AKAAGMSCVMVPDPTVPKHRTEA----ADLVLNSLEE 152
P G++ AKAAG S V+ DP P A AD+V+N EE
Sbjct: 174 PIGLMAIAVAKAAGASLVIASDPN-PYRLELAKKMGADVVINPREE 218
>gnl|CDD|119314 pfam10794, DUF2606, Protein of unknown function (DUF2606). Family
of bacterial proteins with unknown function. These
proteins have been classified as membrane proteins.
Length = 131
Score = 26.8 bits (59), Expect = 3.7
Identities = 11/27 (40%), Positives = 14/27 (51%)
Query: 80 EVKQGKPAPDVFLVAAKRFDEKPQPSK 106
E +G+P V + K D PQPSK
Sbjct: 49 EDAEGQPIKGVEVTLMKAADSDPQPSK 75
>gnl|CDD|223164 COG0086, RpoC, DNA-directed RNA polymerase, beta' subunit/160 kD
subunit [Transcription].
Length = 808
Score = 27.2 bits (61), Expect = 5.3
Identities = 8/35 (22%), Positives = 14/35 (40%), Gaps = 10/35 (28%)
Query: 128 VMVPDPTVPKHRT----EAADLVLNSLEEFKPELY 158
V+ P + ++ + A LE FKP +
Sbjct: 297 VISVGPNLKLNQCGVPKKMA------LELFKPFVV 325
>gnl|CDD|238606 cd01275, FHIT, FHIT (fragile histidine family): FHIT proteins,
related to the HIT family carry a motif HxHxH/Qxx (x, is
a hydrophobic amino acid), On the basis of sequence,
substrate specificity, structure, evolution and
mechanism, HIT proteins are classified into three
branches: the Hint branch, which consists of adenosine
5' -monophosphoramide hydrolases, the Fhit branch, that
consists of diadenosine polyphosphate hydrolases, and
the GalT branch consisting of specific nucloside
monophosphate transferases. Fhit plays a very important
role in the development of tumours. Infact, Fhit
deletions are among the earliest and most frequent
genetic alterations in the development of tumours.
Length = 126
Score = 26.1 bits (58), Expect = 6.4
Identities = 10/43 (23%), Positives = 19/43 (44%)
Query: 1 MNISQVLNYVTHVIFDMDGLLLGYNLAIGALRLINHLHKHNIP 43
+ Q+ V++ DG +G N ++ H+H H +P
Sbjct: 60 FKLVQLAMKALKVVYKPDGFNIGINDGKAGGGIVPHVHIHIVP 102
>gnl|CDD|226572 COG4087, COG4087, Soluble P-type ATPase [General function
prediction only].
Length = 152
Score = 26.0 bits (57), Expect = 8.1
Identities = 23/106 (21%), Positives = 42/106 (39%), Gaps = 16/106 (15%)
Query: 63 KDTLKLFH---HVVLGSADPE--VKQ-----GKPAPDVFLVA---AKR---FDEKPQPSK 106
+T++ H + + S D + + Q G P VF A K + K + K
Sbjct: 36 SETIQELHDMVDIYIASGDRKGSLVQLAEFVGIPVERVFAGADPEMKAKIIRELKKRYEK 95
Query: 107 CLVFEDAPNGVLGAKAAGMSCVMVPDPTVPKHRTEAADLVLNSLEE 152
++ + N +L + A + + VP+ AD+VL + E
Sbjct: 96 VVMVGNGANDILALREADLGICTIQQEGVPERLLLTADVVLKEIAE 141
>gnl|CDD|233434 TIGR01484, HAD-SF-IIB, HAD-superfamily hydrolase, subfamily IIB.
This subfamily falls within the Haloacid Dehalogenase
(HAD) superfamily of aspartate-nucleophile hydrolases.
The Class II subfamilies are characterized by a domain
that is located between the second and third conserved
catalytic motifs of the superfamily domain. The IIB
subfamily is distinguished from the IIA subfamily
(TIGR01460) by homology and the predicted secondary
structure of this domain by PSI-PRED. The IIB
subfamily's Class II domain has the following predicted
structure: Helix-Sheet-Sheet-(Helix or
Sheet)-Helix-Sheet-(variable)-Helix-Sheet-Sheet. The
IIB subfamily consists of Trehalose-6-phosphatase
(TIGR00685), plant and cyanobacterial
Sucrose-phosphatase and a closely related group of
bacterial and archaeal sequences, eukaryotic
phosphomannomutase (pfam03332), a large subfamily
("Cof-like hydrolases", TIGR00099) containing many
closely related bacterial sequences, a hypothetical
equivalog containing the E. coli YedP protein, as well
as two small clusters containing OMNI|TC0379 and
OMNI|SA2196 whose relationship to the other groups is
unclear [Unknown function, Enzymes of unknown
specificity].
Length = 204
Score = 26.2 bits (58), Expect = 8.7
Identities = 9/42 (21%), Positives = 16/42 (38%), Gaps = 3/42 (7%)
Query: 12 HVIFDMDGLLLGYNLAI---GALRLINHLHKHNIPFAIATSS 50
+ FD+DG LL N + + L + + + T
Sbjct: 1 LLFFDLDGTLLDPNAHELSPETIEALERLREAGVKVVLVTGR 42
>gnl|CDD|173848 cd01159, NcnH, Naphthocyclinone hydroxylase. Naphthocyclinone is
an aromatic polyketide and an antibiotic, which is
active against Gram-positive bacteria. Polyketides are
secondary metabolites, which have important biological
functions such as antitumor, immunosupressive or
antibiotic activities. NcnH is a hydroxylase involved in
the biosynthesis of naphthocyclinone and possibly other
polyketides.
Length = 370
Score = 26.2 bits (58), Expect = 9.9
Identities = 20/71 (28%), Positives = 28/71 (39%), Gaps = 11/71 (15%)
Query: 74 LGSADPEVKQGKPAPDVFLVAAKRFDEKPQPSKCLVFEDAPNGVLGAKAAGMSCVMVPDP 133
L A E G P P F+V ++ V+G + G + V+V D
Sbjct: 139 LVGAIVEDDDGGPLPRAFVVPRAEYEIVDTWH-----------VVGLRGTGSNTVVVDDV 187
Query: 134 TVPKHRTEAAD 144
VP+HRT A
Sbjct: 188 FVPEHRTLTAG 198
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.137 0.404
Gapped
Lambda K H
0.267 0.0716 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,626,173
Number of extensions: 780075
Number of successful extensions: 871
Number of sequences better than 10.0: 1
Number of HSP's gapped: 834
Number of HSP's successfully gapped: 66
Length of query: 166
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 76
Effective length of database: 6,945,742
Effective search space: 527876392
Effective search space used: 527876392
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 55 (25.0 bits)