RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5844
(442 letters)
>gnl|CDD|143459 cd07141, ALDH_F1AB_F2_RALDH1, NAD+-dependent retinal dehydrogenase
1, ALDH families 1A, 1B, and 2-like. NAD+-dependent
retinal dehydrogenase 1 (RALDH 1, ALDH1, EC=1.2.1.36)
also known as aldehyde dehydrogenase family 1 member A1
(ALDH1A1) in humans, is a homotetrameric, cytosolic
enzyme that catalyzes the oxidation of retinaldehyde to
retinoic acid. Human ALDH1B1 and ALDH2 are also in this
cluster; both are mitochrondrial homotetramers which
play important roles in acetaldehyde oxidation; ALDH1B1
in response to UV light exposure and ALDH2 during
ethanol metabolism.
Length = 481
Score = 689 bits (1779), Expect = 0.0
Identities = 258/348 (74%), Positives = 299/348 (85%), Gaps = 1/348 (0%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
+SLETLDNGKP++ SY VD+ +++ LRYYAG+ADKIHGKTIP++GDFF YTRHEPVGVC
Sbjct: 90 ASLETLDNGKPFSKSYLVDLPGAIKVLRYYAGWADKIHGKTIPMDGDFFTYTRHEPVGVC 149
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
GQIIPWNFPLLM AWK+APALA GNT+VLKPAEQTPLTAL + +LI EAG P GVVN+VP
Sbjct: 150 GQIIPWNFPLLMAAWKLAPALACGNTVVLKPAEQTPLTALYLASLIKEAGFPPGVVNVVP 209
Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
GYG AGAA+ HP +DKVAFTGSTEVGKL+QQ +G SNLKR TLELGGKSPNI+FADAD
Sbjct: 210 GYGPTAGAAISSHPDIDKVAFTGSTEVGKLIQQAAGKSNLKRVTLELGGKSPNIVFADAD 269
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
LD AVE AH LFFNMGQCCCAGSRTFVQ++IYDEFV RS ERAK+R VG+PFD EQG
Sbjct: 270 LDYAVEQAHEALFFNMGQCCCAGSRTFVQESIYDEFVKRSVERAKKRVVGNPFDPKTEQG 329
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
PQID+EQ KILE IESGK +GA+L GG R GDKG+++QPTVF++V DDM+IA+EEIFG
Sbjct: 330 PQIDEEQFKKILELIESGKKEGAKLECGGKRHGDKGYFIQPTVFSDVTDDMRIAKEEIFG 389
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
PVQQ+ +F +I+EVIERANN+ YGLAAAVFTKD+DK + LRAGT
Sbjct: 390 PVQQIFKFKTIDEVIERANNTTYGLAAAVFTKDIDKAITFSNALRAGT 437
Score = 38.5 bits (90), Expect = 0.007
Identities = 9/20 (45%), Positives = 17/20 (85%)
Query: 31 KPKIEHTQLFINNEFVNAQS 50
P+I++T++FINNE+ ++ S
Sbjct: 1 NPEIKYTKIFINNEWHDSVS 20
>gnl|CDD|143410 cd07091, ALDH_F1-2_Ald2-like, ALDH subfamily: ALDH families 1and 2,
including 10-formyltetrahydrofolate dehydrogenase,
NAD+-dependent retinal dehydrogenase 1 and related
proteins. ALDH subfamily which includes the
NAD+-dependent retinal dehydrogenase 1 (RALDH 1, ALDH1,
EC=1.2.1.36), also known as aldehyde dehydrogenase
family 1 member A1 (ALDH1A1), in humans, a
homotetrameric, cytosolic enzyme that catalyzes the
oxidation of retinaldehyde to retinoic acid. Human
ALDH1B1 and ALDH2 are also in this cluster; both are
mitochrondrial homotetramers which play important roles
in acetaldehyde oxidation; ALDH1B1 in response to UV
light exposure and ALDH2 during ethanol metabolism.
10-formyltetrahydrofolate dehydrogenase (FTHFDH,
EC=1.5.1.6), also known as aldehyde dehydrogenase family
1 member L1 (ALDH1L1), in humans, a multi-domain
homotetramer with an N-terminal formyl transferase
domain and a C-terminal ALDH domain. FTHFDH catalyzes an
NADP+-dependent dehydrogenase reaction resulting in the
conversion of 10-formyltetrahydrofolate to
tetrahydrofolate and CO2. Also included in this
subfamily is the Arabidosis aldehyde dehydrogenase
family 2 members B4 and B7 (EC=1.2.1.3), which are
mitochondrial, homotetramers that oxidize acetaldehyde
and glycolaldehyde, as well as, the Arabidosis
cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
involved in the oxidation of sinapalehyde and
coniferaldehyde. Also included is the AldA aldehyde
dehydrogenase of Aspergillus nidulans (locus AN0554),
the aldehyde dehydrogenase 2 (YMR170c, ALD5, EC=1.2.1.5)
of Saccharomyces cerevisiae, and other similar
sequences.
Length = 476
Score = 642 bits (1659), Expect = 0.0
Identities = 230/348 (66%), Positives = 282/348 (81%), Gaps = 1/348 (0%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
++LE+LDNGKP S DV S++ LRYYAG+ADKI GKTIPI+G+F AYTR EP+GVC
Sbjct: 86 AALESLDNGKPLEESAKGDVALSIKCLRYYAGWADKIQGKTIPIDGNFLAYTRREPIGVC 145
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
GQIIPWNFPLLMLAWK+APALA GNT+VLKPAEQTPL+AL + LI EAG P GVVNIVP
Sbjct: 146 GQIIPWNFPLLMLAWKLAPALAAGNTVVLKPAEQTPLSALYLAELIKEAGFPPGVVNIVP 205
Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G+G AGAA+ H VDK+AFTGST VG+ + + + SNLK+ TLELGGKSPNI+F DAD
Sbjct: 206 GFGPTAGAAISSHMDVDKIAFTGSTAVGRTIMEAAAKSNLKKVTLELGGKSPNIVFDDAD 265
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
LD AVE A FG+FFN GQCCCAGSR FVQ++IYDEFV + RA++R VGDPFD + QG
Sbjct: 266 LDKAVEWAAFGIFFNQGQCCCAGSRIFVQESIYDEFVEKFKARAEKRVVGDPFDPDTFQG 325
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
PQ+ K Q DKIL +IESGK +GA L+ GG R G KG+++QPTVF +V+DDMKIA+EEIFG
Sbjct: 326 PQVSKAQFDKILSYIESGKKEGATLLTGGERHGSKGYFIQPTVFTDVKDDMKIAKEEIFG 385
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
PV +++F + +EVIERAN+++YGLAA VFTKD++K V++ L+AGT
Sbjct: 386 PVVTILKFKTEDEVIERANDTEYGLAAGVFTKDINKALRVSRALKAGT 433
Score = 34.1 bits (79), Expect = 0.13
Identities = 11/21 (52%), Positives = 14/21 (66%), Gaps = 1/21 (4%)
Query: 36 HTQLFINNEFVNAQSSLETLD 56
T LFINNEFV++ S +T
Sbjct: 3 PTGLFINNEFVDSVSG-KTFP 22
>gnl|CDD|143460 cd07142, ALDH_F2BC, Arabidosis aldehyde dehydrogenase family 2 B4,
B7, C4-like. Included in this CD is the Arabidosis
aldehyde dehydrogenase family 2 members B4 and B7
(EC=1.2.1.3), which are mitochondrial homotetramers
that oxidize acetaldehyde and glycolaldehyde, but not
L-lactaldehyde. Also in this group, is the Arabidosis
cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
involved in the oxidation of sinapalehyde and
coniferaldehyde.
Length = 476
Score = 560 bits (1446), Expect = 0.0
Identities = 223/349 (63%), Positives = 276/349 (79%), Gaps = 1/349 (0%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
++LET DNGKPY + +V + R RYYAG+ADKIHG T+P +G YT HEP+GV
Sbjct: 86 AALETWDNGKPYEQARYAEVPLAARLFRYYAGWADKIHGMTLPADGPHHVYTLHEPIGVV 145
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
GQIIPWNFPLLM AWK+ PALA GNTIVLKPAEQTPL+AL L AEAG+P+GV+NIV
Sbjct: 146 GQIIPWNFPLLMFAWKVGPALACGNTIVLKPAEQTPLSALLAAKLAAEAGLPDGVLNIVT 205
Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G+G AGAA+ H VDKVAFTGSTEVGK++ Q + SNLK TLELGGKSP I+ DAD
Sbjct: 206 GFGPTAGAAIASHMDVDKVAFTGSTEVGKIIMQLAAKSNLKPVTLELGGKSPFIVCEDAD 265
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
+D AVE AHF LFFN GQCCCAGSRTFV ++IYDEFV ++ RA +R VGDPF VEQG
Sbjct: 266 VDKAVELAHFALFFNQGQCCCAGSRTFVHESIYDEFVEKAKARALKRVVGDPFRKGVEQG 325
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
PQ+DKEQ +KIL +IE GK +GA L+ GG R G KG+Y+QPT+F++V+DDMKIAR+EIFG
Sbjct: 326 PQVDKEQFEKILSYIEHGKEEGATLITGGDRIGSKGYYIQPTIFSDVKDDMKIARDEIFG 385
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
PVQ +++F +++EVI+RANNS YGLAA VF+K++D N +++ L+AGT+
Sbjct: 386 PVQSILKFKTVDEVIKRANNSKYGLAAGVFSKNIDTANTLSRALKAGTV 434
Score = 33.2 bits (76), Expect = 0.25
Identities = 12/25 (48%), Positives = 19/25 (76%), Gaps = 2/25 (8%)
Query: 34 IEHTQLFINNEFVNAQS--SLETLD 56
++HT+LFIN +FV+A S + T+D
Sbjct: 1 VKHTKLFINGQFVDAASGKTFPTID 25
>gnl|CDD|215259 PLN02466, PLN02466, aldehyde dehydrogenase family 2 member.
Length = 538
Score = 528 bits (1362), Expect = 0.0
Identities = 215/349 (61%), Positives = 263/349 (75%), Gaps = 1/349 (0%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
++LET DNGKPY S ++ R RYYAG+ADKIHG T+P +G T HEP+GV
Sbjct: 140 AALETWDNGKPYEQSAKAELPMFARLFRYYAGWADKIHGLTVPADGPHHVQTLHEPIGVA 199
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
GQIIPWNFPLLM AWK+ PALA GNTIVLK AEQTPL+AL L+ EAG+P GV+N+V
Sbjct: 200 GQIIPWNFPLLMFAWKVGPALACGNTIVLKTAEQTPLSALYAAKLLHEAGLPPGVLNVVS 259
Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G+G AGAAL H VDK+AFTGST+ GK+V + + SNLK TLELGGKSP I+ DAD
Sbjct: 260 GFGPTAGAALASHMDVDKLAFTGSTDTGKIVLELAAKSNLKPVTLELGGKSPFIVCEDAD 319
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
+D AVE AHF LFFN GQCCCAGSRTFV + +YDEFV ++ RA +R VGDPF VEQG
Sbjct: 320 VDKAVELAHFALFFNQGQCCCAGSRTFVHERVYDEFVEKAKARALKRVVGDPFKKGVEQG 379
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
PQID EQ +KIL +I+SG GA L GG R G KG+Y+QPTVF+NV+DDM IA++EIFG
Sbjct: 380 PQIDSEQFEKILRYIKSGVESGATLECGGDRFGSKGYYIQPTVFSNVQDDMLIAQDEIFG 439
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
PVQ +++F ++EVI RANN+ YGLAA VFT++LD N +++ LR GT+
Sbjct: 440 PVQSILKFKDLDEVIRRANNTRYGLAAGVFTQNLDTANTLSRALRVGTV 488
Score = 42.9 bits (101), Expect = 2e-04
Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 5/58 (8%)
Query: 4 LRNLALKSSTLGSLKNYS--AAAVPKPITKP-KIEHTQLFINNEFVNAQS--SLETLD 56
LR+ ++ +S AAAV +PIT P ++ +TQL IN +FV+A S + TLD
Sbjct: 22 LRSRGRNGGRGRGIRRFSTAAAAVEEPITPPVQVSYTQLLINGQFVDAASGKTFPTLD 79
>gnl|CDD|215767 pfam00171, Aldedh, Aldehyde dehydrogenase family. This family of
dehydrogenases act on aldehyde substrates. Members use
NADP as a cofactor. The family includes the following
members: The prototypical members are the aldehyde
dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase
EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28.
Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27.
Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9.
Delta-1-pyrroline-5-carboxylate dehydrogenase EC:
1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10.
Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This
family also includes omega crystallin, an eye lens
protein from squid and octopus that has little aldehyde
dehydrogenase activity.
Length = 459
Score = 523 bits (1350), Expect = 0.0
Identities = 196/347 (56%), Positives = 250/347 (72%), Gaps = 3/347 (0%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
LETL+ GKP + +V ++ TLRYYAG A K+ G+T+P + AYTR EP+GV G
Sbjct: 74 LETLETGKPLAEAR-GEVPRAIDTLRYYAGLARKLEGETLPSDPGVLAYTRREPLGVVGA 132
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
I PWNFPLL+ AWKIAPALA GNT+VLKP+E TPLTAL + L EAG+P GV+N+V G
Sbjct: 133 ITPWNFPLLLAAWKIAPALAAGNTVVLKPSELTPLTALLLAELFEEAGLPPGVLNVVTGS 192
Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
G + G ALV+HP VDKV+FTGSTEVG+ + + + A NLKR TLELGGK+P I+F DADLD
Sbjct: 193 GSEVGDALVEHPDVDKVSFTGSTEVGRRIAK-AAAKNLKRVTLELGGKNPLIVFDDADLD 251
Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
AAVEGA FG F N GQ C AGSR V ++IYDEFV R E AK VGDP D + + GP
Sbjct: 252 AAVEGAVFGAFGNAGQVCTAGSRLLVHESIYDEFVERLVEAAKSLKVGDPLDPDTDIGPL 311
Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPV 350
I K+Q +++L +IE K +GA+L+ GG +KG++V+PTV A+V DM+IA+EEIFGPV
Sbjct: 312 ISKKQRERVLSYIEDAKEEGAKLLCGGEAGLEKGYFVEPTVLADVTPDMRIAQEEIFGPV 371
Query: 351 QQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
+I F +E IE AN+++YGLAA VFT DL++ V + L AG +
Sbjct: 372 LSVIPFKDEDEAIELANDTEYGLAAGVFTNDLERALRVARRLEAGMV 418
>gnl|CDD|143461 cd07143, ALDH_AldA_AN0554, Aspergillus nidulans aldehyde
dehydrogenase, AldA (AN0554)-like. NAD(P)+-dependent
aldehyde dehydrogenase (AldA) of Aspergillus nidulans
(locus AN0554), and other similar sequences, are present
in this CD.
Length = 481
Score = 498 bits (1284), Expect = e-175
Identities = 201/348 (57%), Positives = 260/348 (74%), Gaps = 1/348 (0%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
+S+E LDNGK + + VDV+ S T RYY G+ADKIHG+ I + YTRHEP+GVC
Sbjct: 89 ASIEALDNGKTFGTAKRVDVQASADTFRYYGGWADKIHGQVIETDIKKLTYTRHEPIGVC 148
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
GQIIPWNFPLLM AWKIAPALA GNTIVLKP+E TPL+AL + LI EAG P GV+N+V
Sbjct: 149 GQIIPWNFPLLMCAWKIAPALAAGNTIVLKPSELTPLSALYMTKLIPEAGFPPGVINVVS 208
Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
GYG G A+ H +DKVAFTGST VG+ V + + SNLK+ TLELGGKSPNI+F DAD
Sbjct: 209 GYGRTCGNAISSHMDIDKVAFTGSTLVGRKVMEAAAKSNLKKVTLELGGKSPNIVFDDAD 268
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
L++AV +G+FFN GQ CCAGSR +VQ+ IYD+FV R E+AK+ VGDPF + QG
Sbjct: 269 LESAVVWTAYGIFFNHGQVCCAGSRIYVQEGIYDKFVKRFKEKAKKLKVGDPFAEDTFQG 328
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
PQ+ + Q ++I+ +IESGK++GA + GG R G++G++++PT+F +V +DMKI +EEIFG
Sbjct: 329 PQVSQIQYERIMSYIESGKAEGATVETGGKRHGNEGYFIEPTIFTDVTEDMKIVKEEIFG 388
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
PV +I+F + EE I+RAN+S YGLAAAVFT +++ V L+AGT
Sbjct: 389 PVVAVIKFKTEEEAIKRANDSTYGLAAAVFTNNINNAIRVANALKAGT 436
>gnl|CDD|143397 cd07078, ALDH, NAD(P)+ dependent aldehyde dehydrogenase family.
The aldehyde dehydrogenase family (ALDH) of NAD(P)+
dependent enzymes, in general, oxidize a wide range of
endogenous and exogenous aliphatic and aromatic
aldehydes to their corresponding carboxylic acids and
play an important role in detoxification. Besides
aldehyde detoxification, many ALDH isozymes possess
multiple additional catalytic and non-catalytic
functions such as participating in metabolic pathways,
or as binding proteins, or as osmoregulants, to mention
a few. The enzyme has three domains, a NAD(P)+
cofactor-binding domain, a catalytic domain, and a
bridging domain; and the active enzyme is generally
either homodimeric or homotetrameric. The catalytic
mechanism is proposed to involve cofactor binding,
resulting in a conformational change and activation of
an invariant catalytic cysteine nucleophile. The
cysteine and aldehyde substrate form an oxyanion
thiohemiacetal intermediate resulting in hydride
transfer to the cofactor and formation of a
thioacylenzyme intermediate. Hydrolysis of the
thioacylenzyme and release of the carboxylic acid
product occurs, and in most cases, the reduced cofactor
dissociates from the enzyme. The evolutionary
phylogenetic tree of ALDHs appears to have an initial
bifurcation between what has been characterized as the
classical aldehyde dehydrogenases, the ALDH family
(ALDH) and extended family members or aldehyde
dehydrogenase-like (ALDH-like) proteins. The ALDH
proteins are represented by enzymes which share a number
of highly conserved residues necessary for catalysis and
cofactor binding and they include such proteins as
retinal dehydrogenase, 10-formyltetrahydrofolate
dehydrogenase, non-phosphorylating glyceraldehyde
3-phosphate dehydrogenase,
delta(1)-pyrroline-5-carboxylate dehydrogenases,
alpha-ketoglutaric semialdehyde dehydrogenase,
alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
aldehyde dehydrogenase and succinate-semialdehyde
dehydrogenase. Included in this larger group are all
human, Arabidopsis, Tortula, fungal, protozoan, and
Drosophila ALDHs identified in families ALDH1 through
ALDH22 with the exception of families ALDH18, ALDH19,
and ALDH20 which are present in the ALDH-like group.
Length = 432
Score = 495 bits (1276), Expect = e-174
Identities = 194/348 (55%), Positives = 244/348 (70%), Gaps = 5/348 (1%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPI-NGDFFAYTRHEPVGVCG 110
LETL+ GKP + +V + T RYYAG A ++HG+ IP + A R EP+GV G
Sbjct: 43 LETLETGKPIEEAL-GEVARAADTFRYYAGLARRLHGEVIPSPDPGELAIVRREPLGVVG 101
Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
I PWNFPLL+ AWK+APALA GNT+VLKP+E TPLTAL + L+AEAG+P GV+N+V G
Sbjct: 102 AITPWNFPLLLAAWKLAPALAAGNTVVLKPSELTPLTALLLAELLAEAGLPPGVLNVVTG 161
Query: 171 YGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
GD GAAL HP VDK++FTGST VGK + + + A NLKR TLELGGKSP I+F DADL
Sbjct: 162 DGDEVGAALASHPRVDKISFTGSTAVGKAIMR-AAAENLKRVTLELGGKSPLIVFDDADL 220
Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
DAAV+GA FG F N GQ C A SR V ++IYDEFV R ER K VG+P D + + GP
Sbjct: 221 DAAVKGAVFGAFGNAGQVCTAASRLLVHESIYDEFVERLVERVKALKVGNPLDPDTDMGP 280
Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR-AGDKGFYVQPTVFANVRDDMKIAREEIFG 348
I Q+D++L +IE K++GA+L+ GG R G KG++V PTV +V DM IA+EEIFG
Sbjct: 281 LISAAQLDRVLAYIEDAKAEGAKLLCGGKRLEGGKGYFVPPTVLTDVDPDMPIAQEEIFG 340
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
PV +I F EE IE AN+++YGLAA VFT+DL++ V + L AGT
Sbjct: 341 PVLPVIPFKDEEEAIELANDTEYGLAAGVFTRDLERALRVAERLEAGT 388
>gnl|CDD|143430 cd07112, ALDH_GABALDH-PuuC, Escherichia coli NADP+-dependent
gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
PuuC-like. NADP+-dependent,
gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
(GABALDH) PuuC of Escherichia coli which catalyzes the
conversion of putrescine to 4-aminobutanoate and other
similar sequences are present in this CD.
Length = 462
Score = 483 bits (1246), Expect = e-169
Identities = 191/349 (54%), Positives = 240/349 (68%), Gaps = 4/349 (1%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
LETLD GKP + + +VDV + T R+YA DK++G+ P D A EP+GV G
Sbjct: 71 LETLDMGKPISDALAVDVPSAANTFRWYAEAIDKVYGEVAPTGPDALALITREPLGVVGA 130
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
++PWNFPLLM AWKIAPALA GN++VLKPAEQ+PLTAL + L EAG+P GV+N+VPG+
Sbjct: 131 VVPWNFPLLMAAWKIAPALAAGNSVVLKPAEQSPLTALRLAELALEAGLPAGVLNVVPGF 190
Query: 172 GD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA-DL 229
G AG AL H VD +AFTGSTEVG+ + SG SNLKR LE GGKSPNI+FADA DL
Sbjct: 191 GHTAGEALGLHMDVDALAFTGSTEVGRRFLEYSGQSNLKRVWLECGGKSPNIVFADAPDL 250
Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
DAA E A G+F+N G+ C AGSR V ++I DEF+ + A+ GDP D G
Sbjct: 251 DAAAEAAAAGIFWNQGEVCSAGSRLLVHESIKDEFLEKVVAAAREWKPGDPLDPATRMGA 310
Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR--AGDKGFYVQPTVFANVRDDMKIAREEIF 347
+ + DK+L +IESGK++GA+LVAGG R GF+V+PTVF V DM+IAREEIF
Sbjct: 311 LVSEAHFDKVLGYIESGKAEGARLVAGGKRVLTETGGFFVEPTVFDGVTPDMRIAREEIF 370
Query: 348 GPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
GPV +I F S EE + AN+S YGLAA+V+T DL + + V + LRAGT
Sbjct: 371 GPVLSVITFDSEEEAVALANDSVYGLAASVWTSDLSRAHRVARRLRAGT 419
>gnl|CDD|143462 cd07144, ALDH_ALD2-YMR170C, Saccharomyces cerevisiae aldehyde
dehydrogenase 2 (YMR170c)-like. NAD(P)+-dependent
Saccharomyces cerevisiae aldehyde dehydrogenase 2
(YMR170c, ALD5, EC=1.2.1.5) and other similar sequences,
are present in this CD.
Length = 484
Score = 472 bits (1217), Expect = e-165
Identities = 195/372 (52%), Positives = 260/372 (69%), Gaps = 14/372 (3%)
Query: 51 SLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCG 110
++E LD+GKPY+ + D++ + +RYYAG+ADKI GKTIP + + AYT HEP GVCG
Sbjct: 90 AIEALDSGKPYHSNALGDLDEIIAVIRYYAGWADKIQGKTIPTSPNKLAYTLHEPYGVCG 149
Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
QIIPWN+PL M AWK+APALA GNT+V+KPAE TPL+ L L+ EAG P GVVNI+PG
Sbjct: 150 QIIPWNYPLAMAAWKLAPALAAGNTVVIKPAENTPLSLLYFANLVKEAGFPPGVVNIIPG 209
Query: 171 YGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
YG AG+AL +HP VDK+AFTGST G+LV + + A NLK TLE GGKSP ++F DADL
Sbjct: 210 YGAVAGSALAEHPDVDKIAFTGSTATGRLVMKAA-AQNLKAVTLECGGKSPALVFEDADL 268
Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRT-VGDPFDLNVEQG 288
D AV+ A G+ +N GQ C A SR +VQ++IYD+FV + E K+ VG PFD + G
Sbjct: 269 DQAVKWAAAGIMYNSGQNCTATSRIYVQESIYDKFVEKFVEHVKQNYKVGSPFDDDTVVG 328
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGG---GRAGDKGFYVQPTVFANVRDDMKIAREE 345
PQ+ K Q D++L +IE GK +GA+LV GG KG+++ PT+F +V DM+I +EE
Sbjct: 329 PQVSKTQYDRVLSYIEKGKKEGAKLVYGGEKAPEGLGKGYFIPPTIFTDVPQDMRIVKEE 388
Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLIRFSSIEE 405
IFGPV + +F + EE I++AN++ YGLAAAVFTKD+ + + V + L AG +
Sbjct: 389 IFGPVVVISKFKTYEEAIKKANDTTYGLAAAVFTKDIRRAHRVARELEAGMVW------- 441
Query: 406 VIERANNSDYGL 417
I +N+SD G+
Sbjct: 442 -INSSNDSDVGV 452
>gnl|CDD|143432 cd07114, ALDH_DhaS, Uncharacterized Candidatus pelagibacter
aldehyde dehydrogenase, DhaS-like. Uncharacterized
aldehyde dehydrogenase from Candidatus pelagibacter
(DhaS) and other related sequences are present in this
CD.
Length = 457
Score = 470 bits (1212), Expect = e-164
Identities = 175/352 (49%), Positives = 230/352 (65%), Gaps = 10/352 (2%)
Query: 52 LETLDNGKPYN-MSYSVDVEYSLRTLRYYAGFADKIHGKTIPIN-GDFFAYTRHEPVGVC 109
LET DNGK V Y RYYAG ADKI G IP++ GD+ +TR EP+GV
Sbjct: 66 LETRDNGKLIRETRAQV--RYLAEWYRYYAGLADKIEGAVIPVDKGDYLNFTRREPLGVV 123
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
I PWN PLL+LA K+APALA GNT+VLKP+E TP + L + L EAG P GVVN+V
Sbjct: 124 AAITPWNSPLLLLAKKLAPALAAGNTVVLKPSEHTPASTLELAKLAEEAGFPPGVVNVVT 183
Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G+G + G ALV+HP V K+AFTG TE G+ + + + A NL TLELGGKSPNI+F DAD
Sbjct: 184 GFGPETGEALVEHPLVAKIAFTGGTETGRHIARAA-AENLAPVTLELGGKSPNIVFDDAD 242
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
LDAAV G G+F GQ C AGSR VQ +IYDEFV R RA+ VGDP D + G
Sbjct: 243 LDAAVNGVVAGIFAAAGQTCVAGSRLLVQRSIYDEFVERLVARARAIRVGDPLDPETQMG 302
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAG----DKGFYVQPTVFANVRDDMKIARE 344
P + Q++K+ ++ + +GA+++ GG R G++ +PT+ A+V +DM+IA+E
Sbjct: 303 PLATERQLEKVERYVARAREEGARVLTGGERPSGADLGAGYFFEPTILADVTNDMRIAQE 362
Query: 345 EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
E+FGPV +I F EE I AN+S+YGLAA ++T+DL + + V + + AGT
Sbjct: 363 EVFGPVLSVIPFDDEEEAIALANDSEYGLAAGIWTRDLARAHRVARAIEAGT 414
>gnl|CDD|143437 cd07119, ALDH_BADH-GbsA, Bacillus subtilis NAD+-dependent betaine
aldehyde dehydrogenase-like. Included in this CD is the
NAD+-dependent, betaine aldehyde dehydrogenase (BADH,
GbsA, EC=1.2.1.8) of Bacillus subtilis involved in the
synthesis of the osmoprotectant glycine betaine from
choline or glycine betaine aldehyde.
Length = 482
Score = 466 bits (1202), Expect = e-162
Identities = 174/351 (49%), Positives = 234/351 (66%), Gaps = 7/351 (1%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
LETL+ GK S D++ RYYAG A K G+ + + T EPVGVCG
Sbjct: 82 LETLNTGKTLRESEI-DIDDVANCFRYYAGLATKETGEVYDVPPHVISRTVREPVGVCGL 140
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
I PWN+PLL AWK+APALA GNT+V+KP+E TPLT +A+ LI EAG+P GVVN+V G
Sbjct: 141 ITPWNYPLLQAAWKLAPALAAGNTVVIKPSEVTPLTTIALFELIEEAGLPAGVVNLVTGS 200
Query: 172 GD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
G GA L + P VD V+FTG T G+ + + + A N+K+ LELGGK+PNI+FADAD +
Sbjct: 201 GATVGAELAESPDVDLVSFTGGTATGRSIMR-AAAGNVKKVALELGGKNPNIVFADADFE 259
Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
AV+ A G+FFN GQ C AGSR V+++I+D+FVA ERAK+ +G+ D + E GP
Sbjct: 260 TAVDQALNGVFFNAGQVCSAGSRLLVEESIHDKFVAALAERAKKIKLGNGLDADTEMGPL 319
Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAGD----KGFYVQPTVFANVRDDMKIAREEI 346
+ E +K+L +I+ GK +GA+LV GG R KG++V+PT+F +V M+I +EEI
Sbjct: 320 VSAEHREKVLSYIQLGKEEGARLVCGGKRPTGDELAKGYFVEPTIFDDVDRTMRIVQEEI 379
Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
FGPV + RF + EE I AN++ YGLA AV+TKD+ + N V + LRAGT+
Sbjct: 380 FGPVLTVERFDTEEEAIRLANDTPYGLAGAVWTKDIARANRVARRLRAGTV 430
>gnl|CDD|215410 PLN02766, PLN02766, coniferyl-aldehyde dehydrogenase.
Length = 501
Score = 465 bits (1198), Expect = e-162
Identities = 191/349 (54%), Positives = 257/349 (73%), Gaps = 1/349 (0%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
++L+T+D GK + + +VD+ + LRYYAG ADKIHG+T+ ++ YT EP+GV
Sbjct: 103 AALDTIDAGKLFALGKAVDIPAAAGLLRYYAGAADKIHGETLKMSRQLQGYTLKEPIGVV 162
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
G IIPWNFP M K+APALA G T+V+KPAEQTPL+AL L AG+P+GV+N+V
Sbjct: 163 GHIIPWNFPSTMFFMKVAPALAAGCTMVVKPAEQTPLSALFYAHLAKLAGVPDGVINVVT 222
Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G+G AGAA+ H VDKV+FTGSTEVG+ + Q + SNLK+ +LELGGKSP +IF DAD
Sbjct: 223 GFGPTAGAAIASHMDVDKVSFTGSTEVGRKIMQAAATSNLKQVSLELGGKSPLLIFDDAD 282
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
+D AV+ A G+F+N G+ C A SR +VQ+ IYDEFV + E+AK VGDPFD QG
Sbjct: 283 VDMAVDLALLGIFYNKGEICVASSRVYVQEGIYDEFVKKLVEKAKDWVVGDPFDPRARQG 342
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
PQ+DK+Q +KIL +IE GK +GA L+ GG GDKG+Y++PT+F +V +DMKIA++EIFG
Sbjct: 343 PQVDKQQFEKILSYIEHGKREGATLLTGGKPCGDKGYYIEPTIFTDVTEDMKIAQDEIFG 402
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
PV L++F ++EE I++ANN+ YGLAA + TKDLD N V++ +RAGT+
Sbjct: 403 PVMSLMKFKTVEEAIKKANNTKYGLAAGIVTKDLDVANTVSRSIRAGTI 451
Score = 34.0 bits (78), Expect = 0.16
Identities = 15/32 (46%), Positives = 21/32 (65%), Gaps = 2/32 (6%)
Query: 27 KPITKPKIEHTQLFINNEFVNAQS--SLETLD 56
+ P+I+ T+LFIN EFV+A S + ET D
Sbjct: 11 SGVKVPEIKFTKLFINGEFVDAASGKTFETRD 42
>gnl|CDD|143433 cd07115, ALDH_HMSADH_HapE, Pseudomonas fluorescens 4-hydroxymuconic
semialdehyde dehydrogenase-like. 4-hydroxymuconic
semialdehyde dehydrogenase (HapE, EC=1.2.1.61) of
Pseudomonas fluorescens ACB involved in
4-hydroxyacetophenone degradation, and putative
hydroxycaproate semialdehyde dehydrogenase (ChnE) of
Brachymonas petroleovorans involved in cyclohexane
metabolism, and other similar sequences, are present in
this CD.
Length = 453
Score = 460 bits (1186), Expect = e-160
Identities = 191/348 (54%), Positives = 253/348 (72%), Gaps = 2/348 (0%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
+ LE+LD GKP + +DV + T RYYAG+ADKI G+ IP+ G F YT EPVGV
Sbjct: 62 ARLESLDTGKPIRAARRLDVPRAADTFRYYAGWADKIEGEVIPVRGPFLNYTVREPVGVV 121
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
G I+PWNFPL+ AWK+APALA GNT+VLKPAE TPL+AL + L+AEAG P GV+N+V
Sbjct: 122 GAIVPWNFPLMFAAWKVAPALAAGNTVVLKPAELTPLSALRIAELMAEAGFPAGVLNVVT 181
Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G+G+ AGAALV+HP VDK+ FTGST VG+ + QG+ A NLKR +LELGGKS NI+FADAD
Sbjct: 182 GFGEVAGAALVEHPDVDKITFTGSTAVGRKIMQGA-AGNLKRVSLELGGKSANIVFADAD 240
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
LDAAV A G+F+N GQ C AGSR V ++IYDEF+ R A+ GDP D + G
Sbjct: 241 LDAAVRAAATGIFYNQGQMCTAGSRLLVHESIYDEFLERFTSLARSLRPGDPLDPKTQMG 300
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
P + + Q D++L++++ G+ +GA+L+ GG R G +GF+V+PT+FA V +M+IA+EEIFG
Sbjct: 301 PLVSQAQFDRVLDYVDVGREEGARLLTGGKRPGARGFFVEPTIFAAVPPEMRIAQEEIFG 360
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
PV ++RF EE + AN ++YGLAA V+T+DL + + V L+AGT
Sbjct: 361 PVVSVMRFRDEEEALRIANGTEYGLAAGVWTRDLGRAHRVAAALKAGT 408
>gnl|CDD|223944 COG1012, PutA, NAD-dependent aldehyde dehydrogenases [Energy
production and conversion].
Length = 472
Score = 452 bits (1166), Expect = e-157
Identities = 183/346 (52%), Positives = 240/346 (69%), Gaps = 5/346 (1%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
L TL+ GKP + + ++ + +RYYA A ++ G+TIP + A R EP+GV G
Sbjct: 81 LITLETGKPISEAR-GEIARAADFIRYYAEEARRLEGETIPTDKGSKALVRREPLGVVGA 139
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
I PWNFPL + AWK+APALA GNT+VLKP+EQTPL+ALA+ L AEAG+P GV+N+V G
Sbjct: 140 ITPWNFPLALAAWKLAPALAAGNTVVLKPSEQTPLSALALAELAAEAGLPAGVLNVVTGG 199
Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
G + G ALV HP VD ++FTGST VG+ + + A+NLK TLELGGKSP I+ DADLD
Sbjct: 200 GAEVGDALVAHPDVDAISFTGSTAVGRAIAAAA-AANLKPVTLELGGKSPAIVLEDADLD 258
Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
AAV+ A FG FFN GQ C A SR V +++YDEFV R RA VGDP D + + GP
Sbjct: 259 AAVDAAVFGAFFNAGQRCTAASRLIVHESVYDEFVERLVARAASLKVGDPLDPSTDLGPL 318
Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPV 350
I +EQ+D++ +IE ++GA+L+AGG R G G++V+PT+ V DM+IAREEIFGPV
Sbjct: 319 ISEEQLDRVEGYIEDAVAEGARLLAGGKRPG--GYFVEPTILEGVTPDMRIAREEIFGPV 376
Query: 351 QQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
+IRF EE IE AN+++YGLAAA+FT+DL + V + L AG
Sbjct: 377 LPVIRFKDEEEAIELANDTEYGLAAAIFTRDLARAFRVARRLEAGM 422
>gnl|CDD|143412 cd07093, ALDH_F8_HMSADH, Human aldehyde dehydrogenase family 8
member A1-like. In humans, the aldehyde dehydrogenase
family 8 member A1 (ALDH8A1) protein functions to
convert 9-cis-retinal to 9-cis-retinoic acid and has a
preference for NAD+. Also included in this CD is the
2-hydroxymuconic semialdehyde dehydrogenase (HMSADH)
which catalyzes the conversion of 2-hydroxymuconic
semialdehyde to 4-oxalocrotonate, a step in the meta
cleavage pathway of aromatic hydrocarbons in bacteria.
Such HMSADHs seen here are: XylG of the TOL plasmid pWW0
of Pseudomonas putida, TomC of Burkholderia cepacia G4,
and AphC of Comamonas testosterone.
Length = 455
Score = 442 bits (1139), Expect = e-153
Identities = 173/351 (49%), Positives = 239/351 (68%), Gaps = 6/351 (1%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
LE+LD GKP ++ + D+ + R++A + ++ G++ P +G Y +PVGV G
Sbjct: 64 LESLDTGKPITLARTRDIPRAAANFRFFADYILQLDGESYPQDGGALNYVLRQPVGVAGL 123
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
I PWN PL++L WKIAPALA GNT+VLKP+E TPLTA + L EAG+P GVVN+V G+
Sbjct: 124 ITPWNLPLMLLTWKIAPALAFGNTVVLKPSEWTPLTAWLLAELANEAGLPPGVVNVVHGF 183
Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
G +AGAALV HP VD ++FTG T G+ + + + A NLK +LELGGK+PNI+FADADLD
Sbjct: 184 GPEAGAALVAHPDVDLISFTGETATGRTIMR-AAAPNLKPVSLELGGKNPNIVFADADLD 242
Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
AV+ A F N G+ C AGSR VQ +IYDEF+ R ERAK VGDP D + E GP
Sbjct: 243 RAVDAAVRSSFSNNGEVCLAGSRILVQRSIYDEFLERFVERAKALKVGDPLDPDTEVGPL 302
Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAG----DKGFYVQPTVFANVRDDMKIAREEI 346
I KE ++K+L ++E +++GA ++ GGGR + G++V+PTV + +D ++A+EEI
Sbjct: 303 ISKEHLEKVLGYVELARAEGATILTGGGRPELPDLEGGYFVEPTVITGLDNDSRVAQEEI 362
Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
FGPV +I F EE IE AN++ YGLAA V+T+DL + + V + L AGT+
Sbjct: 363 FGPVVTVIPFDDEEEAIELANDTPYGLAAYVWTRDLGRAHRVARRLEAGTV 413
>gnl|CDD|143427 cd07109, ALDH_AAS00426, Uncharacterized Saccharopolyspora spinosa
aldehyde dehydrogenase (AAS00426)-like. Uncharacterized
aldehyde dehydrogenase of Saccharopolyspora spinosa
(AAS00426) and other similar sequences, are present in
this CD.
Length = 454
Score = 434 bits (1118), Expect = e-150
Identities = 166/350 (47%), Positives = 224/350 (64%), Gaps = 7/350 (2%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
+ LE+LD GKP + + DVE + R YY G ADK+HG+TIP+ +F YT EP GV
Sbjct: 63 ARLESLDTGKPLTQARA-DVEAAARYFEYYGGAADKLHGETIPLGPGYFVYTVREPHGVT 121
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
G IIPWN+PL + +APALA GN +V+KPAE PLTAL + L EAG+P G +N+V
Sbjct: 122 GHIIPWNYPLQITGRSVAPALAAGNAVVVKPAEDAPLTALRLAELAEEAGLPAGALNVVT 181
Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G G +AGAALV HPGVD ++FTGS E G V + + A N+ TLELGGKSP I+FADAD
Sbjct: 182 GLGAEAGAALVAHPGVDHISFTGSVETGIAVMR-AAAENVVPVTLELGGKSPQIVFADAD 240
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
L+AA+ + N GQ C AGSR V +IYDE + R ER + VG + + + G
Sbjct: 241 LEAALPVVVNAIIQNAGQTCSAGSRLLVHRSIYDEVLERLVERFRALRVGPGLE-DPDLG 299
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD---KGFYVQPTVFANVRDDMKIAREE 345
P I +Q+D++ F+ +++GA++VAGG A G++V PT+ +V D ++A+EE
Sbjct: 300 PLISAKQLDRVEGFVARARARGARIVAGGRIAEGAPAGGYFVAPTLLDDVPPDSRLAQEE 359
Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
IFGPV ++ F E I AN +DYGL A V+T+D D+ V + LRAG
Sbjct: 360 IFGPVLAVMPFDDEAEAIALANGTDYGLVAGVWTRDGDRALRVARRLRAG 409
>gnl|CDD|183918 PRK13252, PRK13252, betaine aldehyde dehydrogenase; Provisional.
Length = 488
Score = 435 bits (1120), Expect = e-150
Identities = 169/352 (48%), Positives = 218/352 (61%), Gaps = 5/352 (1%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
++LETLD GKP + VD+ L YYAG A + G+ IP+ G F YTR EP+GVC
Sbjct: 87 AALETLDTGKPIQETSVVDIVTGADVLEYYAGLAPALEGEQIPLRGGSFVYTRREPLGVC 146
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
I WN+P+ + WK APALA GN ++ KP+E TPLTAL + + EAG+P+GV N+V
Sbjct: 147 AGIGAWNYPIQIACWKSAPALAAGNAMIFKPSEVTPLTALKLAEIYTEAGLPDGVFNVVQ 206
Query: 170 GYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
G G GA L +HP + KV+FTG GK V + AS LK T+ELGGKSP I+F DADL
Sbjct: 207 GDGRVGAWLTEHPDIAKVSFTGGVPTGKKVMAAAAAS-LKEVTMELGGKSPLIVFDDADL 265
Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
D A + A F++ GQ C G+R FVQ +I F AR ER +R +GDP D GP
Sbjct: 266 DRAADIAMLANFYSSGQVCTNGTRVFVQKSIKAAFEARLLERVERIRIGDPMDPATNFGP 325
Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR----AGDKGFYVQPTVFANVRDDMKIAREE 345
+ DK+L +IE GK++GA+L+ GG R G +V PTVF + DDM I REE
Sbjct: 326 LVSFAHRDKVLGYIEKGKAEGARLLCGGERLTEGGFANGAFVAPTVFTDCTDDMTIVREE 385
Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
IFGPV ++ F +EVI RAN+++YGLAA VFT DL + + V L AG
Sbjct: 386 IFGPVMSVLTFDDEDEVIARANDTEYGLAAGVFTADLSRAHRVIHQLEAGIC 437
>gnl|CDD|143409 cd07090, ALDH_F9_TMBADH, NAD+-dependent
4-trimethylaminobutyraldehyde dehydrogenase, ALDH family
9A1. NAD+-dependent, 4-trimethylaminobutyraldehyde
dehydrogenase (TMABADH, EC=1.2.1.47), also known as
aldehyde dehydrogenase family 9 member A1 (ALDH9A1) in
humans, is a cytosolic tetramer which catalyzes the
oxidation of gamma-aminobutyraldehyde involved in
4-aminobutyric acid (GABA) biosynthesis and also
oxidizes betaine aldehyde
(gamma-trimethylaminobutyraldehyde) which is involved in
carnitine biosynthesis.
Length = 457
Score = 425 bits (1094), Expect = e-146
Identities = 174/351 (49%), Positives = 228/351 (64%), Gaps = 7/351 (1%)
Query: 51 SLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCG 110
LET+DNGKP + VD++ S L YYAG A + G+ +P+ G FAYTR EP+GVC
Sbjct: 63 RLETIDNGKPIEEA-RVDIDSSADCLEYYAGLAPTLSGEHVPLPGGSFAYTRREPLGVCA 121
Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
I WN+P+ + +WK APALA GN +V KP+ TPLTAL + ++ EAG+P+GV N+V G
Sbjct: 122 GIGAWNYPIQIASWKSAPALACGNAMVYKPSPFTPLTALLLAEILTEAGLPDGVFNVVQG 181
Query: 171 YGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
G+ G L +HP V KV+FTGS GK V + A +K TLELGGKSP IIF DADL+
Sbjct: 182 GGETGQLLCEHPDVAKVSFTGSVPTGKKVMS-AAAKGIKHVTLELGGKSPLIIFDDADLE 240
Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
AV GA F + GQ C G+R FVQ +I DEF R ER K+ +GDP D + + G
Sbjct: 241 NAVNGAMMANFLSQGQVCSNGTRVFVQRSIKDEFTERLVERTKKIRIGDPLDEDTQMGAL 300
Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAG-----DKGFYVQPTVFANVRDDMKIAREE 345
I +E ++K+L +IES K +GA+++ GG R + GFYV P V + DDM I REE
Sbjct: 301 ISEEHLEKVLGYIESAKQEGAKVLCGGERVVPEDGLENGFYVSPCVLTDCTDDMTIVREE 360
Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
IFGPV ++ F + EEVI RAN++ YGLAA VFT+DL + + V L+AGT
Sbjct: 361 IFGPVMSILPFDTEEEVIRRANDTTYGLAAGVFTRDLQRAHRVIAQLQAGT 411
>gnl|CDD|143421 cd07103, ALDH_F5_SSADH_GabD, Mitochondrial succinate-semialdehyde
dehydrogenase and ALDH family members 5A1 and 5F1-like.
Succinate-semialdehyde dehydrogenase, mitochondrial
(SSADH, GabD, EC=1.2.1.24) catalyzes the NAD+-dependent
oxidation of succinate semialdehyde (SSA) to succinate.
This group includes the human aldehyde dehydrogenase
family 5 member A1 (ALDH5A1) which is a mitochondrial
homotetramer that converts SSA to succinate in the last
step of 4-aminobutyric acid (GABA) catabolism. This CD
also includes the Arabidopsis SSADH gene product
ALDH5F1. Mutations in this gene result in the
accumulation of H2O2, suggesting a role in plant defense
against the environmental stress of elevated reactive
oxygen species.
Length = 451
Score = 416 bits (1071), Expect = e-143
Identities = 159/344 (46%), Positives = 219/344 (63%), Gaps = 4/344 (1%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPI-NGDFFAYTRHEPVGVCGQI 112
TL+ GKP + +V+Y+ L ++A A +I+G+TIP +PVGV I
Sbjct: 66 TLEQGKPLAEAR-GEVDYAASFLEWFAEEARRIYGRTIPSPAPGKRILVIKQPVGVVAAI 124
Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG-Y 171
PWNFP M+ KIAPALA G T+VLKPAE+TPL+ALA+ L EAG+P GV+N+V G
Sbjct: 125 TPWNFPAAMITRKIAPALAAGCTVVLKPAEETPLSALALAELAEEAGLPAGVLNVVTGSP 184
Query: 172 GDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDA 231
+ G AL P V K++FTGST VGKL+ + A +KR +LELGG +P I+F DADLD
Sbjct: 185 AEIGEALCASPRVRKISFTGSTAVGKLLMAQA-ADTVKRVSLELGGNAPFIVFDDADLDK 243
Query: 232 AVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
AV+GA F N GQ C +R +V ++IYDEFV + ER K+ VG+ D + GP I
Sbjct: 244 AVDGAIASKFRNAGQTCVCANRIYVHESIYDEFVEKLVERVKKLKVGNGLDEGTDMGPLI 303
Query: 292 DKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQ 351
++ ++K+ +E ++GA+++ GG R G G++ +PTV +V DDM I EE FGPV
Sbjct: 304 NERAVEKVEALVEDAVAKGAKVLTGGKRLGLGGYFYEPTVLTDVTDDMLIMNEETFGPVA 363
Query: 352 QLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
+I F + +EVI RAN++ YGLAA VFT+DL + V + L AG
Sbjct: 364 PIIPFDTEDEVIARANDTPYGLAAYVFTRDLARAWRVAEALEAG 407
>gnl|CDD|200131 TIGR01804, BADH, glycine betaine aldehyde dehydrogenase. Under
osmotic stress, betaine aldehyde dehydrogenase oxidizes
glycine betaine aldehyde into the osmoprotectant glycine
betaine, via the second of two oxidation steps from
exogenously supplied choline or betaine aldehyde. This
choline-glycine betaine synthesis pathway can be found
in gram-positive and gram-negative bacteria. In
Escherichia coli, betaine aldehyde dehydrogenase (betB)
is osmotically co-induced with choline dehydrogenase
(betA) in the presence of choline. These dehydrogenases
are located in a betaine gene cluster with the upstream
choline transporter (betT) and transcriptional regulator
(betI). Similar to E.coli, betaine synthesis in
Staphylococcus xylosus is also influenced by osmotic
stress and the presence of choline with genes localized
in a functionally equivalent gene cluster. Organization
of the betaine gene cluster in Sinorhizobium meliloti
and Bacillus subtilis differs from that of E.coli by the
absence of upstream choline transporter and
transcriptional regulator homologues. Additionally,
B.subtilis co-expresses a type II alcohol dehydrogenase
with betaine aldehyde dehydrogenase instead of choline
dehydrogenase as in E.coli, St.xylosus, and Si.meliloti.
Betaine aldehyde dehydrogenase is a member of the
aldehyde dehydrogenase family (pfam00171) [Cellular
processes, Adaptations to atypical conditions].
Length = 467
Score = 416 bits (1070), Expect = e-143
Identities = 176/353 (49%), Positives = 229/353 (64%), Gaps = 6/353 (1%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
+ LETLD GK + D++ ++AG A ++G+ IP+ G FAYT EP+GVC
Sbjct: 78 AKLETLDTGKTLQETIVADMDSGADVFEFFAGLAPALNGEIIPLGGPSFAYTIREPLGVC 137
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
I WN+PL + +WKIAPALA GN +V KP+E TPLTAL V ++ EAG+P+GV N+V
Sbjct: 138 VGIGAWNYPLQIASWKIAPALAAGNAMVFKPSENTPLTALKVAEIMEEAGLPKGVFNVVQ 197
Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G G + G LV HP V KV+FTG GK + + A +LK T+ELGGKSP I+F DAD
Sbjct: 198 GDGAEVGPLLVNHPDVAKVSFTGGVPTGKKI-MAAAAGHLKHVTMELGGKSPLIVFDDAD 256
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
L++AV+GA G FF+ GQ C G+R FV I + F+AR ER +R +GDPFD E G
Sbjct: 257 LESAVDGAMLGNFFSAGQVCSNGTRVFVHKKIKERFLARLVERTERIKLGDPFDEATEMG 316
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGR----AGDKGFYVQPTVFANVRDDMKIARE 344
P I DK+L +IE GK++GA L GGGR GF+V+PTVFA+ DDM I RE
Sbjct: 317 PLISAAHRDKVLSYIEKGKAEGATLATGGGRPENVGLQNGFFVEPTVFADCTDDMTIVRE 376
Query: 345 EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
EIFGPV ++ FS +EVI RAN+++YGLA VFT DL + + V L AGT+
Sbjct: 377 EIFGPVMTVLTFSDEDEVIARANDTEYGLAGGVFTADLGRAHRVADQLEAGTV 429
>gnl|CDD|143471 cd07559, ALDH_ACDHII_AcoD-like, Ralstonia eutrophus NAD+-dependent
acetaldehyde dehydrogenase II and Staphylococcus aureus
AldA1 (SACOL0154)-like. Included in this CD is the
NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII,
AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus
H16 involved in the catabolism of acetoin and ethanol,
and similar proteins, such as, the dimeric
dihydrolipoamide dehydrogenase of the acetoin
dehydrogenase enzyme system of Klebsiella pneumonia.
Also included are sequences similar to the
NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
and AldB) of Xanthobacter autotrophicus GJ10 which are
involved in the degradation of 1,2-dichloroethane, as
well as, the uncharacterized aldehyde dehydrogenase from
Staphylococcus aureus (AldA1, locus SACOL0154) and other
similar sequences.
Length = 480
Score = 413 bits (1064), Expect = e-142
Identities = 167/354 (47%), Positives = 227/354 (64%), Gaps = 12/354 (3%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
ETLDNGKP + + D+ ++ RY+AG G I+ D +Y HEP+GV GQ
Sbjct: 83 AETLDNGKPIRETLAADIPLAIDHFRYFAGVIRAQEGSLSEIDEDTLSYHFHEPLGVVGQ 142
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
IIPWNFPLLM AWK+APALA GNT+VLKPA QTPL+ L + LI + +P+GVVN+V G+
Sbjct: 143 IIPWNFPLLMAAWKLAPALAAGNTVVLKPASQTPLSILVLMELIGDL-LPKGVVNVVTGF 201
Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA--- 227
G +AG L HP + K+AFTGST VG+L+ Q A NL TLELGGKSPNI F DA
Sbjct: 202 GSEAGKPLASHPRIAKLAFTGSTTVGRLIMQ-YAAENLIPVTLELGGKSPNIFFDDAMDA 260
Query: 228 --DLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNV 285
D D E G FN G+ C SR VQ++IYDEF+ R+ ER + VG+P D
Sbjct: 261 DDDFDDKAEEGQLGFAFNQGEVCTCPSRALVQESIYDEFIERAVERFEAIKVGNPLDPET 320
Query: 286 EQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRA----GDKGFYVQPTVFANVRDDMKI 341
G Q+ K+Q++KIL +++ GK +GA+++ GG R DKG++ +PT+ +DM+I
Sbjct: 321 MMGAQVSKDQLEKILSYVDIGKEEGAEVLTGGERLTLGGLDKGYFYEPTLIKGGNNDMRI 380
Query: 342 AREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
+EEIFGPV +I F EE I AN+++YGL V+T+D+++ V +G++ G
Sbjct: 381 FQEEIFGPVLAVITFKDEEEAIAIANDTEYGLGGGVWTRDINRALRVARGIQTG 434
>gnl|CDD|143458 cd07140, ALDH_F1L_FTFDH, 10-formyltetrahydrofolate dehydrogenase,
ALDH family 1L. 10-formyltetrahydrofolate dehydrogenase
(FTHFDH, EC=1.5.1.6), also known as aldehyde
dehydrogenase family 1 member L1 (ALDH1L1) in humans, is
a multi-domain homotetramer with an N-terminal formyl
transferase domain and a C-terminal ALDH domain. FTHFDH
catalyzes an NADP+-dependent dehydrogenase reaction
resulting in the conversion of 10-formyltetrahydrofolate
to tetrahydrofolate and CO2. The ALDH domain is also
capable of the oxidation of short chain aldehydes to
their corresponding acids.
Length = 486
Score = 408 bits (1050), Expect = e-139
Identities = 175/355 (49%), Positives = 240/355 (67%), Gaps = 7/355 (1%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPIN----GDFFAYTRHEP 105
+++E+LD+G Y ++ V S++T RY+AG+ DKI GKTIPIN T+ EP
Sbjct: 88 ATIESLDSGAVYTLALKTHVGMSIQTFRYFAGWCDKIQGKTIPINQARPNRNLTLTKREP 147
Query: 106 VGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVV 165
+GVCG +IPWN+PL+MLAWK+A LA GNT+VLKPA+ TPLTAL L +AG P+GV+
Sbjct: 148 IGVCGIVIPWNYPLMMLAWKMAACLAAGNTVVLKPAQVTPLTALKFAELTVKAGFPKGVI 207
Query: 166 NIVPGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIF 224
NI+PG G G L HP V K+ FTGST +GK + + SNLK+ +LELGGKSP IIF
Sbjct: 208 NILPGSGSLVGQRLSDHPDVRKLGFTGSTPIGKHIMKSCAVSNLKKVSLELGGKSPLIIF 267
Query: 225 ADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLN 284
AD D+D AV +FFN G+ C A R FV+++I+DEFV R E K+ +GDP D +
Sbjct: 268 ADCDMDKAVRMGMSSVFFNKGENCIAAGRLFVEESIHDEFVRRVVEEVKKMKIGDPLDRS 327
Query: 285 VEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIARE 344
+ GPQ K +DK++E+ E G +GA LV GG + GF+ +PTVF +V D M IA+E
Sbjct: 328 TDHGPQNHKAHLDKLVEYCERGVKEGATLVYGGKQVDRPGFFFEPTVFTDVEDHMFIAKE 387
Query: 345 EIFGPVQQLIRF--SSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
E FGP+ + +F ++ V++RAN+++YGLA+ VFTKD++K YV+ L AGT+
Sbjct: 388 ESFGPIMIISKFDDGDVDGVLQRANDTEYGLASGVFTKDINKALYVSDKLEAGTV 442
>gnl|CDD|143435 cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus
AldA1 (SACOL0154) aldehyde dehydrogenase-like.
Uncharacterized aldehyde dehydrogenase from
Staphylococcus aureus (AldA1, locus SACOL0154) and other
similar sequences are present in this CD.
Length = 475
Score = 406 bits (1044), Expect = e-139
Identities = 173/349 (49%), Positives = 239/349 (68%), Gaps = 7/349 (2%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
+ETLDNGKP + +VD+ + RY+AG G I+ D + EP+GV GQ
Sbjct: 83 VETLDNGKPIRETRAVDIPLAADHFRYFAGVIRAEEGSANMIDEDTLSIVLREPIGVVGQ 142
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
IIPWNFP LM AWK+APALA GNT+V+KP+ T L+ L + +I + +P+GVVNIV G
Sbjct: 143 IIPWNFPFLMAAWKLAPALAAGNTVVIKPSSTTSLSLLELAKIIQDV-LPKGVVNIVTGK 201
Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
G +G L+ HPG+DK+AFTGSTEVG+ V + A L TLELGGKS NIIF DA+ D
Sbjct: 202 GSKSGEYLLNHPGLDKLAFTGSTEVGRDVAI-AAAKKLIPATLELGGKSANIIFDDANWD 260
Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
A+EGA G+ FN GQ CCAGSR FVQ+ IYDEFVA+ E+ + VG+P D + + G Q
Sbjct: 261 KALEGAQLGILFNQGQVCCAGSRIFVQEGIYDEFVAKLKEKFENVKVGNPLDPDTQMGAQ 320
Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGR----AGDKGFYVQPTVFANVRDDMKIAREEI 346
++K+Q+DKIL +++ K +GA+++ GG R DKGF+++PT+ NV +DM++A+EEI
Sbjct: 321 VNKDQLDKILSYVDIAKEEGAKILTGGHRLTENGLDKGFFIEPTLIVNVTNDMRVAQEEI 380
Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
FGPV +I+F + +EVI+ AN+S+YGL VFTKD+++ V + + G
Sbjct: 381 FGPVATVIKFKTEDEVIDMANDSEYGLGGGVFTKDINRALRVARAVETG 429
>gnl|CDD|143436 cd07118, ALDH_SNDH, Gluconobacter oxydans L-sorbosone
dehydrogenase-like. Included in this CD is the
L-sorbosone dehydrogenase (SNDH) from Gluconobacter
oxydans UV10. In G. oxydans, D-sorbitol is converted to
2-keto-L-gulonate (a precursor of L-ascorbic acid) in
sequential oxidation steps catalyzed by a FAD-dependent,
L-sorbose dehydrogenase and an NAD(P)+-dependent,
L-sorbosone dehydrogenase.
Length = 454
Score = 396 bits (1020), Expect = e-135
Identities = 161/348 (46%), Positives = 230/348 (66%), Gaps = 5/348 (1%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTI-PINGDFFAYTRHEPVGVCG 110
+ETL++GKP + + ++E + RY A A +HG + + D EP+GV G
Sbjct: 66 IETLESGKPISQARG-EIEGAADLWRYAASLARTLHGDSYNNLGDDMLGLVLREPIGVVG 124
Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
I PWNFP L+L+ K+ ALA G T+V+KP+E T T L + L+ EAG+P GVVNIV G
Sbjct: 125 IITPWNFPFLILSQKLPFALAAGCTVVVKPSEFTSGTTLMLAELLIEAGLPAGVVNIVTG 184
Query: 171 YGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
YG G A+ +HP VD V+FTGST VGK + + A NLK+ +LELGGK+P I+FADADL
Sbjct: 185 YGATVGQAMTEHPDVDMVSFTGSTRVGKAIA-AAAARNLKKVSLELGGKNPQIVFADADL 243
Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
DAA + FG++FN G+CC +GSR V ++I D FVA R+++ VGDP D + G
Sbjct: 244 DAAADAVVFGVYFNAGECCNSGSRLLVHESIADAFVAAVVARSRKVRVGDPLDPETKVGA 303
Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR-AGDKGFYVQPTVFANVRDDMKIAREEIFG 348
I++ Q+ KI +++++G+++GA L+ GG R A G + QPT+F +V DM IAREEIFG
Sbjct: 304 IINEAQLAKITDYVDAGRAEGATLLLGGERLASAAGLFYQPTIFTDVTPDMAIAREEIFG 363
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
PV ++ F +++E I AN++ YGL+A V++KD+D V + +RAGT
Sbjct: 364 PVLSVLTFDTVDEAIALANDTVYGLSAGVWSKDIDTALTVARRIRAGT 411
>gnl|CDD|143428 cd07110, ALDH_F10_BADH, Arabidopsis betaine aldehyde dehydrogenase
1 and 2, ALDH family 10A8 and 10A9-like. Present in
this CD are the Arabidopsis betaine aldehyde
dehydrogenase (BADH) 1 (chloroplast) and 2
(mitochondria), also known as, aldehyde dehydrogenase
family 10 member A8 and aldehyde dehydrogenase family 10
member A9, respectively, and are putative dehydration-
and salt-inducible BADHs (EC 1.2.1.8) that catalyze the
oxidation of betaine aldehyde to the compatible solute
glycine betaine.
Length = 456
Score = 395 bits (1016), Expect = e-135
Identities = 174/351 (49%), Positives = 232/351 (66%), Gaps = 9/351 (2%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGK---TIPI-NGDFFAYTRHEPVG 107
LE DNGKP + + + DV+ YYA A+++ K +P+ + DF A R EPVG
Sbjct: 64 LEARDNGKPLDEA-AWDVDDVAGCFEYYADLAEQLDAKAERAVPLPSEDFKARVRREPVG 122
Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
V G I PWNFPLLM AWK+APALA G T+VLKP+E T LT L + + AEAG+P GV+N+
Sbjct: 123 VVGLITPWNFPLLMAAWKVAPALAAGCTVVLKPSELTSLTELELAEIAAEAGLPPGVLNV 182
Query: 168 VPGYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFAD 226
V G GD AGA L HPG+DK++FTGST G V Q + A ++K +LELGGKSP I+F D
Sbjct: 183 VTGTGDEAGAPLAAHPGIDKISFTGSTATGSQVMQ-AAAQDIKPVSLELGGKSPIIVFDD 241
Query: 227 ADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVE 286
ADL+ AVE A FG F+N GQ C A SR V ++I D F+ R A+ VGDP + V
Sbjct: 242 ADLEKAVEWAMFGCFWNNGQICSATSRLLVHESIADAFLERLATAAEAIRVGDPLEEGVR 301
Query: 287 QGPQIDKEQMDKILEFIESGKSQGAQLVAGGGR--AGDKGFYVQPTVFANVRDDMKIARE 344
GP + + Q +K+L FI GK +GA+L+ GG R +KG+++ PTVFA+V D +I RE
Sbjct: 302 LGPLVSQAQYEKVLSFIARGKEEGARLLCGGRRPAHLEKGYFIAPTVFADVPTDSRIWRE 361
Query: 345 EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
EIFGPV + F++ +E I AN+S+YGLAAAV ++D ++ + V + L AG
Sbjct: 362 EIFGPVLCVRSFATEDEAIALANDSEYGLAAAVISRDAERCDRVAEALEAG 412
>gnl|CDD|237391 PRK13473, PRK13473, gamma-aminobutyraldehyde dehydrogenase;
Provisional.
Length = 475
Score = 388 bits (999), Expect = e-132
Identities = 149/349 (42%), Positives = 211/349 (60%), Gaps = 5/349 (1%)
Query: 51 SLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGK-TIPINGDFFAYTRHEPVGVC 109
LE+L+ GKP +++ + ++ + R++AG A + GK + R +PVGV
Sbjct: 83 RLESLNCGKPLHLALNDEIPAIVDVFRFFAGAARCLEGKAAGEYLEGHTSMIRRDPVGVV 142
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
I PWN+PL+M AWK+APALA GNT+VLKP+E TPLTAL + L A+ +P GV+N+V
Sbjct: 143 ASIAPWNYPLMMAAWKLAPALAAGNTVVLKPSEITPLTALKLAELAADI-LPPGVLNVVT 201
Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G G G ALV HP V V+ TGS GK V + A ++KRT LELGGK+P I+F DAD
Sbjct: 202 GRGATVGDALVGHPKVRMVSLTGSIATGKHVLS-AAADSVKRTHLELGGKAPVIVFDDAD 260
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
LDA VEG ++N GQ C A R + Q IYD+ VA+ VGDP D + E G
Sbjct: 261 LDAVVEGIRTFGYYNAGQDCTAACRIYAQRGIYDDLVAKLAAAVATLKVGDPDDEDTELG 320
Query: 289 PQIDKEQMDKILEFIESGKSQG-AQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIF 347
P I D++ F+E K+ G ++V GG KG+Y +PT+ A R D +I + E+F
Sbjct: 321 PLISAAHRDRVAGFVERAKALGHIRVVTGGEAPDGKGYYYEPTLLAGARQDDEIVQREVF 380
Query: 348 GPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
GPV + F ++ + AN+SDYGLA++V+T+D+ + + V+ L+ G
Sbjct: 381 GPVVSVTPFDDEDQAVRWANDSDYGLASSVWTRDVGRAHRVSARLQYGC 429
>gnl|CDD|143429 cd07111, ALDH_F16, Aldehyde dehydrogenase family 16A1-like.
Uncharacterized aldehyde dehydrogenase family 16 member
A1 (ALDH16A1) and other related sequences are present in
this CD. The active site cysteine and glutamate residues
are not conserved in the human ALDH16A1 protein
sequence.
Length = 480
Score = 385 bits (992), Expect = e-131
Identities = 167/345 (48%), Positives = 223/345 (64%), Gaps = 11/345 (3%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
LE+LDNGKP S D+ R ++AG+A + +PVGV GQ
Sbjct: 104 LESLDNGKPIRESRDCDIPLVARHFYHHAGWAQLLD----------TELAGWKPVGVVGQ 153
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
I+PWNFPLLMLAWKI PALA GNT+VLKPAE TPLTAL + AEAG+P GV+NIV G
Sbjct: 154 IVPWNFPLLMLAWKICPALAMGNTVVLKPAEYTPLTALLFAEICAEAGLPPGVLNIVTGN 213
Query: 172 GDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDA 231
G G+AL HPGVDKVAFTGSTEVG+ +++ + A K+ +LELGGKSP I+F DADLD+
Sbjct: 214 GSFGSALANHPGVDKVAFTGSTEVGRALRR-ATAGTGKKLSLELGGKSPFIVFDDADLDS 272
Query: 232 AVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
AVEG ++FN GQ CCAGSR VQ+++ +E + + ER VGDP D ++ G +
Sbjct: 273 AVEGIVDAIWFNQGQVCCAGSRLLVQESVAEELIRKLKERMSHLRVGDPLDKAIDMGAIV 332
Query: 292 DKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQ 351
D Q+ +I E +E G+++GA + G KG + PT+F NV +IA+EEIFGPV
Sbjct: 333 DPAQLKRIRELVEEGRAEGADVFQPGADLPSKGPFYPPTLFTNVPPASRIAQEEIFGPVL 392
Query: 352 QLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
++ F + +E + ANN+ YGLAA+V++++L V L+AG
Sbjct: 393 VVLTFRTAKEAVALANNTPYGLAASVWSENLSLALEVALSLKAGV 437
>gnl|CDD|143411 cd07092, ALDH_ABALDH-YdcW, Escherichia coli NAD+-dependent
gamma-aminobutyraldehyde dehydrogenase YdcW-like.
NAD+-dependent, tetrameric, gamma-aminobutyraldehyde
dehydrogenase (ABALDH), YdcW of Escherichia coli K12,
catalyzes the oxidation of gamma-aminobutyraldehyde to
gamma-aminobutyric acid. ABALDH can also oxidize n-alkyl
medium-chain aldehydes, but with a lower catalytic
efficiency.
Length = 450
Score = 382 bits (983), Expect = e-130
Identities = 156/350 (44%), Positives = 213/350 (60%), Gaps = 11/350 (3%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFA----YTRHEPVG 107
LE+ + GKP ++ ++ ++ R++AG A + G P G++ R EP+G
Sbjct: 64 LESRNTGKPLHLVRDDELPGAVDNFRFFAGAARTLEG---PAAGEYLPGHTSMIRREPIG 120
Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
V QI PWN+PL+M AWKIAPALA GNT+VLKP+E TPLT L + L AE +P GVVN+
Sbjct: 121 VVAQIAPWNYPLMMAAWKIAPALAAGNTVVLKPSETTPLTTLLLAELAAE-VLPPGVVNV 179
Query: 168 VPGYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFAD 226
V G G AG ALV HP V V+ TGS GK V + + A LKR LELGGK+P I+F D
Sbjct: 180 VCGGGASAGDALVAHPRVRMVSLTGSVRTGKKVAR-AAADTLKRVHLELGGKAPVIVFDD 238
Query: 227 ADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVE 286
ADLDAAV G ++N GQ C A R +V +++YDEFVA E VGDP D + E
Sbjct: 239 ADLDAAVAGIATAGYYNAGQDCTAACRVYVHESVYDEFVAALVEAVSAIRVGDPDDEDTE 298
Query: 287 QGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEI 346
GP Q +++ F+E A+++ GG RA G++ +PTV A V D +I +EEI
Sbjct: 299 MGPLNSAAQRERVAGFVE-RAPAHARVLTGGRRAEGPGYFYEPTVVAGVAQDDEIVQEEI 357
Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
FGPV + F +E IE AN+ +YGLA++V+T+D+ + ++ L GT
Sbjct: 358 FGPVVTVQPFDDEDEAIELANDVEYGLASSVWTRDVGRAMRLSARLDFGT 407
>gnl|CDD|143408 cd07089, ALDH_CddD-AldA-like, Rhodococcus ruber 6-oxolauric acid
dehydrogenase-like and related proteins. The
6-oxolauric acid dehydrogenase (CddD) from Rhodococcus
ruber SC1 which converts 6-oxolauric acid to
dodecanedioic acid; and the aldehyde dehydrogenase
(locus SSP0762) from Staphylococcus saprophyticus subsp.
saprophyticus ATCC 15305 and also, the Mycobacterium
tuberculosis H37Rv ALDH AldA (locus Rv0768) sequence;
and other similar sequences, are included in this CD.
Length = 459
Score = 379 bits (976), Expect = e-129
Identities = 151/329 (45%), Positives = 197/329 (59%), Gaps = 9/329 (2%)
Query: 76 LRYYAGFADKIHGK-TIPINGDFFAYT----RHEPVGVCGQIIPWNFPLLMLAWKIAPAL 130
LRY+A AD + +P+ R EPVGV I PWNFP + K+APAL
Sbjct: 89 LRYFADLADSFPWEFDLPVPALRGGPGRRVVRREPVGVVAAITPWNFPFFLNLAKLAPAL 148
Query: 131 ATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAF 189
A GNT+VLKPA TPL+AL +G +IAE +P GVVN+V G +A G AL P VD V+F
Sbjct: 149 AAGNTVVLKPAPDTPLSALLLGEIIAETDLPAGVVNVVTGSDNAVGEALTTDPRVDMVSF 208
Query: 190 TGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCC 249
TGST VG+ + A+ LKR LELGGKS NI+ DADL AA A N GQ C
Sbjct: 209 TGSTAVGRRIMA-QAAATLKRVLLELGGKSANIVLDDADLAAAAPAAVGVCMHNAGQGCA 267
Query: 250 AGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQ 309
+R V + YDE V + VGDP D GP I Q D++ +I G+ +
Sbjct: 268 LTTRLLVPRSRYDEVVEALAAAFEALPVGDPADPGTVMGPLISAAQRDRVEGYIARGRDE 327
Query: 310 GAQLVAGGGRA--GDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERAN 367
GA+LV GGGR DKGFYV+PT+FA+V +DM+IA+EEIFGPV +I + +E + AN
Sbjct: 328 GARLVTGGGRPAGLDKGFYVEPTLFADVDNDMRIAQEEIFGPVLVVIPYDDDDEAVRIAN 387
Query: 368 NSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
+SDYGL+ V++ D+D+ V + +R G+
Sbjct: 388 DSDYGLSGGVWSADVDRAYRVARRIRTGS 416
>gnl|CDD|143457 cd07139, ALDH_AldA-Rv0768, Mycobacterium tuberculosis aldehyde
dehydrogenase AldA-like. The Mycobacterium
tuberculosis NAD+-dependent, aldehyde dehydrogenase PDB
structure, 3B4W, and the Mycobacterium tuberculosis
H37Rv aldehyde dehydrogenase AldA (locus Rv0768)
sequence, as well as the Rhodococcus rhodochrous ALDH
involved in haloalkane catabolism, and other similar
sequences, are included in this CD.
Length = 471
Score = 379 bits (975), Expect = e-128
Identities = 154/348 (44%), Positives = 205/348 (58%), Gaps = 8/348 (2%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADK---IHGKTIPINGDFFAYTRHEPVGVCG 110
T +NG P + S LRYYA A + G R EPVGV
Sbjct: 85 TAENGMPISWSRRAQGPGPAALLRYYAALARDFPFEERRPGSGGGH--VLVRREPVGVVA 142
Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
I+PWN PL + A KIAPALA G T+VLKP+ +TPL A + EAG+P GVVN+VP
Sbjct: 143 AIVPWNAPLFLAALKIAPALAAGCTVVLKPSPETPLDAYLLAEAAEEAGLPPGVVNVVPA 202
Query: 171 YGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
+ G LV+HPGVDKV+FTGST G+ + G L R TLELGGKS I+ DADLD
Sbjct: 203 DREVGEYLVRHPGVDKVSFTGSTAAGRRIAAVCG-ERLARVTLELGGKSAAIVLDDADLD 261
Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
AAV G N GQ C A +R V + YDE V VGDP D + GP
Sbjct: 262 AAVPGLVPASLMNNGQVCVALTRILVPRSRYDEVVEALAAAVAALKVGDPLDPATQIGPL 321
Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGR-AG-DKGFYVQPTVFANVRDDMKIAREEIFG 348
Q +++ +I G+++GA+LV GGGR AG D+G++V+PT+FA+V +DM+IA+EEIFG
Sbjct: 322 ASARQRERVEGYIAKGRAEGARLVTGGGRPAGLDRGWFVEPTLFADVDNDMRIAQEEIFG 381
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
PV +I + ++ + AN+SDYGL+ +V+T D+++ V + +R GT
Sbjct: 382 PVLSVIPYDDEDDAVRIANDSDYGLSGSVWTADVERGLAVARRIRTGT 429
>gnl|CDD|143456 cd07138, ALDH_CddD_SSP0762, Rhodococcus ruber 6-oxolauric acid
dehydrogenase-like. The 6-oxolauric acid dehydrogenase
(CddD) from Rhodococcus ruber SC1 which converts
6-oxolauric acid to dodecanedioic acid, and the aldehyde
dehydrogenase (locus SSP0762) from Staphylococcus
saprophyticus subsp. saprophyticus ATCC 15305 and other
similar sequences, are included in this CD.
Length = 466
Score = 378 bits (974), Expect = e-128
Identities = 150/353 (42%), Positives = 202/353 (57%), Gaps = 21/353 (5%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAG------FADKIHGKTIPINGDFFAYTRHEPVG 107
TL+ G P ++ + V + LR A F ++ + EP+G
Sbjct: 83 TLEMGAPITLARAAQVGLGIGHLRAAADALKDFEFEERRGNSLV----------VREPIG 132
Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
VCG I PWN+PL + K+APALA G T+VLKP+E PL+A+ + ++ EAG+P GV N+
Sbjct: 133 VCGLITPWNWPLNQIVLKVAPALAAGCTVVLKPSEVAPLSAIILAEILDEAGLPAGVFNL 192
Query: 168 VPGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFAD 226
V G G G AL HP VD V+FTGST GK V + + A +KR LELGGKS NII D
Sbjct: 193 VNGDGPVVGEALSAHPDVDMVSFTGSTRAGKRVAE-AAADTVKRVALELGGKSANIILDD 251
Query: 227 ADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVE 286
ADL+ AV F N GQ C A +R V + Y E + A+ VGDP D
Sbjct: 252 ADLEKAVPRGVAACFANSGQSCNAPTRMLVPRSRYAEAEEIAAAAAEAYVVGDPRDPATT 311
Query: 287 QGPQIDKEQMDKILEFIESGKSQGAQLVAGG-GR-AG-DKGFYVQPTVFANVRDDMKIAR 343
GP Q D++ +I+ G +GA+LVAGG GR G ++G++V+PTVFA+V DM IAR
Sbjct: 312 LGPLASAAQFDRVQGYIQKGIEEGARLVAGGPGRPEGLERGYFVKPTVFADVTPDMTIAR 371
Query: 344 EEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
EEIFGPV +I + +E I AN++ YGLA V++ D ++ V + LRAG
Sbjct: 372 EEIFGPVLSIIPYDDEDEAIAIANDTPYGLAGYVWSADPERARAVARRLRAGQ 424
>gnl|CDD|143415 cd07097, ALDH_KGSADH-YcbD, Bacillus subtilis NADP+-dependent
alpha-ketoglutaric semialdehyde dehydrogenase ycbD-like.
Kinetic studies of the Bacillus subtilis ALDH-like ycbD
protein, which is involved in d-glucarate/d-galactarate
utilization, reveal that it is a NADP+-dependent,
alpha-ketoglutaric semialdehyde dehydrogenase (KGSADH).
KGSADHs (EC 1.2.1.26) catalyze the NAD(P)+-dependent
conversion of KGSA to alpha-ketoglutarate.
Interestingly, the NADP+-dependent, tetrameric,
2,5-dioxopentanoate dehydrogenase (EC=1.2.1.26), an
enzyme involved in the catabolic pathway for D-arabinose
in Sulfolobus solfataricus, also clusters in this group.
This CD shows a distant phylogenetic relationship to the
Azospirillum brasilense KGSADH-II (-III) group.
Length = 473
Score = 378 bits (972), Expect = e-128
Identities = 157/347 (45%), Positives = 217/347 (62%), Gaps = 6/347 (1%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVCGQI 112
T + GK + +V + + RYYAG A ++ G+T+P T EP+GV G I
Sbjct: 84 TREEGKTLPEARG-EVTRAGQIFRYYAGEALRLSGETLPSTRPGVEVETTREPLGVVGLI 142
Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
PWNFP+ + AWKIAPALA GNT+V KPAE TP +A A+ ++ EAG+P GV N+V G G
Sbjct: 143 TPWNFPIAIPAWKIAPALAYGNTVVFKPAELTPASAWALVEILEEAGLPAGVFNLVMGSG 202
Query: 173 -DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDA 231
+ G ALV+HP VD V+FTGST VG+ + + A+ R LE+GGK+P ++ DADLD
Sbjct: 203 SEVGQALVEHPDVDAVSFTGSTAVGRRIAA-AAAARGARVQLEMGGKNPLVVLDDADLDL 261
Query: 232 AVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
AVE A G FF+ GQ C A SR V + I+D FV ER K VGD D V+ GP +
Sbjct: 262 AVECAVQGAFFSTGQRCTASSRLIVTEGIHDRFVEALVERTKALKVGDALDEGVDIGPVV 321
Query: 292 DKEQMDKILEFIESGKSQGAQLVAGGGR--AGDKGFYVQPTVFANVRDDMKIAREEIFGP 349
+ Q++K L +IE +S+GA+LV GG R D+G+Y+ P +FA V +DM+IAREEIFGP
Sbjct: 322 SERQLEKDLRYIEIARSEGAKLVYGGERLKRPDEGYYLAPALFAGVTNDMRIAREEIFGP 381
Query: 350 VQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
V +IR +E + AN++++GL+A + T L + + + AG
Sbjct: 382 VAAVIRVRDYDEALAIANDTEFGLSAGIVTTSLKHATHFKRRVEAGV 428
>gnl|CDD|143395 cd06534, ALDH-SF, NAD(P)+-dependent aldehyde dehydrogenase
superfamily. The aldehyde dehydrogenase superfamily
(ALDH-SF) of NAD(P)+-dependent enzymes, in general,
oxidize a wide range of endogenous and exogenous
aliphatic and aromatic aldehydes to their corresponding
carboxylic acids and play an important role in
detoxification. Besides aldehyde detoxification, many
ALDH isozymes possess multiple additional catalytic and
non-catalytic functions such as participating in
metabolic pathways, or as binding proteins, or
osmoregulants, to mention a few. The enzyme has three
domains, a NAD(P)+ cofactor-binding domain, a catalytic
domain, and a bridging domain; and the active enzyme is
generally either homodimeric or homotetrameric. The
catalytic mechanism is proposed to involve cofactor
binding, resulting in a conformational change and
activation of an invariant catalytic cysteine
nucleophile. The cysteine and aldehyde substrate form an
oxyanion thiohemiacetal intermediate resulting in
hydride transfer to the cofactor and formation of a
thioacylenzyme intermediate. Hydrolysis of the
thioacylenzyme and release of the carboxylic acid
product occurs, and in most cases, the reduced cofactor
dissociates from the enzyme. The evolutionary
phylogenetic tree of ALDHs appears to have an initial
bifurcation between what has been characterized as the
classical aldehyde dehydrogenases, the ALDH family
(ALDH) and extended family members or aldehyde
dehydrogenase-like (ALDH-L) proteins. The ALDH proteins
are represented by enzymes which share a number of
highly conserved residues necessary for catalysis and
cofactor binding and they include such proteins as
retinal dehydrogenase, 10-formyltetrahydrofolate
dehydrogenase, non-phosphorylating glyceraldehyde
3-phosphate dehydrogenase,
delta(1)-pyrroline-5-carboxylate dehydrogenases,
alpha-ketoglutaric semialdehyde dehydrogenase,
alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
aldehyde dehydrogenase and succinate-semialdehyde
dehydrogenase. Included in this larger group are all
human, Arabidopsis, Tortula, fungal, protozoan, and
Drosophila ALDHs identified in families ALDH1 through
ALDH22 with the exception of families ALDH18, ALDH19,
and ALDH20 which are present in the ALDH-like group. The
ALDH-like group is represented by such proteins as
gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA
reductase, and coenzyme A acylating aldehyde
dehydrogenase. All of these proteins have a conserved
cysteine that aligns with the catalytic cysteine of the
ALDH group.
Length = 367
Score = 366 bits (942), Expect = e-125
Identities = 165/348 (47%), Positives = 205/348 (58%), Gaps = 64/348 (18%)
Query: 51 SLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVC 109
+LETL+ GKP + +V ++ T RY AG ADK+ G +P + AY R EP+GV
Sbjct: 38 ALETLETGKPIEEALG-EVARAIDTFRYAAGLADKLGGPELPSPDPGGEAYVRREPLGVV 96
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
G I PWNFPLL+ AWK+APALA GNT+VLKP+E TPLTALA+ L+ EAG+P GVVN+VP
Sbjct: 97 GVITPWNFPLLLAAWKLAPALAAGNTVVLKPSELTPLTALALAELLQEAGLPPGVVNVVP 156
Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G GD GAAL+ HP VDK++FTGST VGK + + + A NLK TLELGGKSP I+ DAD
Sbjct: 157 GGGDEVGAALLSHPRVDKISFTGSTAVGKAIMK-AAAENLKPVTLELGGKSPVIVDEDAD 215
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
LDAAVEGA FG FFN GQ C A SR
Sbjct: 216 LDAAVEGAVFGAFFNAGQICTAASRLL--------------------------------- 242
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
+ + D+ +E +LV TV +V DM IA+EEIFG
Sbjct: 243 --VHESIYDEFVE----------KLV---------------TVLVDVDPDMPIAQEEIFG 275
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
PV +IRF EE I AN+++YGL A VFT+DL++ V + LRAGT
Sbjct: 276 PVLPVIRFKDEEEAIALANDTEYGLTAGVFTRDLNRALRVAERLRAGT 323
>gnl|CDD|143424 cd07106, ALDH_AldA-AAD23400, Streptomyces aureofaciens putative
aldehyde dehydrogenase AldA (AAD23400)-like. Putative
aldehyde dehydrogenase, AldA, from Streptomyces
aureofaciens (locus AAD23400) and other similar
sequences are present in this CD.
Length = 446
Score = 367 bits (944), Expect = e-124
Identities = 145/343 (42%), Positives = 200/343 (58%), Gaps = 5/343 (1%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQII 113
TL+ GKP + +V ++ LRY A + I + R +P+GV I+
Sbjct: 66 TLEQGKPLAEAQ-FEVGGAVAWLRYTASLDLPD--EVIEDDDTRRVELRRKPLGVVAAIV 122
Query: 114 PWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGD 173
PWNFPLL+ AWKIAPAL GNT+VLKP+ TPL L +G L E +P GV+N+V G +
Sbjct: 123 PWNFPLLLAAWKIAPALLAGNTVVLKPSPFTPLCTLKLGELAQEV-LPPGVLNVVSGGDE 181
Query: 174 AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAV 233
G AL HP + K++FTGST GK V S A LKR TLELGG I+ D D+DA
Sbjct: 182 LGPALTSHPDIRKISFTGSTATGKKVMA-SAAKTLKRVTLELGGNDAAIVLPDVDIDAVA 240
Query: 234 EGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDK 293
+G F N GQ C A R +V ++IYDEF AK VGD D GP +K
Sbjct: 241 PKLFWGAFINSGQVCAAIKRLYVHESIYDEFCEALVALAKAAVVGDGLDPGTTLGPVQNK 300
Query: 294 EQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQL 353
Q DK+ E +E K++GA+++AGG G+++ PT+ + + +I EE FGPV +
Sbjct: 301 MQYDKVKELVEDAKAKGAKVLAGGEPLDGPGYFIPPTIVDDPPEGSRIVDEEQFGPVLPV 360
Query: 354 IRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
+++S +EVI RAN+S+YGL A+V++ DL++ V + L AGT
Sbjct: 361 LKYSDEDEVIARANDSEYGLGASVWSSDLERAEAVARRLEAGT 403
>gnl|CDD|215260 PLN02467, PLN02467, betaine aldehyde dehydrogenase.
Length = 503
Score = 362 bits (930), Expect = e-121
Identities = 157/353 (44%), Positives = 218/353 (61%), Gaps = 9/353 (2%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGK-TIPIN---GDFFAYTRHEP 105
+ LETLD GKP + + + D++ YYA A+ + K P++ F Y EP
Sbjct: 93 AKLETLDCGKPLDEA-AWDMDDVAGCFEYYADLAEALDAKQKAPVSLPMETFKGYVLKEP 151
Query: 106 VGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVV 165
+GV G I PWN+PLLM WK+APALA G T VLKP+E +T L + + E G+P GV+
Sbjct: 152 LGVVGLITPWNYPLLMATWKVAPALAAGCTAVLKPSELASVTCLELADICREVGLPPGVL 211
Query: 166 NIVPGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIF 224
N+V G G +AGA L HPGVDK+AFTGST G+ + + A +K +LELGGKSP I+F
Sbjct: 212 NVVTGLGTEAGAPLASHPGVDKIAFTGSTATGRKIMT-AAAQMVKPVSLELGGKSPIIVF 270
Query: 225 ADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLN 284
D DLD AVE A FG F+ GQ C A SR V + I EF+ + + AK + DP +
Sbjct: 271 DDVDLDKAVEWAMFGCFWTNGQICSATSRLLVHERIASEFLEKLVKWAKNIKISDPLEEG 330
Query: 285 VEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD--KGFYVQPTVFANVRDDMKIA 342
GP + + Q +K+L+FI + KS+GA ++ GG R KGF+++PT+ +V M+I
Sbjct: 331 CRLGPVVSEGQYEKVLKFISTAKSEGATILCGGKRPEHLKKGFFIEPTIITDVTTSMQIW 390
Query: 343 REEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
REE+FGPV + FS+ +E IE AN+S YGLA AV + DL++ V++ +AG
Sbjct: 391 REEVFGPVLCVKTFSTEDEAIELANDSHYGLAGAVISNDLERCERVSEAFQAG 443
>gnl|CDD|143425 cd07107, ALDH_PhdK-like, Nocardioides 2-carboxybenzaldehyde
dehydrogenase, PhdK-like. Nocardioides sp. strain
KP72-carboxybenzaldehyde dehydrogenase (PhdK), an enzyme
involved in phenanthrene degradation, and other similar
sequences, are present in this CD.
Length = 456
Score = 360 bits (925), Expect = e-121
Identities = 155/350 (44%), Positives = 219/350 (62%), Gaps = 9/350 (2%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
++ LD G P + DV + L Y+AG ++ G+TIP+ G YT EP GV +
Sbjct: 64 IDALDCGNPVSAMLG-DVMVAAALLDYFAGLVTELKGETIPVGGRNLHYTLREPYGVVAR 122
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
I+ +N PL+ A KIA LA GNT+V+KP EQ PL+AL + L E +P GV NI+PG
Sbjct: 123 IVAFNHPLMFAAAKIAAPLAAGNTVVVKPPEQAPLSALRLAELAREV-LPPGVFNILPGD 181
Query: 172 GD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
G AGAALV+HP V ++A GS G+ + + + A +K TLELGGK+ I+F DAD +
Sbjct: 182 GATAGAALVRHPDVKRIALIGSVPTGRAIMR-AAAEGIKHVTLELGGKNALIVFPDADPE 240
Query: 231 AAVEGAHFGLFFN-MGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
AA + A G+ F GQ C + SR FV ++IYDE +AR ER VGDP D GP
Sbjct: 241 AAADAAVAGMNFTWCGQSCGSTSRLFVHESIYDEVLARVVERVAAIKVGDPTDPATTMGP 300
Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR----AGDKGFYVQPTVFANVRDDMKIAREE 345
+ ++Q D+++ +I+S K +GA+LV GGGR A + GFYV+PTVFA+V M+IAREE
Sbjct: 301 LVSRQQYDRVMHYIDSAKREGARLVTGGGRPEGPALEGGFYVEPTVFADVTPGMRIAREE 360
Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
IFGPV ++R+ E++ +AN +YGL AA++T D+ + + + + AG
Sbjct: 361 IFGPVLSVLRWRDEAEMVAQANGVEYGLTAAIWTNDISQAHRTARRVEAG 410
>gnl|CDD|143422 cd07104, ALDH_BenzADH-like, ALDH subfamily: NAD(P)+-dependent
benzaldehyde dehydrogenase II, vanillin dehydrogenase,
p-hydroxybenzaldehyde dehydrogenase and related
proteins. ALDH subfamily which includes the
NAD(P)+-dependent, benzaldehyde dehydrogenase II (XylC,
BenzADH, EC=1.2.1.28) involved in the oxidation of
benzyl alcohol to benzoate; p-hydroxybenzaldehyde
dehydrogenase (PchA, HBenzADH) which catalyzes the
oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic
acid; vanillin dehydrogenase (Vdh, VaniDH) involved in
the metabolism of ferulic acid as seen in Pseudomonas
putida KT2440; and other related sequences.
Length = 431
Score = 355 bits (913), Expect = e-119
Identities = 150/335 (44%), Positives = 210/335 (62%), Gaps = 17/335 (5%)
Query: 69 VEYSLRTLRYYAGFADKIHGKTIP--INGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWK- 125
V ++ LR AG + G+ +P + G + R P+GV G I P+NFPL+ LA +
Sbjct: 61 VGAAIAILREAAGLPRRPEGEILPSDVPGKE-SMVRRVPLGVVGVISPFNFPLI-LAMRS 118
Query: 126 IAPALATGNTIVLKPAEQTPLTALAVGALIA----EAGIPEGVVNIVPGYGD-AGAALVQ 180
+APALA GN +VLKP +TP+T G LIA EAG+P+GV+N+VPG G G ALV+
Sbjct: 119 VAPALALGNAVVLKPDSRTPVTG---GLLIAEIFEEAGLPKGVLNVVPGGGSEIGDALVE 175
Query: 181 HPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGL 240
HP V ++FTGST VG+ + + +G +LK+ LELGG +P I+ DADLD AV A FG
Sbjct: 176 HPRVRMISFTGSTAVGRHIGELAGR-HLKKVALELGGNNPLIVLDDADLDLAVSAAAFGA 234
Query: 241 FFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKIL 300
F + GQ C A R V +++YDEFV + +AK VGDP D + GP I++ Q+D++
Sbjct: 235 FLHQGQICMAAGRILVHESVYDEFVEKLVAKAKALPVGDPRDPDTVIGPLINERQVDRVH 294
Query: 301 EFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIE 360
+E + GA+L+ GG +G + QPTV ++V DM I REEIFGPV +I F E
Sbjct: 295 AIVEDAVAAGARLLTGGTY---EGLFYQPTVLSDVTPDMPIFREEIFGPVAPVIPFDDDE 351
Query: 361 EVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
E +E AN+++YGL+AAVFT+DL++ + L G
Sbjct: 352 EAVELANDTEYGLSAAVFTRDLERAMAFAERLETG 386
>gnl|CDD|143407 cd07088, ALDH_LactADH-AldA, Escherichia coli lactaldehyde
dehydrogenase AldA-like. Lactaldehyde dehydrogenase
from Escherichia coli (AldA, LactADH, EC=1.2.1.22), an
NAD(+)-dependent enzyme involved in the metabolism of
L-fucose and L-rhamnose, and other similar sequences are
present in this CD.
Length = 468
Score = 355 bits (914), Expect = e-119
Identities = 150/345 (43%), Positives = 218/345 (63%), Gaps = 5/345 (1%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPI-NGDFFAYTRHEPVGVCGQI 112
+ GK +++ V+VE++ + Y A +A +I G+ IP + + P+GV I
Sbjct: 82 VEEQGKTLSLAR-VEVEFTADYIDYMAEWARRIEGEIIPSDRPNENIFIFKVPIGVVAGI 140
Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
+PWNFP ++A K+APAL TGNTIV+KP+E+TPL AL L+ EAG+P GV+NIV G G
Sbjct: 141 LPWNFPFFLIARKLAPALVTGNTIVIKPSEETPLNALEFAELVDEAGLPAGVLNIVTGRG 200
Query: 173 -DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDA 231
G ALV HP V ++ TGSTE G+ + + + A N+ + +LELGGK+P I+ DADLD
Sbjct: 201 SVVGDALVAHPKVGMISLTGSTEAGQKIMEAA-AENITKVSLELGGKAPAIVMKDADLDL 259
Query: 232 AVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
AV+ N GQ C R +V + IYDEF+ + E+ K VGDPFD + GP +
Sbjct: 260 AVKAIVDSRIINCGQVCTCAERVYVHEDIYDEFMEKLVEKMKAVKVGDPFDAATDMGPLV 319
Query: 292 DKEQMDKILEFIESGKSQGAQLVAGGGRA-GDKGFYVQPTVFANVRDDMKIAREEIFGPV 350
++ +DK+ E +E GA L+ GG R G+KG++ +PTV NVR DM+I +EEIFGPV
Sbjct: 320 NEAALDKVEEMVERAVEAGATLLTGGKRPEGEKGYFYEPTVLTNVRQDMEIVQEEIFGPV 379
Query: 351 QQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
+++FSS++E IE AN+S+YGL + ++T++L+ T L G
Sbjct: 380 LPVVKFSSLDEAIELANDSEYGLTSYIYTENLNTAMRATNELEFG 424
>gnl|CDD|143449 cd07131, ALDH_AldH-CAJ73105, Uncharacterized Candidatus kuenenia
aldehyde dehydrogenase AldH (CAJ73105)-like.
Uncharacterized aldehyde dehydrogenase of Candidatus
kuenenia AldH (locus CAJ73105) and similar sequences
with similarity to alpha-aminoadipic semialdehyde
dehydrogenase (AASADH, human ALDH7A1, EC=1.2.1.31),
Arabidopsis ALDH7B4, and Streptomyces clavuligerus
delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH)
are included in this CD.
Length = 478
Score = 353 bits (907), Expect = e-118
Identities = 168/352 (47%), Positives = 231/352 (65%), Gaps = 8/352 (2%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPIN-GDFFAYTRHEPVGV 108
+ L T + GKP DV+ ++ +Y AG ++ G+T+P + A TR +P+GV
Sbjct: 80 ARLVTREMGKPLAEGRG-DVQEAIDMAQYAAGEGRRLFGETVPSELPNKDAMTRRQPIGV 138
Query: 109 CGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV 168
I PWNFP+ + +WKI PAL GNT+V KPAE TP AL + L AEAG+P GVVN+V
Sbjct: 139 VALITPWNFPVAIPSWKIFPALVCGNTVVFKPAEDTPACALKLVELFAEAGLPPGVVNVV 198
Query: 169 PGYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA 227
G G+ G ALV+HP VD V+FTGSTEVG+ + + N KR LE+GGK+P I+ DA
Sbjct: 199 HGRGEEVGEALVEHPDVDVVSFTGSTEVGERIGETCARPN-KRVALEMGGKNPIIVMDDA 257
Query: 228 DLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQ 287
DLD A+EGA + F GQ C A SR V +++YDEF+ R ERAKR VGD D +
Sbjct: 258 DLDLALEGALWSAFGTTGQRCTATSRLIVHESVYDEFLKRFVERAKRLRVGDGLDEETDM 317
Query: 288 GPQIDKEQMDKILEFIESGKSQGAQLVAGGGRA----GDKGFYVQPTVFANVRDDMKIAR 343
GP I++ Q++K+L + E GK +GA L+ GG R +KG++V+PTVF +V DM+IA+
Sbjct: 318 GPLINEAQLEKVLNYNEIGKEEGATLLLGGERLTGGGYEKGYFVEPTVFTDVTPDMRIAQ 377
Query: 344 EEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
EEIFGPV LI SS+EE IE AN+++YGL++A++T+D++K + L AG
Sbjct: 378 EEIFGPVVALIEVSSLEEAIEIANDTEYGLSSAIYTEDVNKAFRARRDLEAG 429
>gnl|CDD|143426 cd07108, ALDH_MGR_2402, Magnetospirillum NAD(P)+-dependent aldehyde
dehydrogenase MSR-1-like. NAD(P)+-dependent aldehyde
dehydrogenase of Magnetospirillum gryphiswaldense MSR-1
(MGR_2402) , and other similar sequences, are present in
this CD.
Length = 457
Score = 351 bits (903), Expect = e-118
Identities = 157/351 (44%), Positives = 213/351 (60%), Gaps = 9/351 (2%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
L L+ G + RY+ G A ++ G+T+P D YT EP+GV G
Sbjct: 64 LLALETGNALRTQARPEAAVLADLFRYFGGLAGELKGETLPFGPDVLTYTVREPLGVVGA 123
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
I+PWN PL++ A KIAPAL GNT+VLK AE PL L + ++A+ +P GV+N++ GY
Sbjct: 124 ILPWNAPLMLAALKIAPALVAGNTVVLKAAEDAPLAVLLLAEILAQV-LPAGVLNVITGY 182
Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
G + GAALV HP VDKV FTGSTEVGK++ + + A L +LELGGKSP I+F DADLD
Sbjct: 183 GEECGAALVDHPDVDKVTFTGSTEVGKIIYR-AAADRLIPVSLELGGKSPMIVFPDADLD 241
Query: 231 AAVEGAHFGL-FFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
AV+GA G+ F GQ C AGSR FV + IYD F+ + + + +GDP D + G
Sbjct: 242 DAVDGAIAGMRFTRQGQSCTAGSRLFVHEDIYDAFLEKLVAKLSKLKIGDPLDEATDIGA 301
Query: 290 QIDKEQMDKILEFIESGKS-QGAQLVAGG----GRAGDKGFYVQPTVFANVRDDMKIARE 344
I ++Q K+ +I+ G S GA ++ GG GF+VQPT+F+ V ++ ++ARE
Sbjct: 302 IISEKQFAKVCGYIDLGLSTSGATVLRGGPLPGEGPLADGFFVQPTIFSGVDNEWRLARE 361
Query: 345 EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
EIFGPV I + +EVI AN+S YGLAA V+T+DL + L AG
Sbjct: 362 EIFGPVLCAIPWKDEDEVIAMANDSHYGLAAYVWTRDLGRALRAAHALEAG 412
>gnl|CDD|182108 PRK09847, PRK09847, gamma-glutamyl-gamma-aminobutyraldehyde
dehydrogenase; Provisional.
Length = 494
Score = 350 bits (899), Expect = e-116
Identities = 151/348 (43%), Positives = 222/348 (63%), Gaps = 4/348 (1%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
LETLD GKP S D+ + R +R+YA DK++G+ + A EPVGV
Sbjct: 104 LETLDTGKPIRHSLRDDIPGAARAIRWYAEAIDKVYGEVATTSSHELAMIVREPVGVIAA 163
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
I+PWNFPLL+ WK+ PALA GN+++LKP+E++PL+A+ + L EAG+P+GV+N+V G+
Sbjct: 164 IVPWNFPLLLTCWKLGPALAAGNSVILKPSEKSPLSAIRLAGLAKEAGLPDGVLNVVTGF 223
Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA-DL 229
G +AG AL +H +D +AFTGST GK + + +G SN+KR LE GGKS NI+FAD DL
Sbjct: 224 GHEAGQALSRHNDIDAIAFTGSTRTGKQLLKDAGDSNMKRVWLEAGGKSANIVFADCPDL 283
Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
A G+F+N GQ C AG+R ++++I DEF+A ++A+ G P D G
Sbjct: 284 QQAASATAAGIFYNQGQVCIAGTRLLLEESIADEFLALLKQQAQNWQPGHPLDPATTMGT 343
Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGP 349
ID D + FI G+S+G L+ GR + PT+F +V + ++REEIFGP
Sbjct: 344 LIDCAHADSVHSFIREGESKGQLLL--DGRNAGLAAAIGPTIFVDVDPNASLSREEIFGP 401
Query: 350 VQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
V + RF+S E+ ++ AN+S YGL AAV+T+DL + + +++ L+AG++
Sbjct: 402 VLVVTRFTSEEQALQLANDSQYGLGAAVWTRDLSRAHRMSRRLKAGSV 449
>gnl|CDD|143467 cd07149, ALDH_y4uC, Uncharacterized ALDH (y4uC) with similarity to
Tortula ruralis aldehyde dehydrogenase ALDH21A1.
Uncharacterized aldehyde dehydrogenase (ORF name y4uC)
with sequence similarity to the moss Tortula ruralis
aldehyde dehydrogenase ALDH21A1 (RNP123) believed to
play an important role in the detoxification of
aldehydes generated in response to desiccation- and
salinity-stress, and similar sequences are included in
this CD.
Length = 453
Score = 344 bits (884), Expect = e-115
Identities = 146/347 (42%), Positives = 207/347 (59%), Gaps = 13/347 (3%)
Query: 58 GKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPING-----DFFAYTRHEPVGVCGQI 112
GKP + +V+ ++ TLR A A ++ G+TIP + +T EP+GV I
Sbjct: 72 GKPIKDAR-KEVDRAIETLRLSAEEAKRLAGETIPFDASPGGEGRIGFTIREPIGVVAAI 130
Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
P+NFPL ++A K+ PA+A GN +VLKPA QTPL+AL + L+ EAG+P+G +N+V G G
Sbjct: 131 TPFNFPLNLVAHKVGPAIAAGNAVVLKPASQTPLSALKLAELLLEAGLPKGALNVVTGSG 190
Query: 173 DA-GAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDA 231
+ G ALV P V ++FTGS VG+ + + +G LK+ TLELG + I+ ADADL+
Sbjct: 191 ETVGDALVTDPRVRMISFTGSPAVGEAIARKAG---LKKVTLELGSNAAVIVDADADLEK 247
Query: 232 AVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
AVE G F N GQ C + R FV + IYDEF+ R K+ VGDP D + + GP I
Sbjct: 248 AVERCVSGAFANAGQVCISVQRIFVHEDIYDEFLERFVAATKKLVVGDPLDEDTDVGPMI 307
Query: 292 DKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQ 351
+ + ++I E++E GA+L+ GG R G ++PTV +V DMK+ EE+F PV
Sbjct: 308 SEAEAERIEEWVEEAVEGGARLLTGGKRDGA---ILEPTVLTDVPPDMKVVCEEVFAPVV 364
Query: 352 QLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLI 398
L F +++E I AN+S YGL A VFT DL K + L G ++
Sbjct: 365 SLNPFDTLDEAIAMANDSPYGLQAGVFTNDLQKALKAARELEVGGVM 411
>gnl|CDD|143463 cd07145, ALDH_LactADH_F420-Bios, Methanocaldococcus jannaschii
NAD+-dependent lactaldehyde dehydrogenase-like.
NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
involved the biosynthesis of coenzyme F(420) in
Methanocaldococcus jannaschii through the oxidation of
lactaldehyde to lactate and generation of NAPH, and
similar sequences are included in this CD.
Length = 456
Score = 338 bits (870), Expect = e-113
Identities = 141/351 (40%), Positives = 212/351 (60%), Gaps = 10/351 (2%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPI-----NGDFFAYTRHEPVGV 108
T++ GKP S V+VE ++R + A A + G+TIP+ N A+T EP+GV
Sbjct: 68 TIEVGKPIKQS-RVEVERTIRLFKLAAEEAKVLRGETIPVDAYEYNERRIAFTVREPIGV 126
Query: 109 CGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV 168
G I P+NFP + A KIAPA+A GN++V+KP+ TPLTA+ + ++ EAG+P GV+N+V
Sbjct: 127 VGAITPFNFPANLFAHKIAPAIAVGNSVVVKPSSNTPLTAIELAKILEEAGLPPGVINVV 186
Query: 169 PGYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA 227
GYG G +V +P V+ ++FTGST VG L+ +G K+ LELGG P I+ DA
Sbjct: 187 TGYGSEVGDEIVTNPKVNMISFTGSTAVGLLIASKAG-GTGKKVALELGGSDPMIVLKDA 245
Query: 228 DLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQ 287
DL+ AV A G F N GQ C A R V++ +YD+F+ E+ K+ VGDP D + +
Sbjct: 246 DLERAVSIAVRGRFENAGQVCNAVKRILVEEEVYDKFLKLLVEKVKKLKVGDPLDESTDL 305
Query: 288 GPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIF 347
GP I E ++++ + +G +++ GG D+G + PTV N DM + +EE+F
Sbjct: 306 GPLISPEAVERMENLVNDAVEKGGKILYGG--KRDEGSFFPPTVLENDTPDMIVMKEEVF 363
Query: 348 GPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLI 398
GPV + + EE +E AN+++YGL A+VFT D+++ V + L AG ++
Sbjct: 364 GPVLPIAKVKDDEEAVEIANSTEYGLQASVFTNDINRALKVARELEAGGVV 414
>gnl|CDD|215157 PLN02278, PLN02278, succinic semialdehyde dehydrogenase.
Length = 498
Score = 334 bits (858), Expect = e-110
Identities = 155/349 (44%), Positives = 212/349 (60%), Gaps = 10/349 (2%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTR----HEPVG 107
L TL+ GKP + +V Y L Y+A A +++G IP F R +PVG
Sbjct: 107 LMTLEQGKPLKEAIG-EVAYGASFLEYFAEEAKRVYGDIIPSP---FPDRRLLVLKQPVG 162
Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
V G I PWNFPL M+ K+ PALA G T+V+KP+E TPLTALA L +AGIP GV+N+
Sbjct: 163 VVGAITPWNFPLAMITRKVGPALAAGCTVVVKPSELTPLTALAAAELALQAGIPPGVLNV 222
Query: 168 VPGYGDA-GAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFAD 226
V G G AL+ P V K+ FTGST VGK + G+ A+ +KR +LELGG +P I+F D
Sbjct: 223 VMGDAPEIGDALLASPKVRKITFTGSTAVGKKLMAGAAAT-VKRVSLELGGNAPFIVFDD 281
Query: 227 ADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVE 286
ADLD AV+GA F N GQ C +R VQ+ IYD+F + ++ VGD F+ V
Sbjct: 282 ADLDVAVKGALASKFRNSGQTCVCANRILVQEGIYDKFAEAFSKAVQKLVVGDGFEEGVT 341
Query: 287 QGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEI 346
QGP I++ + K+ ++ S+GA+++ GG R G + +PTV +V +DM I REE+
Sbjct: 342 QGPLINEAAVQKVESHVQDAVSKGAKVLLGGKRHSLGGTFYEPTVLGDVTEDMLIFREEV 401
Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
FGPV L RF + EE I AN+++ GLAA +FT+DL + V++ L G
Sbjct: 402 FGPVAPLTRFKTEEEAIAIANDTEAGLAAYIFTRDLQRAWRVSEALEYG 450
>gnl|CDD|131352 TIGR02299, HpaE, 5-carboxymethyl-2-hydroxymuconate semialdehyde
dehydrogenase. This model represents the dehydrogenase
responsible for the conversion of
5-carboxymethyl-2-hydroxymuconate semialdehyde to
5-carboxymethyl-2-hydroxymuconate (a tricarboxylic
acid). This is the step in the degradation of
4-hydroxyphenylacetic acid via homoprotocatechuate
following the oxidative opening of the aromatic ring.
Length = 488
Score = 334 bits (857), Expect = e-110
Identities = 155/355 (43%), Positives = 217/355 (61%), Gaps = 12/355 (3%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFA-DKIHGKTIPINGDFFAYTRHEPVGVCG 110
LE LD G+P + V + R++A + + G+T P++ YT PVG G
Sbjct: 83 LECLDCGQPLRQTRQQ-VIRAAENFRFFADKCEEAMDGRTYPVD-THLNYTVRVPVGPVG 140
Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
I PWN P ++ WKIAPALA GNT+VLKPAE +PLTA + + EAG+P+GV N+V G
Sbjct: 141 LITPWNAPFMLSTWKIAPALAFGNTVVLKPAEWSPLTAARLAEIAKEAGLPDGVFNLVHG 200
Query: 171 YG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
+G +AG ALV HP V V+FTG T G ++ + +GA LKR ++ELGGKSP I+F DADL
Sbjct: 201 FGEEAGKALVAHPDVKAVSFTGETATGSIIMR-NGADTLKRFSMELGGKSPVIVFDDADL 259
Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
+ A++ F +F G+ C A SR VQ++I ++FV + ER + VG P D E GP
Sbjct: 260 ERALDAVVFMIFSFNGERCTASSRLLVQESIAEDFVEKLVERVRAIRVGHPLDPETEVGP 319
Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD-------KGFYVQPTVFANVRDDMKIA 342
I E + K+L ++E+ + +GA ++ GG RA +G YV PTVF + M+IA
Sbjct: 320 LIHPEHLAKVLGYVEAAEKEGATILVGGERAPTFRGEDLGRGNYVLPTVFTGADNHMRIA 379
Query: 343 REEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
+EEIFGPV +I F EE IE+AN++ YGLA V+T D+ + + V L AG +
Sbjct: 380 QEEIFGPVLTVIPFKDEEEAIEKANDTRYGLAGYVWTNDVGRAHRVALALEAGMI 434
>gnl|CDD|143434 cd07116, ALDH_ACDHII-AcoD, Ralstonia eutrophus NAD+-dependent
acetaldehyde dehydrogenase II-like. Included in this CD
is the NAD+-dependent, acetaldehyde dehydrogenase II
(AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes)
eutrophus H16 involved in the catabolism of acetoin and
ethanol, and similar proteins, such as, the dimeric
dihydrolipoamide dehydrogenase of the acetoin
dehydrogenase enzyme system of Klebsiella pneumonia.
Also included are sequences similar to the
NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
and AldB) of Xanthobacter autotrophicus GJ10 which are
involved in the degradation of 1,2-dichloroethane. These
proteins apparently require RpoN factors for expression.
Length = 479
Score = 332 bits (852), Expect = e-110
Identities = 158/356 (44%), Positives = 213/356 (59%), Gaps = 19/356 (5%)
Query: 53 ETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQI 112
ET DNGKP + + D+ ++ RY+AG G I+ + AY HEP+GV GQI
Sbjct: 84 ETWDNGKPVRETLAADIPLAIDHFRYFAGCIRAQEGSISEIDENTVAYHFHEPLGVVGQI 143
Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
IPWNFPLLM WK+APALA GN +VLKPAEQTP + L + LI + +P GVVN+V G+G
Sbjct: 144 IPWNFPLLMATWKLAPALAAGNCVVLKPAEQTPASILVLMELIGDL-LPPGVVNVVNGFG 202
Query: 173 -DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFA------ 225
+AG L + KVAFTG T G+L+ Q + N+ TLELGGKSPNI FA
Sbjct: 203 LEAGKPLASSKRIAKVAFTGETTTGRLIMQ-YASENIIPVTLELGGKSPNIFFADVMDAD 261
Query: 226 DADLDAAVEGAHFGLF-FNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLN 284
DA D A+EG F +F N G+ C SR +Q++IYD F+ R+ ER K G+P D
Sbjct: 262 DAFFDKALEG--FVMFALNQGEVCTCPSRALIQESIYDRFMERALERVKAIKQGNPLDTE 319
Query: 285 VEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRA-----GDKGFYVQPTVFANVRDDM 339
G Q EQ++KIL +I+ GK +GA+++ GG R G+YV T + M
Sbjct: 320 TMIGAQASLEQLEKILSYIDIGKEEGAEVLTGGERNELGGLLGGGYYVPTTFKGG--NKM 377
Query: 340 KIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
+I +EEIFGPV + F EE +E AN++ YGL A V+T+D + + +G++AG
Sbjct: 378 RIFQEEIFGPVLAVTTFKDEEEALEIANDTLYGLGAGVWTRDGNTAYRMGRGIQAG 433
>gnl|CDD|143468 cd07150, ALDH_VaniDH_like, Pseudomonas putida vanillin
dehydrogenase-like. Vanillin dehydrogenase (Vdh,
VaniDH) involved in the metabolism of ferulic acid and
other related sequences are included in this CD. The
E. coli vanillin dehydrogenase (LigV) preferred NAD+ to
NADP+ and exhibited a broad substrate preference,
including vanillin, benzaldehyde, protocatechualdehyde,
m-anisaldehyde, and p-hydroxybenzaldehyde.
Length = 451
Score = 330 bits (849), Expect = e-109
Identities = 139/329 (42%), Positives = 203/329 (61%), Gaps = 6/329 (1%)
Query: 69 VEYSLRTLRYYAGFADKIHGKTIPING-DFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIA 127
++ LR AG ++ G+T+P + + + P+GV I P+N+PL++ K+A
Sbjct: 82 TTFTPELLRAAAGECRRVRGETLPSDSPGTVSMSVRRPLGVVAGITPFNYPLILATKKVA 141
Query: 128 PALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDK 186
ALA GNT+VLKP+E+TP+ L + ++ EAG+P+GV N+V G G G LV P V
Sbjct: 142 FALAAGNTVVLKPSEETPVIGLKIAEIMEEAGLPKGVFNVVTGGGAEVGDELVDDPRVRM 201
Query: 187 VAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQ 246
V FTGST VG+ + + +G +LK+ TLELGGK+P I+ ADADLD AV A FG F + GQ
Sbjct: 202 VTFTGSTAVGREIAEKAGR-HLKKITLELGGKNPLIVLADADLDYAVRAAAFGAFMHQGQ 260
Query: 247 CCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESG 306
C + SR V++ +YDEFV + RA + VGDP D + GP I Q+++I +E
Sbjct: 261 ICMSASRIIVEEPVYDEFVKKFVARASKLKVGDPRDPDTVIGPLISPRQVERIKRQVEDA 320
Query: 307 KSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERA 366
++GA+L+ GG G+ + QPTV +V DM+I REE FGPV +I EE +E A
Sbjct: 321 VAKGAKLLTGGKYDGN---FYQPTVLTDVTPDMRIFREETFGPVTSVIPAKDAEEALELA 377
Query: 367 NNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
N+++YGL+AA+ T DL + + + L +G
Sbjct: 378 NDTEYGLSAAILTNDLQRAFKLAERLESG 406
>gnl|CDD|143431 cd07113, ALDH_PADH_NahF, Escherichia coli NAD+-dependent
phenylacetaldehyde dehydrogenase PadA-like.
NAD+-dependent, homodimeric, phenylacetaldehyde
dehydrogenase (PADH, EC=1.2.1.39) PadA of Escherichia
coli involved in the catabolism of 2-phenylethylamine,
and other related sequences, are present in this CD.
Also included is the Pseudomonas fluorescens ST StyD
PADH involved in styrene catabolism, the Sphingomonas
sp. LB126 FldD protein involved in fluorene degradation,
and the Novosphingobium aromaticivorans NahF
salicylaldehyde dehydrogenase involved in the
NAD+-dependent conversion of salicylaldehyde to
salicylate.
Length = 477
Score = 329 bits (844), Expect = e-108
Identities = 149/352 (42%), Positives = 214/352 (60%), Gaps = 7/352 (1%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKT----IP-INGD-FFAYTRHEP 105
LETL +GK ++S + +V S LRY+AG+A KI+G+T IP + G+ + A+TR EP
Sbjct: 83 LETLCSGKSIHLSRAFEVGQSANFLRYFAGWATKINGETLAPSIPSMQGERYTAFTRREP 142
Query: 106 VGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVV 165
VGV I+PWNF +++ WKI ALATG TIV+KP+E TPLT L V L EAGIP+GV+
Sbjct: 143 VGVVAGIVPWNFSVMIAVWKIGAALATGCTIVIKPSEFTPLTLLRVAELAKEAGIPDGVL 202
Query: 166 NIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFA 225
N+V G G GA L+ HP V KV+FTGS GK + + AS+L R TLELGGK+
Sbjct: 203 NVVNGKGAVGAQLISHPDVAKVSFTGSVATGKKIGR-QAASDLTRVTLELGGKNAAAFLK 261
Query: 226 DADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNV 285
DAD+D VEG F + GQ C A R +V + +DE V + + VG P D +V
Sbjct: 262 DADIDWVVEGLLTAGFLHQGQVCAAPERFYVHRSKFDELVTKLKQALSSFQVGSPMDESV 321
Query: 286 EQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREE 345
GP ++ DK+ +++ +++G ++V GG +G++VQPT+ D ++ REE
Sbjct: 322 MFGPLANQPHFDKVCSYLDDARAEGDEIVRGGEALAGEGYFVQPTLVLARSADSRLMREE 381
Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
FGPV + + EE+I+ N++ +GL A+V+T +L K + AGT+
Sbjct: 382 TFGPVVSFVPYEDEEELIQLINDTPFGLTASVWTNNLSKALRYIPRIEAGTV 433
>gnl|CDD|143405 cd07086, ALDH_F7_AASADH-like, NAD+-dependent alpha-aminoadipic
semialdehyde dehydrogenase and related proteins. ALDH
subfamily which includes the NAD+-dependent,
alpha-aminoadipic semialdehyde dehydrogenase (AASADH,
EC=1.2.1.31), also known as Antiquitin-1, ALDH7A1,
ALDH7B or delta-1-piperideine-6-carboxylate
dehydrogenase (P6CDH), and other similar sequences, such
as the uncharacterized aldehyde dehydrogenase of
Candidatus kuenenia AldH (locus CAJ73105).
Length = 478
Score = 327 bits (841), Expect = e-108
Identities = 131/312 (41%), Positives = 180/312 (57%), Gaps = 8/312 (2%)
Query: 78 YYAGFADKIHGKTIPIN-GDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTI 136
Y G + ++G TIP + P+GV G I +NFP+ + W A AL GNT+
Sbjct: 105 YAVGLSRMLYGLTIPSERPGHRLMEQWNPLGVVGVITAFNFPVAVPGWNAAIALVCGNTV 164
Query: 137 VLKPAEQTPLTALAVGALIAEA----GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGS 192
V KP+E TPLTA+AV ++AE G+P GVVN+V G GD G LV P V V+FTGS
Sbjct: 165 VWKPSETTPLTAIAVTKILAEVLEKNGLPPGVVNLVTGGGDGGELLVHDPRVPLVSFTGS 224
Query: 193 TEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGS 252
TEVG+ V + + A R LELGG + I+ DADLD AV F GQ C
Sbjct: 225 TEVGRRVGE-TVARRFGRVLLELGGNNAIIVMDDADLDLAVRAVLFAAVGTAGQRCTTTR 283
Query: 253 RTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQ 312
R V +++YDEF+ R + K+ +GDP D GP I++ ++K L IE KSQG
Sbjct: 284 RLIVHESVYDEFLERLVKAYKQVRIGDPLDEGTLVGPLINQAAVEKYLNAIEIAKSQGGT 343
Query: 313 LVAGGGRA--GDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSD 370
++ GG R G+ G YV+PT+ V DD +I +EE F P+ +I+F S+EE I N+
Sbjct: 344 VLTGGKRIDGGEPGNYVEPTIVTGVTDDARIVQEETFAPILYVIKFDSLEEAIAINNDVP 403
Query: 371 YGLAAAVFTKDL 382
GL++++FT+DL
Sbjct: 404 QGLSSSIFTEDL 415
>gnl|CDD|188167 TIGR01780, SSADH, succinate-semialdehyde dehydrogenase. Succinic
semialdehyde dehydrogenase is one of three enzymes
constituting 4-aminobutyrate (GABA) degradation in both
prokaryotes and eukaryotes, catalyzing the
(NAD(P)+)-dependent catabolism reaction of succinic
semialdehyde to succinate for metabolism by the citric
acid cycle. The EC number depends on the cofactor:
1.2.1.24 for NAD only, 1.2.1.79 for NADP only, and
1.2.1.16 if both can be used. In Escherichia coli,
succinic semialdehyde dehydrogenase is located in an
unidirectionally transcribed gene cluster encoding
enzymes for GABA degradation and is suggested to be
cotranscribed with succinic semialdehyde transaminase
from a common promoter upstream of SSADH. Similar gene
arrangements can be found in characterized Ralstonia
eutropha and the genome analysis of Bacillus subtilis.
Prokaryotic succinic semialdehyde dehydrogenases
(1.2.1.16) share high sequence homology to characterized
succinic semialdehyde dehydrogenases from rat and human
(1.2.1.24), exhibiting conservation of proposed cofactor
binding residues, and putative active sites (G-237 &
G-242, C-293 & G-259 respectively of rat SSADH).
Eukaryotic SSADH enzymes exclusively utilize NAD+ as a
cofactor, exhibiting little to no NADP+ activity. While
a NADP+ preference has been detected in prokaryotes in
addition to both NADP+- and NAD+-dependencies as in
E.coli, Pseudomonas, and Klebsiella pneumoniae. The
function of this alternative SSADH currently is unknown,
but has been suggested to play a possible role in
4-hydroxyphenylacetic degradation. Just outside the
scope of this model, are several sequences belonging to
clades scoring between trusted and noise. These
sequences may be actual SSADH enzymes, but lack
sufficiently close characterized homologs to make a
definitive assignment at this time. SSADH enzyme belongs
to the aldehyde dehydrogenase family (pfam00171),
sharing a common evolutionary origin and enzymatic
mechanism with lactaldehyde dehydrogenase. Like in
lactaldehyde dehydrogenase and succinate semialdehyde
dehydrogenase, the mammalian catalytic glutamic acid and
cysteine residues are conserved in all the enzymes of
this family (PS00687, PS00070) [Central intermediary
metabolism, Other].
Length = 448
Score = 322 bits (826), Expect = e-106
Identities = 149/345 (43%), Positives = 213/345 (61%), Gaps = 5/345 (1%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVCGQI 112
TL+NGKP + ++ Y+ L ++A A +++G TIP D +PVGVC I
Sbjct: 66 TLENGKPLKEAKG-EILYAASFLEWFAEEAKRVYGDTIPSPQSDKRLIVIKQPVGVCAAI 124
Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY- 171
PWNFP M+ K ALA G T+V+KPAEQTPL+ALA+ L +AGIP+GV+N++ G
Sbjct: 125 TPWNFPAAMITRKAGAALAAGCTVVVKPAEQTPLSALALARLAEQAGIPKGVLNVITGSR 184
Query: 172 -GDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
+ G L P V K++FTGST VGK++ + S AS +K+ ++ELGG +P I+F DADLD
Sbjct: 185 AKEVGNVLTTSPLVRKISFTGSTNVGKILMKQS-ASTVKKVSMELGGNAPFIVFDDADLD 243
Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
AVEGA F N GQ C +R +V D IYDEF + E K+ VG+ D V QGP
Sbjct: 244 QAVEGAMASKFRNAGQTCVCANRLYVHDGIYDEFAKKLAEAVKKLKVGNGLDEGVTQGPL 303
Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPV 350
I+++ ++K+ + I +GA++V GG R G + +PTV +NV DM +++EE FGP+
Sbjct: 304 INEKAVEKVEKHIADAVEKGAKVVTGGKRHELGGNFFEPTVLSNVTADMLVSKEETFGPL 363
Query: 351 QQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
+ +F EEVI AN+++ GLAA F++DL + V + L G
Sbjct: 364 APVFKFDDEEEVIAIANDTEVGLAAYFFSRDLSRIWRVAEALEYG 408
>gnl|CDD|143442 cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate
dehydrogenase, RocA. Delta(1)-pyrroline-5-carboxylate
dehydrogenase (EC=1.5.1.12 ), RocA: a proline catabolic
enzyme of the aldehyde dehydrogenase (ALDH) protein
superfamily. The proline catabolic enzymes, proline
dehydrogenase and Delta(1)-pyrroline-5-carboxylate
dehydrogenase (P5CDH), catalyze the two-step oxidation
of proline to glutamate; P5CDH catalyzes the oxidation
of glutamate semialdehyde, utilizing NAD+ as the
electron acceptor. In some bacteria, the two enzymes are
fused into the bifunctional flavoenzyme, proline
utilization A (PutA). In this CD, monofunctional enzyme
sequences such as seen in the Bacillus subtilis RocA
P5CDH are also present. These enzymes play important
roles in cellular redox control, superoxide generation,
and apoptosis.
Length = 512
Score = 322 bits (827), Expect = e-105
Identities = 146/356 (41%), Positives = 203/356 (57%), Gaps = 10/356 (2%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
++ L+ GK + + DV ++ L YYA ++ G + + + P+GV
Sbjct: 112 AAWMVLEVGKNWAEAD-ADVAEAIDFLEYYAREMLRLRGFPVEMVPGEDNRYVYRPLGVG 170
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
I PWNFPL +LA AL TGNT+VLKPAE TP+ A + ++ EAG+P GVVN +P
Sbjct: 171 AVISPWNFPLAILAGMTTAALVTGNTVVLKPAEDTPVIAAKLVEILEEAGLPPGVVNFLP 230
Query: 170 GYGDA-GAALVQHPGVDKVAFTGSTEVG-----KLVQQGSGASNLKRTTLELGGKSPNII 223
G G+ G LV+HP V +AFTGS EVG + + G LKR E+GGK+ I+
Sbjct: 231 GPGEEVGDYLVEHPDVRFIAFTGSREVGLRIYERAAKVQPGQKWLKRVIAEMGGKNAIIV 290
Query: 224 FADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDL 283
DADLD A EG F GQ C A SR V +++YDEF+ R ER K VGDP D
Sbjct: 291 DEDADLDEAAEGIVRSAFGFQGQKCSACSRVIVHESVYDEFLERLVERTKALKVGDPEDP 350
Query: 284 NVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD--KGFYVQPTVFANVRDDMKI 341
V GP IDK D+I +IE GKS+G +L+ GG +G++VQPT+FA+V D ++
Sbjct: 351 EVYMGPVIDKGARDRIRRYIEIGKSEG-RLLLGGEVLELAAEGYFVQPTIFADVPPDHRL 409
Query: 342 AREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
A+EEIFGPV +I+ +E +E AN+++YGL VF++ + + G L
Sbjct: 410 AQEEIFGPVLAVIKAKDFDEALEIANDTEYGLTGGVFSRSPEHLERARREFEVGNL 465
>gnl|CDD|143417 cd07099, ALDH_DDALDH, Methylomonas sp.
4,4'-diapolycopene-dialdehyde dehydrogenase-like. The
4,4'-diapolycopene-dialdehyde dehydrogenase (DDALDH)
involved in C30 carotenoid synthesis in Methylomonas sp.
strain 16a and other similar sequences are present in
this CD. DDALDH converts 4,4'-diapolycopene-dialdehyde
into 4,4'-diapolycopene-diacid.
Length = 453
Score = 314 bits (807), Expect = e-103
Identities = 132/347 (38%), Positives = 191/347 (55%), Gaps = 7/347 (2%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHG----KTIPINGDFFAYTRHEPVGVC 109
+ GKP + ++V +L + + A A ++ T + + A + P GV
Sbjct: 65 HAETGKPRADAG-LEVLLALEAIDWAARNAPRVLAPRKVPTGLLMPNKKATVEYRPYGVV 123
Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
G I PWN+PLL I PALA GN +VLKP+E TPL + A AG P+GV+ +V
Sbjct: 124 GVISPWNYPLLTPMGDIIPALAAGNAVVLKPSEVTPLVGELLAEAWAAAGPPQGVLQVVT 183
Query: 170 GYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
G G GAAL+ GVDKVAFTGS G+ V + A L LELGGK P I+ ADADL
Sbjct: 184 GDGATGAALIDA-GVDKVAFTGSVATGRKVMAAA-AERLIPVVLELGGKDPMIVLADADL 241
Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
+ A A +G N GQ C + R +V +++YDEFVAR +A+ G + + GP
Sbjct: 242 ERAAAAAVWGAMVNAGQTCISVERVYVHESVYDEFVARLVAKARALRPGADDIGDADIGP 301
Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGP 349
Q+D + ++ ++GA+ + GG R+ G + +PTV +V DM + REE FGP
Sbjct: 302 MTTARQLDIVRRHVDDAVAKGAKALTGGARSNGGGPFYEPTVLTDVPHDMDVMREETFGP 361
Query: 350 VQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
V ++ + +E I AN+S YGL+A+VF++DL + + + L AG
Sbjct: 362 VLPVMPVADEDEAIALANDSRYGLSASVFSRDLARAEAIARRLEAGA 408
>gnl|CDD|132260 TIGR03216, OH_muco_semi_DH, 2-hydroxymuconic semialdehyde
dehydrogenase. Members of this protein family are
2-hydroxymuconic semialdehyde dehydrogenase. Many
aromatic compounds are catabolized by way of the
catechol, via the meta-cleavage pathway, to pyruvate and
acetyl-CoA. This enzyme performs the second of seven
steps in that pathway for catechol degradation [Energy
metabolism, Other].
Length = 481
Score = 308 bits (792), Expect = e-101
Identities = 141/364 (38%), Positives = 201/364 (55%), Gaps = 16/364 (4%)
Query: 43 NEFVNAQSSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGK----TIPINGDFF 98
++F+ A E D GKP +++ +D+ R +A + P
Sbjct: 77 DDFLAA----EVADTGKPRSLASHLDIPRGAANFRVFADVVKNAPTECFEMATPDGKGAL 132
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
Y +P+GV G I PWN PLL++ WK+ PALA GNT+V+KP+E+TP TA +G ++
Sbjct: 133 NYAVRKPLGVVGVISPWNLPLLLMTWKVGPALACGNTVVVKPSEETPGTATLLGEVMNAV 192
Query: 159 GIPEGVVNIVPGYG--DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELG 216
G+P+GV N+V G+G AG L +HPGVD + FTG T G + + + A +K + ELG
Sbjct: 193 GVPKGVYNVVHGFGPDSAGEFLTRHPGVDAITFTGETRTGSAIMKAA-ADGVKPVSFELG 251
Query: 217 GKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRT 276
GK+ I+FAD D DAAV G F N GQ C R +V+ I+D FVA RA+
Sbjct: 252 GKNAAIVFADCDFDAAVAGILRSAFLNTGQVCLGTERVYVERPIFDRFVAALKARAESLK 311
Query: 277 VGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGG-----RAGDKGFYVQPTV 331
+G P D GP I E DK+L + +GA +V GGG A G +VQPT+
Sbjct: 312 IGVPDDPATNMGPLISAEHRDKVLSYYALAVEEGATVVTGGGVPDFGDALAGGAWVQPTI 371
Query: 332 FANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQG 391
+ + D ++ EEIFGP + F S EEVI AN++ YGLAA+V+T+DL + + V +
Sbjct: 372 WTGLPDSARVVTEEIFGPCCHIAPFDSEEEVIALANDTPYGLAASVWTEDLSRAHRVARQ 431
Query: 392 LRAG 395
+ G
Sbjct: 432 MEVG 435
>gnl|CDD|143438 cd07120, ALDH_PsfA-ACA09737, Pseudomonas putida aldehyde
dehydrogenase PsfA (ACA09737)-like. Included in this CD
is the aldehyde dehydrogenase (PsfA, locus ACA09737) of
Pseudomonas putida involved in furoic acid metabolism.
Transcription of psfA was induced in response to
2-furoic acid, furfuryl alcohol, and furfural.
Length = 455
Score = 305 bits (782), Expect = 1e-99
Identities = 142/350 (40%), Positives = 197/350 (56%), Gaps = 7/350 (2%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
L L+NGK + ++ ++ LRYYAG A G+ I F+ EP+GV G
Sbjct: 65 LLALENGKILGEA-RFEISGAISELRYYAGLARTEAGRMIEPEPGSFSLVLREPMGVAGI 123
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA-GIPEGVVNIVPG 170
I+PWN P+++L +APALA G T+V+KPA QT A+ ++AE +P GVVN+
Sbjct: 124 IVPWNSPVVLLVRSLAPALAAGCTVVVKPAGQTAQINAAIIRILAEIPSLPAGVVNLFTE 183
Query: 171 YG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
G + A LV P VD ++FTGST G+ + + A LKR LELGGK+P I+F DADL
Sbjct: 184 SGSEGAAHLVASPDVDVISFTGSTATGRAIMA-AAAPTLKRLGLELGGKTPCIVFDDADL 242
Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
DAA+ L GQ C AGSR VQ +I DE R R VG D + GP
Sbjct: 243 DAALPKLERALTIFAGQFCMAGSRVLVQRSIADEVRDRLAARLAAVKVGPGLDPASDMGP 302
Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD---KGFYVQPTVFANVRDDMKIAREEI 346
ID+ +D++ +E + GA++V GG + KG +++PT+ D I +EEI
Sbjct: 303 LIDRANVDRVDRMVERAIAAGAEVVLRGGPVTEGLAKGAFLRPTLLEVDDPDADIVQEEI 362
Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
FGPV L F E + AN++DYGLAA+V+T+DL + V + +RAGT
Sbjct: 363 FGPVLTLETFDDEAEAVALANDTDYGLAASVWTRDLARAMRVARAIRAGT 412
>gnl|CDD|143469 cd07151, ALDH_HBenzADH, NADP+-dependent p-hydroxybenzaldehyde
dehydrogenase-like. NADP+-dependent,
p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH)
which catalyzes oxidation of p-hydroxybenzaldehyde to
p-hydroxybenzoic acid and other related sequences are
included in this CD.
Length = 465
Score = 303 bits (779), Expect = 5e-99
Identities = 144/337 (42%), Positives = 209/337 (62%), Gaps = 15/337 (4%)
Query: 66 SVDVEYSLRTLRYYAGFADKIHGKTIP--INGDFFAYTRHEPVGVCGQIIPWNFPLLMLA 123
+++ ++ R A F ++ G+ +P + G R EP+GV G I PWNFPL +
Sbjct: 90 NIEWGAAMAITREAATFPLRMEGRILPSDVPGKENRVYR-EPLGVVGVISPWNFPLHLSM 148
Query: 124 WKIAPALATGNTIVLKPAEQTPLTALAVGALIA----EAGIPEGVVNIVPGYG-DAGAAL 178
+APALA GN +VLKPA TP+T G L+A EAG+P+GV+N+V G G + G A
Sbjct: 149 RSVAPALALGNAVVLKPASDTPITG---GLLLAKIFEEAGLPKGVLNVVVGAGSEIGDAF 205
Query: 179 VQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHF 238
V+HP ++FTGST VG+ + + +G +LK+ LELGG +P ++ DAD+DAAV A F
Sbjct: 206 VEHPVPRLISFTGSTPVGRHIGELAGR-HLKKVALELGGNNPFVVLEDADIDAAVNAAVF 264
Query: 239 GLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDK 298
G F + GQ C A +R V + +YDEFV + ER K GDP D + GP I++ Q+D
Sbjct: 265 GKFLHQGQICMAINRIIVHEDVYDEFVEKFVERVKALPYGDPSDPDTVVGPLINESQVDG 324
Query: 299 ILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSS 358
+L+ IE +GA L+ GG G+ ++PTV ++V +DM+IAREEIFGPV +I+
Sbjct: 325 LLDKIEQAVEEGATLLVGGEAEGN---VLEPTVLSDVTNDMEIAREEIFGPVAPIIKADD 381
Query: 359 IEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
EE +E AN+++YGL+ AVFT DL++ + + AG
Sbjct: 382 EEEALELANDTEYGLSGAVFTSDLERGVQFARRIDAG 418
>gnl|CDD|143418 cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis
succinate-semialdehyde dehydrogenase 1-like.
Succinate-semialdehyde dehydrogenase 1 (SSADH1, GabD1,
EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
of succinate semialdehyde (SSA) to succinate. SSADH
activity in Mycobacterium tuberculosis (Mtb) is encoded
by both gabD1 (Rv0234c) and gabD2 (Rv1731). The Mtb
GabD1 SSADH1 reportedly is an enzyme of the
gamma-aminobutyrate shunt, which forms a functional link
between two TCA half-cycles by converting
alpha-ketoglutarate to succinate.
Length = 429
Score = 302 bits (775), Expect = 7e-99
Identities = 136/343 (39%), Positives = 183/343 (53%), Gaps = 4/343 (1%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADK-IHGKTIPINGDFFAYTRHEPVGVCGQI 112
TL+ GKP + + +VE RYYA A+ + + I + AY R+EP+GV I
Sbjct: 46 TLEMGKPIAEARA-EVEKCAWICRYYAENAEAFLADEPIETDAGK-AYVRYEPLGVVLGI 103
Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
+PWNFP + AP L GNT++LK A P ALA+ L EAG PEGV +
Sbjct: 104 MPWNFPFWQVFRFAAPNLMAGNTVLLKHASNVPGCALAIEELFREAGFPEGVFQNLLIDS 163
Query: 173 DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAA 232
D A++ P V V TGS G+ V +G NLK++ LELGG P I+ DADLD A
Sbjct: 164 DQVEAIIADPRVRGVTLTGSERAGRAVAAEAGK-NLKKSVLELGGSDPFIVLDDADLDKA 222
Query: 233 VEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQID 292
V+ A G N GQ C A R V + +YDEF+ + E VGDP D + + GP
Sbjct: 223 VKTAVKGRLQNAGQSCIAAKRFIVHEDVYDEFLEKFVEAMAALKVGDPMDEDTDLGPLAR 282
Query: 293 KEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQ 352
K+ D++ E +E + GA L+ GG R G + PTV +V M EE+FGPV
Sbjct: 283 KDLRDELHEQVEEAVAAGATLLLGGKRPDGPGAFYPPTVLTDVTPGMPAYDEELFGPVAA 342
Query: 353 LIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
+I+ EE I AN+S +GL +VFT DL++ V + L AG
Sbjct: 343 VIKVKDEEEAIALANDSPFGLGGSVFTTDLERAERVARRLEAG 385
>gnl|CDD|143470 cd07152, ALDH_BenzADH, NAD-dependent benzaldehyde dehydrogenase
II-like. NAD-dependent, benzaldehyde dehydrogenase II
(XylC, BenzADH, EC=1.2.1.28) is involved in the
oxidation of benzyl alcohol to benzoate. In
Acinetobacter calcoaceticus, this process is carried out
by the chromosomally encoded, benzyl alcohol
dehydrogenase (xylB) and benzaldehyde dehydrogenase II
(xylC) enzymes; whereas in Pseudomonas putida they are
encoded by TOL plasmids.
Length = 443
Score = 302 bits (776), Expect = 7e-99
Identities = 128/332 (38%), Positives = 193/332 (58%), Gaps = 5/332 (1%)
Query: 67 VDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKI 126
+V ++ L AG + G+ +P + R P+GV G I P+NFPL++ +
Sbjct: 72 FEVGAAIGELHEAAGLPTQPQGEILPSAPGRLSLARRVPLGVVGVISPFNFPLILAMRSV 131
Query: 127 APALATGNTIVLKPAEQTPLTALAVGALI-AEAGIPEGVVNIVPGYGDAGAALVQHPGVD 185
APALA GN +VLKP +TP++ V A + EAG+P GV++++PG DAG ALV+ P V
Sbjct: 132 APALALGNAVVLKPDPRTPVSGGVVIARLFEEAGLPAGVLHVLPGGADAGEALVEDPNVA 191
Query: 186 KVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMG 245
++FTGST VG+ V + +G +LK+ +LELGGK+ I+ DADLD A +G F + G
Sbjct: 192 MISFTGSTAVGRKVGEAAGR-HLKKVSLELGGKNALIVLDDADLDLAASNGAWGAFLHQG 250
Query: 246 QCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIES 305
Q C A R V +++ D + A+ +AK VGDP V GP I+ Q+D++ ++
Sbjct: 251 QICMAAGRHLVHESVADAYTAKLAAKAKHLPVGDPATGQVALGPLINARQLDRVHAIVDD 310
Query: 306 GKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIER 365
+ GA+L AGG G + +PTV + V+ M EEIFGPV + F S EE +
Sbjct: 311 SVAAGARLEAGGTY---DGLFYRPTVLSGVKPGMPAFDEEIFGPVAPVTVFDSDEEAVAL 367
Query: 366 ANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
AN+++YGL+A + ++D+ + + LR G L
Sbjct: 368 ANDTEYGLSAGIISRDVGRAMALADRLRTGML 399
>gnl|CDD|143413 cd07094, ALDH_F21_LactADH-like, ALDH subfamily: NAD+-dependent,
lactaldehyde dehydrogenase, ALDH family 21 A1, and
related proteins. ALDH subfamily which includes Tortula
ruralis aldehyde dehydrogenase ALDH21A1 (RNP123), and
NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
and like sequences.
Length = 453
Score = 293 bits (753), Expect = 2e-95
Identities = 140/351 (39%), Positives = 208/351 (59%), Gaps = 13/351 (3%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPI-----NGDFFAYTRHEPVGV 108
+ GKP + V+V+ ++ TLR A A++I G+ IP+ + + A+T EPVGV
Sbjct: 68 ACEGGKPIKDA-RVEVDRAIDTLRLAAEEAERIRGEEIPLDATQGSDNRLAWTIREPVGV 126
Query: 109 CGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV 168
I P+NFPL ++A K+APA+ATG +VLKPA +TPL+AL + ++ EAG+PEGV+ +V
Sbjct: 127 VLAITPFNFPLNLVAHKLAPAIATGCPVVLKPASKTPLSALELAKILVEAGVPEGVLQVV 186
Query: 169 PGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA 227
G G A V ++FTGS VG+ ++ +G KR LELGG +P I+ DA
Sbjct: 187 TGEREVLGDAFAADERVAMLSFTGSAAVGEALRANAG---GKRIALELGGNAPVIVDRDA 243
Query: 228 DLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQ 287
DLDAA+E G F++ GQ C + R +V + +YDEF+ K+ VGDP D + +
Sbjct: 244 DLDAAIEALAKGGFYHAGQVCISVQRIYVHEELYDEFIEAFVAAVKKLKVGDPLDEDTDV 303
Query: 288 GPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIF 347
GP I +E +++ ++E GA+L+ GG R G +PTV +V D K++ EE F
Sbjct: 304 GPLISEEAAERVERWVEEAVEAGARLLCGGER---DGALFKPTVLEDVPRDTKLSTEETF 360
Query: 348 GPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLI 398
GPV +IR+ EE I AN++DYGL A +FT+DL+ + L G ++
Sbjct: 361 GPVVPIIRYDDFEEAIRIANSTDYGLQAGIFTRDLNVAFKAAEKLEVGGVM 411
>gnl|CDD|179543 PRK03137, PRK03137, 1-pyrroline-5-carboxylate dehydrogenase;
Provisional.
Length = 514
Score = 291 bits (748), Expect = 8e-94
Identities = 138/358 (38%), Positives = 189/358 (52%), Gaps = 16/358 (4%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKI-HGKTI-PINGDFFAYTRHEPVG 107
S+ + GKP+ + D ++ L YYA K+ GK + G+ Y + P+G
Sbjct: 116 SAWLVKEAGKPWAEA-DADTAEAIDFLEYYARQMLKLADGKPVESRPGEHNRY-FYIPLG 173
Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
V I PWNFP ++A A+ GNT++LKPA TP+ A ++ EAG+P GVVN
Sbjct: 174 VGVVISPWNFPFAIMAGMTLAAIVAGNTVLLKPASDTPVIAAKFVEVLEEAGLPAGVVNF 233
Query: 168 VPGYGDA-GAALVQHPGVDKVAFTGSTEVG-------KLVQQGSGASNLKRTTLELGGKS 219
VPG G G LV HP + FTGS EVG VQ G LKR E+GGK
Sbjct: 234 VPGSGSEVGDYLVDHPKTRFITFTGSREVGLRIYERAAKVQPGQ--IWLKRVIAEMGGKD 291
Query: 220 PNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGD 279
++ DADLD A E F GQ C A SR V + +YDE + + E K TVG+
Sbjct: 292 AIVVDEDADLDLAAESIVASAFGFSGQKCSACSRAIVHEDVYDEVLEKVVELTKELTVGN 351
Query: 280 PFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDM 339
P D N GP I++ DKI+ +IE GK +G +LV GG KG+++QPT+FA+V
Sbjct: 352 PED-NAYMGPVINQASFDKIMSYIEIGKEEG-RLVLGGEGDDSKGYFIQPTIFADVDPKA 409
Query: 340 KIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
+I +EEIFGPV I+ + +E ANN++YGL AV + + + + G L
Sbjct: 410 RIMQEEIFGPVVAFIKAKDFDHALEIANNTEYGLTGAVISNNREHLEKARREFHVGNL 467
>gnl|CDD|143423 cd07105, ALDH_SaliADH, Salicylaldehyde dehydrogenase, DoxF-like.
Salicylaldehyde dehydrogenase (DoxF, SaliADH,
EC=1.2.1.65) involved in the upper naphthalene catabolic
pathway of Pseudomonas strain C18 and other similar
sequences are present in this CD.
Length = 432
Score = 287 bits (736), Expect = 5e-93
Identities = 137/336 (40%), Positives = 188/336 (55%), Gaps = 16/336 (4%)
Query: 68 DVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKI 126
+V+ + LR A +I G +IP A EPVGV I PWN P+++ I
Sbjct: 60 NVDLAAGMLREAASLITQIIGGSIPSDKPGTLAMVVKEPVGVVLGIAPWNAPVILGTRAI 119
Query: 127 APALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV---PGYGDAGA---ALVQ 180
A LA GNT+VLK +E +P T +G + EAG+P+GV+N+V P DA AL+
Sbjct: 120 AYPLAAGNTVVLKASELSPRTHWLIGRVFHEAGLPKGVLNVVTHSPE--DAPEVVEALIA 177
Query: 181 HPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGL 240
HP V KV FTGST VG+++ + + A +LK LELGGK+P I+ DADLDAA A FG
Sbjct: 178 HPAVRKVNFTGSTRVGRIIAE-TAAKHLKPVLLELGGKAPAIVLEDADLDAAANAALFGA 236
Query: 241 FFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKIL 300
F N GQ C + R V ++I DEFV + A++ G V G + D++
Sbjct: 237 FLNSGQICMSTERIIVHESIADEFVEKLKAAAEKLFAGP-----VVLGSLVSAAAADRVK 291
Query: 301 EFIESGKSQGAQLVAGG-GRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSI 359
E ++ S+GA+LV GG G + PT+ NV DM I EE FGPV +IR
Sbjct: 292 ELVDDALSKGAKLVVGGLADESPSGTSMPPTILDNVTPDMDIYSEESFGPVVSIIRVKDE 351
Query: 360 EEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
EE + AN+S+YGL+AAVFT+DL + V + + +G
Sbjct: 352 EEAVRIANDSEYGLSAAVFTRDLARALAVAKRIESG 387
>gnl|CDD|236501 PRK09407, gabD2, succinic semialdehyde dehydrogenase; Reviewed.
Length = 524
Score = 288 bits (740), Expect = 2e-92
Identities = 129/330 (39%), Positives = 184/330 (55%), Gaps = 14/330 (4%)
Query: 75 TLRYYAGFADKI-----HGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPA 129
T RYYA A K+ +P+ +P GV G I PWN+PL + PA
Sbjct: 121 TARYYARRAPKLLAPRRRAGALPVLTK--TTELRQPKGVVGVISPWNYPLTLAVSDAIPA 178
Query: 130 LATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG-DAGAALVQHPGVDKVA 188
L GN +VLKP QTPLTALA L+ EAG+P + +V G G G ALV + D +
Sbjct: 179 LLAGNAVVLKPDSQTPLTALAAVELLYEAGLPRDLWQVVTGPGPVVGTALVDN--ADYLM 236
Query: 189 FTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCC 248
FTGST G+++ + +G L +LELGGK+P I+ DADLD A GA F N GQ C
Sbjct: 237 FTGSTATGRVLAEQAG-RRLIGFSLELGGKNPMIVLDDADLDKAAAGAVRACFSNAGQLC 295
Query: 249 CAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKS 308
+ R +V ++IYDEFV + +G +D + + G I + Q++ + ++ +
Sbjct: 296 ISIERIYVHESIYDEFVRAFVAAVRAMRLGAGYDYSADMGSLISEAQLETVSAHVDDAVA 355
Query: 309 QGAQLVAGGGRAGDKG--FYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERA 366
+GA ++AGG D G FY +PTV V DM++AREE FGPV + + ++E +ERA
Sbjct: 356 KGATVLAGGKARPDLGPLFY-EPTVLTGVTPDMELAREETFGPVVSVYPVADVDEAVERA 414
Query: 367 NNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
N++ YGL A+V+T D + + +RAGT
Sbjct: 415 NDTPYGLNASVWTGDTARGRAIAARIRAGT 444
>gnl|CDD|143465 cd07147, ALDH_F21_RNP123, Aldehyde dehydrogenase family 21A1-like.
Aldehyde dehydrogenase ALDH21A1 (gene name RNP123) was
first described in the moss Tortula ruralis and is
believed to play an important role in the detoxification
of aldehydes generated in response to desiccation- and
salinity-stress, and ALDH21A1 expression represents a
unique stress tolerance mechanism. So far, of plants,
only the bryophyte sequence has been observed, but
similar protein sequences from bacteria and archaea are
also present in this CD.
Length = 452
Score = 284 bits (730), Expect = 6e-92
Identities = 122/350 (34%), Positives = 192/350 (54%), Gaps = 12/350 (3%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGD-----FFAYTRHEPVGV 108
L+ GKP + +V ++ T R A A +I+G+ +P++ R P+G
Sbjct: 68 VLEAGKPIKDA-RGEVARAIDTFRIAAEEATRIYGEVLPLDISARGEGRQGLVRRFPIGP 126
Query: 109 CGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV 168
I P+NFPL ++A K+APA+A G VLKPA +TPL+AL +G ++AE G+P+G +++
Sbjct: 127 VSAITPFNFPLNLVAHKVAPAIAAGCPFVLKPASRTPLSALILGEVLAETGLPKGAFSVL 186
Query: 169 PGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
P D LV + ++FTGS VG ++ +G K+ LELGG + I+ +DAD
Sbjct: 187 PCSRDDADLLVTDERIKLLSFTGSPAVGWDLKARAGK---KKVVLELGGNAAVIVDSDAD 243
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
LD A + FG F+ GQ C + R V ++YDEF +R R K GDP D + G
Sbjct: 244 LDFAAQRIIFGAFYQAGQSCISVQRVLVHRSVYDEFKSRLVARVKALKTGDPKDDATDVG 303
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
P I + + +++ ++ GA+L+ GG R G ++PT+ +V DM++ EE+FG
Sbjct: 304 PMISESEAERVEGWVNEAVDAGAKLLTGGKR---DGALLEPTILEDVPPDMEVNCEEVFG 360
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLI 398
PV + + +E + N+S +GL A VFT+DL+K L G ++
Sbjct: 361 PVVTVEPYDDFDEALAAVNDSKFGLQAGVFTRDLEKALRAWDELEVGGVV 410
>gnl|CDD|182233 PRK10090, PRK10090, aldehyde dehydrogenase A; Provisional.
Length = 409
Score = 280 bits (719), Expect = 9e-91
Identities = 131/359 (36%), Positives = 201/359 (55%), Gaps = 17/359 (4%)
Query: 58 GKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIP---INGDFFAYTRHEPVGVCGQIIP 114
GK ++ V+V ++ + Y A +A + G+ I + + R +GV I+P
Sbjct: 24 GKIQQLA-EVEVAFTADYIDYMAEWARRYEGEIIQSDRPGENILLFKR--ALGVTTGILP 80
Query: 115 WNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGD- 173
WNFP ++A K+APAL TGNTIV+KP+E TP A+A ++ E G+P+GV N+V G G+
Sbjct: 81 WNFPFFLIARKMAPALLTGNTIVIKPSEFTPNNAIAFAKIVDEIGLPKGVFNLVLGRGET 140
Query: 174 AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAV 233
G L +P V V+ TGS G+ + + A N+ + LELGGK+P I+ DADLD AV
Sbjct: 141 VGQELAGNPKVAMVSMTGSVSAGEKIMA-AAAKNITKVCLELGGKAPAIVMDDADLDLAV 199
Query: 234 EGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFD-LNVEQGPQID 292
+ N GQ C R +VQ IYD+FV R GE + G+P + ++ GP I+
Sbjct: 200 KAIVDSRVINSGQVCNCAERVYVQKGIYDQFVNRLGEAMQAVQFGNPAERNDIAMGPLIN 259
Query: 293 KEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQ 352
++++ + + +GA++ GG KG+Y PT+ +VR +M I EE FGPV
Sbjct: 260 AAALERVEQKVARAVEEGARVALGGKAVEGKGYYYPPTLLLDVRQEMSIMHEETFGPVLP 319
Query: 353 LIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLIRFSSIEEVIERAN 411
++ F ++EE I AN+SDYGL ++++T++L+ +GL+ G E I R N
Sbjct: 320 VVAFDTLEEAIAMANDSDYGLTSSIYTQNLNVAMKAIKGLKFG--------ETYINREN 370
>gnl|CDD|143420 cd07102, ALDH_EDX86601, Uncharacterized aldehyde dehydrogenase of
Synechococcus sp. PCC 7335 (EDX86601). Uncharacterized
aldehyde dehydrogenase of Synechococcus sp. PCC 7335
(locus EDX86601) and other similar sequences, are
present in this CD.
Length = 452
Score = 280 bits (719), Expect = 3e-90
Identities = 123/324 (37%), Positives = 172/324 (53%), Gaps = 5/324 (1%)
Query: 77 RYYAGFADK-IHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNT 135
RY A++ + +P F Y R EP+GV I PWN+P L + PAL GN
Sbjct: 87 RYMISIAEEALADIRVPEKDGFERYIRREPLGVVLIIAPWNYPYLTAVNAVIPALLAGNA 146
Query: 136 IVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEV 195
++LK + QTPL A AEAG+PEGV ++ + AAL+ P +D V+FTGS
Sbjct: 147 VILKHSPQTPLCGERFAAAFAEAGLPEGVFQVLHLSHETSAALIADPRIDHVSFTGSVAG 206
Query: 196 GKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTF 255
G+ +Q+ + A + LELGGK P + DADLDAA E G FFN GQ CC+ R +
Sbjct: 207 GRAIQR-AAAGRFIKVGLELGGKDPAYVRPDADLDAAAESLVDGAFFNSGQSCCSIERIY 265
Query: 256 VQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVA 315
V ++IYD FV K +GDP D + GP + D + I ++GA+ +
Sbjct: 266 VHESIYDAFVEAFVAVVKGYKLGDPLDPSTTLGPVVSARAADFVRAQIADAIAKGARALI 325
Query: 316 GGGR---AGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYG 372
G G Y+ PTV NV M++ REE FGPV +++ S E I N+S+YG
Sbjct: 326 DGALFPEDKAGGAYLAPTVLTNVDHSMRVMREETFGPVVGIMKVKSDAEAIALMNDSEYG 385
Query: 373 LAAAVFTKDLDKTNYVTQGLRAGT 396
L A+V+TKD+ + + + L GT
Sbjct: 386 LTASVWTKDIARAEALGEQLETGT 409
>gnl|CDD|143419 cd07101, ALDH_SSADH2_GabD2, Mycobacterium tuberculosis
succinate-semialdehyde dehydrogenase 2-like.
Succinate-semialdehyde dehydrogenase 2 (SSADH2) and
similar proteins are in this CD. SSADH1 (GabD1,
EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
of succinate semialdehyde to succinate. SSADH activity
in Mycobacterium tuberculosis is encoded by both gabD1
(Rv0234c) and gabD2 (Rv1731), however ,the Vmax of GabD1
was shown to be much higher than that of GabD2, and
GabD2 (SSADH2) is likely to serve physiologically as a
dehydrogenase for a different aldehyde(s).
Length = 454
Score = 279 bits (717), Expect = 6e-90
Identities = 131/332 (39%), Positives = 184/332 (55%), Gaps = 18/332 (5%)
Query: 75 TLRYYAGFADKI-----HGKTIPINGDFFAYTR--HEPVGVCGQIIPWNFPLLMLAWKIA 127
RYYA A+++ IP+ T P GV G I PWN+PL +
Sbjct: 85 VARYYARRAERLLKPRRRRGAIPV----LTRTTVNRRPKGVVGVISPWNYPLTLAVSDAI 140
Query: 128 PALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG-DAGAALVQHPGVDK 186
PAL GN +VLKP QT LTAL L+ EAG+P + +V G G + G A+V + D
Sbjct: 141 PALLAGNAVVLKPDSQTALTALWAVELLIEAGLPRDLWQVVTGPGSEVGGAIVDN--ADY 198
Query: 187 VAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQ 246
V FTGST G++V + +G L +LELGGK+P I+ DADLD A GA F N GQ
Sbjct: 199 VMFTGSTATGRVVAERAG-RRLIGCSLELGGKNPMIVLEDADLDKAAAGAVRACFSNAGQ 257
Query: 247 CCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESG 306
C + R +V +++YDEFV R R + +G D + G I + Q+D++ ++
Sbjct: 258 LCVSIERIYVHESVYDEFVRRFVARTRALRLGAALDYGPDMGSLISQAQLDRVTAHVDDA 317
Query: 307 KSQGAQLVAGGGRAGDKG--FYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIE 364
++GA ++AGG D G FY +PTV V +DM++ EE FGPV + R + +E IE
Sbjct: 318 VAKGATVLAGGRARPDLGPYFY-EPTVLTGVTEDMELFAEETFGPVVSIYRVADDDEAIE 376
Query: 365 RANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
AN++DYGL A+V+T+D + + LRAGT
Sbjct: 377 LANDTDYGLNASVWTRDGARGRRIAARLRAGT 408
>gnl|CDD|143464 cd07146, ALDH_PhpJ, Streptomyces putative phosphonoformaldehyde
dehydrogenase PhpJ-like. Putative phosphonoformaldehyde
dehydrogenase (PhpJ), an aldehyde dehydrogenase homolog
reportedly involved in the biosynthesis of
phosphinothricin tripeptides in Streptomyces
viridochromogenes DSM 40736, and similar sequences are
included in this CD.
Length = 451
Score = 275 bits (706), Expect = 3e-88
Identities = 127/337 (37%), Positives = 186/337 (55%), Gaps = 12/337 (3%)
Query: 67 VDVEYSLRTLRYYAGFADKIHGKTIPIN-----GDFFAYTRHEPVGVCGQIIPWNFPLLM 121
+V + LR+ A A + G++ + +T EP+GV I P+N PL
Sbjct: 77 YEVGRAADVLRFAAAEALRDDGESFSCDLTANGKARKIFTLREPLGVVLAITPFNHPLNQ 136
Query: 122 LAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG-YGDAGAALVQ 180
+A KIAPA+A N IVLKP+E+TPL+A+ + L+ EAG+P ++++V G G+ G L+
Sbjct: 137 VAHKIAPAIAANNRIVLKPSEKTPLSAIYLADLLYEAGLPPDMLSVVTGEPGEIGDELIT 196
Query: 181 HPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGL 240
HP VD V FTG VGK + + + KR LELGG P I+ DADL+ A A G
Sbjct: 197 HPDVDLVTFTGGVAVGKAI---AATAGYKRQLLELGGNDPLIVMDDADLERAATLAVAGS 253
Query: 241 FFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKIL 300
+ N GQ C A R V +++ DEFV E++ VGDP D + G ID+E +I
Sbjct: 254 YANSGQRCTAVKRILVHESVADEFVDLLVEKSAALVVGDPMDPATDMGTVIDEEAAIQIE 313
Query: 301 EFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIE 360
+E +QGA+++ G R +G PTV +V D ++ EE FGPV +IR ++
Sbjct: 314 NRVEEAIAQGARVLLGNQR---QGALYAPTVLDHVPPDAELVTEETFGPVAPVIRVKDLD 370
Query: 361 EVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
E I +N++ YGL++ V T DLD + + L GT+
Sbjct: 371 EAIAISNSTAYGLSSGVCTNDLDTIKRLVERLDVGTV 407
>gnl|CDD|143401 cd07082, ALDH_F11_NP-GAPDH, NADP+-dependent non-phosphorylating
glyceraldehyde 3-phosphate dehydrogenase and ALDH family
11. NADP+-dependent non-phosphorylating glyceraldehyde
3-phosphate dehydrogenase (NP-GAPDH, EC=1.2.1.9)
catalyzes the irreversible oxidation of glyceraldehyde
3-phosphate to 3-phosphoglycerate generating NADPH for
biosynthetic reactions. This CD also includes the
Arabidopsis thaliana osmotic-stress-inducible ALDH
family 11, ALDH11A3 and similar sequences. In
autotrophic eukaryotes, NP-GAPDH generates NADPH for
biosynthetic processes from photosynthetic
glyceraldehyde-3-phosphate exported from the chloroplast
and catalyzes one of the classic glycolytic bypass
reactions unique to plants.
Length = 473
Score = 274 bits (702), Expect = 2e-87
Identities = 138/363 (38%), Positives = 196/363 (53%), Gaps = 22/363 (6%)
Query: 43 NEFVNAQSSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFF---- 98
E N L + GK + +V+ ++ +R ++ G ++P GD+F
Sbjct: 79 EEVAN----LLMWEIGKTLKDALK-EVDRTIDYIRDTIEELKRLDGDSLP--GDWFPGTK 131
Query: 99 ---AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALI 155
A R EP+GV I P+N+PL + K+ PAL GNT+V KPA Q L + +
Sbjct: 132 GKIAQVRREPLGVVLAIGPFNYPLNLTVSKLIPALIMGNTVVFKPATQGVLLGIPLAEAF 191
Query: 156 AEAGIPEGVVNIVPGYG-DAGAALVQHPGVDKVAFTGSTEVG-KLVQQGSGASNLKRTTL 213
+AG P+GVVN+V G G + G LV H +D ++FTGSTEVG +L +Q +KR L
Sbjct: 192 HDAGFPKGVVNVVTGRGREIGDPLVTHGRIDVISFTGSTEVGNRLKKQHP----MKRLVL 247
Query: 214 ELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAK 273
ELGGK P I+ DADL+ A + G GQ C A R V +++ DE V E
Sbjct: 248 ELGGKDPAIVLPDADLELAAKEIVKGALSYSGQRCTAIKRVLVHESVADELVELLKEEVA 307
Query: 274 RRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFA 333
+ VG P+D V+ P ID + D + I+ ++GA ++ GGGR G + PT+
Sbjct: 308 KLKVGMPWDNGVDITPLIDPKSADFVEGLIDDAVAKGATVLNGGGREGGN--LIYPTLLD 365
Query: 334 NVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLR 393
V DM++A EE FGPV +IR + IEE IE AN S+YGL A++FTKD++K + L
Sbjct: 366 PVTPDMRLAWEEPFGPVLPIIRVNDIEEAIELANKSNYGLQASIFTKDINKARKLADALE 425
Query: 394 AGT 396
GT
Sbjct: 426 VGT 428
>gnl|CDD|143404 cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase
and ALDH family members 6A1 and 6B2. Methylmalonate
semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27)
[acylating] from Bacillus subtilis is involved in valine
metabolism and catalyses the NAD+- and CoA-dependent
oxidation of methylmalonate semialdehyde into
propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and
Arabidopsis MMSDH ALDH6B2 are also present in this CD.
Length = 478
Score = 270 bits (693), Expect = 6e-86
Identities = 113/297 (38%), Positives = 163/297 (54%), Gaps = 13/297 (4%)
Query: 93 INGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVG 152
+ Y+ +P+GV I P+NFP ++ W A+A GNT VLKP+E+ P A+ +
Sbjct: 124 VARGIDTYSYRQPLGVVAGITPFNFPAMIPLWMFPMAIACGNTFVLKPSERVPGAAMRLA 183
Query: 153 ALIAEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTT 212
L+ EAG+P+GV+N+V G +A AL+ HP + V+F GST VG+ + + A+N KR
Sbjct: 184 ELLQEAGLPDGVLNVVHGGKEAVNALLDHPDIKAVSFVGSTPVGEYIYE-RAAANGKRVQ 242
Query: 213 LELGGKSPNIIFADADLDAAVE---GAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSG 269
G K+ ++ DADL+ GA FG GQ C A S DE++ +
Sbjct: 243 ALGGAKNHAVVMPDADLEQTANALVGAAFG---AAGQRCMALSVAVAVGDEADEWIPKLV 299
Query: 270 ERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGR-----AGDKG 324
ERAK+ VG D + GP I ++I IESG +GA+LV GR + G
Sbjct: 300 ERAKKLKVGAGDDPGADMGPVISPAAKERIEGLIESGVEEGAKLVL-DGRGVKVPGYENG 358
Query: 325 FYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
+V PT+ NV DMKI +EEIFGPV ++R +++E I N + YG AA+FT+
Sbjct: 359 NFVGPTILDNVTPDMKIYKEEIFGPVLSIVRVDTLDEAIAIINANPYGNGAAIFTRS 415
>gnl|CDD|132417 TIGR03374, ABALDH, 1-pyrroline dehydrogenase. Members of this
protein family are 1-pyrroline dehydrogenase (1.5.1.35),
also called gamma-aminobutyraldehyde dehydrogenase. This
enzyme can follow putrescine transaminase (EC 2.6.1.82)
for a two-step conversion of putrescine to
gamma-aminobutyric acid (GABA). The member from
Escherichia coli is characterized as a homotetramer that
binds one NADH per momomer. This enzyme belongs to the
medium-chain aldehyde dehydrogenases, and is quite
similar in sequence to the betaine aldehyde
dehydrogenase (EC 1.2.1.8) family.
Length = 472
Score = 267 bits (684), Expect = 6e-85
Identities = 140/376 (37%), Positives = 212/376 (56%), Gaps = 15/376 (3%)
Query: 30 TKPKIEHTQLFINNEFV--NAQ--SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADK 85
T PK L + + NAQ + LE+ + GKP + ++ ++ + R++AG A
Sbjct: 57 TTPKARAECLLKLADVIEENAQVFAELESRNCGKPLHSVFNDEIPAIVDVFRFFAGAARC 116
Query: 86 IHGKTIPINGDFF----AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPA 141
+ G G++ + R +P+GV I PWN+PL+M AWK+APALA GN +VLKP+
Sbjct: 117 LSGLAA---GEYLEGHTSMIRRDPLGVVASIAPWNYPLMMAAWKLAPALAAGNCVVLKPS 173
Query: 142 EQTPLTALAVGALIAEAGIPEGVVNIVPGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQ 200
E TPLTAL + L + P GVVNI+ G G G L H V V+ TGS G+ +
Sbjct: 174 EITPLTALKLAELAKDI-FPAGVVNILFGRGKTVGDPLTGHEKVRMVSLTGSIATGEHIL 232
Query: 201 QGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTI 260
+ A ++KRT +ELGGK+P I+F DAD+DA VEG F+N GQ C A R + Q I
Sbjct: 233 SHT-APSIKRTHMELGGKAPVIVFDDADIDAVVEGVRTFGFYNAGQDCTAACRIYAQRGI 291
Query: 261 YDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQG-AQLVAGGGR 319
YD V + G G P D + E GP ++++++ +E K+ G +++ GG +
Sbjct: 292 YDTLVEKLGAAVATLKSGAPDDESTELGPLSSLAHLERVMKAVEEAKALGHIKVITGGEK 351
Query: 320 AGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFT 379
G+Y PT+ A + D I ++E+FGPV + F E+V+ AN+S YGLA++V+T
Sbjct: 352 RKGNGYYFAPTLLAGAKQDDAIVQKEVFGPVVSITSFDDEEQVVNWANDSQYGLASSVWT 411
Query: 380 KDLDKTNYVTQGLRAG 395
KD+ + + ++ L+ G
Sbjct: 412 KDVGRAHRLSARLQYG 427
>gnl|CDD|183050 PRK11241, gabD, succinate-semialdehyde dehydrogenase I;
Provisional.
Length = 482
Score = 267 bits (683), Expect = 1e-84
Identities = 142/347 (40%), Positives = 205/347 (59%), Gaps = 6/347 (1%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPIN-GDFFAYTRHEPVGVCG 110
L TL+ GKP + ++ Y+ + ++A +I+G TIP + D +P+GV
Sbjct: 93 LMTLEQGKPLAEAKG-EISYAASFIEWFAEEGKRIYGDTIPGHQADKRLIVIKQPIGVTA 151
Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
I PWNFP M+ K PALA G T+VLKPA QTP +ALA+ L AGIP GV N+V G
Sbjct: 152 AITPWNFPAAMITRKAGPALAAGCTMVLKPASQTPFSALALAELAIRAGIPAGVFNVVTG 211
Query: 171 Y-GDAGAALVQHPGVDKVAFTGSTEVGK-LVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G G L +P V K++FTGSTE+G+ L++Q A ++K+ +LELGG +P I+F DAD
Sbjct: 212 SAGAVGGELTSNPLVRKLSFTGSTEIGRQLMEQC--AKDIKKVSLELGGNAPFIVFDDAD 269
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
LD AVEGA F N GQ C +R +VQD +YD F + + + +GD + V G
Sbjct: 270 LDKAVEGALASKFRNAGQTCVCANRLYVQDGVYDRFAEKLQQAVSKLHIGDGLEKGVTIG 329
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
P ID++ + K+ E I +GA++V GG G + QPT+ +V + K+A+EE FG
Sbjct: 330 PLIDEKAVAKVEEHIADALEKGARVVCGGKAHELGGNFFQPTILVDVPANAKVAKEETFG 389
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
P+ L RF +VI +AN++++GLAA + +DL + V + L G
Sbjct: 390 PLAPLFRFKDEADVIAQANDTEFGLAAYFYARDLSRVFRVGEALEYG 436
>gnl|CDD|143416 cd07098, ALDH_F15-22, Aldehyde dehydrogenase family 15A1 and
22A1-like. Aldehyde dehydrogenase family members
ALDH15A1 (Saccharomyces cerevisiae YHR039C) and ALDH22A1
(Arabidopsis thaliana, EC=1.2.1.3), and similar
sequences, are in this CD. Significant improvement of
stress tolerance in tobacco plants was observed by
overexpressing the ALDH22A1 gene from maize (Zea mays)
and was accompanied by a reduction of malondialdehyde
derived from cellular lipid peroxidation.
Length = 465
Score = 265 bits (679), Expect = 4e-84
Identities = 111/305 (36%), Positives = 160/305 (52%), Gaps = 9/305 (2%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
A +EP+GV G I+ WN+P L I AL GN IV+K +EQ ++ ++I E
Sbjct: 114 ARVEYEPLGVVGAIVSWNYPFHNLLGPIIAALFAGNAIVVKVSEQVAWSSGFFLSIIREC 173
Query: 159 ----GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLE 214
G +V +V + AL HP +D + F GS VGK V + A +L LE
Sbjct: 174 LAACGHDPDLVQLVTCLPETAEALTSHPVIDHITFIGSPPVGKKVMA-AAAESLTPVVLE 232
Query: 215 LGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKR 274
LGGK P I+ DADLD G F + GQ C R V + IYD+ + +R +
Sbjct: 233 LGGKDPAIVLDDADLDQIASIIMRGTFQSSGQNCIGIERVIVHEKIYDKLLEILTDRVQA 292
Query: 275 RTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD----KGFYVQPT 330
G P D +V+ G I + D++ E + +GA+L+AGG R +G Y PT
Sbjct: 293 LRQGPPLDGDVDVGAMISPARFDRLEELVADAVEKGARLLAGGKRYPHPEYPQGHYFPPT 352
Query: 331 VFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQ 390
+ +V DMKIA+EE+FGPV +++ S EE +E AN+++YGL A+VF KD+ + +
Sbjct: 353 LLVDVTPDMKIAQEEVFGPVMVVMKASDDEEAVEIANSTEYGLGASVFGKDIKRARRIAS 412
Query: 391 GLRAG 395
L G
Sbjct: 413 QLETG 417
>gnl|CDD|200087 TIGR01237, D1pyr5carbox2, delta-1-pyrroline-5-carboxylate
dehydrogenase, group 2, putative. This enzyme is the
second of two in the degradation of proline to
glutamate. This model represents one of several related
branches of delta-1-pyrroline-5-carboxylate
dehydrogenase. Members of this branch may be associated
with proline dehydrogenase (the other enzyme of the
pathway from proline to glutamate) but have not been
demonstrated experimentally. The branches are not as
closely related to each other as some distinct aldehyde
dehydrogenases are to some; separate models were built
to let each model describe a set of equivalogs [Energy
metabolism, Amino acids and amines].
Length = 511
Score = 257 bits (659), Expect = 1e-80
Identities = 137/358 (38%), Positives = 193/358 (53%), Gaps = 15/358 (4%)
Query: 50 SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKI--HGKTIPINGDFFAYTRHEPVG 107
S+L + GKP+N + +V ++ + YYA ++ G+ Y + P G
Sbjct: 112 SALLVKEVGKPWNEA-DAEVAEAIDFMEYYARQMIELAKGKPVNSREGETNQYV-YTPTG 169
Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
V I PWNFP ++ + TGN +VLKPAE P+ A ++ EAG+P+GVV
Sbjct: 170 VTVVISPWNFPFAIMVGMTVAPIVTGNCVVLKPAEAAPVIAAKFVEILEEAGLPKGVVQF 229
Query: 168 VPGYG-DAGAALVQHPGVDKVAFTGSTEVG-------KLVQQGSGASNLKRTTLELGGKS 219
VPG G + G LV HP + FTGS EVG VQ G +LKR E+GGK
Sbjct: 230 VPGSGSEVGDYLVDHPKTSLITFTGSREVGTRIFERAAKVQPGQ--KHLKRVIAEMGGKD 287
Query: 220 PNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGD 279
I+ DAD++ A + A F GQ C AGSR V + +YDE V R E + VG
Sbjct: 288 TVIVDEDADIELAAQSAFTSAFGFAGQKCSAGSRAVVHEKVYDEVVERFVEITESLKVGP 347
Query: 280 PFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDM 339
P +V GP ID++ +KI+E+IE GK++G +LV+GG KG+++ PT+FA+V
Sbjct: 348 PDSADVYVGPVIDQKSFNKIMEYIEIGKAEG-RLVSGGCGDDSKGYFIGPTIFADVDRKA 406
Query: 340 KIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
++A+EEIFGPV IR S +E +E ANN++YGL V + + D N G L
Sbjct: 407 RLAQEEIFGPVVAFIRASDFDEALEIANNTEYGLTGGVISNNRDHINRAKAEFEVGNL 464
>gnl|CDD|212007 TIGR04284, aldehy_Rv0768, aldehyde dehydrogenase, Rv0768 family.
This family describes a branch of the aldehyde
dehydrogenase (NAD) family (see pfam00171) that includes
Rv0768 from Mycobacterium tuberculosis. All members of
this family belong to species predicted to synthesize
mycofactocin, suggesting that this enzyme or another
upstream or downstream in the same pathway might be
mycofactocin-dependent. However, the taxonomic range of
this family is not nearly broad enough to make that
relationship conclusive [Unknown function, Enzymes of
unknown specificity].
Length = 480
Score = 245 bits (626), Expect = 4e-76
Identities = 128/300 (42%), Positives = 168/300 (56%), Gaps = 5/300 (1%)
Query: 102 RHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAE-AGI 160
R E VGV G I PWNFP + K+ PALA GNT+VLKPA TP A +G LIAE
Sbjct: 138 RREAVGVVGAITPWNFPHQINLAKLGPALAAGNTVVLKPAPDTPWCAAVLGELIAEHTDF 197
Query: 161 PEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKS 219
P GVVNIV GA L + P VD V+FTGST G+ V A+ LK+ LELGGKS
Sbjct: 198 PPGVVNIVTSSDHRLGALLAKDPRVDMVSFTGSTATGRAVM-ADAAATLKKVFLELGGKS 256
Query: 220 PNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGD 279
I+ DADL AA A F + + GQ C +R V YDE VA + GD
Sbjct: 257 AFIVLDDADLAAACSMAAFTVCMHAGQGCAITTRLVVPRARYDEAVAAAAATMGSIKPGD 316
Query: 280 PFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDK--GFYVQPTVFANVRD 337
P D GP I Q D++ +++ ++G + GGGR D+ GF+V+PTV A + +
Sbjct: 317 PADPGTVCGPVISARQRDRVQSYLDLAVAEGGRFACGGGRPADRDRGFFVEPTVIAGLDN 376
Query: 338 DMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
+ ++AREEIFGPV +I ++ + AN+S YGL+ VF D ++ V +R GT+
Sbjct: 377 NARVAREEIFGPVLTVIAHDGDDDAVRIANDSPYGLSGTVFGADPERAAAVAARVRTGTV 436
>gnl|CDD|143448 cd07130, ALDH_F7_AASADH, NAD+-dependent alpha-aminoadipic
semialdehyde dehydrogenase, ALDH family members 7A1 and
7B. Alpha-aminoadipic semialdehyde dehydrogenase
(AASADH, EC=1.2.1.31), also known as ALDH7A1,
Antiquitin-1, ALDH7B, or
delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH),
is a NAD+-dependent ALDH. Human ALDH7A1 is involved in
the pipecolic acid pathway of lysine catabolism,
catalyzing the oxidation of alpha-aminoadipic
semialdehyde to alpha-aminoadipate. Arabidopsis
thaliana ALDH7B4 appears to be an
osmotic-stress-inducible ALDH gene encoding a
turgor-responsive or stress-inducible ALDH. The
Streptomyces clavuligerus P6CDH appears to be involved
in cephamycin biosynthesis, catalyzing the second stage
of the two-step conversion of lysine to
alpha-aminoadipic acid. The ALDH7A1 enzyme and others
in this group have been observed as tetramers, yet the
bacterial P6CDH enzyme has been reported as a monomer.
Length = 474
Score = 236 bits (604), Expect = 7e-73
Identities = 112/282 (39%), Positives = 155/282 (54%), Gaps = 6/282 (2%)
Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA----GI 160
P+GV G I +NFP+ + W A AL GN +V KP+ TPLTA+AV ++A G+
Sbjct: 132 PLGVVGVITAFNFPVAVWGWNAAIALVCGNVVVWKPSPTTPLTAIAVTKIVARVLEKNGL 191
Query: 161 PEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSP 220
P + ++V G D G ALV+ P V V+FTGST VG+ V Q + A+ R+ LELGG +
Sbjct: 192 PGAIASLVCGGADVGEALVKDPRVPLVSFTGSTAVGRQVGQ-AVAARFGRSLLELGGNNA 250
Query: 221 NIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDP 280
I+ DADLD AV F GQ C R V ++IYDE + R + K+ +GDP
Sbjct: 251 IIVMEDADLDLAVRAVLFAAVGTAGQRCTTTRRLIVHESIYDEVLERLKKAYKQVRIGDP 310
Query: 281 FDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMK 340
D GP K +D L IE KSQG ++ GG G YV+PT+ D
Sbjct: 311 LDDGTLVGPLHTKAAVDNYLAAIEEAKSQGGTVLFGGKVIDGPGNYVEPTI-VEGLSDAP 369
Query: 341 IAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDL 382
I +EE F P+ +++F ++EE I N GL++++FT DL
Sbjct: 370 IVKEETFAPILYVLKFDTLEEAIAWNNEVPQGLSSSIFTTDL 411
>gnl|CDD|132294 TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde
dehydrogenase. This family of genes are members of the
pfam00171 NAD-dependent aldehyde dehydrogenase family.
These genes are observed in Ralstonia eutropha JMP134,
Sinorhizobium meliloti 1021, Burkholderia mallei ATCC
23344, Burkholderia thailandensis E264, Burkholderia
cenocepacia AU 1054, Burkholderia pseudomallei K96243
and 1710b, Burkholderia xenovorans LB400, Burkholderia
sp. 383 and Polaromonas sp. JS666 in close proximity to
the PhnW gene (TIGR02326) encoding 2-aminoethyl
phosphonate aminotransferase (which generates
phosphonoacetaldehyde) and PhnA (TIGR02335) encoding
phosphonoacetate hydrolase (not to be confused with the
alkylphosphonate utilization operon protein PhnA modeled
by TIGR00686). Additionally, transporters believed to be
specific for 2-aminoethyl phosphonate are often present.
PhnW is, in other organisms, coupled with PhnX
(TIGR01422) for the degradation of phosphonoacetaldehyde
(GenProp0238), but PhnX is apparently absent in each of
the organisms containing this aldehyde reductase. PhnA,
characterized in a strain of Pseudomonas fluorescens
that has not het been genome sequenced, is only rarely
found outside of the PhnW and aldehyde dehydrogenase
context. For instance in Rhodopseudomonas and Bordetella
bronchiseptica, where it is adjacent to transporters
presumably specific for the import of phosphonoacetate.
It seems reasonably certain then, that this enzyme
catalyzes the NAD-dependent oxidation of
phosphonoacetaldehyde to phosphonoacetate, bridging the
metabolic gap between PhnW and PhnA. We propose the name
phosphonoacetaldehyde dehydrogenase and the gene symbol
PhnY for this enzyme.
Length = 472
Score = 227 bits (579), Expect = 3e-69
Identities = 118/313 (37%), Positives = 171/313 (54%), Gaps = 14/313 (4%)
Query: 87 HGKTIPINGDFFAYTRHEPV-GVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTP 145
HGK + +T+ EP+ GV I P+N P+ +A KIAPA+AT N +V+KP+E+TP
Sbjct: 126 HGKARKV------FTQREPLLGVISAITPFNHPMNQVAHKIAPAIATNNRMVVKPSEKTP 179
Query: 146 LTALAVGALIAEAGIPEGVVNIVPGY-GDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSG 204
L+AL + ++ EAG+P ++ +V G + L+ +P VD V FTG +GK + +G
Sbjct: 180 LSALYLADILYEAGLPPQMLQVVTGDPREIADELITNPHVDLVTFTGGVAIGKYIAARAG 239
Query: 205 ASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEF 264
+R LELGG P I+ DADLD A + A G + N GQ C A R VQ+++ D F
Sbjct: 240 ---YRRQVLELGGNDPLIVMEDADLDRAADLAVKGSYKNSGQRCTAVKRMLVQESVADRF 296
Query: 265 VARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKG 324
E+ + GDP D +V+ G ID+ + +QGA+L+ G R G
Sbjct: 297 TELLVEKTRAWRYGDPMDPSVDMGTVIDEAAAILFEARVNEAIAQGARLLLGNVRDG--A 354
Query: 325 FYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDK 384
Y PTV V M + REE FGPV +IRF I++ I +N++ YGL++ V T LD
Sbjct: 355 LY-APTVLDRVDPSMTLVREETFGPVSPVIRFCDIDDAIRISNSTAYGLSSGVCTNRLDY 413
Query: 385 TNYVTQGLRAGTL 397
L+ GT+
Sbjct: 414 ITRFIAELQVGTV 426
>gnl|CDD|181826 PRK09406, gabD1, succinic semialdehyde dehydrogenase; Reviewed.
Length = 457
Score = 210 bits (537), Expect = 3e-63
Identities = 119/347 (34%), Positives = 171/347 (49%), Gaps = 5/347 (1%)
Query: 52 LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKI---HGKTIPINGDFFAYTRHEPVGV 108
L TL+ GK S + + RYYA A+ + G AY R++P+GV
Sbjct: 68 LMTLEMGKTLA-SAKAEALKCAKGFRYYAEHAEALLADEPADAAAVGASRAYVRYQPLGV 126
Query: 109 CGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV 168
++PWNFPL + APAL GN +LK A P TAL + L AG P+G +
Sbjct: 127 VLAVMPWNFPLWQVVRFAAPALMAGNVGLLKHASNVPQTALYLADLFRRAGFPDGCFQTL 186
Query: 169 PGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
A A+++ P V TGS G+ V +G +K+T LELGG P I+ AD
Sbjct: 187 LVGSGAVEAILRDPRVAAATLTGSEPAGRAVAAIAG-DEIKKTVLELGGSDPFIVMPSAD 245
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
LD A E A N GQ C A R V +YD F + R VGDP D + + G
Sbjct: 246 LDRAAETAVTARVQNNGQSCIAAKRFIVHADVYDAFAEKFVARMAALRVGDPTDPDTDVG 305
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
P ++ D++ + ++ + GA ++ GG R G++ PTV ++ DM++ EE+FG
Sbjct: 306 PLATEQGRDEVEKQVDDAVAAGATILCGGKRPDGPGWFYPPTVITDITPDMRLYTEEVFG 365
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
PV L R + I+E IE AN + +GL + +T+D + L AG
Sbjct: 366 PVASLYRVADIDEAIEIANATTFGLGSNAWTRDEAEQERFIDDLEAG 412
>gnl|CDD|143466 cd07148, ALDH_RL0313, Uncharacterized ALDH ( RL0313) with
similarity to Tortula ruralis aldehyde dehydrogenase
ALDH21A1. Uncharacterized aldehyde dehydrogenase (locus
RL0313) with sequence similarity to the moss Tortula
ruralis aldehyde dehydrogenase ALDH21A1 (RNP123)
believed to play an important role in the detoxification
of aldehydes generated in response to desiccation- and
salinity-stress, and similar sequences are included in
this CD.
Length = 455
Score = 209 bits (534), Expect = 8e-63
Identities = 111/296 (37%), Positives = 161/296 (54%), Gaps = 4/296 (1%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
A+T EP+GV I +N PL ++ ++APA+A G +++KPA TPL+ LA L+ EA
Sbjct: 118 AFTTREPIGVVVAISAFNHPLNLIVHQVAPAIAAGCPVIVKPALATPLSCLAFVDLLHEA 177
Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
G+PEG VP LV P V +F GS VG +++ S + R LE GG
Sbjct: 178 GLPEGWCQAVPCENAVAEKLVTDPRVAFFSFIGSARVGWMLR--SKLAPGTRCALEHGGA 235
Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVG 278
+P I+ ADLDA + G F++ GQ C + R FV I D+F R A++ VG
Sbjct: 236 APVIVDRSADLDAMIPPLVKGGFYHAGQVCVSVQRVFVPAEIADDFAQRLAAAAEKLVVG 295
Query: 279 DPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDD 338
DP D + E GP I ++D++ E++ + GA+L+ GG R D + PTV + D
Sbjct: 296 DPTDPDTEVGPLIRPREVDRVEEWVNEAVAAGARLLCGGKRLSDTTY--APTVLLDPPRD 353
Query: 339 MKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRA 394
K++ +EIFGPV + + ++E I +AN+ AAVFTKDLD + L A
Sbjct: 354 AKVSTQEIFGPVVCVYSYDDLDEAIAQANSLPVAFQAAVFTKDLDVALKAVRRLDA 409
>gnl|CDD|143452 cd07134, ALDH_AlkH-like, Pseudomonas putida Aldehyde dehydrogenase
AlkH-like. Aldehyde dehydrogenase AlkH (locus name
P12693, EC=1.2.1.3) of the alkBFGHJKL operon that allows
Pseudomonas putida to metabolize alkanes and the
aldehyde dehydrogenase AldX of Bacillus subtilis (locus
P46329, EC=1.2.1.3), and similar sequences, are present
in this CD.
Length = 433
Score = 207 bits (530), Expect = 2e-62
Identities = 109/302 (36%), Positives = 156/302 (51%), Gaps = 13/302 (4%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
+ R+EP GVC I PWN+P + + A+A GNT +LKP+E TP T+ + +I EA
Sbjct: 94 SKIRYEPKGVCLIISPWNYPFNLAFGPLVSAIAAGNTAILKPSELTPHTSAVIAKIIREA 153
Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
E V + G + AL++ P D + FTGS VGK+V + A +L TLELGGK
Sbjct: 154 -FDEDEVAVFEGDAEVAQALLELP-FDHIFFTGSPAVGKIVMAAA-AKHLASVTLELGGK 210
Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVG 278
SP I+ ADL A + +G F N GQ C A FV +++ D FV ++
Sbjct: 211 SPTIVDETADLKKAAKKIAWGKFLNAGQTCIAPDYVFVHESVKDAFVEHLKAEIEKFYGK 270
Query: 279 DPFDLNVEQGPQ----IDKEQMDKILEFIESGKSQGAQLVAGGG-RAGDKGFYVQPTVFA 333
D + P ++ D++ ++ ++GA++ GG A + Y+ PTV
Sbjct: 271 DA---ARKASPDLARIVNDRHFDRLKGLLDDAVAKGAKVEFGGQFDAAQR--YIAPTVLT 325
Query: 334 NVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLR 393
NV DMKI +EEIFGPV +I + ++EVIE N LA VF+KD N V
Sbjct: 326 NVTPDMKIMQEEIFGPVLPIITYEDLDEVIEYINAKPKPLALYVFSKDKANVNKVLARTS 385
Query: 394 AG 395
+G
Sbjct: 386 SG 387
>gnl|CDD|143406 cd07087, ALDH_F3-13-14_CALDH-like, ALDH subfamily: Coniferyl
aldehyde dehydrogenase, ALDH families 3, 13, and 14, and
other related proteins. ALDH subfamily which includes
NAD(P)+-dependent, aldehyde dehydrogenase, family 3
member A1 and B1 (ALDH3A1, ALDH3B1, EC=1.2.1.5) and
fatty aldehyde dehydrogenase, family 3 member A2
(ALDH3A2, EC=1.2.1.3), and also plant ALDH family
members ALDH3F1, ALDH3H1, and ALDH3I1, fungal ALDH14
(YMR110C) and the protozoan family 13 member (ALDH13),
as well as coniferyl aldehyde dehydrogenases (CALDH,
EC=1.2.1.68), and other similar sequences, such as the
Pseudomonas putida benzaldehyde dehydrogenase I that is
involved in the metabolism of mandelate.
Length = 426
Score = 203 bits (518), Expect = 1e-60
Identities = 102/305 (33%), Positives = 156/305 (51%), Gaps = 22/305 (7%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAP---ALATGNTIVLKPAEQTPLTALAVGALI 155
AY EP+GV I PWN+PL + +AP A+A GNT+VLKP+E P T+ + LI
Sbjct: 94 AYVIPEPLGVVLIIGPWNYPLQLA---LAPLIGAIAAGNTVVLKPSELAPATSALLAKLI 150
Query: 156 AEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLEL 215
+ V +V G + AL+ P D + FTGS VGK+V + + A +L TLEL
Sbjct: 151 PKY-FDPEAVAVVEGGVEVATALLAEP-FDHIFFTGSPAVGKIVMEAA-AKHLTPVTLEL 207
Query: 216 GGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRR 275
GGKSP I+ DA+L+ A +G F N GQ C A V ++I DE + E K+
Sbjct: 208 GGKSPCIVDKDANLEVAARRIAWGKFLNAGQTCIAPDYVLVHESIKDELI----EELKKA 263
Query: 276 TV---GDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVF 332
G+ + + G I++ D++ ++ GK V GG+ + Y+ PT+
Sbjct: 264 IKEFYGEDPKESPDYGRIINERHFDRLASLLDDGK------VVIGGQVDKEERYIAPTIL 317
Query: 333 ANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGL 392
+V D + +EEIFGP+ ++ + ++E IE N+ LA +F++D V
Sbjct: 318 DDVSPDSPLMQEEIFGPILPILTYDDLDEAIEFINSRPKPLALYLFSEDKAVQERVLAET 377
Query: 393 RAGTL 397
+G +
Sbjct: 378 SSGGV 382
>gnl|CDD|143414 cd07095, ALDH_SGSD_AstD, N-succinylglutamate 5-semialdehyde
dehydrogenase, AstD-like. N-succinylglutamate
5-semialdehyde dehydrogenase or succinylglutamic
semialdehyde dehydrogenase (SGSD, E. coli AstD,
EC=1.2.1.71) involved in L-arginine degradation via the
arginine succinyltransferase (AST) pathway and catalyzes
the NAD+-dependent reduction of succinylglutamate
semialdehyde into succinylglutamate.
Length = 431
Score = 198 bits (506), Expect = 7e-59
Identities = 102/305 (33%), Positives = 143/305 (46%), Gaps = 12/305 (3%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
A RH P GV P+NFP + I PAL GNT+V KP+E TP A + L EA
Sbjct: 91 AVLRHRPHGVMAVFGPFNFPGHLPNGHIVPALLAGNTVVFKPSELTPAVAELMVELWEEA 150
Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
G+P GV+N+V G + G AL H G+D + FTGS G L+ + K LE+GG
Sbjct: 151 GLPPGVLNLVQGGRETGEALAAHEGIDGLLFTGSAATGLLLHRQFAGRPGKILALEMGGN 210
Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFV-QDTIYDEFVARSGERAKRRTV 277
+P +++ AD+DAA F GQ C R V + D F+ R E AKR +
Sbjct: 211 NPLVVWDVADIDAAAYLIVQSAFLTAGQRCTCARRLIVPDGAVGDAFLERLVEAAKRLRI 270
Query: 278 GDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQP-----TVF 332
G P GP I + L + + G + + R ++ P T
Sbjct: 271 GAPDAEPPFMGPLIIAAAAARYLLAQQDLLALGGEPLLAMERLVAGTAFLSPGIIDVTDA 330
Query: 333 ANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGL 392
A+V D EEIFGP+ Q+ R+ +E I AN + +GL+A + + D +
Sbjct: 331 ADVPD------EEIFGPLLQVYRYDDFDEAIALANATRFGLSAGLLSDDEALFERFLARI 384
Query: 393 RAGTL 397
RAG +
Sbjct: 385 RAGIV 389
>gnl|CDD|143402 cd07083, ALDH_P5CDH, ALDH subfamily NAD+-dependent
delta(1)-pyrroline-5-carboxylate dehydrogenase-like.
ALDH subfamily of the NAD+-dependent,
delta(1)-pyrroline-5-carboxylate dehydrogenases (P5CDH,
EC=1.5.1.12). The proline catabolic enzymes, proline
dehydrogenase and P5CDH catalyze the two-step oxidation
of proline to glutamate. P5CDH catalyzes the oxidation
of glutamate semialdehyde, utilizing NAD+ as the
electron acceptor. In some bacteria, the two enzymes are
fused into the bifunctional flavoenzyme, proline
utilization A (PutA). These enzymes play important roles
in cellular redox control, superoxide generation, and
apoptosis. In certain prokaryotes such as Escherichia
coli, PutA is also a transcriptional repressor of the
proline utilization genes. Monofunctional enzyme
sequences such as those seen in the Bacillus RocA P5CDH
are also present in this subfamily as well as the human
ALDH4A1 P5CDH and the Drosophila Aldh17 P5CDH.
Length = 500
Score = 199 bits (507), Expect = 2e-58
Identities = 119/332 (35%), Positives = 172/332 (51%), Gaps = 13/332 (3%)
Query: 77 RYYAGFADKIHGK---TIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATG 133
RYYA A ++ +P G+ + + +G I PWNFP+ + I +A G
Sbjct: 124 RYYARAALRLRYPAVEVVPYPGEDNE-SFYVGLGAGVVISPWNFPVAIFTGMIVAPVAVG 182
Query: 134 NTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGS 192
NT++ KPAE + V + EAG P GVV +PG G+ GA L +H + + FTGS
Sbjct: 183 NTVIAKPAEDAVVVGYKVFEIFHEAGFPPGVVQFLPGVGEEVGAYLTEHERIRGINFTGS 242
Query: 193 TEVGKLV-----QQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQC 247
E GK + + G + KR +E GGK+ I+ AD + VEG F GQ
Sbjct: 243 LETGKKIYEAAARLAPGQTWFKRLYVETGGKNAIIVDETADFELVVEGVVVSAFGFQGQK 302
Query: 248 CCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGK 307
C A SR + Y+ + R +RA+R +VG P + + GP ID EQ K+L +IE GK
Sbjct: 303 CSAASRLILTQGAYEPVLERLLKRAERLSVGPPEENGTDLGPVIDAEQEAKVLSYIEHGK 362
Query: 308 SQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRF--SSIEEVIER 365
++G QLV GG R +G++V PTV V +IA+EEIFGPV +IR+ E +E
Sbjct: 363 NEG-QLVLGGKRLEGEGYFVAPTVVEEVPPKARIAQEEIFGPVLSVIRYKDDDFAEALEV 421
Query: 366 ANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
AN++ YGL V+++ + + G L
Sbjct: 422 ANSTPYGLTGGVYSRKREHLEEARREFHVGNL 453
>gnl|CDD|143443 cd07125, ALDH_PutA-P5CDH, Delta(1)-pyrroline-5-carboxylate
dehydrogenase, PutA. The proline catabolic enzymes of
the aldehyde dehydrogenase (ALDH) protein superfamily,
proline dehydrogenase and
Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
(EC=1.5.1.12 )), catalyze the two-step oxidation of
proline to glutamate; P5CDH catalyzes the oxidation of
glutamate semialdehyde, utilizing NAD+ as the electron
acceptor. In some bacteria, the two enzymes are fused
into the bifunctional flavoenzyme, proline utilization A
(PutA) These enzymes play important roles in cellular
redox control, superoxide generation, and apoptosis. In
certain prokaryotes such as Escherichia coli, PutA is
also a transcriptional repressor of the proline
utilization genes.
Length = 518
Score = 196 bits (501), Expect = 2e-57
Identities = 116/333 (34%), Positives = 166/333 (49%), Gaps = 23/333 (6%)
Query: 76 LRYYAGFADK------IHGKTIPINGDFFAYTRHEP--VGVCGQIIPWNFPLLMLAWKIA 127
RYYA A + + G T +NG V VC I PWNFPL + +IA
Sbjct: 137 CRYYAAQARELFSDPELPGPTGELNGLEL-----HGRGVFVC--ISPWNFPLAIFTGQIA 189
Query: 128 PALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG-DAGAALVQHPGVDK 186
ALA GNT++ KPAEQTPL A L+ EAG+P V+ +VPG G + G ALV HP +D
Sbjct: 190 AALAAGNTVIAKPAEQTPLIAARAVELLHEAGVPRDVLQLVPGDGEEIGEALVAHPRIDG 249
Query: 187 VAFTGSTEVGKLVQQGSGASNLKRTTL--ELGGKSPNIIFADADLDAAVEGAHFGLFFNM 244
V FTGSTE KL+ + + L E GGK+ I+ + A + AV+ F +
Sbjct: 250 VIFTGSTETAKLINRALAERDGPILPLIAETGGKNAMIVDSTALPEQAVKDVVQSAFGSA 309
Query: 245 GQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIE 304
GQ C A ++Q+ I + F+ VGDP+DL+ + GP IDK + E
Sbjct: 310 GQRCSALRLLYLQEEIAERFIEMLKGAMASLKVGDPWDLSTDVGPLIDKPAGKLLRAHTE 369
Query: 305 SGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSS--IEEV 362
+ + A L+A G++V P + V E+FGP+ +IRF + ++E
Sbjct: 370 LMRGE-AWLIAPAPLDDGNGYFVAPGIIEIVGIF--DLTTEVFGPILHVIRFKAEDLDEA 426
Query: 363 IERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
IE N + YGL + ++D + Y + + AG
Sbjct: 427 IEDINATGYGLTLGIHSRDEREIEYWRERVEAG 459
>gnl|CDD|143451 cd07133, ALDH_CALDH_CalB, Coniferyl aldehyde dehydrogenase-like.
Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of
Pseudomonas sp. strain HR199 (CalB) which catalyzes the
NAD+-dependent oxidation of coniferyl aldehyde to
ferulic acid, and similar sequences, are present in this
CD.
Length = 434
Score = 191 bits (487), Expect = 5e-56
Identities = 108/304 (35%), Positives = 160/304 (52%), Gaps = 17/304 (5%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAP---ALATGNTIVLKPAEQTPLTALAVGALI 155
A ++P+GV G I+PWN+PL LA + P ALA GN +++KP+E TP T+ + L+
Sbjct: 95 AEVEYQPLGVVGIIVPWNYPL-YLA--LGPLIAALAAGNRVMIKPSEFTPRTSALLAELL 151
Query: 156 AEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLEL 215
AE E V +V G D AA P D + FTGST VG+ V + + A NL TLEL
Sbjct: 152 AEY-FDEDEVAVVTGGADVAAAFSSLP-FDHLLFTGSTAVGRHVMR-AAAENLTPVTLEL 208
Query: 216 GGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRR 275
GGKSP II DADL A E FG N GQ C A V + +EFVA + +
Sbjct: 209 GGKSPAIIAPDADLAKAAERIAFGKLLNAGQTCVAPDYVLVPEDKLEEFVAAAKAAVAKM 268
Query: 276 --TVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLV--AGGGRAGDKGFYVQPTV 331
T+ D N + I++ ++ +E +++GA+++ G + PT+
Sbjct: 269 YPTLAD----NPDYTSIINERHYARLQGLLEDARAKGARVIELNPAGEDFAATRKLPPTL 324
Query: 332 FANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQG 391
NV DDM++ +EEIFGP+ ++ + S++E I+ N LA F +D + + V +
Sbjct: 325 VLNVTDDMRVMQEEIFGPILPILTYDSLDEAIDYINARPRPLALYYFGEDKAEQDRVLRR 384
Query: 392 LRAG 395
+G
Sbjct: 385 THSG 388
>gnl|CDD|130783 TIGR01722, MMSDH, methylmalonic acid semialdehyde dehydrogenase.
Involved in valine catabolism,
methylmalonate-semialdehyde dehydrogenase catalyzes the
irreversible NAD+- and CoA-dependent oxidative
decarboxylation of methylmalonate semialdehyde to
propionyl-CoA. Methylmalonate-semialdehyde dehydrogenase
has been characterized in both prokaryotes and
eukaryotes, functioning as a mammalian tetramer and a
bacterial homodimer. Although similar in monomeric
molecular mass and enzymatic activity, the N-terminal
sequence in P.aeruginosa does not correspond with the
N-terminal sequence predicted for rat liver. Sequence
homology to a variety of prokaryotic and eukaryotic
aldehyde dehydrogenases places MMSDH in the aldehyde
dehydrogenase (NAD+) superfamily (pfam00171), making
MMSDH's CoA requirement unique among known ALDHs.
Methylmalonate semialdehyde dehydrogenase is closely
related to betaine aldehyde dehydrogenase,
2-hydroxymuconic semialdehyde dehydrogenase, and class 1
and 2 aldehyde dehydrogenase. In Bacillus, a highly
homologous protein to methylmalonic acid semialdehyde
dehydrogenase, groups out from the main MMSDH clade with
Listeria and Sulfolobus. This Bacillus protein has been
suggested to be located in an iol operon and/or involved
in myo-inositol catabolism, converting malonic
semialdehyde to acetyl CoA ad CO2. The preceeding
enzymes responsible for valine catabolism are present in
Bacillus, Listeria, and Sulfolobus [Energy metabolism,
Amino acids and amines].
Length = 477
Score = 190 bits (485), Expect = 1e-55
Identities = 110/319 (34%), Positives = 168/319 (52%), Gaps = 7/319 (2%)
Query: 68 DVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKI 126
DV L + + G + G+T + Y+ +P+GVC I P+NFP ++ W
Sbjct: 98 DVARGLEVVEHACGVNSLLKGETSTQVATRVDVYSIRQPLGVCAGITPFNFPAMIPLWMF 157
Query: 127 APALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDAGAALVQHPGVDK 186
A+A GNT VLKP+E+ P A+ + L +EAG P+GV+N+V G +A L++HP V
Sbjct: 158 PIAIACGNTFVLKPSEKVPSAAVKLAELFSEAGAPDGVLNVVHGDKEAVDRLLEHPDVKA 217
Query: 187 VAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQ 246
V+F GST +G+ + +G+++ KR G K+ ++ DAD DAA + + GQ
Sbjct: 218 VSFVGSTPIGRYIHT-TGSAHGKRVQALGGAKNHMVVMPDADKDAAADALVGAAYGAAGQ 276
Query: 247 CCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESG 306
C A S V DE+V ERA++ +G D E GP I + D++ I G
Sbjct: 277 RCMAIS-AAVLVGAADEWVPEIRERAEKIRIGPGDDPGAEMGPLITPQAKDRVASLIAGG 335
Query: 307 KSQGAQLVAGGGRAGDKGF----YVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEV 362
++GA+++ G G+ +V PT+ V MK +EEIFGPV ++ ++EE
Sbjct: 336 AAEGAEVLLDGRGYKVDGYEEGNWVGPTLLERVPPTMKAYQEEIFGPVLCVLEADTLEEA 395
Query: 363 IERANNSDYGLAAAVFTKD 381
I N S YG A+FT+D
Sbjct: 396 IALINASPYGNGTAIFTRD 414
>gnl|CDD|143454 cd07136, ALDH_YwdH-P39616, Bacillus subtilis aldehyde dehydrogenase
ywdH-like. Uncharacterized Bacillus subtilis ywdH
aldehyde dehydrogenase (locus P39616) most closely
related to the ALDHs and fatty ALDHs of families 3 and
14, and similar sequences, are included in this CD.
Length = 449
Score = 189 bits (482), Expect = 4e-55
Identities = 99/300 (33%), Positives = 158/300 (52%), Gaps = 16/300 (5%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAP---ALATGNTIVLKPAEQTPLTALAVGALI 155
+Y +EP GV I PWN+P + +AP A+A GNT VLKP+E TP T+ + +I
Sbjct: 94 SYIYYEPYGVVLIIAPWNYPFQLA---LAPLIGAIAAGNTAVLKPSELTPNTSKVIAKII 150
Query: 156 AEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLEL 215
E E V +V G + L+ D + FTGS VGK+V + + A +L TLEL
Sbjct: 151 EET-FDEEYVAVVEGGVEENQELLDQK-FDYIFFTGSVRVGKIVME-AAAKHLTPVTLEL 207
Query: 216 GGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRR 275
GGKSP I+ DA+L A + +G F N GQ C A V +++ ++F+ E K+
Sbjct: 208 GGKSPCIVDEDANLKLAAKRIVWGKFLNAGQTCVAPDYVLVHESVKEKFIKELKEEIKKF 267
Query: 276 TVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANV 335
DP + + + G I+++ D++ +++GK + GG + Y++PT+ NV
Sbjct: 268 YGEDPLE-SPDYGRIINEKHFDRLAGLLDNGK------IVFGGNTDRETLYIEPTILDNV 320
Query: 336 RDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
D + +EEIFGP+ ++ + +++E IE + LA +F++D V + L G
Sbjct: 321 TWDDPVMQEEIFGPILPVLTYDTLDEAIEIIKSRPKPLALYLFSEDKKVEKKVLENLSFG 380
>gnl|CDD|184426 PRK13968, PRK13968, putative succinate semialdehyde dehydrogenase;
Provisional.
Length = 462
Score = 183 bits (465), Expect = 1e-52
Identities = 99/283 (34%), Positives = 144/283 (50%), Gaps = 1/283 (0%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
A + P+G I+PWNFPL + P L GN +LK A A + + +A
Sbjct: 120 AVIEYRPLGTILAIMPWNFPLWQVMRGAVPILLAGNGYLLKHAPNVMGCAQLIAQVFKDA 179
Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
GIP+GV + D + ++ + V TGS G + +GA+ LK+ LELGG
Sbjct: 180 GIPQGVYGWLNADNDGVSQMINDSRIAAVTVTGSVRAGAAIGAQAGAA-LKKCVLELGGS 238
Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVG 278
P I+ DADL+ AV+ A G + N GQ C A R +++ I F R A +G
Sbjct: 239 DPFIVLNDADLELAVKAAVAGRYQNTGQVCAAAKRFIIEEGIASAFTERFVAAAAALKMG 298
Query: 279 DPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDD 338
DP D GP + D++ +E+ ++GA+L+ GG + G Y PTV ANV +
Sbjct: 299 DPRDEENALGPMARFDLRDELHHQVEATLAEGARLLLGGEKIAGAGNYYAPTVLANVTPE 358
Query: 339 MKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
M REE+FGPV + E +E AN+S++GL+A +FT D
Sbjct: 359 MTAFREELFGPVAAITVAKDAEHALELANDSEFGLSATIFTTD 401
>gnl|CDD|215110 PLN00412, PLN00412, NADP-dependent glyceraldehyde-3-phosphate
dehydrogenase; Provisional.
Length = 496
Score = 182 bits (463), Expect = 4e-52
Identities = 105/294 (35%), Positives = 157/294 (53%), Gaps = 9/294 (3%)
Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGV 164
P+GV I P+N+P+ + KIAPAL GN +VLKP Q + AL + AG P+G+
Sbjct: 158 PLGVVLAIPPFNYPVNLAVSKIAPALIAGNAVVLKPPTQGAVAALHMVHCFHLAGFPKGL 217
Query: 165 VNIVPGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNII 223
++ V G G + G L HPGV+ ++FTG + G + + +G L+ +ELGGK I+
Sbjct: 218 ISCVTGKGSEIGDFLTMHPGVNCISFTGG-DTGIAISKKAGMVPLQ---MELGGKDACIV 273
Query: 224 FADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDL 283
DADLD A G F GQ C A V +++ D V + + + TVG P D
Sbjct: 274 LEDADLDLAAANIIKGGFSYSGQRCTAVKVVLVMESVADALVEKVNAKVAKLTVGPPED- 332
Query: 284 NVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAR 343
+ + P + + + I + K +GA R G+ + P + NVR DM+IA
Sbjct: 333 DCDITPVVSESSANFIEGLVMDAKEKGATFCQEWKREGN---LIWPLLLDNVRPDMRIAW 389
Query: 344 EEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
EE FGPV +IR +S+EE I N S++GL VFT+D++K ++ + GT+
Sbjct: 390 EEPFGPVLPVIRINSVEEGIHHCNASNFGLQGCVFTRDINKAILISDAMETGTV 443
>gnl|CDD|143453 cd07135, ALDH_F14-YMR110C, Saccharomyces cerevisiae aldehyde
dehydrogenase family 14 and related proteins. Aldehyde
dehydrogenase family 14 (ALDH14), isolated mainly from
the mitochondrial outer membrane of Saccharomyces
cerevisiae (YMR110C) and most closely related to the
plant and animal ALDHs and fatty ALDHs family 3 members,
and similar fungal sequences, are present in this CD.
Length = 436
Score = 180 bits (459), Expect = 6e-52
Identities = 93/297 (31%), Positives = 153/297 (51%), Gaps = 8/297 (2%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
R EP+GV I PWN+P+L+ + A+A G T+VLKP+E TP TA + L+ +
Sbjct: 102 PRIRKEPLGVVLIIGPWNYPVLLALSPLVGAIAAGCTVVLKPSELTPHTAALLAELVPKY 161
Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
+ +V G AL++ DK+ +TGS VG+++ + + A +L TLELGGK
Sbjct: 162 -LDPDAFQVVQGGVPETTALLEQ-KFDKIFYTGSGRVGRIIAE-AAAKHLTPVTLELGGK 218
Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVG 278
SP I+ +ADL+ A + +G F N GQ C A V ++YDEFV + G
Sbjct: 219 SPVIVTKNADLELAAKRILWGKFGNAGQICVAPDYVLVDPSVYDEFVEELKKVLDEFYPG 278
Query: 279 DPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDD 338
+ ++ +++ +++ K ++V GG F + PT+ ++V D
Sbjct: 279 GANASP-DYTRIVNPRHFNRLKSLLDTTK---GKVVIGGEMDEATRF-IPPTIVSDVSWD 333
Query: 339 MKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
+ EE+FGPV +I+ ++E I+ N+ D LA +FT D + +++ R+G
Sbjct: 334 DSLMSEELFGPVLPIIKVDDLDEAIKVINSRDTPLALYIFTDDKSEIDHILTRTRSG 390
>gnl|CDD|181873 PRK09457, astD, succinylglutamic semialdehyde dehydrogenase;
Reviewed.
Length = 487
Score = 180 bits (459), Expect = 1e-51
Identities = 107/312 (34%), Positives = 149/312 (47%), Gaps = 29/312 (9%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
A RH P GV P+NFP + I PAL GNT+V KP+E TP A L +A
Sbjct: 128 AVLRHRPHGVVAVFGPYNFPGHLPNGHIVPALLAGNTVVFKPSELTPWVAELTVKLWQQA 187
Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
G+P GV+N+V G + G AL HP +D + FTGS G L+ + K LE+GG
Sbjct: 188 GLPAGVLNLVQGGRETGKALAAHPDIDGLLFTGSANTGYLLHRQFAGQPEKILALEMGGN 247
Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIY-DEFVARSGERAKRRTV 277
+P +I AD+DAAV F + GQ C R V D F+AR AKR TV
Sbjct: 248 NPLVIDEVADIDAAVHLIIQSAFISAGQRCTCARRLLVPQGAQGDAFLARLVAVAKRLTV 307
Query: 278 GDPFDLNVE-QGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRA--------GDKGFYVQ 328
G G I ++ ++ +Q AQL+A GG++ G +
Sbjct: 308 GRWDAEPQPFMGAVISEQAAQGLVA------AQ-AQLLALGGKSLLEMTQLQAGTGL-LT 359
Query: 329 P-----TVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLD 383
P T A + D EE FGP+ Q++R+ +E I ANN+ +GL+A + + D +
Sbjct: 360 PGIIDVTGVAELPD------EEYFGPLLQVVRYDDFDEAIRLANNTRFGLSAGLLSDDRE 413
Query: 384 KTNYVTQGLRAG 395
+ +RAG
Sbjct: 414 DYDQFLLEIRAG 425
>gnl|CDD|233325 TIGR01238, D1pyr5carbox3, delta-1-pyrroline-5-carboxylate
dehydrogenase (PutA C-terminal domain). This model
represents one of several related branches of
delta-1-pyrroline-5-carboxylate dehydrogenase. Members
of this branch are the C-terminal domain of the PutA
bifunctional proline dehydrogenase /
delta-1-pyrroline-5-carboxylate dehydrogenase [Energy
metabolism, Amino acids and amines].
Length = 500
Score = 177 bits (450), Expect = 3e-50
Identities = 111/348 (31%), Positives = 162/348 (46%), Gaps = 22/348 (6%)
Query: 58 GKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNF 117
GK + + + +V ++ RYYA D E GV I PWNF
Sbjct: 125 GKTIHNAIA-EVREAVDFCRYYAKQVR-----------DVLGEFSVESRGVFVCISPWNF 172
Query: 118 PLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG-DAGA 176
PL + +I+ ALA GNT++ KPAEQT L A L+ EAG P G + ++PG G D GA
Sbjct: 173 PLAIFTGQISAALAAGNTVIAKPAEQTSLIAYRAVELMQEAGFPAGTIQLLPGRGADVGA 232
Query: 177 ALVQHPGVDKVAFTGSTEVGKLVQQ--GSGASNLKRTTLELGGKSPNIIFADADLDAAVE 234
AL P + VAFTGSTEV +L+ Q E GG++ I+ + A + V
Sbjct: 233 ALTSDPRIAGVAFTGSTEVAQLINQTLAQREDAPVPLIAETGGQNAMIVDSTALPEQVVR 292
Query: 235 GAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKE 294
F + GQ C A VQ+ + D + + VG P L + GP ID E
Sbjct: 293 DVLRSAFDSAGQRCSALRVLCVQEDVADRVLTMIQGAMQELKVGVPHLLTTDVGPVIDAE 352
Query: 295 QMDKILEFIESGKSQG---AQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQ 351
+L IE AQL RA G +V PT+F DD+ EE+FGPV
Sbjct: 353 AKQNLLAHIEHMSQTQKKIAQLTLDDSRACQHGTFVAPTLFE--LDDIAELSEEVFGPVL 410
Query: 352 QLIRFSS--IEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
++R+ + +++++++ N + YGL V ++ ++ + R G
Sbjct: 411 HVVRYKARELDQIVDQINQTGYGLTMGVHSRIETTYRWIEKHARVGNC 458
>gnl|CDD|237018 PRK11905, PRK11905, bifunctional proline
dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
Reviewed.
Length = 1208
Score = 182 bits (465), Expect = 3e-50
Identities = 111/333 (33%), Positives = 168/333 (50%), Gaps = 29/333 (8%)
Query: 76 LRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNT 135
LRYYA A H+P+G I PWNFPL + +IA AL GNT
Sbjct: 658 LRYYAAQAR-----------RLLNGPGHKPLGPVVCISPWNFPLAIFTGQIAAALVAGNT 706
Query: 136 IVLKPAEQTPLTAL-AVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGST 193
++ KPAEQTPL A AV L+ EAG+P+ + ++PG G GAALV P + V FTGST
Sbjct: 707 VLAKPAEQTPLIAARAV-RLLHEAGVPKDALQLLPGDGRTVGAALVADPRIAGVMFTGST 765
Query: 194 EVGKLVQQGSGASNLKRTTL---ELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCA 250
EV +L+Q+ + A E GG++ I+ + A + V F + GQ C A
Sbjct: 766 EVARLIQR-TLAKRSGPPVPLIAETGGQNAMIVDSSALPEQVVADVIASAFDSAGQRCSA 824
Query: 251 GSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQG 310
+Q+ + D + +GDP+ L+ + GP ID E I IE+ ++ G
Sbjct: 825 LRVLCLQEDVADRVLTMLKGAMDELRIGDPWRLSTDVGPVIDAEAQANIEAHIEAMRAAG 884
Query: 311 AQL-VAGGGRAGDKGFYVQPTVF--ANVRDDMKIAREEIFGPVQQLIRF--SSIEEVIER 365
+ +KG +V PT+ ++ D++ E+FGPV ++RF ++ VI+
Sbjct: 885 RLVHQLPLPAETEKGTFVAPTLIEIDSIS-DLE---REVFGPVLHVVRFKADELDRVIDD 940
Query: 366 ANNSDYGLAAAVFTKDLDKT-NYVTQGLRAGTL 397
N + YGL + ++ +D+T +VT +RAG +
Sbjct: 941 INATGYGLTFGLHSR-IDETIAHVTSRIRAGNI 972
>gnl|CDD|226683 COG4230, COG4230, Delta 1-pyrroline-5-carboxylate dehydrogenase
[Energy production and conversion].
Length = 769
Score = 179 bits (457), Expect = 1e-49
Identities = 118/332 (35%), Positives = 170/332 (51%), Gaps = 27/332 (8%)
Query: 76 LRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNT 135
LRYYAG A D F H P+G I PWNFPL + +IA ALA GN+
Sbjct: 218 LRYYAGQAR-----------DTFGNLTHRPLGPVVCISPWNFPLAIFTGQIAAALAAGNS 266
Query: 136 IVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGSTE 194
++ KPAEQTPL A L+ EAG+P GV+ ++PG G+ GAAL V V FTGSTE
Sbjct: 267 VLAKPAEQTPLIAAQAVRLLHEAGVPPGVLQLLPGRGETVGAALTADARVAGVMFTGSTE 326
Query: 195 VGKLVQQGSGASNLKRTTL--ELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGS 252
V +L+Q+ + L E GG++ I+ + A + V F + GQ C A
Sbjct: 327 VARLIQRQLAKRQGRPIPLIAETGGQNAMIVDSSALAEQVVADVLASAFDSAGQRCSALR 386
Query: 253 RTFVQDTIYDEFVAR-SGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQG- 310
+Q+ + D + G A+ R VG+P L + GP ID E I + I++ +S+G
Sbjct: 387 VLCLQEDVADRILTMLKGAMAELR-VGNPDRLTTDVGPVIDAEAKANIEKHIQTMRSKGR 445
Query: 311 --AQLVAGGGRAGDKGFYVQPTVFA-NVRDDMKIAREEIFGPVQQLIRF--SSIEEVIER 365
Q A KG +V PT+ D+++ E+FGPV ++R+ ++EVI++
Sbjct: 446 LVHQAAAPNSLQ--KGTFVAPTLIELENLDELQ---REVFGPVLHVVRYKRDELDEVIDQ 500
Query: 366 ANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
N + YGL V T+ + +VT+ AG L
Sbjct: 501 INATGYGLTLGVHTRIDETIAHVTERAHAGNL 532
>gnl|CDD|132284 TIGR03240, arg_catab_astD, succinylglutamic semialdehyde
dehydrogenase. Members of this protein family are
succinylglutamic semialdehyde dehydrogenase (EC
1.2.1.71), the fourth enzyme in the arginine
succinyltransferase (AST) pathway for arginine
catabolism [Energy metabolism, Amino acids and amines].
Length = 484
Score = 173 bits (441), Expect = 4e-49
Identities = 106/309 (34%), Positives = 150/309 (48%), Gaps = 15/309 (4%)
Query: 95 GDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGAL 154
D A RH P GV P+NFP + I PAL GNT+V KP+E TP A L
Sbjct: 122 PDGRAVLRHRPHGVVAVFGPYNFPGHLPNGHIVPALLAGNTVVFKPSELTPWVAEETVKL 181
Query: 155 IAEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLE 214
+AG+P GV+N+V G + G AL HP +D + FTGS+ G L+ + K LE
Sbjct: 182 WEKAGLPAGVLNLVQGARETGVALAAHPDIDGLLFTGSSNTGHLLHRQFAGRPEKILALE 241
Query: 215 LGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIY-DEFVARSGERAK 273
+GG +P I+ AD+DAAV F + GQ C R V D D F+AR E A+
Sbjct: 242 MGGNNPLIVDEVADIDAAVHLIIQSAFISAGQRCTCARRLLVPDGAQGDAFLARLVEVAE 301
Query: 274 RRTVGD------PFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQ-LVAGGGRAGDKGFY 326
R TVG PF G I + ++L + G + L+A
Sbjct: 302 RLTVGAWDAEPQPF-----MGAVISLQAAQRLLAAQAKLLALGGKSLLAMRQLDPGAAL- 355
Query: 327 VQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTN 386
+ P + +V ++ EE FGP+ Q+IR+ +E I ANN+ +GL+A + + D + +
Sbjct: 356 LTPGII-DVTGVAELPDEEHFGPLLQVIRYDDFDEAIAIANNTRFGLSAGLLSDDRELYD 414
Query: 387 YVTQGLRAG 395
+RAG
Sbjct: 415 RFLLEIRAG 423
>gnl|CDD|237017 PRK11904, PRK11904, bifunctional proline
dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
Reviewed.
Length = 1038
Score = 177 bits (452), Expect = 2e-48
Identities = 109/323 (33%), Positives = 152/323 (47%), Gaps = 44/323 (13%)
Query: 76 LRYYAGFADKIHGKTIPINGDFFAYTRHEPVG-------------VCGQIIPWNFPLLML 122
RYYA A ++ G + G P G VC I PWNFPL +
Sbjct: 653 CRYYAAQARRLFGAPEKLPG---------PTGESNELRLHGRGVFVC--ISPWNFPLAIF 701
Query: 123 AWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQH 181
++A ALA GNT++ KPAEQTPL A L+ EAGIP+ V+ ++PG G GAAL
Sbjct: 702 LGQVAAALAAGNTVIAKPAEQTPLIAAEAVKLLHEAGIPKDVLQLLPGDGATVGAALTAD 761
Query: 182 PGVDKVAFTGSTEVGKLVQQGSGASNLKRTTL--ELGGKSPNIIFADAD------LDAAV 233
P + VAFTGSTE +++ + A + L E GG+ N + D+ +D V
Sbjct: 762 PRIAGVAFTGSTETARIINRTLAARDGPIVPLIAETGGQ--NAMIVDSTALPEQVVDDVV 819
Query: 234 EGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDK 293
A F + GQ C A FVQ+ I D + VGDP L+ + GP ID
Sbjct: 820 TSA----FRSAGQRCSALRVLFVQEDIADRVIEMLKGAMAELKVGDPRLLSTDVGPVIDA 875
Query: 294 EQMDKILEFIESGKSQGAQLVAGG-GRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQ 352
E + IE K + L + G +V PT F D + E+FGP+
Sbjct: 876 EAKANLDAHIERMKREARLLAQLPLPAGTENGHFVAPTAFE--IDSISQLEREVFGPILH 933
Query: 353 LIRFSS--IEEVIERANNSDYGL 373
+IR+ + +++VI+ N + YGL
Sbjct: 934 VIRYKASDLDKVIDAINATGYGL 956
>gnl|CDD|240392 PTZ00381, PTZ00381, aldehyde dehydrogenase family protein;
Provisional.
Length = 493
Score = 167 bits (425), Expect = 1e-46
Identities = 97/283 (34%), Positives = 143/283 (50%), Gaps = 10/283 (3%)
Query: 100 YTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAG 159
Y EP+GV I WN+PL + +A A+A GNT+VLKP+E +P T+ + L+ +
Sbjct: 104 YIIPEPLGVVLVIGAWNYPLNLTLIPLAGAIAAGNTVVLKPSELSPHTSKLMAKLLTKY- 162
Query: 160 IPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKS 219
+ V ++ G + L++ P D + FTGS VGKLV Q + A NL TLELGGKS
Sbjct: 163 LDPSYVRVIEGGVEVTTELLKEP-FDHIFFTGSPRVGKLVMQAA-AENLTPCTLELGGKS 220
Query: 220 PNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGD 279
P I+ +L A +G F N GQ C A V +I D+F+ E K D
Sbjct: 221 PVIVDKSCNLKVAARRIAWGKFLNAGQTCVAPDYVLVHRSIKDKFIEALKEAIKEFFGED 280
Query: 280 PFDLNVEQGPQI-DKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDD 338
P E +I ++ ++ E I+ + V GG + YV PT+ N D
Sbjct: 281 P--KKSEDYSRIVNEFHTKRLAELIKDHGGK----VVYGGEVDIENKYVAPTIIVNPDLD 334
Query: 339 MKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
+ +EEIFGP+ ++ + +I+EV+E N+ LA F +D
Sbjct: 335 SPLMQEEIFGPILPILTYENIDEVLEFINSRPKPLALYYFGED 377
>gnl|CDD|177949 PLN02315, PLN02315, aldehyde dehydrogenase family 7 member.
Length = 508
Score = 166 bits (422), Expect = 3e-46
Identities = 94/280 (33%), Positives = 146/280 (52%), Gaps = 6/280 (2%)
Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA----GI 160
P+G+ G I +NFP +L W AL GN +V K A TPL +A+ L+AE +
Sbjct: 154 PLGIVGVITAFNFPCAVLGWNACIALVCGNCVVWKGAPTTPLITIAMTKLVAEVLEKNNL 213
Query: 161 PEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSP 220
P + G + G A+ + + V+FTGS++VG +VQQ A K LEL G +
Sbjct: 214 PGAIFTSFCGGAEIGEAIAKDTRIPLVSFTGSSKVGLMVQQTVNARFGK-CLLELSGNNA 272
Query: 221 NIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDP 280
I+ DAD+ AV F GQ C R + ++IYD+ + + K+ +GDP
Sbjct: 273 IIVMDDADIQLAVRSVLFAAVGTAGQRCTTCRRLLLHESIYDDVLEQLLTVYKQVKIGDP 332
Query: 281 FDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMK 340
+ GP E + IE KSQG +++ GG +G +VQPT+ + D
Sbjct: 333 LEKGTLLGPLHTPESKKNFEKGIEIIKSQGGKILTGGSAIESEGNFVQPTI-VEISPDAD 391
Query: 341 IAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTK 380
+ +EE+FGPV +++F ++EE IE N+ GL++++FT+
Sbjct: 392 VVKEELFGPVLYVMKFKTLEEAIEINNSVPQGLSSSIFTR 431
>gnl|CDD|236989 PRK11809, putA, trifunctional transcriptional regulator/proline
dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
Reviewed.
Length = 1318
Score = 168 bits (428), Expect = 2e-45
Identities = 118/338 (34%), Positives = 170/338 (50%), Gaps = 34/338 (10%)
Query: 76 LRYYAGFADKIHGKTIPINGDFFAYTRHEPVG--VCGQIIPWNFPLLMLAWKIAPALATG 133
LRYYAG D F H P+G VC I PWNFPL + ++A ALA G
Sbjct: 750 LRYYAGQVR-----------DDFDNDTHRPLGPVVC--ISPWNFPLAIFTGQVAAALAAG 796
Query: 134 NTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGS 192
N+++ KPAEQTPL A ++ EAG+P GVV ++PG G+ GAALV V V FTGS
Sbjct: 797 NSVLAKPAEQTPLIAAQAVRILLEAGVPAGVVQLLPGRGETVGAALVADARVRGVMFTGS 856
Query: 193 TEVGKLVQQGSGASNLK---RTT---LELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQ 246
TEV +L+Q+ + A L R E GG++ I+ + A + V F + GQ
Sbjct: 857 TEVARLLQR-NLAGRLDPQGRPIPLIAETGGQNAMIVDSSALTEQVVADVLASAFDSAGQ 915
Query: 247 CCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESG 306
C A +QD + D + +G+P L+ + GP ID E I I++
Sbjct: 916 RCSALRVLCLQDDVADRTLKMLRGAMAECRMGNPDRLSTDIGPVIDAEAKANIERHIQAM 975
Query: 307 KSQG---AQLVAGGGRAGDKGFYVQPTVFA-NVRDDMKIAREEIFGPVQQLIRF--SSIE 360
+++G Q G +V PT+ + D++K E+FGPV ++R+ + ++
Sbjct: 976 RAKGRPVFQAARENSEDWQSGTFVPPTLIELDSFDELK---REVFGPVLHVVRYNRNQLD 1032
Query: 361 EVIERANNSDYGLAAAVFTKDLDKT-NYVTQGLRAGTL 397
E+IE+ N S YGL V T+ +D+T VT G L
Sbjct: 1033 ELIEQINASGYGLTLGVHTR-IDETIAQVTGSAHVGNL 1069
>gnl|CDD|166060 PLN02419, PLN02419, methylmalonate-semialdehyde dehydrogenase
[acylating].
Length = 604
Score = 162 bits (411), Expect = 4e-44
Identities = 110/334 (32%), Positives = 172/334 (51%), Gaps = 14/334 (4%)
Query: 54 TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVCGQI 112
T + GK S+ D+ L + + G A G+ +P ++ Y+ EP+GVC I
Sbjct: 198 TTEQGKTLKDSHG-DIFRGLEVVEHACGMATLQMGEYLPNVSNGVDTYSIREPLGVCAGI 256
Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
P+NFP ++ W A+ GNT +LKP+E+ P ++ + L EAG+P+GV+NIV G
Sbjct: 257 CPFNFPAMIPLWMFPVAVTCGNTFILKPSEKDPGASVILAELAMEAGLPDGVLNIVHGTN 316
Query: 173 DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAA 232
D A+ + V+F GS G + + A KR +G K+ ++ DA++DA
Sbjct: 317 DTVNAICDDEDIRAVSFVGSNTAGMHIYARAAAKG-KRIQSNMGAKNHGLVLPDANIDAT 375
Query: 233 VEGAHFGLFFNMGQCCCAGSR-TFVQD--TIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
+ F GQ C A S FV D + D+ V R+ +A + T G D ++ GP
Sbjct: 376 LNALLAAGFGAAGQRCMALSTVVFVGDAKSWEDKLVERA--KALKVTCGSEPDADL--GP 431
Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR----AGDKGFYVQPTVFANVRDDMKIAREE 345
I K+ ++I I+SG GA+L+ G +KG ++ PT+ + V DM+ +EE
Sbjct: 432 VISKQAKERICRLIQSGVDDGAKLLLDGRDIVVPGYEKGNFIGPTILSGVTPDMECYKEE 491
Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFT 379
IFGPV ++ +S +E I N + YG AA+FT
Sbjct: 492 IFGPVLVCMQANSFDEAISIINKNKYGNGAAIFT 525
>gnl|CDD|143450 cd07132, ALDH_F3AB, Aldehyde dehydrogenase family 3 members A1, A2,
and B1 and related proteins. NAD(P)+-dependent,
aldehyde dehydrogenase, family 3 members A1 and B1
(ALDH3A1, ALDH3B1, EC=1.2.1.5) and fatty aldehyde
dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3),
and similar sequences are included in this CD. Human
ALDH3A1 is a homodimer with a critical role in cellular
defense against oxidative stress; it catalyzes the
oxidation of various cellular membrane lipid-derived
aldehydes. Corneal crystalline ALDH3A1 protects the
cornea and underlying lens against UV-induced oxidative
stress. Human ALDH3A2, a microsomal homodimer, catalyzes
the oxidation of long-chain aliphatic aldehydes to fatty
acids. Human ALDH3B1 is highly expressed in the kidney
and liver and catalyzes the oxidation of various medium-
and long-chain saturated and unsaturated aliphatic
aldehydes.
Length = 443
Score = 156 bits (397), Expect = 4e-43
Identities = 95/312 (30%), Positives = 151/312 (48%), Gaps = 40/312 (12%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
Y EP+GV I WN+PL + + A+A GN +V+KP+E +P TA L+AE
Sbjct: 94 VYIYKEPLGVVLIIGAWNYPLQLTLVPLVGAIAAGNCVVIKPSEVSPATA----KLLAE- 148
Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGV-----------DKVAFTGSTEVGKLVQQGSGASN 207
++P Y D V GV D + +TGST VGK+V Q + A +
Sbjct: 149 --------LIPKYLDKECYPVVLGGVEETTELLKQRFDYIFYTGSTSVGKIVMQAA-AKH 199
Query: 208 LKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVAR 267
L TLELGGKSP + D+D A +G F N GQ C A + ++FV
Sbjct: 200 LTPVTLELGGKSPCYVDKSCDIDVAARRIAWGKFINAGQTCIAPDYVLCTPEVQEKFVE- 258
Query: 268 SGERAKRRTVGDPFDLNVEQGPQ----IDKEQMDKILEFIESGKSQGAQLVAGGGRAGDK 323
A ++T+ + + + ++ P I+ ++ + + GK VA GG+ +K
Sbjct: 259 ----ALKKTLKEFYGEDPKESPDYGRIINDRHFQRLKKLLSGGK------VAIGGQTDEK 308
Query: 324 GFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLD 383
Y+ PTV +V+ + +EEIFGP+ ++ ++++E IE N+ + LA VF+ +
Sbjct: 309 ERYIAPTVLTDVKPSDPVMQEEIFGPILPIVTVNNLDEAIEFINSREKPLALYVFSNNKK 368
Query: 384 KTNYVTQGLRAG 395
N + +G
Sbjct: 369 VINKILSNTSSG 380
>gnl|CDD|233324 TIGR01236, D1pyr5carbox1, delta-1-pyrroline-5-carboxylate
dehydrogenase, group 1. This model represents one of
two related branches of delta-1-pyrroline-5-carboxylate
dehydrogenase. The two branches are not as closely
related to each other as some aldehyde dehydrogenases
are to this branch, and separate models are built for
this reason. The enzyme is the second of two in the
degradation of proline to glutamate [Energy metabolism,
Amino acids and amines].
Length = 532
Score = 158 bits (401), Expect = 5e-43
Identities = 106/332 (31%), Positives = 165/332 (49%), Gaps = 19/332 (5%)
Query: 77 RYYAGFADKIHGKTIPINGD-FFAYTRHEPV-GVCGQIIPWNFPLLMLAWKIAPALATGN 134
R+ A +A +++ + PI+ + T + P+ G I P+NF + APAL GN
Sbjct: 140 RFNAKYARELYEQQ-PISAPGEWNRTEYRPLEGFVYAISPFNFTAIAGNLAGAPAL-MGN 197
Query: 135 TIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGST 193
T++ KP++ L+ V ++ EAG+P GV+N VPG G ++ P + + FTGST
Sbjct: 198 TVIWKPSDTAMLSNYLVMRILEEAGLPPGVINFVPGDGPLVSDTVLADPDLAGIHFTGST 257
Query: 194 ----EVGKLVQQGSGA-SNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCC 248
+ K V Q N R E GGK +++ AD++ V G G F GQ C
Sbjct: 258 PTFKHLWKQVAQNLDRYHNFPRIVGETGGKDFHVVHPSADVEHVVSGTIRGAFEYQGQKC 317
Query: 249 CAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKS 308
A SR +V +++ F R R VGDP D G ID++ KI+++IE KS
Sbjct: 318 SACSRLYVPHSLWPRFKGRLLAELSRVKVGDPDDFGTFMGAVIDEKSFAKIVKYIEHAKS 377
Query: 309 QGA--QLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLI-----RFSSIEE 361
+ ++AGG G++V+PTV + + +EEIFGPV + ++ I +
Sbjct: 378 DPSALTILAGGKYDDSVGYFVEPTVVESKDPQEPLMKEEIFGPVLTVYVYPDDKYKEILD 437
Query: 362 VIERANNSDYGLAAAVFTKDLDKTNYVTQGLR 393
+++ + + YGL AVF KD D + LR
Sbjct: 438 LVD--STTSYGLTGAVFAKDRDAILEADKVLR 467
>gnl|CDD|143441 cd07123, ALDH_F4-17_P5CDH, Delta(1)-pyrroline-5-carboxylate
dehydrogenase, ALDH families 4 and 17.
Delta(1)-pyrroline-5-carboxylate dehydrogenase
(EC=1.5.1.12 ), families 4 and 17: a proline catabolic
enzyme of the aldehyde dehydrogenase (ALDH) protein
superfamily. Delta(1)-pyrroline-5-carboxylate
dehydrogenase (P5CDH), also known as ALDH4A1 in humans,
is a mitochondrial homodimer involved in proline
degradation and catalyzes the NAD + -dependent
conversion of P5C to glutamate. This is a necessary step
in the pathway interconnecting the urea and
tricarboxylic acid cycles. The preferred substrate is
glutamic gamma-semialdehyde, other substrates include
succinic, glutaric and adipic semialdehydes. Also
included in this CD is the Aldh17 Drosophila
melanogaster (Q9VUC0) P5CDH and similar sequences.
Length = 522
Score = 157 bits (399), Expect = 1e-42
Identities = 99/296 (33%), Positives = 149/296 (50%), Gaps = 13/296 (4%)
Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
+ P+NF + APAL GN ++ KP++ L+ V ++ EAG+P GV+N VPG
Sbjct: 177 VSPFNFTAIGGNLAGAPAL-MGNVVLWKPSDTAVLSNYLVYKILEEAGLPPGVINFVPGD 235
Query: 172 GDA-GAALVQHPGVDKVAFTGSTEVGKLVQQ--GSGASNLK---RTTLELGGKSPNIIFA 225
G G ++ P + + FTGST K + + G + R E GGK+ +++
Sbjct: 236 GPVVGDTVLASPHLAGLHFTGSTPTFKSLWKQIGENLDRYRTYPRIVGETGGKNFHLVHP 295
Query: 226 DADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNV 285
AD+D+ V G F GQ C A SR +V ++++ E R E K +GDP D +
Sbjct: 296 SADVDSLVTATVRGAFEYQGQKCSAASRAYVPESLWPEVKERLLEELKEIKMGDPDDFSN 355
Query: 286 EQGPQIDKEQMDKILEFIESGKSQ-GAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIARE 344
G ID++ D+I +I+ KS A+++AGG G++V+PTV K+ E
Sbjct: 356 FMGAVIDEKAFDRIKGYIDHAKSDPEAEIIAGGKCDDSVGYFVEPTVIETTDPKHKLMTE 415
Query: 345 EIFGPVQQLIRF--SSIEEVIERANN-SDYGLAAAVFTKDLDKTNYVTQGLR--AG 395
EIFGPV + + S EE +E + S Y L A+F +D T LR AG
Sbjct: 416 EIFGPVLTVYVYPDSDFEETLELVDTTSPYALTGAIFAQDRKAIREATDALRNAAG 471
>gnl|CDD|143455 cd07137, ALDH_F3FHI, Plant aldehyde dehydrogenase family 3 members
F1, H1, and I1 and related proteins. Aldehyde
dehydrogenase family members 3F1, 3H1, and 3I1 (ALDH3F1,
ALDH3H1, and ALDH3I1), and similar plant sequences, are
in this CD. In Arabidopsis thaliana, stress-regulated
expression of ALDH3I1 was observed in leaves and
osmotic stress expression of ALDH3H1 was observed in
root tissue, whereas, ALDH3F1 expression was not stress
responsive. Functional analysis of ALDH3I1 suggest it
may be involved in a detoxification pathway in plants
that limits aldehyde accumulation and oxidative stress.
Length = 432
Score = 129 bits (326), Expect = 3e-33
Identities = 94/309 (30%), Positives = 143/309 (46%), Gaps = 15/309 (4%)
Query: 90 TIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTAL 149
T P + + EP+GV I WNFP L+ + A+A GN +VLKP+E P T+
Sbjct: 90 TFPAKAEIVS----EPLGVVLVISAWNFPFLLSLEPVIGAIAAGNAVVLKPSELAPATSA 145
Query: 150 AVGALIAEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLK 209
+ LI E + + ++ G AL++ DK+ FTGS VG+++ + A +L
Sbjct: 146 LLAKLIPEYLDTK-AIKVIEGGVPETTALLEQKW-DKIFFTGSPRVGRIIM-AAAAKHLT 202
Query: 210 RTTLELGGKSPNIIFADADLDAAVE---GAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVA 266
TLELGGK P I+ + DL AV G +G N GQ C A V+++ +
Sbjct: 203 PVTLELGGKCPVIVDSTVDLKVAVRRIAGGKWGC--NNGQACIAPDYVLVEESFAPTLID 260
Query: 267 RSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFY 326
++ +P + +I + L + S ++V GG R Y
Sbjct: 261 ALKNTLEKFFGENP--KESKDLSRIVNSHHFQRLSRLLDDPSVADKIVHGGERDEKN-LY 317
Query: 327 VQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTN 386
++PT+ + D I EEIFGP+ +I IEE IE N+ LAA VFTK+ +
Sbjct: 318 IEPTILLDPPLDSSIMTEEIFGPLLPIITVKKIEESIEIINSRPKPLAAYVFTKNKELKR 377
Query: 387 YVTQGLRAG 395
+ +G
Sbjct: 378 RIVAETSSG 386
>gnl|CDD|143446 cd07128, ALDH_MaoC-N, N-terminal domain of the monoamine oxidase C
dehydratase. The N-terminal domain of the MaoC
dehydratase, a monoamine oxidase regulatory protein.
Orthologs of MaoC include PaaZ (Escherichia coli) and
PaaN (Pseudomonas putida), which are putative
ring-opening enzymes of the aerobic phenylacetic acid
(PA) catabolic pathway. The C-terminal domain of MaoC
has sequence similarity to enoyl-CoA hydratase. Also
included in this CD is a novel Burkholderia xenovorans
LB400 ALDH of the aerobic benzoate oxidation (box)
pathway. This pathway involves first the synthesis of a
CoA thio-esterified aromatic acid, with subsequent
dihydroxylation and cleavage steps, yielding the CoA
thio-esterified aliphatic aldehyde,
3,4-dehydroadipyl-CoA semialdehyde, which is further
converted into its corresponding CoA acid by the
Burkholderia LB400 ALDH.
Length = 513
Score = 118 bits (298), Expect = 6e-29
Identities = 104/373 (27%), Positives = 148/373 (39%), Gaps = 70/373 (18%)
Query: 67 VDVEYSLRTLRYYAGFADK--------IHGKTIPI--NGDFFAYTRHEPV---GVCGQII 113
+D++ + TL YA + + G P+ +G F +H GV I
Sbjct: 95 IDIDGGIGTLFAYASLGRRELPNAHFLVEGDVEPLSKDGTFVG--QHILTPRRGVAVHIN 152
Query: 114 PWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGI-PEGVVNIVPGYG 172
+NFP+ + K APAL G +++KPA T AV I E+G+ PEG + ++ G
Sbjct: 153 AFNFPVWGMLEKFAPALLAGVPVIVKPATATAYLTEAVVKDIVESGLLPEGALQLI--CG 210
Query: 173 DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNII------FAD 226
G L D VAFTGS L+ PNI+ A+
Sbjct: 211 SVGDLLDHLGEQDVVAFTGSAATAA---------KLRA--------HPNIVARSIRFNAE 253
Query: 227 AD-LDAAVEGA-------HFGLFF-----NM----GQCCCAGSRTFVQDTIYDEFVARSG 269
AD L+AA+ G F LF M GQ C A R FV + D +
Sbjct: 254 ADSLNAAILGPDATPGTPEFDLFVKEVAREMTVKAGQKCTAIRRAFVPEARVDAVIEALK 313
Query: 270 ERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGG-------GRAGD 322
R + VGDP V GP + +EQ + + + + A++V GG G +
Sbjct: 314 ARLAKVVVGDPRLEGVRMGPLVSREQREDVRAAVAT-LLAEAEVVFGGPDRFEVVGADAE 372
Query: 323 KGFYVQPTVFANVRDDMKIAR---EEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFT 379
KG + PT+ D E FGPV L+ + S+ E IE A L A+V T
Sbjct: 373 KGAFFPPTLLL-CDDPDAATAVHDVEAFGPVATLMPYDSLAEAIELAARGRGSLVASVVT 431
Query: 380 KDLDKTNYVTQGL 392
D + G
Sbjct: 432 NDPAFARELVLGA 444
>gnl|CDD|165847 PLN02203, PLN02203, aldehyde dehydrogenase.
Length = 484
Score = 109 bits (273), Expect = 7e-26
Identities = 83/288 (28%), Positives = 131/288 (45%), Gaps = 22/288 (7%)
Query: 104 EPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPE- 162
EP+GV WNFP+ + + A+A GN +VLKP+E P T+ A +A IP+
Sbjct: 107 EPLGVVLIFSSWNFPIGLSLEPLIGAIAAGNAVVLKPSELAPATS----AFLAAN-IPKY 161
Query: 163 ---GVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKS 219
V ++ G G L+QH DK+ FTGS VG+++ + A +L LELGGK
Sbjct: 162 LDSKAVKVIEGGPAVGEQLLQHKW-DKIFFTGSPRVGRII-MTAAAKHLTPVALELGGKC 219
Query: 220 PNI---IFADADLDAAVE---GAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAK 273
P I + + D AV G +G GQ C A V++ + K
Sbjct: 220 PCIVDSLSSSRDTKVAVNRIVGGKWGSC--AGQACIAIDYVLVEERFAPILIELLKSTIK 277
Query: 274 RRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFA 333
+ +P + ++K+ ++ ++ + + + GG +K +++PT+
Sbjct: 278 KFFGENPRESK-SMARILNKKHFQRLSNLLKDPRVAAS--IVHGGSIDEKKLFIEPTILL 334
Query: 334 NVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
N D I EEIFGP+ +I IE+ I N+ LA FT +
Sbjct: 335 NPPLDSDIMTEEIFGPLLPIITVKKIEDSIAFINSKPKPLAIYAFTNN 382
>gnl|CDD|236931 PRK11563, PRK11563, bifunctional aldehyde dehydrogenase/enoyl-CoA
hydratase; Provisional.
Length = 675
Score = 108 bits (273), Expect = 2e-25
Identities = 105/375 (28%), Positives = 147/375 (39%), Gaps = 74/375 (19%)
Query: 67 VDVEYSLRTLRYYAGFA------DKIH--GKTIPI--NGDFFA---YTRHEPVGVCGQII 113
+D+E + TL YA D + G+ P+ G F T E V V I
Sbjct: 99 IDIEGGIGTLFTYASKGRRELPNDTVLVEGEVEPLSKGGTFAGRHILTPLEGVAV--HIN 156
Query: 114 PWNFPLLMLAWKIAPALATGNTIVLKPAEQT-PLTALAVGALIAEAGI-PEGVVNIVPGY 171
+NFP+ + K+APA G ++KPA T LT V LI E+G+ PEG + ++
Sbjct: 157 AFNFPVWGMLEKLAPAFLAGVPAIVKPATATAYLTEAVV-RLIVESGLLPEGALQLI--C 213
Query: 172 GDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNII-----F-A 225
G AG L G D V FTGS + L+ PN++ F A
Sbjct: 214 GSAGDLLDHLDGQDVVTFTGSAATAQK---------LRAH--------PNVVANSVPFTA 256
Query: 226 DAD--------LDAAVEGAHFGLFFNM---------GQCCCAGSRTFVQDTIYDEFVARS 268
+AD DA F LF GQ C A R V + D +
Sbjct: 257 EADSLNAAILGPDATPGTPEFDLFVKEVVREMTVKAGQKCTAIRRAIVPRALVDAVIEAL 316
Query: 269 GERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGG-------GRAG 321
R + VGDP V G EQ + + E + + A++V GG G
Sbjct: 317 RARLAKVVVGDPRLEGVRMGALASLEQREDVREKVAA-LLAAAEIVFGGPDSFEVVGADA 375
Query: 322 DKGFYVQPTVFANVRDDMKIARE----EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAV 377
+KG + PT+ DD A E FGPV L+ + ++E IE A L A++
Sbjct: 376 EKGAFFPPTLL--YCDDPLEAPAVHDVEAFGPVSTLMPYDDLDEAIELAARGKGSLVASL 433
Query: 378 FTKDLDKTNYVTQGL 392
T D + + G
Sbjct: 434 VTADPEVARELVLGA 448
>gnl|CDD|237016 PRK11903, PRK11903, aldehyde dehydrogenase; Provisional.
Length = 521
Score = 107 bits (268), Expect = 4e-25
Identities = 89/345 (25%), Positives = 144/345 (41%), Gaps = 38/345 (11%)
Query: 67 VDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTR-------HEPV---GVCGQIIPWN 116
VD++ + TL YYA + + +G+ + H V GV I +N
Sbjct: 100 VDIDGGIFTLGYYAKLGAALGDARLLRDGEAVQLGKDPAFQGQHVLVPTRGVALFINAFN 159
Query: 117 FPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGI-PEGVVNIVPGYGDAG 175
FP L K APAL G +++KPA T + + AGI P G +++V G +
Sbjct: 160 FPAWGLWEKAAPALLAGVPVIVKPATATAWLTQRMVKDVVAAGILPAGALSVVCG---SS 216
Query: 176 AALVQH-PGVDKVAFTGSTEVGKLVQ-QGSGASNLKRTTLELGGKSPNIIFADADLDAAV 233
A L+ H D V+FTGS E +++ + R +E + ++ DA +
Sbjct: 217 AGLLDHLQPFDVVSFTGSAETAAVLRSHPAVVQRSVRVNVEADSLNSALLGPDAAPGSEA 276
Query: 234 EGAHFGLFF---------NMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLN 284
F LF GQ C A R FV + +YD R + TVG+P +
Sbjct: 277 ----FDLFVKEVVREMTVKSGQKCTAIRRIFVPEALYDAVAEALAARLAKTTVGNPRNDG 332
Query: 285 VEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAG------DKGFYVQPTVF-ANVRD 337
V GP + + Q+ + + + ++Q A+++ GG V PT+ A+ D
Sbjct: 333 VRMGPLVSRAQLAAVRAGLAALRAQ-AEVLFDGGGFALVDADPAVAACVGPTLLGASDPD 391
Query: 338 DMKIARE-EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
+ E+FGPV L+ + + A L A+V++ D
Sbjct: 392 AATAVHDVEVFGPVATLLPYRDAAHALALARRGQGSLVASVYSDD 436
>gnl|CDD|143403 cd07084, ALDH_KGSADH-like, ALDH subfamily: NAD(P)+-dependent
alpha-ketoglutaric semialdehyde dehydrogenases and plant
delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH
family 12-like. ALDH subfamily which includes the
NAD(P)+-dependent, alpha-ketoglutaric semialdehyde
dehydrogenases (KGSADH, EC 1.2.1.26); plant
delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
EC=1.5.1.12 ), ALDH family 12; the N-terminal domain of
the MaoC (monoamine oxidase C) dehydratase regulatory
protein; and orthologs of MaoC, PaaZ and PaaN, which are
putative ring-opening enzymes of the aerobic
phenylacetic acid catabolic pathway.
Length = 442
Score = 104 bits (260), Expect = 3e-24
Identities = 84/372 (22%), Positives = 141/372 (37%), Gaps = 39/372 (10%)
Query: 49 QSSLETLDNGKPYNMSYSVDVEYSLRTLRYYA--GFADKIH---GKTIPINGDFFAYTRH 103
++ L GK + + ++ LR A ++ +I G + ++
Sbjct: 41 IAAGAVLVTGKGWMFA--ENICGDQVQLRARAFVIYSYRIPHEPGNHLGQGLKQQSHGYR 98
Query: 104 EPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGI-PE 162
P G I +NFPL + ++A ALA GN +++KP + + L+ AG+ P
Sbjct: 99 WPYGPVLVIGAFNFPLWIPLLQLAGALAMGNPVIVKPHTAVSIVMQIMVRLLHYAGLLPP 158
Query: 163 GVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLK--RTTLELGGKSP 220
V ++ G G AL+ HP V FTGS+ V + + A + K R LEL G +
Sbjct: 159 EDVTLINGDGKTMQALLLHPNPKMVLFTGSSRVAEKL-----ALDAKQARIYLELAGFNW 213
Query: 221 NIIFADAD-LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDE-FVARSGERAKRRTVG 278
++ DA +D + GQ C A S FV + V + RR +
Sbjct: 214 KVLGPDAQAVDYVAWQCVQDMTACSGQKCTAQSMLFVPENWSKTPLVEKLKALLARRKLE 273
Query: 279 DPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRA-------GDKGFYVQPTV 331
D + Q L I ++ ++ G+ G V +
Sbjct: 274 DLL---------LGPVQTFTTLAMIAHMENLLGSVLLFSGKELKNHSIPSIYGACVASAL 324
Query: 332 FANVRDDMKIAR---EEIFGPVQQLIRFSSIEE--VIERANNSDYGLAAAVFTKDLDKTN 386
F + + +K EEIFGP ++ + + V+E L AA+++ D
Sbjct: 325 FVPIDEILKTYELVTEEIFGPFAIVVEYKKDQLALVLELLERMHGSLTAAIYSNDPIFLQ 384
Query: 387 YVTQGL-RAGTL 397
+ L AG
Sbjct: 385 ELIGNLWVAGRT 396
>gnl|CDD|131331 TIGR02278, PaaN-DH, phenylacetic acid degradation protein paaN.
This enzyme is proposed to act in the ring-opening step
of phenylacetic acid degradation which follows ligation
of the acid with coenzyme A (by PaaF) and hydroxylation
by a multicomponent non-heme iron hydroxylase complex
(PaaGHIJK). Gene symbols have been standardized in. This
enzyme is related to aldehyde dehydrogenases and has
domains which are members of the pfam00171 and pfam01575
families. This family includes paaN genes from
Pseudomonas, Sinorhizobium, Rhodopseudomonas,
Escherichia, Deinococcus and Corynebacterium. Another
homology family (TIGR02288) includes several other
species.
Length = 663
Score = 99.1 bits (247), Expect = 3e-22
Identities = 83/333 (24%), Positives = 127/333 (38%), Gaps = 20/333 (6%)
Query: 67 VDVEYSLRTLRYYAGFADK--------IHGKTIPINGDFFAYTRH--EPV-GVCGQIIPW 115
VD++ + TL Y+ + + P++ D RH P GV QI +
Sbjct: 95 VDIDGGIGTLFTYSSLGRRELPDANLIPEDEFEPLSKDGSFQGRHILTPKGGVAVQINAF 154
Query: 116 NFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGI-PEGVVNIVPGYGDA 174
NFP+ L K APA G + KPA T A A+ + E+G+ PEG + ++ G A
Sbjct: 155 NFPVWGLLEKFAPAFLAGVPTLAKPATPTAYVAEALVRTMVESGLLPEGSLQLI--CGSA 212
Query: 175 GAALVQHPGVDKVAFTGSTEVGKL------VQQGSGASNLKRTTLELGGKSPNIIFADAD 228
G L D VAFTGS V + N + +L + + +
Sbjct: 213 GDLLDHLDHRDVVAFTGSAATADRLRAHPNVLERGIRFNAEADSLNAAILGEDATPDEPE 272
Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
D + L GQ C A R V + + + R + +GDP + V+ G
Sbjct: 273 FDLFAQEIVRELTIKAGQKCTAIRRVIVPKALLEAVLKALQARLAKVVLGDPREEGVDMG 332
Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
P + EQ + + + + GA++ GG D F+ + A + E FG
Sbjct: 333 PLVSLEQRADVEAAVAALLAAGAEVRLGGPGRLDGAFFPPTLLLAEDPWAGAVHATEAFG 392
Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
PV + E A L A + T D
Sbjct: 393 PVATFFPYGDRAEAARLAARGGGSLVATLATSD 425
>gnl|CDD|177831 PLN02174, PLN02174, aldehyde dehydrogenase family 3 member H1.
Length = 484
Score = 98.2 bits (244), Expect = 4e-22
Identities = 72/279 (25%), Positives = 136/279 (48%), Gaps = 7/279 (2%)
Query: 104 EPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEG 163
EP+GV I WN+P L+ + A++ GN +VLKP+E P ++ + L+ E +
Sbjct: 111 EPLGVVLVISAWNYPFLLSIDPVIGAISAGNAVVLKPSELAPASSALLAKLL-EQYLDSS 169
Query: 164 VVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNII 223
V +V G AL++ DK+ +TGS+++G+++ + A +L LELGGKSP ++
Sbjct: 170 AVRVVEGAVTETTALLEQKW-DKIFYTGSSKIGRVI-MAAAAKHLTPVVLELGGKSPVVV 227
Query: 224 FADADLDAAVEGAHFGLF-FNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFD 282
+D DL V G + N GQ C + + + + + +P +
Sbjct: 228 DSDTDLKVTVRRIIAGKWGCNNGQACISPDYILTTKEYAPKVIDAMKKELETFYGKNPME 287
Query: 283 LNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIA 342
+ + ++ D++ + ++ + + + + GG + + PT+ +V D I
Sbjct: 288 -SKDMSRIVNSTHFDRLSKLLD--EKEVSDKIVYGGEKDRENLKIAPTILLDVPLDSLIM 344
Query: 343 REEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
EEIFGP+ ++ +++EE + + LAA +FT +
Sbjct: 345 SEEIFGPLLPILTLNNLEESFDVIRSRPKPLAAYLFTHN 383
>gnl|CDD|143447 cd07129, ALDH_KGSADH, Alpha-Ketoglutaric Semialdehyde
Dehydrogenase. Alpha-Ketoglutaric Semialdehyde (KGSA)
Dehydrogenase (KGSADH, EC 1.2.1.26) catalyzes the
NAD(P)+-dependent conversion of KGSA to
alpha-ketoglutarate. This CD contains such sequences as
those seen in Azospirillum brasilense, KGSADH-II
(D-glucarate/D-galactarate-inducible) and KGSADH-III
(hydroxy-L-proline-inducible). Both show similar high
substrate specificity for KGSA and different coenzyme
specificity; KGSADH-II is NAD+-dependent and KGSADH-III
is NADP+-dependent. Also included in this CD is the
NADP(+)-dependent aldehyde dehydrogenase from Vibrio
harveyi which catalyzes the oxidation of long-chain
aliphatic aldehydes to acids.
Length = 454
Score = 86.4 bits (215), Expect = 3e-18
Identities = 82/289 (28%), Positives = 115/289 (39%), Gaps = 50/289 (17%)
Query: 105 PVGVCGQIIPWNFPLLMLAWKI-----APALATGNTIVLK--PA--EQTPLTALAVGALI 155
PV V G NFPL A+ + A ALA G +V+K PA + L A A+ A +
Sbjct: 108 PVAVFG---ASNFPL---AFSVAGGDTASALAAGCPVVVKAHPAHPGTSELVARAIRAAL 161
Query: 156 AEAGIPEGVVNIVPGYG-DAGAALVQHPGVDKVAFTGSTEVGK-LVQQGSGASNLKRTTL 213
G+P GV +++ G G + G ALV+HP + V FTGS G+ L +
Sbjct: 162 RATGLPAGVFSLLQGGGREVGVALVKHPAIKAVGFTGSRRGGRALFDAAAARPEPIPFYA 221
Query: 214 ELGGKSPNIIFADA---DLDAAVEGAHFGLFFNMGQ-CCCAGSRTFVQDTIYDEFVARSG 269
ELG +P I A +A +G L GQ C G D F+A
Sbjct: 222 ELGSVNPVFILPGALAERGEAIAQGFVGSLTLGAGQFCTNPGLVLVPAGPAGDAFIAALA 281
Query: 270 ERAKRRTVGDPFDLNVEQGPQIDKEQM--DKILEFIESGKSQ-----GAQLVAGGGRAGD 322
E Q M I E G G +++AGG A +
Sbjct: 282 EALA------------AAPAQT----MLTPGIAEAYRQGVEALAAAPGVRVLAGGA-AAE 324
Query: 323 KGFYVQPTVF----ANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERAN 367
G PT+F A D + +EE+FGP ++R+ E++ A
Sbjct: 325 GGNQAAPTLFKVDAAAFLADPAL-QEEVFGPASLVVRYDDAAELLAVAE 372
>gnl|CDD|143396 cd07077, ALDH-like, NAD(P)+-dependent aldehyde dehydrogenase-like
(ALDH-like) family. The aldehyde dehydrogenase-like
(ALDH-like) group of the ALDH superfamily of
NAD(P)+-dependent enzymes which, in general, oxidize a
wide range of endogenous and exogenous aliphatic and
aromatic aldehydes to their corresponding carboxylic
acids and play an important role in detoxification.
This group includes families ALDH18, ALDH19, and ALDH20
and represents such proteins as gamma-glutamyl phosphate
reductase, LuxC-like acyl-CoA reductase, and coenzyme A
acylating aldehyde dehydrogenase. All of these proteins
have a conserved cysteine that aligns with the catalytic
cysteine of the ALDH group.
Length = 397
Score = 68.0 bits (166), Expect = 2e-12
Identities = 60/239 (25%), Positives = 85/239 (35%), Gaps = 28/239 (11%)
Query: 63 MSYSVDVEYSLR-TLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLM 121
M S Y T R I +P NG+ Y R P+GV I+P PL
Sbjct: 59 MGCSESKLYKNIDTERGITASVGHIQDVLLPDNGE--TYVRAFPIGVTMHILPSTNPLSG 116
Query: 122 LAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA---GIPEGVVNIVPGYGDAGA-A 177
+ +AT N + +P P T A+ L A P+ +V VP D A
Sbjct: 117 IT-SALRGIATRNQCIFRPHPSAPFTNRALALLFQAADAAHGPKILVLYVPHPSDELAEE 175
Query: 178 LVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAH 237
L+ HP +D + TG + + S G SP ++ AD + A H
Sbjct: 176 LLSHPKIDLIVATGGRDAVDAAVK---HSPHIPVIGFGAGNSPVVVDETADEERASGSVH 232
Query: 238 FGLFFNMGQCCCAGSRTF-----VQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
FF+ Q CA + V D +Y+EF + L V Q +
Sbjct: 233 DSKFFD--QNACASEQNLYVVDDVLDPLYEEFKLKLVVEG----------LKVPQETKP 279
>gnl|CDD|143444 cd07126, ALDH_F12_P5CDH, Delta(1)-pyrroline-5-carboxylate
dehydrogenase, ALDH family 12.
Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
EC=1.5.1.12), family 12: a proline catabolic enzyme of
the aldehyde dehydrogenase (ALDH) protein superfamily.
P5CDH is a mitochondrial enzyme involved in proline
degradation and catalyzes the NAD + -dependent
conversion of P5C to glutamate. The P5CDH, ALDH12A1
gene, in Arabidopsis, has been identified as an
osmotic-stress-inducible ALDH gene. This CD contains
both Viridiplantae and Alveolata P5CDH sequences.
Length = 489
Score = 54.4 bits (131), Expect = 5e-08
Identities = 75/333 (22%), Positives = 124/333 (37%), Gaps = 67/333 (20%)
Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGV 164
P G I P+NFPL + A ++ AL GN +LK + + L+ G+P
Sbjct: 142 PYGPVAIITPFNFPLEIPALQLMGALFMGNKPLLKVDSKVSVVMEQFLRLLHLCGMPATD 201
Query: 165 VNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIF 224
V+++ G ++ FTGS++V +R LEL GK +
Sbjct: 202 VDLIHSDGPTMNKILLEANPRMTLFTGSSKVA------------ERLALELHGK---VKL 246
Query: 225 ADADLDAAVEGAHFG------------LFFNMGQCCCAGSRTFVQDTIYDE-FVARSGER 271
DA D + G + GQ C A S F + + +
Sbjct: 247 EDAGFDWKILGPDVSDVDYVAWQCDQDAYACSGQKCSAQSILFAHENWVQAGILDKLKAL 306
Query: 272 AKRRTVGDPFDLNVEQGP------QIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGF 325
A++R + DL + GP + + +DK+L GA+++ GG +
Sbjct: 307 AEQRKLE---DLTI--GPVLTWTTERILDHVDKLLAI------PGAKVLFGGKPLTNHSI 355
Query: 326 -----YVQPT-VF-----ANVRDDMKIAREEIFGPVQQLIRFSSIEE--VIERANNSDYG 372
+PT VF + ++ ++ E+FGP Q + + + V+E
Sbjct: 356 PSIYGAYEPTAVFVPLEEIAIEENFELVTTEVFGPFQVVTEYKDEQLPLVLEALERMHAH 415
Query: 373 LAAAVFTKDLDKTNYVT---------QGLRAGT 396
L AAV + D+ V G+RA T
Sbjct: 416 LTAAVVSNDIRFLQEVLANTVNGTTYAGIRART 448
>gnl|CDD|143400 cd07081, ALDH_F20_ACDH_EutE-like, Coenzyme A acylating aldehyde
dehydrogenase (ACDH), Ethanolamine utilization protein
EutE, and related proteins. Coenzyme A acylating
aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent
acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10),
functions as a single enzyme (such as the Ethanolamine
utilization protein, EutE, in Salmonella typhimurium) or
as part of a multifunctional enzyme to convert
acetaldehyde into acetyl-CoA. The E. coli
aldehyde-alcohol dehydrogenase includes the functional
domains, alcohol dehydrogenase (ADH), ACDH, and
pyruvate-formate-lyase deactivase; and the Entamoeba
histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1)
includes the functional domains ADH and ACDH, and may be
critical enzymes in the fermentative pathway.
Length = 439
Score = 52.3 bits (125), Expect = 3e-07
Identities = 61/335 (18%), Positives = 116/335 (34%), Gaps = 41/335 (12%)
Query: 79 YAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVL 138
Y + D+ + + + EP+GV I P P + +K +L T N+I+
Sbjct: 69 YNVYKDEKTCGVLTGDENGGTLIIAEPIGVVASITPSTNPTSTVIFKSLISLKTRNSIIF 128
Query: 139 KP----AEQTPLTALAVGALIAEAGIPEGVVNIV--PGYGDAGAALVQHPGVDKVAFTGS 192
P + T A + AG PE ++ + P + L++ PG+ + TG
Sbjct: 129 SPHPRAKKVTQRAATLLLQAAVAAGAPENLIGWIDNPSI-ELAQRLMKFPGIGLLLATGG 187
Query: 193 TEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGS 252
V K + S+ K G +P +I AD+ AV+ F+ G C +
Sbjct: 188 PAVVK-----AAYSSGKPAIGVGAGNTPVVIDETADIKRAVQSIVKSKTFDNGVICASEQ 242
Query: 253 RTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQ 312
V D++YDE + + ++ E++ ++ I
Sbjct: 243 SVIVVDSVYDEVMRLFEGQG---------------AYKLTAEELQQVQPVILKNGDVNRD 287
Query: 313 LVA--GGGRAGDKGFYVQPT-----VFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIER 365
+V A G V + + E PV + R ++ + +
Sbjct: 288 IVGQDAYKIAAAAGLKVPQETRILIGEVTSLAEHEPFAHEKLSPVLAMYRAANFADADAK 347
Query: 366 A----NNSDYGLAAAVFT---KDLDKTNYVTQGLR 393
A G +A+++ K ++ N ++
Sbjct: 348 ALALKLEGGCGHTSAMYSDNIKAIENMNQFANAMK 382
>gnl|CDD|143440 cd07122, ALDH_F20_ACDH, Coenzyme A acylating aldehyde dehydrogenase
(ACDH), ALDH family 20-like. Coenzyme A acylating
aldehyde dehydrogenase (ACDH, EC=1.2.1.10), an NAD+ and
CoA-dependent acetaldehyde dehydrogenase, functions as a
single enzyme (such as the Ethanolamine utilization
protein, EutE, in Salmonella typhimurium) or as part of
a multifunctional enzyme to convert acetaldehyde into
acetyl-CoA . The E. coli aldehyde-alcohol dehydrogenase
includes the functional domains, alcohol dehydrogenase
(ADH), ACDH, and pyruvate-formate-lyase deactivase; and
the Entamoeba histolytica aldehyde-alcohol dehydrogenase
2 (ALDH20A1) includes the functional domains ADH and
ACDH and may be critical enzymes in the fermentative
pathway.
Length = 436
Score = 49.0 bits (118), Expect = 3e-06
Identities = 82/324 (25%), Positives = 121/324 (37%), Gaps = 61/324 (18%)
Query: 104 EPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKP---AEQTPLTALAVGALIA---E 157
EPVGV +IP P +K AL T N I+ P A++ + A +
Sbjct: 94 EPVGVIAALIPSTNPTSTAIFKALIALKTRNAIIFSPHPRAKKC--SIEAAKIMREAAVA 151
Query: 158 AGIPEGVVNIVPGYGDAGA-ALVQHPGVDKVAFTGSTEVGKLVQQ------GSGASNLKR 210
AG PEG++ + L++HP VD + TG + K G G N
Sbjct: 152 AGAPEGLIQWIEEPSIELTQELMKHPDVDLILATGGPGMVKAAYSSGKPAIGVGPGN--- 208
Query: 211 TTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGE 270
P I AD+ AV+ F+ G C + V D IYDE A
Sbjct: 209 --------VPAYIDETADIKRAVKDIILSKTFDNGTICASEQSVIVDDEIYDEVRA---- 256
Query: 271 RAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGK------SQGAQLVAGGGRAGDKG 324
KRR G F LN E+ KE+++K L F + G + AQ +A AG +
Sbjct: 257 ELKRR--GAYF-LNEEE-----KEKLEKAL-FDDGGTLNPDIVGKSAQKIA--ELAGIE- 304
Query: 325 FYVQPTV------FANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERA----NNSDYGLA 374
V V + ++RE++ PV R EE +E+A G
Sbjct: 305 --VPEDTKVLVAEETGVGPEEPLSREKLS-PVLAFYRAEDFEEALEKARELLEYGGAGHT 361
Query: 375 AAVFTKDLDKTNYVTQGLRAGTLI 398
A + + D + + ++
Sbjct: 362 AVIHSNDEEVIEEFALRMPVSRIL 385
>gnl|CDD|143439 cd07121, ALDH_EutE, Ethanolamine utilization protein EutE-like.
Coenzyme A acylating aldehyde dehydrogenase (ACDH), an
NAD+ and CoA-dependent acetaldehyde dehydrogenase,
acetylating (EC=1.2.1.10), converts acetaldehyde into
acetyl-CoA. This CD is limited to such monofunctional
enzymes as the Ethanolamine utilization protein, EutE,
in Salmonella typhimurium. Mutations in eutE abolish
the ability to utilize ethanolamine as a carbon source.
Length = 429
Score = 47.6 bits (114), Expect = 7e-06
Identities = 69/312 (22%), Positives = 111/312 (35%), Gaps = 65/312 (20%)
Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKP---AEQTPLTALAVGAL---IAEA 158
P GV G I P P + LA GN +V P A++ +A AV + IAEA
Sbjct: 97 PFGVIGAITPSTNPTETIINNSISMLAAGNAVVFNPHPGAKKV--SAYAVELINKAIAEA 154
Query: 159 GIPEGVVNIV--PGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQ------GSGASNLKR 210
G P+ +V V P + L+ HP ++ + TG V K G+GA N
Sbjct: 155 GGPDNLVVTVEEPTI-ETTNELMAHPDINLLVVTGGPAVVKAALSSGKKAIGAGAGN--- 210
Query: 211 TTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGE 270
P ++ AD++ A G F+ C A D++ D +A
Sbjct: 211 --------PPVVVDETADIEKAARDIVQGASFDNNLPCIAEKEVIAVDSVADYLIAAMQR 262
Query: 271 RAKRRTVGDPFDLNVEQGPQIDKEQMDKILE--------------FIESGKSQGAQLVAG 316
++ EQ +++LE ++ GK L A
Sbjct: 263 ---------------NGAYVLNDEQAEQLLEVVLLTNKGATPNKKWV--GKDASKILKAA 305
Query: 317 GGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGL--A 374
G + D EE P+ ++R + +E IE A ++G
Sbjct: 306 GIEVPAD----IRLIIVETDKDHPFVVEEQMMPILPVVRVKNFDEAIELAVELEHGNRHT 361
Query: 375 AAVFTKDLDKTN 386
A + +K+++
Sbjct: 362 AIIHSKNVENLT 373
>gnl|CDD|131341 TIGR02288, PaaN_2, phenylacetic acid degradation protein paaN.
This enzyme is proposed to act in the ring-opening step
of phenylacetic acid degradation which follows ligation
of the acid with coenzyme A (by PaaF) and hydroxylation
by a multicomponent non-heme iron hydroxylase complex
(PaaGHIJK). Gene symbols have been standardized in. This
enzyme is related to aldehyde dehydrogenases and has a
domain which is a member of the pfam00171 family. This
family includes sequences from Burkholderia, Bordetella,
Streptomyces. Other PaaN enzymes are represented by a
separate model, TIGR02278.
Length = 551
Score = 47.6 bits (113), Expect = 8e-06
Identities = 64/239 (26%), Positives = 90/239 (37%), Gaps = 43/239 (17%)
Query: 69 VEYSLRTLRYYAGFA--DKIHGKTIPINGDFFAYTRHEPVG-------VCGQIIPWN-FP 118
V Y+ R + A +K GK P+ R V C WN +P
Sbjct: 157 VAYAYREMSRIPETAVWEKPQGKNDPLKLK----KRFTIVPRGIALVIGCSTFPTWNTYP 212
Query: 119 LLMLAWKIAPALATGNTIVLKPAEQTPL-TALAVGA---LIAEAGIPEGVVNIV--PGYG 172
L +LATGN +++KP L AL V ++ EAG +V +
Sbjct: 213 GLF------ASLATGNPVLVKPHPGAILPLALTVQVAREVLGEAGFDPNLVTLAAFDPGH 266
Query: 173 DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTL--ELGGKSPNIIFADADLD 230
+A L P V + FTGS G+ ++Q N ++ + E G + II + D
Sbjct: 267 EAAQRLATDPAVRIIDFTGSNAFGQWLEQ-----NARQAQVYTEKAGVNTVIIESTDDYK 321
Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFV-QDTI--------YDEFVARSGERAKRRTVGDP 280
A + F L GQ C V +D I YDE VA A +GDP
Sbjct: 322 AMLRNLAFSLSLYSGQMCTTTQAILVPRDGIRTDQGRKSYDE-VAADLATAIDGLLGDP 379
>gnl|CDD|143445 cd07127, ALDH_PAD-PaaZ, Phenylacetic acid degradation proteins PaaZ
(Escherichia coli) and PaaN (Pseudomonas putida)-like.
Phenylacetic acid degradation (PAD) proteins PaaZ
(Escherichia coli) and PaaN (Pseudomonas putida) are
putative aromatic ring cleavage enzymes of the aerobic
PA catabolic pathway. PaaZ mutants were defective for
growth with PA as a sole carbon source due to
interruption of the putative ring opening system. This
CD is limited to bacterial monofunctional enzymes.
Length = 549
Score = 47.1 bits (112), Expect = 1e-05
Identities = 67/280 (23%), Positives = 99/280 (35%), Gaps = 33/280 (11%)
Query: 109 CGQIIPWN-FPLLMLAWKIAPALATGNTIVLKPAEQTPL-TALAVG---ALIAEAGIPEG 163
C WN +P L +LATGN +++KP L A+ V ++AEAG
Sbjct: 202 CSTFPTWNGYPGLF------ASLATGNPVIVKPHPAAILPLAITVQVAREVLAEAGFDPN 255
Query: 164 VVNIV--PGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPN 221
+V + L P V + FTGS G ++ + + + E G +
Sbjct: 256 LVTLAADTPEEPIAQTLATRPEVRIIDFTGSNAFGDWLEANARQAQV---YTEKAGVNTV 312
Query: 222 IIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFV-QDTI--------YDEFVARSGERA 272
++ + DL A + F L GQ C +V +D I +DE A A
Sbjct: 313 VVDSTDDLKAMLRNLAFSLSLYSGQMCTTPQNIYVPRDGIQTDDGRKSFDEVAADLA-AA 371
Query: 273 KRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAG---DKGFYVQP 329
+ DP G Q L I + G L+A A P
Sbjct: 372 IDGLLADPARAAALLGAI----QSPDTLARIAEARQLGEVLLASEAVAHPEFPDARVRTP 427
Query: 330 TVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNS 369
+ D EE FGP+ ++ S + IE A S
Sbjct: 428 LLLKLDASDEAAYAEERFGPIAFVVATDSTDHSIELARES 467
>gnl|CDD|237956 PRK15398, PRK15398, aldehyde dehydrogenase EutE; Provisional.
Length = 465
Score = 38.3 bits (90), Expect = 0.006
Identities = 65/292 (22%), Positives = 104/292 (35%), Gaps = 69/292 (23%)
Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKP---AEQTPLTALAVGAL---IAEA 158
P GV G + P P + LA GN++V P A++ + A+ L I A
Sbjct: 129 PFGVIGAVTPSTNPTETIINNAISMLAAGNSVVFSPHPGAKKV--SLRAIELLNEAIVAA 186
Query: 159 GIPEGVVNIV--PGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQ------GSGASNLKR 210
G PE +V V P + L++HPG+ + TG V K + G+GA N
Sbjct: 187 GGPENLVVTVAEPTI-ETAQRLMKHPGIALLVVTGGPAVVKAAMKSGKKAIGAGAGN--- 242
Query: 211 TTLELGGKSPNIIFADADLDAA----VEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVA 266
P ++ AD++ A V+GA F N+ C A V D++ DE +
Sbjct: 243 --------PPVVVDETADIEKAARDIVKGASFD--NNL--PCIAEKEVIVVDSVADELMR 290
Query: 267 RSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKIL------------EFIESGKSQGAQLV 314
+ + EQ +K+ +++ GK L
Sbjct: 291 LMEK---------------NGAVLLTAEQAEKLQKVVLKNGGTVNKKWV--GKDAAKILE 333
Query: 315 AGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERA 366
A G + + E+ PV ++R ++E I A
Sbjct: 334 AAGINVPKD----TRLLIVETDANHPFVVTELMMPVLPVVRVKDVDEAIALA 381
>gnl|CDD|143399 cd07080, ALDH_Acyl-CoA-Red_LuxC, Acyl-CoA reductase LuxC. Acyl-CoA
reductase, LuxC, (EC=1.2.1.50) is the fatty acid
reductase enzyme responsible for synthesis of the
aldehyde substrate for the luminescent reaction
catalyzed by luciferase. The fatty acid reductase, a
luminescence-specific, multienzyme complex (LuxCDE),
reduces myristic acid to generate the long chain fatty
aldehyde required for the luciferase-catalyzed reaction
resulting in the emission of blue-green light.
Mutational studies of conserved cysteines of LuxC
revealed that the cysteine which aligns with the
catalytic cysteine conserved throughout the ALDH
superfamily is the LuxC acylation site. This CD is
composed of mainly bacterial sequences but also includes
a few archaeal sequences similar to the Methanospirillum
hungateiacyl acyl-CoA reductase RfbN.
Length = 422
Score = 36.9 bits (86), Expect = 0.019
Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Query: 99 AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAE 157
Y R +P G+ II N PLL + W I L N +LK + PLTA A+ +A+
Sbjct: 106 GYIRAQPRGLVVHIIAGNVPLLPV-WSIVRGLLVKNVNLLKMSSSDPLTATALLRSLAD 163
>gnl|CDD|219180 pfam06798, PrkA, PrkA serine protein kinase C-terminal domain.
This is a family of PrkA bacterial and archaeal serine
kinases approximately 630 residues long. This family
corresponds to the C-terminal domain.
Length = 254
Score = 32.2 bits (74), Expect = 0.45
Identities = 40/180 (22%), Positives = 69/180 (38%), Gaps = 43/180 (23%)
Query: 255 FVQDTIYDEFVARSGERAKRRTVGDPFDL------NVEQGPQIDKEQMDKILEFIESGKS 308
F+ + + V+ S ER +P D+ ++ I +E DK LEF++ +
Sbjct: 41 FIGKALSNALVSDSEERCI-----NPLDVLEELEQGIKDHESIPEEDRDKYLEFLKVVRK 95
Query: 309 ----------QGAQLVA---GGGRAGDKGFYVQPTVFANVRDDMKIARE---EIFGPVQQ 352
Q A + + D Y+ V A + D+ K+ E P ++
Sbjct: 96 EYNERIKKEVQKAYIESYEEAAQNLFDN--YLD-NVEAYINDE-KVKDPLTGEELEPDER 151
Query: 353 LIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLIRFSSIE---EVIER 409
+R SIEE I + F +++ NY+ + R G + S E E IE+
Sbjct: 152 FLR--SIEEQIGISEE-----RKKDFRREI--VNYIGRLAREGKKFDYDSYERLREAIEK 202
>gnl|CDD|218799 pfam05893, LuxC, Acyl-CoA reductase (LuxC). This family consists
of several bacterial Acyl-CoA reductase (LuxC) proteins.
The channelling of fatty acids into the fatty aldehyde
substrate for the bacterial bioluminescence reaction is
catalyzed by a fatty acid reductase multienzyme complex,
which channels fatty acids through the thioesterase
(LuxD), synthetase (LuxE) and reductase (LuxC)
components.
Length = 400
Score = 32.3 bits (74), Expect = 0.52
Identities = 45/201 (22%), Positives = 75/201 (37%), Gaps = 42/201 (20%)
Query: 100 YTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAG 159
Y + +P G+ ++ N PLL + I + N +LKP+ P TA A+ A A+
Sbjct: 83 YEKAQPRGLVVHVLAGNVPLLPV-MSILQGILVKNVNLLKPSSSDPFTAAALLASFAD-- 139
Query: 160 IPEGVVNIVPGYGDAGA-ALVQHPGVDKVAFTGSTEVGKLVQQGS-------GASNLKRT 211
+ + A + ++V G T++ +L+ Q + G +K
Sbjct: 140 -------VDATHPLARSISVVYWHS-------GDTQLEQLIMQHADVVIAWGGEEAIKWI 185
Query: 212 TLELGGKSPNIIF----------ADADLDAAVEG-AHFGLFFNMGQCCCAGSRT-FV--Q 257
L I F A +D A +G AH F++ Q C+ + FV
Sbjct: 186 RKHLPPGCDWIKFGPKISFAVVDPPAAIDKAADGVAHDICFYD--QQACSSPQVVFVESD 243
Query: 258 DTI-YDEFVARSGERAKRRTV 277
D I EF R ++
Sbjct: 244 DGITLREFAQRLAAALEKYAR 264
>gnl|CDD|237515 PRK13805, PRK13805, bifunctional acetaldehyde-CoA/alcohol
dehydrogenase; Provisional.
Length = 862
Score = 32.1 bits (74), Expect = 0.76
Identities = 52/182 (28%), Positives = 72/182 (39%), Gaps = 36/182 (19%)
Query: 104 EPVGV-CGQIIPWNFPLLMLAWKIAPALATGNTIVLKP---AEQTPLTALAV--GALIAE 157
EPVGV G I P P +K AL T N I+ A+++ + A + A +A
Sbjct: 107 EPVGVIAG-ITPTTNPTSTAIFKALIALKTRNPIIFSFHPRAQKSSIAAAKIVLDAAVA- 164
Query: 158 AGIPEGVVNIVPGYG-DAGAALVQHPGVDKVAFTG---------STEVGK--LVQQGSGA 205
AG P+ ++ + + AL+ HPG+ + TG S+ GK L G GA
Sbjct: 165 AGAPKDIIQWIEEPSVELTNALMNHPGIALILATGGPGMVKAAYSS--GKPAL---GVGA 219
Query: 206 SNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFV 265
N P I AD+ AV F+ G C + V D IYDE
Sbjct: 220 GN-----------VPAYIDKTADIKRAVNDILLSKTFDNGMICASEQAVIVDDEIYDEVK 268
Query: 266 AR 267
Sbjct: 269 EE 270
>gnl|CDD|237108 PRK12467, PRK12467, peptide synthase; Provisional.
Length = 3956
Score = 32.1 bits (73), Expect = 0.79
Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 3/51 (5%)
Query: 124 WKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA 174
++ ALA+G T+ L P + A A AL+A+ G+ V+ IVP + A
Sbjct: 713 TELFGALASGATLHLLPPDCA-RDAEAFAALMADQGV--TVLKIVPSHLQA 760
>gnl|CDD|131570 TIGR02518, EutH_ACDH, acetaldehyde dehydrogenase (acetylating).
Length = 488
Score = 31.4 bits (71), Expect = 1.0
Identities = 18/66 (27%), Positives = 26/66 (39%), Gaps = 4/66 (6%)
Query: 104 EPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAE----AG 159
PVGV +IP P +K ++ N IV P + L+ + AG
Sbjct: 103 VPVGVVAGLIPSTNPTSTAIYKTLISIKARNAIVFSPHPNAKKCIIETVKLMRKAAEEAG 162
Query: 160 IPEGVV 165
PEG +
Sbjct: 163 APEGAI 168
>gnl|CDD|240868 cd12422, RRM2_PTBP1_hnRNPL_like, RNA recognition motif in
polypyrimidine tract-binding protein 1 (PTB or hnRNP
I), heterogeneous nuclear ribonucleoprotein L
(hnRNP-L), and similar proteins. This subfamily
corresponds to the RRM2 of polypyrimidine tract-binding
protein 1 (PTB or hnRNP I), polypyrimidine
tract-binding protein 2 (PTBP2 or nPTB), regulator of
differentiation 1 (Rod1), heterogeneous nuclear
ribonucleoprotein L (hnRNP-L), heterogeneous nuclear
ribonucleoprotein L-like (hnRNP-LL), polypyrimidine
tract-binding protein homolog 3 (PTBPH3),
polypyrimidine tract-binding protein homolog 1 and 2
(PTBPH1 and PTBPH2), and similar proteins, and RRM3 of
PTBPH1 and PTBPH2. PTB is an important negative
regulator of alternative splicing in mammalian cells
and also functions at several other aspects of mRNA
metabolism, including mRNA localization, stabilization,
polyadenylation, and translation. PTBP2 is highly
homologous to PTB and is perhaps specific to the
vertebrates. Unlike PTB, PTBP2 is enriched in the brain
and in some neural cell lines. It binds more stably to
the downstream control sequence (DCS) RNA than PTB does
but is a weaker repressor of splicing in vitro. PTBP2
also greatly enhances the binding of two other
proteins, heterogeneous nuclear ribonucleoprotein
(hnRNP) H and KH-type splicing-regulatory protein
(KSRP), to the DCS RNA. The binding properties of PTBP2
and its reduced inhibitory activity on splicing imply
roles in controlling the assembly of other
splicing-regulatory proteins. Rod1 is a mammalian
polypyrimidine tract binding protein (PTB) homolog of a
regulator of differentiation in the fission yeast
Schizosaccharomyces pombe, where the nrd1 gene encodes
an RNA binding protein negatively regulates the onset
of differentiation. ROD1 is predominantly expressed in
hematopoietic cells or organs. It might play a role
controlling differentiation in mammals. hnRNP-L is a
higher eukaryotic specific subunit of human KMT3a (also
known as HYPB or hSet2) complex required for histone H3
Lys-36 trimethylation activity. It plays both, nuclear
and cytoplasmic, roles in mRNA export of intronless
genes, IRES-mediated translation, mRNA stability, and
splicing. hnRNP-LL protein plays a critical and unique
role in the signal-induced regulation of CD45 and acts
as a global regulator of alternative splicing in
activated T cells. This family also includes
polypyrimidine tract binding protein homolog 3 (PTBPH3)
found in plant. Although its biological roles remain
unclear, PTBPH3 shows significant sequence similarity
to other family members, all of which contain four RNA
recognition motifs (RRM), also known as RBD (RNA
binding domain) or RNP (ribonucleoprotein domain).
Although their biological roles remain unclear, both
PTBPH1 and PTBPH2 show significant sequence similarity
to PTB. However, in contrast to PTB, they have three
RRMs. .
Length = 85
Score = 29.1 bits (66), Expect = 1.2
Identities = 11/55 (20%), Positives = 26/55 (47%), Gaps = 5/55 (9%)
Query: 24 AVPKPITKPKIEHTQLFINNEFVN---AQSSLETLDNGKPYNMSYSVDVEYSLRT 75
AV K + K Q + +F + A+++ + L+ Y+ ++D+++S
Sbjct: 27 AVEKILIFEKNTGVQALV--QFDSVESAENAKKALNGRNIYDGCCTLDIQFSRLK 79
>gnl|CDD|237250 PRK12893, PRK12893, allantoate amidohydrolase; Reviewed.
Length = 412
Score = 31.0 bits (71), Expect = 1.2
Identities = 21/82 (25%), Positives = 27/82 (32%), Gaps = 25/82 (30%)
Query: 234 EGAHFGLFFNMGQCCCAGSRTFV----QDTIYD----------EFVARSGERAKRRTVGD 279
EGA F GS F D E +AR G R R
Sbjct: 130 EGARFAP-------AMLGSGVFTGALPLDDALARRDADGITLGEALARIGYRGTARVGRR 182
Query: 280 PFD----LNVEQGPQIDKEQMD 297
D L++EQGP ++ E +
Sbjct: 183 AVDAYLELHIEQGPVLEAEGLP 204
>gnl|CDD|213296 cd05930, A_NRPS, The adenylation domain of nonribosomal peptide
synthetases (NRPS). The adenylation (A) domain of NRPS
recognizes a specific amino acid or hydroxy acid and
activates it as an (amino) acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms a thioester
bond to the enzyme-bound cofactor phosphopantetheine of
a peptidyl carrier protein domain. NRPSs are large
multifunctional enzymes which synthesize many
therapeutically useful peptides in bacteria and fungi
via a template-directed, nucleic acid independent
nonribosomal mechanism. These natural products include
antibiotics, immunosuppressants, plant and animal
toxins, and enzyme inhibitors. NRPS has a distinct
modular structure in which each module is responsible
for the recognition, activation, and in some cases,
modification of a single amino acid residue of the final
peptide product. The modules can be subdivided into
domains that catalyze specific biochemical reactions.
Length = 445
Score = 30.5 bits (70), Expect = 2.0
Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 5/49 (10%)
Query: 124 WKIAPALATGNTIVLKPAEQ-TPLTALAVGALIAEAGIPEGVVNIVPGY 171
W+I P L +G T+VL P E ALA L+ E I V+++VP
Sbjct: 150 WEIFPPLLSGATLVLAPPEVLRDPEALA--ELLREHRI--TVLHLVPSL 194
>gnl|CDD|227012 COG4666, COG4666, TRAP-type uncharacterized transport system, fused
permease components [General function prediction only].
Length = 642
Score = 30.0 bits (68), Expect = 3.0
Identities = 13/50 (26%), Positives = 17/50 (34%)
Query: 118 PLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
L L K +L+ E + V A AGI GVV +
Sbjct: 386 VLRALFRKDTRMGKILARDLLEALESGARNVIPVAIACATAGIIVGVVTL 435
>gnl|CDD|143588 cd07564, nitrilases_CHs, Nitrilases, cyanide hydratase (CH)s, and
similar proteins (class 1 nitrilases). Nitrilases
(nitrile aminohydrolases, EC:3.5.5.1) hydrolyze nitriles
(RCN) to ammonia and the corresponding carboxylic acid.
Most nitrilases prefer aromatic nitriles, some prefer
arylacetonitriles and others aliphatic nitriles. This
group includes the nitrilase cyanide dihydratase (CDH),
which hydrolyzes inorganic cyanide (HCN) to produce
formate. It also includes cyanide hydratase (CH), which
hydrolyzes HCN to formamide. This group includes four
Arabidopsis thaliana nitrilases (Ath)NIT1-4. AthNIT1-3
have a strong substrate preference for
phenylpropionitrile (PPN) and other nitriles which may
originate from the breakdown of glucosinolates. The
product of PPN hydrolysis, phenylacetic acid has auxin
activity. AthNIT1-3 can also convert indoacetonitrile to
indole-3-acetic acid (IAA, auxin), but with a lower
affinity and velocity. From their expression patterns,
it has been speculated that NIT3 may produce IAA during
the early stages of germination, and that NIT3 may
produce IAA during embryo development and maturation.
AthNIT4 has a strong substrate specificity for the
nitrile, beta-cyano-L-alanine (Ala(CN)), an intermediate
of cyanide detoxification. AthNIT4 has both a nitrilase
activity and a nitrile hydratase (NHase) activity, which
generate aspartic acid and asparagine respectively from
Ala(CN). NHase catalyzes the hydration of nitriles to
their corresponding amides. This subgroup belongs to a
larger nitrilase superfamily comprised of belong to a
larger nitrilase superfamily comprised of nitrile- or
amide-hydrolyzing enzymes and amide-condensing enzymes,
which depend on a Glu-Lys-Cys catalytic triad. This
superfamily has been classified in the literature based
on global and structure based sequence analysis into
thirteen different enzyme classes (referred to as 1-13),
this subgroup corresponds to class 1.
Length = 297
Score = 29.0 bits (66), Expect = 5.0
Identities = 8/20 (40%), Positives = 11/20 (55%)
Query: 296 MDKILEFIESGKSQGAQLVA 315
++K IE + GAQLV
Sbjct: 19 VEKACRLIEEAAANGAQLVV 38
>gnl|CDD|184499 PRK14090, PRK14090, phosphoribosylformylglycinamidine synthase II;
Provisional.
Length = 601
Score = 29.1 bits (65), Expect = 5.6
Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 11/53 (20%)
Query: 268 SGERAKRRT--VGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGG 318
+GE+A + + VGDPF +K ++ LE +E G +GAQ + GG
Sbjct: 196 TGEKATKLSIQVGDPF---------AEKMLIEAFLEMVEEGLVEGAQDLGAGG 239
>gnl|CDD|179591 PRK03545, PRK03545, putative arabinose transporter; Provisional.
Length = 390
Score = 28.7 bits (65), Expect = 5.8
Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 18/65 (27%)
Query: 115 WNFPLLMLA-----------WKIAPALATGNTIVLKPAEQTP--LTALAVGALIAE-AGI 160
WNF +L+++ W I +LA I + PA + L+ LA G +A G+
Sbjct: 95 WNFTVLLISRIGIAFAHAIFWSITASLA----IRVAPAGKKAQALSLLATGTALAMVLGL 150
Query: 161 PEGVV 165
P G V
Sbjct: 151 PLGRV 155
>gnl|CDD|219213 pfam06878, Pkip-1, Pkip-1 protein. This family consists of several
Pkip-1 proteins which seem to be specific to
Nucleopolyhedroviruses. The function of this family is
unknown although it has been found that Pkip-1 is not
essential for virus replication in cell culture or by in
vivo intrahaemocoelic injection.
Length = 163
Score = 28.0 bits (63), Expect = 5.9
Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 7/48 (14%)
Query: 323 KGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSD 370
K + +P+ N+ ++M I +IFG +QL E + AN SD
Sbjct: 22 KKYLKKPSKDENLAEEMLILAADIFGQEEQL-------EALLSANASD 62
>gnl|CDD|215956 pfam00504, Chloroa_b-bind, Chlorophyll A-B binding protein.
Length = 135
Score = 27.7 bits (62), Expect = 6.4
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 150 AVGALIAEAGIPEGVVNIVPGYGDAGAALV 179
+GAL+ E G V P + +AG L+
Sbjct: 24 VLGALVPELLTKLGGVAGEPAWFEAGGPLL 53
>gnl|CDD|220561 pfam10091, Glycoamylase, Putative glucoamylase. The structure of
UniProt:Q5LIB7 has an alpha/alpha toroid fold and is
similar structurally to a number of glucoamylases. Most
of these structural homologues are glucoamylases,
involved in breaking down complex sugars (e.g. starch).
The biologically relevant state is likely to be
monomeric. The putative active site is located at the
centre of the toroid with a well defined large cavity.
Length = 218
Score = 28.0 bits (63), Expect = 8.1
Identities = 7/32 (21%), Positives = 13/32 (40%), Gaps = 2/32 (6%)
Query: 60 PYN--MSYSVDVEYSLRTLRYYAGFADKIHGK 89
PY ++ E +L LR + + G+
Sbjct: 141 PYAALLALPFAPEEALANLRRLYDYGPGLWGR 172
>gnl|CDD|234133 TIGR03168, 1-PFK, hexose kinase, 1-phosphofructokinase family.
This family consists largely of 1-phosphofructokinases,
but also includes tagatose-6-kinases and
6-phosphofructokinases.
Length = 303
Score = 28.3 bits (64), Expect = 8.6
Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 278 GDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGG--RAGDKGFYVQ 328
G+ +LN E GP+I +E+++++LE + + G +V G FY Q
Sbjct: 97 GEETELN-EPGPEISEEELEQLLEKLRELLASGDIVVISGSLPPGVPPDFYAQ 148
>gnl|CDD|219388 pfam07368, DUF1487, Protein of unknown function (DUF1487). This
family consists of several uncharacterized proteins from
Drosophila melanogaster. The function of this family is
unknown.
Length = 215
Score = 28.1 bits (63), Expect = 8.8
Identities = 18/52 (34%), Positives = 25/52 (48%), Gaps = 14/52 (26%)
Query: 222 IIFADADLDAA----VEGAH--FGLFFNMGQCCCAGSRTFVQDTIYDEFVAR 267
+IF D D+++A VE H F A + VQ++I DEFV R
Sbjct: 9 VIFEDGDVNSALHALVESLHNPFAPG--------AVATVLVQESIRDEFVER 52
>gnl|CDD|238570 cd01164, FruK_PfkB_like, 1-phosphofructokinase (FruK), minor
6-phosphofructokinase (pfkB) and related sugar kinases.
FruK plays an important role in the predominant pathway
for fructose utilisation.This group also contains
tagatose-6-phophate kinase, an enzyme of the tagatose
6-phosphate pathway, which responsible for breakdown of
the galactose moiety during lactose metabolism by
bacteria such as L. lactis.
Length = 289
Score = 27.9 bits (63), Expect = 9.4
Identities = 12/42 (28%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 276 TVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGG 317
G ++N E GP+I +E+++ +LE +++ +G +V G
Sbjct: 96 EDGTETEIN-EPGPEISEEELEALLEKLKALLKKGDIVVLSG 136
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.135 0.393
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,970,428
Number of extensions: 2316664
Number of successful extensions: 2981
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2580
Number of HSP's successfully gapped: 255
Length of query: 442
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 342
Effective length of database: 6,502,202
Effective search space: 2223753084
Effective search space used: 2223753084
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.4 bits)