RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5844
         (442 letters)



>gnl|CDD|143459 cd07141, ALDH_F1AB_F2_RALDH1, NAD+-dependent retinal dehydrogenase
           1, ALDH families 1A, 1B, and 2-like.  NAD+-dependent
           retinal dehydrogenase 1 (RALDH 1, ALDH1, EC=1.2.1.36)
           also known as aldehyde dehydrogenase family 1 member A1
           (ALDH1A1) in humans, is a homotetrameric, cytosolic
           enzyme that catalyzes the oxidation of retinaldehyde to
           retinoic acid. Human ALDH1B1 and ALDH2 are also in this
           cluster; both are mitochrondrial homotetramers which
           play important roles in acetaldehyde oxidation; ALDH1B1
           in response to UV light exposure and ALDH2 during
           ethanol metabolism.
          Length = 481

 Score =  689 bits (1779), Expect = 0.0
 Identities = 258/348 (74%), Positives = 299/348 (85%), Gaps = 1/348 (0%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           +SLETLDNGKP++ SY VD+  +++ LRYYAG+ADKIHGKTIP++GDFF YTRHEPVGVC
Sbjct: 90  ASLETLDNGKPFSKSYLVDLPGAIKVLRYYAGWADKIHGKTIPMDGDFFTYTRHEPVGVC 149

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
           GQIIPWNFPLLM AWK+APALA GNT+VLKPAEQTPLTAL + +LI EAG P GVVN+VP
Sbjct: 150 GQIIPWNFPLLMAAWKLAPALACGNTVVLKPAEQTPLTALYLASLIKEAGFPPGVVNVVP 209

Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           GYG  AGAA+  HP +DKVAFTGSTEVGKL+QQ +G SNLKR TLELGGKSPNI+FADAD
Sbjct: 210 GYGPTAGAAISSHPDIDKVAFTGSTEVGKLIQQAAGKSNLKRVTLELGGKSPNIVFADAD 269

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           LD AVE AH  LFFNMGQCCCAGSRTFVQ++IYDEFV RS ERAK+R VG+PFD   EQG
Sbjct: 270 LDYAVEQAHEALFFNMGQCCCAGSRTFVQESIYDEFVKRSVERAKKRVVGNPFDPKTEQG 329

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           PQID+EQ  KILE IESGK +GA+L  GG R GDKG+++QPTVF++V DDM+IA+EEIFG
Sbjct: 330 PQIDEEQFKKILELIESGKKEGAKLECGGKRHGDKGYFIQPTVFSDVTDDMRIAKEEIFG 389

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           PVQQ+ +F +I+EVIERANN+ YGLAAAVFTKD+DK    +  LRAGT
Sbjct: 390 PVQQIFKFKTIDEVIERANNTTYGLAAAVFTKDIDKAITFSNALRAGT 437



 Score = 38.5 bits (90), Expect = 0.007
 Identities = 9/20 (45%), Positives = 17/20 (85%)

Query: 31 KPKIEHTQLFINNEFVNAQS 50
           P+I++T++FINNE+ ++ S
Sbjct: 1  NPEIKYTKIFINNEWHDSVS 20


>gnl|CDD|143410 cd07091, ALDH_F1-2_Ald2-like, ALDH subfamily: ALDH families 1and 2,
           including 10-formyltetrahydrofolate dehydrogenase,
           NAD+-dependent retinal dehydrogenase 1 and related
           proteins.  ALDH subfamily which includes the
           NAD+-dependent retinal dehydrogenase 1 (RALDH 1, ALDH1,
           EC=1.2.1.36), also known as aldehyde dehydrogenase
           family 1 member A1 (ALDH1A1), in humans, a
           homotetrameric, cytosolic enzyme that catalyzes the
           oxidation of retinaldehyde to retinoic acid. Human
           ALDH1B1 and ALDH2 are also in this cluster; both are
           mitochrondrial homotetramers which play important roles
           in acetaldehyde oxidation; ALDH1B1 in response to UV
           light exposure and ALDH2 during ethanol metabolism.
           10-formyltetrahydrofolate dehydrogenase (FTHFDH,
           EC=1.5.1.6), also known as aldehyde dehydrogenase family
           1 member L1 (ALDH1L1), in humans, a multi-domain
           homotetramer with an N-terminal formyl transferase
           domain and a C-terminal ALDH domain. FTHFDH catalyzes an
           NADP+-dependent dehydrogenase reaction resulting in the
           conversion of 10-formyltetrahydrofolate to
           tetrahydrofolate and CO2. Also included in this
           subfamily is the Arabidosis aldehyde dehydrogenase
           family 2 members B4 and B7 (EC=1.2.1.3), which are
           mitochondrial, homotetramers that oxidize acetaldehyde
           and glycolaldehyde, as well as, the Arabidosis
           cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
           involved in the oxidation of sinapalehyde and
           coniferaldehyde. Also included is the AldA aldehyde
           dehydrogenase  of Aspergillus nidulans (locus AN0554),
           the aldehyde dehydrogenase 2 (YMR170c, ALD5, EC=1.2.1.5)
           of Saccharomyces cerevisiae, and other similar
           sequences.
          Length = 476

 Score =  642 bits (1659), Expect = 0.0
 Identities = 230/348 (66%), Positives = 282/348 (81%), Gaps = 1/348 (0%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           ++LE+LDNGKP   S   DV  S++ LRYYAG+ADKI GKTIPI+G+F AYTR EP+GVC
Sbjct: 86  AALESLDNGKPLEESAKGDVALSIKCLRYYAGWADKIQGKTIPIDGNFLAYTRREPIGVC 145

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
           GQIIPWNFPLLMLAWK+APALA GNT+VLKPAEQTPL+AL +  LI EAG P GVVNIVP
Sbjct: 146 GQIIPWNFPLLMLAWKLAPALAAGNTVVLKPAEQTPLSALYLAELIKEAGFPPGVVNIVP 205

Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G+G  AGAA+  H  VDK+AFTGST VG+ + + +  SNLK+ TLELGGKSPNI+F DAD
Sbjct: 206 GFGPTAGAAISSHMDVDKIAFTGSTAVGRTIMEAAAKSNLKKVTLELGGKSPNIVFDDAD 265

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           LD AVE A FG+FFN GQCCCAGSR FVQ++IYDEFV +   RA++R VGDPFD +  QG
Sbjct: 266 LDKAVEWAAFGIFFNQGQCCCAGSRIFVQESIYDEFVEKFKARAEKRVVGDPFDPDTFQG 325

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           PQ+ K Q DKIL +IESGK +GA L+ GG R G KG+++QPTVF +V+DDMKIA+EEIFG
Sbjct: 326 PQVSKAQFDKILSYIESGKKEGATLLTGGERHGSKGYFIQPTVFTDVKDDMKIAKEEIFG 385

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           PV  +++F + +EVIERAN+++YGLAA VFTKD++K   V++ L+AGT
Sbjct: 386 PVVTILKFKTEDEVIERANDTEYGLAAGVFTKDINKALRVSRALKAGT 433



 Score = 34.1 bits (79), Expect = 0.13
 Identities = 11/21 (52%), Positives = 14/21 (66%), Gaps = 1/21 (4%)

Query: 36 HTQLFINNEFVNAQSSLETLD 56
           T LFINNEFV++ S  +T  
Sbjct: 3  PTGLFINNEFVDSVSG-KTFP 22


>gnl|CDD|143460 cd07142, ALDH_F2BC, Arabidosis aldehyde dehydrogenase family 2 B4,
           B7, C4-like.  Included in this CD is the Arabidosis
           aldehyde dehydrogenase family 2 members B4 and B7
           (EC=1.2.1.3),  which are mitochondrial homotetramers
           that oxidize acetaldehyde and glycolaldehyde, but not
           L-lactaldehyde. Also in this group, is the Arabidosis
           cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
           involved in the oxidation of sinapalehyde and
           coniferaldehyde.
          Length = 476

 Score =  560 bits (1446), Expect = 0.0
 Identities = 223/349 (63%), Positives = 276/349 (79%), Gaps = 1/349 (0%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           ++LET DNGKPY  +   +V  + R  RYYAG+ADKIHG T+P +G    YT HEP+GV 
Sbjct: 86  AALETWDNGKPYEQARYAEVPLAARLFRYYAGWADKIHGMTLPADGPHHVYTLHEPIGVV 145

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
           GQIIPWNFPLLM AWK+ PALA GNTIVLKPAEQTPL+AL    L AEAG+P+GV+NIV 
Sbjct: 146 GQIIPWNFPLLMFAWKVGPALACGNTIVLKPAEQTPLSALLAAKLAAEAGLPDGVLNIVT 205

Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G+G  AGAA+  H  VDKVAFTGSTEVGK++ Q +  SNLK  TLELGGKSP I+  DAD
Sbjct: 206 GFGPTAGAAIASHMDVDKVAFTGSTEVGKIIMQLAAKSNLKPVTLELGGKSPFIVCEDAD 265

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           +D AVE AHF LFFN GQCCCAGSRTFV ++IYDEFV ++  RA +R VGDPF   VEQG
Sbjct: 266 VDKAVELAHFALFFNQGQCCCAGSRTFVHESIYDEFVEKAKARALKRVVGDPFRKGVEQG 325

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           PQ+DKEQ +KIL +IE GK +GA L+ GG R G KG+Y+QPT+F++V+DDMKIAR+EIFG
Sbjct: 326 PQVDKEQFEKILSYIEHGKEEGATLITGGDRIGSKGYYIQPTIFSDVKDDMKIARDEIFG 385

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           PVQ +++F +++EVI+RANNS YGLAA VF+K++D  N +++ L+AGT+
Sbjct: 386 PVQSILKFKTVDEVIKRANNSKYGLAAGVFSKNIDTANTLSRALKAGTV 434



 Score = 33.2 bits (76), Expect = 0.25
 Identities = 12/25 (48%), Positives = 19/25 (76%), Gaps = 2/25 (8%)

Query: 34 IEHTQLFINNEFVNAQS--SLETLD 56
          ++HT+LFIN +FV+A S  +  T+D
Sbjct: 1  VKHTKLFINGQFVDAASGKTFPTID 25


>gnl|CDD|215259 PLN02466, PLN02466, aldehyde dehydrogenase family 2 member.
          Length = 538

 Score =  528 bits (1362), Expect = 0.0
 Identities = 215/349 (61%), Positives = 263/349 (75%), Gaps = 1/349 (0%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           ++LET DNGKPY  S   ++    R  RYYAG+ADKIHG T+P +G     T HEP+GV 
Sbjct: 140 AALETWDNGKPYEQSAKAELPMFARLFRYYAGWADKIHGLTVPADGPHHVQTLHEPIGVA 199

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
           GQIIPWNFPLLM AWK+ PALA GNTIVLK AEQTPL+AL    L+ EAG+P GV+N+V 
Sbjct: 200 GQIIPWNFPLLMFAWKVGPALACGNTIVLKTAEQTPLSALYAAKLLHEAGLPPGVLNVVS 259

Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G+G  AGAAL  H  VDK+AFTGST+ GK+V + +  SNLK  TLELGGKSP I+  DAD
Sbjct: 260 GFGPTAGAALASHMDVDKLAFTGSTDTGKIVLELAAKSNLKPVTLELGGKSPFIVCEDAD 319

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           +D AVE AHF LFFN GQCCCAGSRTFV + +YDEFV ++  RA +R VGDPF   VEQG
Sbjct: 320 VDKAVELAHFALFFNQGQCCCAGSRTFVHERVYDEFVEKAKARALKRVVGDPFKKGVEQG 379

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           PQID EQ +KIL +I+SG   GA L  GG R G KG+Y+QPTVF+NV+DDM IA++EIFG
Sbjct: 380 PQIDSEQFEKILRYIKSGVESGATLECGGDRFGSKGYYIQPTVFSNVQDDMLIAQDEIFG 439

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           PVQ +++F  ++EVI RANN+ YGLAA VFT++LD  N +++ LR GT+
Sbjct: 440 PVQSILKFKDLDEVIRRANNTRYGLAAGVFTQNLDTANTLSRALRVGTV 488



 Score = 42.9 bits (101), Expect = 2e-04
 Identities = 23/58 (39%), Positives = 34/58 (58%), Gaps = 5/58 (8%)

Query: 4  LRNLALKSSTLGSLKNYS--AAAVPKPITKP-KIEHTQLFINNEFVNAQS--SLETLD 56
          LR+          ++ +S  AAAV +PIT P ++ +TQL IN +FV+A S  +  TLD
Sbjct: 22 LRSRGRNGGRGRGIRRFSTAAAAVEEPITPPVQVSYTQLLINGQFVDAASGKTFPTLD 79


>gnl|CDD|215767 pfam00171, Aldedh, Aldehyde dehydrogenase family.  This family of
           dehydrogenases act on aldehyde substrates. Members use
           NADP as a cofactor. The family includes the following
           members: The prototypical members are the aldehyde
           dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
           dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase
           EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28.
           Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27.
           Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9.
           Delta-1-pyrroline-5-carboxylate dehydrogenase EC:
           1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10.
           Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This
           family also includes omega crystallin, an eye lens
           protein from squid and octopus that has little aldehyde
           dehydrogenase activity.
          Length = 459

 Score =  523 bits (1350), Expect = 0.0
 Identities = 196/347 (56%), Positives = 250/347 (72%), Gaps = 3/347 (0%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           LETL+ GKP   +   +V  ++ TLRYYAG A K+ G+T+P +    AYTR EP+GV G 
Sbjct: 74  LETLETGKPLAEAR-GEVPRAIDTLRYYAGLARKLEGETLPSDPGVLAYTRREPLGVVGA 132

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           I PWNFPLL+ AWKIAPALA GNT+VLKP+E TPLTAL +  L  EAG+P GV+N+V G 
Sbjct: 133 ITPWNFPLLLAAWKIAPALAAGNTVVLKPSELTPLTALLLAELFEEAGLPPGVLNVVTGS 192

Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
           G + G ALV+HP VDKV+FTGSTEVG+ + + + A NLKR TLELGGK+P I+F DADLD
Sbjct: 193 GSEVGDALVEHPDVDKVSFTGSTEVGRRIAK-AAAKNLKRVTLELGGKNPLIVFDDADLD 251

Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
           AAVEGA FG F N GQ C AGSR  V ++IYDEFV R  E AK   VGDP D + + GP 
Sbjct: 252 AAVEGAVFGAFGNAGQVCTAGSRLLVHESIYDEFVERLVEAAKSLKVGDPLDPDTDIGPL 311

Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPV 350
           I K+Q +++L +IE  K +GA+L+ GG    +KG++V+PTV A+V  DM+IA+EEIFGPV
Sbjct: 312 ISKKQRERVLSYIEDAKEEGAKLLCGGEAGLEKGYFVEPTVLADVTPDMRIAQEEIFGPV 371

Query: 351 QQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
             +I F   +E IE AN+++YGLAA VFT DL++   V + L AG +
Sbjct: 372 LSVIPFKDEDEAIELANDTEYGLAAGVFTNDLERALRVARRLEAGMV 418


>gnl|CDD|143461 cd07143, ALDH_AldA_AN0554, Aspergillus nidulans aldehyde
           dehydrogenase, AldA (AN0554)-like.  NAD(P)+-dependent
           aldehyde dehydrogenase (AldA) of Aspergillus nidulans
           (locus AN0554), and other similar sequences, are present
           in this CD.
          Length = 481

 Score =  498 bits (1284), Expect = e-175
 Identities = 201/348 (57%), Positives = 260/348 (74%), Gaps = 1/348 (0%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           +S+E LDNGK +  +  VDV+ S  T RYY G+ADKIHG+ I  +     YTRHEP+GVC
Sbjct: 89  ASIEALDNGKTFGTAKRVDVQASADTFRYYGGWADKIHGQVIETDIKKLTYTRHEPIGVC 148

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
           GQIIPWNFPLLM AWKIAPALA GNTIVLKP+E TPL+AL +  LI EAG P GV+N+V 
Sbjct: 149 GQIIPWNFPLLMCAWKIAPALAAGNTIVLKPSELTPLSALYMTKLIPEAGFPPGVINVVS 208

Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           GYG   G A+  H  +DKVAFTGST VG+ V + +  SNLK+ TLELGGKSPNI+F DAD
Sbjct: 209 GYGRTCGNAISSHMDIDKVAFTGSTLVGRKVMEAAAKSNLKKVTLELGGKSPNIVFDDAD 268

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           L++AV    +G+FFN GQ CCAGSR +VQ+ IYD+FV R  E+AK+  VGDPF  +  QG
Sbjct: 269 LESAVVWTAYGIFFNHGQVCCAGSRIYVQEGIYDKFVKRFKEKAKKLKVGDPFAEDTFQG 328

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           PQ+ + Q ++I+ +IESGK++GA +  GG R G++G++++PT+F +V +DMKI +EEIFG
Sbjct: 329 PQVSQIQYERIMSYIESGKAEGATVETGGKRHGNEGYFIEPTIFTDVTEDMKIVKEEIFG 388

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           PV  +I+F + EE I+RAN+S YGLAAAVFT +++    V   L+AGT
Sbjct: 389 PVVAVIKFKTEEEAIKRANDSTYGLAAAVFTNNINNAIRVANALKAGT 436


>gnl|CDD|143397 cd07078, ALDH, NAD(P)+ dependent aldehyde dehydrogenase family.
           The aldehyde dehydrogenase family (ALDH) of NAD(P)+
           dependent enzymes, in general, oxidize a wide range of
           endogenous and exogenous aliphatic and aromatic
           aldehydes to their corresponding carboxylic acids and
           play an  important role in detoxification. Besides
           aldehyde detoxification, many ALDH isozymes possess
           multiple additional catalytic and non-catalytic
           functions such as participating in  metabolic pathways,
           or as  binding proteins, or as osmoregulants, to mention
           a few. The enzyme has three domains, a NAD(P)+
           cofactor-binding domain, a catalytic domain, and a
           bridging domain; and the active enzyme  is generally
           either homodimeric or homotetrameric. The catalytic
           mechanism is proposed to involve cofactor binding,
           resulting in a conformational change and activation of
           an invariant catalytic cysteine nucleophile. The
           cysteine and aldehyde substrate form an oxyanion
           thiohemiacetal intermediate resulting in hydride
           transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-like) proteins. The ALDH
           proteins are represented by enzymes which share a number
           of highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group.
          Length = 432

 Score =  495 bits (1276), Expect = e-174
 Identities = 194/348 (55%), Positives = 244/348 (70%), Gaps = 5/348 (1%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPI-NGDFFAYTRHEPVGVCG 110
           LETL+ GKP   +   +V  +  T RYYAG A ++HG+ IP  +    A  R EP+GV G
Sbjct: 43  LETLETGKPIEEAL-GEVARAADTFRYYAGLARRLHGEVIPSPDPGELAIVRREPLGVVG 101

Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
            I PWNFPLL+ AWK+APALA GNT+VLKP+E TPLTAL +  L+AEAG+P GV+N+V G
Sbjct: 102 AITPWNFPLLLAAWKLAPALAAGNTVVLKPSELTPLTALLLAELLAEAGLPPGVLNVVTG 161

Query: 171 YGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
            GD  GAAL  HP VDK++FTGST VGK + + + A NLKR TLELGGKSP I+F DADL
Sbjct: 162 DGDEVGAALASHPRVDKISFTGSTAVGKAIMR-AAAENLKRVTLELGGKSPLIVFDDADL 220

Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
           DAAV+GA FG F N GQ C A SR  V ++IYDEFV R  ER K   VG+P D + + GP
Sbjct: 221 DAAVKGAVFGAFGNAGQVCTAASRLLVHESIYDEFVERLVERVKALKVGNPLDPDTDMGP 280

Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR-AGDKGFYVQPTVFANVRDDMKIAREEIFG 348
            I   Q+D++L +IE  K++GA+L+ GG R  G KG++V PTV  +V  DM IA+EEIFG
Sbjct: 281 LISAAQLDRVLAYIEDAKAEGAKLLCGGKRLEGGKGYFVPPTVLTDVDPDMPIAQEEIFG 340

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           PV  +I F   EE IE AN+++YGLAA VFT+DL++   V + L AGT
Sbjct: 341 PVLPVIPFKDEEEAIELANDTEYGLAAGVFTRDLERALRVAERLEAGT 388


>gnl|CDD|143430 cd07112, ALDH_GABALDH-PuuC, Escherichia coli NADP+-dependent
           gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
           PuuC-like.  NADP+-dependent,
           gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
           (GABALDH) PuuC of  Escherichia coli which catalyzes the
           conversion of putrescine to 4-aminobutanoate and other
           similar sequences are present in this CD.
          Length = 462

 Score =  483 bits (1246), Expect = e-169
 Identities = 191/349 (54%), Positives = 240/349 (68%), Gaps = 4/349 (1%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           LETLD GKP + + +VDV  +  T R+YA   DK++G+  P   D  A    EP+GV G 
Sbjct: 71  LETLDMGKPISDALAVDVPSAANTFRWYAEAIDKVYGEVAPTGPDALALITREPLGVVGA 130

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           ++PWNFPLLM AWKIAPALA GN++VLKPAEQ+PLTAL +  L  EAG+P GV+N+VPG+
Sbjct: 131 VVPWNFPLLMAAWKIAPALAAGNSVVLKPAEQSPLTALRLAELALEAGLPAGVLNVVPGF 190

Query: 172 GD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA-DL 229
           G  AG AL  H  VD +AFTGSTEVG+   + SG SNLKR  LE GGKSPNI+FADA DL
Sbjct: 191 GHTAGEALGLHMDVDALAFTGSTEVGRRFLEYSGQSNLKRVWLECGGKSPNIVFADAPDL 250

Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
           DAA E A  G+F+N G+ C AGSR  V ++I DEF+ +    A+    GDP D     G 
Sbjct: 251 DAAAEAAAAGIFWNQGEVCSAGSRLLVHESIKDEFLEKVVAAAREWKPGDPLDPATRMGA 310

Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR--AGDKGFYVQPTVFANVRDDMKIAREEIF 347
            + +   DK+L +IESGK++GA+LVAGG R      GF+V+PTVF  V  DM+IAREEIF
Sbjct: 311 LVSEAHFDKVLGYIESGKAEGARLVAGGKRVLTETGGFFVEPTVFDGVTPDMRIAREEIF 370

Query: 348 GPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           GPV  +I F S EE +  AN+S YGLAA+V+T DL + + V + LRAGT
Sbjct: 371 GPVLSVITFDSEEEAVALANDSVYGLAASVWTSDLSRAHRVARRLRAGT 419


>gnl|CDD|143462 cd07144, ALDH_ALD2-YMR170C, Saccharomyces cerevisiae aldehyde
           dehydrogenase 2 (YMR170c)-like.  NAD(P)+-dependent
           Saccharomyces cerevisiae aldehyde dehydrogenase 2
           (YMR170c, ALD5, EC=1.2.1.5) and other similar sequences,
           are present in this CD.
          Length = 484

 Score =  472 bits (1217), Expect = e-165
 Identities = 195/372 (52%), Positives = 260/372 (69%), Gaps = 14/372 (3%)

Query: 51  SLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCG 110
           ++E LD+GKPY+ +   D++  +  +RYYAG+ADKI GKTIP + +  AYT HEP GVCG
Sbjct: 90  AIEALDSGKPYHSNALGDLDEIIAVIRYYAGWADKIQGKTIPTSPNKLAYTLHEPYGVCG 149

Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
           QIIPWN+PL M AWK+APALA GNT+V+KPAE TPL+ L    L+ EAG P GVVNI+PG
Sbjct: 150 QIIPWNYPLAMAAWKLAPALAAGNTVVIKPAENTPLSLLYFANLVKEAGFPPGVVNIIPG 209

Query: 171 YGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
           YG  AG+AL +HP VDK+AFTGST  G+LV + + A NLK  TLE GGKSP ++F DADL
Sbjct: 210 YGAVAGSALAEHPDVDKIAFTGSTATGRLVMKAA-AQNLKAVTLECGGKSPALVFEDADL 268

Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRT-VGDPFDLNVEQG 288
           D AV+ A  G+ +N GQ C A SR +VQ++IYD+FV +  E  K+   VG PFD +   G
Sbjct: 269 DQAVKWAAAGIMYNSGQNCTATSRIYVQESIYDKFVEKFVEHVKQNYKVGSPFDDDTVVG 328

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGG---GRAGDKGFYVQPTVFANVRDDMKIAREE 345
           PQ+ K Q D++L +IE GK +GA+LV GG        KG+++ PT+F +V  DM+I +EE
Sbjct: 329 PQVSKTQYDRVLSYIEKGKKEGAKLVYGGEKAPEGLGKGYFIPPTIFTDVPQDMRIVKEE 388

Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLIRFSSIEE 405
           IFGPV  + +F + EE I++AN++ YGLAAAVFTKD+ + + V + L AG +        
Sbjct: 389 IFGPVVVISKFKTYEEAIKKANDTTYGLAAAVFTKDIRRAHRVARELEAGMVW------- 441

Query: 406 VIERANNSDYGL 417
            I  +N+SD G+
Sbjct: 442 -INSSNDSDVGV 452


>gnl|CDD|143432 cd07114, ALDH_DhaS, Uncharacterized Candidatus pelagibacter
           aldehyde dehydrogenase, DhaS-like.  Uncharacterized
           aldehyde dehydrogenase from Candidatus pelagibacter
           (DhaS) and other related sequences are present in this
           CD.
          Length = 457

 Score =  470 bits (1212), Expect = e-164
 Identities = 175/352 (49%), Positives = 230/352 (65%), Gaps = 10/352 (2%)

Query: 52  LETLDNGKPYN-MSYSVDVEYSLRTLRYYAGFADKIHGKTIPIN-GDFFAYTRHEPVGVC 109
           LET DNGK        V   Y     RYYAG ADKI G  IP++ GD+  +TR EP+GV 
Sbjct: 66  LETRDNGKLIRETRAQV--RYLAEWYRYYAGLADKIEGAVIPVDKGDYLNFTRREPLGVV 123

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
             I PWN PLL+LA K+APALA GNT+VLKP+E TP + L +  L  EAG P GVVN+V 
Sbjct: 124 AAITPWNSPLLLLAKKLAPALAAGNTVVLKPSEHTPASTLELAKLAEEAGFPPGVVNVVT 183

Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G+G + G ALV+HP V K+AFTG TE G+ + + + A NL   TLELGGKSPNI+F DAD
Sbjct: 184 GFGPETGEALVEHPLVAKIAFTGGTETGRHIARAA-AENLAPVTLELGGKSPNIVFDDAD 242

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           LDAAV G   G+F   GQ C AGSR  VQ +IYDEFV R   RA+   VGDP D   + G
Sbjct: 243 LDAAVNGVVAGIFAAAGQTCVAGSRLLVQRSIYDEFVERLVARARAIRVGDPLDPETQMG 302

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAG----DKGFYVQPTVFANVRDDMKIARE 344
           P   + Q++K+  ++   + +GA+++ GG R        G++ +PT+ A+V +DM+IA+E
Sbjct: 303 PLATERQLEKVERYVARAREEGARVLTGGERPSGADLGAGYFFEPTILADVTNDMRIAQE 362

Query: 345 EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           E+FGPV  +I F   EE I  AN+S+YGLAA ++T+DL + + V + + AGT
Sbjct: 363 EVFGPVLSVIPFDDEEEAIALANDSEYGLAAGIWTRDLARAHRVARAIEAGT 414


>gnl|CDD|143437 cd07119, ALDH_BADH-GbsA, Bacillus subtilis NAD+-dependent betaine
           aldehyde dehydrogenase-like.  Included in this CD is the
           NAD+-dependent, betaine aldehyde dehydrogenase (BADH,
           GbsA, EC=1.2.1.8) of Bacillus subtilis involved in the
           synthesis of the osmoprotectant glycine betaine from
           choline or glycine betaine aldehyde.
          Length = 482

 Score =  466 bits (1202), Expect = e-162
 Identities = 174/351 (49%), Positives = 234/351 (66%), Gaps = 7/351 (1%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           LETL+ GK    S   D++      RYYAG A K  G+   +     + T  EPVGVCG 
Sbjct: 82  LETLNTGKTLRESEI-DIDDVANCFRYYAGLATKETGEVYDVPPHVISRTVREPVGVCGL 140

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           I PWN+PLL  AWK+APALA GNT+V+KP+E TPLT +A+  LI EAG+P GVVN+V G 
Sbjct: 141 ITPWNYPLLQAAWKLAPALAAGNTVVIKPSEVTPLTTIALFELIEEAGLPAGVVNLVTGS 200

Query: 172 GD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
           G   GA L + P VD V+FTG T  G+ + + + A N+K+  LELGGK+PNI+FADAD +
Sbjct: 201 GATVGAELAESPDVDLVSFTGGTATGRSIMR-AAAGNVKKVALELGGKNPNIVFADADFE 259

Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
            AV+ A  G+FFN GQ C AGSR  V+++I+D+FVA   ERAK+  +G+  D + E GP 
Sbjct: 260 TAVDQALNGVFFNAGQVCSAGSRLLVEESIHDKFVAALAERAKKIKLGNGLDADTEMGPL 319

Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAGD----KGFYVQPTVFANVRDDMKIAREEI 346
           +  E  +K+L +I+ GK +GA+LV GG R       KG++V+PT+F +V   M+I +EEI
Sbjct: 320 VSAEHREKVLSYIQLGKEEGARLVCGGKRPTGDELAKGYFVEPTIFDDVDRTMRIVQEEI 379

Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           FGPV  + RF + EE I  AN++ YGLA AV+TKD+ + N V + LRAGT+
Sbjct: 380 FGPVLTVERFDTEEEAIRLANDTPYGLAGAVWTKDIARANRVARRLRAGTV 430


>gnl|CDD|215410 PLN02766, PLN02766, coniferyl-aldehyde dehydrogenase.
          Length = 501

 Score =  465 bits (1198), Expect = e-162
 Identities = 191/349 (54%), Positives = 257/349 (73%), Gaps = 1/349 (0%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           ++L+T+D GK + +  +VD+  +   LRYYAG ADKIHG+T+ ++     YT  EP+GV 
Sbjct: 103 AALDTIDAGKLFALGKAVDIPAAAGLLRYYAGAADKIHGETLKMSRQLQGYTLKEPIGVV 162

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
           G IIPWNFP  M   K+APALA G T+V+KPAEQTPL+AL    L   AG+P+GV+N+V 
Sbjct: 163 GHIIPWNFPSTMFFMKVAPALAAGCTMVVKPAEQTPLSALFYAHLAKLAGVPDGVINVVT 222

Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G+G  AGAA+  H  VDKV+FTGSTEVG+ + Q +  SNLK+ +LELGGKSP +IF DAD
Sbjct: 223 GFGPTAGAAIASHMDVDKVSFTGSTEVGRKIMQAAATSNLKQVSLELGGKSPLLIFDDAD 282

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           +D AV+ A  G+F+N G+ C A SR +VQ+ IYDEFV +  E+AK   VGDPFD    QG
Sbjct: 283 VDMAVDLALLGIFYNKGEICVASSRVYVQEGIYDEFVKKLVEKAKDWVVGDPFDPRARQG 342

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           PQ+DK+Q +KIL +IE GK +GA L+ GG   GDKG+Y++PT+F +V +DMKIA++EIFG
Sbjct: 343 PQVDKQQFEKILSYIEHGKREGATLLTGGKPCGDKGYYIEPTIFTDVTEDMKIAQDEIFG 402

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           PV  L++F ++EE I++ANN+ YGLAA + TKDLD  N V++ +RAGT+
Sbjct: 403 PVMSLMKFKTVEEAIKKANNTKYGLAAGIVTKDLDVANTVSRSIRAGTI 451



 Score = 34.0 bits (78), Expect = 0.16
 Identities = 15/32 (46%), Positives = 21/32 (65%), Gaps = 2/32 (6%)

Query: 27 KPITKPKIEHTQLFINNEFVNAQS--SLETLD 56
            +  P+I+ T+LFIN EFV+A S  + ET D
Sbjct: 11 SGVKVPEIKFTKLFINGEFVDAASGKTFETRD 42


>gnl|CDD|143433 cd07115, ALDH_HMSADH_HapE, Pseudomonas fluorescens 4-hydroxymuconic
           semialdehyde dehydrogenase-like.  4-hydroxymuconic
           semialdehyde dehydrogenase (HapE, EC=1.2.1.61) of
           Pseudomonas fluorescens ACB involved in
           4-hydroxyacetophenone degradation, and putative
           hydroxycaproate semialdehyde dehydrogenase (ChnE) of
           Brachymonas petroleovorans involved in cyclohexane
           metabolism, and other similar sequences, are present in
           this CD.
          Length = 453

 Score =  460 bits (1186), Expect = e-160
 Identities = 191/348 (54%), Positives = 253/348 (72%), Gaps = 2/348 (0%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           + LE+LD GKP   +  +DV  +  T RYYAG+ADKI G+ IP+ G F  YT  EPVGV 
Sbjct: 62  ARLESLDTGKPIRAARRLDVPRAADTFRYYAGWADKIEGEVIPVRGPFLNYTVREPVGVV 121

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
           G I+PWNFPL+  AWK+APALA GNT+VLKPAE TPL+AL +  L+AEAG P GV+N+V 
Sbjct: 122 GAIVPWNFPLMFAAWKVAPALAAGNTVVLKPAELTPLSALRIAELMAEAGFPAGVLNVVT 181

Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G+G+ AGAALV+HP VDK+ FTGST VG+ + QG+ A NLKR +LELGGKS NI+FADAD
Sbjct: 182 GFGEVAGAALVEHPDVDKITFTGSTAVGRKIMQGA-AGNLKRVSLELGGKSANIVFADAD 240

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           LDAAV  A  G+F+N GQ C AGSR  V ++IYDEF+ R    A+    GDP D   + G
Sbjct: 241 LDAAVRAAATGIFYNQGQMCTAGSRLLVHESIYDEFLERFTSLARSLRPGDPLDPKTQMG 300

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           P + + Q D++L++++ G+ +GA+L+ GG R G +GF+V+PT+FA V  +M+IA+EEIFG
Sbjct: 301 PLVSQAQFDRVLDYVDVGREEGARLLTGGKRPGARGFFVEPTIFAAVPPEMRIAQEEIFG 360

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           PV  ++RF   EE +  AN ++YGLAA V+T+DL + + V   L+AGT
Sbjct: 361 PVVSVMRFRDEEEALRIANGTEYGLAAGVWTRDLGRAHRVAAALKAGT 408


>gnl|CDD|223944 COG1012, PutA, NAD-dependent aldehyde dehydrogenases [Energy
           production and conversion].
          Length = 472

 Score =  452 bits (1166), Expect = e-157
 Identities = 183/346 (52%), Positives = 240/346 (69%), Gaps = 5/346 (1%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           L TL+ GKP + +   ++  +   +RYYA  A ++ G+TIP +    A  R EP+GV G 
Sbjct: 81  LITLETGKPISEAR-GEIARAADFIRYYAEEARRLEGETIPTDKGSKALVRREPLGVVGA 139

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           I PWNFPL + AWK+APALA GNT+VLKP+EQTPL+ALA+  L AEAG+P GV+N+V G 
Sbjct: 140 ITPWNFPLALAAWKLAPALAAGNTVVLKPSEQTPLSALALAELAAEAGLPAGVLNVVTGG 199

Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
           G + G ALV HP VD ++FTGST VG+ +   + A+NLK  TLELGGKSP I+  DADLD
Sbjct: 200 GAEVGDALVAHPDVDAISFTGSTAVGRAIAAAA-AANLKPVTLELGGKSPAIVLEDADLD 258

Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
           AAV+ A FG FFN GQ C A SR  V +++YDEFV R   RA    VGDP D + + GP 
Sbjct: 259 AAVDAAVFGAFFNAGQRCTAASRLIVHESVYDEFVERLVARAASLKVGDPLDPSTDLGPL 318

Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPV 350
           I +EQ+D++  +IE   ++GA+L+AGG R G  G++V+PT+   V  DM+IAREEIFGPV
Sbjct: 319 ISEEQLDRVEGYIEDAVAEGARLLAGGKRPG--GYFVEPTILEGVTPDMRIAREEIFGPV 376

Query: 351 QQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
             +IRF   EE IE AN+++YGLAAA+FT+DL +   V + L AG 
Sbjct: 377 LPVIRFKDEEEAIELANDTEYGLAAAIFTRDLARAFRVARRLEAGM 422


>gnl|CDD|143412 cd07093, ALDH_F8_HMSADH, Human aldehyde dehydrogenase family 8
           member A1-like.  In humans, the  aldehyde dehydrogenase
           family 8 member A1 (ALDH8A1) protein functions to
           convert 9-cis-retinal to 9-cis-retinoic acid and has a
           preference for NAD+. Also included in this CD is the
           2-hydroxymuconic semialdehyde dehydrogenase (HMSADH)
           which catalyzes the conversion of 2-hydroxymuconic
           semialdehyde to 4-oxalocrotonate, a step in the meta
           cleavage pathway of aromatic hydrocarbons in bacteria.
           Such HMSADHs seen here are: XylG of the TOL plasmid pWW0
           of Pseudomonas putida, TomC  of Burkholderia cepacia G4,
           and AphC of Comamonas testosterone.
          Length = 455

 Score =  442 bits (1139), Expect = e-153
 Identities = 173/351 (49%), Positives = 239/351 (68%), Gaps = 6/351 (1%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           LE+LD GKP  ++ + D+  +    R++A +  ++ G++ P +G    Y   +PVGV G 
Sbjct: 64  LESLDTGKPITLARTRDIPRAAANFRFFADYILQLDGESYPQDGGALNYVLRQPVGVAGL 123

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           I PWN PL++L WKIAPALA GNT+VLKP+E TPLTA  +  L  EAG+P GVVN+V G+
Sbjct: 124 ITPWNLPLMLLTWKIAPALAFGNTVVLKPSEWTPLTAWLLAELANEAGLPPGVVNVVHGF 183

Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
           G +AGAALV HP VD ++FTG T  G+ + + + A NLK  +LELGGK+PNI+FADADLD
Sbjct: 184 GPEAGAALVAHPDVDLISFTGETATGRTIMR-AAAPNLKPVSLELGGKNPNIVFADADLD 242

Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
            AV+ A    F N G+ C AGSR  VQ +IYDEF+ R  ERAK   VGDP D + E GP 
Sbjct: 243 RAVDAAVRSSFSNNGEVCLAGSRILVQRSIYDEFLERFVERAKALKVGDPLDPDTEVGPL 302

Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAG----DKGFYVQPTVFANVRDDMKIAREEI 346
           I KE ++K+L ++E  +++GA ++ GGGR      + G++V+PTV   + +D ++A+EEI
Sbjct: 303 ISKEHLEKVLGYVELARAEGATILTGGGRPELPDLEGGYFVEPTVITGLDNDSRVAQEEI 362

Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           FGPV  +I F   EE IE AN++ YGLAA V+T+DL + + V + L AGT+
Sbjct: 363 FGPVVTVIPFDDEEEAIELANDTPYGLAAYVWTRDLGRAHRVARRLEAGTV 413


>gnl|CDD|143427 cd07109, ALDH_AAS00426, Uncharacterized Saccharopolyspora spinosa
           aldehyde dehydrogenase (AAS00426)-like.  Uncharacterized
           aldehyde dehydrogenase of Saccharopolyspora spinosa
           (AAS00426) and other similar sequences, are present in
           this CD.
          Length = 454

 Score =  434 bits (1118), Expect = e-150
 Identities = 166/350 (47%), Positives = 224/350 (64%), Gaps = 7/350 (2%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           + LE+LD GKP   + + DVE + R   YY G ADK+HG+TIP+   +F YT  EP GV 
Sbjct: 63  ARLESLDTGKPLTQARA-DVEAAARYFEYYGGAADKLHGETIPLGPGYFVYTVREPHGVT 121

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
           G IIPWN+PL +    +APALA GN +V+KPAE  PLTAL +  L  EAG+P G +N+V 
Sbjct: 122 GHIIPWNYPLQITGRSVAPALAAGNAVVVKPAEDAPLTALRLAELAEEAGLPAGALNVVT 181

Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G G +AGAALV HPGVD ++FTGS E G  V + + A N+   TLELGGKSP I+FADAD
Sbjct: 182 GLGAEAGAALVAHPGVDHISFTGSVETGIAVMR-AAAENVVPVTLELGGKSPQIVFADAD 240

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           L+AA+      +  N GQ C AGSR  V  +IYDE + R  ER +   VG   + + + G
Sbjct: 241 LEAALPVVVNAIIQNAGQTCSAGSRLLVHRSIYDEVLERLVERFRALRVGPGLE-DPDLG 299

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD---KGFYVQPTVFANVRDDMKIAREE 345
           P I  +Q+D++  F+   +++GA++VAGG  A      G++V PT+  +V  D ++A+EE
Sbjct: 300 PLISAKQLDRVEGFVARARARGARIVAGGRIAEGAPAGGYFVAPTLLDDVPPDSRLAQEE 359

Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           IFGPV  ++ F    E I  AN +DYGL A V+T+D D+   V + LRAG
Sbjct: 360 IFGPVLAVMPFDDEAEAIALANGTDYGLVAGVWTRDGDRALRVARRLRAG 409


>gnl|CDD|183918 PRK13252, PRK13252, betaine aldehyde dehydrogenase; Provisional.
          Length = 488

 Score =  435 bits (1120), Expect = e-150
 Identities = 169/352 (48%), Positives = 218/352 (61%), Gaps = 5/352 (1%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           ++LETLD GKP   +  VD+      L YYAG A  + G+ IP+ G  F YTR EP+GVC
Sbjct: 87  AALETLDTGKPIQETSVVDIVTGADVLEYYAGLAPALEGEQIPLRGGSFVYTRREPLGVC 146

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
             I  WN+P+ +  WK APALA GN ++ KP+E TPLTAL +  +  EAG+P+GV N+V 
Sbjct: 147 AGIGAWNYPIQIACWKSAPALAAGNAMIFKPSEVTPLTALKLAEIYTEAGLPDGVFNVVQ 206

Query: 170 GYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
           G G  GA L +HP + KV+FTG    GK V   + AS LK  T+ELGGKSP I+F DADL
Sbjct: 207 GDGRVGAWLTEHPDIAKVSFTGGVPTGKKVMAAAAAS-LKEVTMELGGKSPLIVFDDADL 265

Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
           D A + A    F++ GQ C  G+R FVQ +I   F AR  ER +R  +GDP D     GP
Sbjct: 266 DRAADIAMLANFYSSGQVCTNGTRVFVQKSIKAAFEARLLERVERIRIGDPMDPATNFGP 325

Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR----AGDKGFYVQPTVFANVRDDMKIAREE 345
            +     DK+L +IE GK++GA+L+ GG R        G +V PTVF +  DDM I REE
Sbjct: 326 LVSFAHRDKVLGYIEKGKAEGARLLCGGERLTEGGFANGAFVAPTVFTDCTDDMTIVREE 385

Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           IFGPV  ++ F   +EVI RAN+++YGLAA VFT DL + + V   L AG  
Sbjct: 386 IFGPVMSVLTFDDEDEVIARANDTEYGLAAGVFTADLSRAHRVIHQLEAGIC 437


>gnl|CDD|143409 cd07090, ALDH_F9_TMBADH, NAD+-dependent
           4-trimethylaminobutyraldehyde dehydrogenase, ALDH family
           9A1.  NAD+-dependent, 4-trimethylaminobutyraldehyde
           dehydrogenase (TMABADH, EC=1.2.1.47), also known as
           aldehyde dehydrogenase family 9 member A1 (ALDH9A1) in
           humans, is a cytosolic tetramer which catalyzes the
           oxidation of gamma-aminobutyraldehyde involved in
           4-aminobutyric acid (GABA) biosynthesis  and also
           oxidizes betaine aldehyde
           (gamma-trimethylaminobutyraldehyde) which is involved in
           carnitine biosynthesis.
          Length = 457

 Score =  425 bits (1094), Expect = e-146
 Identities = 174/351 (49%), Positives = 228/351 (64%), Gaps = 7/351 (1%)

Query: 51  SLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCG 110
            LET+DNGKP   +  VD++ S   L YYAG A  + G+ +P+ G  FAYTR EP+GVC 
Sbjct: 63  RLETIDNGKPIEEA-RVDIDSSADCLEYYAGLAPTLSGEHVPLPGGSFAYTRREPLGVCA 121

Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
            I  WN+P+ + +WK APALA GN +V KP+  TPLTAL +  ++ EAG+P+GV N+V G
Sbjct: 122 GIGAWNYPIQIASWKSAPALACGNAMVYKPSPFTPLTALLLAEILTEAGLPDGVFNVVQG 181

Query: 171 YGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
            G+ G  L +HP V KV+FTGS   GK V   + A  +K  TLELGGKSP IIF DADL+
Sbjct: 182 GGETGQLLCEHPDVAKVSFTGSVPTGKKVMS-AAAKGIKHVTLELGGKSPLIIFDDADLE 240

Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
            AV GA    F + GQ C  G+R FVQ +I DEF  R  ER K+  +GDP D + + G  
Sbjct: 241 NAVNGAMMANFLSQGQVCSNGTRVFVQRSIKDEFTERLVERTKKIRIGDPLDEDTQMGAL 300

Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAG-----DKGFYVQPTVFANVRDDMKIAREE 345
           I +E ++K+L +IES K +GA+++ GG R       + GFYV P V  +  DDM I REE
Sbjct: 301 ISEEHLEKVLGYIESAKQEGAKVLCGGERVVPEDGLENGFYVSPCVLTDCTDDMTIVREE 360

Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           IFGPV  ++ F + EEVI RAN++ YGLAA VFT+DL + + V   L+AGT
Sbjct: 361 IFGPVMSILPFDTEEEVIRRANDTTYGLAAGVFTRDLQRAHRVIAQLQAGT 411


>gnl|CDD|143421 cd07103, ALDH_F5_SSADH_GabD, Mitochondrial succinate-semialdehyde
           dehydrogenase and ALDH family members 5A1 and 5F1-like. 
           Succinate-semialdehyde dehydrogenase, mitochondrial
           (SSADH, GabD, EC=1.2.1.24) catalyzes the NAD+-dependent
           oxidation of succinate semialdehyde (SSA) to succinate.
           This group includes the human aldehyde dehydrogenase
           family 5 member A1 (ALDH5A1) which is a mitochondrial
           homotetramer that converts SSA to succinate in the last
           step of 4-aminobutyric acid (GABA) catabolism. This CD
           also includes the Arabidopsis SSADH gene product
           ALDH5F1. Mutations in this gene result in the
           accumulation of H2O2, suggesting a role in plant defense
           against the environmental stress of elevated reactive
           oxygen species.
          Length = 451

 Score =  416 bits (1071), Expect = e-143
 Identities = 159/344 (46%), Positives = 219/344 (63%), Gaps = 4/344 (1%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPI-NGDFFAYTRHEPVGVCGQI 112
           TL+ GKP   +   +V+Y+   L ++A  A +I+G+TIP            +PVGV   I
Sbjct: 66  TLEQGKPLAEAR-GEVDYAASFLEWFAEEARRIYGRTIPSPAPGKRILVIKQPVGVVAAI 124

Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG-Y 171
            PWNFP  M+  KIAPALA G T+VLKPAE+TPL+ALA+  L  EAG+P GV+N+V G  
Sbjct: 125 TPWNFPAAMITRKIAPALAAGCTVVLKPAEETPLSALALAELAEEAGLPAGVLNVVTGSP 184

Query: 172 GDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDA 231
            + G AL   P V K++FTGST VGKL+   + A  +KR +LELGG +P I+F DADLD 
Sbjct: 185 AEIGEALCASPRVRKISFTGSTAVGKLLMAQA-ADTVKRVSLELGGNAPFIVFDDADLDK 243

Query: 232 AVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
           AV+GA    F N GQ C   +R +V ++IYDEFV +  ER K+  VG+  D   + GP I
Sbjct: 244 AVDGAIASKFRNAGQTCVCANRIYVHESIYDEFVEKLVERVKKLKVGNGLDEGTDMGPLI 303

Query: 292 DKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQ 351
           ++  ++K+   +E   ++GA+++ GG R G  G++ +PTV  +V DDM I  EE FGPV 
Sbjct: 304 NERAVEKVEALVEDAVAKGAKVLTGGKRLGLGGYFYEPTVLTDVTDDMLIMNEETFGPVA 363

Query: 352 QLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
            +I F + +EVI RAN++ YGLAA VFT+DL +   V + L AG
Sbjct: 364 PIIPFDTEDEVIARANDTPYGLAAYVFTRDLARAWRVAEALEAG 407


>gnl|CDD|200131 TIGR01804, BADH, glycine betaine aldehyde dehydrogenase.  Under
           osmotic stress, betaine aldehyde dehydrogenase oxidizes
           glycine betaine aldehyde into the osmoprotectant glycine
           betaine, via the second of two oxidation steps from
           exogenously supplied choline or betaine aldehyde. This
           choline-glycine betaine synthesis pathway can be found
           in gram-positive and gram-negative bacteria. In
           Escherichia coli, betaine aldehyde dehydrogenase (betB)
           is osmotically co-induced with choline dehydrogenase
           (betA) in the presence of choline. These dehydrogenases
           are located in a betaine gene cluster with the upstream
           choline transporter (betT) and transcriptional regulator
           (betI). Similar to E.coli, betaine synthesis in
           Staphylococcus xylosus is also influenced by osmotic
           stress and the presence of choline with genes localized
           in a functionally equivalent gene cluster. Organization
           of the betaine gene cluster in Sinorhizobium meliloti
           and Bacillus subtilis differs from that of E.coli by the
           absence of upstream choline transporter and
           transcriptional regulator homologues. Additionally,
           B.subtilis co-expresses a type II alcohol dehydrogenase
           with betaine aldehyde dehydrogenase instead of choline
           dehydrogenase as in E.coli, St.xylosus, and Si.meliloti.
           Betaine aldehyde dehydrogenase is a member of the
           aldehyde dehydrogenase family (pfam00171) [Cellular
           processes, Adaptations to atypical conditions].
          Length = 467

 Score =  416 bits (1070), Expect = e-143
 Identities = 176/353 (49%), Positives = 229/353 (64%), Gaps = 6/353 (1%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           + LETLD GK    +   D++       ++AG A  ++G+ IP+ G  FAYT  EP+GVC
Sbjct: 78  AKLETLDTGKTLQETIVADMDSGADVFEFFAGLAPALNGEIIPLGGPSFAYTIREPLGVC 137

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
             I  WN+PL + +WKIAPALA GN +V KP+E TPLTAL V  ++ EAG+P+GV N+V 
Sbjct: 138 VGIGAWNYPLQIASWKIAPALAAGNAMVFKPSENTPLTALKVAEIMEEAGLPKGVFNVVQ 197

Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G G + G  LV HP V KV+FTG    GK +   + A +LK  T+ELGGKSP I+F DAD
Sbjct: 198 GDGAEVGPLLVNHPDVAKVSFTGGVPTGKKI-MAAAAGHLKHVTMELGGKSPLIVFDDAD 256

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           L++AV+GA  G FF+ GQ C  G+R FV   I + F+AR  ER +R  +GDPFD   E G
Sbjct: 257 LESAVDGAMLGNFFSAGQVCSNGTRVFVHKKIKERFLARLVERTERIKLGDPFDEATEMG 316

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGR----AGDKGFYVQPTVFANVRDDMKIARE 344
           P I     DK+L +IE GK++GA L  GGGR        GF+V+PTVFA+  DDM I RE
Sbjct: 317 PLISAAHRDKVLSYIEKGKAEGATLATGGGRPENVGLQNGFFVEPTVFADCTDDMTIVRE 376

Query: 345 EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           EIFGPV  ++ FS  +EVI RAN+++YGLA  VFT DL + + V   L AGT+
Sbjct: 377 EIFGPVMTVLTFSDEDEVIARANDTEYGLAGGVFTADLGRAHRVADQLEAGTV 429


>gnl|CDD|143471 cd07559, ALDH_ACDHII_AcoD-like, Ralstonia eutrophus NAD+-dependent
           acetaldehyde dehydrogenase II and Staphylococcus aureus
           AldA1 (SACOL0154)-like.  Included in this CD is the
           NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII,
           AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus
           H16 involved in the catabolism of acetoin and ethanol,
           and similar proteins, such as, the dimeric
           dihydrolipoamide dehydrogenase of the acetoin
           dehydrogenase enzyme system of Klebsiella pneumonia.
           Also included are sequences similar to the
           NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
           and AldB) of Xanthobacter autotrophicus GJ10 which are
           involved in the degradation of 1,2-dichloroethane, as
           well as, the uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences.
          Length = 480

 Score =  413 bits (1064), Expect = e-142
 Identities = 167/354 (47%), Positives = 227/354 (64%), Gaps = 12/354 (3%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
            ETLDNGKP   + + D+  ++   RY+AG      G    I+ D  +Y  HEP+GV GQ
Sbjct: 83  AETLDNGKPIRETLAADIPLAIDHFRYFAGVIRAQEGSLSEIDEDTLSYHFHEPLGVVGQ 142

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           IIPWNFPLLM AWK+APALA GNT+VLKPA QTPL+ L +  LI +  +P+GVVN+V G+
Sbjct: 143 IIPWNFPLLMAAWKLAPALAAGNTVVLKPASQTPLSILVLMELIGDL-LPKGVVNVVTGF 201

Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA--- 227
           G +AG  L  HP + K+AFTGST VG+L+ Q   A NL   TLELGGKSPNI F DA   
Sbjct: 202 GSEAGKPLASHPRIAKLAFTGSTTVGRLIMQ-YAAENLIPVTLELGGKSPNIFFDDAMDA 260

Query: 228 --DLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNV 285
             D D   E    G  FN G+ C   SR  VQ++IYDEF+ R+ ER +   VG+P D   
Sbjct: 261 DDDFDDKAEEGQLGFAFNQGEVCTCPSRALVQESIYDEFIERAVERFEAIKVGNPLDPET 320

Query: 286 EQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRA----GDKGFYVQPTVFANVRDDMKI 341
             G Q+ K+Q++KIL +++ GK +GA+++ GG R      DKG++ +PT+     +DM+I
Sbjct: 321 MMGAQVSKDQLEKILSYVDIGKEEGAEVLTGGERLTLGGLDKGYFYEPTLIKGGNNDMRI 380

Query: 342 AREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
            +EEIFGPV  +I F   EE I  AN+++YGL   V+T+D+++   V +G++ G
Sbjct: 381 FQEEIFGPVLAVITFKDEEEAIAIANDTEYGLGGGVWTRDINRALRVARGIQTG 434


>gnl|CDD|143458 cd07140, ALDH_F1L_FTFDH, 10-formyltetrahydrofolate dehydrogenase,
           ALDH family 1L.  10-formyltetrahydrofolate dehydrogenase
           (FTHFDH, EC=1.5.1.6), also known as aldehyde
           dehydrogenase family 1 member L1 (ALDH1L1) in humans, is
           a multi-domain homotetramer with an N-terminal formyl
           transferase domain and a C-terminal ALDH domain. FTHFDH
           catalyzes an NADP+-dependent dehydrogenase reaction
           resulting in the conversion of 10-formyltetrahydrofolate
           to tetrahydrofolate and CO2. The ALDH domain is also
           capable of the oxidation of short chain aldehydes to
           their corresponding acids.
          Length = 486

 Score =  408 bits (1050), Expect = e-139
 Identities = 175/355 (49%), Positives = 240/355 (67%), Gaps = 7/355 (1%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPIN----GDFFAYTRHEP 105
           +++E+LD+G  Y ++    V  S++T RY+AG+ DKI GKTIPIN          T+ EP
Sbjct: 88  ATIESLDSGAVYTLALKTHVGMSIQTFRYFAGWCDKIQGKTIPINQARPNRNLTLTKREP 147

Query: 106 VGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVV 165
           +GVCG +IPWN+PL+MLAWK+A  LA GNT+VLKPA+ TPLTAL    L  +AG P+GV+
Sbjct: 148 IGVCGIVIPWNYPLMMLAWKMAACLAAGNTVVLKPAQVTPLTALKFAELTVKAGFPKGVI 207

Query: 166 NIVPGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIF 224
           NI+PG G   G  L  HP V K+ FTGST +GK + +    SNLK+ +LELGGKSP IIF
Sbjct: 208 NILPGSGSLVGQRLSDHPDVRKLGFTGSTPIGKHIMKSCAVSNLKKVSLELGGKSPLIIF 267

Query: 225 ADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLN 284
           AD D+D AV      +FFN G+ C A  R FV+++I+DEFV R  E  K+  +GDP D +
Sbjct: 268 ADCDMDKAVRMGMSSVFFNKGENCIAAGRLFVEESIHDEFVRRVVEEVKKMKIGDPLDRS 327

Query: 285 VEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIARE 344
            + GPQ  K  +DK++E+ E G  +GA LV GG +    GF+ +PTVF +V D M IA+E
Sbjct: 328 TDHGPQNHKAHLDKLVEYCERGVKEGATLVYGGKQVDRPGFFFEPTVFTDVEDHMFIAKE 387

Query: 345 EIFGPVQQLIRF--SSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           E FGP+  + +F    ++ V++RAN+++YGLA+ VFTKD++K  YV+  L AGT+
Sbjct: 388 ESFGPIMIISKFDDGDVDGVLQRANDTEYGLASGVFTKDINKALYVSDKLEAGTV 442


>gnl|CDD|143435 cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus
           AldA1 (SACOL0154) aldehyde dehydrogenase-like.
           Uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences are present in this CD.
          Length = 475

 Score =  406 bits (1044), Expect = e-139
 Identities = 173/349 (49%), Positives = 239/349 (68%), Gaps = 7/349 (2%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           +ETLDNGKP   + +VD+  +    RY+AG      G    I+ D  +    EP+GV GQ
Sbjct: 83  VETLDNGKPIRETRAVDIPLAADHFRYFAGVIRAEEGSANMIDEDTLSIVLREPIGVVGQ 142

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           IIPWNFP LM AWK+APALA GNT+V+KP+  T L+ L +  +I +  +P+GVVNIV G 
Sbjct: 143 IIPWNFPFLMAAWKLAPALAAGNTVVIKPSSTTSLSLLELAKIIQDV-LPKGVVNIVTGK 201

Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
           G  +G  L+ HPG+DK+AFTGSTEVG+ V   + A  L   TLELGGKS NIIF DA+ D
Sbjct: 202 GSKSGEYLLNHPGLDKLAFTGSTEVGRDVAI-AAAKKLIPATLELGGKSANIIFDDANWD 260

Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
            A+EGA  G+ FN GQ CCAGSR FVQ+ IYDEFVA+  E+ +   VG+P D + + G Q
Sbjct: 261 KALEGAQLGILFNQGQVCCAGSRIFVQEGIYDEFVAKLKEKFENVKVGNPLDPDTQMGAQ 320

Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGR----AGDKGFYVQPTVFANVRDDMKIAREEI 346
           ++K+Q+DKIL +++  K +GA+++ GG R      DKGF+++PT+  NV +DM++A+EEI
Sbjct: 321 VNKDQLDKILSYVDIAKEEGAKILTGGHRLTENGLDKGFFIEPTLIVNVTNDMRVAQEEI 380

Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           FGPV  +I+F + +EVI+ AN+S+YGL   VFTKD+++   V + +  G
Sbjct: 381 FGPVATVIKFKTEDEVIDMANDSEYGLGGGVFTKDINRALRVARAVETG 429


>gnl|CDD|143436 cd07118, ALDH_SNDH, Gluconobacter oxydans L-sorbosone
           dehydrogenase-like.  Included in this CD is the
           L-sorbosone dehydrogenase (SNDH) from Gluconobacter
           oxydans UV10. In G. oxydans,  D-sorbitol is converted to
           2-keto-L-gulonate (a precursor of L-ascorbic acid) in
           sequential oxidation steps catalyzed by a FAD-dependent,
           L-sorbose dehydrogenase and an NAD(P)+-dependent,
           L-sorbosone dehydrogenase.
          Length = 454

 Score =  396 bits (1020), Expect = e-135
 Identities = 161/348 (46%), Positives = 230/348 (66%), Gaps = 5/348 (1%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTI-PINGDFFAYTRHEPVGVCG 110
           +ETL++GKP + +   ++E +    RY A  A  +HG +   +  D       EP+GV G
Sbjct: 66  IETLESGKPISQARG-EIEGAADLWRYAASLARTLHGDSYNNLGDDMLGLVLREPIGVVG 124

Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
            I PWNFP L+L+ K+  ALA G T+V+KP+E T  T L +  L+ EAG+P GVVNIV G
Sbjct: 125 IITPWNFPFLILSQKLPFALAAGCTVVVKPSEFTSGTTLMLAELLIEAGLPAGVVNIVTG 184

Query: 171 YGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
           YG   G A+ +HP VD V+FTGST VGK +   + A NLK+ +LELGGK+P I+FADADL
Sbjct: 185 YGATVGQAMTEHPDVDMVSFTGSTRVGKAIA-AAAARNLKKVSLELGGKNPQIVFADADL 243

Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
           DAA +   FG++FN G+CC +GSR  V ++I D FVA    R+++  VGDP D   + G 
Sbjct: 244 DAAADAVVFGVYFNAGECCNSGSRLLVHESIADAFVAAVVARSRKVRVGDPLDPETKVGA 303

Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR-AGDKGFYVQPTVFANVRDDMKIAREEIFG 348
            I++ Q+ KI +++++G+++GA L+ GG R A   G + QPT+F +V  DM IAREEIFG
Sbjct: 304 IINEAQLAKITDYVDAGRAEGATLLLGGERLASAAGLFYQPTIFTDVTPDMAIAREEIFG 363

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           PV  ++ F +++E I  AN++ YGL+A V++KD+D    V + +RAGT
Sbjct: 364 PVLSVLTFDTVDEAIALANDTVYGLSAGVWSKDIDTALTVARRIRAGT 411


>gnl|CDD|143428 cd07110, ALDH_F10_BADH, Arabidopsis betaine aldehyde dehydrogenase
           1 and 2, ALDH family 10A8 and 10A9-like.  Present in
           this CD are the Arabidopsis betaine aldehyde
           dehydrogenase (BADH) 1 (chloroplast) and 2
           (mitochondria), also known as, aldehyde dehydrogenase
           family 10 member A8 and aldehyde dehydrogenase family 10
           member A9, respectively, and are putative dehydration-
           and salt-inducible BADHs (EC 1.2.1.8) that catalyze the
           oxidation of betaine aldehyde to the compatible solute
           glycine betaine.
          Length = 456

 Score =  395 bits (1016), Expect = e-135
 Identities = 174/351 (49%), Positives = 232/351 (66%), Gaps = 9/351 (2%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGK---TIPI-NGDFFAYTRHEPVG 107
           LE  DNGKP + + + DV+       YYA  A+++  K    +P+ + DF A  R EPVG
Sbjct: 64  LEARDNGKPLDEA-AWDVDDVAGCFEYYADLAEQLDAKAERAVPLPSEDFKARVRREPVG 122

Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
           V G I PWNFPLLM AWK+APALA G T+VLKP+E T LT L +  + AEAG+P GV+N+
Sbjct: 123 VVGLITPWNFPLLMAAWKVAPALAAGCTVVLKPSELTSLTELELAEIAAEAGLPPGVLNV 182

Query: 168 VPGYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFAD 226
           V G GD AGA L  HPG+DK++FTGST  G  V Q + A ++K  +LELGGKSP I+F D
Sbjct: 183 VTGTGDEAGAPLAAHPGIDKISFTGSTATGSQVMQ-AAAQDIKPVSLELGGKSPIIVFDD 241

Query: 227 ADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVE 286
           ADL+ AVE A FG F+N GQ C A SR  V ++I D F+ R    A+   VGDP +  V 
Sbjct: 242 ADLEKAVEWAMFGCFWNNGQICSATSRLLVHESIADAFLERLATAAEAIRVGDPLEEGVR 301

Query: 287 QGPQIDKEQMDKILEFIESGKSQGAQLVAGGGR--AGDKGFYVQPTVFANVRDDMKIARE 344
            GP + + Q +K+L FI  GK +GA+L+ GG R    +KG+++ PTVFA+V  D +I RE
Sbjct: 302 LGPLVSQAQYEKVLSFIARGKEEGARLLCGGRRPAHLEKGYFIAPTVFADVPTDSRIWRE 361

Query: 345 EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           EIFGPV  +  F++ +E I  AN+S+YGLAAAV ++D ++ + V + L AG
Sbjct: 362 EIFGPVLCVRSFATEDEAIALANDSEYGLAAAVISRDAERCDRVAEALEAG 412


>gnl|CDD|237391 PRK13473, PRK13473, gamma-aminobutyraldehyde dehydrogenase;
           Provisional.
          Length = 475

 Score =  388 bits (999), Expect = e-132
 Identities = 149/349 (42%), Positives = 211/349 (60%), Gaps = 5/349 (1%)

Query: 51  SLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGK-TIPINGDFFAYTRHEPVGVC 109
            LE+L+ GKP +++ + ++   +   R++AG A  + GK          +  R +PVGV 
Sbjct: 83  RLESLNCGKPLHLALNDEIPAIVDVFRFFAGAARCLEGKAAGEYLEGHTSMIRRDPVGVV 142

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
             I PWN+PL+M AWK+APALA GNT+VLKP+E TPLTAL +  L A+  +P GV+N+V 
Sbjct: 143 ASIAPWNYPLMMAAWKLAPALAAGNTVVLKPSEITPLTALKLAELAADI-LPPGVLNVVT 201

Query: 170 GYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G G   G ALV HP V  V+ TGS   GK V   + A ++KRT LELGGK+P I+F DAD
Sbjct: 202 GRGATVGDALVGHPKVRMVSLTGSIATGKHVLS-AAADSVKRTHLELGGKAPVIVFDDAD 260

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           LDA VEG     ++N GQ C A  R + Q  IYD+ VA+         VGDP D + E G
Sbjct: 261 LDAVVEGIRTFGYYNAGQDCTAACRIYAQRGIYDDLVAKLAAAVATLKVGDPDDEDTELG 320

Query: 289 PQIDKEQMDKILEFIESGKSQG-AQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIF 347
           P I     D++  F+E  K+ G  ++V GG     KG+Y +PT+ A  R D +I + E+F
Sbjct: 321 PLISAAHRDRVAGFVERAKALGHIRVVTGGEAPDGKGYYYEPTLLAGARQDDEIVQREVF 380

Query: 348 GPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           GPV  +  F   ++ +  AN+SDYGLA++V+T+D+ + + V+  L+ G 
Sbjct: 381 GPVVSVTPFDDEDQAVRWANDSDYGLASSVWTRDVGRAHRVSARLQYGC 429


>gnl|CDD|143429 cd07111, ALDH_F16, Aldehyde dehydrogenase family 16A1-like.
           Uncharacterized aldehyde dehydrogenase family 16 member
           A1 (ALDH16A1) and other related sequences are present in
           this CD. The active site cysteine and glutamate residues
           are not conserved in the human ALDH16A1 protein
           sequence.
          Length = 480

 Score =  385 bits (992), Expect = e-131
 Identities = 167/345 (48%), Positives = 223/345 (64%), Gaps = 11/345 (3%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           LE+LDNGKP   S   D+    R   ++AG+A  +                 +PVGV GQ
Sbjct: 104 LESLDNGKPIRESRDCDIPLVARHFYHHAGWAQLLD----------TELAGWKPVGVVGQ 153

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           I+PWNFPLLMLAWKI PALA GNT+VLKPAE TPLTAL    + AEAG+P GV+NIV G 
Sbjct: 154 IVPWNFPLLMLAWKICPALAMGNTVVLKPAEYTPLTALLFAEICAEAGLPPGVLNIVTGN 213

Query: 172 GDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDA 231
           G  G+AL  HPGVDKVAFTGSTEVG+ +++ + A   K+ +LELGGKSP I+F DADLD+
Sbjct: 214 GSFGSALANHPGVDKVAFTGSTEVGRALRR-ATAGTGKKLSLELGGKSPFIVFDDADLDS 272

Query: 232 AVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
           AVEG    ++FN GQ CCAGSR  VQ+++ +E + +  ER     VGDP D  ++ G  +
Sbjct: 273 AVEGIVDAIWFNQGQVCCAGSRLLVQESVAEELIRKLKERMSHLRVGDPLDKAIDMGAIV 332

Query: 292 DKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQ 351
           D  Q+ +I E +E G+++GA +   G     KG +  PT+F NV    +IA+EEIFGPV 
Sbjct: 333 DPAQLKRIRELVEEGRAEGADVFQPGADLPSKGPFYPPTLFTNVPPASRIAQEEIFGPVL 392

Query: 352 QLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
            ++ F + +E +  ANN+ YGLAA+V++++L     V   L+AG 
Sbjct: 393 VVLTFRTAKEAVALANNTPYGLAASVWSENLSLALEVALSLKAGV 437


>gnl|CDD|143411 cd07092, ALDH_ABALDH-YdcW, Escherichia coli NAD+-dependent
           gamma-aminobutyraldehyde dehydrogenase YdcW-like.
           NAD+-dependent, tetrameric, gamma-aminobutyraldehyde
           dehydrogenase (ABALDH), YdcW of Escherichia coli K12,
           catalyzes the oxidation of gamma-aminobutyraldehyde to
           gamma-aminobutyric acid. ABALDH can also oxidize n-alkyl
           medium-chain aldehydes, but with a lower catalytic
           efficiency.
          Length = 450

 Score =  382 bits (983), Expect = e-130
 Identities = 156/350 (44%), Positives = 213/350 (60%), Gaps = 11/350 (3%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFA----YTRHEPVG 107
           LE+ + GKP ++    ++  ++   R++AG A  + G   P  G++        R EP+G
Sbjct: 64  LESRNTGKPLHLVRDDELPGAVDNFRFFAGAARTLEG---PAAGEYLPGHTSMIRREPIG 120

Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
           V  QI PWN+PL+M AWKIAPALA GNT+VLKP+E TPLT L +  L AE  +P GVVN+
Sbjct: 121 VVAQIAPWNYPLMMAAWKIAPALAAGNTVVLKPSETTPLTTLLLAELAAE-VLPPGVVNV 179

Query: 168 VPGYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFAD 226
           V G G  AG ALV HP V  V+ TGS   GK V + + A  LKR  LELGGK+P I+F D
Sbjct: 180 VCGGGASAGDALVAHPRVRMVSLTGSVRTGKKVAR-AAADTLKRVHLELGGKAPVIVFDD 238

Query: 227 ADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVE 286
           ADLDAAV G     ++N GQ C A  R +V +++YDEFVA   E      VGDP D + E
Sbjct: 239 ADLDAAVAGIATAGYYNAGQDCTAACRVYVHESVYDEFVAALVEAVSAIRVGDPDDEDTE 298

Query: 287 QGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEI 346
            GP     Q +++  F+E      A+++ GG RA   G++ +PTV A V  D +I +EEI
Sbjct: 299 MGPLNSAAQRERVAGFVE-RAPAHARVLTGGRRAEGPGYFYEPTVVAGVAQDDEIVQEEI 357

Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           FGPV  +  F   +E IE AN+ +YGLA++V+T+D+ +   ++  L  GT
Sbjct: 358 FGPVVTVQPFDDEDEAIELANDVEYGLASSVWTRDVGRAMRLSARLDFGT 407


>gnl|CDD|143408 cd07089, ALDH_CddD-AldA-like, Rhodococcus ruber 6-oxolauric acid
           dehydrogenase-like and related proteins.  The
           6-oxolauric acid dehydrogenase (CddD) from Rhodococcus
           ruber SC1 which converts 6-oxolauric acid to
           dodecanedioic acid; and the aldehyde dehydrogenase
           (locus SSP0762) from Staphylococcus saprophyticus subsp.
           saprophyticus ATCC 15305 and also, the Mycobacterium
           tuberculosis H37Rv ALDH AldA (locus Rv0768) sequence;
           and other similar sequences, are included in this CD.
          Length = 459

 Score =  379 bits (976), Expect = e-129
 Identities = 151/329 (45%), Positives = 197/329 (59%), Gaps = 9/329 (2%)

Query: 76  LRYYAGFADKIHGK-TIPINGDFFAYT----RHEPVGVCGQIIPWNFPLLMLAWKIAPAL 130
           LRY+A  AD    +  +P+            R EPVGV   I PWNFP  +   K+APAL
Sbjct: 89  LRYFADLADSFPWEFDLPVPALRGGPGRRVVRREPVGVVAAITPWNFPFFLNLAKLAPAL 148

Query: 131 ATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAF 189
           A GNT+VLKPA  TPL+AL +G +IAE  +P GVVN+V G  +A G AL   P VD V+F
Sbjct: 149 AAGNTVVLKPAPDTPLSALLLGEIIAETDLPAGVVNVVTGSDNAVGEALTTDPRVDMVSF 208

Query: 190 TGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCC 249
           TGST VG+ +     A+ LKR  LELGGKS NI+  DADL AA   A      N GQ C 
Sbjct: 209 TGSTAVGRRIMA-QAAATLKRVLLELGGKSANIVLDDADLAAAAPAAVGVCMHNAGQGCA 267

Query: 250 AGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQ 309
             +R  V  + YDE V       +   VGDP D     GP I   Q D++  +I  G+ +
Sbjct: 268 LTTRLLVPRSRYDEVVEALAAAFEALPVGDPADPGTVMGPLISAAQRDRVEGYIARGRDE 327

Query: 310 GAQLVAGGGRA--GDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERAN 367
           GA+LV GGGR    DKGFYV+PT+FA+V +DM+IA+EEIFGPV  +I +   +E +  AN
Sbjct: 328 GARLVTGGGRPAGLDKGFYVEPTLFADVDNDMRIAQEEIFGPVLVVIPYDDDDEAVRIAN 387

Query: 368 NSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           +SDYGL+  V++ D+D+   V + +R G+
Sbjct: 388 DSDYGLSGGVWSADVDRAYRVARRIRTGS 416


>gnl|CDD|143457 cd07139, ALDH_AldA-Rv0768, Mycobacterium tuberculosis aldehyde
           dehydrogenase  AldA-like.  The Mycobacterium
           tuberculosis NAD+-dependent, aldehyde dehydrogenase  PDB
           structure,  3B4W, and the Mycobacterium tuberculosis
           H37Rv aldehyde dehydrogenase  AldA (locus Rv0768)
           sequence, as well as the Rhodococcus rhodochrous ALDH
           involved in haloalkane catabolism, and other similar
           sequences, are included in this CD.
          Length = 471

 Score =  379 bits (975), Expect = e-128
 Identities = 154/348 (44%), Positives = 205/348 (58%), Gaps = 8/348 (2%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADK---IHGKTIPINGDFFAYTRHEPVGVCG 110
           T +NG P + S           LRYYA  A        +     G      R EPVGV  
Sbjct: 85  TAENGMPISWSRRAQGPGPAALLRYYAALARDFPFEERRPGSGGGH--VLVRREPVGVVA 142

Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
            I+PWN PL + A KIAPALA G T+VLKP+ +TPL A  +     EAG+P GVVN+VP 
Sbjct: 143 AIVPWNAPLFLAALKIAPALAAGCTVVLKPSPETPLDAYLLAEAAEEAGLPPGVVNVVPA 202

Query: 171 YGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
             + G  LV+HPGVDKV+FTGST  G+ +    G   L R TLELGGKS  I+  DADLD
Sbjct: 203 DREVGEYLVRHPGVDKVSFTGSTAAGRRIAAVCG-ERLARVTLELGGKSAAIVLDDADLD 261

Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
           AAV G       N GQ C A +R  V  + YDE V           VGDP D   + GP 
Sbjct: 262 AAVPGLVPASLMNNGQVCVALTRILVPRSRYDEVVEALAAAVAALKVGDPLDPATQIGPL 321

Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGR-AG-DKGFYVQPTVFANVRDDMKIAREEIFG 348
               Q +++  +I  G+++GA+LV GGGR AG D+G++V+PT+FA+V +DM+IA+EEIFG
Sbjct: 322 ASARQRERVEGYIAKGRAEGARLVTGGGRPAGLDRGWFVEPTLFADVDNDMRIAQEEIFG 381

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           PV  +I +   ++ +  AN+SDYGL+ +V+T D+++   V + +R GT
Sbjct: 382 PVLSVIPYDDEDDAVRIANDSDYGLSGSVWTADVERGLAVARRIRTGT 429


>gnl|CDD|143456 cd07138, ALDH_CddD_SSP0762, Rhodococcus ruber 6-oxolauric acid
           dehydrogenase-like.  The 6-oxolauric acid dehydrogenase
           (CddD) from Rhodococcus ruber SC1 which converts
           6-oxolauric acid to dodecanedioic acid, and the aldehyde
           dehydrogenase (locus SSP0762) from Staphylococcus
           saprophyticus subsp. saprophyticus ATCC 15305 and other
           similar sequences, are included in this CD.
          Length = 466

 Score =  378 bits (974), Expect = e-128
 Identities = 150/353 (42%), Positives = 202/353 (57%), Gaps = 21/353 (5%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAG------FADKIHGKTIPINGDFFAYTRHEPVG 107
           TL+ G P  ++ +  V   +  LR  A       F ++     +            EP+G
Sbjct: 83  TLEMGAPITLARAAQVGLGIGHLRAAADALKDFEFEERRGNSLV----------VREPIG 132

Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
           VCG I PWN+PL  +  K+APALA G T+VLKP+E  PL+A+ +  ++ EAG+P GV N+
Sbjct: 133 VCGLITPWNWPLNQIVLKVAPALAAGCTVVLKPSEVAPLSAIILAEILDEAGLPAGVFNL 192

Query: 168 VPGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFAD 226
           V G G   G AL  HP VD V+FTGST  GK V + + A  +KR  LELGGKS NII  D
Sbjct: 193 VNGDGPVVGEALSAHPDVDMVSFTGSTRAGKRVAE-AAADTVKRVALELGGKSANIILDD 251

Query: 227 ADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVE 286
           ADL+ AV       F N GQ C A +R  V  + Y E    +   A+   VGDP D    
Sbjct: 252 ADLEKAVPRGVAACFANSGQSCNAPTRMLVPRSRYAEAEEIAAAAAEAYVVGDPRDPATT 311

Query: 287 QGPQIDKEQMDKILEFIESGKSQGAQLVAGG-GR-AG-DKGFYVQPTVFANVRDDMKIAR 343
            GP     Q D++  +I+ G  +GA+LVAGG GR  G ++G++V+PTVFA+V  DM IAR
Sbjct: 312 LGPLASAAQFDRVQGYIQKGIEEGARLVAGGPGRPEGLERGYFVKPTVFADVTPDMTIAR 371

Query: 344 EEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           EEIFGPV  +I +   +E I  AN++ YGLA  V++ D ++   V + LRAG 
Sbjct: 372 EEIFGPVLSIIPYDDEDEAIAIANDTPYGLAGYVWSADPERARAVARRLRAGQ 424


>gnl|CDD|143415 cd07097, ALDH_KGSADH-YcbD, Bacillus subtilis NADP+-dependent
           alpha-ketoglutaric semialdehyde dehydrogenase ycbD-like.
            Kinetic studies of the Bacillus subtilis ALDH-like ycbD
           protein, which is involved in d-glucarate/d-galactarate
           utilization, reveal that it is a NADP+-dependent,
           alpha-ketoglutaric semialdehyde dehydrogenase (KGSADH).
           KGSADHs (EC 1.2.1.26) catalyze the NAD(P)+-dependent
           conversion of KGSA to alpha-ketoglutarate.
           Interestingly, the NADP+-dependent, tetrameric,
           2,5-dioxopentanoate dehydrogenase (EC=1.2.1.26), an
           enzyme involved in the catabolic pathway for D-arabinose
           in Sulfolobus solfataricus, also clusters in this group.
           This CD shows a distant phylogenetic relationship to the
           Azospirillum brasilense KGSADH-II (-III) group.
          Length = 473

 Score =  378 bits (972), Expect = e-128
 Identities = 157/347 (45%), Positives = 217/347 (62%), Gaps = 6/347 (1%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVCGQI 112
           T + GK    +   +V  + +  RYYAG A ++ G+T+P         T  EP+GV G I
Sbjct: 84  TREEGKTLPEARG-EVTRAGQIFRYYAGEALRLSGETLPSTRPGVEVETTREPLGVVGLI 142

Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
            PWNFP+ + AWKIAPALA GNT+V KPAE TP +A A+  ++ EAG+P GV N+V G G
Sbjct: 143 TPWNFPIAIPAWKIAPALAYGNTVVFKPAELTPASAWALVEILEEAGLPAGVFNLVMGSG 202

Query: 173 -DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDA 231
            + G ALV+HP VD V+FTGST VG+ +   + A+   R  LE+GGK+P ++  DADLD 
Sbjct: 203 SEVGQALVEHPDVDAVSFTGSTAVGRRIAA-AAAARGARVQLEMGGKNPLVVLDDADLDL 261

Query: 232 AVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
           AVE A  G FF+ GQ C A SR  V + I+D FV    ER K   VGD  D  V+ GP +
Sbjct: 262 AVECAVQGAFFSTGQRCTASSRLIVTEGIHDRFVEALVERTKALKVGDALDEGVDIGPVV 321

Query: 292 DKEQMDKILEFIESGKSQGAQLVAGGGR--AGDKGFYVQPTVFANVRDDMKIAREEIFGP 349
            + Q++K L +IE  +S+GA+LV GG R    D+G+Y+ P +FA V +DM+IAREEIFGP
Sbjct: 322 SERQLEKDLRYIEIARSEGAKLVYGGERLKRPDEGYYLAPALFAGVTNDMRIAREEIFGP 381

Query: 350 VQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           V  +IR    +E +  AN++++GL+A + T  L    +  + + AG 
Sbjct: 382 VAAVIRVRDYDEALAIANDTEFGLSAGIVTTSLKHATHFKRRVEAGV 428


>gnl|CDD|143395 cd06534, ALDH-SF, NAD(P)+-dependent aldehyde dehydrogenase
           superfamily.  The aldehyde dehydrogenase superfamily
           (ALDH-SF) of  NAD(P)+-dependent enzymes, in general,
           oxidize a wide range of  endogenous and exogenous
           aliphatic and aromatic aldehydes to their corresponding
           carboxylic acids and play an  important role in
           detoxification. Besides aldehyde detoxification, many
           ALDH isozymes possess multiple additional catalytic and
           non-catalytic functions such as participating in
           metabolic pathways, or as binding proteins, or
           osmoregulants, to mention a few. The enzyme has three
           domains, a NAD(P)+ cofactor-binding domain, a catalytic
           domain, and a bridging domain; and the active enzyme is
           generally either homodimeric or homotetrameric. The
           catalytic mechanism is proposed to involve cofactor
           binding, resulting in a conformational change and
           activation of an invariant catalytic cysteine
           nucleophile. The cysteine and aldehyde substrate form an
           oxyanion thiohemiacetal intermediate resulting in
           hydride transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-L) proteins. The ALDH proteins
           are represented by enzymes which share a number of
           highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group. The
           ALDH-like group is represented by such proteins as
           gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA
           reductase, and coenzyme A acylating aldehyde
           dehydrogenase. All of these proteins have a conserved
           cysteine that aligns with the catalytic cysteine of the
           ALDH group.
          Length = 367

 Score =  366 bits (942), Expect = e-125
 Identities = 165/348 (47%), Positives = 205/348 (58%), Gaps = 64/348 (18%)

Query: 51  SLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVC 109
           +LETL+ GKP   +   +V  ++ T RY AG ADK+ G  +P  +    AY R EP+GV 
Sbjct: 38  ALETLETGKPIEEALG-EVARAIDTFRYAAGLADKLGGPELPSPDPGGEAYVRREPLGVV 96

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
           G I PWNFPLL+ AWK+APALA GNT+VLKP+E TPLTALA+  L+ EAG+P GVVN+VP
Sbjct: 97  GVITPWNFPLLLAAWKLAPALAAGNTVVLKPSELTPLTALALAELLQEAGLPPGVVNVVP 156

Query: 170 GYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G GD  GAAL+ HP VDK++FTGST VGK + + + A NLK  TLELGGKSP I+  DAD
Sbjct: 157 GGGDEVGAALLSHPRVDKISFTGSTAVGKAIMK-AAAENLKPVTLELGGKSPVIVDEDAD 215

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           LDAAVEGA FG FFN GQ C A SR                                   
Sbjct: 216 LDAAVEGAVFGAFFNAGQICTAASRLL--------------------------------- 242

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
             + +   D+ +E          +LV               TV  +V  DM IA+EEIFG
Sbjct: 243 --VHESIYDEFVE----------KLV---------------TVLVDVDPDMPIAQEEIFG 275

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           PV  +IRF   EE I  AN+++YGL A VFT+DL++   V + LRAGT
Sbjct: 276 PVLPVIRFKDEEEAIALANDTEYGLTAGVFTRDLNRALRVAERLRAGT 323


>gnl|CDD|143424 cd07106, ALDH_AldA-AAD23400, Streptomyces aureofaciens putative
           aldehyde dehydrogenase AldA (AAD23400)-like.  Putative
           aldehyde dehydrogenase, AldA, from Streptomyces
           aureofaciens (locus AAD23400) and other similar
           sequences are present in this CD.
          Length = 446

 Score =  367 bits (944), Expect = e-124
 Identities = 145/343 (42%), Positives = 200/343 (58%), Gaps = 5/343 (1%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQII 113
           TL+ GKP   +   +V  ++  LRY A        + I  +       R +P+GV   I+
Sbjct: 66  TLEQGKPLAEAQ-FEVGGAVAWLRYTASLDLPD--EVIEDDDTRRVELRRKPLGVVAAIV 122

Query: 114 PWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGD 173
           PWNFPLL+ AWKIAPAL  GNT+VLKP+  TPL  L +G L  E  +P GV+N+V G  +
Sbjct: 123 PWNFPLLLAAWKIAPALLAGNTVVLKPSPFTPLCTLKLGELAQEV-LPPGVLNVVSGGDE 181

Query: 174 AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAV 233
            G AL  HP + K++FTGST  GK V   S A  LKR TLELGG    I+  D D+DA  
Sbjct: 182 LGPALTSHPDIRKISFTGSTATGKKVMA-SAAKTLKRVTLELGGNDAAIVLPDVDIDAVA 240

Query: 234 EGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDK 293
               +G F N GQ C A  R +V ++IYDEF       AK   VGD  D     GP  +K
Sbjct: 241 PKLFWGAFINSGQVCAAIKRLYVHESIYDEFCEALVALAKAAVVGDGLDPGTTLGPVQNK 300

Query: 294 EQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQL 353
            Q DK+ E +E  K++GA+++AGG      G+++ PT+  +  +  +I  EE FGPV  +
Sbjct: 301 MQYDKVKELVEDAKAKGAKVLAGGEPLDGPGYFIPPTIVDDPPEGSRIVDEEQFGPVLPV 360

Query: 354 IRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           +++S  +EVI RAN+S+YGL A+V++ DL++   V + L AGT
Sbjct: 361 LKYSDEDEVIARANDSEYGLGASVWSSDLERAEAVARRLEAGT 403


>gnl|CDD|215260 PLN02467, PLN02467, betaine aldehyde dehydrogenase.
          Length = 503

 Score =  362 bits (930), Expect = e-121
 Identities = 157/353 (44%), Positives = 218/353 (61%), Gaps = 9/353 (2%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGK-TIPIN---GDFFAYTRHEP 105
           + LETLD GKP + + + D++       YYA  A+ +  K   P++     F  Y   EP
Sbjct: 93  AKLETLDCGKPLDEA-AWDMDDVAGCFEYYADLAEALDAKQKAPVSLPMETFKGYVLKEP 151

Query: 106 VGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVV 165
           +GV G I PWN+PLLM  WK+APALA G T VLKP+E   +T L +  +  E G+P GV+
Sbjct: 152 LGVVGLITPWNYPLLMATWKVAPALAAGCTAVLKPSELASVTCLELADICREVGLPPGVL 211

Query: 166 NIVPGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIF 224
           N+V G G +AGA L  HPGVDK+AFTGST  G+ +   + A  +K  +LELGGKSP I+F
Sbjct: 212 NVVTGLGTEAGAPLASHPGVDKIAFTGSTATGRKIMT-AAAQMVKPVSLELGGKSPIIVF 270

Query: 225 ADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLN 284
            D DLD AVE A FG F+  GQ C A SR  V + I  EF+ +  + AK   + DP +  
Sbjct: 271 DDVDLDKAVEWAMFGCFWTNGQICSATSRLLVHERIASEFLEKLVKWAKNIKISDPLEEG 330

Query: 285 VEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD--KGFYVQPTVFANVRDDMKIA 342
              GP + + Q +K+L+FI + KS+GA ++ GG R     KGF+++PT+  +V   M+I 
Sbjct: 331 CRLGPVVSEGQYEKVLKFISTAKSEGATILCGGKRPEHLKKGFFIEPTIITDVTTSMQIW 390

Query: 343 REEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           REE+FGPV  +  FS+ +E IE AN+S YGLA AV + DL++   V++  +AG
Sbjct: 391 REEVFGPVLCVKTFSTEDEAIELANDSHYGLAGAVISNDLERCERVSEAFQAG 443


>gnl|CDD|143425 cd07107, ALDH_PhdK-like, Nocardioides 2-carboxybenzaldehyde
           dehydrogenase, PhdK-like.  Nocardioides sp. strain
           KP72-carboxybenzaldehyde dehydrogenase (PhdK), an enzyme
           involved in phenanthrene degradation, and other similar
           sequences, are present in this CD.
          Length = 456

 Score =  360 bits (925), Expect = e-121
 Identities = 155/350 (44%), Positives = 219/350 (62%), Gaps = 9/350 (2%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           ++ LD G P +     DV  +   L Y+AG   ++ G+TIP+ G    YT  EP GV  +
Sbjct: 64  IDALDCGNPVSAMLG-DVMVAAALLDYFAGLVTELKGETIPVGGRNLHYTLREPYGVVAR 122

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           I+ +N PL+  A KIA  LA GNT+V+KP EQ PL+AL +  L  E  +P GV NI+PG 
Sbjct: 123 IVAFNHPLMFAAAKIAAPLAAGNTVVVKPPEQAPLSALRLAELAREV-LPPGVFNILPGD 181

Query: 172 GD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
           G  AGAALV+HP V ++A  GS   G+ + + + A  +K  TLELGGK+  I+F DAD +
Sbjct: 182 GATAGAALVRHPDVKRIALIGSVPTGRAIMR-AAAEGIKHVTLELGGKNALIVFPDADPE 240

Query: 231 AAVEGAHFGLFFN-MGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
           AA + A  G+ F   GQ C + SR FV ++IYDE +AR  ER     VGDP D     GP
Sbjct: 241 AAADAAVAGMNFTWCGQSCGSTSRLFVHESIYDEVLARVVERVAAIKVGDPTDPATTMGP 300

Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR----AGDKGFYVQPTVFANVRDDMKIAREE 345
            + ++Q D+++ +I+S K +GA+LV GGGR    A + GFYV+PTVFA+V   M+IAREE
Sbjct: 301 LVSRQQYDRVMHYIDSAKREGARLVTGGGRPEGPALEGGFYVEPTVFADVTPGMRIAREE 360

Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           IFGPV  ++R+    E++ +AN  +YGL AA++T D+ + +   + + AG
Sbjct: 361 IFGPVLSVLRWRDEAEMVAQANGVEYGLTAAIWTNDISQAHRTARRVEAG 410


>gnl|CDD|143422 cd07104, ALDH_BenzADH-like, ALDH subfamily: NAD(P)+-dependent
           benzaldehyde dehydrogenase II, vanillin dehydrogenase,
           p-hydroxybenzaldehyde dehydrogenase and related
           proteins.  ALDH subfamily which includes the
           NAD(P)+-dependent, benzaldehyde dehydrogenase II (XylC,
           BenzADH, EC=1.2.1.28)  involved in the oxidation of
           benzyl alcohol to benzoate; p-hydroxybenzaldehyde
           dehydrogenase (PchA, HBenzADH) which catalyzes the
           oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic
           acid; vanillin dehydrogenase (Vdh, VaniDH) involved in
           the metabolism of ferulic acid as seen in Pseudomonas
           putida KT2440; and other related sequences.
          Length = 431

 Score =  355 bits (913), Expect = e-119
 Identities = 150/335 (44%), Positives = 210/335 (62%), Gaps = 17/335 (5%)

Query: 69  VEYSLRTLRYYAGFADKIHGKTIP--INGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWK- 125
           V  ++  LR  AG   +  G+ +P  + G   +  R  P+GV G I P+NFPL+ LA + 
Sbjct: 61  VGAAIAILREAAGLPRRPEGEILPSDVPGKE-SMVRRVPLGVVGVISPFNFPLI-LAMRS 118

Query: 126 IAPALATGNTIVLKPAEQTPLTALAVGALIA----EAGIPEGVVNIVPGYGD-AGAALVQ 180
           +APALA GN +VLKP  +TP+T    G LIA    EAG+P+GV+N+VPG G   G ALV+
Sbjct: 119 VAPALALGNAVVLKPDSRTPVTG---GLLIAEIFEEAGLPKGVLNVVPGGGSEIGDALVE 175

Query: 181 HPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGL 240
           HP V  ++FTGST VG+ + + +G  +LK+  LELGG +P I+  DADLD AV  A FG 
Sbjct: 176 HPRVRMISFTGSTAVGRHIGELAGR-HLKKVALELGGNNPLIVLDDADLDLAVSAAAFGA 234

Query: 241 FFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKIL 300
           F + GQ C A  R  V +++YDEFV +   +AK   VGDP D +   GP I++ Q+D++ 
Sbjct: 235 FLHQGQICMAAGRILVHESVYDEFVEKLVAKAKALPVGDPRDPDTVIGPLINERQVDRVH 294

Query: 301 EFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIE 360
             +E   + GA+L+ GG     +G + QPTV ++V  DM I REEIFGPV  +I F   E
Sbjct: 295 AIVEDAVAAGARLLTGGTY---EGLFYQPTVLSDVTPDMPIFREEIFGPVAPVIPFDDDE 351

Query: 361 EVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           E +E AN+++YGL+AAVFT+DL++     + L  G
Sbjct: 352 EAVELANDTEYGLSAAVFTRDLERAMAFAERLETG 386


>gnl|CDD|143407 cd07088, ALDH_LactADH-AldA, Escherichia coli lactaldehyde
           dehydrogenase AldA-like.  Lactaldehyde dehydrogenase
           from Escherichia coli (AldA, LactADH, EC=1.2.1.22), an
           NAD(+)-dependent enzyme involved in the metabolism of
           L-fucose and L-rhamnose, and other similar sequences are
           present in this CD.
          Length = 468

 Score =  355 bits (914), Expect = e-119
 Identities = 150/345 (43%), Positives = 218/345 (63%), Gaps = 5/345 (1%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPI-NGDFFAYTRHEPVGVCGQI 112
             + GK  +++  V+VE++   + Y A +A +I G+ IP    +   +    P+GV   I
Sbjct: 82  VEEQGKTLSLAR-VEVEFTADYIDYMAEWARRIEGEIIPSDRPNENIFIFKVPIGVVAGI 140

Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
           +PWNFP  ++A K+APAL TGNTIV+KP+E+TPL AL    L+ EAG+P GV+NIV G G
Sbjct: 141 LPWNFPFFLIARKLAPALVTGNTIVIKPSEETPLNALEFAELVDEAGLPAGVLNIVTGRG 200

Query: 173 -DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDA 231
              G ALV HP V  ++ TGSTE G+ + + + A N+ + +LELGGK+P I+  DADLD 
Sbjct: 201 SVVGDALVAHPKVGMISLTGSTEAGQKIMEAA-AENITKVSLELGGKAPAIVMKDADLDL 259

Query: 232 AVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
           AV+        N GQ C    R +V + IYDEF+ +  E+ K   VGDPFD   + GP +
Sbjct: 260 AVKAIVDSRIINCGQVCTCAERVYVHEDIYDEFMEKLVEKMKAVKVGDPFDAATDMGPLV 319

Query: 292 DKEQMDKILEFIESGKSQGAQLVAGGGRA-GDKGFYVQPTVFANVRDDMKIAREEIFGPV 350
           ++  +DK+ E +E     GA L+ GG R  G+KG++ +PTV  NVR DM+I +EEIFGPV
Sbjct: 320 NEAALDKVEEMVERAVEAGATLLTGGKRPEGEKGYFYEPTVLTNVRQDMEIVQEEIFGPV 379

Query: 351 QQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
             +++FSS++E IE AN+S+YGL + ++T++L+     T  L  G
Sbjct: 380 LPVVKFSSLDEAIELANDSEYGLTSYIYTENLNTAMRATNELEFG 424


>gnl|CDD|143449 cd07131, ALDH_AldH-CAJ73105, Uncharacterized Candidatus kuenenia
           aldehyde dehydrogenase AldH (CAJ73105)-like.
           Uncharacterized aldehyde dehydrogenase of Candidatus
           kuenenia AldH (locus CAJ73105) and similar sequences
           with similarity to alpha-aminoadipic semialdehyde
           dehydrogenase (AASADH, human ALDH7A1, EC=1.2.1.31),
           Arabidopsis ALDH7B4, and Streptomyces clavuligerus
           delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH)
           are included in this CD.
          Length = 478

 Score =  353 bits (907), Expect = e-118
 Identities = 168/352 (47%), Positives = 231/352 (65%), Gaps = 8/352 (2%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPIN-GDFFAYTRHEPVGV 108
           + L T + GKP       DV+ ++   +Y AG   ++ G+T+P    +  A TR +P+GV
Sbjct: 80  ARLVTREMGKPLAEGRG-DVQEAIDMAQYAAGEGRRLFGETVPSELPNKDAMTRRQPIGV 138

Query: 109 CGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV 168
              I PWNFP+ + +WKI PAL  GNT+V KPAE TP  AL +  L AEAG+P GVVN+V
Sbjct: 139 VALITPWNFPVAIPSWKIFPALVCGNTVVFKPAEDTPACALKLVELFAEAGLPPGVVNVV 198

Query: 169 PGYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA 227
            G G+  G ALV+HP VD V+FTGSTEVG+ + +     N KR  LE+GGK+P I+  DA
Sbjct: 199 HGRGEEVGEALVEHPDVDVVSFTGSTEVGERIGETCARPN-KRVALEMGGKNPIIVMDDA 257

Query: 228 DLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQ 287
           DLD A+EGA +  F   GQ C A SR  V +++YDEF+ R  ERAKR  VGD  D   + 
Sbjct: 258 DLDLALEGALWSAFGTTGQRCTATSRLIVHESVYDEFLKRFVERAKRLRVGDGLDEETDM 317

Query: 288 GPQIDKEQMDKILEFIESGKSQGAQLVAGGGRA----GDKGFYVQPTVFANVRDDMKIAR 343
           GP I++ Q++K+L + E GK +GA L+ GG R      +KG++V+PTVF +V  DM+IA+
Sbjct: 318 GPLINEAQLEKVLNYNEIGKEEGATLLLGGERLTGGGYEKGYFVEPTVFTDVTPDMRIAQ 377

Query: 344 EEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           EEIFGPV  LI  SS+EE IE AN+++YGL++A++T+D++K     + L AG
Sbjct: 378 EEIFGPVVALIEVSSLEEAIEIANDTEYGLSSAIYTEDVNKAFRARRDLEAG 429


>gnl|CDD|143426 cd07108, ALDH_MGR_2402, Magnetospirillum NAD(P)+-dependent aldehyde
           dehydrogenase MSR-1-like.  NAD(P)+-dependent aldehyde
           dehydrogenase of Magnetospirillum gryphiswaldense MSR-1
           (MGR_2402) , and other similar sequences, are present in
           this CD.
          Length = 457

 Score =  351 bits (903), Expect = e-118
 Identities = 157/351 (44%), Positives = 213/351 (60%), Gaps = 9/351 (2%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           L  L+ G         +        RY+ G A ++ G+T+P   D   YT  EP+GV G 
Sbjct: 64  LLALETGNALRTQARPEAAVLADLFRYFGGLAGELKGETLPFGPDVLTYTVREPLGVVGA 123

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           I+PWN PL++ A KIAPAL  GNT+VLK AE  PL  L +  ++A+  +P GV+N++ GY
Sbjct: 124 ILPWNAPLMLAALKIAPALVAGNTVVLKAAEDAPLAVLLLAEILAQV-LPAGVLNVITGY 182

Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
           G + GAALV HP VDKV FTGSTEVGK++ + + A  L   +LELGGKSP I+F DADLD
Sbjct: 183 GEECGAALVDHPDVDKVTFTGSTEVGKIIYR-AAADRLIPVSLELGGKSPMIVFPDADLD 241

Query: 231 AAVEGAHFGL-FFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
            AV+GA  G+ F   GQ C AGSR FV + IYD F+ +   +  +  +GDP D   + G 
Sbjct: 242 DAVDGAIAGMRFTRQGQSCTAGSRLFVHEDIYDAFLEKLVAKLSKLKIGDPLDEATDIGA 301

Query: 290 QIDKEQMDKILEFIESGKS-QGAQLVAGG----GRAGDKGFYVQPTVFANVRDDMKIARE 344
            I ++Q  K+  +I+ G S  GA ++ GG          GF+VQPT+F+ V ++ ++ARE
Sbjct: 302 IISEKQFAKVCGYIDLGLSTSGATVLRGGPLPGEGPLADGFFVQPTIFSGVDNEWRLARE 361

Query: 345 EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           EIFGPV   I +   +EVI  AN+S YGLAA V+T+DL +       L AG
Sbjct: 362 EIFGPVLCAIPWKDEDEVIAMANDSHYGLAAYVWTRDLGRALRAAHALEAG 412


>gnl|CDD|182108 PRK09847, PRK09847, gamma-glutamyl-gamma-aminobutyraldehyde
           dehydrogenase; Provisional.
          Length = 494

 Score =  350 bits (899), Expect = e-116
 Identities = 151/348 (43%), Positives = 222/348 (63%), Gaps = 4/348 (1%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           LETLD GKP   S   D+  + R +R+YA   DK++G+    +    A    EPVGV   
Sbjct: 104 LETLDTGKPIRHSLRDDIPGAARAIRWYAEAIDKVYGEVATTSSHELAMIVREPVGVIAA 163

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           I+PWNFPLL+  WK+ PALA GN+++LKP+E++PL+A+ +  L  EAG+P+GV+N+V G+
Sbjct: 164 IVPWNFPLLLTCWKLGPALAAGNSVILKPSEKSPLSAIRLAGLAKEAGLPDGVLNVVTGF 223

Query: 172 G-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA-DL 229
           G +AG AL +H  +D +AFTGST  GK + + +G SN+KR  LE GGKS NI+FAD  DL
Sbjct: 224 GHEAGQALSRHNDIDAIAFTGSTRTGKQLLKDAGDSNMKRVWLEAGGKSANIVFADCPDL 283

Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
             A      G+F+N GQ C AG+R  ++++I DEF+A   ++A+    G P D     G 
Sbjct: 284 QQAASATAAGIFYNQGQVCIAGTRLLLEESIADEFLALLKQQAQNWQPGHPLDPATTMGT 343

Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGP 349
            ID    D +  FI  G+S+G  L+   GR       + PT+F +V  +  ++REEIFGP
Sbjct: 344 LIDCAHADSVHSFIREGESKGQLLL--DGRNAGLAAAIGPTIFVDVDPNASLSREEIFGP 401

Query: 350 VQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           V  + RF+S E+ ++ AN+S YGL AAV+T+DL + + +++ L+AG++
Sbjct: 402 VLVVTRFTSEEQALQLANDSQYGLGAAVWTRDLSRAHRMSRRLKAGSV 449


>gnl|CDD|143467 cd07149, ALDH_y4uC, Uncharacterized ALDH (y4uC) with similarity to
           Tortula ruralis aldehyde dehydrogenase ALDH21A1.
           Uncharacterized aldehyde dehydrogenase (ORF name y4uC)
           with sequence similarity to the moss Tortula ruralis
           aldehyde dehydrogenase ALDH21A1 (RNP123) believed to
           play an important role in the detoxification of
           aldehydes generated in response to desiccation- and
           salinity-stress, and similar sequences are included in
           this CD.
          Length = 453

 Score =  344 bits (884), Expect = e-115
 Identities = 146/347 (42%), Positives = 207/347 (59%), Gaps = 13/347 (3%)

Query: 58  GKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPING-----DFFAYTRHEPVGVCGQI 112
           GKP   +   +V+ ++ TLR  A  A ++ G+TIP +          +T  EP+GV   I
Sbjct: 72  GKPIKDAR-KEVDRAIETLRLSAEEAKRLAGETIPFDASPGGEGRIGFTIREPIGVVAAI 130

Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
            P+NFPL ++A K+ PA+A GN +VLKPA QTPL+AL +  L+ EAG+P+G +N+V G G
Sbjct: 131 TPFNFPLNLVAHKVGPAIAAGNAVVLKPASQTPLSALKLAELLLEAGLPKGALNVVTGSG 190

Query: 173 DA-GAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDA 231
           +  G ALV  P V  ++FTGS  VG+ + + +G   LK+ TLELG  +  I+ ADADL+ 
Sbjct: 191 ETVGDALVTDPRVRMISFTGSPAVGEAIARKAG---LKKVTLELGSNAAVIVDADADLEK 247

Query: 232 AVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
           AVE    G F N GQ C +  R FV + IYDEF+ R     K+  VGDP D + + GP I
Sbjct: 248 AVERCVSGAFANAGQVCISVQRIFVHEDIYDEFLERFVAATKKLVVGDPLDEDTDVGPMI 307

Query: 292 DKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQ 351
            + + ++I E++E     GA+L+ GG R G     ++PTV  +V  DMK+  EE+F PV 
Sbjct: 308 SEAEAERIEEWVEEAVEGGARLLTGGKRDGA---ILEPTVLTDVPPDMKVVCEEVFAPVV 364

Query: 352 QLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLI 398
            L  F +++E I  AN+S YGL A VFT DL K     + L  G ++
Sbjct: 365 SLNPFDTLDEAIAMANDSPYGLQAGVFTNDLQKALKAARELEVGGVM 411


>gnl|CDD|143463 cd07145, ALDH_LactADH_F420-Bios, Methanocaldococcus jannaschii
           NAD+-dependent lactaldehyde dehydrogenase-like.
           NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
           involved the biosynthesis of coenzyme F(420) in
           Methanocaldococcus jannaschii through the oxidation of
           lactaldehyde to lactate and generation of NAPH, and
           similar sequences are included in this CD.
          Length = 456

 Score =  338 bits (870), Expect = e-113
 Identities = 141/351 (40%), Positives = 212/351 (60%), Gaps = 10/351 (2%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPI-----NGDFFAYTRHEPVGV 108
           T++ GKP   S  V+VE ++R  +  A  A  + G+TIP+     N    A+T  EP+GV
Sbjct: 68  TIEVGKPIKQS-RVEVERTIRLFKLAAEEAKVLRGETIPVDAYEYNERRIAFTVREPIGV 126

Query: 109 CGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV 168
            G I P+NFP  + A KIAPA+A GN++V+KP+  TPLTA+ +  ++ EAG+P GV+N+V
Sbjct: 127 VGAITPFNFPANLFAHKIAPAIAVGNSVVVKPSSNTPLTAIELAKILEEAGLPPGVINVV 186

Query: 169 PGYGD-AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA 227
            GYG   G  +V +P V+ ++FTGST VG L+   +G    K+  LELGG  P I+  DA
Sbjct: 187 TGYGSEVGDEIVTNPKVNMISFTGSTAVGLLIASKAG-GTGKKVALELGGSDPMIVLKDA 245

Query: 228 DLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQ 287
           DL+ AV  A  G F N GQ C A  R  V++ +YD+F+    E+ K+  VGDP D + + 
Sbjct: 246 DLERAVSIAVRGRFENAGQVCNAVKRILVEEEVYDKFLKLLVEKVKKLKVGDPLDESTDL 305

Query: 288 GPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIF 347
           GP I  E ++++   +     +G +++ GG    D+G +  PTV  N   DM + +EE+F
Sbjct: 306 GPLISPEAVERMENLVNDAVEKGGKILYGG--KRDEGSFFPPTVLENDTPDMIVMKEEVF 363

Query: 348 GPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLI 398
           GPV  + +    EE +E AN+++YGL A+VFT D+++   V + L AG ++
Sbjct: 364 GPVLPIAKVKDDEEAVEIANSTEYGLQASVFTNDINRALKVARELEAGGVV 414


>gnl|CDD|215157 PLN02278, PLN02278, succinic semialdehyde dehydrogenase.
          Length = 498

 Score =  334 bits (858), Expect = e-110
 Identities = 155/349 (44%), Positives = 212/349 (60%), Gaps = 10/349 (2%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTR----HEPVG 107
           L TL+ GKP   +   +V Y    L Y+A  A +++G  IP     F   R     +PVG
Sbjct: 107 LMTLEQGKPLKEAIG-EVAYGASFLEYFAEEAKRVYGDIIPSP---FPDRRLLVLKQPVG 162

Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
           V G I PWNFPL M+  K+ PALA G T+V+KP+E TPLTALA   L  +AGIP GV+N+
Sbjct: 163 VVGAITPWNFPLAMITRKVGPALAAGCTVVVKPSELTPLTALAAAELALQAGIPPGVLNV 222

Query: 168 VPGYGDA-GAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFAD 226
           V G     G AL+  P V K+ FTGST VGK +  G+ A+ +KR +LELGG +P I+F D
Sbjct: 223 VMGDAPEIGDALLASPKVRKITFTGSTAVGKKLMAGAAAT-VKRVSLELGGNAPFIVFDD 281

Query: 227 ADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVE 286
           ADLD AV+GA    F N GQ C   +R  VQ+ IYD+F     +  ++  VGD F+  V 
Sbjct: 282 ADLDVAVKGALASKFRNSGQTCVCANRILVQEGIYDKFAEAFSKAVQKLVVGDGFEEGVT 341

Query: 287 QGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEI 346
           QGP I++  + K+   ++   S+GA+++ GG R    G + +PTV  +V +DM I REE+
Sbjct: 342 QGPLINEAAVQKVESHVQDAVSKGAKVLLGGKRHSLGGTFYEPTVLGDVTEDMLIFREEV 401

Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           FGPV  L RF + EE I  AN+++ GLAA +FT+DL +   V++ L  G
Sbjct: 402 FGPVAPLTRFKTEEEAIAIANDTEAGLAAYIFTRDLQRAWRVSEALEYG 450


>gnl|CDD|131352 TIGR02299, HpaE, 5-carboxymethyl-2-hydroxymuconate semialdehyde
           dehydrogenase.  This model represents the dehydrogenase
           responsible for the conversion of
           5-carboxymethyl-2-hydroxymuconate semialdehyde to
           5-carboxymethyl-2-hydroxymuconate (a tricarboxylic
           acid). This is the step in the degradation of
           4-hydroxyphenylacetic acid via homoprotocatechuate
           following the oxidative opening of the aromatic ring.
          Length = 488

 Score =  334 bits (857), Expect = e-110
 Identities = 155/355 (43%), Positives = 217/355 (61%), Gaps = 12/355 (3%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFA-DKIHGKTIPINGDFFAYTRHEPVGVCG 110
           LE LD G+P   +    V  +    R++A    + + G+T P++     YT   PVG  G
Sbjct: 83  LECLDCGQPLRQTRQQ-VIRAAENFRFFADKCEEAMDGRTYPVD-THLNYTVRVPVGPVG 140

Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
            I PWN P ++  WKIAPALA GNT+VLKPAE +PLTA  +  +  EAG+P+GV N+V G
Sbjct: 141 LITPWNAPFMLSTWKIAPALAFGNTVVLKPAEWSPLTAARLAEIAKEAGLPDGVFNLVHG 200

Query: 171 YG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
           +G +AG ALV HP V  V+FTG T  G ++ + +GA  LKR ++ELGGKSP I+F DADL
Sbjct: 201 FGEEAGKALVAHPDVKAVSFTGETATGSIIMR-NGADTLKRFSMELGGKSPVIVFDDADL 259

Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
           + A++   F +F   G+ C A SR  VQ++I ++FV +  ER +   VG P D   E GP
Sbjct: 260 ERALDAVVFMIFSFNGERCTASSRLLVQESIAEDFVEKLVERVRAIRVGHPLDPETEVGP 319

Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD-------KGFYVQPTVFANVRDDMKIA 342
            I  E + K+L ++E+ + +GA ++ GG RA         +G YV PTVF    + M+IA
Sbjct: 320 LIHPEHLAKVLGYVEAAEKEGATILVGGERAPTFRGEDLGRGNYVLPTVFTGADNHMRIA 379

Query: 343 REEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           +EEIFGPV  +I F   EE IE+AN++ YGLA  V+T D+ + + V   L AG +
Sbjct: 380 QEEIFGPVLTVIPFKDEEEAIEKANDTRYGLAGYVWTNDVGRAHRVALALEAGMI 434


>gnl|CDD|143434 cd07116, ALDH_ACDHII-AcoD, Ralstonia eutrophus NAD+-dependent
           acetaldehyde dehydrogenase II-like.  Included in this CD
           is the NAD+-dependent, acetaldehyde dehydrogenase II
           (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes)
           eutrophus H16 involved in the catabolism of acetoin and
           ethanol, and similar proteins, such as, the dimeric
           dihydrolipoamide dehydrogenase of the acetoin
           dehydrogenase enzyme system of Klebsiella pneumonia.
           Also included are sequences similar to the
           NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
           and AldB) of Xanthobacter autotrophicus GJ10 which are
           involved in the degradation of 1,2-dichloroethane. These
           proteins apparently require RpoN factors for expression.
          Length = 479

 Score =  332 bits (852), Expect = e-110
 Identities = 158/356 (44%), Positives = 213/356 (59%), Gaps = 19/356 (5%)

Query: 53  ETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQI 112
           ET DNGKP   + + D+  ++   RY+AG      G    I+ +  AY  HEP+GV GQI
Sbjct: 84  ETWDNGKPVRETLAADIPLAIDHFRYFAGCIRAQEGSISEIDENTVAYHFHEPLGVVGQI 143

Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
           IPWNFPLLM  WK+APALA GN +VLKPAEQTP + L +  LI +  +P GVVN+V G+G
Sbjct: 144 IPWNFPLLMATWKLAPALAAGNCVVLKPAEQTPASILVLMELIGDL-LPPGVVNVVNGFG 202

Query: 173 -DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFA------ 225
            +AG  L     + KVAFTG T  G+L+ Q   + N+   TLELGGKSPNI FA      
Sbjct: 203 LEAGKPLASSKRIAKVAFTGETTTGRLIMQ-YASENIIPVTLELGGKSPNIFFADVMDAD 261

Query: 226 DADLDAAVEGAHFGLF-FNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLN 284
           DA  D A+EG  F +F  N G+ C   SR  +Q++IYD F+ R+ ER K    G+P D  
Sbjct: 262 DAFFDKALEG--FVMFALNQGEVCTCPSRALIQESIYDRFMERALERVKAIKQGNPLDTE 319

Query: 285 VEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRA-----GDKGFYVQPTVFANVRDDM 339
              G Q   EQ++KIL +I+ GK +GA+++ GG R         G+YV  T      + M
Sbjct: 320 TMIGAQASLEQLEKILSYIDIGKEEGAEVLTGGERNELGGLLGGGYYVPTTFKGG--NKM 377

Query: 340 KIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           +I +EEIFGPV  +  F   EE +E AN++ YGL A V+T+D +    + +G++AG
Sbjct: 378 RIFQEEIFGPVLAVTTFKDEEEALEIANDTLYGLGAGVWTRDGNTAYRMGRGIQAG 433


>gnl|CDD|143468 cd07150, ALDH_VaniDH_like, Pseudomonas putida vanillin
           dehydrogenase-like.  Vanillin dehydrogenase (Vdh,
           VaniDH) involved in the metabolism of ferulic acid and
           other related  sequences are included in this CD.  The
           E. coli vanillin dehydrogenase (LigV) preferred NAD+ to
           NADP+  and exhibited a broad substrate preference,
           including vanillin,  benzaldehyde, protocatechualdehyde,
           m-anisaldehyde, and p-hydroxybenzaldehyde.
          Length = 451

 Score =  330 bits (849), Expect = e-109
 Identities = 139/329 (42%), Positives = 203/329 (61%), Gaps = 6/329 (1%)

Query: 69  VEYSLRTLRYYAGFADKIHGKTIPING-DFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIA 127
             ++   LR  AG   ++ G+T+P +     + +   P+GV   I P+N+PL++   K+A
Sbjct: 82  TTFTPELLRAAAGECRRVRGETLPSDSPGTVSMSVRRPLGVVAGITPFNYPLILATKKVA 141

Query: 128 PALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDK 186
            ALA GNT+VLKP+E+TP+  L +  ++ EAG+P+GV N+V G G   G  LV  P V  
Sbjct: 142 FALAAGNTVVLKPSEETPVIGLKIAEIMEEAGLPKGVFNVVTGGGAEVGDELVDDPRVRM 201

Query: 187 VAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQ 246
           V FTGST VG+ + + +G  +LK+ TLELGGK+P I+ ADADLD AV  A FG F + GQ
Sbjct: 202 VTFTGSTAVGREIAEKAGR-HLKKITLELGGKNPLIVLADADLDYAVRAAAFGAFMHQGQ 260

Query: 247 CCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESG 306
            C + SR  V++ +YDEFV +   RA +  VGDP D +   GP I   Q+++I   +E  
Sbjct: 261 ICMSASRIIVEEPVYDEFVKKFVARASKLKVGDPRDPDTVIGPLISPRQVERIKRQVEDA 320

Query: 307 KSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERA 366
            ++GA+L+ GG   G+   + QPTV  +V  DM+I REE FGPV  +I     EE +E A
Sbjct: 321 VAKGAKLLTGGKYDGN---FYQPTVLTDVTPDMRIFREETFGPVTSVIPAKDAEEALELA 377

Query: 367 NNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           N+++YGL+AA+ T DL +   + + L +G
Sbjct: 378 NDTEYGLSAAILTNDLQRAFKLAERLESG 406


>gnl|CDD|143431 cd07113, ALDH_PADH_NahF, Escherichia coli NAD+-dependent
           phenylacetaldehyde dehydrogenase PadA-like.
           NAD+-dependent, homodimeric, phenylacetaldehyde
           dehydrogenase (PADH, EC=1.2.1.39) PadA of Escherichia
           coli involved in the catabolism of 2-phenylethylamine,
           and other related sequences, are present in this CD.
           Also included is the Pseudomonas fluorescens ST StyD
           PADH involved in styrene catabolism, the Sphingomonas
           sp. LB126 FldD protein involved in fluorene degradation,
           and the Novosphingobium aromaticivorans NahF
           salicylaldehyde dehydrogenase involved in the
           NAD+-dependent conversion of salicylaldehyde to
           salicylate.
          Length = 477

 Score =  329 bits (844), Expect = e-108
 Identities = 149/352 (42%), Positives = 214/352 (60%), Gaps = 7/352 (1%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKT----IP-INGD-FFAYTRHEP 105
           LETL +GK  ++S + +V  S   LRY+AG+A KI+G+T    IP + G+ + A+TR EP
Sbjct: 83  LETLCSGKSIHLSRAFEVGQSANFLRYFAGWATKINGETLAPSIPSMQGERYTAFTRREP 142

Query: 106 VGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVV 165
           VGV   I+PWNF +++  WKI  ALATG TIV+KP+E TPLT L V  L  EAGIP+GV+
Sbjct: 143 VGVVAGIVPWNFSVMIAVWKIGAALATGCTIVIKPSEFTPLTLLRVAELAKEAGIPDGVL 202

Query: 166 NIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFA 225
           N+V G G  GA L+ HP V KV+FTGS   GK + +   AS+L R TLELGGK+      
Sbjct: 203 NVVNGKGAVGAQLISHPDVAKVSFTGSVATGKKIGR-QAASDLTRVTLELGGKNAAAFLK 261

Query: 226 DADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNV 285
           DAD+D  VEG     F + GQ C A  R +V  + +DE V +  +      VG P D +V
Sbjct: 262 DADIDWVVEGLLTAGFLHQGQVCAAPERFYVHRSKFDELVTKLKQALSSFQVGSPMDESV 321

Query: 286 EQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREE 345
             GP  ++   DK+  +++  +++G ++V GG     +G++VQPT+      D ++ REE
Sbjct: 322 MFGPLANQPHFDKVCSYLDDARAEGDEIVRGGEALAGEGYFVQPTLVLARSADSRLMREE 381

Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
            FGPV   + +   EE+I+  N++ +GL A+V+T +L K       + AGT+
Sbjct: 382 TFGPVVSFVPYEDEEELIQLINDTPFGLTASVWTNNLSKALRYIPRIEAGTV 433


>gnl|CDD|143405 cd07086, ALDH_F7_AASADH-like, NAD+-dependent alpha-aminoadipic
           semialdehyde dehydrogenase and related proteins.  ALDH
           subfamily which includes the NAD+-dependent,
           alpha-aminoadipic semialdehyde dehydrogenase (AASADH,
           EC=1.2.1.31), also known as Antiquitin-1, ALDH7A1,
           ALDH7B or delta-1-piperideine-6-carboxylate
           dehydrogenase (P6CDH), and other similar sequences, such
           as the uncharacterized aldehyde dehydrogenase of
           Candidatus kuenenia AldH (locus CAJ73105).
          Length = 478

 Score =  327 bits (841), Expect = e-108
 Identities = 131/312 (41%), Positives = 180/312 (57%), Gaps = 8/312 (2%)

Query: 78  YYAGFADKIHGKTIPIN-GDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTI 136
           Y  G +  ++G TIP          +  P+GV G I  +NFP+ +  W  A AL  GNT+
Sbjct: 105 YAVGLSRMLYGLTIPSERPGHRLMEQWNPLGVVGVITAFNFPVAVPGWNAAIALVCGNTV 164

Query: 137 VLKPAEQTPLTALAVGALIAEA----GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGS 192
           V KP+E TPLTA+AV  ++AE     G+P GVVN+V G GD G  LV  P V  V+FTGS
Sbjct: 165 VWKPSETTPLTAIAVTKILAEVLEKNGLPPGVVNLVTGGGDGGELLVHDPRVPLVSFTGS 224

Query: 193 TEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGS 252
           TEVG+ V + + A    R  LELGG +  I+  DADLD AV    F      GQ C    
Sbjct: 225 TEVGRRVGE-TVARRFGRVLLELGGNNAIIVMDDADLDLAVRAVLFAAVGTAGQRCTTTR 283

Query: 253 RTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQ 312
           R  V +++YDEF+ R  +  K+  +GDP D     GP I++  ++K L  IE  KSQG  
Sbjct: 284 RLIVHESVYDEFLERLVKAYKQVRIGDPLDEGTLVGPLINQAAVEKYLNAIEIAKSQGGT 343

Query: 313 LVAGGGRA--GDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSD 370
           ++ GG R   G+ G YV+PT+   V DD +I +EE F P+  +I+F S+EE I   N+  
Sbjct: 344 VLTGGKRIDGGEPGNYVEPTIVTGVTDDARIVQEETFAPILYVIKFDSLEEAIAINNDVP 403

Query: 371 YGLAAAVFTKDL 382
            GL++++FT+DL
Sbjct: 404 QGLSSSIFTEDL 415


>gnl|CDD|188167 TIGR01780, SSADH, succinate-semialdehyde dehydrogenase.  Succinic
           semialdehyde dehydrogenase is one of three enzymes
           constituting 4-aminobutyrate (GABA) degradation in both
           prokaryotes and eukaryotes, catalyzing the
           (NAD(P)+)-dependent catabolism reaction of succinic
           semialdehyde to succinate for metabolism by the citric
           acid cycle. The EC number depends on the cofactor:
           1.2.1.24 for NAD only, 1.2.1.79 for NADP only, and
           1.2.1.16 if both can be used. In Escherichia coli,
           succinic semialdehyde dehydrogenase is located in an
           unidirectionally transcribed gene cluster encoding
           enzymes for GABA degradation and is suggested to be
           cotranscribed with succinic semialdehyde transaminase
           from a common promoter upstream of SSADH. Similar gene
           arrangements can be found in characterized Ralstonia
           eutropha and the genome analysis of Bacillus subtilis.
           Prokaryotic succinic semialdehyde dehydrogenases
           (1.2.1.16) share high sequence homology to characterized
           succinic semialdehyde dehydrogenases from rat and human
           (1.2.1.24), exhibiting conservation of proposed cofactor
           binding residues, and putative active sites (G-237 &
           G-242, C-293 & G-259 respectively of rat SSADH).
           Eukaryotic SSADH enzymes exclusively utilize NAD+ as a
           cofactor, exhibiting little to no NADP+ activity. While
           a NADP+ preference has been detected in prokaryotes in
           addition to both NADP+- and NAD+-dependencies as in
           E.coli, Pseudomonas, and Klebsiella pneumoniae. The
           function of this alternative SSADH currently is unknown,
           but has been suggested to play a possible role in
           4-hydroxyphenylacetic degradation. Just outside the
           scope of this model, are several sequences belonging to
           clades scoring between trusted and noise. These
           sequences may be actual SSADH enzymes, but lack
           sufficiently close characterized homologs to make a
           definitive assignment at this time. SSADH enzyme belongs
           to the aldehyde dehydrogenase family (pfam00171),
           sharing a common evolutionary origin and enzymatic
           mechanism with lactaldehyde dehydrogenase. Like in
           lactaldehyde dehydrogenase and succinate semialdehyde
           dehydrogenase, the mammalian catalytic glutamic acid and
           cysteine residues are conserved in all the enzymes of
           this family (PS00687, PS00070) [Central intermediary
           metabolism, Other].
          Length = 448

 Score =  322 bits (826), Expect = e-106
 Identities = 149/345 (43%), Positives = 213/345 (61%), Gaps = 5/345 (1%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVCGQI 112
           TL+NGKP   +   ++ Y+   L ++A  A +++G TIP    D       +PVGVC  I
Sbjct: 66  TLENGKPLKEAKG-EILYAASFLEWFAEEAKRVYGDTIPSPQSDKRLIVIKQPVGVCAAI 124

Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY- 171
            PWNFP  M+  K   ALA G T+V+KPAEQTPL+ALA+  L  +AGIP+GV+N++ G  
Sbjct: 125 TPWNFPAAMITRKAGAALAAGCTVVVKPAEQTPLSALALARLAEQAGIPKGVLNVITGSR 184

Query: 172 -GDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLD 230
             + G  L   P V K++FTGST VGK++ + S AS +K+ ++ELGG +P I+F DADLD
Sbjct: 185 AKEVGNVLTTSPLVRKISFTGSTNVGKILMKQS-ASTVKKVSMELGGNAPFIVFDDADLD 243

Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQ 290
            AVEGA    F N GQ C   +R +V D IYDEF  +  E  K+  VG+  D  V QGP 
Sbjct: 244 QAVEGAMASKFRNAGQTCVCANRLYVHDGIYDEFAKKLAEAVKKLKVGNGLDEGVTQGPL 303

Query: 291 IDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPV 350
           I+++ ++K+ + I     +GA++V GG R    G + +PTV +NV  DM +++EE FGP+
Sbjct: 304 INEKAVEKVEKHIADAVEKGAKVVTGGKRHELGGNFFEPTVLSNVTADMLVSKEETFGPL 363

Query: 351 QQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
             + +F   EEVI  AN+++ GLAA  F++DL +   V + L  G
Sbjct: 364 APVFKFDDEEEVIAIANDTEVGLAAYFFSRDLSRIWRVAEALEYG 408


>gnl|CDD|143442 cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, RocA.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (EC=1.5.1.12 ), RocA: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily. The proline catabolic enzymes, proline
           dehydrogenase and Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), catalyze the two-step oxidation
           of proline to glutamate; P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). In this CD, monofunctional enzyme
           sequences such as seen in the Bacillus subtilis RocA
           P5CDH are also present. These enzymes play important
           roles in cellular redox control, superoxide generation,
           and apoptosis.
          Length = 512

 Score =  322 bits (827), Expect = e-105
 Identities = 146/356 (41%), Positives = 203/356 (57%), Gaps = 10/356 (2%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVC 109
           ++   L+ GK +  +   DV  ++  L YYA    ++ G  + +         + P+GV 
Sbjct: 112 AAWMVLEVGKNWAEAD-ADVAEAIDFLEYYAREMLRLRGFPVEMVPGEDNRYVYRPLGVG 170

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
             I PWNFPL +LA     AL TGNT+VLKPAE TP+ A  +  ++ EAG+P GVVN +P
Sbjct: 171 AVISPWNFPLAILAGMTTAALVTGNTVVLKPAEDTPVIAAKLVEILEEAGLPPGVVNFLP 230

Query: 170 GYGDA-GAALVQHPGVDKVAFTGSTEVG-----KLVQQGSGASNLKRTTLELGGKSPNII 223
           G G+  G  LV+HP V  +AFTGS EVG     +  +   G   LKR   E+GGK+  I+
Sbjct: 231 GPGEEVGDYLVEHPDVRFIAFTGSREVGLRIYERAAKVQPGQKWLKRVIAEMGGKNAIIV 290

Query: 224 FADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDL 283
             DADLD A EG     F   GQ C A SR  V +++YDEF+ R  ER K   VGDP D 
Sbjct: 291 DEDADLDEAAEGIVRSAFGFQGQKCSACSRVIVHESVYDEFLERLVERTKALKVGDPEDP 350

Query: 284 NVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD--KGFYVQPTVFANVRDDMKI 341
            V  GP IDK   D+I  +IE GKS+G +L+ GG       +G++VQPT+FA+V  D ++
Sbjct: 351 EVYMGPVIDKGARDRIRRYIEIGKSEG-RLLLGGEVLELAAEGYFVQPTIFADVPPDHRL 409

Query: 342 AREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           A+EEIFGPV  +I+    +E +E AN+++YGL   VF++  +      +    G L
Sbjct: 410 AQEEIFGPVLAVIKAKDFDEALEIANDTEYGLTGGVFSRSPEHLERARREFEVGNL 465


>gnl|CDD|143417 cd07099, ALDH_DDALDH, Methylomonas sp.
           4,4'-diapolycopene-dialdehyde dehydrogenase-like.  The
           4,4'-diapolycopene-dialdehyde dehydrogenase (DDALDH)
           involved in C30 carotenoid synthesis in Methylomonas sp.
           strain 16a and other similar sequences are present in
           this CD. DDALDH converts 4,4'-diapolycopene-dialdehyde
           into 4,4'-diapolycopene-diacid.
          Length = 453

 Score =  314 bits (807), Expect = e-103
 Identities = 132/347 (38%), Positives = 191/347 (55%), Gaps = 7/347 (2%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHG----KTIPINGDFFAYTRHEPVGVC 109
             + GKP   +  ++V  +L  + + A  A ++       T  +  +  A   + P GV 
Sbjct: 65  HAETGKPRADAG-LEVLLALEAIDWAARNAPRVLAPRKVPTGLLMPNKKATVEYRPYGVV 123

Query: 110 GQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVP 169
           G I PWN+PLL     I PALA GN +VLKP+E TPL    +    A AG P+GV+ +V 
Sbjct: 124 GVISPWNYPLLTPMGDIIPALAAGNAVVLKPSEVTPLVGELLAEAWAAAGPPQGVLQVVT 183

Query: 170 GYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
           G G  GAAL+   GVDKVAFTGS   G+ V   + A  L    LELGGK P I+ ADADL
Sbjct: 184 GDGATGAALIDA-GVDKVAFTGSVATGRKVMAAA-AERLIPVVLELGGKDPMIVLADADL 241

Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
           + A   A +G   N GQ C +  R +V +++YDEFVAR   +A+    G     + + GP
Sbjct: 242 ERAAAAAVWGAMVNAGQTCISVERVYVHESVYDEFVARLVAKARALRPGADDIGDADIGP 301

Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGP 349
                Q+D +   ++   ++GA+ + GG R+   G + +PTV  +V  DM + REE FGP
Sbjct: 302 MTTARQLDIVRRHVDDAVAKGAKALTGGARSNGGGPFYEPTVLTDVPHDMDVMREETFGP 361

Query: 350 VQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           V  ++  +  +E I  AN+S YGL+A+VF++DL +   + + L AG 
Sbjct: 362 VLPVMPVADEDEAIALANDSRYGLSASVFSRDLARAEAIARRLEAGA 408


>gnl|CDD|132260 TIGR03216, OH_muco_semi_DH, 2-hydroxymuconic semialdehyde
           dehydrogenase.  Members of this protein family are
           2-hydroxymuconic semialdehyde dehydrogenase. Many
           aromatic compounds are catabolized by way of the
           catechol, via the meta-cleavage pathway, to pyruvate and
           acetyl-CoA. This enzyme performs the second of seven
           steps in that pathway for catechol degradation [Energy
           metabolism, Other].
          Length = 481

 Score =  308 bits (792), Expect = e-101
 Identities = 141/364 (38%), Positives = 201/364 (55%), Gaps = 16/364 (4%)

Query: 43  NEFVNAQSSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGK----TIPINGDFF 98
           ++F+ A    E  D GKP +++  +D+       R +A        +      P      
Sbjct: 77  DDFLAA----EVADTGKPRSLASHLDIPRGAANFRVFADVVKNAPTECFEMATPDGKGAL 132

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
            Y   +P+GV G I PWN PLL++ WK+ PALA GNT+V+KP+E+TP TA  +G ++   
Sbjct: 133 NYAVRKPLGVVGVISPWNLPLLLMTWKVGPALACGNTVVVKPSEETPGTATLLGEVMNAV 192

Query: 159 GIPEGVVNIVPGYG--DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELG 216
           G+P+GV N+V G+G   AG  L +HPGVD + FTG T  G  + + + A  +K  + ELG
Sbjct: 193 GVPKGVYNVVHGFGPDSAGEFLTRHPGVDAITFTGETRTGSAIMKAA-ADGVKPVSFELG 251

Query: 217 GKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRT 276
           GK+  I+FAD D DAAV G     F N GQ C    R +V+  I+D FVA    RA+   
Sbjct: 252 GKNAAIVFADCDFDAAVAGILRSAFLNTGQVCLGTERVYVERPIFDRFVAALKARAESLK 311

Query: 277 VGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGG-----RAGDKGFYVQPTV 331
           +G P D     GP I  E  DK+L +      +GA +V GGG      A   G +VQPT+
Sbjct: 312 IGVPDDPATNMGPLISAEHRDKVLSYYALAVEEGATVVTGGGVPDFGDALAGGAWVQPTI 371

Query: 332 FANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQG 391
           +  + D  ++  EEIFGP   +  F S EEVI  AN++ YGLAA+V+T+DL + + V + 
Sbjct: 372 WTGLPDSARVVTEEIFGPCCHIAPFDSEEEVIALANDTPYGLAASVWTEDLSRAHRVARQ 431

Query: 392 LRAG 395
           +  G
Sbjct: 432 MEVG 435


>gnl|CDD|143438 cd07120, ALDH_PsfA-ACA09737, Pseudomonas putida aldehyde
           dehydrogenase PsfA (ACA09737)-like.  Included in this CD
           is the aldehyde dehydrogenase (PsfA, locus ACA09737) of
           Pseudomonas putida involved in furoic acid metabolism.
           Transcription of psfA was induced in response to
           2-furoic acid, furfuryl alcohol, and furfural.
          Length = 455

 Score =  305 bits (782), Expect = 1e-99
 Identities = 142/350 (40%), Positives = 197/350 (56%), Gaps = 7/350 (2%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQ 111
           L  L+NGK    +   ++  ++  LRYYAG A    G+ I      F+    EP+GV G 
Sbjct: 65  LLALENGKILGEA-RFEISGAISELRYYAGLARTEAGRMIEPEPGSFSLVLREPMGVAGI 123

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA-GIPEGVVNIVPG 170
           I+PWN P+++L   +APALA G T+V+KPA QT     A+  ++AE   +P GVVN+   
Sbjct: 124 IVPWNSPVVLLVRSLAPALAAGCTVVVKPAGQTAQINAAIIRILAEIPSLPAGVVNLFTE 183

Query: 171 YG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADL 229
            G +  A LV  P VD ++FTGST  G+ +   + A  LKR  LELGGK+P I+F DADL
Sbjct: 184 SGSEGAAHLVASPDVDVISFTGSTATGRAIMA-AAAPTLKRLGLELGGKTPCIVFDDADL 242

Query: 230 DAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
           DAA+      L    GQ C AGSR  VQ +I DE   R   R     VG   D   + GP
Sbjct: 243 DAALPKLERALTIFAGQFCMAGSRVLVQRSIADEVRDRLAARLAAVKVGPGLDPASDMGP 302

Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD---KGFYVQPTVFANVRDDMKIAREEI 346
            ID+  +D++   +E   + GA++V  GG   +   KG +++PT+      D  I +EEI
Sbjct: 303 LIDRANVDRVDRMVERAIAAGAEVVLRGGPVTEGLAKGAFLRPTLLEVDDPDADIVQEEI 362

Query: 347 FGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           FGPV  L  F    E +  AN++DYGLAA+V+T+DL +   V + +RAGT
Sbjct: 363 FGPVLTLETFDDEAEAVALANDTDYGLAASVWTRDLARAMRVARAIRAGT 412


>gnl|CDD|143469 cd07151, ALDH_HBenzADH, NADP+-dependent p-hydroxybenzaldehyde
           dehydrogenase-like.  NADP+-dependent,
           p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH)
           which catalyzes oxidation of p-hydroxybenzaldehyde to
           p-hydroxybenzoic acid and other related sequences are
           included in this CD.
          Length = 465

 Score =  303 bits (779), Expect = 5e-99
 Identities = 144/337 (42%), Positives = 209/337 (62%), Gaps = 15/337 (4%)

Query: 66  SVDVEYSLRTLRYYAGFADKIHGKTIP--INGDFFAYTRHEPVGVCGQIIPWNFPLLMLA 123
           +++   ++   R  A F  ++ G+ +P  + G      R EP+GV G I PWNFPL +  
Sbjct: 90  NIEWGAAMAITREAATFPLRMEGRILPSDVPGKENRVYR-EPLGVVGVISPWNFPLHLSM 148

Query: 124 WKIAPALATGNTIVLKPAEQTPLTALAVGALIA----EAGIPEGVVNIVPGYG-DAGAAL 178
             +APALA GN +VLKPA  TP+T    G L+A    EAG+P+GV+N+V G G + G A 
Sbjct: 149 RSVAPALALGNAVVLKPASDTPITG---GLLLAKIFEEAGLPKGVLNVVVGAGSEIGDAF 205

Query: 179 VQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHF 238
           V+HP    ++FTGST VG+ + + +G  +LK+  LELGG +P ++  DAD+DAAV  A F
Sbjct: 206 VEHPVPRLISFTGSTPVGRHIGELAGR-HLKKVALELGGNNPFVVLEDADIDAAVNAAVF 264

Query: 239 GLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDK 298
           G F + GQ C A +R  V + +YDEFV +  ER K    GDP D +   GP I++ Q+D 
Sbjct: 265 GKFLHQGQICMAINRIIVHEDVYDEFVEKFVERVKALPYGDPSDPDTVVGPLINESQVDG 324

Query: 299 ILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSS 358
           +L+ IE    +GA L+ GG   G+    ++PTV ++V +DM+IAREEIFGPV  +I+   
Sbjct: 325 LLDKIEQAVEEGATLLVGGEAEGN---VLEPTVLSDVTNDMEIAREEIFGPVAPIIKADD 381

Query: 359 IEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
            EE +E AN+++YGL+ AVFT DL++     + + AG
Sbjct: 382 EEEALELANDTEYGLSGAVFTSDLERGVQFARRIDAG 418


>gnl|CDD|143418 cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis
           succinate-semialdehyde dehydrogenase 1-like.
           Succinate-semialdehyde dehydrogenase 1 (SSADH1, GabD1,
           EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
           of succinate semialdehyde (SSA)  to succinate.  SSADH
           activity in Mycobacterium tuberculosis (Mtb) is encoded
           by both gabD1 (Rv0234c) and gabD2 (Rv1731).  The Mtb
           GabD1 SSADH1 reportedly is an enzyme of the
           gamma-aminobutyrate shunt, which forms a functional link
           between two TCA half-cycles by converting
           alpha-ketoglutarate to succinate.
          Length = 429

 Score =  302 bits (775), Expect = 7e-99
 Identities = 136/343 (39%), Positives = 183/343 (53%), Gaps = 4/343 (1%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADK-IHGKTIPINGDFFAYTRHEPVGVCGQI 112
           TL+ GKP   + + +VE      RYYA  A+  +  + I  +    AY R+EP+GV   I
Sbjct: 46  TLEMGKPIAEARA-EVEKCAWICRYYAENAEAFLADEPIETDAGK-AYVRYEPLGVVLGI 103

Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
           +PWNFP   +    AP L  GNT++LK A   P  ALA+  L  EAG PEGV   +    
Sbjct: 104 MPWNFPFWQVFRFAAPNLMAGNTVLLKHASNVPGCALAIEELFREAGFPEGVFQNLLIDS 163

Query: 173 DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAA 232
           D   A++  P V  V  TGS   G+ V   +G  NLK++ LELGG  P I+  DADLD A
Sbjct: 164 DQVEAIIADPRVRGVTLTGSERAGRAVAAEAGK-NLKKSVLELGGSDPFIVLDDADLDKA 222

Query: 233 VEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQID 292
           V+ A  G   N GQ C A  R  V + +YDEF+ +  E      VGDP D + + GP   
Sbjct: 223 VKTAVKGRLQNAGQSCIAAKRFIVHEDVYDEFLEKFVEAMAALKVGDPMDEDTDLGPLAR 282

Query: 293 KEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQ 352
           K+  D++ E +E   + GA L+ GG R    G +  PTV  +V   M    EE+FGPV  
Sbjct: 283 KDLRDELHEQVEEAVAAGATLLLGGKRPDGPGAFYPPTVLTDVTPGMPAYDEELFGPVAA 342

Query: 353 LIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           +I+    EE I  AN+S +GL  +VFT DL++   V + L AG
Sbjct: 343 VIKVKDEEEAIALANDSPFGLGGSVFTTDLERAERVARRLEAG 385


>gnl|CDD|143470 cd07152, ALDH_BenzADH, NAD-dependent benzaldehyde dehydrogenase
           II-like.  NAD-dependent, benzaldehyde dehydrogenase II
           (XylC, BenzADH, EC=1.2.1.28) is involved in the
           oxidation of benzyl alcohol to benzoate. In
           Acinetobacter calcoaceticus, this process is carried out
           by the chromosomally encoded, benzyl alcohol
           dehydrogenase (xylB) and benzaldehyde dehydrogenase II
           (xylC) enzymes; whereas in Pseudomonas putida they are
           encoded by TOL plasmids.
          Length = 443

 Score =  302 bits (776), Expect = 7e-99
 Identities = 128/332 (38%), Positives = 193/332 (58%), Gaps = 5/332 (1%)

Query: 67  VDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKI 126
            +V  ++  L   AG   +  G+ +P      +  R  P+GV G I P+NFPL++    +
Sbjct: 72  FEVGAAIGELHEAAGLPTQPQGEILPSAPGRLSLARRVPLGVVGVISPFNFPLILAMRSV 131

Query: 127 APALATGNTIVLKPAEQTPLTALAVGALI-AEAGIPEGVVNIVPGYGDAGAALVQHPGVD 185
           APALA GN +VLKP  +TP++   V A +  EAG+P GV++++PG  DAG ALV+ P V 
Sbjct: 132 APALALGNAVVLKPDPRTPVSGGVVIARLFEEAGLPAGVLHVLPGGADAGEALVEDPNVA 191

Query: 186 KVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMG 245
            ++FTGST VG+ V + +G  +LK+ +LELGGK+  I+  DADLD A     +G F + G
Sbjct: 192 MISFTGSTAVGRKVGEAAGR-HLKKVSLELGGKNALIVLDDADLDLAASNGAWGAFLHQG 250

Query: 246 QCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIES 305
           Q C A  R  V +++ D + A+   +AK   VGDP    V  GP I+  Q+D++   ++ 
Sbjct: 251 QICMAAGRHLVHESVADAYTAKLAAKAKHLPVGDPATGQVALGPLINARQLDRVHAIVDD 310

Query: 306 GKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIER 365
             + GA+L AGG      G + +PTV + V+  M    EEIFGPV  +  F S EE +  
Sbjct: 311 SVAAGARLEAGGTY---DGLFYRPTVLSGVKPGMPAFDEEIFGPVAPVTVFDSDEEAVAL 367

Query: 366 ANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           AN+++YGL+A + ++D+ +   +   LR G L
Sbjct: 368 ANDTEYGLSAGIISRDVGRAMALADRLRTGML 399


>gnl|CDD|143413 cd07094, ALDH_F21_LactADH-like, ALDH subfamily: NAD+-dependent,
           lactaldehyde dehydrogenase, ALDH family 21 A1, and
           related proteins.  ALDH subfamily which includes Tortula
           ruralis aldehyde dehydrogenase ALDH21A1 (RNP123), and
           NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
           and like sequences.
          Length = 453

 Score =  293 bits (753), Expect = 2e-95
 Identities = 140/351 (39%), Positives = 208/351 (59%), Gaps = 13/351 (3%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPI-----NGDFFAYTRHEPVGV 108
             + GKP   +  V+V+ ++ TLR  A  A++I G+ IP+     + +  A+T  EPVGV
Sbjct: 68  ACEGGKPIKDA-RVEVDRAIDTLRLAAEEAERIRGEEIPLDATQGSDNRLAWTIREPVGV 126

Query: 109 CGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV 168
              I P+NFPL ++A K+APA+ATG  +VLKPA +TPL+AL +  ++ EAG+PEGV+ +V
Sbjct: 127 VLAITPFNFPLNLVAHKLAPAIATGCPVVLKPASKTPLSALELAKILVEAGVPEGVLQVV 186

Query: 169 PGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADA 227
            G     G A      V  ++FTGS  VG+ ++  +G    KR  LELGG +P I+  DA
Sbjct: 187 TGEREVLGDAFAADERVAMLSFTGSAAVGEALRANAG---GKRIALELGGNAPVIVDRDA 243

Query: 228 DLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQ 287
           DLDAA+E    G F++ GQ C +  R +V + +YDEF+       K+  VGDP D + + 
Sbjct: 244 DLDAAIEALAKGGFYHAGQVCISVQRIYVHEELYDEFIEAFVAAVKKLKVGDPLDEDTDV 303

Query: 288 GPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIF 347
           GP I +E  +++  ++E     GA+L+ GG R    G   +PTV  +V  D K++ EE F
Sbjct: 304 GPLISEEAAERVERWVEEAVEAGARLLCGGER---DGALFKPTVLEDVPRDTKLSTEETF 360

Query: 348 GPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLI 398
           GPV  +IR+   EE I  AN++DYGL A +FT+DL+      + L  G ++
Sbjct: 361 GPVVPIIRYDDFEEAIRIANSTDYGLQAGIFTRDLNVAFKAAEKLEVGGVM 411


>gnl|CDD|179543 PRK03137, PRK03137, 1-pyrroline-5-carboxylate dehydrogenase;
           Provisional.
          Length = 514

 Score =  291 bits (748), Expect = 8e-94
 Identities = 138/358 (38%), Positives = 189/358 (52%), Gaps = 16/358 (4%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKI-HGKTI-PINGDFFAYTRHEPVG 107
           S+    + GKP+  +   D   ++  L YYA    K+  GK +    G+   Y  + P+G
Sbjct: 116 SAWLVKEAGKPWAEA-DADTAEAIDFLEYYARQMLKLADGKPVESRPGEHNRY-FYIPLG 173

Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
           V   I PWNFP  ++A     A+  GNT++LKPA  TP+ A     ++ EAG+P GVVN 
Sbjct: 174 VGVVISPWNFPFAIMAGMTLAAIVAGNTVLLKPASDTPVIAAKFVEVLEEAGLPAGVVNF 233

Query: 168 VPGYGDA-GAALVQHPGVDKVAFTGSTEVG-------KLVQQGSGASNLKRTTLELGGKS 219
           VPG G   G  LV HP    + FTGS EVG         VQ G     LKR   E+GGK 
Sbjct: 234 VPGSGSEVGDYLVDHPKTRFITFTGSREVGLRIYERAAKVQPGQ--IWLKRVIAEMGGKD 291

Query: 220 PNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGD 279
             ++  DADLD A E      F   GQ C A SR  V + +YDE + +  E  K  TVG+
Sbjct: 292 AIVVDEDADLDLAAESIVASAFGFSGQKCSACSRAIVHEDVYDEVLEKVVELTKELTVGN 351

Query: 280 PFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDM 339
           P D N   GP I++   DKI+ +IE GK +G +LV GG     KG+++QPT+FA+V    
Sbjct: 352 PED-NAYMGPVINQASFDKIMSYIEIGKEEG-RLVLGGEGDDSKGYFIQPTIFADVDPKA 409

Query: 340 KIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           +I +EEIFGPV   I+    +  +E ANN++YGL  AV + + +      +    G L
Sbjct: 410 RIMQEEIFGPVVAFIKAKDFDHALEIANNTEYGLTGAVISNNREHLEKARREFHVGNL 467


>gnl|CDD|143423 cd07105, ALDH_SaliADH, Salicylaldehyde dehydrogenase, DoxF-like.
           Salicylaldehyde dehydrogenase (DoxF, SaliADH,
           EC=1.2.1.65) involved in the upper naphthalene catabolic
           pathway of Pseudomonas strain C18 and other similar
           sequences are present in this CD.
          Length = 432

 Score =  287 bits (736), Expect = 5e-93
 Identities = 137/336 (40%), Positives = 188/336 (55%), Gaps = 16/336 (4%)

Query: 68  DVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKI 126
           +V+ +   LR  A    +I G +IP       A    EPVGV   I PWN P+++    I
Sbjct: 60  NVDLAAGMLREAASLITQIIGGSIPSDKPGTLAMVVKEPVGVVLGIAPWNAPVILGTRAI 119

Query: 127 APALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV---PGYGDAGA---ALVQ 180
           A  LA GNT+VLK +E +P T   +G +  EAG+P+GV+N+V   P   DA     AL+ 
Sbjct: 120 AYPLAAGNTVVLKASELSPRTHWLIGRVFHEAGLPKGVLNVVTHSPE--DAPEVVEALIA 177

Query: 181 HPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGL 240
           HP V KV FTGST VG+++ + + A +LK   LELGGK+P I+  DADLDAA   A FG 
Sbjct: 178 HPAVRKVNFTGSTRVGRIIAE-TAAKHLKPVLLELGGKAPAIVLEDADLDAAANAALFGA 236

Query: 241 FFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKIL 300
           F N GQ C +  R  V ++I DEFV +    A++   G      V  G  +     D++ 
Sbjct: 237 FLNSGQICMSTERIIVHESIADEFVEKLKAAAEKLFAGP-----VVLGSLVSAAAADRVK 291

Query: 301 EFIESGKSQGAQLVAGG-GRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSI 359
           E ++   S+GA+LV GG       G  + PT+  NV  DM I  EE FGPV  +IR    
Sbjct: 292 ELVDDALSKGAKLVVGGLADESPSGTSMPPTILDNVTPDMDIYSEESFGPVVSIIRVKDE 351

Query: 360 EEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           EE +  AN+S+YGL+AAVFT+DL +   V + + +G
Sbjct: 352 EEAVRIANDSEYGLSAAVFTRDLARALAVAKRIESG 387


>gnl|CDD|236501 PRK09407, gabD2, succinic semialdehyde dehydrogenase; Reviewed.
          Length = 524

 Score =  288 bits (740), Expect = 2e-92
 Identities = 129/330 (39%), Positives = 184/330 (55%), Gaps = 14/330 (4%)

Query: 75  TLRYYAGFADKI-----HGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPA 129
           T RYYA  A K+         +P+          +P GV G I PWN+PL +      PA
Sbjct: 121 TARYYARRAPKLLAPRRRAGALPVLTK--TTELRQPKGVVGVISPWNYPLTLAVSDAIPA 178

Query: 130 LATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG-DAGAALVQHPGVDKVA 188
           L  GN +VLKP  QTPLTALA   L+ EAG+P  +  +V G G   G ALV +   D + 
Sbjct: 179 LLAGNAVVLKPDSQTPLTALAAVELLYEAGLPRDLWQVVTGPGPVVGTALVDN--ADYLM 236

Query: 189 FTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCC 248
           FTGST  G+++ + +G   L   +LELGGK+P I+  DADLD A  GA    F N GQ C
Sbjct: 237 FTGSTATGRVLAEQAG-RRLIGFSLELGGKNPMIVLDDADLDKAAAGAVRACFSNAGQLC 295

Query: 249 CAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKS 308
            +  R +V ++IYDEFV       +   +G  +D + + G  I + Q++ +   ++   +
Sbjct: 296 ISIERIYVHESIYDEFVRAFVAAVRAMRLGAGYDYSADMGSLISEAQLETVSAHVDDAVA 355

Query: 309 QGAQLVAGGGRAGDKG--FYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERA 366
           +GA ++AGG    D G  FY +PTV   V  DM++AREE FGPV  +   + ++E +ERA
Sbjct: 356 KGATVLAGGKARPDLGPLFY-EPTVLTGVTPDMELAREETFGPVVSVYPVADVDEAVERA 414

Query: 367 NNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
           N++ YGL A+V+T D  +   +   +RAGT
Sbjct: 415 NDTPYGLNASVWTGDTARGRAIAARIRAGT 444


>gnl|CDD|143465 cd07147, ALDH_F21_RNP123, Aldehyde dehydrogenase family 21A1-like. 
           Aldehyde dehydrogenase ALDH21A1 (gene name RNP123) was
           first described in the moss Tortula ruralis and is
           believed to play an important role in the detoxification
           of aldehydes generated in response to desiccation- and
           salinity-stress, and ALDH21A1 expression represents a
           unique stress tolerance mechanism. So far, of plants,
           only the bryophyte sequence has been observed, but
           similar protein sequences from bacteria and archaea are
           also present in this CD.
          Length = 452

 Score =  284 bits (730), Expect = 6e-92
 Identities = 122/350 (34%), Positives = 192/350 (54%), Gaps = 12/350 (3%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGD-----FFAYTRHEPVGV 108
            L+ GKP   +   +V  ++ T R  A  A +I+G+ +P++            R  P+G 
Sbjct: 68  VLEAGKPIKDA-RGEVARAIDTFRIAAEEATRIYGEVLPLDISARGEGRQGLVRRFPIGP 126

Query: 109 CGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV 168
              I P+NFPL ++A K+APA+A G   VLKPA +TPL+AL +G ++AE G+P+G  +++
Sbjct: 127 VSAITPFNFPLNLVAHKVAPAIAAGCPFVLKPASRTPLSALILGEVLAETGLPKGAFSVL 186

Query: 169 PGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           P   D    LV    +  ++FTGS  VG  ++  +G    K+  LELGG +  I+ +DAD
Sbjct: 187 PCSRDDADLLVTDERIKLLSFTGSPAVGWDLKARAGK---KKVVLELGGNAAVIVDSDAD 243

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           LD A +   FG F+  GQ C +  R  V  ++YDEF +R   R K    GDP D   + G
Sbjct: 244 LDFAAQRIIFGAFYQAGQSCISVQRVLVHRSVYDEFKSRLVARVKALKTGDPKDDATDVG 303

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           P I + + +++  ++      GA+L+ GG R    G  ++PT+  +V  DM++  EE+FG
Sbjct: 304 PMISESEAERVEGWVNEAVDAGAKLLTGGKR---DGALLEPTILEDVPPDMEVNCEEVFG 360

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLI 398
           PV  +  +   +E +   N+S +GL A VFT+DL+K       L  G ++
Sbjct: 361 PVVTVEPYDDFDEALAAVNDSKFGLQAGVFTRDLEKALRAWDELEVGGVV 410


>gnl|CDD|182233 PRK10090, PRK10090, aldehyde dehydrogenase A; Provisional.
          Length = 409

 Score =  280 bits (719), Expect = 9e-91
 Identities = 131/359 (36%), Positives = 201/359 (55%), Gaps = 17/359 (4%)

Query: 58  GKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIP---INGDFFAYTRHEPVGVCGQIIP 114
           GK   ++  V+V ++   + Y A +A +  G+ I       +   + R   +GV   I+P
Sbjct: 24  GKIQQLA-EVEVAFTADYIDYMAEWARRYEGEIIQSDRPGENILLFKR--ALGVTTGILP 80

Query: 115 WNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGD- 173
           WNFP  ++A K+APAL TGNTIV+KP+E TP  A+A   ++ E G+P+GV N+V G G+ 
Sbjct: 81  WNFPFFLIARKMAPALLTGNTIVIKPSEFTPNNAIAFAKIVDEIGLPKGVFNLVLGRGET 140

Query: 174 AGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAV 233
            G  L  +P V  V+ TGS   G+ +   + A N+ +  LELGGK+P I+  DADLD AV
Sbjct: 141 VGQELAGNPKVAMVSMTGSVSAGEKIMA-AAAKNITKVCLELGGKAPAIVMDDADLDLAV 199

Query: 234 EGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFD-LNVEQGPQID 292
           +        N GQ C    R +VQ  IYD+FV R GE  +    G+P +  ++  GP I+
Sbjct: 200 KAIVDSRVINSGQVCNCAERVYVQKGIYDQFVNRLGEAMQAVQFGNPAERNDIAMGPLIN 259

Query: 293 KEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQ 352
              ++++ + +     +GA++  GG     KG+Y  PT+  +VR +M I  EE FGPV  
Sbjct: 260 AAALERVEQKVARAVEEGARVALGGKAVEGKGYYYPPTLLLDVRQEMSIMHEETFGPVLP 319

Query: 353 LIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLIRFSSIEEVIERAN 411
           ++ F ++EE I  AN+SDYGL ++++T++L+      +GL+ G        E  I R N
Sbjct: 320 VVAFDTLEEAIAMANDSDYGLTSSIYTQNLNVAMKAIKGLKFG--------ETYINREN 370


>gnl|CDD|143420 cd07102, ALDH_EDX86601, Uncharacterized aldehyde dehydrogenase of
           Synechococcus sp. PCC 7335 (EDX86601).  Uncharacterized
           aldehyde dehydrogenase of Synechococcus sp. PCC 7335
           (locus EDX86601) and other similar sequences, are
           present in this CD.
          Length = 452

 Score =  280 bits (719), Expect = 3e-90
 Identities = 123/324 (37%), Positives = 172/324 (53%), Gaps = 5/324 (1%)

Query: 77  RYYAGFADK-IHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNT 135
           RY    A++ +    +P    F  Y R EP+GV   I PWN+P L     + PAL  GN 
Sbjct: 87  RYMISIAEEALADIRVPEKDGFERYIRREPLGVVLIIAPWNYPYLTAVNAVIPALLAGNA 146

Query: 136 IVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEV 195
           ++LK + QTPL      A  AEAG+PEGV  ++    +  AAL+  P +D V+FTGS   
Sbjct: 147 VILKHSPQTPLCGERFAAAFAEAGLPEGVFQVLHLSHETSAALIADPRIDHVSFTGSVAG 206

Query: 196 GKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTF 255
           G+ +Q+ + A    +  LELGGK P  +  DADLDAA E    G FFN GQ CC+  R +
Sbjct: 207 GRAIQR-AAAGRFIKVGLELGGKDPAYVRPDADLDAAAESLVDGAFFNSGQSCCSIERIY 265

Query: 256 VQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVA 315
           V ++IYD FV       K   +GDP D +   GP +     D +   I    ++GA+ + 
Sbjct: 266 VHESIYDAFVEAFVAVVKGYKLGDPLDPSTTLGPVVSARAADFVRAQIADAIAKGARALI 325

Query: 316 GGGR---AGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYG 372
            G         G Y+ PTV  NV   M++ REE FGPV  +++  S  E I   N+S+YG
Sbjct: 326 DGALFPEDKAGGAYLAPTVLTNVDHSMRVMREETFGPVVGIMKVKSDAEAIALMNDSEYG 385

Query: 373 LAAAVFTKDLDKTNYVTQGLRAGT 396
           L A+V+TKD+ +   + + L  GT
Sbjct: 386 LTASVWTKDIARAEALGEQLETGT 409


>gnl|CDD|143419 cd07101, ALDH_SSADH2_GabD2, Mycobacterium tuberculosis
           succinate-semialdehyde dehydrogenase 2-like.
           Succinate-semialdehyde dehydrogenase 2 (SSADH2) and
           similar proteins are in this CD. SSADH1 (GabD1,
           EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
           of succinate semialdehyde to succinate.  SSADH activity
           in Mycobacterium tuberculosis is encoded by both gabD1
           (Rv0234c) and gabD2 (Rv1731), however ,the Vmax of GabD1
           was shown to be much higher than that of GabD2, and
           GabD2 (SSADH2) is likely to serve physiologically as a
           dehydrogenase for a different aldehyde(s).
          Length = 454

 Score =  279 bits (717), Expect = 6e-90
 Identities = 131/332 (39%), Positives = 184/332 (55%), Gaps = 18/332 (5%)

Query: 75  TLRYYAGFADKI-----HGKTIPINGDFFAYTR--HEPVGVCGQIIPWNFPLLMLAWKIA 127
             RYYA  A+++         IP+       T     P GV G I PWN+PL +      
Sbjct: 85  VARYYARRAERLLKPRRRRGAIPV----LTRTTVNRRPKGVVGVISPWNYPLTLAVSDAI 140

Query: 128 PALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG-DAGAALVQHPGVDK 186
           PAL  GN +VLKP  QT LTAL    L+ EAG+P  +  +V G G + G A+V +   D 
Sbjct: 141 PALLAGNAVVLKPDSQTALTALWAVELLIEAGLPRDLWQVVTGPGSEVGGAIVDN--ADY 198

Query: 187 VAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQ 246
           V FTGST  G++V + +G   L   +LELGGK+P I+  DADLD A  GA    F N GQ
Sbjct: 199 VMFTGSTATGRVVAERAG-RRLIGCSLELGGKNPMIVLEDADLDKAAAGAVRACFSNAGQ 257

Query: 247 CCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESG 306
            C +  R +V +++YDEFV R   R +   +G   D   + G  I + Q+D++   ++  
Sbjct: 258 LCVSIERIYVHESVYDEFVRRFVARTRALRLGAALDYGPDMGSLISQAQLDRVTAHVDDA 317

Query: 307 KSQGAQLVAGGGRAGDKG--FYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIE 364
            ++GA ++AGG    D G  FY +PTV   V +DM++  EE FGPV  + R +  +E IE
Sbjct: 318 VAKGATVLAGGRARPDLGPYFY-EPTVLTGVTEDMELFAEETFGPVVSIYRVADDDEAIE 376

Query: 365 RANNSDYGLAAAVFTKDLDKTNYVTQGLRAGT 396
            AN++DYGL A+V+T+D  +   +   LRAGT
Sbjct: 377 LANDTDYGLNASVWTRDGARGRRIAARLRAGT 408


>gnl|CDD|143464 cd07146, ALDH_PhpJ, Streptomyces putative phosphonoformaldehyde
           dehydrogenase PhpJ-like.  Putative phosphonoformaldehyde
           dehydrogenase (PhpJ), an aldehyde dehydrogenase homolog
           reportedly involved in the biosynthesis of
           phosphinothricin tripeptides in Streptomyces
           viridochromogenes DSM 40736, and similar sequences are
           included in this CD.
          Length = 451

 Score =  275 bits (706), Expect = 3e-88
 Identities = 127/337 (37%), Positives = 186/337 (55%), Gaps = 12/337 (3%)

Query: 67  VDVEYSLRTLRYYAGFADKIHGKTIPIN-----GDFFAYTRHEPVGVCGQIIPWNFPLLM 121
            +V  +   LR+ A  A +  G++   +          +T  EP+GV   I P+N PL  
Sbjct: 77  YEVGRAADVLRFAAAEALRDDGESFSCDLTANGKARKIFTLREPLGVVLAITPFNHPLNQ 136

Query: 122 LAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG-YGDAGAALVQ 180
           +A KIAPA+A  N IVLKP+E+TPL+A+ +  L+ EAG+P  ++++V G  G+ G  L+ 
Sbjct: 137 VAHKIAPAIAANNRIVLKPSEKTPLSAIYLADLLYEAGLPPDMLSVVTGEPGEIGDELIT 196

Query: 181 HPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGL 240
           HP VD V FTG   VGK +   +  +  KR  LELGG  P I+  DADL+ A   A  G 
Sbjct: 197 HPDVDLVTFTGGVAVGKAI---AATAGYKRQLLELGGNDPLIVMDDADLERAATLAVAGS 253

Query: 241 FFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKIL 300
           + N GQ C A  R  V +++ DEFV    E++    VGDP D   + G  ID+E   +I 
Sbjct: 254 YANSGQRCTAVKRILVHESVADEFVDLLVEKSAALVVGDPMDPATDMGTVIDEEAAIQIE 313

Query: 301 EFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIE 360
             +E   +QGA+++ G  R   +G    PTV  +V  D ++  EE FGPV  +IR   ++
Sbjct: 314 NRVEEAIAQGARVLLGNQR---QGALYAPTVLDHVPPDAELVTEETFGPVAPVIRVKDLD 370

Query: 361 EVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           E I  +N++ YGL++ V T DLD    + + L  GT+
Sbjct: 371 EAIAISNSTAYGLSSGVCTNDLDTIKRLVERLDVGTV 407


>gnl|CDD|143401 cd07082, ALDH_F11_NP-GAPDH, NADP+-dependent non-phosphorylating
           glyceraldehyde 3-phosphate dehydrogenase and ALDH family
           11.  NADP+-dependent non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase (NP-GAPDH, EC=1.2.1.9)
           catalyzes the irreversible oxidation of glyceraldehyde
           3-phosphate to 3-phosphoglycerate generating NADPH for
           biosynthetic reactions.  This CD also includes the
           Arabidopsis thaliana osmotic-stress-inducible ALDH
           family 11, ALDH11A3  and similar sequences. In
           autotrophic eukaryotes, NP-GAPDH generates NADPH for
           biosynthetic processes from photosynthetic
           glyceraldehyde-3-phosphate exported from the chloroplast
           and catalyzes one of the classic glycolytic bypass
           reactions unique to plants.
          Length = 473

 Score =  274 bits (702), Expect = 2e-87
 Identities = 138/363 (38%), Positives = 196/363 (53%), Gaps = 22/363 (6%)

Query: 43  NEFVNAQSSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFF---- 98
            E  N    L   + GK    +   +V+ ++  +R       ++ G ++P  GD+F    
Sbjct: 79  EEVAN----LLMWEIGKTLKDALK-EVDRTIDYIRDTIEELKRLDGDSLP--GDWFPGTK 131

Query: 99  ---AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALI 155
              A  R EP+GV   I P+N+PL +   K+ PAL  GNT+V KPA Q  L  + +    
Sbjct: 132 GKIAQVRREPLGVVLAIGPFNYPLNLTVSKLIPALIMGNTVVFKPATQGVLLGIPLAEAF 191

Query: 156 AEAGIPEGVVNIVPGYG-DAGAALVQHPGVDKVAFTGSTEVG-KLVQQGSGASNLKRTTL 213
            +AG P+GVVN+V G G + G  LV H  +D ++FTGSTEVG +L +Q      +KR  L
Sbjct: 192 HDAGFPKGVVNVVTGRGREIGDPLVTHGRIDVISFTGSTEVGNRLKKQHP----MKRLVL 247

Query: 214 ELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAK 273
           ELGGK P I+  DADL+ A +    G     GQ C A  R  V +++ DE V    E   
Sbjct: 248 ELGGKDPAIVLPDADLELAAKEIVKGALSYSGQRCTAIKRVLVHESVADELVELLKEEVA 307

Query: 274 RRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFA 333
           +  VG P+D  V+  P ID +  D +   I+   ++GA ++ GGGR G     + PT+  
Sbjct: 308 KLKVGMPWDNGVDITPLIDPKSADFVEGLIDDAVAKGATVLNGGGREGGN--LIYPTLLD 365

Query: 334 NVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLR 393
            V  DM++A EE FGPV  +IR + IEE IE AN S+YGL A++FTKD++K   +   L 
Sbjct: 366 PVTPDMRLAWEEPFGPVLPIIRVNDIEEAIELANKSNYGLQASIFTKDINKARKLADALE 425

Query: 394 AGT 396
            GT
Sbjct: 426 VGT 428


>gnl|CDD|143404 cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase
           and ALDH family members 6A1 and 6B2.  Methylmalonate
           semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27)
           [acylating] from Bacillus subtilis is involved in valine
           metabolism and catalyses the NAD+- and CoA-dependent
           oxidation of methylmalonate semialdehyde into
           propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and
           Arabidopsis MMSDH ALDH6B2 are also present in this CD.
          Length = 478

 Score =  270 bits (693), Expect = 6e-86
 Identities = 113/297 (38%), Positives = 163/297 (54%), Gaps = 13/297 (4%)

Query: 93  INGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVG 152
           +      Y+  +P+GV   I P+NFP ++  W    A+A GNT VLKP+E+ P  A+ + 
Sbjct: 124 VARGIDTYSYRQPLGVVAGITPFNFPAMIPLWMFPMAIACGNTFVLKPSERVPGAAMRLA 183

Query: 153 ALIAEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTT 212
            L+ EAG+P+GV+N+V G  +A  AL+ HP +  V+F GST VG+ + +   A+N KR  
Sbjct: 184 ELLQEAGLPDGVLNVVHGGKEAVNALLDHPDIKAVSFVGSTPVGEYIYE-RAAANGKRVQ 242

Query: 213 LELGGKSPNIIFADADLDAAVE---GAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSG 269
              G K+  ++  DADL+       GA FG     GQ C A S         DE++ +  
Sbjct: 243 ALGGAKNHAVVMPDADLEQTANALVGAAFG---AAGQRCMALSVAVAVGDEADEWIPKLV 299

Query: 270 ERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGR-----AGDKG 324
           ERAK+  VG   D   + GP I     ++I   IESG  +GA+LV   GR       + G
Sbjct: 300 ERAKKLKVGAGDDPGADMGPVISPAAKERIEGLIESGVEEGAKLVL-DGRGVKVPGYENG 358

Query: 325 FYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
            +V PT+  NV  DMKI +EEIFGPV  ++R  +++E I   N + YG  AA+FT+ 
Sbjct: 359 NFVGPTILDNVTPDMKIYKEEIFGPVLSIVRVDTLDEAIAIINANPYGNGAAIFTRS 415


>gnl|CDD|132417 TIGR03374, ABALDH, 1-pyrroline dehydrogenase.  Members of this
           protein family are 1-pyrroline dehydrogenase (1.5.1.35),
           also called gamma-aminobutyraldehyde dehydrogenase. This
           enzyme can follow putrescine transaminase (EC 2.6.1.82)
           for a two-step conversion of putrescine to
           gamma-aminobutyric acid (GABA). The member from
           Escherichia coli is characterized as a homotetramer that
           binds one NADH per momomer. This enzyme belongs to the
           medium-chain aldehyde dehydrogenases, and is quite
           similar in sequence to the betaine aldehyde
           dehydrogenase (EC 1.2.1.8) family.
          Length = 472

 Score =  267 bits (684), Expect = 6e-85
 Identities = 140/376 (37%), Positives = 212/376 (56%), Gaps = 15/376 (3%)

Query: 30  TKPKIEHTQLFINNEFV--NAQ--SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADK 85
           T PK     L    + +  NAQ  + LE+ + GKP +  ++ ++   +   R++AG A  
Sbjct: 57  TTPKARAECLLKLADVIEENAQVFAELESRNCGKPLHSVFNDEIPAIVDVFRFFAGAARC 116

Query: 86  IHGKTIPINGDFF----AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPA 141
           + G      G++     +  R +P+GV   I PWN+PL+M AWK+APALA GN +VLKP+
Sbjct: 117 LSGLAA---GEYLEGHTSMIRRDPLGVVASIAPWNYPLMMAAWKLAPALAAGNCVVLKPS 173

Query: 142 EQTPLTALAVGALIAEAGIPEGVVNIVPGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQ 200
           E TPLTAL +  L  +   P GVVNI+ G G   G  L  H  V  V+ TGS   G+ + 
Sbjct: 174 EITPLTALKLAELAKDI-FPAGVVNILFGRGKTVGDPLTGHEKVRMVSLTGSIATGEHIL 232

Query: 201 QGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTI 260
             + A ++KRT +ELGGK+P I+F DAD+DA VEG     F+N GQ C A  R + Q  I
Sbjct: 233 SHT-APSIKRTHMELGGKAPVIVFDDADIDAVVEGVRTFGFYNAGQDCTAACRIYAQRGI 291

Query: 261 YDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQG-AQLVAGGGR 319
           YD  V + G        G P D + E GP      ++++++ +E  K+ G  +++ GG +
Sbjct: 292 YDTLVEKLGAAVATLKSGAPDDESTELGPLSSLAHLERVMKAVEEAKALGHIKVITGGEK 351

Query: 320 AGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFT 379
               G+Y  PT+ A  + D  I ++E+FGPV  +  F   E+V+  AN+S YGLA++V+T
Sbjct: 352 RKGNGYYFAPTLLAGAKQDDAIVQKEVFGPVVSITSFDDEEQVVNWANDSQYGLASSVWT 411

Query: 380 KDLDKTNYVTQGLRAG 395
           KD+ + + ++  L+ G
Sbjct: 412 KDVGRAHRLSARLQYG 427


>gnl|CDD|183050 PRK11241, gabD, succinate-semialdehyde dehydrogenase I;
           Provisional.
          Length = 482

 Score =  267 bits (683), Expect = 1e-84
 Identities = 142/347 (40%), Positives = 205/347 (59%), Gaps = 6/347 (1%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPIN-GDFFAYTRHEPVGVCG 110
           L TL+ GKP   +   ++ Y+   + ++A    +I+G TIP +  D       +P+GV  
Sbjct: 93  LMTLEQGKPLAEAKG-EISYAASFIEWFAEEGKRIYGDTIPGHQADKRLIVIKQPIGVTA 151

Query: 111 QIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPG 170
            I PWNFP  M+  K  PALA G T+VLKPA QTP +ALA+  L   AGIP GV N+V G
Sbjct: 152 AITPWNFPAAMITRKAGPALAAGCTMVLKPASQTPFSALALAELAIRAGIPAGVFNVVTG 211

Query: 171 Y-GDAGAALVQHPGVDKVAFTGSTEVGK-LVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
             G  G  L  +P V K++FTGSTE+G+ L++Q   A ++K+ +LELGG +P I+F DAD
Sbjct: 212 SAGAVGGELTSNPLVRKLSFTGSTEIGRQLMEQC--AKDIKKVSLELGGNAPFIVFDDAD 269

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           LD AVEGA    F N GQ C   +R +VQD +YD F  +  +   +  +GD  +  V  G
Sbjct: 270 LDKAVEGALASKFRNAGQTCVCANRLYVQDGVYDRFAEKLQQAVSKLHIGDGLEKGVTIG 329

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           P ID++ + K+ E I     +GA++V GG      G + QPT+  +V  + K+A+EE FG
Sbjct: 330 PLIDEKAVAKVEEHIADALEKGARVVCGGKAHELGGNFFQPTILVDVPANAKVAKEETFG 389

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           P+  L RF    +VI +AN++++GLAA  + +DL +   V + L  G
Sbjct: 390 PLAPLFRFKDEADVIAQANDTEFGLAAYFYARDLSRVFRVGEALEYG 436


>gnl|CDD|143416 cd07098, ALDH_F15-22, Aldehyde dehydrogenase family 15A1 and
           22A1-like.  Aldehyde dehydrogenase family members
           ALDH15A1 (Saccharomyces cerevisiae YHR039C) and ALDH22A1
           (Arabidopsis thaliana, EC=1.2.1.3), and similar
           sequences, are in this CD. Significant improvement of
           stress tolerance in tobacco plants was observed by
           overexpressing the ALDH22A1 gene from maize (Zea mays)
           and was accompanied by a reduction of malondialdehyde
           derived from cellular lipid peroxidation.
          Length = 465

 Score =  265 bits (679), Expect = 4e-84
 Identities = 111/305 (36%), Positives = 160/305 (52%), Gaps = 9/305 (2%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
           A   +EP+GV G I+ WN+P   L   I  AL  GN IV+K +EQ   ++    ++I E 
Sbjct: 114 ARVEYEPLGVVGAIVSWNYPFHNLLGPIIAALFAGNAIVVKVSEQVAWSSGFFLSIIREC 173

Query: 159 ----GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLE 214
               G    +V +V    +   AL  HP +D + F GS  VGK V   + A +L    LE
Sbjct: 174 LAACGHDPDLVQLVTCLPETAEALTSHPVIDHITFIGSPPVGKKVMA-AAAESLTPVVLE 232

Query: 215 LGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKR 274
           LGGK P I+  DADLD        G F + GQ C    R  V + IYD+ +    +R + 
Sbjct: 233 LGGKDPAIVLDDADLDQIASIIMRGTFQSSGQNCIGIERVIVHEKIYDKLLEILTDRVQA 292

Query: 275 RTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGD----KGFYVQPT 330
              G P D +V+ G  I   + D++ E +     +GA+L+AGG R       +G Y  PT
Sbjct: 293 LRQGPPLDGDVDVGAMISPARFDRLEELVADAVEKGARLLAGGKRYPHPEYPQGHYFPPT 352

Query: 331 VFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQ 390
           +  +V  DMKIA+EE+FGPV  +++ S  EE +E AN+++YGL A+VF KD+ +   +  
Sbjct: 353 LLVDVTPDMKIAQEEVFGPVMVVMKASDDEEAVEIANSTEYGLGASVFGKDIKRARRIAS 412

Query: 391 GLRAG 395
            L  G
Sbjct: 413 QLETG 417


>gnl|CDD|200087 TIGR01237, D1pyr5carbox2, delta-1-pyrroline-5-carboxylate
           dehydrogenase, group 2, putative.  This enzyme is the
           second of two in the degradation of proline to
           glutamate. This model represents one of several related
           branches of delta-1-pyrroline-5-carboxylate
           dehydrogenase. Members of this branch may be associated
           with proline dehydrogenase (the other enzyme of the
           pathway from proline to glutamate) but have not been
           demonstrated experimentally. The branches are not as
           closely related to each other as some distinct aldehyde
           dehydrogenases are to some; separate models were built
           to let each model describe a set of equivalogs [Energy
           metabolism, Amino acids and amines].
          Length = 511

 Score =  257 bits (659), Expect = 1e-80
 Identities = 137/358 (38%), Positives = 193/358 (53%), Gaps = 15/358 (4%)

Query: 50  SSLETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKI--HGKTIPINGDFFAYTRHEPVG 107
           S+L   + GKP+N +   +V  ++  + YYA    ++          G+   Y  + P G
Sbjct: 112 SALLVKEVGKPWNEA-DAEVAEAIDFMEYYARQMIELAKGKPVNSREGETNQYV-YTPTG 169

Query: 108 VCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
           V   I PWNFP  ++       + TGN +VLKPAE  P+ A     ++ EAG+P+GVV  
Sbjct: 170 VTVVISPWNFPFAIMVGMTVAPIVTGNCVVLKPAEAAPVIAAKFVEILEEAGLPKGVVQF 229

Query: 168 VPGYG-DAGAALVQHPGVDKVAFTGSTEVG-------KLVQQGSGASNLKRTTLELGGKS 219
           VPG G + G  LV HP    + FTGS EVG         VQ G    +LKR   E+GGK 
Sbjct: 230 VPGSGSEVGDYLVDHPKTSLITFTGSREVGTRIFERAAKVQPGQ--KHLKRVIAEMGGKD 287

Query: 220 PNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGD 279
             I+  DAD++ A + A    F   GQ C AGSR  V + +YDE V R  E  +   VG 
Sbjct: 288 TVIVDEDADIELAAQSAFTSAFGFAGQKCSAGSRAVVHEKVYDEVVERFVEITESLKVGP 347

Query: 280 PFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDM 339
           P   +V  GP ID++  +KI+E+IE GK++G +LV+GG     KG+++ PT+FA+V    
Sbjct: 348 PDSADVYVGPVIDQKSFNKIMEYIEIGKAEG-RLVSGGCGDDSKGYFIGPTIFADVDRKA 406

Query: 340 KIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           ++A+EEIFGPV   IR S  +E +E ANN++YGL   V + + D  N        G L
Sbjct: 407 RLAQEEIFGPVVAFIRASDFDEALEIANNTEYGLTGGVISNNRDHINRAKAEFEVGNL 464


>gnl|CDD|212007 TIGR04284, aldehy_Rv0768, aldehyde dehydrogenase, Rv0768 family.
           This family describes a branch of the aldehyde
           dehydrogenase (NAD) family (see pfam00171) that includes
           Rv0768 from Mycobacterium tuberculosis. All members of
           this family belong to species predicted to synthesize
           mycofactocin, suggesting that this enzyme or another
           upstream or downstream in the same pathway might be
           mycofactocin-dependent. However, the taxonomic range of
           this family is not nearly broad enough to make that
           relationship conclusive [Unknown function, Enzymes of
           unknown specificity].
          Length = 480

 Score =  245 bits (626), Expect = 4e-76
 Identities = 128/300 (42%), Positives = 168/300 (56%), Gaps = 5/300 (1%)

Query: 102 RHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAE-AGI 160
           R E VGV G I PWNFP  +   K+ PALA GNT+VLKPA  TP  A  +G LIAE    
Sbjct: 138 RREAVGVVGAITPWNFPHQINLAKLGPALAAGNTVVLKPAPDTPWCAAVLGELIAEHTDF 197

Query: 161 PEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKS 219
           P GVVNIV       GA L + P VD V+FTGST  G+ V     A+ LK+  LELGGKS
Sbjct: 198 PPGVVNIVTSSDHRLGALLAKDPRVDMVSFTGSTATGRAVM-ADAAATLKKVFLELGGKS 256

Query: 220 PNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGD 279
             I+  DADL AA   A F +  + GQ C   +R  V    YDE VA +         GD
Sbjct: 257 AFIVLDDADLAAACSMAAFTVCMHAGQGCAITTRLVVPRARYDEAVAAAAATMGSIKPGD 316

Query: 280 PFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDK--GFYVQPTVFANVRD 337
           P D     GP I   Q D++  +++   ++G +   GGGR  D+  GF+V+PTV A + +
Sbjct: 317 PADPGTVCGPVISARQRDRVQSYLDLAVAEGGRFACGGGRPADRDRGFFVEPTVIAGLDN 376

Query: 338 DMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           + ++AREEIFGPV  +I     ++ +  AN+S YGL+  VF  D ++   V   +R GT+
Sbjct: 377 NARVAREEIFGPVLTVIAHDGDDDAVRIANDSPYGLSGTVFGADPERAAAVAARVRTGTV 436


>gnl|CDD|143448 cd07130, ALDH_F7_AASADH, NAD+-dependent alpha-aminoadipic
           semialdehyde dehydrogenase, ALDH family members 7A1 and
           7B.  Alpha-aminoadipic semialdehyde dehydrogenase
           (AASADH, EC=1.2.1.31), also known as ALDH7A1,
           Antiquitin-1, ALDH7B, or
           delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH),
           is a NAD+-dependent ALDH. Human ALDH7A1 is involved in
           the pipecolic acid pathway of lysine catabolism,
           catalyzing the oxidation of alpha-aminoadipic
           semialdehyde to alpha-aminoadipate.  Arabidopsis
           thaliana ALDH7B4 appears to be an
           osmotic-stress-inducible ALDH gene encoding a
           turgor-responsive or stress-inducible ALDH. The
           Streptomyces clavuligerus P6CDH appears to be involved
           in cephamycin biosynthesis, catalyzing the second stage
           of the two-step conversion of lysine to
           alpha-aminoadipic acid.  The ALDH7A1 enzyme and others
           in this group have been observed as tetramers, yet the
           bacterial P6CDH enzyme has been reported as a monomer.
          Length = 474

 Score =  236 bits (604), Expect = 7e-73
 Identities = 112/282 (39%), Positives = 155/282 (54%), Gaps = 6/282 (2%)

Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA----GI 160
           P+GV G I  +NFP+ +  W  A AL  GN +V KP+  TPLTA+AV  ++A      G+
Sbjct: 132 PLGVVGVITAFNFPVAVWGWNAAIALVCGNVVVWKPSPTTPLTAIAVTKIVARVLEKNGL 191

Query: 161 PEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSP 220
           P  + ++V G  D G ALV+ P V  V+FTGST VG+ V Q + A+   R+ LELGG + 
Sbjct: 192 PGAIASLVCGGADVGEALVKDPRVPLVSFTGSTAVGRQVGQ-AVAARFGRSLLELGGNNA 250

Query: 221 NIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDP 280
            I+  DADLD AV    F      GQ C    R  V ++IYDE + R  +  K+  +GDP
Sbjct: 251 IIVMEDADLDLAVRAVLFAAVGTAGQRCTTTRRLIVHESIYDEVLERLKKAYKQVRIGDP 310

Query: 281 FDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMK 340
            D     GP   K  +D  L  IE  KSQG  ++ GG      G YV+PT+      D  
Sbjct: 311 LDDGTLVGPLHTKAAVDNYLAAIEEAKSQGGTVLFGGKVIDGPGNYVEPTI-VEGLSDAP 369

Query: 341 IAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDL 382
           I +EE F P+  +++F ++EE I   N    GL++++FT DL
Sbjct: 370 IVKEETFAPILYVLKFDTLEEAIAWNNEVPQGLSSSIFTTDL 411


>gnl|CDD|132294 TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde
           dehydrogenase.  This family of genes are members of the
           pfam00171 NAD-dependent aldehyde dehydrogenase family.
           These genes are observed in Ralstonia eutropha JMP134,
           Sinorhizobium meliloti 1021, Burkholderia mallei ATCC
           23344, Burkholderia thailandensis E264, Burkholderia
           cenocepacia AU 1054, Burkholderia pseudomallei K96243
           and 1710b, Burkholderia xenovorans LB400, Burkholderia
           sp. 383 and Polaromonas sp. JS666 in close proximity to
           the PhnW gene (TIGR02326) encoding 2-aminoethyl
           phosphonate aminotransferase (which generates
           phosphonoacetaldehyde) and PhnA (TIGR02335) encoding
           phosphonoacetate hydrolase (not to be confused with the
           alkylphosphonate utilization operon protein PhnA modeled
           by TIGR00686). Additionally, transporters believed to be
           specific for 2-aminoethyl phosphonate are often present.
           PhnW is, in other organisms, coupled with PhnX
           (TIGR01422) for the degradation of phosphonoacetaldehyde
           (GenProp0238), but PhnX is apparently absent in each of
           the organisms containing this aldehyde reductase. PhnA,
           characterized in a strain of Pseudomonas fluorescens
           that has not het been genome sequenced, is only rarely
           found outside of the PhnW and aldehyde dehydrogenase
           context. For instance in Rhodopseudomonas and Bordetella
           bronchiseptica, where it is adjacent to transporters
           presumably specific for the import of phosphonoacetate.
           It seems reasonably certain then, that this enzyme
           catalyzes the NAD-dependent oxidation of
           phosphonoacetaldehyde to phosphonoacetate, bridging the
           metabolic gap between PhnW and PhnA. We propose the name
           phosphonoacetaldehyde dehydrogenase and the gene symbol
           PhnY for this enzyme.
          Length = 472

 Score =  227 bits (579), Expect = 3e-69
 Identities = 118/313 (37%), Positives = 171/313 (54%), Gaps = 14/313 (4%)

Query: 87  HGKTIPINGDFFAYTRHEPV-GVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTP 145
           HGK   +      +T+ EP+ GV   I P+N P+  +A KIAPA+AT N +V+KP+E+TP
Sbjct: 126 HGKARKV------FTQREPLLGVISAITPFNHPMNQVAHKIAPAIATNNRMVVKPSEKTP 179

Query: 146 LTALAVGALIAEAGIPEGVVNIVPGY-GDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSG 204
           L+AL +  ++ EAG+P  ++ +V G   +    L+ +P VD V FTG   +GK +   +G
Sbjct: 180 LSALYLADILYEAGLPPQMLQVVTGDPREIADELITNPHVDLVTFTGGVAIGKYIAARAG 239

Query: 205 ASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEF 264
               +R  LELGG  P I+  DADLD A + A  G + N GQ C A  R  VQ+++ D F
Sbjct: 240 ---YRRQVLELGGNDPLIVMEDADLDRAADLAVKGSYKNSGQRCTAVKRMLVQESVADRF 296

Query: 265 VARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKG 324
                E+ +    GDP D +V+ G  ID+         +    +QGA+L+ G  R G   
Sbjct: 297 TELLVEKTRAWRYGDPMDPSVDMGTVIDEAAAILFEARVNEAIAQGARLLLGNVRDG--A 354

Query: 325 FYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDK 384
            Y  PTV   V   M + REE FGPV  +IRF  I++ I  +N++ YGL++ V T  LD 
Sbjct: 355 LY-APTVLDRVDPSMTLVREETFGPVSPVIRFCDIDDAIRISNSTAYGLSSGVCTNRLDY 413

Query: 385 TNYVTQGLRAGTL 397
                  L+ GT+
Sbjct: 414 ITRFIAELQVGTV 426


>gnl|CDD|181826 PRK09406, gabD1, succinic semialdehyde dehydrogenase; Reviewed.
          Length = 457

 Score =  210 bits (537), Expect = 3e-63
 Identities = 119/347 (34%), Positives = 171/347 (49%), Gaps = 5/347 (1%)

Query: 52  LETLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKI---HGKTIPINGDFFAYTRHEPVGV 108
           L TL+ GK    S   +     +  RYYA  A+ +           G   AY R++P+GV
Sbjct: 68  LMTLEMGKTLA-SAKAEALKCAKGFRYYAEHAEALLADEPADAAAVGASRAYVRYQPLGV 126

Query: 109 CGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIV 168
              ++PWNFPL  +    APAL  GN  +LK A   P TAL +  L   AG P+G    +
Sbjct: 127 VLAVMPWNFPLWQVVRFAAPALMAGNVGLLKHASNVPQTALYLADLFRRAGFPDGCFQTL 186

Query: 169 PGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADAD 228
                A  A+++ P V     TGS   G+ V   +G   +K+T LELGG  P I+   AD
Sbjct: 187 LVGSGAVEAILRDPRVAAATLTGSEPAGRAVAAIAG-DEIKKTVLELGGSDPFIVMPSAD 245

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
           LD A E A      N GQ C A  R  V   +YD F  +   R     VGDP D + + G
Sbjct: 246 LDRAAETAVTARVQNNGQSCIAAKRFIVHADVYDAFAEKFVARMAALRVGDPTDPDTDVG 305

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           P   ++  D++ + ++   + GA ++ GG R    G++  PTV  ++  DM++  EE+FG
Sbjct: 306 PLATEQGRDEVEKQVDDAVAAGATILCGGKRPDGPGWFYPPTVITDITPDMRLYTEEVFG 365

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           PV  L R + I+E IE AN + +GL +  +T+D  +       L AG
Sbjct: 366 PVASLYRVADIDEAIEIANATTFGLGSNAWTRDEAEQERFIDDLEAG 412


>gnl|CDD|143466 cd07148, ALDH_RL0313, Uncharacterized ALDH ( RL0313) with
           similarity to Tortula ruralis aldehyde dehydrogenase
           ALDH21A1.  Uncharacterized aldehyde dehydrogenase (locus
           RL0313) with sequence similarity to the moss Tortula
           ruralis aldehyde dehydrogenase ALDH21A1 (RNP123)
           believed to play an important role in the detoxification
           of aldehydes generated in response to desiccation- and
           salinity-stress, and similar sequences are included in
           this CD.
          Length = 455

 Score =  209 bits (534), Expect = 8e-63
 Identities = 111/296 (37%), Positives = 161/296 (54%), Gaps = 4/296 (1%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
           A+T  EP+GV   I  +N PL ++  ++APA+A G  +++KPA  TPL+ LA   L+ EA
Sbjct: 118 AFTTREPIGVVVAISAFNHPLNLIVHQVAPAIAAGCPVIVKPALATPLSCLAFVDLLHEA 177

Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
           G+PEG    VP        LV  P V   +F GS  VG +++  S  +   R  LE GG 
Sbjct: 178 GLPEGWCQAVPCENAVAEKLVTDPRVAFFSFIGSARVGWMLR--SKLAPGTRCALEHGGA 235

Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVG 278
           +P I+   ADLDA +     G F++ GQ C +  R FV   I D+F  R    A++  VG
Sbjct: 236 APVIVDRSADLDAMIPPLVKGGFYHAGQVCVSVQRVFVPAEIADDFAQRLAAAAEKLVVG 295

Query: 279 DPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDD 338
           DP D + E GP I   ++D++ E++    + GA+L+ GG R  D  +   PTV  +   D
Sbjct: 296 DPTDPDTEVGPLIRPREVDRVEEWVNEAVAAGARLLCGGKRLSDTTY--APTVLLDPPRD 353

Query: 339 MKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRA 394
            K++ +EIFGPV  +  +  ++E I +AN+      AAVFTKDLD      + L A
Sbjct: 354 AKVSTQEIFGPVVCVYSYDDLDEAIAQANSLPVAFQAAVFTKDLDVALKAVRRLDA 409


>gnl|CDD|143452 cd07134, ALDH_AlkH-like, Pseudomonas putida Aldehyde dehydrogenase
           AlkH-like.  Aldehyde dehydrogenase AlkH (locus name
           P12693, EC=1.2.1.3) of the alkBFGHJKL operon that allows
           Pseudomonas putida to metabolize alkanes and the
           aldehyde dehydrogenase AldX of Bacillus subtilis (locus
           P46329, EC=1.2.1.3), and similar sequences, are present
           in this CD.
          Length = 433

 Score =  207 bits (530), Expect = 2e-62
 Identities = 109/302 (36%), Positives = 156/302 (51%), Gaps = 13/302 (4%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
           +  R+EP GVC  I PWN+P  +    +  A+A GNT +LKP+E TP T+  +  +I EA
Sbjct: 94  SKIRYEPKGVCLIISPWNYPFNLAFGPLVSAIAAGNTAILKPSELTPHTSAVIAKIIREA 153

Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
              E  V +  G  +   AL++ P  D + FTGS  VGK+V   + A +L   TLELGGK
Sbjct: 154 -FDEDEVAVFEGDAEVAQALLELP-FDHIFFTGSPAVGKIVMAAA-AKHLASVTLELGGK 210

Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVG 278
           SP I+   ADL  A +   +G F N GQ C A    FV +++ D FV       ++    
Sbjct: 211 SPTIVDETADLKKAAKKIAWGKFLNAGQTCIAPDYVFVHESVKDAFVEHLKAEIEKFYGK 270

Query: 279 DPFDLNVEQGPQ----IDKEQMDKILEFIESGKSQGAQLVAGGG-RAGDKGFYVQPTVFA 333
           D      +  P     ++    D++   ++   ++GA++  GG   A  +  Y+ PTV  
Sbjct: 271 DA---ARKASPDLARIVNDRHFDRLKGLLDDAVAKGAKVEFGGQFDAAQR--YIAPTVLT 325

Query: 334 NVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLR 393
           NV  DMKI +EEIFGPV  +I +  ++EVIE  N     LA  VF+KD    N V     
Sbjct: 326 NVTPDMKIMQEEIFGPVLPIITYEDLDEVIEYINAKPKPLALYVFSKDKANVNKVLARTS 385

Query: 394 AG 395
           +G
Sbjct: 386 SG 387


>gnl|CDD|143406 cd07087, ALDH_F3-13-14_CALDH-like, ALDH subfamily: Coniferyl
           aldehyde dehydrogenase, ALDH families 3, 13, and 14, and
           other related proteins.  ALDH subfamily which includes
           NAD(P)+-dependent, aldehyde dehydrogenase, family 3
           member A1 and B1  (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and
           fatty aldehyde dehydrogenase, family 3 member A2
           (ALDH3A2, EC=1.2.1.3), and also plant ALDH family
           members ALDH3F1, ALDH3H1, and ALDH3I1, fungal ALDH14
           (YMR110C) and the protozoan family 13 member (ALDH13),
           as well as coniferyl aldehyde dehydrogenases (CALDH,
           EC=1.2.1.68), and other similar  sequences, such as the
           Pseudomonas putida benzaldehyde dehydrogenase I that is
           involved in the metabolism of mandelate.
          Length = 426

 Score =  203 bits (518), Expect = 1e-60
 Identities = 102/305 (33%), Positives = 156/305 (51%), Gaps = 22/305 (7%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAP---ALATGNTIVLKPAEQTPLTALAVGALI 155
           AY   EP+GV   I PWN+PL +    +AP   A+A GNT+VLKP+E  P T+  +  LI
Sbjct: 94  AYVIPEPLGVVLIIGPWNYPLQLA---LAPLIGAIAAGNTVVLKPSELAPATSALLAKLI 150

Query: 156 AEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLEL 215
            +       V +V G  +   AL+  P  D + FTGS  VGK+V + + A +L   TLEL
Sbjct: 151 PKY-FDPEAVAVVEGGVEVATALLAEP-FDHIFFTGSPAVGKIVMEAA-AKHLTPVTLEL 207

Query: 216 GGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRR 275
           GGKSP I+  DA+L+ A     +G F N GQ C A     V ++I DE +    E  K+ 
Sbjct: 208 GGKSPCIVDKDANLEVAARRIAWGKFLNAGQTCIAPDYVLVHESIKDELI----EELKKA 263

Query: 276 TV---GDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVF 332
                G+    + + G  I++   D++   ++ GK      V  GG+   +  Y+ PT+ 
Sbjct: 264 IKEFYGEDPKESPDYGRIINERHFDRLASLLDDGK------VVIGGQVDKEERYIAPTIL 317

Query: 333 ANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGL 392
            +V  D  + +EEIFGP+  ++ +  ++E IE  N+    LA  +F++D      V    
Sbjct: 318 DDVSPDSPLMQEEIFGPILPILTYDDLDEAIEFINSRPKPLALYLFSEDKAVQERVLAET 377

Query: 393 RAGTL 397
            +G +
Sbjct: 378 SSGGV 382


>gnl|CDD|143414 cd07095, ALDH_SGSD_AstD, N-succinylglutamate 5-semialdehyde
           dehydrogenase, AstD-like.  N-succinylglutamate
           5-semialdehyde dehydrogenase or succinylglutamic
           semialdehyde dehydrogenase (SGSD, E. coli AstD,
           EC=1.2.1.71) involved in L-arginine degradation via the
           arginine succinyltransferase (AST) pathway and catalyzes
           the NAD+-dependent reduction of succinylglutamate
           semialdehyde into succinylglutamate.
          Length = 431

 Score =  198 bits (506), Expect = 7e-59
 Identities = 102/305 (33%), Positives = 143/305 (46%), Gaps = 12/305 (3%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
           A  RH P GV     P+NFP  +    I PAL  GNT+V KP+E TP  A  +  L  EA
Sbjct: 91  AVLRHRPHGVMAVFGPFNFPGHLPNGHIVPALLAGNTVVFKPSELTPAVAELMVELWEEA 150

Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
           G+P GV+N+V G  + G AL  H G+D + FTGS   G L+ +       K   LE+GG 
Sbjct: 151 GLPPGVLNLVQGGRETGEALAAHEGIDGLLFTGSAATGLLLHRQFAGRPGKILALEMGGN 210

Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFV-QDTIYDEFVARSGERAKRRTV 277
           +P +++  AD+DAA        F   GQ C    R  V    + D F+ R  E AKR  +
Sbjct: 211 NPLVVWDVADIDAAAYLIVQSAFLTAGQRCTCARRLIVPDGAVGDAFLERLVEAAKRLRI 270

Query: 278 GDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQP-----TVF 332
           G P       GP I      + L   +   + G + +    R      ++ P     T  
Sbjct: 271 GAPDAEPPFMGPLIIAAAAARYLLAQQDLLALGGEPLLAMERLVAGTAFLSPGIIDVTDA 330

Query: 333 ANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGL 392
           A+V D      EEIFGP+ Q+ R+   +E I  AN + +GL+A + + D          +
Sbjct: 331 ADVPD------EEIFGPLLQVYRYDDFDEAIALANATRFGLSAGLLSDDEALFERFLARI 384

Query: 393 RAGTL 397
           RAG +
Sbjct: 385 RAGIV 389


>gnl|CDD|143402 cd07083, ALDH_P5CDH, ALDH subfamily NAD+-dependent
           delta(1)-pyrroline-5-carboxylate dehydrogenase-like.
           ALDH subfamily of the NAD+-dependent,
           delta(1)-pyrroline-5-carboxylate dehydrogenases (P5CDH,
           EC=1.5.1.12). The proline catabolic enzymes, proline
           dehydrogenase and P5CDH catalyze the two-step oxidation
           of proline to glutamate.  P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). These enzymes play important roles
           in cellular redox control, superoxide generation, and
           apoptosis. In certain prokaryotes such as Escherichia
           coli, PutA is also a transcriptional repressor of the
           proline utilization genes. Monofunctional enzyme
           sequences such as those seen in the Bacillus RocA P5CDH
           are also present in this subfamily as well as the human
           ALDH4A1 P5CDH and the Drosophila Aldh17 P5CDH.
          Length = 500

 Score =  199 bits (507), Expect = 2e-58
 Identities = 119/332 (35%), Positives = 172/332 (51%), Gaps = 13/332 (3%)

Query: 77  RYYAGFADKIHGK---TIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATG 133
           RYYA  A ++       +P  G+    + +  +G    I PWNFP+ +    I   +A G
Sbjct: 124 RYYARAALRLRYPAVEVVPYPGEDNE-SFYVGLGAGVVISPWNFPVAIFTGMIVAPVAVG 182

Query: 134 NTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGS 192
           NT++ KPAE   +    V  +  EAG P GVV  +PG G+  GA L +H  +  + FTGS
Sbjct: 183 NTVIAKPAEDAVVVGYKVFEIFHEAGFPPGVVQFLPGVGEEVGAYLTEHERIRGINFTGS 242

Query: 193 TEVGKLV-----QQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQC 247
            E GK +     +   G +  KR  +E GGK+  I+   AD +  VEG     F   GQ 
Sbjct: 243 LETGKKIYEAAARLAPGQTWFKRLYVETGGKNAIIVDETADFELVVEGVVVSAFGFQGQK 302

Query: 248 CCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGK 307
           C A SR  +    Y+  + R  +RA+R +VG P +   + GP ID EQ  K+L +IE GK
Sbjct: 303 CSAASRLILTQGAYEPVLERLLKRAERLSVGPPEENGTDLGPVIDAEQEAKVLSYIEHGK 362

Query: 308 SQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRF--SSIEEVIER 365
           ++G QLV GG R   +G++V PTV   V    +IA+EEIFGPV  +IR+      E +E 
Sbjct: 363 NEG-QLVLGGKRLEGEGYFVAPTVVEEVPPKARIAQEEIFGPVLSVIRYKDDDFAEALEV 421

Query: 366 ANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           AN++ YGL   V+++  +      +    G L
Sbjct: 422 ANSTPYGLTGGVYSRKREHLEEARREFHVGNL 453


>gnl|CDD|143443 cd07125, ALDH_PutA-P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, PutA.  The proline catabolic enzymes of
           the aldehyde dehydrogenase (ALDH) protein superfamily,
           proline dehydrogenase and
           Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           (EC=1.5.1.12 )), catalyze the two-step oxidation of
           proline to glutamate; P5CDH catalyzes the oxidation of
           glutamate semialdehyde, utilizing NAD+ as the electron
           acceptor. In some bacteria, the two enzymes are fused
           into the bifunctional flavoenzyme, proline utilization A
           (PutA) These enzymes play important roles in cellular
           redox control, superoxide generation, and apoptosis. In
           certain prokaryotes such as Escherichia coli, PutA is
           also a transcriptional repressor of the proline
           utilization genes.
          Length = 518

 Score =  196 bits (501), Expect = 2e-57
 Identities = 116/333 (34%), Positives = 166/333 (49%), Gaps = 23/333 (6%)

Query: 76  LRYYAGFADK------IHGKTIPINGDFFAYTRHEP--VGVCGQIIPWNFPLLMLAWKIA 127
            RYYA  A +      + G T  +NG            V VC  I PWNFPL +   +IA
Sbjct: 137 CRYYAAQARELFSDPELPGPTGELNGLEL-----HGRGVFVC--ISPWNFPLAIFTGQIA 189

Query: 128 PALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG-DAGAALVQHPGVDK 186
            ALA GNT++ KPAEQTPL A     L+ EAG+P  V+ +VPG G + G ALV HP +D 
Sbjct: 190 AALAAGNTVIAKPAEQTPLIAARAVELLHEAGVPRDVLQLVPGDGEEIGEALVAHPRIDG 249

Query: 187 VAFTGSTEVGKLVQQGSGASNLKRTTL--ELGGKSPNIIFADADLDAAVEGAHFGLFFNM 244
           V FTGSTE  KL+ +     +     L  E GGK+  I+ + A  + AV+      F + 
Sbjct: 250 VIFTGSTETAKLINRALAERDGPILPLIAETGGKNAMIVDSTALPEQAVKDVVQSAFGSA 309

Query: 245 GQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIE 304
           GQ C A    ++Q+ I + F+           VGDP+DL+ + GP IDK     +    E
Sbjct: 310 GQRCSALRLLYLQEEIAERFIEMLKGAMASLKVGDPWDLSTDVGPLIDKPAGKLLRAHTE 369

Query: 305 SGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSS--IEEV 362
             + + A L+A        G++V P +   V         E+FGP+  +IRF +  ++E 
Sbjct: 370 LMRGE-AWLIAPAPLDDGNGYFVAPGIIEIVGIF--DLTTEVFGPILHVIRFKAEDLDEA 426

Query: 363 IERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
           IE  N + YGL   + ++D  +  Y  + + AG
Sbjct: 427 IEDINATGYGLTLGIHSRDEREIEYWRERVEAG 459


>gnl|CDD|143451 cd07133, ALDH_CALDH_CalB, Coniferyl aldehyde dehydrogenase-like.
           Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of
           Pseudomonas sp. strain HR199 (CalB) which catalyzes the
           NAD+-dependent oxidation of coniferyl aldehyde to
           ferulic acid, and similar sequences, are present in this
           CD.
          Length = 434

 Score =  191 bits (487), Expect = 5e-56
 Identities = 108/304 (35%), Positives = 160/304 (52%), Gaps = 17/304 (5%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAP---ALATGNTIVLKPAEQTPLTALAVGALI 155
           A   ++P+GV G I+PWN+PL  LA  + P   ALA GN +++KP+E TP T+  +  L+
Sbjct: 95  AEVEYQPLGVVGIIVPWNYPL-YLA--LGPLIAALAAGNRVMIKPSEFTPRTSALLAELL 151

Query: 156 AEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLEL 215
           AE    E  V +V G  D  AA    P  D + FTGST VG+ V + + A NL   TLEL
Sbjct: 152 AEY-FDEDEVAVVTGGADVAAAFSSLP-FDHLLFTGSTAVGRHVMR-AAAENLTPVTLEL 208

Query: 216 GGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRR 275
           GGKSP II  DADL  A E   FG   N GQ C A     V +   +EFVA +     + 
Sbjct: 209 GGKSPAIIAPDADLAKAAERIAFGKLLNAGQTCVAPDYVLVPEDKLEEFVAAAKAAVAKM 268

Query: 276 --TVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLV--AGGGRAGDKGFYVQPTV 331
             T+ D    N +    I++    ++   +E  +++GA+++     G        + PT+
Sbjct: 269 YPTLAD----NPDYTSIINERHYARLQGLLEDARAKGARVIELNPAGEDFAATRKLPPTL 324

Query: 332 FANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQG 391
             NV DDM++ +EEIFGP+  ++ + S++E I+  N     LA   F +D  + + V + 
Sbjct: 325 VLNVTDDMRVMQEEIFGPILPILTYDSLDEAIDYINARPRPLALYYFGEDKAEQDRVLRR 384

Query: 392 LRAG 395
             +G
Sbjct: 385 THSG 388


>gnl|CDD|130783 TIGR01722, MMSDH, methylmalonic acid semialdehyde dehydrogenase.
           Involved in valine catabolism,
           methylmalonate-semialdehyde dehydrogenase catalyzes the
           irreversible NAD+- and CoA-dependent oxidative
           decarboxylation of methylmalonate semialdehyde to
           propionyl-CoA. Methylmalonate-semialdehyde dehydrogenase
           has been characterized in both prokaryotes and
           eukaryotes, functioning as a mammalian tetramer and a
           bacterial homodimer. Although similar in monomeric
           molecular mass and enzymatic activity, the N-terminal
           sequence in P.aeruginosa does not correspond with the
           N-terminal sequence predicted for rat liver. Sequence
           homology to a variety of prokaryotic and eukaryotic
           aldehyde dehydrogenases places MMSDH in the aldehyde
           dehydrogenase (NAD+) superfamily (pfam00171), making
           MMSDH's CoA requirement unique among known ALDHs.
           Methylmalonate semialdehyde dehydrogenase is closely
           related to betaine aldehyde dehydrogenase,
           2-hydroxymuconic semialdehyde dehydrogenase, and class 1
           and 2 aldehyde dehydrogenase. In Bacillus, a highly
           homologous protein to methylmalonic acid semialdehyde
           dehydrogenase, groups out from the main MMSDH clade with
           Listeria and Sulfolobus. This Bacillus protein has been
           suggested to be located in an iol operon and/or involved
           in myo-inositol catabolism, converting malonic
           semialdehyde to acetyl CoA ad CO2. The preceeding
           enzymes responsible for valine catabolism are present in
           Bacillus, Listeria, and Sulfolobus [Energy metabolism,
           Amino acids and amines].
          Length = 477

 Score =  190 bits (485), Expect = 1e-55
 Identities = 110/319 (34%), Positives = 168/319 (52%), Gaps = 7/319 (2%)

Query: 68  DVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKI 126
           DV   L  + +  G    + G+T   +      Y+  +P+GVC  I P+NFP ++  W  
Sbjct: 98  DVARGLEVVEHACGVNSLLKGETSTQVATRVDVYSIRQPLGVCAGITPFNFPAMIPLWMF 157

Query: 127 APALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDAGAALVQHPGVDK 186
             A+A GNT VLKP+E+ P  A+ +  L +EAG P+GV+N+V G  +A   L++HP V  
Sbjct: 158 PIAIACGNTFVLKPSEKVPSAAVKLAELFSEAGAPDGVLNVVHGDKEAVDRLLEHPDVKA 217

Query: 187 VAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQ 246
           V+F GST +G+ +   +G+++ KR     G K+  ++  DAD DAA +      +   GQ
Sbjct: 218 VSFVGSTPIGRYIHT-TGSAHGKRVQALGGAKNHMVVMPDADKDAAADALVGAAYGAAGQ 276

Query: 247 CCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESG 306
            C A S   V     DE+V    ERA++  +G   D   E GP I  +  D++   I  G
Sbjct: 277 RCMAIS-AAVLVGAADEWVPEIRERAEKIRIGPGDDPGAEMGPLITPQAKDRVASLIAGG 335

Query: 307 KSQGAQLVAGGGRAGDKGF----YVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEV 362
            ++GA+++  G      G+    +V PT+   V   MK  +EEIFGPV  ++   ++EE 
Sbjct: 336 AAEGAEVLLDGRGYKVDGYEEGNWVGPTLLERVPPTMKAYQEEIFGPVLCVLEADTLEEA 395

Query: 363 IERANNSDYGLAAAVFTKD 381
           I   N S YG   A+FT+D
Sbjct: 396 IALINASPYGNGTAIFTRD 414


>gnl|CDD|143454 cd07136, ALDH_YwdH-P39616, Bacillus subtilis aldehyde dehydrogenase
           ywdH-like.  Uncharacterized Bacillus subtilis ywdH
           aldehyde dehydrogenase (locus P39616)  most closely
           related to the ALDHs and fatty ALDHs of families 3 and
           14, and similar sequences, are included in this CD.
          Length = 449

 Score =  189 bits (482), Expect = 4e-55
 Identities = 99/300 (33%), Positives = 158/300 (52%), Gaps = 16/300 (5%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAP---ALATGNTIVLKPAEQTPLTALAVGALI 155
           +Y  +EP GV   I PWN+P  +    +AP   A+A GNT VLKP+E TP T+  +  +I
Sbjct: 94  SYIYYEPYGVVLIIAPWNYPFQLA---LAPLIGAIAAGNTAVLKPSELTPNTSKVIAKII 150

Query: 156 AEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLEL 215
            E    E  V +V G  +    L+     D + FTGS  VGK+V + + A +L   TLEL
Sbjct: 151 EET-FDEEYVAVVEGGVEENQELLDQK-FDYIFFTGSVRVGKIVME-AAAKHLTPVTLEL 207

Query: 216 GGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRR 275
           GGKSP I+  DA+L  A +   +G F N GQ C A     V +++ ++F+    E  K+ 
Sbjct: 208 GGKSPCIVDEDANLKLAAKRIVWGKFLNAGQTCVAPDYVLVHESVKEKFIKELKEEIKKF 267

Query: 276 TVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANV 335
              DP + + + G  I+++  D++   +++GK      +  GG    +  Y++PT+  NV
Sbjct: 268 YGEDPLE-SPDYGRIINEKHFDRLAGLLDNGK------IVFGGNTDRETLYIEPTILDNV 320

Query: 336 RDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
             D  + +EEIFGP+  ++ + +++E IE   +    LA  +F++D      V + L  G
Sbjct: 321 TWDDPVMQEEIFGPILPVLTYDTLDEAIEIIKSRPKPLALYLFSEDKKVEKKVLENLSFG 380


>gnl|CDD|184426 PRK13968, PRK13968, putative succinate semialdehyde dehydrogenase;
           Provisional.
          Length = 462

 Score =  183 bits (465), Expect = 1e-52
 Identities = 99/283 (34%), Positives = 144/283 (50%), Gaps = 1/283 (0%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
           A   + P+G    I+PWNFPL  +     P L  GN  +LK A      A  +  +  +A
Sbjct: 120 AVIEYRPLGTILAIMPWNFPLWQVMRGAVPILLAGNGYLLKHAPNVMGCAQLIAQVFKDA 179

Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
           GIP+GV   +    D  + ++    +  V  TGS   G  +   +GA+ LK+  LELGG 
Sbjct: 180 GIPQGVYGWLNADNDGVSQMINDSRIAAVTVTGSVRAGAAIGAQAGAA-LKKCVLELGGS 238

Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVG 278
            P I+  DADL+ AV+ A  G + N GQ C A  R  +++ I   F  R    A    +G
Sbjct: 239 DPFIVLNDADLELAVKAAVAGRYQNTGQVCAAAKRFIIEEGIASAFTERFVAAAAALKMG 298

Query: 279 DPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDD 338
           DP D     GP    +  D++   +E+  ++GA+L+ GG +    G Y  PTV ANV  +
Sbjct: 299 DPRDEENALGPMARFDLRDELHHQVEATLAEGARLLLGGEKIAGAGNYYAPTVLANVTPE 358

Query: 339 MKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
           M   REE+FGPV  +      E  +E AN+S++GL+A +FT D
Sbjct: 359 MTAFREELFGPVAAITVAKDAEHALELANDSEFGLSATIFTTD 401


>gnl|CDD|215110 PLN00412, PLN00412, NADP-dependent glyceraldehyde-3-phosphate
           dehydrogenase; Provisional.
          Length = 496

 Score =  182 bits (463), Expect = 4e-52
 Identities = 105/294 (35%), Positives = 157/294 (53%), Gaps = 9/294 (3%)

Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGV 164
           P+GV   I P+N+P+ +   KIAPAL  GN +VLKP  Q  + AL +      AG P+G+
Sbjct: 158 PLGVVLAIPPFNYPVNLAVSKIAPALIAGNAVVLKPPTQGAVAALHMVHCFHLAGFPKGL 217

Query: 165 VNIVPGYG-DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNII 223
           ++ V G G + G  L  HPGV+ ++FTG  + G  + + +G   L+   +ELGGK   I+
Sbjct: 218 ISCVTGKGSEIGDFLTMHPGVNCISFTGG-DTGIAISKKAGMVPLQ---MELGGKDACIV 273

Query: 224 FADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDL 283
             DADLD A      G F   GQ C A     V +++ D  V +   +  + TVG P D 
Sbjct: 274 LEDADLDLAAANIIKGGFSYSGQRCTAVKVVLVMESVADALVEKVNAKVAKLTVGPPED- 332

Query: 284 NVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAR 343
           + +  P + +   + I   +   K +GA       R G+    + P +  NVR DM+IA 
Sbjct: 333 DCDITPVVSESSANFIEGLVMDAKEKGATFCQEWKREGN---LIWPLLLDNVRPDMRIAW 389

Query: 344 EEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
           EE FGPV  +IR +S+EE I   N S++GL   VFT+D++K   ++  +  GT+
Sbjct: 390 EEPFGPVLPVIRINSVEEGIHHCNASNFGLQGCVFTRDINKAILISDAMETGTV 443


>gnl|CDD|143453 cd07135, ALDH_F14-YMR110C, Saccharomyces cerevisiae aldehyde
           dehydrogenase family 14 and related proteins.  Aldehyde
           dehydrogenase family 14 (ALDH14), isolated mainly from
           the mitochondrial outer membrane of Saccharomyces
           cerevisiae (YMR110C) and most closely related to the
           plant and animal ALDHs and fatty ALDHs family 3 members,
           and similar fungal sequences, are present in this CD.
          Length = 436

 Score =  180 bits (459), Expect = 6e-52
 Identities = 93/297 (31%), Positives = 153/297 (51%), Gaps = 8/297 (2%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
              R EP+GV   I PWN+P+L+    +  A+A G T+VLKP+E TP TA  +  L+ + 
Sbjct: 102 PRIRKEPLGVVLIIGPWNYPVLLALSPLVGAIAAGCTVVLKPSELTPHTAALLAELVPKY 161

Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
            +      +V G      AL++    DK+ +TGS  VG+++ + + A +L   TLELGGK
Sbjct: 162 -LDPDAFQVVQGGVPETTALLEQ-KFDKIFYTGSGRVGRIIAE-AAAKHLTPVTLELGGK 218

Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVG 278
           SP I+  +ADL+ A +   +G F N GQ C A     V  ++YDEFV    +       G
Sbjct: 219 SPVIVTKNADLELAAKRILWGKFGNAGQICVAPDYVLVDPSVYDEFVEELKKVLDEFYPG 278

Query: 279 DPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDD 338
                  +    ++    +++   +++ K    ++V GG       F + PT+ ++V  D
Sbjct: 279 GANASP-DYTRIVNPRHFNRLKSLLDTTK---GKVVIGGEMDEATRF-IPPTIVSDVSWD 333

Query: 339 MKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAG 395
             +  EE+FGPV  +I+   ++E I+  N+ D  LA  +FT D  + +++    R+G
Sbjct: 334 DSLMSEELFGPVLPIIKVDDLDEAIKVINSRDTPLALYIFTDDKSEIDHILTRTRSG 390


>gnl|CDD|181873 PRK09457, astD, succinylglutamic semialdehyde dehydrogenase;
           Reviewed.
          Length = 487

 Score =  180 bits (459), Expect = 1e-51
 Identities = 107/312 (34%), Positives = 149/312 (47%), Gaps = 29/312 (9%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
           A  RH P GV     P+NFP  +    I PAL  GNT+V KP+E TP  A     L  +A
Sbjct: 128 AVLRHRPHGVVAVFGPYNFPGHLPNGHIVPALLAGNTVVFKPSELTPWVAELTVKLWQQA 187

Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGK 218
           G+P GV+N+V G  + G AL  HP +D + FTGS   G L+ +       K   LE+GG 
Sbjct: 188 GLPAGVLNLVQGGRETGKALAAHPDIDGLLFTGSANTGYLLHRQFAGQPEKILALEMGGN 247

Query: 219 SPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIY-DEFVARSGERAKRRTV 277
           +P +I   AD+DAAV       F + GQ C    R  V      D F+AR    AKR TV
Sbjct: 248 NPLVIDEVADIDAAVHLIIQSAFISAGQRCTCARRLLVPQGAQGDAFLARLVAVAKRLTV 307

Query: 278 GDPFDLNVE-QGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRA--------GDKGFYVQ 328
           G          G  I ++    ++       +Q AQL+A GG++           G  + 
Sbjct: 308 GRWDAEPQPFMGAVISEQAAQGLVA------AQ-AQLLALGGKSLLEMTQLQAGTGL-LT 359

Query: 329 P-----TVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLD 383
           P     T  A + D      EE FGP+ Q++R+   +E I  ANN+ +GL+A + + D +
Sbjct: 360 PGIIDVTGVAELPD------EEYFGPLLQVVRYDDFDEAIRLANNTRFGLSAGLLSDDRE 413

Query: 384 KTNYVTQGLRAG 395
             +     +RAG
Sbjct: 414 DYDQFLLEIRAG 425


>gnl|CDD|233325 TIGR01238, D1pyr5carbox3, delta-1-pyrroline-5-carboxylate
           dehydrogenase (PutA C-terminal domain).  This model
           represents one of several related branches of
           delta-1-pyrroline-5-carboxylate dehydrogenase. Members
           of this branch are the C-terminal domain of the PutA
           bifunctional proline dehydrogenase /
           delta-1-pyrroline-5-carboxylate dehydrogenase [Energy
           metabolism, Amino acids and amines].
          Length = 500

 Score =  177 bits (450), Expect = 3e-50
 Identities = 111/348 (31%), Positives = 162/348 (46%), Gaps = 22/348 (6%)

Query: 58  GKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNF 117
           GK  + + + +V  ++   RYYA               D       E  GV   I PWNF
Sbjct: 125 GKTIHNAIA-EVREAVDFCRYYAKQVR-----------DVLGEFSVESRGVFVCISPWNF 172

Query: 118 PLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG-DAGA 176
           PL +   +I+ ALA GNT++ KPAEQT L A     L+ EAG P G + ++PG G D GA
Sbjct: 173 PLAIFTGQISAALAAGNTVIAKPAEQTSLIAYRAVELMQEAGFPAGTIQLLPGRGADVGA 232

Query: 177 ALVQHPGVDKVAFTGSTEVGKLVQQ--GSGASNLKRTTLELGGKSPNIIFADADLDAAVE 234
           AL   P +  VAFTGSTEV +L+ Q              E GG++  I+ + A  +  V 
Sbjct: 233 ALTSDPRIAGVAFTGSTEVAQLINQTLAQREDAPVPLIAETGGQNAMIVDSTALPEQVVR 292

Query: 235 GAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKE 294
                 F + GQ C A     VQ+ + D  +       +   VG P  L  + GP ID E
Sbjct: 293 DVLRSAFDSAGQRCSALRVLCVQEDVADRVLTMIQGAMQELKVGVPHLLTTDVGPVIDAE 352

Query: 295 QMDKILEFIESGKSQG---AQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQ 351
               +L  IE         AQL     RA   G +V PT+F    DD+    EE+FGPV 
Sbjct: 353 AKQNLLAHIEHMSQTQKKIAQLTLDDSRACQHGTFVAPTLFE--LDDIAELSEEVFGPVL 410

Query: 352 QLIRFSS--IEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
            ++R+ +  +++++++ N + YGL   V ++      ++ +  R G  
Sbjct: 411 HVVRYKARELDQIVDQINQTGYGLTMGVHSRIETTYRWIEKHARVGNC 458


>gnl|CDD|237018 PRK11905, PRK11905, bifunctional proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1208

 Score =  182 bits (465), Expect = 3e-50
 Identities = 111/333 (33%), Positives = 168/333 (50%), Gaps = 29/333 (8%)

Query: 76  LRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNT 135
           LRYYA  A                   H+P+G    I PWNFPL +   +IA AL  GNT
Sbjct: 658 LRYYAAQAR-----------RLLNGPGHKPLGPVVCISPWNFPLAIFTGQIAAALVAGNT 706

Query: 136 IVLKPAEQTPLTAL-AVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGST 193
           ++ KPAEQTPL A  AV  L+ EAG+P+  + ++PG G   GAALV  P +  V FTGST
Sbjct: 707 VLAKPAEQTPLIAARAV-RLLHEAGVPKDALQLLPGDGRTVGAALVADPRIAGVMFTGST 765

Query: 194 EVGKLVQQGSGASNLKRTTL---ELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCA 250
           EV +L+Q+ + A           E GG++  I+ + A  +  V       F + GQ C A
Sbjct: 766 EVARLIQR-TLAKRSGPPVPLIAETGGQNAMIVDSSALPEQVVADVIASAFDSAGQRCSA 824

Query: 251 GSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQG 310
                +Q+ + D  +           +GDP+ L+ + GP ID E    I   IE+ ++ G
Sbjct: 825 LRVLCLQEDVADRVLTMLKGAMDELRIGDPWRLSTDVGPVIDAEAQANIEAHIEAMRAAG 884

Query: 311 AQL-VAGGGRAGDKGFYVQPTVF--ANVRDDMKIAREEIFGPVQQLIRF--SSIEEVIER 365
             +         +KG +V PT+    ++  D++    E+FGPV  ++RF    ++ VI+ 
Sbjct: 885 RLVHQLPLPAETEKGTFVAPTLIEIDSIS-DLE---REVFGPVLHVVRFKADELDRVIDD 940

Query: 366 ANNSDYGLAAAVFTKDLDKT-NYVTQGLRAGTL 397
            N + YGL   + ++ +D+T  +VT  +RAG +
Sbjct: 941 INATGYGLTFGLHSR-IDETIAHVTSRIRAGNI 972


>gnl|CDD|226683 COG4230, COG4230, Delta 1-pyrroline-5-carboxylate dehydrogenase
           [Energy production and conversion].
          Length = 769

 Score =  179 bits (457), Expect = 1e-49
 Identities = 118/332 (35%), Positives = 170/332 (51%), Gaps = 27/332 (8%)

Query: 76  LRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNT 135
           LRYYAG A            D F    H P+G    I PWNFPL +   +IA ALA GN+
Sbjct: 218 LRYYAGQAR-----------DTFGNLTHRPLGPVVCISPWNFPLAIFTGQIAAALAAGNS 266

Query: 136 IVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGSTE 194
           ++ KPAEQTPL A     L+ EAG+P GV+ ++PG G+  GAAL     V  V FTGSTE
Sbjct: 267 VLAKPAEQTPLIAAQAVRLLHEAGVPPGVLQLLPGRGETVGAALTADARVAGVMFTGSTE 326

Query: 195 VGKLVQQGSGASNLKRTTL--ELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGS 252
           V +L+Q+       +   L  E GG++  I+ + A  +  V       F + GQ C A  
Sbjct: 327 VARLIQRQLAKRQGRPIPLIAETGGQNAMIVDSSALAEQVVADVLASAFDSAGQRCSALR 386

Query: 253 RTFVQDTIYDEFVAR-SGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQG- 310
              +Q+ + D  +    G  A+ R VG+P  L  + GP ID E    I + I++ +S+G 
Sbjct: 387 VLCLQEDVADRILTMLKGAMAELR-VGNPDRLTTDVGPVIDAEAKANIEKHIQTMRSKGR 445

Query: 311 --AQLVAGGGRAGDKGFYVQPTVFA-NVRDDMKIAREEIFGPVQQLIRF--SSIEEVIER 365
              Q  A       KG +V PT+      D+++    E+FGPV  ++R+    ++EVI++
Sbjct: 446 LVHQAAAPNSLQ--KGTFVAPTLIELENLDELQ---REVFGPVLHVVRYKRDELDEVIDQ 500

Query: 366 ANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTL 397
            N + YGL   V T+  +   +VT+   AG L
Sbjct: 501 INATGYGLTLGVHTRIDETIAHVTERAHAGNL 532


>gnl|CDD|132284 TIGR03240, arg_catab_astD, succinylglutamic semialdehyde
           dehydrogenase.  Members of this protein family are
           succinylglutamic semialdehyde dehydrogenase (EC
           1.2.1.71), the fourth enzyme in the arginine
           succinyltransferase (AST) pathway for arginine
           catabolism [Energy metabolism, Amino acids and amines].
          Length = 484

 Score =  173 bits (441), Expect = 4e-49
 Identities = 106/309 (34%), Positives = 150/309 (48%), Gaps = 15/309 (4%)

Query: 95  GDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGAL 154
            D  A  RH P GV     P+NFP  +    I PAL  GNT+V KP+E TP  A     L
Sbjct: 122 PDGRAVLRHRPHGVVAVFGPYNFPGHLPNGHIVPALLAGNTVVFKPSELTPWVAEETVKL 181

Query: 155 IAEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLE 214
             +AG+P GV+N+V G  + G AL  HP +D + FTGS+  G L+ +       K   LE
Sbjct: 182 WEKAGLPAGVLNLVQGARETGVALAAHPDIDGLLFTGSSNTGHLLHRQFAGRPEKILALE 241

Query: 215 LGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIY-DEFVARSGERAK 273
           +GG +P I+   AD+DAAV       F + GQ C    R  V D    D F+AR  E A+
Sbjct: 242 MGGNNPLIVDEVADIDAAVHLIIQSAFISAGQRCTCARRLLVPDGAQGDAFLARLVEVAE 301

Query: 274 RRTVGD------PFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQ-LVAGGGRAGDKGFY 326
           R TVG       PF      G  I  +   ++L       + G + L+A           
Sbjct: 302 RLTVGAWDAEPQPF-----MGAVISLQAAQRLLAAQAKLLALGGKSLLAMRQLDPGAAL- 355

Query: 327 VQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTN 386
           + P +  +V    ++  EE FGP+ Q+IR+   +E I  ANN+ +GL+A + + D +  +
Sbjct: 356 LTPGII-DVTGVAELPDEEHFGPLLQVIRYDDFDEAIAIANNTRFGLSAGLLSDDRELYD 414

Query: 387 YVTQGLRAG 395
                +RAG
Sbjct: 415 RFLLEIRAG 423


>gnl|CDD|237017 PRK11904, PRK11904, bifunctional proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1038

 Score =  177 bits (452), Expect = 2e-48
 Identities = 109/323 (33%), Positives = 152/323 (47%), Gaps = 44/323 (13%)

Query: 76  LRYYAGFADKIHGKTIPINGDFFAYTRHEPVG-------------VCGQIIPWNFPLLML 122
            RYYA  A ++ G    + G         P G             VC  I PWNFPL + 
Sbjct: 653 CRYYAAQARRLFGAPEKLPG---------PTGESNELRLHGRGVFVC--ISPWNFPLAIF 701

Query: 123 AWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQH 181
             ++A ALA GNT++ KPAEQTPL A     L+ EAGIP+ V+ ++PG G   GAAL   
Sbjct: 702 LGQVAAALAAGNTVIAKPAEQTPLIAAEAVKLLHEAGIPKDVLQLLPGDGATVGAALTAD 761

Query: 182 PGVDKVAFTGSTEVGKLVQQGSGASNLKRTTL--ELGGKSPNIIFADAD------LDAAV 233
           P +  VAFTGSTE  +++ +   A +     L  E GG+  N +  D+       +D  V
Sbjct: 762 PRIAGVAFTGSTETARIINRTLAARDGPIVPLIAETGGQ--NAMIVDSTALPEQVVDDVV 819

Query: 234 EGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDK 293
             A    F + GQ C A    FVQ+ I D  +           VGDP  L+ + GP ID 
Sbjct: 820 TSA----FRSAGQRCSALRVLFVQEDIADRVIEMLKGAMAELKVGDPRLLSTDVGPVIDA 875

Query: 294 EQMDKILEFIESGKSQGAQLVAGG-GRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQ 352
           E    +   IE  K +   L         + G +V PT F    D +     E+FGP+  
Sbjct: 876 EAKANLDAHIERMKREARLLAQLPLPAGTENGHFVAPTAFE--IDSISQLEREVFGPILH 933

Query: 353 LIRFSS--IEEVIERANNSDYGL 373
           +IR+ +  +++VI+  N + YGL
Sbjct: 934 VIRYKASDLDKVIDAINATGYGL 956


>gnl|CDD|240392 PTZ00381, PTZ00381, aldehyde dehydrogenase family protein;
           Provisional.
          Length = 493

 Score =  167 bits (425), Expect = 1e-46
 Identities = 97/283 (34%), Positives = 143/283 (50%), Gaps = 10/283 (3%)

Query: 100 YTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAG 159
           Y   EP+GV   I  WN+PL +    +A A+A GNT+VLKP+E +P T+  +  L+ +  
Sbjct: 104 YIIPEPLGVVLVIGAWNYPLNLTLIPLAGAIAAGNTVVLKPSELSPHTSKLMAKLLTKY- 162

Query: 160 IPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKS 219
           +    V ++ G  +    L++ P  D + FTGS  VGKLV Q + A NL   TLELGGKS
Sbjct: 163 LDPSYVRVIEGGVEVTTELLKEP-FDHIFFTGSPRVGKLVMQAA-AENLTPCTLELGGKS 220

Query: 220 PNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGD 279
           P I+    +L  A     +G F N GQ C A     V  +I D+F+    E  K     D
Sbjct: 221 PVIVDKSCNLKVAARRIAWGKFLNAGQTCVAPDYVLVHRSIKDKFIEALKEAIKEFFGED 280

Query: 280 PFDLNVEQGPQI-DKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDD 338
           P     E   +I ++    ++ E I+    +    V  GG    +  YV PT+  N   D
Sbjct: 281 P--KKSEDYSRIVNEFHTKRLAELIKDHGGK----VVYGGEVDIENKYVAPTIIVNPDLD 334

Query: 339 MKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
             + +EEIFGP+  ++ + +I+EV+E  N+    LA   F +D
Sbjct: 335 SPLMQEEIFGPILPILTYENIDEVLEFINSRPKPLALYYFGED 377


>gnl|CDD|177949 PLN02315, PLN02315, aldehyde dehydrogenase family 7 member.
          Length = 508

 Score =  166 bits (422), Expect = 3e-46
 Identities = 94/280 (33%), Positives = 146/280 (52%), Gaps = 6/280 (2%)

Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA----GI 160
           P+G+ G I  +NFP  +L W    AL  GN +V K A  TPL  +A+  L+AE      +
Sbjct: 154 PLGIVGVITAFNFPCAVLGWNACIALVCGNCVVWKGAPTTPLITIAMTKLVAEVLEKNNL 213

Query: 161 PEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSP 220
           P  +     G  + G A+ +   +  V+FTGS++VG +VQQ   A   K   LEL G + 
Sbjct: 214 PGAIFTSFCGGAEIGEAIAKDTRIPLVSFTGSSKVGLMVQQTVNARFGK-CLLELSGNNA 272

Query: 221 NIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDP 280
            I+  DAD+  AV    F      GQ C    R  + ++IYD+ + +     K+  +GDP
Sbjct: 273 IIVMDDADIQLAVRSVLFAAVGTAGQRCTTCRRLLLHESIYDDVLEQLLTVYKQVKIGDP 332

Query: 281 FDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMK 340
            +     GP    E      + IE  KSQG +++ GG     +G +VQPT+   +  D  
Sbjct: 333 LEKGTLLGPLHTPESKKNFEKGIEIIKSQGGKILTGGSAIESEGNFVQPTI-VEISPDAD 391

Query: 341 IAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTK 380
           + +EE+FGPV  +++F ++EE IE  N+   GL++++FT+
Sbjct: 392 VVKEELFGPVLYVMKFKTLEEAIEINNSVPQGLSSSIFTR 431


>gnl|CDD|236989 PRK11809, putA, trifunctional transcriptional regulator/proline
            dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
            Reviewed.
          Length = 1318

 Score =  168 bits (428), Expect = 2e-45
 Identities = 118/338 (34%), Positives = 170/338 (50%), Gaps = 34/338 (10%)

Query: 76   LRYYAGFADKIHGKTIPINGDFFAYTRHEPVG--VCGQIIPWNFPLLMLAWKIAPALATG 133
            LRYYAG              D F    H P+G  VC  I PWNFPL +   ++A ALA G
Sbjct: 750  LRYYAGQVR-----------DDFDNDTHRPLGPVVC--ISPWNFPLAIFTGQVAAALAAG 796

Query: 134  NTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGS 192
            N+++ KPAEQTPL A     ++ EAG+P GVV ++PG G+  GAALV    V  V FTGS
Sbjct: 797  NSVLAKPAEQTPLIAAQAVRILLEAGVPAGVVQLLPGRGETVGAALVADARVRGVMFTGS 856

Query: 193  TEVGKLVQQGSGASNLK---RTT---LELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQ 246
            TEV +L+Q+ + A  L    R      E GG++  I+ + A  +  V       F + GQ
Sbjct: 857  TEVARLLQR-NLAGRLDPQGRPIPLIAETGGQNAMIVDSSALTEQVVADVLASAFDSAGQ 915

Query: 247  CCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESG 306
             C A     +QD + D  +           +G+P  L+ + GP ID E    I   I++ 
Sbjct: 916  RCSALRVLCLQDDVADRTLKMLRGAMAECRMGNPDRLSTDIGPVIDAEAKANIERHIQAM 975

Query: 307  KSQG---AQLVAGGGRAGDKGFYVQPTVFA-NVRDDMKIAREEIFGPVQQLIRF--SSIE 360
            +++G    Q           G +V PT+   +  D++K    E+FGPV  ++R+  + ++
Sbjct: 976  RAKGRPVFQAARENSEDWQSGTFVPPTLIELDSFDELK---REVFGPVLHVVRYNRNQLD 1032

Query: 361  EVIERANNSDYGLAAAVFTKDLDKT-NYVTQGLRAGTL 397
            E+IE+ N S YGL   V T+ +D+T   VT     G L
Sbjct: 1033 ELIEQINASGYGLTLGVHTR-IDETIAQVTGSAHVGNL 1069


>gnl|CDD|166060 PLN02419, PLN02419, methylmalonate-semialdehyde dehydrogenase
           [acylating].
          Length = 604

 Score =  162 bits (411), Expect = 4e-44
 Identities = 110/334 (32%), Positives = 172/334 (51%), Gaps = 14/334 (4%)

Query: 54  TLDNGKPYNMSYSVDVEYSLRTLRYYAGFADKIHGKTIP-INGDFFAYTRHEPVGVCGQI 112
           T + GK    S+  D+   L  + +  G A    G+ +P ++     Y+  EP+GVC  I
Sbjct: 198 TTEQGKTLKDSHG-DIFRGLEVVEHACGMATLQMGEYLPNVSNGVDTYSIREPLGVCAGI 256

Query: 113 IPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYG 172
            P+NFP ++  W    A+  GNT +LKP+E+ P  ++ +  L  EAG+P+GV+NIV G  
Sbjct: 257 CPFNFPAMIPLWMFPVAVTCGNTFILKPSEKDPGASVILAELAMEAGLPDGVLNIVHGTN 316

Query: 173 DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAA 232
           D   A+     +  V+F GS   G  +   + A   KR    +G K+  ++  DA++DA 
Sbjct: 317 DTVNAICDDEDIRAVSFVGSNTAGMHIYARAAAKG-KRIQSNMGAKNHGLVLPDANIDAT 375

Query: 233 VEGAHFGLFFNMGQCCCAGSR-TFVQD--TIYDEFVARSGERAKRRTVGDPFDLNVEQGP 289
           +       F   GQ C A S   FV D  +  D+ V R+  +A + T G   D ++  GP
Sbjct: 376 LNALLAAGFGAAGQRCMALSTVVFVGDAKSWEDKLVERA--KALKVTCGSEPDADL--GP 431

Query: 290 QIDKEQMDKILEFIESGKSQGAQLVAGGGR----AGDKGFYVQPTVFANVRDDMKIAREE 345
            I K+  ++I   I+SG   GA+L+  G        +KG ++ PT+ + V  DM+  +EE
Sbjct: 432 VISKQAKERICRLIQSGVDDGAKLLLDGRDIVVPGYEKGNFIGPTILSGVTPDMECYKEE 491

Query: 346 IFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFT 379
           IFGPV   ++ +S +E I   N + YG  AA+FT
Sbjct: 492 IFGPVLVCMQANSFDEAISIINKNKYGNGAAIFT 525


>gnl|CDD|143450 cd07132, ALDH_F3AB, Aldehyde dehydrogenase family 3 members A1, A2,
           and B1 and related proteins.  NAD(P)+-dependent,
           aldehyde dehydrogenase, family 3 members A1 and B1
           (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and fatty aldehyde
           dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3),
           and similar sequences are included in this CD. Human
           ALDH3A1 is a homodimer with a critical role in cellular
           defense against oxidative stress; it catalyzes the
           oxidation of various cellular membrane lipid-derived
           aldehydes. Corneal crystalline ALDH3A1 protects the
           cornea and underlying lens against UV-induced oxidative
           stress. Human ALDH3A2, a microsomal homodimer, catalyzes
           the oxidation of long-chain aliphatic aldehydes to fatty
           acids. Human ALDH3B1 is highly expressed in the kidney
           and liver and catalyzes the oxidation of various medium-
           and long-chain saturated and unsaturated aliphatic
           aldehydes.
          Length = 443

 Score =  156 bits (397), Expect = 4e-43
 Identities = 95/312 (30%), Positives = 151/312 (48%), Gaps = 40/312 (12%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA 158
            Y   EP+GV   I  WN+PL +    +  A+A GN +V+KP+E +P TA     L+AE 
Sbjct: 94  VYIYKEPLGVVLIIGAWNYPLQLTLVPLVGAIAAGNCVVIKPSEVSPATA----KLLAE- 148

Query: 159 GIPEGVVNIVPGYGDAGAALVQHPGV-----------DKVAFTGSTEVGKLVQQGSGASN 207
                   ++P Y D     V   GV           D + +TGST VGK+V Q + A +
Sbjct: 149 --------LIPKYLDKECYPVVLGGVEETTELLKQRFDYIFYTGSTSVGKIVMQAA-AKH 199

Query: 208 LKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVAR 267
           L   TLELGGKSP  +    D+D A     +G F N GQ C A         + ++FV  
Sbjct: 200 LTPVTLELGGKSPCYVDKSCDIDVAARRIAWGKFINAGQTCIAPDYVLCTPEVQEKFVE- 258

Query: 268 SGERAKRRTVGDPFDLNVEQGPQ----IDKEQMDKILEFIESGKSQGAQLVAGGGRAGDK 323
               A ++T+ + +  + ++ P     I+     ++ + +  GK      VA GG+  +K
Sbjct: 259 ----ALKKTLKEFYGEDPKESPDYGRIINDRHFQRLKKLLSGGK------VAIGGQTDEK 308

Query: 324 GFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLD 383
             Y+ PTV  +V+    + +EEIFGP+  ++  ++++E IE  N+ +  LA  VF+ +  
Sbjct: 309 ERYIAPTVLTDVKPSDPVMQEEIFGPILPIVTVNNLDEAIEFINSREKPLALYVFSNNKK 368

Query: 384 KTNYVTQGLRAG 395
             N +     +G
Sbjct: 369 VINKILSNTSSG 380


>gnl|CDD|233324 TIGR01236, D1pyr5carbox1, delta-1-pyrroline-5-carboxylate
           dehydrogenase, group 1.  This model represents one of
           two related branches of delta-1-pyrroline-5-carboxylate
           dehydrogenase. The two branches are not as closely
           related to each other as some aldehyde dehydrogenases
           are to this branch, and separate models are built for
           this reason. The enzyme is the second of two in the
           degradation of proline to glutamate [Energy metabolism,
           Amino acids and amines].
          Length = 532

 Score =  158 bits (401), Expect = 5e-43
 Identities = 106/332 (31%), Positives = 165/332 (49%), Gaps = 19/332 (5%)

Query: 77  RYYAGFADKIHGKTIPINGD-FFAYTRHEPV-GVCGQIIPWNFPLLMLAWKIAPALATGN 134
           R+ A +A +++ +  PI+    +  T + P+ G    I P+NF  +      APAL  GN
Sbjct: 140 RFNAKYARELYEQQ-PISAPGEWNRTEYRPLEGFVYAISPFNFTAIAGNLAGAPAL-MGN 197

Query: 135 TIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA-GAALVQHPGVDKVAFTGST 193
           T++ KP++   L+   V  ++ EAG+P GV+N VPG G      ++  P +  + FTGST
Sbjct: 198 TVIWKPSDTAMLSNYLVMRILEEAGLPPGVINFVPGDGPLVSDTVLADPDLAGIHFTGST 257

Query: 194 ----EVGKLVQQGSGA-SNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCC 248
                + K V Q      N  R   E GGK  +++   AD++  V G   G F   GQ C
Sbjct: 258 PTFKHLWKQVAQNLDRYHNFPRIVGETGGKDFHVVHPSADVEHVVSGTIRGAFEYQGQKC 317

Query: 249 CAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKS 308
            A SR +V  +++  F  R      R  VGDP D     G  ID++   KI+++IE  KS
Sbjct: 318 SACSRLYVPHSLWPRFKGRLLAELSRVKVGDPDDFGTFMGAVIDEKSFAKIVKYIEHAKS 377

Query: 309 QGA--QLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLI-----RFSSIEE 361
             +   ++AGG      G++V+PTV  +      + +EEIFGPV  +      ++  I +
Sbjct: 378 DPSALTILAGGKYDDSVGYFVEPTVVESKDPQEPLMKEEIFGPVLTVYVYPDDKYKEILD 437

Query: 362 VIERANNSDYGLAAAVFTKDLDKTNYVTQGLR 393
           +++  + + YGL  AVF KD D      + LR
Sbjct: 438 LVD--STTSYGLTGAVFAKDRDAILEADKVLR 467


>gnl|CDD|143441 cd07123, ALDH_F4-17_P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, ALDH families 4 and 17.
           Delta(1)-pyrroline-5-carboxylate dehydrogenase
           (EC=1.5.1.12 ), families 4 and 17: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), also known as ALDH4A1 in humans, 
           is a mitochondrial  homodimer involved in proline
           degradation and catalyzes the NAD + -dependent
           conversion of P5C to glutamate. This is a necessary step
           in the pathway interconnecting the urea and
           tricarboxylic acid cycles. The preferred substrate is
           glutamic gamma-semialdehyde, other substrates include
           succinic, glutaric and adipic semialdehydes. Also
           included in this CD is the Aldh17 Drosophila
           melanogaster (Q9VUC0) P5CDH and similar sequences.
          Length = 522

 Score =  157 bits (399), Expect = 1e-42
 Identities = 99/296 (33%), Positives = 149/296 (50%), Gaps = 13/296 (4%)

Query: 112 IIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGY 171
           + P+NF  +      APAL  GN ++ KP++   L+   V  ++ EAG+P GV+N VPG 
Sbjct: 177 VSPFNFTAIGGNLAGAPAL-MGNVVLWKPSDTAVLSNYLVYKILEEAGLPPGVINFVPGD 235

Query: 172 GDA-GAALVQHPGVDKVAFTGSTEVGKLVQQ--GSGASNLK---RTTLELGGKSPNIIFA 225
           G   G  ++  P +  + FTGST   K + +  G      +   R   E GGK+ +++  
Sbjct: 236 GPVVGDTVLASPHLAGLHFTGSTPTFKSLWKQIGENLDRYRTYPRIVGETGGKNFHLVHP 295

Query: 226 DADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNV 285
            AD+D+ V     G F   GQ C A SR +V ++++ E   R  E  K   +GDP D + 
Sbjct: 296 SADVDSLVTATVRGAFEYQGQKCSAASRAYVPESLWPEVKERLLEELKEIKMGDPDDFSN 355

Query: 286 EQGPQIDKEQMDKILEFIESGKSQ-GAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIARE 344
             G  ID++  D+I  +I+  KS   A+++AGG      G++V+PTV        K+  E
Sbjct: 356 FMGAVIDEKAFDRIKGYIDHAKSDPEAEIIAGGKCDDSVGYFVEPTVIETTDPKHKLMTE 415

Query: 345 EIFGPVQQLIRF--SSIEEVIERANN-SDYGLAAAVFTKDLDKTNYVTQGLR--AG 395
           EIFGPV  +  +  S  EE +E  +  S Y L  A+F +D       T  LR  AG
Sbjct: 416 EIFGPVLTVYVYPDSDFEETLELVDTTSPYALTGAIFAQDRKAIREATDALRNAAG 471


>gnl|CDD|143455 cd07137, ALDH_F3FHI, Plant aldehyde dehydrogenase family 3 members
           F1, H1, and I1 and related proteins.  Aldehyde
           dehydrogenase family members 3F1, 3H1, and 3I1 (ALDH3F1,
           ALDH3H1, and ALDH3I1), and similar plant sequences, are
           in this CD.  In Arabidopsis thaliana, stress-regulated
           expression of ALDH3I1  was observed in  leaves and
           osmotic stress expression of  ALDH3H1 was observed in
           root tissue, whereas, ALDH3F1 expression was not stress
           responsive. Functional analysis of ALDH3I1 suggest it
           may be involved in a detoxification pathway in plants
           that limits aldehyde accumulation and oxidative stress.
          Length = 432

 Score =  129 bits (326), Expect = 3e-33
 Identities = 94/309 (30%), Positives = 143/309 (46%), Gaps = 15/309 (4%)

Query: 90  TIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTAL 149
           T P   +  +    EP+GV   I  WNFP L+    +  A+A GN +VLKP+E  P T+ 
Sbjct: 90  TFPAKAEIVS----EPLGVVLVISAWNFPFLLSLEPVIGAIAAGNAVVLKPSELAPATSA 145

Query: 150 AVGALIAEAGIPEGVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLK 209
            +  LI E    +  + ++ G      AL++    DK+ FTGS  VG+++   + A +L 
Sbjct: 146 LLAKLIPEYLDTK-AIKVIEGGVPETTALLEQKW-DKIFFTGSPRVGRIIM-AAAAKHLT 202

Query: 210 RTTLELGGKSPNIIFADADLDAAVE---GAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVA 266
             TLELGGK P I+ +  DL  AV    G  +G   N GQ C A     V+++     + 
Sbjct: 203 PVTLELGGKCPVIVDSTVDLKVAVRRIAGGKWGC--NNGQACIAPDYVLVEESFAPTLID 260

Query: 267 RSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFY 326
                 ++    +P     +   +I      + L  +    S   ++V GG R      Y
Sbjct: 261 ALKNTLEKFFGENP--KESKDLSRIVNSHHFQRLSRLLDDPSVADKIVHGGERDEKN-LY 317

Query: 327 VQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTN 386
           ++PT+  +   D  I  EEIFGP+  +I    IEE IE  N+    LAA VFTK+ +   
Sbjct: 318 IEPTILLDPPLDSSIMTEEIFGPLLPIITVKKIEESIEIINSRPKPLAAYVFTKNKELKR 377

Query: 387 YVTQGLRAG 395
            +     +G
Sbjct: 378 RIVAETSSG 386


>gnl|CDD|143446 cd07128, ALDH_MaoC-N, N-terminal domain of the monoamine oxidase C
           dehydratase.  The N-terminal domain of the MaoC
           dehydratase, a monoamine oxidase regulatory protein.
           Orthologs of MaoC include PaaZ (Escherichia coli) and
           PaaN (Pseudomonas putida), which are putative
           ring-opening enzymes of the aerobic phenylacetic acid
           (PA) catabolic pathway. The C-terminal domain of MaoC
           has sequence similarity to enoyl-CoA hydratase. Also
           included in this CD is a novel Burkholderia xenovorans
           LB400 ALDH of the aerobic benzoate oxidation (box)
           pathway. This pathway involves first the synthesis of a
           CoA thio-esterified aromatic acid, with subsequent
           dihydroxylation and cleavage steps, yielding the CoA
           thio-esterified aliphatic aldehyde,
           3,4-dehydroadipyl-CoA semialdehyde, which is further
           converted into its corresponding CoA acid by the
           Burkholderia LB400 ALDH.
          Length = 513

 Score =  118 bits (298), Expect = 6e-29
 Identities = 104/373 (27%), Positives = 148/373 (39%), Gaps = 70/373 (18%)

Query: 67  VDVEYSLRTLRYYAGFADK--------IHGKTIPI--NGDFFAYTRHEPV---GVCGQII 113
           +D++  + TL  YA    +        + G   P+  +G F    +H      GV   I 
Sbjct: 95  IDIDGGIGTLFAYASLGRRELPNAHFLVEGDVEPLSKDGTFVG--QHILTPRRGVAVHIN 152

Query: 114 PWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGI-PEGVVNIVPGYG 172
            +NFP+  +  K APAL  G  +++KPA  T     AV   I E+G+ PEG + ++   G
Sbjct: 153 AFNFPVWGMLEKFAPALLAGVPVIVKPATATAYLTEAVVKDIVESGLLPEGALQLI--CG 210

Query: 173 DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNII------FAD 226
             G  L      D VAFTGS               L+          PNI+       A+
Sbjct: 211 SVGDLLDHLGEQDVVAFTGSAATAA---------KLRA--------HPNIVARSIRFNAE 253

Query: 227 AD-LDAAVEGA-------HFGLFF-----NM----GQCCCAGSRTFVQDTIYDEFVARSG 269
           AD L+AA+ G         F LF       M    GQ C A  R FV +   D  +    
Sbjct: 254 ADSLNAAILGPDATPGTPEFDLFVKEVAREMTVKAGQKCTAIRRAFVPEARVDAVIEALK 313

Query: 270 ERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGG-------GRAGD 322
            R  +  VGDP    V  GP + +EQ + +   + +     A++V GG       G   +
Sbjct: 314 ARLAKVVVGDPRLEGVRMGPLVSREQREDVRAAVAT-LLAEAEVVFGGPDRFEVVGADAE 372

Query: 323 KGFYVQPTVFANVRDDMKIAR---EEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFT 379
           KG +  PT+     D          E FGPV  L+ + S+ E IE A      L A+V T
Sbjct: 373 KGAFFPPTLLL-CDDPDAATAVHDVEAFGPVATLMPYDSLAEAIELAARGRGSLVASVVT 431

Query: 380 KDLDKTNYVTQGL 392
            D      +  G 
Sbjct: 432 NDPAFARELVLGA 444


>gnl|CDD|165847 PLN02203, PLN02203, aldehyde dehydrogenase.
          Length = 484

 Score =  109 bits (273), Expect = 7e-26
 Identities = 83/288 (28%), Positives = 131/288 (45%), Gaps = 22/288 (7%)

Query: 104 EPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPE- 162
           EP+GV      WNFP+ +    +  A+A GN +VLKP+E  P T+    A +A   IP+ 
Sbjct: 107 EPLGVVLIFSSWNFPIGLSLEPLIGAIAAGNAVVLKPSELAPATS----AFLAAN-IPKY 161

Query: 163 ---GVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKS 219
                V ++ G    G  L+QH   DK+ FTGS  VG+++   + A +L    LELGGK 
Sbjct: 162 LDSKAVKVIEGGPAVGEQLLQHKW-DKIFFTGSPRVGRII-MTAAAKHLTPVALELGGKC 219

Query: 220 PNI---IFADADLDAAVE---GAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAK 273
           P I   + +  D   AV    G  +G     GQ C A     V++      +       K
Sbjct: 220 PCIVDSLSSSRDTKVAVNRIVGGKWGSC--AGQACIAIDYVLVEERFAPILIELLKSTIK 277

Query: 274 RRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFA 333
           +    +P +        ++K+   ++   ++  +   +  +  GG   +K  +++PT+  
Sbjct: 278 KFFGENPRESK-SMARILNKKHFQRLSNLLKDPRVAAS--IVHGGSIDEKKLFIEPTILL 334

Query: 334 NVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
           N   D  I  EEIFGP+  +I    IE+ I   N+    LA   FT +
Sbjct: 335 NPPLDSDIMTEEIFGPLLPIITVKKIEDSIAFINSKPKPLAIYAFTNN 382


>gnl|CDD|236931 PRK11563, PRK11563, bifunctional aldehyde dehydrogenase/enoyl-CoA
           hydratase; Provisional.
          Length = 675

 Score =  108 bits (273), Expect = 2e-25
 Identities = 105/375 (28%), Positives = 147/375 (39%), Gaps = 74/375 (19%)

Query: 67  VDVEYSLRTLRYYAGFA------DKIH--GKTIPI--NGDFFA---YTRHEPVGVCGQII 113
           +D+E  + TL  YA         D +   G+  P+   G F      T  E V V   I 
Sbjct: 99  IDIEGGIGTLFTYASKGRRELPNDTVLVEGEVEPLSKGGTFAGRHILTPLEGVAV--HIN 156

Query: 114 PWNFPLLMLAWKIAPALATGNTIVLKPAEQT-PLTALAVGALIAEAGI-PEGVVNIVPGY 171
            +NFP+  +  K+APA   G   ++KPA  T  LT   V  LI E+G+ PEG + ++   
Sbjct: 157 AFNFPVWGMLEKLAPAFLAGVPAIVKPATATAYLTEAVV-RLIVESGLLPEGALQLI--C 213

Query: 172 GDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNII-----F-A 225
           G AG  L    G D V FTGS    +          L+          PN++     F A
Sbjct: 214 GSAGDLLDHLDGQDVVTFTGSAATAQK---------LRAH--------PNVVANSVPFTA 256

Query: 226 DAD--------LDAAVEGAHFGLFFNM---------GQCCCAGSRTFVQDTIYDEFVARS 268
           +AD         DA      F LF            GQ C A  R  V   + D  +   
Sbjct: 257 EADSLNAAILGPDATPGTPEFDLFVKEVVREMTVKAGQKCTAIRRAIVPRALVDAVIEAL 316

Query: 269 GERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGG-------GRAG 321
             R  +  VGDP    V  G     EQ + + E + +     A++V GG       G   
Sbjct: 317 RARLAKVVVGDPRLEGVRMGALASLEQREDVREKVAA-LLAAAEIVFGGPDSFEVVGADA 375

Query: 322 DKGFYVQPTVFANVRDDMKIARE----EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAV 377
           +KG +  PT+     DD   A      E FGPV  L+ +  ++E IE A      L A++
Sbjct: 376 EKGAFFPPTLL--YCDDPLEAPAVHDVEAFGPVSTLMPYDDLDEAIELAARGKGSLVASL 433

Query: 378 FTKDLDKTNYVTQGL 392
            T D +    +  G 
Sbjct: 434 VTADPEVARELVLGA 448


>gnl|CDD|237016 PRK11903, PRK11903, aldehyde dehydrogenase; Provisional.
          Length = 521

 Score =  107 bits (268), Expect = 4e-25
 Identities = 89/345 (25%), Positives = 144/345 (41%), Gaps = 38/345 (11%)

Query: 67  VDVEYSLRTLRYYAGFADKIHGKTIPINGDFFAYTR-------HEPV---GVCGQIIPWN 116
           VD++  + TL YYA     +    +  +G+     +       H  V   GV   I  +N
Sbjct: 100 VDIDGGIFTLGYYAKLGAALGDARLLRDGEAVQLGKDPAFQGQHVLVPTRGVALFINAFN 159

Query: 117 FPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGI-PEGVVNIVPGYGDAG 175
           FP   L  K APAL  G  +++KPA  T      +   +  AGI P G +++V G   + 
Sbjct: 160 FPAWGLWEKAAPALLAGVPVIVKPATATAWLTQRMVKDVVAAGILPAGALSVVCG---SS 216

Query: 176 AALVQH-PGVDKVAFTGSTEVGKLVQ-QGSGASNLKRTTLELGGKSPNIIFADADLDAAV 233
           A L+ H    D V+FTGS E   +++   +      R  +E    +  ++  DA   +  
Sbjct: 217 AGLLDHLQPFDVVSFTGSAETAAVLRSHPAVVQRSVRVNVEADSLNSALLGPDAAPGSEA 276

Query: 234 EGAHFGLFF---------NMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLN 284
               F LF            GQ C A  R FV + +YD        R  + TVG+P +  
Sbjct: 277 ----FDLFVKEVVREMTVKSGQKCTAIRRIFVPEALYDAVAEALAARLAKTTVGNPRNDG 332

Query: 285 VEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAG------DKGFYVQPTVF-ANVRD 337
           V  GP + + Q+  +   + + ++Q A+++  GG              V PT+  A+  D
Sbjct: 333 VRMGPLVSRAQLAAVRAGLAALRAQ-AEVLFDGGGFALVDADPAVAACVGPTLLGASDPD 391

Query: 338 DMKIARE-EIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
                 + E+FGPV  L+ +      +  A      L A+V++ D
Sbjct: 392 AATAVHDVEVFGPVATLLPYRDAAHALALARRGQGSLVASVYSDD 436


>gnl|CDD|143403 cd07084, ALDH_KGSADH-like, ALDH subfamily: NAD(P)+-dependent
           alpha-ketoglutaric semialdehyde dehydrogenases and plant
           delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH
           family 12-like.  ALDH subfamily which includes the
           NAD(P)+-dependent, alpha-ketoglutaric semialdehyde
           dehydrogenases (KGSADH, EC 1.2.1.26); plant
           delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           EC=1.5.1.12 ), ALDH family 12; the N-terminal domain of
           the MaoC (monoamine oxidase C) dehydratase regulatory
           protein; and orthologs of MaoC, PaaZ and PaaN, which are
           putative ring-opening enzymes of the aerobic
           phenylacetic acid catabolic pathway.
          Length = 442

 Score =  104 bits (260), Expect = 3e-24
 Identities = 84/372 (22%), Positives = 141/372 (37%), Gaps = 39/372 (10%)

Query: 49  QSSLETLDNGKPYNMSYSVDVEYSLRTLRYYA--GFADKIH---GKTIPINGDFFAYTRH 103
            ++   L  GK +  +   ++      LR  A   ++ +I    G  +       ++   
Sbjct: 41  IAAGAVLVTGKGWMFA--ENICGDQVQLRARAFVIYSYRIPHEPGNHLGQGLKQQSHGYR 98

Query: 104 EPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGI-PE 162
            P G    I  +NFPL +   ++A ALA GN +++KP     +    +  L+  AG+ P 
Sbjct: 99  WPYGPVLVIGAFNFPLWIPLLQLAGALAMGNPVIVKPHTAVSIVMQIMVRLLHYAGLLPP 158

Query: 163 GVVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLK--RTTLELGGKSP 220
             V ++ G G    AL+ HP    V FTGS+ V + +     A + K  R  LEL G + 
Sbjct: 159 EDVTLINGDGKTMQALLLHPNPKMVLFTGSSRVAEKL-----ALDAKQARIYLELAGFNW 213

Query: 221 NIIFADAD-LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDE-FVARSGERAKRRTVG 278
            ++  DA  +D         +    GQ C A S  FV +       V +      RR + 
Sbjct: 214 KVLGPDAQAVDYVAWQCVQDMTACSGQKCTAQSMLFVPENWSKTPLVEKLKALLARRKLE 273

Query: 279 DPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRA-------GDKGFYVQPTV 331
           D           +   Q    L  I   ++    ++   G+           G  V   +
Sbjct: 274 DLL---------LGPVQTFTTLAMIAHMENLLGSVLLFSGKELKNHSIPSIYGACVASAL 324

Query: 332 FANVRDDMKIAR---EEIFGPVQQLIRFSSIEE--VIERANNSDYGLAAAVFTKDLDKTN 386
           F  + + +K      EEIFGP   ++ +   +   V+E        L AA+++ D     
Sbjct: 325 FVPIDEILKTYELVTEEIFGPFAIVVEYKKDQLALVLELLERMHGSLTAAIYSNDPIFLQ 384

Query: 387 YVTQGL-RAGTL 397
            +   L  AG  
Sbjct: 385 ELIGNLWVAGRT 396


>gnl|CDD|131331 TIGR02278, PaaN-DH, phenylacetic acid degradation protein paaN.
           This enzyme is proposed to act in the ring-opening step
           of phenylacetic acid degradation which follows ligation
           of the acid with coenzyme A (by PaaF) and hydroxylation
           by a multicomponent non-heme iron hydroxylase complex
           (PaaGHIJK). Gene symbols have been standardized in. This
           enzyme is related to aldehyde dehydrogenases and has
           domains which are members of the pfam00171 and pfam01575
           families. This family includes paaN genes from
           Pseudomonas, Sinorhizobium, Rhodopseudomonas,
           Escherichia, Deinococcus and Corynebacterium. Another
           homology family (TIGR02288) includes several other
           species.
          Length = 663

 Score = 99.1 bits (247), Expect = 3e-22
 Identities = 83/333 (24%), Positives = 127/333 (38%), Gaps = 20/333 (6%)

Query: 67  VDVEYSLRTLRYYAGFADK--------IHGKTIPINGDFFAYTRH--EPV-GVCGQIIPW 115
           VD++  + TL  Y+    +           +  P++ D     RH   P  GV  QI  +
Sbjct: 95  VDIDGGIGTLFTYSSLGRRELPDANLIPEDEFEPLSKDGSFQGRHILTPKGGVAVQINAF 154

Query: 116 NFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGI-PEGVVNIVPGYGDA 174
           NFP+  L  K APA   G   + KPA  T   A A+   + E+G+ PEG + ++   G A
Sbjct: 155 NFPVWGLLEKFAPAFLAGVPTLAKPATPTAYVAEALVRTMVESGLLPEGSLQLI--CGSA 212

Query: 175 GAALVQHPGVDKVAFTGSTEVGKL------VQQGSGASNLKRTTLELGGKSPNIIFADAD 228
           G  L      D VAFTGS            V +     N +  +L       +    + +
Sbjct: 213 GDLLDHLDHRDVVAFTGSAATADRLRAHPNVLERGIRFNAEADSLNAAILGEDATPDEPE 272

Query: 229 LDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQG 288
            D   +     L    GQ C A  R  V   + +  +     R  +  +GDP +  V+ G
Sbjct: 273 FDLFAQEIVRELTIKAGQKCTAIRRVIVPKALLEAVLKALQARLAKVVLGDPREEGVDMG 332

Query: 289 PQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFG 348
           P +  EQ   +   + +  + GA++  GG    D  F+    + A       +   E FG
Sbjct: 333 PLVSLEQRADVEAAVAALLAAGAEVRLGGPGRLDGAFFPPTLLLAEDPWAGAVHATEAFG 392

Query: 349 PVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
           PV     +    E    A      L A + T D
Sbjct: 393 PVATFFPYGDRAEAARLAARGGGSLVATLATSD 425


>gnl|CDD|177831 PLN02174, PLN02174, aldehyde dehydrogenase family 3 member H1.
          Length = 484

 Score = 98.2 bits (244), Expect = 4e-22
 Identities = 72/279 (25%), Positives = 136/279 (48%), Gaps = 7/279 (2%)

Query: 104 EPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEG 163
           EP+GV   I  WN+P L+    +  A++ GN +VLKP+E  P ++  +  L+ E  +   
Sbjct: 111 EPLGVVLVISAWNYPFLLSIDPVIGAISAGNAVVLKPSELAPASSALLAKLL-EQYLDSS 169

Query: 164 VVNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNII 223
            V +V G      AL++    DK+ +TGS+++G+++   + A +L    LELGGKSP ++
Sbjct: 170 AVRVVEGAVTETTALLEQKW-DKIFYTGSSKIGRVI-MAAAAKHLTPVVLELGGKSPVVV 227

Query: 224 FADADLDAAVEGAHFGLF-FNMGQCCCAGSRTFVQDTIYDEFVARSGERAKRRTVGDPFD 282
            +D DL   V     G +  N GQ C +            + +    +  +     +P +
Sbjct: 228 DSDTDLKVTVRRIIAGKWGCNNGQACISPDYILTTKEYAPKVIDAMKKELETFYGKNPME 287

Query: 283 LNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGFYVQPTVFANVRDDMKIA 342
            + +    ++    D++ + ++  + + +  +  GG    +   + PT+  +V  D  I 
Sbjct: 288 -SKDMSRIVNSTHFDRLSKLLD--EKEVSDKIVYGGEKDRENLKIAPTILLDVPLDSLIM 344

Query: 343 REEIFGPVQQLIRFSSIEEVIERANNSDYGLAAAVFTKD 381
            EEIFGP+  ++  +++EE  +   +    LAA +FT +
Sbjct: 345 SEEIFGPLLPILTLNNLEESFDVIRSRPKPLAAYLFTHN 383


>gnl|CDD|143447 cd07129, ALDH_KGSADH, Alpha-Ketoglutaric Semialdehyde
           Dehydrogenase.  Alpha-Ketoglutaric Semialdehyde (KGSA)
           Dehydrogenase (KGSADH, EC 1.2.1.26) catalyzes the
           NAD(P)+-dependent conversion of KGSA to
           alpha-ketoglutarate. This CD contains such sequences as
           those seen in Azospirillum brasilense, KGSADH-II
           (D-glucarate/D-galactarate-inducible) and KGSADH-III
           (hydroxy-L-proline-inducible). Both show similar high
           substrate specificity for KGSA and different coenzyme
           specificity; KGSADH-II is NAD+-dependent and KGSADH-III
           is NADP+-dependent. Also included in this CD is the
           NADP(+)-dependent aldehyde dehydrogenase from Vibrio
           harveyi which catalyzes the oxidation of long-chain
           aliphatic aldehydes to acids.
          Length = 454

 Score = 86.4 bits (215), Expect = 3e-18
 Identities = 82/289 (28%), Positives = 115/289 (39%), Gaps = 50/289 (17%)

Query: 105 PVGVCGQIIPWNFPLLMLAWKI-----APALATGNTIVLK--PA--EQTPLTALAVGALI 155
           PV V G     NFPL   A+ +     A ALA G  +V+K  PA    + L A A+ A +
Sbjct: 108 PVAVFG---ASNFPL---AFSVAGGDTASALAAGCPVVVKAHPAHPGTSELVARAIRAAL 161

Query: 156 AEAGIPEGVVNIVPGYG-DAGAALVQHPGVDKVAFTGSTEVGK-LVQQGSGASNLKRTTL 213
              G+P GV +++ G G + G ALV+HP +  V FTGS   G+ L    +          
Sbjct: 162 RATGLPAGVFSLLQGGGREVGVALVKHPAIKAVGFTGSRRGGRALFDAAAARPEPIPFYA 221

Query: 214 ELGGKSPNIIFADA---DLDAAVEGAHFGLFFNMGQ-CCCAGSRTFVQDTIYDEFVARSG 269
           ELG  +P  I   A     +A  +G    L    GQ C   G          D F+A   
Sbjct: 222 ELGSVNPVFILPGALAERGEAIAQGFVGSLTLGAGQFCTNPGLVLVPAGPAGDAFIAALA 281

Query: 270 ERAKRRTVGDPFDLNVEQGPQIDKEQM--DKILEFIESGKSQ-----GAQLVAGGGRAGD 322
           E                   Q     M    I E    G        G +++AGG  A +
Sbjct: 282 EALA------------AAPAQT----MLTPGIAEAYRQGVEALAAAPGVRVLAGGA-AAE 324

Query: 323 KGFYVQPTVF----ANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERAN 367
            G    PT+F    A    D  + +EE+FGP   ++R+    E++  A 
Sbjct: 325 GGNQAAPTLFKVDAAAFLADPAL-QEEVFGPASLVVRYDDAAELLAVAE 372


>gnl|CDD|143396 cd07077, ALDH-like, NAD(P)+-dependent aldehyde dehydrogenase-like
           (ALDH-like) family.  The aldehyde dehydrogenase-like
           (ALDH-like) group of the ALDH superfamily of
           NAD(P)+-dependent enzymes which, in general, oxidize a
           wide range of  endogenous and exogenous aliphatic and
           aromatic aldehydes to their corresponding carboxylic
           acids and play an  important role in detoxification.
           This group includes families ALDH18, ALDH19, and ALDH20
           and represents such proteins as gamma-glutamyl phosphate
           reductase, LuxC-like acyl-CoA reductase, and coenzyme A
           acylating aldehyde dehydrogenase.  All of these proteins
           have a conserved cysteine that aligns with the catalytic
           cysteine of the ALDH group.
          Length = 397

 Score = 68.0 bits (166), Expect = 2e-12
 Identities = 60/239 (25%), Positives = 85/239 (35%), Gaps = 28/239 (11%)

Query: 63  MSYSVDVEYSLR-TLRYYAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLM 121
           M  S    Y    T R        I    +P NG+   Y R  P+GV   I+P   PL  
Sbjct: 59  MGCSESKLYKNIDTERGITASVGHIQDVLLPDNGE--TYVRAFPIGVTMHILPSTNPLSG 116

Query: 122 LAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEA---GIPEGVVNIVPGYGDAGA-A 177
           +       +AT N  + +P    P T  A+  L   A     P+ +V  VP   D  A  
Sbjct: 117 IT-SALRGIATRNQCIFRPHPSAPFTNRALALLFQAADAAHGPKILVLYVPHPSDELAEE 175

Query: 178 LVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAH 237
           L+ HP +D +  TG  +      +    S          G SP ++   AD + A    H
Sbjct: 176 LLSHPKIDLIVATGGRDAVDAAVK---HSPHIPVIGFGAGNSPVVVDETADEERASGSVH 232

Query: 238 FGLFFNMGQCCCAGSRTF-----VQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQI 291
              FF+  Q  CA  +       V D +Y+EF  +               L V Q  + 
Sbjct: 233 DSKFFD--QNACASEQNLYVVDDVLDPLYEEFKLKLVVEG----------LKVPQETKP 279


>gnl|CDD|143444 cd07126, ALDH_F12_P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, ALDH family 12.
           Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           EC=1.5.1.12), family 12: a proline catabolic enzyme of
           the aldehyde dehydrogenase (ALDH) protein superfamily.
           P5CDH is a mitochondrial enzyme involved in proline
           degradation and catalyzes the NAD + -dependent
           conversion of P5C to glutamate.  The P5CDH, ALDH12A1
           gene, in Arabidopsis, has been identified as an
           osmotic-stress-inducible ALDH gene. This CD contains
           both Viridiplantae and Alveolata P5CDH sequences.
          Length = 489

 Score = 54.4 bits (131), Expect = 5e-08
 Identities = 75/333 (22%), Positives = 124/333 (37%), Gaps = 67/333 (20%)

Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGV 164
           P G    I P+NFPL + A ++  AL  GN  +LK   +  +       L+   G+P   
Sbjct: 142 PYGPVAIITPFNFPLEIPALQLMGALFMGNKPLLKVDSKVSVVMEQFLRLLHLCGMPATD 201

Query: 165 VNIVPGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPNIIF 224
           V+++   G     ++         FTGS++V             +R  LEL GK   +  
Sbjct: 202 VDLIHSDGPTMNKILLEANPRMTLFTGSSKVA------------ERLALELHGK---VKL 246

Query: 225 ADADLDAAVEGAHFG------------LFFNMGQCCCAGSRTFVQDTIYDE-FVARSGER 271
            DA  D  + G                 +   GQ C A S  F  +       + +    
Sbjct: 247 EDAGFDWKILGPDVSDVDYVAWQCDQDAYACSGQKCSAQSILFAHENWVQAGILDKLKAL 306

Query: 272 AKRRTVGDPFDLNVEQGP------QIDKEQMDKILEFIESGKSQGAQLVAGGGRAGDKGF 325
           A++R +    DL +  GP      +   + +DK+L         GA+++ GG    +   
Sbjct: 307 AEQRKLE---DLTI--GPVLTWTTERILDHVDKLLAI------PGAKVLFGGKPLTNHSI 355

Query: 326 -----YVQPT-VF-----ANVRDDMKIAREEIFGPVQQLIRFSSIEE--VIERANNSDYG 372
                  +PT VF       + ++ ++   E+FGP Q +  +   +   V+E        
Sbjct: 356 PSIYGAYEPTAVFVPLEEIAIEENFELVTTEVFGPFQVVTEYKDEQLPLVLEALERMHAH 415

Query: 373 LAAAVFTKDLDKTNYVT---------QGLRAGT 396
           L AAV + D+     V           G+RA T
Sbjct: 416 LTAAVVSNDIRFLQEVLANTVNGTTYAGIRART 448


>gnl|CDD|143400 cd07081, ALDH_F20_ACDH_EutE-like, Coenzyme A acylating aldehyde
           dehydrogenase (ACDH), Ethanolamine utilization protein
           EutE, and related proteins.  Coenzyme A acylating
           aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent
           acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10),
           functions as a single enzyme (such as the Ethanolamine
           utilization protein, EutE, in Salmonella typhimurium) or
           as part of a multifunctional enzyme to convert
           acetaldehyde into acetyl-CoA. The E. coli
           aldehyde-alcohol dehydrogenase includes the functional
           domains, alcohol dehydrogenase (ADH), ACDH, and
           pyruvate-formate-lyase deactivase; and the Entamoeba
           histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1)
           includes the functional domains ADH and ACDH, and may be
           critical enzymes in the fermentative pathway.
          Length = 439

 Score = 52.3 bits (125), Expect = 3e-07
 Identities = 61/335 (18%), Positives = 116/335 (34%), Gaps = 41/335 (12%)

Query: 79  YAGFADKIHGKTIPINGDFFAYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVL 138
           Y  + D+     +  + +       EP+GV   I P   P   + +K   +L T N+I+ 
Sbjct: 69  YNVYKDEKTCGVLTGDENGGTLIIAEPIGVVASITPSTNPTSTVIFKSLISLKTRNSIIF 128

Query: 139 KP----AEQTPLTALAVGALIAEAGIPEGVVNIV--PGYGDAGAALVQHPGVDKVAFTGS 192
            P     + T   A  +      AG PE ++  +  P   +    L++ PG+  +  TG 
Sbjct: 129 SPHPRAKKVTQRAATLLLQAAVAAGAPENLIGWIDNPSI-ELAQRLMKFPGIGLLLATGG 187

Query: 193 TEVGKLVQQGSGASNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGS 252
             V K     +  S+ K       G +P +I   AD+  AV+       F+ G  C +  
Sbjct: 188 PAVVK-----AAYSSGKPAIGVGAGNTPVVIDETADIKRAVQSIVKSKTFDNGVICASEQ 242

Query: 253 RTFVQDTIYDEFVARSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQ 312
              V D++YDE +     +                  ++  E++ ++   I         
Sbjct: 243 SVIVVDSVYDEVMRLFEGQG---------------AYKLTAEELQQVQPVILKNGDVNRD 287

Query: 313 LVA--GGGRAGDKGFYVQPT-----VFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIER 365
           +V       A   G  V               + +    E   PV  + R ++  +   +
Sbjct: 288 IVGQDAYKIAAAAGLKVPQETRILIGEVTSLAEHEPFAHEKLSPVLAMYRAANFADADAK 347

Query: 366 A----NNSDYGLAAAVFT---KDLDKTNYVTQGLR 393
           A         G  +A+++   K ++  N     ++
Sbjct: 348 ALALKLEGGCGHTSAMYSDNIKAIENMNQFANAMK 382


>gnl|CDD|143440 cd07122, ALDH_F20_ACDH, Coenzyme A acylating aldehyde dehydrogenase
           (ACDH), ALDH family 20-like.  Coenzyme A acylating
           aldehyde dehydrogenase (ACDH, EC=1.2.1.10), an NAD+ and
           CoA-dependent acetaldehyde dehydrogenase, functions as a
           single enzyme (such as the Ethanolamine utilization
           protein, EutE, in Salmonella typhimurium) or as part of
           a multifunctional enzyme to convert acetaldehyde into
           acetyl-CoA . The E. coli aldehyde-alcohol dehydrogenase
           includes the functional domains, alcohol dehydrogenase
           (ADH), ACDH, and pyruvate-formate-lyase deactivase; and
           the Entamoeba histolytica aldehyde-alcohol dehydrogenase
           2 (ALDH20A1) includes the functional domains ADH and
           ACDH and may be critical enzymes in the fermentative
           pathway.
          Length = 436

 Score = 49.0 bits (118), Expect = 3e-06
 Identities = 82/324 (25%), Positives = 121/324 (37%), Gaps = 61/324 (18%)

Query: 104 EPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKP---AEQTPLTALAVGALIA---E 157
           EPVGV   +IP   P     +K   AL T N I+  P   A++   +  A   +      
Sbjct: 94  EPVGVIAALIPSTNPTSTAIFKALIALKTRNAIIFSPHPRAKKC--SIEAAKIMREAAVA 151

Query: 158 AGIPEGVVNIVPGYGDAGA-ALVQHPGVDKVAFTGSTEVGKLVQQ------GSGASNLKR 210
           AG PEG++  +          L++HP VD +  TG   + K          G G  N   
Sbjct: 152 AGAPEGLIQWIEEPSIELTQELMKHPDVDLILATGGPGMVKAAYSSGKPAIGVGPGN--- 208

Query: 211 TTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGE 270
                    P  I   AD+  AV+       F+ G  C +     V D IYDE  A    
Sbjct: 209 --------VPAYIDETADIKRAVKDIILSKTFDNGTICASEQSVIVDDEIYDEVRA---- 256

Query: 271 RAKRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGK------SQGAQLVAGGGRAGDKG 324
             KRR  G  F LN E+     KE+++K L F + G        + AQ +A    AG + 
Sbjct: 257 ELKRR--GAYF-LNEEE-----KEKLEKAL-FDDGGTLNPDIVGKSAQKIA--ELAGIE- 304

Query: 325 FYVQPTV------FANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERA----NNSDYGLA 374
             V             V  +  ++RE++  PV    R    EE +E+A         G  
Sbjct: 305 --VPEDTKVLVAEETGVGPEEPLSREKLS-PVLAFYRAEDFEEALEKARELLEYGGAGHT 361

Query: 375 AAVFTKDLDKTNYVTQGLRAGTLI 398
           A + + D +        +    ++
Sbjct: 362 AVIHSNDEEVIEEFALRMPVSRIL 385


>gnl|CDD|143439 cd07121, ALDH_EutE, Ethanolamine utilization protein EutE-like.
           Coenzyme A acylating aldehyde dehydrogenase (ACDH), an
           NAD+ and CoA-dependent acetaldehyde dehydrogenase,
           acetylating (EC=1.2.1.10), converts acetaldehyde into
           acetyl-CoA.  This CD is limited to such monofunctional
           enzymes as the Ethanolamine utilization protein, EutE,
           in Salmonella typhimurium.  Mutations in eutE abolish
           the ability to utilize ethanolamine as a carbon source.
          Length = 429

 Score = 47.6 bits (114), Expect = 7e-06
 Identities = 69/312 (22%), Positives = 111/312 (35%), Gaps = 65/312 (20%)

Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKP---AEQTPLTALAVGAL---IAEA 158
           P GV G I P   P   +       LA GN +V  P   A++   +A AV  +   IAEA
Sbjct: 97  PFGVIGAITPSTNPTETIINNSISMLAAGNAVVFNPHPGAKKV--SAYAVELINKAIAEA 154

Query: 159 GIPEGVVNIV--PGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQ------GSGASNLKR 210
           G P+ +V  V  P   +    L+ HP ++ +  TG   V K          G+GA N   
Sbjct: 155 GGPDNLVVTVEEPTI-ETTNELMAHPDINLLVVTGGPAVVKAALSSGKKAIGAGAGN--- 210

Query: 211 TTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVARSGE 270
                    P ++   AD++ A      G  F+    C A       D++ D  +A    
Sbjct: 211 --------PPVVVDETADIEKAARDIVQGASFDNNLPCIAEKEVIAVDSVADYLIAAMQR 262

Query: 271 RAKRRTVGDPFDLNVEQGPQIDKEQMDKILE--------------FIESGKSQGAQLVAG 316
                               ++ EQ +++LE              ++  GK     L A 
Sbjct: 263 ---------------NGAYVLNDEQAEQLLEVVLLTNKGATPNKKWV--GKDASKILKAA 305

Query: 317 GGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSDYGL--A 374
           G             +      D     EE   P+  ++R  + +E IE A   ++G    
Sbjct: 306 GIEVPAD----IRLIIVETDKDHPFVVEEQMMPILPVVRVKNFDEAIELAVELEHGNRHT 361

Query: 375 AAVFTKDLDKTN 386
           A + +K+++   
Sbjct: 362 AIIHSKNVENLT 373


>gnl|CDD|131341 TIGR02288, PaaN_2, phenylacetic acid degradation protein paaN.
           This enzyme is proposed to act in the ring-opening step
           of phenylacetic acid degradation which follows ligation
           of the acid with coenzyme A (by PaaF) and hydroxylation
           by a multicomponent non-heme iron hydroxylase complex
           (PaaGHIJK). Gene symbols have been standardized in. This
           enzyme is related to aldehyde dehydrogenases and has a
           domain which is a member of the pfam00171 family. This
           family includes sequences from Burkholderia, Bordetella,
           Streptomyces. Other PaaN enzymes are represented by a
           separate model, TIGR02278.
          Length = 551

 Score = 47.6 bits (113), Expect = 8e-06
 Identities = 64/239 (26%), Positives = 90/239 (37%), Gaps = 43/239 (17%)

Query: 69  VEYSLRTLRYYAGFA--DKIHGKTIPINGDFFAYTRHEPVG-------VCGQIIPWN-FP 118
           V Y+ R +      A  +K  GK  P+        R   V         C     WN +P
Sbjct: 157 VAYAYREMSRIPETAVWEKPQGKNDPLKLK----KRFTIVPRGIALVIGCSTFPTWNTYP 212

Query: 119 LLMLAWKIAPALATGNTIVLKPAEQTPL-TALAVGA---LIAEAGIPEGVVNIV--PGYG 172
            L        +LATGN +++KP     L  AL V     ++ EAG    +V +       
Sbjct: 213 GLF------ASLATGNPVLVKPHPGAILPLALTVQVAREVLGEAGFDPNLVTLAAFDPGH 266

Query: 173 DAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTL--ELGGKSPNIIFADADLD 230
           +A   L   P V  + FTGS   G+ ++Q     N ++  +  E  G +  II +  D  
Sbjct: 267 EAAQRLATDPAVRIIDFTGSNAFGQWLEQ-----NARQAQVYTEKAGVNTVIIESTDDYK 321

Query: 231 AAVEGAHFGLFFNMGQCCCAGSRTFV-QDTI--------YDEFVARSGERAKRRTVGDP 280
           A +    F L    GQ C       V +D I        YDE VA     A    +GDP
Sbjct: 322 AMLRNLAFSLSLYSGQMCTTTQAILVPRDGIRTDQGRKSYDE-VAADLATAIDGLLGDP 379


>gnl|CDD|143445 cd07127, ALDH_PAD-PaaZ, Phenylacetic acid degradation proteins PaaZ
           (Escherichia coli) and PaaN (Pseudomonas putida)-like.
           Phenylacetic acid degradation (PAD) proteins PaaZ
           (Escherichia coli) and PaaN (Pseudomonas putida) are
           putative aromatic ring cleavage enzymes of the aerobic
           PA catabolic pathway. PaaZ mutants were defective for
           growth with PA as a sole carbon source due to
           interruption of the putative ring opening system.  This
           CD is limited to bacterial monofunctional enzymes.
          Length = 549

 Score = 47.1 bits (112), Expect = 1e-05
 Identities = 67/280 (23%), Positives = 99/280 (35%), Gaps = 33/280 (11%)

Query: 109 CGQIIPWN-FPLLMLAWKIAPALATGNTIVLKPAEQTPL-TALAVG---ALIAEAGIPEG 163
           C     WN +P L        +LATGN +++KP     L  A+ V     ++AEAG    
Sbjct: 202 CSTFPTWNGYPGLF------ASLATGNPVIVKPHPAAILPLAITVQVAREVLAEAGFDPN 255

Query: 164 VVNIV--PGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQGSGASNLKRTTLELGGKSPN 221
           +V +            L   P V  + FTGS   G  ++  +  + +     E  G +  
Sbjct: 256 LVTLAADTPEEPIAQTLATRPEVRIIDFTGSNAFGDWLEANARQAQV---YTEKAGVNTV 312

Query: 222 IIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFV-QDTI--------YDEFVARSGERA 272
           ++ +  DL A +    F L    GQ C      +V +D I        +DE  A     A
Sbjct: 313 VVDSTDDLKAMLRNLAFSLSLYSGQMCTTPQNIYVPRDGIQTDDGRKSFDEVAADLA-AA 371

Query: 273 KRRTVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGGRAG---DKGFYVQP 329
               + DP       G      Q    L  I   +  G  L+A    A           P
Sbjct: 372 IDGLLADPARAAALLGAI----QSPDTLARIAEARQLGEVLLASEAVAHPEFPDARVRTP 427

Query: 330 TVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNS 369
            +      D     EE FGP+  ++   S +  IE A  S
Sbjct: 428 LLLKLDASDEAAYAEERFGPIAFVVATDSTDHSIELARES 467


>gnl|CDD|237956 PRK15398, PRK15398, aldehyde dehydrogenase EutE; Provisional.
          Length = 465

 Score = 38.3 bits (90), Expect = 0.006
 Identities = 65/292 (22%), Positives = 104/292 (35%), Gaps = 69/292 (23%)

Query: 105 PVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKP---AEQTPLTALAVGAL---IAEA 158
           P GV G + P   P   +       LA GN++V  P   A++   +  A+  L   I  A
Sbjct: 129 PFGVIGAVTPSTNPTETIINNAISMLAAGNSVVFSPHPGAKKV--SLRAIELLNEAIVAA 186

Query: 159 GIPEGVVNIV--PGYGDAGAALVQHPGVDKVAFTGSTEVGKLVQQ------GSGASNLKR 210
           G PE +V  V  P   +    L++HPG+  +  TG   V K   +      G+GA N   
Sbjct: 187 GGPENLVVTVAEPTI-ETAQRLMKHPGIALLVVTGGPAVVKAAMKSGKKAIGAGAGN--- 242

Query: 211 TTLELGGKSPNIIFADADLDAA----VEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFVA 266
                    P ++   AD++ A    V+GA F    N+   C A     V D++ DE + 
Sbjct: 243 --------PPVVVDETADIEKAARDIVKGASFD--NNL--PCIAEKEVIVVDSVADELMR 290

Query: 267 RSGERAKRRTVGDPFDLNVEQGPQIDKEQMDKIL------------EFIESGKSQGAQLV 314
              +                    +  EQ +K+             +++  GK     L 
Sbjct: 291 LMEK---------------NGAVLLTAEQAEKLQKVVLKNGGTVNKKWV--GKDAAKILE 333

Query: 315 AGGGRAGDKGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERA 366
           A G             +      +      E+  PV  ++R   ++E I  A
Sbjct: 334 AAGINVPKD----TRLLIVETDANHPFVVTELMMPVLPVVRVKDVDEAIALA 381


>gnl|CDD|143399 cd07080, ALDH_Acyl-CoA-Red_LuxC, Acyl-CoA reductase LuxC.  Acyl-CoA
           reductase, LuxC, (EC=1.2.1.50) is the fatty acid
           reductase enzyme responsible for synthesis of the
           aldehyde  substrate for the luminescent reaction
           catalyzed by luciferase. The fatty acid reductase, a
           luminescence-specific, multienzyme complex (LuxCDE),
           reduces myristic acid to generate the long chain fatty
           aldehyde required for the luciferase-catalyzed reaction
           resulting in the emission of blue-green light.
           Mutational studies of conserved cysteines of LuxC
           revealed that the cysteine which aligns with the
           catalytic cysteine conserved throughout the ALDH
           superfamily is the LuxC acylation site. This CD is
           composed of mainly bacterial sequences but also includes
           a few archaeal sequences similar to the Methanospirillum
           hungateiacyl acyl-CoA reductase RfbN.
          Length = 422

 Score = 36.9 bits (86), Expect = 0.019
 Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 1/59 (1%)

Query: 99  AYTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAE 157
            Y R +P G+   II  N PLL + W I   L   N  +LK +   PLTA A+   +A+
Sbjct: 106 GYIRAQPRGLVVHIIAGNVPLLPV-WSIVRGLLVKNVNLLKMSSSDPLTATALLRSLAD 163


>gnl|CDD|219180 pfam06798, PrkA, PrkA serine protein kinase C-terminal domain.
           This is a family of PrkA bacterial and archaeal serine
           kinases approximately 630 residues long. This family
           corresponds to the C-terminal domain.
          Length = 254

 Score = 32.2 bits (74), Expect = 0.45
 Identities = 40/180 (22%), Positives = 69/180 (38%), Gaps = 43/180 (23%)

Query: 255 FVQDTIYDEFVARSGERAKRRTVGDPFDL------NVEQGPQIDKEQMDKILEFIESGKS 308
           F+   + +  V+ S ER       +P D+       ++    I +E  DK LEF++  + 
Sbjct: 41  FIGKALSNALVSDSEERCI-----NPLDVLEELEQGIKDHESIPEEDRDKYLEFLKVVRK 95

Query: 309 ----------QGAQLVA---GGGRAGDKGFYVQPTVFANVRDDMKIARE---EIFGPVQQ 352
                     Q A + +         D   Y+   V A + D+ K+      E   P ++
Sbjct: 96  EYNERIKKEVQKAYIESYEEAAQNLFDN--YLD-NVEAYINDE-KVKDPLTGEELEPDER 151

Query: 353 LIRFSSIEEVIERANNSDYGLAAAVFTKDLDKTNYVTQGLRAGTLIRFSSIE---EVIER 409
            +R  SIEE I  +           F +++   NY+ +  R G    + S E   E IE+
Sbjct: 152 FLR--SIEEQIGISEE-----RKKDFRREI--VNYIGRLAREGKKFDYDSYERLREAIEK 202


>gnl|CDD|218799 pfam05893, LuxC, Acyl-CoA reductase (LuxC).  This family consists
           of several bacterial Acyl-CoA reductase (LuxC) proteins.
           The channelling of fatty acids into the fatty aldehyde
           substrate for the bacterial bioluminescence reaction is
           catalyzed by a fatty acid reductase multienzyme complex,
           which channels fatty acids through the thioesterase
           (LuxD), synthetase (LuxE) and reductase (LuxC)
           components.
          Length = 400

 Score = 32.3 bits (74), Expect = 0.52
 Identities = 45/201 (22%), Positives = 75/201 (37%), Gaps = 42/201 (20%)

Query: 100 YTRHEPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAG 159
           Y + +P G+   ++  N PLL +   I   +   N  +LKP+   P TA A+ A  A+  
Sbjct: 83  YEKAQPRGLVVHVLAGNVPLLPV-MSILQGILVKNVNLLKPSSSDPFTAAALLASFAD-- 139

Query: 160 IPEGVVNIVPGYGDAGA-ALVQHPGVDKVAFTGSTEVGKLVQQGS-------GASNLKRT 211
                  +   +  A + ++V           G T++ +L+ Q +       G   +K  
Sbjct: 140 -------VDATHPLARSISVVYWHS-------GDTQLEQLIMQHADVVIAWGGEEAIKWI 185

Query: 212 TLELGGKSPNIIF----------ADADLDAAVEG-AHFGLFFNMGQCCCAGSRT-FV--Q 257
              L      I F            A +D A +G AH   F++  Q  C+  +  FV   
Sbjct: 186 RKHLPPGCDWIKFGPKISFAVVDPPAAIDKAADGVAHDICFYD--QQACSSPQVVFVESD 243

Query: 258 DTI-YDEFVARSGERAKRRTV 277
           D I   EF  R     ++   
Sbjct: 244 DGITLREFAQRLAAALEKYAR 264


>gnl|CDD|237515 PRK13805, PRK13805, bifunctional acetaldehyde-CoA/alcohol
           dehydrogenase; Provisional.
          Length = 862

 Score = 32.1 bits (74), Expect = 0.76
 Identities = 52/182 (28%), Positives = 72/182 (39%), Gaps = 36/182 (19%)

Query: 104 EPVGV-CGQIIPWNFPLLMLAWKIAPALATGNTIVLKP---AEQTPLTALAV--GALIAE 157
           EPVGV  G I P   P     +K   AL T N I+      A+++ + A  +   A +A 
Sbjct: 107 EPVGVIAG-ITPTTNPTSTAIFKALIALKTRNPIIFSFHPRAQKSSIAAAKIVLDAAVA- 164

Query: 158 AGIPEGVVNIVPGYG-DAGAALVQHPGVDKVAFTG---------STEVGK--LVQQGSGA 205
           AG P+ ++  +     +   AL+ HPG+  +  TG         S+  GK  L   G GA
Sbjct: 165 AGAPKDIIQWIEEPSVELTNALMNHPGIALILATGGPGMVKAAYSS--GKPAL---GVGA 219

Query: 206 SNLKRTTLELGGKSPNIIFADADLDAAVEGAHFGLFFNMGQCCCAGSRTFVQDTIYDEFV 265
            N            P  I   AD+  AV        F+ G  C +     V D IYDE  
Sbjct: 220 GN-----------VPAYIDKTADIKRAVNDILLSKTFDNGMICASEQAVIVDDEIYDEVK 268

Query: 266 AR 267
             
Sbjct: 269 EE 270


>gnl|CDD|237108 PRK12467, PRK12467, peptide synthase; Provisional.
          Length = 3956

 Score = 32.1 bits (73), Expect = 0.79
 Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 3/51 (5%)

Query: 124 WKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNIVPGYGDA 174
            ++  ALA+G T+ L P +     A A  AL+A+ G+   V+ IVP +  A
Sbjct: 713 TELFGALASGATLHLLPPDCA-RDAEAFAALMADQGV--TVLKIVPSHLQA 760


>gnl|CDD|131570 TIGR02518, EutH_ACDH, acetaldehyde dehydrogenase (acetylating). 
          Length = 488

 Score = 31.4 bits (71), Expect = 1.0
 Identities = 18/66 (27%), Positives = 26/66 (39%), Gaps = 4/66 (6%)

Query: 104 EPVGVCGQIIPWNFPLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAE----AG 159
            PVGV   +IP   P     +K   ++   N IV  P        +    L+ +    AG
Sbjct: 103 VPVGVVAGLIPSTNPTSTAIYKTLISIKARNAIVFSPHPNAKKCIIETVKLMRKAAEEAG 162

Query: 160 IPEGVV 165
            PEG +
Sbjct: 163 APEGAI 168


>gnl|CDD|240868 cd12422, RRM2_PTBP1_hnRNPL_like, RNA recognition motif in
          polypyrimidine tract-binding protein 1 (PTB or hnRNP
          I), heterogeneous nuclear ribonucleoprotein L
          (hnRNP-L), and similar proteins.  This subfamily
          corresponds to the RRM2 of polypyrimidine tract-binding
          protein 1 (PTB or hnRNP I), polypyrimidine
          tract-binding protein 2 (PTBP2 or nPTB), regulator of
          differentiation 1 (Rod1), heterogeneous nuclear
          ribonucleoprotein L (hnRNP-L), heterogeneous nuclear
          ribonucleoprotein L-like (hnRNP-LL), polypyrimidine
          tract-binding protein homolog 3 (PTBPH3),
          polypyrimidine tract-binding protein homolog 1 and 2
          (PTBPH1 and PTBPH2), and similar proteins, and RRM3 of
          PTBPH1 and PTBPH2. PTB is an important negative
          regulator of alternative splicing in mammalian cells
          and also functions at several other aspects of mRNA
          metabolism, including mRNA localization, stabilization,
          polyadenylation, and translation. PTBP2 is highly
          homologous to PTB and is perhaps specific to the
          vertebrates. Unlike PTB, PTBP2 is enriched in the brain
          and in some neural cell lines. It binds more stably to
          the downstream control sequence (DCS) RNA than PTB does
          but is a weaker repressor of splicing in vitro. PTBP2
          also greatly enhances the binding of two other
          proteins, heterogeneous nuclear ribonucleoprotein
          (hnRNP) H and KH-type splicing-regulatory protein
          (KSRP), to the DCS RNA. The binding properties of PTBP2
          and its reduced inhibitory activity on splicing imply
          roles in controlling the assembly of other
          splicing-regulatory proteins. Rod1 is a mammalian
          polypyrimidine tract binding protein (PTB) homolog of a
          regulator of differentiation in the fission yeast
          Schizosaccharomyces pombe, where the nrd1 gene encodes
          an RNA binding protein negatively regulates the onset
          of differentiation. ROD1 is predominantly expressed in
          hematopoietic cells or organs. It might play a role
          controlling differentiation in mammals. hnRNP-L is a
          higher eukaryotic specific subunit of human KMT3a (also
          known as HYPB or hSet2) complex required for histone H3
          Lys-36 trimethylation activity. It plays both, nuclear
          and cytoplasmic, roles in mRNA export of intronless
          genes, IRES-mediated translation, mRNA stability, and
          splicing. hnRNP-LL protein plays a critical and unique
          role in the signal-induced regulation of CD45 and acts
          as a global regulator of alternative splicing in
          activated T cells. This family also includes
          polypyrimidine tract binding protein homolog 3 (PTBPH3)
          found in plant. Although its biological roles remain
          unclear, PTBPH3 shows significant sequence similarity
          to other family members, all of which contain four RNA
          recognition motifs (RRM), also known as RBD (RNA
          binding domain) or RNP (ribonucleoprotein domain).
          Although their biological roles remain unclear, both
          PTBPH1 and PTBPH2 show significant sequence similarity
          to PTB. However, in contrast to PTB, they have three
          RRMs. .
          Length = 85

 Score = 29.1 bits (66), Expect = 1.2
 Identities = 11/55 (20%), Positives = 26/55 (47%), Gaps = 5/55 (9%)

Query: 24 AVPKPITKPKIEHTQLFINNEFVN---AQSSLETLDNGKPYNMSYSVDVEYSLRT 75
          AV K +   K    Q  +  +F +   A+++ + L+    Y+   ++D+++S   
Sbjct: 27 AVEKILIFEKNTGVQALV--QFDSVESAENAKKALNGRNIYDGCCTLDIQFSRLK 79


>gnl|CDD|237250 PRK12893, PRK12893, allantoate amidohydrolase; Reviewed.
          Length = 412

 Score = 31.0 bits (71), Expect = 1.2
 Identities = 21/82 (25%), Positives = 27/82 (32%), Gaps = 25/82 (30%)

Query: 234 EGAHFGLFFNMGQCCCAGSRTFV----QDTIYD----------EFVARSGERAKRRTVGD 279
           EGA F            GS  F      D              E +AR G R   R    
Sbjct: 130 EGARFAP-------AMLGSGVFTGALPLDDALARRDADGITLGEALARIGYRGTARVGRR 182

Query: 280 PFD----LNVEQGPQIDKEQMD 297
             D    L++EQGP ++ E + 
Sbjct: 183 AVDAYLELHIEQGPVLEAEGLP 204


>gnl|CDD|213296 cd05930, A_NRPS, The adenylation domain of nonribosomal peptide
           synthetases (NRPS).  The adenylation (A) domain of NRPS
           recognizes a specific amino acid or hydroxy acid and
           activates it as an (amino) acyl adenylate by hydrolysis
           of ATP. The activated acyl moiety then forms a thioester
           bond to the enzyme-bound cofactor phosphopantetheine of
           a peptidyl carrier protein domain. NRPSs are large
           multifunctional enzymes which synthesize many
           therapeutically useful peptides in bacteria and fungi
           via a template-directed, nucleic acid independent
           nonribosomal mechanism. These natural products include
           antibiotics, immunosuppressants, plant and animal
           toxins, and enzyme inhibitors. NRPS has a distinct
           modular structure in which each module is responsible
           for the recognition, activation, and in some cases,
           modification of a single amino acid residue of the final
           peptide product. The modules can be subdivided into
           domains that catalyze specific biochemical reactions.
          Length = 445

 Score = 30.5 bits (70), Expect = 2.0
 Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 5/49 (10%)

Query: 124 WKIAPALATGNTIVLKPAEQ-TPLTALAVGALIAEAGIPEGVVNIVPGY 171
           W+I P L +G T+VL P E      ALA   L+ E  I   V+++VP  
Sbjct: 150 WEIFPPLLSGATLVLAPPEVLRDPEALA--ELLREHRI--TVLHLVPSL 194


>gnl|CDD|227012 COG4666, COG4666, TRAP-type uncharacterized transport system, fused
           permease components [General function prediction only].
          Length = 642

 Score = 30.0 bits (68), Expect = 3.0
 Identities = 13/50 (26%), Positives = 17/50 (34%)

Query: 118 PLLMLAWKIAPALATGNTIVLKPAEQTPLTALAVGALIAEAGIPEGVVNI 167
            L  L  K           +L+  E      + V    A AGI  GVV +
Sbjct: 386 VLRALFRKDTRMGKILARDLLEALESGARNVIPVAIACATAGIIVGVVTL 435


>gnl|CDD|143588 cd07564, nitrilases_CHs, Nitrilases, cyanide hydratase (CH)s, and
           similar proteins (class 1 nitrilases).  Nitrilases
           (nitrile aminohydrolases, EC:3.5.5.1) hydrolyze nitriles
           (RCN) to ammonia and the corresponding carboxylic acid.
           Most nitrilases prefer aromatic nitriles, some prefer
           arylacetonitriles and others aliphatic nitriles. This
           group includes the nitrilase cyanide dihydratase (CDH),
           which hydrolyzes inorganic cyanide (HCN) to produce
           formate. It also includes cyanide hydratase (CH), which
           hydrolyzes HCN to formamide. This group includes four
           Arabidopsis thaliana nitrilases (Ath)NIT1-4. AthNIT1-3
           have a strong substrate preference for
           phenylpropionitrile (PPN) and other nitriles which may
           originate from the breakdown of glucosinolates. The
           product of PPN hydrolysis, phenylacetic acid has auxin
           activity. AthNIT1-3 can also convert indoacetonitrile to
           indole-3-acetic acid (IAA, auxin), but with a lower
           affinity and velocity. From their expression patterns,
           it has been speculated that NIT3 may produce IAA during
           the early stages of germination, and that NIT3 may
           produce IAA during embryo development and maturation.
           AthNIT4 has a strong substrate specificity for the
           nitrile, beta-cyano-L-alanine (Ala(CN)), an intermediate
           of cyanide detoxification. AthNIT4 has both a nitrilase
           activity and a nitrile hydratase (NHase) activity, which
           generate aspartic acid and asparagine respectively from
           Ala(CN). NHase catalyzes the hydration of nitriles to
           their corresponding amides. This subgroup belongs to a
           larger nitrilase superfamily comprised of belong to a
           larger nitrilase superfamily comprised of nitrile- or
           amide-hydrolyzing enzymes and amide-condensing enzymes,
           which depend on a Glu-Lys-Cys catalytic triad. This
           superfamily has been classified in the literature based
           on global and structure based sequence analysis into
           thirteen different enzyme classes (referred to as 1-13),
           this subgroup corresponds to class 1.
          Length = 297

 Score = 29.0 bits (66), Expect = 5.0
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query: 296 MDKILEFIESGKSQGAQLVA 315
           ++K    IE   + GAQLV 
Sbjct: 19  VEKACRLIEEAAANGAQLVV 38


>gnl|CDD|184499 PRK14090, PRK14090, phosphoribosylformylglycinamidine synthase II;
           Provisional.
          Length = 601

 Score = 29.1 bits (65), Expect = 5.6
 Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 11/53 (20%)

Query: 268 SGERAKRRT--VGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGG 318
           +GE+A + +  VGDPF          +K  ++  LE +E G  +GAQ +  GG
Sbjct: 196 TGEKATKLSIQVGDPF---------AEKMLIEAFLEMVEEGLVEGAQDLGAGG 239


>gnl|CDD|179591 PRK03545, PRK03545, putative arabinose transporter; Provisional.
          Length = 390

 Score = 28.7 bits (65), Expect = 5.8
 Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 18/65 (27%)

Query: 115 WNFPLLMLA-----------WKIAPALATGNTIVLKPAEQTP--LTALAVGALIAE-AGI 160
           WNF +L+++           W I  +LA    I + PA +    L+ LA G  +A   G+
Sbjct: 95  WNFTVLLISRIGIAFAHAIFWSITASLA----IRVAPAGKKAQALSLLATGTALAMVLGL 150

Query: 161 PEGVV 165
           P G V
Sbjct: 151 PLGRV 155


>gnl|CDD|219213 pfam06878, Pkip-1, Pkip-1 protein.  This family consists of several
           Pkip-1 proteins which seem to be specific to
           Nucleopolyhedroviruses. The function of this family is
           unknown although it has been found that Pkip-1 is not
           essential for virus replication in cell culture or by in
           vivo intrahaemocoelic injection.
          Length = 163

 Score = 28.0 bits (63), Expect = 5.9
 Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 7/48 (14%)

Query: 323 KGFYVQPTVFANVRDDMKIAREEIFGPVQQLIRFSSIEEVIERANNSD 370
           K +  +P+   N+ ++M I   +IFG  +QL       E +  AN SD
Sbjct: 22  KKYLKKPSKDENLAEEMLILAADIFGQEEQL-------EALLSANASD 62


>gnl|CDD|215956 pfam00504, Chloroa_b-bind, Chlorophyll A-B binding protein. 
          Length = 135

 Score = 27.7 bits (62), Expect = 6.4
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 150 AVGALIAEAGIPEGVVNIVPGYGDAGAALV 179
            +GAL+ E     G V   P + +AG  L+
Sbjct: 24  VLGALVPELLTKLGGVAGEPAWFEAGGPLL 53


>gnl|CDD|220561 pfam10091, Glycoamylase, Putative glucoamylase.  The structure of
           UniProt:Q5LIB7 has an alpha/alpha toroid fold and is
           similar structurally to a number of glucoamylases. Most
           of these structural homologues are glucoamylases,
           involved in breaking down complex sugars (e.g. starch).
           The biologically relevant state is likely to be
           monomeric. The putative active site is located at the
           centre of the toroid with a well defined large cavity.
          Length = 218

 Score = 28.0 bits (63), Expect = 8.1
 Identities = 7/32 (21%), Positives = 13/32 (40%), Gaps = 2/32 (6%)

Query: 60  PYN--MSYSVDVEYSLRTLRYYAGFADKIHGK 89
           PY   ++     E +L  LR    +   + G+
Sbjct: 141 PYAALLALPFAPEEALANLRRLYDYGPGLWGR 172


>gnl|CDD|234133 TIGR03168, 1-PFK, hexose kinase, 1-phosphofructokinase family.
           This family consists largely of 1-phosphofructokinases,
           but also includes tagatose-6-kinases and
           6-phosphofructokinases.
          Length = 303

 Score = 28.3 bits (64), Expect = 8.6
 Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 278 GDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGGG--RAGDKGFYVQ 328
           G+  +LN E GP+I +E+++++LE +    + G  +V  G         FY Q
Sbjct: 97  GEETELN-EPGPEISEEELEQLLEKLRELLASGDIVVISGSLPPGVPPDFYAQ 148


>gnl|CDD|219388 pfam07368, DUF1487, Protein of unknown function (DUF1487).  This
           family consists of several uncharacterized proteins from
           Drosophila melanogaster. The function of this family is
           unknown.
          Length = 215

 Score = 28.1 bits (63), Expect = 8.8
 Identities = 18/52 (34%), Positives = 25/52 (48%), Gaps = 14/52 (26%)

Query: 222 IIFADADLDAA----VEGAH--FGLFFNMGQCCCAGSRTFVQDTIYDEFVAR 267
           +IF D D+++A    VE  H  F           A +   VQ++I DEFV R
Sbjct: 9   VIFEDGDVNSALHALVESLHNPFAPG--------AVATVLVQESIRDEFVER 52


>gnl|CDD|238570 cd01164, FruK_PfkB_like, 1-phosphofructokinase (FruK), minor
           6-phosphofructokinase (pfkB) and related sugar kinases.
           FruK plays an important role in the predominant pathway
           for fructose utilisation.This group also contains
           tagatose-6-phophate kinase, an enzyme of the tagatose
           6-phosphate pathway, which responsible for breakdown of
           the galactose moiety during lactose metabolism by
           bacteria such as L. lactis.
          Length = 289

 Score = 27.9 bits (63), Expect = 9.4
 Identities = 12/42 (28%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 276 TVGDPFDLNVEQGPQIDKEQMDKILEFIESGKSQGAQLVAGG 317
             G   ++N E GP+I +E+++ +LE +++   +G  +V  G
Sbjct: 96  EDGTETEIN-EPGPEISEEELEALLEKLKALLKKGDIVVLSG 136


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.135    0.393 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,970,428
Number of extensions: 2316664
Number of successful extensions: 2981
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2580
Number of HSP's successfully gapped: 255
Length of query: 442
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 342
Effective length of database: 6,502,202
Effective search space: 2223753084
Effective search space used: 2223753084
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.4 bits)