RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10608
         (445 letters)



>gnl|CDD|143404 cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase
           and ALDH family members 6A1 and 6B2.  Methylmalonate
           semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27)
           [acylating] from Bacillus subtilis is involved in valine
           metabolism and catalyses the NAD+- and CoA-dependent
           oxidation of methylmalonate semialdehyde into
           propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and
           Arabidopsis MMSDH ALDH6B2 are also present in this CD.
          Length = 478

 Score =  639 bits (1651), Expect = 0.0
 Identities = 220/352 (62%), Positives = 278/352 (78%), Gaps = 1/352 (0%)

Query: 30  TKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSII 89
            KL+I+G++VESK T+W+D++NPAT EV+ RVP AT  E+++AVA+AK A+  WS T ++
Sbjct: 1   LKLFINGEWVESKTTEWLDVYNPATGEVIARVPLATAEEVDAAVAAAKAAFPAWSATPVL 60

Query: 90  TRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGET 149
            RQQV+F+ + +++ N+ +LA  IT E GKTLADA GDVLRGL+VVE  CSIP +++GE 
Sbjct: 61  KRQQVMFKFRQLLEENLDELARLITLEHGKTLADARGDVLRGLEVVEFACSIPHLLKGEY 120

Query: 150 LAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACM 209
           L  VA+ +D +SYR PLGV AGITPFNFPAMIPLWMFP+AIACGNT V+KPSER PGA M
Sbjct: 121 LENVARGIDTYSYRQPLGVVAGITPFNFPAMIPLWMFPMAIACGNTFVLKPSERVPGAAM 180

Query: 210 MLVDMLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKR 269
            L ++L +AG P GV+NV+HG  +AVN + DHPDI+A+SFVGS   G+YIYER A NGKR
Sbjct: 181 RLAELLQEAGLPDGVLNVVHGGKEAVNALLDHPDIKAVSFVGSTPVGEYIYERAAANGKR 240

Query: 270 VQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV-EWLPELKK 328
           VQ+  GAKNH V+M DA+   T N LVGAAFGAAGQRCMALS  + VG+   EW+P+L +
Sbjct: 241 VQALGGAKNHAVVMPDADLEQTANALVGAAFGAAGQRCMALSVAVAVGDEADEWIPKLVE 300

Query: 329 RAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKV 380
           RA+ LKV AG+ PG D+GPVISP AKERI  LI+SGV+EGA L+LDGR +KV
Sbjct: 301 RAKKLKVGAGDDPGADMGPVISPAAKERIEGLIESGVEEGAKLVLDGRGVKV 352



 Score = 99.1 bits (248), Expect = 2e-22
 Identities = 34/48 (70%), Positives = 38/48 (79%)

Query: 380 VGINVPIPVPLSMFSFTGSRGSFLGENHFYGKQGFYFYTETKTVTQLW 427
           VGINVPIPVPL+ FSF G +GSF G+ HFYGK G  FYT+TKTVT  W
Sbjct: 431 VGINVPIPVPLAFFSFGGWKGSFFGDLHFYGKDGVRFYTQTKTVTSRW 478


>gnl|CDD|130783 TIGR01722, MMSDH, methylmalonic acid semialdehyde dehydrogenase.
           Involved in valine catabolism,
           methylmalonate-semialdehyde dehydrogenase catalyzes the
           irreversible NAD+- and CoA-dependent oxidative
           decarboxylation of methylmalonate semialdehyde to
           propionyl-CoA. Methylmalonate-semialdehyde dehydrogenase
           has been characterized in both prokaryotes and
           eukaryotes, functioning as a mammalian tetramer and a
           bacterial homodimer. Although similar in monomeric
           molecular mass and enzymatic activity, the N-terminal
           sequence in P.aeruginosa does not correspond with the
           N-terminal sequence predicted for rat liver. Sequence
           homology to a variety of prokaryotic and eukaryotic
           aldehyde dehydrogenases places MMSDH in the aldehyde
           dehydrogenase (NAD+) superfamily (pfam00171), making
           MMSDH's CoA requirement unique among known ALDHs.
           Methylmalonate semialdehyde dehydrogenase is closely
           related to betaine aldehyde dehydrogenase,
           2-hydroxymuconic semialdehyde dehydrogenase, and class 1
           and 2 aldehyde dehydrogenase. In Bacillus, a highly
           homologous protein to methylmalonic acid semialdehyde
           dehydrogenase, groups out from the main MMSDH clade with
           Listeria and Sulfolobus. This Bacillus protein has been
           suggested to be located in an iol operon and/or involved
           in myo-inositol catabolism, converting malonic
           semialdehyde to acetyl CoA ad CO2. The preceeding
           enzymes responsible for valine catabolism are present in
           Bacillus, Listeria, and Sulfolobus [Energy metabolism,
           Amino acids and amines].
          Length = 477

 Score =  522 bits (1347), Expect = 0.0
 Identities = 185/350 (52%), Positives = 258/350 (73%)

Query: 31  KLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIIT 90
             +I GKF E  +  +I + NPATNEV T+V  A+  E+++AVASA++ + TW +TS+  
Sbjct: 2   NHWIGGKFAEGASGTYIPVTNPATNEVTTKVAFASVDEVDAAVASARETFLTWGQTSLAQ 61

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETL 150
           R  VL R Q ++K +  ++AE IT E GKT +DA GDV RGL+VVE  C + ++++GET 
Sbjct: 62  RTSVLLRYQALLKEHRDEIAELITAEHGKTHSDALGDVARGLEVVEHACGVNSLLKGETS 121

Query: 151 AGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMM 210
             VA  +D++S R PLGV AGITPFNFPAMIPLWMFP+AIACGNT V+KPSE+ P A + 
Sbjct: 122 TQVATRVDVYSIRQPLGVCAGITPFNFPAMIPLWMFPIAIACGNTFVLKPSEKVPSAAVK 181

Query: 211 LVDMLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRV 270
           L ++ ++AG P GV+NV+HG  +AV+ + +HPD++A+SFVGS   G+YI+  G+ +GKRV
Sbjct: 182 LAELFSEAGAPDGVLNVVHGDKEAVDRLLEHPDVKAVSFVGSTPIGRYIHTTGSAHGKRV 241

Query: 271 QSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAVEWLPELKKRA 330
           Q+  GAKNH V+M DA++    + LVGAA+GAAGQRCMA+S  + VG A EW+PE+++RA
Sbjct: 242 QALGGAKNHMVVMPDADKDAAADALVGAAYGAAGQRCMAISAAVLVGAADEWVPEIRERA 301

Query: 331 EALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKV 380
           E +++  G+ PG ++GP+I+PQAK+R+ SLI  G  EGA +LLDGR  KV
Sbjct: 302 EKIRIGPGDDPGAEMGPLITPQAKDRVASLIAGGAAEGAEVLLDGRGYKV 351



 Score = 88.8 bits (220), Expect = 6e-19
 Identities = 32/49 (65%), Positives = 38/49 (77%)

Query: 379 KVGINVPIPVPLSMFSFTGSRGSFLGENHFYGKQGFYFYTETKTVTQLW 427
           +VG+NVPIPVPL  FSFTG + SF G++H YGKQG +FYT  KTVT  W
Sbjct: 429 QVGVNVPIPVPLPYFSFTGWKDSFFGDHHIYGKQGTHFYTRGKTVTTRW 477


>gnl|CDD|166060 PLN02419, PLN02419, methylmalonate-semialdehyde dehydrogenase
           [acylating].
          Length = 604

 Score =  435 bits (1120), Expect = e-148
 Identities = 207/347 (59%), Positives = 265/347 (76%)

Query: 34  IDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQ 93
           I G FVES+++ +ID+ NPAT EVV++VP  T  E ++AV++AK+A+  W  T I TRQ+
Sbjct: 118 IGGSFVESQSSSFIDVINPATQEVVSKVPLTTNEEFKAAVSAAKQAFPLWRNTPITTRQR 177

Query: 94  VLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGV 153
           V+ + Q +I+ NM +LA NIT EQGKTL D+ GD+ RGL+VVE  C + T+  GE L  V
Sbjct: 178 VMLKFQELIRKNMDKLAMNITTEQGKTLKDSHGDIFRGLEVVEHACGMATLQMGEYLPNV 237

Query: 154 AKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVD 213
           +  +D +S R PLGV AGI PFNFPAMIPLWMFPVA+ CGNT ++KPSE+DPGA ++L +
Sbjct: 238 SNGVDTYSIREPLGVCAGICPFNFPAMIPLWMFPVAVTCGNTFILKPSEKDPGASVILAE 297

Query: 214 MLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSN 273
           +  +AG P GV+N++HG +D VN ICD  DIRA+SFVGS+ AG +IY R A  GKR+QSN
Sbjct: 298 LAMEAGLPDGVLNIVHGTNDTVNAICDDEDIRAVSFVGSNTAGMHIYARAAAKGKRIQSN 357

Query: 274 MGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAVEWLPELKKRAEAL 333
           MGAKNHG+++ DAN   TLN L+ A FGAAGQRCMALSTV+FVG+A  W  +L +RA+AL
Sbjct: 358 MGAKNHGLVLPDANIDATLNALLAAGFGAAGQRCMALSTVVFVGDAKSWEDKLVERAKAL 417

Query: 334 KVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKV 380
           KV+ G+ P  DLGPVIS QAKERI  LIQSGVD+GA LLLDGR++ V
Sbjct: 418 KVTCGSEPDADLGPVISKQAKERICRLIQSGVDDGAKLLLDGRDIVV 464



 Score = 73.6 bits (180), Expect = 5e-14
 Identities = 32/64 (50%), Positives = 46/64 (71%), Gaps = 3/64 (4%)

Query: 379 KVGINVPIPVPLSMFSFTGSRGSFLGENHFYGKQGFYFYTETKTVTQLWRESDVTHSKAA 438
           ++GINVPIPVPL  FSFTG++ SF G+ +FYGK G  F+T+ K VTQ  ++    HS  +
Sbjct: 542 QIGINVPIPVPLPFFSFTGNKASFAGDLNFYGKAGVDFFTQIKLVTQKQKD---IHSPFS 598

Query: 439 VSMP 442
           +++P
Sbjct: 599 LAIP 602


>gnl|CDD|215767 pfam00171, Aldedh, Aldehyde dehydrogenase family.  This family of
           dehydrogenases act on aldehyde substrates. Members use
           NADP as a cofactor. The family includes the following
           members: The prototypical members are the aldehyde
           dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
           dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase
           EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28.
           Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27.
           Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9.
           Delta-1-pyrroline-5-carboxylate dehydrogenase EC:
           1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10.
           Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This
           family also includes omega crystallin, an eye lens
           protein from squid and octopus that has little aldehyde
           dehydrogenase activity.
          Length = 459

 Score =  342 bits (879), Expect = e-114
 Identities = 129/342 (37%), Positives = 204/342 (59%), Gaps = 6/342 (1%)

Query: 38  FVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFR 97
           +V+S +++ I++ NPAT EV+  VP AT  ++++AVA+A+ A++ W+KT    R  +L +
Sbjct: 1   WVDS-SSETIEVINPATGEVIATVPAATAEDVDAAVAAARAAFKAWAKTPPSERAAILLK 59

Query: 98  LQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDM 157
             ++++    +LAE  T E GK LA+A G+V R +  +     +   ++GETL      +
Sbjct: 60  AADLLEERRDELAELETLETGKPLAEARGEVPRAIDTLRYYAGLARKLEGETLP-SDPGV 118

Query: 158 DIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQ 217
             ++ R PLGV   ITP+NFP ++  W    A+A GNT V+KPSE  P   ++L ++  +
Sbjct: 119 LAYTRREPLGVVGAITPWNFPLLLAAWKIAPALAAGNTVVLKPSELTPLTALLLAELFEE 178

Query: 218 AGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGA 276
           AG PPGV+NV+ G   +  + + +HPD+  +SF GS + G+ I +  AKN KRV   +G 
Sbjct: 179 AGLPPGVLNVVTGSGSEVGDALVEHPDVDKVSFTGSTEVGRRIAKAAAKNLKRVTLELGG 238

Query: 277 KNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALK 334
           KN  ++  DA+    +   V  AFG AGQ C A S  + V E++  E++  L + A++LK
Sbjct: 239 KNPLIVFDDADLDAAVEGAVFGAFGNAGQVCTAGS-RLLVHESIYDEFVERLVEAAKSLK 297

Query: 335 VSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           V     P TD+GP+IS + +ER+LS I+   +EGA LL  G 
Sbjct: 298 VGDPLDPDTDIGPLISKKQRERVLSYIEDAKEEGAKLLCGGE 339



 Score = 41.7 bits (99), Expect = 5e-04
 Identities = 17/44 (38%), Positives = 19/44 (43%), Gaps = 2/44 (4%)

Query: 380 VGINVPIPVPLSMFSFTGSRGSFLGENHFYGKQGFYFYTETKTV 423
           V IN           F G + S  G     GK+G   YTETKTV
Sbjct: 418 VWINDYTTGDPEALPFGGFKQSGFG--REGGKEGLEEYTETKTV 459


>gnl|CDD|223944 COG1012, PutA, NAD-dependent aldehyde dehydrogenases [Energy
           production and conversion].
          Length = 472

 Score =  329 bits (846), Expect = e-109
 Identities = 130/349 (37%), Positives = 196/349 (56%), Gaps = 5/349 (1%)

Query: 30  TKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSII 89
            KL IDG++V+      I++ NPAT EV+  VP AT  ++++AVA+A+ A+E WS+ S  
Sbjct: 1   YKLLIDGEWVD--GASTIEVINPATGEVIATVPAATAEDVDAAVAAARAAFEAWSRLSAE 58

Query: 90  TRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGET 149
            R  +L R+ ++++A  ++LA  IT E GK +++A G++ R    +         ++GET
Sbjct: 59  ERAAILRRIADLLEARAEELAALITLETGKPISEARGEIARAADFIRYYAEEARRLEGET 118

Query: 150 LAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACM 209
           +    K       R PLGV   ITP+NFP  +  W    A+A GNT V+KPSE+ P + +
Sbjct: 119 IP-TDKGSKALVRREPLGVVGAITPWNFPLALAAWKLAPALAAGNTVVLKPSEQTPLSAL 177

Query: 210 MLVDMLTQAGCPPGVVNVIHGA-HDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGK 268
            L ++  +AG P GV+NV+ G   +  + +  HPD+ AISF GS   G+ I    A N K
Sbjct: 178 ALAELAAEAGLPAGVLNVVTGGGAEVGDALVAHPDVDAISFTGSTAVGRAIAAAAAANLK 237

Query: 269 RVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGE-AVEWLPELK 327
            V   +G K+  +++ DA+    ++  V  AF  AGQRC A S +I       E++  L 
Sbjct: 238 PVTLELGGKSPAIVLEDADLDAAVDAAVFGAFFNAGQRCTAASRLIVHESVYDEFVERLV 297

Query: 328 KRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
            RA +LKV     P TDLGP+IS +  +R+   I+  V EGA LL  G+
Sbjct: 298 ARAASLKVGDPLDPSTDLGPLISEEQLDRVEGYIEDAVAEGARLLAGGK 346



 Score = 40.4 bits (95), Expect = 0.002
 Identities = 20/65 (30%), Positives = 25/65 (38%), Gaps = 6/65 (9%)

Query: 369 ATLLLDGRNLKVG---INVPIP-VPLSMFSFTGSRGSFLGENHFYGKQGFYFYTETKTVT 424
           A      R L+ G   IN       ++   F G + S LG     GK G   +TE KTVT
Sbjct: 409 ARAFRVARRLEAGMVGINDYTGGADIAYLPFGGVKQSGLG--REGGKYGLEEFTEVKTVT 466

Query: 425 QLWRE 429
                
Sbjct: 467 IKLGP 471


>gnl|CDD|143449 cd07131, ALDH_AldH-CAJ73105, Uncharacterized Candidatus kuenenia
           aldehyde dehydrogenase AldH (CAJ73105)-like.
           Uncharacterized aldehyde dehydrogenase of Candidatus
           kuenenia AldH (locus CAJ73105) and similar sequences
           with similarity to alpha-aminoadipic semialdehyde
           dehydrogenase (AASADH, human ALDH7A1, EC=1.2.1.31),
           Arabidopsis ALDH7B4, and Streptomyces clavuligerus
           delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH)
           are included in this CD.
          Length = 478

 Score =  301 bits (773), Expect = 6e-98
 Identities = 138/353 (39%), Positives = 203/353 (57%), Gaps = 5/353 (1%)

Query: 33  YIDGKFVESKATDWIDLHNPA-TNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITR 91
           YI G++V+S + +  D  NPA   EVV   P +T +++++AV +A++A+  W K     R
Sbjct: 2   YIGGEWVDSASGETFDSRNPADLEEVVGTFPLSTASDVDAAVEAAREAFPEWRKVPAPRR 61

Query: 92  QQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLA 151
            + LFR   ++K   ++LA  +T E GK LA+  GDV   + + +        + GET+ 
Sbjct: 62  AEYLFRAAELLKKRKEELARLVTREMGKPLAEGRGDVQEAIDMAQYAAGEGRRLFGETVP 121

Query: 152 GVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMML 211
               + D  + R P+GV A ITP+NFP  IP W    A+ CGNT V KP+E  P   + L
Sbjct: 122 SELPNKDAMTRRQPIGVVALITPWNFPVAIPSWKIFPALVCGNTVVFKPAEDTPACALKL 181

Query: 212 VDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRV 270
           V++  +AG PPGVVNV+HG  + V   + +HPD+  +SF GS + G+ I E  A+  KRV
Sbjct: 182 VELFAEAGLPPGVVNVVHGRGEEVGEALVEHPDVDVVSFTGSTEVGERIGETCARPNKRV 241

Query: 271 QSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKK 328
              MG KN  ++M DA+    L   + +AFG  GQRC A S +I V E+V  E+L    +
Sbjct: 242 ALEMGGKNPIIVMDDADLDLALEGALWSAFGTTGQRCTATSRLI-VHESVYDEFLKRFVE 300

Query: 329 RAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKVG 381
           RA+ L+V  G    TD+GP+I+    E++L+  + G +EGATLLL G  L  G
Sbjct: 301 RAKRLRVGDGLDEETDMGPLINEAQLEKVLNYNEIGKEEGATLLLGGERLTGG 353


>gnl|CDD|143415 cd07097, ALDH_KGSADH-YcbD, Bacillus subtilis NADP+-dependent
           alpha-ketoglutaric semialdehyde dehydrogenase ycbD-like.
            Kinetic studies of the Bacillus subtilis ALDH-like ycbD
           protein, which is involved in d-glucarate/d-galactarate
           utilization, reveal that it is a NADP+-dependent,
           alpha-ketoglutaric semialdehyde dehydrogenase (KGSADH).
           KGSADHs (EC 1.2.1.26) catalyze the NAD(P)+-dependent
           conversion of KGSA to alpha-ketoglutarate.
           Interestingly, the NADP+-dependent, tetrameric,
           2,5-dioxopentanoate dehydrogenase (EC=1.2.1.26), an
           enzyme involved in the catabolic pathway for D-arabinose
           in Sulfolobus solfataricus, also clusters in this group.
           This CD shows a distant phylogenetic relationship to the
           Azospirillum brasilense KGSADH-II (-III) group.
          Length = 473

 Score =  286 bits (734), Expect = 4e-92
 Identities = 129/351 (36%), Positives = 195/351 (55%), Gaps = 7/351 (1%)

Query: 33  YIDGKFVESKATDWIDLHNPA-TNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITR 91
           YIDG++V     +  +  NP+ T++VV +  +A+  + ++A+A+A  A+  W +TS   R
Sbjct: 4   YIDGEWVAGGDGE--ENRNPSDTSDVVGKYARASAEDADAAIAAAAAAFPAWRRTSPEAR 61

Query: 92  QQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLA 151
             +L +  + ++A  ++LA  +T E+GKTL +A G+V R  Q+          + GETL 
Sbjct: 62  ADILDKAGDELEARKEELARLLTREEGKTLPEARGEVTRAGQIFRYYAGEALRLSGETLP 121

Query: 152 GVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMML 211
                +++ + R PLGV   ITP+NFP  IP W    A+A GNT V KP+E  P +   L
Sbjct: 122 STRPGVEVETTREPLGVVGLITPWNFPIAIPAWKIAPALAYGNTVVFKPAELTPASAWAL 181

Query: 212 VDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRV 270
           V++L +AG P GV N++ G+   V   + +HPD+ A+SF GS   G+ I    A  G RV
Sbjct: 182 VEILEEAGLPAGVFNLVMGSGSEVGQALVEHPDVDAVSFTGSTAVGRRIAAAAAARGARV 241

Query: 271 QSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKK 328
           Q  MG KN  V++ DA+    +   V  AF + GQRC A S +I V E +   ++  L +
Sbjct: 242 QLEMGGKNPLVVLDDADLDLAVECAVQGAFFSTGQRCTASSRLI-VTEGIHDRFVEALVE 300

Query: 329 RAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLK 379
           R +ALKV      G D+GPV+S +  E+ L  I+    EGA L+  G  LK
Sbjct: 301 RTKALKVGDALDEGVDIGPVVSERQLEKDLRYIEIARSEGAKLVYGGERLK 351


>gnl|CDD|143397 cd07078, ALDH, NAD(P)+ dependent aldehyde dehydrogenase family.
           The aldehyde dehydrogenase family (ALDH) of NAD(P)+
           dependent enzymes, in general, oxidize a wide range of
           endogenous and exogenous aliphatic and aromatic
           aldehydes to their corresponding carboxylic acids and
           play an  important role in detoxification. Besides
           aldehyde detoxification, many ALDH isozymes possess
           multiple additional catalytic and non-catalytic
           functions such as participating in  metabolic pathways,
           or as  binding proteins, or as osmoregulants, to mention
           a few. The enzyme has three domains, a NAD(P)+
           cofactor-binding domain, a catalytic domain, and a
           bridging domain; and the active enzyme  is generally
           either homodimeric or homotetrameric. The catalytic
           mechanism is proposed to involve cofactor binding,
           resulting in a conformational change and activation of
           an invariant catalytic cysteine nucleophile. The
           cysteine and aldehyde substrate form an oxyanion
           thiohemiacetal intermediate resulting in hydride
           transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-like) proteins. The ALDH
           proteins are represented by enzymes which share a number
           of highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group.
          Length = 432

 Score =  276 bits (708), Expect = 9e-89
 Identities = 109/310 (35%), Positives = 166/310 (53%), Gaps = 4/310 (1%)

Query: 70  ESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVL 129
           ++AVA+A+ A++ W+      R  +L +L ++++   ++LA   T E GK + +A G+V 
Sbjct: 1   DAAVAAARAAFKAWAALPPAERAAILRKLADLLEERREELAALETLETGKPIEEALGEVA 60

Query: 130 RGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVA 189
           R          +   + GE +            R PLGV   ITP+NFP ++  W    A
Sbjct: 61  RAADTFRYYAGLARRLHGEVIPSPDPGELAIVRREPLGVVGAITPWNFPLLLAAWKLAPA 120

Query: 190 IACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAIS 248
           +A GNT V+KPSE  P   ++L ++L +AG PPGV+NV+ G   +    +  HP +  IS
Sbjct: 121 LAAGNTVVLKPSELTPLTALLLAELLAEAGLPPGVLNVVTGDGDEVGAALASHPRVDKIS 180

Query: 249 FVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCM 308
           F GS   GK I    A+N KRV   +G K+  ++  DA+    +   V  AFG AGQ C 
Sbjct: 181 FTGSTAVGKAIMRAAAENLKRVTLELGGKSPLIVFDDADLDAAVKGAVFGAFGNAGQVCT 240

Query: 309 ALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVD 366
           A S  + V E++  E++  L +R +ALKV     P TD+GP+IS    +R+L+ I+    
Sbjct: 241 AAS-RLLVHESIYDEFVERLVERVKALKVGNPLDPDTDMGPLISAAQLDRVLAYIEDAKA 299

Query: 367 EGATLLLDGR 376
           EGA LL  G+
Sbjct: 300 EGAKLLCGGK 309


>gnl|CDD|143405 cd07086, ALDH_F7_AASADH-like, NAD+-dependent alpha-aminoadipic
           semialdehyde dehydrogenase and related proteins.  ALDH
           subfamily which includes the NAD+-dependent,
           alpha-aminoadipic semialdehyde dehydrogenase (AASADH,
           EC=1.2.1.31), also known as Antiquitin-1, ALDH7A1,
           ALDH7B or delta-1-piperideine-6-carboxylate
           dehydrogenase (P6CDH), and other similar sequences, such
           as the uncharacterized aldehyde dehydrogenase of
           Candidatus kuenenia AldH (locus CAJ73105).
          Length = 478

 Score =  256 bits (657), Expect = 1e-80
 Identities = 109/350 (31%), Positives = 180/350 (51%), Gaps = 8/350 (2%)

Query: 33  YIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQ 92
            I G++V S         NPA  E + RV  A+P ++E+AVA+A++A++ W K     R 
Sbjct: 2   VIGGEWVGSGGET-FTSRNPANGEPIARVFPASPEDVEAAVAAAREAFKEWRKVPAPRRG 60

Query: 93  QVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAG 152
           +++ ++   ++   + L   ++ E GK L +  G+V   + + +    +  ++ G T+  
Sbjct: 61  EIVRQIGEALRKKKEALGRLVSLEMGKILPEGLGEVQEMIDICDYAVGLSRMLYGLTIPS 120

Query: 153 VAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPG---ACM 209
                 +     PLGV   IT FNFP  +P W   +A+ CGNT V KPSE  P    A  
Sbjct: 121 ERPGHRLMEQWNPLGVVGVITAFNFPVAVPGWNAAIALVCGNTVVWKPSETTPLTAIAVT 180

Query: 210 -MLVDMLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGK 268
            +L ++L + G PPGVVN++ G  D    +   P +  +SF GS + G+ + E  A+   
Sbjct: 181 KILAEVLEKNGLPPGVVNLVTGGGDGGELLVHDPRVPLVSFTGSTEVGRRVGETVARRFG 240

Query: 269 RVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPEL 326
           RV   +G  N  ++M DA+    +  ++ AA G AGQRC     +I V E+V  E+L  L
Sbjct: 241 RVLLELGGNNAIIVMDDADLDLAVRAVLFAAVGTAGQRCTTTRRLI-VHESVYDEFLERL 299

Query: 327 KKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
            K  + +++      GT +GP+I+  A E+ L+ I+    +G T+L  G+
Sbjct: 300 VKAYKQVRIGDPLDEGTLVGPLINQAAVEKYLNAIEIAKSQGGTVLTGGK 349


>gnl|CDD|143407 cd07088, ALDH_LactADH-AldA, Escherichia coli lactaldehyde
           dehydrogenase AldA-like.  Lactaldehyde dehydrogenase
           from Escherichia coli (AldA, LactADH, EC=1.2.1.22), an
           NAD(+)-dependent enzyme involved in the metabolism of
           L-fucose and L-rhamnose, and other similar sequences are
           present in this CD.
          Length = 468

 Score =  251 bits (644), Expect = 7e-79
 Identities = 118/353 (33%), Positives = 197/353 (55%), Gaps = 6/353 (1%)

Query: 33  YIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQ 92
           YI+G+FV S + + ID+ NPAT EVV  VP AT  + + AV +A+ A + W +   I R 
Sbjct: 1   YINGEFVPSSSGETIDVLNPATGEVVATVPAATAEDADRAVDAAEAAQKAWERLPAIERA 60

Query: 93  QVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAG 152
             L +L ++I+ N  +LA+ I EEQGKTL+ A  +V      ++        ++GE +  
Sbjct: 61  AYLRKLADLIRENADELAKLIVEEQGKTLSLARVEVEFTADYIDYMAEWARRIEGEIIPS 120

Query: 153 VAKDMDIHSYRVPLGVTAGITPFNFP-AMIPLWMFPVAIACGNTHVIKPSERDPGACMML 211
              + +I  ++VP+GV AGI P+NFP  +I   + P A+  GNT VIKPSE  P   +  
Sbjct: 121 DRPNENIFIFKVPIGVVAGILPWNFPFFLIARKLAP-ALVTGNTIVIKPSEETPLNALEF 179

Query: 212 VDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRV 270
            +++ +AG P GV+N++ G    V + +  HP +  IS  GS +AG+ I E  A+N  +V
Sbjct: 180 AELVDEAGLPAGVLNIVTGRGSVVGDALVAHPKVGMISLTGSTEAGQKIMEAAAENITKV 239

Query: 271 QSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKK 328
              +G K   ++M DA+    +  +V +     GQ C   +  ++V E +  E++ +L +
Sbjct: 240 SLELGGKAPAIVMKDADLDLAVKAIVDSRIINCGQVCTC-AERVYVHEDIYDEFMEKLVE 298

Query: 329 RAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKVG 381
           + +A+KV       TD+GP+++  A +++  +++  V+ GATLL  G+  +  
Sbjct: 299 KMKAVKVGDPFDAATDMGPLVNEAALDKVEEMVERAVEAGATLLTGGKRPEGE 351


>gnl|CDD|143421 cd07103, ALDH_F5_SSADH_GabD, Mitochondrial succinate-semialdehyde
           dehydrogenase and ALDH family members 5A1 and 5F1-like. 
           Succinate-semialdehyde dehydrogenase, mitochondrial
           (SSADH, GabD, EC=1.2.1.24) catalyzes the NAD+-dependent
           oxidation of succinate semialdehyde (SSA) to succinate.
           This group includes the human aldehyde dehydrogenase
           family 5 member A1 (ALDH5A1) which is a mitochondrial
           homotetramer that converts SSA to succinate in the last
           step of 4-aminobutyric acid (GABA) catabolism. This CD
           also includes the Arabidopsis SSADH gene product
           ALDH5F1. Mutations in this gene result in the
           accumulation of H2O2, suggesting a role in plant defense
           against the environmental stress of elevated reactive
           oxygen species.
          Length = 451

 Score =  248 bits (637), Expect = 7e-78
 Identities = 117/336 (34%), Positives = 188/336 (55%), Gaps = 18/336 (5%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLA 110
           NPAT EV+  VP A  A+ ++A+ +A  A++TW KT+   R  +L R  ++I+   + LA
Sbjct: 3   NPATGEVIGEVPDAGAADADAAIDAAAAAFKTWRKTTARERAAILRRWADLIRERAEDLA 62

Query: 111 ENITEEQGKTLADAEGDVLRGLQVV----EQCCSIPTIMQGETLAGVAKDMDIHSYRVPL 166
             +T EQGK LA+A G+V      +    E+   I     G T+   A    I   + P+
Sbjct: 63  RLLTLEQGKPLAEARGEVDYAASFLEWFAEEARRI----YGRTIPSPAPGKRILVIKQPV 118

Query: 167 GVTAGITPFNFP-AMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVV 225
           GV A ITP+NFP AMI   + P A+A G T V+KP+E  P + + L ++  +AG P GV+
Sbjct: 119 GVVAAITPWNFPAAMITRKIAP-ALAAGCTVVLKPAEETPLSALALAELAEEAGLPAGVL 177

Query: 226 NVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHG--VI 282
           NV+ G+   +   +C  P +R ISF GS   GK +  + A   KRV   +G   +   ++
Sbjct: 178 NVVTGSPAEIGEALCASPRVRKISFTGSTAVGKLLMAQAADTVKRVSLELGG--NAPFIV 235

Query: 283 MSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNV 340
             DA+    ++  + + F  AGQ C+  +  I+V E++  E++ +L +R + LKV  G  
Sbjct: 236 FDDADLDKAVDGAIASKFRNAGQTCVC-ANRIYVHESIYDEFVEKLVERVKKLKVGNGLD 294

Query: 341 PGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
            GTD+GP+I+ +A E++ +L++  V +GA +L  G+
Sbjct: 295 EGTDMGPLINERAVEKVEALVEDAVAKGAKVLTGGK 330


>gnl|CDD|143412 cd07093, ALDH_F8_HMSADH, Human aldehyde dehydrogenase family 8
           member A1-like.  In humans, the  aldehyde dehydrogenase
           family 8 member A1 (ALDH8A1) protein functions to
           convert 9-cis-retinal to 9-cis-retinoic acid and has a
           preference for NAD+. Also included in this CD is the
           2-hydroxymuconic semialdehyde dehydrogenase (HMSADH)
           which catalyzes the conversion of 2-hydroxymuconic
           semialdehyde to 4-oxalocrotonate, a step in the meta
           cleavage pathway of aromatic hydrocarbons in bacteria.
           Such HMSADHs seen here are: XylG of the TOL plasmid pWW0
           of Pseudomonas putida, TomC  of Burkholderia cepacia G4,
           and AphC of Comamonas testosterone.
          Length = 455

 Score =  244 bits (624), Expect = 6e-76
 Identities = 116/333 (34%), Positives = 182/333 (54%), Gaps = 10/333 (3%)

Query: 50  HNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQL 109
            NPAT EV+ +VP+   AE+++AVA+AK+A+  WS+ S   R ++L ++ ++I+A   +L
Sbjct: 2   FNPATGEVLAKVPEGGAAEVDAAVAAAKEAFPGWSRMSPAERARILHKVADLIEARADEL 61

Query: 110 AENITEEQGKTLADAE-GDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSY--RVPL 166
           A   + + GK +  A   D+ R              + GE+     +D    +Y  R P+
Sbjct: 62  ALLESLDTGKPITLARTRDIPRAAANFRFFADYILQLDGESYP---QDGGALNYVLRQPV 118

Query: 167 GVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVN 226
           GV   ITP+N P M+  W    A+A GNT V+KPSE  P    +L ++  +AG PPGVVN
Sbjct: 119 GVAGLITPWNLPLMLLTWKIAPALAFGNTVVLKPSEWTPLTAWLLAELANEAGLPPGVVN 178

Query: 227 VIHGA-HDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSD 285
           V+HG   +A   +  HPD+  ISF G    G+ I    A N K V   +G KN  ++ +D
Sbjct: 179 VVHGFGPEAGAALVAHPDVDLISFTGETATGRTIMRAAAPNLKPVSLELGGKNPNIVFAD 238

Query: 286 ANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGT 343
           A+    ++  V ++F   G+ C+A  + I V  ++  E+L    +RA+ALKV     P T
Sbjct: 239 ADLDRAVDAAVRSSFSNNGEVCLA-GSRILVQRSIYDEFLERFVERAKALKVGDPLDPDT 297

Query: 344 DLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           ++GP+IS +  E++L  ++    EGAT+L  G 
Sbjct: 298 EVGPLISKEHLEKVLGYVELARAEGATILTGGG 330


>gnl|CDD|143424 cd07106, ALDH_AldA-AAD23400, Streptomyces aureofaciens putative
           aldehyde dehydrogenase AldA (AAD23400)-like.  Putative
           aldehyde dehydrogenase, AldA, from Streptomyces
           aureofaciens (locus AAD23400) and other similar
           sequences are present in this CD.
          Length = 446

 Score =  241 bits (617), Expect = 5e-75
 Identities = 113/335 (33%), Positives = 172/335 (51%), Gaps = 15/335 (4%)

Query: 50  HNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQL 109
            NPAT EV    P A+ A+++ AVA+AK A+  WS T +  R+  L  + + I+AN ++L
Sbjct: 2   INPATGEVFASAPVASEAQLDQAVAAAKAAFPGWSATPLEERRAALLAIADAIEANAEEL 61

Query: 110 AENITEEQGKTLADAEGDVLRGLQVVEQCCSIP----TIMQGETLAGVAKDMDIHSYRVP 165
           A  +T EQGK LA+A+ +V   +  +    S+      I   +T         +   R P
Sbjct: 62  ARLLTLEQGKPLAEAQFEVGGAVAWLRYTASLDLPDEVIEDDDT-------RRVELRRKP 114

Query: 166 LGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVV 225
           LGV A I P+NFP ++  W    A+  GNT V+KPS   P   + L ++  +   PPGV+
Sbjct: 115 LGVVAAIVPWNFPLLLAAWKIAPALLAGNTVVLKPSPFTPLCTLKLGELAQEV-LPPGVL 173

Query: 226 NVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSD 285
           NV+ G  +    +  HPDIR ISF GS   GK +    AK  KRV   +G  +  +++ D
Sbjct: 174 NVVSGGDELGPALTSHPDIRKISFTGSTATGKKVMASAAKTLKRVTLELGGNDAAIVLPD 233

Query: 286 ANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGT 343
            +      +L   AF  +GQ C A+  + +V E++  E+   L   A+A  V  G  PGT
Sbjct: 234 VDIDAVAPKLFWGAFINSGQVCAAIKRL-YVHESIYDEFCEALVALAKAAVVGDGLDPGT 292

Query: 344 DLGPVISPQAKERILSLIQSGVDEGATLLLDGRNL 378
            LGPV +    +++  L++    +GA +L  G  L
Sbjct: 293 TLGPVQNKMQYDKVKELVEDAKAKGAKVLAGGEPL 327


>gnl|CDD|143456 cd07138, ALDH_CddD_SSP0762, Rhodococcus ruber 6-oxolauric acid
           dehydrogenase-like.  The 6-oxolauric acid dehydrogenase
           (CddD) from Rhodococcus ruber SC1 which converts
           6-oxolauric acid to dodecanedioic acid, and the aldehyde
           dehydrogenase (locus SSP0762) from Staphylococcus
           saprophyticus subsp. saprophyticus ATCC 15305 and other
           similar sequences, are included in this CD.
          Length = 466

 Score =  234 bits (600), Expect = 3e-72
 Identities = 128/358 (35%), Positives = 182/358 (50%), Gaps = 30/358 (8%)

Query: 32  LYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITR 91
            YIDG +V    T+ ID+ NPAT EV+  VP  T A+++ AVA+A++A+  WS TS+  R
Sbjct: 1   FYIDGAWVAPAGTETIDVINPATEEVIGTVPLGTAADVDRAVAAARRAFPAWSATSVEER 60

Query: 92  QQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLA 151
             +L R+    +A   +LA+ IT E G     A   + R  QV         I      A
Sbjct: 61  AALLERIAEAYEARADELAQAITLEMG-----APITLARAAQV--GL----GIGHLRAAA 109

Query: 152 GVAKDMDIHSY-------RVPLGVTAGITPFNFPA-MIPLWMFPVAIACGNTHVIKPSER 203
              KD +           R P+GV   ITP+N+P   I L + P A+A G T V+KPSE 
Sbjct: 110 DALKDFEFEERRGNSLVVREPIGVCGLITPWNWPLNQIVLKVAP-ALAAGCTVVLKPSEV 168

Query: 204 DPGACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYER 262
            P + ++L ++L +AG P GV N+++G    V   +  HPD+  +SF GS +AGK + E 
Sbjct: 169 APLSAIILAEILDEAGLPAGVFNLVNGDGPVVGEALSAHPDVDMVSFTGSTRAGKRVAEA 228

Query: 263 GAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFV-----G 317
            A   KRV   +G K+  +I+ DA+    + + V A F  +GQ C A  T + V      
Sbjct: 229 AADTVKRVALELGGKSANIILDDADLEKAVPRGVAACFANSGQSCNAP-TRMLVPRSRYA 287

Query: 318 EAVEWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
           EA E        AEA  V     P T LGP+ S    +R+   IQ G++EGA L+  G
Sbjct: 288 EAEE---IAAAAAEAYVVGDPRDPATTLGPLASAAQFDRVQGYIQKGIEEGARLVAGG 342


>gnl|CDD|143437 cd07119, ALDH_BADH-GbsA, Bacillus subtilis NAD+-dependent betaine
           aldehyde dehydrogenase-like.  Included in this CD is the
           NAD+-dependent, betaine aldehyde dehydrogenase (BADH,
           GbsA, EC=1.2.1.8) of Bacillus subtilis involved in the
           synthesis of the osmoprotectant glycine betaine from
           choline or glycine betaine aldehyde.
          Length = 482

 Score =  234 bits (599), Expect = 5e-72
 Identities = 116/349 (33%), Positives = 194/349 (55%), Gaps = 7/349 (2%)

Query: 33  YIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYET--WSKTSIIT 90
           YIDG++VE+ +    D+ NPA  EV+  VP+ T  + + A+A+A++A+++  W       
Sbjct: 1   YIDGEWVEAASGKTRDIINPANGEVIATVPEGTAEDAKRAIAAARRAFDSGEWPHLPAQE 60

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETL 150
           R  +LFR+ + I+ + ++LA   T   GKTL ++E D+            + T   GE  
Sbjct: 61  RAALLFRIADKIREDAEELARLETLNTGKTLRESEIDIDDVANCFRYYAGLATKETGEVY 120

Query: 151 AGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMM 210
             V   +   + R P+GV   ITP+N+P +   W    A+A GNT VIKPSE  P   + 
Sbjct: 121 D-VPPHVISRTVREPVGVCGLITPWNYPLLQAAWKLAPALAAGNTVVIKPSEVTPLTTIA 179

Query: 211 LVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKR 269
           L +++ +AG P GVVN++ G+   V   + + PD+  +SF G    G+ I    A N K+
Sbjct: 180 LFELIEEAGLPAGVVNLVTGSGATVGAELAESPDVDLVSFTGGTATGRSIMRAAAGNVKK 239

Query: 270 VQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELK 327
           V   +G KN  ++ +DA+    ++Q +   F  AGQ C A S ++ V E++  +++  L 
Sbjct: 240 VALELGGKNPNIVFADADFETAVDQALNGVFFNAGQVCSAGSRLL-VEESIHDKFVAALA 298

Query: 328 KRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           +RA+ +K+  G    T++GP++S + +E++LS IQ G +EGA L+  G+
Sbjct: 299 ERAKKIKLGNGLDADTEMGPLVSAEHREKVLSYIQLGKEEGARLVCGGK 347


>gnl|CDD|143410 cd07091, ALDH_F1-2_Ald2-like, ALDH subfamily: ALDH families 1and 2,
           including 10-formyltetrahydrofolate dehydrogenase,
           NAD+-dependent retinal dehydrogenase 1 and related
           proteins.  ALDH subfamily which includes the
           NAD+-dependent retinal dehydrogenase 1 (RALDH 1, ALDH1,
           EC=1.2.1.36), also known as aldehyde dehydrogenase
           family 1 member A1 (ALDH1A1), in humans, a
           homotetrameric, cytosolic enzyme that catalyzes the
           oxidation of retinaldehyde to retinoic acid. Human
           ALDH1B1 and ALDH2 are also in this cluster; both are
           mitochrondrial homotetramers which play important roles
           in acetaldehyde oxidation; ALDH1B1 in response to UV
           light exposure and ALDH2 during ethanol metabolism.
           10-formyltetrahydrofolate dehydrogenase (FTHFDH,
           EC=1.5.1.6), also known as aldehyde dehydrogenase family
           1 member L1 (ALDH1L1), in humans, a multi-domain
           homotetramer with an N-terminal formyl transferase
           domain and a C-terminal ALDH domain. FTHFDH catalyzes an
           NADP+-dependent dehydrogenase reaction resulting in the
           conversion of 10-formyltetrahydrofolate to
           tetrahydrofolate and CO2. Also included in this
           subfamily is the Arabidosis aldehyde dehydrogenase
           family 2 members B4 and B7 (EC=1.2.1.3), which are
           mitochondrial, homotetramers that oxidize acetaldehyde
           and glycolaldehyde, as well as, the Arabidosis
           cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
           involved in the oxidation of sinapalehyde and
           coniferaldehyde. Also included is the AldA aldehyde
           dehydrogenase  of Aspergillus nidulans (locus AN0554),
           the aldehyde dehydrogenase 2 (YMR170c, ALD5, EC=1.2.1.5)
           of Saccharomyces cerevisiae, and other similar
           sequences.
          Length = 476

 Score =  230 bits (589), Expect = 1e-70
 Identities = 123/355 (34%), Positives = 192/355 (54%), Gaps = 13/355 (3%)

Query: 30  TKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWS--KTS 87
           T L+I+ +FV+S +       NPAT EV+ +V +A   ++++AV +A+ A+ET    K  
Sbjct: 4   TGLFINNEFVDSVSGKTFPTINPATEEVICQVAEADEEDVDAAVKAARAAFETGWWRKMD 63

Query: 88  IITRQQVLFRLQNIIKANMKQLAENITEEQGKTL-ADAEGDVLRGLQVVEQCCSIPTIMQ 146
              R ++L +L ++I+ +  +LA   + + GK L   A+GDV   ++ +         +Q
Sbjct: 64  PRERGRLLNKLADLIERDRDELAALESLDNGKPLEESAKGDVALSIKCLRYYAGWADKIQ 123

Query: 147 GETLAGVAKDMDIHSY--RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERD 204
           G+T   +  D +  +Y  R P+GV   I P+NFP ++  W    A+A GNT V+KP+E+ 
Sbjct: 124 GKT---IPIDGNFLAYTRREPIGVCGQIIPWNFPLLMLAWKLAPALAAGNTVVLKPAEQT 180

Query: 205 PGACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIYERG 263
           P + + L +++ +AG PPGVVN++ G    A   I  H D+  I+F GS   G+ I E  
Sbjct: 181 PLSALYLAELIKEAGFPPGVVNIVPGFGPTAGAAISSHMDVDKIAFTGSTAVGRTIMEAA 240

Query: 264 AK-NGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV-- 320
           AK N K+V   +G K+  ++  DA+    +       F   GQ C A S  IFV E++  
Sbjct: 241 AKSNLKKVTLELGGKSPNIVFDDADLDKAVEWAAFGIFFNQGQCCCAGSR-IFVQESIYD 299

Query: 321 EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
           E++ + K RAE   V     P T  GP +S    ++ILS I+SG  EGATLL  G
Sbjct: 300 EFVEKFKARAEKRVVGDPFDPDTFQGPQVSKAQFDKILSYIESGKKEGATLLTGG 354


>gnl|CDD|143442 cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, RocA.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (EC=1.5.1.12 ), RocA: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily. The proline catabolic enzymes, proline
           dehydrogenase and Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), catalyze the two-step oxidation
           of proline to glutamate; P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). In this CD, monofunctional enzyme
           sequences such as seen in the Bacillus subtilis RocA
           P5CDH are also present. These enzymes play important
           roles in cellular redox control, superoxide generation,
           and apoptosis.
          Length = 512

 Score =  231 bits (591), Expect = 2e-70
 Identities = 137/355 (38%), Positives = 196/355 (55%), Gaps = 15/355 (4%)

Query: 32  LYIDGKFVESKATDWIDLHNPA-TNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIIT 90
           L I GK V ++    I+  NPA  +EV+  V KAT  E E+AV +A+ A+ TW +T    
Sbjct: 35  LVIGGKEVRTEEK--IESRNPADPSEVLGTVQKATKEEAEAAVQAARAAFPTWRRTPPEE 92

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETL 150
           R ++L R   +++    +LA  +  E GK  A+A+ DV   +  +E        ++G  +
Sbjct: 93  RARLLLRAAALLRRRRFELAAWMVLEVGKNWAEADADVAEAIDFLEYYAREMLRLRGFPV 152

Query: 151 AGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMM 210
             V  + + + YR PLGV A I+P+NFP  I   M   A+  GNT V+KP+E  P     
Sbjct: 153 EMVPGEDNRYVYR-PLGVGAVISPWNFPLAILAGMTTAALVTGNTVVLKPAEDTPVIAAK 211

Query: 211 LVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYER------G 263
           LV++L +AG PPGVVN + G  + V +++ +HPD+R I+F GS + G  IYER      G
Sbjct: 212 LVEILEEAGLPPGVVNFLPGPGEEVGDYLVEHPDVRFIAFTGSREVGLRIYERAAKVQPG 271

Query: 264 AKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--E 321
            K  KRV + MG KN  ++  DA+       +V +AFG  GQ+C A S VI V E+V  E
Sbjct: 272 QKWLKRVIAEMGGKNAIIVDEDADLDEAAEGIVRSAFGFQGQKCSACSRVI-VHESVYDE 330

Query: 322 WLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           +L  L +R +ALKV     P   +GPVI   A++RI   I+ G  EG  LLL G 
Sbjct: 331 FLERLVERTKALKVGDPEDPEVYMGPVIDKGARDRIRRYIEIGKSEG-RLLLGGE 384


>gnl|CDD|143432 cd07114, ALDH_DhaS, Uncharacterized Candidatus pelagibacter
           aldehyde dehydrogenase, DhaS-like.  Uncharacterized
           aldehyde dehydrogenase from Candidatus pelagibacter
           (DhaS) and other related sequences are present in this
           CD.
          Length = 457

 Score =  229 bits (586), Expect = 3e-70
 Identities = 108/332 (32%), Positives = 176/332 (53%), Gaps = 6/332 (1%)

Query: 50  HNPATNEVVTRVPKATPAEMESAVASAKKAYET--WSKTSIITRQQVLFRLQNIIKANMK 107
            NPAT E   RVP+A+ A+++ AVA+A+ A+E   W K +   R ++L RL ++I+AN +
Sbjct: 2   INPATGEPWARVPEASAADVDRAVAAARAAFEGGAWRKLTPTERGKLLRRLADLIEANAE 61

Query: 108 QLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLG 167
           +LAE  T + GK + +    V    +       +   ++G  +     D    + R PLG
Sbjct: 62  ELAELETRDNGKLIRETRAQVRYLAEWYRYYAGLADKIEGAVIPVDKGDYLNFTRREPLG 121

Query: 168 VTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNV 227
           V A ITP+N P ++       A+A GNT V+KPSE  P + + L  +  +AG PPGVVNV
Sbjct: 122 VVAAITPWNSPLLLLAKKLAPALAAGNTVVLKPSEHTPASTLELAKLAEEAGFPPGVVNV 181

Query: 228 IHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDA 286
           + G   +    + +HP +  I+F G  + G++I    A+N   V   +G K+  ++  DA
Sbjct: 182 VTGFGPETGEALVEHPLVAKIAFTGGTETGRHIARAAAENLAPVTLELGGKSPNIVFDDA 241

Query: 287 NRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTD 344
           +    +N +V   F AAGQ C+A S  + V  ++  E++  L  RA A++V     P T 
Sbjct: 242 DLDAAVNGVVAGIFAAAGQTCVAGSR-LLVQRSIYDEFVERLVARARAIRVGDPLDPETQ 300

Query: 345 LGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           +GP+ + +  E++   +    +EGA +L  G 
Sbjct: 301 MGPLATERQLEKVERYVARAREEGARVLTGGE 332


>gnl|CDD|143395 cd06534, ALDH-SF, NAD(P)+-dependent aldehyde dehydrogenase
           superfamily.  The aldehyde dehydrogenase superfamily
           (ALDH-SF) of  NAD(P)+-dependent enzymes, in general,
           oxidize a wide range of  endogenous and exogenous
           aliphatic and aromatic aldehydes to their corresponding
           carboxylic acids and play an  important role in
           detoxification. Besides aldehyde detoxification, many
           ALDH isozymes possess multiple additional catalytic and
           non-catalytic functions such as participating in
           metabolic pathways, or as binding proteins, or
           osmoregulants, to mention a few. The enzyme has three
           domains, a NAD(P)+ cofactor-binding domain, a catalytic
           domain, and a bridging domain; and the active enzyme is
           generally either homodimeric or homotetrameric. The
           catalytic mechanism is proposed to involve cofactor
           binding, resulting in a conformational change and
           activation of an invariant catalytic cysteine
           nucleophile. The cysteine and aldehyde substrate form an
           oxyanion thiohemiacetal intermediate resulting in
           hydride transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-L) proteins. The ALDH proteins
           are represented by enzymes which share a number of
           highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group. The
           ALDH-like group is represented by such proteins as
           gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA
           reductase, and coenzyme A acylating aldehyde
           dehydrogenase. All of these proteins have a conserved
           cysteine that aligns with the catalytic cysteine of the
           ALDH group.
          Length = 367

 Score =  222 bits (567), Expect = 1e-68
 Identities = 112/376 (29%), Positives = 169/376 (44%), Gaps = 35/376 (9%)

Query: 74  ASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQ 133
           A+A+ A++ W+      R  +L ++ ++++   ++LA   T E GK + +A G+V R + 
Sbjct: 1   AAARAAFKAWAALPPAERAAILRKIADLLEERREELAALETLETGKPIEEALGEVARAID 60

Query: 134 VVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACG 193
                  +   + G  L       + +  R PLGV   ITP+NFP ++  W    A+A G
Sbjct: 61  TFRYAAGLADKLGGPELPSPDPGGEAYVRREPLGVVGVITPWNFPLLLAAWKLAPALAAG 120

Query: 194 NTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGS 252
           NT V+KPSE  P   + L ++L +AG PPGVVNV+ G  D V   +  HP +  ISF GS
Sbjct: 121 NTVVLKPSELTPLTALALAELLQEAGLPPGVVNVVPGGGDEVGAALLSHPRVDKISFTGS 180

Query: 253 DQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALST 312
              GK I +  A+N K V   +G K+  ++  DA+    +   V  AF  AGQ C A S 
Sbjct: 181 TAVGKAIMKAAAENLKPVTLELGGKSPVIVDEDADLDAAVEGAVFGAFFNAGQICTAAS- 239

Query: 313 VIFVGEAVEWLPELKKRAEALKVSAGNVPGTDL--------GPVIS-------PQAKERI 357
            + V E++    E  ++   + V   +V   D+        GPV+         +A    
Sbjct: 240 RLLVHESI--YDEFVEKLVTVLV---DVD-PDMPIAQEEIFGPVLPVIRFKDEEEAIALA 293

Query: 358 LSLI---QSGV---DEGATLLLDGRNLKVG---INVPIPVPLSMFSFTGSRGSFLGENHF 408
                   +GV   D    L +    L+ G   IN           F G + S +G    
Sbjct: 294 NDTEYGLTAGVFTRDLNRALRV-AERLRAGTVYINDSSIGVGPEAPFGGVKNSGIGREG- 351

Query: 409 YGKQGFYFYTETKTVT 424
            G  G   YT TKTV 
Sbjct: 352 -GPYGLEEYTRTKTVV 366


>gnl|CDD|143467 cd07149, ALDH_y4uC, Uncharacterized ALDH (y4uC) with similarity to
           Tortula ruralis aldehyde dehydrogenase ALDH21A1.
           Uncharacterized aldehyde dehydrogenase (ORF name y4uC)
           with sequence similarity to the moss Tortula ruralis
           aldehyde dehydrogenase ALDH21A1 (RNP123) believed to
           play an important role in the detoxification of
           aldehydes generated in response to desiccation- and
           salinity-stress, and similar sequences are included in
           this CD.
          Length = 453

 Score =  217 bits (556), Expect = 6e-66
 Identities = 104/338 (30%), Positives = 171/338 (50%), Gaps = 12/338 (3%)

Query: 47  IDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANM 106
           I++ +P   EV+ RVP A+  ++E A+A+AK+  +         R ++L R   +++   
Sbjct: 1   IEVISPYDGEVIGRVPVASEEDVEKAIAAAKEGAKEMKSLPAYERAEILERAAQLLEERR 60

Query: 107 KQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETL-----AGVAKDMDIHS 161
           ++ A  I  E GK + DA  +V R ++ +         + GET+      G    +   +
Sbjct: 61  EEFARTIALEAGKPIKDARKEVDRAIETLRLSAEEAKRLAGETIPFDASPGGEGRIG-FT 119

Query: 162 YRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCP 221
            R P+GV A ITPFNFP  +       AIA GN  V+KP+ + P + + L ++L +AG P
Sbjct: 120 IREPIGVVAAITPFNFPLNLVAHKVGPAIAAGNAVVLKPASQTPLSALKLAELLLEAGLP 179

Query: 222 PGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHG 280
            G +NV+ G+ + V + +   P +R ISF GS   G+ I  +     K+V   +G+    
Sbjct: 180 KGALNVVTGSGETVGDALVTDPRVRMISFTGSPAVGEAIARKAGL--KKVTLELGSNAAV 237

Query: 281 VIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAG 338
           ++ +DA+    + + V  AF  AGQ C+++   IFV E +  E+L       + L V   
Sbjct: 238 IVDADADLEKAVERCVSGAFANAGQVCISVQR-IFVHEDIYDEFLERFVAATKKLVVGDP 296

Query: 339 NVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
               TD+GP+IS    ERI   ++  V+ GA LL  G+
Sbjct: 297 LDEDTDVGPMISEAEAERIEEWVEEAVEGGARLLTGGK 334


>gnl|CDD|143468 cd07150, ALDH_VaniDH_like, Pseudomonas putida vanillin
           dehydrogenase-like.  Vanillin dehydrogenase (Vdh,
           VaniDH) involved in the metabolism of ferulic acid and
           other related  sequences are included in this CD.  The
           E. coli vanillin dehydrogenase (LigV) preferred NAD+ to
           NADP+  and exhibited a broad substrate preference,
           including vanillin,  benzaldehyde, protocatechualdehyde,
           m-anisaldehyde, and p-hydroxybenzaldehyde.
          Length = 451

 Score =  213 bits (544), Expect = 3e-64
 Identities = 113/336 (33%), Positives = 178/336 (52%), Gaps = 12/336 (3%)

Query: 48  DLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMK 107
           D  NPA   V  RV   +  + E A+A+A  A+  W+ T+   R+++L +   I++    
Sbjct: 2   DDLNPADGSVYARVAVGSRQDAERAIAAAYDAFPAWAATTPSERERILLKAAEIMERRAD 61

Query: 108 QLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLG 167
            L + + +E G T   A  +     +++         ++GETL   +      S R PLG
Sbjct: 62  DLIDLLIDEGGSTYGKAWFETTFTPELLRAAAGECRRVRGETLPSDSPGTVSMSVRRPLG 121

Query: 168 VTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNV 227
           V AGITPFN+P ++       A+A GNT V+KPSE  P   + + +++ +AG P GV NV
Sbjct: 122 VVAGITPFNYPLILATKKVAFALAAGNTVVLKPSEETPVIGLKIAEIMEEAGLPKGVFNV 181

Query: 228 IHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDA 286
           + G    V + + D P +R ++F GS   G+ I E+  ++ K++   +G KN  ++++DA
Sbjct: 182 VTGGGAEVGDELVDDPRVRMVTFTGSTAVGREIAEKAGRHLKKITLELGGKNPLIVLADA 241

Query: 287 NRSNTLNQLVG-AAFGA---AGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNV 340
           +    L+  V  AAFGA    GQ CM+ S +I V E V  E++ +   RA  LKV     
Sbjct: 242 D----LDYAVRAAAFGAFMHQGQICMSASRII-VEEPVYDEFVKKFVARASKLKVGDPRD 296

Query: 341 PGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           P T +GP+ISP+  ERI   ++  V +GA LL  G+
Sbjct: 297 PDTVIGPLISPRQVERIKRQVEDAVAKGAKLLTGGK 332


>gnl|CDD|143457 cd07139, ALDH_AldA-Rv0768, Mycobacterium tuberculosis aldehyde
           dehydrogenase  AldA-like.  The Mycobacterium
           tuberculosis NAD+-dependent, aldehyde dehydrogenase  PDB
           structure,  3B4W, and the Mycobacterium tuberculosis
           H37Rv aldehyde dehydrogenase  AldA (locus Rv0768)
           sequence, as well as the Rhodococcus rhodochrous ALDH
           involved in haloalkane catabolism, and other similar
           sequences, are included in this CD.
          Length = 471

 Score =  212 bits (542), Expect = 1e-63
 Identities = 115/364 (31%), Positives = 177/364 (48%), Gaps = 36/364 (9%)

Query: 32  LYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYE--TWSKTSII 89
           L+I G++V    ++ ID+ +PAT EVV RVP+ATPA++++AVA+A++A++   W + S  
Sbjct: 1   LFIGGRWVAPSGSETIDVVSPATEEVVGRVPEATPADVDAAVAAARRAFDNGPWPRLSPA 60

Query: 90  TRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGET 149
            R  VL RL + ++A   +LA   T E G  ++       R  Q        P       
Sbjct: 61  ERAAVLRRLADALEARADELARLWTAENGMPIS-----WSRRAQG-----PGPA-ALLRY 109

Query: 150 LAGVAKDM------------DIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHV 197
            A +A+D              +   R P+GV A I P+N P  +       A+A G T V
Sbjct: 110 YAALARDFPFEERRPGSGGGHVLVRREPVGVVAAIVPWNAPLFLAALKIAPALAAGCTVV 169

Query: 198 IKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGK 257
           +KPS   P    +L +   +AG PPGVVNV+    +   ++  HP +  +SF GS  AG+
Sbjct: 170 LKPSPETPLDAYLLAEAAEEAGLPPGVVNVVPADREVGEYLVRHPGVDKVSFTGSTAAGR 229

Query: 258 YIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVG 317
            I     +   RV   +G K+  +++ DA+    +  LV A+    GQ C+AL+ ++   
Sbjct: 230 RIAAVCGERLARVTLELGGKSAAIVLDDADLDAAVPGLVPASLMNNGQVCVALTRIL--- 286

Query: 318 EAV------EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATL 371
             V      E +  L     ALKV     P T +GP+ S + +ER+   I  G  EGA L
Sbjct: 287 --VPRSRYDEVVEALAAAVAALKVGDPLDPATQIGPLASARQRERVEGYIAKGRAEGARL 344

Query: 372 LLDG 375
           +  G
Sbjct: 345 VTGG 348


>gnl|CDD|179543 PRK03137, PRK03137, 1-pyrroline-5-carboxylate dehydrogenase;
           Provisional.
          Length = 514

 Score =  213 bits (544), Expect = 1e-63
 Identities = 121/356 (33%), Positives = 180/356 (50%), Gaps = 32/356 (8%)

Query: 32  LYIDGKFVESKATDWIDLHNPA-TNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIIT 90
           L I G+ + ++    I   NPA  +EVV RV KAT    E A+ +A +A+ETW K S   
Sbjct: 39  LIIGGERITTEDK--IVSINPANKSEVVGRVSKATKELAEKAMQAALEAFETWKKWSPED 96

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGL--------QVVEQCCSIP 142
           R ++L R   II+    + +  + +E GK  A+A+ D    +        Q+++     P
Sbjct: 97  RARILLRAAAIIRRRKHEFSAWLVKEAGKPWAEADADTAEAIDFLEYYARQMLKLADGKP 156

Query: 143 TI-MQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPS 201
                GE             + +PLGV   I+P+NFP  I   M   AI  GNT ++KP+
Sbjct: 157 VESRPGE---------HNRYFYIPLGVGVVISPWNFPFAIMAGMTLAAIVAGNTVLLKPA 207

Query: 202 ERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIY 260
              P      V++L +AG P GVVN + G+   V +++ DHP  R I+F GS + G  IY
Sbjct: 208 SDTPVIAAKFVEVLEEAGLPAGVVNFVPGSGSEVGDYLVDHPKTRFITFTGSREVGLRIY 267

Query: 261 ERGAK--NG----KRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVI 314
           ER AK   G    KRV + MG K+  V+  DA+       +V +AFG +GQ+C A S  I
Sbjct: 268 ERAAKVQPGQIWLKRVIAEMGGKDAIVVDEDADLDLAAESIVASAFGFSGQKCSACSRAI 327

Query: 315 FVGEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEG 368
            V E V  E L ++ +  + L V         +GPVI+  + ++I+S I+ G +EG
Sbjct: 328 -VHEDVYDEVLEKVVELTKELTVGNPEDN-AYMGPVINQASFDKIMSYIEIGKEEG 381


>gnl|CDD|215157 PLN02278, PLN02278, succinic semialdehyde dehydrogenase.
          Length = 498

 Score =  211 bits (538), Expect = 7e-63
 Identities = 116/363 (31%), Positives = 183/363 (50%), Gaps = 6/363 (1%)

Query: 25  RHIATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWS 84
             +  T+  I GK+ ++       ++NPAT EV+  VP    AE   A+ASA  A+ +WS
Sbjct: 20  AGLLRTQGLIGGKWTDAYDGKTFPVYNPATGEVIANVPCMGRAETNDAIASAHDAFPSWS 79

Query: 85  KTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTI 144
           K +   R ++L R  ++I AN + LA+ +T EQGK L +A G+V  G   +E        
Sbjct: 80  KLTASERSKILRRWYDLIIANKEDLAQLMTLEQGKPLKEAIGEVAYGASFLEYFAEEAKR 139

Query: 145 MQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFP-AMIPLWMFPVAIACGNTHVIKPSER 203
           + G+ +     D  +   + P+GV   ITP+NFP AMI   + P A+A G T V+KPSE 
Sbjct: 140 VYGDIIPSPFPDRRLLVLKQPVGVVGAITPWNFPLAMITRKVGP-ALAAGCTVVVKPSEL 198

Query: 204 DPGACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYER 262
            P   +   ++  QAG PPGV+NV+ G    + + +   P +R I+F GS   GK +   
Sbjct: 199 TPLTALAAAELALQAGIPPGVLNVVMGDAPEIGDALLASPKVRKITFTGSTAVGKKLMAG 258

Query: 263 GAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV-- 320
            A   KRV   +G     ++  DA+    +   + + F  +GQ C+  +  I V E +  
Sbjct: 259 AAATVKRVSLELGGNAPFIVFDDADLDVAVKGALASKFRNSGQTCVC-ANRILVQEGIYD 317

Query: 321 EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKV 380
           ++     K  + L V  G   G   GP+I+  A +++ S +Q  V +GA +LL G+   +
Sbjct: 318 KFAEAFSKAVQKLVVGDGFEEGVTQGPLINEAAVQKVESHVQDAVSKGAKVLLGGKRHSL 377

Query: 381 GIN 383
           G  
Sbjct: 378 GGT 380


>gnl|CDD|143422 cd07104, ALDH_BenzADH-like, ALDH subfamily: NAD(P)+-dependent
           benzaldehyde dehydrogenase II, vanillin dehydrogenase,
           p-hydroxybenzaldehyde dehydrogenase and related
           proteins.  ALDH subfamily which includes the
           NAD(P)+-dependent, benzaldehyde dehydrogenase II (XylC,
           BenzADH, EC=1.2.1.28)  involved in the oxidation of
           benzyl alcohol to benzoate; p-hydroxybenzaldehyde
           dehydrogenase (PchA, HBenzADH) which catalyzes the
           oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic
           acid; vanillin dehydrogenase (Vdh, VaniDH) involved in
           the metabolism of ferulic acid as seen in Pseudomonas
           putida KT2440; and other related sequences.
          Length = 431

 Score =  208 bits (533), Expect = 9e-63
 Identities = 99/318 (31%), Positives = 169/318 (53%), Gaps = 15/318 (4%)

Query: 68  EMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGD 127
           +++ A A+A  A + W+ T    R  +L +   I++    ++A+ +  E G T   A  +
Sbjct: 1   DVDRAYAAAAAAQKAWAATPPQERAAILRKAAEILEERRDEIADWLIRESGSTRPKAAFE 60

Query: 128 VLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFP 187
           V   + ++ +   +P   +GE L       +    RVPLGV   I+PFNFP ++ +    
Sbjct: 61  VGAAIAILREAAGLPRRPEGEILPSDVPGKESMVRRVPLGVVGVISPFNFPLILAMRSVA 120

Query: 188 VAIACGNTHVIKPSERDP--GACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDI 244
            A+A GN  V+KP  R P  G  +++ ++  +AG P GV+NV+ G    + + + +HP +
Sbjct: 121 PALALGNAVVLKPDSRTPVTGG-LLIAEIFEEAGLPKGVLNVVPGGGSEIGDALVEHPRV 179

Query: 245 RAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVG-AAFGA- 302
           R ISF GS   G++I E   ++ K+V   +G  N  +++ DA+    L+  V  AAFGA 
Sbjct: 180 RMISFTGSTAVGRHIGELAGRHLKKVALELGGNNPLIVLDDAD----LDLAVSAAAFGAF 235

Query: 303 --AGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERIL 358
              GQ CMA   ++ V E+V  E++ +L  +A+AL V     P T +GP+I+ +  +R+ 
Sbjct: 236 LHQGQICMAAGRIL-VHESVYDEFVEKLVAKAKALPVGDPRDPDTVIGPLINERQVDRVH 294

Query: 359 SLIQSGVDEGATLLLDGR 376
           ++++  V  GA LL  G 
Sbjct: 295 AIVEDAVAAGARLLTGGT 312


>gnl|CDD|143463 cd07145, ALDH_LactADH_F420-Bios, Methanocaldococcus jannaschii
           NAD+-dependent lactaldehyde dehydrogenase-like.
           NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
           involved the biosynthesis of coenzyme F(420) in
           Methanocaldococcus jannaschii through the oxidation of
           lactaldehyde to lactate and generation of NAPH, and
           similar sequences are included in this CD.
          Length = 456

 Score =  209 bits (534), Expect = 9e-63
 Identities = 108/337 (32%), Positives = 178/337 (52%), Gaps = 8/337 (2%)

Query: 47  IDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANM 106
           I++ NPA  EV+  VP  +  E+  A+  A+KA +  S      R ++L ++  +I+   
Sbjct: 1   IEVRNPANGEVIDTVPSLSREEVREAIEVAEKAKDVMSNLPAYKRYKILMKVAELIERRK 60

Query: 107 KQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETL----AGVAKDMDIHSY 162
           ++LA+ +T E GK +  +  +V R +++ +       +++GET+        +     + 
Sbjct: 61  EELAKLLTIEVGKPIKQSRVEVERTIRLFKLAAEEAKVLRGETIPVDAYEYNERRIAFTV 120

Query: 163 RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPP 222
           R P+GV   ITPFNFPA +       AIA GN+ V+KPS   P   + L  +L +AG PP
Sbjct: 121 REPIGVVGAITPFNFPANLFAHKIAPAIAVGNSVVVKPSSNTPLTAIELAKILEEAGLPP 180

Query: 223 GVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGV 281
           GV+NV+ G    V + I  +P +  ISF GS   G  I  +    GK+V   +G  +  +
Sbjct: 181 GVINVVTGYGSEVGDEIVTNPKVNMISFTGSTAVGLLIASKAGGTGKKVALELGGSDPMI 240

Query: 282 IMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGN 339
           ++ DA+    ++  V   F  AGQ C A+  ++ V E V  ++L  L ++ + LKV    
Sbjct: 241 VLKDADLERAVSIAVRGRFENAGQVCNAVKRIL-VEEEVYDKFLKLLVEKVKKLKVGDPL 299

Query: 340 VPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
              TDLGP+ISP+A ER+ +L+   V++G  +L  G+
Sbjct: 300 DESTDLGPLISPEAVERMENLVNDAVEKGGKILYGGK 336


>gnl|CDD|131352 TIGR02299, HpaE, 5-carboxymethyl-2-hydroxymuconate semialdehyde
           dehydrogenase.  This model represents the dehydrogenase
           responsible for the conversion of
           5-carboxymethyl-2-hydroxymuconate semialdehyde to
           5-carboxymethyl-2-hydroxymuconate (a tricarboxylic
           acid). This is the step in the degradation of
           4-hydroxyphenylacetic acid via homoprotocatechuate
           following the oxidative opening of the aromatic ring.
          Length = 488

 Score =  207 bits (528), Expect = 1e-61
 Identities = 108/359 (30%), Positives = 189/359 (52%), Gaps = 19/359 (5%)

Query: 31  KLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIIT 90
             +IDG+FV S++ +  +  +PATNEV+  V +   A+++ A  +AK+A++ W++     
Sbjct: 2   GHFIDGEFVPSESGETFETLSPATNEVLGSVARGGAADVDRAAKAAKEAFKRWAELKAAE 61

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQ----VVEQCCSIPTIMQ 146
           R++ L ++ ++I+ +  ++A     + G+ L      V+R  +      ++C      M 
Sbjct: 62  RKRYLHKIADLIEQHADEIAVLECLDCGQPLRQTRQQVIRAAENFRFFADKC---EEAMD 118

Query: 147 GETLAGVAKDMDIHSY---RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSER 203
           G T       +D H     RVP+G    ITP+N P M+  W    A+A GNT V+KP+E 
Sbjct: 119 GRTYP-----VDTHLNYTVRVPVGPVGLITPWNAPFMLSTWKIAPALAFGNTVVLKPAEW 173

Query: 204 DPGACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIYER 262
            P     L ++  +AG P GV N++HG   +A   +  HPD++A+SF G    G  I   
Sbjct: 174 SPLTAARLAEIAKEAGLPDGVFNLVHGFGEEAGKALVAHPDVKAVSFTGETATGSIIMRN 233

Query: 263 GAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV-- 320
           GA   KR    +G K+  ++  DA+    L+ +V   F   G+RC A S+ + V E++  
Sbjct: 234 GADTLKRFSMELGGKSPVIVFDDADLERALDAVVFMIFSFNGERCTA-SSRLLVQESIAE 292

Query: 321 EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLK 379
           +++ +L +R  A++V     P T++GP+I P+   ++L  +++   EGAT+L+ G    
Sbjct: 293 DFVEKLVERVRAIRVGHPLDPETEVGPLIHPEHLAKVLGYVEAAEKEGATILVGGERAP 351


>gnl|CDD|143431 cd07113, ALDH_PADH_NahF, Escherichia coli NAD+-dependent
           phenylacetaldehyde dehydrogenase PadA-like.
           NAD+-dependent, homodimeric, phenylacetaldehyde
           dehydrogenase (PADH, EC=1.2.1.39) PadA of Escherichia
           coli involved in the catabolism of 2-phenylethylamine,
           and other related sequences, are present in this CD.
           Also included is the Pseudomonas fluorescens ST StyD
           PADH involved in styrene catabolism, the Sphingomonas
           sp. LB126 FldD protein involved in fluorene degradation,
           and the Novosphingobium aromaticivorans NahF
           salicylaldehyde dehydrogenase involved in the
           NAD+-dependent conversion of salicylaldehyde to
           salicylate.
          Length = 477

 Score =  205 bits (523), Expect = 5e-61
 Identities = 112/363 (30%), Positives = 182/363 (50%), Gaps = 24/363 (6%)

Query: 32  LYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYET-WSKTSIIT 90
            +IDG+ V  ++   +D+ NPAT +V+  V  AT A++++AVASA +A+ + W+KT+   
Sbjct: 2   HFIDGRPVAGQSEKRLDITNPATEQVIASVASATEADVDAAVASAWRAFVSAWAKTTPAE 61

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSI--------P 142
           R ++L RL ++I+ + ++LA+  T   GK++      + R  +V     +          
Sbjct: 62  RGRILLRLADLIEQHGEELAQLETLCSGKSIH-----LSRAFEV--GQSANFLRYFAGWA 114

Query: 143 TIMQGETLAGVAKDMDIHSY-----RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHV 197
           T + GETLA     M    Y     R P+GV AGI P+NF  MI +W    A+A G T V
Sbjct: 115 TKINGETLAPSIPSMQGERYTAFTRREPVGVVAGIVPWNFSVMIAVWKIGAALATGCTIV 174

Query: 198 IKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGK 257
           IKPSE  P   + + ++  +AG P GV+NV++G       +  HPD+  +SF GS   GK
Sbjct: 175 IKPSEFTPLTLLRVAELAKEAGIPDGVLNVVNGKGAVGAQLISHPDVAKVSFTGSVATGK 234

Query: 258 YIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVG 317
            I  + A +  RV   +G KN    + DA+    +  L+ A F   GQ C A     +V 
Sbjct: 235 KIGRQAASDLTRVTLELGGKNAAAFLKDADIDWVVEGLLTAGFLHQGQVC-AAPERFYVH 293

Query: 318 EAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
            +   E + +LK+   + +V +        GP+ +    +++ S +     EG  ++  G
Sbjct: 294 RSKFDELVTKLKQALSSFQVGSPMDESVMFGPLANQPHFDKVCSYLDDARAEGDEIVRGG 353

Query: 376 RNL 378
             L
Sbjct: 354 EAL 356


>gnl|CDD|143401 cd07082, ALDH_F11_NP-GAPDH, NADP+-dependent non-phosphorylating
           glyceraldehyde 3-phosphate dehydrogenase and ALDH family
           11.  NADP+-dependent non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase (NP-GAPDH, EC=1.2.1.9)
           catalyzes the irreversible oxidation of glyceraldehyde
           3-phosphate to 3-phosphoglycerate generating NADPH for
           biosynthetic reactions.  This CD also includes the
           Arabidopsis thaliana osmotic-stress-inducible ALDH
           family 11, ALDH11A3  and similar sequences. In
           autotrophic eukaryotes, NP-GAPDH generates NADPH for
           biosynthetic processes from photosynthetic
           glyceraldehyde-3-phosphate exported from the chloroplast
           and catalyzes one of the classic glycolytic bypass
           reactions unique to plants.
          Length = 473

 Score =  205 bits (523), Expect = 5e-61
 Identities = 107/367 (29%), Positives = 169/367 (46%), Gaps = 38/367 (10%)

Query: 31  KLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKT-SII 89
           K  I+G++ ES     I++++P   EV+  VP  +  E+  A  +A  A   W  T  + 
Sbjct: 3   KYLINGEWKESSG-KTIEVYSPIDGEVIGSVPALSALEILEAAETAYDAGRGWWPTMPLE 61

Query: 90  TRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGET 149
            R   L +  +++K N +++A  +  E GKTL DA  +V R +  +       TI + + 
Sbjct: 62  ERIDCLHKFADLLKENKEEVANLLMWEIGKTLKDALKEVDRTIDYIRD-----TIEELKR 116

Query: 150 LAGVAKDMDIHS---------YRVPLGVTAGITPFNFP------AMIPLWMFPVAIACGN 194
           L G +   D             R PLGV   I PFN+P       +IP      A+  GN
Sbjct: 117 LDGDSLPGDWFPGTKGKIAQVRREPLGVVLAIGPFNYPLNLTVSKLIP------ALIMGN 170

Query: 195 THVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSD 253
           T V KP+ +     + L +    AG P GVVNV+ G   +  + +  H  I  ISF GS 
Sbjct: 171 TVVFKPATQGVLLGIPLAEAFHDAGFPKGVVNVVTGRGREIGDPLVTHGRIDVISFTGST 230

Query: 254 QAGKYIYERGAKNGKRVQSNM--GAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALS 311
           + G  +     K     +  +  G K+  +++ DA+      ++V  A   +GQRC A+ 
Sbjct: 231 EVGNRL----KKQHPMKRLVLELGGKDPAIVLPDADLELAAKEIVKGALSYSGQRCTAIK 286

Query: 312 TVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGA 369
             + V E+V  E +  LK+    LKV      G D+ P+I P++ + +  LI   V +GA
Sbjct: 287 -RVLVHESVADELVELLKEEVAKLKVGMPWDNGVDITPLIDPKSADFVEGLIDDAVAKGA 345

Query: 370 TLLLDGR 376
           T+L  G 
Sbjct: 346 TVLNGGG 352


>gnl|CDD|143409 cd07090, ALDH_F9_TMBADH, NAD+-dependent
           4-trimethylaminobutyraldehyde dehydrogenase, ALDH family
           9A1.  NAD+-dependent, 4-trimethylaminobutyraldehyde
           dehydrogenase (TMABADH, EC=1.2.1.47), also known as
           aldehyde dehydrogenase family 9 member A1 (ALDH9A1) in
           humans, is a cytosolic tetramer which catalyzes the
           oxidation of gamma-aminobutyraldehyde involved in
           4-aminobutyric acid (GABA) biosynthesis  and also
           oxidizes betaine aldehyde
           (gamma-trimethylaminobutyraldehyde) which is involved in
           carnitine biosynthesis.
          Length = 457

 Score =  203 bits (519), Expect = 2e-60
 Identities = 107/336 (31%), Positives = 171/336 (50%), Gaps = 12/336 (3%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLA 110
            PAT EV+  V  A   +++ AV SAK A + WS TS + R ++L +  ++++    ++A
Sbjct: 3   EPATGEVLATVHCAGAEDVDLAVKSAKAAQKEWSATSGMERGRILRKAADLLRERNDEIA 62

Query: 111 ENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSY----RVPL 166
              T + GK + +A  D+      +E    +   + GE +      +   S+    R PL
Sbjct: 63  RLETIDNGKPIEEARVDIDSSADCLEYYAGLAPTLSGEHV-----PLPGGSFAYTRREPL 117

Query: 167 GVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVN 226
           GV AGI  +N+P  I  W    A+ACGN  V KPS   P   ++L ++LT+AG P GV N
Sbjct: 118 GVCAGIGAWNYPIQIASWKSAPALACGNAMVYKPSPFTPLTALLLAEILTEAGLPDGVFN 177

Query: 227 VIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDA 286
           V+ G  +    +C+HPD+  +SF GS   GK +    AK  K V   +G K+  +I  DA
Sbjct: 178 VVQGGGETGQLLCEHPDVAKVSFTGSVPTGKKVMSAAAKGIKHVTLELGGKSPLIIFDDA 237

Query: 287 NRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTD 344
           +  N +N  + A F + GQ C    T +FV  ++  E+   L +R + +++       T 
Sbjct: 238 DLENAVNGAMMANFLSQGQVCSN-GTRVFVQRSIKDEFTERLVERTKKIRIGDPLDEDTQ 296

Query: 345 LGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKV 380
           +G +IS +  E++L  I+S   EGA +L  G  +  
Sbjct: 297 MGALISEEHLEKVLGYIESAKQEGAKVLCGGERVVP 332


>gnl|CDD|143433 cd07115, ALDH_HMSADH_HapE, Pseudomonas fluorescens 4-hydroxymuconic
           semialdehyde dehydrogenase-like.  4-hydroxymuconic
           semialdehyde dehydrogenase (HapE, EC=1.2.1.61) of
           Pseudomonas fluorescens ACB involved in
           4-hydroxyacetophenone degradation, and putative
           hydroxycaproate semialdehyde dehydrogenase (ChnE) of
           Brachymonas petroleovorans involved in cyclohexane
           metabolism, and other similar sequences, are present in
           this CD.
          Length = 453

 Score =  203 bits (518), Expect = 2e-60
 Identities = 108/330 (32%), Positives = 174/330 (52%), Gaps = 6/330 (1%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLA 110
           NPAT E++ RV +A+  ++++AVA+A+ A+E WS      R ++L+RL  +I AN  +LA
Sbjct: 3   NPATGELIARVAQASAEDVDAAVAAARAAFEAWSAMDPAERGRILWRLAELILANADELA 62

Query: 111 ENITEEQGKTLADAEG-DVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVT 169
              + + GK +  A   DV R              ++GE +  V      ++ R P+GV 
Sbjct: 63  RLESLDTGKPIRAARRLDVPRAADTFRYYAGWADKIEGEVIP-VRGPFLNYTVREPVGVV 121

Query: 170 AGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIH 229
             I P+NFP M   W    A+A GNT V+KP+E  P + + + +++ +AG P GV+NV+ 
Sbjct: 122 GAIVPWNFPLMFAAWKVAPALAAGNTVVLKPAELTPLSALRIAELMAEAGFPAGVLNVVT 181

Query: 230 G-AHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANR 288
           G    A   + +HPD+  I+F GS   G+ I +  A N KRV   +G K+  ++ +DA+ 
Sbjct: 182 GFGEVAGAALVEHPDVDKITFTGSTAVGRKIMQGAAGNLKRVSLELGGKSANIVFADADL 241

Query: 289 SNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDLG 346
              +       F   GQ C A S ++ V E++  E+L      A +L+      P T +G
Sbjct: 242 DAAVRAAATGIFYNQGQMCTAGSRLL-VHESIYDEFLERFTSLARSLRPGDPLDPKTQMG 300

Query: 347 PVISPQAKERILSLIQSGVDEGATLLLDGR 376
           P++S    +R+L  +  G +EGA LL  G+
Sbjct: 301 PLVSQAQFDRVLDYVDVGREEGARLLTGGK 330


>gnl|CDD|183918 PRK13252, PRK13252, betaine aldehyde dehydrogenase; Provisional.
          Length = 488

 Score =  202 bits (516), Expect = 1e-59
 Identities = 116/361 (32%), Positives = 191/361 (52%), Gaps = 13/361 (3%)

Query: 28  ATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTS 87
               LYIDG +VE+ + +  ++ NPAT EV+  V  ATPA++E+AVASAK+  + W+  +
Sbjct: 5   PLQSLYIDGAYVEATSGETFEVINPATGEVLATVQAATPADVEAAVASAKQGQKIWAAMT 64

Query: 88  IITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAE-GDVLRGLQVVEQCCSIPTIMQ 146
            + R ++L R  +I++    +LA   T + GK + +    D++ G  V+E    +   ++
Sbjct: 65  AMERSRILRRAVDILRERNDELAALETLDTGKPIQETSVVDIVTGADVLEYYAGLAPALE 124

Query: 147 GETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPG 206
           GE +        +++ R PLGV AGI  +N+P  I  W    A+A GN  + KPSE  P 
Sbjct: 125 GEQIPLRGGSF-VYTRREPLGVCAGIGAWNYPIQIACWKSAPALAAGNAMIFKPSEVTPL 183

Query: 207 ACMMLVDMLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKN 266
             + L ++ T+AG P GV NV+ G      ++ +HPDI  +SF G    GK +    A +
Sbjct: 184 TALKLAEIYTEAGLPDGVFNVVQGDGRVGAWLTEHPDIAKVSFTGGVPTGKKVMAAAAAS 243

Query: 267 GKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAA----FGAAGQRCMALSTVIFVGEAV-- 320
            K V   +G K+  ++  DA+    L++    A    F ++GQ C    T +FV +++  
Sbjct: 244 LKEVTMELGGKSPLIVFDDAD----LDRAADIAMLANFYSSGQVCTN-GTRVFVQKSIKA 298

Query: 321 EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKV 380
            +   L +R E +++     P T+ GP++S   ++++L  I+ G  EGA LL  G  L  
Sbjct: 299 AFEARLLERVERIRIGDPMDPATNFGPLVSFAHRDKVLGYIEKGKAEGARLLCGGERLTE 358

Query: 381 G 381
           G
Sbjct: 359 G 359


>gnl|CDD|200131 TIGR01804, BADH, glycine betaine aldehyde dehydrogenase.  Under
           osmotic stress, betaine aldehyde dehydrogenase oxidizes
           glycine betaine aldehyde into the osmoprotectant glycine
           betaine, via the second of two oxidation steps from
           exogenously supplied choline or betaine aldehyde. This
           choline-glycine betaine synthesis pathway can be found
           in gram-positive and gram-negative bacteria. In
           Escherichia coli, betaine aldehyde dehydrogenase (betB)
           is osmotically co-induced with choline dehydrogenase
           (betA) in the presence of choline. These dehydrogenases
           are located in a betaine gene cluster with the upstream
           choline transporter (betT) and transcriptional regulator
           (betI). Similar to E.coli, betaine synthesis in
           Staphylococcus xylosus is also influenced by osmotic
           stress and the presence of choline with genes localized
           in a functionally equivalent gene cluster. Organization
           of the betaine gene cluster in Sinorhizobium meliloti
           and Bacillus subtilis differs from that of E.coli by the
           absence of upstream choline transporter and
           transcriptional regulator homologues. Additionally,
           B.subtilis co-expresses a type II alcohol dehydrogenase
           with betaine aldehyde dehydrogenase instead of choline
           dehydrogenase as in E.coli, St.xylosus, and Si.meliloti.
           Betaine aldehyde dehydrogenase is a member of the
           aldehyde dehydrogenase family (pfam00171) [Cellular
           processes, Adaptations to atypical conditions].
          Length = 467

 Score =  200 bits (511), Expect = 3e-59
 Identities = 113/351 (32%), Positives = 185/351 (52%), Gaps = 6/351 (1%)

Query: 33  YIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQ 92
           +IDG++VE  A    ++ NPA  EV+  V  ATP ++E A+A+A++A   W+  S + R 
Sbjct: 1   FIDGEYVEDSAGTTREIINPANGEVIATVHAATPEDVERAIAAARRAQGEWAAMSPMERG 60

Query: 93  QVLFRLQNIIKANMKQLAENITEEQGKTLADAE-GDVLRGLQVVEQCCSIPTIMQGETLA 151
           ++L R  ++I+   ++LA+  T + GKTL +    D+  G  V E    +   + GE + 
Sbjct: 61  RILRRAADLIRERNEELAKLETLDTGKTLQETIVADMDSGADVFEFFAGLAPALNGEIIP 120

Query: 152 GVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMML 211
                   ++ R PLGV  GI  +N+P  I  W    A+A GN  V KPSE  P   + +
Sbjct: 121 LGGPS-FAYTIREPLGVCVGIGAWNYPLQIASWKIAPALAAGNAMVFKPSENTPLTALKV 179

Query: 212 VDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRV 270
            +++ +AG P GV NV+ G    V   + +HPD+  +SF G    GK I    A + K V
Sbjct: 180 AEIMEEAGLPKGVFNVVQGDGAEVGPLLVNHPDVAKVSFTGGVPTGKKIMAAAAGHLKHV 239

Query: 271 QSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAVE--WLPELKK 328
              +G K+  ++  DA+  + ++  +   F +AGQ C    T +FV + ++  +L  L +
Sbjct: 240 TMELGGKSPLIVFDDADLESAVDGAMLGNFFSAGQVCSN-GTRVFVHKKIKERFLARLVE 298

Query: 329 RAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLK 379
           R E +K+       T++GP+IS   ++++LS I+ G  EGATL   G   +
Sbjct: 299 RTERIKLGDPFDEATEMGPLISAAHRDKVLSYIEKGKAEGATLATGGGRPE 349


>gnl|CDD|188167 TIGR01780, SSADH, succinate-semialdehyde dehydrogenase.  Succinic
           semialdehyde dehydrogenase is one of three enzymes
           constituting 4-aminobutyrate (GABA) degradation in both
           prokaryotes and eukaryotes, catalyzing the
           (NAD(P)+)-dependent catabolism reaction of succinic
           semialdehyde to succinate for metabolism by the citric
           acid cycle. The EC number depends on the cofactor:
           1.2.1.24 for NAD only, 1.2.1.79 for NADP only, and
           1.2.1.16 if both can be used. In Escherichia coli,
           succinic semialdehyde dehydrogenase is located in an
           unidirectionally transcribed gene cluster encoding
           enzymes for GABA degradation and is suggested to be
           cotranscribed with succinic semialdehyde transaminase
           from a common promoter upstream of SSADH. Similar gene
           arrangements can be found in characterized Ralstonia
           eutropha and the genome analysis of Bacillus subtilis.
           Prokaryotic succinic semialdehyde dehydrogenases
           (1.2.1.16) share high sequence homology to characterized
           succinic semialdehyde dehydrogenases from rat and human
           (1.2.1.24), exhibiting conservation of proposed cofactor
           binding residues, and putative active sites (G-237 &
           G-242, C-293 & G-259 respectively of rat SSADH).
           Eukaryotic SSADH enzymes exclusively utilize NAD+ as a
           cofactor, exhibiting little to no NADP+ activity. While
           a NADP+ preference has been detected in prokaryotes in
           addition to both NADP+- and NAD+-dependencies as in
           E.coli, Pseudomonas, and Klebsiella pneumoniae. The
           function of this alternative SSADH currently is unknown,
           but has been suggested to play a possible role in
           4-hydroxyphenylacetic degradation. Just outside the
           scope of this model, are several sequences belonging to
           clades scoring between trusted and noise. These
           sequences may be actual SSADH enzymes, but lack
           sufficiently close characterized homologs to make a
           definitive assignment at this time. SSADH enzyme belongs
           to the aldehyde dehydrogenase family (pfam00171),
           sharing a common evolutionary origin and enzymatic
           mechanism with lactaldehyde dehydrogenase. Like in
           lactaldehyde dehydrogenase and succinate semialdehyde
           dehydrogenase, the mammalian catalytic glutamic acid and
           cysteine residues are conserved in all the enzymes of
           this family (PS00687, PS00070) [Central intermediary
           metabolism, Other].
          Length = 448

 Score =  197 bits (503), Expect = 3e-58
 Identities = 112/347 (32%), Positives = 180/347 (51%), Gaps = 9/347 (2%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLA 110
           NPAT E++  VP     E E+A+ +A +A++TW  T+   R  +L +  N++  N   LA
Sbjct: 3   NPATGEIIGSVPDQGVDETEAAIRAAYEAFKTWRATTAKERSSLLRKWYNLMMENKDDLA 62

Query: 111 ENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTA 170
             IT E GK L +A+G++L     +E        + G+T+     D  +   + P+GV A
Sbjct: 63  RLITLENGKPLKEAKGEILYAASFLEWFAEEAKRVYGDTIPSPQSDKRLIVIKQPVGVCA 122

Query: 171 GITPFNFP-AMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIH 229
            ITP+NFP AMI       A+A G T V+KP+E+ P + + L  +  QAG P GV+NVI 
Sbjct: 123 AITPWNFPAAMITRKAGA-ALAAGCTVVVKPAEQTPLSALALARLAEQAGIPKGVLNVIT 181

Query: 230 GAHDAV---NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDA 286
           G   A    N +   P +R ISF GS   GK + ++ A   K+V   +G     ++  DA
Sbjct: 182 G-SRAKEVGNVLTTSPLVRKISFTGSTNVGKILMKQSASTVKKVSMELGGNAPFIVFDDA 240

Query: 287 NRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTD 344
           +    +   + + F  AGQ C+  +  ++V + +  E+  +L +  + LKV  G   G  
Sbjct: 241 DLDQAVEGAMASKFRNAGQTCVC-ANRLYVHDGIYDEFAKKLAEAVKKLKVGNGLDEGVT 299

Query: 345 LGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKVGINVPIPVPLS 391
            GP+I+ +A E++   I   V++GA ++  G+  ++G N   P  LS
Sbjct: 300 QGPLINEKAVEKVEKHIADAVEKGAKVVTGGKRHELGGNFFEPTVLS 346


>gnl|CDD|143428 cd07110, ALDH_F10_BADH, Arabidopsis betaine aldehyde dehydrogenase
           1 and 2, ALDH family 10A8 and 10A9-like.  Present in
           this CD are the Arabidopsis betaine aldehyde
           dehydrogenase (BADH) 1 (chloroplast) and 2
           (mitochondria), also known as, aldehyde dehydrogenase
           family 10 member A8 and aldehyde dehydrogenase family 10
           member A9, respectively, and are putative dehydration-
           and salt-inducible BADHs (EC 1.2.1.8) that catalyze the
           oxidation of betaine aldehyde to the compatible solute
           glycine betaine.
          Length = 456

 Score =  197 bits (502), Expect = 4e-58
 Identities = 104/332 (31%), Positives = 164/332 (49%), Gaps = 7/332 (2%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLA 110
           NPAT   +  +P AT  ++++AV +A++A+  W KT+   R + L  +   ++   ++LA
Sbjct: 3   NPATEATIGEIPAATAEDVDAAVRAARRAFPRWKKTTGAERAKYLRAIAEGVRERREELA 62

Query: 111 ENITEEQGKTLADAE---GDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLG 167
           E    + GK L +A     DV    +                +   ++D      R P+G
Sbjct: 63  ELEARDNGKPLDEAAWDVDDVAGCFEYYADLAEQLDAKAERAVPLPSEDFKARVRREPVG 122

Query: 168 VTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNV 227
           V   ITP+NFP ++  W    A+A G T V+KPSE      + L ++  +AG PPGV+NV
Sbjct: 123 VVGLITPWNFPLLMAAWKVAPALAAGCTVVLKPSELTSLTELELAEIAAEAGLPPGVLNV 182

Query: 228 IHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDA 286
           + G   +A   +  HP I  ISF GS   G  + +  A++ K V   +G K+  ++  DA
Sbjct: 183 VTGTGDEAGAPLAAHPGIDKISFTGSTATGSQVMQAAAQDIKPVSLELGGKSPIIVFDDA 242

Query: 287 NRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTD 344
           +    +   +   F   GQ C A S ++ V E++   +L  L   AEA++V      G  
Sbjct: 243 DLEKAVEWAMFGCFWNNGQICSATSRLL-VHESIADAFLERLATAAEAIRVGDPLEEGVR 301

Query: 345 LGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           LGP++S    E++LS I  G +EGA LL  GR
Sbjct: 302 LGPLVSQAQYEKVLSFIARGKEEGARLLCGGR 333


>gnl|CDD|143427 cd07109, ALDH_AAS00426, Uncharacterized Saccharopolyspora spinosa
           aldehyde dehydrogenase (AAS00426)-like.  Uncharacterized
           aldehyde dehydrogenase of Saccharopolyspora spinosa
           (AAS00426) and other similar sequences, are present in
           this CD.
          Length = 454

 Score =  197 bits (502), Expect = 5e-58
 Identities = 101/332 (30%), Positives = 169/332 (50%), Gaps = 7/332 (2%)

Query: 49  LHNPATNEVVTRVPKATPAEMESAVASAKKAYET-WSKTSIITRQQVLFRLQNIIKANMK 107
           + +P+T EV  R+ +   A+++ AV +A++A+E+ W + S   R ++L R+  +I+ +  
Sbjct: 1   VFDPSTGEVFARIARGGAADVDRAVQAARRAFESGWLRLSPAERGRLLLRIARLIREHAD 60

Query: 108 QLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLG 167
           +LA   + + GK L  A  DV    +  E        + GET+  +     +++ R P G
Sbjct: 61  ELARLESLDTGKPLTQARADVEAAARYFEYYGGAADKLHGETIP-LGPGYFVYTVREPHG 119

Query: 168 VTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNV 227
           VT  I P+N+P  I       A+A GN  V+KP+E  P   + L ++  +AG P G +NV
Sbjct: 120 VTGHIIPWNYPLQITGRSVAPALAAGNAVVVKPAEDAPLTALRLAELAEEAGLPAGALNV 179

Query: 228 IHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDA 286
           + G   +A   +  HP +  ISF GS + G  +    A+N   V   +G K+  ++ +DA
Sbjct: 180 VTGLGAEAGAALVAHPGVDHISFTGSVETGIAVMRAAAENVVPVTLELGGKSPQIVFADA 239

Query: 287 NRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTD 344
           +    L  +V A    AGQ C A S ++ V  ++  E L  L +R  AL+V  G +   D
Sbjct: 240 DLEAALPVVVNAIIQNAGQTCSAGSRLL-VHRSIYDEVLERLVERFRALRVGPG-LEDPD 297

Query: 345 LGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           LGP+IS +  +R+   +      GA ++  GR
Sbjct: 298 LGPLISAKQLDRVEGFVARARARGARIVAGGR 329


>gnl|CDD|143462 cd07144, ALDH_ALD2-YMR170C, Saccharomyces cerevisiae aldehyde
           dehydrogenase 2 (YMR170c)-like.  NAD(P)+-dependent
           Saccharomyces cerevisiae aldehyde dehydrogenase 2
           (YMR170c, ALD5, EC=1.2.1.5) and other similar sequences,
           are present in this CD.
          Length = 484

 Score =  194 bits (495), Expect = 8e-57
 Identities = 110/355 (30%), Positives = 186/355 (52%), Gaps = 14/355 (3%)

Query: 30  TKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYET-WSKTSI 88
           T L+I+ +FV+S   + I   NP+T EV+  V  A   +++ AV +A+KA+E+ WSK + 
Sbjct: 8   TGLFINNEFVKSSDGETIKTVNPSTGEVIASVYAAGEEDVDKAVKAARKAFESWWSKVTG 67

Query: 89  ITRQQVLFRLQNIIKANMKQLAENITEEQGKTL-ADAEGDVLRGLQVVEQCCSIPTIMQG 147
             R ++L +L ++++ N   LA     + GK   ++A GD+   + V+         +QG
Sbjct: 68  EERGELLDKLADLVEKNRDLLAAIEALDSGKPYHSNALGDLDEIIAVIRYYAGWADKIQG 127

Query: 148 ETLAGVAKDMDIHSY--RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDP 205
           +T+          +Y    P GV   I P+N+P  +  W    A+A GNT VIKP+E  P
Sbjct: 128 KTIPTSPNK---LAYTLHEPYGVCGQIIPWNYPLAMAAWKLAPALAAGNTVVIKPAENTP 184

Query: 206 GACMMLVDMLTQAGCPPGVVNVIHGAHDAV--NFICDHPDIRAISFVGSDQAGKYIYERG 263
            + +   +++ +AG PPGVVN+I G + AV  + + +HPD+  I+F GS   G+ + +  
Sbjct: 185 LSLLYFANLVKEAGFPPGVVNIIPG-YGAVAGSALAEHPDVDKIAFTGSTATGRLVMKAA 243

Query: 264 AKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--E 321
           A+N K V    G K+  ++  DA+    +          +GQ C A S  I+V E++  +
Sbjct: 244 AQNLKAVTLECGGKSPALVFEDADLDQAVKWAAAGIMYNSGQNCTATSR-IYVQESIYDK 302

Query: 322 WLPELKKRA-EALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
           ++ +  +   +  KV +     T +GP +S    +R+LS I+ G  EGA L+  G
Sbjct: 303 FVEKFVEHVKQNYKVGSPFDDDTVVGPQVSKTQYDRVLSYIEKGKKEGAKLVYGG 357


>gnl|CDD|132260 TIGR03216, OH_muco_semi_DH, 2-hydroxymuconic semialdehyde
           dehydrogenase.  Members of this protein family are
           2-hydroxymuconic semialdehyde dehydrogenase. Many
           aromatic compounds are catabolized by way of the
           catechol, via the meta-cleavage pathway, to pyruvate and
           acetyl-CoA. This enzyme performs the second of seven
           steps in that pathway for catechol degradation [Energy
           metabolism, Other].
          Length = 481

 Score =  194 bits (494), Expect = 1e-56
 Identities = 110/352 (31%), Positives = 178/352 (50%), Gaps = 12/352 (3%)

Query: 33  YIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYE-TWSKTSIITR 91
           +I+G FVES  T      NP    V+ RV +A  AE+++AVA+A+ A +  W K ++  R
Sbjct: 4   FINGAFVESGKT--FANINPVDGRVIARVHEAGAAEVDAAVAAARAALKGPWGKMTVAER 61

Query: 92  QQVLFRLQNIIKANMKQLAENITEEQGKTLADAEG-DVLRGLQVVEQCCSIPTIMQGETL 150
             +L+ + + I+            + GK  + A   D+ RG         +      E  
Sbjct: 62  ADLLYAVADEIERRFDDFLAAEVADTGKPRSLASHLDIPRGAANFRVFADVVKNAPTECF 121

Query: 151 AGVAKD-MDIHSY--RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGA 207
                D     +Y  R PLGV   I+P+N P ++  W    A+ACGNT V+KPSE  PG 
Sbjct: 122 EMATPDGKGALNYAVRKPLGVVGVISPWNLPLLLMTWKVGPALACGNTVVVKPSEETPGT 181

Query: 208 CMMLVDMLTQAGCPPGVVNVIH--GAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAK 265
             +L +++   G P GV NV+H  G   A  F+  HP + AI+F G  + G  I +  A 
Sbjct: 182 ATLLGEVMNAVGVPKGVYNVVHGFGPDSAGEFLTRHPGVDAITFTGETRTGSAIMKAAAD 241

Query: 266 NGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWL 323
             K V   +G KN  ++ +D +    +  ++ +AF   GQ C+  +  ++V   +   ++
Sbjct: 242 GVKPVSFELGGKNAAIVFADCDFDAAVAGILRSAFLNTGQVCLG-TERVYVERPIFDRFV 300

Query: 324 PELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
             LK RAE+LK+   + P T++GP+IS + ++++LS     V+EGAT++  G
Sbjct: 301 AALKARAESLKIGVPDDPATNMGPLISAEHRDKVLSYYALAVEEGATVVTGG 352


>gnl|CDD|143436 cd07118, ALDH_SNDH, Gluconobacter oxydans L-sorbosone
           dehydrogenase-like.  Included in this CD is the
           L-sorbosone dehydrogenase (SNDH) from Gluconobacter
           oxydans UV10. In G. oxydans,  D-sorbitol is converted to
           2-keto-L-gulonate (a precursor of L-ascorbic acid) in
           sequential oxidation steps catalyzed by a FAD-dependent,
           L-sorbose dehydrogenase and an NAD(P)+-dependent,
           L-sorbosone dehydrogenase.
          Length = 454

 Score =  192 bits (490), Expect = 2e-56
 Identities = 108/334 (32%), Positives = 177/334 (52%), Gaps = 12/334 (3%)

Query: 50  HNPATNEVVTRVPKATPAEMESAVASAKKAYET--WSKTSIITRQQVLFRLQNIIKANMK 107
            +PA   VV R  + T  ++++AVA+A+KA++   W + S   R  VL ++ ++I+A  +
Sbjct: 2   RSPAHGVVVARYAEGTVEDVDAAVAAARKAFDKGPWPRMSGAERAAVLLKVADLIRARRE 61

Query: 108 QLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLG 167
           +LA   T E GK ++ A G++     +     S+   + G++   +  DM     R P+G
Sbjct: 62  RLALIETLESGKPISQARGEIEGAADLWRYAASLARTLHGDSYNNLGDDMLGLVLREPIG 121

Query: 168 VTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNV 227
           V   ITP+NFP +I     P A+A G T V+KPSE   G  +ML ++L +AG P GVVN+
Sbjct: 122 VVGIITPWNFPFLILSQKLPFALAAGCTVVVKPSEFTSGTTLMLAELLIEAGLPAGVVNI 181

Query: 228 IHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDA 286
           + G    V   + +HPD+  +SF GS + GK I    A+N K+V   +G KN  ++ +DA
Sbjct: 182 VTGYGATVGQAMTEHPDVDMVSFTGSTRVGKAIAAAAARNLKKVSLELGGKNPQIVFADA 241

Query: 287 NRSNTLNQLVGAAFGA---AGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVP 341
           +     +      FG    AG+ C + S  + V E++   ++  +  R+  ++V     P
Sbjct: 242 DLDAAAD---AVVFGVYFNAGECCNSGSR-LLVHESIADAFVAAVVARSRKVRVGDPLDP 297

Query: 342 GTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
            T +G +I+     +I   + +G  EGATLLL G
Sbjct: 298 ETKVGAIINEAQLAKITDYVDAGRAEGATLLLGG 331


>gnl|CDD|143469 cd07151, ALDH_HBenzADH, NADP+-dependent p-hydroxybenzaldehyde
           dehydrogenase-like.  NADP+-dependent,
           p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH)
           which catalyzes oxidation of p-hydroxybenzaldehyde to
           p-hydroxybenzoic acid and other related sequences are
           included in this CD.
          Length = 465

 Score =  191 bits (487), Expect = 9e-56
 Identities = 105/345 (30%), Positives = 177/345 (51%), Gaps = 7/345 (2%)

Query: 36  GKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVL 95
           G++ +  +   ID+ NP T E +  +P A+  +++ A  +A  A + W+ T    R ++L
Sbjct: 1   GEWRDGTSERTIDVLNPYTGETLAEIPAASKEDVDEAYRAAAAAQKEWAATLPQERAEIL 60

Query: 96  FRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAK 155
            +   I++    ++ E +  E G T   A  +    + +  +  + P  M+G  L     
Sbjct: 61  EKAAQILEERRDEIVEWLIRESGSTRIKANIEWGAAMAITREAATFPLRMEGRILPSDVP 120

Query: 156 DMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDP--GACMMLVD 213
             +   YR PLGV   I+P+NFP  + +     A+A GN  V+KP+   P  G  ++L  
Sbjct: 121 GKENRVYREPLGVVGVISPWNFPLHLSMRSVAPALALGNAVVLKPASDTPITGG-LLLAK 179

Query: 214 MLTQAGCPPGVVNVIHGA-HDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQS 272
           +  +AG P GV+NV+ GA  +  +   +HP  R ISF GS   G++I E   ++ K+V  
Sbjct: 180 IFEEAGLPKGVLNVVVGAGSEIGDAFVEHPVPRLISFTGSTPVGRHIGELAGRHLKKVAL 239

Query: 273 NMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRA 330
            +G  N  V++ DA+    +N  V   F   GQ CMA++ +I V E V  E++ +  +R 
Sbjct: 240 ELGGNNPFVVLEDADIDAAVNAAVFGKFLHQGQICMAINRII-VHEDVYDEFVEKFVERV 298

Query: 331 EALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
           +AL     + P T +GP+I+    + +L  I+  V+EGATLL+ G
Sbjct: 299 KALPYGDPSDPDTVVGPLINESQVDGLLDKIEQAVEEGATLLVGG 343


>gnl|CDD|143411 cd07092, ALDH_ABALDH-YdcW, Escherichia coli NAD+-dependent
           gamma-aminobutyraldehyde dehydrogenase YdcW-like.
           NAD+-dependent, tetrameric, gamma-aminobutyraldehyde
           dehydrogenase (ABALDH), YdcW of Escherichia coli K12,
           catalyzes the oxidation of gamma-aminobutyraldehyde to
           gamma-aminobutyric acid. ABALDH can also oxidize n-alkyl
           medium-chain aldehydes, but with a lower catalytic
           efficiency.
          Length = 450

 Score =  190 bits (485), Expect = 1e-55
 Identities = 106/341 (31%), Positives = 167/341 (48%), Gaps = 23/341 (6%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLA 110
           +PAT E +  VP A+ A++++AVA+A  A+ +W +T+   R + L +L + I+ N ++LA
Sbjct: 3   DPATGEEIATVPDASAADVDAAVAAAHAAFPSWRRTTPAERSKALLKLADAIEENAEELA 62

Query: 111 ENITEEQGKTLADAEGDVLRGLQVVEQ-------CCSIPTIMQGETLAGVAKDMDIHSY- 162
              +   GK L     D L G   V+          ++     GE L G        S  
Sbjct: 63  ALESRNTGKPLHLVRDDELPG--AVDNFRFFAGAARTLEGPAAGEYLPGHT------SMI 114

Query: 163 -RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCP 221
            R P+GV A I P+N+P M+  W    A+A GNT V+KPSE  P   ++L + L     P
Sbjct: 115 RREPIGVVAQIAPWNYPLMMAAWKIAPALAAGNTVVLKPSETTPLTTLLLAE-LAAEVLP 173

Query: 222 PGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHG 280
           PGVVNV+ G   +  + +  HP +R +S  GS + GK +    A   KRV   +G K   
Sbjct: 174 PGVVNVVCGGGASAGDALVAHPRVRMVSLTGSVRTGKKVARAAADTLKRVHLELGGKAPV 233

Query: 281 VIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAG 338
           ++  DA+    +  +  A +  AGQ C A   V  V E+V  E++  L +   A++V   
Sbjct: 234 IVFDDADLDAAVAGIATAGYYNAGQDCTAACRVY-VHESVYDEFVAALVEAVSAIRVGDP 292

Query: 339 NVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLK 379
           +   T++GP+ S   +ER+   +       A +L  GR  +
Sbjct: 293 DDEDTEMGPLNSAAQRERVAGFV-ERAPAHARVLTGGRRAE 332


>gnl|CDD|200087 TIGR01237, D1pyr5carbox2, delta-1-pyrroline-5-carboxylate
           dehydrogenase, group 2, putative.  This enzyme is the
           second of two in the degradation of proline to
           glutamate. This model represents one of several related
           branches of delta-1-pyrroline-5-carboxylate
           dehydrogenase. Members of this branch may be associated
           with proline dehydrogenase (the other enzyme of the
           pathway from proline to glutamate) but have not been
           demonstrated experimentally. The branches are not as
           closely related to each other as some distinct aldehyde
           dehydrogenases are to some; separate models were built
           to let each model describe a set of equivalogs [Energy
           metabolism, Amino acids and amines].
          Length = 511

 Score =  191 bits (486), Expect = 2e-55
 Identities = 112/356 (31%), Positives = 170/356 (47%), Gaps = 23/356 (6%)

Query: 32  LYIDGKFVESKATDWIDLHNPA-TNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIIT 90
           L I+G+ VE++    I   NP   +EVV  V KA+    E A+ +A KA+E W KT    
Sbjct: 35  LVINGERVETEN--KIVSINPCDKSEVVGTVSKASQEHAEHALQAAAKAFEAWKKTDPEE 92

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETL 150
           R  +LF+   I++    + +  + +E GK   +A+ +V   +  +E         Q   L
Sbjct: 93  RAAILFKAAAIVRRRRHEFSALLVKEVGKPWNEADAEVAEAIDFMEYYAR-----QMIEL 147

Query: 151 AGVAKDMDIHSYR-----VPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDP 205
           A                  P GVT  I+P+NFP  I + M    I  GN  V+KP+E  P
Sbjct: 148 AKGKPVNSREGETNQYVYTPTGVTVVISPWNFPFAIMVGMTVAPIVTGNCVVLKPAEAAP 207

Query: 206 GACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGA 264
                 V++L +AG P GVV  + G+   V +++ DHP    I+F GS + G  I+ER A
Sbjct: 208 VIAAKFVEILEEAGLPKGVVQFVPGSGSEVGDYLVDHPKTSLITFTGSREVGTRIFERAA 267

Query: 265 K------NGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGE 318
           K      + KRV + MG K+  ++  DA+          +AFG AGQ+C A S  + V E
Sbjct: 268 KVQPGQKHLKRVIAEMGGKDTVIVDEDADIELAAQSAFTSAFGFAGQKCSAGSRAV-VHE 326

Query: 319 AV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLL 372
            V  E +    +  E+LKV   +     +GPVI  ++  +I+  I+ G  EG  + 
Sbjct: 327 KVYDEVVERFVEITESLKVGPPDSADVYVGPVIDQKSFNKIMEYIEIGKAEGRLVS 382


>gnl|CDD|143408 cd07089, ALDH_CddD-AldA-like, Rhodococcus ruber 6-oxolauric acid
           dehydrogenase-like and related proteins.  The
           6-oxolauric acid dehydrogenase (CddD) from Rhodococcus
           ruber SC1 which converts 6-oxolauric acid to
           dodecanedioic acid; and the aldehyde dehydrogenase
           (locus SSP0762) from Staphylococcus saprophyticus subsp.
           saprophyticus ATCC 15305 and also, the Mycobacterium
           tuberculosis H37Rv ALDH AldA (locus Rv0768) sequence;
           and other similar sequences, are included in this CD.
          Length = 459

 Score =  189 bits (482), Expect = 5e-55
 Identities = 106/342 (30%), Positives = 167/342 (48%), Gaps = 24/342 (7%)

Query: 50  HNPATNEVVTRVPKATPAEMESAVASAKKAYETWS-KTSIITRQQVLFRLQNIIKANMKQ 108
            NPAT EV+   P A  A++++A+A+A++A++T    T    R + L +L   ++A  ++
Sbjct: 2   INPATEEVIGTAPDAGAADVDAAIAAARRAFDTGDWSTDAEERARCLRQLHEALEARKEE 61

Query: 109 LAENITEEQGKTLADAE--------GDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIH 160
           L   +  E G  +  A         G +     + +        +    L G      + 
Sbjct: 62  LRALLVAEVGAPVMTARAMQVDGPIGHLRYFADLADSF-PWEFDLPVPALRGGPGRRVVR 120

Query: 161 SYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGC 220
             R P+GV A ITP+NFP  + L     A+A GNT V+KP+   P + ++L +++ +   
Sbjct: 121 --REPVGVVAAITPWNFPFFLNLAKLAPALAAGNTVVLKPAPDTPLSALLLGEIIAETDL 178

Query: 221 PPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNH 279
           P GVVNV+ G+ +AV   +   P +  +SF GS   G+ I  + A   KRV   +G K+ 
Sbjct: 179 PAGVVNVVTGSDNAVGEALTTDPRVDMVSFTGSTAVGRRIMAQAAATLKRVLLELGGKSA 238

Query: 280 GVIMSDANRSNTLNQLVGAAFG----AAGQRCMALSTVIFVGEAV--EWLPELKKRAEAL 333
            +++ DA+    L     AA G     AGQ C AL+T + V  +   E +  L    EAL
Sbjct: 239 NIVLDDAD----LAAAAPAAVGVCMHNAGQGC-ALTTRLLVPRSRYDEVVEALAAAFEAL 293

Query: 334 KVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
            V     PGT +GP+IS   ++R+   I  G DEGA L+  G
Sbjct: 294 PVGDPADPGTVMGPLISAAQRDRVEGYIARGRDEGARLVTGG 335


>gnl|CDD|143413 cd07094, ALDH_F21_LactADH-like, ALDH subfamily: NAD+-dependent,
           lactaldehyde dehydrogenase, ALDH family 21 A1, and
           related proteins.  ALDH subfamily which includes Tortula
           ruralis aldehyde dehydrogenase ALDH21A1 (RNP123), and
           NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
           and like sequences.
          Length = 453

 Score =  188 bits (480), Expect = 7e-55
 Identities = 97/337 (28%), Positives = 158/337 (46%), Gaps = 10/337 (2%)

Query: 47  IDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANM 106
           +D+HNP   EV+ +VP    A+ E A+A+A+   E         R  +L R  +++K   
Sbjct: 1   LDVHNPYDGEVIGKVPADDRADAEEALATARAGAENRRALPPHERMAILERAADLLKKRA 60

Query: 107 KQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVA----KDMDIHSY 162
           ++ A+ I  E GK + DA  +V R +  +         ++GE +   A     +    + 
Sbjct: 61  EEFAKIIACEGGKPIKDARVEVDRAIDTLRLAAEEAERIRGEEIPLDATQGSDNRLAWTI 120

Query: 163 RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPP 222
           R P+GV   ITPFNFP  +       AIA G   V+KP+ + P + + L  +L +AG P 
Sbjct: 121 REPVGVVLAITPFNFPLNLVAHKLAPAIATGCPVVLKPASKTPLSALELAKILVEAGVPE 180

Query: 223 GVVNVIHGA-HDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGV 281
           GV+ V+ G      +       +  +SF GS   G+ +       GKR+   +G     +
Sbjct: 181 GVLQVVTGEREVLGDAFAADERVAMLSFTGSAAVGEALRANAG--GKRIALELGGNAPVI 238

Query: 282 IMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGN 339
           +  DA+    +  L    F  AGQ C+++   I+V E +  E++       + LKV    
Sbjct: 239 VDRDADLDAAIEALAKGGFYHAGQVCISVQR-IYVHEELYDEFIEAFVAAVKKLKVGDPL 297

Query: 340 VPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
              TD+GP+IS +A ER+   ++  V+ GA LL  G 
Sbjct: 298 DEDTDVGPLISEEAAERVERWVEEAVEAGARLLCGGE 334


>gnl|CDD|143465 cd07147, ALDH_F21_RNP123, Aldehyde dehydrogenase family 21A1-like. 
           Aldehyde dehydrogenase ALDH21A1 (gene name RNP123) was
           first described in the moss Tortula ruralis and is
           believed to play an important role in the detoxification
           of aldehydes generated in response to desiccation- and
           salinity-stress, and ALDH21A1 expression represents a
           unique stress tolerance mechanism. So far, of plants,
           only the bryophyte sequence has been observed, but
           similar protein sequences from bacteria and archaea are
           also present in this CD.
          Length = 452

 Score =  188 bits (480), Expect = 8e-55
 Identities = 110/345 (31%), Positives = 168/345 (48%), Gaps = 27/345 (7%)

Query: 47  IDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANM 106
           +++ NP T EVV RV  A P ++E A+A+A KA+          R  +L      ++   
Sbjct: 1   LEVTNPYTGEVVARVALAGPDDIEEAIAAAVKAFRPMRALPAHRRAAILLHCVARLEERF 60

Query: 107 KQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSI-PTIMQGETLAGVAKDMDIHS---- 161
           ++LAE I  E GK + DA G+V R      +  +   T + GE L      +DI +    
Sbjct: 61  EELAETIVLEAGKPIKDARGEVAR-AIDTFRIAAEEATRIYGEVL-----PLDISARGEG 114

Query: 162 -----YRVPLGVTAGITPFNFPAMIPLWMFPV--AIACGNTHVIKPSERDPGACMMLVDM 214
                 R P+G  + ITPFNFP  + L    V  AIA G   V+KP+ R P + ++L ++
Sbjct: 115 RQGLVRRFPIGPVSAITPFNFP--LNLVAHKVAPAIAAGCPFVLKPASRTPLSALILGEV 172

Query: 215 LTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNM 274
           L + G P G  +V+  + D  + +     I+ +SF GS   G  +  R  K  K+V   +
Sbjct: 173 LAETGLPKGAFSVLPCSRDDADLLVTDERIKLLSFTGSPAVGWDLKARAGK--KKVVLEL 230

Query: 275 GAKNHGVIM-SDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAE 331
           G  N  VI+ SDA+      +++  AF  AGQ C+++   + V  +V  E+   L  R +
Sbjct: 231 GG-NAAVIVDSDADLDFAAQRIIFGAFYQAGQSCISVQR-VLVHRSVYDEFKSRLVARVK 288

Query: 332 ALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           ALK        TD+GP+IS    ER+   +   VD GA LL  G+
Sbjct: 289 ALKTGDPKDDATDVGPMISESEAERVEGWVNEAVDAGAKLLTGGK 333


>gnl|CDD|143459 cd07141, ALDH_F1AB_F2_RALDH1, NAD+-dependent retinal dehydrogenase
           1, ALDH families 1A, 1B, and 2-like.  NAD+-dependent
           retinal dehydrogenase 1 (RALDH 1, ALDH1, EC=1.2.1.36)
           also known as aldehyde dehydrogenase family 1 member A1
           (ALDH1A1) in humans, is a homotetrameric, cytosolic
           enzyme that catalyzes the oxidation of retinaldehyde to
           retinoic acid. Human ALDH1B1 and ALDH2 are also in this
           cluster; both are mitochrondrial homotetramers which
           play important roles in acetaldehyde oxidation; ALDH1B1
           in response to UV light exposure and ALDH2 during
           ethanol metabolism.
          Length = 481

 Score =  189 bits (481), Expect = 8e-55
 Identities = 116/359 (32%), Positives = 185/359 (51%), Gaps = 14/359 (3%)

Query: 27  IATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYE---TW 83
           I  TK++I+ ++ +S +       NPAT E +  V +   A+++ AV +A+ A++    W
Sbjct: 4   IKYTKIFINNEWHDSVSGKTFPTINPATGEKICEVQEGDKADVDKAVKAARAAFKLGSPW 63

Query: 84  SKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADA-EGDVLRGLQVVEQCCSIP 142
                  R ++L +L ++I+ +   LA   T + GK  + +   D+   ++V+       
Sbjct: 64  RTMDASERGRLLNKLADLIERDRAYLASLETLDNGKPFSKSYLVDLPGAIKVLRYYAGWA 123

Query: 143 TIMQGETLAGVAKDMDIHSY--RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKP 200
             + G+T   +  D D  +Y    P+GV   I P+NFP ++  W    A+ACGNT V+KP
Sbjct: 124 DKIHGKT---IPMDGDFFTYTRHEPVGVCGQIIPWNFPLLMAAWKLAPALACGNTVVLKP 180

Query: 201 SERDPGACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSDQAGKYI 259
           +E+ P   + L  ++ +AG PPGVVNV+ G    A   I  HPDI  ++F GS + GK I
Sbjct: 181 AEQTPLTALYLASLIKEAGFPPGVVNVVPGYGPTAGAAISSHPDIDKVAFTGSTEVGKLI 240

Query: 260 YERGAK-NGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGE 318
            +   K N KRV   +G K+  ++ +DA+    + Q   A F   GQ C A S   FV E
Sbjct: 241 QQAAGKSNLKRVTLELGGKSPNIVFADADLDYAVEQAHEALFFNMGQCCCAGSRT-FVQE 299

Query: 319 AV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
           ++  E++    +RA+   V     P T+ GP I  +  ++IL LI+SG  EGA L   G
Sbjct: 300 SIYDEFVKRSVERAKKRVVGNPFDPKTEQGPQIDEEQFKKILELIESGKKEGAKLECGG 358


>gnl|CDD|237391 PRK13473, PRK13473, gamma-aminobutyraldehyde dehydrogenase;
           Provisional.
          Length = 475

 Score =  187 bits (477), Expect = 3e-54
 Identities = 106/348 (30%), Positives = 169/348 (48%), Gaps = 25/348 (7%)

Query: 28  ATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTS 87
             TKL I+G+ V  +      ++NPAT EV+  + +A+ A++++AVA+A  A+  WS+T+
Sbjct: 1   MQTKLLINGELVAGEGEKQ-PVYNPATGEVLAEIAEASAAQVDAAVAAADAAFPEWSQTT 59

Query: 88  IITRQQVLFRLQNIIKANMKQLAENITEEQGK----TLAD---AEGDVLRGLQVVEQCCS 140
              R + L +L + I+ N  + A   +   GK     L D   A  DV R      +C  
Sbjct: 60  PKERAEALLKLADAIEENADEFARLESLNCGKPLHLALNDEIPAIVDVFRFFAGAARCLE 119

Query: 141 IPTIMQGETLAGVAKDMDIH-SY--RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHV 197
                 GE L G       H S   R P+GV A I P+N+P M+  W    A+A GNT V
Sbjct: 120 GKA--AGEYLEG-------HTSMIRRDPVGVVASIAPWNYPLMMAAWKLAPALAAGNTVV 170

Query: 198 IKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAG 256
           +KPSE  P   + L ++      PPGV+NV+ G    V + +  HP +R +S  GS   G
Sbjct: 171 LKPSEITPLTALKLAELAADI-LPPGVLNVVTGRGATVGDALVGHPKVRMVSLTGSIATG 229

Query: 257 KYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFV 316
           K++    A + KR    +G K   ++  DA+    +  +    +  AGQ C A +  I+ 
Sbjct: 230 KHVLSAAADSVKRTHLELGGKAPVIVFDDADLDAVVEGIRTFGYYNAGQDCTA-ACRIYA 288

Query: 317 GEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQ 362
              +  + + +L      LKV   +   T+LGP+IS   ++R+   ++
Sbjct: 289 QRGIYDDLVAKLAAAVATLKVGDPDDEDTELGPLISAAHRDRVAGFVE 336


>gnl|CDD|143461 cd07143, ALDH_AldA_AN0554, Aspergillus nidulans aldehyde
           dehydrogenase, AldA (AN0554)-like.  NAD(P)+-dependent
           aldehyde dehydrogenase (AldA) of Aspergillus nidulans
           (locus AN0554), and other similar sequences, are present
           in this CD.
          Length = 481

 Score =  185 bits (471), Expect = 2e-53
 Identities = 113/360 (31%), Positives = 187/360 (51%), Gaps = 9/360 (2%)

Query: 24  IRHIATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYET- 82
            ++   T L+I+G+FV+S     + ++NP+T +++T++ +AT A+++ AV  A  A+ET 
Sbjct: 1   GKYEQPTGLFINGEFVDSVHGGTVKVYNPSTGKLITKIAEATEADVDIAVEVAHAAFETD 60

Query: 83  WS-KTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKT-LADAEGDVLRGLQVVEQCCS 140
           W  K S   R + L +L ++++ N+  LA     + GKT       DV            
Sbjct: 61  WGLKVSGSKRGRCLSKLADLMERNLDYLASIEALDNGKTFGTAKRVDVQASADTFRYYGG 120

Query: 141 IPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKP 200
               + G+ +    K +  ++   P+GV   I P+NFP ++  W    A+A GNT V+KP
Sbjct: 121 WADKIHGQVIETDIKKL-TYTRHEPIGVCGQIIPWNFPLLMCAWKIAPALAAGNTIVLKP 179

Query: 201 SERDPGACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSDQAGKYI 259
           SE  P + + +  ++ +AG PPGV+NV+ G      N I  H DI  ++F GS   G+ +
Sbjct: 180 SELTPLSALYMTKLIPEAGFPPGVINVVSGYGRTCGNAISSHMDIDKVAFTGSTLVGRKV 239

Query: 260 YERGAK-NGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGE 318
            E  AK N K+V   +G K+  ++  DA+  + +       F   GQ C A  + I+V E
Sbjct: 240 MEAAAKSNLKKVTLELGGKSPNIVFDDADLESAVVWTAYGIFFNHGQVCCA-GSRIYVQE 298

Query: 319 AV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
            +  +++   K++A+ LKV       T  GP +S    ERI+S I+SG  EGAT+   G+
Sbjct: 299 GIYDKFVKRFKEKAKKLKVGDPFAEDTFQGPQVSQIQYERIMSYIESGKAEGATVETGGK 358


>gnl|CDD|143430 cd07112, ALDH_GABALDH-PuuC, Escherichia coli NADP+-dependent
           gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
           PuuC-like.  NADP+-dependent,
           gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
           (GABALDH) PuuC of  Escherichia coli which catalyzes the
           conversion of putrescine to 4-aminobutanoate and other
           similar sequences are present in this CD.
          Length = 462

 Score =  182 bits (465), Expect = 1e-52
 Identities = 108/346 (31%), Positives = 173/346 (50%), Gaps = 32/346 (9%)

Query: 50  HNPATNEVVTRVPKATPAEMESAVASAKKAYE--TWSKTSIITRQQVLFRLQNIIKANMK 107
            NPAT  V+  V     A+++ AVA+A++A+E   WS+ S   R+ VL RL ++I+A+  
Sbjct: 7   INPATGRVLAEVAACDAADVDRAVAAARRAFESGVWSRLSPAERKAVLLRLADLIEAHRD 66

Query: 108 QLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTI---------MQGETLAGVAKDMD 158
           +LA   T + GK ++DA    +          +  T          + GE  A    D  
Sbjct: 67  ELALLETLDMGKPISDALAVDVPS--------AANTFRWYAEAIDKVYGEV-APTGPDAL 117

Query: 159 IHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQA 218
               R PLGV   + P+NFP ++  W    A+A GN+ V+KP+E+ P   + L ++  +A
Sbjct: 118 ALITREPLGVVGAVVPWNFPLLMAAWKIAPALAAGNSVVLKPAEQSPLTALRLAELALEA 177

Query: 219 GCPPGVVNVIHGA-HDAVNFICDHPDIRAISFVGSDQAGKYIYERGAK-NGKRVQSNMGA 276
           G P GV+NV+ G  H A   +  H D+ A++F GS + G+   E   + N KRV    G 
Sbjct: 178 GLPAGVLNVVPGFGHTAGEALGLHMDVDALAFTGSTEVGRRFLEYSGQSNLKRVWLECGG 237

Query: 277 KNHGVIMSDANRSNTLNQLV-GAAFGA---AGQRCMALSTVIFVGEAV--EWLPELKKRA 330
           K+  ++ +DA     L+     AA G     G+ C A S ++ V E++  E+L ++   A
Sbjct: 238 KSPNIVFADA---PDLDAAAEAAAAGIFWNQGEVCSAGSRLL-VHESIKDEFLEKVVAAA 293

Query: 331 EALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
              K      P T +G ++S    +++L  I+SG  EGA L+  G+
Sbjct: 294 REWKPGDPLDPATRMGALVSEAHFDKVLGYIESGKAEGARLVAGGK 339


>gnl|CDD|143470 cd07152, ALDH_BenzADH, NAD-dependent benzaldehyde dehydrogenase
           II-like.  NAD-dependent, benzaldehyde dehydrogenase II
           (XylC, BenzADH, EC=1.2.1.28) is involved in the
           oxidation of benzyl alcohol to benzoate. In
           Acinetobacter calcoaceticus, this process is carried out
           by the chromosomally encoded, benzyl alcohol
           dehydrogenase (xylB) and benzaldehyde dehydrogenase II
           (xylC) enzymes; whereas in Pseudomonas putida they are
           encoded by TOL plasmids.
          Length = 443

 Score =  180 bits (459), Expect = 6e-52
 Identities = 96/325 (29%), Positives = 160/325 (49%), Gaps = 7/325 (2%)

Query: 56  EVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITE 115
            V+  V  A  A+++ A A A  A   W+ T    R  VL R  ++++ +  ++A+ I  
Sbjct: 2   AVLGEVGVADAADVDRAAARAAAAQRAWAATPPRERAAVLRRAADLLEEHADEIADWIVR 61

Query: 116 EQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPF 175
           E G     A  +V   +  + +   +PT  QGE L      + + + RVPLGV   I+PF
Sbjct: 62  ESGSIRPKAGFEVGAAIGELHEAAGLPTQPQGEILPSAPGRLSL-ARRVPLGVVGVISPF 120

Query: 176 NFPAMIPLWMFPVAIACGNTHVIKPSERDP--GACMMLVDMLTQAGCPPGVVNVIHGAHD 233
           NFP ++ +     A+A GN  V+KP  R P  G  +++  +  +AG P GV++V+ G  D
Sbjct: 121 NFPLILAMRSVAPALALGNAVVLKPDPRTPVSGG-VVIARLFEEAGLPAGVLHVLPGGAD 179

Query: 234 AVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLN 293
           A   + + P++  ISF GS   G+ + E   ++ K+V   +G KN  +++ DA+     +
Sbjct: 180 AGEALVEDPNVAMISFTGSTAVGRKVGEAAGRHLKKVSLELGGKNALIVLDDADLDLAAS 239

Query: 294 QLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISP 351
                AF   GQ CMA +    V E+V   +  +L  +A+ L V         LGP+I+ 
Sbjct: 240 NGAWGAFLHQGQICMA-AGRHLVHESVADAYTAKLAAKAKHLPVGDPATGQVALGPLINA 298

Query: 352 QAKERILSLIQSGVDEGATLLLDGR 376
           +  +R+ +++   V  GA L   G 
Sbjct: 299 RQLDRVHAIVDDSVAAGARLEAGGT 323


>gnl|CDD|143435 cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus
           AldA1 (SACOL0154) aldehyde dehydrogenase-like.
           Uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences are present in this CD.
          Length = 475

 Score =  180 bits (458), Expect = 2e-51
 Identities = 106/356 (29%), Positives = 178/356 (50%), Gaps = 15/356 (4%)

Query: 31  KLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIIT 90
            L+I+G++V+  + + ID +NPA  E ++ +  AT A+++ AV +A++A++TW KT++  
Sbjct: 2   GLFINGEWVKGSSGETIDSYNPANGETLSEITDATDADVDRAVKAAQEAFKTWRKTTVAE 61

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEG-DVLRGLQVVEQCCSIPTIMQGET 149
           R  +L ++ +II  N + LA   T + GK + +    D+            +  I   E 
Sbjct: 62  RANILNKIADIIDENKELLAMVETLDNGKPIRETRAVDIPLAADHFRYFAGV--IRAEE- 118

Query: 150 LAGVAKDMDIHSY----RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDP 205
             G A  +D  +     R P+GV   I P+NFP ++  W    A+A GNT VIKPS    
Sbjct: 119 --GSANMIDEDTLSIVLREPIGVVGQIIPWNFPFLMAAWKLAPALAAGNTVVIKPSSTTS 176

Query: 206 GACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIYERGA 264
            + + L  ++     P GVVN++ G    +  ++ +HP +  ++F GS + G+ +    A
Sbjct: 177 LSLLELAKIIQDV-LPKGVVNIVTGKGSKSGEYLLNHPGLDKLAFTGSTEVGRDVAIAAA 235

Query: 265 KNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EW 322
           K        +G K+  +I  DAN    L           GQ C A S  IFV E +  E+
Sbjct: 236 KKLIPATLELGGKSANIIFDDANWDKALEGAQLGILFNQGQVCCAGSR-IFVQEGIYDEF 294

Query: 323 LPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNL 378
           + +LK++ E +KV     P T +G  ++    ++ILS +    +EGA +L  G  L
Sbjct: 295 VAKLKEKFENVKVGNPLDPDTQMGAQVNKDQLDKILSYVDIAKEEGAKILTGGHRL 350


>gnl|CDD|143471 cd07559, ALDH_ACDHII_AcoD-like, Ralstonia eutrophus NAD+-dependent
           acetaldehyde dehydrogenase II and Staphylococcus aureus
           AldA1 (SACOL0154)-like.  Included in this CD is the
           NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII,
           AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus
           H16 involved in the catabolism of acetoin and ethanol,
           and similar proteins, such as, the dimeric
           dihydrolipoamide dehydrogenase of the acetoin
           dehydrogenase enzyme system of Klebsiella pneumonia.
           Also included are sequences similar to the
           NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
           and AldB) of Xanthobacter autotrophicus GJ10 which are
           involved in the degradation of 1,2-dichloroethane, as
           well as, the uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences.
          Length = 480

 Score =  178 bits (453), Expect = 9e-51
 Identities = 107/366 (29%), Positives = 179/366 (48%), Gaps = 34/366 (9%)

Query: 31  KLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIIT 90
             +I+G++V     ++ D +NP   +V+  +P++T  +++ AV +A +A++TW KTS+  
Sbjct: 2   DNFINGEWVAPSKGEYFDNYNPVNGKVLCEIPRSTAEDVDLAVDAAHEAFKTWGKTSVAE 61

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGK-----TLAD--AEGDVLRGLQVVEQCCSIPT 143
           R  +L ++ + I+ N++ LA   T + GK       AD     D  R    V        
Sbjct: 62  RANILNKIADRIEENLELLAVAETLDNGKPIRETLAADIPLAIDHFRYFAGV-------- 113

Query: 144 IMQGETLAGVAKDMD-----IHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVI 198
           I   E   G   ++D      H +  PLGV   I P+NFP ++  W    A+A GNT V+
Sbjct: 114 IRAQE---GSLSEIDEDTLSYH-FHEPLGVVGQIIPWNFPLLMAAWKLAPALAAGNTVVL 169

Query: 199 KPSERDPGACMMLVDMLTQAGCPPGVVNVIHGA-HDAVNFICDHPDIRAISFVGSDQAGK 257
           KP+ + P + ++L++++     P GVVNV+ G   +A   +  HP I  ++F GS   G+
Sbjct: 170 KPASQTPLSILVLMELIGDL-LPKGVVNVVTGFGSEAGKPLASHPRIAKLAFTGSTTVGR 228

Query: 258 YIYERGAKNGKRVQSNMGAKNHGVIMSDA--NRSNTLNQLVGAAFGAA---GQRCMALST 312
            I +  A+N   V   +G K+  +   DA     +  ++      G A   G+ C   S 
Sbjct: 229 LIMQYAAENLIPVTLELGGKSPNIFFDDAMDADDDFDDKAEEGQLGFAFNQGEVCTCPSR 288

Query: 313 VIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGAT 370
              V E++  E++    +R EA+KV     P T +G  +S    E+ILS +  G +EGA 
Sbjct: 289 -ALVQESIYDEFIERAVERFEAIKVGNPLDPETMMGAQVSKDQLEKILSYVDIGKEEGAE 347

Query: 371 LLLDGR 376
           +L  G 
Sbjct: 348 VLTGGE 353


>gnl|CDD|215260 PLN02467, PLN02467, betaine aldehyde dehydrogenase.
          Length = 503

 Score =  178 bits (453), Expect = 2e-50
 Identities = 116/385 (30%), Positives = 187/385 (48%), Gaps = 19/385 (4%)

Query: 26  HIATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYET--- 82
            +   +L+I G++ E      I + NPAT E +  +P AT  ++++AV +A+KA++    
Sbjct: 4   PVPRRQLFIGGEWREPVLGKRIPVVNPATEETIGDIPAATAEDVDAAVEAARKAFKRNKG 63

Query: 83  --WSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCS 140
             W++T+   R + L  +   I     +LA+  T + GK L +A  D+       E    
Sbjct: 64  KDWARTTGAVRAKYLRAIAAKITERKSELAKLETLDCGKPLDEAAWDMDDVAGCFEYYAD 123

Query: 141 IPTIMQGETLAGVAKDMD---IHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHV 197
           +   +  +  A V+  M+    +  + PLGV   ITP+N+P ++  W    A+A G T V
Sbjct: 124 LAEALDAKQKAPVSLPMETFKGYVLKEPLGVVGLITPWNYPLLMATWKVAPALAAGCTAV 183

Query: 198 IKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGA-HDAVNFICDHPDIRAISFVGSDQAG 256
           +KPSE     C+ L D+  + G PPGV+NV+ G   +A   +  HP +  I+F GS   G
Sbjct: 184 LKPSELASVTCLELADICREVGLPPGVLNVVTGLGTEAGAPLASHPGVDKIAFTGSTATG 243

Query: 257 KYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFV 316
           + I    A+  K V   +G K+  ++  D +    +   +   F   GQ C A S +  V
Sbjct: 244 RKIMTAAAQMVKPVSLELGGKSPIIVFDDVDLDKAVEWAMFGCFWTNGQICSATSRL-LV 302

Query: 317 GEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLD 374
            E +  E+L +L K A+ +K+S     G  LGPV+S    E++L  I +   EGAT+L  
Sbjct: 303 HERIASEFLEKLVKWAKNIKISDPLEEGCRLGPVVSEGQYEKVLKFISTAKSEGATILCG 362

Query: 375 G---RNLKVGI----NVPIPVPLSM 392
           G    +LK G      +   V  SM
Sbjct: 363 GKRPEHLKKGFFIEPTIITDVTTSM 387


>gnl|CDD|143423 cd07105, ALDH_SaliADH, Salicylaldehyde dehydrogenase, DoxF-like.
           Salicylaldehyde dehydrogenase (DoxF, SaliADH,
           EC=1.2.1.65) involved in the upper naphthalene catabolic
           pathway of Pseudomonas strain C18 and other similar
           sequences are present in this CD.
          Length = 432

 Score =  175 bits (445), Expect = 6e-50
 Identities = 93/325 (28%), Positives = 158/325 (48%), Gaps = 14/325 (4%)

Query: 70  ESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVL 129
           + AV +A  A+  WSKT    R+ +L +  +++++   +  E + EE G T A A  +V 
Sbjct: 3   DQAVEAAAAAFPAWSKTPPSERRDILLKAADLLESRRDEFIEAMMEETGATAAWAGFNVD 62

Query: 130 RGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVA 189
               ++ +  S+ T + G ++            + P+GV  GI P+N P ++        
Sbjct: 63  LAAGMLREAASLITQIIGGSIPSDKPGTLAMVVKEPVGVVLGIAPWNAPVILGTRAIAYP 122

Query: 190 IACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIH----GAHDAVNFICDHPDIR 245
           +A GNT V+K SE  P    ++  +  +AG P GV+NV+      A + V  +  HP +R
Sbjct: 123 LAAGNTVVLKASELSPRTHWLIGRVFHEAGLPKGVLNVVTHSPEDAPEVVEALIAHPAVR 182

Query: 246 AISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQ 305
            ++F GS + G+ I E  AK+ K V   +G K   +++ DA+     N  +  AF  +GQ
Sbjct: 183 KVNFTGSTRVGRIIAETAAKHLKPVLLELGGKAPAIVLEDADLDAAANAALFGAFLNSGQ 242

Query: 306 RCMALST-VIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQ 362
            CM  ST  I V E++  E++ +LK  AE L           LG ++S  A +R+  L+ 
Sbjct: 243 ICM--STERIIVHESIADEFVEKLKAAAEKLFAGP-----VVLGSLVSAAAADRVKELVD 295

Query: 363 SGVDEGATLLLDGRNLKVGINVPIP 387
             + +GA L++ G   +      +P
Sbjct: 296 DALSKGAKLVVGGLADESPSGTSMP 320


>gnl|CDD|183050 PRK11241, gabD, succinate-semialdehyde dehydrogenase I;
           Provisional.
          Length = 482

 Score =  175 bits (444), Expect = 1e-49
 Identities = 108/371 (29%), Positives = 192/371 (51%), Gaps = 14/371 (3%)

Query: 17  MKTFRQFIRHIATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASA 76
              FRQ        +  I+G+++++   + ID+ NPA  + +  VPK    E  +A+ +A
Sbjct: 6   STLFRQ--------QALINGEWLDANNGEVIDVTNPANGDKLGSVPKMGADETRAAIDAA 57

Query: 77  KKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVE 136
            +A   W   +   R  +L R  N++  +   LA  +T EQGK LA+A+G++      +E
Sbjct: 58  NRALPAWRALTAKERANILRRWFNLMMEHQDDLARLMTLEQGKPLAEAKGEISYAASFIE 117

Query: 137 QCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFP-AMIPLWMFPVAIACGNT 195
                   + G+T+ G   D  +   + P+GVTA ITP+NFP AMI     P A+A G T
Sbjct: 118 WFAEEGKRIYGDTIPGHQADKRLIVIKQPIGVTAAITPWNFPAAMITRKAGP-ALAAGCT 176

Query: 196 HVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQ 254
            V+KP+ + P + + L ++  +AG P GV NV+ G+  AV   +  +P +R +SF GS +
Sbjct: 177 MVLKPASQTPFSALALAELAIRAGIPAGVFNVVTGSAGAVGGELTSNPLVRKLSFTGSTE 236

Query: 255 AGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVI 314
            G+ + E+ AK+ K+V   +G     ++  DA+    +   + + F  AGQ C+  +  +
Sbjct: 237 IGRQLMEQCAKDIKKVSLELGGNAPFIVFDDADLDKAVEGALASKFRNAGQTCVC-ANRL 295

Query: 315 FVGEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLL 372
           +V + V   +  +L++    L +  G   G  +GP+I  +A  ++   I   +++GA ++
Sbjct: 296 YVQDGVYDRFAEKLQQAVSKLHIGDGLEKGVTIGPLIDEKAVAKVEEHIADALEKGARVV 355

Query: 373 LDGRNLKVGIN 383
             G+  ++G N
Sbjct: 356 CGGKAHELGGN 366


>gnl|CDD|143429 cd07111, ALDH_F16, Aldehyde dehydrogenase family 16A1-like.
           Uncharacterized aldehyde dehydrogenase family 16 member
           A1 (ALDH16A1) and other related sequences are present in
           this CD. The active site cysteine and glutamate residues
           are not conserved in the human ALDH16A1 protein
           sequence.
          Length = 480

 Score =  172 bits (437), Expect = 2e-48
 Identities = 100/343 (29%), Positives = 181/343 (52%), Gaps = 19/343 (5%)

Query: 32  LYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITR 91
            +I+GK+V+ +        NPAT EV+  V +A   ++++AVA+A+ A+E+WS      R
Sbjct: 24  HFINGKWVKPENRKSFPTINPATGEVLASVLQAEEEDVDAAVAAARTAFESWSALPGHVR 83

Query: 92  QQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQG-ETL 150
            + L+R+   I+ + +  A   + + GK + ++              C IP + +     
Sbjct: 84  ARHLYRIARHIQKHQRLFAVLESLDNGKPIRESRD------------CDIPLVARHFYHH 131

Query: 151 AGVAK--DMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGAC 208
           AG A+  D ++  ++ P+GV   I P+NFP ++  W    A+A GNT V+KP+E  P   
Sbjct: 132 AGWAQLLDTELAGWK-PVGVVGQIVPWNFPLLMLAWKICPALAMGNTVVLKPAEYTPLTA 190

Query: 209 MMLVDMLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGK 268
           ++  ++  +AG PPGV+N++ G     + + +HP +  ++F GS + G+ +    A  GK
Sbjct: 191 LLFAEICAEAGLPPGVLNIVTGNGSFGSALANHPGVDKVAFTGSTEVGRALRRATAGTGK 250

Query: 269 RVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPEL 326
           ++   +G K+  ++  DA+  + +  +V A +   GQ C A S ++ V E+V  E + +L
Sbjct: 251 KLSLELGGKSPFIVFDDADLDSAVEGIVDAIWFNQGQVCCAGSRLL-VQESVAEELIRKL 309

Query: 327 KKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGA 369
           K+R   L+V        D+G ++ P   +RI  L++ G  EGA
Sbjct: 310 KERMSHLRVGDPLDKAIDMGAIVDPAQLKRIRELVEEGRAEGA 352


>gnl|CDD|143420 cd07102, ALDH_EDX86601, Uncharacterized aldehyde dehydrogenase of
           Synechococcus sp. PCC 7335 (EDX86601).  Uncharacterized
           aldehyde dehydrogenase of Synechococcus sp. PCC 7335
           (locus EDX86601) and other similar sequences, are
           present in this CD.
          Length = 452

 Score =  169 bits (431), Expect = 7e-48
 Identities = 89/332 (26%), Positives = 151/332 (45%), Gaps = 11/332 (3%)

Query: 50  HNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQL 109
            +P    V+   P A+   + +A+  A+ A + W    +  R+ ++ R   ++ AN  ++
Sbjct: 1   ISPIDGSVIAERPLASLEAVRAALERARAAQKGWRAVPLEERKAIVTRAVELLAANTDEI 60

Query: 110 AENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAG--VAKDMDIHSY--RVP 165
           AE +T + G+ +A A G++   L+      SI      E LA   V +      Y  R P
Sbjct: 61  AEELTWQMGRPIAQAGGEIRGMLERARYMISIAE----EALADIRVPEKDGFERYIRREP 116

Query: 166 LGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVV 225
           LGV   I P+N+P +  +     A+  GN  ++K S + P           +AG P GV 
Sbjct: 117 LGVVLIIAPWNYPYLTAVNAVIPALLAGNAVILKHSPQTPLCGERFAAAFAEAGLPEGVF 176

Query: 226 NVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSD 285
            V+H +H+    +   P I  +SF GS   G+ I    A    +V   +G K+   +  D
Sbjct: 177 QVLHLSHETSAALIADPRIDHVSFTGSVAGGRAIQRAAAGRFIKVGLELGGKDPAYVRPD 236

Query: 286 ANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGT 343
           A+       LV  AF  +GQ C ++   I+V E++   ++       +  K+     P T
Sbjct: 237 ADLDAAAESLVDGAFFNSGQSCCSIER-IYVHESIYDAFVEAFVAVVKGYKLGDPLDPST 295

Query: 344 DLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
            LGPV+S +A + + + I   + +GA  L+DG
Sbjct: 296 TLGPVVSARAADFVRAQIADAIAKGARALIDG 327


>gnl|CDD|143426 cd07108, ALDH_MGR_2402, Magnetospirillum NAD(P)+-dependent aldehyde
           dehydrogenase MSR-1-like.  NAD(P)+-dependent aldehyde
           dehydrogenase of Magnetospirillum gryphiswaldense MSR-1
           (MGR_2402) , and other similar sequences, are present in
           this CD.
          Length = 457

 Score =  169 bits (429), Expect = 2e-47
 Identities = 103/331 (31%), Positives = 168/331 (50%), Gaps = 9/331 (2%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLA 110
           NPAT +V+  VP++  A+++ AVA+AK A+  W+ T    R ++L R+ + ++A  ++LA
Sbjct: 3   NPATGQVIGEVPRSRAADVDRAVAAAKAAFPEWAATPARERGKLLARIADALEARSEELA 62

Query: 111 ENITEEQGKTL-ADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVT 169
             +  E G  L   A  +      +      +   ++GETL     D+  ++ R PLGV 
Sbjct: 63  RLLALETGNALRTQARPEAAVLADLFRYFGGLAGELKGETLPF-GPDVLTYTVREPLGVV 121

Query: 170 AGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIH 229
             I P+N P M+       A+  GNT V+K +E  P A ++L ++L Q   P GV+NVI 
Sbjct: 122 GAILPWNAPLMLAALKIAPALVAGNTVVLKAAEDAPLAVLLLAEILAQV-LPAGVLNVIT 180

Query: 230 G-AHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANR 288
           G   +    + DHPD+  ++F GS + GK IY   A     V   +G K+  ++  DA+ 
Sbjct: 181 GYGEECGAALVDHPDVDKVTFTGSTEVGKIIYRAAADRLIPVSLELGGKSPMIVFPDADL 240

Query: 289 SNTLNQLV-GAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDL 345
            + ++  + G  F   GQ C A S  +FV E +   +L +L  +   LK+       TD+
Sbjct: 241 DDAVDGAIAGMRFTRQGQSCTAGSR-LFVHEDIYDAFLEKLVAKLSKLKIGDPLDEATDI 299

Query: 346 GPVISPQAKERILSLIQSGVDE-GATLLLDG 375
           G +IS +   ++   I  G+   GAT+L  G
Sbjct: 300 GAIISEKQFAKVCGYIDLGLSTSGATVLRGG 330


>gnl|CDD|143414 cd07095, ALDH_SGSD_AstD, N-succinylglutamate 5-semialdehyde
           dehydrogenase, AstD-like.  N-succinylglutamate
           5-semialdehyde dehydrogenase or succinylglutamic
           semialdehyde dehydrogenase (SGSD, E. coli AstD,
           EC=1.2.1.71) involved in L-arginine degradation via the
           arginine succinyltransferase (AST) pathway and catalyzes
           the NAD+-dependent reduction of succinylglutamate
           semialdehyde into succinylglutamate.
          Length = 431

 Score =  167 bits (426), Expect = 2e-47
 Identities = 96/317 (30%), Positives = 147/317 (46%), Gaps = 10/317 (3%)

Query: 68  EMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGD 127
           ++++AVA+A+ A+  W+  S+  R  +L R   ++KAN ++LA  I+ E GK L +A+ +
Sbjct: 1   QVDAAVAAARAAFPGWAALSLEERAAILRRFAELLKANKEELARLISRETGKPLWEAQTE 60

Query: 128 VLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRV---PLGVTAGITPFNFPAMIPLW 184
           V      +     I      E     A  M      +   P GV A   PFNFP  +P  
Sbjct: 61  V----AAMAGKIDISIKAYHERTGERATPMAQGRAVLRHRPHGVMAVFGPFNFPGHLPNG 116

Query: 185 MFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAVNFICDHPDI 244
               A+  GNT V KPSE  P    ++V++  +AG PPGV+N++ G  +    +  H  I
Sbjct: 117 HIVPALLAGNTVVFKPSELTPAVAELMVELWEEAGLPPGVLNLVQGGRETGEALAAHEGI 176

Query: 245 RAISFVGSDQAGKYIYERGAKN-GKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAA 303
             + F GS   G  ++ + A   GK +   MG  N  V+   A+       +V +AF  A
Sbjct: 177 DGLLFTGSAATGLLLHRQFAGRPGKILALEMGGNNPLVVWDVADIDAAAYLIVQSAFLTA 236

Query: 304 GQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLI 361
           GQRC     +I    AV   +L  L + A+ L++ A +     +GP+I   A  R L   
Sbjct: 237 GQRCTCARRLIVPDGAVGDAFLERLVEAAKRLRIGAPDAEPPFMGPLIIAAAAARYLLAQ 296

Query: 362 QSGVDEGATLLLDGRNL 378
           Q  +  G   LL    L
Sbjct: 297 QDLLALGGEPLLAMERL 313


>gnl|CDD|143425 cd07107, ALDH_PhdK-like, Nocardioides 2-carboxybenzaldehyde
           dehydrogenase, PhdK-like.  Nocardioides sp. strain
           KP72-carboxybenzaldehyde dehydrogenase (PhdK), an enzyme
           involved in phenanthrene degradation, and other similar
           sequences, are present in this CD.
          Length = 456

 Score =  167 bits (425), Expect = 6e-47
 Identities = 103/330 (31%), Positives = 172/330 (52%), Gaps = 7/330 (2%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLA 110
           NPAT +V+ RVP A+ A+++ AVA+A+ A+  W  T+ + R ++L  L   ++ + ++LA
Sbjct: 3   NPATGQVLARVPAASAADVDRAVAAARAAFPEWRATTPLERARMLRELATRLREHAEELA 62

Query: 111 ENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTA 170
                + G  ++   GDV+    +++    + T ++GET+    +++  ++ R P GV A
Sbjct: 63  LIDALDCGNPVSAMLGDVMVAAALLDYFAGLVTELKGETIPVGGRNLH-YTLREPYGVVA 121

Query: 171 GITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHG 230
            I  FN P M         +A GNT V+KP E+ P + + L + L +   PPGV N++ G
Sbjct: 122 RIVAFNHPLMFAAAKIAAPLAAGNTVVVKPPEQAPLSALRLAE-LAREVLPPGVFNILPG 180

Query: 231 AHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRS 289
                   +  HPD++ I+ +GS   G+ I    A+  K V   +G KN  ++  DA+  
Sbjct: 181 DGATAGAALVRHPDVKRIALIGSVPTGRAIMRAAAEGIKHVTLELGGKNALIVFPDADPE 240

Query: 290 NTLNQLV-GAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDLG 346
              +  V G  F   GQ C + S + FV E++  E L  + +R  A+KV     P T +G
Sbjct: 241 AAADAAVAGMNFTWCGQSCGSTSRL-FVHESIYDEVLARVVERVAAIKVGDPTDPATTMG 299

Query: 347 PVISPQAKERILSLIQSGVDEGATLLLDGR 376
           P++S Q  +R++  I S   EGA L+  G 
Sbjct: 300 PLVSRQQYDRVMHYIDSAKREGARLVTGGG 329


>gnl|CDD|143458 cd07140, ALDH_F1L_FTFDH, 10-formyltetrahydrofolate dehydrogenase,
           ALDH family 1L.  10-formyltetrahydrofolate dehydrogenase
           (FTHFDH, EC=1.5.1.6), also known as aldehyde
           dehydrogenase family 1 member L1 (ALDH1L1) in humans, is
           a multi-domain homotetramer with an N-terminal formyl
           transferase domain and a C-terminal ALDH domain. FTHFDH
           catalyzes an NADP+-dependent dehydrogenase reaction
           resulting in the conversion of 10-formyltetrahydrofolate
           to tetrahydrofolate and CO2. The ALDH domain is also
           capable of the oxidation of short chain aldehydes to
           their corresponding acids.
          Length = 486

 Score =  168 bits (426), Expect = 6e-47
 Identities = 100/361 (27%), Positives = 191/361 (52%), Gaps = 15/361 (4%)

Query: 30  TKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYET--WSKTS 87
            +L+I+G+FV+++     +  NP    V+ +V  AT  +++ AVA+AK+A+E   W K +
Sbjct: 6   HQLFINGEFVDAEGGKTYNTINPTDGSVICKVSLATVEDVDRAVAAAKEAFENGEWGKMN 65

Query: 88  IITRQQVLFRLQNIIKANMKQLA--ENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIM 145
              R ++++RL ++++ + ++LA  E++      TLA  +  V   +Q           +
Sbjct: 66  ARDRGRLMYRLADLMEEHQEELATIESLDSGAVYTLA-LKTHVGMSIQTFRYFAGWCDKI 124

Query: 146 QGETL----AGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPS 201
           QG+T+    A   +++ + + R P+GV   + P+N+P M+  W     +A GNT V+KP+
Sbjct: 125 QGKTIPINQARPNRNLTL-TKREPIGVCGIVIPWNYPLMMLAWKMAACLAAGNTVVLKPA 183

Query: 202 ERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIY 260
           +  P   +   ++  +AG P GV+N++ G+   V   + DHPD+R + F GS   GK+I 
Sbjct: 184 QVTPLTALKFAELTVKAGFPKGVINILPGSGSLVGQRLSDHPDVRKLGFTGSTPIGKHIM 243

Query: 261 ERGAK-NGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEA 319
           +  A  N K+V   +G K+  +I +D +    +   + + F   G+ C+A    +FV E+
Sbjct: 244 KSCAVSNLKKVSLELGGKSPLIIFADCDMDKAVRMGMSSVFFNKGENCIAAGR-LFVEES 302

Query: 320 V--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRN 377
           +  E++  + +  + +K+       TD GP       ++++   + GV EGATL+  G+ 
Sbjct: 303 IHDEFVRRVVEEVKKMKIGDPLDRSTDHGPQNHKAHLDKLVEYCERGVKEGATLVYGGKQ 362

Query: 378 L 378
           +
Sbjct: 363 V 363


>gnl|CDD|143460 cd07142, ALDH_F2BC, Arabidosis aldehyde dehydrogenase family 2 B4,
           B7, C4-like.  Included in this CD is the Arabidosis
           aldehyde dehydrogenase family 2 members B4 and B7
           (EC=1.2.1.3),  which are mitochondrial homotetramers
           that oxidize acetaldehyde and glycolaldehyde, but not
           L-lactaldehyde. Also in this group, is the Arabidosis
           cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
           involved in the oxidation of sinapalehyde and
           coniferaldehyde.
          Length = 476

 Score =  167 bits (425), Expect = 8e-47
 Identities = 108/358 (30%), Positives = 180/358 (50%), Gaps = 13/358 (3%)

Query: 30  TKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYE--TWSKTS 87
           TKL+I+G+FV++ +       +P   EV+  V +    +++ AV +A+KA++   W + +
Sbjct: 4   TKLFINGQFVDAASGKTFPTIDPRNGEVIAHVAEGDAEDVDRAVKAARKAFDEGPWPRMT 63

Query: 88  IITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAE-GDVLRGLQVVEQCCSIPTIMQ 146
              R ++L R  ++++ +  +LA   T + GK    A   +V    ++          + 
Sbjct: 64  GYERSRILLRFADLLEKHADELAALETWDNGKPYEQARYAEVPLAARLFRYYAGWADKIH 123

Query: 147 GETLAGVAKDMDIHSYRV--PLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERD 204
           G TL     D   H Y +  P+GV   I P+NFP ++  W    A+ACGNT V+KP+E+ 
Sbjct: 124 GMTLPA---DGPHHVYTLHEPIGVVGQIIPWNFPLLMFAWKVGPALACGNTIVLKPAEQT 180

Query: 205 PGACMMLVDMLTQAGCPPGVVNVIHGAHDAVNF-ICDHPDIRAISFVGSDQAGKYIYERG 263
           P + ++   +  +AG P GV+N++ G        I  H D+  ++F GS + GK I +  
Sbjct: 181 PLSALLAAKLAAEAGLPDGVLNIVTGFGPTAGAAIASHMDVDKVAFTGSTEVGKIIMQLA 240

Query: 264 AK-NGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV-- 320
           AK N K V   +G K+  ++  DA+    +     A F   GQ C A S   FV E++  
Sbjct: 241 AKSNLKPVTLELGGKSPFIVCEDADVDKAVELAHFALFFNQGQCCCAGSRT-FVHESIYD 299

Query: 321 EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNL 378
           E++ + K RA    V      G + GP +  +  E+ILS I+ G +EGATL+  G  +
Sbjct: 300 EFVEKAKARALKRVVGDPFRKGVEQGPQVDKEQFEKILSYIEHGKEEGATLITGGDRI 357


>gnl|CDD|143438 cd07120, ALDH_PsfA-ACA09737, Pseudomonas putida aldehyde
           dehydrogenase PsfA (ACA09737)-like.  Included in this CD
           is the aldehyde dehydrogenase (PsfA, locus ACA09737) of
           Pseudomonas putida involved in furoic acid metabolism.
           Transcription of psfA was induced in response to
           2-furoic acid, furfuryl alcohol, and furfural.
          Length = 455

 Score =  164 bits (417), Expect = 9e-46
 Identities = 97/342 (28%), Positives = 161/342 (47%), Gaps = 31/342 (9%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYE--TWSKTSIITRQQVLFRLQNIIKANMKQ 108
           +PAT EV+        AE E+A+A+A++A++   W+    + R +VL  L +  +AN ++
Sbjct: 3   DPATGEVIGTYADGGVAEAEAAIAAARRAFDETDWAHDPRL-RARVLLELADAFEANAER 61

Query: 109 LAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHS------- 161
           LA  +  E GK L +A  ++  G        +I  +      AG+A+             
Sbjct: 62  LARLLALENGKILGEARFEI-SG--------AISEL---RYYAGLARTEAGRMIEPEPGS 109

Query: 162 ----YRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQ 217
                R P+GV   I P+N P ++ +     A+A G T V+KP+ +       ++ +L +
Sbjct: 110 FSLVLREPMGVAGIIVPWNSPVVLLVRSLAPALAAGCTVVVKPAGQTAQINAAIIRILAE 169

Query: 218 A-GCPPGVVNVIHGA-HDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMG 275
               P GVVN+   +  +    +   PD+  ISF GS   G+ I    A   KR+   +G
Sbjct: 170 IPSLPAGVVNLFTESGSEGAAHLVASPDVDVISFTGSTATGRAIMAAAAPTLKRLGLELG 229

Query: 276 AKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEAL 333
            K   ++  DA+    L +L  A    AGQ CMA S V+ V  ++  E    L  R  A+
Sbjct: 230 GKTPCIVFDDADLDAALPKLERALTIFAGQFCMAGSRVL-VQRSIADEVRDRLAARLAAV 288

Query: 334 KVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
           KV  G  P +D+GP+I     +R+  +++  +  GA ++L G
Sbjct: 289 KVGPGLDPASDMGPLIDRANVDRVDRMVERAIAAGAEVVLRG 330


>gnl|CDD|143464 cd07146, ALDH_PhpJ, Streptomyces putative phosphonoformaldehyde
           dehydrogenase PhpJ-like.  Putative phosphonoformaldehyde
           dehydrogenase (PhpJ), an aldehyde dehydrogenase homolog
           reportedly involved in the biosynthesis of
           phosphinothricin tripeptides in Streptomyces
           viridochromogenes DSM 40736, and similar sequences are
           included in this CD.
          Length = 451

 Score =  162 bits (411), Expect = 5e-45
 Identities = 105/340 (30%), Positives = 164/340 (48%), Gaps = 19/340 (5%)

Query: 47  IDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQ---VLFRLQNIIK 103
           +++ NP T EVV  VP  T   +  A+A A       S  S +TR Q   +L +   +++
Sbjct: 1   LEVRNPYTGEVVGTVPAGTEEALREALALAA------SYRSTLTRYQRSAILNKAAALLE 54

Query: 104 ANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGET----LAGVAKDMDI 159
           A  ++ A  IT E G  L D   +V R   V+    +      GE+    L    K   I
Sbjct: 55  ARREEFARLITLESGLCLKDTRYEVGRAADVLRFAAAEALRDDGESFSCDLTANGKARKI 114

Query: 160 HSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAG 219
            + R PLGV   ITPFN P          AIA  N  V+KPSE+ P + + L D+L +AG
Sbjct: 115 FTLREPLGVVLAITPFNHPLNQVAHKIAPAIAANNRIVLKPSEKTPLSAIYLADLLYEAG 174

Query: 220 CPPGVVNVIHGA-HDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKN 278
            PP +++V+ G   +  + +  HPD+  ++F G    GK I        KR    +G  +
Sbjct: 175 LPPDMLSVVTGEPGEIGDELITHPDVDLVTFTGGVAVGKAIAATAG--YKRQLLELGGND 232

Query: 279 HGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVS 336
             ++M DA+        V  ++  +GQRC A+   I V E+V  E++  L +++ AL V 
Sbjct: 233 PLIVMDDADLERAATLAVAGSYANSGQRCTAVKR-ILVHESVADEFVDLLVEKSAALVVG 291

Query: 337 AGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
               P TD+G VI  +A  +I + ++  + +GA +LL  +
Sbjct: 292 DPMDPATDMGTVIDEEAAIQIENRVEEAIAQGARVLLGNQ 331


>gnl|CDD|215410 PLN02766, PLN02766, coniferyl-aldehyde dehydrogenase.
          Length = 501

 Score =  162 bits (412), Expect = 9e-45
 Identities = 105/357 (29%), Positives = 189/357 (52%), Gaps = 9/357 (2%)

Query: 27  IATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYE--TWS 84
           I  TKL+I+G+FV++ +    +  +P T EV+ R+ +    +++ AV +A++A++   W 
Sbjct: 18  IKFTKLFINGEFVDAASGKTFETRDPRTGEVIARIAEGDKEDVDLAVKAAREAFDHGPWP 77

Query: 85  KTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEG-DVLRGLQVVEQCCSIPT 143
           + S   R +++ +  ++I+ ++++LA   T + GK  A  +  D+     ++        
Sbjct: 78  RMSGFERGRIMMKFADLIEEHIEELAALDTIDAGKLFALGKAVDIPAAAGLLRYYAGAAD 137

Query: 144 IMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSER 203
            + GETL  +++ +  ++ + P+GV   I P+NFP+ +       A+A G T V+KP+E+
Sbjct: 138 KIHGETLK-MSRQLQGYTLKEPIGVVGHIIPWNFPSTMFFMKVAPALAAGCTMVVKPAEQ 196

Query: 204 DPGACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIYER 262
            P + +    +   AG P GV+NV+ G    A   I  H D+  +SF GS + G+ I + 
Sbjct: 197 TPLSALFYAHLAKLAGVPDGVINVVTGFGPTAGAAIASHMDVDKVSFTGSTEVGRKIMQA 256

Query: 263 GAK-NGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV- 320
            A  N K+V   +G K+  +I  DA+    ++  +   F   G+ C+A S V +V E + 
Sbjct: 257 AATSNLKQVSLELGGKSPLLIFDDADVDMAVDLALLGIFYNKGEICVASSRV-YVQEGIY 315

Query: 321 -EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
            E++ +L ++A+   V     P    GP +  Q  E+ILS I+ G  EGATLL  G+
Sbjct: 316 DEFVKKLVEKAKDWVVGDPFDPRARQGPQVDKQQFEKILSYIEHGKREGATLLTGGK 372


>gnl|CDD|143418 cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis
           succinate-semialdehyde dehydrogenase 1-like.
           Succinate-semialdehyde dehydrogenase 1 (SSADH1, GabD1,
           EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
           of succinate semialdehyde (SSA)  to succinate.  SSADH
           activity in Mycobacterium tuberculosis (Mtb) is encoded
           by both gabD1 (Rv0234c) and gabD2 (Rv1731).  The Mtb
           GabD1 SSADH1 reportedly is an enzyme of the
           gamma-aminobutyrate shunt, which forms a functional link
           between two TCA half-cycles by converting
           alpha-ketoglutarate to succinate.
          Length = 429

 Score =  161 bits (409), Expect = 9e-45
 Identities = 94/321 (29%), Positives = 149/321 (46%), Gaps = 28/321 (8%)

Query: 69  MESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDV 128
           +E+A+  A  A+  W KTS   R  +L +L ++++    +LA  IT E GK +A+A  +V
Sbjct: 1   IEAALDRAHAAFLAWRKTSFAERAALLRKLADLLRERKDELARLITLEMGKPIAEARAEV 60

Query: 129 LRGLQVVEQCCSI--------PTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAM 180
                  E+C  I           +  E +   A    +  Y  PLGV  GI P+NFP  
Sbjct: 61  -------EKCAWICRYYAENAEAFLADEPIETDAGKAYV-RYE-PLGVVLGIMPWNFP-- 109

Query: 181 IPLW-MFPVAIAC---GNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAVN 236
              W +F  A      GNT ++K +   PG  + + ++  +AG P GV   +    D V 
Sbjct: 110 --FWQVFRFAAPNLMAGNTVLLKHASNVPGCALAIEELFREAGFPEGVFQNLLIDSDQVE 167

Query: 237 FICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLV 296
            I   P +R ++  GS++AG+ +     KN K+    +G  +  +++ DA+    +   V
Sbjct: 168 AIIADPRVRGVTLTGSERAGRAVAAEAGKNLKKSVLELGGSDPFIVLDDADLDKAVKTAV 227

Query: 297 GAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAK 354
                 AGQ C+A +    V E V  E+L +  +   ALKV       TDLGP+     +
Sbjct: 228 KGRLQNAGQSCIA-AKRFIVHEDVYDEFLEKFVEAMAALKVGDPMDEDTDLGPLARKDLR 286

Query: 355 ERILSLIQSGVDEGATLLLDG 375
           + +   ++  V  GATLLL G
Sbjct: 287 DELHEQVEEAVAAGATLLLGG 307


>gnl|CDD|143417 cd07099, ALDH_DDALDH, Methylomonas sp.
           4,4'-diapolycopene-dialdehyde dehydrogenase-like.  The
           4,4'-diapolycopene-dialdehyde dehydrogenase (DDALDH)
           involved in C30 carotenoid synthesis in Methylomonas sp.
           strain 16a and other similar sequences are present in
           this CD. DDALDH converts 4,4'-diapolycopene-dialdehyde
           into 4,4'-diapolycopene-diacid.
          Length = 453

 Score =  160 bits (407), Expect = 2e-44
 Identities = 95/337 (28%), Positives = 160/337 (47%), Gaps = 7/337 (2%)

Query: 50  HNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQL 109
            NPAT EV+  VP   PAE+ +AVA A+ A   W+   +  R Q L R +  +  +  +L
Sbjct: 1   RNPATGEVLGEVPVTDPAEVAAAVARARAAQRAWAALGVEGRAQRLLRWKRALADHADEL 60

Query: 110 AENITEEQGKTLADAEGDVLRGLQVVEQCCS-IPTIMQGETLAG--VAKDMDIHSYRVPL 166
           AE +  E GK  ADA  +VL  L+ ++      P ++    +    +  +        P 
Sbjct: 61  AELLHAETGKPRADAGLEVLLALEAIDWAARNAPRVLAPRKVPTGLLMPNKKATVEYRPY 120

Query: 167 GVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVN 226
           GV   I+P+N+P + P+     A+A GN  V+KPSE  P    +L +    AG P GV+ 
Sbjct: 121 GVVGVISPWNYPLLTPMGDIIPALAAGNAVVLKPSEVTPLVGELLAEAWAAAGPPQGVLQ 180

Query: 227 VIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDA 286
           V+ G       + D   +  ++F GS   G+ +    A+    V   +G K+  ++++DA
Sbjct: 181 VVTGDGATGAALIDAG-VDKVAFTGSVATGRKVMAAAAERLIPVVLELGGKDPMIVLADA 239

Query: 287 NRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGTD 344
           +        V  A   AGQ C+++   ++V E+V  E++  L  +A AL+  A ++   D
Sbjct: 240 DLERAAAAAVWGAMVNAGQTCISVE-RVYVHESVYDEFVARLVAKARALRPGADDIGDAD 298

Query: 345 LGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKVG 381
           +GP+ + +  + +   +   V +GA  L  G     G
Sbjct: 299 IGPMTTARQLDIVRRHVDDAVAKGAKALTGGARSNGG 335


>gnl|CDD|143402 cd07083, ALDH_P5CDH, ALDH subfamily NAD+-dependent
           delta(1)-pyrroline-5-carboxylate dehydrogenase-like.
           ALDH subfamily of the NAD+-dependent,
           delta(1)-pyrroline-5-carboxylate dehydrogenases (P5CDH,
           EC=1.5.1.12). The proline catabolic enzymes, proline
           dehydrogenase and P5CDH catalyze the two-step oxidation
           of proline to glutamate.  P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). These enzymes play important roles
           in cellular redox control, superoxide generation, and
           apoptosis. In certain prokaryotes such as Escherichia
           coli, PutA is also a transcriptional repressor of the
           proline utilization genes. Monofunctional enzyme
           sequences such as those seen in the Bacillus RocA P5CDH
           are also present in this subfamily as well as the human
           ALDH4A1 P5CDH and the Drosophila Aldh17 P5CDH.
          Length = 500

 Score =  159 bits (404), Expect = 1e-43
 Identities = 114/371 (30%), Positives = 183/371 (49%), Gaps = 17/371 (4%)

Query: 17  MKTFR-QFIRHIATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVAS 75
           ++  + +F R      L I G++V++K      +   A +EVV    KA  AE E+A+ +
Sbjct: 8   LRRVKEEFGRAYP---LVIGGEWVDTKERMVS-VSPFAPSEVVGTTAKADKAEAEAALEA 63

Query: 76  AKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVV 135
           A  A++TW       R ++L +  ++++   ++L   +T E GK   +A  DV   +  +
Sbjct: 64  AWAAFKTWKDWPQEDRARLLLKAADLLRRRRRELIATLTYEVGKNWVEAIDDVAEAIDFI 123

Query: 136 E--QCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACG 193
                 ++        +     + +  S+ V LG    I+P+NFP  I   M    +A G
Sbjct: 124 RYYARAALRLRYPAVEVVPYPGEDN-ESFYVGLGAGVVISPWNFPVAIFTGMIVAPVAVG 182

Query: 194 NTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGS 252
           NT + KP+E        + ++  +AG PPGVV  + G  + V  ++ +H  IR I+F GS
Sbjct: 183 NTVIAKPAEDAVVVGYKVFEIFHEAGFPPGVVQFLPGVGEEVGAYLTEHERIRGINFTGS 242

Query: 253 DQAGKYIYERGAKNG------KRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQR 306
            + GK IYE  A+        KR+    G KN  ++   A+    +  +V +AFG  GQ+
Sbjct: 243 LETGKKIYEAAARLAPGQTWFKRLYVETGGKNAIIVDETADFELVVEGVVVSAFGFQGQK 302

Query: 307 CMALSTVIFVGEAVEWLPE-LKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGV 365
           C A S +I    A E + E L KRAE L V      GTDLGPVI  + + ++LS I+ G 
Sbjct: 303 CSAASRLILTQGAYEPVLERLLKRAERLSVGPPEENGTDLGPVIDAEQEAKVLSYIEHGK 362

Query: 366 DEGATLLLDGR 376
           +EG  L+L G+
Sbjct: 363 NEG-QLVLGGK 372


>gnl|CDD|182108 PRK09847, PRK09847, gamma-glutamyl-gamma-aminobutyraldehyde
           dehydrogenase; Provisional.
          Length = 494

 Score =  157 bits (399), Expect = 5e-43
 Identities = 105/359 (29%), Positives = 185/359 (51%), Gaps = 17/359 (4%)

Query: 30  TKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYET--WSKTS 87
            +L+I+G++  +   +  +  +P T   + ++ +    +++ AV++A+  +E   WS +S
Sbjct: 20  NRLFINGEYTAAAENETFETVDPVTQAPLAKIARGKSVDIDRAVSAARGVFERGDWSLSS 79

Query: 88  IITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADA-EGDVLRGLQVVEQCCSIPTIMQ 146
              R+ VL +L ++++A+ ++LA   T + GK +  +   D+    + +         + 
Sbjct: 80  PAKRKAVLNKLADLMEAHAEELALLETLDTGKPIRHSLRDDIPGAARAIRWYAEAIDKVY 139

Query: 147 GETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPG 206
           GE +A  +        R P+GV A I P+NFP ++  W    A+A GN+ ++KPSE+ P 
Sbjct: 140 GE-VATTSSHELAMIVREPVGVIAAIVPWNFPLLLTCWKLGPALAAGNSVILKPSEKSPL 198

Query: 207 ACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIY-ERGA 264
           + + L  +  +AG P GV+NV+ G  H+A   +  H DI AI+F GS + GK +  + G 
Sbjct: 199 SAIRLAGLAKEAGLPDGVLNVVTGFGHEAGQALSRHNDIDAIAFTGSTRTGKQLLKDAGD 258

Query: 265 KNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAA----FGAAGQRCMALSTVIFVGEAV 320
            N KRV    G K+  ++ +D      L Q   A     F   GQ C+A  T + + E++
Sbjct: 259 SNMKRVWLEAGGKSANIVFADC---PDLQQAASATAAGIFYNQGQVCIA-GTRLLLEESI 314

Query: 321 --EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRN 377
             E+L  LK++A+  +      P T +G +I     + + S I+ G  +G  LLLDGRN
Sbjct: 315 ADEFLALLKQQAQNWQPGHPLDPATTMGTLIDCAHADSVHSFIREGESKG-QLLLDGRN 372


>gnl|CDD|215259 PLN02466, PLN02466, aldehyde dehydrogenase family 2 member.
          Length = 538

 Score =  154 bits (390), Expect = 2e-41
 Identities = 116/372 (31%), Positives = 179/372 (48%), Gaps = 41/372 (11%)

Query: 27  IATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYET--WS 84
           ++ T+L I+G+FV++ +       +P T EV+  V +    ++  AVA+A+KA++   W 
Sbjct: 55  VSYTQLLINGQFVDAASGKTFPTLDPRTGEVIAHVAEGDAEDVNRAVAAARKAFDEGPWP 114

Query: 85  KTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIP-T 143
           K +   R ++L R  ++++ +  +LA   T + GK    +                +P  
Sbjct: 115 KMTAYERSRILLRFADLLEKHNDELAALETWDNGKPYEQSAK------------AELPMF 162

Query: 144 IMQGETLAGVAKDMDIHSYRV-------------PLGVTAGITPFNFPAMIPLWMFPVAI 190
                  AG A    IH   V             P+GV   I P+NFP ++  W    A+
Sbjct: 163 ARLFRYYAGWAD--KIHGLTVPADGPHHVQTLHEPIGVAGQIIPWNFPLLMFAWKVGPAL 220

Query: 191 ACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISF 249
           ACGNT V+K +E+ P + +    +L +AG PPGV+NV+ G    A   +  H D+  ++F
Sbjct: 221 ACGNTIVLKTAEQTPLSALYAAKLLHEAGLPPGVLNVVSGFGPTAGAALASHMDVDKLAF 280

Query: 250 VGSDQAGKYIYERGAK-NGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCM 308
            GS   GK + E  AK N K V   +G K+  ++  DA+    +     A F   GQ C 
Sbjct: 281 TGSTDTGKIVLELAAKSNLKPVTLELGGKSPFIVCEDADVDKAVELAHFALFFNQGQCCC 340

Query: 309 ALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVP---GTDLGPVISPQAKERILSLIQS 363
           A S   FV E V  E++   K +A ALK   G+ P   G + GP I  +  E+IL  I+S
Sbjct: 341 AGSRT-FVHERVYDEFVE--KAKARALKRVVGD-PFKKGVEQGPQIDSEQFEKILRYIKS 396

Query: 364 GVDEGATLLLDG 375
           GV+ GATL   G
Sbjct: 397 GVESGATLECGG 408


>gnl|CDD|143448 cd07130, ALDH_F7_AASADH, NAD+-dependent alpha-aminoadipic
           semialdehyde dehydrogenase, ALDH family members 7A1 and
           7B.  Alpha-aminoadipic semialdehyde dehydrogenase
           (AASADH, EC=1.2.1.31), also known as ALDH7A1,
           Antiquitin-1, ALDH7B, or
           delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH),
           is a NAD+-dependent ALDH. Human ALDH7A1 is involved in
           the pipecolic acid pathway of lysine catabolism,
           catalyzing the oxidation of alpha-aminoadipic
           semialdehyde to alpha-aminoadipate.  Arabidopsis
           thaliana ALDH7B4 appears to be an
           osmotic-stress-inducible ALDH gene encoding a
           turgor-responsive or stress-inducible ALDH. The
           Streptomyces clavuligerus P6CDH appears to be involved
           in cephamycin biosynthesis, catalyzing the second stage
           of the two-step conversion of lysine to
           alpha-aminoadipic acid.  The ALDH7A1 enzyme and others
           in this group have been observed as tetramers, yet the
           bacterial P6CDH enzyme has been reported as a monomer.
          Length = 474

 Score =  151 bits (385), Expect = 4e-41
 Identities = 95/359 (26%), Positives = 166/359 (46%), Gaps = 27/359 (7%)

Query: 33  YIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQ 92
             DG++        +   +PA  E + RV +ATP + ES + +A++A++ W       R 
Sbjct: 2   VYDGEWGGGGGV--VTSISPANGEPIARVRQATPEDYESTIKAAQEAFKEWRDVPAPKRG 59

Query: 93  QVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAG 152
           +++ ++ + ++   + L + ++ E GK L +  G+V   + + +    +   + G T   
Sbjct: 60  EIVRQIGDALRKKKEALGKLVSLEMGKILPEGLGEVQEMIDICDFAVGLSRQLYGLT--- 116

Query: 153 VAKDMDIHSYR---------VPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSER 203
                 I S R          PLGV   IT FNFP  +  W   +A+ CGN  V KPS  
Sbjct: 117 ------IPSERPGHRMMEQWNPLGVVGVITAFNFPVAVWGWNAAIALVCGNVVVWKPSPT 170

Query: 204 DP---GACMMLV-DMLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYI 259
            P    A   +V  +L + G P  + +++ G  D    +   P +  +SF GS   G+ +
Sbjct: 171 TPLTAIAVTKIVARVLEKNGLPGAIASLVCGGADVGEALVKDPRVPLVSFTGSTAVGRQV 230

Query: 260 YERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEA 319
            +  A    R    +G  N  ++M DA+    +  ++ AA G AGQRC     +I V E+
Sbjct: 231 GQAVAARFGRSLLELGGNNAIIVMEDADLDLAVRAVLFAAVGTAGQRCTTTRRLI-VHES 289

Query: 320 V--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           +  E L  LKK  + +++      GT +GP+ +  A +  L+ I+    +G T+L  G+
Sbjct: 290 IYDEVLERLKKAYKQVRIGDPLDDGTLVGPLHTKAAVDNYLAAIEEAKSQGGTVLFGGK 348


>gnl|CDD|181873 PRK09457, astD, succinylglutamic semialdehyde dehydrogenase;
           Reviewed.
          Length = 487

 Score =  149 bits (379), Expect = 3e-40
 Identities = 105/376 (27%), Positives = 170/376 (45%), Gaps = 50/376 (13%)

Query: 32  LYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITR 91
           L+I+G ++  +     +  NP + EV+ +   AT A++++AV +A+ A+  W++ S   R
Sbjct: 3   LWINGDWIAGQGEA-FESRNPVSGEVLWQGNDATAAQVDAAVRAARAAFPAWARLSFEER 61

Query: 92  QQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLA 151
           Q ++ R   +++ N ++LAE I  E GK L +A  +V              T M  +   
Sbjct: 62  QAIVERFAALLEENKEELAEVIARETGKPLWEAATEV--------------TAMINKI-- 105

Query: 152 GVAKDMDIHSYRV-------------------PLGVTAGITPFNFPAMIPLWMFPVAIAC 192
                + I +Y                     P GV A   P+NFP  +P      A+  
Sbjct: 106 ----AISIQAYHERTGEKRSEMADGAAVLRHRPHGVVAVFGPYNFPGHLPNGHIVPALLA 161

Query: 193 GNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGS 252
           GNT V KPSE  P    + V +  QAG P GV+N++ G  +    +  HPDI  + F GS
Sbjct: 162 GNTVVFKPSELTPWVAELTVKLWQQAGLPAGVLNLVQGGRETGKALAAHPDIDGLLFTGS 221

Query: 253 DQAGKYIYERGAKN-GKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALS 311
              G  ++ + A    K +   MG  N  VI   A+    ++ ++ +AF +AGQRC   +
Sbjct: 222 ANTGYLLHRQFAGQPEKILALEMGGNNPLVIDEVADIDAAVHLIIQSAFISAGQRCTC-A 280

Query: 312 TVIFV-----GEAVEWLPELKKRAEALKVSAGNV-PGTDLGPVISPQAKERILSLIQSGV 365
             + V     G+A   L  L   A+ L V   +  P   +G VIS QA + +++     +
Sbjct: 281 RRLLVPQGAQGDAF--LARLVAVAKRLTVGRWDAEPQPFMGAVISEQAAQGLVAAQAQLL 338

Query: 366 DEGATLLLDGRNLKVG 381
             G   LL+   L+ G
Sbjct: 339 ALGGKSLLEMTQLQAG 354


>gnl|CDD|132417 TIGR03374, ABALDH, 1-pyrroline dehydrogenase.  Members of this
           protein family are 1-pyrroline dehydrogenase (1.5.1.35),
           also called gamma-aminobutyraldehyde dehydrogenase. This
           enzyme can follow putrescine transaminase (EC 2.6.1.82)
           for a two-step conversion of putrescine to
           gamma-aminobutyric acid (GABA). The member from
           Escherichia coli is characterized as a homotetramer that
           binds one NADH per momomer. This enzyme belongs to the
           medium-chain aldehyde dehydrogenases, and is quite
           similar in sequence to the betaine aldehyde
           dehydrogenase (EC 1.2.1.8) family.
          Length = 472

 Score =  146 bits (370), Expect = 3e-39
 Identities = 99/342 (28%), Positives = 162/342 (47%), Gaps = 19/342 (5%)

Query: 31  KLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIIT 90
           KL I+G+ V  +  +   ++NPAT EV+  + +A+  ++++AV +A  A+  W +T+   
Sbjct: 3   KLLINGELVSGEG-EKQPVYNPATGEVILEIAEASAEQVDAAVRAADAAFAEWGQTTPKA 61

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGKTL-------ADAEGDVLRGLQVVEQCCSIPT 143
           R + L +L ++I+ N +  AE  +   GK L         A  DV R      +C S   
Sbjct: 62  RAECLLKLADVIEENAQVFAELESRNCGKPLHSVFNDEIPAIVDVFRFFAGAARCLS--G 119

Query: 144 IMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSER 203
           +  GE L G    +     R PLGV A I P+N+P M+  W    A+A GN  V+KPSE 
Sbjct: 120 LAAGEYLEGHTSMIR----RDPLGVVASIAPWNYPLMMAAWKLAPALAAGNCVVLKPSEI 175

Query: 204 DPGACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIYER 262
            P   + L ++      P GVVN++ G      + +  H  +R +S  GS   G++I   
Sbjct: 176 TPLTALKLAELAKDI-FPAGVVNILFGRGKTVGDPLTGHEKVRMVSLTGSIATGEHILSH 234

Query: 263 GAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV-- 320
            A + KR    +G K   ++  DA+    +  +    F  AGQ C A +  I+    +  
Sbjct: 235 TAPSIKRTHMELGGKAPVIVFDDADIDAVVEGVRTFGFYNAGQDCTA-ACRIYAQRGIYD 293

Query: 321 EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQ 362
             + +L      LK  A +   T+LGP+ S    ER++  ++
Sbjct: 294 TLVEKLGAAVATLKSGAPDDESTELGPLSSLAHLERVMKAVE 335


>gnl|CDD|143443 cd07125, ALDH_PutA-P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, PutA.  The proline catabolic enzymes of
           the aldehyde dehydrogenase (ALDH) protein superfamily,
           proline dehydrogenase and
           Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           (EC=1.5.1.12 )), catalyze the two-step oxidation of
           proline to glutamate; P5CDH catalyzes the oxidation of
           glutamate semialdehyde, utilizing NAD+ as the electron
           acceptor. In some bacteria, the two enzymes are fused
           into the bifunctional flavoenzyme, proline utilization A
           (PutA) These enzymes play important roles in cellular
           redox control, superoxide generation, and apoptosis. In
           certain prokaryotes such as Escherichia coli, PutA is
           also a transcriptional repressor of the proline
           utilization genes.
          Length = 518

 Score =  145 bits (368), Expect = 2e-38
 Identities = 98/351 (27%), Positives = 162/351 (46%), Gaps = 28/351 (7%)

Query: 34  IDGKFVESKATDWIDLHNPATNEVVT-RVPKATPAEMESAVASAKKAYETWSKTSIITRQ 92
           I+G+  E+   +   + +PA +E     V  A   ++++A+A A  A+  WS T +  R 
Sbjct: 37  INGEETET--GEGAPVIDPADHERTIGEVSLADAEDVDAALAIAAAAFAGWSATPVEERA 94

Query: 93  QVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDV------LR--GLQVVEQCCSIPTI 144
           ++L +  ++++AN  +L      E GKTLADA+ +V       R    Q  E        
Sbjct: 95  EILEKAADLLEANRGELIALAAAEAGKTLADADAEVREAIDFCRYYAAQAREL------- 147

Query: 145 MQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERD 204
                L G   +++        GV   I+P+NFP  I       A+A GNT + KP+E+ 
Sbjct: 148 FSDPELPGPTGELNGLELH-GRGVFVCISPWNFPLAIFTGQIAAALAAGNTVIAKPAEQT 206

Query: 205 PGACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYE-R 262
           P      V++L +AG P  V+ ++ G  + +   +  HP I  + F GS +  K I    
Sbjct: 207 PLIAARAVELLHEAGVPRDVLQLVPGDGEEIGEALVAHPRIDGVIFTGSTETAKLINRAL 266

Query: 263 GAKNGKRVQSN--MGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV 320
             ++G  +      G KN  ++ S A     +  +V +AFG+AGQRC AL  +    E  
Sbjct: 267 AERDGPILPLIAETGGKNAMIVDSTALPEQAVKDVVQSAFGSAGQRCSALRLLYLQEEIA 326

Query: 321 E-WLPELKKRAEALKVSAGNV--PGTDLGPVISPQAKERILSLIQSGVDEG 368
           E ++  LK    +LKV  G+     TD+GP+I   A + + +  +    E 
Sbjct: 327 ERFIEMLKGAMASLKV--GDPWDLSTDVGPLIDKPAGKLLRAHTELMRGEA 375


>gnl|CDD|143416 cd07098, ALDH_F15-22, Aldehyde dehydrogenase family 15A1 and
           22A1-like.  Aldehyde dehydrogenase family members
           ALDH15A1 (Saccharomyces cerevisiae YHR039C) and ALDH22A1
           (Arabidopsis thaliana, EC=1.2.1.3), and similar
           sequences, are in this CD. Significant improvement of
           stress tolerance in tobacco plants was observed by
           overexpressing the ALDH22A1 gene from maize (Zea mays)
           and was accompanied by a reduction of malondialdehyde
           derived from cellular lipid peroxidation.
          Length = 465

 Score =  143 bits (362), Expect = 7e-38
 Identities = 94/347 (27%), Positives = 164/347 (47%), Gaps = 33/347 (9%)

Query: 50  HNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQL 109
           ++PAT + +  VP  TP +++ A+A+A+ A   W+KTS   R++VL  L   I  N +++
Sbjct: 1   YDPATGQHLGSVPADTPEDVDEAIAAARAAQREWAKTSFAERRKVLRSLLKYILENQEEI 60

Query: 110 AENITEEQGKTLADAE-GDVLRGLQVVEQCCSIPTIMQGE-TLAGVAKD----MDIHSYR 163
                 + GKT+ DA  G++L   + +       T+  GE  L   ++     M     R
Sbjct: 61  CRVACRDTGKTMVDASLGEILVTCEKIRW-----TLKHGEKALRPESRPGGLLMFYKRAR 115

Query: 164 V---PLGVTAGITPFNFP------AMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDM 214
           V   PLGV   I  +N+P       +I       A+  GN  V+K SE+   +    + +
Sbjct: 116 VEYEPLGVVGAIVSWNYPFHNLLGPIIA------ALFAGNAIVVKVSEQVAWSSGFFLSI 169

Query: 215 LTQA----GCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRV 270
           + +     G  P +V ++    +    +  HP I  I+F+GS   GK +    A++   V
Sbjct: 170 IRECLAACGHDPDLVQLVTCLPETAEALTSHPVIDHITFIGSPPVGKKVMAAAAESLTPV 229

Query: 271 QSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKK 328
              +G K+  +++ DA+     + ++   F ++GQ C+ +  VI V E +  + L  L  
Sbjct: 230 VLELGGKDPAIVLDDADLDQIASIIMRGTFQSSGQNCIGIERVI-VHEKIYDKLLEILTD 288

Query: 329 RAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
           R +AL+         D+G +ISP   +R+  L+   V++GA LL  G
Sbjct: 289 RVQALRQGPPLDGDVDVGAMISPARFDRLEELVADAVEKGARLLAGG 335


>gnl|CDD|132284 TIGR03240, arg_catab_astD, succinylglutamic semialdehyde
           dehydrogenase.  Members of this protein family are
           succinylglutamic semialdehyde dehydrogenase (EC
           1.2.1.71), the fourth enzyme in the arginine
           succinyltransferase (AST) pathway for arginine
           catabolism [Energy metabolism, Amino acids and amines].
          Length = 484

 Score =  142 bits (360), Expect = 2e-37
 Identities = 110/359 (30%), Positives = 175/359 (48%), Gaps = 16/359 (4%)

Query: 32  LYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITR 91
           L+IDGK+   +   +    NPAT EV+ +   A+ A++E+AVA+A+ A+  W++ S+  R
Sbjct: 1   LFIDGKWRAGQGESFSST-NPATQEVLWQGAAASAAQVEAAVAAARAAFPAWARLSLEER 59

Query: 92  QQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQ--GET 149
             V+ R   +++   + LA  I  E GK L +   +V     + +   SI    +  GE+
Sbjct: 60  IAVVQRFAALLEERKEALARVIARETGKPLWETRTEV--ASMIGKVAISIKAYHERTGES 117

Query: 150 LAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACM 209
              +     +  +R P GV A   P+NFP  +P      A+  GNT V KPSE  P    
Sbjct: 118 ENPMPDGRAVLRHR-PHGVVAVFGPYNFPGHLPNGHIVPALLAGNTVVFKPSELTPWVAE 176

Query: 210 MLVDMLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYER-GAKNGK 268
             V +  +AG P GV+N++ GA +    +  HPDI  + F GS   G  ++ +   +  K
Sbjct: 177 ETVKLWEKAGLPAGVLNLVQGARETGVALAAHPDIDGLLFTGSSNTGHLLHRQFAGRPEK 236

Query: 269 RVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFV-----GEAVEWL 323
            +   MG  N  ++   A+    ++ ++ +AF +AGQRC   +  + V     G+A   L
Sbjct: 237 ILALEMGGNNPLIVDEVADIDAAVHLIIQSAFISAGQRC-TCARRLLVPDGAQGDAF--L 293

Query: 324 PELKKRAEALKVSAGNV-PGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKVG 381
             L + AE L V A +  P   +G VIS QA +R+L+     +  G   LL  R L  G
Sbjct: 294 ARLVEVAERLTVGAWDAEPQPFMGAVISLQAAQRLLAAQAKLLALGGKSLLAMRQLDPG 352


>gnl|CDD|212007 TIGR04284, aldehy_Rv0768, aldehyde dehydrogenase, Rv0768 family.
           This family describes a branch of the aldehyde
           dehydrogenase (NAD) family (see pfam00171) that includes
           Rv0768 from Mycobacterium tuberculosis. All members of
           this family belong to species predicted to synthesize
           mycofactocin, suggesting that this enzyme or another
           upstream or downstream in the same pathway might be
           mycofactocin-dependent. However, the taxonomic range of
           this family is not nearly broad enough to make that
           relationship conclusive [Unknown function, Enzymes of
           unknown specificity].
          Length = 480

 Score =  142 bits (359), Expect = 2e-37
 Identities = 108/368 (29%), Positives = 175/368 (47%), Gaps = 36/368 (9%)

Query: 30  TKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYET--WSKTS 87
           ++L IDGK V   A  +    NPAT EV+     AT A+M++A+A+A++A++   WS   
Sbjct: 1   SRLLIDGKLVAGSAGTF-PTVNPATEEVLGVAADATAADMDAAIAAARRAFDETDWS-RD 58

Query: 88  IITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRG-LQVVEQCCSIPTIMQ 146
              R + L +L++ ++A++++L E    E G       G  L G +  +     +     
Sbjct: 59  TALRVRCLRQLRDALRAHVEELRELTIAEVGAPRMLTAGAQLEGPVDDLGFAADLAESYA 118

Query: 147 GETLAGVAKDMDIHSYRV----PLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSE 202
             T  GVA  M I + R      +GV   ITP+NFP  I L     A+A GNT V+KP+ 
Sbjct: 119 WTTDLGVASPMGIPTRRTLRREAVGVVGAITPWNFPHQINLAKLGPALAAGNTVVLKPAP 178

Query: 203 RDPGACMMLVDMLTQ-AGCPPGVVNVIHGAHDAVN-FICDHPDIRAISFVGSDQAGKYIY 260
             P    +L +++ +    PPGVVN++  +   +   +   P +  +SF GS   G+ + 
Sbjct: 179 DTPWCAAVLGELIAEHTDFPPGVVNIVTSSDHRLGALLAKDPRVDMVSFTGSTATGRAVM 238

Query: 261 ERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGA---AGQRCMALSTVIFVG 317
              A   K+V   +G K+  +++ DA+ +   +    AAF     AGQ C A++T + V 
Sbjct: 239 ADAAATLKKVFLELGGKSAFIVLDDADLAAACSM---AAFTVCMHAGQGC-AITTRLVVP 294

Query: 318 EAVEWLPELKKRAEALKVSAGNV----------PGTDLGPVISPQAKERILSLIQSGVDE 367
            A        +  EA+  +A  +          PGT  GPVIS + ++R+ S +   V E
Sbjct: 295 RA--------RYDEAVAAAAATMGSIKPGDPADPGTVCGPVISARQRDRVQSYLDLAVAE 346

Query: 368 GATLLLDG 375
           G      G
Sbjct: 347 GGRFACGG 354


>gnl|CDD|143419 cd07101, ALDH_SSADH2_GabD2, Mycobacterium tuberculosis
           succinate-semialdehyde dehydrogenase 2-like.
           Succinate-semialdehyde dehydrogenase 2 (SSADH2) and
           similar proteins are in this CD. SSADH1 (GabD1,
           EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
           of succinate semialdehyde to succinate.  SSADH activity
           in Mycobacterium tuberculosis is encoded by both gabD1
           (Rv0234c) and gabD2 (Rv1731), however ,the Vmax of GabD1
           was shown to be much higher than that of GabD2, and
           GabD2 (SSADH2) is likely to serve physiologically as a
           dehydrogenase for a different aldehyde(s).
          Length = 454

 Score =  140 bits (354), Expect = 6e-37
 Identities = 93/344 (27%), Positives = 154/344 (44%), Gaps = 32/344 (9%)

Query: 50  HNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQL 109
             P T E +  +P++TPA++E+A A A+ A   W+      R  V  R  +++     +L
Sbjct: 1   EAPFTGEPLGELPQSTPADVEAAFARARAAQRAWAARPFAERAAVFLRFHDLVLERRDEL 60

Query: 110 AENITEEQGKTLADAEGDVL--------------RGLQVVEQCCSIPTIMQGETLAGVAK 155
            + I  E GK    A  +VL              R L+   +  +IP + +         
Sbjct: 61  LDLIQLETGKARRHAFEEVLDVAIVARYYARRAERLLKPRRRRGAIPVLTRTTV------ 114

Query: 156 DMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDML 215
                  R P GV   I+P+N+P  + +     A+  GN  V+KP  +     +  V++L
Sbjct: 115 ------NRRPKGVVGVISPWNYPLTLAVSDAIPALLAGNAVVLKPDSQTALTALWAVELL 168

Query: 216 TQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNM 274
            +AG P  +  V+ G    V   I D+ D   + F GS   G+ + ER  +        +
Sbjct: 169 IEAGLPRDLWQVVTGPGSEVGGAIVDNADY--VMFTGSTATGRVVAERAGRRLIGCSLEL 226

Query: 275 GAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEA 332
           G KN  +++ DA+        V A F  AGQ C+++   I+V E+V  E++     R  A
Sbjct: 227 GGKNPMIVLEDADLDKAAAGAVRACFSNAGQLCVSIER-IYVHESVYDEFVRRFVARTRA 285

Query: 333 LKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           L++ A    G D+G +IS    +R+ + +   V +GAT+L  GR
Sbjct: 286 LRLGAALDYGPDMGSLISQAQLDRVTAHVDDAVAKGATVLAGGR 329


>gnl|CDD|143434 cd07116, ALDH_ACDHII-AcoD, Ralstonia eutrophus NAD+-dependent
           acetaldehyde dehydrogenase II-like.  Included in this CD
           is the NAD+-dependent, acetaldehyde dehydrogenase II
           (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes)
           eutrophus H16 involved in the catabolism of acetoin and
           ethanol, and similar proteins, such as, the dimeric
           dihydrolipoamide dehydrogenase of the acetoin
           dehydrogenase enzyme system of Klebsiella pneumonia.
           Also included are sequences similar to the
           NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
           and AldB) of Xanthobacter autotrophicus GJ10 which are
           involved in the degradation of 1,2-dichloroethane. These
           proteins apparently require RpoN factors for expression.
          Length = 479

 Score =  136 bits (343), Expect = 3e-35
 Identities = 104/364 (28%), Positives = 173/364 (47%), Gaps = 34/364 (9%)

Query: 33  YIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQ 92
           +I G++V     ++ D   P T +V   VP++T  ++E A+ +A  A E W KTS+  R 
Sbjct: 4   FIGGEWVAPVKGEYFDNITPVTGKVFCEVPRSTAEDIELALDAAHAAKEAWGKTSVAERA 63

Query: 93  QVLFRLQNIIKANMKQLAENITEEQGKTLADAEG-------DVLRGLQVV--EQCCSIPT 143
            +L ++ + ++AN++ LA   T + GK + +          D  R        Q  SI  
Sbjct: 64  NILNKIADRMEANLEMLAVAETWDNGKPVRETLAADIPLAIDHFRYFAGCIRAQEGSISE 123

Query: 144 IMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSER 203
           I    T+A        H +  PLGV   I P+NFP ++  W    A+A GN  V+KP+E+
Sbjct: 124 I-DENTVA-------YHFHE-PLGVVGQIIPWNFPLLMATWKLAPALAAGNCVVLKPAEQ 174

Query: 204 DPGACMMLVDMLTQAGCPPGVVNVIHGAH-DAVNFICDHPDIRAISFVGSDQAGKYIYER 262
            P + ++L++++     PPGVVNV++G   +A   +     I  ++F G    G+ I + 
Sbjct: 175 TPASILVLMELIGDL-LPPGVVNVVNGFGLEAGKPLASSKRIAKVAFTGETTTGRLIMQY 233

Query: 263 GAKNGKRVQSNMGAKNHGV----IMS--DANRSNTLNQLVGAAFGAAGQRCMALSTVIFV 316
            ++N   V   +G K+  +    +M   DA     L   V  A    G+ C   S  + +
Sbjct: 234 ASENIIPVTLELGGKSPNIFFADVMDADDAFFDKALEGFVMFALN-QGEVCTCPSRAL-I 291

Query: 317 GEAV--EWLPELKKRAEALKVSAGNV--PGTDLGPVISPQAKERILSLIQSGVDEGATLL 372
            E++   ++    +R +A+K   GN     T +G   S +  E+ILS I  G +EGA +L
Sbjct: 292 QESIYDRFMERALERVKAIK--QGNPLDTETMIGAQASLEQLEKILSYIDIGKEEGAEVL 349

Query: 373 LDGR 376
             G 
Sbjct: 350 TGGE 353


>gnl|CDD|182233 PRK10090, PRK10090, aldehyde dehydrogenase A; Provisional.
          Length = 409

 Score =  134 bits (340), Expect = 3e-35
 Identities = 84/290 (28%), Positives = 138/290 (47%), Gaps = 13/290 (4%)

Query: 95  LFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVA 154
           L ++   I+    +++  I EE GK    AE +V      ++         +GE +    
Sbjct: 1   LRKIAAGIRERASEISALIVEEGGKIQQLAEVEVAFTADYIDYMAEWARRYEGEIIQSDR 60

Query: 155 KDMDIHSYRVPLGVTAGITPFNFPA-MIPLWMFPVAIACGNTHVIKPSERDPGACMMLVD 213
              +I  ++  LGVT GI P+NFP  +I   M P A+  GNT VIKPSE  P   +    
Sbjct: 61  PGENILLFKRALGVTTGILPWNFPFFLIARKMAP-ALLTGNTIVIKPSEFTPNNAIAFAK 119

Query: 214 MLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQS 272
           ++ + G P GV N++ G  + V   +  +P +  +S  GS  AG+ I    AKN  +V  
Sbjct: 120 IVDEIGLPKGVFNLVLGRGETVGQELAGNPKVAMVSMTGSVSAGEKIMAAAAKNITKVCL 179

Query: 273 NMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRA 330
            +G K   ++M DA+    +  +V +    +GQ C     V +V + +  +++  L +  
Sbjct: 180 ELGGKAPAIVMDDADLDLAVKAIVDSRVINSGQVCNCAERV-YVQKGIYDQFVNRLGEAM 238

Query: 331 EALKVSAGNVPGT----DLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           +A  V  GN P       +GP+I+  A ER+   +   V+EGA + L G+
Sbjct: 239 QA--VQFGN-PAERNDIAMGPLINAAALERVEQKVARAVEEGARVALGGK 285


>gnl|CDD|215110 PLN00412, PLN00412, NADP-dependent glyceraldehyde-3-phosphate
           dehydrogenase; Provisional.
          Length = 496

 Score =  134 bits (339), Expect = 1e-34
 Identities = 96/353 (27%), Positives = 169/353 (47%), Gaps = 19/353 (5%)

Query: 31  KLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIIT 90
           K Y DG++  S +   + + NP+T +   +V   T  E+  A+ SAK A + W+KT +  
Sbjct: 17  KYYADGEWRTSSSGKSVAITNPSTRKTQYKVQACTQEEVNKAMESAKAAQKAWAKTPLWK 76

Query: 91  RQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCS--IPTIMQGE 148
           R ++L +   I+K +   +AE + +E  K   DA  +V+R   ++       +  + +G+
Sbjct: 77  RAELLHKAAAILKEHKAPIAECLVKEIAKPAKDAVTEVVRSGDLISYTAEEGVRILGEGK 136

Query: 149 TLA-----GVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSER 203
            L      G  ++    + ++PLGV   I PFN+P  + +     A+  GN  V+KP  +
Sbjct: 137 FLVSDSFPGNERNKYCLTSKIPLGVVLAIPPFNYPVNLAVSKIAPALIAGNAVVLKPPTQ 196

Query: 204 DPGACMMLVDMLTQAGCPPGVVNVIHG-AHDAVNFICDHPDIRAISFVGSDQAGKYIYER 262
              A + +V     AG P G+++ + G   +  +F+  HP +  ISF G D  G  I   
Sbjct: 197 GAVAALHMVHCFHLAGFPKGLISCVTGKGSEIGDFLTMHPGVNCISFTGGD-TGIAI--- 252

Query: 263 GAKNGKRV--QSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV 320
            +K    V  Q  +G K+  +++ DA+       ++   F  +GQRC A+  V+ +    
Sbjct: 253 -SKKAGMVPLQMELGGKDACIVLEDADLDLAAANIIKGGFSYSGQRCTAVKVVLVMESVA 311

Query: 321 EWLPELKKRAEALKVSAGNVP--GTDLGPVISPQAKERILSLIQSGVDEGATL 371
           + L E K  A+  K++ G  P    D+ PV+S  +   I  L+    ++GAT 
Sbjct: 312 DALVE-KVNAKVAKLTVG-PPEDDCDITPVVSESSANFIEGLVMDAKEKGATF 362


>gnl|CDD|132294 TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde
           dehydrogenase.  This family of genes are members of the
           pfam00171 NAD-dependent aldehyde dehydrogenase family.
           These genes are observed in Ralstonia eutropha JMP134,
           Sinorhizobium meliloti 1021, Burkholderia mallei ATCC
           23344, Burkholderia thailandensis E264, Burkholderia
           cenocepacia AU 1054, Burkholderia pseudomallei K96243
           and 1710b, Burkholderia xenovorans LB400, Burkholderia
           sp. 383 and Polaromonas sp. JS666 in close proximity to
           the PhnW gene (TIGR02326) encoding 2-aminoethyl
           phosphonate aminotransferase (which generates
           phosphonoacetaldehyde) and PhnA (TIGR02335) encoding
           phosphonoacetate hydrolase (not to be confused with the
           alkylphosphonate utilization operon protein PhnA modeled
           by TIGR00686). Additionally, transporters believed to be
           specific for 2-aminoethyl phosphonate are often present.
           PhnW is, in other organisms, coupled with PhnX
           (TIGR01422) for the degradation of phosphonoacetaldehyde
           (GenProp0238), but PhnX is apparently absent in each of
           the organisms containing this aldehyde reductase. PhnA,
           characterized in a strain of Pseudomonas fluorescens
           that has not het been genome sequenced, is only rarely
           found outside of the PhnW and aldehyde dehydrogenase
           context. For instance in Rhodopseudomonas and Bordetella
           bronchiseptica, where it is adjacent to transporters
           presumably specific for the import of phosphonoacetate.
           It seems reasonably certain then, that this enzyme
           catalyzes the NAD-dependent oxidation of
           phosphonoacetaldehyde to phosphonoacetate, bridging the
           metabolic gap between PhnW and PhnA. We propose the name
           phosphonoacetaldehyde dehydrogenase and the gene symbol
           PhnY for this enzyme.
          Length = 472

 Score =  132 bits (334), Expect = 5e-34
 Identities = 103/364 (28%), Positives = 164/364 (45%), Gaps = 22/364 (6%)

Query: 24  IRHIATTKLYIDGKFVESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETW 83
           IR  A   L I G+ V       I++  P    VV  VPKA+  ++  A A A  AY   
Sbjct: 1   IRAEA---LRIAGEKVSRDRV--IEVRYPYNGTVVGTVPKASVDDVRRAFAIAA-AYR-- 52

Query: 84  SKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPT 143
              +   R  +L R   ++ A  +++++ IT E G +  D+  +V R   V+    +   
Sbjct: 53  PTLTRYERSAILDRAAALLAARKEEISDLITLESGLSKKDSLYEVGRVADVLTFAAAEAL 112

Query: 144 IMQGET----LAGVAKDMDIHSYRVPL-GVTAGITPFNFPAMIPLWMFPVAIACGNTHVI 198
              G+     L    K   + + R PL GV + ITPFN P          AIA  N  V+
Sbjct: 113 RDDGQIFSCDLTPHGKARKVFTQREPLLGVISAITPFNHPMNQVAHKIAPAIATNNRMVV 172

Query: 199 KPSERDPGACMMLVDMLTQAGCPPGVVNVIHG--AHDAVNFICDHPDIRAISFVGSDQAG 256
           KPSE+ P + + L D+L +AG PP ++ V+ G     A   I  +P +  ++F G    G
Sbjct: 173 KPSEKTPLSALYLADILYEAGLPPQMLQVVTGDPREIADELIT-NPHVDLVTFTGGVAIG 231

Query: 257 KYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFV 316
           KYI  R     +R    +G  +  ++M DA+     +  V  ++  +GQRC A+   + V
Sbjct: 232 KYIAARAGY--RRQVLELGGNDPLIVMEDADLDRAADLAVKGSYKNSGQRCTAVKR-MLV 288

Query: 317 GEAV--EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLL- 373
            E+V   +   L ++  A +      P  D+G VI   A     + +   + +GA LLL 
Sbjct: 289 QESVADRFTELLVEKTRAWRYGDPMDPSVDMGTVIDEAAAILFEARVNEAIAQGARLLLG 348

Query: 374 DGRN 377
           + R+
Sbjct: 349 NVRD 352


>gnl|CDD|236501 PRK09407, gabD2, succinic semialdehyde dehydrogenase; Reviewed.
          Length = 524

 Score =  130 bits (330), Expect = 3e-33
 Identities = 94/353 (26%), Positives = 159/353 (45%), Gaps = 28/353 (7%)

Query: 39  VESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRL 98
           V+  A    ++  P T E +  VP +T A++E+A A A+ A   W+ T +  R  VL R 
Sbjct: 26  VDGAAGPTREVTAPFTGEPLATVPVSTAADVEAAFARARAAQRAWAATPVRERAAVLLRF 85

Query: 99  QNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCS-----IPTIMQGETLAGV 153
            +++  N ++L + +  E GK    A  +VL     V           P ++     AG 
Sbjct: 86  HDLVLENREELLDLVQLETGKARRHAFEEVLD----VALTARYYARRAPKLLAPRRRAGA 141

Query: 154 AKDM-DIHSYRVPLGVTAGITPFNFPA------MIPLWMFPVAIACGNTHVIKPSERDPG 206
              +      R P GV   I+P+N+P        IP      A+  GN  V+KP  + P 
Sbjct: 142 LPVLTKTTELRQPKGVVGVISPWNYPLTLAVSDAIP------ALLAGNAVVLKPDSQTPL 195

Query: 207 ACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAK 265
             +  V++L +AG P  +  V+ G    V   + D+ D   + F GS   G+ + E+  +
Sbjct: 196 TALAAVELLYEAGLPRDLWQVVTGPGPVVGTALVDNADY--LMFTGSTATGRVLAEQAGR 253

Query: 266 NGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWL 323
                   +G KN  +++ DA+        V A F  AGQ C+++   I+V E++  E++
Sbjct: 254 RLIGFSLELGGKNPMIVLDDADLDKAAAGAVRACFSNAGQLCISIER-IYVHESIYDEFV 312

Query: 324 PELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
                   A+++ AG     D+G +IS    E + + +   V +GAT+L  G+
Sbjct: 313 RAFVAAVRAMRLGAGYDYSADMGSLISEAQLETVSAHVDDAVAKGATVLAGGK 365


>gnl|CDD|184426 PRK13968, PRK13968, putative succinate semialdehyde dehydrogenase;
           Provisional.
          Length = 462

 Score =  129 bits (325), Expect = 7e-33
 Identities = 91/346 (26%), Positives = 160/346 (46%), Gaps = 17/346 (4%)

Query: 41  SKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQN 100
           + AT  I + NPAT E ++ +P A   ++E+A+  A   +  W +T+I  R Q L  +  
Sbjct: 4   TPATHAISV-NPATGEQLSVLPWAGADDIENALQLAAAGFRDWRETNIDYRAQKLRDIGK 62

Query: 101 IIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSI-PTIMQGE-TLAGVAKDMD 158
            ++A  +++A+ IT E GK +  A  +V +   + +      P +++ E TL  V     
Sbjct: 63  ALRARSEEMAQMITREMGKPINQARAEVAKSANLCDWYAEHGPAMLKAEPTL--VENQQA 120

Query: 159 IHSYRVPLGVTAGITPFNFPAMIPLWMF-----PVAIACGNTHVIKPSERDPGACMMLVD 213
           +  YR PLG    I P+NF    PLW       P+ +A GN +++K +    G   ++  
Sbjct: 121 VIEYR-PLGTILAIMPWNF----PLWQVMRGAVPILLA-GNGYLLKHAPNVMGCAQLIAQ 174

Query: 214 MLTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSN 273
           +   AG P GV   ++  +D V+ + +   I A++  GS +AG  I  +     K+    
Sbjct: 175 VFKDAGIPQGVYGWLNADNDGVSQMINDSRIAAVTVTGSVRAGAAIGAQAGAALKKCVLE 234

Query: 274 MGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIF-VGEAVEWLPELKKRAEA 332
           +G  +  ++++DA+    +   V   +   GQ C A    I   G A  +       A A
Sbjct: 235 LGGSDPFIVLNDADLELAVKAAVAGRYQNTGQVCAAAKRFIIEEGIASAFTERFVAAAAA 294

Query: 333 LKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNL 378
           LK+         LGP+     ++ +   +++ + EGA LLL G  +
Sbjct: 295 LKMGDPRDEENALGPMARFDLRDELHHQVEATLAEGARLLLGGEKI 340


>gnl|CDD|226683 COG4230, COG4230, Delta 1-pyrroline-5-carboxylate dehydrogenase
           [Energy production and conversion].
          Length = 769

 Score =  127 bits (320), Expect = 2e-31
 Identities = 100/319 (31%), Positives = 157/319 (49%), Gaps = 20/319 (6%)

Query: 51  NPAT-NEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQL 109
           NPA  +++V  V +AT A++E A+ +A  A   WS T    R  +L R  ++++A M QL
Sbjct: 133 NPADPDDIVGTVTEATEADVEQALEAAVAAAPIWSATPPAERAAILERAADLMEAQMPQL 192

Query: 110 AENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVT 169
              +  E GKTL++A  +V    + V+              AG A+D   +    PLG  
Sbjct: 193 MGLLVREAGKTLSNAIAEVR---EAVDFL---------RYYAGQARDTFGNLTHRPLGPV 240

Query: 170 AGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIH 229
             I+P+NFP  I       A+A GN+ + KP+E+ P      V +L +AG PPGV+ ++ 
Sbjct: 241 VCISPWNFPLAIFTGQIAAALAAGNSVLAKPAEQTPLIAAQAVRLLHEAGVPPGVLQLLP 300

Query: 230 GAHDAVN-FICDHPDIRAISFVGSDQAGKYIYERGAK-NGKRVQ--SNMGAKNHGVIMSD 285
           G  + V   +     +  + F GS +  + I  + AK  G+ +   +  G +N  ++ S 
Sbjct: 301 GRGETVGAALTADARVAGVMFTGSTEVARLIQRQLAKRQGRPIPLIAETGGQNAMIVDSS 360

Query: 286 ANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAVE--WLPELKKRAEALKVSAGNVPGT 343
           A     +  ++ +AF +AGQRC AL  V+ + E V    L  LK     L+V   +   T
Sbjct: 361 ALAEQVVADVLASAFDSAGQRCSALR-VLCLQEDVADRILTMLKGAMAELRVGNPDRLTT 419

Query: 344 DLGPVISPQAKERILSLIQ 362
           D+GPVI  +AK  I   IQ
Sbjct: 420 DVGPVIDAEAKANIEKHIQ 438


>gnl|CDD|233325 TIGR01238, D1pyr5carbox3, delta-1-pyrroline-5-carboxylate
           dehydrogenase (PutA C-terminal domain).  This model
           represents one of several related branches of
           delta-1-pyrroline-5-carboxylate dehydrogenase. Members
           of this branch are the C-terminal domain of the PutA
           bifunctional proline dehydrogenase /
           delta-1-pyrroline-5-carboxylate dehydrogenase [Energy
           metabolism, Amino acids and amines].
          Length = 500

 Score =  125 bits (314), Expect = 3e-31
 Identities = 91/319 (28%), Positives = 157/319 (49%), Gaps = 20/319 (6%)

Query: 51  NPAT-NEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQL 109
           NPA   ++V +V  A  A +++A+ SA++A+ TW+ T    R   L RL ++++ +M +L
Sbjct: 57  NPADRRDIVGQVFHANLAHVQAAIDSAQQAFPTWNATPAKERAAKLDRLADLLELHMPEL 116

Query: 110 AENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVT 169
                 E GKT+ +A  +V    + V+ C            A   +D+         GV 
Sbjct: 117 MALCVREAGKTIHNAIAEVR---EAVDFC---------RYYAKQVRDVLGEFSVESRGVF 164

Query: 170 AGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIH 229
             I+P+NFP  I       A+A GNT + KP+E+        V+++ +AG P G + ++ 
Sbjct: 165 VCISPWNFPLAIFTGQISAALAAGNTVIAKPAEQTSLIAYRAVELMQEAGFPAGTIQLLP 224

Query: 230 GAHDAVN-FICDHPDIRAISFVGSDQAGKYIYERGAKNGK---RVQSNMGAKNHGVIMSD 285
           G    V   +   P I  ++F GS +  + I +  A+       + +  G +N  ++ S 
Sbjct: 225 GRGADVGAALTSDPRIAGVAFTGSTEVAQLINQTLAQREDAPVPLIAETGGQNAMIVDST 284

Query: 286 ANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGT 343
           A     +  ++ +AF +AGQRC AL  V+ V E V    L  ++   + LKV   ++  T
Sbjct: 285 ALPEQVVRDVLRSAFDSAGQRCSALR-VLCVQEDVADRVLTMIQGAMQELKVGVPHLLTT 343

Query: 344 DLGPVISPQAKERILSLIQ 362
           D+GPVI  +AK+ +L+ I+
Sbjct: 344 DVGPVIDAEAKQNLLAHIE 362


>gnl|CDD|237017 PRK11904, PRK11904, bifunctional proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1038

 Score =  119 bits (302), Expect = 5e-29
 Identities = 103/337 (30%), Positives = 162/337 (48%), Gaps = 31/337 (9%)

Query: 45  DWIDLHNPA-TNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIK 103
           +   + +PA    VV  V  A   ++E A+A+A+ A+  WS+T +  R  +L R  ++++
Sbjct: 562 EARPVVSPADRRRVVGEVAFADAEQVEQALAAARAAFPAWSRTPVEERAAILERAADLLE 621

Query: 104 ANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQC---CSIPTIMQGETLAGVAKDM--- 157
           AN  +L      E GKTL DA  +V R  + V+ C          Q   L G  + +   
Sbjct: 622 ANRAELIALCVREAGKTLQDAIAEV-R--EAVDFCRYYA-----AQARRLFGAPEKLPGP 673

Query: 158 ---DIHSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDM 214
                       GV   I+P+NFP  I L     A+A GNT + KP+E+ P      V +
Sbjct: 674 TGESNELRLHGRGVFVCISPWNFPLAIFLGQVAAALAAGNTVIAKPAEQTPLIAAEAVKL 733

Query: 215 LTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERG--AKNGKRVQ 271
           L +AG P  V+ ++ G    V   +   P I  ++F GS +  + I  R   A++G  V 
Sbjct: 734 LHEAGIPKDVLQLLPGDGATVGAALTADPRIAGVAFTGSTETARII-NRTLAARDGPIVP 792

Query: 272 --SNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELK 327
             +  G +N  ++ S A     ++ +V +AF +AGQRC AL  V+FV E +    +  LK
Sbjct: 793 LIAETGGQNAMIVDSTALPEQVVDDVVTSAFRSAGQRCSALR-VLFVQEDIADRVIEMLK 851

Query: 328 KRAEALKVSAGN--VPGTDLGPVISPQAKERILSLIQ 362
                LKV  G+  +  TD+GPVI  +AK  + + I+
Sbjct: 852 GAMAELKV--GDPRLLSTDVGPVIDAEAKANLDAHIE 886


>gnl|CDD|143466 cd07148, ALDH_RL0313, Uncharacterized ALDH ( RL0313) with
           similarity to Tortula ruralis aldehyde dehydrogenase
           ALDH21A1.  Uncharacterized aldehyde dehydrogenase (locus
           RL0313) with sequence similarity to the moss Tortula
           ruralis aldehyde dehydrogenase ALDH21A1 (RNP123)
           believed to play an important role in the detoxification
           of aldehydes generated in response to desiccation- and
           salinity-stress, and similar sequences are included in
           this CD.
          Length = 455

 Score =  112 bits (282), Expect = 5e-27
 Identities = 95/350 (27%), Positives = 153/350 (43%), Gaps = 31/350 (8%)

Query: 47  IDLHNPATNEVVTRVPKATPAEMESAVASAKKAYETWSK-TSIITRQQVLFRLQNIIKAN 105
           +++ NP   + +  VP    A ++ A+ +A   +   +       R  +L RL ++++  
Sbjct: 1   LEVVNPFDLKPIGEVPTVDWAAIDKALDTAHALFLDRNNWLPAHERIAILERLADLMEER 60

Query: 106 MKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDI------ 159
             +LA  I  E GK L DA+ +V R +  VE         +   L G    M +      
Sbjct: 61  ADELALLIAREGGKPLVDAKVEVTRAIDGVELA-----ADELGQLGGREIPMGLTPASAG 115

Query: 160 ---HSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLT 216
               + R P+GV   I+ FN P  + +     AIA G   ++KP+   P +C+  VD+L 
Sbjct: 116 RIAFTTREPIGVVVAISAFNHPLNLIVHQVAPAIAAGCPVIVKPALATPLSCLAFVDLLH 175

Query: 217 QAGCPPGVVN-VIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMG 275
           +AG P G    V      A   + D P +   SF+GS + G  +  + A  G R      
Sbjct: 176 EAGLPEGWCQAVPCENAVAEKLVTD-PRVAFFSFIGSARVGWMLRSKLAP-GTRC----- 228

Query: 276 AKNHG-----VIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKK 328
           A  HG     ++   A+    +  LV   F  AGQ C+++  V FV   +  ++   L  
Sbjct: 229 ALEHGGAAPVIVDRSADLDAMIPPLVKGGFYHAGQVCVSVQRV-FVPAEIADDFAQRLAA 287

Query: 329 RAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNL 378
            AE L V     P T++GP+I P+  +R+   +   V  GA LL  G+ L
Sbjct: 288 AAEKLVVGDPTDPDTEVGPLIRPREVDRVEEWVNEAVAAGARLLCGGKRL 337


>gnl|CDD|181826 PRK09406, gabD1, succinic semialdehyde dehydrogenase; Reviewed.
          Length = 457

 Score =  111 bits (280), Expect = 8e-27
 Identities = 89/336 (26%), Positives = 145/336 (43%), Gaps = 17/336 (5%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLA 110
           NPAT E V      T  E+++A+A A   +  +  T+   R +      ++++A   Q+A
Sbjct: 7   NPATGETVKTFTALTDDEVDAAIARAHARFRDYRTTTFAQRARWANAAADLLEAEADQVA 66

Query: 111 ENITEEQGKTLADAEGDVL---RGLQVVEQCCSIPTIMQGET--LAGVAKDMDIHSYRVP 165
             +T E GKTLA A+ + L   +G +   +      ++  E    A V        Y+ P
Sbjct: 67  ALMTLEMGKTLASAKAEALKCAKGFRYYAE--HAEALLADEPADAAAVGASRAYVRYQ-P 123

Query: 166 LGVTAGITPFNFPAMIPLWM---FPV-AIACGNTHVIKPSERDPGACMMLVDMLTQAGCP 221
           LGV   + P+NFP    LW    F   A+  GN  ++K +   P   + L D+  +AG P
Sbjct: 124 LGVVLAVMPWNFP----LWQVVRFAAPALMAGNVGLLKHASNVPQTALYLADLFRRAGFP 179

Query: 222 PGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGV 281
            G    +     AV  I   P + A +  GS+ AG+ +        K+    +G  +  +
Sbjct: 180 DGCFQTLLVGSGAVEAILRDPRVAAATLTGSEPAGRAVAAIAGDEIKKTVLELGGSDPFI 239

Query: 282 IMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAVEWLPEL-KKRAEALKVSAGNV 340
           +M  A+        V A     GQ C+A    I   +  +   E    R  AL+V     
Sbjct: 240 VMPSADLDRAAETAVTARVQNNGQSCIAAKRFIVHADVYDAFAEKFVARMAALRVGDPTD 299

Query: 341 PGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGR 376
           P TD+GP+ + Q ++ +   +   V  GAT+L  G+
Sbjct: 300 PDTDVGPLATEQGRDEVEKQVDDAVAAGATILCGGK 335


>gnl|CDD|236989 PRK11809, putA, trifunctional transcriptional regulator/proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1318

 Score =  111 bits (280), Expect = 4e-26
 Identities = 99/338 (29%), Positives = 155/338 (45%), Gaps = 47/338 (13%)

Query: 48  DLHNPA-TNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANM 106
            + NPA   ++V  V +ATPAE+E A+ SA  A   W  T    R  +L R  ++++A M
Sbjct: 662 PVINPADPRDIVGYVREATPAEVEQALESAVNAAPIWFATPPAERAAILERAADLMEAQM 721

Query: 107 KQLAENITEEQGKTLADAEGDV------LRGLQVVEQCCSIPTIMQGETLAG-VAKDMDI 159
           + L   +  E GKT ++A  +V      LR                    AG V  D D 
Sbjct: 722 QTLMGLLVREAGKTFSNAIAEVREAVDFLR------------------YYAGQVRDDFDN 763

Query: 160 HSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAG 219
            ++R PLG    I+P+NFP  I       A+A GN+ + KP+E+ P      V +L +AG
Sbjct: 764 DTHR-PLGPVVCISPWNFPLAIFTGQVAAALAAGNSVLAKPAEQTPLIAAQAVRILLEAG 822

Query: 220 CPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKN-GKRVQSN---- 273
            P GVV ++ G  + V   +     +R + F GS +  + +     +N   R+       
Sbjct: 823 VPAGVVQLLPGRGETVGAALVADARVRGVMFTGSTEVARLL----QRNLAGRLDPQGRPI 878

Query: 274 -----MGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAVEWLPELKK 328
                 G +N  ++ S A     +  ++ +AF +AGQRC AL  +    +  +   ++ +
Sbjct: 879 PLIAETGGQNAMIVDSSALTEQVVADVLASAFDSAGQRCSALRVLCLQDDVADRTLKMLR 938

Query: 329 RAEALKVSAGNVPG---TDLGPVISPQAKERILSLIQS 363
            A A +   GN P    TD+GPVI  +AK  I   IQ+
Sbjct: 939 GAMA-ECRMGN-PDRLSTDIGPVIDAEAKANIERHIQA 974


>gnl|CDD|237018 PRK11905, PRK11905, bifunctional proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1208

 Score =  107 bits (269), Expect = 1e-24
 Identities = 99/335 (29%), Positives = 154/335 (45%), Gaps = 48/335 (14%)

Query: 49  LHNPA-TNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMK 107
           + NPA  ++VV  V +A+  ++E A+A+A+ A+  WS T    R  +L R  ++++A+M 
Sbjct: 571 VLNPADHDDVVGTVTEASAEDVERALAAAQAAFPEWSATPAAERAAILERAADLMEAHMP 630

Query: 108 QLAENITEEQGKTLADAEGDV------LRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHS 161
           +L      E GKTLA+A  +V      LR                    A  A+ +    
Sbjct: 631 ELFALAVREAGKTLANAIAEVREAVDFLR------------------YYAAQARRLLNGP 672

Query: 162 YRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCP 221
              PLG    I+P+NFP  I       A+  GNT + KP+E+ P      V +L +AG P
Sbjct: 673 GHKPLGPVVCISPWNFPLAIFTGQIAAALVAGNTVLAKPAEQTPLIAARAVRLLHEAGVP 732

Query: 222 PGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGAKNGKR---VQSNMGAK 277
              + ++ G    V   +   P I  + F GS +  + I    AK       + +  G +
Sbjct: 733 KDALQLLPGDGRTVGAALVADPRIAGVMFTGSTEVARLIQRTLAKRSGPPVPLIAETGGQ 792

Query: 278 NHGVIMSDANRSNTL-NQLVGA----AFGAAGQRCMALSTVIFVGE--AVEWLPELKKRA 330
           N  ++ S A     L  Q+V      AF +AGQRC AL  V+ + E  A   L  LK   
Sbjct: 793 NAMIVDSSA-----LPEQVVADVIASAFDSAGQRCSALR-VLCLQEDVADRVLTMLKGAM 846

Query: 331 EALKVSAGNVP---GTDLGPVISPQAKERILSLIQ 362
           + L++  G+ P    TD+GPVI  +A+  I + I+
Sbjct: 847 DELRI--GD-PWRLSTDVGPVIDAEAQANIEAHIE 878


>gnl|CDD|143441 cd07123, ALDH_F4-17_P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, ALDH families 4 and 17.
           Delta(1)-pyrroline-5-carboxylate dehydrogenase
           (EC=1.5.1.12 ), families 4 and 17: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), also known as ALDH4A1 in humans, 
           is a mitochondrial  homodimer involved in proline
           degradation and catalyzes the NAD + -dependent
           conversion of P5C to glutamate. This is a necessary step
           in the pathway interconnecting the urea and
           tricarboxylic acid cycles. The preferred substrate is
           glutamic gamma-semialdehyde, other substrates include
           succinic, glutaric and adipic semialdehydes. Also
           included in this CD is the Aldh17 Drosophila
           melanogaster (Q9VUC0) P5CDH and similar sequences.
          Length = 522

 Score =  105 bits (265), Expect = 1e-24
 Identities = 93/374 (24%), Positives = 159/374 (42%), Gaps = 50/374 (13%)

Query: 32  LYIDGKFVESKATDWIDLHNPAT-NEVVTRVPKATPAEMESAVASAKKAYETWSKTSIIT 90
           L I GK V +  T       P     V+     A  A +E A+ +A +A + W++     
Sbjct: 35  LVIGGKEVRTGNT--GKQVMPHDHAHVLATYHYADAALVEKAIEAALEARKEWARMPFED 92

Query: 91  RQQVLFRLQNII--KANMKQLAENITEEQGKTLADAEGDV-------LR-GLQVVEQCCS 140
           R  +  +  +++  K   +  A  +   QGK +  AE D        LR  ++  E+   
Sbjct: 93  RAAIFLKAADLLSGKYRYELNAATMLG-QGKNVWQAEIDAACELIDFLRFNVKYAEE--- 148

Query: 141 IPTIMQGETLAGVAKDMDIHSYRVPL-GVTAGITPFNFPAM------IPLWMFPVAIACG 193
              +   + L+  A   +   YR PL G    ++PFNF A+       P  M       G
Sbjct: 149 ---LYAQQPLSSPAGVWNRLEYR-PLEGFVYAVSPFNFTAIGGNLAGAPALM-------G 197

Query: 194 NTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGS 252
           N  + KPS+    +  ++  +L +AG PPGV+N + G    V + +   P +  + F GS
Sbjct: 198 NVVLWKPSDTAVLSNYLVYKILEEAGLPPGVINFVPGDGPVVGDTVLASPHLAGLHFTGS 257

Query: 253 DQAGKYIYERGA------KNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQR 306
               K ++++        +   R+    G KN  ++   A+  + +   V  AF   GQ+
Sbjct: 258 TPTFKSLWKQIGENLDRYRTYPRIVGETGGKNFHLVHPSADVDSLVTATVRGAFEYQGQK 317

Query: 307 CMALSTVIFVGEAVEWLPELKKR--AEALKVSAGNVP--GTDLGPVISPQAKERILSLI- 361
           C A S   +V E++ W PE+K+R   E  ++  G+       +G VI  +A +RI   I 
Sbjct: 318 CSAASRA-YVPESL-W-PEVKERLLEELKEIKMGDPDDFSNFMGAVIDEKAFDRIKGYID 374

Query: 362 QSGVDEGATLLLDG 375
            +  D  A ++  G
Sbjct: 375 HAKSDPEAEIIAGG 388


>gnl|CDD|143453 cd07135, ALDH_F14-YMR110C, Saccharomyces cerevisiae aldehyde
           dehydrogenase family 14 and related proteins.  Aldehyde
           dehydrogenase family 14 (ALDH14), isolated mainly from
           the mitochondrial outer membrane of Saccharomyces
           cerevisiae (YMR110C) and most closely related to the
           plant and animal ALDHs and fatty ALDHs family 3 members,
           and similar fungal sequences, are present in this CD.
          Length = 436

 Score = 99.6 bits (249), Expect = 1e-22
 Identities = 73/334 (21%), Positives = 134/334 (40%), Gaps = 39/334 (11%)

Query: 65  TPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGK----- 119
              E++S  +  +  + +     +  R   L +L   +K N + + E + ++ G+     
Sbjct: 3   PLDEIDSIHSRLRATFRSGKTKDLEYRLWQLKQLYWAVKDNEEAIVEALKKDLGRPPFET 62

Query: 120 TLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRV---PLGVTAGITPFN 176
            L +  G     L +++    +    + E +            R+   PLGV   I P+N
Sbjct: 63  LLTEVSGVKNDILHMLKN---LKKWAKDEKVKDGPLAFMFGKPRIRKEPLGVVLIIGPWN 119

Query: 177 FPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAVN 236
           +P ++ L     AIA G T V+KPSE  P    +L +++ +    P    V+ G      
Sbjct: 120 YPVLLALSPLVGAIAAGCTVVLKPSELTPHTAALLAELVPKY-LDPDAFQVVQGGVPETT 178

Query: 237 FICDHP-DIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQL 295
            + +   D   I + GS + G+ I E  AK+   V   +G K+  ++  +A+      ++
Sbjct: 179 ALLEQKFD--KIFYTGSGRVGRIIAEAAAKHLTPVTLELGGKSPVIVTKNADLELAAKRI 236

Query: 296 VGAAFGAAGQRCMALSTVI--------FVGEAVEWLPELKKRAEALKVSAGNVPGTDLGP 347
           +   FG AGQ C+A   V+        FV E  + L E            G     D   
Sbjct: 237 LWGKFGNAGQICVAPDYVLVDPSVYDEFVEELKKVLDEFYPG--------GANASPDYTR 288

Query: 348 VISPQAKERILSLIQS--------GVDEGATLLL 373
           +++P+   R+ SL+ +        G  + AT  +
Sbjct: 289 IVNPRHFNRLKSLLDTTKGKVVIGGEMDEATRFI 322


>gnl|CDD|177949 PLN02315, PLN02315, aldehyde dehydrogenase family 7 member.
          Length = 508

 Score = 99.1 bits (247), Expect = 2e-22
 Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 9/344 (2%)

Query: 51  NPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLA 110
           NPA N+ +  V +A+  + E  + + ++A + W +     R +++ ++ + ++A +  L 
Sbjct: 40  NPANNQPIAEVVEASLEDYEEGLRACEEAAKIWMQVPAPKRGEIVRQIGDALRAKLDYLG 99

Query: 111 ENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTA 170
             ++ E GK LA+  G+V   + + +    +   + G  +     +  +     PLG+  
Sbjct: 100 RLVSLEMGKILAEGIGEVQEIIDMCDFAVGLSRQLNGSIIPSERPNHMMMEVWNPLGIVG 159

Query: 171 GITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDP----GACMMLVDMLTQAGCPPGVVN 226
            IT FNFP  +  W   +A+ CGN  V K +   P        ++ ++L +   P  +  
Sbjct: 160 VITAFNFPCAVLGWNACIALVCGNCVVWKGAPTTPLITIAMTKLVAEVLEKNNLPGAIFT 219

Query: 227 VIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYER-GAKNGKRVQSNMGAKNHGVIMSD 285
              G  +    I     I  +SF GS + G  + +   A+ GK +   +   N  ++M D
Sbjct: 220 SFCGGAEIGEAIAKDTRIPLVSFTGSSKVGLMVQQTVNARFGKCLLE-LSGNNAIIVMDD 278

Query: 286 ANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGT 343
           A+    +  ++ AA G AGQRC      + + E++  + L +L    + +K+      GT
Sbjct: 279 ADIQLAVRSVLFAAVGTAGQRCTTCRR-LLLHESIYDDVLEQLLTVYKQVKIGDPLEKGT 337

Query: 344 DLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKVGINVPIP 387
            LGP+ +P++K+     I+    +G  +L  G  ++   N   P
Sbjct: 338 LLGPLHTPESKKNFEKGIEIIKSQGGKILTGGSAIESEGNFVQP 381


>gnl|CDD|143403 cd07084, ALDH_KGSADH-like, ALDH subfamily: NAD(P)+-dependent
           alpha-ketoglutaric semialdehyde dehydrogenases and plant
           delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH
           family 12-like.  ALDH subfamily which includes the
           NAD(P)+-dependent, alpha-ketoglutaric semialdehyde
           dehydrogenases (KGSADH, EC 1.2.1.26); plant
           delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           EC=1.5.1.12 ), ALDH family 12; the N-terminal domain of
           the MaoC (monoamine oxidase C) dehydratase regulatory
           protein; and orthologs of MaoC, PaaZ and PaaN, which are
           putative ring-opening enzymes of the aerobic
           phenylacetic acid catabolic pathway.
          Length = 442

 Score = 97.7 bits (243), Expect = 4e-22
 Identities = 85/330 (25%), Positives = 126/330 (38%), Gaps = 21/330 (6%)

Query: 70  ESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKT---LADAEG 126
           E A+ +A  + +   + ++  R   L R+   + A    +A       GK      +  G
Sbjct: 2   ERALLAADISTKAARRLALPKRADFLARIIQRLAAKSYDIAAGAVLVTGKGWMFAENICG 61

Query: 127 DV--LRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLW 184
           D   LR    V     IP    G  L G       H YR P G    I  FNFP  IPL 
Sbjct: 62  DQVQLRARAFVIYSYRIPHEP-GNHL-GQGLKQQSHGYRWPYGPVLVIGAFNFPLWIPLL 119

Query: 185 MFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAG-CPPGVVNVIHGAHDAVNFICDHPD 243
               A+A GN  ++KP         ++V +L  AG  PP  V +I+G    +  +  HP+
Sbjct: 120 QLAGALAMGNPVIVKPHTAVSIVMQIMVRLLHYAGLLPPEDVTLINGDGKTMQALLLHPN 179

Query: 244 IRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTL-NQLVGAAFGA 302
            + + F GS +  + +    AK   R+   +   N  V+  DA   + +  Q V      
Sbjct: 180 PKMVLFTGSSRVAEKLALD-AKQA-RIYLELAGFNWKVLGPDAQAVDYVAWQCVQDMTAC 237

Query: 303 AGQRCMALSTVIFVGEAVEWLPELKKRAEALKVSAGN--VPGTDLGPVISPQAKERILSL 360
           +GQ+C A S ++FV E            E LK       +    LGPV +      I  +
Sbjct: 238 SGQKCTAQS-MLFVPEN----WSKTPLVEKLKALLARRKLEDLLLGPVQTFTTLAMIAHM 292

Query: 361 IQSGVDEGATLLLDGRNLKVGINVPIPVPL 390
                  G+ LL  G+ LK      I    
Sbjct: 293 ---ENLLGSVLLFSGKELKNHSIPSIYGAC 319


>gnl|CDD|240392 PTZ00381, PTZ00381, aldehyde dehydrogenase family protein;
           Provisional.
          Length = 493

 Score = 90.9 bits (226), Expect = 1e-19
 Identities = 78/311 (25%), Positives = 131/311 (42%), Gaps = 22/311 (7%)

Query: 63  KATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLA 122
              P  +   V   K+++ T     +  R+Q L  L  +++ N ++ +E + ++ G+   
Sbjct: 3   PDNPEIIPPIVKKLKESFLTGKTRPLEFRKQQLRNLLRMLEENKQEFSEAVHKDLGRHPF 62

Query: 123 DAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIH-----SY--RVPLGVTAGITPF 175
           + +   +  L V E    +  +   E L     D         SY    PLGV   I  +
Sbjct: 63  ETKMTEV-LLTVAEIEHLLKHL--DEYLKPEKVDTVGVFGPGKSYIIPEPLGVVLVIGAW 119

Query: 176 NFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGVVNVIHGAHDAV 235
           N+P  + L     AIA GNT V+KPSE  P    ++  +LT+    P  V VI G  +  
Sbjct: 120 NYPLNLTLIPLAGAIAAGNTVVLKPSELSPHTSKLMAKLLTKY-LDPSYVRVIEGGVEVT 178

Query: 236 NFICDHP-DIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQ 294
             +   P D   I F GS + GK + +  A+N       +G K+  ++    N      +
Sbjct: 179 TELLKEPFDH--IFFTGSPRVGKLVMQAAAENLTPCTLELGGKSPVIVDKSCNLKVAARR 236

Query: 295 LVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSAGNVPGT--DLGPVIS 350
           +    F  AGQ C+A   V  V  ++  +++  LK   EA+K   G  P    D   +++
Sbjct: 237 IAWGKFLNAGQTCVAPDYV-LVHRSIKDKFIEALK---EAIKEFFGEDPKKSEDYSRIVN 292

Query: 351 PQAKERILSLI 361
               +R+  LI
Sbjct: 293 EFHTKRLAELI 303


>gnl|CDD|233324 TIGR01236, D1pyr5carbox1, delta-1-pyrroline-5-carboxylate
           dehydrogenase, group 1.  This model represents one of
           two related branches of delta-1-pyrroline-5-carboxylate
           dehydrogenase. The two branches are not as closely
           related to each other as some aldehyde dehydrogenases
           are to this branch, and separate models are built for
           this reason. The enzyme is the second of two in the
           degradation of proline to glutamate [Energy metabolism,
           Amino acids and amines].
          Length = 532

 Score = 89.5 bits (222), Expect = 4e-19
 Identities = 84/350 (24%), Positives = 145/350 (41%), Gaps = 58/350 (16%)

Query: 47  IDLHNPATNE-VVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKAN 105
           I   +P  ++ V+ +   AT      AV +A  A + WS      R  +  +  +++   
Sbjct: 48  IYQVSPHNHQAVLAKATNATEELANKAVEAALDAKKEWSLLPFYDRAAIFLKAADLLSGP 107

Query: 106 MKQLAENITE-EQGKTLADAEGDVLRGL------------QVVEQCCSIPTIMQGETLAG 152
            +      T   Q KT+  AE D +  L            ++ EQ        Q  +  G
Sbjct: 108 YRAEILAATMLGQSKTVYQAEIDAVAELIDFFRFNAKYARELYEQ--------QPISAPG 159

Query: 153 VAKDMDIHSYRVPL-GVTAGITPFNFPAM------IPLWMFPVAIACGNTHVIKPSERDP 205
                +   YR PL G    I+PFNF A+       P  M       GNT + KPS+   
Sbjct: 160 EWNRTE---YR-PLEGFVYAISPFNFTAIAGNLAGAPALM-------GNTVIWKPSDTAM 208

Query: 206 GACMMLVDMLTQAGCPPGVVNVIHGAHDAV-NFICDHPDIRAISFVGSDQAGKYIYERGA 264
            +  +++ +L +AG PPGV+N + G    V + +   PD+  I F GS    K+++++ A
Sbjct: 209 LSNYLVMRILEEAGLPPGVINFVPGDGPLVSDTVLADPDLAGIHFTGSTPTFKHLWKQVA 268

Query: 265 KN-GK-----RVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGE 318
           +N  +     R+    G K+  V+   A+  + ++  +  AF   GQ+C A S  ++V  
Sbjct: 269 QNLDRYHNFPRIVGETGGKDFHVVHPSADVEHVVSGTIRGAFEYQGQKCSACSR-LYVPH 327

Query: 319 AV--EWLPELKKRAEALKVSAGNVPGTDL----GPVISPQAKERILSLIQ 362
           ++   +   L      +KV        D     G VI  ++  +I+  I+
Sbjct: 328 SLWPRFKGRLLAELSRVKV----GDPDDFGTFMGAVIDEKSFAKIVKYIE 373


>gnl|CDD|143406 cd07087, ALDH_F3-13-14_CALDH-like, ALDH subfamily: Coniferyl
           aldehyde dehydrogenase, ALDH families 3, 13, and 14, and
           other related proteins.  ALDH subfamily which includes
           NAD(P)+-dependent, aldehyde dehydrogenase, family 3
           member A1 and B1  (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and
           fatty aldehyde dehydrogenase, family 3 member A2
           (ALDH3A2, EC=1.2.1.3), and also plant ALDH family
           members ALDH3F1, ALDH3H1, and ALDH3I1, fungal ALDH14
           (YMR110C) and the protozoan family 13 member (ALDH13),
           as well as coniferyl aldehyde dehydrogenases (CALDH,
           EC=1.2.1.68), and other similar  sequences, such as the
           Pseudomonas putida benzaldehyde dehydrogenase I that is
           involved in the metabolism of mandelate.
          Length = 426

 Score = 83.7 bits (208), Expect = 2e-17
 Identities = 72/209 (34%), Positives = 100/209 (47%), Gaps = 20/209 (9%)

Query: 165 PLGVTAGITPFNFP---AMIPLWMFPV-AIACGNTHVIKPSERDPGACMMLVDMLTQAGC 220
           PLGV   I P+N+P   A+ PL    + AIA GNT V+KPSE  P A   L+  L     
Sbjct: 100 PLGVVLIIGPWNYPLQLALAPL----IGAIAAGNTVVLKPSELAP-ATSALLAKLIPKYF 154

Query: 221 PPGVVNVIHGAHDAVNFICDHP-DIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNH 279
            P  V V+ G  +    +   P D   I F GS   GK + E  AK+   V   +G K+ 
Sbjct: 155 DPEAVAVVEGGVEVATALLAEPFDH--IFFTGSPAVGKIVMEAAAKHLTPVTLELGGKSP 212

Query: 280 GVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAEALKVSA 337
            ++  DAN      ++    F  AGQ C+A   V  V E++  E + ELKK   A+K   
Sbjct: 213 CIVDKDANLEVAARRIAWGKFLNAGQTCIAPDYV-LVHESIKDELIEELKK---AIKEFY 268

Query: 338 GNVPG--TDLGPVISPQAKERILSLIQSG 364
           G  P    D G +I+ +  +R+ SL+  G
Sbjct: 269 GEDPKESPDYGRIINERHFDRLASLLDDG 297


>gnl|CDD|143454 cd07136, ALDH_YwdH-P39616, Bacillus subtilis aldehyde dehydrogenase
           ywdH-like.  Uncharacterized Bacillus subtilis ywdH
           aldehyde dehydrogenase (locus P39616)  most closely
           related to the ALDHs and fatty ALDHs of families 3 and
           14, and similar sequences, are included in this CD.
          Length = 449

 Score = 70.6 bits (174), Expect = 4e-13
 Identities = 63/215 (29%), Positives = 101/215 (46%), Gaps = 26/215 (12%)

Query: 162 YRVPLGVTAGITPFNFP---AMIPLWMFPV-AIACGNTHVIKPSERDPGACMMLVDMLTQ 217
           Y  P GV   I P+N+P   A+ PL    + AIA GNT V+KPSE  P    ++  ++ +
Sbjct: 97  YYEPYGVVLIIAPWNYPFQLALAPL----IGAIAAGNTAVLKPSELTPNTSKVIAKIIEE 152

Query: 218 AGCPPGVVNVIHGAHDAVNFIC----DHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSN 273
                  V V+ G  +    +     D+     I F GS + GK + E  AK+   V   
Sbjct: 153 T-FDEEYVAVVEGGVEENQELLDQKFDY-----IFFTGSVRVGKIVMEAAAKHLTPVTLE 206

Query: 274 MGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKRAE 331
           +G K+  ++  DAN      ++V   F  AGQ C+A    + V E+V  +++ ELK+  +
Sbjct: 207 LGGKSPCIVDEDANLKLAAKRIVWGKFLNAGQTCVA-PDYVLVHESVKEKFIKELKEEIK 265

Query: 332 ALKVSAGNVP--GTDLGPVISPQAKERILSLIQSG 364
                 G  P    D G +I+ +  +R+  L+ +G
Sbjct: 266 KFY---GEDPLESPDYGRIINEKHFDRLAGLLDNG 297


>gnl|CDD|143452 cd07134, ALDH_AlkH-like, Pseudomonas putida Aldehyde dehydrogenase
           AlkH-like.  Aldehyde dehydrogenase AlkH (locus name
           P12693, EC=1.2.1.3) of the alkBFGHJKL operon that allows
           Pseudomonas putida to metabolize alkanes and the
           aldehyde dehydrogenase AldX of Bacillus subtilis (locus
           P46329, EC=1.2.1.3), and similar sequences, are present
           in this CD.
          Length = 433

 Score = 69.9 bits (172), Expect = 5e-13
 Identities = 60/217 (27%), Positives = 96/217 (44%), Gaps = 12/217 (5%)

Query: 165 PLGVTAGITPFNFP---AMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCP 221
           P GV   I+P+N+P   A  PL     AIA GNT ++KPSE  P    ++  ++ +A   
Sbjct: 100 PKGVCLIISPWNYPFNLAFGPL---VSAIAAGNTAILKPSELTPHTSAVIAKIIREA-FD 155

Query: 222 PGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGV 281
              V V  G  +    + + P    I F GS   GK +    AK+   V   +G K+  +
Sbjct: 156 EDEVAVFEGDAEVAQALLELP-FDHIFFTGSPAVGKIVMAAAAKHLASVTLELGGKSPTI 214

Query: 282 IMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV--EWLPELKKR-AEALKVSAG 338
           +   A+      ++    F  AGQ C+A   V FV E+V   ++  LK    +     A 
Sbjct: 215 VDETADLKKAAKKIAWGKFLNAGQTCIAPDYV-FVHESVKDAFVEHLKAEIEKFYGKDAA 273

Query: 339 NVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDG 375
                DL  +++ +  +R+  L+   V +GA +   G
Sbjct: 274 RKASPDLARIVNDRHFDRLKGLLDDAVAKGAKVEFGG 310


>gnl|CDD|143455 cd07137, ALDH_F3FHI, Plant aldehyde dehydrogenase family 3 members
           F1, H1, and I1 and related proteins.  Aldehyde
           dehydrogenase family members 3F1, 3H1, and 3I1 (ALDH3F1,
           ALDH3H1, and ALDH3I1), and similar plant sequences, are
           in this CD.  In Arabidopsis thaliana, stress-regulated
           expression of ALDH3I1  was observed in  leaves and
           osmotic stress expression of  ALDH3H1 was observed in
           root tissue, whereas, ALDH3F1 expression was not stress
           responsive. Functional analysis of ALDH3I1 suggest it
           may be involved in a detoxification pathway in plants
           that limits aldehyde accumulation and oxidative stress.
          Length = 432

 Score = 67.1 bits (164), Expect = 5e-12
 Identities = 50/173 (28%), Positives = 76/173 (43%), Gaps = 14/173 (8%)

Query: 165 PLGVTAGITPFNFPAMIPLWMFPV--AIACGNTHVIKPSERDPGACMMLVDMLTQAGCPP 222
           PLGV   I+ +NFP    L + PV  AIA GN  V+KPSE  P    +L  ++ +     
Sbjct: 101 PLGVVLVISAWNFP--FLLSLEPVIGAIAAGNAVVLKPSELAPATSALLAKLIPEY-LDT 157

Query: 223 GVVNVIHGAHDAVNFICDHP-DIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGV 281
             + VI G       + +   D   I F GS + G+ I    AK+   V   +G K   +
Sbjct: 158 KAIKVIEGGVPETTALLEQKWD--KIFFTGSPRVGRIIMAAAAKHLTPVTLELGGKCPVI 215

Query: 282 IMSDANRSNTLNQLVGAAFGA-AGQRCMA----LSTVIFVGEAVEWL-PELKK 328
           + S  +    + ++ G  +G   GQ C+A    L    F    ++ L   L+K
Sbjct: 216 VDSTVDLKVAVRRIAGGKWGCNNGQACIAPDYVLVEESFAPTLIDALKNTLEK 268


>gnl|CDD|143447 cd07129, ALDH_KGSADH, Alpha-Ketoglutaric Semialdehyde
           Dehydrogenase.  Alpha-Ketoglutaric Semialdehyde (KGSA)
           Dehydrogenase (KGSADH, EC 1.2.1.26) catalyzes the
           NAD(P)+-dependent conversion of KGSA to
           alpha-ketoglutarate. This CD contains such sequences as
           those seen in Azospirillum brasilense, KGSADH-II
           (D-glucarate/D-galactarate-inducible) and KGSADH-III
           (hydroxy-L-proline-inducible). Both show similar high
           substrate specificity for KGSA and different coenzyme
           specificity; KGSADH-II is NAD+-dependent and KGSADH-III
           is NADP+-dependent. Also included in this CD is the
           NADP(+)-dependent aldehyde dehydrogenase from Vibrio
           harveyi which catalyzes the oxidation of long-chain
           aliphatic aldehydes to acids.
          Length = 454

 Score = 66.0 bits (162), Expect = 1e-11
 Identities = 66/276 (23%), Positives = 105/276 (38%), Gaps = 56/276 (20%)

Query: 70  ESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVL 129
           ++A A+A  A+E++   S   R   L  + + I+A   +L      E G   A  +G++ 
Sbjct: 2   DAAAAAAAAAFESYRALSPARRAAFLEAIADEIEALGDELVARAHAETGLPEARLQGELG 61

Query: 130 RGLQVVEQCCSIPTIMQGETLAGVAKD--------------------MDIHSYRVPLGVT 169
           R            T  Q    A + ++                     D+    VPLG  
Sbjct: 62  R------------TTGQLRLFADLVREGSWLDARIDPADPDRQPLPRPDLRRMLVPLGPV 109

Query: 170 AGITPFNFP-AMIPLWMFPV-------AIACGNTHVIKPSERDPGACMMLVDMLTQA--- 218
           A     NFP A      F V       A+A G   V+K     PG   ++   +  A   
Sbjct: 110 AVFGASNFPLA------FSVAGGDTASALAAGCPVVVKAHPAHPGTSELVARAIRAALRA 163

Query: 219 -GCPPGVVNVIHGAHDAVNF-ICDHPDIRAISFVGSDQAGKYIYERGAK--NGKRVQSNM 274
            G P GV +++ G    V   +  HP I+A+ F GS + G+ +++  A         + +
Sbjct: 164 TGLPAGVFSLLQGGGREVGVALVKHPAIKAVGFTGSRRGGRALFDAAAARPEPIPFYAEL 223

Query: 275 GAKNHGVIMSDA--NRSNTLNQ-LVGAAFGAAGQRC 307
           G+ N   I+  A   R   + Q  VG+    AGQ C
Sbjct: 224 GSVNPVFILPGALAERGEAIAQGFVGSLTLGAGQFC 259


>gnl|CDD|177831 PLN02174, PLN02174, aldehyde dehydrogenase family 3 member H1.
          Length = 484

 Score = 63.1 bits (153), Expect = 1e-10
 Identities = 58/233 (24%), Positives = 106/233 (45%), Gaps = 12/233 (5%)

Query: 165 PLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPGV 224
           PLGV   I+ +N+P ++ +     AI+ GN  V+KPSE  P +  +L  +L Q       
Sbjct: 112 PLGVVLVISAWNYPFLLSIDPVIGAISAGNAVVLKPSELAPASSALLAKLLEQY-LDSSA 170

Query: 225 VNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMS 284
           V V+ GA      + +      I + GS + G+ I    AK+   V   +G K+  V+ S
Sbjct: 171 VRVVEGAVTETTALLEQ-KWDKIFYTGSSKIGRVIMAAAAKHLTPVVLELGGKSPVVVDS 229

Query: 285 DANRSNTLNQLVGAAFGA-AGQRCMALSTVIFVGE-AVEWLPELKKRAEALKVSAGNVPG 342
           D +   T+ +++   +G   GQ C++   ++   E A + +  +KK  E        +  
Sbjct: 230 DTDLKVTVRRIIAGKWGCNNGQACISPDYILTTKEYAPKVIDAMKKELETF-YGKNPMES 288

Query: 343 TDLGPVISPQAKERILSLIQSGVDEGATLLL-----DGRNLKVGINVPIPVPL 390
            D+  +++    +R+  L+     E +  ++     D  NLK+   + + VPL
Sbjct: 289 KDMSRIVNSTHFDRLSKLLDE--KEVSDKIVYGGEKDRENLKIAPTILLDVPL 339


>gnl|CDD|143450 cd07132, ALDH_F3AB, Aldehyde dehydrogenase family 3 members A1, A2,
           and B1 and related proteins.  NAD(P)+-dependent,
           aldehyde dehydrogenase, family 3 members A1 and B1
           (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and fatty aldehyde
           dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3),
           and similar sequences are included in this CD. Human
           ALDH3A1 is a homodimer with a critical role in cellular
           defense against oxidative stress; it catalyzes the
           oxidation of various cellular membrane lipid-derived
           aldehydes. Corneal crystalline ALDH3A1 protects the
           cornea and underlying lens against UV-induced oxidative
           stress. Human ALDH3A2, a microsomal homodimer, catalyzes
           the oxidation of long-chain aliphatic aldehydes to fatty
           acids. Human ALDH3B1 is highly expressed in the kidney
           and liver and catalyzes the oxidation of various medium-
           and long-chain saturated and unsaturated aliphatic
           aldehydes.
          Length = 443

 Score = 61.5 bits (150), Expect = 3e-10
 Identities = 80/314 (25%), Positives = 140/314 (44%), Gaps = 36/314 (11%)

Query: 70  ESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADA---EG 126
             AV  A++A+ +     +  R Q L  L  +++ N  ++ E + ++  K   +A   E 
Sbjct: 1   AEAVRRAREAFSSGKTRPLEFRIQQLEALLRMLEENEDEIVEALAKDLRKPKFEAVLSEI 60

Query: 127 DVLRGLQVVEQCCSIPTIMQGETLAGVAKDM-----DIHSYRVPLGVTAGITPFNFPAMI 181
            +++  ++     ++P  M+ E    V K++     D++ Y+ PLGV   I  +N+P  +
Sbjct: 61  LLVKN-EIKYAISNLPEWMKPEP---VKKNLATLLDDVYIYKEPLGVVLIIGAWNYP--L 114

Query: 182 PLWMFPV--AIACGNTHVIKPSERDPGACMMLVDMLTQ---AGCPPGVVNVIHGAHDA-- 234
            L + P+  AIA GN  VIKPSE  P    +L +++ +     C P VV    G  +   
Sbjct: 115 QLTLVPLVGAIAAGNCVVIKPSEVSPATAKLLAELIPKYLDKECYP-VVL--GGVEETTE 171

Query: 235 -VNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLN 293
            +    D+     I + GS   GK + +  AK+   V   +G K+   +    +      
Sbjct: 172 LLKQRFDY-----IFYTGSTSVGKIVMQAAAKHLTPVTLELGGKSPCYVDKSCDIDVAAR 226

Query: 294 QLVGAAFGAAGQRCMALSTVIFVGEAVE-WLPELKKRAEALKVSAGNVPGT--DLGPVIS 350
           ++    F  AGQ C+A   V+   E  E ++  LKK    LK   G  P    D G +I+
Sbjct: 227 RIAWGKFINAGQTCIAPDYVLCTPEVQEKFVEALKK---TLKEFYGEDPKESPDYGRIIN 283

Query: 351 PQAKERILSLIQSG 364
            +  +R+  L+  G
Sbjct: 284 DRHFQRLKKLLSGG 297


>gnl|CDD|143451 cd07133, ALDH_CALDH_CalB, Coniferyl aldehyde dehydrogenase-like.
           Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of
           Pseudomonas sp. strain HR199 (CalB) which catalyzes the
           NAD+-dependent oxidation of coniferyl aldehyde to
           ferulic acid, and similar sequences, are present in this
           CD.
          Length = 434

 Score = 59.8 bits (146), Expect = 1e-09
 Identities = 57/225 (25%), Positives = 95/225 (42%), Gaps = 37/225 (16%)

Query: 165 PLGVTAGITPFNFP---AMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCP 221
           PLGV   I P+N+P   A+ PL     A+A GN  +IKPSE  P    +L ++L +    
Sbjct: 101 PLGVVGIIVPWNYPLYLALGPL---IAALAAGNRVMIKPSEFTPRTSALLAELLAEYF-D 156

Query: 222 PGVVNVIHGAHD------AVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMG 275
              V V+ G  D      ++ F  DH     + F GS   G+++    A+N   V   +G
Sbjct: 157 EDEVAVVTGGADVAAAFSSLPF--DH-----LLFTGSTAVGRHVMRAAAENLTPVTLELG 209

Query: 276 AKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVI--------FVGEAVEWLPELK 327
            K+  +I  DA+ +    ++       AGQ C+A   V+        FV  A   + ++ 
Sbjct: 210 GKSPAIIAPDADLAKAAERIAFGKLLNAGQTCVAPDYVLVPEDKLEEFVAAAKAAVAKMY 269

Query: 328 KRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLL 372
                           D   +I+ +   R+  L++    +GA ++
Sbjct: 270 PTLAD---------NPDYTSIINERHYARLQGLLEDARAKGARVI 305


>gnl|CDD|165847 PLN02203, PLN02203, aldehyde dehydrogenase.
          Length = 484

 Score = 59.4 bits (144), Expect = 2e-09
 Identities = 55/171 (32%), Positives = 80/171 (46%), Gaps = 12/171 (7%)

Query: 165 PLGVTAGITPFNFPAMIPLWMFPV--AIACGNTHVIKPSERDPGACMMLVDMLTQAGCPP 222
           PLGV    + +NFP  I L + P+  AIA GN  V+KPSE  P     L   + +     
Sbjct: 108 PLGVVLIFSSWNFP--IGLSLEPLIGAIAAGNAVVLKPSELAPATSAFLAANIPKY-LDS 164

Query: 223 GVVNVIHGAHDAVNFICDHP-DIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGV 281
             V VI G       +  H  D   I F GS + G+ I    AK+   V   +G K   +
Sbjct: 165 KAVKVIEGGPAVGEQLLQHKWD--KIFFTGSPRVGRIIMTAAAKHLTPVALELGGKCPCI 222

Query: 282 IMSDANRSNT---LNQLVGAAFGA-AGQRCMALSTVIFVGEAVEWLPELKK 328
           + S ++  +T   +N++VG  +G+ AGQ C+A+  V+        L EL K
Sbjct: 223 VDSLSSSRDTKVAVNRIVGGKWGSCAGQACIAIDYVLVEERFAPILIELLK 273


>gnl|CDD|143444 cd07126, ALDH_F12_P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, ALDH family 12.
           Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           EC=1.5.1.12), family 12: a proline catabolic enzyme of
           the aldehyde dehydrogenase (ALDH) protein superfamily.
           P5CDH is a mitochondrial enzyme involved in proline
           degradation and catalyzes the NAD + -dependent
           conversion of P5C to glutamate.  The P5CDH, ALDH12A1
           gene, in Arabidopsis, has been identified as an
           osmotic-stress-inducible ALDH gene. This CD contains
           both Viridiplantae and Alveolata P5CDH sequences.
          Length = 489

 Score = 58.3 bits (141), Expect = 4e-09
 Identities = 64/237 (27%), Positives = 95/237 (40%), Gaps = 39/237 (16%)

Query: 160 HSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAG 219
             YR P G  A ITPFNFP  IP      A+  GN  ++K   +        + +L   G
Sbjct: 137 SGYRWPYGPVAIITPFNFPLEIPALQLMGALFMGNKPLLKVDSKVSVVMEQFLRLLHLCG 196

Query: 220 CPPGVVNVIHGAHDAVNFICDHPDIRAISFVGS---------DQAGKYIYERGAKNGKRV 270
            P   V++IH     +N I    + R   F GS         +  GK   E    + K +
Sbjct: 197 MPATDVDLIHSDGPTMNKILLEANPRMTLFTGSSKVAERLALELHGKVKLEDAGFDWKIL 256

Query: 271 QSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAVEW-----LPE 325
             ++   ++     D +           A+  +GQ+C A S ++F  E   W     L +
Sbjct: 257 GPDVSDVDYVAWQCDQD-----------AYACSGQKCSAQS-ILFAHE--NWVQAGILDK 302

Query: 326 LKKRAEALKVSAGNVPGTDL--GPVISPQAKERILSLIQSGVD-EGATLLLDGRNLK 379
           LK  AE  K+        DL  GPV++    ERIL  +   +   GA +L  G+ L 
Sbjct: 303 LKALAEQRKLE-------DLTIGPVLT-WTTERILDHVDKLLAIPGAKVLFGGKPLT 351


>gnl|CDD|131331 TIGR02278, PaaN-DH, phenylacetic acid degradation protein paaN.
           This enzyme is proposed to act in the ring-opening step
           of phenylacetic acid degradation which follows ligation
           of the acid with coenzyme A (by PaaF) and hydroxylation
           by a multicomponent non-heme iron hydroxylase complex
           (PaaGHIJK). Gene symbols have been standardized in. This
           enzyme is related to aldehyde dehydrogenases and has
           domains which are members of the pfam00171 and pfam01575
           families. This family includes paaN genes from
           Pseudomonas, Sinorhizobium, Rhodopseudomonas,
           Escherichia, Deinococcus and Corynebacterium. Another
           homology family (TIGR02288) includes several other
           species.
          Length = 663

 Score = 55.6 bits (134), Expect = 3e-08
 Identities = 77/360 (21%), Positives = 140/360 (38%), Gaps = 28/360 (7%)

Query: 40  ESKATDWIDLHNPATNEVVTRVPKATPAEMESAVASAK-KAYETWSKTSIITRQQVLFRL 98
            +   + + + + +T EV+ RV  +   ++ +AVA A+          +   R ++L  L
Sbjct: 10  RTGQGEGVPVRDASTGEVLARV-TSEGLDVAAAVAWAREVGGPALRALTFHERARMLKAL 68

Query: 99  QNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCS-----IPT---IMQGETL 150
              +    + L   +    G T  D   D+  G+  +    S     +P    I + E  
Sbjct: 69  AQYLSERKEALYA-LAATTGATRRDNWVDIDGGIGTLFTYSSLGRRELPDANLIPEDEFE 127

Query: 151 AGVAKDMDI---HSYRVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGA 207
             ++KD      H      GV   I  FNFP    L  F  A   G   + KP+      
Sbjct: 128 P-LSKDGSFQGRHILTPKGGVAVQINAFNFPVWGLLEKFAPAFLAGVPTLAKPATPTAYV 186

Query: 208 CMMLVDMLTQAG-CPPGVVNVIHGAHDAVNFICDHPDIR-AISFVGSDQAGKYIYERGAK 265
              LV  + ++G  P G + +I G+      + DH D R  ++F GS      +      
Sbjct: 187 AEALVRTMVESGLLPEGSLQLICGS---AGDLLDHLDHRDVVAFTGSAATADRLRAHPNV 243

Query: 266 NGKRVQSNMGAK--NHGVIMSDANRSNT-----LNQLVGAAFGAAGQRCMALSTVIFVGE 318
             + ++ N  A   N  ++  DA            ++V      AGQ+C A+  VI    
Sbjct: 244 LERGIRFNAEADSLNAAILGEDATPDEPEFDLFAQEIVRELTIKAGQKCTAIRRVIVPKA 303

Query: 319 AVEWLPE-LKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRN 377
            +E + + L+ R   + +      G D+GP++S + +  + + + + +  GA + L G  
Sbjct: 304 LLEAVLKALQARLAKVVLGDPREEGVDMGPLVSLEQRADVEAAVAALLAAGAEVRLGGPG 363


>gnl|CDD|143396 cd07077, ALDH-like, NAD(P)+-dependent aldehyde dehydrogenase-like
           (ALDH-like) family.  The aldehyde dehydrogenase-like
           (ALDH-like) group of the ALDH superfamily of
           NAD(P)+-dependent enzymes which, in general, oxidize a
           wide range of  endogenous and exogenous aliphatic and
           aromatic aldehydes to their corresponding carboxylic
           acids and play an  important role in detoxification.
           This group includes families ALDH18, ALDH19, and ALDH20
           and represents such proteins as gamma-glutamyl phosphate
           reductase, LuxC-like acyl-CoA reductase, and coenzyme A
           acylating aldehyde dehydrogenase.  All of these proteins
           have a conserved cysteine that aligns with the catalytic
           cysteine of the ALDH group.
          Length = 397

 Score = 54.2 bits (130), Expect = 6e-08
 Identities = 40/188 (21%), Positives = 60/188 (31%), Gaps = 17/188 (9%)

Query: 164 VPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPG 223
            P+GVT  I P   P           IA  N  + +P    P      + +L QA     
Sbjct: 99  FPIGVTMHILPSTNPLSGITSAL-RGIATRNQCIFRPHPSAP-FTNRALALLFQAADAAH 156

Query: 224 -----VVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKN 278
                V+ V H + +    +  HP I  I   G   A     +        V       +
Sbjct: 157 GPKILVLYVPHPSDELAEELLSHPKIDLIVATGGRDAVDAAVKHS--PHIPVIGFGAGNS 214

Query: 279 HGVIMSDANRSNTLNQLVGAAF--GAAGQRCMALSTVIFVGEAVEWLPE---LKKRAEAL 333
             V+   A+       +  + F    A   C +   +  V + ++ L E   LK   E L
Sbjct: 215 PVVVDETADEERASGSVHDSKFFDQNA---CASEQNLYVVDDVLDPLYEEFKLKLVVEGL 271

Query: 334 KVSAGNVP 341
           KV     P
Sbjct: 272 KVPQETKP 279


>gnl|CDD|143439 cd07121, ALDH_EutE, Ethanolamine utilization protein EutE-like.
           Coenzyme A acylating aldehyde dehydrogenase (ACDH), an
           NAD+ and CoA-dependent acetaldehyde dehydrogenase,
           acetylating (EC=1.2.1.10), converts acetaldehyde into
           acetyl-CoA.  This CD is limited to such monofunctional
           enzymes as the Ethanolamine utilization protein, EutE,
           in Salmonella typhimurium.  Mutations in eutE abolish
           the ability to utilize ethanolamine as a carbon source.
          Length = 429

 Score = 44.1 bits (105), Expect = 9e-05
 Identities = 55/221 (24%), Positives = 89/221 (40%), Gaps = 38/221 (17%)

Query: 67  AEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEG 126
           A ++ AVA+AK A + + K ++  R++++  ++  + +N ++LAE   EE G  +   E 
Sbjct: 4   ATVDDAVAAAKAAQKQYRKCTLADREKIIEAIREALLSNAEELAEMAVEETG--MGRVED 61

Query: 127 DVLRGLQVVEQCCSIPTIMQGETLAGVAKDMD--IHSY-RVPLGVTAGITPFNFPA---- 179
            + +     E     P     E L   A   D  +      P GV   ITP   P     
Sbjct: 62  KIAKNHLAAE---KTPGT---EDLTTTAWSGDNGLTLVEYAPFGVIGAITPSTNPTETII 115

Query: 180 --MIPLWMFPVAIACGNTHVIKPSERDPGA---CMMLVDMLTQA----GCPPG-VVNVIH 229
              I +      +A GN  V  P    PGA       V+++ +A    G P   VV V  
Sbjct: 116 NNSISM------LAAGNAVVFNPH---PGAKKVSAYAVELINKAIAEAGGPDNLVVTVEE 166

Query: 230 GAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRV 270
              +  N +  HPDI  +   G    G  + +    +GK+ 
Sbjct: 167 PTIETTNELMAHPDINLLVVTG----GPAVVKAALSSGKKA 203


>gnl|CDD|237956 PRK15398, PRK15398, aldehyde dehydrogenase EutE; Provisional.
          Length = 465

 Score = 43.0 bits (102), Expect = 2e-04
 Identities = 53/244 (21%), Positives = 89/244 (36%), Gaps = 84/244 (34%)

Query: 67  AEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQG-------- 118
           A ++ AVA+AK A + + + S+  RQ+++  ++  +  + ++LAE   EE G        
Sbjct: 36  ASVDDAVAAAKVAQQRYQQKSLAMRQRIIDAIREALLPHAEELAELAVEETGMGRVEDKI 95

Query: 119 -KTLADAE--------------GDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYR 163
            K +A AE              GD   GL ++E                           
Sbjct: 96  AKNVAAAEKTPGVEDLTTEALTGD--NGLTLIE--------------------------Y 127

Query: 164 VPLGVTAGITPFNFPA---------MIPLWMFPVAIACGNTHVIKPSERDPGA---CMML 211
            P GV   +TP   P          M         +A GN+ V  P    PGA    +  
Sbjct: 128 APFGVIGAVTPSTNPTETIINNAISM---------LAAGNSVVFSPH---PGAKKVSLRA 175

Query: 212 VDMLTQA----GCPPGVVNVIHGAH-DAVNFICDHPDIRAISFVGSDQAGKYIYERGAKN 266
           +++L +A    G P  +V  +     +    +  HP I  +   G    G  + +   K+
Sbjct: 176 IELLNEAIVAAGGPENLVVTVAEPTIETAQRLMKHPGIALLVVTG----GPAVVKAAMKS 231

Query: 267 GKRV 270
           GK+ 
Sbjct: 232 GKKA 235


>gnl|CDD|143445 cd07127, ALDH_PAD-PaaZ, Phenylacetic acid degradation proteins PaaZ
           (Escherichia coli) and PaaN (Pseudomonas putida)-like.
           Phenylacetic acid degradation (PAD) proteins PaaZ
           (Escherichia coli) and PaaN (Pseudomonas putida) are
           putative aromatic ring cleavage enzymes of the aerobic
           PA catabolic pathway. PaaZ mutants were defective for
           growth with PA as a sole carbon source due to
           interruption of the putative ring opening system.  This
           CD is limited to bacterial monofunctional enzymes.
          Length = 549

 Score = 37.8 bits (88), Expect = 0.012
 Identities = 68/292 (23%), Positives = 110/292 (37%), Gaps = 43/292 (14%)

Query: 52  PATNEVVTRVPKATPAEMESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAE 111
           P   E+    P+  P   ++ +A+A+ A   W       R  V   +   + A   ++A 
Sbjct: 72  PYGVELGVTYPQCDP---DALLAAARAAMPGWRDAGARARAGVCLEILQRLNARSFEMAH 128

Query: 112 NI--TEEQGKTLADAEG-----DVLRGLQVV----EQCCSIPTIMQGETLAGVAKD--MD 158
            +  T  Q   +A   G     D  RGL+ V     +   IP   + E   G      M+
Sbjct: 129 AVMHTTGQAFMMAFQAGGPHAQD--RGLEAVAYAWREMSRIPPTAEWEKPQGKHDPLAME 186

Query: 159 IHSYRVPLGVTAGITPFNFPAMIPLW-----MFPVAIACGNTHVIKPSERDPGACMMLV- 212
                VP GV   I    FP     W     +F  ++A GN  ++KP    P A + L  
Sbjct: 187 KTFTVVPRGVALVIGCSTFPT----WNGYPGLF-ASLATGNPVIVKPH---PAAILPLAI 238

Query: 213 ------DMLTQAGCPPGVVNVI--HGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGA 264
                 ++L +AG  P +V +            +   P++R I F GS+  G ++ E  A
Sbjct: 239 TVQVAREVLAEAGFDPNLVTLAADTPEEPIAQTLATRPEVRIIDFTGSNAFGDWL-EANA 297

Query: 265 KNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFV 316
           +   +V +     N  V+ S  +    L  L  +    +GQ C      I+V
Sbjct: 298 R-QAQVYTEKAGVNTVVVDSTDDLKAMLRNLAFSLSLYSGQMCTT-PQNIYV 347


>gnl|CDD|131341 TIGR02288, PaaN_2, phenylacetic acid degradation protein paaN.
           This enzyme is proposed to act in the ring-opening step
           of phenylacetic acid degradation which follows ligation
           of the acid with coenzyme A (by PaaF) and hydroxylation
           by a multicomponent non-heme iron hydroxylase complex
           (PaaGHIJK). Gene symbols have been standardized in. This
           enzyme is related to aldehyde dehydrogenases and has a
           domain which is a member of the pfam00171 family. This
           family includes sequences from Burkholderia, Bordetella,
           Streptomyces. Other PaaN enzymes are represented by a
           separate model, TIGR02278.
          Length = 551

 Score = 37.2 bits (86), Expect = 0.016
 Identities = 42/166 (25%), Positives = 70/166 (42%), Gaps = 22/166 (13%)

Query: 157 MDIHSYRVPLGVTAGITPFNFPAMIPLW-MFP---VAIACGNTHVIKPSERDPGACMMLV 212
           +      VP G+   I    FP     W  +P    ++A GN  ++KP    PGA + L 
Sbjct: 185 LKKRFTIVPRGIALVIGCSTFPT----WNTYPGLFASLATGNPVLVKPH---PGAILPLA 237

Query: 213 -------DMLTQAGCPPGVVNVI--HGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERG 263
                  ++L +AG  P +V +      H+A   +   P +R I F GS+  G+++ E+ 
Sbjct: 238 LTVQVAREVLGEAGFDPNLVTLAAFDPGHEAAQRLATDPAVRIIDFTGSNAFGQWL-EQN 296

Query: 264 AKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMA 309
           A+  + V +     N  +I S  +    L  L  +    +GQ C  
Sbjct: 297 ARQAQ-VYTEKAGVNTVIIESTDDYKAMLRNLAFSLSLYSGQMCTT 341


>gnl|CDD|131570 TIGR02518, EutH_ACDH, acetaldehyde dehydrogenase (acetylating). 
          Length = 488

 Score = 36.4 bits (84), Expect = 0.031
 Identities = 38/167 (22%), Positives = 67/167 (40%), Gaps = 23/167 (13%)

Query: 76  AKKAYETWSKTSIITRQQVLFRLQNIIKA---NMKQLAENITEEQG------KTLADAEG 126
            + A     K + +T++Q+   ++ I+ A   N  +LA+   EE G      K + +   
Sbjct: 14  IRSAKVAQKKLANMTQEQIDKIVKAIVDAAYENAVKLAKMANEETGFGKWEDKVIKN--- 70

Query: 127 DVLRGLQVVEQCCSIPTI-MQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIPLWM 185
            V     V +    + TI +  E      K ++I    VP+GV AG+ P   P    ++ 
Sbjct: 71  -VFAATIVYDSIKDMKTIGILSEDK--EKKVIEI---AVPVGVVAGLIPSTNPTSTAIYK 124

Query: 186 FPVAIACGNTHVIKPSERDPGACMMLVDMLTQ----AGCPPGVVNVI 228
             ++I   N  V  P        +  V ++ +    AG P G +  I
Sbjct: 125 TLISIKARNAIVFSPHPNAKKCIIETVKLMRKAAEEAGAPEGAIGCI 171


>gnl|CDD|143400 cd07081, ALDH_F20_ACDH_EutE-like, Coenzyme A acylating aldehyde
           dehydrogenase (ACDH), Ethanolamine utilization protein
           EutE, and related proteins.  Coenzyme A acylating
           aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent
           acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10),
           functions as a single enzyme (such as the Ethanolamine
           utilization protein, EutE, in Salmonella typhimurium) or
           as part of a multifunctional enzyme to convert
           acetaldehyde into acetyl-CoA. The E. coli
           aldehyde-alcohol dehydrogenase includes the functional
           domains, alcohol dehydrogenase (ADH), ACDH, and
           pyruvate-formate-lyase deactivase; and the Entamoeba
           histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1)
           includes the functional domains ADH and ACDH, and may be
           critical enzymes in the fermentative pathway.
          Length = 439

 Score = 34.2 bits (78), Expect = 0.14
 Identities = 42/228 (18%), Positives = 79/228 (34%), Gaps = 24/228 (10%)

Query: 69  MESAVASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDV 128
           ++ AVA+AK A +  S  S      +        +     LA+    E G      E  V
Sbjct: 1   LDDAVAAAKVAQQGLSCKSQEMVDLIFRAAAEAAEDARIDLAKLAVSETGMGRV--EDKV 58

Query: 129 L------RGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNFPAMIP 182
           +        +  V +      ++ G+   G     +      P+GV A ITP   P    
Sbjct: 59  IKNHFAAEYIYNVYKDEKTCGVLTGDENGGTLIIAE------PIGVVASITPSTNPTSTV 112

Query: 183 LWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGCPPG-----VVNVIHGAHDAVNF 237
           ++   +++   N+ +  P  R          +L QA    G     +  + + + +    
Sbjct: 113 IFKSLISLKTRNSIIFSPHPRAKKVTQRAATLLLQAAVAAGAPENLIGWIDNPSIELAQR 172

Query: 238 ICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIMSD 285
           +   P I  +   G    G  + +    +GK     +GA N  V++ +
Sbjct: 173 LMKFPGIGLLLATG----GPAVVKAAYSSGKPAIG-VGAGNTPVVIDE 215


>gnl|CDD|143440 cd07122, ALDH_F20_ACDH, Coenzyme A acylating aldehyde dehydrogenase
           (ACDH), ALDH family 20-like.  Coenzyme A acylating
           aldehyde dehydrogenase (ACDH, EC=1.2.1.10), an NAD+ and
           CoA-dependent acetaldehyde dehydrogenase, functions as a
           single enzyme (such as the Ethanolamine utilization
           protein, EutE, in Salmonella typhimurium) or as part of
           a multifunctional enzyme to convert acetaldehyde into
           acetyl-CoA . The E. coli aldehyde-alcohol dehydrogenase
           includes the functional domains, alcohol dehydrogenase
           (ADH), ACDH, and pyruvate-formate-lyase deactivase; and
           the Entamoeba histolytica aldehyde-alcohol dehydrogenase
           2 (ALDH20A1) includes the functional domains ADH and
           ACDH and may be critical enzymes in the fermentative
           pathway.
          Length = 436

 Score = 33.6 bits (78), Expect = 0.18
 Identities = 21/91 (23%), Positives = 35/91 (38%), Gaps = 9/91 (9%)

Query: 164 VPLGVTAGITPFNFPAMIPLWMFPVAIA--CGNTHVIKPSERDPGACMMLVDMLTQA--- 218
            P+GV A + P   P      +F   IA    N  +  P  R     +    ++ +A   
Sbjct: 94  EPVGVIAALIPSTNPTSTA--IFKALIALKTRNAIIFSPHPRAKKCSIEAAKIMREAAVA 151

Query: 219 -GCPPGVVNVI-HGAHDAVNFICDHPDIRAI 247
            G P G++  I   + +    +  HPD+  I
Sbjct: 152 AGAPEGLIQWIEEPSIELTQELMKHPDVDLI 182


>gnl|CDD|237016 PRK11903, PRK11903, aldehyde dehydrogenase; Provisional.
          Length = 521

 Score = 33.9 bits (78), Expect = 0.18
 Identities = 63/243 (25%), Positives = 95/243 (39%), Gaps = 51/243 (20%)

Query: 164 VPL-GVTAGITPFNFPAMIPLW-MFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAGC- 220
           VP  GV   I  FNFPA   LW     A+  G   ++KP+         +V  +  AG  
Sbjct: 146 VPTRGVALFINAFNFPAW-GLWEKAAPALLAGVPVIVKPATATAWLTQRMVKDVVAAGIL 204

Query: 221 PPGVVNVIHGAH----DAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGA 276
           P G ++V+ G+     D +    D      +SF GS +    +    A   + V+ N+ A
Sbjct: 205 PAGALSVVCGSSAGLLDHLQ-PFD-----VVSFTGSAETAAVLRSHPAVVQRSVRVNVEA 258

Query: 277 K--NHGVIMSDANRSNTLNQLVGAAFGA------------AGQRCMALSTVIFVGEAVEW 322
              N  ++  DA            AF              +GQ+C A+   IFV EA+  
Sbjct: 259 DSLNSALLGPDAAPG-------SEAFDLFVKEVVREMTVKSGQKCTAIRR-IFVPEAL-- 308

Query: 323 LPELKKRAEAL-----KVSAGNVPGTD---LGPVISPQAKERILSLIQSGVDEGATLLLD 374
                  AEAL     K + GN P  D   +GP++S      + + +   +   A +L D
Sbjct: 309 ---YDAVAEALAARLAKTTVGN-PRNDGVRMGPLVSRAQLAAVRAGLA-ALRAQAEVLFD 363

Query: 375 GRN 377
           G  
Sbjct: 364 GGG 366


>gnl|CDD|237402 PRK13504, PRK13504, sulfite reductase subunit beta; Provisional.
          Length = 569

 Score = 31.7 bits (73), Expect = 0.89
 Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 7/53 (13%)

Query: 90  TRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIP 142
           TRQ   F+   I+K N+K + + I      TLA A GDV R +     C   P
Sbjct: 111 TRQT--FQFHGILKKNLKPVIQTINSVLLDTLA-ACGDVNRNV----MCTPNP 156


>gnl|CDD|213736 TIGR02773, addB_Gpos, helicase-exonuclease AddAB, AddB subunit.  DNA
            repair is accomplished by several different systems in
            prokaryotes. Recombinational repair of double-stranded
            DNA breaks involves the RecBCD pathway in some lineages,
            and AddAB (also called RexAB) in other. The AddA protein
            is conserved between the firmicutes and the
            alphaproteobacteria, while the partner protein is not.
            Nevertheless, the partner is designated AddB in both
            systems. This model describes the AddB protein as found
            Bacillus subtilis and related species. Although the RexB
            protein of Streptococcus and Lactococcus is considered to
            be orthologous, functionally equivalent, and merely named
            differently, all members of this protein family have a
            P-loop nucleotide binding motif GxxGxGK[ST] at the
            N-terminus, unlike RexB proteins, and a CxxCxxxxxC motif
            at the C-terminus, both of which may be relevant to
            function [DNA metabolism, DNA replication, recombination,
            and repair].
          Length = 1160

 Score = 31.2 bits (71), Expect = 1.2
 Identities = 11/42 (26%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 74   ASAKKAYETWSKTSIITRQQVLFRLQNIIKANMKQLAENITE 115
            AS KK     S++   T ++    L+  ++   ++  ENIT+
Sbjct: 1062 ASLKKDGSLGSRSKAATEEE-FELLRKHVRRKFQEAGENITD 1102


>gnl|CDD|185342 PRK15445, PRK15445, arsenical pump membrane protein; Provisional.
          Length = 427

 Score = 30.9 bits (71), Expect = 1.4
 Identities = 23/62 (37%), Positives = 29/62 (46%), Gaps = 25/62 (40%)

Query: 339 NVPGTDLGPVISPQAKERILSLIQSGVDEGATLL----LDGRNL--------KVGINVPI 386
           NV G+DLGP I+P     I SL        ATLL    L  + +        KVGI + +
Sbjct: 364 NVIGSDLGPKITP-----IGSL--------ATLLWLHVLAQKGVKISWGYYFKVGIIITL 410

Query: 387 PV 388
           PV
Sbjct: 411 PV 412


>gnl|CDD|119386 cd00163, RNase_A, RNase A family, or Pancreatic RNases family;
           includes vertebrate RNase homologs to the bovine
           pancreatic ribonuclease A (RNase A). Many of these
           enzymes have special biological activities; for example,
           some stimulate the development of vascular endothelial
           cells, dendritic cells, and neurons, are
           cytotoxic/anti-tumoral and/or anti-pathogenic. RNase A
           is involved in endonucleolytic cleavage of
           3'-phosphomononucleotides and 3'-phosphooligonucleotides
           ending in C-P or U-P with 2',3'-cyclic phosphate
           intermediates. The catalytic mechanism is a
           transphosphorylation of P-O 5' bonds on the 3' side of
           pyrimidines and subsequent hydrolysis to generate 3'
           phosphate groups. The RNase A family proteins have a
           conserved catalytic triad (two histidines and one
           lysine); recently some family members lacking the
           catalytic residues have been identified. They also share
           three or four disulfide bonds. The most conserved
           disulfide bonds are close to the N and C termini and
           contribute most significantly to the conformational
           stability. 8 RNase A homologs had initially been
           identified in the human genome, pancreatic RNase (RNase
           1), Eosinophil Derived Neurotoxin (EDN/RNASE 2),
           Eosinophil Cationic Protein (ECP/RNase 3), RNase 4,
           Angiogenin (RNase 5), RNase 6 or k6, the skin derived
           RNase (RNase 7) and RNase 8. These eight human genes are
           all located in a cluster on chromosome 14. Recent
           genomic analysis has extended the family to 13
           sequences. However only the first eight identified human
           RNases, which are refered to as "canonical" RNases,
           contain the catalytic residues required for RNase A
           activity. The new genes corresponding to RNases 9-13 are
           also located in the same chromosome cluster and seem to
           be related to male-reproductive functions. RNases 9-13
           have the characteristic disulfide bridge pattern but are
           unlikely to share RNase activity. The RNase A family
           most likely started off in vertebrates as a host-defense
           protein, and comparative analysis in mammals and birds
           indicates that the family may have originated from a
           RNase 5-like gene. This hypothesis is supported by the
           fact that only RNase 5-like RNases have been reported
           outside the mammalian class. The RNase 5 group would
           therefore be the most ancient form of this family, and
           all other members would have arisen during mammalian
           evolution.
          Length = 119

 Score = 29.6 bits (67), Expect = 1.4
 Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 4/50 (8%)

Query: 199 KPSERDPGAC--MMLVDMLTQAG-CPPGVVNVIHGAHDAVNFICDHPDIR 245
             S  DP  C  MM    +T+ G C       IH + + V  +C++P + 
Sbjct: 14  PKSNGDPRYCNSMMRRKKVTENGSCKK-QHTFIHESLEEVQAVCNNPSVF 62


>gnl|CDD|161862 TIGR00407, proA, gamma-glutamyl phosphate reductase.  The related
           model TIGR01092 describes a full-length fusion protein
           delta l-pyrroline-5-carboxylate synthetase that includes
           a gamma-glutamyl phosphate reductase region as described
           by this model. Alternate name: glutamate-5-semialdehyde
           dehydrogenase. The prosite motif begins at residue 332
           of the seed alignment although not all of the members of
           the family exactly obey the motif [Amino acid
           biosynthesis, Glutamate family].
          Length = 398

 Score = 30.5 bits (69), Expect = 1.6
 Identities = 39/174 (22%), Positives = 66/174 (37%), Gaps = 27/174 (15%)

Query: 76  AKKAYETWSKTSIITRQQVLFRLQNIIKANMKQ-LAEN-----------ITEEQGKTLAD 123
           AK+A    ++ S   +   L ++ + ++A     LA N           + +     L  
Sbjct: 1   AKQAANILAQLSTAEKNDALSKIADGLEAQAPAILAANAKDIAVAKENGLADALLDRLLL 60

Query: 124 AEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKD--MDIHSYRVPLGVTAGITPFNFPAMI 181
            EG +      V+    +   + G+ + G   D  + +   RVPLGV   I    + A  
Sbjct: 61  TEGRLKGIADGVKDVIELADPV-GKVIDGRELDSGLTLERVRVPLGVLGVI----YEAR- 114

Query: 182 PLWMFPVAIAC---GNTHVIKPSERDPGACMMLV----DMLTQAGCPPGVVNVI 228
           P     +A  C   GN  +++  +    +   LV    D L Q G P G V +I
Sbjct: 115 PNVTVDIASLCLKTGNAVILRGGKEAVRSNKALVEVIQDALAQTGLPVGAVQLI 168


>gnl|CDD|143399 cd07080, ALDH_Acyl-CoA-Red_LuxC, Acyl-CoA reductase LuxC.  Acyl-CoA
           reductase, LuxC, (EC=1.2.1.50) is the fatty acid
           reductase enzyme responsible for synthesis of the
           aldehyde  substrate for the luminescent reaction
           catalyzed by luciferase. The fatty acid reductase, a
           luminescence-specific, multienzyme complex (LuxCDE),
           reduces myristic acid to generate the long chain fatty
           aldehyde required for the luciferase-catalyzed reaction
           resulting in the emission of blue-green light.
           Mutational studies of conserved cysteines of LuxC
           revealed that the cysteine which aligns with the
           catalytic cysteine conserved throughout the ALDH
           superfamily is the LuxC acylation site. This CD is
           composed of mainly bacterial sequences but also includes
           a few archaeal sequences similar to the Methanospirillum
           hungateiacyl acyl-CoA reductase RfbN.
          Length = 422

 Score = 30.7 bits (70), Expect = 1.6
 Identities = 26/106 (24%), Positives = 40/106 (37%), Gaps = 15/106 (14%)

Query: 163 RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGA----CMMLVDM---- 214
             P G+   I   N P ++P+W     +   N +++K S  DP         L D+    
Sbjct: 110 AQPRGLVVHIIAGNVP-LLPVWSIVRGLLVKNVNLLKMSSSDPLTATALLRSLADVDPNH 168

Query: 215 -LTQAGCPPGVVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYI 259
            LT +     VV    G  +    I    D  A+   G ++A K I
Sbjct: 169 PLTDSIS---VVYWPGGDAELEERILASAD--AVVAWGGEEAVKAI 209


>gnl|CDD|143446 cd07128, ALDH_MaoC-N, N-terminal domain of the monoamine oxidase C
           dehydratase.  The N-terminal domain of the MaoC
           dehydratase, a monoamine oxidase regulatory protein.
           Orthologs of MaoC include PaaZ (Escherichia coli) and
           PaaN (Pseudomonas putida), which are putative
           ring-opening enzymes of the aerobic phenylacetic acid
           (PA) catabolic pathway. The C-terminal domain of MaoC
           has sequence similarity to enoyl-CoA hydratase. Also
           included in this CD is a novel Burkholderia xenovorans
           LB400 ALDH of the aerobic benzoate oxidation (box)
           pathway. This pathway involves first the synthesis of a
           CoA thio-esterified aromatic acid, with subsequent
           dihydroxylation and cleavage steps, yielding the CoA
           thio-esterified aliphatic aldehyde,
           3,4-dehydroadipyl-CoA semialdehyde, which is further
           converted into its corresponding CoA acid by the
           Burkholderia LB400 ALDH.
          Length = 513

 Score = 30.7 bits (70), Expect = 1.6
 Identities = 47/207 (22%), Positives = 84/207 (40%), Gaps = 14/207 (6%)

Query: 164 VPL-GVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGACMMLVDMLTQAG-CP 221
            P  GV   I  FNFP    L  F  A+  G   ++KP+         +V  + ++G  P
Sbjct: 142 TPRRGVAVHINAFNFPVWGMLEKFAPALLAGVPVIVKPATATAYLTEAVVKDIVESGLLP 201

Query: 222 PGVVNVIHGAHDAVNFICDHPDIR-AISFVGSDQAGKYIYERGAKNGKRVQSNMGAK--N 278
            G + +I G+   V  + DH   +  ++F GS      +        + ++ N  A   N
Sbjct: 202 EGALQLICGS---VGDLLDHLGEQDVVAFTGSAATAAKLRAHPNIVARSIRFNAEADSLN 258

Query: 279 HGVIMSDANRSNT-----LNQLVGAAFGAAGQRCMALSTVIFVGEAVEWLPE-LKKRAEA 332
             ++  DA          + ++       AGQ+C A+         V+ + E LK R   
Sbjct: 259 AAILGPDATPGTPEFDLFVKEVAREMTVKAGQKCTAIRRAFVPEARVDAVIEALKARLAK 318

Query: 333 LKVSAGNVPGTDLGPVISPQAKERILS 359
           + V    + G  +GP++S + +E + +
Sbjct: 319 VVVGDPRLEGVRMGPLVSREQREDVRA 345


>gnl|CDD|213358 cd12824, ZntB-like, Salmonella typhimurium Zn2+ transporter
           ZntB-like subfamily.  A bacterial subfamily belonging to
           the Escherichia coli CorA-Salmonella typhimurium
           ZntB_like family (EcCorA_ZntB-like) family of the MIT
           superfamily of essential membrane proteins involved in
           transporting divalent cations (uptake or efflux) across
           membranes. This subfamily includes the Zn2+ transporter
           Salmonella typhimurium ZntB which mediates the efflux of
           Zn2+ (and Cd2+). Structures of the intracellular domain
           of Vibrio parahaemolyticus and Salmonella typhimurium
           ZntB form funnel-shaped homopentamers, the tip of the
           funnel is formed from two C-terminal transmembrane (TM)
           helices from each monomer, and the large opening of the
           funnel from the N-terminal cytoplasmic domains. The GMN
           signature motif of the MIT superfamily occurs just after
           TM1, mutation within this motif is known to abolish Mg2+
           transport through Salmonella typhimurium CorA, and
           Mrs2p. Natural variants such as GVN and GIN, which occur
           in proteins belonging to this subfamily, may be
           associated with the transport of different divalent
           cations, such as zinc and cadmium. The functional
           diversity of MIT transporters may also be due to minor
           structural differences regulating gating, substrate
           selection, and transport.
          Length = 290

 Score = 30.2 bits (68), Expect = 2.2
 Identities = 23/101 (22%), Positives = 42/101 (41%), Gaps = 8/101 (7%)

Query: 84  SKTSIITRQQVLFRLQNIIKANMKQL-------AENITEEQGKTLADAEGDVLRGLQVVE 136
              ++I  Q+ LFR++  I+   + L       AE +     +TL  A  +V R L+ ++
Sbjct: 146 PPYALIALQRRLFRIRRQIRPQQQALRPLITDPAELLVTSDRETLRRASNNVTRYLETLD 205

Query: 137 QCCSIPTIMQGETLAGVAKDMDIHSYRVPLGVTAGITPFNF 177
            C +   ++Q E  A   +     S  + + V     P  F
Sbjct: 206 FCRTRVALLQDEIKAQRDEKTTRSSLLLTV-VATVFLPLTF 245


>gnl|CDD|191617 pfam06830, Root_cap, Root cap.  The cells at the periphery of the
           root cap are continuously sloughed off from the root
           into the mucilage, and are thought to be programmed to
           die.This family represents a conserved region
           approximately 60 residues in length within plant root
           cap proteins, which may be involved in the process.
          Length = 57

 Score = 27.6 bits (62), Expect = 2.2
 Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 4/36 (11%)

Query: 413 GFYFYTETKTVT----QLWRESDVTHSKAAVSMPVM 444
           GF FY  +  V     Q +R   V   K   SMPVM
Sbjct: 1   GFKFYDLSDDVHGVLGQTYRPDYVNPVKVGASMPVM 36


>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein.  A number of members of this
           family are annotated as being transmembrane proteins
           induced by tumour necrosis factor alpha, but no
           literature was found to support this.
          Length = 330

 Score = 30.1 bits (68), Expect = 2.4
 Identities = 13/47 (27%), Positives = 25/47 (53%), Gaps = 3/47 (6%)

Query: 84  SKTSIITRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLR 130
             TS I RQ+   RL+ +I  ++K+L +++T E  + +   E  +  
Sbjct: 36  QCTSAIARQR--KRLKQLIV-SLKKLKKSLTPEDSELVEQLEEQIKE 79


>gnl|CDD|218799 pfam05893, LuxC, Acyl-CoA reductase (LuxC).  This family consists
           of several bacterial Acyl-CoA reductase (LuxC) proteins.
           The channelling of fatty acids into the fatty aldehyde
           substrate for the bacterial bioluminescence reaction is
           catalyzed by a fatty acid reductase multienzyme complex,
           which channels fatty acids through the thioesterase
           (LuxD), synthetase (LuxE) and reductase (LuxC)
           components.
          Length = 400

 Score = 30.0 bits (68), Expect = 2.6
 Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 1/45 (2%)

Query: 163 RVPLGVTAGITPFNFPAMIPLWMFPVAIACGNTHVIKPSERDPGA 207
             P G+   +   N P ++P+      I   N +++KPS  DP  
Sbjct: 86  AQPRGLVVHVLAGNVP-LLPVMSILQGILVKNVNLLKPSSSDPFT 129


>gnl|CDD|224439 COG1522, Lrp, Transcriptional regulators [Transcription].
          Length = 154

 Score = 28.5 bits (64), Expect = 4.2
 Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 1/44 (2%)

Query: 349 ISPQA-KERILSLIQSGVDEGATLLLDGRNLKVGINVPIPVPLS 391
           +SP     RI  L + GV +G T +LD   L + +   + V L 
Sbjct: 34  LSPSTVLRRIKRLEEEGVIKGYTAVLDPEKLGLDLTAFVEVKLE 77


>gnl|CDD|153389 cd07952, ED_3B_like, Uncharacterized class III extradiol
           dioxygenases.  This subfamily is composed of proteins of
           unknown function with similarity to the catalytic B
           subunit of class III extradiol dioxygenases. Class III
           extradiol dioxygenases use a non-heme Fe(II) to cleave
           aromatic rings between a hydroxylated carbon and an
           adjacent non-hydroxylated carbon. They play key roles in
           the degradation of aromatic compounds.
          Length = 256

 Score = 28.8 bits (65), Expect = 4.7
 Identities = 7/43 (16%), Positives = 15/43 (34%), Gaps = 3/43 (6%)

Query: 148 ETLAGVAKDMDIHSYRVPLGVTAG---ITPFNFPAMIPLWMFP 187
             +   A+   I    +    ++G     P ++  +IPL    
Sbjct: 84  NEIYKSARADGIPVLGINFATSSGDNSDFPLDWGELIPLSFLK 126


>gnl|CDD|182040 PRK09707, PRK09707, putative lipoprotein; Provisional.
          Length = 1343

 Score = 29.3 bits (65), Expect = 5.5
 Identities = 22/73 (30%), Positives = 35/73 (47%), Gaps = 4/73 (5%)

Query: 224 VVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIM 283
           +V V  G+H  VN      D    +F+G+ +A +YIY   A +G+   S  G  + G+I 
Sbjct: 318 LVGVQDGSHGVVNVT----DKGHWNFLGTGEAFRYIYIGDAGDGELNVSREGKVDSGIIT 373

Query: 284 SDANRSNTLNQLV 296
           +    + T N  V
Sbjct: 374 AGMKETGTGNLTV 386


>gnl|CDD|234344 TIGR03754, conj_TOL_TraD, conjugative coupling factor TraD, TOL
           family.  Members of this protein are assigned by
           homology to the TraD family of conjugative coupling
           factor. This particular clade serves as a marker for an
           extended gene region that occurs occasionally on
           plasmids, including the toluene catabolism TOL plasmid.
           More commonly, the gene region is chromosomal, flanked
           by various markers of conjugative transfer and
           insertion.
          Length = 643

 Score = 29.3 bits (66), Expect = 5.7
 Identities = 27/121 (22%), Positives = 45/121 (37%), Gaps = 20/121 (16%)

Query: 56  EVVTRVPKATPAEMESAVASAKKAYETWSKTSIITR-----------QQVLFRLQNIIKA 104
           EV TR+      E  SA   A K +  W   +I+ R           Q +L  + NI   
Sbjct: 264 EVATRITGQLSGEGNSA---AFKEF-AWRFVNIVARALVALGQRPDYQLILRYVTNIDPL 319

Query: 105 NMKQLAENITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQGETLAGVAKDMDIHSYRV 164
            M + A+ I  +          +++  L    +  ++P  MQG     VA +  +   R+
Sbjct: 320 FM-EYADKILPKHDPKAW----ELIEELIARLKDRNVPRNMQGRPPRVVAIEQYLKEKRL 374

Query: 165 P 165
            
Sbjct: 375 Y 375


>gnl|CDD|217546 pfam03417, AAT, Acyl-coenzyme A:6-aminopenicillanic acid
           acyl-transferase. 
          Length = 223

 Score = 28.4 bits (64), Expect = 5.8
 Identities = 16/97 (16%), Positives = 32/97 (32%), Gaps = 17/97 (17%)

Query: 169 TAGITPFNFPAMIPLWMFPV---AIACGNT--HVIKPSERDPGACMMLVDMLTQAGCPPG 223
            +G     +   IP     +    +A G    H+ K  +  P     +  +L +      
Sbjct: 30  ESGFATIGYAGRIPGRTDGINEAGLAMGINFLHLRKLGDGFP--RHFITRLLLECT---- 83

Query: 224 VVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIY 260
             +V     +AV  + + P     +++  D AG    
Sbjct: 84  --SV----EEAVKALKEIPHAAGFNYILLDAAGNIAV 114


>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1).  The
           homeobox gene Prox1 is expressed in a subpopulation of
           endothelial cells that, after budding from veins, gives
           rise to the mammalian lymphatic system. Prox1 has been
           found to be an early specific marker for the developing
           liver and pancreas in the mammalian foregut endoderm.
           This family contains an atypical homeobox domain.
          Length = 908

 Score = 29.3 bits (65), Expect = 5.9
 Identities = 31/138 (22%), Positives = 44/138 (31%), Gaps = 30/138 (21%)

Query: 261 ERGAKNGKRVQSNMGAKNHGVIMSDANRSNTLNQLVGAAFGAAGQRCMALSTVIFVGEAV 320
           E  A NG    S  G+       +D+N S  +N   G+   +        S V       
Sbjct: 63  EFSAMNGHAPHSPHGSD------TDSNESKAVNPGPGSIRSSDSSAK---SDV------- 106

Query: 321 EWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATLLLDGRNLKV 380
             L +L KRA + + +    PGT     I  Q  +  ++    G D G     D      
Sbjct: 107 --LRKLLKRANSYEDAMMPFPGT----TIISQLLKNNMAKNGGGPDRGEGDRGDREAP-- 158

Query: 381 GINVPIPVPLSMFSFTGS 398
                   P S  S T  
Sbjct: 159 ------SFPASALSNTKP 170


>gnl|CDD|131097 TIGR02042, sir, ferredoxin-sulfite reductase.  Distantly related to
           the iron-sulfur hemoprotein of sulfite reductase (NADPH)
           found in Proteobacteria and Eubacteria, sulfite
           reductase (ferredoxin) is a cyanobacterial and plant
           monomeric enzyme that also catalyzes the reduction of
           sulfite to sulfide [Central intermediary metabolism,
           Sulfur metabolism].
          Length = 577

 Score = 29.0 bits (65), Expect = 6.2
 Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 3/41 (7%)

Query: 90  TRQQVLFRLQNIIKANMKQLAENITEEQGKTLADAEGDVLR 130
           TRQ   F+L  I+K N+K +   I +  G TL  A GD+ R
Sbjct: 101 TRQT--FQLHGILKKNLKTVISTIVKNLGSTLG-ACGDLNR 138


>gnl|CDD|184851 PRK14849, PRK14849, putative lipoprotein/autotransporter
           domain-containing protein; Provisional.
          Length = 1806

 Score = 28.9 bits (64), Expect = 6.6
 Identities = 21/70 (30%), Positives = 34/70 (48%), Gaps = 4/70 (5%)

Query: 224 VVNVIHGAHDAVNFICDHPDIRAISFVGSDQAGKYIYERGAKNGKRVQSNMGAKNHGVIM 283
           +V V  G+H  VN      D    +F+G+ +A +YIY   A +G+   S  G  + G+I 
Sbjct: 318 LVGVQDGSHGVVNVT----DKGHWNFLGTGEAFRYIYIGDAGDGELNVSREGKVDSGIIT 373

Query: 284 SDANRSNTLN 293
           +    + T N
Sbjct: 374 AGMKETGTGN 383


>gnl|CDD|185768 cd09245, BRO1_UmRIM23-like, Protein-interacting, Bro1-like domain
           of Ustilago maydis Rim23 (PalC), and related domains.
           This family contains the Bro1-like domain of Ustilago
           maydis Rim23 (also known as PalC), and related proteins.
           It belongs to the BRO1_Alix_like superfamily which
           includes the Bro1-like domains of mammalian Alix
           (apoptosis-linked gene-2 interacting protein X),
           His-Domain type N23 protein tyrosine phosphatase
           (HD-PTP, also known as PTPN23), RhoA-binding proteins
           Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as
           PalA) from Saccharomyces cerevisiae, and related
           domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23
           interact with the ESCRT (Endosomal Sorting Complexes
           Required for Transport) system. Rim20 and Rim23
           participate in the response to the external pH via the
           Rim101 pathway. Through its Bro1-like domain, Rim23
           allows the interaction between the endosomal and plasma
           membrane complexes. Bro1-like domains are
           boomerang-shape, and part of the domain is a
           tetratricopeptide repeat (TPR)-like structure.
           Intermediates in the Rim101 pathway may play roles in
           the pathogenesis of fungal corneal infection during
           Candida albicans keratitis. This family lacks the
           V-shaped (V) domain found in many members of the
           BRO1_Alix_like superfamily.
          Length = 413

 Score = 28.5 bits (64), Expect = 7.5
 Identities = 17/56 (30%), Positives = 24/56 (42%)

Query: 316 VGEAVEWLPELKKRAEALKVSAGNVPGTDLGPVISPQAKERILSLIQSGVDEGATL 371
           VGEA+ WL   KK  E LK  +G      L      + ++R +       +E  TL
Sbjct: 312 VGEAIGWLRAAKKELEDLKSPSGVASKAKLKKSWKEKREDRKVEKGAGVEEELRTL 367


>gnl|CDD|178550 PLN02966, PLN02966, cytochrome P450 83A1.
          Length = 502

 Score = 28.6 bits (63), Expect = 7.5
 Identities = 34/146 (23%), Positives = 55/146 (37%), Gaps = 14/146 (9%)

Query: 34  IDGKFVESKATDWIDL------HNPATNEVVTRVPKATPAEMESAVASAKKAYETWSKTS 87
           +D K V+ +    IDL        P  +E      KA   ++  A      A   W  T 
Sbjct: 256 LDPKRVKPETESMIDLLMEIYKEQPFASEFTVDNVKAVILDIVVAGTDTAAAAVVWGMTY 315

Query: 88  IITRQQVLFRLQNIIKANMKQLAEN-ITEEQGKTLADAEGDVLRGLQVVEQCCSIPTIMQ 146
           ++   QVL + Q  ++  MK+     +TE+  K L        R L  V++   I  ++ 
Sbjct: 316 LMKYPQVLKKAQAEVREYMKEKGSTFVTEDDVKNLP-----YFRAL--VKETLRIEPVIP 368

Query: 147 GETLAGVAKDMDIHSYRVPLGVTAGI 172
                   +D  I  Y +P G T  +
Sbjct: 369 LLIPRACIQDTKIAGYDIPAGTTVNV 394


>gnl|CDD|130895 TIGR01836, PHA_synth_III_C, poly(R)-hydroxyalkanoic acid synthase,
           class III, PhaC subunit.  This model represents the PhaC
           subunit of a heterodimeric form of polyhydroxyalkanoic
           acid (PHA) synthase. Excepting the PhaC of Bacillus
           megaterium (which needs PhaR), all members require PhaE
           (TIGR01834) for activity and are designated class III.
           This enzyme builds ester polymers for carbon and energy
           storage that accumulate in inclusions, and both this
           enzyme and the depolymerase associate with the
           inclusions. Class III enzymes polymerize
           short-chain-length hydroxyalkanoates [Fatty acid and
           phospholipid metabolism, Biosynthesis].
          Length = 350

 Score = 28.2 bits (63), Expect = 8.5
 Identities = 16/69 (23%), Positives = 25/69 (36%), Gaps = 10/69 (14%)

Query: 19  TFRQFIRHIATTKLYIDGKF-VESKATDWIDLHN---PATNEVVTR---VPKATPAEMES 71
            FRQF++        I+G+  +  +    +DL N   P  N    R   VP      +  
Sbjct: 254 AFRQFVKDFYQQNGLINGEVEIGGRK---VDLKNIKMPILNIYAERDHLVPPDASKALND 310

Query: 72  AVASAKKAY 80
            V+S     
Sbjct: 311 LVSSEDYTE 319


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.399 

Gapped
Lambda     K      H
   0.267   0.0669    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,277,216
Number of extensions: 2154479
Number of successful extensions: 2697
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2309
Number of HSP's successfully gapped: 177
Length of query: 445
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 345
Effective length of database: 6,502,202
Effective search space: 2243259690
Effective search space used: 2243259690
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.4 bits)