RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3132
         (507 letters)



>gnl|CDD|143421 cd07103, ALDH_F5_SSADH_GabD, Mitochondrial succinate-semialdehyde
           dehydrogenase and ALDH family members 5A1 and 5F1-like. 
           Succinate-semialdehyde dehydrogenase, mitochondrial
           (SSADH, GabD, EC=1.2.1.24) catalyzes the NAD+-dependent
           oxidation of succinate semialdehyde (SSA) to succinate.
           This group includes the human aldehyde dehydrogenase
           family 5 member A1 (ALDH5A1) which is a mitochondrial
           homotetramer that converts SSA to succinate in the last
           step of 4-aminobutyric acid (GABA) catabolism. This CD
           also includes the Arabidopsis SSADH gene product
           ALDH5F1. Mutations in this gene result in the
           accumulation of H2O2, suggesting a role in plant defense
           against the environmental stress of elevated reactive
           oxygen species.
          Length = 451

 Score =  723 bits (1869), Expect = 0.0
 Identities = 249/457 (54%), Positives = 326/457 (71%), Gaps = 6/457 (1%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
           V NPA  E + +VPD G  D   AI +A  AF+ W +TT +ER  +L+RW   +    + 
Sbjct: 1   VINPATGEVIGEVPDAGAADADAAIDAAAAAFKTWRKTTARERAAILRRWADLIRERAED 60

Query: 107 LAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGV 166
           LA ++T E GKP  E+ GEV YA SF++WF+EEARR YG  +PSP   K +L++++P+GV
Sbjct: 61  LARLLTLEQGKPLAEARGEVDYAASFLEWFAEEARRIYGRTIPSPAPGKRILVIKQPVGV 120

Query: 167 VGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVV 226
           V  ITPWNFP AMITRK+APALAAGCT V+KPAE+TPL+ALALA LA EA +P GVLNVV
Sbjct: 121 VAAITPWNFPAAMITRKIAPALAAGCTVVLKPAEETPLSALALAELAEEAGLPAGVLNVV 180

Query: 227 TSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYS 286
           T S      +G+ LC SP V  ISFTGST VGK+L +  +  +KR+SLELGGNAPFIV+ 
Sbjct: 181 TGS---PAEIGEALCASPRVRKISFTGSTAVGKLLMAQAADTVKRVSLELGGNAPFIVFD 237

Query: 287 NANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKND 346
           +A+L KAV GA+A+KFRN GQ CV ANR  + E ++D+FVE L VE  K    G+   ++
Sbjct: 238 DADLDKAVDGAIASKFRNAGQTCVCANRIYVHESIYDEFVEKL-VERVKKLKVGN-GLDE 295

Query: 347 KCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCY 406
             ++GPLIN   VEKV+++V+DA++KGAKV+ GG ++  LG  F+ PT+LT++T DML  
Sbjct: 296 GTDMGPLINERAVEKVEALVEDAVAKGAKVLTGGKRLG-LGGYFYEPTVLTDVTDDMLIM 354

Query: 407 TEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEG 466
            EE FGPVA  I F TE+E +  AN++  GLA Y FT D+++AWRV + L+ GMVGIN G
Sbjct: 355 NEETFGPVAPIIPFDTEDEVIARANDTPYGLAAYVFTRDLARAWRVAEALEAGMVGINTG 414

Query: 467 IISHTEAAFGGVKESGLGREGSRHGMEEFTQIKYMCF 503
           +IS  EA FGGVKESGLGREG + G+EE+ + KY+  
Sbjct: 415 LISDAEAPFGGVKESGLGREGGKEGLEEYLETKYVSL 451


>gnl|CDD|215157 PLN02278, PLN02278, succinic semialdehyde dehydrogenase.
          Length = 498

 Score =  645 bits (1667), Expect = 0.0
 Identities = 259/479 (54%), Positives = 346/479 (72%), Gaps = 8/479 (1%)

Query: 30  TQAYINGEWVHS--SKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPK 87
           TQ  I G+W  +   K FPVYNPA  E +A+VP MG  +   AI SA +AF  WS+ T  
Sbjct: 25  TQGLIGGKWTDAYDGKTFPVYNPATGEVIANVPCMGRAETNDAIASAHDAFPSWSKLTAS 84

Query: 88  ERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGEL 147
           ER  +L+RW   +  NK+ LA++MT E GKP  E++GEV Y  SF+++F+EEA+R YG++
Sbjct: 85  ERSKILRRWYDLIIANKEDLAQLMTLEQGKPLKEAIGEVAYGASFLEYFAEEAKRVYGDI 144

Query: 148 VPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTAL 207
           +PSP   + LL++++P+GVVG ITPWNFP+AMITRKV PALAAGCT V+KP+E TPLTAL
Sbjct: 145 IPSPFPDRRLLVLKQPVGVVGAITPWNFPLAMITRKVGPALAAGCTVVVKPSELTPLTAL 204

Query: 208 ALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQ 267
           A A LA +A IP GVLNVV     +A  +G  L  SP V  I+FTGST VGK L +  + 
Sbjct: 205 AAAELALQAGIPPGVLNVVMG---DAPEIGDALLASPKVRKITFTGSTAVGKKLMAGAAA 261

Query: 268 GIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVE 327
            +KR+SLELGGNAPFIV+ +A+L  AV GA+A+KFRN GQ CV ANR L+QE ++D+F E
Sbjct: 262 TVKRVSLELGGNAPFIVFDDADLDVAVKGALASKFRNSGQTCVCANRILVQEGIYDKFAE 321

Query: 328 MLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLG 387
                + +  + GD  + +    GPLIN A V+KV+S VQDA+SKGAKV+LGG + + LG
Sbjct: 322 AFSKAV-QKLVVGDGFE-EGVTQGPLINEAAVQKVESHVQDAVSKGAKVLLGGKRHS-LG 378

Query: 388 SKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDIS 447
             F+ PT+L ++T DML + EEVFGPVA   +FKTEEE++ +AN++  GLA Y FT D+ 
Sbjct: 379 GTFYEPTVLGDVTEDMLIFREEVFGPVAPLTRFKTEEEAIAIANDTEAGLAAYIFTRDLQ 438

Query: 448 QAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIKYMCFGNL 506
           +AWRV + L+ G+VG+NEG+IS   A FGGVK+SGLGREGS++G++E+ +IKY+C GN+
Sbjct: 439 RAWRVSEALEYGIVGVNEGLISTEVAPFGGVKQSGLGREGSKYGIDEYLEIKYVCLGNM 497


>gnl|CDD|188167 TIGR01780, SSADH, succinate-semialdehyde dehydrogenase.  Succinic
           semialdehyde dehydrogenase is one of three enzymes
           constituting 4-aminobutyrate (GABA) degradation in both
           prokaryotes and eukaryotes, catalyzing the
           (NAD(P)+)-dependent catabolism reaction of succinic
           semialdehyde to succinate for metabolism by the citric
           acid cycle. The EC number depends on the cofactor:
           1.2.1.24 for NAD only, 1.2.1.79 for NADP only, and
           1.2.1.16 if both can be used. In Escherichia coli,
           succinic semialdehyde dehydrogenase is located in an
           unidirectionally transcribed gene cluster encoding
           enzymes for GABA degradation and is suggested to be
           cotranscribed with succinic semialdehyde transaminase
           from a common promoter upstream of SSADH. Similar gene
           arrangements can be found in characterized Ralstonia
           eutropha and the genome analysis of Bacillus subtilis.
           Prokaryotic succinic semialdehyde dehydrogenases
           (1.2.1.16) share high sequence homology to characterized
           succinic semialdehyde dehydrogenases from rat and human
           (1.2.1.24), exhibiting conservation of proposed cofactor
           binding residues, and putative active sites (G-237 &
           G-242, C-293 & G-259 respectively of rat SSADH).
           Eukaryotic SSADH enzymes exclusively utilize NAD+ as a
           cofactor, exhibiting little to no NADP+ activity. While
           a NADP+ preference has been detected in prokaryotes in
           addition to both NADP+- and NAD+-dependencies as in
           E.coli, Pseudomonas, and Klebsiella pneumoniae. The
           function of this alternative SSADH currently is unknown,
           but has been suggested to play a possible role in
           4-hydroxyphenylacetic degradation. Just outside the
           scope of this model, are several sequences belonging to
           clades scoring between trusted and noise. These
           sequences may be actual SSADH enzymes, but lack
           sufficiently close characterized homologs to make a
           definitive assignment at this time. SSADH enzyme belongs
           to the aldehyde dehydrogenase family (pfam00171),
           sharing a common evolutionary origin and enzymatic
           mechanism with lactaldehyde dehydrogenase. Like in
           lactaldehyde dehydrogenase and succinate semialdehyde
           dehydrogenase, the mammalian catalytic glutamic acid and
           cysteine residues are conserved in all the enzymes of
           this family (PS00687, PS00070) [Central intermediary
           metabolism, Other].
          Length = 448

 Score =  593 bits (1530), Expect = 0.0
 Identities = 243/453 (53%), Positives = 319/453 (70%), Gaps = 5/453 (1%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
           VYNPA  E +  VPD GV++ + AI +A EAF+ W  TT KER  +L++W   +  NK  
Sbjct: 1   VYNPATGEIIGSVPDQGVDETEAAIRAAYEAFKTWRATTAKERSSLLRKWYNLMMENKDD 60

Query: 107 LAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGV 166
           LA ++T E+GKP  E+ GE++YA SF++WF+EEA+R YG+ +PSP   K L+++++P+GV
Sbjct: 61  LARLITLENGKPLKEAKGEILYAASFLEWFAEEAKRVYGDTIPSPQSDKRLIVIKQPVGV 120

Query: 167 VGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVV 226
              ITPWNFP AMITRK   ALAAGCT V+KPAE TPL+ALALA LA +A IPKGVLNV+
Sbjct: 121 CAAITPWNFPAAMITRKAGAALAAGCTVVVKPAEQTPLSALALARLAEQAGIPKGVLNVI 180

Query: 227 TSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYS 286
           T SR     VG +L TSP V  ISFTGST VGKIL    +  +K++S+ELGGNAPFIV+ 
Sbjct: 181 TGSRAKE--VGNVLTTSPLVRKISFTGSTNVGKILMKQSASTVKKVSMELGGNAPFIVFD 238

Query: 287 NANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKND 346
           +A+L +AV GAMA+KFRN GQ CV ANR  + + ++D+F + L  E  K    G+     
Sbjct: 239 DADLDQAVEGAMASKFRNAGQTCVCANRLYVHDGIYDEFAKKL-AEAVKKLKVGNGLDE- 296

Query: 347 KCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCY 406
               GPLIN   VEKV+  + DA+ KGAKV+  GGK  +LG  FF PT+L+N+T DML  
Sbjct: 297 GVTQGPLINEKAVEKVEKHIADAVEKGAKVVT-GGKRHELGGNFFEPTVLSNVTADMLVS 355

Query: 407 TEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEG 466
            EE FGP+A   KF  EEE + +AN++  GLA YFF+ D+S+ WRV + L+ GMVGIN G
Sbjct: 356 KEETFGPLAPVFKFDDEEEVIAIANDTEVGLAAYFFSRDLSRIWRVAEALEYGMVGINTG 415

Query: 467 IISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           +IS+  A FGGVK+SGLGREGS++G+EE+ + K
Sbjct: 416 LISNVVAPFGGVKQSGLGREGSKYGIEEYLETK 448


>gnl|CDD|215767 pfam00171, Aldedh, Aldehyde dehydrogenase family.  This family of
           dehydrogenases act on aldehyde substrates. Members use
           NADP as a cofactor. The family includes the following
           members: The prototypical members are the aldehyde
           dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
           dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase
           EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28.
           Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27.
           Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9.
           Delta-1-pyrroline-5-carboxylate dehydrogenase EC:
           1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10.
           Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This
           family also includes omega crystallin, an eye lens
           protein from squid and octopus that has little aldehyde
           dehydrogenase activity.
          Length = 459

 Score =  562 bits (1450), Expect = 0.0
 Identities = 213/464 (45%), Positives = 290/464 (62%), Gaps = 9/464 (1%)

Query: 38  WVHS-SKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRW 96
           WV S S+   V NPA  E +A VP    EDV  A+ +A+ AF+ W++T P ER  +L + 
Sbjct: 1   WVDSSSETIEVINPATGEVIATVPAATAEDVDAAVAAARAAFKAWAKTPPSERAAILLKA 60

Query: 97  LKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKE 156
              LE  +  LAE+ T E+GKP  E+ GEV  A   + +++  AR+  GE +PS      
Sbjct: 61  ADLLEERRDELAELETLETGKPLAEARGEVPRAIDTLRYYAGLARKLEGETLPSD-PGVL 119

Query: 157 LLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEA 216
               REPLGVVG ITPWNFP+ +   K+APALAAG T V+KP+E TPLTAL LA L  EA
Sbjct: 120 AYTRREPLGVVGAITPWNFPLLLAAWKIAPALAAGNTVVLKPSELTPLTALLLAELFEEA 179

Query: 217 QIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLEL 276
            +P GVLNVVT S    + VG  L   P V  +SFTGST+VG+ +    ++ +KR++LEL
Sbjct: 180 GLPPGVLNVVTGS---GSEVGDALVEHPDVDKVSFTGSTEVGRRIAKAAAKNLKRVTLEL 236

Query: 277 GGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKS 336
           GG  P IV+ +A+L  AV GA+   F N GQ C A +R L+ E ++D+FVE L VE  KS
Sbjct: 237 GGKNPLIVFDDADLDAAVEGAVFGAFGNAGQVCTAGSRLLVHESIYDEFVERL-VEAAKS 295

Query: 337 FIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTIL 396
              GD   +   ++GPLI+  Q E+V S ++DA  +GAK++ GG    + G  F  PT+L
Sbjct: 296 LKVGD-PLDPDTDIGPLISKKQRERVLSYIEDAKEEGAKLLCGGEAGLEKGY-FVEPTVL 353

Query: 397 TNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKL 456
            ++TPDM    EE+FGPV   I FK E+E++ +AN++  GLA   FTND+ +A RV ++L
Sbjct: 354 ADVTPDMRIAQEEIFGPVLSVIPFKDEDEAIELANDTEYGLAAGVFTNDLERALRVARRL 413

Query: 457 QVGMVGINEGIISHTEAA-FGGVKESGLGREGSRHGMEEFTQIK 499
           + GMV IN+      EA  FGG K+SG GREG + G+EE+T+ K
Sbjct: 414 EAGMVWINDYTTGDPEALPFGGFKQSGFGREGGKEGLEEYTETK 457


>gnl|CDD|223944 COG1012, PutA, NAD-dependent aldehyde dehydrogenases [Energy
           production and conversion].
          Length = 472

 Score =  542 bits (1399), Expect = 0.0
 Identities = 218/475 (45%), Positives = 307/475 (64%), Gaps = 11/475 (2%)

Query: 30  TQAYINGEWVHSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKER 89
            +  I+GEWV  +    V NPA  E +A VP    EDV  A+ +A+ AFE WS+ + +ER
Sbjct: 1   YKLLIDGEWVDGASTIEVINPATGEVIATVPAATAEDVDAAVAAARAAFEAWSRLSAEER 60

Query: 90  GIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVP 149
             +L+R    LE   + LA ++T E+GKP +E+ GE+  A  F+ +++EEARR  GE +P
Sbjct: 61  AAILRRIADLLEARAEELAALITLETGKPISEARGEIARAADFIRYYAEEARRLEGETIP 120

Query: 150 SPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALAL 209
           +  K  + L+ REPLGVVG ITPWNFP+A+   K+APALAAG T V+KP+E TPL+ALAL
Sbjct: 121 TD-KGSKALVRREPLGVVGAITPWNFPLALAAWKLAPALAAGNTVVLKPSEQTPLSALAL 179

Query: 210 AALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGI 269
           A LA EA +P GVLNVVT        VG  L   P V  ISFTGST VG+ + +  +  +
Sbjct: 180 AELAAEAGLPAGVLNVVTGGGA---EVGDALVAHPDVDAISFTGSTAVGRAIAAAAAANL 236

Query: 270 KRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEML 329
           K ++LELGG +P IV  +A+L  AV  A+   F N GQ C AA+R ++ E V+D+FVE L
Sbjct: 237 KPVTLELGGKSPAIVLEDADLDAAVDAAVFGAFFNAGQRCTAASRLIVHESVYDEFVERL 296

Query: 330 KVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSK 389
            V    S   GD   +   +LGPLI+  Q+++V+  ++DA+++GA+++ GG    + G  
Sbjct: 297 -VARAASLKVGD-PLDPSTDLGPLISEEQLDRVEGYIEDAVAEGARLLAGGK---RPGGY 351

Query: 390 FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQA 449
           F  PTIL  +TPDM    EE+FGPV   I+FK EEE++ +AN++  GLA   FT D+++A
Sbjct: 352 FVEPTILEGVTPDMRIAREEIFGPVLPVIRFKDEEEAIELANDTEYGLAAAIFTRDLARA 411

Query: 450 WRVGKKLQVGMVGINEGIISHTE--AAFGGVKESGLGREGSRHGMEEFTQIKYMC 502
           +RV ++L+ GMVGIN+           FGGVK+SGLGREG ++G+EEFT++K + 
Sbjct: 412 FRVARRLEAGMVGINDYTGGADIAYLPFGGVKQSGLGREGGKYGLEEFTEVKTVT 466


>gnl|CDD|183050 PRK11241, gabD, succinate-semialdehyde dehydrogenase I;
           Provisional.
          Length = 482

 Score =  513 bits (1323), Expect = e-180
 Identities = 235/476 (49%), Positives = 321/476 (67%), Gaps = 8/476 (1%)

Query: 31  QAYINGEWV--HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKE 88
           QA INGEW+  ++ +   V NPAN + L  VP MG ++ + AI +A  A   W   T KE
Sbjct: 12  QALINGEWLDANNGEVIDVTNPANGDKLGSVPKMGADETRAAIDAANRALPAWRALTAKE 71

Query: 89  RGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELV 148
           R  +L+RW   +  ++  LA +MT E GKP  E+ GE+ YA SF++WF+EE +R YG+ +
Sbjct: 72  RANILRRWFNLMMEHQDDLARLMTLEQGKPLAEAKGEISYAASFIEWFAEEGKRIYGDTI 131

Query: 149 PSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALA 208
           P     K L+++++P+GV   ITPWNFP AMITRK  PALAAGCT V+KPA  TP +ALA
Sbjct: 132 PGHQADKRLIVIKQPIGVTAAITPWNFPAAMITRKAGPALAAGCTMVLKPASQTPFSALA 191

Query: 209 LAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQG 268
           LA LA  A IP GV NVVT S   A +VG  L ++P V  +SFTGST++G+ L   C++ 
Sbjct: 192 LAELAIRAGIPAGVFNVVTGS---AGAVGGELTSNPLVRKLSFTGSTEIGRQLMEQCAKD 248

Query: 269 IKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEM 328
           IK++SLELGGNAPFIV+ +A+L KAV GA+A+KFRN GQ CV ANR  +Q+ V+D+F E 
Sbjct: 249 IKKVSLELGGNAPFIVFDDADLDKAVEGALASKFRNAGQTCVCANRLYVQDGVYDRFAEK 308

Query: 329 LKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGS 388
           L+  ++K  I GD        +GPLI+   V KV+  + DA+ KGA+V+  GGK  +LG 
Sbjct: 309 LQQAVSKLHI-GD-GLEKGVTIGPLIDEKAVAKVEEHIADALEKGARVVC-GGKAHELGG 365

Query: 389 KFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQ 448
            FF PTIL ++  +     EE FGP+A   +FK E + +  AN++  GLA YF+  D+S+
Sbjct: 366 NFFQPTILVDVPANAKVAKEETFGPLAPLFRFKDEADVIAQANDTEFGLAAYFYARDLSR 425

Query: 449 AWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIKYMCFG 504
            +RVG+ L+ G+VGIN GIIS+  A FGG+K SGLGREGS++G+E++ +IKYMC G
Sbjct: 426 VFRVGEALEYGIVGINTGIISNEVAPFGGIKASGLGREGSKYGIEDYLEIKYMCIG 481


>gnl|CDD|143397 cd07078, ALDH, NAD(P)+ dependent aldehyde dehydrogenase family.
           The aldehyde dehydrogenase family (ALDH) of NAD(P)+
           dependent enzymes, in general, oxidize a wide range of
           endogenous and exogenous aliphatic and aromatic
           aldehydes to their corresponding carboxylic acids and
           play an  important role in detoxification. Besides
           aldehyde detoxification, many ALDH isozymes possess
           multiple additional catalytic and non-catalytic
           functions such as participating in  metabolic pathways,
           or as  binding proteins, or as osmoregulants, to mention
           a few. The enzyme has three domains, a NAD(P)+
           cofactor-binding domain, a catalytic domain, and a
           bridging domain; and the active enzyme  is generally
           either homodimeric or homotetrameric. The catalytic
           mechanism is proposed to involve cofactor binding,
           resulting in a conformational change and activation of
           an invariant catalytic cysteine nucleophile. The
           cysteine and aldehyde substrate form an oxyanion
           thiohemiacetal intermediate resulting in hydride
           transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-like) proteins. The ALDH
           proteins are represented by enzymes which share a number
           of highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group.
          Length = 432

 Score =  503 bits (1297), Expect = e-176
 Identities = 196/433 (45%), Positives = 281/433 (64%), Gaps = 6/433 (1%)

Query: 68  QYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVV 127
             A+ +A+ AF+ W+   P ER  +L++    LE  ++ LA + T E+GKP  E+LGEV 
Sbjct: 1   DAAVAAARAAFKAWAALPPAERAAILRKLADLLEERREELAALETLETGKPIEEALGEVA 60

Query: 128 YANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPA 187
            A     +++  ARR +GE++PSP   +  ++ REPLGVVG ITPWNFP+ +   K+APA
Sbjct: 61  RAADTFRYYAGLARRLHGEVIPSPDPGELAIVRREPLGVVGAITPWNFPLLLAAWKLAPA 120

Query: 188 LAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVA 247
           LAAG T V+KP+E TPLTAL LA L  EA +P GVLNVVT      + VG  L + P V 
Sbjct: 121 LAAGNTVVLKPSELTPLTALLLAELLAEAGLPPGVLNVVTGD---GDEVGAALASHPRVD 177

Query: 248 GISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQ 307
            ISFTGST VGK +    ++ +KR++LELGG +P IV+ +A+L  AV GA+   F N GQ
Sbjct: 178 KISFTGSTAVGKAIMRAAAENLKRVTLELGGKSPLIVFDDADLDAAVKGAVFGAFGNAGQ 237

Query: 308 ACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQ 367
            C AA+R L+ E ++D+FVE L VE  K+   G+       ++GPLI+ AQ+++V + ++
Sbjct: 238 VCTAASRLLVHESIYDEFVERL-VERVKALKVGNPLD-PDTDMGPLISAAQLDRVLAYIE 295

Query: 368 DAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESL 427
           DA ++GAK++ GG ++      F  PT+LT++ PDM    EE+FGPV   I FK EEE++
Sbjct: 296 DAKAEGAKLLCGGKRLEGGKGYFVPPTVLTDVDPDMPIAQEEIFGPVLPVIPFKDEEEAI 355

Query: 428 CVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIIS-HTEAAFGGVKESGLGRE 486
            +AN++  GLA   FT D+ +A RV ++L+ G V IN+  +     A FGGVK+SG+GRE
Sbjct: 356 ELANDTEYGLAAGVFTRDLERALRVAERLEAGTVWINDYSVGAEPSAPFGGVKQSGIGRE 415

Query: 487 GSRHGMEEFTQIK 499
           G  +G+EE+T+ K
Sbjct: 416 GGPYGLEEYTEPK 428


>gnl|CDD|143432 cd07114, ALDH_DhaS, Uncharacterized Candidatus pelagibacter
           aldehyde dehydrogenase, DhaS-like.  Uncharacterized
           aldehyde dehydrogenase from Candidatus pelagibacter
           (DhaS) and other related sequences are present in this
           CD.
          Length = 457

 Score =  454 bits (1171), Expect = e-157
 Identities = 182/458 (39%), Positives = 265/458 (57%), Gaps = 10/458 (2%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQ--WSQTTPKERGIVLKRWLKELEGNK 104
             NPA  E  A VP+    DV  A+ +A+ AFE   W + TP ERG +L+R    +E N 
Sbjct: 1   SINPATGEPWARVPEASAADVDRAVAAARAAFEGGAWRKLTPTERGKLLRRLADLIEANA 60

Query: 105 QALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPL 164
           + LAE+ T+++GK   E+  +V Y   +  +++  A +  G ++P           REPL
Sbjct: 61  EELAELETRDNGKLIRETRAQVRYLAEWYRYYAGLADKIEGAVIPVDKGDYLNFTRREPL 120

Query: 165 GVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLN 224
           GVV  ITPWN P+ ++ +K+APALAAG T V+KP+E TP + L LA LA EA  P GV+N
Sbjct: 121 GVVAAITPWNSPLLLLAKKLAPALAAGNTVVLKPSEHTPASTLELAKLAEEAGFPPGVVN 180

Query: 225 VVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIV 284
           VVT         G+ L   P VA I+FTG T+ G+ +    ++ +  ++LELGG +P IV
Sbjct: 181 VVTG---FGPETGEALVEHPLVAKIAFTGGTETGRHIARAAAENLAPVTLELGGKSPNIV 237

Query: 285 YSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSK 344
           + +A+L  AV+G +A  F   GQ CVA +R L+Q  ++D+FVE L V   ++   GD   
Sbjct: 238 FDDADLDAAVNGVVAGIFAAAGQTCVAGSRLLVQRSIYDEFVERL-VARARAIRVGD-PL 295

Query: 345 NDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSK---FFAPTILTNITP 401
           + +  +GPL    Q+EKV+  V  A  +GA+V+ GG + +        FF PTIL ++T 
Sbjct: 296 DPETQMGPLATERQLEKVERYVARAREEGARVLTGGERPSGADLGAGYFFEPTILADVTN 355

Query: 402 DMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMV 461
           DM    EEVFGPV   I F  EEE++ +AN+S  GLA   +T D+++A RV + ++ G V
Sbjct: 356 DMRIAQEEVFGPVLSVIPFDDEEEAIALANDSEYGLAAGIWTRDLARAHRVARAIEAGTV 415

Query: 462 GINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            +N        + FGG K+SG+GRE     + E+TQ K
Sbjct: 416 WVNTYRALSPSSPFGGFKDSGIGRENGIEAIREYTQTK 453


>gnl|CDD|143407 cd07088, ALDH_LactADH-AldA, Escherichia coli lactaldehyde
           dehydrogenase AldA-like.  Lactaldehyde dehydrogenase
           from Escherichia coli (AldA, LactADH, EC=1.2.1.22), an
           NAD(+)-dependent enzyme involved in the metabolism of
           L-fucose and L-rhamnose, and other similar sequences are
           present in this CD.
          Length = 468

 Score =  452 bits (1166), Expect = e-156
 Identities = 184/472 (38%), Positives = 274/472 (58%), Gaps = 13/472 (2%)

Query: 33  YINGEWVHSS--KKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
           YINGE+V SS  +   V NPA  E +A VP    ED   A+ +A+ A + W +    ER 
Sbjct: 1   YINGEFVPSSSGETIDVLNPATGEVVATVPAATAEDADRAVDAAEAAQKAWERLPAIERA 60

Query: 91  IVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPS 150
             L++    +  N   LA+++ +E GK  + +  EV +   ++D+ +E ARR  GE++PS
Sbjct: 61  AYLRKLADLIRENADELAKLIVEEQGKTLSLARVEVEFTADYIDYMAEWARRIEGEIIPS 120

Query: 151 PVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALA 210
              ++ + + + P+GVV  I PWNFP  +I RK+APAL  G T VIKP+E+TPL AL  A
Sbjct: 121 DRPNENIFIFKVPIGVVAGILPWNFPFFLIARKLAPALVTGNTIVIKPSEETPLNALEFA 180

Query: 211 ALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIK 270
            L  EA +P GVLN+VT      + VG  L   P V  IS TGST+ G+ +    ++ I 
Sbjct: 181 ELVDEAGLPAGVLNIVTGR---GSVVGDALVAHPKVGMISLTGSTEAGQKIMEAAAENIT 237

Query: 271 RLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLK 330
           ++SLELGG AP IV  +A+L  AV   + ++  NCGQ C  A R  + E ++D+F+E L 
Sbjct: 238 KVSLELGGKAPAIVMKDADLDLAVKAIVDSRIINCGQVCTCAERVYVHEDIYDEFMEKL- 296

Query: 331 VEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKF 390
           VE  K+   GD   +   ++GPL+N A ++KV+ +V+ A+  GA ++ GG +       F
Sbjct: 297 VEKMKAVKVGD-PFDAATDMGPLVNEAALDKVEEMVERAVEAGATLLTGGKRPEGEKGYF 355

Query: 391 FAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAW 450
           + PT+LTN+  DM    EE+FGPV   +KF + +E++ +AN+S  GL  Y +T +++ A 
Sbjct: 356 YEPTVLTNVRQDMEIVQEEIFGPVLPVVKFSSLDEAIELANDSEYGLTSYIYTENLNTAM 415

Query: 451 RVGKKLQVGMVGINEGIISHTEAAFG---GVKESGLGREGSRHGMEEFTQIK 499
           R   +L+ G   IN     + EA  G   G K+SGLG    +HG+EE+ Q K
Sbjct: 416 RATNELEFGETYINRE---NFEAMQGFHAGWKKSGLGGADGKHGLEEYLQTK 464


>gnl|CDD|143410 cd07091, ALDH_F1-2_Ald2-like, ALDH subfamily: ALDH families 1and 2,
           including 10-formyltetrahydrofolate dehydrogenase,
           NAD+-dependent retinal dehydrogenase 1 and related
           proteins.  ALDH subfamily which includes the
           NAD+-dependent retinal dehydrogenase 1 (RALDH 1, ALDH1,
           EC=1.2.1.36), also known as aldehyde dehydrogenase
           family 1 member A1 (ALDH1A1), in humans, a
           homotetrameric, cytosolic enzyme that catalyzes the
           oxidation of retinaldehyde to retinoic acid. Human
           ALDH1B1 and ALDH2 are also in this cluster; both are
           mitochrondrial homotetramers which play important roles
           in acetaldehyde oxidation; ALDH1B1 in response to UV
           light exposure and ALDH2 during ethanol metabolism.
           10-formyltetrahydrofolate dehydrogenase (FTHFDH,
           EC=1.5.1.6), also known as aldehyde dehydrogenase family
           1 member L1 (ALDH1L1), in humans, a multi-domain
           homotetramer with an N-terminal formyl transferase
           domain and a C-terminal ALDH domain. FTHFDH catalyzes an
           NADP+-dependent dehydrogenase reaction resulting in the
           conversion of 10-formyltetrahydrofolate to
           tetrahydrofolate and CO2. Also included in this
           subfamily is the Arabidosis aldehyde dehydrogenase
           family 2 members B4 and B7 (EC=1.2.1.3), which are
           mitochondrial, homotetramers that oxidize acetaldehyde
           and glycolaldehyde, as well as, the Arabidosis
           cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
           involved in the oxidation of sinapalehyde and
           coniferaldehyde. Also included is the AldA aldehyde
           dehydrogenase  of Aspergillus nidulans (locus AN0554),
           the aldehyde dehydrogenase 2 (YMR170c, ALD5, EC=1.2.1.5)
           of Saccharomyces cerevisiae, and other similar
           sequences.
          Length = 476

 Score =  440 bits (1134), Expect = e-151
 Identities = 186/479 (38%), Positives = 275/479 (57%), Gaps = 19/479 (3%)

Query: 30  TQAYINGEWVHS--SKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWS--QTT 85
           T  +IN E+V S   K FP  NPA +E +  V +   EDV  A+ +A+ AFE     +  
Sbjct: 4   TGLFINNEFVDSVSGKTFPTINPATEEVICQVAEADEEDVDAAVKAARAAFETGWWRKMD 63

Query: 86  PKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESL-GEVVYANSFVDWFSEEARRTY 144
           P+ERG +L +    +E ++  LA + + ++GKP  ES  G+V  +   + +++  A +  
Sbjct: 64  PRERGRLLNKLADLIERDRDELAALESLDNGKPLEESAKGDVALSIKCLRYYAGWADKIQ 123

Query: 145 GELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPL 204
           G+ +P    +      REP+GV G I PWNFP+ M+  K+APALAAG T V+KPAE TPL
Sbjct: 124 GKTIPID-GNFLAYTRREPIGVCGQIIPWNFPLLMLAWKLAPALAAGNTVVLKPAEQTPL 182

Query: 205 TALALAALAHEAQIPKGVLNVVTSS-RTNANSVGKLLCTSPHVAGISFTGSTQVGKILYS 263
           +AL LA L  EA  P GV+N+V     T     G  + +   V  I+FTGST VG+ +  
Sbjct: 183 SALYLAELIKEAGFPPGVVNIVPGFGPT----AGAAISSHMDVDKIAFTGSTAVGRTIME 238

Query: 264 HCSQ-GIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVF 322
             ++  +K+++LELGG +P IV+ +A+L KAV  A    F N GQ C A +R  +QE ++
Sbjct: 239 AAAKSNLKKVTLELGGKSPNIVFDDADLDKAVEWAAFGIFFNQGQCCCAGSRIFVQESIY 298

Query: 323 DQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGK 382
           D+FVE  K    +  + GD   +     GP ++ AQ +K+ S ++    +GA ++ GG +
Sbjct: 299 DEFVEKFK-ARAEKRVVGD-PFDPDTFQGPQVSKAQFDKILSYIESGKKEGATLLTGGER 356

Query: 383 VTKLGSK--FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGY 440
               GSK  F  PT+ T++  DM    EE+FGPV   +KFKTE+E +  AN++  GLA  
Sbjct: 357 H---GSKGYFIQPTVFTDVKDDMKIAKEEIFGPVVTILKFKTEDEVIERANDTEYGLAAG 413

Query: 441 FFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            FT DI++A RV + L+ G V +N   +      FGG K+SG GRE    G+EE+TQ+K
Sbjct: 414 VFTKDINKALRVSRALKAGTVWVNTYNVFDAAVPFGGFKQSGFGRELGEEGLEEYTQVK 472


>gnl|CDD|143456 cd07138, ALDH_CddD_SSP0762, Rhodococcus ruber 6-oxolauric acid
           dehydrogenase-like.  The 6-oxolauric acid dehydrogenase
           (CddD) from Rhodococcus ruber SC1 which converts
           6-oxolauric acid to dodecanedioic acid, and the aldehyde
           dehydrogenase (locus SSP0762) from Staphylococcus
           saprophyticus subsp. saprophyticus ATCC 15305 and other
           similar sequences, are included in this CD.
          Length = 466

 Score =  431 bits (1111), Expect = e-148
 Identities = 185/479 (38%), Positives = 270/479 (56%), Gaps = 30/479 (6%)

Query: 33  YINGEWV--HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
           YI+G WV    ++   V NPA +E +  VP     DV  A+ +A+ AF  WS T+ +ER 
Sbjct: 2   YIDGAWVAPAGTETIDVINPATEEVIGTVPLGTAADVDRAVAAARRAFPAWSATSVEERA 61

Query: 91  IVLKRWLKELEGNKQALAEIMTQESGKPPTES--------LGEVVYANSFVDWFSEEARR 142
            +L+R  +  E     LA+ +T E G P T +        +G +  A   +  F  E RR
Sbjct: 62  ALLERIAEAYEARADELAQAITLEMGAPITLARAAQVGLGIGHLRAAADALKDFEFEERR 121

Query: 143 TYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDT 202
               LV           VREP+GV GLITPWN+P+  I  KVAPALAAGCT V+KP+E  
Sbjct: 122 G-NSLV-----------VREPIGVCGLITPWNWPLNQIVLKVAPALAAGCTVVLKPSEVA 169

Query: 203 PLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILY 262
           PL+A+ LA +  EA +P GV N+V         VG+ L   P V  +SFTGST+ GK + 
Sbjct: 170 PLSAIILAEILDEAGLPAGVFNLVNGD---GPVVGEALSAHPDVDMVSFTGSTRAGKRVA 226

Query: 263 SHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVF 322
              +  +KR++LELGG +  I+  +A+L+KAV   +AA F N GQ+C A  R L+    +
Sbjct: 227 EAAADTVKRVALELGGKSANIILDDADLEKAVPRGVAACFANSGQSCNAPTRMLVPRSRY 286

Query: 323 DQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGG-G 381
            +  E+      ++++ GD        LGPL + AQ ++V   +Q  I +GA+++ GG G
Sbjct: 287 AEAEEIAA-AAAEAYVVGDPRD-PATTLGPLASAAQFDRVQGYIQKGIEEGARLVAGGPG 344

Query: 382 KVTKLGSKFFA-PTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGY 440
           +   L   +F  PT+  ++TPDM    EE+FGPV   I +  E+E++ +AN++  GLAGY
Sbjct: 345 RPEGLERGYFVKPTVFADVTPDMTIAREEIFGPVLSIIPYDDEDEAIAIANDTPYGLAGY 404

Query: 441 FFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            ++ D  +A  V ++L+ G V IN G   +  A FGG K+SG GRE  R+G+EEF ++K
Sbjct: 405 VWSADPERARAVARRLRAGQVHIN-GAAFNPGAPFGGYKQSGNGREWGRYGLEEFLEVK 462


>gnl|CDD|143422 cd07104, ALDH_BenzADH-like, ALDH subfamily: NAD(P)+-dependent
           benzaldehyde dehydrogenase II, vanillin dehydrogenase,
           p-hydroxybenzaldehyde dehydrogenase and related
           proteins.  ALDH subfamily which includes the
           NAD(P)+-dependent, benzaldehyde dehydrogenase II (XylC,
           BenzADH, EC=1.2.1.28)  involved in the oxidation of
           benzyl alcohol to benzoate; p-hydroxybenzaldehyde
           dehydrogenase (PchA, HBenzADH) which catalyzes the
           oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic
           acid; vanillin dehydrogenase (Vdh, VaniDH) involved in
           the metabolism of ferulic acid as seen in Pseudomonas
           putida KT2440; and other related sequences.
          Length = 431

 Score =  426 bits (1097), Expect = e-146
 Identities = 181/438 (41%), Positives = 266/438 (60%), Gaps = 11/438 (2%)

Query: 66  DVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGE 125
           DV  A  +A  A + W+ T P+ER  +L++  + LE  +  +A+ + +ESG    ++  E
Sbjct: 1   DVDRAYAAAAAAQKAWAATPPQERAAILRKAAEILEERRDEIADWLIRESGSTRPKAAFE 60

Query: 126 VVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVA 185
           V  A + +   +   RR  GE++PS V  KE ++ R PLGVVG+I+P+NFP+ +  R VA
Sbjct: 61  VGAAIAILREAAGLPRRPEGEILPSDVPGKESMVRRVPLGVVGVISPFNFPLILAMRSVA 120

Query: 186 PALAAGCTCVIKPAEDTPLT-ALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSP 244
           PALA G   V+KP   TP+T  L +A +  EA +PKGVLNVV       + +G  L   P
Sbjct: 121 PALALGNAVVLKPDSRTPVTGGLLIAEIFEEAGLPKGVLNVVP---GGGSEIGDALVEHP 177

Query: 245 HVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRN 304
            V  ISFTGST VG+ +     + +K+++LELGGN P IV  +A+L  AVS A    F +
Sbjct: 178 RVRMISFTGSTAVGRHIGELAGRHLKKVALELGGNNPLIVLDDADLDLAVSAAAFGAFLH 237

Query: 305 CGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDS 364
            GQ C+AA R L+ E V+D+FVE L V   K+   GD    D   +GPLIN  QV++V +
Sbjct: 238 QGQICMAAGRILVHESVYDEFVEKL-VAKAKALPVGDPRDPD-TVIGPLINERQVDRVHA 295

Query: 365 IVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEE 424
           IV+DA++ GA++ L GG    L   F+ PT+L+++TPDM  + EE+FGPVA  I F  +E
Sbjct: 296 IVEDAVAAGARL-LTGGTYEGL---FYQPTVLSDVTPDMPIFREEIFGPVAPVIPFDDDE 351

Query: 425 ESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIISH-TEAAFGGVKESGL 483
           E++ +AN++  GL+   FT D+ +A    ++L+ GMV IN+  ++      FGGVK SG 
Sbjct: 352 EAVELANDTEYGLSAAVFTRDLERAMAFAERLETGMVHINDQTVNDEPHVPFGGVKASGG 411

Query: 484 GREGSRHGMEEFTQIKYM 501
           GR G    +EEFT+ +++
Sbjct: 412 GRFGGPASLEEFTEWQWI 429


>gnl|CDD|143412 cd07093, ALDH_F8_HMSADH, Human aldehyde dehydrogenase family 8
           member A1-like.  In humans, the  aldehyde dehydrogenase
           family 8 member A1 (ALDH8A1) protein functions to
           convert 9-cis-retinal to 9-cis-retinoic acid and has a
           preference for NAD+. Also included in this CD is the
           2-hydroxymuconic semialdehyde dehydrogenase (HMSADH)
           which catalyzes the conversion of 2-hydroxymuconic
           semialdehyde to 4-oxalocrotonate, a step in the meta
           cleavage pathway of aromatic hydrocarbons in bacteria.
           Such HMSADHs seen here are: XylG of the TOL plasmid pWW0
           of Pseudomonas putida, TomC  of Burkholderia cepacia G4,
           and AphC of Comamonas testosterone.
          Length = 455

 Score =  422 bits (1088), Expect = e-144
 Identities = 181/461 (39%), Positives = 272/461 (59%), Gaps = 12/461 (2%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
            +NPA  E LA VP+ G  +V  A+ +AKEAF  WS+ +P ER  +L +    +E     
Sbjct: 1   NFNPATGEVLAKVPEGGAAEVDAAVAAAKEAFPGWSRMSPAERARILHKVADLIEARADE 60

Query: 107 LAEIMTQESGKPPTESLGEVV--YANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPL 164
           LA + + ++GKP T +    +   A +F  +F++   +  GE  P    +    ++R+P+
Sbjct: 61  LALLESLDTGKPITLARTRDIPRAAANF-RFFADYILQLDGESYPQDGGALNY-VLRQPV 118

Query: 165 GVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLN 224
           GV GLITPWN P+ ++T K+APALA G T V+KP+E TPLTA  LA LA+EA +P GV+N
Sbjct: 119 GVAGLITPWNLPLMLLTWKIAPALAFGNTVVLKPSEWTPLTAWLLAELANEAGLPPGVVN 178

Query: 225 VVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIV 284
           VV      A   G  L   P V  ISFTG T  G+ +    +  +K +SLELGG  P IV
Sbjct: 179 VVHGFGPEA---GAALVAHPDVDLISFTGETATGRTIMRAAAPNLKPVSLELGGKNPNIV 235

Query: 285 YSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSK 344
           +++A+L +AV  A+ + F N G+ C+A +R L+Q  ++D+F+E   VE  K+   GD   
Sbjct: 236 FADADLDRAVDAAVRSSFSNNGEVCLAGSRILVQRSIYDEFLERF-VERAKALKVGDPLD 294

Query: 345 NDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSK---FFAPTILTNITP 401
            D   +GPLI+   +EKV   V+ A ++GA ++ GGG+      +   F  PT++T +  
Sbjct: 295 PDT-EVGPLISKEHLEKVLGYVELARAEGATILTGGGRPELPDLEGGYFVEPTVITGLDN 353

Query: 402 DMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMV 461
           D     EE+FGPV   I F  EEE++ +AN++  GLA Y +T D+ +A RV ++L+ G V
Sbjct: 354 DSRVAQEEIFGPVVTVIPFDDEEEAIELANDTPYGLAAYVWTRDLGRAHRVARRLEAGTV 413

Query: 462 GINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIKYMC 502
            +N  ++      FGGVK SG+GREG  + +E +T++K +C
Sbjct: 414 WVNCWLVRDLRTPFGGVKASGIGREGGDYSLEFYTELKNVC 454


>gnl|CDD|143408 cd07089, ALDH_CddD-AldA-like, Rhodococcus ruber 6-oxolauric acid
           dehydrogenase-like and related proteins.  The
           6-oxolauric acid dehydrogenase (CddD) from Rhodococcus
           ruber SC1 which converts 6-oxolauric acid to
           dodecanedioic acid; and the aldehyde dehydrogenase
           (locus SSP0762) from Staphylococcus saprophyticus subsp.
           saprophyticus ATCC 15305 and also, the Mycobacterium
           tuberculosis H37Rv ALDH AldA (locus Rv0768) sequence;
           and other similar sequences, are included in this CD.
          Length = 459

 Score =  418 bits (1076), Expect = e-143
 Identities = 174/467 (37%), Positives = 258/467 (55%), Gaps = 26/467 (5%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWS-QTTPKERGIVLKRWLKELEGNKQ 105
           V NPA +E +   PD G  DV  AI +A+ AF+     T  +ER   L++  + LE  K+
Sbjct: 1   VINPATEEVIGTAPDAGAADVDAAIAAARRAFDTGDWSTDAEERARCLRQLHEALEARKE 60

Query: 106 ALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPS-------PVKSKEL- 157
            L  ++  E G P       V+ A +          R + +L  S       PV +    
Sbjct: 61  ELRALLVAEVGAP-------VMTARAMQVDGPIGHLRYFADLADSFPWEFDLPVPALRGG 113

Query: 158 ----LMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALA 213
               ++ REP+GVV  ITPWNFP  +   K+APALAAG T V+KPA DTPL+AL L  + 
Sbjct: 114 PGRRVVRREPVGVVAAITPWNFPFFLNLAKLAPALAAGNTVVLKPAPDTPLSALLLGEII 173

Query: 214 HEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLS 273
            E  +P GV+NVVT S    N+VG+ L T P V  +SFTGST VG+ + +  +  +KR+ 
Sbjct: 174 AETDLPAGVVNVVTGS---DNAVGEALTTDPRVDMVSFTGSTAVGRRIMAQAAATLKRVL 230

Query: 274 LELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEI 333
           LELGG +  IV  +A+L  A   A+     N GQ C    R L+    +D+ VE L    
Sbjct: 231 LELGGKSANIVLDDADLAAAAPAAVGVCMHNAGQGCALTTRLLVPRSRYDEVVEALA-AA 289

Query: 334 TKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFF-A 392
            ++   GD +      +GPLI+ AQ ++V+  +     +GA+++ GGG+   L   F+  
Sbjct: 290 FEALPVGDPADPGT-VMGPLISAAQRDRVEGYIARGRDEGARLVTGGGRPAGLDKGFYVE 348

Query: 393 PTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRV 452
           PT+  ++  DM    EE+FGPV V I +  ++E++ +AN+S+ GL+G  ++ D+ +A+RV
Sbjct: 349 PTLFADVDNDMRIAQEEIFGPVLVVIPYDDDDEAVRIANDSDYGLSGGVWSADVDRAYRV 408

Query: 453 GKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            ++++ G VGIN G     +A FGG K+SGLGRE    G+EEF + K
Sbjct: 409 ARRIRTGSVGINGGGGYGPDAPFGGYKQSGLGRENGIEGLEEFLETK 455


>gnl|CDD|143437 cd07119, ALDH_BADH-GbsA, Bacillus subtilis NAD+-dependent betaine
           aldehyde dehydrogenase-like.  Included in this CD is the
           NAD+-dependent, betaine aldehyde dehydrogenase (BADH,
           GbsA, EC=1.2.1.8) of Bacillus subtilis involved in the
           synthesis of the osmoprotectant glycine betaine from
           choline or glycine betaine aldehyde.
          Length = 482

 Score =  415 bits (1069), Expect = e-141
 Identities = 187/476 (39%), Positives = 273/476 (57%), Gaps = 17/476 (3%)

Query: 33  YINGEWV--HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQ--WSQTTPKE 88
           YI+GEWV   S K   + NPAN E +A VP+   ED + AI +A+ AF+   W     +E
Sbjct: 1   YIDGEWVEAASGKTRDIINPANGEVIATVPEGTAEDAKRAIAAARRAFDSGEWPHLPAQE 60

Query: 89  RGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELV 148
           R  +L R   ++  + + LA + T  +GK   ES  ++    +   +++  A +  GE+ 
Sbjct: 61  RAALLFRIADKIREDAEELARLETLNTGKTLRESEIDIDDVANCFRYYAGLATKETGEVY 120

Query: 149 PSP--VKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTA 206
             P  V S+    VREP+GV GLITPWN+P+     K+APALAAG T VIKP+E TPLT 
Sbjct: 121 DVPPHVISR---TVREPVGVCGLITPWNYPLLQAAWKLAPALAAGNTVVIKPSEVTPLTT 177

Query: 207 LALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCS 266
           +AL  L  EA +P GV+N+VT S     +VG  L  SP V  +SFTG T  G+ +    +
Sbjct: 178 IALFELIEEAGLPAGVVNLVTGS---GATVGAELAESPDVDLVSFTGGTATGRSIMRAAA 234

Query: 267 QGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFV 326
             +K+++LELGG  P IV+++A+ + AV  A+   F N GQ C A +R L++E + D+FV
Sbjct: 235 GNVKKVALELGGKNPNIVFADADFETAVDQALNGVFFNAGQVCSAGSRLLVEESIHDKFV 294

Query: 327 EMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKL 386
             L  E  K    G+    D   +GPL++    EKV S +Q    +GA+++ GG + T  
Sbjct: 295 AAL-AERAKKIKLGNGLDADT-EMGPLVSAEHREKVLSYIQLGKEEGARLVCGGKRPTGD 352

Query: 387 GSK---FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFT 443
                 F  PTI  ++   M    EE+FGPV    +F TEEE++ +AN++  GLAG  +T
Sbjct: 353 ELAKGYFVEPTIFDDVDRTMRIVQEEIFGPVLTVERFDTEEEAIRLANDTPYGLAGAVWT 412

Query: 444 NDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            DI++A RV ++L+ G V IN+      EA +GG K+SG+GRE    G+EE+ + K
Sbjct: 413 KDIARANRVARRLRAGTVWINDYHPYFAEAPWGGYKQSGIGRELGPTGLEEYQETK 468


>gnl|CDD|143430 cd07112, ALDH_GABALDH-PuuC, Escherichia coli NADP+-dependent
           gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
           PuuC-like.  NADP+-dependent,
           gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
           (GABALDH) PuuC of  Escherichia coli which catalyzes the
           conversion of putrescine to 4-aminobutanoate and other
           similar sequences are present in this CD.
          Length = 462

 Score =  412 bits (1061), Expect = e-140
 Identities = 184/467 (39%), Positives = 276/467 (59%), Gaps = 20/467 (4%)

Query: 43  KKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFE--QWSQTTPKERGIVLKRWLKEL 100
           + F   NPA    LA+V      DV  A+ +A+ AFE   WS+ +P ER  VL R    +
Sbjct: 2   ETFATINPATGRVLAEVAACDAADVDRAVAAARRAFESGVWSRLSPAERKAVLLRLADLI 61

Query: 101 EGNKQALAEIMTQESGKPPTESL-GEVVYANSFVDWFSEEARRTYGELVPSPVKSKEL-L 158
           E ++  LA + T + GKP +++L  +V  A +   W++E   + YGE+ P+      L L
Sbjct: 62  EAHRDELALLETLDMGKPISDALAVDVPSAANTFRWYAEAIDKVYGEVAPTG--PDALAL 119

Query: 159 MVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQI 218
           + REPLGVVG + PWNFP+ M   K+APALAAG + V+KPAE +PLTAL LA LA EA +
Sbjct: 120 ITREPLGVVGAVVPWNFPLLMAAWKIAPALAAGNSVVLKPAEQSPLTALRLAELALEAGL 179

Query: 219 PKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQG-IKRLSLELG 277
           P GVLNVV       ++ G+ L     V  ++FTGST+VG+    +  Q  +KR+ LE G
Sbjct: 180 PAGVLNVVPGF---GHTAGEALGLHMDVDALAFTGSTEVGRRFLEYSGQSNLKRVWLECG 236

Query: 278 GNAPFIVYSNA-NLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKS 336
           G +P IV+++A +L  A   A A  F N G+ C A +R L+ E + D+F+E + V   + 
Sbjct: 237 GKSPNIVFADAPDLDAAAEAAAAGIFWNQGEVCSAGSRLLVHESIKDEFLEKV-VAAARE 295

Query: 337 FIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFA-PTI 395
           +  GD   +    +G L++ A  +KV   ++   ++GA+++ GG +V      FF  PT+
Sbjct: 296 WKPGD-PLDPATRMGALVSEAHFDKVLGYIESGKAEGARLVAGGKRVLTETGGFFVEPTV 354

Query: 396 LTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKK 455
              +TPDM    EE+FGPV   I F +EEE++ +AN+S  GLA   +T+D+S+A RV ++
Sbjct: 355 FDGVTPDMRIAREEIFGPVLSVITFDSEEEAVALANDSVYGLAASVWTSDLSRAHRVARR 414

Query: 456 LQVGMVGIN---EGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           L+ G V +N   EG I+     FGG K+SG GR+ S H ++++T++K
Sbjct: 415 LRAGTVWVNCFDEGDIT---TPFGGFKQSGNGRDKSLHALDKYTELK 458


>gnl|CDD|143468 cd07150, ALDH_VaniDH_like, Pseudomonas putida vanillin
           dehydrogenase-like.  Vanillin dehydrogenase (Vdh,
           VaniDH) involved in the metabolism of ferulic acid and
           other related  sequences are included in this CD.  The
           E. coli vanillin dehydrogenase (LigV) preferred NAD+ to
           NADP+  and exhibited a broad substrate preference,
           including vanillin,  benzaldehyde, protocatechualdehyde,
           m-anisaldehyde, and p-hydroxybenzaldehyde.
          Length = 451

 Score =  409 bits (1053), Expect = e-139
 Identities = 181/458 (39%), Positives = 267/458 (58%), Gaps = 12/458 (2%)

Query: 45  FPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNK 104
           F   NPA+    A V     +D + AI +A +AF  W+ TTP ER  +L +  + +E   
Sbjct: 1   FDDLNPADGSVYARVAVGSRQDAERAIAAAYDAFPAWAATTPSERERILLKAAEIMERRA 60

Query: 105 QALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPL 164
             L +++  E G    ++  E  +    +   + E RR  GE +PS       + VR PL
Sbjct: 61  DDLIDLLIDEGGSTYGKAWFETTFTPELLRAAAGECRRVRGETLPSDSPGTVSMSVRRPL 120

Query: 165 GVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLN 224
           GVV  ITP+N+P+ + T+KVA ALAAG T V+KP+E+TP+  L +A +  EA +PKGV N
Sbjct: 121 GVVAGITPFNYPLILATKKVAFALAAGNTVVLKPSEETPVIGLKIAEIMEEAGLPKGVFN 180

Query: 225 VVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIV 284
           VVT        VG  L   P V  ++FTGST VG+ +     + +K+++LELGG  P IV
Sbjct: 181 VVTGGGA---EVGDELVDDPRVRMVTFTGSTAVGREIAEKAGRHLKKITLELGGKNPLIV 237

Query: 285 YSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSK 344
            ++A+L  AV  A    F + GQ C++A+R +++E V+D+FV+   V        GD   
Sbjct: 238 LADADLDYAVRAAAFGAFMHQGQICMSASRIIVEEPVYDEFVKKF-VARASKLKVGDPR- 295

Query: 345 NDKCN-LGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDM 403
            D    +GPLI+  QVE++   V+DA++KGAK++ GG    K    F+ PT+LT++TPDM
Sbjct: 296 -DPDTVIGPLISPRQVERIKRQVEDAVAKGAKLLTGG----KYDGNFYQPTVLTDVTPDM 350

Query: 404 LCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGI 463
             + EE FGPV   I  K  EE+L +AN++  GL+    TND+ +A+++ ++L+ GMV I
Sbjct: 351 RIFREETFGPVTSVIPAKDAEEALELANDTEYGLSAAILTNDLQRAFKLAERLESGMVHI 410

Query: 464 NEG-IISHTEAAFGGVKESGLGREGSRHGMEEFTQIKY 500
           N+  I+      FGGVK SG GREG    MEEFT++K+
Sbjct: 411 NDPTILDEAHVPFGGVKASGFGREGGEWSMEEFTELKW 448


>gnl|CDD|143463 cd07145, ALDH_LactADH_F420-Bios, Methanocaldococcus jannaschii
           NAD+-dependent lactaldehyde dehydrogenase-like.
           NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
           involved the biosynthesis of coenzyme F(420) in
           Methanocaldococcus jannaschii through the oxidation of
           lactaldehyde to lactate and generation of NAPH, and
           similar sequences are included in this CD.
          Length = 456

 Score =  405 bits (1043), Expect = e-138
 Identities = 189/461 (40%), Positives = 280/461 (60%), Gaps = 13/461 (2%)

Query: 45  FPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNK 104
             V NPAN E +  VP +  E+V+ AI  A++A +  S     +R  +L +  + +E  K
Sbjct: 1   IEVRNPANGEVIDTVPSLSREEVREAIEVAEKAKDVMSNLPAYKRYKILMKVAELIERRK 60

Query: 105 QALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPS---PVKSKELLM-V 160
           + LA+++T E GKP  +S  EV          +EEA+   GE +P        + +   V
Sbjct: 61  EELAKLLTIEVGKPIKQSRVEVERTIRLFKLAAEEAKVLRGETIPVDAYEYNERRIAFTV 120

Query: 161 REPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPK 220
           REP+GVVG ITP+NFP  +   K+APA+A G + V+KP+ +TPLTA+ LA +  EA +P 
Sbjct: 121 REPIGVVGAITPFNFPANLFAHKIAPAIAVGNSVVVKPSSNTPLTAIELAKILEEAGLPP 180

Query: 221 GVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNA 280
           GV+NVVT      + VG  + T+P V  ISFTGST VG ++ S      K+++LELGG+ 
Sbjct: 181 GVINVVTG---YGSEVGDEIVTNPKVNMISFTGSTAVGLLIASKAGGTGKKVALELGGSD 237

Query: 281 PFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQG 340
           P IV  +A+L++AVS A+  +F N GQ C A  R L++E+V+D+F+++L VE  K    G
Sbjct: 238 PMIVLKDADLERAVSIAVRGRFENAGQVCNAVKRILVEEEVYDKFLKLL-VEKVKKLKVG 296

Query: 341 DTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNIT 400
           D   ++  +LGPLI+   VE+++++V DA+ KG K++ GG +       FF PT+L N T
Sbjct: 297 D-PLDESTDLGPLISPEAVERMENLVNDAVEKGGKILYGGKRD---EGSFFPPTVLENDT 352

Query: 401 PDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGM 460
           PDM+   EEVFGPV    K K +EE++ +AN++  GL    FTNDI++A +V ++L+ G 
Sbjct: 353 PDMIVMKEEVFGPVLPIAKVKDDEEAVEIANSTEYGLQASVFTNDINRALKVARELEAGG 412

Query: 461 VGINEGIISHTEA-AFGGVKESGLGREGSRHGMEEFTQIKY 500
           V IN+      +   FGG K+SG+GREG R+ M E T+ K 
Sbjct: 413 VVINDSTRFRWDNLPFGGFKKSGIGREGVRYTMLEMTEEKT 453


>gnl|CDD|143418 cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis
           succinate-semialdehyde dehydrogenase 1-like.
           Succinate-semialdehyde dehydrogenase 1 (SSADH1, GabD1,
           EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
           of succinate semialdehyde (SSA)  to succinate.  SSADH
           activity in Mycobacterium tuberculosis (Mtb) is encoded
           by both gabD1 (Rv0234c) and gabD2 (Rv1731).  The Mtb
           GabD1 SSADH1 reportedly is an enzyme of the
           gamma-aminobutyrate shunt, which forms a functional link
           between two TCA half-cycles by converting
           alpha-ketoglutarate to succinate.
          Length = 429

 Score =  402 bits (1037), Expect = e-137
 Identities = 166/436 (38%), Positives = 249/436 (57%), Gaps = 14/436 (3%)

Query: 67  VQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEV 126
           ++ A+  A  AF  W +T+  ER  +L++    L   K  LA ++T E GKP  E+  EV
Sbjct: 1   IEAALDRAHAAFLAWRKTSFAERAALLRKLADLLRERKDELARLITLEMGKPIAEARAEV 60

Query: 127 VYANSFVDWFSEEARRTYGELVPSPVKS--KELLMVREPLGVVGLITPWNFPMAMITRKV 184
                   +++E A      L   P+++   +  +  EPLGVV  I PWNFP   + R  
Sbjct: 61  EKCAWICRYYAENAEA---FLADEPIETDAGKAYVRYEPLGVVLGIMPWNFPFWQVFRFA 117

Query: 185 APALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVL-NVVTSSRTNANSVGKLLCTS 243
           AP L AG T ++K A + P  ALA+  L  EA  P+GV  N++  S      V  ++   
Sbjct: 118 APNLMAGNTVLLKHASNVPGCALAIEELFREAGFPEGVFQNLLIDSDQ----VEAII-AD 172

Query: 244 PHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFR 303
           P V G++ TGS + G+ + +   + +K+  LELGG+ PFIV  +A+L KAV  A+  + +
Sbjct: 173 PRVRGVTLTGSERAGRAVAAEAGKNLKKSVLELGGSDPFIVLDDADLDKAVKTAVKGRLQ 232

Query: 304 NCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVD 363
           N GQ+C+AA RF++ E V+D+F+E     +    + GD   ++  +LGPL      +++ 
Sbjct: 233 NAGQSCIAAKRFIVHEDVYDEFLEKFVEAMAALKV-GDPM-DEDTDLGPLARKDLRDELH 290

Query: 364 SIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTE 423
             V++A++ GA ++LGG +    G+ F+ PT+LT++TP M  Y EE+FGPVA  IK K E
Sbjct: 291 EQVEEAVAAGATLLLGGKRPDGPGA-FYPPTVLTDVTPGMPAYDEELFGPVAAVIKVKDE 349

Query: 424 EESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGL 483
           EE++ +AN+S  GL G  FT D+ +A RV ++L+ GMV IN  + S     FGGVK SG 
Sbjct: 350 EEAIALANDSPFGLGGSVFTTDLERAERVARRLEAGMVFINGMVKSDPRLPFGGVKRSGY 409

Query: 484 GREGSRHGMEEFTQIK 499
           GRE  R G+ EF  IK
Sbjct: 410 GRELGRFGIREFVNIK 425


>gnl|CDD|143449 cd07131, ALDH_AldH-CAJ73105, Uncharacterized Candidatus kuenenia
           aldehyde dehydrogenase AldH (CAJ73105)-like.
           Uncharacterized aldehyde dehydrogenase of Candidatus
           kuenenia AldH (locus CAJ73105) and similar sequences
           with similarity to alpha-aminoadipic semialdehyde
           dehydrogenase (AASADH, human ALDH7A1, EC=1.2.1.31),
           Arabidopsis ALDH7B4, and Streptomyces clavuligerus
           delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH)
           are included in this CD.
          Length = 478

 Score =  404 bits (1041), Expect = e-137
 Identities = 189/478 (39%), Positives = 279/478 (58%), Gaps = 15/478 (3%)

Query: 33  YINGEWV--HSSKKFPVYNPAN-DEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKER 89
           YI GEWV   S + F   NPA+ +E +   P     DV  A+ +A+EAF +W +     R
Sbjct: 2   YIGGEWVDSASGETFDSRNPADLEEVVGTFPLSTASDVDAAVEAAREAFPEWRKVPAPRR 61

Query: 90  GIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVP 149
              L R  + L+  K+ LA ++T+E GKP  E  G+V  A     + + E RR +GE VP
Sbjct: 62  AEYLFRAAELLKKRKEELARLVTREMGKPLAEGRGDVQEAIDMAQYAAGEGRRLFGETVP 121

Query: 150 SPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALAL 209
           S + +K+ +  R+P+GVV LITPWNFP+A+ + K+ PAL  G T V KPAEDTP  AL L
Sbjct: 122 SELPNKDAMTRRQPIGVVALITPWNFPVAIPSWKIFPALVCGNTVVFKPAEDTPACALKL 181

Query: 210 AALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGI 269
             L  EA +P GV+NVV         VG+ L   P V  +SFTGST+VG+ +   C++  
Sbjct: 182 VELFAEAGLPPGVVNVVHGR---GEEVGEALVEHPDVDVVSFTGSTEVGERIGETCARPN 238

Query: 270 KRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEML 329
           KR++LE+GG  P IV  +A+L  A+ GA+ + F   GQ C A +R ++ E V+D+F++  
Sbjct: 239 KRVALEMGGKNPIIVMDDADLDLALEGALWSAFGTTGQRCTATSRLIVHESVYDEFLKRF 298

Query: 330 KVEITKSFIQGDTSKNDKCNL-GPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGS 388
            VE  K    GD    D+    GPLIN AQ+EKV +  +    +GA ++LGG ++T  G 
Sbjct: 299 -VERAKRLRVGD--GLDEETDMGPLINEAQLEKVLNYNEIGKEEGATLLLGGERLTGGGY 355

Query: 389 K---FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTND 445
           +   F  PT+ T++TPDM    EE+FGPV   I+  + EE++ +AN++  GL+   +T D
Sbjct: 356 EKGYFVEPTVFTDVTPDMRIAQEEIFGPVVALIEVSSLEEAIEIANDTEYGLSSAIYTED 415

Query: 446 ISQAWRVGKKLQVGMVGINEGII-SHTEAAFGGVKESGLG-REGSRHGMEEFTQIKYM 501
           +++A+R  + L+ G+  +N   I +     FGGVK+SG G RE     ++ FT+ K +
Sbjct: 416 VNKAFRARRDLEAGITYVNAPTIGAEVHLPFGGVKKSGNGHREAGTTALDAFTEWKAV 473


>gnl|CDD|143467 cd07149, ALDH_y4uC, Uncharacterized ALDH (y4uC) with similarity to
           Tortula ruralis aldehyde dehydrogenase ALDH21A1.
           Uncharacterized aldehyde dehydrogenase (ORF name y4uC)
           with sequence similarity to the moss Tortula ruralis
           aldehyde dehydrogenase ALDH21A1 (RNP123) believed to
           play an important role in the detoxification of
           aldehydes generated in response to desiccation- and
           salinity-stress, and similar sequences are included in
           this CD.
          Length = 453

 Score =  401 bits (1034), Expect = e-136
 Identities = 185/464 (39%), Positives = 277/464 (59%), Gaps = 16/464 (3%)

Query: 45  FPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNK 104
             V +P + E +  VP    EDV+ AI +AKE  ++       ER  +L+R  + LE  +
Sbjct: 1   IEVISPYDGEVIGRVPVASEEDVEKAIAAAKEGAKEMKSLPAYERAEILERAAQLLEERR 60

Query: 105 QALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVP---SP-VKSKELLMV 160
           +  A  +  E+GKP  ++  EV  A   +   +EEA+R  GE +P   SP  + +    +
Sbjct: 61  EEFARTIALEAGKPIKDARKEVDRAIETLRLSAEEAKRLAGETIPFDASPGGEGRIGFTI 120

Query: 161 REPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPK 220
           REP+GVV  ITP+NFP+ ++  KV PA+AAG   V+KPA  TPL+AL LA L  EA +PK
Sbjct: 121 REPIGVVAAITPFNFPLNLVAHKVGPAIAAGNAVVLKPASQTPLSALKLAELLLEAGLPK 180

Query: 221 GVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNA 280
           G LNVVT S     +VG  L T P V  ISFTGS  VG+ +      G+K+++LELG NA
Sbjct: 181 GALNVVTGS---GETVGDALVTDPRVRMISFTGSPAVGEAIAR--KAGLKKVTLELGSNA 235

Query: 281 PFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQG 340
             IV ++A+L+KAV   ++  F N GQ C++  R  + E ++D+F+E   V  TK  + G
Sbjct: 236 AVIVDADADLEKAVERCVSGAFANAGQVCISVQRIFVHEDIYDEFLERF-VAATKKLVVG 294

Query: 341 DTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNIT 400
           D   ++  ++GP+I+ A+ E+++  V++A+  GA+++ GG    + G+    PT+LT++ 
Sbjct: 295 DPL-DEDTDVGPMISEAEAERIEEWVEEAVEGGARLLTGG---KRDGA-ILEPTVLTDVP 349

Query: 401 PDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGM 460
           PDM    EEVF PV     F T +E++ +AN+S  GL    FTND+ +A +  ++L+VG 
Sbjct: 350 PDMKVVCEEVFAPVVSLNPFDTLDEAIAMANDSPYGLQAGVFTNDLQKALKAARELEVGG 409

Query: 461 VGINEGIISHTEA-AFGGVKESGLGREGSRHGMEEFTQIKYMCF 503
           V IN+      +   +GGVKESG GREG R+ +EE T+IK +CF
Sbjct: 410 VMINDSSTFRVDHMPYGGVKESGTGREGPRYAIEEMTEIKLVCF 453


>gnl|CDD|143415 cd07097, ALDH_KGSADH-YcbD, Bacillus subtilis NADP+-dependent
           alpha-ketoglutaric semialdehyde dehydrogenase ycbD-like.
            Kinetic studies of the Bacillus subtilis ALDH-like ycbD
           protein, which is involved in d-glucarate/d-galactarate
           utilization, reveal that it is a NADP+-dependent,
           alpha-ketoglutaric semialdehyde dehydrogenase (KGSADH).
           KGSADHs (EC 1.2.1.26) catalyze the NAD(P)+-dependent
           conversion of KGSA to alpha-ketoglutarate.
           Interestingly, the NADP+-dependent, tetrameric,
           2,5-dioxopentanoate dehydrogenase (EC=1.2.1.26), an
           enzyme involved in the catabolic pathway for D-arabinose
           in Sulfolobus solfataricus, also clusters in this group.
           This CD shows a distant phylogenetic relationship to the
           Azospirillum brasilense KGSADH-II (-III) group.
          Length = 473

 Score =  400 bits (1031), Expect = e-136
 Identities = 180/474 (37%), Positives = 268/474 (56%), Gaps = 15/474 (3%)

Query: 33  YINGEWVHSSKKFPVYNPAN-DEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGI 91
           YI+GEWV         NP++  + +        ED   AI +A  AF  W +T+P+ R  
Sbjct: 4   YIDGEWVAGGDGEENRNPSDTSDVVGKYARASAEDADAAIAAAAAAFPAWRRTSPEARAD 63

Query: 92  VLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSP 151
           +L +   ELE  K+ LA ++T+E GK   E+ GEV  A     +++ EA R  GE +PS 
Sbjct: 64  ILDKAGDELEARKEELARLLTREEGKTLPEARGEVTRAGQIFRYYAGEALRLSGETLPST 123

Query: 152 VKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAA 211
               E+   REPLGVVGLITPWNFP+A+   K+APALA G T V KPAE TP +A AL  
Sbjct: 124 RPGVEVETTREPLGVVGLITPWNFPIAIPAWKIAPALAYGNTVVFKPAELTPASAWALVE 183

Query: 212 LAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKR 271
           +  EA +P GV N+V  S    + VG+ L   P V  +SFTGST VG+ + +  +    R
Sbjct: 184 ILEEAGLPAGVFNLVMGS---GSEVGQALVEHPDVDAVSFTGSTAVGRRIAAAAAARGAR 240

Query: 272 LSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKV 331
           + LE+GG  P +V  +A+L  AV  A+   F + GQ C A++R ++ E + D+FVE L V
Sbjct: 241 VQLEMGGKNPLVVLDDADLDLAVECAVQGAFFSTGQRCTASSRLIVTEGIHDRFVEAL-V 299

Query: 332 EITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSK-- 389
           E TK+   GD + ++  ++GP+++  Q+EK    ++ A S+GAK++ GG ++ K   +  
Sbjct: 300 ERTKALKVGD-ALDEGVDIGPVVSERQLEKDLRYIEIARSEGAKLVYGGERL-KRPDEGY 357

Query: 390 FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQA 449
           + AP +   +T DM    EE+FGPVA  I+ +  +E+L +AN++  GL+    T  +  A
Sbjct: 358 YLAPALFAGVTNDMRIAREEIFGPVAAVIRVRDYDEALAIANDTEFGLSAGIVTTSLKHA 417

Query: 450 WRVGKKLQVGMVGINE---GIISHTEAAFGGVKESGLG-REGSRHGMEEFTQIK 499
               ++++ G+V +N    G+  H    FGG K S  G RE     +E +T IK
Sbjct: 418 THFKRRVEAGVVMVNLPTAGVDYH--VPFGGRKGSSYGPREQGEAALEFYTTIK 469


>gnl|CDD|143428 cd07110, ALDH_F10_BADH, Arabidopsis betaine aldehyde dehydrogenase
           1 and 2, ALDH family 10A8 and 10A9-like.  Present in
           this CD are the Arabidopsis betaine aldehyde
           dehydrogenase (BADH) 1 (chloroplast) and 2
           (mitochondria), also known as, aldehyde dehydrogenase
           family 10 member A8 and aldehyde dehydrogenase family 10
           member A9, respectively, and are putative dehydration-
           and salt-inducible BADHs (EC 1.2.1.8) that catalyze the
           oxidation of betaine aldehyde to the compatible solute
           glycine betaine.
          Length = 456

 Score =  397 bits (1022), Expect = e-135
 Identities = 177/458 (38%), Positives = 263/458 (57%), Gaps = 11/458 (2%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
           V NPA +  + ++P    EDV  A+ +A+ AF +W +TT  ER   L+   + +   ++ 
Sbjct: 1   VINPATEATIGEIPAATAEDVDAAVRAARRAFPRWKKTTGAERAKYLRAIAEGVRERREE 60

Query: 107 LAEIMTQESGKPPTESLGEVVYANSFVDWFSEEA---RRTYGELVPSPVKSKELLMVREP 163
           LAE+  +++GKP  E+  +V       +++++ A          VP P +  +  + REP
Sbjct: 61  LAELEARDNGKPLDEAAWDVDDVAGCFEYYADLAEQLDAKAERAVPLPSEDFKARVRREP 120

Query: 164 LGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVL 223
           +GVVGLITPWNFP+ M   KVAPALAAGCT V+KP+E T LT L LA +A EA +P GVL
Sbjct: 121 VGVVGLITPWNFPLLMAAWKVAPALAAGCTVVLKPSELTSLTELELAEIAAEAGLPPGVL 180

Query: 224 NVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFI 283
           NVVT +    +  G  L   P +  ISFTGST  G  +    +Q IK +SLELGG +P I
Sbjct: 181 NVVTGT---GDEAGAPLAAHPGIDKISFTGSTATGSQVMQAAAQDIKPVSLELGGKSPII 237

Query: 284 VYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTS 343
           V+ +A+L+KAV  AM   F N GQ C A +R L+ E + D F+E L        +     
Sbjct: 238 VFDDADLEKAVEWAMFGCFWNNGQICSATSRLLVHESIADAFLERLATAAEAIRVGDPLE 297

Query: 344 KNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFF-APTILTNITPD 402
           +     LGPL++ AQ EKV S +     +GA+++ GG +   L   +F APT+  ++  D
Sbjct: 298 EG--VRLGPLVSQAQYEKVLSFIARGKEEGARLLCGGRRPAHLEKGYFIAPTVFADVPTD 355

Query: 403 MLCYTEEVFGPVAVCIK-FKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMV 461
              + EE+FGPV +C++ F TE+E++ +AN+S  GLA    + D  +  RV + L+ G+V
Sbjct: 356 SRIWREEIFGPV-LCVRSFATEDEAIALANDSEYGLAAAVISRDAERCDRVAEALEAGIV 414

Query: 462 GINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            IN       +A +GG K SG+GRE    G++ + ++K
Sbjct: 415 WINCSQPCFPQAPWGGYKRSGIGRELGEWGLDNYLEVK 452


>gnl|CDD|143424 cd07106, ALDH_AldA-AAD23400, Streptomyces aureofaciens putative
           aldehyde dehydrogenase AldA (AAD23400)-like.  Putative
           aldehyde dehydrogenase, AldA, from Streptomyces
           aureofaciens (locus AAD23400) and other similar
           sequences are present in this CD.
          Length = 446

 Score =  388 bits (1000), Expect = e-131
 Identities = 167/453 (36%), Positives = 240/453 (52%), Gaps = 11/453 (2%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
           V NPA  E  A  P      +  A+ +AK AF  WS T  +ER   L      +E N + 
Sbjct: 1   VINPATGEVFASAPVASEAQLDQAVAAAKAAFPGWSATPLEERRAALLAIADAIEANAEE 60

Query: 107 LAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGV 166
           LA ++T E GKP  E+  EV  A   V W    A     + V     ++ + + R+PLGV
Sbjct: 61  LARLLTLEQGKPLAEAQFEVGGA---VAWLRYTASLDLPDEVIEDDDTRRVELRRKPLGV 117

Query: 167 VGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVV 226
           V  I PWNFP+ +   K+APAL AG T V+KP+  TPL  L L  LA E  +P GVLNVV
Sbjct: 118 VAAIVPWNFPLLLAAWKIAPALLAGNTVVLKPSPFTPLCTLKLGELAQEV-LPPGVLNVV 176

Query: 227 TSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYS 286
           +      + +G  L + P +  ISFTGST  GK + +  ++ +KR++LELGGN   IV  
Sbjct: 177 SGG----DELGPALTSHPDIRKISFTGSTATGKKVMASAAKTLKRVTLELGGNDAAIVLP 232

Query: 287 NANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKND 346
           + ++           F N GQ C A  R  + E ++D+F E L V + K+ + GD   + 
Sbjct: 233 DVDIDAVAPKLFWGAFINSGQVCAAIKRLYVHESIYDEFCEAL-VALAKAAVVGDGL-DP 290

Query: 347 KCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCY 406
              LGP+ N  Q +KV  +V+DA +KGAKV+  GG+       F  PTI+ +        
Sbjct: 291 GTTLGPVQNKMQYDKVKELVEDAKAKGAKVLA-GGEPLDGPGYFIPPTIVDDPPEGSRIV 349

Query: 407 TEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEG 466
            EE FGPV   +K+  E+E +  AN+S  GL    +++D+ +A  V ++L+ G V IN  
Sbjct: 350 DEEQFGPVLPVLKYSDEDEVIARANDSEYGLGASVWSSDLERAEAVARRLEAGTVWINTH 409

Query: 467 IISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
                +A FGG K+SG+G E    G++E+TQ +
Sbjct: 410 GALDPDAPFGGHKQSGIGVEFGIEGLKEYTQTQ 442


>gnl|CDD|143462 cd07144, ALDH_ALD2-YMR170C, Saccharomyces cerevisiae aldehyde
           dehydrogenase 2 (YMR170c)-like.  NAD(P)+-dependent
           Saccharomyces cerevisiae aldehyde dehydrogenase 2
           (YMR170c, ALD5, EC=1.2.1.5) and other similar sequences,
           are present in this CD.
          Length = 484

 Score =  389 bits (1001), Expect = e-131
 Identities = 188/487 (38%), Positives = 276/487 (56%), Gaps = 17/487 (3%)

Query: 19  KYLQKLCLSTYTQAYINGEWVHSS--KKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKE 76
            Y Q   L      +IN E+V SS  +     NP+  E +A V   G EDV  A+ +A++
Sbjct: 3   SYDQPTGL------FINNEFVKSSDGETIKTVNPSTGEVIASVYAAGEEDVDKAVKAARK 56

Query: 77  AFEQ-WSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKP-PTESLGEVVYANSFVD 134
           AFE  WS+ T +ERG +L +    +E N+  LA I   +SGKP  + +LG++    + + 
Sbjct: 57  AFESWWSKVTGEERGELLDKLADLVEKNRDLLAAIEALDSGKPYHSNALGDLDEIIAVIR 116

Query: 135 WFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTC 194
           +++  A +  G+ +P+    K    + EP GV G I PWN+P+AM   K+APALAAG T 
Sbjct: 117 YYAGWADKIQGKTIPTSPN-KLAYTLHEPYGVCGQIIPWNYPLAMAAWKLAPALAAGNTV 175

Query: 195 VIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGS 254
           VIKPAE+TPL+ L  A L  EA  P GV+N++          G  L   P V  I+FTGS
Sbjct: 176 VIKPAENTPLSLLYFANLVKEAGFPPGVVNIIPGYGA---VAGSALAEHPDVDKIAFTGS 232

Query: 255 TQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANR 314
           T  G+++    +Q +K ++LE GG +P +V+ +A+L +AV  A A    N GQ C A +R
Sbjct: 233 TATGRLVMKAAAQNLKAVTLECGGKSPALVFEDADLDQAVKWAAAGIMYNSGQNCTATSR 292

Query: 315 FLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGA 374
             +QE ++D+FVE     + +++  G    +D   +GP ++  Q ++V S ++    +GA
Sbjct: 293 IYVQESIYDKFVEKFVEHVKQNYKVGSPF-DDDTVVGPQVSKTQYDRVLSYIEKGKKEGA 351

Query: 375 KVILGGGKVTKLGSK--FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANN 432
           K++ GG K  +   K  F  PTI T++  DM    EE+FGPV V  KFKT EE++  AN+
Sbjct: 352 KLVYGGEKAPEGLGKGYFIPPTIFTDVPQDMRIVKEEIFGPVVVISKFKTYEEAIKKAND 411

Query: 433 SNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGM 492
           +  GLA   FT DI +A RV ++L+ GMV IN    S     FGG K SG+GRE   +G+
Sbjct: 412 TTYGLAAAVFTKDIRRAHRVARELEAGMVWINSSNDSDVGVPFGGFKMSGIGRELGEYGL 471

Query: 493 EEFTQIK 499
           E +TQ K
Sbjct: 472 ETYTQTK 478


>gnl|CDD|143427 cd07109, ALDH_AAS00426, Uncharacterized Saccharopolyspora spinosa
           aldehyde dehydrogenase (AAS00426)-like.  Uncharacterized
           aldehyde dehydrogenase of Saccharopolyspora spinosa
           (AAS00426) and other similar sequences, are present in
           this CD.
          Length = 454

 Score =  386 bits (995), Expect = e-130
 Identities = 171/461 (37%), Positives = 261/461 (56%), Gaps = 19/461 (4%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQ-WSQTTPKERGIVLKRWLKELEGNKQ 105
           V++P+  E  A +   G  DV  A+ +A+ AFE  W + +P ERG +L R  + +  +  
Sbjct: 1   VFDPSTGEVFARIARGGAADVDRAVQAARRAFESGWLRLSPAERGRLLLRIARLIREHAD 60

Query: 106 ALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLG 165
            LA + + ++GKP T++  +V  A  + +++   A + +GE +P       +  VREP G
Sbjct: 61  ELARLESLDTGKPLTQARADVEAAARYFEYYGGAADKLHGETIP-LGPGYFVYTVREPHG 119

Query: 166 VVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNV 225
           V G I PWN+P+ +  R VAPALAAG   V+KPAED PLTAL LA LA EA +P G LNV
Sbjct: 120 VTGHIIPWNYPLQITGRSVAPALAAGNAVVVKPAEDAPLTALRLAELAEEAGLPAGALNV 179

Query: 226 VTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVY 285
           VT     A   G  L   P V  ISFTGS + G  +    ++ +  ++LELGG +P IV+
Sbjct: 180 VTGLGAEA---GAALVAHPGVDHISFTGSVETGIAVMRAAAENVVPVTLELGGKSPQIVF 236

Query: 286 SNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKN 345
           ++A+L+ A+   + A  +N GQ C A +R L+   ++D+ +E L VE  ++   G   ++
Sbjct: 237 ADADLEAALPVVVNAIIQNAGQTCSAGSRLLVHRSIYDEVLERL-VERFRALRVGPGLED 295

Query: 346 DKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSK--FFAPTILTNITPDM 403
              +LGPLI+  Q+++V+  V  A ++GA+++ GG       +   F APT+L ++ PD 
Sbjct: 296 P--DLGPLISAKQLDRVEGFVARARARGARIVAGGRIAEGAPAGGYFVAPTLLDDVPPDS 353

Query: 404 LCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGI 463
               EE+FGPV   + F  E E++ +AN ++ GL    +T D  +A RV ++L+ G V +
Sbjct: 354 RLAQEEIFGPVLAVMPFDDEAEAIALANGTDYGLVAGVWTRDGDRALRVARRLRAGQVFV 413

Query: 464 NE-----GIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           N      GI    E  FGGVK+SG GRE     +  +TQ K
Sbjct: 414 NNYGAGGGI----ELPFGGVKKSGHGREKGLEALYNYTQTK 450


>gnl|CDD|143457 cd07139, ALDH_AldA-Rv0768, Mycobacterium tuberculosis aldehyde
           dehydrogenase  AldA-like.  The Mycobacterium
           tuberculosis NAD+-dependent, aldehyde dehydrogenase  PDB
           structure,  3B4W, and the Mycobacterium tuberculosis
           H37Rv aldehyde dehydrogenase  AldA (locus Rv0768)
           sequence, as well as the Rhodococcus rhodochrous ALDH
           involved in haloalkane catabolism, and other similar
           sequences, are included in this CD.
          Length = 471

 Score =  387 bits (997), Expect = e-130
 Identities = 171/474 (36%), Positives = 262/474 (55%), Gaps = 15/474 (3%)

Query: 33  YINGEWV--HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQ--WSQTTPKE 88
           +I G WV    S+   V +PA +E +  VP+    DV  A+ +A+ AF+   W + +P E
Sbjct: 2   FIGGRWVAPSGSETIDVVSPATEEVVGRVPEATPADVDAAVAAARRAFDNGPWPRLSPAE 61

Query: 89  RGIVLKRWLKELEGNKQALAEIMTQESGKPPTES-LGEVVYANSFVDWFSEEARR-TYGE 146
           R  VL+R    LE     LA + T E+G P + S   +     + + +++  AR   + E
Sbjct: 62  RAAVLRRLADALEARADELARLWTAENGMPISWSRRAQGPGPAALLRYYAALARDFPFEE 121

Query: 147 LVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTA 206
             P       L+  REP+GVV  I PWN P+ +   K+APALAAGCT V+KP+ +TPL A
Sbjct: 122 RRPGSGGGHVLV-RREPVGVVAAIVPWNAPLFLAALKIAPALAAGCTVVLKPSPETPLDA 180

Query: 207 LALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCS 266
             LA  A EA +P GV+NVV + R     VG+ L   P V  +SFTGST  G+ + + C 
Sbjct: 181 YLLAEAAEEAGLPPGVVNVVPADR----EVGEYLVRHPGVDKVSFTGSTAAGRRIAAVCG 236

Query: 267 QGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFV 326
           + + R++LELGG +  IV  +A+L  AV G + A   N GQ CVA  R L+    +D+ V
Sbjct: 237 ERLARVTLELGGKSAAIVLDDADLDAAVPGLVPASLMNNGQVCVALTRILVPRSRYDEVV 296

Query: 327 EMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKL 386
           E L      +   GD   +    +GPL +  Q E+V+  +    ++GA+++ GGG+   L
Sbjct: 297 EAL-AAAVAALKVGDPL-DPATQIGPLASARQRERVEGYIAKGRAEGARLVTGGGRPAGL 354

Query: 387 GSKFF-APTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTND 445
              +F  PT+  ++  DM    EE+FGPV   I +  E++++ +AN+S+ GL+G  +T D
Sbjct: 355 DRGWFVEPTLFADVDNDMRIAQEEIFGPVLSVIPYDDEDDAVRIANDSDYGLSGSVWTAD 414

Query: 446 ISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           + +   V ++++ G VG+N G      A FGG K+SG+GREG   G++ + + K
Sbjct: 415 VERGLAVARRIRTGTVGVN-GFRLDFGAPFGGFKQSGIGREGGPEGLDAYLETK 467


>gnl|CDD|143436 cd07118, ALDH_SNDH, Gluconobacter oxydans L-sorbosone
           dehydrogenase-like.  Included in this CD is the
           L-sorbosone dehydrogenase (SNDH) from Gluconobacter
           oxydans UV10. In G. oxydans,  D-sorbitol is converted to
           2-keto-L-gulonate (a precursor of L-ascorbic acid) in
           sequential oxidation steps catalyzed by a FAD-dependent,
           L-sorbose dehydrogenase and an NAD(P)+-dependent,
           L-sorbosone dehydrogenase.
          Length = 454

 Score =  383 bits (985), Expect = e-129
 Identities = 167/452 (36%), Positives = 270/452 (59%), Gaps = 7/452 (1%)

Query: 50  PANDEHLADVPDMGVEDVQYAIMSAKEAFEQ--WSQTTPKERGIVLKRWLKELEGNKQAL 107
           PA+   +A   +  VEDV  A+ +A++AF++  W + +  ER  VL +    +   ++ L
Sbjct: 4   PAHGVVVARYAEGTVEDVDAAVAAARKAFDKGPWPRMSGAERAAVLLKVADLIRARRERL 63

Query: 108 AEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVV 167
           A I T ESGKP +++ GE+  A     + +  AR  +G+   +       L++REP+GVV
Sbjct: 64  ALIETLESGKPISQARGEIEGAADLWRYAASLARTLHGDSYNNLGDDMLGLVLREPIGVV 123

Query: 168 GLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVT 227
           G+ITPWNFP  ++++K+  ALAAGCT V+KP+E T  T L LA L  EA +P GV+N+VT
Sbjct: 124 GIITPWNFPFLILSQKLPFALAAGCTVVVKPSEFTSGTTLMLAELLIEAGLPAGVVNIVT 183

Query: 228 SSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSN 287
                  +VG+ +   P V  +SFTGST+VGK + +  ++ +K++SLELGG  P IV+++
Sbjct: 184 GYGA---TVGQAMTEHPDVDMVSFTGSTRVGKAIAAAAARNLKKVSLELGGKNPQIVFAD 240

Query: 288 ANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDK 347
           A+L  A    +   + N G+ C + +R L+ E + D FV  + V  ++    GD   + +
Sbjct: 241 ADLDAAADAVVFGVYFNAGECCNSGSRLLVHESIADAFVAAV-VARSRKVRVGD-PLDPE 298

Query: 348 CNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYT 407
             +G +IN AQ+ K+   V    ++GA ++LGG ++      F+ PTI T++TPDM    
Sbjct: 299 TKVGAIINEAQLAKITDYVDAGRAEGATLLLGGERLASAAGLFYQPTIFTDVTPDMAIAR 358

Query: 408 EEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGI 467
           EE+FGPV   + F T +E++ +AN++  GL+   ++ DI  A  V ++++ G V +N  +
Sbjct: 359 EEIFGPVLSVLTFDTVDEAIALANDTVYGLSAGVWSKDIDTALTVARRIRAGTVWVNTFL 418

Query: 468 ISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
               E  FGG K+SG+GRE  R+G+EE+T++K
Sbjct: 419 DGSPELPFGGFKQSGIGRELGRYGVEEYTELK 450


>gnl|CDD|237391 PRK13473, PRK13473, gamma-aminobutyraldehyde dehydrogenase;
           Provisional.
          Length = 475

 Score =  383 bits (986), Expect = e-129
 Identities = 179/479 (37%), Positives = 254/479 (53%), Gaps = 22/479 (4%)

Query: 30  TQAYINGEWVHSS-KKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKE 88
           T+  INGE V    +K PVYNPA  E LA++ +     V  A+ +A  AF +WSQTTPKE
Sbjct: 3   TKLLINGELVAGEGEKQPVYNPATGEVLAEIAEASAAQVDAAVAAADAAFPEWSQTTPKE 62

Query: 89  RGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFV-DWFSEEAR----RT 143
           R   L +    +E N    A + +   GKP   +L + + A   V  +F+  AR    + 
Sbjct: 63  RAEALLKLADAIEENADEFARLESLNCGKPLHLALNDEIPAIVDVFRFFAGAARCLEGKA 122

Query: 144 YGELVP---SPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAE 200
            GE +    S ++       R+P+GVV  I PWN+P+ M   K+APALAAG T V+KP+E
Sbjct: 123 AGEYLEGHTSMIR-------RDPVGVVASIAPWNYPLMMAAWKLAPALAAGNTVVLKPSE 175

Query: 201 DTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKI 260
            TPLTAL LA LA +  +P GVLNVVT       +VG  L   P V  +S TGS   GK 
Sbjct: 176 ITPLTALKLAELAADI-LPPGVLNVVTGR---GATVGDALVGHPKVRMVSLTGSIATGKH 231

Query: 261 LYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQ 320
           + S  +  +KR  LELGG AP IV+ +A+L   V G     + N GQ C AA R   Q  
Sbjct: 232 VLSAAADSVKRTHLELGGKAPVIVFDDADLDAVVEGIRTFGYYNAGQDCTAACRIYAQRG 291

Query: 321 VFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGG 380
           ++D  V  L      +   GD    D   LGPLI+ A  ++V   V+ A + G   ++ G
Sbjct: 292 IYDDLVAKL-AAAVATLKVGDPDDEDT-ELGPLISAAHRDRVAGFVERAKALGHIRVVTG 349

Query: 381 GKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGY 440
           G+       ++ PT+L     D      EVFGPV     F  E++++  AN+S+ GLA  
Sbjct: 350 GEAPDGKGYYYEPTLLAGARQDDEIVQREVFGPVVSVTPFDDEDQAVRWANDSDYGLASS 409

Query: 441 FFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            +T D+ +A RV  +LQ G   +N   +  +E   GG K+SG G++ S +G+E++T ++
Sbjct: 410 VWTRDVGRAHRVSARLQYGCTWVNTHFMLVSEMPHGGQKQSGYGKDMSLYGLEDYTVVR 468


>gnl|CDD|143409 cd07090, ALDH_F9_TMBADH, NAD+-dependent
           4-trimethylaminobutyraldehyde dehydrogenase, ALDH family
           9A1.  NAD+-dependent, 4-trimethylaminobutyraldehyde
           dehydrogenase (TMABADH, EC=1.2.1.47), also known as
           aldehyde dehydrogenase family 9 member A1 (ALDH9A1) in
           humans, is a cytosolic tetramer which catalyzes the
           oxidation of gamma-aminobutyraldehyde involved in
           4-aminobutyric acid (GABA) biosynthesis  and also
           oxidizes betaine aldehyde
           (gamma-trimethylaminobutyraldehyde) which is involved in
           carnitine biosynthesis.
          Length = 457

 Score =  378 bits (974), Expect = e-127
 Identities = 186/460 (40%), Positives = 256/460 (55%), Gaps = 11/460 (2%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
           V  PA  E LA V   G EDV  A+ SAK A ++WS T+  ERG +L++    L      
Sbjct: 1   VIEPATGEVLATVHCAGAEDVDLAVKSAKAAQKEWSATSGMERGRILRKAADLLRERNDE 60

Query: 107 LAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGV 166
           +A + T ++GKP  E+  ++  +   +++++  A    GE VP P  S      REPLGV
Sbjct: 61  IARLETIDNGKPIEEARVDIDSSADCLEYYAGLAPTLSGEHVPLPGGSFAYTR-REPLGV 119

Query: 167 VGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVV 226
              I  WN+P+ + + K APALA G   V KP+  TPLTAL LA +  EA +P GV NVV
Sbjct: 120 CAGIGAWNYPIQIASWKSAPALACGNAMVYKPSPFTPLTALLLAEILTEAGLPDGVFNVV 179

Query: 227 TSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYS 286
                     G+LLC  P VA +SFTGS   GK + S  ++GIK ++LELGG +P I++ 
Sbjct: 180 QG----GGETGQLLCEHPDVAKVSFTGSVPTGKKVMSAAAKGIKHVTLELGGKSPLIIFD 235

Query: 287 NANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKND 346
           +A+L+ AV+GAM A F + GQ C    R  +Q  + D+F E L VE TK    GD    D
Sbjct: 236 DADLENAVNGAMMANFLSQGQVCSNGTRVFVQRSIKDEFTERL-VERTKKIRIGDPLDED 294

Query: 347 KCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKL----GSKFFAPTILTNITPD 402
              +G LI+   +EKV   ++ A  +GAKV+ GG +V          + +P +LT+ T D
Sbjct: 295 T-QMGALISEEHLEKVLGYIESAKQEGAKVLCGGERVVPEDGLENGFYVSPCVLTDCTDD 353

Query: 403 MLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVG 462
           M    EE+FGPV   + F TEEE +  AN++  GLA   FT D+ +A RV  +LQ G   
Sbjct: 354 MTIVREEIFGPVMSILPFDTEEEVIRRANDTTYGLAAGVFTRDLQRAHRVIAQLQAGTCW 413

Query: 463 INEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIKYMC 502
           IN   IS  E  FGG K+SG GRE     +E +TQ+K + 
Sbjct: 414 INTYNISPVEVPFGGYKQSGFGRENGTAALEHYTQLKTVY 453


>gnl|CDD|143438 cd07120, ALDH_PsfA-ACA09737, Pseudomonas putida aldehyde
           dehydrogenase PsfA (ACA09737)-like.  Included in this CD
           is the aldehyde dehydrogenase (PsfA, locus ACA09737) of
           Pseudomonas putida involved in furoic acid metabolism.
           Transcription of psfA was induced in response to
           2-furoic acid, furfuryl alcohol, and furfural.
          Length = 455

 Score =  376 bits (968), Expect = e-126
 Identities = 165/461 (35%), Positives = 252/461 (54%), Gaps = 14/461 (3%)

Query: 48  YNPANDEHLADVPDMGVEDVQYAIMSAKEAFE--QWSQTTPKERGIVLKRWLKELEGNKQ 105
            +PA  E +    D GV + + AI +A+ AF+   W+   P+ R  VL       E N +
Sbjct: 2   IDPATGEVIGTYADGGVAEAEAAIAAARRAFDETDWAHD-PRLRARVLLELADAFEANAE 60

Query: 106 ALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELV-PSPVKSKELLMVREPL 164
            LA ++  E+GK   E+  E+  A S + +++  AR   G ++ P P      L++REP+
Sbjct: 61  RLARLLALENGKILGEARFEISGAISELRYYAGLARTEAGRMIEPEPGSFS--LVLREPM 118

Query: 165 GVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEA-QIPKGVL 223
           GV G+I PWN P+ ++ R +APALAAGCT V+KPA  T     A+  +  E   +P GV+
Sbjct: 119 GVAGIIVPWNSPVVLLVRSLAPALAAGCTVVVKPAGQTAQINAAIIRILAEIPSLPAGVV 178

Query: 224 NVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFI 283
           N+ T S +        L  SP V  ISFTGST  G+ + +  +  +KRL LELGG  P I
Sbjct: 179 NLFTESGS---EGAAHLVASPDVDVISFTGSTATGRAIMAAAAPTLKRLGLELGGKTPCI 235

Query: 284 VYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTS 343
           V+ +A+L  A+     A     GQ C+A +R L+Q  + D+  + L      +   G   
Sbjct: 236 VFDDADLDAALPKLERALTIFAGQFCMAGSRVLVQRSIADEVRDRLA-ARLAAVKVGPGL 294

Query: 344 KNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSK--FFAPTILTNITP 401
                ++GPLI+ A V++VD +V+ AI+ GA+V+L GG VT+  +K  F  PT+L    P
Sbjct: 295 DPAS-DMGPLIDRANVDRVDRMVERAIAAGAEVVLRGGPVTEGLAKGAFLRPTLLEVDDP 353

Query: 402 DMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMV 461
           D     EE+FGPV     F  E E++ +AN+++ GLA   +T D+++A RV + ++ G V
Sbjct: 354 DADIVQEEIFGPVLTLETFDDEAEAVALANDTDYGLAASVWTRDLARAMRVARAIRAGTV 413

Query: 462 GINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIKYMC 502
            IN+      EA  GG ++SGLGR      +E+F + K++ 
Sbjct: 414 WINDWNKLFAEAEEGGYRQSGLGRLHGVAALEDFIEYKHIY 454


>gnl|CDD|143433 cd07115, ALDH_HMSADH_HapE, Pseudomonas fluorescens 4-hydroxymuconic
           semialdehyde dehydrogenase-like.  4-hydroxymuconic
           semialdehyde dehydrogenase (HapE, EC=1.2.1.61) of
           Pseudomonas fluorescens ACB involved in
           4-hydroxyacetophenone degradation, and putative
           hydroxycaproate semialdehyde dehydrogenase (ChnE) of
           Brachymonas petroleovorans involved in cyclohexane
           metabolism, and other similar sequences, are present in
           this CD.
          Length = 453

 Score =  376 bits (966), Expect = e-126
 Identities = 182/456 (39%), Positives = 258/456 (56%), Gaps = 10/456 (2%)

Query: 49  NPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALA 108
           NPA  E +A V     EDV  A+ +A+ AFE WS   P ERG +L R  + +  N   LA
Sbjct: 3   NPATGELIARVAQASAEDVDAAVAAARAAFEAWSAMDPAERGRILWRLAELILANADELA 62

Query: 109 EIMTQESGKPPTESLG-EVVYANSFVDWFSEEARRTYGELVPSPVKSKEL-LMVREPLGV 166
            + + ++GKP   +   +V  A     +++  A +  GE++P  V+   L   VREP+GV
Sbjct: 63  RLESLDTGKPIRAARRLDVPRAADTFRYYAGWADKIEGEVIP--VRGPFLNYTVREPVGV 120

Query: 167 VGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVV 226
           VG I PWNFP+     KVAPALAAG T V+KPAE TPL+AL +A L  EA  P GVLNVV
Sbjct: 121 VGAIVPWNFPLMFAAWKVAPALAAGNTVVLKPAELTPLSALRIAELMAEAGFPAGVLNVV 180

Query: 227 TSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYS 286
           T         G  L   P V  I+FTGST VG+ +    +  +KR+SLELGG +  IV++
Sbjct: 181 TGF---GEVAGAALVEHPDVDKITFTGSTAVGRKIMQGAAGNLKRVSLELGGKSANIVFA 237

Query: 287 NANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKND 346
           +A+L  AV  A    F N GQ C A +R L+ E ++D+F+E     + +S   GD   + 
Sbjct: 238 DADLDAAVRAAATGIFYNQGQMCTAGSRLLVHESIYDEFLERF-TSLARSLRPGDPL-DP 295

Query: 347 KCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCY 406
           K  +GPL++ AQ ++V   V     +GA+++ GG +    G  F  PTI   + P+M   
Sbjct: 296 KTQMGPLVSQAQFDRVLDYVDVGREEGARLLTGGKRPGARGF-FVEPTIFAAVPPEMRIA 354

Query: 407 TEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEG 466
            EE+FGPV   ++F+ EEE+L +AN +  GLA   +T D+ +A RV   L+ G V IN  
Sbjct: 355 QEEIFGPVVSVMRFRDEEEALRIANGTEYGLAAGVWTRDLGRAHRVAAALKAGTVWINTY 414

Query: 467 IISHTEAAFGGVKESGLGREGSRHGMEEFTQIKYMC 502
                 + FGG K+SG GRE  R  ++E+T++K + 
Sbjct: 415 NRFDPGSPFGGYKQSGFGREMGREALDEYTEVKSVW 450


>gnl|CDD|143417 cd07099, ALDH_DDALDH, Methylomonas sp.
           4,4'-diapolycopene-dialdehyde dehydrogenase-like.  The
           4,4'-diapolycopene-dialdehyde dehydrogenase (DDALDH)
           involved in C30 carotenoid synthesis in Methylomonas sp.
           strain 16a and other similar sequences are present in
           this CD. DDALDH converts 4,4'-diapolycopene-dialdehyde
           into 4,4'-diapolycopene-diacid.
          Length = 453

 Score =  373 bits (959), Expect = e-125
 Identities = 155/458 (33%), Positives = 243/458 (53%), Gaps = 13/458 (2%)

Query: 48  YNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQAL 107
            NPA  E L +VP     +V  A+  A+ A   W+    + R   L RW + L  +   L
Sbjct: 1   RNPATGEVLGEVPVTDPAEVAAAVARARAAQRAWAALGVEGRAQRLLRWKRALADHADEL 60

Query: 108 AEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRT--YGELVPSPV-KSKELLMVREPL 164
           AE++  E+GKP  ++  EV+ A   +DW +  A R     ++    +  +K+  +   P 
Sbjct: 61  AELLHAETGKPRADAGLEVLLALEAIDWAARNAPRVLAPRKVPTGLLMPNKKATVEYRPY 120

Query: 165 GVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLN 224
           GVVG+I+PWN+P+      + PALAAG   V+KP+E TPL    LA     A  P+GVL 
Sbjct: 121 GVVGVISPWNYPLLTPMGDIIPALAAGNAVVLKPSEVTPLVGELLAEAWAAAGPPQGVLQ 180

Query: 225 VVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIV 284
           VVT         G  L  +  V  ++FTGS   G+ + +  ++ +  + LELGG  P IV
Sbjct: 181 VVTGDGA----TGAALIDAG-VDKVAFTGSVATGRKVMAAAAERLIPVVLELGGKDPMIV 235

Query: 285 YSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSK 344
            ++A+L++A + A+     N GQ C++  R  + E V+D+FV  L V   ++   G    
Sbjct: 236 LADADLERAAAAAVWGAMVNAGQTCISVERVYVHESVYDEFVARL-VAKARALRPGADDI 294

Query: 345 NDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDML 404
            D  ++GP+    Q++ V   V DA++KGAK  L GG  +  G  F+ PT+LT++  DM 
Sbjct: 295 GD-ADIGPMTTARQLDIVRRHVDDAVAKGAKA-LTGGARSNGGGPFYEPTVLTDVPHDMD 352

Query: 405 CYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGIN 464
              EE FGPV   +    E+E++ +AN+S  GL+   F+ D+++A  + ++L+ G V IN
Sbjct: 353 VMREETFGPVLPVMPVADEDEAIALANDSRYGLSASVFSRDLARAEAIARRLEAGAVSIN 412

Query: 465 EGIISH--TEAAFGGVKESGLGREGSRHGMEEFTQIKY 500
           + +++       FGGVK+SG GR     G+ EF + K 
Sbjct: 413 DVLLTAGIPALPFGGVKDSGGGRRHGAEGLREFCRPKA 450


>gnl|CDD|143405 cd07086, ALDH_F7_AASADH-like, NAD+-dependent alpha-aminoadipic
           semialdehyde dehydrogenase and related proteins.  ALDH
           subfamily which includes the NAD+-dependent,
           alpha-aminoadipic semialdehyde dehydrogenase (AASADH,
           EC=1.2.1.31), also known as Antiquitin-1, ALDH7A1,
           ALDH7B or delta-1-piperideine-6-carboxylate
           dehydrogenase (P6CDH), and other similar sequences, such
           as the uncharacterized aldehyde dehydrogenase of
           Candidatus kuenenia AldH (locus CAJ73105).
          Length = 478

 Score =  372 bits (957), Expect = e-124
 Identities = 173/464 (37%), Positives = 247/464 (53%), Gaps = 17/464 (3%)

Query: 33  YINGEWVHS-SKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGI 91
            I GEWV S  + F   NPAN E +A V     EDV+ A+ +A+EAF++W +     RG 
Sbjct: 2   VIGGEWVGSGGETFTSRNPANGEPIARVFPASPEDVEAAVAAAREAFKEWRKVPAPRRGE 61

Query: 92  VLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSP 151
           ++++  + L   K+AL  +++ E GK   E LGEV       D+    +R  YG  +PS 
Sbjct: 62  IVRQIGEALRKKKEALGRLVSLEMGKILPEGLGEVQEMIDICDYAVGLSRMLYGLTIPSE 121

Query: 152 VKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAA 211
                L+    PLGVVG+IT +NFP+A+     A AL  G T V KP+E TPLTA+A+  
Sbjct: 122 RPGHRLMEQWNPLGVVGVITAFNFPVAVPGWNAAIALVCGNTVVWKPSETTPLTAIAVTK 181

Query: 212 LAHEA----QIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQ 267
           +  E      +P GV+N+VT         G+LL   P V  +SFTGST+VG+ +    ++
Sbjct: 182 ILAEVLEKNGLPPGVVNLVTGGGD----GGELLVHDPRVPLVSFTGSTEVGRRVGETVAR 237

Query: 268 GIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVE 327
              R+ LELGGN   IV  +A+L  AV   + A     GQ C    R ++ E V+D+F+E
Sbjct: 238 RFGRVLLELGGNNAIIVMDDADLDLAVRAVLFAAVGTAGQRCTTTRRLIVHESVYDEFLE 297

Query: 328 MLKVEITKSFIQGDTSKNDKCNL-GPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKL 386
            L V+  K    GD    D+  L GPLIN A VEK  + ++ A S+G  V+ GG ++   
Sbjct: 298 RL-VKAYKQVRIGD--PLDEGTLVGPLINQAAVEKYLNAIEIAKSQGGTVLTGGKRIDGG 354

Query: 387 GSKFF-APTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTND 445
               +  PTI+T +T D     EE F P+   IKF + EE++ + N+   GL+   FT D
Sbjct: 355 EPGNYVEPTIVTGVTDDARIVQEETFAPILYVIKFDSLEEAIAINNDVPQGLSSSIFTED 414

Query: 446 ISQA--WRVGKKLQVGMVGINEGII-SHTEAAFGGVKESGLGRE 486
           + +A  W   K    G+V +N     +    AFGG KE+G GRE
Sbjct: 415 LREAFRWLGPKGSDCGIVNVNIPTSGAEIGGAFGGEKETGGGRE 458


>gnl|CDD|215260 PLN02467, PLN02467, betaine aldehyde dehydrogenase.
          Length = 503

 Score =  372 bits (956), Expect = e-124
 Identities = 189/481 (39%), Positives = 282/481 (58%), Gaps = 18/481 (3%)

Query: 31  QAYINGEWVHSSKK--FPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQ-----WSQ 83
           Q +I GEW         PV NPA +E + D+P    EDV  A+ +A++AF++     W++
Sbjct: 9   QLFIGGEWREPVLGKRIPVVNPATEETIGDIPAATAEDVDAAVEAARKAFKRNKGKDWAR 68

Query: 84  TTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSE--EAR 141
           TT   R   L+    ++   K  LA++ T + GKP  E+  ++       +++++  EA 
Sbjct: 69  TTGAVRAKYLRAIAAKITERKSELAKLETLDCGKPLDEAAWDMDDVAGCFEYYADLAEAL 128

Query: 142 RTYGEL-VPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAE 200
               +  V  P+++ +  +++EPLGVVGLITPWN+P+ M T KVAPALAAGCT V+KP+E
Sbjct: 129 DAKQKAPVSLPMETFKGYVLKEPLGVVGLITPWNYPLLMATWKVAPALAAGCTAVLKPSE 188

Query: 201 DTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKI 260
              +T L LA +  E  +P GVLNVVT   T A   G  L + P V  I+FTGST  G+ 
Sbjct: 189 LASVTCLELADICREVGLPPGVLNVVTGLGTEA---GAPLASHPGVDKIAFTGSTATGRK 245

Query: 261 LYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQ 320
           + +  +Q +K +SLELGG +P IV+ + +L KAV  AM   F   GQ C A +R L+ E+
Sbjct: 246 IMTAAAQMVKPVSLELGGKSPIIVFDDVDLDKAVEWAMFGCFWTNGQICSATSRLLVHER 305

Query: 321 VFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGG 380
           +  +F+E L V+  K+    D  +   C LGP+++  Q EKV   +  A S+GA ++ GG
Sbjct: 306 IASEFLEKL-VKWAKNIKISDPLEEG-CRLGPVVSEGQYEKVLKFISTAKSEGATILCGG 363

Query: 381 GKVTKLGSKFF-APTILTNITPDMLCYTEEVFGPVAVCIK-FKTEEESLCVANNSNTGLA 438
            +   L   FF  PTI+T++T  M  + EEVFGPV +C+K F TE+E++ +AN+S+ GLA
Sbjct: 364 KRPEHLKKGFFIEPTIITDVTTSMQIWREEVFGPV-LCVKTFSTEDEAIELANDSHYGLA 422

Query: 439 GYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQI 498
           G   +ND+ +  RV +  Q G+V IN       +A +GG+K SG GRE    G+E +  +
Sbjct: 423 GAVISNDLERCERVSEAFQAGIVWINCSQPCFCQAPWGGIKRSGFGRELGEWGLENYLSV 482

Query: 499 K 499
           K
Sbjct: 483 K 483


>gnl|CDD|143413 cd07094, ALDH_F21_LactADH-like, ALDH subfamily: NAD+-dependent,
           lactaldehyde dehydrogenase, ALDH family 21 A1, and
           related proteins.  ALDH subfamily which includes Tortula
           ruralis aldehyde dehydrogenase ALDH21A1 (RNP123), and
           NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
           and like sequences.
          Length = 453

 Score =  369 bits (948), Expect = e-123
 Identities = 171/462 (37%), Positives = 261/462 (56%), Gaps = 16/462 (3%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
           V+NP + E +  VP     D + A+ +A+   E      P ER  +L+R    L+   + 
Sbjct: 3   VHNPYDGEVIGKVPADDRADAEEALATARAGAENRRALPPHERMAILERAADLLKKRAEE 62

Query: 107 LAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPV----KSKELLMVRE 162
            A+I+  E GKP  ++  EV  A   +   +EEA R  GE +P        ++    +RE
Sbjct: 63  FAKIIACEGGKPIKDARVEVDRAIDTLRLAAEEAERIRGEEIPLDATQGSDNRLAWTIRE 122

Query: 163 PLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGV 222
           P+GVV  ITP+NFP+ ++  K+APA+A GC  V+KPA  TPL+AL LA +  EA +P+GV
Sbjct: 123 PVGVVLAITPFNFPLNLVAHKLAPAIATGCPVVLKPASKTPLSALELAKILVEAGVPEGV 182

Query: 223 LNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPF 282
           L VVT  R     +G        VA +SFTGS  VG+ L +  + G KR++LELGGNAP 
Sbjct: 183 LQVVTGER---EVLGDAFAADERVAMLSFTGSAAVGEALRA--NAGGKRIALELGGNAPV 237

Query: 283 IVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDT 342
           IV  +A+L  A+       F + GQ C++  R  + E+++D+F+E   V   K    GD 
Sbjct: 238 IVDRDADLDAAIEALAKGGFYHAGQVCISVQRIYVHEELYDEFIEAF-VAAVKKLKVGD- 295

Query: 343 SKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPD 402
             ++  ++GPLI+    E+V+  V++A+  GA+++ GG    +     F PT+L ++  D
Sbjct: 296 PLDEDTDVGPLISEEAAERVERWVEEAVEAGARLLCGG----ERDGALFKPTVLEDVPRD 351

Query: 403 MLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVG 462
               TEE FGPV   I++   EE++ +AN+++ GL    FT D++ A++  +KL+VG V 
Sbjct: 352 TKLSTEETFGPVVPIIRYDDFEEAIRIANSTDYGLQAGIFTRDLNVAFKAAEKLEVGGVM 411

Query: 463 INEGIISHTEA-AFGGVKESGLGREGSRHGMEEFTQIKYMCF 503
           +N+     T+   FGGVKESG+GREG  + MEE T+ K +  
Sbjct: 412 VNDSSAFRTDWMPFGGVKESGVGREGVPYAMEEMTEEKTVVI 453


>gnl|CDD|143469 cd07151, ALDH_HBenzADH, NADP+-dependent p-hydroxybenzaldehyde
           dehydrogenase-like.  NADP+-dependent,
           p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH)
           which catalyzes oxidation of p-hydroxybenzaldehyde to
           p-hydroxybenzoic acid and other related sequences are
           included in this CD.
          Length = 465

 Score =  368 bits (948), Expect = e-123
 Identities = 185/470 (39%), Positives = 272/470 (57%), Gaps = 15/470 (3%)

Query: 36  GEWV--HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVL 93
           GEW    S +   V NP   E LA++P    EDV  A  +A  A ++W+ T P+ER  +L
Sbjct: 1   GEWRDGTSERTIDVLNPYTGETLAEIPAASKEDVDEAYRAAAAAQKEWAATLPQERAEIL 60

Query: 94  KRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVK 153
           ++  + LE  +  + E + +ESG    ++  E   A +     +    R  G ++PS V 
Sbjct: 61  EKAAQILEERRDEIVEWLIRESGSTRIKANIEWGAAMAITREAATFPLRMEGRILPSDVP 120

Query: 154 SKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTA-LALAAL 212
            KE  + REPLGVVG+I+PWNFP+ +  R VAPALA G   V+KPA DTP+T  L LA +
Sbjct: 121 GKENRVYREPLGVVGVISPWNFPLHLSMRSVAPALALGNAVVLKPASDTPITGGLLLAKI 180

Query: 213 AHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRL 272
             EA +PKGVLNVV       + +G      P    ISFTGST VG+ +     + +K++
Sbjct: 181 FEEAGLPKGVLNVVV---GAGSEIGDAFVEHPVPRLISFTGSTPVGRHIGELAGRHLKKV 237

Query: 273 SLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVE 332
           +LELGGN PF+V  +A++  AV+ A+  KF + GQ C+A NR ++ E V+D+FVE   VE
Sbjct: 238 ALELGGNNPFVVLEDADIDAAVNAAVFGKFLHQGQICMAINRIIVHEDVYDEFVEKF-VE 296

Query: 333 ITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFA 392
             K+   GD S  D   +GPLIN +QV+ +   ++ A+ +GA +++GG    +       
Sbjct: 297 RVKALPYGDPSDPDTV-VGPLINESQVDGLLDKIEQAVEEGATLLVGG----EAEGNVLE 351

Query: 393 PTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRV 452
           PT+L+++T DM    EE+FGPVA  IK   EEE+L +AN++  GL+G  FT+D+ +  + 
Sbjct: 352 PTVLSDVTNDMEIAREEIFGPVAPIIKADDEEEALELANDTEYGLSGAVFTSDLERGVQF 411

Query: 453 GKKLQVGMVGINEGIISHTE--AAFGGVKESGLGREGSRHGMEEFTQIKY 500
            +++  GM  IN+  + + E    FGG K SGLGR      +EEFT  K+
Sbjct: 412 ARRIDAGMTHINDQPV-NDEPHVPFGGEKNSGLGRFNGEWALEEFTTDKW 460


>gnl|CDD|143423 cd07105, ALDH_SaliADH, Salicylaldehyde dehydrogenase, DoxF-like.
           Salicylaldehyde dehydrogenase (DoxF, SaliADH,
           EC=1.2.1.65) involved in the upper naphthalene catabolic
           pathway of Pseudomonas strain C18 and other similar
           sequences are present in this CD.
          Length = 432

 Score =  366 bits (941), Expect = e-123
 Identities = 169/439 (38%), Positives = 245/439 (55%), Gaps = 16/439 (3%)

Query: 66  DVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGE 125
           D   A+ +A  AF  WS+T P ER  +L +    LE  +    E M +E+G   T +   
Sbjct: 1   DADQAVEAAAAAFPAWSKTPPSERRDILLKAADLLESRRDEFIEAMMEETG--ATAAWAG 58

Query: 126 V---VYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITR 182
               + A    +  +    +  G  +PS       ++V+EP+GVV  I PWN P+ + TR
Sbjct: 59  FNVDLAAGMLRE-AASLITQIIGGSIPSDKPGTLAMVVKEPVGVVLGIAPWNAPVILGTR 117

Query: 183 KVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCT 242
            +A  LAAG T V+K +E +P T   +  + HEA +PKGVLNVVT S  +A  V + L  
Sbjct: 118 AIAYPLAAGNTVVLKASELSPRTHWLIGRVFHEAGLPKGVLNVVTHSPEDAPEVVEALIA 177

Query: 243 SPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKF 302
            P V  ++FTGST+VG+I+    ++ +K + LELGG AP IV  +A+L  A + A+   F
Sbjct: 178 HPAVRKVNFTGSTRVGRIIAETAAKHLKPVLLELGGKAPAIVLEDADLDAAANAALFGAF 237

Query: 303 RNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKV 362
            N GQ C++  R ++ E + D+FVE LK    K F            LG L++ A  ++V
Sbjct: 238 LNSGQICMSTERIIVHESIADEFVEKLKAAAEKLF-------AGPVVLGSLVSAAAADRV 290

Query: 363 DSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKT 422
             +V DA+SKGAK+++GG            PTIL N+TPDM  Y+EE FGPV   I+ K 
Sbjct: 291 KELVDDALSKGAKLVVGGLADESPSGTSMPPTILDNVTPDMDIYSEESFGPVVSIIRVKD 350

Query: 423 EEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAA--FGGVKE 480
           EEE++ +AN+S  GL+   FT D+++A  V K+++ G V IN G+  H E     GGVK 
Sbjct: 351 EEEAVRIANDSEYGLSAAVFTRDLARALAVAKRIESGAVHIN-GMTVHDEPTLPHGGVKS 409

Query: 481 SGLGREGSRHGMEEFTQIK 499
           SG GR   + G++EFT+ K
Sbjct: 410 SGYGRFNGKWGIDEFTETK 428


>gnl|CDD|143411 cd07092, ALDH_ABALDH-YdcW, Escherichia coli NAD+-dependent
           gamma-aminobutyraldehyde dehydrogenase YdcW-like.
           NAD+-dependent, tetrameric, gamma-aminobutyraldehyde
           dehydrogenase (ABALDH), YdcW of Escherichia coli K12,
           catalyzes the oxidation of gamma-aminobutyraldehyde to
           gamma-aminobutyric acid. ABALDH can also oxidize n-alkyl
           medium-chain aldehydes, but with a lower catalytic
           efficiency.
          Length = 450

 Score =  366 bits (942), Expect = e-123
 Identities = 173/457 (37%), Positives = 249/457 (54%), Gaps = 13/457 (2%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
           V +PA  E +A VPD    DV  A+ +A  AF  W +TTP ER   L +    +E N + 
Sbjct: 1   VVDPATGEEIATVPDASAADVDAAVAAAHAAFPSWRRTTPAERSKALLKLADAIEENAEE 60

Query: 107 LAEIMTQESGKPPTESLGEVVYANSFVD---WFSEEARRTYGELVPSPVKSKELLMVREP 163
           LA + ++ +GKP      + +     VD   +F+  AR   G      +     ++ REP
Sbjct: 61  LAALESRNTGKPLHLVRDDELPGA--VDNFRFFAGAARTLEGPAAGEYLPGHTSMIRREP 118

Query: 164 LGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVL 223
           +GVV  I PWN+P+ M   K+APALAAG T V+KP+E TPLT L LA LA E  +P GV+
Sbjct: 119 IGVVAQIAPWNYPLMMAAWKIAPALAAGNTVVLKPSETTPLTTLLLAELAAEV-LPPGVV 177

Query: 224 NVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFI 283
           NVV        S G  L   P V  +S TGS + GK +    +  +KR+ LELGG AP I
Sbjct: 178 NVVCGGG---ASAGDALVAHPRVRMVSLTGSVRTGKKVARAAADTLKRVHLELGGKAPVI 234

Query: 284 VYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTS 343
           V+ +A+L  AV+G   A + N GQ C AA R  + E V+D+FV  L VE   +   GD  
Sbjct: 235 VFDDADLDAAVAGIATAGYYNAGQDCTAACRVYVHESVYDEFVAAL-VEAVSAIRVGD-P 292

Query: 344 KNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDM 403
            ++   +GPL + AQ E+V   V+ A    A+V+ GG +    G  F+ PT++  +  D 
Sbjct: 293 DDEDTEMGPLNSAAQRERVAGFVERA-PAHARVLTGGRRAEGPGY-FYEPTVVAGVAQDD 350

Query: 404 LCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGI 463
               EE+FGPV     F  E+E++ +AN+   GLA   +T D+ +A R+  +L  G V +
Sbjct: 351 EIVQEEIFGPVVTVQPFDDEDEAIELANDVEYGLASSVWTRDVGRAMRLSARLDFGTVWV 410

Query: 464 NEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIKY 500
           N  I    E   GG K+SG G++ S + +E++T+IK+
Sbjct: 411 NTHIPLAAEMPHGGFKQSGYGKDLSIYALEDYTRIKH 447


>gnl|CDD|143459 cd07141, ALDH_F1AB_F2_RALDH1, NAD+-dependent retinal dehydrogenase
           1, ALDH families 1A, 1B, and 2-like.  NAD+-dependent
           retinal dehydrogenase 1 (RALDH 1, ALDH1, EC=1.2.1.36)
           also known as aldehyde dehydrogenase family 1 member A1
           (ALDH1A1) in humans, is a homotetrameric, cytosolic
           enzyme that catalyzes the oxidation of retinaldehyde to
           retinoic acid. Human ALDH1B1 and ALDH2 are also in this
           cluster; both are mitochrondrial homotetramers which
           play important roles in acetaldehyde oxidation; ALDH1B1
           in response to UV light exposure and ALDH2 during
           ethanol metabolism.
          Length = 481

 Score =  366 bits (942), Expect = e-122
 Identities = 183/479 (38%), Positives = 269/479 (56%), Gaps = 16/479 (3%)

Query: 29  YTQAYINGEWVH--SSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQ---WSQ 83
           YT+ +IN EW    S K FP  NPA  E + +V +    DV  A+ +A+ AF+    W  
Sbjct: 6   YTKIFINNEWHDSVSGKTFPTINPATGEKICEVQEGDKADVDKAVKAARAAFKLGSPWRT 65

Query: 84  TTPKERGIVLKRWLKELEGNKQALAEIMTQESGKP-PTESLGEVVYANSFVDWFSEEARR 142
               ERG +L +    +E ++  LA + T ++GKP     L ++  A   + +++  A +
Sbjct: 66  MDASERGRLLNKLADLIERDRAYLASLETLDNGKPFSKSYLVDLPGAIKVLRYYAGWADK 125

Query: 143 TYGELVPSPVKSKELLMVR-EPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAED 201
            +G+ +P  +        R EP+GV G I PWNFP+ M   K+APALA G T V+KPAE 
Sbjct: 126 IHGKTIP--MDGDFFTYTRHEPVGVCGQIIPWNFPLLMAAWKLAPALACGNTVVLKPAEQ 183

Query: 202 TPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGK-I 260
           TPLTAL LA+L  EA  P GV+NVV      A   G  + + P +  ++FTGST+VGK I
Sbjct: 184 TPLTALYLASLIKEAGFPPGVVNVVPGYGPTA---GAAISSHPDIDKVAFTGSTEVGKLI 240

Query: 261 LYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQ 320
             +     +KR++LELGG +P IV+++A+L  AV  A  A F N GQ C A +R  +QE 
Sbjct: 241 QQAAGKSNLKRVTLELGGKSPNIVFADADLDYAVEQAHEALFFNMGQCCCAGSRTFVQES 300

Query: 321 VFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGG 380
           ++D+FV+   VE  K  + G+   + K   GP I+  Q +K+  +++    +GAK+  GG
Sbjct: 301 IYDEFVKRS-VERAKKRVVGNPF-DPKTEQGPQIDEEQFKKILELIESGKKEGAKLECGG 358

Query: 381 GKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGY 440
            +    G  F  PT+ +++T DM    EE+FGPV    KFKT +E +  ANN+  GLA  
Sbjct: 359 KRHGDKGY-FIQPTVFSDVTDDMRIAKEEIFGPVQQIFKFKTIDEVIERANNTTYGLAAA 417

Query: 441 FFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            FT DI +A      L+ G V +N   +   +A FGG K SG GRE   +G++E+T++K
Sbjct: 418 VFTKDIDKAITFSNALRAGTVWVNCYNVVSPQAPFGGYKMSGNGRELGEYGLQEYTEVK 476


>gnl|CDD|143471 cd07559, ALDH_ACDHII_AcoD-like, Ralstonia eutrophus NAD+-dependent
           acetaldehyde dehydrogenase II and Staphylococcus aureus
           AldA1 (SACOL0154)-like.  Included in this CD is the
           NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII,
           AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus
           H16 involved in the catabolism of acetoin and ethanol,
           and similar proteins, such as, the dimeric
           dihydrolipoamide dehydrogenase of the acetoin
           dehydrogenase enzyme system of Klebsiella pneumonia.
           Also included are sequences similar to the
           NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
           and AldB) of Xanthobacter autotrophicus GJ10 which are
           involved in the degradation of 1,2-dichloroethane, as
           well as, the uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences.
          Length = 480

 Score =  364 bits (936), Expect = e-121
 Identities = 176/479 (36%), Positives = 266/479 (55%), Gaps = 20/479 (4%)

Query: 33  YINGEWVHSSKK--FPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
           +INGEWV  SK   F  YNP N + L ++P    EDV  A+ +A EAF+ W +T+  ER 
Sbjct: 4   FINGEWVAPSKGEYFDNYNPVNGKVLCEIPRSTAEDVDLAVDAAHEAFKTWGKTSVAERA 63

Query: 91  IVLKRWLKELEGNKQALAEIMTQESGKPPTESLG-EVVYANSFVDWFSEEARRTYGELVP 149
            +L +    +E N + LA   T ++GKP  E+L  ++  A     +F+   R   G L  
Sbjct: 64  NILNKIADRIEENLELLAVAETLDNGKPIRETLAADIPLAIDHFRYFAGVIRAQEGSL-- 121

Query: 150 SPVKSKEL-LMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALA 208
           S +    L     EPLGVVG I PWNFP+ M   K+APALAAG T V+KPA  TPL+ L 
Sbjct: 122 SEIDEDTLSYHFHEPLGVVGQIIPWNFPLLMAAWKLAPALAAGNTVVLKPASQTPLSILV 181

Query: 209 LAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQG 268
           L  L  +  +PKGV+NVVT   + A   GK L + P +A ++FTGST VG+++  + ++ 
Sbjct: 182 LMELIGDL-LPKGVVNVVTGFGSEA---GKPLASHPRIAKLAFTGSTTVGRLIMQYAAEN 237

Query: 269 IKRLSLELGGNAPFIVYSNA-----NLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFD 323
           +  ++LELGG +P I + +A     +              N G+ C   +R L+QE ++D
Sbjct: 238 LIPVTLELGGKSPNIFFDDAMDADDDFDDKAEEGQLGFAFNQGEVCTCPSRALVQESIYD 297

Query: 324 QFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKV 383
           +F+E   VE  ++   G+   + +  +G  ++  Q+EK+ S V     +GA+V+ GG ++
Sbjct: 298 EFIERA-VERFEAIKVGN-PLDPETMMGAQVSKDQLEKILSYVDIGKEEGAEVLTGGERL 355

Query: 384 TK---LGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGY 440
           T        F+ PT++     DM  + EE+FGPV   I FK EEE++ +AN++  GL G 
Sbjct: 356 TLGGLDKGYFYEPTLIKGGNNDMRIFQEEIFGPVLAVITFKDEEEAIAIANDTEYGLGGG 415

Query: 441 FFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            +T DI++A RV + +Q G V +N        A FGG K+SG+GRE  +  ++ + Q K
Sbjct: 416 VWTRDINRALRVARGIQTGRVWVNCYHQYPAHAPFGGYKKSGIGRETHKMMLDHYQQTK 474


>gnl|CDD|200131 TIGR01804, BADH, glycine betaine aldehyde dehydrogenase.  Under
           osmotic stress, betaine aldehyde dehydrogenase oxidizes
           glycine betaine aldehyde into the osmoprotectant glycine
           betaine, via the second of two oxidation steps from
           exogenously supplied choline or betaine aldehyde. This
           choline-glycine betaine synthesis pathway can be found
           in gram-positive and gram-negative bacteria. In
           Escherichia coli, betaine aldehyde dehydrogenase (betB)
           is osmotically co-induced with choline dehydrogenase
           (betA) in the presence of choline. These dehydrogenases
           are located in a betaine gene cluster with the upstream
           choline transporter (betT) and transcriptional regulator
           (betI). Similar to E.coli, betaine synthesis in
           Staphylococcus xylosus is also influenced by osmotic
           stress and the presence of choline with genes localized
           in a functionally equivalent gene cluster. Organization
           of the betaine gene cluster in Sinorhizobium meliloti
           and Bacillus subtilis differs from that of E.coli by the
           absence of upstream choline transporter and
           transcriptional regulator homologues. Additionally,
           B.subtilis co-expresses a type II alcohol dehydrogenase
           with betaine aldehyde dehydrogenase instead of choline
           dehydrogenase as in E.coli, St.xylosus, and Si.meliloti.
           Betaine aldehyde dehydrogenase is a member of the
           aldehyde dehydrogenase family (pfam00171) [Cellular
           processes, Adaptations to atypical conditions].
          Length = 467

 Score =  360 bits (926), Expect = e-120
 Identities = 177/474 (37%), Positives = 269/474 (56%), Gaps = 14/474 (2%)

Query: 33  YINGEWVHSSKK--FPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
           +I+GE+V  S      + NPAN E +A V     EDV+ AI +A+ A  +W+  +P ERG
Sbjct: 1   FIDGEYVEDSAGTTREIINPANGEVIATVHAATPEDVERAIAAARRAQGEWAAMSPMERG 60

Query: 91  IVLKRWLKELEGNKQALAEIMTQESGKPPTESL--GEVVYANSFVDWFSEEARRTYGELV 148
            +L+R    +    + LA++ T ++GK   E++       A+ F ++F+  A    GE++
Sbjct: 61  RILRRAADLIRERNEELAKLETLDTGKTLQETIVADMDSGADVF-EFFAGLAPALNGEII 119

Query: 149 PSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALA 208
           P    S     +REPLGV   I  WN+P+ + + K+APALAAG   V KP+E+TPLTAL 
Sbjct: 120 PLGGPS-FAYTIREPLGVCVGIGAWNYPLQIASWKIAPALAAGNAMVFKPSENTPLTALK 178

Query: 209 LAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQG 268
           +A +  EA +PKGV NVV     +   VG LL   P VA +SFTG    GK + +  +  
Sbjct: 179 VAEIMEEAGLPKGVFNVVQG---DGAEVGPLLVNHPDVAKVSFTGGVPTGKKIMAAAAGH 235

Query: 269 IKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEM 328
           +K +++ELGG +P IV+ +A+L+ AV GAM   F + GQ C    R  + +++ ++F+  
Sbjct: 236 LKHVTMELGGKSPLIVFDDADLESAVDGAMLGNFFSAGQVCSNGTRVFVHKKIKERFLAR 295

Query: 329 LKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGS 388
           L VE T+    GD        +GPLI+ A  +KV S ++   ++GA +  GGG+   +G 
Sbjct: 296 L-VERTERIKLGDPFDEAT-EMGPLISAAHRDKVLSYIEKGKAEGATLATGGGRPENVGL 353

Query: 389 K---FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTND 445
           +   F  PT+  + T DM    EE+FGPV   + F  E+E +  AN++  GLAG  FT D
Sbjct: 354 QNGFFVEPTVFADCTDDMTIVREEIFGPVMTVLTFSDEDEVIARANDTEYGLAGGVFTAD 413

Query: 446 ISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           + +A RV  +L+ G V IN   +   EA FGG K+SG+GRE  +  +  +T++K
Sbjct: 414 LGRAHRVADQLEAGTVWINTYNLYPAEAPFGGYKQSGIGRENGKAALAHYTEVK 467


>gnl|CDD|143435 cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus
           AldA1 (SACOL0154) aldehyde dehydrogenase-like.
           Uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences are present in this CD.
          Length = 475

 Score =  359 bits (923), Expect = e-119
 Identities = 180/475 (37%), Positives = 276/475 (58%), Gaps = 17/475 (3%)

Query: 33  YINGEWVHSS--KKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
           +INGEWV  S  +    YNPAN E L+++ D    DV  A+ +A+EAF+ W +TT  ER 
Sbjct: 4   FINGEWVKGSSGETIDSYNPANGETLSEITDATDADVDRAVKAAQEAFKTWRKTTVAERA 63

Query: 91  IVLKRWLKELEGNKQALAEIMTQESGKPPTESLG-EVVYANSFVDWFSEEARRTYGELVP 149
            +L +    ++ NK+ LA + T ++GKP  E+   ++  A     +F+   R   G    
Sbjct: 64  NILNKIADIIDENKELLAMVETLDNGKPIRETRAVDIPLAADHFRYFAGVIRAEEGSAN- 122

Query: 150 SPVKSKELL--MVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTAL 207
             +  ++ L  ++REP+GVVG I PWNFP  M   K+APALAAG T VIKP+  T L+ L
Sbjct: 123 --MIDEDTLSIVLREPIGVVGQIIPWNFPFLMAAWKLAPALAAGNTVVIKPSSTTSLSLL 180

Query: 208 ALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQ 267
            LA +  +  +PKGV+N+VT   + +   G+ L   P +  ++FTGST+VG+ +    ++
Sbjct: 181 ELAKIIQDV-LPKGVVNIVTGKGSKS---GEYLLNHPGLDKLAFTGSTEVGRDVAIAAAK 236

Query: 268 GIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVE 327
            +   +LELGG +  I++ +AN  KA+ GA      N GQ C A +R  +QE ++D+FV 
Sbjct: 237 KLIPATLELGGKSANIIFDDANWDKALEGAQLGILFNQGQVCCAGSRIFVQEGIYDEFVA 296

Query: 328 MLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLG 387
            LK +     ++     +    +G  +N  Q++K+ S V  A  +GAK++ GG ++T+ G
Sbjct: 297 KLKEKFEN--VKVGNPLDPDTQMGAQVNKDQLDKILSYVDIAKEEGAKILTGGHRLTENG 354

Query: 388 ---SKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTN 444
                F  PT++ N+T DM    EE+FGPVA  IKFKTE+E + +AN+S  GL G  FT 
Sbjct: 355 LDKGFFIEPTLIVNVTNDMRVAQEEIFGPVATVIKFKTEDEVIDMANDSEYGLGGGVFTK 414

Query: 445 DISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           DI++A RV + ++ G V +N        A FGG K+SG+GRE  +  ++ +TQ+K
Sbjct: 415 DINRALRVARAVETGRVWVNTYNQIPAGAPFGGYKKSGIGRETHKSMLDAYTQMK 469


>gnl|CDD|143465 cd07147, ALDH_F21_RNP123, Aldehyde dehydrogenase family 21A1-like. 
           Aldehyde dehydrogenase ALDH21A1 (gene name RNP123) was
           first described in the moss Tortula ruralis and is
           believed to play an important role in the detoxification
           of aldehydes generated in response to desiccation- and
           salinity-stress, and ALDH21A1 expression represents a
           unique stress tolerance mechanism. So far, of plants,
           only the bryophyte sequence has been observed, but
           similar protein sequences from bacteria and archaea are
           also present in this CD.
          Length = 452

 Score =  355 bits (914), Expect = e-118
 Identities = 175/465 (37%), Positives = 255/465 (54%), Gaps = 19/465 (4%)

Query: 45  FPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNK 104
             V NP   E +A V   G +D++ AI +A +AF          R  +L   +  LE   
Sbjct: 1   LEVTNPYTGEVVARVALAGPDDIEEAIAAAVKAFRPMRALPAHRRAAILLHCVARLEERF 60

Query: 105 QALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVP---SPVKSKELLMVR 161
           + LAE +  E+GKP  ++ GEV  A       +EEA R YGE++P   S        +VR
Sbjct: 61  EELAETIVLEAGKPIKDARGEVARAIDTFRIAAEEATRIYGEVLPLDISARGEGRQGLVR 120

Query: 162 E-PLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPK 220
             P+G V  ITP+NFP+ ++  KVAPA+AAGC  V+KPA  TPL+AL L  +  E  +PK
Sbjct: 121 RFPIGPVSAITPFNFPLNLVAHKVAPAIAAGCPFVLKPASRTPLSALILGEVLAETGLPK 180

Query: 221 GVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNA 280
           G  +V+  SR +A+    LL T   +  +SFTGS  VG  L +    G K++ LELGGNA
Sbjct: 181 GAFSVLPCSRDDAD----LLVTDERIKLLSFTGSPAVGWDLKARA--GKKKVVLELGGNA 234

Query: 281 PFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQG 340
             IV S+A+L  A    +   F   GQ+C++  R L+   V+D+F   L V   K+   G
Sbjct: 235 AVIVDSDADLDFAAQRIIFGAFYQAGQSCISVQRVLVHRSVYDEFKSRL-VARVKALKTG 293

Query: 341 DTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNIT 400
           D  K+D  ++GP+I+ ++ E+V+  V +A+  GAK++ GG    K       PTIL ++ 
Sbjct: 294 D-PKDDATDVGPMISESEAERVEGWVNEAVDAGAKLLTGG----KRDGALLEPTILEDVP 348

Query: 401 PDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGM 460
           PDM    EEVFGPV     +   +E+L   N+S  GL    FT D+ +A R   +L+VG 
Sbjct: 349 PDMEVNCEEVFGPVVTVEPYDDFDEALAAVNDSKFGLQAGVFTRDLEKALRAWDELEVGG 408

Query: 461 VGINEGIISH--TEAAFGGVKESGLGREGSRHGMEEFTQIKYMCF 503
           V IN+ + +       +GGVK+SG+GREG R+ +EE T+ + +  
Sbjct: 409 VVIND-VPTFRVDHMPYGGVKDSGIGREGVRYAIEEMTEPRLLVI 452


>gnl|CDD|143460 cd07142, ALDH_F2BC, Arabidosis aldehyde dehydrogenase family 2 B4,
           B7, C4-like.  Included in this CD is the Arabidosis
           aldehyde dehydrogenase family 2 members B4 and B7
           (EC=1.2.1.3),  which are mitochondrial homotetramers
           that oxidize acetaldehyde and glycolaldehyde, but not
           L-lactaldehyde. Also in this group, is the Arabidosis
           cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
           involved in the oxidation of sinapalehyde and
           coniferaldehyde.
          Length = 476

 Score =  354 bits (909), Expect = e-117
 Identities = 183/481 (38%), Positives = 272/481 (56%), Gaps = 21/481 (4%)

Query: 29  YTQAYINGEWVH--SSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQ--WSQT 84
           +T+ +ING++V   S K FP  +P N E +A V +   EDV  A+ +A++AF++  W + 
Sbjct: 3   HTKLFINGQFVDAASGKTFPTIDPRNGEVIAHVAEGDAEDVDRAVKAARKAFDEGPWPRM 62

Query: 85  TPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTES-LGEVVYANSFVDWFSEEARRT 143
           T  ER  +L R+   LE +   LA + T ++GKP  ++   EV  A     +++  A + 
Sbjct: 63  TGYERSRILLRFADLLEKHADELAALETWDNGKPYEQARYAEVPLAARLFRYYAGWADKI 122

Query: 144 YGELVPS--PVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAED 201
           +G  +P+  P     L    EP+GVVG I PWNFP+ M   KV PALA G T V+KPAE 
Sbjct: 123 HGMTLPADGPHHVYTL---HEPIGVVGQIIPWNFPLLMFAWKVGPALACGNTIVLKPAEQ 179

Query: 202 TPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKIL 261
           TPL+AL  A LA EA +P GVLN+VT       + G  + +   V  ++FTGST+VGKI+
Sbjct: 180 TPLSALLAAKLAAEAGLPDGVLNIVTGF---GPTAGAAIASHMDVDKVAFTGSTEVGKII 236

Query: 262 YSHCSQG-IKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQ 320
               ++  +K ++LELGG +PFIV  +A++ KAV  A  A F N GQ C A +R  + E 
Sbjct: 237 MQLAAKSNLKPVTLELGGKSPFIVCEDADVDKAVELAHFALFFNQGQCCCAGSRTFVHES 296

Query: 321 VFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGG 380
           ++D+FVE  K    K  + GD  +      GP ++  Q EK+ S ++    +GA +I GG
Sbjct: 297 IYDEFVEKAKARALKRVV-GDPFRK-GVEQGPQVDKEQFEKILSYIEHGKEEGATLITGG 354

Query: 381 GKVTKLGSK--FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLA 438
            ++   GSK  +  PTI +++  DM    +E+FGPV   +KFKT +E +  ANNS  GLA
Sbjct: 355 DRI---GSKGYYIQPTIFSDVKDDMKIARDEIFGPVQSILKFKTVDEVIKRANNSKYGLA 411

Query: 439 GYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQI 498
              F+ +I  A  + + L+ G V +N   +      FGG K SG+GRE   + +  + Q+
Sbjct: 412 AGVFSKNIDTANTLSRALKAGTVWVNCYDVFDASIPFGGYKMSGIGREKGIYALNNYLQV 471

Query: 499 K 499
           K
Sbjct: 472 K 472


>gnl|CDD|143401 cd07082, ALDH_F11_NP-GAPDH, NADP+-dependent non-phosphorylating
           glyceraldehyde 3-phosphate dehydrogenase and ALDH family
           11.  NADP+-dependent non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase (NP-GAPDH, EC=1.2.1.9)
           catalyzes the irreversible oxidation of glyceraldehyde
           3-phosphate to 3-phosphoglycerate generating NADPH for
           biosynthetic reactions.  This CD also includes the
           Arabidopsis thaliana osmotic-stress-inducible ALDH
           family 11, ALDH11A3  and similar sequences. In
           autotrophic eukaryotes, NP-GAPDH generates NADPH for
           biosynthetic processes from photosynthetic
           glyceraldehyde-3-phosphate exported from the chloroplast
           and catalyzes one of the classic glycolytic bypass
           reactions unique to plants.
          Length = 473

 Score =  348 bits (895), Expect = e-115
 Identities = 173/479 (36%), Positives = 248/479 (51%), Gaps = 23/479 (4%)

Query: 31  QAYINGEWVHSSKKF-PVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTP-KE 88
           +  INGEW  SS K   VY+P + E +  VP +   ++  A  +A +A   W  T P +E
Sbjct: 3   KYLINGEWKESSGKTIEVYSPIDGEVIGSVPALSALEILEAAETAYDAGRGWWPTMPLEE 62

Query: 89  RGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELV 148
           R   L ++   L+ NK+ +A ++  E GK   ++L EV     ++    EE +R  G+ +
Sbjct: 63  RIDCLHKFADLLKENKEEVANLLMWEIGKTLKDALKEVDRTIDYIRDTIEELKRLDGDSL 122

Query: 149 PSP----VKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPL 204
           P       K K   + REPLGVV  I P+N+P+ +   K+ PAL  G T V KPA    L
Sbjct: 123 PGDWFPGTKGKIAQVRREPLGVVLAIGPFNYPLNLTVSKLIPALIMGNTVVFKPATQGVL 182

Query: 205 TALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSH 264
             + LA   H+A  PKGV+NVVT        +G  L T   +  ISFTGST+VG  L   
Sbjct: 183 LGIPLAEAFHDAGFPKGVVNVVT---GRGREIGDPLVTHGRIDVISFTGSTEVGNRLKK- 238

Query: 265 CSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQ 324
               +KRL LELGG  P IV  +A+L+ A    +       GQ C A  R L+ E V D+
Sbjct: 239 -QHPMKRLVLELGGKDPAIVLPDADLELAAKEIVKGALSYSGQRCTAIKRVLVHESVADE 297

Query: 325 FVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVT 384
            VE+LK E+ K  + G    ++  ++ PLI+    + V+ ++ DA++KGA V+ GGG+  
Sbjct: 298 LVELLKEEVAKLKV-GM-PWDNGVDITPLIDPKSADFVEGLIDDAVAKGATVLNGGGREG 355

Query: 385 KLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTN 444
                   PT+L  +TPDM    EE FGPV   I+    EE++ +AN SN GL    FT 
Sbjct: 356 G---NLIYPTLLDPVTPDMRLAWEEPFGPVLPIIRVNDIEEAIELANKSNYGLQASIFTK 412

Query: 445 DISQAWRVGKKLQVGMVGINE----GIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           DI++A ++   L+VG V IN     G        F G K+SG+G +G    +   T+ K
Sbjct: 413 DINKARKLADALEVGTVNINSKCQRGPDHF---PFLGRKDSGIGTQGIGDALRSMTRRK 468


>gnl|CDD|143429 cd07111, ALDH_F16, Aldehyde dehydrogenase family 16A1-like.
           Uncharacterized aldehyde dehydrogenase family 16 member
           A1 (ALDH16A1) and other related sequences are present in
           this CD. The active site cysteine and glutamate residues
           are not conserved in the human ALDH16A1 protein
           sequence.
          Length = 480

 Score =  345 bits (887), Expect = e-114
 Identities = 172/466 (36%), Positives = 258/466 (55%), Gaps = 21/466 (4%)

Query: 33  YINGEWV--HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
           +ING+WV   + K FP  NPA  E LA V     EDV  A+ +A+ AFE WS      R 
Sbjct: 25  FINGKWVKPENRKSFPTINPATGEVLASVLQAEEEDVDAAVAAARTAFESWSALPGHVRA 84

Query: 91  IVLKRWLKELEGNKQALAEIMTQESGKPPTESL-GEVVYANSFVDWFSEEARRTYGELVP 149
             L R  + ++ +++  A + + ++GKP  ES   ++          +  A+        
Sbjct: 85  RHLYRIARHIQKHQRLFAVLESLDNGKPIRESRDCDIPLVARHFYHHAGWAQ-------- 136

Query: 150 SPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALAL 209
                   L   +P+GVVG I PWNFP+ M+  K+ PALA G T V+KPAE TPLTAL  
Sbjct: 137 ---LLDTELAGWKPVGVVGQIVPWNFPLLMLAWKICPALAMGNTVVLKPAEYTPLTALLF 193

Query: 210 AALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGI 269
           A +  EA +P GVLN+VT       S G  L   P V  ++FTGST+VG+ L    +   
Sbjct: 194 AEICAEAGLPPGVLNIVTG----NGSFGSALANHPGVDKVAFTGSTEVGRALRRATAGTG 249

Query: 270 KRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEML 329
           K+LSLELGG +PFIV+ +A+L  AV G + A + N GQ C A +R L+QE V ++ +  L
Sbjct: 250 KKLSLELGGKSPFIVFDDADLDSAVEGIVDAIWFNQGQVCCAGSRLLVQESVAEELIRKL 309

Query: 330 KVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSK 389
           K  ++   +     K    ++G +++ AQ++++  +V++  ++GA V   G  +   G  
Sbjct: 310 KERMSHLRVGDPLDKA--IDMGAIVDPAQLKRIRELVEEGRAEGADVFQPGADLPSKGP- 366

Query: 390 FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQA 449
           F+ PT+ TN+ P      EE+FGPV V + F+T +E++ +ANN+  GLA   ++ ++S A
Sbjct: 367 FYPPTLFTNVPPASRIAQEEIFGPVLVVLTFRTAKEAVALANNTPYGLAASVWSENLSLA 426

Query: 450 WRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEF 495
             V   L+ G+V IN   +    A FGG +ESG GREG + G+ E+
Sbjct: 427 LEVALSLKAGVVWINGHNLFDAAAGFGGYRESGFGREGGKEGLYEY 472


>gnl|CDD|183918 PRK13252, PRK13252, betaine aldehyde dehydrogenase; Provisional.
          Length = 488

 Score =  345 bits (887), Expect = e-114
 Identities = 175/482 (36%), Positives = 253/482 (52%), Gaps = 27/482 (5%)

Query: 31  QAYINGEWVH--SSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKE 88
             YI+G +V   S + F V NPA  E LA V      DV+ A+ SAK+  + W+  T  E
Sbjct: 8   SLYIDGAYVEATSGETFEVINPATGEVLATVQAATPADVEAAVASAKQGQKIWAAMTAME 67

Query: 89  RGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFS-EEARRTYGEL 147
           R  +L+R +  L      LA + T ++GKP  E+        S VD  +  +    Y  L
Sbjct: 68  RSRILRRAVDILRERNDELAALETLDTGKPIQET--------SVVDIVTGADVLEYYAGL 119

Query: 148 VPS------PVKSKELL-MVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAE 200
            P+      P++    +   REPLGV   I  WN+P+ +   K APALAAG   + KP+E
Sbjct: 120 APALEGEQIPLRGGSFVYTRREPLGVCAGIGAWNYPIQIACWKSAPALAAGNAMIFKPSE 179

Query: 201 DTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKI 260
            TPLTAL LA +  EA +P GV NVV         VG  L   P +A +SFTG    GK 
Sbjct: 180 VTPLTALKLAEIYTEAGLPDGVFNVVQGDG----RVGAWLTEHPDIAKVSFTGGVPTGKK 235

Query: 261 LYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQ 320
           + +  +  +K +++ELGG +P IV+ +A+L +A   AM A F + GQ C    R  +Q+ 
Sbjct: 236 VMAAAAASLKEVTMELGGKSPLIVFDDADLDRAADIAMLANFYSSGQVCTNGTRVFVQKS 295

Query: 321 VFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGG 380
           +   F   L  E  +    GD   +   N GPL++ A  +KV   ++   ++GA+++ GG
Sbjct: 296 IKAAFEARLL-ERVERIRIGDPM-DPATNFGPLVSFAHRDKVLGYIEKGKAEGARLLCGG 353

Query: 381 GKVTKLGSK---FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGL 437
            ++T+ G     F APT+ T+ T DM    EE+FGPV   + F  E+E +  AN++  GL
Sbjct: 354 ERLTEGGFANGAFVAPTVFTDCTDDMTIVREEIFGPVMSVLTFDDEDEVIARANDTEYGL 413

Query: 438 AGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQ 497
           A   FT D+S+A RV  +L+ G+  IN    S  E   GG K+SG+GRE     +E +TQ
Sbjct: 414 AAGVFTADLSRAHRVIHQLEAGICWINTWGESPAEMPVGGYKQSGIGRENGIATLEHYTQ 473

Query: 498 IK 499
           IK
Sbjct: 474 IK 475


>gnl|CDD|143464 cd07146, ALDH_PhpJ, Streptomyces putative phosphonoformaldehyde
           dehydrogenase PhpJ-like.  Putative phosphonoformaldehyde
           dehydrogenase (PhpJ), an aldehyde dehydrogenase homolog
           reportedly involved in the biosynthesis of
           phosphinothricin tripeptides in Streptomyces
           viridochromogenes DSM 40736, and similar sequences are
           included in this CD.
          Length = 451

 Score =  342 bits (879), Expect = e-113
 Identities = 167/459 (36%), Positives = 260/459 (56%), Gaps = 20/459 (4%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
           V NP   E +  VP    E ++ A+  A       S  T  +R  +L +    LE  ++ 
Sbjct: 3   VRNPYTGEVVGTVPAGTEEALREALALA---ASYRSTLTRYQRSAILNKAAALLEARREE 59

Query: 107 LAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPV----KSKELLMVRE 162
            A ++T ESG    ++  EV  A   + + + EA R  GE     +    K++++  +RE
Sbjct: 60  FARLITLESGLCLKDTRYEVGRAADVLRFAAAEALRDDGESFSCDLTANGKARKIFTLRE 119

Query: 163 PLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGV 222
           PLGVV  ITP+N P+  +  K+APA+AA    V+KP+E TPL+A+ LA L +EA +P  +
Sbjct: 120 PLGVVLAITPFNHPLNQVAHKIAPAIAANNRIVLKPSEKTPLSAIYLADLLYEAGLPPDM 179

Query: 223 LNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPF 282
           L+VVT        +G  L T P V  ++FTG   VGK +    + G KR  LELGGN P 
Sbjct: 180 LSVVTG---EPGEIGDELITHPDVDLVTFTGGVAVGKAI--AATAGYKRQLLELGGNDPL 234

Query: 283 IVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDT 342
           IV  +A+L++A + A+A  + N GQ C A  R L+ E V D+FV++L VE + + + GD 
Sbjct: 235 IVMDDADLERAATLAVAGSYANSGQRCTAVKRILVHESVADEFVDLL-VEKSAALVVGDP 293

Query: 343 SKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPD 402
                 ++G +I+     ++++ V++AI++GA+V+LG  +        +APT+L ++ PD
Sbjct: 294 MDPAT-DMGTVIDEEAAIQIENRVEEAIAQGARVLLGNQRQ----GALYAPTVLDHVPPD 348

Query: 403 MLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVG 462
               TEE FGPVA  I+ K  +E++ ++N++  GL+    TND+    R+ ++L VG V 
Sbjct: 349 AELVTEETFGPVAPVIRVKDLDEAIAISNSTAYGLSSGVCTNDLDTIKRLVERLDVGTVN 408

Query: 463 INEGIISHTE-AAFGGVKESGLG-REGSRHGMEEFTQIK 499
           +NE     +E + FGGVK+SGLG +EG R  M+E T +K
Sbjct: 409 VNEVPGFRSELSPFGGVKDSGLGGKEGVREAMKEMTNVK 447


>gnl|CDD|143470 cd07152, ALDH_BenzADH, NAD-dependent benzaldehyde dehydrogenase
           II-like.  NAD-dependent, benzaldehyde dehydrogenase II
           (XylC, BenzADH, EC=1.2.1.28) is involved in the
           oxidation of benzyl alcohol to benzoate. In
           Acinetobacter calcoaceticus, this process is carried out
           by the chromosomally encoded, benzyl alcohol
           dehydrogenase (xylB) and benzaldehyde dehydrogenase II
           (xylC) enzymes; whereas in Pseudomonas putida they are
           encoded by TOL plasmids.
          Length = 443

 Score =  338 bits (868), Expect = e-112
 Identities = 160/436 (36%), Positives = 239/436 (54%), Gaps = 14/436 (3%)

Query: 65  EDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLG 124
            DV  A   A  A   W+ T P+ER  VL+R    LE +   +A+ + +ESG    ++  
Sbjct: 13  ADVDRAAARAAAAQRAWAATPPRERAAVLRRAADLLEEHADEIADWIVRESGSIRPKAGF 72

Query: 125 EVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKV 184
           EV  A   +   +    +  GE++PS    +  L  R PLGVVG+I+P+NFP+ +  R V
Sbjct: 73  EVGAAIGELHEAAGLPTQPQGEILPSA-PGRLSLARRVPLGVVGVISPFNFPLILAMRSV 131

Query: 185 APALAAGCTCVIKPAEDTPLTA-LALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTS 243
           APALA G   V+KP   TP++  + +A L  EA +P GVL+V+          G+ L   
Sbjct: 132 APALALGNAVVLKPDPRTPVSGGVVIARLFEEAGLPAGVLHVLPG----GADAGEALVED 187

Query: 244 PHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFR 303
           P+VA ISFTGST VG+ +     + +K++SLELGG    IV  +A+L  A S      F 
Sbjct: 188 PNVAMISFTGSTAVGRKVGEAAGRHLKKVSLELGGKNALIVLDDADLDLAASNGAWGAFL 247

Query: 304 NCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVD 363
           + GQ C+AA R L+ E V D +   L     K    GD +   +  LGPLIN  Q+++V 
Sbjct: 248 HQGQICMAAGRHLVHESVADAYTAKL-AAKAKHLPVGDPAT-GQVALGPLINARQLDRVH 305

Query: 364 SIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTE 423
           +IV D+++ GA++  GG   T  G  F+ PT+L+ + P M  + EE+FGPVA    F ++
Sbjct: 306 AIVDDSVAAGARLEAGG---TYDG-LFYRPTVLSGVKPGMPAFDEEIFGPVAPVTVFDSD 361

Query: 424 EESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIISH-TEAAFGGVKESG 482
           EE++ +AN++  GL+    + D+ +A  +  +L+ GM+ IN+  ++      FGG+  SG
Sbjct: 362 EEAVALANDTEYGLSAGIISRDVGRAMALADRLRTGMLHINDQTVNDEPHNPFGGMGASG 421

Query: 483 LG-REGSRHGMEEFTQ 497
            G R G     EEFTQ
Sbjct: 422 NGSRFGGPANWEEFTQ 437


>gnl|CDD|143461 cd07143, ALDH_AldA_AN0554, Aspergillus nidulans aldehyde
           dehydrogenase, AldA (AN0554)-like.  NAD(P)+-dependent
           aldehyde dehydrogenase (AldA) of Aspergillus nidulans
           (locus AN0554), and other similar sequences, are present
           in this CD.
          Length = 481

 Score =  339 bits (870), Expect = e-111
 Identities = 179/474 (37%), Positives = 264/474 (55%), Gaps = 15/474 (3%)

Query: 33  YINGEWVHSSKK--FPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFE-QWS-QTTPKE 88
           +INGE+V S       VYNP+  + +  + +    DV  A+  A  AFE  W  + +  +
Sbjct: 10  FINGEFVDSVHGGTVKVYNPSTGKLITKIAEATEADVDIAVEVAHAAFETDWGLKVSGSK 69

Query: 89  RGIVLKRWLKELEGNKQALAEIMTQESGKP-PTESLGEVVYANSFVDWFSEEARRTYGEL 147
           RG  L +    +E N   LA I   ++GK   T    +V  +     ++   A + +G++
Sbjct: 70  RGRCLSKLADLMERNLDYLASIEALDNGKTFGTAKRVDVQASADTFRYYGGWADKIHGQV 129

Query: 148 VPSPVKSKELLMVR-EPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTA 206
           + +    K+L   R EP+GV G I PWNFP+ M   K+APALAAG T V+KP+E TPL+A
Sbjct: 130 IET--DIKKLTYTRHEPIGVCGQIIPWNFPLLMCAWKIAPALAAGNTIVLKPSELTPLSA 187

Query: 207 LALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVG-KILYSHC 265
           L +  L  EA  P GV+NVV+       + G  + +   +  ++FTGST VG K++ +  
Sbjct: 188 LYMTKLIPEAGFPPGVINVVSG---YGRTCGNAISSHMDIDKVAFTGSTLVGRKVMEAAA 244

Query: 266 SQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQF 325
              +K+++LELGG +P IV+ +A+L+ AV       F N GQ C A +R  +QE ++D+F
Sbjct: 245 KSNLKKVTLELGGKSPNIVFDDADLESAVVWTAYGIFFNHGQVCCAGSRIYVQEGIYDKF 304

Query: 326 VEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTK 385
           V+  K E  K    GD    D    GP ++  Q E++ S ++   ++GA V  GG +   
Sbjct: 305 VKRFK-EKAKKLKVGDPFAEDTFQ-GPQVSQIQYERIMSYIESGKAEGATVETGGKRHGN 362

Query: 386 LGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTND 445
            G  F  PTI T++T DM    EE+FGPV   IKFKTEEE++  AN+S  GLA   FTN+
Sbjct: 363 EGY-FIEPTIFTDVTEDMKIVKEEIFGPVVAVIKFKTEEEAIKRANDSTYGLAAAVFTNN 421

Query: 446 ISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           I+ A RV   L+ G V +N   + H +  FGG K+SG+GRE   + +E +TQIK
Sbjct: 422 INNAIRVANALKAGTVWVNCYNLLHHQVPFGGYKQSGIGRELGEYALENYTQIK 475


>gnl|CDD|143426 cd07108, ALDH_MGR_2402, Magnetospirillum NAD(P)+-dependent aldehyde
           dehydrogenase MSR-1-like.  NAD(P)+-dependent aldehyde
           dehydrogenase of Magnetospirillum gryphiswaldense MSR-1
           (MGR_2402) , and other similar sequences, are present in
           this CD.
          Length = 457

 Score =  337 bits (865), Expect = e-111
 Identities = 177/460 (38%), Positives = 252/460 (54%), Gaps = 14/460 (3%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
           V NPA  + + +VP     DV  A+ +AK AF +W+ T  +ERG +L R    LE   + 
Sbjct: 1   VINPATGQVIGEVPRSRAADVDRAVAAAKAAFPEWAATPARERGKLLARIADALEARSEE 60

Query: 107 LAEIMTQESGKP-PTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLG 165
           LA ++  E+G    T++  E         +F   A    GE +P          VREPLG
Sbjct: 61  LARLLALETGNALRTQARPEAAVLADLFRYFGGLAGELKGETLP-FGPDVLTYTVREPLG 119

Query: 166 VVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNV 225
           VVG I PWN P+ +   K+APAL AG T V+K AED PL  L LA +  +  +P GVLNV
Sbjct: 120 VVGAILPWNAPLMLAALKIAPALVAGNTVVLKAAEDAPLAVLLLAEILAQV-LPAGVLNV 178

Query: 226 VTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVY 285
           +T         G  L   P V  ++FTGST+VGKI+Y   +  +  +SLELGG +P IV+
Sbjct: 179 ITGY---GEECGAALVDHPDVDKVTFTGSTEVGKIIYRAAADRLIPVSLELGGKSPMIVF 235

Query: 286 SNANLQKAVSGAMAA-KFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSK 344
            +A+L  AV GA+A  +F   GQ+C A +R  + E ++D F+E L  +++K  I GD   
Sbjct: 236 PDADLDDAVDGAIAGMRFTRQGQSCTAGSRLFVHEDIYDAFLEKLVAKLSKLKI-GD-PL 293

Query: 345 NDKCNLGPLINLAQVEKVDSIVQDAIS-KGAKVILGGG--KVTKLGSKFF-APTILTNIT 400
           ++  ++G +I+  Q  KV   +   +S  GA V+ GG       L   FF  PTI + + 
Sbjct: 294 DEATDIGAIISEKQFAKVCGYIDLGLSTSGATVLRGGPLPGEGPLADGFFVQPTIFSGVD 353

Query: 401 PDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGM 460
            +     EE+FGPV   I +K E+E + +AN+S+ GLA Y +T D+ +A R    L+ G 
Sbjct: 354 NEWRLAREEIFGPVLCAIPWKDEDEVIAMANDSHYGLAAYVWTRDLGRALRAAHALEAGW 413

Query: 461 VGINEGIISHTEAAFGGVKESGLGREGSRHGM-EEFTQIK 499
           V +N+G       ++GG K+SGLGRE S  GM E FTQ K
Sbjct: 414 VQVNQGGGQQPGQSYGGFKQSGLGREASLEGMLEHFTQKK 453


>gnl|CDD|131352 TIGR02299, HpaE, 5-carboxymethyl-2-hydroxymuconate semialdehyde
           dehydrogenase.  This model represents the dehydrogenase
           responsible for the conversion of
           5-carboxymethyl-2-hydroxymuconate semialdehyde to
           5-carboxymethyl-2-hydroxymuconate (a tricarboxylic
           acid). This is the step in the degradation of
           4-hydroxyphenylacetic acid via homoprotocatechuate
           following the oxidative opening of the aromatic ring.
          Length = 488

 Score =  335 bits (861), Expect = e-110
 Identities = 173/479 (36%), Positives = 259/479 (54%), Gaps = 16/479 (3%)

Query: 33  YINGEWVHSSKK--FPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
           +I+GE+V S     F   +PA +E L  V   G  DV  A  +AKEAF++W++    ER 
Sbjct: 4   FIDGEFVPSESGETFETLSPATNEVLGSVARGGAADVDRAAKAAKEAFKRWAELKAAERK 63

Query: 91  IVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEA-RRTYGELVP 149
             L +    +E +   +A +   + G+P  ++  +V+ A     +F+++      G   P
Sbjct: 64  RYLHKIADLIEQHADEIAVLECLDCGQPLRQTRQQVIRAAENFRFFADKCEEAMDGRTYP 123

Query: 150 SPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALAL 209
                     VR P+G VGLITPWN P  + T K+APALA G T V+KPAE +PLTA  L
Sbjct: 124 VDTHLN--YTVRVPVGPVGLITPWNAPFMLSTWKIAPALAFGNTVVLKPAEWSPLTAARL 181

Query: 210 AALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGI 269
           A +A EA +P GV N+V      A   GK L   P V  +SFTG T  G I+  + +  +
Sbjct: 182 AEIAKEAGLPDGVFNLVHGFGEEA---GKALVAHPDVKAVSFTGETATGSIIMRNGADTL 238

Query: 270 KRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEML 329
           KR S+ELGG +P IV+ +A+L++A+   +   F   G+ C A++R L+QE + + FVE L
Sbjct: 239 KRFSMELGGKSPVIVFDDADLERALDAVVFMIFSFNGERCTASSRLLVQESIAEDFVEKL 298

Query: 330 KVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSK 389
             E  ++   G     +   +GPLI+   + KV   V+ A  +GA +++GG +      +
Sbjct: 299 V-ERVRAIRVGHPLDPET-EVGPLIHPEHLAKVLGYVEAAEKEGATILVGGERAPTFRGE 356

Query: 390 ------FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFT 443
                 +  PT+ T     M    EE+FGPV   I FK EEE++  AN++  GLAGY +T
Sbjct: 357 DLGRGNYVLPTVFTGADNHMRIAQEEIFGPVLTVIPFKDEEEAIEKANDTRYGLAGYVWT 416

Query: 444 NDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIKYMC 502
           ND+ +A RV   L+ GM+ +N   + H    FGGVK SG+GREG  +  + +T+ K + 
Sbjct: 417 NDVGRAHRVALALEAGMIWVNSQNVRHLPTPFGGVKASGIGREGGTYSFDFYTETKNVA 475


>gnl|CDD|182108 PRK09847, PRK09847, gamma-glutamyl-gamma-aminobutyraldehyde
           dehydrogenase; Provisional.
          Length = 494

 Score =  329 bits (846), Expect = e-108
 Identities = 175/493 (35%), Positives = 279/493 (56%), Gaps = 20/493 (4%)

Query: 17  HMKYLQK--LCLSTYTQAYINGEWVHSS--KKFPVYNPANDEHLADVPDMGVEDVQYAIM 72
           H+ Y Q   L L+   + +INGE+  ++  + F   +P     LA +      D+  A+ 
Sbjct: 5   HLAYWQDKALSLAIENRLFINGEYTAAAENETFETVDPVTQAPLAKIARGKSVDIDRAVS 64

Query: 73  SAKEAFEQ--WSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESL-GEVVYA 129
           +A+  FE+  WS ++P +R  VL +    +E + + LA + T ++GKP   SL  ++  A
Sbjct: 65  AARGVFERGDWSLSSPAKRKAVLNKLADLMEAHAEELALLETLDTGKPIRHSLRDDIPGA 124

Query: 130 NSFVDWFSEEARRTYGELVPSPVKSKELLM-VREPLGVVGLITPWNFPMAMITRKVAPAL 188
              + W++E   + YGE+  +   S EL M VREP+GV+  I PWNFP+ +   K+ PAL
Sbjct: 125 ARAIRWYAEAIDKVYGEVATT--SSHELAMIVREPVGVIAAIVPWNFPLLLTCWKLGPAL 182

Query: 189 AAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAG 248
           AAG + ++KP+E +PL+A+ LA LA EA +P GVLNVVT     A   G+ L     +  
Sbjct: 183 AAGNSVILKPSEKSPLSAIRLAGLAKEAGLPDGVLNVVTGFGHEA---GQALSRHNDIDA 239

Query: 249 ISFTGSTQVGKILYSHCSQG-IKRLSLELGGNAPFIVYSNA-NLQKAVSGAMAAKFRNCG 306
           I+FTGST+ GK L        +KR+ LE GG +  IV+++  +LQ+A S   A  F N G
Sbjct: 240 IAFTGSTRTGKQLLKDAGDSNMKRVWLEAGGKSANIVFADCPDLQQAASATAAGIFYNQG 299

Query: 307 QACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIV 366
           Q C+A  R L++E + D+F+ +LK +  +++  G    +    +G LI+ A  + V S +
Sbjct: 300 QVCIAGTRLLLEESIADEFLALLKQQA-QNWQPGH-PLDPATTMGTLIDCAHADSVHSFI 357

Query: 367 QDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEES 426
           ++  SKG +++L G       +    PTI  ++ P+     EE+FGPV V  +F +EE++
Sbjct: 358 REGESKG-QLLLDGRNAGLAAA--IGPTIFVDVDPNASLSREEIFGPVLVVTRFTSEEQA 414

Query: 427 LCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGRE 486
           L +AN+S  GL    +T D+S+A R+ ++L+ G V +N          FGG K+SG GR+
Sbjct: 415 LQLANDSQYGLGAAVWTRDLSRAHRMSRRLKAGSVFVNNYNDGDMTVPFGGYKQSGNGRD 474

Query: 487 GSRHGMEEFTQIK 499
            S H +E+FT++K
Sbjct: 475 KSLHALEKFTELK 487


>gnl|CDD|132260 TIGR03216, OH_muco_semi_DH, 2-hydroxymuconic semialdehyde
           dehydrogenase.  Members of this protein family are
           2-hydroxymuconic semialdehyde dehydrogenase. Many
           aromatic compounds are catabolized by way of the
           catechol, via the meta-cleavage pathway, to pyruvate and
           acetyl-CoA. This enzyme performs the second of seven
           steps in that pathway for catechol degradation [Energy
           metabolism, Other].
          Length = 481

 Score =  325 bits (834), Expect = e-106
 Identities = 170/486 (34%), Positives = 256/486 (52%), Gaps = 23/486 (4%)

Query: 31  QAYINGEWVHSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFE-QWSQTTPKER 89
           + +ING +V S K F   NP +   +A V + G  +V  A+ +A+ A +  W + T  ER
Sbjct: 2   RNFINGAFVESGKTFANINPVDGRVIARVHEAGAAEVDAAVAAARAALKGPWGKMTVAER 61

Query: 90  GIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVV-----YANSFVDWFSEEARRTY 144
             +L     E+E            ++GKP   SL   +      AN  V  F++  +   
Sbjct: 62  ADLLYAVADEIERRFDDFLAAEVADTGKP--RSLASHLDIPRGAANFRV--FADVVKNAP 117

Query: 145 GEL--VPSPVKSKEL-LMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAED 201
            E   + +P     L   VR+PLGVVG+I+PWN P+ ++T KV PALA G T V+KP+E+
Sbjct: 118 TECFEMATPDGKGALNYAVRKPLGVVGVISPWNLPLLLMTWKVGPALACGNTVVVKPSEE 177

Query: 202 TPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKIL 261
           TP TA  L  + +   +PKGV NVV       +S G+ L   P V  I+FTG T+ G  +
Sbjct: 178 TPGTATLLGEVMNAVGVPKGVYNVVHG--FGPDSAGEFLTRHPGVDAITFTGETRTGSAI 235

Query: 262 YSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQV 321
               + G+K +S ELGG    IV+++ +   AV+G + + F N GQ C+   R  ++  +
Sbjct: 236 MKAAADGVKPVSFELGGKNAAIVFADCDFDAAVAGILRSAFLNTGQVCLGTERVYVERPI 295

Query: 322 FDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGG 381
           FD+FV  LK    +S   G    +   N+GPLI+    +KV S    A+ +GA V+ GGG
Sbjct: 296 FDRFVAALK-ARAESLKIGVPD-DPATNMGPLISAEHRDKVLSYYALAVEEGATVVTGGG 353

Query: 382 KVTKLGSK-----FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTG 436
            V   G       +  PTI T +       TEE+FGP      F +EEE + +AN++  G
Sbjct: 354 -VPDFGDALAGGAWVQPTIWTGLPDSARVVTEEIFGPCCHIAPFDSEEEVIALANDTPYG 412

Query: 437 LAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFT 496
           LA   +T D+S+A RV ++++VG+V +N   +      FGG K SG+GREG  H +E +T
Sbjct: 413 LAASVWTEDLSRAHRVARQMEVGIVWVNSWFLRDLRTPFGGSKLSGIGREGGVHSLEFYT 472

Query: 497 QIKYMC 502
           ++  +C
Sbjct: 473 ELTNVC 478


>gnl|CDD|143431 cd07113, ALDH_PADH_NahF, Escherichia coli NAD+-dependent
           phenylacetaldehyde dehydrogenase PadA-like.
           NAD+-dependent, homodimeric, phenylacetaldehyde
           dehydrogenase (PADH, EC=1.2.1.39) PadA of Escherichia
           coli involved in the catabolism of 2-phenylethylamine,
           and other related sequences, are present in this CD.
           Also included is the Pseudomonas fluorescens ST StyD
           PADH involved in styrene catabolism, the Sphingomonas
           sp. LB126 FldD protein involved in fluorene degradation,
           and the Novosphingobium aromaticivorans NahF
           salicylaldehyde dehydrogenase involved in the
           NAD+-dependent conversion of salicylaldehyde to
           salicylate.
          Length = 477

 Score =  324 bits (832), Expect = e-106
 Identities = 173/479 (36%), Positives = 255/479 (53%), Gaps = 16/479 (3%)

Query: 33  YINGEWV--HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFE-QWSQTTPKER 89
           +I+G  V   S K+  + NPA ++ +A V      DV  A+ SA  AF   W++TTP ER
Sbjct: 3   FIDGRPVAGQSEKRLDITNPATEQVIASVASATEADVDAAVASAWRAFVSAWAKTTPAER 62

Query: 90  GIVLKRWLKELEGNKQALAEIMTQESGKPPTESLG-EVVYANSFVDWFSEEARRTYGE-L 147
           G +L R    +E + + LA++ T  SGK    S   EV  + +F+ +F+  A +  GE L
Sbjct: 63  GRILLRLADLIEQHGEELAQLETLCSGKSIHLSRAFEVGQSANFLRYFAGWATKINGETL 122

Query: 148 VPS-PVKSKE---LLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTP 203
            PS P    E       REP+GVV  I PWNF + +   K+  ALA GCT VIKP+E TP
Sbjct: 123 APSIPSMQGERYTAFTRREPVGVVAGIVPWNFSVMIAVWKIGAALATGCTIVIKPSEFTP 182

Query: 204 LTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYS 263
           LT L +A LA EA IP GVLNVV        +VG  L + P VA +SFTGS   GK +  
Sbjct: 183 LTLLRVAELAKEAGIPDGVLNVVNGK----GAVGAQLISHPDVAKVSFTGSVATGKKIGR 238

Query: 264 HCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFD 323
             +  + R++LELGG        +A++   V G + A F + GQ C A  RF +    FD
Sbjct: 239 QAASDLTRVTLELGGKNAAAFLKDADIDWVVEGLLTAGFLHQGQVCAAPERFYVHRSKFD 298

Query: 324 QFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKV 383
           + V  LK  +  SF  G    ++    GPL N    +KV S + DA ++G +++ GG  +
Sbjct: 299 ELVTKLKQAL-SSFQVGS-PMDESVMFGPLANQPHFDKVCSYLDDARAEGDEIVRGGEAL 356

Query: 384 TKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFT 443
              G  F  PT++   + D     EE FGPV   + ++ EEE + + N++  GL    +T
Sbjct: 357 AGEGY-FVQPTLVLARSADSRLMREETFGPVVSFVPYEDEEELIQLINDTPFGLTASVWT 415

Query: 444 NDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIKYMC 502
           N++S+A R   +++ G V +N          FGG+K+SG+GRE     ++++T++K + 
Sbjct: 416 NNLSKALRYIPRIEAGTVWVNMHTFLDPAVPFGGMKQSGIGREFGSAFIDDYTELKSVM 474


>gnl|CDD|215410 PLN02766, PLN02766, coniferyl-aldehyde dehydrogenase.
          Length = 501

 Score =  324 bits (832), Expect = e-105
 Identities = 178/481 (37%), Positives = 279/481 (58%), Gaps = 21/481 (4%)

Query: 29  YTQAYINGEWVH--SSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQ--WSQT 84
           +T+ +INGE+V   S K F   +P   E +A + +   EDV  A+ +A+EAF+   W + 
Sbjct: 20  FTKLFINGEFVDAASGKTFETRDPRTGEVIARIAEGDKEDVDLAVKAAREAFDHGPWPRM 79

Query: 85  TPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVV---YANSFVDWFSEEAR 141
           +  ERG ++ ++   +E + + LA + T ++GK     LG+ V    A   + +++  A 
Sbjct: 80  SGFERGRIMMKFADLIEEHIEELAALDTIDAGKLFA--LGKAVDIPAAAGLLRYYAGAAD 137

Query: 142 RTYGELVPSPVKSKEL--LMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPA 199
           + +GE +     S++L    ++EP+GVVG I PWNFP  M   KVAPALAAGCT V+KPA
Sbjct: 138 KIHGETLKM---SRQLQGYTLKEPIGVVGHIIPWNFPSTMFFMKVAPALAAGCTMVVKPA 194

Query: 200 EDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVG- 258
           E TPL+AL  A LA  A +P GV+NVVT     A   G  + +   V  +SFTGST+VG 
Sbjct: 195 EQTPLSALFYAHLAKLAGVPDGVINVVTGFGPTA---GAAIASHMDVDKVSFTGSTEVGR 251

Query: 259 KILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQ 318
           KI+ +  +  +K++SLELGG +P +++ +A++  AV  A+   F N G+ CVA++R  +Q
Sbjct: 252 KIMQAAATSNLKQVSLELGGKSPLLIFDDADVDMAVDLALLGIFYNKGEICVASSRVYVQ 311

Query: 319 EQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVIL 378
           E ++D+FV+ L VE  K ++ GD   + +   GP ++  Q EK+ S ++    +GA ++ 
Sbjct: 312 EGIYDEFVKKL-VEKAKDWVVGDPF-DPRARQGPQVDKQQFEKILSYIEHGKREGATLLT 369

Query: 379 GGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLA 438
           GG      G  +  PTI T++T DM    +E+FGPV   +KFKT EE++  ANN+  GLA
Sbjct: 370 GGKPCGDKGY-YIEPTIFTDVTEDMKIAQDEIFGPVMSLMKFKTVEEAIKKANNTKYGLA 428

Query: 439 GYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQI 498
               T D+  A  V + ++ G + +N       +  FGG K SG GR+     ++++ Q+
Sbjct: 429 AGIVTKDLDVANTVSRSIRAGTIWVNCYFAFDPDCPFGGYKMSGFGRDQGMDALDKYLQV 488

Query: 499 K 499
           K
Sbjct: 489 K 489


>gnl|CDD|236501 PRK09407, gabD2, succinic semialdehyde dehydrogenase; Reviewed.
          Length = 524

 Score =  324 bits (833), Expect = e-105
 Identities = 164/468 (35%), Positives = 239/468 (51%), Gaps = 33/468 (7%)

Query: 40  HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKE 99
            +     V  P   E LA VP     DV+ A   A+ A   W+ T  +ER  VL R+   
Sbjct: 29  AAGPTREVTAPFTGEPLATVPVSTAADVEAAFARARAAQRAWAATPVRERAAVLLRFHDL 88

Query: 100 LEGNKQALAEIMTQESGKPPTESLGEVV--------YANSFVDWFSEEARRTYGELVPSP 151
           +  N++ L +++  E+GK    +  EV+        YA       +   R      +P  
Sbjct: 89  VLENREELLDLVQLETGKARRHAFEEVLDVALTARYYARRAPKLLAPRRRAG---ALPVL 145

Query: 152 VKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAA 211
            K+ EL   R+P GVVG+I+PWN+P+ +      PAL AG   V+KP   TPLTALA   
Sbjct: 146 TKTTEL---RQPKGVVGVISPWNYPLTLAVSDAIPALLAGNAVVLKPDSQTPLTALAAVE 202

Query: 212 LAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKR 271
           L +EA +P+ +  VVT        VG  L    +   + FTGST  G++L        +R
Sbjct: 203 LLYEAGLPRDLWQVVTGPGP---VVGTAL--VDNADYLMFTGSTATGRVLAEQAG---RR 254

Query: 272 L---SLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEM 328
           L   SLELGG  P IV  +A+L KA +GA+ A F N GQ C++  R  + E ++D+FV  
Sbjct: 255 LIGFSLELGGKNPMIVLDDADLDKAAAGAVRACFSNAGQLCISIERIYVHESIYDEFVRA 314

Query: 329 LKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGS 388
             V   ++   G    +   ++G LI+ AQ+E V + V DA++KGA V+ GG     LG 
Sbjct: 315 F-VAAVRAMRLG-AGYDYSADMGSLISEAQLETVSAHVDDAVAKGATVLAGGKARPDLGP 372

Query: 389 KFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQ 448
            F+ PT+LT +TPDM    EE FGPV         +E++  AN++  GL    +T D ++
Sbjct: 373 LFYEPTVLTGVTPDMELAREETFGPVVSVYPVADVDEAVERANDTPYGLNASVWTGDTAR 432

Query: 449 AWRVGKKLQVGMVGINEGIIS---HTEAAFGGVKESGLGREGSRHGME 493
              +  +++ G V +NEG  +     +A  GG+K+SGLGR   RHG E
Sbjct: 433 GRAIAARIRAGTVNVNEGYAAAWGSVDAPMGGMKDSGLGR---RHGAE 477


>gnl|CDD|143458 cd07140, ALDH_F1L_FTFDH, 10-formyltetrahydrofolate dehydrogenase,
           ALDH family 1L.  10-formyltetrahydrofolate dehydrogenase
           (FTHFDH, EC=1.5.1.6), also known as aldehyde
           dehydrogenase family 1 member L1 (ALDH1L1) in humans, is
           a multi-domain homotetramer with an N-terminal formyl
           transferase domain and a C-terminal ALDH domain. FTHFDH
           catalyzes an NADP+-dependent dehydrogenase reaction
           resulting in the conversion of 10-formyltetrahydrofolate
           to tetrahydrofolate and CO2. The ALDH domain is also
           capable of the oxidation of short chain aldehydes to
           their corresponding acids.
          Length = 486

 Score =  321 bits (824), Expect = e-104
 Identities = 168/481 (34%), Positives = 266/481 (55%), Gaps = 17/481 (3%)

Query: 30  TQAYINGEWV--HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFE--QWSQTT 85
            Q +INGE+V     K +   NP +   +  V    VEDV  A+ +AKEAFE  +W +  
Sbjct: 6   HQLFINGEFVDAEGGKTYNTINPTDGSVICKVSLATVEDVDRAVAAAKEAFENGEWGKMN 65

Query: 86  PKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLG-EVVYANSFVDWFSEEARRTY 144
            ++RG ++ R    +E +++ LA I + +SG   T +L   V  +     +F+    +  
Sbjct: 66  ARDRGRLMYRLADLMEEHQEELATIESLDSGAVYTLALKTHVGMSIQTFRYFAGWCDKIQ 125

Query: 145 GELVP-SPVKSKELLMV--REPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAED 201
           G+ +P +  +    L +  REP+GV G++ PWN+P+ M+  K+A  LAAG T V+KPA+ 
Sbjct: 126 GKTIPINQARPNRNLTLTKREPIGVCGIVIPWNYPLMMLAWKMAACLAAGNTVVLKPAQV 185

Query: 202 TPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGK-I 260
           TPLTAL  A L  +A  PKGV+N++  S +    VG+ L   P V  + FTGST +GK I
Sbjct: 186 TPLTALKFAELTVKAGFPKGVINILPGSGSL---VGQRLSDHPDVRKLGFTGSTPIGKHI 242

Query: 261 LYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQ 320
           + S     +K++SLELGG +P I++++ ++ KAV   M++ F N G+ C+AA R  ++E 
Sbjct: 243 MKSCAVSNLKKVSLELGGKSPLIIFADCDMDKAVRMGMSSVFFNKGENCIAAGRLFVEES 302

Query: 321 VFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGG 380
           + D+FV  +  E+ K  I GD       + GP  + A ++K+    +  + +GA ++ GG
Sbjct: 303 IHDEFVRRVVEEVKKMKI-GDPLDRST-DHGPQNHKAHLDKLVEYCERGVKEGATLVYGG 360

Query: 381 GKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTE--EESLCVANNSNTGLA 438
            +V + G  FF PT+ T++   M    EE FGP+ +  KF     +  L  AN++  GLA
Sbjct: 361 KQVDRPGF-FFEPTVFTDVEDHMFIAKEESFGPIMIISKFDDGDVDGVLQRANDTEYGLA 419

Query: 439 GYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQI 498
              FT DI++A  V  KL+ G V +N    +   A FGG K+SG G++     + E+ + 
Sbjct: 420 SGVFTKDINKALYVSDKLEAGTVFVNTYNKTDVAAPFGGFKQSGFGKDLGEEALNEYLKT 479

Query: 499 K 499
           K
Sbjct: 480 K 480


>gnl|CDD|215259 PLN02466, PLN02466, aldehyde dehydrogenase family 2 member.
          Length = 538

 Score =  322 bits (826), Expect = e-104
 Identities = 182/479 (37%), Positives = 264/479 (55%), Gaps = 17/479 (3%)

Query: 29  YTQAYINGEWVH--SSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQ--WSQT 84
           YTQ  ING++V   S K FP  +P   E +A V +   EDV  A+ +A++AF++  W + 
Sbjct: 57  YTQLLINGQFVDAASGKTFPTLDPRTGEVIAHVAEGDAEDVNRAVAAARKAFDEGPWPKM 116

Query: 85  TPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLG-EVVYANSFVDWFSEEARRT 143
           T  ER  +L R+   LE +   LA + T ++GKP  +S   E+        +++  A + 
Sbjct: 117 TAYERSRILLRFADLLEKHNDELAALETWDNGKPYEQSAKAELPMFARLFRYYAGWADKI 176

Query: 144 YGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTP 203
           +G  VP+      +  + EP+GV G I PWNFP+ M   KV PALA G T V+K AE TP
Sbjct: 177 HGLTVPAD-GPHHVQTLHEPIGVAGQIIPWNFPLLMFAWKVGPALACGNTIVLKTAEQTP 235

Query: 204 LTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYS 263
           L+AL  A L HEA +P GVLNVV+     A   G  L +   V  ++FTGST  GKI+  
Sbjct: 236 LSALYAAKLLHEAGLPPGVLNVVSGFGPTA---GAALASHMDVDKLAFTGSTDTGKIVLE 292

Query: 264 HCSQG-IKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVF 322
             ++  +K ++LELGG +PFIV  +A++ KAV  A  A F N GQ C A +R  + E+V+
Sbjct: 293 LAAKSNLKPVTLELGGKSPFIVCEDADVDKAVELAHFALFFNQGQCCCAGSRTFVHERVY 352

Query: 323 DQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGK 382
           D+FVE  K    K  + GD  K      GP I+  Q EK+   ++  +  GA +  GG  
Sbjct: 353 DEFVEKAKARALKRVV-GDPFKKG-VEQGPQIDSEQFEKILRYIKSGVESGATLECGGD- 409

Query: 383 VTKLGSK--FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGY 440
             + GSK  +  PT+ +N+  DML   +E+FGPV   +KFK  +E +  ANN+  GLA  
Sbjct: 410 --RFGSKGYYIQPTVFSNVQDDMLIAQDEIFGPVQSILKFKDLDEVIRRANNTRYGLAAG 467

Query: 441 FFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            FT ++  A  + + L+VG V +N   +      FGG K SG+GRE   + +  + Q+K
Sbjct: 468 VFTQNLDTANTLSRALRVGTVWVNCFDVFDAAIPFGGYKMSGIGREKGIYSLNNYLQVK 526


>gnl|CDD|143419 cd07101, ALDH_SSADH2_GabD2, Mycobacterium tuberculosis
           succinate-semialdehyde dehydrogenase 2-like.
           Succinate-semialdehyde dehydrogenase 2 (SSADH2) and
           similar proteins are in this CD. SSADH1 (GabD1,
           EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
           of succinate semialdehyde to succinate.  SSADH activity
           in Mycobacterium tuberculosis is encoded by both gabD1
           (Rv0234c) and gabD2 (Rv1731), however ,the Vmax of GabD1
           was shown to be much higher than that of GabD2, and
           GabD2 (SSADH2) is likely to serve physiologically as a
           dehydrogenase for a different aldehyde(s).
          Length = 454

 Score =  317 bits (814), Expect = e-103
 Identities = 155/458 (33%), Positives = 235/458 (51%), Gaps = 33/458 (7%)

Query: 50  PANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAE 109
           P   E L ++P     DV+ A   A+ A   W+     ER  V  R+   +   +  L +
Sbjct: 3   PFTGEPLGELPQSTPADVEAAFARARAAQRAWAARPFAERAAVFLRFHDLVLERRDELLD 62

Query: 110 IMTQESGKPPTESLGEVV--------YANSFVDWFSEEARRTYGELVPSPVKSKELLMVR 161
           ++  E+GK    +  EV+        YA           RR     +P   ++      R
Sbjct: 63  LIQLETGKARRHAFEEVLDVAIVARYYARRAERLLKPRRRRG---AIPVLTRTTVN---R 116

Query: 162 EPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKG 221
            P GVVG+I+PWN+P+ +      PAL AG   V+KP   T LTAL    L  EA +P+ 
Sbjct: 117 RPKGVVGVISPWNYPLTLAVSDAIPALLAGNAVVLKPDSQTALTALWAVELLIEAGLPRD 176

Query: 222 VLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRL---SLELGG 278
           +  VVT      + VG  +    +   + FTGST  G+++        +RL   SLELGG
Sbjct: 177 LWQVVTGP---GSEVGGAIVD--NADYVMFTGSTATGRVVAERAG---RRLIGCSLELGG 228

Query: 279 NAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFI 338
             P IV  +A+L KA +GA+ A F N GQ CV+  R  + E V+D+FV    V  T++  
Sbjct: 229 KNPMIVLEDADLDKAAAGAVRACFSNAGQLCVSIERIYVHESVYDEFVRRF-VARTRALR 287

Query: 339 QGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTN 398
            G  + +   ++G LI+ AQ+++V + V DA++KGA V+ GG     LG  F+ PT+LT 
Sbjct: 288 LG-AALDYGPDMGSLISQAQLDRVTAHVDDAVAKGATVLAGGRARPDLGPYFYEPTVLTG 346

Query: 399 ITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQV 458
           +T DM  + EE FGPV    +   ++E++ +AN+++ GL    +T D ++  R+  +L+ 
Sbjct: 347 VTEDMELFAEETFGPVVSIYRVADDDEAIELANDTDYGLNASVWTRDGARGRRIAARLRA 406

Query: 459 GMVGINEGIIS---HTEAAFGGVKESGLGREGSRHGME 493
           G V +NEG  +     +A  GG+K+SGLGR   RHG E
Sbjct: 407 GTVNVNEGYAAAWASIDAPMGGMKDSGLGR---RHGAE 441


>gnl|CDD|143425 cd07107, ALDH_PhdK-like, Nocardioides 2-carboxybenzaldehyde
           dehydrogenase, PhdK-like.  Nocardioides sp. strain
           KP72-carboxybenzaldehyde dehydrogenase (PhdK), an enzyme
           involved in phenanthrene degradation, and other similar
           sequences, are present in this CD.
          Length = 456

 Score =  317 bits (813), Expect = e-103
 Identities = 161/461 (34%), Positives = 243/461 (52%), Gaps = 19/461 (4%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQA 106
           V NPA  + LA VP     DV  A+ +A+ AF +W  TTP ER  +L+     L  + + 
Sbjct: 1   VINPATGQVLARVPAASAADVDRAVAAARAAFPEWRATTPLERARMLRELATRLREHAEE 60

Query: 107 LAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLM-VREPLG 165
           LA I   + G P +  LG+V+ A + +D+F+       GE +P   ++  L   +REP G
Sbjct: 61  LALIDALDCGNPVSAMLGDVMVAAALLDYFAGLVTELKGETIPVGGRN--LHYTLREPYG 118

Query: 166 VVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNV 225
           VV  I  +N P+     K+A  LAAG T V+KP E  PL+AL LA LA E  +P GV N+
Sbjct: 119 VVARIVAFNHPLMFAAAKIAAPLAAGNTVVVKPPEQAPLSALRLAELAREV-LPPGVFNI 177

Query: 226 VTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVY 285
           +      A   G  L   P V  I+  GS   G+ +    ++GIK ++LELGG    IV+
Sbjct: 178 LPGDGATA---GAALVRHPDVKRIALIGSVPTGRAIMRAAAEGIKHVTLELGGKNALIVF 234

Query: 286 SNANLQKAVSGAMAA-KFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSK 344
            +A+ + A   A+A   F  CGQ+C + +R  + E ++D+ +  + VE   +   GD + 
Sbjct: 235 PDADPEAAADAAVAGMNFTWCGQSCGSTSRLFVHESIYDEVLARV-VERVAAIKVGDPT- 292

Query: 345 NDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKV-TKLGSK--FFAPTILTNITP 401
           +    +GPL++  Q ++V   +  A  +GA+++ GGG+          +  PT+  ++TP
Sbjct: 293 DPATTMGPLVSRQQYDRVMHYIDSAKREGARLVTGGGRPEGPALEGGFYVEPTVFADVTP 352

Query: 402 DMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMV 461
            M    EE+FGPV   ++++ E E +  AN    GL    +TNDISQA R  ++++ G V
Sbjct: 353 GMRIAREEIFGPVLSVLRWRDEAEMVAQANGVEYGLTAAIWTNDISQAHRTARRVEAGYV 412

Query: 462 GINEGIISHTEAAFGGVKESGLGREGSRHGMEE---FTQIK 499
            IN        A FGGVK SG+GRE     +EE   +TQ K
Sbjct: 413 WINGSSRHFLGAPFGGVKNSGIGRE---ECLEELLSYTQEK 450


>gnl|CDD|143404 cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase
           and ALDH family members 6A1 and 6B2.  Methylmalonate
           semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27)
           [acylating] from Bacillus subtilis is involved in valine
           metabolism and catalyses the NAD+- and CoA-dependent
           oxidation of methylmalonate semialdehyde into
           propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and
           Arabidopsis MMSDH ALDH6B2 are also present in this CD.
          Length = 478

 Score =  316 bits (812), Expect = e-103
 Identities = 161/481 (33%), Positives = 247/481 (51%), Gaps = 20/481 (4%)

Query: 30  TQAYINGEWVHSS--KKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPK 87
            + +INGEWV S   +   VYNPA  E +A VP    E+V  A+ +AK AF  WS T   
Sbjct: 1   LKLFINGEWVESKTTEWLDVYNPATGEVIARVPLATAEEVDAAVAAAKAAFPAWSATPVL 60

Query: 88  ERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGEL 147
           +R  V+ ++ + LE N   LA ++T E GK   ++ G+V+     V++         GE 
Sbjct: 61  KRQQVMFKFRQLLEENLDELARLITLEHGKTLADARGDVLRGLEVVEFACSIPHLLKGEY 120

Query: 148 VPSPVKSKELLMVREPLGVVGLITPWNFPMAMITR-KVAPALAAGCTCVIKPAEDTPLTA 206
           + +  +  +    R+PLGVV  ITP+NFP AMI       A+A G T V+KP+E  P  A
Sbjct: 121 LENVARGIDTYSYRQPLGVVAGITPFNFP-AMIPLWMFPMAIACGNTFVLKPSERVPGAA 179

Query: 207 LALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCS 266
           + LA L  EA +P GVLNVV   +   N+    L   P +  +SF GST VG+ +Y   +
Sbjct: 180 MRLAELLQEAGLPDGVLNVVHGGKEAVNA----LLDHPDIKAVSFVGSTPVGEYIYERAA 235

Query: 267 QGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFV 326
              KR+    G     +V  +A+L++  +  + A F   GQ C+A +  +      D+++
Sbjct: 236 ANGKRVQALGGAKNHAVVMPDADLEQTANALVGAAFGAAGQRCMALSVAVAVGDEADEWI 295

Query: 327 EMLKVEITKSFI-QGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTK 385
             L     K  +  GD    D   +GP+I+ A  E+++ +++  + +GAK++L G  V  
Sbjct: 296 PKLVERAKKLKVGAGDDPGAD---MGPVISPAAKERIEGLIESGVEEGAKLVLDGRGVKV 352

Query: 386 LGSK---FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFF 442
            G +   F  PTIL N+TPDM  Y EE+FGPV   ++  T +E++ + N +  G     F
Sbjct: 353 PGYENGNFVGPTILDNVTPDMKIYKEEIFGPVLSIVRVDTLDEAIAIINANPYGNGAAIF 412

Query: 443 TNDISQAWRVGKKLQVGMVGINEGIISHTEA-AFGGVKESGLGREGSRHGMEE---FTQI 498
           T   + A +  +++  GMVGIN  I       +FGG K S  G +   +G +    +TQ 
Sbjct: 413 TRSGAAARKFQREVDAGMVGINVPIPVPLAFFSFGGWKGSFFG-DLHFYGKDGVRFYTQT 471

Query: 499 K 499
           K
Sbjct: 472 K 472


>gnl|CDD|143395 cd06534, ALDH-SF, NAD(P)+-dependent aldehyde dehydrogenase
           superfamily.  The aldehyde dehydrogenase superfamily
           (ALDH-SF) of  NAD(P)+-dependent enzymes, in general,
           oxidize a wide range of  endogenous and exogenous
           aliphatic and aromatic aldehydes to their corresponding
           carboxylic acids and play an  important role in
           detoxification. Besides aldehyde detoxification, many
           ALDH isozymes possess multiple additional catalytic and
           non-catalytic functions such as participating in
           metabolic pathways, or as binding proteins, or
           osmoregulants, to mention a few. The enzyme has three
           domains, a NAD(P)+ cofactor-binding domain, a catalytic
           domain, and a bridging domain; and the active enzyme is
           generally either homodimeric or homotetrameric. The
           catalytic mechanism is proposed to involve cofactor
           binding, resulting in a conformational change and
           activation of an invariant catalytic cysteine
           nucleophile. The cysteine and aldehyde substrate form an
           oxyanion thiohemiacetal intermediate resulting in
           hydride transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-L) proteins. The ALDH proteins
           are represented by enzymes which share a number of
           highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group. The
           ALDH-like group is represented by such proteins as
           gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA
           reductase, and coenzyme A acylating aldehyde
           dehydrogenase. All of these proteins have a conserved
           cysteine that aligns with the catalytic cysteine of the
           ALDH group.
          Length = 367

 Score =  305 bits (784), Expect = e-100
 Identities = 120/258 (46%), Positives = 162/258 (62%), Gaps = 3/258 (1%)

Query: 73  SAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSF 132
           +A+ AF+ W+   P ER  +L++    LE  ++ LA + T E+GKP  E+LGEV  A   
Sbjct: 2   AARAAFKAWAALPPAERAAILRKIADLLEERREELAALETLETGKPIEEALGEVARAIDT 61

Query: 133 VDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGC 192
             + +  A +  G  +PSP    E  + REPLGVVG+ITPWNFP+ +   K+APALAAG 
Sbjct: 62  FRYAAGLADKLGGPELPSPDPGGEAYVRREPLGVVGVITPWNFPLLLAAWKLAPALAAGN 121

Query: 193 TCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFT 252
           T V+KP+E TPLTALALA L  EA +P GV+NVV       + VG  L + P V  ISFT
Sbjct: 122 TVVLKPSELTPLTALALAELLQEAGLPPGVVNVVPGG---GDEVGAALLSHPRVDKISFT 178

Query: 253 GSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAA 312
           GST VGK +    ++ +K ++LELGG +P IV  +A+L  AV GA+   F N GQ C AA
Sbjct: 179 GSTAVGKAIMKAAAENLKPVTLELGGKSPVIVDEDADLDAAVEGAVFGAFFNAGQICTAA 238

Query: 313 NRFLIQEQVFDQFVEMLK 330
           +R L+ E ++D+FVE L 
Sbjct: 239 SRLLVHESIYDEFVEKLV 256



 Score =  132 bits (335), Expect = 2e-34
 Identities = 51/107 (47%), Positives = 73/107 (68%), Gaps = 1/107 (0%)

Query: 394 TILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVG 453
           T+L ++ PDM    EE+FGPV   I+FK EEE++ +AN++  GL    FT D+++A RV 
Sbjct: 257 TVLVDVDPDMPIAQEEIFGPVLPVIRFKDEEEAIALANDTEYGLTAGVFTRDLNRALRVA 316

Query: 454 KKLQVGMVGINEG-IISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           ++L+ G V IN+  I    EA FGGVK SG+GREG  +G+EE+T+ K
Sbjct: 317 ERLRAGTVYINDSSIGVGPEAPFGGVKNSGIGREGGPYGLEEYTRTK 363


>gnl|CDD|143420 cd07102, ALDH_EDX86601, Uncharacterized aldehyde dehydrogenase of
           Synechococcus sp. PCC 7335 (EDX86601).  Uncharacterized
           aldehyde dehydrogenase of Synechococcus sp. PCC 7335
           (locus EDX86601) and other similar sequences, are
           present in this CD.
          Length = 452

 Score =  303 bits (779), Expect = 3e-98
 Identities = 141/454 (31%), Positives = 225/454 (49%), Gaps = 8/454 (1%)

Query: 48  YNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQAL 107
            +P +   +A+ P   +E V+ A+  A+ A + W     +ER  ++ R ++ L  N   +
Sbjct: 1   ISPIDGSVIAERPLASLEAVRAALERARAAQKGWRAVPLEERKAIVTRAVELLAANTDEI 60

Query: 108 AEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVV 167
           AE +T + G+P  ++ GE+        +    A     ++        E  + REPLGVV
Sbjct: 61  AEELTWQMGRPIAQAGGEIRGMLERARYMISIAEEALADIRVPEKDGFERYIRREPLGVV 120

Query: 168 GLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVT 227
            +I PWN+P       V PAL AG   ++K +  TPL     AA   EA +P+GV  V+ 
Sbjct: 121 LIIAPWNYPYLTAVNAVIPALLAGNAVILKHSPQTPLCGERFAAAFAEAGLPEGVFQVLH 180

Query: 228 SSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSN 287
            S    +     L   P +  +SFTGS   G+ +    +    ++ LELGG  P  V  +
Sbjct: 181 LS----HETSAALIADPRIDHVSFTGSVAGGRAIQRAAAGRFIKVGLELGGKDPAYVRPD 236

Query: 288 ANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDK 347
           A+L  A    +   F N GQ+C +  R  + E ++D FVE   V + K +  GD      
Sbjct: 237 ADLDAAAESLVDGAFFNSGQSCCSIERIYVHESIYDAFVEAF-VAVVKGYKLGDPLDPST 295

Query: 348 CNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGK--VTKLGSKFFAPTILTNITPDMLC 405
             LGP+++    + V + + DAI+KGA+ ++ G      K G  + APT+LTN+   M  
Sbjct: 296 -TLGPVVSARAADFVRAQIADAIAKGARALIDGALFPEDKAGGAYLAPTVLTNVDHSMRV 354

Query: 406 YTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINE 465
             EE FGPV   +K K++ E++ + N+S  GL    +T DI++A  +G++L+ G V +N 
Sbjct: 355 MREETFGPVVGIMKVKSDAEAIALMNDSEYGLTASVWTKDIARAEALGEQLETGTVFMNR 414

Query: 466 GIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
                   A+ GVK+SG G   SR G ++ T+ K
Sbjct: 415 CDYLDPALAWTGVKDSGRGVTLSRLGYDQLTRPK 448


>gnl|CDD|143442 cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, RocA.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (EC=1.5.1.12 ), RocA: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily. The proline catabolic enzymes, proline
           dehydrogenase and Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), catalyze the two-step oxidation
           of proline to glutamate; P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). In this CD, monofunctional enzyme
           sequences such as seen in the Bacillus subtilis RocA
           P5CDH are also present. These enzymes play important
           roles in cellular redox control, superoxide generation,
           and apoptosis.
          Length = 512

 Score =  305 bits (783), Expect = 4e-98
 Identities = 157/466 (33%), Positives = 239/466 (51%), Gaps = 25/466 (5%)

Query: 33  YINGEWVHSSKKFPVYNPAN-DEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGI 91
            I G+ V + +K    NPA+  E L  V     E+ + A+ +A+ AF  W +T P+ER  
Sbjct: 36  VIGGKEVRTEEKIESRNPADPSEVLGTVQKATKEEAEAAVQAARAAFPTWRRTPPEERAR 95

Query: 92  VLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSP 151
           +L R    L   +  LA  M  E GK   E+  +V  A  F+++++ E  R         
Sbjct: 96  LLLRAAALLRRRRFELAAWMVLEVGKNWAEADADVAEAIDFLEYYAREMLR-LRGFPVEM 154

Query: 152 VKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAA 211
           V  ++   V  PLGV  +I+PWNFP+A++      AL  G T V+KPAEDTP+ A  L  
Sbjct: 155 VPGEDNRYVYRPLGVGAVISPWNFPLAILAGMTTAALVTGNTVVLKPAEDTPVIAAKLVE 214

Query: 212 LAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQG--- 268
           +  EA +P GV+N +         VG  L   P V  I+FTGS +VG  +Y   ++    
Sbjct: 215 ILEEAGLPPGVVNFLPGP---GEEVGDYLVEHPDVRFIAFTGSREVGLRIYERAAKVQPG 271

Query: 269 ---IKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQF 325
              +KR+  E+GG    IV  +A+L +A  G + + F   GQ C A +R ++ E V+D+F
Sbjct: 272 QKWLKRVIAEMGGKNAIIVDEDADLDEAAEGIVRSAFGFQGQKCSACSRVIVHESVYDEF 331

Query: 326 VEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTK 385
           +E L VE TK+   GD    +   +GP+I+    +++   ++   S+G +++LGG  V +
Sbjct: 332 LERL-VERTKALKVGDPEDPE-VYMGPVIDKGARDRIRRYIEIGKSEG-RLLLGGE-VLE 387

Query: 386 LGSK--FFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFT 443
           L ++  F  PTI  ++ PD     EE+FGPV   IK K  +E+L +AN++  GL G  F+
Sbjct: 388 LAAEGYFVQPTIFADVPPDHRLAQEEIFGPVLAVIKAKDFDEALEIANDTEYGLTGGVFS 447

Query: 444 NDISQAWRVGKKLQVGMVGINEGIISHTEA-----AFGGVKESGLG 484
                  R  ++ +VG +  N  I   T A      FGG K SG G
Sbjct: 448 RSPEHLERARREFEVGNLYANRKI---TGALVGRQPFGGFKMSGTG 490


>gnl|CDD|132417 TIGR03374, ABALDH, 1-pyrroline dehydrogenase.  Members of this
           protein family are 1-pyrroline dehydrogenase (1.5.1.35),
           also called gamma-aminobutyraldehyde dehydrogenase. This
           enzyme can follow putrescine transaminase (EC 2.6.1.82)
           for a two-step conversion of putrescine to
           gamma-aminobutyric acid (GABA). The member from
           Escherichia coli is characterized as a homotetramer that
           binds one NADH per momomer. This enzyme belongs to the
           medium-chain aldehyde dehydrogenases, and is quite
           similar in sequence to the betaine aldehyde
           dehydrogenase (EC 1.2.1.8) family.
          Length = 472

 Score =  299 bits (766), Expect = 3e-96
 Identities = 172/475 (36%), Positives = 255/475 (53%), Gaps = 12/475 (2%)

Query: 31  QAYINGEWVHSS-KKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKER 89
           +  INGE V    +K PVYNPA  E + ++ +   E V  A+ +A  AF +W QTTPK R
Sbjct: 3   KLLINGELVSGEGEKQPVYNPATGEVILEIAEASAEQVDAAVRAADAAFAEWGQTTPKAR 62

Query: 90  GIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVD---WFSEEARRTYGE 146
              L +    +E N Q  AE+ ++  GKP      + + A   VD   +F+  AR   G 
Sbjct: 63  AECLLKLADVIEENAQVFAELESRNCGKPLHSVFNDEIPA--IVDVFRFFAGAARCLSGL 120

Query: 147 LVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTA 206
                ++    ++ R+PLGVV  I PWN+P+ M   K+APALAAG   V+KP+E TPLTA
Sbjct: 121 AAGEYLEGHTSMIRRDPLGVVASIAPWNYPLMMAAWKLAPALAAGNCVVLKPSEITPLTA 180

Query: 207 LALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCS 266
           L LA LA +   P GV+N++        +VG  L     V  +S TGS   G+ + SH +
Sbjct: 181 LKLAELAKDI-FPAGVVNILFG---RGKTVGDPLTGHEKVRMVSLTGSIATGEHILSHTA 236

Query: 267 QGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFV 326
             IKR  +ELGG AP IV+ +A++   V G     F N GQ C AA R   Q  ++D  V
Sbjct: 237 PSIKRTHMELGGKAPVIVFDDADIDAVVEGVRTFGFYNAGQDCTAACRIYAQRGIYDTLV 296

Query: 327 EMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKL 386
           E L   +  + ++     ++   LGPL +LA +E+V   V++A + G   ++ GG+  K 
Sbjct: 297 EKLGAAV--ATLKSGAPDDESTELGPLSSLAHLERVMKAVEEAKALGHIKVITGGEKRKG 354

Query: 387 GSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDI 446
              +FAPT+L     D     +EVFGPV     F  EE+ +  AN+S  GLA   +T D+
Sbjct: 355 NGYYFAPTLLAGAKQDDAIVQKEVFGPVVSITSFDDEEQVVNWANDSQYGLASSVWTKDV 414

Query: 447 SQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIKYM 501
            +A R+  +LQ G   +N   +  +E   GG K SG G++ S +G+E++T ++++
Sbjct: 415 GRAHRLSARLQYGCTWVNTHFMLVSEMPHGGQKLSGYGKDMSLYGLEDYTVVRHI 469


>gnl|CDD|182233 PRK10090, PRK10090, aldehyde dehydrogenase A; Provisional.
          Length = 409

 Score =  295 bits (758), Expect = 9e-96
 Identities = 143/411 (34%), Positives = 230/411 (55%), Gaps = 7/411 (1%)

Query: 93  LKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPV 152
           L++    +      ++ ++ +E GK    +  EV +   ++D+ +E ARR  GE++ S  
Sbjct: 1   LRKIAAGIRERASEISALIVEEGGKIQQLAEVEVAFTADYIDYMAEWARRYEGEIIQSDR 60

Query: 153 KSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAAL 212
             + +L+ +  LGV   I PWNFP  +I RK+APAL  G T VIKP+E TP  A+A A +
Sbjct: 61  PGENILLFKRALGVTTGILPWNFPFFLIARKMAPALLTGNTIVIKPSEFTPNNAIAFAKI 120

Query: 213 AHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRL 272
             E  +PKGV N+V        +VG+ L  +P VA +S TGS   G+ + +  ++ I ++
Sbjct: 121 VDEIGLPKGVFNLVLGR---GETVGQELAGNPKVAMVSMTGSVSAGEKIMAAAAKNITKV 177

Query: 273 SLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVE 332
            LELGG AP IV  +A+L  AV   + ++  N GQ C  A R  +Q+ ++DQFV  L  E
Sbjct: 178 CLELGGKAPAIVMDDADLDLAVKAIVDSRVINSGQVCNCAERVYVQKGIYDQFVNRL-GE 236

Query: 333 ITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFA 392
             ++   G+ ++ +   +GPLIN A +E+V+  V  A+ +GA+V LGG  V   G  ++ 
Sbjct: 237 AMQAVQFGNPAERNDIAMGPLINAAALERVEQKVARAVEEGARVALGGKAVEGKGY-YYP 295

Query: 393 PTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRV 452
           PT+L ++  +M    EE FGPV   + F T EE++ +AN+S+ GL    +T +++ A + 
Sbjct: 296 PTLLLDVRQEMSIMHEETFGPVLPVVAFDTLEEAIAMANDSDYGLTSSIYTQNLNVAMKA 355

Query: 453 GKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEF--TQIKYM 501
            K L+ G   IN       +    G ++SG+G    +HG+ E+  TQ+ Y+
Sbjct: 356 IKGLKFGETYINRENFEAMQGFHAGWRKSGIGGADGKHGLHEYLQTQVVYL 406


>gnl|CDD|212007 TIGR04284, aldehy_Rv0768, aldehyde dehydrogenase, Rv0768 family.
           This family describes a branch of the aldehyde
           dehydrogenase (NAD) family (see pfam00171) that includes
           Rv0768 from Mycobacterium tuberculosis. All members of
           this family belong to species predicted to synthesize
           mycofactocin, suggesting that this enzyme or another
           upstream or downstream in the same pathway might be
           mycofactocin-dependent. However, the taxonomic range of
           this family is not nearly broad enough to make that
           relationship conclusive [Unknown function, Enzymes of
           unknown specificity].
          Length = 480

 Score =  290 bits (745), Expect = 6e-93
 Identities = 162/487 (33%), Positives = 241/487 (49%), Gaps = 30/487 (6%)

Query: 30  TQAYINGEWVH-SSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQ--WSQTTP 86
           ++  I+G+ V  S+  FP  NPA +E L    D    D+  AI +A+ AF++  WS+ T 
Sbjct: 1   SRLLIDGKLVAGSAGTFPTVNPATEEVLGVAADATAADMDAAIAAARRAFDETDWSRDT- 59

Query: 87  KERGIVLKRWLKELEGNKQALAEIMTQESGKP-----------PTESLGEVVYANSFVDW 135
             R   L++    L  + + L E+   E G P           P + LG          W
Sbjct: 60  ALRVRCLRQLRDALRAHVEELRELTIAEVGAPRMLTAGAQLEGPVDDLGFAADLAESYAW 119

Query: 136 FSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCV 195
                    G   P  + ++  L  RE +GVVG ITPWNFP  +   K+ PALAAG T V
Sbjct: 120 -----TTDLGVASPMGIPTRRTL-RREAVGVVGAITPWNFPHQINLAKLGPALAAGNTVV 173

Query: 196 IKPAEDTPLTALALAAL-AHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGS 254
           +KPA DTP  A  L  L A     P GV+N+VTSS    + +G LL   P V  +SFTGS
Sbjct: 174 LKPAPDTPWCAAVLGELIAEHTDFPPGVVNIVTSSD---HRLGALLAKDPRVDMVSFTGS 230

Query: 255 TQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANR 314
           T  G+ + +  +  +K++ LELGG + FIV  +A+L  A S A      + GQ C    R
Sbjct: 231 TATGRAVMADAAATLKKVFLELGGKSAFIVLDDADLAAACSMAAFTVCMHAGQGCAITTR 290

Query: 315 FLIQEQVFDQFVEMLKVEITKSFIQGD-TSKNDKCNLGPLINLAQVEKVDSIVQDAISKG 373
            ++    +D+ V         S   GD       C  GP+I+  Q ++V S +  A+++G
Sbjct: 291 LVVPRARYDEAVAAA-AATMGSIKPGDPADPGTVC--GPVISARQRDRVQSYLDLAVAEG 347

Query: 374 AKVILGGGKVTKLGSKFFA-PTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANN 432
            +   GGG+       FF  PT++  +  +     EE+FGPV   I    +++++ +AN+
Sbjct: 348 GRFACGGGRPADRDRGFFVEPTVIAGLDNNARVAREEIFGPVLTVIAHDGDDDAVRIAND 407

Query: 433 SNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGM 492
           S  GL+G  F  D  +A  V  +++ G V +N G+    +A FGG K+SG+GRE    G 
Sbjct: 408 SPYGLSGTVFGADPERAAAVAARVRTGTVNVNGGVWYSADAPFGGYKQSGIGREMGVAGF 467

Query: 493 EEFTQIK 499
           EE+ + K
Sbjct: 468 EEYLETK 474


>gnl|CDD|143434 cd07116, ALDH_ACDHII-AcoD, Ralstonia eutrophus NAD+-dependent
           acetaldehyde dehydrogenase II-like.  Included in this CD
           is the NAD+-dependent, acetaldehyde dehydrogenase II
           (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes)
           eutrophus H16 involved in the catabolism of acetoin and
           ethanol, and similar proteins, such as, the dimeric
           dihydrolipoamide dehydrogenase of the acetoin
           dehydrogenase enzyme system of Klebsiella pneumonia.
           Also included are sequences similar to the
           NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
           and AldB) of Xanthobacter autotrophicus GJ10 which are
           involved in the degradation of 1,2-dichloroethane. These
           proteins apparently require RpoN factors for expression.
          Length = 479

 Score =  288 bits (739), Expect = 4e-92
 Identities = 170/485 (35%), Positives = 256/485 (52%), Gaps = 29/485 (5%)

Query: 33  YINGEWVHSSKK--FPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
           +I GEWV   K   F    P   +   +VP    ED++ A+ +A  A E W +T+  ER 
Sbjct: 4   FIGGEWVAPVKGEYFDNITPVTGKVFCEVPRSTAEDIELALDAAHAAKEAWGKTSVAERA 63

Query: 91  IVLKRWLKELEGNKQALAEIMTQESGKPPTESLG-EVVYANSFVDWFSEEARRTYGELVP 149
            +L +    +E N + LA   T ++GKP  E+L  ++  A     +F+   R   G +  
Sbjct: 64  NILNKIADRMEANLEMLAVAETWDNGKPVRETLAADIPLAIDHFRYFAGCIRAQEGSI-- 121

Query: 150 SPVKSKEL-LMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALA 208
           S +    +     EPLGVVG I PWNFP+ M T K+APALAAG   V+KPAE TP + L 
Sbjct: 122 SEIDENTVAYHFHEPLGVVGQIIPWNFPLLMATWKLAPALAAGNCVVLKPAEQTPASILV 181

Query: 209 LAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQG 268
           L  L  +  +P GV+NVV      A   GK L +S  +A ++FTG T  G+++  + S+ 
Sbjct: 182 LMELIGDL-LPPGVVNVVNGFGLEA---GKPLASSKRIAKVAFTGETTTGRLIMQYASEN 237

Query: 269 IKRLSLELGGNAPFIVYS------NANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVF 322
           I  ++LELGG +P I ++      +A   KA+ G +     N G+ C   +R LIQE ++
Sbjct: 238 IIPVTLELGGKSPNIFFADVMDADDAFFDKALEGFVMFAL-NQGEVCTCPSRALIQESIY 296

Query: 323 DQFVEMLKVEITKSFIQG---DTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILG 379
           D+F+E   +E  K+  QG   DT       +G   +L Q+EK+ S +     +GA+V+ G
Sbjct: 297 DRFME-RALERVKAIKQGNPLDTE----TMIGAQASLEQLEKILSYIDIGKEEGAEVLTG 351

Query: 380 GGKVT---KLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTG 436
           G +      LG  ++ PT        M  + EE+FGPV     FK EEE+L +AN++  G
Sbjct: 352 GERNELGGLLGGGYYVPTTFKG-GNKMRIFQEEIFGPVLAVTTFKDEEEALEIANDTLYG 410

Query: 437 LAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESGLGREGSRHGMEEFT 496
           L    +T D + A+R+G+ +Q G V  N   +    AAFGG K+SG+GRE  +  ++ + 
Sbjct: 411 LGAGVWTRDGNTAYRMGRGIQAGRVWTNCYHLYPAHAAFGGYKQSGIGRENHKMMLDHYQ 470

Query: 497 QIKYM 501
           Q K +
Sbjct: 471 QTKNL 475


>gnl|CDD|181826 PRK09406, gabD1, succinic semialdehyde dehydrogenase; Reviewed.
          Length = 457

 Score =  286 bits (734), Expect = 1e-91
 Identities = 158/456 (34%), Positives = 231/456 (50%), Gaps = 15/456 (3%)

Query: 49  NPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALA 108
           NPA  E +     +  ++V  AI  A   F  +  TT  +R          LE     +A
Sbjct: 7   NPATGETVKTFTALTDDEVDAAIARAHARFRDYRTTTFAQRARWANAAADLLEAEADQVA 66

Query: 109 EIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSP-----VKSKELLMVREP 163
            +MT E GK    +  E +       +++E A      L   P     V +    +  +P
Sbjct: 67  ALMTLEMGKTLASAKAEALKCAKGFRYYAEHAEAL---LADEPADAAAVGASRAYVRYQP 123

Query: 164 LGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVL 223
           LGVV  + PWNFP+  + R  APAL AG   ++K A + P TAL LA L   A  P G  
Sbjct: 124 LGVVLAVMPWNFPLWQVVRFAAPALMAGNVGLLKHASNVPQTALYLADLFRRAGFPDGCF 183

Query: 224 NVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFI 283
             +      + +V  +L   P VA  + TGS   G+ + +     IK+  LELGG+ PFI
Sbjct: 184 QTLLVG---SGAVEAIL-RDPRVAAATLTGSEPAGRAVAAIAGDEIKKTVLELGGSDPFI 239

Query: 284 VYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTS 343
           V  +A+L +A   A+ A+ +N GQ+C+AA RF++   V+D F E   V    +   GD +
Sbjct: 240 VMPSADLDRAAETAVTARVQNNGQSCIAAKRFIVHADVYDAFAEKF-VARMAALRVGDPT 298

Query: 344 KNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDM 403
            +   ++GPL      ++V+  V DA++ GA ++ GG +    G  F+ PT++T+ITPDM
Sbjct: 299 -DPDTDVGPLATEQGRDEVEKQVDDAVAAGATILCGGKRPDGPGW-FYPPTVITDITPDM 356

Query: 404 LCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGI 463
             YTEEVFGPVA   +    +E++ +AN +  GL    +T D ++  R    L+ G V I
Sbjct: 357 RLYTEEVFGPVASLYRVADIDEAIEIANATTFGLGSNAWTRDEAEQERFIDDLEAGQVFI 416

Query: 464 NEGIISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           N   +S+ E  FGGVK SG GRE S HG+ EF  IK
Sbjct: 417 NGMTVSYPELPFGGVKRSGYGRELSAHGIREFCNIK 452


>gnl|CDD|143416 cd07098, ALDH_F15-22, Aldehyde dehydrogenase family 15A1 and
           22A1-like.  Aldehyde dehydrogenase family members
           ALDH15A1 (Saccharomyces cerevisiae YHR039C) and ALDH22A1
           (Arabidopsis thaliana, EC=1.2.1.3), and similar
           sequences, are in this CD. Significant improvement of
           stress tolerance in tobacco plants was observed by
           overexpressing the ALDH22A1 gene from maize (Zea mays)
           and was accompanied by a reduction of malondialdehyde
           derived from cellular lipid peroxidation.
          Length = 465

 Score =  282 bits (724), Expect = 7e-90
 Identities = 148/474 (31%), Positives = 241/474 (50%), Gaps = 29/474 (6%)

Query: 48  YNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQAL 107
           Y+PA  +HL  VP    EDV  AI +A+ A  +W++T+  ER  VL+  LK +  N++ +
Sbjct: 1   YDPATGQHLGSVPADTPEDVDEAIAAARAAQREWAKTSFAERRKVLRSLLKYILENQEEI 60

Query: 108 AEIMTQESGKPPTE-SLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVR----- 161
             +  +++GK   + SLGE++     + W  +   +    L P       L+  +     
Sbjct: 61  CRVACRDTGKTMVDASLGEILVTCEKIRWTLKHGEKA---LRPESRPGGLLMFYKRARVE 117

Query: 162 -EPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDT-----PLTALALAALAHE 215
            EPLGVVG I  WN+P   +   +  AL AG   V+K +E          ++    LA  
Sbjct: 118 YEPLGVVGAIVSWNYPFHNLLGPIIAALFAGNAIVVKVSEQVAWSSGFFLSIIRECLAAC 177

Query: 216 AQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLE 275
              P  V  V     T      + L + P +  I+F GS  VGK + +  ++ +  + LE
Sbjct: 178 GHDPDLVQLVTCLPET-----AEALTSHPVIDHITFIGSPPVGKKVMAAAAESLTPVVLE 232

Query: 276 LGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITK 335
           LGG  P IV  +A+L +  S  M   F++ GQ C+   R ++ E+++D+ +E+L  +  +
Sbjct: 233 LGGKDPAIVLDDADLDQIASIIMRGTFQSSGQNCIGIERVIVHEKIYDKLLEILT-DRVQ 291

Query: 336 SFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSK----FF 391
           +  QG     D  ++G +I+ A+ ++++ +V DA+ KGA+ +L GGK           +F
Sbjct: 292 ALRQGPPLDGD-VDVGAMISPARFDRLEELVADAVEKGAR-LLAGGKRYPHPEYPQGHYF 349

Query: 392 APTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWR 451
            PT+L ++TPDM    EEVFGPV V +K   +EE++ +AN++  GL    F  DI +A R
Sbjct: 350 PPTLLVDVTPDMKIAQEEVFGPVMVVMKASDDEEAVEIANSTEYGLGASVFGKDIKRARR 409

Query: 452 VGKKLQVGMVGINEGIISHTEAA--FGGVKESGLGREGSRHGMEEFTQIKYMCF 503
           +  +L+ GMV IN+  +++      FGGVK SG GR     G+      K +  
Sbjct: 410 IASQLETGMVAINDFGVNYYVQQLPFGGVKGSGFGRFAGEEGLRGLCNPKSVTE 463


>gnl|CDD|184426 PRK13968, PRK13968, putative succinate semialdehyde dehydrogenase;
           Provisional.
          Length = 462

 Score =  281 bits (721), Expect = 1e-89
 Identities = 153/452 (33%), Positives = 252/452 (55%), Gaps = 10/452 (2%)

Query: 49  NPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALA 108
           NPA  E L+ +P  G +D++ A+  A   F  W +T    R   L+   K L    + +A
Sbjct: 13  NPATGEQLSVLPWAGADDIENALQLAAAGFRDWRETNIDYRAQKLRDIGKALRARSEEMA 72

Query: 109 EIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVG 168
           +++T+E GKP  ++  EV  + +  DW++E       +  P+ V++++ ++   PLG + 
Sbjct: 73  QMITREMGKPINQARAEVAKSANLCDWYAEHGPAML-KAEPTLVENQQAVIEYRPLGTIL 131

Query: 169 LITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTS 228
            I PWNFP+  + R   P L AG   ++K A +    A  +A +  +A IP+GV   +  
Sbjct: 132 AIMPWNFPLWQVMRGAVPILLAGNGYLLKHAPNVMGCAQLIAQVFKDAGIPQGVYGWL-- 189

Query: 229 SRTNANSVG-KLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSN 287
              NA++ G   +     +A ++ TGS + G  + +     +K+  LELGG+ PFIV ++
Sbjct: 190 ---NADNDGVSQMINDSRIAAVTVTGSVRAGAAIGAQAGAALKKCVLELGGSDPFIVLND 246

Query: 288 ANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDK 347
           A+L+ AV  A+A +++N GQ C AA RF+I+E +   F E   V    +   GD  ++++
Sbjct: 247 ADLELAVKAAVAGRYQNTGQVCAAAKRFIIEEGIASAFTERF-VAAAAALKMGD-PRDEE 304

Query: 348 CNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYT 407
             LGP+      +++   V+  +++GA+++LGG K+   G+ ++APT+L N+TP+M  + 
Sbjct: 305 NALGPMARFDLRDELHHQVEATLAEGARLLLGGEKIAGAGN-YYAPTVLANVTPEMTAFR 363

Query: 408 EEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGI 467
           EE+FGPVA     K  E +L +AN+S  GL+   FT D +QA ++  +L+ G V IN   
Sbjct: 364 EELFGPVAAITVAKDAEHALELANDSEFGLSATIFTTDETQARQMAARLECGGVFINGYC 423

Query: 468 ISHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
            S    AFGGVK+SG GRE S  G+ EF  I+
Sbjct: 424 ASDARVAFGGVKKSGFGRELSHFGLHEFCNIQ 455


>gnl|CDD|132294 TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde
           dehydrogenase.  This family of genes are members of the
           pfam00171 NAD-dependent aldehyde dehydrogenase family.
           These genes are observed in Ralstonia eutropha JMP134,
           Sinorhizobium meliloti 1021, Burkholderia mallei ATCC
           23344, Burkholderia thailandensis E264, Burkholderia
           cenocepacia AU 1054, Burkholderia pseudomallei K96243
           and 1710b, Burkholderia xenovorans LB400, Burkholderia
           sp. 383 and Polaromonas sp. JS666 in close proximity to
           the PhnW gene (TIGR02326) encoding 2-aminoethyl
           phosphonate aminotransferase (which generates
           phosphonoacetaldehyde) and PhnA (TIGR02335) encoding
           phosphonoacetate hydrolase (not to be confused with the
           alkylphosphonate utilization operon protein PhnA modeled
           by TIGR00686). Additionally, transporters believed to be
           specific for 2-aminoethyl phosphonate are often present.
           PhnW is, in other organisms, coupled with PhnX
           (TIGR01422) for the degradation of phosphonoacetaldehyde
           (GenProp0238), but PhnX is apparently absent in each of
           the organisms containing this aldehyde reductase. PhnA,
           characterized in a strain of Pseudomonas fluorescens
           that has not het been genome sequenced, is only rarely
           found outside of the PhnW and aldehyde dehydrogenase
           context. For instance in Rhodopseudomonas and Bordetella
           bronchiseptica, where it is adjacent to transporters
           presumably specific for the import of phosphonoacetate.
           It seems reasonably certain then, that this enzyme
           catalyzes the NAD-dependent oxidation of
           phosphonoacetaldehyde to phosphonoacetate, bridging the
           metabolic gap between PhnW and PhnA. We propose the name
           phosphonoacetaldehyde dehydrogenase and the gene symbol
           PhnY for this enzyme.
          Length = 472

 Score =  274 bits (703), Expect = 7e-87
 Identities = 168/474 (35%), Positives = 259/474 (54%), Gaps = 23/474 (4%)

Query: 34  INGEWVHSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVL 93
           I GE V   +   V  P N   +  VP   V+DV+ A   A  A+      T  ER  +L
Sbjct: 8   IAGEKVSRDRVIEVRYPYNGTVVGTVPKASVDDVRRAFAIAA-AYR--PTLTRYERSAIL 64

Query: 94  KRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGE-----LV 148
            R    L   K+ +++++T ESG    +SL EV      + + + EA R  G+     L 
Sbjct: 65  DRAAALLAARKEEISDLITLESGLSKKDSLYEVGRVADVLTFAAAEALRDDGQIFSCDLT 124

Query: 149 PSPVKSKELLMVREPL-GVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTAL 207
           P   K++++   REPL GV+  ITP+N PM  +  K+APA+A     V+KP+E TPL+AL
Sbjct: 125 PHG-KARKVFTQREPLLGVISAITPFNHPMNQVAHKIAPAIATNNRMVVKPSEKTPLSAL 183

Query: 208 ALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQ 267
            LA + +EA +P  +L VVT    +   +   L T+PHV  ++FTG   +GK + +    
Sbjct: 184 YLADILYEAGLPPQMLQVVTG---DPREIADELITNPHVDLVTFTGGVAIGKYIAARA-- 238

Query: 268 GIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVE 327
           G +R  LELGGN P IV  +A+L +A   A+   ++N GQ C A  R L+QE V D+F E
Sbjct: 239 GYRRQVLELGGNDPLIVMEDADLDRAADLAVKGSYKNSGQRCTAVKRMLVQESVADRFTE 298

Query: 328 MLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLG 387
           +L VE T+++  GD   +   ++G +I+ A     ++ V +AI++GA+++LG  +   L 
Sbjct: 299 LL-VEKTRAWRYGDPM-DPSVDMGTVIDEAAAILFEARVNEAIAQGARLLLGNVRDGAL- 355

Query: 388 SKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDIS 447
              +APT+L  + P M    EE FGPV+  I+F   ++++ ++N++  GL+    TN + 
Sbjct: 356 ---YAPTVLDRVDPSMTLVREETFGPVSPVIRFCDIDDAIRISNSTAYGLSSGVCTNRLD 412

Query: 448 QAWRVGKKLQVGMVGINEGIISHTEAA-FGGVKESGLG-REGSRHGMEEFTQIK 499
              R   +LQVG V + E      E   FGG+K+SGLG +EG +  M+ FT +K
Sbjct: 413 YITRFIAELQVGTVNVWEVPGYRLELTPFGGIKDSGLGYKEGVQEAMKSFTNLK 466


>gnl|CDD|179543 PRK03137, PRK03137, 1-pyrroline-5-carboxylate dehydrogenase;
           Provisional.
          Length = 514

 Score =  269 bits (689), Expect = 4e-84
 Identities = 149/469 (31%), Positives = 232/469 (49%), Gaps = 37/469 (7%)

Query: 33  YINGEWVHSSKKFPVYNPAN-DEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGI 91
            I GE + +  K    NPAN  E +  V     E  + A+ +A EAFE W + +P++R  
Sbjct: 40  IIGGERITTEDKIVSINPANKSEVVGRVSKATKELAEKAMQAALEAFETWKKWSPEDRAR 99

Query: 92  VLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSP 151
           +L R    +   K   +  + +E+GKP  E+  +   A  F+++++ +  +   +    P
Sbjct: 100 ILLRAAAIIRRRKHEFSAWLVKEAGKPWAEADADTAEAIDFLEYYARQMLK-LAD--GKP 156

Query: 152 VKSKE---LLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALA 208
           V+S+          PLGV  +I+PWNFP A++      A+ AG T ++KPA DTP+ A  
Sbjct: 157 VESRPGEHNRYFYIPLGVGVVISPWNFPFAIMAGMTLAAIVAGNTVLLKPASDTPVIAAK 216

Query: 209 LAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCS-- 266
              +  EA +P GV+N V  S    + VG  L   P    I+FTGS +VG  +Y   +  
Sbjct: 217 FVEVLEEAGLPAGVVNFVPGS---GSEVGDYLVDHPKTRFITFTGSREVGLRIYERAAKV 273

Query: 267 ----QGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVF 322
                 +KR+  E+GG    +V  +A+L  A    +A+ F   GQ C A +R ++ E V+
Sbjct: 274 QPGQIWLKRVIAEMGGKDAIVVDEDADLDLAAESIVASAFGFSGQKCSACSRAIVHEDVY 333

Query: 323 DQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKG-AKVILGGG 381
           D+ +E + VE+TK    G+   N    +GP+IN A  +K+ S ++  I K   +++LGG 
Sbjct: 334 DEVLEKV-VELTKELTVGNPEDNA--YMGPVINQASFDKIMSYIE--IGKEEGRLVLGGE 388

Query: 382 KVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYF 441
                G  F  PTI  ++ P      EE+FGPV   IK K  + +L +ANN+  GL G  
Sbjct: 389 GDDSKGY-FIQPTIFADVDPKARIMQEEIFGPVVAFIKAKDFDHALEIANNTEYGLTGAV 447

Query: 442 FTND---ISQAWRVGKKLQVGMVGINEGIISHTEA-----AFGGVKESG 482
            +N+   + +A R   +  VG +  N G    T A      FGG   SG
Sbjct: 448 ISNNREHLEKARR---EFHVGNLYFNRGC---TGAIVGYHPFGGFNMSG 490


>gnl|CDD|143448 cd07130, ALDH_F7_AASADH, NAD+-dependent alpha-aminoadipic
           semialdehyde dehydrogenase, ALDH family members 7A1 and
           7B.  Alpha-aminoadipic semialdehyde dehydrogenase
           (AASADH, EC=1.2.1.31), also known as ALDH7A1,
           Antiquitin-1, ALDH7B, or
           delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH),
           is a NAD+-dependent ALDH. Human ALDH7A1 is involved in
           the pipecolic acid pathway of lysine catabolism,
           catalyzing the oxidation of alpha-aminoadipic
           semialdehyde to alpha-aminoadipate.  Arabidopsis
           thaliana ALDH7B4 appears to be an
           osmotic-stress-inducible ALDH gene encoding a
           turgor-responsive or stress-inducible ALDH. The
           Streptomyces clavuligerus P6CDH appears to be involved
           in cephamycin biosynthesis, catalyzing the second stage
           of the two-step conversion of lysine to
           alpha-aminoadipic acid.  The ALDH7A1 enzyme and others
           in this group have been observed as tetramers, yet the
           bacterial P6CDH enzyme has been reported as a monomer.
          Length = 474

 Score =  261 bits (669), Expect = 1e-81
 Identities = 159/470 (33%), Positives = 230/470 (48%), Gaps = 28/470 (5%)

Query: 33  YINGEWVHSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIV 92
             +GEW          +PAN E +A V     ED +  I +A+EAF++W      +RG +
Sbjct: 2   VYDGEWGGGGGVVTSISPANGEPIARVRQATPEDYESTIKAAQEAFKEWRDVPAPKRGEI 61

Query: 93  LKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEA----RRTYGELV 148
           +++    L   K+AL ++++ E GK   E LGEV      +D   + A    R+ YG  +
Sbjct: 62  VRQIGDALRKKKEALGKLVSLEMGKILPEGLGEV---QEMID-ICDFAVGLSRQLYGLTI 117

Query: 149 PSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALA 208
           PS      ++    PLGVVG+IT +NFP+A+     A AL  G   V KP+  TPLTA+A
Sbjct: 118 PSERPGHRMMEQWNPLGVVGVITAFNFPVAVWGWNAAIALVCGNVVVWKPSPTTPLTAIA 177

Query: 209 ----LAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSH 264
               +A +  +  +P  + ++V         VG+ L   P V  +SFTGST VG+ +   
Sbjct: 178 VTKIVARVLEKNGLPGAIASLVCGGAD----VGEALVKDPRVPLVSFTGSTAVGRQVGQA 233

Query: 265 CSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQ 324
            +    R  LELGGN   IV  +A+L  AV   + A     GQ C    R ++ E ++D+
Sbjct: 234 VAARFGRSLLELGGNNAIIVMEDADLDLAVRAVLFAAVGTAGQRCTTTRRLIVHESIYDE 293

Query: 325 FVEMLKVEITKSFIQGDTSKNDKCNL-GPLINLAQVEKVDSIVQDAISKGAKVILGGGKV 383
            +E LK +  K    GD   +    L GPL   A V+   + +++A S+G   +L GGKV
Sbjct: 294 VLERLK-KAYKQVRIGDPLDDG--TLVGPLHTKAAVDNYLAAIEEAKSQGGT-VLFGGKV 349

Query: 384 TKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFT 443
                 +  PTI+     D     EE F P+   +KF T EE++   N    GL+   FT
Sbjct: 350 IDGPGNYVEPTIVE-GLSDAPIVKEETFAPILYVLKFDTLEEAIAWNNEVPQGLSSSIFT 408

Query: 444 NDISQA--WRVGKKLQVGMVGINEGIISHTE--AAFGGVKESGLGRE-GS 488
            D+  A  W   K    G+V +N G  S  E   AFGG KE+G GRE GS
Sbjct: 409 TDLRNAFRWLGPKGSDCGIVNVNIG-TSGAEIGGAFGGEKETGGGRESGS 457


>gnl|CDD|130783 TIGR01722, MMSDH, methylmalonic acid semialdehyde dehydrogenase.
           Involved in valine catabolism,
           methylmalonate-semialdehyde dehydrogenase catalyzes the
           irreversible NAD+- and CoA-dependent oxidative
           decarboxylation of methylmalonate semialdehyde to
           propionyl-CoA. Methylmalonate-semialdehyde dehydrogenase
           has been characterized in both prokaryotes and
           eukaryotes, functioning as a mammalian tetramer and a
           bacterial homodimer. Although similar in monomeric
           molecular mass and enzymatic activity, the N-terminal
           sequence in P.aeruginosa does not correspond with the
           N-terminal sequence predicted for rat liver. Sequence
           homology to a variety of prokaryotic and eukaryotic
           aldehyde dehydrogenases places MMSDH in the aldehyde
           dehydrogenase (NAD+) superfamily (pfam00171), making
           MMSDH's CoA requirement unique among known ALDHs.
           Methylmalonate semialdehyde dehydrogenase is closely
           related to betaine aldehyde dehydrogenase,
           2-hydroxymuconic semialdehyde dehydrogenase, and class 1
           and 2 aldehyde dehydrogenase. In Bacillus, a highly
           homologous protein to methylmalonic acid semialdehyde
           dehydrogenase, groups out from the main MMSDH clade with
           Listeria and Sulfolobus. This Bacillus protein has been
           suggested to be located in an iol operon and/or involved
           in myo-inositol catabolism, converting malonic
           semialdehyde to acetyl CoA ad CO2. The preceeding
           enzymes responsible for valine catabolism are present in
           Bacillus, Listeria, and Sulfolobus [Energy metabolism,
           Amino acids and amines].
          Length = 477

 Score =  243 bits (621), Expect = 1e-74
 Identities = 150/477 (31%), Positives = 234/477 (49%), Gaps = 19/477 (3%)

Query: 33  YINGEWVH--SSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
           +I G++    S    PV NPA +E    V    V++V  A+ SA+E F  W QT+  +R 
Sbjct: 4   WIGGKFAEGASGTYIPVTNPATNEVTTKVAFASVDEVDAAVASARETFLTWGQTSLAQRT 63

Query: 91  IVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPS 150
            VL R+   L+ ++  +AE++T E GK  +++LG+V      V+          GE    
Sbjct: 64  SVLLRYQALLKEHRDEIAELITAEHGKTHSDALGDVARGLEVVEHACGVNSLLKGETSTQ 123

Query: 151 PVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAP-ALAAGCTCVIKPAEDTPLTALAL 209
                ++  +R+PLGV   ITP+NFP AMI   + P A+A G T V+KP+E  P  A+ L
Sbjct: 124 VATRVDVYSIRQPLGVCAGITPFNFP-AMIPLWMFPIAIACGNTFVLKPSEKVPSAAVKL 182

Query: 210 AALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGI 269
           A L  EA  P GVLNVV   +   +     L   P V  +SF GST +G+ +++  S   
Sbjct: 183 AELFSEAGAPDGVLNVVHGDKEAVDR----LLEHPDVKAVSFVGSTPIGRYIHTTGSAHG 238

Query: 270 KRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEML 329
           KR+    G     +V  +A+   A    + A +   GQ C+A +  ++     D++V  +
Sbjct: 239 KRVQALGGAKNHMVVMPDADKDAAADALVGAAYGAAGQRCMAISAAVLVGAA-DEWVPEI 297

Query: 330 KVEITKSFIQ-GDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLG- 387
           +    K  I  GD    +   +GPLI     ++V S++    ++GA+V+L G      G 
Sbjct: 298 RERAEKIRIGPGDDPGAE---MGPLITPQAKDRVASLIAGGAAEGAEVLLDGRGYKVDGY 354

Query: 388 --SKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTND 445
               +  PT+L  + P M  Y EE+FGPV   ++  T EE++ + N S  G     FT D
Sbjct: 355 EEGNWVGPTLLERVPPTMKAYQEEIFGPVLCVLEADTLEEAIALINASPYGNGTAIFTRD 414

Query: 446 ISQAWRVGKKLQVGMVGINEGI-ISHTEAAFGGVKESGLGREG--SRHGMEEFTQIK 499
            + A R   +++VG VG+N  I +     +F G K+S  G      + G   +T+ K
Sbjct: 415 GAAARRFQHEIEVGQVGVNVPIPVPLPYFSFTGWKDSFFGDHHIYGKQGTHFYTRGK 471


>gnl|CDD|143466 cd07148, ALDH_RL0313, Uncharacterized ALDH ( RL0313) with
           similarity to Tortula ruralis aldehyde dehydrogenase
           ALDH21A1.  Uncharacterized aldehyde dehydrogenase (locus
           RL0313) with sequence similarity to the moss Tortula
           ruralis aldehyde dehydrogenase ALDH21A1 (RNP123)
           believed to play an important role in the detoxification
           of aldehydes generated in response to desiccation- and
           salinity-stress, and similar sequences are included in
           this CD.
          Length = 455

 Score =  241 bits (616), Expect = 4e-74
 Identities = 146/468 (31%), Positives = 234/468 (50%), Gaps = 26/468 (5%)

Query: 47  VYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPK-ERGIVLKRWLKELEGNKQ 105
           V NP + + + +VP +    +  A+ +A   F   +   P  ER  +L+R    +E    
Sbjct: 3   VVNPFDLKPIGEVPTVDWAAIDKALDTAHALFLDRNNWLPAHERIAILERLADLMEERAD 62

Query: 106 ALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVP---SPVKSKEL-LMVR 161
            LA ++ +E GKP  ++  EV  A   V+  ++E  +  G  +P   +P  +  +    R
Sbjct: 63  ELALLIAREGGKPLVDAKVEVTRAIDGVELAADELGQLGGREIPMGLTPASAGRIAFTTR 122

Query: 162 EPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKG 221
           EP+GVV  I+ +N P+ +I  +VAPA+AAGC  ++KPA  TPL+ LA   L HEA +P+G
Sbjct: 123 EPIGVVVAISAFNHPLNLIVHQVAPAIAAGCPVIVKPALATPLSCLAFVDLLHEAGLPEG 182

Query: 222 VLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAP 281
               V       N+V + L T P VA  SF GS +VG +L S  + G  R +LE GG AP
Sbjct: 183 WCQAVPC----ENAVAEKLVTDPRVAFFSFIGSARVGWMLRSKLAPG-TRCALEHGGAAP 237

Query: 282 FIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGD 341
            IV  +A+L   +   +   F + GQ CV+  R  +  ++ D F + L     +  + GD
Sbjct: 238 VIVDRSADLDAMIPPLVKGGFYHAGQVCVSVQRVFVPAEIADDFAQRL-AAAAEKLVVGD 296

Query: 342 -TSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNIT 400
            T  + +   GPLI   +V++V+  V +A++ GA+++ GG    +L    +APT+L +  
Sbjct: 297 PTDPDTEV--GPLIRPREVDRVEEWVNEAVAAGARLLCGG---KRLSDTTYAPTVLLDPP 351

Query: 401 PDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGM 460
            D    T+E+FGPV     +   +E++  AN+         FT D+  A +  ++L    
Sbjct: 352 RDAKVSTQEIFGPVVCVYSYDDLDEAIAQANSLPVAFQAAVFTKDLDVALKAVRRLDATA 411

Query: 461 VGINEGIISHTE-----AAFGGVKESGLGREGSRHGMEEFTQIKYMCF 503
           V +N+    HT        F G ++SG G  G  + M + TQ K    
Sbjct: 412 VMVND----HTAFRVDWMPFAGRRQSGYGTGGIPYTMHDMTQEKMAVI 455


>gnl|CDD|200087 TIGR01237, D1pyr5carbox2, delta-1-pyrroline-5-carboxylate
           dehydrogenase, group 2, putative.  This enzyme is the
           second of two in the degradation of proline to
           glutamate. This model represents one of several related
           branches of delta-1-pyrroline-5-carboxylate
           dehydrogenase. Members of this branch may be associated
           with proline dehydrogenase (the other enzyme of the
           pathway from proline to glutamate) but have not been
           demonstrated experimentally. The branches are not as
           closely related to each other as some distinct aldehyde
           dehydrogenases are to some; separate models were built
           to let each model describe a set of equivalogs [Energy
           metabolism, Amino acids and amines].
          Length = 511

 Score =  240 bits (615), Expect = 2e-73
 Identities = 136/461 (29%), Positives = 221/461 (47%), Gaps = 16/461 (3%)

Query: 33  YINGEWVHSSKKFPVYNPAN-DEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGI 91
            INGE V +  K    NP +  E +  V     E  ++A+ +A +AFE W +T P+ER  
Sbjct: 36  VINGERVETENKIVSINPCDKSEVVGTVSKASQEHAEHALQAAAKAFEAWKKTDPEERAA 95

Query: 92  VLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSP 151
           +L +    +   +   + ++ +E GKP  E+  EV  A  F+++++ +          + 
Sbjct: 96  ILFKAAAIVRRRRHEFSALLVKEVGKPWNEADAEVAEAIDFMEYYARQMIELAKGKPVNS 155

Query: 152 VKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAA 211
            + +    V  P GV  +I+PWNFP A++       +  G   V+KPAE  P+ A     
Sbjct: 156 REGETNQYVYTPTGVTVVISPWNFPFAIMVGMTVAPIVTGNCVVLKPAEAAPVIAAKFVE 215

Query: 212 LAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCS----- 266
           +  EA +PKGV+  V  S    + VG  L   P  + I+FTGS +VG  ++   +     
Sbjct: 216 ILEEAGLPKGVVQFVPGS---GSEVGDYLVDHPKTSLITFTGSREVGTRIFERAAKVQPG 272

Query: 267 -QGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQF 325
            + +KR+  E+GG    IV  +A+++ A   A  + F   GQ C A +R ++ E+V+D+ 
Sbjct: 273 QKHLKRVIAEMGGKDTVIVDEDADIELAAQSAFTSAFGFAGQKCSAGSRAVVHEKVYDEV 332

Query: 326 VEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTK 385
           VE   VEIT+S   G     D   +GP+I+     K+   ++   ++G +++ GG     
Sbjct: 333 VERF-VEITESLKVGPPDSAD-VYVGPVIDQKSFNKIMEYIEIGKAEG-RLVSGGCGDDS 389

Query: 386 LGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTND 445
            G  F  PTI  ++        EE+FGPV   I+    +E+L +ANN+  GL G   +N+
Sbjct: 390 KGY-FIGPTIFADVDRKARLAQEEIFGPVVAFIRASDFDEALEIANNTEYGLTGGVISNN 448

Query: 446 ISQAWRVGKKLQVGMVGINEGIISHTEA--AFGGVKESGLG 484
                R   + +VG +  N  I         FGG K SG  
Sbjct: 449 RDHINRAKAEFEVGNLYFNRNITGAIVGYQPFGGFKMSGTD 489


>gnl|CDD|143406 cd07087, ALDH_F3-13-14_CALDH-like, ALDH subfamily: Coniferyl
           aldehyde dehydrogenase, ALDH families 3, 13, and 14, and
           other related proteins.  ALDH subfamily which includes
           NAD(P)+-dependent, aldehyde dehydrogenase, family 3
           member A1 and B1  (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and
           fatty aldehyde dehydrogenase, family 3 member A2
           (ALDH3A2, EC=1.2.1.3), and also plant ALDH family
           members ALDH3F1, ALDH3H1, and ALDH3I1, fungal ALDH14
           (YMR110C) and the protozoan family 13 member (ALDH13),
           as well as coniferyl aldehyde dehydrogenases (CALDH,
           EC=1.2.1.68), and other similar  sequences, such as the
           Pseudomonas putida benzaldehyde dehydrogenase I that is
           involved in the metabolism of mandelate.
          Length = 426

 Score =  230 bits (590), Expect = 2e-70
 Identities = 132/420 (31%), Positives = 209/420 (49%), Gaps = 36/420 (8%)

Query: 93  LKRWLKELEGNKQALAEIMTQESGKPPTES--------LGEVVYA-NSFVDWFSEEARRT 143
           LK   + L  N++ +A  +  + GKPP E+        LGE+ +A      W   + RR 
Sbjct: 26  LKALKRMLTENEEEIAAALYADLGKPPAEAYLTEIAVVLGEIDHALKHLKKWM--KPRRV 83

Query: 144 YGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTP 203
              L+  P K+    ++ EPLGVV +I PWN+P+ +    +  A+AAG T V+KP+E  P
Sbjct: 84  SVPLLLQPAKAY---VIPEPLGVVLIIGPWNYPLQLALAPLIGAIAAGNTVVLKPSELAP 140

Query: 204 LTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSP--HVAGISFTGSTQVGKIL 261
            T+  LA L  +       + VV         V   L   P  H   I FTGS  VGKI+
Sbjct: 141 ATSALLAKLIPKY-FDPEAVAVVEG----GVEVATALLAEPFDH---IFFTGSPAVGKIV 192

Query: 262 YSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQV 321
               ++ +  ++LELGG +P IV  +ANL+ A       KF N GQ C+A +  L+ E +
Sbjct: 193 MEAAAKHLTPVTLELGGKSPCIVDKDANLEVAARRIAWGKFLNAGQTCIAPDYVLVHESI 252

Query: 322 FDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGG 381
            D+ +E LK  I K F   D  ++   + G +IN    +++ S++ D      KV++GG 
Sbjct: 253 KDELIEELKKAI-KEFYGEDPKESP--DYGRIINERHFDRLASLLDD-----GKVVIGGQ 304

Query: 382 KVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYF 441
                  ++ APTIL +++PD     EE+FGP+   + +   +E++   N+    LA Y 
Sbjct: 305 --VDKEERYIAPTILDDVSPDSPLMQEEIFGPILPILTYDDLDEAIEFINSRPKPLALYL 362

Query: 442 FTNDISQAWRVGKKLQVGMVGINEGII--SHTEAAFGGVKESGLGREGSRHGMEEFTQIK 499
           F+ D +   RV  +   G V +N+ ++  +     FGGV  SG+G    + G + F+ +K
Sbjct: 363 FSEDKAVQERVLAETSSGGVCVNDVLLHAAIPNLPFGGVGNSGMGAYHGKAGFDTFSHLK 422


>gnl|CDD|143452 cd07134, ALDH_AlkH-like, Pseudomonas putida Aldehyde dehydrogenase
           AlkH-like.  Aldehyde dehydrogenase AlkH (locus name
           P12693, EC=1.2.1.3) of the alkBFGHJKL operon that allows
           Pseudomonas putida to metabolize alkanes and the
           aldehyde dehydrogenase AldX of Bacillus subtilis (locus
           P46329, EC=1.2.1.3), and similar sequences, are present
           in this CD.
          Length = 433

 Score =  226 bits (578), Expect = 1e-68
 Identities = 136/442 (30%), Positives = 210/442 (47%), Gaps = 43/442 (9%)

Query: 75  KEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTE-SLGEV--VYANS 131
           +        +T  ER   LKR  K +   ++ +   +  +  KP  E  L E+  V +  
Sbjct: 8   QAHALALRASTAAERIAKLKRLKKAILARREEIIAALAADFRKPAAEVDLTEILPVLS-- 65

Query: 132 FVDWFSEEARRTYGEL--------VPSPV-----KSKELLMVREPLGVVGLITPWNFPMA 178
                  E       L        V +P+     KSK   +  EP GV  +I+PWN+P  
Sbjct: 66  -------EINHAIKHLKKWMKPKRVRTPLLLFGTKSK---IRYEPKGVCLIISPWNYPFN 115

Query: 179 MITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGK 238
           +    +  A+AAG T ++KP+E TP T+  +A +  EA     V     +       V +
Sbjct: 116 LAFGPLVSAIAAGNTAILKPSELTPHTSAVIAKIIREAFDEDEV-----AVFEGDAEVAQ 170

Query: 239 LLCTSP--HVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSG 296
            L   P  H   I FTGS  VGKI+ +  ++ +  ++LELGG +P IV   A+L+KA   
Sbjct: 171 ALLELPFDH---IFFTGSPAVGKIVMAAAAKHLASVTLELGGKSPTIVDETADLKKAAKK 227

Query: 297 AMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINL 356
               KF N GQ C+A +   + E V D FVE LK EI K + +    K    +L  ++N 
Sbjct: 228 IAWGKFLNAGQTCIAPDYVFVHESVKDAFVEHLKAEIEKFYGKDAARK-ASPDLARIVND 286

Query: 357 AQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAV 416
              +++  ++ DA++KGAKV  GG        ++ APT+LTN+TPDM    EE+FGPV  
Sbjct: 287 RHFDRLKGLLDDAVAKGAKVEFGG--QFDAAQRYIAPTVLTNVTPDMKIMQEEIFGPVLP 344

Query: 417 CIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGII--SHTEAA 474
            I ++  +E +   N     LA Y F+ D +   +V  +   G V +N+ ++   +    
Sbjct: 345 IITYEDLDEVIEYINAKPKPLALYVFSKDKANVNKVLARTSSGGVVVNDVVLHFLNPNLP 404

Query: 475 FGGVKESGLGREGSRHGMEEFT 496
           FGGV  SG+G     +G + F+
Sbjct: 405 FGGVNNSGIGSYHGVYGFKAFS 426


>gnl|CDD|143414 cd07095, ALDH_SGSD_AstD, N-succinylglutamate 5-semialdehyde
           dehydrogenase, AstD-like.  N-succinylglutamate
           5-semialdehyde dehydrogenase or succinylglutamic
           semialdehyde dehydrogenase (SGSD, E. coli AstD,
           EC=1.2.1.71) involved in L-arginine degradation via the
           arginine succinyltransferase (AST) pathway and catalyzes
           the NAD+-dependent reduction of succinylglutamate
           semialdehyde into succinylglutamate.
          Length = 431

 Score =  221 bits (566), Expect = 5e-67
 Identities = 131/433 (30%), Positives = 212/433 (48%), Gaps = 24/433 (5%)

Query: 66  DVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGE 125
            V  A+ +A+ AF  W+  + +ER  +L+R+ + L+ NK+ LA ++++E+GKP  E+  E
Sbjct: 1   QVDAAVAAARAAFPGWAALSLEERAAILRRFAELLKANKEELARLISRETGKPLWEAQTE 60

Query: 126 VVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVA 185
           V      +D  S +A         +P+     ++   P GV+ +  P+NFP  +    + 
Sbjct: 61  VAAMAGKID-ISIKAYHERTGERATPMAQGRAVLRHRPHGVMAVFGPFNFPGHLPNGHIV 119

Query: 186 PALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPH 245
           PAL AG T V KP+E TP  A  +  L  EA +P GVLN+V   R      G+ L     
Sbjct: 120 PALLAGNTVVFKPSELTPAVAELMVELWEEAGLPPGVLNLVQGGRE----TGEALAAHEG 175

Query: 246 VAGISFTGSTQVGKILYSH-CSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRN 304
           + G+ FTGS   G +L+     +  K L+LE+GGN P +V+  A++  A    + + F  
Sbjct: 176 IDGLLFTGSAATGLLLHRQFAGRPGKILALEMGGNNPLVVWDVADIDAAAYLIVQSAFLT 235

Query: 305 CGQACVAANR-FLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVD 363
            GQ C  A R  +    V D F+E L VE  K    G         +GPLI  A   +  
Sbjct: 236 AGQRCTCARRLIVPDGAVGDAFLERL-VEAAKRLRIGAPDAEPP-FMGPLIIAAAAARYL 293

Query: 364 SIVQDAISKGAKVILGGGKVTKL--GSKFFAPTIL--TNIT--PDMLCYTEEVFGPVAVC 417
              QD ++ G + +L    + +L  G+ F +P I+  T+    PD     EE+FGP+   
Sbjct: 294 LAQQDLLALGGEPLL---AMERLVAGTAFLSPGIIDVTDAADVPD-----EEIFGPLLQV 345

Query: 418 IKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGII-SHTEAAFG 476
            ++   +E++ +AN +  GL+    ++D +   R   +++ G+V  N     + + A FG
Sbjct: 346 YRYDDFDEAIALANATRFGLSAGLLSDDEALFERFLARIRAGIVNWNRPTTGASSTAPFG 405

Query: 477 GVKESGLGREGSR 489
           GV  SG  R  + 
Sbjct: 406 GVGLSGNHRPSAY 418


>gnl|CDD|215110 PLN00412, PLN00412, NADP-dependent glyceraldehyde-3-phosphate
           dehydrogenase; Provisional.
          Length = 496

 Score =  219 bits (560), Expect = 2e-65
 Identities = 151/478 (31%), Positives = 233/478 (48%), Gaps = 25/478 (5%)

Query: 33  YINGEWVHSS--KKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
           Y +GEW  SS  K   + NP+  +    V     E+V  A+ SAK A + W++T   +R 
Sbjct: 19  YADGEWRTSSSGKSVAITNPSTRKTQYKVQACTQEEVNKAMESAKAAQKAWAKTPLWKRA 78

Query: 91  IVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGE---L 147
            +L +    L+ +K  +AE + +E  KP  +++ EVV +   + + +EE  R  GE   L
Sbjct: 79  ELLHKAAAILKEHKAPIAECLVKEIAKPAKDAVTEVVRSGDLISYTAEEGVRILGEGKFL 138

Query: 148 V----PSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTP 203
           V    P   ++K  L  + PLGVV  I P+N+P+ +   K+APAL AG   V+KP     
Sbjct: 139 VSDSFPGNERNKYCLTSKIPLGVVLAIPPFNYPVNLAVSKIAPALIAGNAVVLKPPTQGA 198

Query: 204 LTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYS 263
           + AL +    H A  PKG+++ VT      + +G  L   P V  ISFTG    G    +
Sbjct: 199 VAALHMVHCFHLAGFPKGLISCVTG---KGSEIGDFLTMHPGVNCISFTG----GDTGIA 251

Query: 264 HCSQ-GIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVF 322
              + G+  L +ELGG    IV  +A+L  A +  +   F   GQ C A    L+ E V 
Sbjct: 252 ISKKAGMVPLQMELGGKDACIVLEDADLDLAAANIIKGGFSYSGQRCTAVKVVLVMESVA 311

Query: 323 DQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGK 382
           D  VE +  ++ K  +       D C++ P+++ +    ++ +V DA  KGA       +
Sbjct: 312 DALVEKVNAKVAKLTV---GPPEDDCDITPVVSESSANFIEGLVMDAKEKGATFC----Q 364

Query: 383 VTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFF 442
             K       P +L N+ PDM    EE FGPV   I+  + EE +   N SN GL G  F
Sbjct: 365 EWKREGNLIWPLLLDNVRPDMRIAWEEPFGPVLPVIRINSVEEGIHHCNASNFGLQGCVF 424

Query: 443 TNDISQAWRVGKKLQVGMVGINEGIISHTEA-AFGGVKESGLGREGSRHGMEEFTQIK 499
           T DI++A  +   ++ G V IN       +   F G+K+SG+G +G  + +   T++K
Sbjct: 425 TRDINKAILISDAMETGTVQINSAPARGPDHFPFQGLKDSGIGSQGITNSINMMTKVK 482


>gnl|CDD|143443 cd07125, ALDH_PutA-P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, PutA.  The proline catabolic enzymes of
           the aldehyde dehydrogenase (ALDH) protein superfamily,
           proline dehydrogenase and
           Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           (EC=1.5.1.12 )), catalyze the two-step oxidation of
           proline to glutamate; P5CDH catalyzes the oxidation of
           glutamate semialdehyde, utilizing NAD+ as the electron
           acceptor. In some bacteria, the two enzymes are fused
           into the bifunctional flavoenzyme, proline utilization A
           (PutA) These enzymes play important roles in cellular
           redox control, superoxide generation, and apoptosis. In
           certain prokaryotes such as Escherichia coli, PutA is
           also a transcriptional repressor of the proline
           utilization genes.
          Length = 518

 Score =  218 bits (558), Expect = 6e-65
 Identities = 147/470 (31%), Positives = 226/470 (48%), Gaps = 39/470 (8%)

Query: 34  INGEWVHSSKKFPVYNPANDE-HLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIV 92
           INGE   + +  PV +PA+ E  + +V     EDV  A+  A  AF  WS T  +ER  +
Sbjct: 37  INGEETETGEGAPVIDPADHERTIGEVSLADAEDVDAALAIAAAAFAGWSATPVEERAEI 96

Query: 93  LKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGE-LVPSP 151
           L++    LE N+  L  +   E+GK   ++  EV  A  F  +++ +AR  + +  +P P
Sbjct: 97  LEKAADLLEANRGELIALAAAEAGKTLADADAEVREAIDFCRYYAAQARELFSDPELPGP 156

Query: 152 V-KSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALA 210
             +   L +     GV   I+PWNFP+A+ T ++A ALAAG T + KPAE TPL A    
Sbjct: 157 TGELNGLEL--HGRGVFVCISPWNFPLAIFTGQIAAALAAGNTVIAKPAEQTPLIAARAV 214

Query: 211 ALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQ--- 267
            L HEA +P+ VL +V         +G+ L   P + G+ FTGST+  K++    ++   
Sbjct: 215 ELLHEAGVPRDVLQLVPGDG---EEIGEALVAHPRIDGVIFTGSTETAKLINRALAERDG 271

Query: 268 GIKRLSLELGG-NAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFV 326
            I  L  E GG NA  IV S A  ++AV   + + F + GQ C A     +QE++ ++F+
Sbjct: 272 PILPLIAETGGKNA-MIVDSTALPEQAVKDVVQSAFGSAGQRCSALRLLYLQEEIAERFI 330

Query: 327 EMLKVEITKSFIQGDTS--KNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVT 384
           EMLK     S   GD      D   +GPLI+    + + +  +  + +G   ++    + 
Sbjct: 331 EMLK-GAMASLKVGDPWDLSTD---VGPLIDKPAGKLLRAHTE--LMRGEAWLIAPAPLD 384

Query: 385 KLGSKFFAPTI--LTNITPDMLCYTEEVFGPVAVCIKFKTE--EESLCVANNSNTGLAGY 440
                F AP I  +  I       T EVFGP+   I+FK E  +E++   N +  GL   
Sbjct: 385 DGNGYFVAPGIIEIVGIF----DLTTEVFGPILHVIRFKAEDLDEAIEDINATGYGLTLG 440

Query: 441 FFTNDISQAWRVGKKLQVGMVGINEGIISHTEAA------FGGVKESGLG 484
             + D  +     ++++ G + IN  I      A      FGG   SG G
Sbjct: 441 IHSRDEREIEYWRERVEAGNLYINRNIT----GAIVGRQPFGGWGLSGTG 486


>gnl|CDD|143453 cd07135, ALDH_F14-YMR110C, Saccharomyces cerevisiae aldehyde
           dehydrogenase family 14 and related proteins.  Aldehyde
           dehydrogenase family 14 (ALDH14), isolated mainly from
           the mitochondrial outer membrane of Saccharomyces
           cerevisiae (YMR110C) and most closely related to the
           plant and animal ALDHs and fatty ALDHs family 3 members,
           and similar fungal sequences, are present in this CD.
          Length = 436

 Score =  216 bits (552), Expect = 7e-65
 Identities = 132/419 (31%), Positives = 219/419 (52%), Gaps = 35/419 (8%)

Query: 94  KRWLKEL----EGNKQALAEIMTQESGKPPTESL--------GEVVYANSFVDWFSEEAR 141
              LK+L    + N++A+ E + ++ G+PP E+L         ++++    +  ++++ +
Sbjct: 30  LWQLKQLYWAVKDNEEAIVEALKKDLGRPPFETLLTEVSGVKNDILHMLKNLKKWAKDEK 89

Query: 142 RTYGELVPSPVKSKELLMVR-EPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAE 200
              G L     K +    +R EPLGVV +I PWN+P+ +    +  A+AAGCT V+KP+E
Sbjct: 90  VKDGPLAFMFGKPR----IRKEPLGVVLIIGPWNYPVLLALSPLVGAIAAGCTVVLKPSE 145

Query: 201 DTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLL-CTSPHVAGISFTGSTQVGK 259
            TP TA  LA L  +  +      VV            LL         I +TGS +VG+
Sbjct: 146 LTPHTAALLAELVPKY-LDPDAFQVVQGG---VPETTALLEQKFDK---IFYTGSGRVGR 198

Query: 260 ILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQE 319
           I+    ++ +  ++LELGG +P IV  NA+L+ A    +  KF N GQ CVA +  L+  
Sbjct: 199 IIAEAAAKHLTPVTLELGGKSPVIVTKNADLELAAKRILWGKFGNAGQICVAPDYVLVDP 258

Query: 320 QVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILG 379
            V+D+FVE LK  + + +  G  +  D      ++N     ++ S++    +KG KV++G
Sbjct: 259 SVYDEFVEELKKVLDEFYPGGANASPD---YTRIVNPRHFNRLKSLLDT--TKG-KVVIG 312

Query: 380 GGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAG 439
           G       ++F  PTI+++++ D    +EE+FGPV   IK    +E++ V N+ +T LA 
Sbjct: 313 G--EMDEATRFIPPTIVSDVSWDDSLMSEELFGPVLPIIKVDDLDEAIKVINSRDTPLAL 370

Query: 440 YFFTNDISQAWRVGKKLQVGMVGINEGII--SHTEAAFGGVKESGLGREGSRHGMEEFT 496
           Y FT+D S+   +  + + G V IN+ +I      A FGGV +SG G    ++G + FT
Sbjct: 371 YIFTDDKSEIDHILTRTRSGGVVINDTLIHVGVDNAPFGGVGDSGYGAYHGKYGFDTFT 429


>gnl|CDD|143402 cd07083, ALDH_P5CDH, ALDH subfamily NAD+-dependent
           delta(1)-pyrroline-5-carboxylate dehydrogenase-like.
           ALDH subfamily of the NAD+-dependent,
           delta(1)-pyrroline-5-carboxylate dehydrogenases (P5CDH,
           EC=1.5.1.12). The proline catabolic enzymes, proline
           dehydrogenase and P5CDH catalyze the two-step oxidation
           of proline to glutamate.  P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). These enzymes play important roles
           in cellular redox control, superoxide generation, and
           apoptosis. In certain prokaryotes such as Escherichia
           coli, PutA is also a transcriptional repressor of the
           proline utilization genes. Monofunctional enzyme
           sequences such as those seen in the Bacillus RocA P5CDH
           are also present in this subfamily as well as the human
           ALDH4A1 P5CDH and the Drosophila Aldh17 P5CDH.
          Length = 500

 Score =  217 bits (555), Expect = 8e-65
 Identities = 137/482 (28%), Positives = 226/482 (46%), Gaps = 24/482 (4%)

Query: 33  YINGEWVHSSKKFPVYNPANDEH-LADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGI 91
            I GEWV + ++    +P      +         + + A+ +A  AF+ W     ++R  
Sbjct: 22  VIGGEWVDTKERMVSVSPFAPSEVVGTTAKADKAEAEAALEAAWAAFKTWKDWPQEDRAR 81

Query: 92  VLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYG---ELV 148
           +L +    L   ++ L   +T E GK   E++ +V  A  F+ +++  A R      E+V
Sbjct: 82  LLLKAADLLRRRRRELIATLTYEVGKNWVEAIDDVAEAIDFIRYYARAALRLRYPAVEVV 141

Query: 149 PSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALA 208
           P P +  E   V   LG   +I+PWNFP+A+ T  +   +A G T + KPAED  +    
Sbjct: 142 PYPGEDNESFYV--GLGAGVVISPWNFPVAIFTGMIVAPVAVGNTVIAKPAEDAVVVGYK 199

Query: 209 LAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQ- 267
           +  + HEA  P GV+  +         VG  L     + GI+FTGS + GK +Y   ++ 
Sbjct: 200 VFEIFHEAGFPPGVVQFLPGV---GEEVGAYLTEHERIRGINFTGSLETGKKIYEAAARL 256

Query: 268 -----GIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVF 322
                  KRL +E GG    IV   A+ +  V G + + F   GQ C AA+R ++ +  +
Sbjct: 257 APGQTWFKRLYVETGGKNAIIVDETADFELVVEGVVVSAFGFQGQKCSAASRLILTQGAY 316

Query: 323 DQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGK 382
           +  +E L     +  + G   +N   +LGP+I+  Q  KV S ++   ++G   ++ GGK
Sbjct: 317 EPVLERLLKRAERLSV-GPPEEN-GTDLGPVIDAEQEAKVLSYIEHGKNEGQ--LVLGGK 372

Query: 383 VTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEE--ESLCVANNSNTGLAGY 440
             +    F APT++  + P      EE+FGPV   I++K ++  E+L VAN++  GL G 
Sbjct: 373 RLEGEGYFVAPTVVEEVPPKARIAQEEIFGPVLSVIRYKDDDFAEALEVANSTPYGLTGG 432

Query: 441 FFTNDISQAWRVGKKLQVGMVGINEGIISHT--EAAFGGVKESGLG-REGSRHGMEEFTQ 497
            ++          ++  VG + IN  I         FGG K SG   + G  H +  F +
Sbjct: 433 VYSRKREHLEEARREFHVGNLYINRKITGALVGVQPFGGFKLSGTNAKTGGPHYLRRFLE 492

Query: 498 IK 499
           +K
Sbjct: 493 MK 494


>gnl|CDD|166060 PLN02419, PLN02419, methylmalonate-semialdehyde dehydrogenase
           [acylating].
          Length = 604

 Score =  210 bits (535), Expect = 5e-61
 Identities = 148/478 (30%), Positives = 236/478 (49%), Gaps = 23/478 (4%)

Query: 34  INGEWV--HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGI 91
           I G +V   SS    V NPA  E ++ VP    E+ + A+ +AK+AF  W  T    R  
Sbjct: 118 IGGSFVESQSSSFIDVINPATQEVVSKVPLTTNEEFKAAVSAAKQAFPLWRNTPITTRQR 177

Query: 92  VLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSP 151
           V+ ++ + +  N   LA  +T E GK   +S G++      V+     A    GE +P+ 
Sbjct: 178 VMLKFQELIRKNMDKLAMNITTEQGKTLKDSHGDIFRGLEVVEHACGMATLQMGEYLPNV 237

Query: 152 VKSKELLMVREPLGVVGLITPWNFPMAMITRKVAP-ALAAGCTCVIKPAEDTPLTALALA 210
               +   +REPLGV   I P+NFP AMI   + P A+  G T ++KP+E  P  ++ LA
Sbjct: 238 SNGVDTYSIREPLGVCAGICPFNFP-AMIPLWMFPVAVTCGNTFILKPSEKDPGASVILA 296

Query: 211 ALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIK 270
            LA EA +P GVLN+V  +    N++    C    +  +SF GS   G  +Y+  +   K
Sbjct: 297 ELAMEAGLPDGVLNIVHGTNDTVNAI----CDDEDIRAVSFVGSNTAGMHIYARAAAKGK 352

Query: 271 RLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLI---QEQVFDQFVE 327
           R+   +G     +V  +AN+   ++  +AA F   GQ C+A +  +     +   D+ VE
Sbjct: 353 RIQSNMGAKNHGLVLPDANIDATLNALLAAGFGAAGQRCMALSTVVFVGDAKSWEDKLVE 412

Query: 328 MLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLG 387
             K       ++         +LGP+I+    E++  ++Q  +  GAK++L G  +   G
Sbjct: 413 RAKA------LKVTCGSEPDADLGPVISKQAKERICRLIQSGVDDGAKLLLDGRDIVVPG 466

Query: 388 ---SKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTN 444
                F  PTIL+ +TPDM CY EE+FGPV VC++  + +E++ + N +  G     FT+
Sbjct: 467 YEKGNFIGPTILSGVTPDMECYKEEIFGPVLVCMQANSFDEAISIINKNKYGNGAAIFTS 526

Query: 445 DISQAWRVGKKLQVGMVGINEGI-ISHTEAAFGGVKESGLGREG--SRHGMEEFTQIK 499
             + A +    ++ G +GIN  I +     +F G K S  G      + G++ FTQIK
Sbjct: 527 SGAAARKFQMDIEAGQIGINVPIPVPLPFFSFTGNKASFAGDLNFYGKAGVDFFTQIK 584


>gnl|CDD|132284 TIGR03240, arg_catab_astD, succinylglutamic semialdehyde
           dehydrogenase.  Members of this protein family are
           succinylglutamic semialdehyde dehydrogenase (EC
           1.2.1.71), the fourth enzyme in the arginine
           succinyltransferase (AST) pathway for arginine
           catabolism [Energy metabolism, Amino acids and amines].
          Length = 484

 Score =  203 bits (517), Expect = 2e-59
 Identities = 138/462 (29%), Positives = 225/462 (48%), Gaps = 26/462 (5%)

Query: 32  AYINGEWV----HSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPK 87
            +I+G+W      S       NPA  E L          V+ A+ +A+ AF  W++ + +
Sbjct: 1   LFIDGKWRAGQGESFSST---NPATQEVLWQGAAASAAQVEAAVAAARAAFPAWARLSLE 57

Query: 88  ERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGEL 147
           ER  V++R+   LE  K+ALA ++ +E+GKP  E+  EV      V   S +A       
Sbjct: 58  ERIAVVQRFAALLEERKEALARVIARETGKPLWETRTEVASMIGKVA-ISIKAYHERTGE 116

Query: 148 VPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTAL 207
             +P+     ++   P GVV +  P+NFP  +    + PAL AG T V KP+E TP  A 
Sbjct: 117 SENPMPDGRAVLRHRPHGVVAVFGPYNFPGHLPNGHIVPALLAGNTVVFKPSELTPWVAE 176

Query: 208 ALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCS- 266
               L  +A +P GVLN+V  +R      G  L   P + G+ FTGS+  G +L+   + 
Sbjct: 177 ETVKLWEKAGLPAGVLNLVQGAR----ETGVALAAHPDIDGLLFTGSSNTGHLLHRQFAG 232

Query: 267 QGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVF-DQF 325
           +  K L+LE+GGN P IV   A++  AV   + + F + GQ C  A R L+ +    D F
Sbjct: 233 RPEKILALEMGGNNPLIVDEVADIDAAVHLIIQSAFISAGQRCTCARRLLVPDGAQGDAF 292

Query: 326 VEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTK 385
           +  L VE+ +    G      +  +G +I+L   +++ +     ++ G K +L   +   
Sbjct: 293 LARL-VEVAERLTVGAWDAEPQPFMGAVISLQAAQRLLAAQAKLLALGGKSLL-AMRQLD 350

Query: 386 LGSKFFAPTIL--TNI--TPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYF 441
            G+    P I+  T +   PD     EE FGP+   I++   +E++ +ANN+  GL+   
Sbjct: 351 PGAALLTPGIIDVTGVAELPD-----EEHFGPLLQVIRYDDFDEAIAIANNTRFGLSAGL 405

Query: 442 FTNDISQAWRVGKKLQVGMVGINEGIISHTEAA-FGGVKESG 482
            ++D     R   +++ G+V  N+ +   + AA FGG+  SG
Sbjct: 406 LSDDRELYDRFLLEIRAGIVNWNKPLTGASSAAPFGGIGASG 447


>gnl|CDD|240392 PTZ00381, PTZ00381, aldehyde dehydrogenase family protein;
           Provisional.
          Length = 493

 Score =  200 bits (511), Expect = 3e-58
 Identities = 123/423 (29%), Positives = 203/423 (47%), Gaps = 28/423 (6%)

Query: 87  KERGIVLKRWLKELEGNKQALAEIMTQESGKPPTES--------LGEVVYANSFVDWFSE 138
           + R   L+  L+ LE NKQ  +E + ++ G+ P E+        + E+ +    +D + +
Sbjct: 29  EFRKQQLRNLLRMLEENKQEFSEAVHKDLGRHPFETKMTEVLLTVAEIEHLLKHLDEYLK 88

Query: 139 EARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKP 198
              +     V  P KS    ++ EPLGVV +I  WN+P+ +    +A A+AAG T V+KP
Sbjct: 89  P-EKVDTVGVFGPGKSY---IIPEPLGVVLVIGAWNYPLNLTLIPLAGAIAAGNTVVLKP 144

Query: 199 AEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVG 258
           +E +P T+  +A L  +  +    + V+                      I FTGS +VG
Sbjct: 145 SELSPHTSKLMAKLLTK-YLDPSYVRVIEGGVEVTTE-----LLKEPFDHIFFTGSPRVG 198

Query: 259 KILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQ 318
           K++    ++ +   +LELGG +P IV  + NL+ A       KF N GQ CVA +  L+ 
Sbjct: 199 KLVMQAAAENLTPCTLELGGKSPVIVDKSCNLKVAARRIAWGKFLNAGQTCVAPDYVLVH 258

Query: 319 EQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVIL 378
             + D+F+E LK  I + F  G+  K  +     ++N    +++  +++D    G KV+ 
Sbjct: 259 RSIKDKFIEALKEAIKEFF--GEDPKKSED-YSRIVNEFHTKRLAELIKD---HGGKVVY 312

Query: 379 GGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLA 438
            GG+V    +K+ APTI+ N   D     EE+FGP+   + ++  +E L   N+    LA
Sbjct: 313 -GGEVDI-ENKYVAPTIIVNPDLDSPLMQEEIFGPILPILTYENIDEVLEFINSRPKPLA 370

Query: 439 GYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAA--FGGVKESGLGREGSRHGMEEFT 496
            Y+F  D      V +    G V IN+ +         FGGV  SG+G    ++G + F+
Sbjct: 371 LYYFGEDKRHKELVLENTSSGAVVINDCVFHLLNPNLPFGGVGNSGMGAYHGKYGFDTFS 430

Query: 497 QIK 499
             K
Sbjct: 431 HPK 433


>gnl|CDD|177949 PLN02315, PLN02315, aldehyde dehydrogenase family 7 member.
          Length = 508

 Score =  200 bits (509), Expect = 5e-58
 Identities = 142/475 (29%), Positives = 228/475 (48%), Gaps = 14/475 (2%)

Query: 19  KYLQKLCLSTYTQ-AYINGEWVHSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEA 77
           ++L ++ LS+     Y+ GEW  +       NPAN++ +A+V +  +ED +  + + +EA
Sbjct: 9   EFLSEIGLSSRNLGCYVGGEWRANGPLVSSVNPANNQPIAEVVEASLEDYEEGLRACEEA 68

Query: 78  FEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFS 137
            + W Q    +RG ++++    L      L  +++ E GK   E +GEV       D+  
Sbjct: 69  AKIWMQVPAPKRGEIVRQIGDALRAKLDYLGRLVSLEMGKILAEGIGEVQEIIDMCDFAV 128

Query: 138 EEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIK 197
             +R+  G ++PS   +  ++ V  PLG+VG+IT +NFP A++      AL  G   V K
Sbjct: 129 GLSRQLNGSIIPSERPNHMMMEVWNPLGIVGVITAFNFPCAVLGWNACIALVCGNCVVWK 188

Query: 198 PAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQV 257
            A  TPL  +A+  L  E      +   + +S      +G+ +     +  +SFTGS++V
Sbjct: 189 GAPTTPLITIAMTKLVAEVLEKNNLPGAIFTSFCGGAEIGEAIAKDTRIPLVSFTGSSKV 248

Query: 258 GKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLI 317
           G ++    +    +  LEL GN   IV  +A++Q AV   + A     GQ C    R L+
Sbjct: 249 GLMVQQTVNARFGKCLLELSGNNAIIVMDDADIQLAVRSVLFAAVGTAGQRCTTCRRLLL 308

Query: 318 QEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVI 377
            E ++D  +E L + + K    GD  +     LGPL      +  +  ++   S+G K I
Sbjct: 309 HESIYDDVLEQL-LTVYKQVKIGDPLEKGTL-LGPLHTPESKKNFEKGIEIIKSQGGK-I 365

Query: 378 LGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGL 437
           L GG   +    F  PTI+  I+PD     EE+FGPV   +KFKT EE++ + N+   GL
Sbjct: 366 LTGGSAIESEGNFVQPTIV-EISPDADVVKEELFGPVLYVMKFKTLEEAIEINNSVPQGL 424

Query: 438 AGYFFTN--DISQAWRVGKKLQVGMVGINEGIISHTEA----AFGGVKESGLGRE 486
           +   FT   +    W        G+V +N   I    A    AFGG K +G GRE
Sbjct: 425 SSSIFTRNPETIFKWIGPLGSDCGIVNVN---IPTNGAEIGGAFGGEKATGGGRE 476


>gnl|CDD|143451 cd07133, ALDH_CALDH_CalB, Coniferyl aldehyde dehydrogenase-like.
           Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of
           Pseudomonas sp. strain HR199 (CalB) which catalyzes the
           NAD+-dependent oxidation of coniferyl aldehyde to
           ferulic acid, and similar sequences, are present in this
           CD.
          Length = 434

 Score =  198 bits (505), Expect = 6e-58
 Identities = 127/430 (29%), Positives = 209/430 (48%), Gaps = 32/430 (7%)

Query: 85  TPKERGIVLKRWLKELEGNKQALAEIMTQE-SGKPPTES-LGEVV-------YANSFVD- 134
           + +ER   L R    L  N+ ALAE ++ +   +   E+ L E++       +A   +  
Sbjct: 18  SLEERRDRLDRLKALLLDNQDALAEAISADFGHRSRHETLLAEILPSIAGIKHARKHLKK 77

Query: 135 WFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTC 194
           W     R     L+  P K++   +  +PLGVVG+I PWN+P+ +    +  ALAAG   
Sbjct: 78  WMKPSRRHV--GLLFLPAKAE---VEYQPLGVVGIIVPWNYPLYLALGPLIAALAAGNRV 132

Query: 195 VIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSP--HVAGISFT 252
           +IKP+E TP T+  LA L  E    +  + VVT        V     + P  H   + FT
Sbjct: 133 MIKPSEFTPRTSALLAELLAEY-FDEDEVAVVTGGAD----VAAAFSSLPFDH---LLFT 184

Query: 253 GSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAA 312
           GST VG+ +    ++ +  ++LELGG +P I+  +A+L KA       K  N GQ CVA 
Sbjct: 185 GSTAVGRHVMRAAAENLTPVTLELGGKSPAIIAPDADLAKAAERIAFGKLLNAGQTCVAP 244

Query: 313 NRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISK 372
           +  L+ E   ++FV   K  + K +     +     +   +IN     ++  +++DA +K
Sbjct: 245 DYVLVPEDKLEEFVAAAKAAVAKMYPTLADNP----DYTSIINERHYARLQGLLEDARAK 300

Query: 373 GAKVI-LGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVAN 431
           GA+VI L         ++   PT++ N+T DM    EE+FGP+   + + + +E++   N
Sbjct: 301 GARVIELNPAGEDFAATRKLPPTLVLNVTDDMRVMQEEIFGPILPILTYDSLDEAIDYIN 360

Query: 432 NSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGI--ISHTEAAFGGVKESGLGREGSR 489
                LA Y+F  D ++  RV ++   G V IN+ +  ++  +  FGGV  SG+G    +
Sbjct: 361 ARPRPLALYYFGEDKAEQDRVLRRTHSGGVTINDTLLHVAQDDLPFGGVGASGMGAYHGK 420

Query: 490 HGMEEFTQIK 499
            G   F+  K
Sbjct: 421 EGFLTFSHAK 430


>gnl|CDD|181873 PRK09457, astD, succinylglutamic semialdehyde dehydrogenase;
           Reviewed.
          Length = 487

 Score =  197 bits (502), Expect = 4e-57
 Identities = 144/467 (30%), Positives = 233/467 (49%), Gaps = 38/467 (8%)

Query: 33  YINGEWVH-SSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGI 91
           +ING+W+    + F   NP + E L    D     V  A+ +A+ AF  W++ + +ER  
Sbjct: 4   WINGDWIAGQGEAFESRNPVSGEVLWQGNDATAAQVDAAVRAARAAFPAWARLSFEERQA 63

Query: 92  VLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEA--RRTYGELVP 149
           +++R+   LE NK+ LAE++ +E+GKP  E+  EV    + +   S +A   RT GE   
Sbjct: 64  IVERFAALLEENKEELAEVIARETGKPLWEAATEVTAMINKIA-ISIQAYHERT-GE--- 118

Query: 150 SPVKSKELL----MVR-EPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPL 204
              K  E+     ++R  P GVV +  P+NFP  +    + PAL AG T V KP+E TP 
Sbjct: 119 ---KRSEMADGAAVLRHRPHGVVAVFGPYNFPGHLPNGHIVPALLAGNTVVFKPSELTPW 175

Query: 205 TALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSH 264
            A     L  +A +P GVLN+V   R      GK L   P + G+ FTGS   G +L+  
Sbjct: 176 VAELTVKLWQQAGLPAGVLNLVQGGRET----GKALAAHPDIDGLLFTGSANTGYLLHRQ 231

Query: 265 -CSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVF- 322
              Q  K L+LE+GGN P ++   A++  AV   + + F + GQ C  A R L+ +    
Sbjct: 232 FAGQPEKILALEMGGNNPLVIDEVADIDAAVHLIIQSAFISAGQRCTCARRLLVPQGAQG 291

Query: 323 DQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGK 382
           D F+  L V + K    G      +  +G +I+    + + +     ++ G K +L   +
Sbjct: 292 DAFLARL-VAVAKRLTVGRWDAEPQPFMGAVISEQAAQGLVAAQAQLLALGGKSLL---E 347

Query: 383 VTKL--GSKFFAPTIL--TNIT--PDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTG 436
           +T+L  G+    P I+  T +   PD     EE FGP+   +++   +E++ +ANN+  G
Sbjct: 348 MTQLQAGTGLLTPGIIDVTGVAELPD-----EEYFGPLLQVVRYDDFDEAIRLANNTRFG 402

Query: 437 LAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTEAA-FGGVKESG 482
           L+    ++D     +   +++ G+V  N+ +   + AA FGGV  SG
Sbjct: 403 LSAGLLSDDREDYDQFLLEIRAGIVNWNKPLTGASSAAPFGGVGASG 449


>gnl|CDD|143454 cd07136, ALDH_YwdH-P39616, Bacillus subtilis aldehyde dehydrogenase
           ywdH-like.  Uncharacterized Bacillus subtilis ywdH
           aldehyde dehydrogenase (locus P39616)  most closely
           related to the ALDHs and fatty ALDHs of families 3 and
           14, and similar sequences, are included in this CD.
          Length = 449

 Score =  195 bits (498), Expect = 7e-57
 Identities = 128/419 (30%), Positives = 204/419 (48%), Gaps = 45/419 (10%)

Query: 93  LKRWLKELEGNKQALAEIMTQESGKPPTES--------LGEVVYANSFVD-WFSEEARRT 143
           LK+  + ++  +  + E + ++ GK   E+        L E+ YA   +  W   +  +T
Sbjct: 26  LKKLKQAIKKYENEILEALKKDLGKSEFEAYMTEIGFVLSEINYAIKHLKKWMKPKRVKT 85

Query: 144 YGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAP---ALAAGCTCVIKPAE 200
                PS  KS    +  EP GVV +I PWN+P  +    +AP   A+AAG T V+KP+E
Sbjct: 86  PLLNFPS--KSY---IYYEPYGVVLIIAPWNYPFQLA---LAPLIGAIAAGNTAVLKPSE 137

Query: 201 DTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKI 260
            TP T+  +A +  E    +  + VV           +LL        I FTGS +VGKI
Sbjct: 138 LTPNTSKVIAKIIEET-FDEEYVAVVEGGVEENQ---ELL--DQKFDYIFFTGSVRVGKI 191

Query: 261 LYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQ 320
           +    ++ +  ++LELGG +P IV  +ANL+ A    +  KF N GQ CVA +  L+ E 
Sbjct: 192 VMEAAAKHLTPVTLELGGKSPCIVDEDANLKLAAKRIVWGKFLNAGQTCVAPDYVLVHES 251

Query: 321 VFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGG 380
           V ++F++ LK EI K +  G+       + G +IN    +++  ++ +      K++ GG
Sbjct: 252 VKEKFIKELKEEIKKFY--GEDPLESP-DYGRIINEKHFDRLAGLLDN-----GKIVFGG 303

Query: 381 GKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGY 440
              T   + +  PTIL N+T D     EE+FGP+   + + T +E++ +  +    LA Y
Sbjct: 304 N--TDRETLYIEPTILDNVTWDDPVMQEEIFGPILPVLTYDTLDEAIEIIKSRPKPLALY 361

Query: 441 FFTNDISQAWRVGKKLQVGMVGINEGII----SHTEAAFGGVKESGLGREGSRHGMEEF 495
            F+ D     +V + L  G   IN+ I+     +    FGGV  SG+   GS HG   F
Sbjct: 362 LFSEDKKVEKKVLENLSFGGGCINDTIMHLANPYL--PFGGVGNSGM---GSYHGKYSF 415


>gnl|CDD|233325 TIGR01238, D1pyr5carbox3, delta-1-pyrroline-5-carboxylate
           dehydrogenase (PutA C-terminal domain).  This model
           represents one of several related branches of
           delta-1-pyrroline-5-carboxylate dehydrogenase. Members
           of this branch are the C-terminal domain of the PutA
           bifunctional proline dehydrogenase /
           delta-1-pyrroline-5-carboxylate dehydrogenase [Energy
           metabolism, Amino acids and amines].
          Length = 500

 Score =  188 bits (480), Expect = 6e-54
 Identities = 134/479 (27%), Positives = 214/479 (44%), Gaps = 32/479 (6%)

Query: 34  INGEWVHSSKKFPVYNPAN-DEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIV 92
           I   +    +  PV NPA+  + +  V    +  VQ AI SA++AF  W+ T  KER   
Sbjct: 42  IGHSYKADGEAQPVTNPADRRDIVGQVFHANLAHVQAAIDSAQQAFPTWNATPAKERAAK 101

Query: 93  LKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPV 152
           L R    LE +   L  +  +E+GK    ++ EV  A  F  +++++ R   GE      
Sbjct: 102 LDRLADLLELHMPELMALCVREAGKTIHNAIAEVREAVDFCRYYAKQVRDVLGE------ 155

Query: 153 KSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAAL 212
                    E  GV   I+PWNFP+A+ T +++ ALAAG T + KPAE T L A     L
Sbjct: 156 ------FSVESRGVFVCISPWNFPLAIFTGQISAALAAGNTVIAKPAEQTSLIAYRAVEL 209

Query: 213 AHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIK-- 270
             EA  P G + ++         VG  L + P +AG++FTGST+V +++    +Q     
Sbjct: 210 MQEAGFPAGTIQLLPGR---GADVGAALTSDPRIAGVAFTGSTEVAQLINQTLAQREDAP 266

Query: 271 -RLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEML 329
             L  E GG    IV S A  ++ V   + + F + GQ C A     +QE V D+ + M+
Sbjct: 267 VPLIAETGGQNAMIVDSTALPEQVVRDVLRSAFDSAGQRCSALRVLCVQEDVADRVLTMI 326

Query: 330 KVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKV--ILGGGKVTKLG 387
           +  + +  ++         ++GP+I+    + + + ++       K+  +          
Sbjct: 327 QGAMQE--LKVGVPHLLTTDVGPVIDAEAKQNLLAHIEHMSQTQKKIAQLTLDDSRACQH 384

Query: 388 SKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLA---GYFFTN 444
             F APT+        L  +EEVFGPV   +++K  E    V   + TG     G     
Sbjct: 385 GTFVAPTLFELDDIAEL--SEEVFGPVLHVVRYKARELDQIVDQINQTGYGLTMGVHSRI 442

Query: 445 DISQAWRVGKKLQVGMVGINEGIISHTEAA--FGGVKESGLG-REGSRHGMEEFTQIKY 500
           + +  W + K  +VG   +N   +        FGG   SG G + G  H +   TQ++Y
Sbjct: 443 ETTYRW-IEKHARVGNCYVNRNQVGAVVGVQPFGGQGLSGTGPKAGGPHYLYRLTQVQY 500


>gnl|CDD|237018 PRK11905, PRK11905, bifunctional proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1208

 Score =  193 bits (493), Expect = 3e-53
 Identities = 148/464 (31%), Positives = 229/464 (49%), Gaps = 61/464 (13%)

Query: 46  PVYNPAN-DEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNK 104
           PV NPA+ D+ +  V +   EDV+ A+ +A+ AF +WS T   ER  +L+R    +E + 
Sbjct: 570 PVLNPADHDDVVGTVTEASAEDVERALAAAQAAFPEWSATPAAERAAILERAADLMEAHM 629

Query: 105 QALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPL 164
             L  +  +E+GK    ++ EV  A  F+ +++ +ARR    L              +PL
Sbjct: 630 PELFALAVREAGKTLANAIAEVREAVDFLRYYAAQARRL---LNGPG---------HKPL 677

Query: 165 GVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLN 224
           G V  I+PWNFP+A+ T ++A AL AG T + KPAE TPL A     L HEA +PK  L 
Sbjct: 678 GPVVCISPWNFPLAIFTGQIAAALVAGNTVLAKPAEQTPLIAARAVRLLHEAGVPKDALQ 737

Query: 225 VVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKI----LYSHCSQGIKRLSLELGG-N 279
           ++        +VG  L   P +AG+ FTGST+V ++    L       +  L  E GG N
Sbjct: 738 LLPGD---GRTVGAALVADPRIAGVMFTGSTEVARLIQRTLAKRSGPPVP-LIAETGGQN 793

Query: 280 APFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFL-IQEQVFDQFVEMLK---VEITK 335
           A  IV S+A  ++ V+  +A+ F + GQ C +A R L +QE V D+ + MLK    E+  
Sbjct: 794 A-MIVDSSALPEQVVADVIASAFDSAGQRC-SALRVLCLQEDVADRVLTMLKGAMDELR- 850

Query: 336 SFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVI-LGGGKVTKLGSKFFAPT 394
               GD  +    ++GP+I+      +++ ++   + G  V  L     T+ G+ F APT
Sbjct: 851 ---IGDPWRLS-TDVGPVIDAEAQANIEAHIEAMRAAGRLVHQLPLPAETEKGT-FVAPT 905

Query: 395 I--LTNITPDMLCYTEEVFGPVAVCIKFKTEE-ESLCVANNSNTGLAGYFFTNDI----- 446
           +  + +I  D+     EVFGPV   ++FK +E + +    N+     GY  T  +     
Sbjct: 906 LIEIDSI-SDL---EREVFGPVLHVVRFKADELDRVIDDINA----TGYGLTFGLHSRID 957

Query: 447 SQAWRVGKKLQVGMVGINEGIISHTEAA------FGGVKESGLG 484
                V  +++ G + +N  II     A      FGG   SG G
Sbjct: 958 ETIAHVTSRIRAGNIYVNRNII----GAVVGVQPFGGEGLSGTG 997


>gnl|CDD|226683 COG4230, COG4230, Delta 1-pyrroline-5-carboxylate dehydrogenase
           [Energy production and conversion].
          Length = 769

 Score =  186 bits (475), Expect = 1e-51
 Identities = 143/470 (30%), Positives = 224/470 (47%), Gaps = 49/470 (10%)

Query: 34  INGEWVHSSKKFPVYNPANDEHL-ADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIV 92
           +NG  V   +  PV NPA+ + +   V +    DV+ A+ +A  A   WS T P ER  +
Sbjct: 118 VNGAPVAGGEPRPVINPADPDDIVGTVTEATEADVEQALEAAVAAAPIWSATPPAERAAI 177

Query: 93  LKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPV 152
           L+R    +E     L  ++ +E+GK  + ++ EV  A  F+ +++ +AR T+G L     
Sbjct: 178 LERAADLMEAQMPQLMGLLVREAGKTLSNAIAEVREAVDFLRYYAGQARDTFGNLT---- 233

Query: 153 KSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAAL 212
                     PLG V  I+PWNFP+A+ T ++A ALAAG + + KPAE TPL A     L
Sbjct: 234 --------HRPLGPVVCISPWNFPLAIFTGQIAAALAAGNSVLAKPAEQTPLIAAQAVRL 285

Query: 213 AHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCS--QG-I 269
            HEA +P GVL ++        +VG  L     VAG+ FTGST+V +++    +  QG  
Sbjct: 286 LHEAGVPPGVLQLLPG---RGETVGAALTADARVAGVMFTGSTEVARLIQRQLAKRQGRP 342

Query: 270 KRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEML 329
             L  E GG    IV S+A  ++ V+  +A+ F + GQ C A     +QE V D+ + ML
Sbjct: 343 IPLIAETGGQNAMIVDSSALAEQVVADVLASAFDSAGQRCSALRVLCLQEDVADRILTML 402

Query: 330 KVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLG-- 387
           K  + +  + G+  +    ++GP+I+    E   +I +   +  +K    G  V +    
Sbjct: 403 KGAMAELRV-GNPDRL-TTDVGPVID---AEAKANIEKHIQTMRSK----GRLVHQAAAP 453

Query: 388 -----SKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEE-ESLCVANNSNTGLAGYF 441
                  F APT++     D L    EVFGPV   +++K +E + +    N+     GY 
Sbjct: 454 NSLQKGTFVAPTLIELENLDEL--QREVFGPVLHVVRYKRDELDEVIDQINAT----GYG 507

Query: 442 FTNDI-----SQAWRVGKKLQVGMVGINEGIISHTEA--AFGGVKESGLG 484
            T  +          V ++   G + +N  I+        FGG   SG G
Sbjct: 508 LTLGVHTRIDETIAHVTERAHAGNLYVNRNIVGAVVGVQPFGGEGLSGTG 557


>gnl|CDD|237017 PRK11904, PRK11904, bifunctional proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1038

 Score =  181 bits (463), Expect = 3e-49
 Identities = 149/457 (32%), Positives = 226/457 (49%), Gaps = 68/457 (14%)

Query: 46  PVYNPANDEH-LADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNK 104
           PV +PA+    + +V     E V+ A+ +A+ AF  WS+T  +ER  +L+R    LE N+
Sbjct: 565 PVVSPADRRRVVGEVAFADAEQVEQALAAARAAFPAWSRTPVEERAAILERAADLLEANR 624

Query: 105 QALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYG--ELVPSPV-KSKELLMVR 161
             L  +  +E+GK   +++ EV  A  F  +++ +ARR +G  E +P P  +S EL    
Sbjct: 625 AELIALCVREAGKTLQDAIAEVREAVDFCRYYAAQARRLFGAPEKLPGPTGESNEL--RL 682

Query: 162 EPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKG 221
              GV   I+PWNFP+A+   +VA ALAAG T + KPAE TPL A     L HEA IPK 
Sbjct: 683 HGRGVFVCISPWNFPLAIFLGQVAAALAAGNTVIAKPAEQTPLIAAEAVKLLHEAGIPKD 742

Query: 222 VLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKR---------- 271
           VL ++        +VG  L   P +AG++FTGST+  +I        I R          
Sbjct: 743 VLQLLPGD---GATVGAALTADPRIAGVAFTGSTETARI--------INRTLAARDGPIV 791

Query: 272 -LSLELGG-NAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEML 329
            L  E GG NA  IV S A  ++ V   + + FR+ GQ C A     +QE + D+ +EML
Sbjct: 792 PLIAETGGQNA-MIVDSTALPEQVVDDVVTSAFRSAGQRCSALRVLFVQEDIADRVIEML 850

Query: 330 K---VEITKSFIQGDTS--KNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVI----LGG 380
           K    E+      GD      D   +GP+I+      +D+ + + + + A+++    L  
Sbjct: 851 KGAMAELK----VGDPRLLSTD---VGPVIDAEAKANLDAHI-ERMKREARLLAQLPLPA 902

Query: 381 GKVTKLGSKFFAPTI--LTNITPDMLCYTEEVFGPVAVCIKFKTEE-ESLCVANNSNTGL 437
           G  T+ G  F APT   + +I    L    EVFGP+   I++K  + + +  A N+    
Sbjct: 903 G--TENGH-FVAPTAFEIDSI--SQL--EREVFGPILHVIRYKASDLDKVIDAINA---- 951

Query: 438 AGYFFT------NDISQAWRVGKKLQVGMVGINEGII 468
            GY  T       +   A R+  +++VG V +N   I
Sbjct: 952 TGYGLTLGIHSRIE-ETADRIADRVRVGNVYVNRNQI 987


>gnl|CDD|143450 cd07132, ALDH_F3AB, Aldehyde dehydrogenase family 3 members A1, A2,
           and B1 and related proteins.  NAD(P)+-dependent,
           aldehyde dehydrogenase, family 3 members A1 and B1
           (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and fatty aldehyde
           dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3),
           and similar sequences are included in this CD. Human
           ALDH3A1 is a homodimer with a critical role in cellular
           defense against oxidative stress; it catalyzes the
           oxidation of various cellular membrane lipid-derived
           aldehydes. Corneal crystalline ALDH3A1 protects the
           cornea and underlying lens against UV-induced oxidative
           stress. Human ALDH3A2, a microsomal homodimer, catalyzes
           the oxidation of long-chain aliphatic aldehydes to fatty
           acids. Human ALDH3B1 is highly expressed in the kidney
           and liver and catalyzes the oxidation of various medium-
           and long-chain saturated and unsaturated aliphatic
           aldehydes.
          Length = 443

 Score =  171 bits (436), Expect = 6e-48
 Identities = 133/449 (29%), Positives = 216/449 (48%), Gaps = 48/449 (10%)

Query: 70  AIMSAKEAFEQWSQTTPKE-RGIVLKRWLKELEGNKQALAEIMTQESGKPPTES-LGEVV 127
           A+  A+EAF    +T P E R   L+  L+ LE N+  + E + ++  KP  E+ L E++
Sbjct: 3   AVRRAREAFSS-GKTRPLEFRIQQLEALLRMLEENEDEIVEALAKDLRKPKFEAVLSEIL 61

Query: 128 YANSFVDWFSEEARRTYGEL----VPSPVK------SKELLMVREPLGVVGLITPWNFPM 177
                      E +     L     P PVK        ++ + +EPLGVV +I  WN+P+
Sbjct: 62  LVK-------NEIKYAISNLPEWMKPEPVKKNLATLLDDVYIYKEPLGVVLIIGAWNYPL 114

Query: 178 AMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLN----VVTSSRTNA 233
            +    +  A+AAG   VIKP+E +P TA  LA L     IPK +      VV       
Sbjct: 115 QLTLVPLVGAIAAGNCVVIKPSEVSPATAKLLAEL-----IPKYLDKECYPVVLGG---V 166

Query: 234 NSVGKLL-CTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQK 292
               +LL     +   I +TGST VGKI+    ++ +  ++LELGG +P  V  + ++  
Sbjct: 167 EETTELLKQRFDY---IFYTGSTSVGKIVMQAAAKHLTPVTLELGGKSPCYVDKSCDIDV 223

Query: 293 AVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGP 352
           A       KF N GQ C+A +  L   +V ++FVE LK +  K F   D  ++   + G 
Sbjct: 224 AARRIAWGKFINAGQTCIAPDYVLCTPEVQEKFVEALK-KTLKEFYGEDPKESP--DYGR 280

Query: 353 LINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFG 412
           +IN    ++    ++  +S G KV +GG   T    ++ APT+LT++ P      EE+FG
Sbjct: 281 IINDRHFQR----LKKLLSGG-KVAIGGQ--TDEKERYIAPTVLTDVKPSDPVMQEEIFG 333

Query: 413 PVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTE 472
           P+   +     +E++   N+    LA Y F+N+     ++      G V +N+ I+ +T 
Sbjct: 334 PILPIVTVNNLDEAIEFINSREKPLALYVFSNNKKVINKILSNTSSGGVCVNDTIMHYTL 393

Query: 473 AA--FGGVKESGLGREGSRHGMEEFTQIK 499
            +  FGGV  SG+G    ++  + F+  +
Sbjct: 394 DSLPFGGVGNSGMGAYHGKYSFDTFSHKR 422


>gnl|CDD|143455 cd07137, ALDH_F3FHI, Plant aldehyde dehydrogenase family 3 members
           F1, H1, and I1 and related proteins.  Aldehyde
           dehydrogenase family members 3F1, 3H1, and 3I1 (ALDH3F1,
           ALDH3H1, and ALDH3I1), and similar plant sequences, are
           in this CD.  In Arabidopsis thaliana, stress-regulated
           expression of ALDH3I1  was observed in  leaves and
           osmotic stress expression of  ALDH3H1 was observed in
           root tissue, whereas, ALDH3F1 expression was not stress
           responsive. Functional analysis of ALDH3I1 suggest it
           may be involved in a detoxification pathway in plants
           that limits aldehyde accumulation and oxidative stress.
          Length = 432

 Score =  169 bits (430), Expect = 2e-47
 Identities = 131/448 (29%), Positives = 209/448 (46%), Gaps = 36/448 (8%)

Query: 64  VEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTES- 122
           V +++    S +    +W ++        LK  L+ ++ N+  +   + Q+ GKP  ES 
Sbjct: 5   VRELRETFRSGRTRSAEWRKSQ-------LKGLLRLVDENEDDIFAALRQDLGKPSAESF 57

Query: 123 LGEV--------VYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWN 174
             EV        +       W + E  +T   L   P K++   +V EPLGVV +I+ WN
Sbjct: 58  RDEVSVLVSSCKLAIKELKKWMAPEKVKT--PLTTFPAKAE---IVSEPLGVVLVISAWN 112

Query: 175 FPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNAN 234
           FP  +    V  A+AAG   V+KP+E  P T+  LA L  E    K +  V+        
Sbjct: 113 FPFLLSLEPVIGAIAAGNAVVLKPSELAPATSALLAKLIPEYLDTKAI-KVIEGGVPETT 171

Query: 235 SVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAV 294
           +   LL        I FTGS +VG+I+ +  ++ +  ++LELGG  P IV S  +L+ AV
Sbjct: 172 A---LL--EQKWDKIFFTGSPRVGRIIMAAAAKHLTPVTLELGGKCPVIVDSTVDLKVAV 226

Query: 295 SGAMAAKFRNC-GQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPL 353
                 K+    GQAC+A +  L++E      ++ LK  + K F  G+  K  K  L  +
Sbjct: 227 RRIAGGKWGCNNGQACIAPDYVLVEESFAPTLIDALKNTLEKFF--GENPKESKD-LSRI 283

Query: 354 INLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGP 413
           +N    +++  ++ D  S   K++ GG +  K  + +  PTIL +   D    TEE+FGP
Sbjct: 284 VNSHHFQRLSRLLDDP-SVADKIVHGGERDEK--NLYIEPTILLDPPLDSSIMTEEIFGP 340

Query: 414 VAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHT-- 471
           +   I  K  EES+ + N+    LA Y FT +     R+  +   G V  N+ ++ +   
Sbjct: 341 LLPIITVKKIEESIEIINSRPKPLAAYVFTKNKELKRRIVAETSSGGVTFNDTVVQYAID 400

Query: 472 EAAFGGVKESGLGREGSRHGMEEFTQIK 499
              FGGV ESG G    +   + F+  K
Sbjct: 401 TLPFGGVGESGFGAYHGKFSFDAFSHKK 428


>gnl|CDD|236989 PRK11809, putA, trifunctional transcriptional regulator/proline
            dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
            Reviewed.
          Length = 1318

 Score =  171 bits (435), Expect = 1e-45
 Identities = 116/407 (28%), Positives = 187/407 (45%), Gaps = 66/407 (16%)

Query: 46   PVYNPANDEHLAD----VPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELE 101
            PV NPA+     D    V +    +V+ A+ SA  A   W  T P ER  +L+R    +E
Sbjct: 662  PVINPAD---PRDIVGYVREATPAEVEQALESAVNAAPIWFATPPAERAAILERAADLME 718

Query: 102  GNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVR 161
               Q L  ++ +E+GK  + ++ EV  A  F+ +++ + R  +           +     
Sbjct: 719  AQMQTLMGLLVREAGKTFSNAIAEVREAVDFLRYYAGQVRDDF---------DND---TH 766

Query: 162  EPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKG 221
             PLG V  I+PWNFP+A+ T +VA ALAAG + + KPAE TPL A     +  EA +P G
Sbjct: 767  RPLGPVVCISPWNFPLAIFTGQVAAALAAGNSVLAKPAEQTPLIAAQAVRILLEAGVPAG 826

Query: 222  VLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLS-------- 273
            V+ ++        +VG  L     V G+ FTGST+V ++L  + +    RL         
Sbjct: 827  VVQLLPGR---GETVGAALVADARVRGVMFTGSTEVARLLQRNLA---GRLDPQGRPIPL 880

Query: 274  -LELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVE 332
              E GG    IV S+A  ++ V+  +A+ F + GQ C A     +Q+ V D+ ++ML+  
Sbjct: 881  IAETGGQNAMIVDSSALTEQVVADVLASAFDSAGQRCSALRVLCLQDDVADRTLKMLRGA 940

Query: 333  ITKSFIQGDTSKNDKCNL----------GPLINLAQVEKVDSIVQDAISKGAKV---ILG 379
            +             +C +          GP+I+      ++  +Q   +KG  V      
Sbjct: 941  M------------AECRMGNPDRLSTDIGPVIDAEAKANIERHIQAMRAKGRPVFQAARE 988

Query: 380  GGKVTKLGSKFFAPTI--LTNITPDMLCYTEEVFGPVAVCIKFKTEE 424
              +  + G+ F  PT+  L +   D L    EVFGPV   +++   +
Sbjct: 989  NSEDWQSGT-FVPPTLIELDSF--DEL--KREVFGPVLHVVRYNRNQ 1030


>gnl|CDD|165847 PLN02203, PLN02203, aldehyde dehydrogenase.
          Length = 484

 Score =  159 bits (403), Expect = 4e-43
 Identities = 134/449 (29%), Positives = 222/449 (49%), Gaps = 50/449 (11%)

Query: 75  KEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTES----LG----EV 126
           +E +E     + + R   LK  L+ L+ N++A+ + + Q+ GK   E+    +G      
Sbjct: 16  RETYESGRTRSLEWRKSQLKGLLRLLKDNEEAIFKALHQDLGKHRVEAYRDEVGVLTKSA 75

Query: 127 VYANSFVD-WFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVA 185
             A S +  W + +  +    LV  P  ++   +V EPLGVV + + WNFP+ +    + 
Sbjct: 76  NLALSNLKKWMAPK--KAKLPLVAFPATAE---VVPEPLGVVLIFSSWNFPIGLSLEPLI 130

Query: 186 PALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPH 245
            A+AAG   V+KP+E  P T+  LA     A IPK +         ++ +V K++   P 
Sbjct: 131 GAIAAGNAVVLKPSELAPATSAFLA-----ANIPKYL---------DSKAV-KVIEGGPA 175

Query: 246 VA---------GISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIV---YSNANLQKA 293
           V           I FTGS +VG+I+ +  ++ +  ++LELGG  P IV    S+ + + A
Sbjct: 176 VGEQLLQHKWDKIFFTGSPRVGRIIMTAAAKHLTPVALELGGKCPCIVDSLSSSRDTKVA 235

Query: 294 VSGAMAAKFRNC-GQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGP 352
           V+  +  K+ +C GQAC+A +  L++E+     +E+LK  I K F  G+  +  K  +  
Sbjct: 236 VNRIVGGKWGSCAGQACIAIDYVLVEERFAPILIELLKSTIKKFF--GENPRESKS-MAR 292

Query: 353 LINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFG 412
           ++N    +++ ++++D     A ++ GG    K    F  PTIL N   D    TEE+FG
Sbjct: 293 ILNKKHFQRLSNLLKDP-RVAASIVHGGSIDEK--KLFIEPTILLNPPLDSDIMTEEIFG 349

Query: 413 PVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIISHTE 472
           P+   I  K  E+S+   N+    LA Y FTN+     R+  +   G V  N+ II +  
Sbjct: 350 PLLPIITVKKIEDSIAFINSKPKPLAIYAFTNNEKLKRRILSETSSGSVTFNDAIIQYAC 409

Query: 473 AA--FGGVKESGLGREGSRHGMEEFTQIK 499
            +  FGGV ESG GR   ++  + F+  K
Sbjct: 410 DSLPFGGVGESGFGRYHGKYSFDTFSHEK 438


>gnl|CDD|233324 TIGR01236, D1pyr5carbox1, delta-1-pyrroline-5-carboxylate
           dehydrogenase, group 1.  This model represents one of
           two related branches of delta-1-pyrroline-5-carboxylate
           dehydrogenase. The two branches are not as closely
           related to each other as some aldehyde dehydrogenases
           are to this branch, and separate models are built for
           this reason. The enzyme is the second of two in the
           degradation of proline to glutamate [Energy metabolism,
           Amino acids and amines].
          Length = 532

 Score =  137 bits (348), Expect = 3e-35
 Identities = 124/488 (25%), Positives = 205/488 (42%), Gaps = 31/488 (6%)

Query: 34  INGEWVHSS-KKFPVYNPANDEH-LADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGI 91
           I GE V +S ++    +P N +  LA   +   E    A+ +A +A ++WS     +R  
Sbjct: 36  IGGEEVWTSGERIYQVSPHNHQAVLAKATNATEELANKAVEAALDAKKEWSLLPFYDRAA 95

Query: 92  VLKRWLKELEG--NKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARRTYGELVP 149
           +  +    L G    + LA  M  +S       +  V     F  + ++ AR  Y +  P
Sbjct: 96  IFLKAADLLSGPYRAEILAATMLGQSKTVYQAEIDAVAELIDFFRFNAKYARELYEQQ-P 154

Query: 150 SPVKSKELLMVREPL-GVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALA 208
                +       PL G V  I+P+NF         APAL  G T + KP++   L+   
Sbjct: 155 ISAPGEWNRTEYRPLEGFVYAISPFNFTAIAGNLAGAPALM-GNTVIWKPSDTAMLSNYL 213

Query: 209 LAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQG 268
           +  +  EA +P GV+N V     +   V   +   P +AGI FTGST   K L+   +Q 
Sbjct: 214 VMRILEEAGLPPGVINFVPG---DGPLVSDTVLADPDLAGIHFTGSTPTFKHLWKQVAQN 270

Query: 269 IK------RLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVF 322
           +       R+  E GG    +V+ +A+++  VSG +   F   GQ C A +R  +   ++
Sbjct: 271 LDRYHNFPRIVGETGGKDFHVVHPSADVEHVVSGTIRGAFEYQGQKCSACSRLYVPHSLW 330

Query: 323 DQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKV-ILGGG 381
            +F   L          GD   +    +G +I+     K+   ++ A S  + + IL GG
Sbjct: 331 PRFKGRLL-AELSRVKVGDPD-DFGTFMGAVIDEKSFAKIVKYIEHAKSDPSALTILAGG 388

Query: 382 KVTKLGSKFFAPTILTNITPDMLCYTEEVFGPV-AVCIKFKTEEESLC--VANNSNTGLA 438
           K       F  PT++ +  P      EE+FGPV  V +    + + +   V + ++ GL 
Sbjct: 389 KYDDSVGYFVEPTVVESKDPQEPLMKEEIFGPVLTVYVYPDDKYKEILDLVDSTTSYGLT 448

Query: 439 GYFFTNDISQAWRVGKKLQ--VGMVGINE----GIISHTEAAFGGVKESGLG-REGSRHG 491
           G  F  D        K L+   G   IN+     ++   +  FGG + SG   + G  H 
Sbjct: 449 GAVFAKDRDAILEADKVLRFAAGNFYINDKPTGAVVG--QQPFGGARASGTNDKAGGPHN 506

Query: 492 MEEFTQIK 499
           +  +T  +
Sbjct: 507 LLRWTSPR 514


>gnl|CDD|177831 PLN02174, PLN02174, aldehyde dehydrogenase family 3 member H1.
          Length = 484

 Score =  135 bits (340), Expect = 2e-34
 Identities = 100/377 (26%), Positives = 185/377 (49%), Gaps = 22/377 (5%)

Query: 134 DWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCT 193
           +W + E  +T     P+  +     +V EPLGVV +I+ WN+P  +    V  A++AG  
Sbjct: 88  NWMAPEKAKTSLTTFPASAE-----IVSEPLGVVLVISAWNYPFLLSIDPVIGAISAGNA 142

Query: 194 CVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTG 253
            V+KP+E  P ++  LA L  E  +    + VV  + T   ++            I +TG
Sbjct: 143 VVLKPSELAPASSALLAKLL-EQYLDSSAVRVVEGAVTETTAL-----LEQKWDKIFYTG 196

Query: 254 STQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSGAMAAKFR-NCGQACVAA 312
           S+++G+++ +  ++ +  + LELGG +P +V S+ +L+  V   +A K+  N GQAC++ 
Sbjct: 197 SSKIGRVIMAAAAKHLTPVVLELGGKSPVVVDSDTDLKVTVRRIIAGKWGCNNGQACISP 256

Query: 313 NRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEKVDSIVQDAISK 372
           +  L  ++   + ++ +K E+   + +      D   +  ++N    +++  ++ D    
Sbjct: 257 DYILTTKEYAPKVIDAMKKELETFYGKNPMESKD---MSRIVNSTHFDRLSKLL-DEKEV 312

Query: 373 GAKVILGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANN 432
             K++ GG K  +  +   APTIL ++  D L  +EE+FGP+   +     EES  V  +
Sbjct: 313 SDKIVYGGEKDRE--NLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVIRS 370

Query: 433 SNTGLAGYFFTNDISQAWRVGKKLQVGMVGINEGIIS---HTEAAFGGVKESGLGREGSR 489
               LA Y FT++     R    +  G + +N+  +    HT   FGGV ESG+G    +
Sbjct: 371 RPKPLAAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHT-LPFGGVGESGMGAYHGK 429

Query: 490 HGMEEFTQIKYMCFGNL 506
              + F+  K + + +L
Sbjct: 430 FSFDAFSHKKAVLYRSL 446


>gnl|CDD|143446 cd07128, ALDH_MaoC-N, N-terminal domain of the monoamine oxidase C
           dehydratase.  The N-terminal domain of the MaoC
           dehydratase, a monoamine oxidase regulatory protein.
           Orthologs of MaoC include PaaZ (Escherichia coli) and
           PaaN (Pseudomonas putida), which are putative
           ring-opening enzymes of the aerobic phenylacetic acid
           (PA) catabolic pathway. The C-terminal domain of MaoC
           has sequence similarity to enoyl-CoA hydratase. Also
           included in this CD is a novel Burkholderia xenovorans
           LB400 ALDH of the aerobic benzoate oxidation (box)
           pathway. This pathway involves first the synthesis of a
           CoA thio-esterified aromatic acid, with subsequent
           dihydroxylation and cleavage steps, yielding the CoA
           thio-esterified aliphatic aldehyde,
           3,4-dehydroadipyl-CoA semialdehyde, which is further
           converted into its corresponding CoA acid by the
           Burkholderia LB400 ALDH.
          Length = 513

 Score =  116 bits (293), Expect = 5e-28
 Identities = 119/474 (25%), Positives = 180/474 (37%), Gaps = 101/474 (21%)

Query: 31  QAYINGEWVHSSKK-FPVYNPANDEHLADVPDMGV---EDVQYAIMSAKEAFEQWSQTTP 86
           Q+Y+ G+W   +     +++    E +A V   G+     V YA      A       T 
Sbjct: 2   QSYVAGQWHAGTGDGRTLHDAVTGEVVARVSSEGLDFAAAVAYAREKGGPALRAL---TF 58

Query: 87  KERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVD---------WFS 137
            ER  +LK   K L   K+ L  +                   +S++D          ++
Sbjct: 59  HERAAMLKALAKYLMERKEDLYALSAATGA----------TRRDSWIDIDGGIGTLFAYA 108

Query: 138 EEARRTYGELVPSPVK--------SKE-------LLMVREPLGVVGLITPWNFPMAMITR 182
              RR   EL  +           SK+       +L  R   GV   I  +NFP+  +  
Sbjct: 109 SLGRR---ELPNAHFLVEGDVEPLSKDGTFVGQHILTPRR--GVAVHINAFNFPVWGMLE 163

Query: 183 KVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQI-PKGVLNVVTSSRTNANSVGKLLC 241
           K APAL AG   ++KPA  T     A+     E+ + P+G L ++        SVG LL 
Sbjct: 164 KFAPALLAGVPVIVKPATATAYLTEAVVKDIVESGLLPEGALQLI------CGSVGDLL- 216

Query: 242 TSPHVAG---ISFTGSTQVGKILYSH---CSQGIKRLSLE--------LGGNAP------ 281
              H+     ++FTGS      L +H    ++ I R + E        LG +A       
Sbjct: 217 --DHLGEQDVVAFTGSAATAAKLRAHPNIVARSI-RFNAEADSLNAAILGPDATPGTPEF 273

Query: 282 --FIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQ 339
             F+        K V+  M  K    GQ C A  R  + E   D  +E LK  + K  + 
Sbjct: 274 DLFV--------KEVAREMTVK---AGQKCTAIRRAFVPEARVDAVIEALKARLAKVVV- 321

Query: 340 GDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGK------VTKLGSKFFAP 393
           GD    +   +GPL++  Q E V + V   +   A+V+ GG               FF P
Sbjct: 322 GDPR-LEGVRMGPLVSREQREDVRAAV-ATLLAEAEVVFGGPDRFEVVGADAEKGAFFPP 379

Query: 394 TILTNITPDM--LCYTEEVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTND 445
           T+L    PD     +  E FGPVA  + + +  E++ +A      L     TND
Sbjct: 380 TLLLCDDPDAATAVHDVEAFGPVATLMPYDSLAEAIELAARGRGSLVASVVTND 433


>gnl|CDD|143441 cd07123, ALDH_F4-17_P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, ALDH families 4 and 17.
           Delta(1)-pyrroline-5-carboxylate dehydrogenase
           (EC=1.5.1.12 ), families 4 and 17: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), also known as ALDH4A1 in humans, 
           is a mitochondrial  homodimer involved in proline
           degradation and catalyzes the NAD + -dependent
           conversion of P5C to glutamate. This is a necessary step
           in the pathway interconnecting the urea and
           tricarboxylic acid cycles. The preferred substrate is
           glutamic gamma-semialdehyde, other substrates include
           succinic, glutaric and adipic semialdehydes. Also
           included in this CD is the Aldh17 Drosophila
           melanogaster (Q9VUC0) P5CDH and similar sequences.
          Length = 522

 Score =  116 bits (293), Expect = 6e-28
 Identities = 125/478 (26%), Positives = 200/478 (41%), Gaps = 43/478 (8%)

Query: 32  AYINGEWVHSSKKFPVYNPANDEH-LADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERG 90
             I G+ V +        P +  H LA         V+ AI +A EA ++W++   ++R 
Sbjct: 35  LVIGGKEVRTGNTGKQVMPHDHAHVLATYHYADAALVEKAIEAALEARKEWARMPFEDRA 94

Query: 91  IVLKRWLKELEGNKQA--LAEIMTQESGKPPTESLGEVVYANSFVDWF---SEEARRTYG 145
            +  +    L G  +    A  M  + GK   ++  E+  A   +D+     + A   Y 
Sbjct: 95  AIFLKAADLLSGKYRYELNAATMLGQ-GKNVWQA--EIDAACELIDFLRFNVKYAEELYA 151

Query: 146 ELVPSPVKSKELLMVREPL-GVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPL 204
           +   S        +   PL G V  ++P+NF         APAL  G   + KP++   L
Sbjct: 152 QQPLSSPAGVWNRLEYRPLEGFVYAVSPFNFTAIGGNLAGAPALM-GNVVLWKPSDTAVL 210

Query: 205 TALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSH 264
           +   +  +  EA +P GV+N V         VG  +  SPH+AG+ FTGST   K L+  
Sbjct: 211 SNYLVYKILEEAGLPPGVINFVPGD---GPVVGDTVLASPHLAGLHFTGSTPTFKSLWKQ 267

Query: 265 CSQGIK------RLSLELGGNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQ 318
             + +       R+  E GG    +V+ +A++   V+  +   F   GQ C AA+R  + 
Sbjct: 268 IGENLDRYRTYPRIVGETGGKNFHLVHPSADVDSLVTATVRGAFEYQGQKCSAASRAYVP 327

Query: 319 EQVFDQFVEMLKVEITKSFIQGDTSKNDKCN-LGPLINLAQVEKVDSIVQDAISKGAKVI 377
           E ++ +  E L +E  K    GD    D  N +G +I+    +++   +  A S     I
Sbjct: 328 ESLWPEVKERL-LEELKEIKMGDPD--DFSNFMGAVIDEKAFDRIKGYIDHAKSDPEAEI 384

Query: 378 LGGGKVTKLGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTE--EESLCVANNSNT 435
           + GGK       F  PT++    P     TEE+FGPV     +     EE+L + + ++ 
Sbjct: 385 IAGGKCDDSVGYFVEPTVIETTDPKHKLMTEEIFGPVLTVYVYPDSDFEETLELVDTTSP 444

Query: 436 -GLAGYFFTND---ISQAWRVGKKLQ--VGMVGINEGIISHTEAA-----FGGVKESG 482
             L G  F  D   I +A      L+   G   IN+     T A      FGG + SG
Sbjct: 445 YALTGAIFAQDRKAIREA---TDALRNAAGNFYIND---KPTGAVVGQQPFGGARASG 496


>gnl|CDD|143403 cd07084, ALDH_KGSADH-like, ALDH subfamily: NAD(P)+-dependent
           alpha-ketoglutaric semialdehyde dehydrogenases and plant
           delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH
           family 12-like.  ALDH subfamily which includes the
           NAD(P)+-dependent, alpha-ketoglutaric semialdehyde
           dehydrogenases (KGSADH, EC 1.2.1.26); plant
           delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           EC=1.5.1.12 ), ALDH family 12; the N-terminal domain of
           the MaoC (monoamine oxidase C) dehydratase regulatory
           protein; and orthologs of MaoC, PaaZ and PaaN, which are
           putative ring-opening enzymes of the aerobic
           phenylacetic acid catabolic pathway.
          Length = 442

 Score =  108 bits (270), Expect = 2e-25
 Identities = 76/389 (19%), Positives = 140/389 (35%), Gaps = 52/389 (13%)

Query: 67  VQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGE- 125
            + A+++A  + +   +    +R   L R ++ L      +A      +GK         
Sbjct: 1   PERALLAADISTKAARRLALPKRADFLARIIQRLAAKSYDIAAGAVLVTGKG-WMFAENI 59

Query: 126 ------------VVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPW 173
                       V+Y+                         ++    R P G V +I  +
Sbjct: 60  CGDQVQLRARAFVIYSYRIPHEPGNH---------LGQGLKQQSHGYRWPYGPVLVIGAF 110

Query: 174 NFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQI-PKGVLNVVTSSRTN 232
           NFP+ +   ++A ALA G   ++KP     +    +  L H A + P   + ++      
Sbjct: 111 NFPLWIPLLQLAGALAMGNPVIVKPHTAVSIVMQIMVRLLHYAGLLPPEDVTLINGDGKT 170

Query: 233 ANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQK 292
             +    L   P+   + FTGS++V + L     Q   R+ LEL G    ++  +A    
Sbjct: 171 MQA----LLLHPNPKMVLFTGSSRVAEKLALDAKQA--RIYLELAGFNWKVLGPDAQAVD 224

Query: 293 AVSGAMAAK-FRNCGQACVAANRFLIQEQVFDQ-FVEMLKVEITKSFIQGDTSKNDKCNL 350
            V+           GQ C A +   + E       VE LK  + +  ++          L
Sbjct: 225 YVAWQCVQDMTACSGQKCTAQSMLFVPENWSKTPLVEKLKALLARRKLEDLL-------L 277

Query: 351 GPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKF------FAPTILTNITPDM- 403
           GP+    Q     +++    +    V+L  GK  K  S         A  +   I   + 
Sbjct: 278 GPV----QTFTTLAMIAHMENLLGSVLLFSGKELKNHSIPSIYGACVASALFVPIDEILK 333

Query: 404 --LCYTEEVFGPVAVCIKFKTEEESLCVA 430
                TEE+FGP A+ +++K ++ +L + 
Sbjct: 334 TYELVTEEIFGPFAIVVEYKKDQLALVLE 362


>gnl|CDD|131331 TIGR02278, PaaN-DH, phenylacetic acid degradation protein paaN.
           This enzyme is proposed to act in the ring-opening step
           of phenylacetic acid degradation which follows ligation
           of the acid with coenzyme A (by PaaF) and hydroxylation
           by a multicomponent non-heme iron hydroxylase complex
           (PaaGHIJK). Gene symbols have been standardized in. This
           enzyme is related to aldehyde dehydrogenases and has
           domains which are members of the pfam00171 and pfam01575
           families. This family includes paaN genes from
           Pseudomonas, Sinorhizobium, Rhodopseudomonas,
           Escherichia, Deinococcus and Corynebacterium. Another
           homology family (TIGR02288) includes several other
           species.
          Length = 663

 Score =  105 bits (263), Expect = 6e-24
 Identities = 114/445 (25%), Positives = 180/445 (40%), Gaps = 51/445 (11%)

Query: 31  QAYINGEWVH-SSKKFPVYNPANDEHLADVPDMGV---EDVQYAIMSAKEAFEQWSQTTP 86
           Q+Y++GEW     +  PV + +  E LA V   G+     V +A      A       T 
Sbjct: 2   QSYLSGEWRTGQGEGVPVRDASTGEVLARVTSEGLDVAAAVAWAREVGGPALRA---LTF 58

Query: 87  KERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARR---- 142
            ER  +LK   + L   K+AL  +    +G    ++  ++      +  +S   RR    
Sbjct: 59  HERARMLKALAQYLSERKEALYALAAT-TGATRRDNWVDIDGGIGTLFTYSSLGRRELPD 117

Query: 143 ----TYGELVPSPVKSKELLMVREPL-----GVVGLITPWNFPMAMITRKVAPALAAGCT 193
                  E  P    SK+       +     GV   I  +NFP+  +  K APA  AG  
Sbjct: 118 ANLIPEDEFEPL---SKDGSFQGRHILTPKGGVAVQINAFNFPVWGLLEKFAPAFLAGVP 174

Query: 194 CVIKPAEDTPLTALALAALAHEAQI-PKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFT 252
            + KPA  T   A AL     E+ + P+G L ++        S G LL    H   ++FT
Sbjct: 175 TLAKPATPTAYVAEALVRTMVESGLLPEGSLQLICG------SAGDLLDHLDHRDVVAFT 228

Query: 253 GSTQVGKILYSH---CSQGIKRLSLE--------LGGNAPFIVYSNANLQKAVSGAMAAK 301
           GS      L +H     +GI R + E        LG +A           + +   +  K
Sbjct: 229 GSAATADRLRAHPNVLERGI-RFNAEADSLNAAILGEDATPDEPEFDLFAQEIVRELTIK 287

Query: 302 FRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLINLAQVEK 361
               GQ C A  R ++ + + +  ++ L+  + K  + GD  + +  ++GPL++L Q   
Sbjct: 288 ---AGQKCTAIRRVIVPKALLEAVLKALQARLAK-VVLGD-PREEGVDMGPLVSLEQRAD 342

Query: 362 VDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTILTNITPDM-LCYTEEVFGPVAVCIKF 420
           V++ V   ++ GA+V LGG     L   FF PT+L    P     +  E FGPVA    +
Sbjct: 343 VEAAVAALLAAGAEVRLGGPGR--LDGAFFPPTLLLAEDPWAGAVHATEAFGPVATFFPY 400

Query: 421 KTEEESLCVANNSNTGLAGYFFTND 445
               E+  +A      L     T+D
Sbjct: 401 GDRAEAARLAARGGGSLVATLATSD 425


>gnl|CDD|143447 cd07129, ALDH_KGSADH, Alpha-Ketoglutaric Semialdehyde
           Dehydrogenase.  Alpha-Ketoglutaric Semialdehyde (KGSA)
           Dehydrogenase (KGSADH, EC 1.2.1.26) catalyzes the
           NAD(P)+-dependent conversion of KGSA to
           alpha-ketoglutarate. This CD contains such sequences as
           those seen in Azospirillum brasilense, KGSADH-II
           (D-glucarate/D-galactarate-inducible) and KGSADH-III
           (hydroxy-L-proline-inducible). Both show similar high
           substrate specificity for KGSA and different coenzyme
           specificity; KGSADH-II is NAD+-dependent and KGSADH-III
           is NADP+-dependent. Also included in this CD is the
           NADP(+)-dependent aldehyde dehydrogenase from Vibrio
           harveyi which catalyzes the oxidation of long-chain
           aliphatic aldehydes to acids.
          Length = 454

 Score =  103 bits (258), Expect = 1e-23
 Identities = 95/397 (23%), Positives = 143/397 (36%), Gaps = 71/397 (17%)

Query: 74  AKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGE-------- 125
           A  AFE +   +P  R   L+    E+E     L      E+G P     GE        
Sbjct: 8   AAAAFESYRALSPARRAAFLEAIADEIEALGDELVARAHAETGLPEARLQGELGRTTGQL 67

Query: 126 -----VVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMI 180
                +V   S++D   + A     + +P P    +L  +  PLG V +    NFP+A  
Sbjct: 68  RLFADLVREGSWLDARIDPADPD-RQPLPRP----DLRRMLVPLGPVAVFGASNFPLAFS 122

Query: 181 TR--KVAPALAAGCTCVIKPAEDTPLTALALAALAHEA----QIPKGVLNVVTSSRTNAN 234
                 A ALAAGC  V+K     P T+  +A     A     +P GV +++        
Sbjct: 123 VAGGDTASALAAGCPVVVKAHPAHPGTSELVARAIRAALRATGLPAGVFSLLQGGG---R 179

Query: 235 SVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLS-----LELGGNAPFIVYSNAN 289
            VG  L   P +  + FTGS + G+ L+   +    R        ELG   P  +   A 
Sbjct: 180 EVGVALVKHPAIKAVGFTGSRRGGRALFDAAA---ARPEPIPFYAELGSVNPVFILPGA- 235

Query: 290 LQKAVSGAMAAKF-----RNCGQAC-----VAANRFLIQEQVFDQFVEMLKVEITKSFIQ 339
                  A+A  F        GQ C     V     +      D F+  L   +  +   
Sbjct: 236 -LAERGEAIAQGFVGSLTLGAGQFCTNPGLV----LVPAGPAGDAFIAALAEALAAA--- 287

Query: 340 GDTSKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGGGKVTKLGSKFFAPTIL--- 396
                  +  L P I  A  + V+++       G +V+ GG           APT+    
Sbjct: 288 -----PAQTMLTPGIAEAYRQGVEALAAAP---GVRVLAGGAAAEGGNQ--AAPTLFKVD 337

Query: 397 --TNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVAN 431
               +    L   EEVFGP ++ +++    E L VA 
Sbjct: 338 AAAFLADPAL--QEEVFGPASLVVRYDDAAELLAVAE 372


>gnl|CDD|237016 PRK11903, PRK11903, aldehyde dehydrogenase; Provisional.
          Length = 521

 Score =  100 bits (251), Expect = 2e-22
 Identities = 120/457 (26%), Positives = 185/457 (40%), Gaps = 68/457 (14%)

Query: 31  QAYINGEWVH-SSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKE----AFEQWSQTT 85
             Y+ G W   S    P+++P   E L  V   G+ D+  A   A+E    A       T
Sbjct: 6   ANYVAGRWQAGSGAGTPLFDPVTGEELVRVSATGL-DLAAAFAFAREQGGAALRA---LT 61

Query: 86  PKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTES-------LGEVVYANSFVDWFSE 138
             +R  +L   +K L+ N+ A  +I T  SG    +S       +  + Y         +
Sbjct: 62  YAQRAALLAAIVKVLQANRDAYYDIATANSGTTRNDSAVDIDGGIFTLGYYAKLGAALGD 121

Query: 139 EARRTYGE---LVPSPV-KSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTC 194
                 GE   L   P  + + +L+     GV   I  +NFP   +  K APAL AG   
Sbjct: 122 ARLLRDGEAVQLGKDPAFQGQHVLVPTR--GVALFINAFNFPAWGLWEKAAPALLAGVPV 179

Query: 195 VIKPAEDTPLTALALAALAHEAQI-PKGVLNVVTSSRTNANSVGKLLCTSPHVAG---IS 250
           ++KPA  T      +      A I P G L+VV        S   LL    H+     +S
Sbjct: 180 IVKPATATAWLTQRMVKDVVAAGILPAGALSVV------CGSSAGLL---DHLQPFDVVS 230

Query: 251 FTGSTQVGKILYSHCS--QGIKRLSLE--------LG-----GNAPFIVYSNANLQKAVS 295
           FTGS +   +L SH +  Q   R+++E        LG     G+  F         K V 
Sbjct: 231 FTGSAETAAVLRSHPAVVQRSVRVNVEADSLNSALLGPDAAPGSEAF-----DLFVKEVV 285

Query: 296 GAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDTSKNDKCNLGPLIN 355
             M  K    GQ C A  R  + E ++D   E L   + K+ + G+  +ND   +GPL++
Sbjct: 286 REMTVK---SGQKCTAIRRIFVPEALYDAVAEALAARLAKTTV-GN-PRNDGVRMGPLVS 340

Query: 356 LAQVEKVDSIVQDAISKGAKVILGGGKVTKLG-----SKFFAPTILTNITPD--MLCYTE 408
            AQ+  V + +  A+   A+V+  GG    +      +    PT+L    PD     +  
Sbjct: 341 RAQLAAVRAGLA-ALRAQAEVLFDGGGFALVDADPAVAACVGPTLLGASDPDAATAVHDV 399

Query: 409 EVFGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTND 445
           EVFGPVA  + ++    +L +A      L    +++D
Sbjct: 400 EVFGPVATLLPYRDAAHALALARRGQGSLVASVYSDD 436


>gnl|CDD|236931 PRK11563, PRK11563, bifunctional aldehyde dehydrogenase/enoyl-CoA
           hydratase; Provisional.
          Length = 675

 Score = 93.8 bits (234), Expect = 3e-20
 Identities = 118/475 (24%), Positives = 185/475 (38%), Gaps = 103/475 (21%)

Query: 31  QAYINGEWVHSSKK-FPVYNPANDEHLADVPDMGV---EDVQYAIMSAKEAFEQWSQTTP 86
           Q+Y+ G W     +  P+++    E +A V   G+     + YA      A       T 
Sbjct: 6   QSYVAGRWQTGQGEGRPLHDAVTGEPVARVSSEGLDFAAALAYAREVGGPALRAL---TF 62

Query: 87  KERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVD---------WFS 137
            ER  +LK   K L   K+ L  +  Q +G   T         +S++D          ++
Sbjct: 63  HERAAMLKALAKYLLERKEELYALSAQ-TGA--TR-------RDSWIDIEGGIGTLFTYA 112

Query: 138 EEARRTYGELVPSPVK--------SKE-------LLMVREPLGVVGLITPWNFPMAMITR 182
            + RR   EL    V         SK        +L   E  GV   I  +NFP+  +  
Sbjct: 113 SKGRR---ELPNDTVLVEGEVEPLSKGGTFAGRHILTPLE--GVAVHINAFNFPVWGMLE 167

Query: 183 KVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQI-PKGVLNVVTSSRTNANSVGKLLC 241
           K+APA  AG   ++KPA  T     A+  L  E+ + P+G L ++  S       G LL 
Sbjct: 168 KLAPAFLAGVPAIVKPATATAYLTEAVVRLIVESGLLPEGALQLICGS------AGDLL- 220

Query: 242 TSPHVAG---ISFTGSTQVGKILYSH---CSQGIKRLSLE--------LG-----GNAPF 282
              H+ G   ++FTGS    + L +H    +  +   + E        LG     G   F
Sbjct: 221 --DHLDGQDVVTFTGSAATAQKLRAHPNVVANSV-PFTAEADSLNAAILGPDATPGTPEF 277

Query: 283 IVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEITKSFIQGDT 342
            ++      K V   M  K    GQ C A  R ++   + D  +E L+  + K  + GD 
Sbjct: 278 DLFV-----KEVVREMTVK---AGQKCTAIRRAIVPRALVDAVIEALRARLAKVVV-GD- 327

Query: 343 SKNDKCNLGPLINLAQVEKVDSIVQDAISKGAKVILGG-------GKVTKLGSKFFAPTI 395
            + +   +G L +L Q E V   V  A+   A+++ GG       G   + G+ FF PT+
Sbjct: 328 PRLEGVRMGALASLEQREDVREKV-AALLAAAEIVFGGPDSFEVVGADAEKGA-FFPPTL 385

Query: 396 LTNITPDMLCYTEEV-----FGPVAVCIKFKTEEESLCVANNSNTGLAGYFFTND 445
           L    P        V     FGPV+  + +   +E++ +A      L     T D
Sbjct: 386 LYCDDPL---EAPAVHDVEAFGPVSTLMPYDDLDEAIELAARGKGSLVASLVTAD 437


>gnl|CDD|143444 cd07126, ALDH_F12_P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, ALDH family 12.
           Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           EC=1.5.1.12), family 12: a proline catabolic enzyme of
           the aldehyde dehydrogenase (ALDH) protein superfamily.
           P5CDH is a mitochondrial enzyme involved in proline
           degradation and catalyzes the NAD + -dependent
           conversion of P5C to glutamate.  The P5CDH, ALDH12A1
           gene, in Arabidopsis, has been identified as an
           osmotic-stress-inducible ALDH gene. This CD contains
           both Viridiplantae and Alveolata P5CDH sequences.
          Length = 489

 Score = 64.8 bits (158), Expect = 4e-11
 Identities = 98/435 (22%), Positives = 169/435 (38%), Gaps = 72/435 (16%)

Query: 33  YINGEWVHSSKKFPVYNPANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPK--ER- 89
            + G+W  +S    + +P N +    VPD   +++   + S ++  +       K  ER 
Sbjct: 2   LVAGKWKGASNYTTLLDPLNGDKFISVPDTDEDEINEFVDSLRQCPKSGLHNPLKNPERY 61

Query: 90  ---GIVLKRWLKELEGNKQA--LAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARR-- 142
              G V  R   EL   +     A ++ + + K   ++LGEVV    F++ F+ +  R  
Sbjct: 62  LLYGDVSHRVAHELRKPEVEDFFARLIQRVAPKSDAQALGEVVVTRKFLENFAGDQVRFL 121

Query: 143 TYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDT 202
                VP   + ++    R P G V +ITP+NFP+ +   ++  AL  G   ++K     
Sbjct: 122 ARSFNVPGDHQGQQSSGYRWPYGPVAIITPFNFPLEIPALQLMGALFMGNKPLLKVDSKV 181

Query: 203 PLTALALAALAHEAQIPKGVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILY 262
            +       L H   +P   ++++ S     N +  LL  +P +    FTGS++V     
Sbjct: 182 SVVMEQFLRLLHLCGMPATDVDLIHSDGPTMNKI--LLEANPRM--TLFTGSSKVA---- 233

Query: 263 SHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSG----------------AMAAKFRNCG 306
                  +RL+LEL G    +   +A     + G                A A      G
Sbjct: 234 -------ERLALELHGK---VKLEDAGFDWKILGPDVSDVDYVAWQCDQDAYACS----G 279

Query: 307 QACVAANRFLIQEQVFDQ-FVEMLKVEITKSFIQGDTSKNDKCNLGPLI---NLAQVEKV 362
           Q C A +     E       ++ LK    +  ++  T       +GP++       ++ V
Sbjct: 280 QKCSAQSILFAHENWVQAGILDKLKALAEQRKLEDLT-------IGPVLTWTTERILDHV 332

Query: 363 DSIVQDAISKGAKVILGGGKVTK----------LGSKFFAPTILTNITPDMLCYTEEVFG 412
           D ++      GAKV+ GG  +T             +  F P     I  +    T EVFG
Sbjct: 333 DKLLA---IPGAKVLFGGKPLTNHSIPSIYGAYEPTAVFVPLEEIAIEENFELVTTEVFG 389

Query: 413 PVAVCIKFKTEEESL 427
           P  V  ++K E+  L
Sbjct: 390 PFQVVTEYKDEQLPL 404


>gnl|CDD|143396 cd07077, ALDH-like, NAD(P)+-dependent aldehyde dehydrogenase-like
           (ALDH-like) family.  The aldehyde dehydrogenase-like
           (ALDH-like) group of the ALDH superfamily of
           NAD(P)+-dependent enzymes which, in general, oxidize a
           wide range of  endogenous and exogenous aliphatic and
           aromatic aldehydes to their corresponding carboxylic
           acids and play an  important role in detoxification.
           This group includes families ALDH18, ALDH19, and ALDH20
           and represents such proteins as gamma-glutamyl phosphate
           reductase, LuxC-like acyl-CoA reductase, and coenzyme A
           acylating aldehyde dehydrogenase.  All of these proteins
           have a conserved cysteine that aligns with the catalytic
           cysteine of the ALDH group.
          Length = 397

 Score = 61.1 bits (148), Expect = 4e-10
 Identities = 72/339 (21%), Positives = 122/339 (35%), Gaps = 51/339 (15%)

Query: 64  VEDVQYAIMSAKEAFE----QWSQTTPKERGIVLKR----WLKELEGNKQALAEIMTQES 115
            E     I +   A      + +     ERG  ++     W+  +  ++  L + +  E 
Sbjct: 15  DEQRDLIINAIANALYDTRQRLASEAVSERGAYIRSLIANWIAMMGCSESKLYKNIDTER 74

Query: 116 GKPPTESLGEVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPLGVVGLITPWNF 175
           G   +                          L+P    + E  +   P+GV   I P   
Sbjct: 75  GITASVG-------------------HIQDVLLPD---NGETYVRAFPIGVTMHILPSTN 112

Query: 176 PMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANS 235
           P++ IT      +A    C+ +P    P T  ALA L   A    G   +V      ++ 
Sbjct: 113 PLSGIT-SALRGIATRNQCIFRPHPSAPFTNRALALLFQAADAAHGPKILVLYVPHPSDE 171

Query: 236 VGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVS 295
           + + L + P +  I  TG          H S  I  +     GN+P +V   A+ ++A  
Sbjct: 172 LAEELLSHPKIDLIVATGGRDAVDAAVKH-SPHIPVIGFG-AGNSPVVVDETADEERASG 229

Query: 296 GAMAAKFRNCGQACVAANRFLIQEQV----FDQF-----VEMLKV-EITKSFIQGDTSKN 345
               +KF +   AC +     + + V    +++F     VE LKV + TK   +  T   
Sbjct: 230 SVHDSKFFD-QNACASEQNLYVVDDVLDPLYEEFKLKLVVEGLKVPQETKPLSKETTPSF 288

Query: 346 DKC---NLGPLINLAQVEKVDSIVQDAISKGAKVILGGG 381
           D     ++ PL    +V  V S V++A      +I  GG
Sbjct: 289 DDEALESMTPLECQFRVLDVISAVENAW----MIIESGG 323


>gnl|CDD|131341 TIGR02288, PaaN_2, phenylacetic acid degradation protein paaN.
           This enzyme is proposed to act in the ring-opening step
           of phenylacetic acid degradation which follows ligation
           of the acid with coenzyme A (by PaaF) and hydroxylation
           by a multicomponent non-heme iron hydroxylase complex
           (PaaGHIJK). Gene symbols have been standardized in. This
           enzyme is related to aldehyde dehydrogenases and has a
           domain which is a member of the pfam00171 family. This
           family includes sequences from Burkholderia, Bordetella,
           Streptomyces. Other PaaN enzymes are represented by a
           separate model, TIGR02278.
          Length = 551

 Score = 56.0 bits (135), Expect = 3e-08
 Identities = 102/473 (21%), Positives = 172/473 (36%), Gaps = 76/473 (16%)

Query: 61  DMGVE----DVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAE------- 109
           ++GV     D    + +A  A   W     + R  V    L+ L      +A        
Sbjct: 76  ELGVTYPQCDGDALLDAAHAALPGWRDAGARARAGVCLEILQRLNARSFEIAHAVMHTTG 135

Query: 110 ---IMTQESGKPPTESLG--EVVYANSFVDWFSEEARRTYGELVPSPVKSKELLMVREPL 164
              +M  ++G P  +  G   V YA   +    E A     +    P+K K+   +  P 
Sbjct: 136 QAFMMAFQAGGPHAQDRGLEAVAYAYREMSRIPETAVWEKPQGKNDPLKLKKRFTI-VPR 194

Query: 165 GVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVL- 223
           G+  +I    FP       +  +LA G   ++KP         A+  LA   Q+ + VL 
Sbjct: 195 GIALVIGCSTFPTWNTYPGLFASLATGNPVLVKPHPG------AILPLALTVQVAREVLG 248

Query: 224 ------NVVT-SSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLEL 276
                 N+VT ++    +   + L T P V  I FTGS   G+ L  +  Q   ++  E 
Sbjct: 249 EAGFDPNLVTLAAFDPGHEAAQRLATDPAVRIIDFTGSNAFGQWLEQNARQA--QVYTEK 306

Query: 277 GGNAPFIVYSNANLQKAVSGAMAAKFRNC---GQACVAANRFLI--------QEQV-FDQ 324
            G    I+ S  +  KA+   +A  F      GQ C      L+        Q +  +D+
Sbjct: 307 AGVNTVIIESTDDY-KAMLRNLA--FSLSLYSGQMCTTTQAILVPRDGIRTDQGRKSYDE 363

Query: 325 FVEMLKVEITKSFIQGDTSKNDKCN--LGPLINLAQVEKVDSIVQDAISKGAKVILGGGK 382
               L   I    + GD     +    LG + +   + +    + +A + G +V+L   K
Sbjct: 364 VAADLATAIDG--LLGD---PARATAVLGAIQSPDTLAR----IAEARALG-EVLLASTK 413

Query: 383 VTKL---GSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNS---NTG 436
           +      G++   P +L     D   Y +E FGP+A  +       ++ +A  S      
Sbjct: 414 IEHPEFPGARVRTPLLLKCDAADEAAYMQERFGPIAFVVAVDDGAHAVELARRSVREKGA 473

Query: 437 LAGYFFTND-------ISQAWRVGKKLQVGMVGINEGIISHTEAAFGGVKESG 482
           +    +T D          AW     L + + G   G+  +  AAF     +G
Sbjct: 474 MTVGAYTTDPEVVDAVQEAAWDAAVALSLNLTG---GVFVNQSAAFSDFHGTG 523


>gnl|CDD|143445 cd07127, ALDH_PAD-PaaZ, Phenylacetic acid degradation proteins PaaZ
           (Escherichia coli) and PaaN (Pseudomonas putida)-like.
           Phenylacetic acid degradation (PAD) proteins PaaZ
           (Escherichia coli) and PaaN (Pseudomonas putida) are
           putative aromatic ring cleavage enzymes of the aerobic
           PA catabolic pathway. PaaZ mutants were defective for
           growth with PA as a sole carbon source due to
           interruption of the putative ring opening system.  This
           CD is limited to bacterial monofunctional enzymes.
          Length = 549

 Score = 52.5 bits (126), Expect = 3e-07
 Identities = 102/470 (21%), Positives = 168/470 (35%), Gaps = 84/470 (17%)

Query: 61  DMGVE----DVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAE------- 109
           ++GV     D    + +A+ A   W     + R  V    L+ L      +A        
Sbjct: 76  ELGVTYPQCDPDALLAAARAAMPGWRDAGARARAGVCLEILQRLNARSFEMAHAVMHTTG 135

Query: 110 ---IMTQESGKPPTESLG--EVVYANSFVDWFSEEARRTYGELVPSPVK-SKELLMVREP 163
              +M  ++G P  +  G   V YA   +      A     +    P+   K   +V  P
Sbjct: 136 QAFMMAFQAGGPHAQDRGLEAVAYAWREMSRIPPTAEWEKPQGKHDPLAMEKTFTVV--P 193

Query: 164 LGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPKGVL 223
            GV  +I    FP       +  +LA G   ++KP    P   L LA      Q+ + VL
Sbjct: 194 RGVALVIGCSTFPTWNGYPGLFASLATGNPVIVKPH---PAAILPLAI---TVQVAREVL 247

Query: 224 -------NVVT-SSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLE 275
                  N+VT ++ T    + + L T P V  I FTGS   G  L ++  Q   ++  E
Sbjct: 248 AEAGFDPNLVTLAADTPEEPIAQTLATRPEVRIIDFTGSNAFGDWLEANARQ--AQVYTE 305

Query: 276 LGGNAPFIVYSNANLQKAVSGAMAAKFRNC---GQACVAANRFLI--------QEQV-FD 323
             G    +V S  +L KA+   +A  F      GQ C       +          +  FD
Sbjct: 306 KAGVNTVVVDSTDDL-KAMLRNLA--FSLSLYSGQMCTTPQNIYVPRDGIQTDDGRKSFD 362

Query: 324 QFVEMLKVEITKSFIQGDTSKNDKCN--LGPLIN---LAQVEKVDSIVQDAISKGAKVIL 378
           +    L   I    +  D     +    LG + +   LA++ +          +  +V+L
Sbjct: 363 EVAADLAAAIDG--LLAD---PARAAALLGAIQSPDTLARIAEA--------RQLGEVLL 409

Query: 379 GGGKVTK---LGSKFFAPTILTNITPDMLCYTEEVFGPVAVCIKFKTEEESLCVANNS-- 433
               V       ++   P +L     D   Y EE FGP+A  +   + + S+ +A  S  
Sbjct: 410 ASEAVAHPEFPDARVRTPLLLKLDASDEAAYAEERFGPIAFVVATDSTDHSIELARESVR 469

Query: 434 -NTGLAGYFFTND-------ISQAWRVGKKLQVGMVGINEGIISHTEAAF 475
            +  +    ++ D          A   G  L + + G   G+  +  AAF
Sbjct: 470 EHGAMTVGVYSTDPEVVERVQEAALDAGVALSINLTG---GVFVNQSAAF 516


>gnl|CDD|131570 TIGR02518, EutH_ACDH, acetaldehyde dehydrogenase (acetylating). 
          Length = 488

 Score = 51.4 bits (123), Expect = 8e-07
 Identities = 66/275 (24%), Positives = 115/275 (41%), Gaps = 21/275 (7%)

Query: 64  VEDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESL 123
           ++ V+  I SAK A ++ +  T ++   ++K  +     N   LA++  +E+G    E  
Sbjct: 7   IQQVRNLIRSAKVAQKKLANMTQEQIDKIVKAIVDAAYENAVKLAKMANEETGFGKWE-- 64

Query: 124 GEVVYANSFVDWFSEEA---RRTYGELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMI 180
            + V  N F      ++    +T G L  S  K K+++ +  P+GVV  + P   P +  
Sbjct: 65  -DKVIKNVFAATIVYDSIKDMKTIGIL--SEDKEKKVIEIAVPVGVVAGLIPSTNPTSTA 121

Query: 181 TRKVAPALAAGCTCVIKP----AEDTPLTALALAALAHEAQIPKGVLNVVTSSRTNANSV 236
             K   ++ A    V  P     +    T   +   A EA  P+G +  +T       + 
Sbjct: 122 IYKTLISIKARNAIVFSPHPNAKKCIIETVKLMRKAAEEAGAPEGAIGCITVPTIEGTNE 181

Query: 237 GKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELG-GNAPFIVYSNANLQKAVS 295
              L  +   + I  TG   + K  YS  +  I      +G GN P  +   AN++KAV 
Sbjct: 182 ---LMKNKDTSLILATGGEAMVKAAYSSGTPAIG-----VGPGNGPAYIERTANVKKAVR 233

Query: 296 GAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLK 330
             + +K  + G  C +    +++E   D  VE LK
Sbjct: 234 DIIDSKTFDNGTICASEQSIIVEECNKDAVVEELK 268


>gnl|CDD|237956 PRK15398, PRK15398, aldehyde dehydrogenase EutE; Provisional.
          Length = 465

 Score = 46.4 bits (111), Expect = 3e-05
 Identities = 85/352 (24%), Positives = 136/352 (38%), Gaps = 60/352 (17%)

Query: 49  NPANDEHLADVPDMGV-EDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQAL 107
           +       A V +MGV   V  A+ +AK A +++ Q +   R  ++    + L  + + L
Sbjct: 19  SSQTVSPPAAVGEMGVFASVDDAVAAAKVAQQRYQQKSLAMRQRIIDAIREALLPHAEEL 78

Query: 108 AEIMTQESGKPPTESLGEV---VYANSFVDWFSEEARRTYG--ELVPSPVKSKELLMVRE 162
           AE+  +E+G      +G V   +  N         A +T G  +L    +     L + E
Sbjct: 79  AELAVEETG------MGRVEDKIAKNVAA------AEKTPGVEDLTTEALTGDNGLTLIE 126

Query: 163 --PLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAEDTPLTALALAALAHEAQIPK 220
             P GV+G +TP   P   I       LAAG + V  P       +L    L +EA I  
Sbjct: 127 YAPFGVIGAVTPSTNPTETIINNAISMLAAGNSVVFSPHPGAKKVSLRAIELLNEA-IVA 185

Query: 221 --GVLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGG 278
             G  N+V +         + L   P +A +  TG   V K        G K+      G
Sbjct: 186 AGGPENLVVTVAEPTIETAQRLMKHPGIALLVVTGGPAVVKAAM---KSG-KKAIGAGAG 241

Query: 279 NAPFIVYSNANLQKA----VSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLKVEIT 334
           N P +V   A+++KA    V G   A F N    C+A    ++ + V D+ + +++    
Sbjct: 242 NPPVVVDETADIEKAARDIVKG---ASFDN-NLPCIAEKEVIVVDSVADELMRLME---- 293

Query: 335 KSFIQGDTSKNDKCNLGPLINLAQVEKVDSIV--------QDAISKGAKVIL 378
                    KN       L+   Q EK+  +V        +  + K A  IL
Sbjct: 294 ---------KNGAV----LLTAEQAEKLQKVVLKNGGTVNKKWVGKDAAKIL 332


>gnl|CDD|143439 cd07121, ALDH_EutE, Ethanolamine utilization protein EutE-like.
           Coenzyme A acylating aldehyde dehydrogenase (ACDH), an
           NAD+ and CoA-dependent acetaldehyde dehydrogenase,
           acetylating (EC=1.2.1.10), converts acetaldehyde into
           acetyl-CoA.  This CD is limited to such monofunctional
           enzymes as the Ethanolamine utilization protein, EutE,
           in Salmonella typhimurium.  Mutations in eutE abolish
           the ability to utilize ethanolamine as a carbon source.
          Length = 429

 Score = 41.8 bits (99), Expect = 6e-04
 Identities = 71/283 (25%), Positives = 111/283 (39%), Gaps = 37/283 (13%)

Query: 63  GV-EDVQYAIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTE 121
           GV   V  A+ +AK A +Q+ + T  +R  +++   + L  N + LAE+  +E+G     
Sbjct: 1   GVFATVDDAVAAAKAAQKQYRKCTLADREKIIEAIREALLSNAEELAEMAVEETG----- 55

Query: 122 SLGEV---VYANSFVDWFSEEARRTYGE--LVPSPVKSKE---LLMVREPLGVVGLITPW 173
            +G V   +  N         A +T G   L  +   S +    L+   P GV+G ITP 
Sbjct: 56  -MGRVEDKIAKNHLA------AEKTPGTEDLTTT-AWSGDNGLTLVEYAPFGVIGAITPS 107

Query: 174 NFPMAMITRKVAPALAAGCTCVIKP---AEDTPLTALAL--AALAHEAQIPKGVLNVVTS 228
             P   I       LAAG   V  P   A+     A+ L   A+A        V+ V   
Sbjct: 108 TNPTETIINNSISMLAAGNAVVFNPHPGAKKVSAYAVELINKAIAEAGGPDNLVVTVEEP 167

Query: 229 SRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNA 288
           +    N     L   P +  +  TG   V K   S      K+      GN P +V   A
Sbjct: 168 TIETTNE----LMAHPDINLLVVTGGPAVVKAALSSG----KKAIGAGAGNPPVVVDETA 219

Query: 289 NLQKAVSGAMA-AKFRNCGQACVAANRFLIQEQVFDQFVEMLK 330
           +++KA    +  A F N    C+A    +  + V D  +  ++
Sbjct: 220 DIEKAARDIVQGASFDN-NLPCIAEKEVIAVDSVADYLIAAMQ 261


>gnl|CDD|143440 cd07122, ALDH_F20_ACDH, Coenzyme A acylating aldehyde dehydrogenase
           (ACDH), ALDH family 20-like.  Coenzyme A acylating
           aldehyde dehydrogenase (ACDH, EC=1.2.1.10), an NAD+ and
           CoA-dependent acetaldehyde dehydrogenase, functions as a
           single enzyme (such as the Ethanolamine utilization
           protein, EutE, in Salmonella typhimurium) or as part of
           a multifunctional enzyme to convert acetaldehyde into
           acetyl-CoA . The E. coli aldehyde-alcohol dehydrogenase
           includes the functional domains, alcohol dehydrogenase
           (ADH), ACDH, and pyruvate-formate-lyase deactivase; and
           the Entamoeba histolytica aldehyde-alcohol dehydrogenase
           2 (ALDH20A1) includes the functional domains ADH and
           ACDH and may be critical enzymes in the fermentative
           pathway.
          Length = 436

 Score = 40.9 bits (97), Expect = 0.001
 Identities = 39/177 (22%), Positives = 71/177 (40%), Gaps = 19/177 (10%)

Query: 162 EPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKP----AEDTPLTALALAALAHEAQ 217
           EP+GV+  + P   P +    K   AL      +  P     + +   A  +   A  A 
Sbjct: 94  EPVGVIAALIPSTNPTSTAIFKALIALKTRNAIIFSPHPRAKKCSIEAAKIMREAAVAAG 153

Query: 218 IPKG---VLNVVTSSRTNANSVGKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSL 274
            P+G    +   +   T        L   P V  I  TG   + K  YS    G  + ++
Sbjct: 154 APEGLIQWIEEPSIELTQE------LMKHPDVDLILATGGPGMVKAAYS---SG--KPAI 202

Query: 275 ELG-GNAPFIVYSNANLQKAVSGAMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLK 330
            +G GN P  +   A++++AV   + +K  + G  C +    ++ ++++D+    LK
Sbjct: 203 GVGPGNVPAYIDETADIKRAVKDIILSKTFDNGTICASEQSVIVDDEIYDEVRAELK 259


>gnl|CDD|143400 cd07081, ALDH_F20_ACDH_EutE-like, Coenzyme A acylating aldehyde
           dehydrogenase (ACDH), Ethanolamine utilization protein
           EutE, and related proteins.  Coenzyme A acylating
           aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent
           acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10),
           functions as a single enzyme (such as the Ethanolamine
           utilization protein, EutE, in Salmonella typhimurium) or
           as part of a multifunctional enzyme to convert
           acetaldehyde into acetyl-CoA. The E. coli
           aldehyde-alcohol dehydrogenase includes the functional
           domains, alcohol dehydrogenase (ADH), ACDH, and
           pyruvate-formate-lyase deactivase; and the Entamoeba
           histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1)
           includes the functional domains ADH and ACDH, and may be
           critical enzymes in the fermentative pathway.
          Length = 439

 Score = 40.3 bits (94), Expect = 0.002
 Identities = 59/274 (21%), Positives = 111/274 (40%), Gaps = 31/274 (11%)

Query: 70  AIMSAKEAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYA 129
           A+ +AK A +  S  + +   ++ +   +  E  +  LA++   E+G    E   + V  
Sbjct: 4   AVAAAKVAQQGLSCKSQEMVDLIFRAAAEAAEDARIDLAKLAVSETGMGRVE---DKVIK 60

Query: 130 NSFVDWFSEEARRTYG----ELVPSPVKSKE---LLMVREPLGVVGLITPWNFPMAMITR 182
           N F       A   Y     E     +   E    L++ EP+GVV  ITP   P + +  
Sbjct: 61  NHF------AAEYIYNVYKDEKTCGVLTGDENGGTLIIAEPIGVVASITPSTNPTSTVIF 114

Query: 183 KVAPALAAGCTCVIKP---AEDTPLTALAL---AALAHEAQIPKGVLNVVTSSRTNANSV 236
           K   +L    + +  P   A+     A  L   AA+A  A  P+ ++  + +       +
Sbjct: 115 KSLISLKTRNSIIFSPHPRAKKVTQRAATLLLQAAVA--AGAPENLIGWIDNPSI---EL 169

Query: 237 GKLLCTSPHVAGISFTGSTQVGKILYSHCSQGIKRLSLELGGNAPFIVYSNANLQKAVSG 296
            + L   P +  +  TG   V K  YS      K       GN P ++   A++++AV  
Sbjct: 170 AQRLMKFPGIGLLLATGGPAVVKAAYSSG----KPAIGVGAGNTPVVIDETADIKRAVQS 225

Query: 297 AMAAKFRNCGQACVAANRFLIQEQVFDQFVEMLK 330
            + +K  + G  C +    ++ + V+D+ + + +
Sbjct: 226 IVKSKTFDNGVICASEQSVIVVDSVYDEVMRLFE 259


>gnl|CDD|215908 pfam00413, Peptidase_M10, Matrixin.  The members of this family
          are enzymes that cleave peptides. These proteases
          require zinc for catalysis.
          Length = 159

 Score = 33.4 bits (77), Expect = 0.13
 Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 3/27 (11%)

Query: 60 PDMGVEDVQYAIMSAKEAFEQWSQTTP 86
          PD+  ++V+ AI   + AF+ WS+ TP
Sbjct: 16 PDLSRDEVRRAI---RRAFKVWSEVTP 39


>gnl|CDD|219366 pfam07295, DUF1451, Protein of unknown function (DUF1451).  This
           family consists of several hypothetical bacterial
           proteins of around 160 residues in length. Members of
           this family contain four highly conserved cysteine
           resides toward the C-terminal region of the protein. The
           function of this family is unknown.
          Length = 148

 Score = 32.2 bits (74), Expect = 0.29
 Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 11/93 (11%)

Query: 54  EHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPKERGIV---LKRWLKELEGNKQALAEI 110
           + L++      ++++ AI  AKE  +   + T +E  ++   LKR L+E   + +     
Sbjct: 3   DRLSERLKHTEKELKEAIEQAKEYLQAAEELTREELALIGAYLKRDLEEFLRSYE----- 57

Query: 111 MTQESGKPPTESLGEVVYANSFVDWFSEEARRT 143
              E+G+    S    +   S  DW S    RT
Sbjct: 58  ---ETGEADWLSPDLQLIEESLWDWLSSITDRT 87


>gnl|CDD|215751 pfam00150, Cellulase, Cellulase (glycosyl hydrolase family 5). 
          Length = 269

 Score = 32.3 bits (74), Expect = 0.43
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query: 349 NLGPLINLAQVEKVDSIVQDAISKGAKVIL 378
           N   LI+   + +VD +V  AI  G  VI+
Sbjct: 52  NPDYLIDENWLNRVDEVVDYAIDNGMYVII 81


>gnl|CDD|238317 cd00564, TMP_TenI, Thiamine monophosphate synthase (TMP
           synthase)/TenI. TMP synthase catalyzes an important step
           in the thiamine biosynthesis pathway, the substitution
           of the pyrophosphate of 2-methyl-4-amino-5-
           hydroxymethylpyrimidine pyrophosphate by 4-methyl-5-
           (beta-hydroxyethyl) thiazole phosphate to yield thiamine
           phosphate. TenI is a enzymatically inactive regulatory
           protein involved in the regulation of several
           extracellular enzymes. This superfamily also contains
           other enzymatically inactive proteins with unknown
           functions.
          Length = 196

 Score = 32.1 bits (74), Expect = 0.48
 Identities = 16/56 (28%), Positives = 19/56 (33%), Gaps = 5/56 (8%)

Query: 169 LITPWNFPMAMITRKVAPALAAGCTCVI---KPAEDTPLTALA--LAALAHEAQIP 219
           LIT        +   V  AL  G T V    K      L  LA  L  L  +  +P
Sbjct: 3   LITDRRLDGEDLLEVVEAALKGGVTLVQLREKDLSARELLELARALRELCRKYGVP 58


>gnl|CDD|234590 PRK00043, thiE, thiamine-phosphate pyrophosphorylase; Reviewed.
          Length = 212

 Score = 32.1 bits (74), Expect = 0.48
 Identities = 15/57 (26%), Positives = 19/57 (33%), Gaps = 6/57 (10%)

Query: 169 LITP-WNFPMAMITRKVAPALAAGCTCV---IKPAEDTPLTALA--LAALAHEAQIP 219
           LIT   +     +   V  AL  G T V    K  +      LA  L  L     +P
Sbjct: 11  LITDSRDDSGRDLLEVVEAALEGGVTLVQLREKGLDTRERLELARALKELCRRYGVP 67


>gnl|CDD|199855 cd06229, M14_Endopeptidase_I, Peptidase M14 carboxypeptidase
           family-like domain of Endopeptidase I.  Peptidase
           M14-like domain of Gamma-D-glutamyl-L-diamino acid
           endopeptidase 1 (also known as
           Gamma-D-glutamyl-meso-diaminopimelate peptidase I, and
           Endopeptidase I (ENP1); EC 3.4.19.11). ENP1 is a member
           of the M14 family of metallocarboxypeptidases (MCPs).
           However it has an exceptional type of activity of
           hydrolyzing the gamma-D-Glu-(L)meso-diaminopimelic acid
           (gamma-D-Glu-Dap) bond of
           L-Ala-gamma-D-Glu-(L)meso-diaminopimelic acid and
           L-Ala-gamma-D-Glu-(L)meso-diaminopimelic acid(L)-D-Ala
           peptides. ENP1has a different substrate specificity and
           cellular role than MpaA (MpaA does not belong to this
           group). ENP1 hydrolyzes the gamma-D-Glu-Dap bond of
           MurNAc-tripeptide and MurNAc-tetrapeptide, as well as
           the amide bond of free tripeptide and tetrapeptide .
           ENP1 is active on spore cortex peptidoglycan, and is
           produced at stage IV of sporulation in forespore and
           spore integuments.
          Length = 256

 Score = 30.7 bits (70), Expect = 1.5
 Identities = 12/38 (31%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 105 QALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEARR 142
           Q +AE++ + SG    +S  +  Y   + DWF ++ RR
Sbjct: 190 QKMAELLAELSGYELVDSEPKASYG-GYKDWFIQKFRR 226


>gnl|CDD|143399 cd07080, ALDH_Acyl-CoA-Red_LuxC, Acyl-CoA reductase LuxC.  Acyl-CoA
           reductase, LuxC, (EC=1.2.1.50) is the fatty acid
           reductase enzyme responsible for synthesis of the
           aldehyde  substrate for the luminescent reaction
           catalyzed by luciferase. The fatty acid reductase, a
           luminescence-specific, multienzyme complex (LuxCDE),
           reduces myristic acid to generate the long chain fatty
           aldehyde required for the luciferase-catalyzed reaction
           resulting in the emission of blue-green light.
           Mutational studies of conserved cysteines of LuxC
           revealed that the cysteine which aligns with the
           catalytic cysteine conserved throughout the ALDH
           superfamily is the LuxC acylation site. This CD is
           composed of mainly bacterial sequences but also includes
           a few archaeal sequences similar to the Methanospirillum
           hungateiacyl acyl-CoA reductase RfbN.
          Length = 422

 Score = 30.7 bits (70), Expect = 1.8
 Identities = 26/130 (20%), Positives = 41/130 (31%), Gaps = 18/130 (13%)

Query: 92  VLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYANSFVDWFSEEA-----RRTYG- 145
             KR L      +Q    ++   +G         +      +  F  E       R  G 
Sbjct: 36  AGKRLLDPDYPLRQQAERLLPTVTGYSEEMLREGLKRL---MALFRRENLERILERELGS 92

Query: 146 -----ELVPSPVKSKELLMVREPLGVVGLITPWNFPMAMITRKVAPALAAGCTCVIKPAE 200
                E VP         +  +P G+V  I   N P+  +   +   L      ++K + 
Sbjct: 93  PGILDEWVPPGRGG---YIRAQPRGLVVHIIAGNVPLLPVW-SIVRGLLVKNVNLLKMSS 148

Query: 201 DTPLTALALA 210
             PLTA AL 
Sbjct: 149 SDPLTATALL 158


>gnl|CDD|217120 pfam02581, TMP-TENI, Thiamine monophosphate synthase/TENI.
           Thiamine monophosphate synthase (TMP) (EC:2.5.1.3)
           catalyzes the substitution of the pyrophosphate of
           2-methyl-4-amino-5- hydroxymethylpyrimidine
           pyrophosphate by 4-methyl-5- (beta-hydroxyethyl)thiazole
           phosphate to yield thiamine phosphate. This Pfam family
           also includes the regulatory protein TENI.
          Length = 180

 Score = 30.2 bits (69), Expect = 1.8
 Identities = 17/56 (30%), Positives = 22/56 (39%), Gaps = 5/56 (8%)

Query: 169 LITPWNFPMAMITRKVAPALAAGCTCVI---KPAEDTPLTALA--LAALAHEAQIP 219
           L+T        +   V  AL  G T V    K  +D  L  LA  L AL  +  +P
Sbjct: 3   LVTDPGLDGEDLLEVVEEALKGGVTIVQLREKDLDDRELLELAERLRALCRKYGVP 58


>gnl|CDD|113011 pfam04221, RelB, RelB antitoxin.  RelE and RelB form a
           toxin-antitoxin system. RelE represses translation,
           probably through binding ribosomes. RelB stably binds
           RelE, presumably deactivating it.
          Length = 83

 Score = 28.2 bits (63), Expect = 2.8
 Identities = 11/44 (25%), Positives = 17/44 (38%), Gaps = 2/44 (4%)

Query: 87  KERGIVLKRWLKELEGNKQALAEIMTQESGKPPTESLGEVVYAN 130
              GI     L+  E N++ L  IM  E+G    ++       N
Sbjct: 40  LTGGIPFD--LRLPEPNQRTLVAIMELEAGNDEHKAKSVEKTLN 81


>gnl|CDD|239805 cd04278, ZnMc_MMP, Zinc-dependent metalloprotease, matrix
          metalloproteinase (MMP) sub-family. MMPs are
          responsible for a great deal of pericellular
          proteolysis of extracellular matrix and cell surface
          molecules, playing crucial roles in morphogenesis, cell
          fate specification, cell migration, tissue repair,
          tumorigenesis, gain or loss of tissue-specific
          functions, and apoptosis. In many instances, they are
          anchored to cell membranes via trans-membrane domains,
          and their activity is controlled via TIMPs (tissue
          inhibitors of metalloproteinases).
          Length = 157

 Score = 29.1 bits (66), Expect = 3.0
 Identities = 12/27 (44%), Positives = 15/27 (55%), Gaps = 3/27 (11%)

Query: 60 PDMGVEDVQYAIMSAKEAFEQWSQTTP 86
          PD+  +DV+ AI     AF  WS  TP
Sbjct: 16 PDLPRDDVRRAI---ARAFRVWSDVTP 39


>gnl|CDD|133037 cd04194, GT8_A4GalT_like, A4GalT_like proteins catalyze the
           addition of galactose or glucose residues to the
           lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of
           the bacterial cell surface.  The members of this family
           of glycosyltransferases catalyze the addition of
           galactose or glucose residues to the lipooligosaccharide
           (LOS) or lipopolysaccharide (LPS) of the bacterial cell
           surface. The enzymes exhibit broad substrate
           specificities. The known functions found in this family
           include: Alpha-1,4-galactosyltransferase,
           LOS-alpha-1,3-D-galactosyltransferase,
           UDP-glucose:(galactosyl) LPS
           alpha1,2-glucosyltransferase, UDP-galactose: (glucosyl)
           LPS alpha1,2-galactosyltransferase, and
           UDP-glucose:(glucosyl) LPS alpha1,2-glucosyltransferase.
           Alpha-1,4-galactosyltransferase from N. meningitidis
           adds an alpha-galactose from UDP-Gal (the donor) to a
           terminal lactose (the acceptor) of the LOS structure of
           outer membrane. LOSs are virulence factors that enable
           the organism to evade the immune system of host cells.
           In E. coli, the three alpha-1,2-glycosyltransferases,
           that are involved in the synthesis of the outer core
           region of the LPS, are all members of this family. The
           three enzymes share 40 % of sequence identity, but have
           different sugar donor or acceptor specificities,
           representing the structural diversity of LPS.
          Length = 248

 Score = 29.5 bits (67), Expect = 3.7
 Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 9/45 (20%)

Query: 413 PVAVCIKFKTEEESLCVANNSNTGLAGYFFTNDISQAWRVGKKLQ 457
            +AV IK      S+  ANNS      Y   +DIS+  +  KKL+
Sbjct: 14  YLAVTIK------SIL-ANNSKRDYDFYILNDDISEENK--KKLK 49


>gnl|CDD|227918 COG5631, COG5631, Predicted transcription regulator, contains HTH
          domain (MarR family) [Transcription].
          Length = 199

 Score = 29.2 bits (65), Expect = 3.7
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 55 HLADVPDMGVEDVQYAIMSAKEAFEQW 81
          HLA      V +V+YA+M A EAF +W
Sbjct: 42 HLARTLSPDVSEVEYALMRAYEAFGRW 68


>gnl|CDD|237687 PRK14357, glmU, bifunctional N-acetylglucosamine-1-phosphate
           uridyltransferase/glucosamine-1-phosphate
           acetyltransferase; Provisional.
          Length = 448

 Score = 29.3 bits (66), Expect = 5.9
 Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 15/67 (22%)

Query: 336 SFIQGDTSKNDKCNLGPL-------------INLAQVEKVDSIVQDAISKGAKVILGGGK 382
           +FI+G T   + C +GP+             I  ++ EK  S+++D +S G    L  G 
Sbjct: 268 TFIEGKTRIGEDCEIGPMTRIVDCEIGNNVKIIRSECEK--SVIEDDVSVGPFSRLREGT 325

Query: 383 VTKLGSK 389
           V K   K
Sbjct: 326 VLKKSVK 332


>gnl|CDD|220933 pfam11001, DUF2841, Protein of unknown function (DUF2841).  This
           family of proteins with unknown function are all present
           in yeast.
          Length = 126

 Score = 28.0 bits (63), Expect = 6.0
 Identities = 14/44 (31%), Positives = 20/44 (45%), Gaps = 3/44 (6%)

Query: 76  EAFEQWSQTTPKERGIVLKRWLKELEGNKQALAEIMTQESGKPP 119
           +AF+   Q   K    + K W+K +E  KQA       +  KPP
Sbjct: 15  QAFKSLQQLNCKV---IAKAWIKLIEPKKQAKYPYNGGKKTKPP 55


>gnl|CDD|225548 COG3002, COG3002, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 880

 Score = 29.1 bits (65), Expect = 6.5
 Identities = 10/37 (27%), Positives = 12/37 (32%)

Query: 51  ANDEHLADVPDMGVEDVQYAIMSAKEAFEQWSQTTPK 87
           AN E L  +P             A      WSQ  P+
Sbjct: 668 ANQERLPSLPLNHNHQAPPPKDEALRFAHDWSQVRPE 704


>gnl|CDD|225268 COG2404, COG2404, Predicted phosphohydrolase (DHH superfamily)
           [General function prediction only].
          Length = 339

 Score = 28.2 bits (63), Expect = 9.8
 Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 7/78 (8%)

Query: 306 GQACVAANRFLIQEQVFDQ-FVEMLKVEITKSFIQGDTSKNDKCNLGPL----INLAQVE 360
           G AC A  +    + V++  F   +   I    I     ++   +   +    +NL + E
Sbjct: 12  GYACAAVVKRFFGKNVYNANFGREVSARIN--SILESAEESGIGDAILISDLDVNLDRFE 69

Query: 361 KVDSIVQDAISKGAKVIL 378
           ++   +++A +KG KV  
Sbjct: 70  ELVEKLKEATNKGTKVKW 87


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.133    0.391 

Gapped
Lambda     K      H
   0.267   0.0731    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 25,267,983
Number of extensions: 2445734
Number of successful extensions: 2987
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2388
Number of HSP's successfully gapped: 160
Length of query: 507
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 406
Effective length of database: 6,457,848
Effective search space: 2621886288
Effective search space used: 2621886288
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.3 bits)