RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3652
         (394 letters)



>gnl|CDD|143448 cd07130, ALDH_F7_AASADH, NAD+-dependent alpha-aminoadipic
           semialdehyde dehydrogenase, ALDH family members 7A1 and
           7B.  Alpha-aminoadipic semialdehyde dehydrogenase
           (AASADH, EC=1.2.1.31), also known as ALDH7A1,
           Antiquitin-1, ALDH7B, or
           delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH),
           is a NAD+-dependent ALDH. Human ALDH7A1 is involved in
           the pipecolic acid pathway of lysine catabolism,
           catalyzing the oxidation of alpha-aminoadipic
           semialdehyde to alpha-aminoadipate.  Arabidopsis
           thaliana ALDH7B4 appears to be an
           osmotic-stress-inducible ALDH gene encoding a
           turgor-responsive or stress-inducible ALDH. The
           Streptomyces clavuligerus P6CDH appears to be involved
           in cephamycin biosynthesis, catalyzing the second stage
           of the two-step conversion of lysine to
           alpha-aminoadipic acid.  The ALDH7A1 enzyme and others
           in this group have been observed as tetramers, yet the
           bacterial P6CDH enzyme has been reported as a monomer.
          Length = 474

 Score =  540 bits (1393), Expect = 0.0
 Identities = 213/475 (44%), Positives = 263/475 (55%), Gaps = 131/475 (27%)

Query: 38  GVYDGTSWKANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRG 97
           GVYDG  W   G ++ SI+P+ GE IA V+Q   +DY   I++++ A+  W  +PAP+RG
Sbjct: 1   GVYDGE-WGGGGGVVTSISPANGEPIARVRQATPEDYESTIKAAQEAFKEWRDVPAPKRG 59

Query: 98  EIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPS 157
           EIVRQIGDALR K   LG+LVSLEMGKIL EG+GEVQE IDICD+AVGLSR   G  +PS
Sbjct: 60  EIVRQIGDALRKKKEALGKLVSLEMGKILPEGLGEVQEMIDICDFAVGLSRQLYGLTIPS 119

Query: 158 ERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV----------------GVAVQ 201
           ERPGH ++E WNPLGVVG+I+AFNFPVAV+GWNAAIALV                 +AV 
Sbjct: 120 ERPGHRMMEQWNPLGVVGVITAFNFPVAVWGWNAAIALVCGNVVVWKPSPTTPLTAIAVT 179

Query: 202 SRFGKLLLELGGNNAI------------IVNEDADLNL---------------------- 227
               ++L + G   AI             + +D  + L                      
Sbjct: 180 KIVARVLEKNGLPGAIASLVCGGADVGEALVKDPRVPLVSFTGSTAVGRQVGQAVAARFG 239

Query: 228 ----------------------AQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR------ 259
                                 A + ++FA  GTAGQRCTTTRRL +H+   D       
Sbjct: 240 RSLLELGGNNAIIVMEDADLDLAVRAVLFAAVGTAGQRCTTTRRLIVHESIYDEVLERLK 299

Query: 260 -------------PGYFVEP----------------------TIVTG------------- 271
                         G  V P                      T++ G             
Sbjct: 300 KAYKQVRIGDPLDDGTLVGPLHTKAAVDNYLAAIEEAKSQGGTVLFGGKVIDGPGNYVEP 359

Query: 272 ----LPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNLFKWLG 327
                  ++ +V  ETFAPI+YV  FD+L+EAI WNNEV QGLSSSIFT D+ N F+WLG
Sbjct: 360 TIVEGLSDAPIVKEETFAPILYVLKFDTLEEAIAWNNEVPQGLSSSIFTTDLRNAFRWLG 419

Query: 328 PQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTINHG 382
           P+GSDCG++NVNI T+GAEIGGAFGGEK TGGGRESGSD+WKQY RRST TIN+ 
Sbjct: 420 PKGSDCGIVNVNIGTSGAEIGGAFGGEKETGGGRESGSDAWKQYMRRSTCTINYS 474


>gnl|CDD|143405 cd07086, ALDH_F7_AASADH-like, NAD+-dependent alpha-aminoadipic
           semialdehyde dehydrogenase and related proteins.  ALDH
           subfamily which includes the NAD+-dependent,
           alpha-aminoadipic semialdehyde dehydrogenase (AASADH,
           EC=1.2.1.31), also known as Antiquitin-1, ALDH7A1,
           ALDH7B or delta-1-piperideine-6-carboxylate
           dehydrogenase (P6CDH), and other similar sequences, such
           as the uncharacterized aldehyde dehydrogenase of
           Candidatus kuenenia AldH (locus CAJ73105).
          Length = 478

 Score =  505 bits (1303), Expect = e-178
 Identities = 210/478 (43%), Positives = 257/478 (53%), Gaps = 133/478 (27%)

Query: 38  GVYDGTSWKANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRG 97
           GV  G    + GE   S NP+ GE IA V   + +D    + ++R A+  W  +PAPRRG
Sbjct: 1   GVIGGEWVGSGGETFTSRNPANGEPIARVFPASPEDVEAAVAAAREAFKEWRKVPAPRRG 60

Query: 98  EIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPS 157
           EIVRQIG+ALR K   LG+LVSLEMGKIL EG+GEVQE IDICDYAVGLSR   G  +PS
Sbjct: 61  EIVRQIGEALRKKKEALGRLVSLEMGKILPEGLGEVQEMIDICDYAVGLSRMLYGLTIPS 120

Query: 158 ERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV----------------GVAVQ 201
           ERPGH L+E WNPLGVVG+I+AFNFPVAV GWNAAIALV                 +AV 
Sbjct: 121 ERPGHRLMEQWNPLGVVGVITAFNFPVAVPGWNAAIALVCGNTVVWKPSETTPLTAIAVT 180

Query: 202 SRFGKLLLELGGNNAII------------VNEDADLNL---------------------- 227
               ++L + G    ++            +  D  + L                      
Sbjct: 181 KILAEVLEKNGLPPGVVNLVTGGGDGGELLVHDPRVPLVSFTGSTEVGRRVGETVARRFG 240

Query: 228 ----------------------AQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR------ 259
                                 A + ++FA  GTAGQRCTTTRRL +H+   D       
Sbjct: 241 RVLLELGGNNAIIVMDDADLDLAVRAVLFAAVGTAGQRCTTTRRLIVHESVYDEFLERLV 300

Query: 260 -------------PGYFVEP----------------------TIVTG------------- 271
                         G  V P                      T++TG             
Sbjct: 301 KAYKQVRIGDPLDEGTLVGPLINQAAVEKYLNAIEIAKSQGGTVLTGGKRIDGGEPGNYV 360

Query: 272 -------LPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNLFK 324
                  +  ++R+V  ETFAPI+YV  FDSL+EAI  NN+V QGLSSSIFT+D+   F+
Sbjct: 361 EPTIVTGVTDDARIVQEETFAPILYVIKFDSLEEAIAINNDVPQGLSSSIFTEDLREAFR 420

Query: 325 WLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTINHG 382
           WLGP+GSDCG++NVNIPT+GAEIGGAFGGEK TGGGRESGSD+WKQY RRST TIN+ 
Sbjct: 421 WLGPKGSDCGIVNVNIPTSGAEIGGAFGGEKETGGGRESGSDAWKQYMRRSTCTINYS 478


>gnl|CDD|177949 PLN02315, PLN02315, aldehyde dehydrogenase family 7 member.
          Length = 508

 Score =  266 bits (681), Expect = 1e-84
 Identities = 128/257 (49%), Positives = 154/257 (59%), Gaps = 58/257 (22%)

Query: 195 LVGVAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHK 254
           +V   V +RFGK LLEL GNNAIIV +DAD+ LA + ++FA  GTAGQRCTT RRL LH+
Sbjct: 251 MVQQTVNARFGKCLLELSGNNAIIVMDDADIQLAVRSVLFAAVGTAGQRCTTCRRLLLHE 310

Query: 255 K----------------KIDRP---GYFVEP----------------------TIVTG-- 271
                            KI  P   G  + P                       I+TG  
Sbjct: 311 SIYDDVLEQLLTVYKQVKIGDPLEKGTLLGPLHTPESKKNFEKGIEIIKSQGGKILTGGS 370

Query: 272 ---------------LPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFT 316
                          +  ++ VV  E F P++YV  F +L+EAI  NN V QGLSSSIFT
Sbjct: 371 AIESEGNFVQPTIVEISPDADVVKEELFGPVLYVMKFKTLEEAIEINNSVPQGLSSSIFT 430

Query: 317 KDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRST 376
           ++   +FKW+GP GSDCG++NVNIPTNGAEIGGAFGGEK TGGGRE+GSDSWKQY RRST
Sbjct: 431 RNPETIFKWIGPLGSDCGIVNVNIPTNGAEIGGAFGGEKATGGGREAGSDSWKQYMRRST 490

Query: 377 VTINHGKEITLAQGIKF 393
            TIN+G E+ LAQGI F
Sbjct: 491 CTINYGNELPLAQGINF 507



 Score =  255 bits (652), Expect = 2e-80
 Identities = 108/175 (61%), Positives = 135/175 (77%), Gaps = 1/175 (0%)

Query: 22  KYPFLKELGLSGSVNPGVYDGTSWKANGEIIPSINPSTGETIASVQQGNVQDYHRCIESS 81
           +Y FL E+GLS S N G Y G  W+ANG ++ S+NP+  + IA V + +++DY   + + 
Sbjct: 7   EYEFLSEIGLS-SRNLGCYVGGEWRANGPLVSSVNPANNQPIAEVVEASLEDYEEGLRAC 65

Query: 82  RAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICD 141
             A   W  +PAP+RGEIVRQIGDALR KL  LG+LVSLEMGKILAEGIGEVQE ID+CD
Sbjct: 66  EEAAKIWMQVPAPKRGEIVRQIGDALRAKLDYLGRLVSLEMGKILAEGIGEVQEIIDMCD 125

Query: 142 YAVGLSRTYSGSILPSERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
           +AVGLSR  +GSI+PSERP H+++E WNPLG+VG+I+AFNFP AV GWNA IALV
Sbjct: 126 FAVGLSRQLNGSIIPSERPNHMMMEVWNPLGIVGVITAFNFPCAVLGWNACIALV 180


>gnl|CDD|143395 cd06534, ALDH-SF, NAD(P)+-dependent aldehyde dehydrogenase
           superfamily.  The aldehyde dehydrogenase superfamily
           (ALDH-SF) of  NAD(P)+-dependent enzymes, in general,
           oxidize a wide range of  endogenous and exogenous
           aliphatic and aromatic aldehydes to their corresponding
           carboxylic acids and play an  important role in
           detoxification. Besides aldehyde detoxification, many
           ALDH isozymes possess multiple additional catalytic and
           non-catalytic functions such as participating in
           metabolic pathways, or as binding proteins, or
           osmoregulants, to mention a few. The enzyme has three
           domains, a NAD(P)+ cofactor-binding domain, a catalytic
           domain, and a bridging domain; and the active enzyme is
           generally either homodimeric or homotetrameric. The
           catalytic mechanism is proposed to involve cofactor
           binding, resulting in a conformational change and
           activation of an invariant catalytic cysteine
           nucleophile. The cysteine and aldehyde substrate form an
           oxyanion thiohemiacetal intermediate resulting in
           hydride transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-L) proteins. The ALDH proteins
           are represented by enzymes which share a number of
           highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group. The
           ALDH-like group is represented by such proteins as
           gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA
           reductase, and coenzyme A acylating aldehyde
           dehydrogenase. All of these proteins have a conserved
           cysteine that aligns with the catalytic cysteine of the
           ALDH group.
          Length = 367

 Score =  176 bits (448), Expect = 1e-51
 Identities = 62/194 (31%), Positives = 94/194 (48%), Gaps = 7/194 (3%)

Query: 188 GWNAAIALVGVAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTT 247
           G  A    +  A       + LELGG + +IV+EDADL+ A +  VF     AGQ CT  
Sbjct: 179 GSTAVGKAIMKAAAENLKPVTLELGGKSPVIVDEDADLDAAVEGAVFGAFFNAGQICTAA 238

Query: 248 RRLFLHKKKIDRPGYFVEP--TIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNE 305
            RL +H+   D    FVE   T++  +  +  +   E F P++ V  F   +EAI   N+
Sbjct: 239 SRLLVHESIYDE---FVEKLVTVLVDVDPDMPIAQEEIFGPVLPVIRFKDEEEAIALAND 295

Query: 306 VKQGLSSSIFTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGS 365
            + GL++ +FT+D+    +    +    G + +N  + G      FGG K +G GRE G 
Sbjct: 296 TEYGLTAGVFTRDLNRALRVA--ERLRAGTVYINDSSIGVGPEAPFGGVKNSGIGREGGP 353

Query: 366 DSWKQYCRRSTVTI 379
              ++Y R  TV I
Sbjct: 354 YGLEEYTRTKTVVI 367



 Score =  116 bits (293), Expect = 2e-29
 Identities = 53/123 (43%), Positives = 68/123 (55%), Gaps = 1/123 (0%)

Query: 79  ESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFID 138
            ++RAA+ AWAALP   R  I+R+I D L  +   L  L +LE GK + E +GEV   ID
Sbjct: 1   AAARAAFKAWAALPPAERAAILRKIADLLEERREELAALETLETGKPIEEALGEVARAID 60

Query: 139 ICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV-G 197
              YA GL+    G  LPS  PG        PLGVVG+I+ +NFP+ +  W  A AL  G
Sbjct: 61  TFRYAAGLADKLGGPELPSPDPGGEAYVRREPLGVVGVITPWNFPLLLAAWKLAPALAAG 120

Query: 198 VAV 200
             V
Sbjct: 121 NTV 123


>gnl|CDD|215767 pfam00171, Aldedh, Aldehyde dehydrogenase family.  This family of
           dehydrogenases act on aldehyde substrates. Members use
           NADP as a cofactor. The family includes the following
           members: The prototypical members are the aldehyde
           dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
           dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase
           EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28.
           Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27.
           Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9.
           Delta-1-pyrroline-5-carboxylate dehydrogenase EC:
           1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10.
           Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This
           family also includes omega crystallin, an eye lens
           protein from squid and octopus that has little aldehyde
           dehydrogenase activity.
          Length = 459

 Score =  153 bits (388), Expect = 5e-42
 Identities = 61/151 (40%), Positives = 81/151 (53%), Gaps = 1/151 (0%)

Query: 46  KANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGD 105
            ++ E I  INP+TGE IA+V     +D    + ++RAA+ AWA  P   R  I+ +  D
Sbjct: 3   DSSSETIEVINPATGEVIATVPAATAEDVDAAVAAARAAFKAWAKTPPSERAAILLKAAD 62

Query: 106 ALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLL 165
            L  +   L +L +LE GK LAE  GEV   ID   Y  GL+R   G  LPS  PG +  
Sbjct: 63  LLEERRDELAELETLETGKPLAEARGEVPRAIDTLRYYAGLARKLEGETLPS-DPGVLAY 121

Query: 166 ENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
               PLGVVG I+ +NFP+ +  W  A AL 
Sbjct: 122 TRREPLGVVGAITPWNFPLLLAAWKIAPALA 152



 Score =  152 bits (386), Expect = 9e-42
 Identities = 62/242 (25%), Positives = 96/242 (39%), Gaps = 61/242 (25%)

Query: 195 LVGVAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHK 254
            +  A      ++ LELGG N +IV +DADL+ A +  VF   G AGQ CT   RL +H+
Sbjct: 220 RIAKAAAKNLKRVTLELGGKNPLIVFDDADLDAAVEGAVFGAFGNAGQVCTAGSRLLVHE 279

Query: 255 --------------KKI-----DRPGYFVEP----------------------TIVTG-- 271
                         K +       P   + P                       ++ G  
Sbjct: 280 SIYDEFVERLVEAAKSLKVGDPLDPDTDIGPLISKKQRERVLSYIEDAKEEGAKLLCGGE 339

Query: 272 ----------------LPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIF 315
                           +  + R+   E F P++ V PF   DEAI   N+ + GL++ +F
Sbjct: 340 AGLEKGYFVEPTVLADVTPDMRIAQEEIFGPVLSVIPFKDEDEAIELANDTEYGLAAGVF 399

Query: 316 TKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRS 375
           T D+    +    +  + G++ +N  T G      FGG K +G GRE G +  ++Y    
Sbjct: 400 TNDLERALRVA--RRLEAGMVWINDYTTGDPEALPFGGFKQSGFGREGGKEGLEEYTETK 457

Query: 376 TV 377
           TV
Sbjct: 458 TV 459


>gnl|CDD|223944 COG1012, PutA, NAD-dependent aldehyde dehydrogenases [Energy
           production and conversion].
          Length = 472

 Score =  139 bits (352), Expect = 6e-37
 Identities = 78/289 (26%), Positives = 109/289 (37%), Gaps = 71/289 (24%)

Query: 45  WKANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIG 104
           W      I  INP+TGE IA+V     +D    + ++RAA+ AW+ L A  R  I+R+I 
Sbjct: 9   WVDGASTIEVINPATGEVIATVPAATAEDVDAAVAAARAAFEAWSRLSAEERAAILRRIA 68

Query: 105 DALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVL 164
           D L  +   L  L++LE GK ++E  GE+    D   Y    +R   G  +P+++    L
Sbjct: 69  DLLEARAEELAALITLETGKPISEARGEIARAADFIRYYAEEARRLEGETIPTDKGSKAL 128

Query: 165 LE-----------NWN----------------------------PLGVVGIISAFN---F 182
           +             WN                            PL  + +         
Sbjct: 129 VRREPLGVVGAITPWNFPLALAAWKLAPALAAGNTVVLKPSEQTPLSALALAELAAEAGL 188

Query: 183 PVAVYGW-----------------NAAIALVG---------VAVQSRFGKLLLELGGNNA 216
           P  V                      AI+  G          A  +    + LELGG + 
Sbjct: 189 PAGVLNVVTGGGAEVGDALVAHPDVDAISFTGSTAVGRAIAAAAAANLKPVTLELGGKSP 248

Query: 217 IIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
            IV EDADL+ A    VF     AGQRCT   RL +H+   D    FVE
Sbjct: 249 AIVLEDADLDAAVDAAVFGAFFNAGQRCTAASRLIVHESVYDE---FVE 294



 Score =  122 bits (307), Expect = 1e-30
 Identities = 46/124 (37%), Positives = 71/124 (57%), Gaps = 3/124 (2%)

Query: 260 PGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDV 319
            GYFVEPTI+ G+  + R+   E F P++ V  F   +EAI   N+ + GL+++IFT+D+
Sbjct: 349 GGYFVEPTILEGVTPDMRIAREEIFGPVLPVIRFKDEEEAIELANDTEYGLAAAIFTRDL 408

Query: 320 TNLFKWLGPQGSDCGLINVNIPTNGAEIGGA-FGGEKYTGGGRESGSDSWKQYCRRSTVT 378
              F+    +  + G++ +N  T GA+I    FGG K +G GRE G    +++    TVT
Sbjct: 409 ARAFRVA--RRLEAGMVGINDYTGGADIAYLPFGGVKQSGLGREGGKYGLEEFTEVKTVT 466

Query: 379 INHG 382
           I  G
Sbjct: 467 IKLG 470


>gnl|CDD|143407 cd07088, ALDH_LactADH-AldA, Escherichia coli lactaldehyde
           dehydrogenase AldA-like.  Lactaldehyde dehydrogenase
           from Escherichia coli (AldA, LactADH, EC=1.2.1.22), an
           NAD(+)-dependent enzyme involved in the metabolism of
           L-fucose and L-rhamnose, and other similar sequences are
           present in this CD.
          Length = 468

 Score =  136 bits (345), Expect = 6e-36
 Identities = 53/150 (35%), Positives = 79/150 (52%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDA 106
           ++GE I  +NP+TGE +A+V     +D  R ++++ AA  AW  LPA  R   +R++ D 
Sbjct: 10  SSGETIDVLNPATGEVVATVPAATAEDADRAVDAAEAAQKAWERLPAIERAAYLRKLADL 69

Query: 107 LRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLE 166
           +R     L +L+  E GK L+    EV+   D  DY    +R   G I+PS+RP   +  
Sbjct: 70  IRENADELAKLIVEEQGKTLSLARVEVEFTADYIDYMAEWARRIEGEIIPSDRPNENIFI 129

Query: 167 NWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
              P+GVV  I  +NFP  +     A ALV
Sbjct: 130 FKVPIGVVAGILPWNFPFFLIARKLAPALV 159



 Score = 76.9 bits (190), Expect = 3e-15
 Identities = 25/66 (37%), Positives = 40/66 (60%)

Query: 259 RPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
             GYF EPT++T +  +  +V  E F P++ V  F SLDEAI   N+ + GL+S I+T++
Sbjct: 351 EKGYFYEPTVLTNVRQDMEIVQEEIFGPVLPVVKFSSLDEAIELANDSEYGLTSYIYTEN 410

Query: 319 VTNLFK 324
           +    +
Sbjct: 411 LNTAMR 416



 Score = 52.7 bits (127), Expect = 1e-07
 Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 3/60 (5%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
           K+ LELGG    IV +DADL+LA + +V +     GQ CT   R+++H+   D    F+E
Sbjct: 238 KVSLELGGKAPAIVMKDADLDLAVKAIVDSRIINCGQVCTCAERVYVHEDIYDE---FME 294


>gnl|CDD|143449 cd07131, ALDH_AldH-CAJ73105, Uncharacterized Candidatus kuenenia
           aldehyde dehydrogenase AldH (CAJ73105)-like.
           Uncharacterized aldehyde dehydrogenase of Candidatus
           kuenenia AldH (locus CAJ73105) and similar sequences
           with similarity to alpha-aminoadipic semialdehyde
           dehydrogenase (AASADH, human ALDH7A1, EC=1.2.1.31),
           Arabidopsis ALDH7B4, and Streptomyces clavuligerus
           delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH)
           are included in this CD.
          Length = 478

 Score =  136 bits (344), Expect = 8e-36
 Identities = 59/151 (39%), Positives = 83/151 (54%), Gaps = 1/151 (0%)

Query: 47  ANGEIIPSINPSTGETIASVQQ-GNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGD 105
           A+GE   S NP+  E +          D    +E++R A+  W  +PAPRR E + +  +
Sbjct: 11  ASGETFDSRNPADLEEVVGTFPLSTASDVDAAVEAAREAFPEWRKVPAPRRAEYLFRAAE 70

Query: 106 ALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLL 165
            L+ +   L +LV+ EMGK LAEG G+VQE ID+  YA G  R   G  +PSE P    +
Sbjct: 71  LLKKRKEELARLVTREMGKPLAEGRGDVQEAIDMAQYAAGEGRRLFGETVPSELPNKDAM 130

Query: 166 ENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
               P+GVV +I+ +NFPVA+  W    ALV
Sbjct: 131 TRRQPIGVVALITPWNFPVAIPSWKIFPALV 161



 Score =  110 bits (277), Expect = 1e-26
 Identities = 47/126 (37%), Positives = 70/126 (55%), Gaps = 7/126 (5%)

Query: 260 PGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDV 319
            GYFVEPT+ T +  + R+   E F P+V +    SL+EAI   N+ + GLSS+I+T+DV
Sbjct: 357 KGYFVEPTVFTDVTPDMRIAQEEIFGPVVALIEVSSLEEAIEIANDTEYGLSSAIYTEDV 416

Query: 320 TNLFKWLGPQGSDC--GLINVNIPTNGAEIGGAFGGEKYTGGG-RESGSDSWKQYCRRST 376
              F+       D   G+  VN PT GAE+   FGG K +G G RE+G+ +   +     
Sbjct: 417 NKAFRARR----DLEAGITYVNAPTIGAEVHLPFGGVKKSGNGHREAGTTALDAFTEWKA 472

Query: 377 VTINHG 382
           V +++ 
Sbjct: 473 VYVDYS 478



 Score = 65.1 bits (159), Expect = 2e-11
 Identities = 26/51 (50%), Positives = 35/51 (68%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR 259
           LE+GG N IIV +DADL+LA +  +++  GT GQRCT T RL +H+   D 
Sbjct: 243 LEMGGKNPIIVMDDADLDLALEGALWSAFGTTGQRCTATSRLIVHESVYDE 293


>gnl|CDD|143415 cd07097, ALDH_KGSADH-YcbD, Bacillus subtilis NADP+-dependent
           alpha-ketoglutaric semialdehyde dehydrogenase ycbD-like.
            Kinetic studies of the Bacillus subtilis ALDH-like ycbD
           protein, which is involved in d-glucarate/d-galactarate
           utilization, reveal that it is a NADP+-dependent,
           alpha-ketoglutaric semialdehyde dehydrogenase (KGSADH).
           KGSADHs (EC 1.2.1.26) catalyze the NAD(P)+-dependent
           conversion of KGSA to alpha-ketoglutarate.
           Interestingly, the NADP+-dependent, tetrameric,
           2,5-dioxopentanoate dehydrogenase (EC=1.2.1.26), an
           enzyme involved in the catabolic pathway for D-arabinose
           in Sulfolobus solfataricus, also clusters in this group.
           This CD shows a distant phylogenetic relationship to the
           Azospirillum brasilense KGSADH-II (-III) group.
          Length = 473

 Score =  131 bits (333), Expect = 3e-34
 Identities = 56/161 (34%), Positives = 80/161 (49%), Gaps = 2/161 (1%)

Query: 37  PGVYDGTSWKANGEIIPSINPS-TGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPR 95
               DG  W A G+   + NPS T + +    + + +D    I ++ AA+ AW       
Sbjct: 2   RNYIDG-EWVAGGDGEENRNPSDTSDVVGKYARASAEDADAAIAAAAAAFPAWRRTSPEA 60

Query: 96  RGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSIL 155
           R +I+ + GD L  +   L +L++ E GK L E  GEV     I  Y  G +   SG  L
Sbjct: 61  RADILDKAGDELEARKEELARLLTREEGKTLPEARGEVTRAGQIFRYYAGEALRLSGETL 120

Query: 156 PSERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
           PS RPG  +     PLGVVG+I+ +NFP+A+  W  A AL 
Sbjct: 121 PSTRPGVEVETTREPLGVVGLITPWNFPIAIPAWKIAPALA 161



 Score = 84.2 bits (209), Expect = 9e-18
 Identities = 38/120 (31%), Positives = 61/120 (50%), Gaps = 15/120 (12%)

Query: 255 KKIDRP--GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSS 312
           +++ RP  GY++ P +  G+ ++ R+   E F P+  V      DEA+   N+ + GLS+
Sbjct: 348 ERLKRPDEGYYLAPALFAGVTNDMRIAREEIFGPVAAVIRVRDYDEALAIANDTEFGLSA 407

Query: 313 SIFTKDVTNL-----FKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTG-GGRESGSD 366
            I T   T+L     FK    +  + G++ VN+PT G +    FGG K +  G RE G  
Sbjct: 408 GIVT---TSLKHATHFK----RRVEAGVVMVNLPTAGVDYHVPFGGRKGSSYGPREQGEA 460



 Score = 65.7 bits (161), Expect = 1e-11
 Identities = 28/70 (40%), Positives = 42/70 (60%), Gaps = 3/70 (4%)

Query: 196 VGVAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKK 255
           +  A  +R  ++ LE+GG N ++V +DADL+LA +C V     + GQRCT + RL + + 
Sbjct: 230 IAAAAAARGARVQLEMGGKNPLVVLDDADLDLAVECAVQGAFFSTGQRCTASSRLIVTEG 289

Query: 256 KIDRPGYFVE 265
             DR   FVE
Sbjct: 290 IHDR---FVE 296


>gnl|CDD|143397 cd07078, ALDH, NAD(P)+ dependent aldehyde dehydrogenase family.
           The aldehyde dehydrogenase family (ALDH) of NAD(P)+
           dependent enzymes, in general, oxidize a wide range of
           endogenous and exogenous aliphatic and aromatic
           aldehydes to their corresponding carboxylic acids and
           play an  important role in detoxification. Besides
           aldehyde detoxification, many ALDH isozymes possess
           multiple additional catalytic and non-catalytic
           functions such as participating in  metabolic pathways,
           or as  binding proteins, or as osmoregulants, to mention
           a few. The enzyme has three domains, a NAD(P)+
           cofactor-binding domain, a catalytic domain, and a
           bridging domain; and the active enzyme  is generally
           either homodimeric or homotetrameric. The catalytic
           mechanism is proposed to involve cofactor binding,
           resulting in a conformational change and activation of
           an invariant catalytic cysteine nucleophile. The
           cysteine and aldehyde substrate form an oxyanion
           thiohemiacetal intermediate resulting in hydride
           transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-like) proteins. The ALDH
           proteins are represented by enzymes which share a number
           of highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group.
          Length = 432

 Score =  130 bits (329), Expect = 6e-34
 Identities = 41/120 (34%), Positives = 64/120 (53%), Gaps = 2/120 (1%)

Query: 260 PGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDV 319
            GYFV PT++T +  +  +   E F P++ V PF   +EAI   N+ + GL++ +FT+D+
Sbjct: 315 KGYFVPPTVLTDVDPDMPIAQEEIFGPVLPVIPFKDEEEAIELANDTEYGLAAGVFTRDL 374

Query: 320 TNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTI 379
               +       + G + +N  + GAE    FGG K +G GRE G    ++Y    TVTI
Sbjct: 375 ERALRVAE--RLEAGTVWINDYSVGAEPSAPFGGVKQSGIGREGGPYGLEEYTEPKTVTI 432



 Score =  126 bits (318), Expect = 2e-32
 Identities = 69/246 (28%), Positives = 98/246 (39%), Gaps = 69/246 (28%)

Query: 78  IESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFI 137
           + ++RAA+ AWAALP   R  I+R++ D L  +   L  L +LE GK + E +GEV    
Sbjct: 4   VAAARAAFKAWAALPPAERAAILRKLADLLEERREELAALETLETGKPIEEALGEVARAA 63

Query: 138 DICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIAL-V 196
           D   Y  GL+R   G ++PS  PG + +    PLGVVG I+ +NFP+ +  W  A AL  
Sbjct: 64  DTFRYYAGLARRLHGEVIPSPDPGELAIVRREPLGVVGAITPWNFPLLLAAWKLAPALAA 123

Query: 197 GVAV------QS-----RFGKLLLELG----------GN-----NAIIVNEDADL----- 225
           G  V       +        +LL E G          G+      A+  +   D      
Sbjct: 124 GNTVVLKPSELTPLTALLLAELLAEAGLPPGVLNVVTGDGDEVGAALASHPRVDKISFTG 183

Query: 226 ----------NLAQQ-------------CLVF--------------ACCGTAGQRCTTTR 248
                       A+               +VF                 G AGQ CT   
Sbjct: 184 STAVGKAIMRAAAENLKRVTLELGGKSPLIVFDDADLDAAVKGAVFGAFGNAGQVCTAAS 243

Query: 249 RLFLHK 254
           RL +H+
Sbjct: 244 RLLVHE 249


>gnl|CDD|143417 cd07099, ALDH_DDALDH, Methylomonas sp.
           4,4'-diapolycopene-dialdehyde dehydrogenase-like.  The
           4,4'-diapolycopene-dialdehyde dehydrogenase (DDALDH)
           involved in C30 carotenoid synthesis in Methylomonas sp.
           strain 16a and other similar sequences are present in
           this CD. DDALDH converts 4,4'-diapolycopene-dialdehyde
           into 4,4'-diapolycopene-diacid.
          Length = 453

 Score =  110 bits (276), Expect = 1e-26
 Identities = 62/251 (24%), Positives = 99/251 (39%), Gaps = 76/251 (30%)

Query: 196 VGVAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRL----- 250
           V  A   R   ++LELGG + +IV  DADL  A    V+     AGQ C +  R+     
Sbjct: 212 VMAAAAERLIPVVLELGGKDPMIVLADADLERAAAAAVWGAMVNAGQTCISVERVYVHES 271

Query: 251 ----FLHK--------------------------------------------------KK 256
               F+ +                                                   +
Sbjct: 272 VYDEFVARLVAKARALRPGADDIGDADIGPMTTARQLDIVRRHVDDAVAKGAKALTGGAR 331

Query: 257 IDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFT 316
            +  G F EPT++T +PH+  V+  ETF P++ V P    DEAI   N+ + GLS+S+F+
Sbjct: 332 SNGGGPFYEPTVLTDVPHDMDVMREETFGPVLPVMPVADEDEAIALANDSRYGLSASVFS 391

Query: 317 KDVTN-------LFKWLGPQGSDCGLINVN-IPTNGAEIGGAFGGEKYTGGGRESGSDSW 368
           +D+         L         + G +++N +          FGG K +GGGR  G++  
Sbjct: 392 RDLARAEAIARRL---------EAGAVSINDVLLTAGIPALPFGGVKDSGGGRRHGAEGL 442

Query: 369 KQYCRRSTVTI 379
           +++CR   +  
Sbjct: 443 REFCRPKAIAR 453



 Score = 76.1 bits (188), Expect = 4e-15
 Identities = 46/154 (29%), Positives = 72/154 (46%), Gaps = 12/154 (7%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPL 114
            NP+TGE +  V   +  +    +  +RAA  AWAAL    R + + +   AL      L
Sbjct: 1   RNPATGEVLGEVPVTDPAEVAAAVARARAAQRAWAALGVEGRAQRLLRWKRALADHADEL 60

Query: 115 GQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSIL-PSERPGHVLLEN------ 167
            +L+  E GK  A+   EV   ++  D+A       +  +L P + P  +L+ N      
Sbjct: 61  AELLHAETGKPRADAGLEVLLALEAIDWAARN----APRVLAPRKVPTGLLMPNKKATVE 116

Query: 168 WNPLGVVGIISAFNFPVAVYGWNAAIALV-GVAV 200
           + P GVVG+IS +N+P+     +   AL  G AV
Sbjct: 117 YRPYGVVGVISPWNYPLLTPMGDIIPALAAGNAV 150


>gnl|CDD|143421 cd07103, ALDH_F5_SSADH_GabD, Mitochondrial succinate-semialdehyde
           dehydrogenase and ALDH family members 5A1 and 5F1-like. 
           Succinate-semialdehyde dehydrogenase, mitochondrial
           (SSADH, GabD, EC=1.2.1.24) catalyzes the NAD+-dependent
           oxidation of succinate semialdehyde (SSA) to succinate.
           This group includes the human aldehyde dehydrogenase
           family 5 member A1 (ALDH5A1) which is a mitochondrial
           homotetramer that converts SSA to succinate in the last
           step of 4-aminobutyric acid (GABA) catabolism. This CD
           also includes the Arabidopsis SSADH gene product
           ALDH5F1. Mutations in this gene result in the
           accumulation of H2O2, suggesting a role in plant defense
           against the environmental stress of elevated reactive
           oxygen species.
          Length = 451

 Score =  105 bits (265), Expect = 5e-25
 Identities = 51/139 (36%), Positives = 68/139 (48%), Gaps = 16/139 (11%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPL 114
           INP+TGE I  V      D    I+++ AA+  W    A  R  I+R+  D +R +   L
Sbjct: 2   INPATGEVIGEVPDAGAADADAAIDAAAAAFKTWRKTTARERAAILRRWADLIRERAEDL 61

Query: 115 GQLVSLEMGKILAEGIGEVQEFIDICDYAVGL--------SRTYSGSILPSERPGHVLLE 166
            +L++LE GK LAE  GEV       DYA            R Y G  +PS  PG  +L 
Sbjct: 62  ARLLTLEQGKPLAEARGEV-------DYAASFLEWFAEEARRIY-GRTIPSPAPGKRILV 113

Query: 167 NWNPLGVVGIISAFNFPVA 185
              P+GVV  I+ +NFP A
Sbjct: 114 IKQPVGVVAAITPWNFPAA 132



 Score = 91.3 bits (228), Expect = 4e-20
 Identities = 37/114 (32%), Positives = 64/114 (56%), Gaps = 9/114 (7%)

Query: 255 KKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSI 314
           K++   GYF EPT++T +  +  +++ ETF P+  + PFD+ DE I   N+   GL++ +
Sbjct: 330 KRLGLGGYFYEPTVLTDVTDDMLIMNEETFGPVAPIIPFDTEDEVIARANDTPYGLAAYV 389

Query: 315 FTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGA---FGGEKYTGGGRESGS 365
           FT+D+   ++ +  +  + G++ +N       I  A   FGG K +G GRE G 
Sbjct: 390 FTRDLARAWR-VA-EALEAGMVGINTGL----ISDAEAPFGGVKESGLGREGGK 437



 Score = 49.4 bits (119), Expect = 2e-06
 Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 3/60 (5%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
           ++ LELGGN   IV +DADL+ A    + +    AGQ C    R+++H+   D    FVE
Sbjct: 222 RVSLELGGNAPFIVFDDADLDKAVDGAIASKFRNAGQTCVCANRIYVHESIYDE---FVE 278


>gnl|CDD|143464 cd07146, ALDH_PhpJ, Streptomyces putative phosphonoformaldehyde
           dehydrogenase PhpJ-like.  Putative phosphonoformaldehyde
           dehydrogenase (PhpJ), an aldehyde dehydrogenase homolog
           reportedly involved in the biosynthesis of
           phosphinothricin tripeptides in Streptomyces
           viridochromogenes DSM 40736, and similar sequences are
           included in this CD.
          Length = 451

 Score =  104 bits (261), Expect = 1e-24
 Identities = 67/243 (27%), Positives = 99/243 (40%), Gaps = 67/243 (27%)

Query: 198 VAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRR-------- 249
           +A  + + + LLELGGN+ +IV +DADL  A    V      +GQRCT  +R        
Sbjct: 215 IAATAGYKRQLLELGGNDPLIVMDDADLERAATLAVAGSYANSGQRCTAVKRILVHESVA 274

Query: 250 -----LFLHKKK-----------------IDRP--------------------------G 261
                L + K                   ID                            G
Sbjct: 275 DEFVDLLVEKSAALVVGDPMDPATDMGTVIDEEAAIQIENRVEEAIAQGARVLLGNQRQG 334

Query: 262 YFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD--- 318
               PT++  +P ++ +V  ETF P+  V     LDEAI  +N    GLSS + T D   
Sbjct: 335 ALYAPTVLDHVPPDAELVTEETFGPVAPVIRVKDLDEAIAISNSTAYGLSSGVCTNDLDT 394

Query: 319 VTNLFKWLGPQGSDCGLINVN-IPTNGAEIGGAFGGEKYTG-GGRESGSDSWKQYCRRST 376
           +  L + L     D G +NVN +P   +E+   FGG K +G GG+E   ++ K+     T
Sbjct: 395 IKRLVERL-----DVGTVNVNEVPGFRSEL-SPFGGVKDSGLGGKEGVREAMKEMTNVKT 448

Query: 377 VTI 379
            ++
Sbjct: 449 YSL 451


>gnl|CDD|143422 cd07104, ALDH_BenzADH-like, ALDH subfamily: NAD(P)+-dependent
           benzaldehyde dehydrogenase II, vanillin dehydrogenase,
           p-hydroxybenzaldehyde dehydrogenase and related
           proteins.  ALDH subfamily which includes the
           NAD(P)+-dependent, benzaldehyde dehydrogenase II (XylC,
           BenzADH, EC=1.2.1.28)  involved in the oxidation of
           benzyl alcohol to benzoate; p-hydroxybenzaldehyde
           dehydrogenase (PchA, HBenzADH) which catalyzes the
           oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic
           acid; vanillin dehydrogenase (Vdh, VaniDH) involved in
           the metabolism of ferulic acid as seen in Pseudomonas
           putida KT2440; and other related sequences.
          Length = 431

 Score =  103 bits (259), Expect = 2e-24
 Identities = 61/236 (25%), Positives = 95/236 (40%), Gaps = 62/236 (26%)

Query: 196 VGVAVQSRFGKLL----LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLF 251
           VG  +    G+ L    LELGGNN +IV +DADL+LA     F      GQ C    R+ 
Sbjct: 190 VGRHIGELAGRHLKKVALELGGNNPLIVLDDADLDLAVSAAAFGAFLHQGQICMAAGRIL 249

Query: 252 LHK-----------------------------------KKIDRPGYFVE------PTIVT 270
           +H+                                   +++DR    VE        ++T
Sbjct: 250 VHESVYDEFVEKLVAKAKALPVGDPRDPDTVIGPLINERQVDRVHAIVEDAVAAGARLLT 309

Query: 271 GLPHNSRVV---------------HRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIF 315
           G  +                      E F P+  V PFD  +EA+   N+ + GLS+++F
Sbjct: 310 GGTYEGLFYQPTVLSDVTPDMPIFREEIFGPVAPVIPFDDDEEAVELANDTEYGLSAAVF 369

Query: 316 TKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQY 371
           T+D+         +  + G++++N  T   E    FGG K +GGGR  G  S +++
Sbjct: 370 TRDLERAMA-FAER-LETGMVHINDQTVNDEPHVPFGGVKASGGGRFGGPASLEEF 423



 Score = 77.2 bits (191), Expect = 2e-15
 Identities = 42/112 (37%), Positives = 55/112 (49%)

Query: 73  DYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGE 132
           D  R   ++ AA  AWAA P   R  I+R+  + L  +   +   +  E G    +   E
Sbjct: 1   DVDRAYAAAAAAQKAWAATPPQERAAILRKAAEILEERRDEIADWLIRESGSTRPKAAFE 60

Query: 133 VQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGVVGIISAFNFPV 184
           V   I I   A GL R   G ILPS+ PG   +    PLGVVG+IS FNFP+
Sbjct: 61  VGAAIAILREAAGLPRRPEGEILPSDVPGKESMVRRVPLGVVGVISPFNFPL 112


>gnl|CDD|143404 cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase
           and ALDH family members 6A1 and 6B2.  Methylmalonate
           semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27)
           [acylating] from Bacillus subtilis is involved in valine
           metabolism and catalyses the NAD+- and CoA-dependent
           oxidation of methylmalonate semialdehyde into
           propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and
           Arabidopsis MMSDH ALDH6B2 are also present in this CD.
          Length = 478

 Score =  103 bits (260), Expect = 3e-24
 Identities = 67/267 (25%), Positives = 103/267 (38%), Gaps = 72/267 (26%)

Query: 48  NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDAL 107
             E +   NP+TGE IA V     ++    + +++AA+ AW+A P  +R +++ +    L
Sbjct: 14  TTEWLDVYNPATGEVIARVPLATAEEVDAAVAAAKAAFPAWSATPVLKRQQVMFKFRQLL 73

Query: 108 RHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLEN 167
              L  L +L++LE GK LA+  G+V   +++ ++A  +     G  L +   G      
Sbjct: 74  EENLDELARLITLEHGKTLADARGDVLRGLEVVEFACSIPHLLKGEYLENVARGIDTYSY 133

Query: 168 WNPLGVVGIISAFNFPVAVYGWNAAIALV--------------GVAVQSRFGKLLLELG- 212
             PLGVV  I+ FNFP  +  W   +A+               G A+  R  +LL E G 
Sbjct: 134 RQPLGVVAGITPFNFPAMIPLWMFPMAIACGNTFVLKPSERVPGAAM--RLAELLQEAGL 191

Query: 213 ---------GN----NAIIVNED------------------------------------- 222
                    G     NA++ + D                                     
Sbjct: 192 PDGVLNVVHGGKEAVNALLDHPDIKAVSFVGSTPVGEYIYERAAANGKRVQALGGAKNHA 251

Query: 223 -----ADLNLAQQCLVFACCGTAGQRC 244
                ADL      LV A  G AGQRC
Sbjct: 252 VVMPDADLEQTANALVGAAFGAAGQRC 278



 Score = 51.4 bits (124), Expect = 4e-07
 Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 2/81 (2%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G FV PTI+  +  + ++   E F P++ +   D+LDEAI   N    G  ++IFT+   
Sbjct: 358 GNFVGPTILDNVTPDMKIYKEEIFGPVLSIVRVDTLDEAIAIINANPYGNGAAIFTRSGA 417

Query: 321 NLFKWLGPQGSDCGLINVNIP 341
              K+   +  D G++ +N+P
Sbjct: 418 AARKFQ--REVDAGMVGINVP 436


>gnl|CDD|143413 cd07094, ALDH_F21_LactADH-like, ALDH subfamily: NAD+-dependent,
           lactaldehyde dehydrogenase, ALDH family 21 A1, and
           related proteins.  ALDH subfamily which includes Tortula
           ruralis aldehyde dehydrogenase ALDH21A1 (RNP123), and
           NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
           and like sequences.
          Length = 453

 Score =  101 bits (254), Expect = 1e-23
 Identities = 97/455 (21%), Positives = 154/455 (33%), Gaps = 135/455 (29%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPL 114
            NP  GE I  V   +  D    + ++RA      ALP   R  I+ +  D L+ +    
Sbjct: 4   HNPYDGEVIGKVPADDRADAEEALATARAGAENRRALPPHERMAILERAADLLKKRAEEF 63

Query: 115 GQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSE-------RPGHVLLEN 167
            ++++ E GK + +   EV   ID    A   +    G  +P +       R    + E 
Sbjct: 64  AKIIACEGGKPIKDARVEVDRAIDTLRLAAEEAERIRGEEIPLDATQGSDNRLAWTIRE- 122

Query: 168 WNPLGVVGIISAFNFPVAVYGWNAAIAL-VGVAV-----------QSRFGKLLLELG--- 212
             P+GVV  I+ FNFP+ +     A A+  G  V                K+L+E G   
Sbjct: 123 --PVGVVLAITPFNFPLNLVAHKLAPAIATGCPVVLKPASKTPLSALELAKILVEAGVPE 180

Query: 213 ------------GNNAIIVNED-------------------------------------- 222
                         +A   +E                                       
Sbjct: 181 GVLQVVTGEREVLGDAFAADERVAMLSFTGSAAVGEALRANAGGKRIALELGGNAPVIVD 240

Query: 223 --ADLNLAQQCLVFACCGTAGQRCTTTRRLFLH----------------KKKIDRP---G 261
             ADL+ A + L       AGQ C + +R+++H                K K+  P    
Sbjct: 241 RDADLDAAIEALAKGGFYHAGQVCISVQRIYVHEELYDEFIEAFVAAVKKLKVGDPLDED 300

Query: 262 YFVEP----------------------TIVTG---------------LPHNSRVVHRETF 284
             V P                       ++ G               +P ++++   ETF
Sbjct: 301 TDVGPLISEEAAERVERWVEEAVEAGARLLCGGERDGALFKPTVLEDVPRDTKLSTEETF 360

Query: 285 APIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNLFKWLGPQGSDCGLINVNIPTNG 344
            P+V +  +D  +EAI   N    GL + IFT+D+   FK    +  + G + VN  +  
Sbjct: 361 GPVVPIIRYDDFEEAIRIANSTDYGLQAGIFTRDLNVAFK--AAEKLEVGGVMVNDSSAF 418

Query: 345 AEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTI 379
                 FGG K +G GRE    + ++     TV I
Sbjct: 419 RTDWMPFGGVKESGVGREGVPYAMEEMTEEKTVVI 453


>gnl|CDD|143467 cd07149, ALDH_y4uC, Uncharacterized ALDH (y4uC) with similarity to
           Tortula ruralis aldehyde dehydrogenase ALDH21A1.
           Uncharacterized aldehyde dehydrogenase (ORF name y4uC)
           with sequence similarity to the moss Tortula ruralis
           aldehyde dehydrogenase ALDH21A1 (RNP123) believed to
           play an important role in the detoxification of
           aldehydes generated in response to desiccation- and
           salinity-stress, and similar sequences are included in
           this CD.
          Length = 453

 Score =  100 bits (252), Expect = 2e-23
 Identities = 64/226 (28%), Positives = 94/226 (41%), Gaps = 62/226 (27%)

Query: 196 VGVAVQSRFG--KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLH 253
           VG A+  + G  K+ LELG N A+IV+ DADL  A +  V      AGQ C + +R+F+H
Sbjct: 214 VGEAIARKAGLKKVTLELGSNAAVIVDADADLEKAVERCVSGAFANAGQVCISVQRIFVH 273

Query: 254 KKKIDR-------------------PGYFVEP----------------------TIVTG- 271
           +   D                        V P                       ++TG 
Sbjct: 274 EDIYDEFLERFVAATKKLVVGDPLDEDTDVGPMISEAEAERIEEWVEEAVEGGARLLTGG 333

Query: 272 --------------LPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTK 317
                         +P + +VV  E FAP+V + PFD+LDEAI   N+   GL + +FT 
Sbjct: 334 KRDGAILEPTVLTDVPPDMKVVCEEVFAPVVSLNPFDTLDEAIAMANDSPYGLQAGVFTN 393

Query: 318 DVTNLFKWLGPQGSDCGLINVN-IPTNGAEIGGAFGGEKYTGGGRE 362
           D+    K    +  + G + +N   T   +    +GG K +G GRE
Sbjct: 394 DLQKALKAA--RELEVGGVMINDSSTFRVD-HMPYGGVKESGTGRE 436



 Score = 78.4 bits (194), Expect = 7e-16
 Identities = 37/140 (26%), Positives = 64/140 (45%), Gaps = 10/140 (7%)

Query: 52  IPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKL 111
           I  I+P  GE I  V   + +D  + I +++       +LPA  R EI+ +    L  + 
Sbjct: 1   IEVISPYDGEVIGRVPVASEEDVEKAIAAAKEGAKEMKSLPAYERAEILERAAQLLEERR 60

Query: 112 VPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSE-------RPGHVL 164
               + ++LE GK + +   EV   I+    +   ++  +G  +P +       R G  +
Sbjct: 61  EEFARTIALEAGKPIKDARKEVDRAIETLRLSAEEAKRLAGETIPFDASPGGEGRIGFTI 120

Query: 165 LENWNPLGVVGIISAFNFPV 184
            E   P+GVV  I+ FNFP+
Sbjct: 121 RE---PIGVVAAITPFNFPL 137


>gnl|CDD|200131 TIGR01804, BADH, glycine betaine aldehyde dehydrogenase.  Under
           osmotic stress, betaine aldehyde dehydrogenase oxidizes
           glycine betaine aldehyde into the osmoprotectant glycine
           betaine, via the second of two oxidation steps from
           exogenously supplied choline or betaine aldehyde. This
           choline-glycine betaine synthesis pathway can be found
           in gram-positive and gram-negative bacteria. In
           Escherichia coli, betaine aldehyde dehydrogenase (betB)
           is osmotically co-induced with choline dehydrogenase
           (betA) in the presence of choline. These dehydrogenases
           are located in a betaine gene cluster with the upstream
           choline transporter (betT) and transcriptional regulator
           (betI). Similar to E.coli, betaine synthesis in
           Staphylococcus xylosus is also influenced by osmotic
           stress and the presence of choline with genes localized
           in a functionally equivalent gene cluster. Organization
           of the betaine gene cluster in Sinorhizobium meliloti
           and Bacillus subtilis differs from that of E.coli by the
           absence of upstream choline transporter and
           transcriptional regulator homologues. Additionally,
           B.subtilis co-expresses a type II alcohol dehydrogenase
           with betaine aldehyde dehydrogenase instead of choline
           dehydrogenase as in E.coli, St.xylosus, and Si.meliloti.
           Betaine aldehyde dehydrogenase is a member of the
           aldehyde dehydrogenase family (pfam00171) [Cellular
           processes, Adaptations to atypical conditions].
          Length = 467

 Score = 99.1 bits (247), Expect = 9e-23
 Identities = 50/151 (33%), Positives = 76/151 (50%), Gaps = 2/151 (1%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDA 106
           + G     INP+ GE IA+V     +D  R I ++R A   WAA+    RG I+R+  D 
Sbjct: 10  SAGTTREIINPANGEVIATVHAATPEDVERAIAAARRAQGEWAAMSPMERGRILRRAADL 69

Query: 107 LRHKLVPLGQLVSLEMGKILAEGI-GEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLL 165
           +R +   L +L +L+ GK L E I  ++    D+ ++  GL+   +G I+P   P     
Sbjct: 70  IRERNEELAKLETLDTGKTLQETIVADMDSGADVFEFFAGLAPALNGEIIPLGGPSFAYT 129

Query: 166 ENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
               PLGV   I A+N+P+ +  W  A AL 
Sbjct: 130 IR-EPLGVCVGIGAWNYPLQIASWKIAPALA 159



 Score = 65.6 bits (160), Expect = 1e-11
 Identities = 57/229 (24%), Positives = 84/229 (36%), Gaps = 66/229 (28%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTT------------------- 246
            + +ELGG + +IV +DADL  A    +     +AGQ C+                    
Sbjct: 238 HVTMELGGKSPLIVFDDADLESAVDGAMLGNFFSAGQVCSNGTRVFVHKKIKERFLARLV 297

Query: 247 --TRRLFL-----------------HKKKI------------------DRP-------GY 262
             T R+ L                 H+ K+                   RP       G+
Sbjct: 298 ERTERIKLGDPFDEATEMGPLISAAHRDKVLSYIEKGKAEGATLATGGGRPENVGLQNGF 357

Query: 263 FVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNL 322
           FVEPT+      +  +V  E F P++ V  F   DE I   N+ + GL+  +FT D+   
Sbjct: 358 FVEPTVFADCTDDMTIVREEIFGPVMTVLTFSDEDEVIARANDTEYGLAGGVFTADLGRA 417

Query: 323 FKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQY 371
            +      +    IN       AE    FGG K +G GRE+G  +   Y
Sbjct: 418 HRVADQLEAGTVWINTYNLYP-AEA--PFGGYKQSGIGRENGKAALAHY 463


>gnl|CDD|143426 cd07108, ALDH_MGR_2402, Magnetospirillum NAD(P)+-dependent aldehyde
           dehydrogenase MSR-1-like.  NAD(P)+-dependent aldehyde
           dehydrogenase of Magnetospirillum gryphiswaldense MSR-1
           (MGR_2402) , and other similar sequences, are present in
           this CD.
          Length = 457

 Score = 97.4 bits (243), Expect = 4e-22
 Identities = 50/143 (34%), Positives = 73/143 (51%), Gaps = 2/143 (1%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPL 114
           INP+TG+ I  V +    D  R + +++AA+  WAA PA  RG+++ +I DAL  +   L
Sbjct: 2   INPATGQVIGEVPRSRAADVDRAVAAAKAAFPEWAATPARERGKLLARIADALEARSEEL 61

Query: 115 GQLVSLEMGKIL-AEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGV 173
            +L++LE G  L  +   E     D+  Y  GL+    G  LP   P  +      PLGV
Sbjct: 62  ARLLALETGNALRTQARPEAAVLADLFRYFGGLAGELKGETLPF-GPDVLTYTVREPLGV 120

Query: 174 VGIISAFNFPVAVYGWNAAIALV 196
           VG I  +N P+ +     A ALV
Sbjct: 121 VGAILPWNAPLMLAALKIAPALV 143



 Score = 74.3 bits (183), Expect = 1e-14
 Identities = 35/121 (28%), Positives = 64/121 (52%), Gaps = 4/121 (3%)

Query: 260 PGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDV 319
            G+FV+PTI +G+ +  R+   E F P++   P+   DE I   N+   GL++ ++T+D+
Sbjct: 340 DGFFVQPTIFSGVDNEWRLAREEIFGPVLCAIPWKDEDEVIAMANDSHYGLAAYVWTRDL 399

Query: 320 TNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSW-KQYCRRSTVT 378
               +       + G + VN    G + G ++GG K +G GRE+  +   + + ++ TV 
Sbjct: 400 GRALR--AAHALEAGWVQVN-QGGGQQPGQSYGGFKQSGLGREASLEGMLEHFTQKKTVN 456

Query: 379 I 379
           I
Sbjct: 457 I 457



 Score = 40.8 bits (96), Expect = 9e-04
 Identities = 22/51 (43%), Positives = 28/51 (54%), Gaps = 1/51 (1%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGT-AGQRCTTTRRLFLHKKKID 258
           LELGG + +IV  DADL+ A    +     T  GQ CT   RLF+H+   D
Sbjct: 224 LELGGKSPMIVFPDADLDDAVDGAIAGMRFTRQGQSCTAGSRLFVHEDIYD 274


>gnl|CDD|143442 cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, RocA.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (EC=1.5.1.12 ), RocA: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily. The proline catabolic enzymes, proline
           dehydrogenase and Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), catalyze the two-step oxidation
           of proline to glutamate; P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). In this CD, monofunctional enzyme
           sequences such as seen in the Bacillus subtilis RocA
           P5CDH are also present. These enzymes play important
           roles in cellular redox control, superoxide generation,
           and apoptosis.
          Length = 512

 Score = 96.9 bits (242), Expect = 7e-22
 Identities = 42/149 (28%), Positives = 67/149 (44%), Gaps = 2/149 (1%)

Query: 49  GEIIPSINPS-TGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDAL 107
            E I S NP+   E + +VQ+   ++    ++++RAA+  W   P   R  ++ +    L
Sbjct: 45  EEKIESRNPADPSEVLGTVQKATKEEAEAAVQAARAAFPTWRRTPPEERARLLLRAAALL 104

Query: 108 RHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLEN 167
           R +   L   + LE+GK  AE   +V E ID  +Y         G  +            
Sbjct: 105 RRRRFELAAWMVLEVGKNWAEADADVAEAIDFLEYYAREMLRLRGFPVEMVPGEDNRYV- 163

Query: 168 WNPLGVVGIISAFNFPVAVYGWNAAIALV 196
           + PLGV  +IS +NFP+A+       ALV
Sbjct: 164 YRPLGVGAVISPWNFPLAILAGMTTAALV 192



 Score = 81.5 bits (202), Expect = 9e-17
 Identities = 58/244 (23%), Positives = 96/244 (39%), Gaps = 68/244 (27%)

Query: 201 QSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR- 259
           Q    +++ E+GG NAIIV+EDADL+ A + +V +  G  GQ+C+   R+ +H+   D  
Sbjct: 272 QKWLKRVIAEMGGKNAIIVDEDADLDEAAEGIVRSAFGFQGQKCSACSRVIVHESVYDEF 331

Query: 260 ------------------PGYFVEPTI--------------------------------- 268
                             P  ++ P I                                 
Sbjct: 332 LERLVERTKALKVGDPEDPEVYMGPVIDKGARDRIRRYIEIGKSEGRLLLGGEVLELAAE 391

Query: 269 --------VTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
                      +P + R+   E F P++ V      DEA+   N+ + GL+  +F++   
Sbjct: 392 GYFVQPTIFADVPPDHRLAQEEIFGPVLAVIKAKDFDEALEIANDTEYGLTGGVFSRSPE 451

Query: 321 NL--FKWLGPQGSDCGLINVNIPTNGAEIG-GAFGGEKYTG-GGRESGSDSWKQYCRRST 376
           +L   +    +  + G +  N    GA +G   FGG K +G G +  G D   Q+ +  T
Sbjct: 452 HLERAR----REFEVGNLYANRKITGALVGRQPFGGFKMSGTGSKAGGPDYLLQFMQPKT 507

Query: 377 VTIN 380
           VT N
Sbjct: 508 VTEN 511


>gnl|CDD|143432 cd07114, ALDH_DhaS, Uncharacterized Candidatus pelagibacter
           aldehyde dehydrogenase, DhaS-like.  Uncharacterized
           aldehyde dehydrogenase from Candidatus pelagibacter
           (DhaS) and other related sequences are present in this
           CD.
          Length = 457

 Score = 95.3 bits (238), Expect = 2e-21
 Identities = 43/144 (29%), Positives = 76/144 (52%), Gaps = 2/144 (1%)

Query: 54  SINPSTGETIASVQQGNVQDYHRCIESSRAAY--AAWAALPAPRRGEIVRQIGDALRHKL 111
           SINP+TGE  A V + +  D  R + ++RAA+   AW  L    RG+++R++ D +    
Sbjct: 1   SINPATGEPWARVPEASAADVDRAVAAARAAFEGGAWRKLTPTERGKLLRRLADLIEANA 60

Query: 112 VPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWNPL 171
             L +L + + GK++ E   +V+   +   Y  GL+    G+++P ++  ++      PL
Sbjct: 61  EELAELETRDNGKLIRETRAQVRYLAEWYRYYAGLADKIEGAVIPVDKGDYLNFTRREPL 120

Query: 172 GVVGIISAFNFPVAVYGWNAAIAL 195
           GVV  I+ +N P+ +     A AL
Sbjct: 121 GVVAAITPWNSPLLLLAKKLAPAL 144



 Score = 81.8 bits (203), Expect = 5e-17
 Identities = 41/120 (34%), Positives = 68/120 (56%), Gaps = 5/120 (4%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           GYF EPTI+  + ++ R+   E F P++ V PFD  +EAI   N+ + GL++ I+T+D+ 
Sbjct: 342 GYFFEPTILADVTNDMRIAQEEVFGPVLSVIPFDDEEEAIALANDSEYGLAAGIWTRDLA 401

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGA-EIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTI 379
              +    +  + G + VN  T  A      FGG K +G GRE+G ++ ++Y +  +V I
Sbjct: 402 RAHRV--ARAIEAGTVWVN--TYRALSPSSPFGGFKDSGIGRENGIEAIREYTQTKSVWI 457



 Score = 52.2 bits (126), Expect = 3e-07
 Identities = 29/101 (28%), Positives = 39/101 (38%), Gaps = 9/101 (8%)

Query: 184 VAVYGWNAAIALVGVAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQR 243
           +A  G       +  A       + LELGG +  IV +DADL+ A   +V      AGQ 
Sbjct: 202 IAFTGGTETGRHIARAAAENLAPVTLELGGKSPNIVFDDADLDAAVNGVVAGIFAAAGQT 261

Query: 244 CTTTRRLFLHKKKIDRPGYFVE------PTIVTGLPHNSRV 278
           C    RL + +   D    FVE        I  G P +   
Sbjct: 262 CVAGSRLLVQRSIYDE---FVERLVARARAIRVGDPLDPET 299


>gnl|CDD|143418 cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis
           succinate-semialdehyde dehydrogenase 1-like.
           Succinate-semialdehyde dehydrogenase 1 (SSADH1, GabD1,
           EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
           of succinate semialdehyde (SSA)  to succinate.  SSADH
           activity in Mycobacterium tuberculosis (Mtb) is encoded
           by both gabD1 (Rv0234c) and gabD2 (Rv1731).  The Mtb
           GabD1 SSADH1 reportedly is an enzyme of the
           gamma-aminobutyrate shunt, which forms a functional link
           between two TCA half-cycles by converting
           alpha-ketoglutarate to succinate.
          Length = 429

 Score = 94.4 bits (236), Expect = 3e-21
 Identities = 59/236 (25%), Positives = 83/236 (35%), Gaps = 80/236 (33%)

Query: 197 GVAVQSRFGKLL----LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRL-- 250
           G AV +  GK L    LELGG++  IV +DADL+ A +  V      AGQ C   +R   
Sbjct: 187 GRAVAAEAGKNLKKSVLELGGSDPFIVLDDADLDKAVKTAVKGRLQNAGQSCIAAKRFIV 246

Query: 251 -------FLHK------------------------------------------------- 254
                  FL K                                                 
Sbjct: 247 HEDVYDEFLEKFVEAMAALKVGDPMDEDTDLGPLARKDLRDELHEQVEEAVAAGATLLLG 306

Query: 255 -KKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSS 313
            K+ D PG F  PT++T +         E F P+  V      +EAI   N+   GL  S
Sbjct: 307 GKRPDGPGAFYPPTVLTDVTPGMPAYDEELFGPVAAVIKVKDEEEAIALANDSPFGLGGS 366

Query: 314 IFTKDVTNLFKWLGPQGS------DCGLINVNIPT-NGAEIGGAFGGEKYTGGGRE 362
           +FT D+         +        + G++ +N    +   +   FGG K +G GRE
Sbjct: 367 VFTTDLE--------RAERVARRLEAGMVFINGMVKSDPRL--PFGGVKRSGYGRE 412



 Score = 69.0 bits (170), Expect = 8e-13
 Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 11/111 (9%)

Query: 78  IESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFI 137
           ++ + AA+ AW       R  ++R++ D LR +   L +L++LEMGK +AE   EV++  
Sbjct: 5   LDRAHAAFLAWRKTSFAERAALLRKLADLLRERKDELARLITLEMGKPIAEARAEVEKCA 64

Query: 138 DICDY-----AVGLSRTYSGSILPSERPGHVLLENWNPLGVVGIISAFNFP 183
            IC Y        L+      I       +V  E   PLGVV  I  +NFP
Sbjct: 65  WICRYYAENAEAFLADEP---IETDAGKAYVRYE---PLGVVLGIMPWNFP 109


>gnl|CDD|143468 cd07150, ALDH_VaniDH_like, Pseudomonas putida vanillin
           dehydrogenase-like.  Vanillin dehydrogenase (Vdh,
           VaniDH) involved in the metabolism of ferulic acid and
           other related  sequences are included in this CD.  The
           E. coli vanillin dehydrogenase (LigV) preferred NAD+ to
           NADP+  and exhibited a broad substrate preference,
           including vanillin,  benzaldehyde, protocatechualdehyde,
           m-anisaldehyde, and p-hydroxybenzaldehyde.
          Length = 451

 Score = 94.7 bits (236), Expect = 3e-21
 Identities = 60/232 (25%), Positives = 90/232 (38%), Gaps = 62/232 (26%)

Query: 196 VGVAVQSRFGKLL----LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLF 251
           VG  +  + G+ L    LELGG N +IV  DADL+ A +   F      GQ C +  R+ 
Sbjct: 210 VGREIAEKAGRHLKKITLELGGKNPLIVLADADLDYAVRAAAFGAFMHQGQICMSASRII 269

Query: 252 LHKKKIDR-------------------PGYFVEPTI------------------------ 268
           + +   D                    P   + P I                        
Sbjct: 270 VEEPVYDEFVKKFVARASKLKVGDPRDPDTVIGPLISPRQVERIKRQVEDAVAKGAKLLT 329

Query: 269 -------------VTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIF 315
                        +T +  + R+   ETF P+  V P    +EA+   N+ + GLS++I 
Sbjct: 330 GGKYDGNFYQPTVLTDVTPDMRIFREETFGPVTSVIPAKDAEEALELANDTEYGLSAAIL 389

Query: 316 TKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDS 367
           T D+   FK    +  + G++++N PT   E    FGG K +G GRE G  S
Sbjct: 390 TNDLQRAFKL--AERLESGMVHINDPTILDEAHVPFGGVKASGFGREGGEWS 439



 Score = 87.4 bits (217), Expect = 8e-19
 Identities = 41/132 (31%), Positives = 61/132 (46%)

Query: 53  PSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLV 112
             +NP+ G   A V  G+ QD  R I ++  A+ AWAA     R  I+ +  + +  +  
Sbjct: 2   DDLNPADGSVYARVAVGSRQDAERAIAAAYDAFPAWAATTPSERERILLKAAEIMERRAD 61

Query: 113 PLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWNPLG 172
            L  L+  E G    +   E     ++   A G  R   G  LPS+ PG V +    PLG
Sbjct: 62  DLIDLLIDEGGSTYGKAWFETTFTPELLRAAAGECRRVRGETLPSDSPGTVSMSVRRPLG 121

Query: 173 VVGIISAFNFPV 184
           VV  I+ FN+P+
Sbjct: 122 VVAGITPFNYPL 133


>gnl|CDD|143424 cd07106, ALDH_AldA-AAD23400, Streptomyces aureofaciens putative
           aldehyde dehydrogenase AldA (AAD23400)-like.  Putative
           aldehyde dehydrogenase, AldA, from Streptomyces
           aureofaciens (locus AAD23400) and other similar
           sequences are present in this CD.
          Length = 446

 Score = 93.7 bits (234), Expect = 5e-21
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 7/124 (5%)

Query: 258 DRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTK 317
           D PGYF+ PTIV   P  SR+V  E F P++ V  +   DE I   N+ + GL +S+++ 
Sbjct: 328 DGPGYFIPPTIVDDPPEGSRIVDEEQFGPVLPVLKYSDEDEVIARANDSEYGLGASVWSS 387

Query: 318 DVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGA--FGGEKYTGGGRESGSDSWKQYCRRS 375
           D+      +  +  + G + +N       +     FGG K +G G E G +  K+Y +  
Sbjct: 388 DLERAEA-VARR-LEAGTVWIN---THGALDPDAPFGGHKQSGIGVEFGIEGLKEYTQTQ 442

Query: 376 TVTI 379
            + I
Sbjct: 443 VINI 446



 Score = 93.4 bits (233), Expect = 8e-21
 Identities = 47/146 (32%), Positives = 67/146 (45%), Gaps = 11/146 (7%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPL 114
           INP+TGE  AS    +     + + +++AA+  W+A P   R   +  I DA+      L
Sbjct: 2   INPATGEVFASAPVASEAQLDQAVAAAKAAFPGWSATPLEERRAALLAIADAIEANAEEL 61

Query: 115 GQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSER----PGHVLLENWNP 170
            +L++LE GK LAE   EV         AV   R  +   LP E         +     P
Sbjct: 62  ARLLTLEQGKPLAEAQFEV-------GGAVAWLRYTASLDLPDEVIEDDDTRRVELRRKP 114

Query: 171 LGVVGIISAFNFPVAVYGWNAAIALV 196
           LGVV  I  +NFP+ +  W  A AL+
Sbjct: 115 LGVVAAIVPWNFPLLLAAWKIAPALL 140


>gnl|CDD|215157 PLN02278, PLN02278, succinic semialdehyde dehydrogenase.
          Length = 498

 Score = 94.0 bits (234), Expect = 6e-21
 Identities = 58/159 (36%), Positives = 83/159 (52%), Gaps = 8/159 (5%)

Query: 42  GTSWKA--NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEI 99
           G  W    +G+  P  NP+TGE IA+V      + +  I S+  A+ +W+ L A  R +I
Sbjct: 30  GGKWTDAYDGKTFPVYNPATGEVIANVPCMGRAETNDAIASAHDAFPSWSKLTASERSKI 89

Query: 100 VRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEV---QEFIDICDYAVGLSRTYSGSILP 156
           +R+  D +      L QL++LE GK L E IGEV     F++   +A    R Y G I+P
Sbjct: 90  LRRWYDLIIANKEDLAQLMTLEQGKPLKEAIGEVAYGASFLEY--FAEEAKRVY-GDIIP 146

Query: 157 SERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
           S  P   LL    P+GVVG I+ +NFP+A+       AL
Sbjct: 147 SPFPDRRLLVLKQPVGVVGAITPWNFPLAMITRKVGPAL 185



 Score = 69.7 bits (171), Expect = 6e-13
 Identities = 53/228 (23%), Positives = 84/228 (36%), Gaps = 68/228 (29%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKID------- 258
           ++ LELGGN   IV +DADL++A +  + +    +GQ C    R+ + +   D       
Sbjct: 265 RVSLELGGNAPFIVFDDADLDVAVKGALASKFRNSGQTCVCANRILVQEGIYDKFAEAFS 324

Query: 259 ------------RPGYFVEPTI-----------------------VTGLPHN-------- 275
                         G    P I                       + G  H+        
Sbjct: 325 KAVQKLVVGDGFEEGVTQGPLINEAAVQKVESHVQDAVSKGAKVLLGGKRHSLGGTFYEP 384

Query: 276 ---------SRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNLFKWL 326
                      +   E F P+  +  F + +EAI   N+ + GL++ IFT+D+     W 
Sbjct: 385 TVLGDVTEDMLIFREEVFGPVAPLTRFKTEEEAIAIANDTEAGLAAYIFTRDLQRA--WR 442

Query: 327 GPQGSDCGLINVN---IPTNGAEIGGAFGGEKYTGGGRESGSDSWKQY 371
             +  + G++ VN   I T  A     FGG K +G GRE       +Y
Sbjct: 443 VSEALEYGIVGVNEGLISTEVA----PFGGVKQSGLGREGSKYGIDEY 486


>gnl|CDD|143433 cd07115, ALDH_HMSADH_HapE, Pseudomonas fluorescens 4-hydroxymuconic
           semialdehyde dehydrogenase-like.  4-hydroxymuconic
           semialdehyde dehydrogenase (HapE, EC=1.2.1.61) of
           Pseudomonas fluorescens ACB involved in
           4-hydroxyacetophenone degradation, and putative
           hydroxycaproate semialdehyde dehydrogenase (ChnE) of
           Brachymonas petroleovorans involved in cyclohexane
           metabolism, and other similar sequences, are present in
           this CD.
          Length = 453

 Score = 91.7 bits (228), Expect = 3e-20
 Identities = 52/174 (29%), Positives = 82/174 (47%), Gaps = 18/174 (10%)

Query: 54  SINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVP 113
           ++NP+TGE IA V Q + +D    + ++RAA+ AW+A+    RG I+ ++ + +      
Sbjct: 1   TLNPATGELIARVAQASAEDVDAAVAAARAAFEAWSAMDPAERGRILWRLAELILANADE 60

Query: 114 LGQLVSLEMGKILAEG-IGEVQEFIDICDYAVGLSRTYSGSILPSERP--GHVLLENWNP 170
           L +L SL+ GK +      +V    D   Y  G +    G ++P   P   + + E   P
Sbjct: 61  LARLESLDTGKPIRAARRLDVPRAADTFRYYAGWADKIEGEVIPVRGPFLNYTVRE---P 117

Query: 171 LGVVGIISAFNFPVAVYGWNAAIAL-VGVAV-----------QSRFGKLLLELG 212
           +GVVG I  +NFP+    W  A AL  G  V             R  +L+ E G
Sbjct: 118 VGVVGAIVPWNFPLMFAAWKVAPALAAGNTVVLKPAELTPLSALRIAELMAEAG 171



 Score = 77.1 bits (190), Expect = 2e-15
 Identities = 39/126 (30%), Positives = 63/126 (50%), Gaps = 3/126 (2%)

Query: 255 KKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSI 314
           K+    G+FVEPTI   +P   R+   E F P+V V  F   +EA+   N  + GL++ +
Sbjct: 330 KRPGARGFFVEPTIFAAVPPEMRIAQEEIFGPVVSVMRFRDEEEALRIANGTEYGLAAGV 389

Query: 315 FTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRR 374
           +T+D+    +         G + +N   N  + G  FGG K +G GRE G ++  +Y   
Sbjct: 390 WTRDLGRAHRV--AAALKAGTVWINT-YNRFDPGSPFGGYKQSGFGREMGREALDEYTEV 446

Query: 375 STVTIN 380
            +V +N
Sbjct: 447 KSVWVN 452


>gnl|CDD|143465 cd07147, ALDH_F21_RNP123, Aldehyde dehydrogenase family 21A1-like. 
           Aldehyde dehydrogenase ALDH21A1 (gene name RNP123) was
           first described in the moss Tortula ruralis and is
           believed to play an important role in the detoxification
           of aldehydes generated in response to desiccation- and
           salinity-stress, and ALDH21A1 expression represents a
           unique stress tolerance mechanism. So far, of plants,
           only the bryophyte sequence has been observed, but
           similar protein sequences from bacteria and archaea are
           also present in this CD.
          Length = 452

 Score = 90.8 bits (226), Expect = 6e-20
 Identities = 62/226 (27%), Positives = 96/226 (42%), Gaps = 62/226 (27%)

Query: 196 VGVAVQSRFGK--LLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLH 253
           VG  +++R GK  ++LELGGN A+IV+ DADL+ A Q ++F     AGQ C + +R+ +H
Sbjct: 213 VGWDLKARAGKKKVVLELGGNAAVIVDSDADLDFAAQRIIFGAFYQAGQSCISVQRVLVH 272

Query: 254 KKKIDR-------------------PGYFVEP---------------------------- 266
           +   D                        V P                            
Sbjct: 273 RSVYDEFKSRLVARVKALKTGDPKDDATDVGPMISESEAERVEGWVNEAVDAGAKLLTGG 332

Query: 267 ---------TIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTK 317
                    TI+  +P +  V   E F P+V V P+D  DEA+   N+ K GL + +FT+
Sbjct: 333 KRDGALLEPTILEDVPPDMEVNCEEVFGPVVTVEPYDDFDEALAAVNDSKFGLQAGVFTR 392

Query: 318 DVTNLFKWLGPQGSDCGLINVN-IPTNGAEIGGAFGGEKYTGGGRE 362
           D+    +       + G + +N +PT   +    +GG K +G GRE
Sbjct: 393 DLEKALRAWD--ELEVGGVVINDVPTFRVD-HMPYGGVKDSGIGRE 435



 Score = 76.1 bits (188), Expect = 5e-15
 Identities = 44/136 (32%), Positives = 59/136 (43%), Gaps = 4/136 (2%)

Query: 52  IPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKL 111
           +   NP TGE +A V      D    I ++  A+    ALPA RR  I+      L  + 
Sbjct: 1   LEVTNPYTGEVVARVALAGPDDIEEAIAAAVKAFRPMRALPAHRRAAILLHCVARLEERF 60

Query: 112 VPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILP---SER-PGHVLLEN 167
             L + + LE GK + +  GEV   ID    A   +    G +LP   S R  G   L  
Sbjct: 61  EELAETIVLEAGKPIKDARGEVARAIDTFRIAAEEATRIYGEVLPLDISARGEGRQGLVR 120

Query: 168 WNPLGVVGIISAFNFP 183
             P+G V  I+ FNFP
Sbjct: 121 RFPIGPVSAITPFNFP 136


>gnl|CDD|143414 cd07095, ALDH_SGSD_AstD, N-succinylglutamate 5-semialdehyde
           dehydrogenase, AstD-like.  N-succinylglutamate
           5-semialdehyde dehydrogenase or succinylglutamic
           semialdehyde dehydrogenase (SGSD, E. coli AstD,
           EC=1.2.1.71) involved in L-arginine degradation via the
           arginine succinyltransferase (AST) pathway and catalyzes
           the NAD+-dependent reduction of succinylglutamate
           semialdehyde into succinylglutamate.
          Length = 431

 Score = 90.4 bits (225), Expect = 7e-20
 Identities = 56/222 (25%), Positives = 82/222 (36%), Gaps = 62/222 (27%)

Query: 203 RFGKLL-LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHK------- 254
           R GK+L LE+GGNN ++V + AD++ A   +V +   TAGQRCT  RRL +         
Sbjct: 198 RPGKILALEMGGNNPLVVWDVADIDAAAYLIVQSAFLTAGQRCTCARRLIVPDGAVGDAF 257

Query: 255 --------KKIDRPGYFVEPTIVTGLPHNSRVVHRETFA--------------------- 285
                   K++       EP  +  L   +                              
Sbjct: 258 LERLVEAAKRLRIGAPDAEPPFMGPLIIAAAAARYLLAQQDLLALGGEPLLAMERLVAGT 317

Query: 286 -----------------------PIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNL 322
                                  P++ V+ +D  DEAI   N  + GLS+ + + D   L
Sbjct: 318 AFLSPGIIDVTDAADVPDEEIFGPLLQVYRYDDFDEAIALANATRFGLSAGLLSDD-EAL 376

Query: 323 FKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESG 364
           F+         G++N N PT GA     FGG   +G  R S 
Sbjct: 377 FERFL-ARIRAGIVNWNRPTTGASSTAPFGGVGLSGNHRPSA 417



 Score = 58.8 bits (143), Expect = 2e-09
 Identities = 30/106 (28%), Positives = 47/106 (44%), Gaps = 1/106 (0%)

Query: 78  IESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFI 137
           + ++RAA+  WAAL    R  I+R+  + L+     L +L+S E GK L E   EV    
Sbjct: 6   VAAARAAFPGWAALSLEERAAILRRFAELLKANKEELARLISRETGKPLWEAQTEVAAMA 65

Query: 138 DICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGVVGIISAFNFP 183
              D ++      +G    +       +    P GV+ +   FNFP
Sbjct: 66  GKIDISIKAYHERTGE-RATPMAQGRAVLRHRPHGVMAVFGPFNFP 110


>gnl|CDD|143437 cd07119, ALDH_BADH-GbsA, Bacillus subtilis NAD+-dependent betaine
           aldehyde dehydrogenase-like.  Included in this CD is the
           NAD+-dependent, betaine aldehyde dehydrogenase (BADH,
           GbsA, EC=1.2.1.8) of Bacillus subtilis involved in the
           synthesis of the osmoprotectant glycine betaine from
           choline or glycine betaine aldehyde.
          Length = 482

 Score = 88.5 bits (220), Expect = 4e-19
 Identities = 50/156 (32%), Positives = 80/156 (51%), Gaps = 11/156 (7%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAA--WAALPAPRRGEIVRQIG 104
           A+G+    INP+ GE IA+V +G  +D  R I ++R A+ +  W  LPA  R  ++ +I 
Sbjct: 10  ASGKTRDIINPANGEVIATVPEGTAEDAKRAIAAARRAFDSGEWPHLPAQERAALLFRIA 69

Query: 105 DALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHV- 163
           D +R     L +L +L  GK L E   ++ +  +   Y  GL+   +G +     P HV 
Sbjct: 70  DKIREDAEELARLETLNTGKTLRESEIDIDDVANCFRYYAGLATKETGEVYDV--PPHVI 127

Query: 164 ---LLENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
              + E   P+GV G+I+ +N+P+    W  A AL 
Sbjct: 128 SRTVRE---PVGVCGLITPWNYPLLQAAWKLAPALA 160



 Score = 62.7 bits (153), Expect = 1e-10
 Identities = 40/131 (30%), Positives = 59/131 (45%), Gaps = 25/131 (19%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           GYFVEPTI   +    R+V  E F P++ V  FD+ +EAI   N+   GL+ +++TKD+ 
Sbjct: 357 GYFVEPTIFDDVDRTMRIVQEEIFGPVLTVERFDTEEEAIRLANDTPYGLAGAVWTKDIA 416

Query: 321 NLFK----------WLGPQGSDCGLIN-VNIPTNGAEIGGAFGGEKYTGGGRESGSDSWK 369
              +          W          IN  +     A     +GG K +G GRE G    +
Sbjct: 417 RANRVARRLRAGTVW----------INDYHPYFAEAP----WGGYKQSGIGRELGPTGLE 462

Query: 370 QYCRRSTVTIN 380
           +Y     + IN
Sbjct: 463 EYQETKHININ 473



 Score = 36.5 bits (85), Expect = 0.021
 Identities = 21/60 (35%), Positives = 26/60 (43%), Gaps = 4/60 (6%)

Query: 204 FGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLF----LHKKKIDR 259
             K+ LELGG N  IV  DAD   A    +      AGQ C+   RL     +H K +  
Sbjct: 237 VKKVALELGGKNPNIVFADADFETAVDQALNGVFFNAGQVCSAGSRLLVEESIHDKFVAA 296


>gnl|CDD|143469 cd07151, ALDH_HBenzADH, NADP+-dependent p-hydroxybenzaldehyde
           dehydrogenase-like.  NADP+-dependent,
           p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH)
           which catalyzes oxidation of p-hydroxybenzaldehyde to
           p-hydroxybenzoic acid and other related sequences are
           included in this CD.
          Length = 465

 Score = 88.1 bits (219), Expect = 5e-19
 Identities = 55/218 (25%), Positives = 88/218 (40%), Gaps = 60/218 (27%)

Query: 204 FGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLH---------- 253
             K+ LELGGNN  +V EDAD++ A    VF      GQ C    R+ +H          
Sbjct: 234 LKKVALELGGNNPFVVLEDADIDAAVNAAVFGKFLHQGQICMAINRIIVHEDVYDEFVEK 293

Query: 254 --------------------------------KKKIDRP--------------GYFVEPT 267
                                             KI++               G  +EPT
Sbjct: 294 FVERVKALPYGDPSDPDTVVGPLINESQVDGLLDKIEQAVEEGATLLVGGEAEGNVLEPT 353

Query: 268 IVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNLFKWLG 327
           +++ + ++  +   E F P+  +   D  +EA+   N+ + GLS ++FT D+    ++  
Sbjct: 354 VLSDVTNDMEIAREEIFGPVAPIIKADDEEEALELANDTEYGLSGAVFTSDLERGVQF-- 411

Query: 328 PQGSDCGLINVN-IPTNGAEIGGAFGGEKYTGGGRESG 364
            +  D G+ ++N  P N  E    FGGEK +G GR +G
Sbjct: 412 ARRIDAGMTHINDQPVND-EPHVPFGGEKNSGLGRFNG 448



 Score = 78.1 bits (193), Expect = 9e-16
 Identities = 47/146 (32%), Positives = 65/146 (44%), Gaps = 10/146 (6%)

Query: 41  DGTSWKANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIV 100
           DGTS       I  +NP TGET+A +   + +D      ++ AA   WAA     R EI+
Sbjct: 5   DGTS----ERTIDVLNPYTGETLAEIPAASKEDVDEAYRAAAAAQKEWAATLPQERAEIL 60

Query: 101 RQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERP 160
            +    L  +   + + +  E G    +   E    + I   A        G ILPS+ P
Sbjct: 61  EKAAQILEERRDEIVEWLIRESGSTRIKANIEWGAAMAITREAATFPLRMEGRILPSDVP 120

Query: 161 G---HVLLENWNPLGVVGIISAFNFP 183
           G    V  E   PLGVVG+IS +NFP
Sbjct: 121 GKENRVYRE---PLGVVGVISPWNFP 143


>gnl|CDD|143459 cd07141, ALDH_F1AB_F2_RALDH1, NAD+-dependent retinal dehydrogenase
           1, ALDH families 1A, 1B, and 2-like.  NAD+-dependent
           retinal dehydrogenase 1 (RALDH 1, ALDH1, EC=1.2.1.36)
           also known as aldehyde dehydrogenase family 1 member A1
           (ALDH1A1) in humans, is a homotetrameric, cytosolic
           enzyme that catalyzes the oxidation of retinaldehyde to
           retinoic acid. Human ALDH1B1 and ALDH2 are also in this
           cluster; both are mitochrondrial homotetramers which
           play important roles in acetaldehyde oxidation; ALDH1B1
           in response to UV light exposure and ALDH2 during
           ethanol metabolism.
          Length = 481

 Score = 87.8 bits (218), Expect = 7e-19
 Identities = 104/471 (22%), Positives = 166/471 (35%), Gaps = 151/471 (32%)

Query: 49  GEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAY---AAWAALPAPRRGEIVRQIGD 105
           G+  P+INP+TGE I  VQ+G+  D  + ++++RAA+   + W  + A  RG ++ ++ D
Sbjct: 21  GKTFPTINPATGEKICEVQEGDKADVDKAVKAARAAFKLGSPWRTMDASERGRLLNKLAD 80

Query: 106 ALRHKLVPLGQLVSLEMGKILAEG-IGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVL 164
            +      L  L +L+ GK  ++  + ++   I +  Y  G +    G  +P +      
Sbjct: 81  LIERDRAYLASLETLDNGKPFSKSYLVDLPGAIKVLRYYAGWADKIHGKTIPMDGDFFTY 140

Query: 165 LEN-----------WN-------------------------------------------- 169
             +           WN                                            
Sbjct: 141 TRHEPVGVCGQIIPWNFPLLMAAWKLAPALACGNTVVLKPAEQTPLTALYLASLIKEAGF 200

Query: 170 PLGVVGIISAFNFPVAVYGWNAAIA--------------LVGVAVQSRFGK-----LLLE 210
           P GVV ++  +  P A     AAI+               VG  +Q   GK     + LE
Sbjct: 201 PPGVVNVVPGYG-PTA----GAAISSHPDIDKVAFTGSTEVGKLIQQAAGKSNLKRVTLE 255

Query: 211 LGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVEPTI-- 268
           LGG +  IV  DADL+ A +    A     GQ C    R F+ +   D    FV+ ++  
Sbjct: 256 LGGKSPNIVFADADLDYAVEQAHEALFFNMGQCCCAGSRTFVQESIYDE---FVKRSVER 312

Query: 269 ----VTGLPHNSRV-----VHRETFAPIV-------------------------YVFP-- 292
               V G P + +      +  E F  I+                         ++ P  
Sbjct: 313 AKKRVVGNPFDPKTEQGPQIDEEQFKKILELIESGKKEGAKLECGGKRHGDKGYFIQPTV 372

Query: 293 ------------------------FDSLDEAITWNNEVKQGLSSSIFTKDVTNLFKWLGP 328
                                   F ++DE I   N    GL++++FTKD+     +   
Sbjct: 373 FSDVTDDMRIAKEEIFGPVQQIFKFKTIDEVIERANNTTYGLAAAVFTKDIDKAITF--S 430

Query: 329 QGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTI 379
                G + VN   N       FGG K +G GRE G    ++Y    TVTI
Sbjct: 431 NALRAGTVWVNCY-NVVSPQAPFGGYKMSGNGRELGEYGLQEYTEVKTVTI 480


>gnl|CDD|143412 cd07093, ALDH_F8_HMSADH, Human aldehyde dehydrogenase family 8
           member A1-like.  In humans, the  aldehyde dehydrogenase
           family 8 member A1 (ALDH8A1) protein functions to
           convert 9-cis-retinal to 9-cis-retinoic acid and has a
           preference for NAD+. Also included in this CD is the
           2-hydroxymuconic semialdehyde dehydrogenase (HMSADH)
           which catalyzes the conversion of 2-hydroxymuconic
           semialdehyde to 4-oxalocrotonate, a step in the meta
           cleavage pathway of aromatic hydrocarbons in bacteria.
           Such HMSADHs seen here are: XylG of the TOL plasmid pWW0
           of Pseudomonas putida, TomC  of Burkholderia cepacia G4,
           and AphC of Comamonas testosterone.
          Length = 455

 Score = 87.6 bits (218), Expect = 7e-19
 Identities = 45/119 (37%), Positives = 66/119 (55%), Gaps = 3/119 (2%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           GYFVEPT++TGL ++SRV   E F P+V V PFD  +EAI   N+   GL++ ++T+D+ 
Sbjct: 340 GYFVEPTVITGLDNDSRVAQEEIFGPVVTVIPFDDEEEAIELANDTPYGLAAYVWTRDLG 399

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTI 379
              +    +  + G + VN      ++   FGG K +G GRE G  S + Y     V I
Sbjct: 400 RAHRV--ARRLEAGTVWVNCW-LVRDLRTPFGGVKASGIGREGGDYSLEFYTELKNVCI 455



 Score = 77.2 bits (191), Expect = 2e-15
 Identities = 36/147 (24%), Positives = 64/147 (43%), Gaps = 10/147 (6%)

Query: 54  SINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVP 113
           + NP+TGE +A V +G   +    + +++ A+  W+ +    R  I+ ++ D +  +   
Sbjct: 1   NFNPATGEVLAKVPEGGAAEVDAAVAAAKEAFPGWSRMSPAERARILHKVADLIEARADE 60

Query: 114 LGQLVSLEMGKILAEGIGEV-----QEFIDICDYAVGLSRTYSGSILPSERPGHVLLENW 168
           L  L SL+ GK +              F    DY +       G   P +  G +     
Sbjct: 61  LALLESLDTGKPITLARTRDIPRAAANFRFFADYIL----QLDGESYPQD-GGALNYVLR 115

Query: 169 NPLGVVGIISAFNFPVAVYGWNAAIAL 195
            P+GV G+I+ +N P+ +  W  A AL
Sbjct: 116 QPVGVAGLITPWNLPLMLLTWKIAPAL 142



 Score = 52.6 bits (127), Expect = 2e-07
 Identities = 20/60 (33%), Positives = 28/60 (46%), Gaps = 3/60 (5%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
            + LELGG N  IV  DADL+ A    V +     G+ C    R+ + +   D    F+E
Sbjct: 222 PVSLELGGKNPNIVFADADLDRAVDAAVRSSFSNNGEVCLAGSRILVQRSIYDE---FLE 278


>gnl|CDD|143425 cd07107, ALDH_PhdK-like, Nocardioides 2-carboxybenzaldehyde
           dehydrogenase, PhdK-like.  Nocardioides sp. strain
           KP72-carboxybenzaldehyde dehydrogenase (PhdK), an enzyme
           involved in phenanthrene degradation, and other similar
           sequences, are present in this CD.
          Length = 456

 Score = 87.4 bits (217), Expect = 9e-19
 Identities = 46/170 (27%), Positives = 73/170 (42%), Gaps = 13/170 (7%)

Query: 54  SINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVP 113
            INP+TG+ +A V   +  D  R + ++RAA+  W A     R  ++R++   LR     
Sbjct: 1   VINPATGQVLARVPAASAADVDRAVAAARAAFPEWRATTPLERARMLRELATRLREHAEE 60

Query: 114 LGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGV 173
           L  + +L+ G  ++  +G+V     + DY  GL     G  +P     ++      P GV
Sbjct: 61  LALIDALDCGNPVSAMLGDVMVAAALLDYFAGLVTELKGETIPVG-GRNLHYTLREPYGV 119

Query: 174 VGIISAFNFPVAVYGWNAAIALV-GVAV------QS-----RFGKLLLEL 211
           V  I AFN P+       A  L  G  V      Q+     R  +L  E+
Sbjct: 120 VARIVAFNHPLMFAAAKIAAPLAAGNTVVVKPPEQAPLSALRLAELAREV 169



 Score = 67.8 bits (166), Expect = 2e-12
 Identities = 31/120 (25%), Positives = 58/120 (48%), Gaps = 3/120 (2%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G++VEPT+   +    R+   E F P++ V  +    E +   N V+ GL+++I+T D++
Sbjct: 339 GFYVEPTVFADVTPGMRIAREEIFGPVLSVLRWRDEAEMVAQANGVEYGLTAAIWTNDIS 398

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTIN 380
                   +  + G + +N  ++   +G  FGG K +G GRE   +    Y +   V + 
Sbjct: 399 QAH--RTARRVEAGYVWIN-GSSRHFLGAPFGGVKNSGIGREECLEELLSYTQEKNVNVR 455


>gnl|CDD|143411 cd07092, ALDH_ABALDH-YdcW, Escherichia coli NAD+-dependent
           gamma-aminobutyraldehyde dehydrogenase YdcW-like.
           NAD+-dependent, tetrameric, gamma-aminobutyraldehyde
           dehydrogenase (ABALDH), YdcW of Escherichia coli K12,
           catalyzes the oxidation of gamma-aminobutyraldehyde to
           gamma-aminobutyric acid. ABALDH can also oxidize n-alkyl
           medium-chain aldehydes, but with a lower catalytic
           efficiency.
          Length = 450

 Score = 87.0 bits (216), Expect = 1e-18
 Identities = 52/169 (30%), Positives = 71/169 (42%), Gaps = 58/169 (34%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKID---------- 258
           LELGG   +IV +DADL+ A   +  A    AGQ CT   R+++H+   D          
Sbjct: 225 LELGGKAPVIVFDDADLDAAVAGIATAGYYNAGQDCTAACRVYVHESVYDEFVAALVEAV 284

Query: 259 ---RPGY----------------------FVE-----------------------PTIVT 270
              R G                       FVE                       PT+V 
Sbjct: 285 SAIRVGDPDDEDTEMGPLNSAAQRERVAGFVERAPAHARVLTGGRRAEGPGYFYEPTVVA 344

Query: 271 GLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDV 319
           G+  +  +V  E F P+V V PFD  DEAI   N+V+ GL+SS++T+DV
Sbjct: 345 GVAQDDEIVQEEIFGPVVTVQPFDDEDEAIELANDVEYGLASSVWTRDV 393



 Score = 62.7 bits (153), Expect = 9e-11
 Identities = 40/142 (28%), Positives = 66/142 (46%), Gaps = 1/142 (0%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPL 114
           ++P+TGE IA+V   +  D    + ++ AA+ +W       R + + ++ DA+      L
Sbjct: 2   VDPATGEEIATVPDASAADVDAAVAAAHAAFPSWRRTTPAERSKALLKLADAIEENAEEL 61

Query: 115 GQLVSLEMGKILAEGI-GEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGV 173
             L S   GK L      E+   +D   +  G +RT  G       PGH  +    P+GV
Sbjct: 62  AALESRNTGKPLHLVRDDELPGAVDNFRFFAGAARTLEGPAAGEYLPGHTSMIRREPIGV 121

Query: 174 VGIISAFNFPVAVYGWNAAIAL 195
           V  I+ +N+P+ +  W  A AL
Sbjct: 122 VAQIAPWNYPLMMAAWKIAPAL 143


>gnl|CDD|183918 PRK13252, PRK13252, betaine aldehyde dehydrogenase; Provisional.
          Length = 488

 Score = 86.1 bits (214), Expect = 2e-18
 Identities = 49/150 (32%), Positives = 75/150 (50%), Gaps = 2/150 (1%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDA 106
            +GE    INP+TGE +A+VQ     D    + S++     WAA+ A  R  I+R+  D 
Sbjct: 19  TSGETFEVINPATGEVLATVQAATPADVEAAVASAKQGQKIWAAMTAMERSRILRRAVDI 78

Query: 107 LRHKLVPLGQLVSLEMGKILAEGI-GEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLL 165
           LR +   L  L +L+ GK + E    ++    D+ +Y  GL+    G  +P  R G  + 
Sbjct: 79  LRERNDELAALETLDTGKPIQETSVVDIVTGADVLEYYAGLAPALEGEQIPL-RGGSFVY 137

Query: 166 ENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
               PLGV   I A+N+P+ +  W +A AL
Sbjct: 138 TRREPLGVCAGIGAWNYPIQIACWKSAPAL 167



 Score = 57.2 bits (139), Expect = 6e-09
 Identities = 40/127 (31%), Positives = 63/127 (49%), Gaps = 11/127 (8%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G FV PT+ T    +  +V  E F P++ V  FD  DE I   N+ + GL++ +FT D++
Sbjct: 364 GAFVAPTVFTDCTDDMTIVREEIFGPVMSVLTFDDEDEVIARANDTEYGLAAGVFTADLS 423

Query: 321 NLFKWLGP-QGSDCGLINVNIPTNG---AEIGGAFGGEKYTGGGRESGSDSWKQYCRRST 376
              + +   +   C  IN    T G   AE+    GG K +G GRE+G  + + Y +  +
Sbjct: 424 RAHRVIHQLEAGIC-WIN----TWGESPAEM--PVGGYKQSGIGRENGIATLEHYTQIKS 476

Query: 377 VTINHGK 383
           V +  G 
Sbjct: 477 VQVEMGP 483



 Score = 39.5 bits (93), Expect = 0.002
 Identities = 19/47 (40%), Positives = 29/47 (61%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKK 255
           +ELGG + +IV +DADL+ A    + A   ++GQ CT   R+F+ K 
Sbjct: 249 MELGGKSPLIVFDDADLDRAADIAMLANFYSSGQVCTNGTRVFVQKS 295


>gnl|CDD|143443 cd07125, ALDH_PutA-P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, PutA.  The proline catabolic enzymes of
           the aldehyde dehydrogenase (ALDH) protein superfamily,
           proline dehydrogenase and
           Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           (EC=1.5.1.12 )), catalyze the two-step oxidation of
           proline to glutamate; P5CDH catalyzes the oxidation of
           glutamate semialdehyde, utilizing NAD+ as the electron
           acceptor. In some bacteria, the two enzymes are fused
           into the bifunctional flavoenzyme, proline utilization A
           (PutA) These enzymes play important roles in cellular
           redox control, superoxide generation, and apoptosis. In
           certain prokaryotes such as Escherichia coli, PutA is
           also a transcriptional repressor of the proline
           utilization genes.
          Length = 518

 Score = 85.3 bits (212), Expect = 5e-18
 Identities = 51/152 (33%), Positives = 71/152 (46%), Gaps = 3/152 (1%)

Query: 47  ANGEIIPSINPSTGE-TIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGD 105
             GE  P I+P+  E TI  V   + +D    +  + AA+A W+A P   R EI+ +  D
Sbjct: 43  ETGEGAPVIDPADHERTIGEVSLADAEDVDAALAIAAAAFAGWSATPVEERAEILEKAAD 102

Query: 106 ALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICD-YAVGLSRTYSGSILPSERPGHVL 164
            L      L  L + E GK LA+   EV+E ID C  YA      +S   LP    G + 
Sbjct: 103 LLEANRGELIALAAAEAGKTLADADAEVREAIDFCRYYAAQARELFSDPELPGPT-GELN 161

Query: 165 LENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
               +  GV   IS +NFP+A++    A AL 
Sbjct: 162 GLELHGRGVFVCISPWNFPLAIFTGQIAAALA 193



 Score = 49.1 bits (118), Expect = 3e-06
 Identities = 38/136 (27%), Positives = 63/136 (46%), Gaps = 9/136 (6%)

Query: 251 FLHKKKIDRP-GYFVEPTIVTGLPHNSRVVHRETFAPIVYV--FPFDSLDEAITWNNEVK 307
            +    +D   GYFV P I+         +  E F PI++V  F  + LDEAI   N   
Sbjct: 377 LIAPAPLDDGNGYFVAPGIIEI--VGIFDLTTEVFGPILHVIRFKAEDLDEAIEDINATG 434

Query: 308 QGLSSSIFTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIG-GAFGGEKYTGGGRESGSD 366
            GL+  I ++D   +  W   +  + G + +N    GA +G   FGG   +G G ++G  
Sbjct: 435 YGLTLGIHSRDEREIEYWR--ERVEAGNLYINRNITGAIVGRQPFGGWGLSGTGPKAGGP 492

Query: 367 SW-KQYCRRSTVTINH 381
           ++  ++    TV++N 
Sbjct: 493 NYLLRFGNEKTVSLNT 508


>gnl|CDD|184426 PRK13968, PRK13968, putative succinate semialdehyde dehydrogenase;
           Provisional.
          Length = 462

 Score = 84.5 bits (209), Expect = 7e-18
 Identities = 70/274 (25%), Positives = 106/274 (38%), Gaps = 83/274 (30%)

Query: 54  SINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVP 113
           S+NP+TGE ++ +      D    ++ + A +  W       R + +R IG ALR +   
Sbjct: 11  SVNPATGEQLSVLPWAGADDIENALQLAAAGFRDWRETNIDYRAQKLRDIGKALRARSEE 70

Query: 114 LGQLVSLEMGKILAEGIGEVQEFIDICD-YAVGLSRTYSGSILPSERPGHVLLEN----- 167
           + Q+++ EMGK + +   EV +  ++CD YA      +  ++L +E     L+EN     
Sbjct: 71  MAQMITREMGKPINQARAEVAKSANLCDWYA-----EHGPAMLKAEP---TLVENQQAVI 122

Query: 168 -WNPLGVVGIISAFNFPV------------------------------------------ 184
            + PLG +  I  +NFP+                                          
Sbjct: 123 EYRPLGTILAIMPWNFPLWQVMRGAVPILLAGNGYLLKHAPNVMGCAQLIAQVFKDAGIP 182

Query: 185 -AVYGW-NAA------------IALVGVAVQSRFG------------KLLLELGGNNAII 218
             VYGW NA             IA V V    R G            K +LELGG++  I
Sbjct: 183 QGVYGWLNADNDGVSQMINDSRIAAVTVTGSVRAGAAIGAQAGAALKKCVLELGGSDPFI 242

Query: 219 VNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFL 252
           V  DADL LA +  V       GQ C   +R  +
Sbjct: 243 VLNDADLELAVKAAVAGRYQNTGQVCAAAKRFII 276



 Score = 42.5 bits (100), Expect = 3e-04
 Identities = 37/130 (28%), Positives = 59/130 (45%), Gaps = 5/130 (3%)

Query: 249 RLFLHKKKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQ 308
           RL L  +KI   G +  PT++  +         E F P+  +      + A+   N+ + 
Sbjct: 332 RLLLGGEKIAGAGNYYAPTVLANVTPEMTAFREELFGPVAAITVAKDAEHALELANDSEF 391

Query: 309 GLSSSIFTKDVTNLFKWLGPQGSDCGLINVN-IPTNGAEIGGAFGGEKYTGGGRESGSDS 367
           GLS++IFT D T   +       +CG + +N    + A +  AFGG K +G GRE     
Sbjct: 392 GLSATIFTTDETQARQM--AARLECGGVFINGYCASDARV--AFGGVKKSGFGRELSHFG 447

Query: 368 WKQYCRRSTV 377
             ++C   TV
Sbjct: 448 LHEFCNIQTV 457


>gnl|CDD|143460 cd07142, ALDH_F2BC, Arabidosis aldehyde dehydrogenase family 2 B4,
           B7, C4-like.  Included in this CD is the Arabidosis
           aldehyde dehydrogenase family 2 members B4 and B7
           (EC=1.2.1.3),  which are mitochondrial homotetramers
           that oxidize acetaldehyde and glycolaldehyde, but not
           L-lactaldehyde. Also in this group, is the Arabidosis
           cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
           involved in the oxidation of sinapalehyde and
           coniferaldehyde.
          Length = 476

 Score = 84.5 bits (209), Expect = 8e-18
 Identities = 54/183 (29%), Positives = 83/183 (45%), Gaps = 20/183 (10%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAY--AAWAALPAPRRGEIVRQIG 104
           A+G+  P+I+P  GE IA V +G+ +D  R ++++R A+    W  +    R  I+ +  
Sbjct: 16  ASGKTFPTIDPRNGEVIAHVAEGDAEDVDRAVKAARKAFDEGPWPRMTGYERSRILLRFA 75

Query: 105 DALRHKLVPLGQLVSLEMGKILAEG-IGEVQEFIDICDYAVGLSRTYSGSILPSERPGHV 163
           D L      L  L + + GK   +    EV     +  Y  G +    G  LP++ P HV
Sbjct: 76  DLLEKHADELAALETWDNGKPYEQARYAEVPLAARLFRYYAGWADKIHGMTLPADGPHHV 135

Query: 164 --LLENWNPLGVVGIISAFNFPVAVYGWNAAIAL-------VGVAVQSRF-----GKLLL 209
             L E   P+GVVG I  +NFP+ ++ W    AL       +  A Q+        KL  
Sbjct: 136 YTLHE---PIGVVGQIIPWNFPLLMFAWKVGPALACGNTIVLKPAEQTPLSALLAAKLAA 192

Query: 210 ELG 212
           E G
Sbjct: 193 EAG 195



 Score = 63.7 bits (155), Expect = 5e-11
 Identities = 33/117 (28%), Positives = 59/117 (50%), Gaps = 3/117 (2%)

Query: 255 KKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSI 314
            +I   GY+++PTI + +  + ++   E F P+  +  F ++DE I   N  K GL++ +
Sbjct: 355 DRIGSKGYYIQPTIFSDVKDDMKIARDEIFGPVQSILKFKTVDEVIKRANNSKYGLAAGV 414

Query: 315 FTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQY 371
           F+K++ +    L  +    G + VN   +  +    FGG K +G GRE G  +   Y
Sbjct: 415 FSKNI-DTANTLS-RALKAGTVWVNC-YDVFDASIPFGGYKMSGIGREKGIYALNNY 468


>gnl|CDD|215259 PLN02466, PLN02466, aldehyde dehydrogenase family 2 member.
          Length = 538

 Score = 84.5 bits (209), Expect = 1e-17
 Identities = 107/469 (22%), Positives = 168/469 (35%), Gaps = 160/469 (34%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAY--AAWAALPAPRRGEIVRQIG 104
           A+G+  P+++P TGE IA V +G+ +D +R + ++R A+    W  + A  R  I+ +  
Sbjct: 70  ASGKTFPTLDPRTGEVIAHVAEGDAEDVNRAVAAARKAFDEGPWPKMTAYERSRILLRFA 129

Query: 105 DALRHKLVPLGQLVSLEMGKILAEGIG-EVQEFIDICDYAVGLSRT-------------- 149
           D L      L  L + + GK   +    E+  F  +  Y  G +                
Sbjct: 130 DLLEKHNDELAALETWDNGKPYEQSAKAELPMFARLFRYYAGWADKIHGLTVPADGPHHV 189

Query: 150 --------YSGSILPSERP---------------------------------GHVLLENW 168
                    +G I+P   P                                   +L E  
Sbjct: 190 QTLHEPIGVAGQIIPWNFPLLMFAWKVGPALACGNTIVLKTAEQTPLSALYAAKLLHEAG 249

Query: 169 NPLGVVGIISAFNFPVAVYGWNAAIA-------------------LVGVAVQSRFGKLLL 209
            P GV+ ++S F  P A     AA+A                   ++ +A +S    + L
Sbjct: 250 LPPGVLNVVSGFG-PTA----GAALASHMDVDKLAFTGSTDTGKIVLELAAKSNLKPVTL 304

Query: 210 ELGGNNAIIVNEDADLN----LAQQCLVF----ACCGTAGQRCTTTRRL---FLHKKK-- 256
           ELGG +  IV EDAD++    LA   L F     CC  AG R     R+   F+ K K  
Sbjct: 305 ELGGKSPFIVCEDADVDKAVELAHFALFFNQGQCCC--AGSRTFVHERVYDEFVEKAKAR 362

Query: 257 -------------------IDR-----------------------------PGYFVEPTI 268
                              ID                               GY+++PT+
Sbjct: 363 ALKRVVGDPFKKGVEQGPQIDSEQFEKILRYIKSGVESGATLECGGDRFGSKGYYIQPTV 422

Query: 269 VTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDV---TNLFKW 325
            + +  +  +   E F P+  +  F  LDE I   N  + GL++ +FT+++     L + 
Sbjct: 423 FSNVQDDMLIAQDEIFGPVQSILKFKDLDEVIRRANNTRYGLAAGVFTQNLDTANTLSRA 482

Query: 326 L--GPQGSDC-GLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQY 371
           L  G    +C  + +  IP         FGG K +G GRE G  S   Y
Sbjct: 483 LRVGTVWVNCFDVFDAAIP---------FGGYKMSGIGREKGIYSLNNY 522


>gnl|CDD|143428 cd07110, ALDH_F10_BADH, Arabidopsis betaine aldehyde dehydrogenase
           1 and 2, ALDH family 10A8 and 10A9-like.  Present in
           this CD are the Arabidopsis betaine aldehyde
           dehydrogenase (BADH) 1 (chloroplast) and 2
           (mitochondria), also known as, aldehyde dehydrogenase
           family 10 member A8 and aldehyde dehydrogenase family 10
           member A9, respectively, and are putative dehydration-
           and salt-inducible BADHs (EC 1.2.1.8) that catalyze the
           oxidation of betaine aldehyde to the compatible solute
           glycine betaine.
          Length = 456

 Score = 83.6 bits (207), Expect = 1e-17
 Identities = 39/147 (26%), Positives = 66/147 (44%), Gaps = 9/147 (6%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPL 114
           INP+T  TI  +     +D    + ++R A+  W       R + +R I + +R +   L
Sbjct: 2   INPATEATIGEIPAATAEDVDAAVRAARRAFPRWKKTTGAERAKYLRAIAEGVRERREEL 61

Query: 115 GQLVSLEMGKILAEGIGEVQEFIDICDYAVGLS---RTYSGSILPSERPG---HVLLENW 168
            +L + + GK L E   +V +     +Y   L+      +   +P         V  E  
Sbjct: 62  AELEARDNGKPLDEAAWDVDDVAGCFEYYADLAEQLDAKAERAVPLPSEDFKARVRRE-- 119

Query: 169 NPLGVVGIISAFNFPVAVYGWNAAIAL 195
            P+GVVG+I+ +NFP+ +  W  A AL
Sbjct: 120 -PVGVVGLITPWNFPLLMAAWKVAPAL 145



 Score = 57.4 bits (139), Expect = 5e-09
 Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 15/112 (13%)

Query: 259 RPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
             GYF+ PT+   +P +SR+   E F P++ V  F + DEAI   N+ + GL++++ ++D
Sbjct: 339 EKGYFIAPTVFADVPTDSRIWREEIFGPVLCVRSFATEDEAIALANDSEYGLAAAVISRD 398

Query: 319 ------VTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESG 364
                 V            + G++ +N           +GG K +G GRE G
Sbjct: 399 AERCDRVAEAL--------EAGIVWINCS-QPCFPQAPWGGYKRSGIGRELG 441



 Score = 52.7 bits (127), Expect = 1e-07
 Identities = 22/45 (48%), Positives = 28/45 (62%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLH 253
           LELGG + IIV +DADL  A +  +F C    GQ C+ T RL +H
Sbjct: 228 LELGGKSPIIVFDDADLEKAVEWAMFGCFWNNGQICSATSRLLVH 272


>gnl|CDD|143470 cd07152, ALDH_BenzADH, NAD-dependent benzaldehyde dehydrogenase
           II-like.  NAD-dependent, benzaldehyde dehydrogenase II
           (XylC, BenzADH, EC=1.2.1.28) is involved in the
           oxidation of benzyl alcohol to benzoate. In
           Acinetobacter calcoaceticus, this process is carried out
           by the chromosomally encoded, benzyl alcohol
           dehydrogenase (xylB) and benzaldehyde dehydrogenase II
           (xylC) enzymes; whereas in Pseudomonas putida they are
           encoded by TOL plasmids.
          Length = 443

 Score = 83.5 bits (207), Expect = 2e-17
 Identities = 44/125 (35%), Positives = 56/125 (44%), Gaps = 1/125 (0%)

Query: 60  GETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVS 119
           G  +  V   +  D  R    + AA  AWAA P   R  ++R+  D L      +   + 
Sbjct: 1   GAVLGEVGVADAADVDRAAARAAAAQRAWAATPPRERAAVLRRAADLLEEHADEIADWIV 60

Query: 120 LEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGVVGIISA 179
            E G I  +   EV   I     A GL     G ILPS  PG + L    PLGVVG+IS 
Sbjct: 61  RESGSIRPKAGFEVGAAIGELHEAAGLPTQPQGEILPSA-PGRLSLARRVPLGVVGVISP 119

Query: 180 FNFPV 184
           FNFP+
Sbjct: 120 FNFPL 124



 Score = 69.2 bits (170), Expect = 8e-13
 Identities = 36/120 (30%), Positives = 60/120 (50%), Gaps = 3/120 (2%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G F  PT+++G+         E F P+  V  FDS +EA+   N+ + GLS+ I ++DV 
Sbjct: 326 GLFYRPTVLSGVKPGMPAFDEEIFGPVAPVTVFDSDEEAVALANDTEYGLSAGIISRDVG 385

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTG-GGRESGSDSWKQYCRRSTVTI 379
                L  +    G++++N  T   E    FGG   +G G R  G  +W+++ +   VT+
Sbjct: 386 RAMA-LADR-LRTGMLHINDQTVNDEPHNPFGGMGASGNGSRFGGPANWEEFTQWQWVTV 443



 Score = 53.1 bits (128), Expect = 1e-07
 Identities = 26/72 (36%), Positives = 35/72 (48%)

Query: 184 VAVYGWNAAIALVGVAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQR 243
           ++  G  A    VG A      K+ LELGG NA+IV +DADL+LA     +      GQ 
Sbjct: 193 ISFTGSTAVGRKVGEAAGRHLKKVSLELGGKNALIVLDDADLDLAASNGAWGAFLHQGQI 252

Query: 244 CTTTRRLFLHKK 255
           C    R  +H+ 
Sbjct: 253 CMAAGRHLVHES 264


>gnl|CDD|143427 cd07109, ALDH_AAS00426, Uncharacterized Saccharopolyspora spinosa
           aldehyde dehydrogenase (AAS00426)-like.  Uncharacterized
           aldehyde dehydrogenase of Saccharopolyspora spinosa
           (AAS00426) and other similar sequences, are present in
           this CD.
          Length = 454

 Score = 82.7 bits (205), Expect = 3e-17
 Identities = 41/119 (34%), Positives = 60/119 (50%), Gaps = 2/119 (1%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           GYFV PT++  +P +SR+   E F P++ V PFD   EAI   N    GL + ++T+D  
Sbjct: 338 GYFVAPTLLDDVPPDSRLAQEEIFGPVLAVMPFDDEAEAIALANGTDYGLVAGVWTRDGD 397

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTI 379
                +  +    G + VN    G  I   FGG K +G GRE G ++   Y +  TV +
Sbjct: 398 RAL-RVA-RRLRAGQVFVNNYGAGGGIELPFGGVKKSGHGREKGLEALYNYTQTKTVAV 454



 Score = 78.8 bits (195), Expect = 5e-16
 Identities = 65/272 (23%), Positives = 97/272 (35%), Gaps = 73/272 (26%)

Query: 54  SINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWA-ALPAPRRGEIVRQIGDALRHKLV 112
             +PSTGE  A + +G   D  R ++++R A+ +    L    RG ++ +I   +R    
Sbjct: 1   VFDPSTGEVFARIARGGAADVDRAVQAARRAFESGWLRLSPAERGRLLLRIARLIREHAD 60

Query: 113 PLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSE---------RP--- 160
            L +L SL+ GK L +   +V+      +Y  G +    G  +P            P   
Sbjct: 61  ELARLESLDTGKPLTQARADVEAAARYFEYYGGAADKLHGETIPLGPGYFVYTVREPHGV 120

Query: 161 -GHVLLENWN-PLGVVG------------------------------IISAFNFPVAV-- 186
            GH++   WN PL + G                              +      P     
Sbjct: 121 TGHII--PWNYPLQITGRSVAPALAAGNAVVVKPAEDAPLTALRLAELAEEAGLPAGALN 178

Query: 187 ----YGWNAAIAL----------------VGVAVQ----SRFGKLLLELGGNNAIIVNED 222
                G  A  AL                 G+AV          + LELGG +  IV  D
Sbjct: 179 VVTGLGAEAGAALVAHPGVDHISFTGSVETGIAVMRAAAENVVPVTLELGGKSPQIVFAD 238

Query: 223 ADLNLAQQCLVFACCGTAGQRCTTTRRLFLHK 254
           ADL  A   +V A    AGQ C+   RL +H+
Sbjct: 239 ADLEAALPVVVNAIIQNAGQTCSAGSRLLVHR 270


>gnl|CDD|237391 PRK13473, PRK13473, gamma-aminobutyraldehyde dehydrogenase;
           Provisional.
          Length = 475

 Score = 82.6 bits (205), Expect = 3e-17
 Identities = 28/65 (43%), Positives = 42/65 (64%)

Query: 255 KKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSI 314
           +  D  GY+ EPT++ G   +  +V RE F P+V V PFD  D+A+ W N+   GL+SS+
Sbjct: 351 EAPDGKGYYYEPTLLAGARQDDEIVQREVFGPVVSVTPFDDEDQAVRWANDSDYGLASSV 410

Query: 315 FTKDV 319
           +T+DV
Sbjct: 411 WTRDV 415



 Score = 62.2 bits (152), Expect = 1e-10
 Identities = 39/148 (26%), Positives = 68/148 (45%), Gaps = 1/148 (0%)

Query: 49  GEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALR 108
           GE  P  NP+TGE +A + + +       + ++ AA+  W+      R E + ++ DA+ 
Sbjct: 16  GEKQPVYNPATGEVLAEIAEASAAQVDAAVAAADAAFPEWSQTTPKERAEALLKLADAIE 75

Query: 109 HKLVPLGQLVSLEMGK-ILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLEN 167
                  +L SL  GK +      E+   +D+  +  G +R   G        GH  +  
Sbjct: 76  ENADEFARLESLNCGKPLHLALNDEIPAIVDVFRFFAGAARCLEGKAAGEYLEGHTSMIR 135

Query: 168 WNPLGVVGIISAFNFPVAVYGWNAAIAL 195
            +P+GVV  I+ +N+P+ +  W  A AL
Sbjct: 136 RDPVGVVASIAPWNYPLMMAAWKLAPAL 163


>gnl|CDD|237018 PRK11905, PRK11905, bifunctional proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1208

 Score = 82.2 bits (204), Expect = 7e-17
 Identities = 48/158 (30%), Positives = 74/158 (46%), Gaps = 16/158 (10%)

Query: 46  KANGEIIPSINPS-TGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIG 104
             +G   P +NP+   + + +V + + +D  R + +++AA+  W+A PA  R  I+ +  
Sbjct: 563 DVDGGTRPVLNPADHDDVVGTVTEASAEDVERALAAAQAAFPEWSATPAAERAAILERAA 622

Query: 105 DALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICD-YAVGLSRTYSGSILPSERPGHV 163
           D +   +  L  L   E GK LA  I EV+E +D    YA    R  +G           
Sbjct: 623 DLMEAHMPELFALAVREAGKTLANAIAEVREAVDFLRYYAAQARRLLNGPGHK------- 675

Query: 164 LLENWNPLGVVGIISAFNFPVAVYGWNAAIALV-GVAV 200
                 PLG V  IS +NFP+A++    A ALV G  V
Sbjct: 676 ------PLGPVVCISPWNFPLAIFTGQIAAALVAGNTV 707



 Score = 29.1 bits (66), Expect = 4.9
 Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 9/60 (15%)

Query: 199 AVQSRFGK---LLLELGGNNAIIVNEDADLNLAQQCL--VFACC-GTAGQRCTTTRRLFL 252
            +  R G    L+ E GG NA+IV+  A   L +Q +  V A    +AGQRC+  R L L
Sbjct: 774 TLAKRSGPPVPLIAETGGQNAMIVDSSA---LPEQVVADVIASAFDSAGQRCSALRVLCL 830


>gnl|CDD|143430 cd07112, ALDH_GABALDH-PuuC, Escherichia coli NADP+-dependent
           gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
           PuuC-like.  NADP+-dependent,
           gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
           (GABALDH) PuuC of  Escherichia coli which catalyzes the
           conversion of putrescine to 4-aminobutanoate and other
           similar sequences are present in this CD.
          Length = 462

 Score = 81.5 bits (202), Expect = 8e-17
 Identities = 37/123 (30%), Positives = 63/123 (51%), Gaps = 3/123 (2%)

Query: 257 IDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFT 316
            +  G+FVEPT+  G+  + R+   E F P++ V  FDS +EA+   N+   GL++S++T
Sbjct: 343 TETGGFFVEPTVFDGVTPDMRIAREEIFGPVLSVITFDSEEEAVALANDSVYGLAASVWT 402

Query: 317 KDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRST 376
            D++   +    +    G + VN   +  +I   FGG K +G GR+    +  +Y    T
Sbjct: 403 SDLSRAHR--VARRLRAGTVWVNC-FDEGDITTPFGGFKQSGNGRDKSLHALDKYTELKT 459

Query: 377 VTI 379
             I
Sbjct: 460 TWI 462



 Score = 68.0 bits (167), Expect = 2e-12
 Identities = 44/151 (29%), Positives = 75/151 (49%), Gaps = 6/151 (3%)

Query: 49  GEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAY--AAWAALPAPRRGEIVRQIGDA 106
           GE   +INP+TG  +A V   +  D  R + ++R A+    W+ L    R  ++ ++ D 
Sbjct: 1   GETFATINPATGRVLAEVAACDAADVDRAVAAARRAFESGVWSRLSPAERKAVLLRLADL 60

Query: 107 LRHKLVPLGQLVSLEMGKILAEGI-GEVQEFIDICD-YAVGLSRTYSGSILPSERPGHVL 164
           +      L  L +L+MGK +++ +  +V    +    YA  + + Y G + P+  P  + 
Sbjct: 61  IEAHRDELALLETLDMGKPISDALAVDVPSAANTFRWYAEAIDKVY-GEVAPTG-PDALA 118

Query: 165 LENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
           L    PLGVVG +  +NFP+ +  W  A AL
Sbjct: 119 LITREPLGVVGAVVPWNFPLLMAAWKIAPAL 149



 Score = 39.9 bits (94), Expect = 0.002
 Identities = 21/68 (30%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 199 AVQSRFGKLLLELGGNNAIIVNEDA-DLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKI 257
           + QS   ++ LE GG +  IV  DA DL+ A +          G+ C+   RL +H+   
Sbjct: 223 SGQSNLKRVWLECGGKSPNIVFADAPDLDAAAEAAAAGIFWNQGEVCSAGSRLLVHESIK 282

Query: 258 DRPGYFVE 265
           D    F+E
Sbjct: 283 DE---FLE 287


>gnl|CDD|143438 cd07120, ALDH_PsfA-ACA09737, Pseudomonas putida aldehyde
           dehydrogenase PsfA (ACA09737)-like.  Included in this CD
           is the aldehyde dehydrogenase (PsfA, locus ACA09737) of
           Pseudomonas putida involved in furoic acid metabolism.
           Transcription of psfA was induced in response to
           2-furoic acid, furfuryl alcohol, and furfural.
          Length = 455

 Score = 80.5 bits (199), Expect = 2e-16
 Identities = 48/134 (35%), Positives = 69/134 (51%), Gaps = 4/134 (2%)

Query: 54  SINPSTGETIASVQQGNVQDYHRCIESSRAAY--AAWAALPAPRRGEIVRQIGDALRHKL 111
           SI+P+TGE I +   G V +    I ++R A+    WA  P   R  ++ ++ DA     
Sbjct: 1   SIDPATGEVIGTYADGGVAEAEAAIAAARRAFDETDWAHDPR-LRARVLLELADAFEANA 59

Query: 112 VPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWNPL 171
             L +L++LE GKIL E   E+   I    Y  GL+RT +G ++  E PG   L    P+
Sbjct: 60  ERLARLLALENGKILGEARFEISGAISELRYYAGLARTEAGRMIEPE-PGSFSLVLREPM 118

Query: 172 GVVGIISAFNFPVA 185
           GV GII  +N PV 
Sbjct: 119 GVAGIIVPWNSPVV 132



 Score = 60.0 bits (146), Expect = 8e-10
 Identities = 52/250 (20%), Positives = 82/250 (32%), Gaps = 70/250 (28%)

Query: 192 AIALVGVAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLF 251
           AI         R G   LELGG    IV +DADL+ A   L  A    AGQ C    R+ 
Sbjct: 212 AIMAAAAPTLKRLG---LELGGKTPCIVFDDADLDAALPKLERALTIFAGQFCMAGSRVL 268

Query: 252 LHKKKIDR---------------PGY----------------FVEPTIVTGLPHNSRVVH 280
           + +   D                PG                  V+  +   +   + VV 
Sbjct: 269 VQRSIADEVRDRLAARLAAVKVGPGLDPASDMGPLIDRANVDRVDRMVERAIAAGAEVVL 328

Query: 281 R-------------------------------ETFAPIVYVFPFDSLDEAITWNNEVKQG 309
           R                               E F P++ +  FD   EA+   N+   G
Sbjct: 329 RGGPVTEGLAKGAFLRPTLLEVDDPDADIVQEEIFGPVLTLETFDDEAEAVALANDTDYG 388

Query: 310 LSSSIFTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGA-FGGEKYTGGGRESGSDSW 368
           L++S++T+D+    +    +    G + +N          A  GG + +G GR  G  + 
Sbjct: 389 LAASVWTRDLARAMRV--ARAIRAGTVWIN--DWNKLFAEAEEGGYRQSGLGRLHGVAAL 444

Query: 369 KQYCRRSTVT 378
           + +     + 
Sbjct: 445 EDFIEYKHIY 454


>gnl|CDD|132260 TIGR03216, OH_muco_semi_DH, 2-hydroxymuconic semialdehyde
           dehydrogenase.  Members of this protein family are
           2-hydroxymuconic semialdehyde dehydrogenase. Many
           aromatic compounds are catabolized by way of the
           catechol, via the meta-cleavage pathway, to pyruvate and
           acetyl-CoA. This enzyme performs the second of seven
           steps in that pathway for catechol degradation [Energy
           metabolism, Other].
          Length = 481

 Score = 80.2 bits (198), Expect = 2e-16
 Identities = 44/126 (34%), Positives = 68/126 (53%), Gaps = 15/126 (11%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G +V+PTI TGLP ++RVV  E F P  ++ PFDS +E I   N+   GL++S++T+D++
Sbjct: 364 GAWVQPTIWTGLPDSARVVTEEIFGPCCHIAPFDSEEEVIALANDTPYGLAASVWTEDLS 423

Query: 321 NLFKWLGPQGSDCGLINVN------IPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRR 374
              +    +  + G++ VN      + T        FGG K +G GRE G  S + Y   
Sbjct: 424 RAHRV--ARQMEVGIVWVNSWFLRDLRT-------PFGGSKLSGIGREGGVHSLEFYTEL 474

Query: 375 STVTIN 380
           + V I 
Sbjct: 475 TNVCIK 480



 Score = 45.9 bits (109), Expect = 2e-05
 Identities = 41/166 (24%), Positives = 61/166 (36%), Gaps = 25/166 (15%)

Query: 48  NGEIIPS------INPSTGETIASVQQGNVQDYHRCIESSRAAYAA-WAALPAPRRG--- 97
           NG  + S      INP  G  IA V +    +    + ++RAA    W  +    R    
Sbjct: 6   NGAFVESGKTFANINPVDGRVIARVHEAGAAEVDAAVAAARAALKGPWGKMTVAERADLL 65

Query: 98  -----EIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSG 152
                EI R+  D L  ++   G+  SL     +  G    + F D+        +    
Sbjct: 66  YAVADEIERRFDDFLAAEVADTGKPRSLASHLDIPRGAANFRVFADVV-------KNAPT 118

Query: 153 SILPSERPGHVLLENW---NPLGVVGIISAFNFPVAVYGWNAAIAL 195
                  P      N+    PLGVVG+IS +N P+ +  W    AL
Sbjct: 119 ECFEMATPDGKGALNYAVRKPLGVVGVISPWNLPLLLMTWKVGPAL 164


>gnl|CDD|143408 cd07089, ALDH_CddD-AldA-like, Rhodococcus ruber 6-oxolauric acid
           dehydrogenase-like and related proteins.  The
           6-oxolauric acid dehydrogenase (CddD) from Rhodococcus
           ruber SC1 which converts 6-oxolauric acid to
           dodecanedioic acid; and the aldehyde dehydrogenase
           (locus SSP0762) from Staphylococcus saprophyticus subsp.
           saprophyticus ATCC 15305 and also, the Mycobacterium
           tuberculosis H37Rv ALDH AldA (locus Rv0768) sequence;
           and other similar sequences, are included in this CD.
          Length = 459

 Score = 80.0 bits (198), Expect = 2e-16
 Identities = 60/241 (24%), Positives = 95/241 (39%), Gaps = 68/241 (28%)

Query: 196 VGVAVQSRFG----KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLF 251
           VG  + ++      ++LLELGG +A IV +DADL  A    V  C   AGQ C  T RL 
Sbjct: 214 VGRRIMAQAAATLKRVLLELGGKSANIVLDDADLAAAAPAAVGVCMHNAGQGCALTTRLL 273

Query: 252 LHKKKIDR-------------------PGYFVEP----------------------TIVT 270
           + + + D                    PG  + P                       +VT
Sbjct: 274 VPRSRYDEVVEALAAAFEALPVGDPADPGTVMGPLISAAQRDRVEGYIARGRDEGARLVT 333

Query: 271 G--------------------LPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGL 310
           G                    + ++ R+   E F P++ V P+D  DEA+   N+   GL
Sbjct: 334 GGGRPAGLDKGFYVEPTLFADVDNDMRIAQEEIFGPVLVVIPYDDDDEAVRIANDSDYGL 393

Query: 311 SSSIFTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQ 370
           S  +++ DV   ++    +    G + +N    G      FGG K +G GRE+G +  ++
Sbjct: 394 SGGVWSADVDRAYRV--ARRIRTGSVGIN-GGGGYGPDAPFGGYKQSGLGRENGIEGLEE 450

Query: 371 Y 371
           +
Sbjct: 451 F 451



 Score = 66.9 bits (164), Expect = 5e-12
 Identities = 37/147 (25%), Positives = 58/147 (39%), Gaps = 6/147 (4%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWA-ALPAPRRGEIVRQIGDALRHKLVP 113
           INP+T E I +       D    I ++R A+     +  A  R   +RQ+ +AL  +   
Sbjct: 2   INPATEEVIGTAPDAGAADVDAAIAAARRAFDTGDWSTDAEERARCLRQLHEALEARKEE 61

Query: 114 LGQLVSLEMGKILA-EGIGEVQEFIDICDYAVGLSRTYSGSI-LPSERPGHVLLENW--- 168
           L  L+  E+G  +      +V   I    Y   L+ ++     LP               
Sbjct: 62  LRALLVAEVGAPVMTARAMQVDGPIGHLRYFADLADSFPWEFDLPVPALRGGPGRRVVRR 121

Query: 169 NPLGVVGIISAFNFPVAVYGWNAAIAL 195
            P+GVV  I+ +NFP  +     A AL
Sbjct: 122 EPVGVVAAITPWNFPFFLNLAKLAPAL 148


>gnl|CDD|183050 PRK11241, gabD, succinate-semialdehyde dehydrogenase I;
           Provisional.
          Length = 482

 Score = 80.0 bits (197), Expect = 3e-16
 Identities = 49/151 (32%), Positives = 75/151 (49%), Gaps = 6/151 (3%)

Query: 48  NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDAL 107
           NGE+I   NP+ G+ + SV +    +    I+++  A  AW AL A  R  I+R+  + +
Sbjct: 24  NGEVIDVTNPANGDKLGSVPKMGADETRAAIDAANRALPAWRALTAKERANILRRWFNLM 83

Query: 108 RHKLVPLGQLVSLEMGKILAEGIGEVQ---EFIDICDYAVGLSRTYSGSILPSERPGHVL 164
                 L +L++LE GK LAE  GE+     FI+   +A    R Y G  +P  +    L
Sbjct: 84  MEHQDDLARLMTLEQGKPLAEAKGEISYAASFIEW--FAEEGKRIY-GDTIPGHQADKRL 140

Query: 165 LENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
           +    P+GV   I+ +NFP A+    A  AL
Sbjct: 141 IVIKQPIGVTAAITPWNFPAAMITRKAGPAL 171



 Score = 62.6 bits (152), Expect = 1e-10
 Identities = 31/102 (30%), Positives = 56/102 (54%), Gaps = 3/102 (2%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G F +PTI+  +P N++V   ETF P+  +F F    + I   N+ + GL++  + +D++
Sbjct: 365 GNFFQPTILVDVPANAKVAKEETFGPLAPLFRFKDEADVIAQANDTEFGLAAYFYARDLS 424

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRE 362
            +F+    +  + G++ +N      E+   FGG K +G GRE
Sbjct: 425 RVFRV--GEALEYGIVGINTGIISNEV-APFGGIKASGLGRE 463



 Score = 39.1 bits (91), Expect = 0.003
 Identities = 22/54 (40%), Positives = 30/54 (55%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR 259
           K+ LELGGN   IV +DADL+ A +  + +    AGQ C    RL++     DR
Sbjct: 251 KVSLELGGNAPFIVFDDADLDKAVEGALASKFRNAGQTCVCANRLYVQDGVYDR 304


>gnl|CDD|143409 cd07090, ALDH_F9_TMBADH, NAD+-dependent
           4-trimethylaminobutyraldehyde dehydrogenase, ALDH family
           9A1.  NAD+-dependent, 4-trimethylaminobutyraldehyde
           dehydrogenase (TMABADH, EC=1.2.1.47), also known as
           aldehyde dehydrogenase family 9 member A1 (ALDH9A1) in
           humans, is a cytosolic tetramer which catalyzes the
           oxidation of gamma-aminobutyraldehyde involved in
           4-aminobutyric acid (GABA) biosynthesis  and also
           oxidizes betaine aldehyde
           (gamma-trimethylaminobutyraldehyde) which is involved in
           carnitine biosynthesis.
          Length = 457

 Score = 79.3 bits (196), Expect = 4e-16
 Identities = 46/148 (31%), Positives = 74/148 (50%), Gaps = 15/148 (10%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPL 114
           I P+TGE +A+V     +D    ++S++AA   W+A     RG I+R+  D LR +   +
Sbjct: 2   IEPATGEVLATVHCAGAEDVDLAVKSAKAAQKEWSATSGMERGRILRKAADLLRERNDEI 61

Query: 115 GQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWN----- 169
            +L +++ GK + E   ++    D  +Y  GL+ T SG         HV L   +     
Sbjct: 62  ARLETIDNGKPIEEARVDIDSSADCLEYYAGLAPTLSGE--------HVPLPGGSFAYTR 113

Query: 170 --PLGVVGIISAFNFPVAVYGWNAAIAL 195
             PLGV   I A+N+P+ +  W +A AL
Sbjct: 114 REPLGVCAGIGAWNYPIQIASWKSAPAL 141



 Score = 71.6 bits (176), Expect = 2e-13
 Identities = 37/123 (30%), Positives = 66/123 (53%), Gaps = 5/123 (4%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G++V P ++T    +  +V  E F P++ + PFD+ +E I   N+   GL++ +FT+D+ 
Sbjct: 339 GFYVSPCVLTDCTDDMTIVREEIFGPVMSILPFDTEEEVIRRANDTTYGLAAGVFTRDLQ 398

Query: 321 NLFKWLGP-QGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTI 379
              + +   Q   C +   NI  +  E+   FGG K +G GRE+G+ + + Y +  TV +
Sbjct: 399 RAHRVIAQLQAGTCWINTYNI--SPVEV--PFGGYKQSGFGRENGTAALEHYTQLKTVYV 454

Query: 380 NHG 382
             G
Sbjct: 455 EMG 457



 Score = 43.8 bits (104), Expect = 9e-05
 Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
           LELGG + +I+ +DADL  A    + A   + GQ C+   R+F+ +   D    F E
Sbjct: 223 LELGGKSPLIIFDDADLENAVNGAMMANFLSQGQVCSNGTRVFVQRSIKDE---FTE 276


>gnl|CDD|188167 TIGR01780, SSADH, succinate-semialdehyde dehydrogenase.  Succinic
           semialdehyde dehydrogenase is one of three enzymes
           constituting 4-aminobutyrate (GABA) degradation in both
           prokaryotes and eukaryotes, catalyzing the
           (NAD(P)+)-dependent catabolism reaction of succinic
           semialdehyde to succinate for metabolism by the citric
           acid cycle. The EC number depends on the cofactor:
           1.2.1.24 for NAD only, 1.2.1.79 for NADP only, and
           1.2.1.16 if both can be used. In Escherichia coli,
           succinic semialdehyde dehydrogenase is located in an
           unidirectionally transcribed gene cluster encoding
           enzymes for GABA degradation and is suggested to be
           cotranscribed with succinic semialdehyde transaminase
           from a common promoter upstream of SSADH. Similar gene
           arrangements can be found in characterized Ralstonia
           eutropha and the genome analysis of Bacillus subtilis.
           Prokaryotic succinic semialdehyde dehydrogenases
           (1.2.1.16) share high sequence homology to characterized
           succinic semialdehyde dehydrogenases from rat and human
           (1.2.1.24), exhibiting conservation of proposed cofactor
           binding residues, and putative active sites (G-237 &
           G-242, C-293 & G-259 respectively of rat SSADH).
           Eukaryotic SSADH enzymes exclusively utilize NAD+ as a
           cofactor, exhibiting little to no NADP+ activity. While
           a NADP+ preference has been detected in prokaryotes in
           addition to both NADP+- and NAD+-dependencies as in
           E.coli, Pseudomonas, and Klebsiella pneumoniae. The
           function of this alternative SSADH currently is unknown,
           but has been suggested to play a possible role in
           4-hydroxyphenylacetic degradation. Just outside the
           scope of this model, are several sequences belonging to
           clades scoring between trusted and noise. These
           sequences may be actual SSADH enzymes, but lack
           sufficiently close characterized homologs to make a
           definitive assignment at this time. SSADH enzyme belongs
           to the aldehyde dehydrogenase family (pfam00171),
           sharing a common evolutionary origin and enzymatic
           mechanism with lactaldehyde dehydrogenase. Like in
           lactaldehyde dehydrogenase and succinate semialdehyde
           dehydrogenase, the mammalian catalytic glutamic acid and
           cysteine residues are conserved in all the enzymes of
           this family (PS00687, PS00070) [Central intermediary
           metabolism, Other].
          Length = 448

 Score = 79.0 bits (195), Expect = 4e-16
 Identities = 34/117 (29%), Positives = 61/117 (52%), Gaps = 3/117 (2%)

Query: 255 KKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSI 314
           K+ +  G F EPT+++ +  +  V   ETF P+  VF FD  +E I   N+ + GL++  
Sbjct: 331 KRHELGGNFFEPTVLSNVTADMLVSKEETFGPLAPVFKFDDEEEVIAIANDTEVGLAAYF 390

Query: 315 FTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQY 371
           F++D++ +  W   +  + G++ +N       +   FGG K +G GRE      ++Y
Sbjct: 391 FSRDLSRI--WRVAEALEYGMVGINTGLISNVV-APFGGVKQSGLGREGSKYGIEEY 444



 Score = 70.5 bits (173), Expect = 2e-13
 Identities = 45/143 (31%), Positives = 69/143 (48%), Gaps = 6/143 (4%)

Query: 56  NPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLG 115
           NP+TGE I SV    V +    I ++  A+  W A  A  R  ++R+  + +      L 
Sbjct: 3   NPATGEIIGSVPDQGVDETEAAIRAAYEAFKTWRATTAKERSSLLRKWYNLMMENKDDLA 62

Query: 116 QLVSLEMGKILAEGIGEVQ---EFIDICDYAVGLSRTYSGSILPSERPGHVLLENWNPLG 172
           +L++LE GK L E  GE+     F++   +A    R Y G  +PS +    L+    P+G
Sbjct: 63  RLITLENGKPLKEAKGEILYAASFLEW--FAEEAKRVY-GDTIPSPQSDKRLIVIKQPVG 119

Query: 173 VVGIISAFNFPVAVYGWNAAIAL 195
           V   I+ +NFP A+    A  AL
Sbjct: 120 VCAAITPWNFPAAMITRKAGAAL 142


>gnl|CDD|130783 TIGR01722, MMSDH, methylmalonic acid semialdehyde dehydrogenase.
           Involved in valine catabolism,
           methylmalonate-semialdehyde dehydrogenase catalyzes the
           irreversible NAD+- and CoA-dependent oxidative
           decarboxylation of methylmalonate semialdehyde to
           propionyl-CoA. Methylmalonate-semialdehyde dehydrogenase
           has been characterized in both prokaryotes and
           eukaryotes, functioning as a mammalian tetramer and a
           bacterial homodimer. Although similar in monomeric
           molecular mass and enzymatic activity, the N-terminal
           sequence in P.aeruginosa does not correspond with the
           N-terminal sequence predicted for rat liver. Sequence
           homology to a variety of prokaryotic and eukaryotic
           aldehyde dehydrogenases places MMSDH in the aldehyde
           dehydrogenase (NAD+) superfamily (pfam00171), making
           MMSDH's CoA requirement unique among known ALDHs.
           Methylmalonate semialdehyde dehydrogenase is closely
           related to betaine aldehyde dehydrogenase,
           2-hydroxymuconic semialdehyde dehydrogenase, and class 1
           and 2 aldehyde dehydrogenase. In Bacillus, a highly
           homologous protein to methylmalonic acid semialdehyde
           dehydrogenase, groups out from the main MMSDH clade with
           Listeria and Sulfolobus. This Bacillus protein has been
           suggested to be located in an iol operon and/or involved
           in myo-inositol catabolism, converting malonic
           semialdehyde to acetyl CoA ad CO2. The preceeding
           enzymes responsible for valine catabolism are present in
           Bacillus, Listeria, and Sulfolobus [Energy metabolism,
           Amino acids and amines].
          Length = 477

 Score = 79.2 bits (195), Expect = 5e-16
 Identities = 36/150 (24%), Positives = 67/150 (44%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDA 106
           A+G  IP  NP+T E    V   +V +    + S+R  +  W      +R  ++ +    
Sbjct: 13  ASGTYIPVTNPATNEVTTKVAFASVDEVDAAVASARETFLTWGQTSLAQRTSVLLRYQAL 72

Query: 107 LRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLE 166
           L+     + +L++ E GK  ++ +G+V   +++ ++A G++    G           +  
Sbjct: 73  LKEHRDEIAELITAEHGKTHSDALGDVARGLEVVEHACGVNSLLKGETSTQVATRVDVYS 132

Query: 167 NWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
              PLGV   I+ FNFP  +  W   IA+ 
Sbjct: 133 IRQPLGVCAGITPFNFPAMIPLWMFPIAIA 162



 Score = 50.3 bits (120), Expect = 1e-06
 Identities = 59/242 (24%), Positives = 84/242 (34%), Gaps = 67/242 (27%)

Query: 202 SRFGKLLLELGG-NNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFL-------- 252
           S  GK +  LGG  N ++V  DAD + A   LV A  G AGQRC       L        
Sbjct: 235 SAHGKRVQALGGAKNHMVVMPDADKDAAADALVGAAYGAAGQRCMAISAAVLVGAADEWV 294

Query: 253 -----------------------------HKKKI----------------DRPGYFVE-- 265
                                         K ++                D  GY V+  
Sbjct: 295 PEIRERAEKIRIGPGDDPGAEMGPLITPQAKDRVASLIAGGAAEGAEVLLDGRGYKVDGY 354

Query: 266 -------PTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
                  PT++  +P   +    E F P++ V   D+L+EAI   N    G  ++IFT+D
Sbjct: 355 EEGNWVGPTLLERVPPTMKAYQEEIFGPVLCVLEADTLEEAIALINASPYGNGTAIFTRD 414

Query: 319 VTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEK--YTGGGRESGSDSWKQYCRRST 376
                ++      + G + VN+P        +F G K  + G     G      Y R  T
Sbjct: 415 GAAARRF--QHEIEVGQVGVNVPIPVPLPYFSFTGWKDSFFGDHHIYGKQGTHFYTRGKT 472

Query: 377 VT 378
           VT
Sbjct: 473 VT 474


>gnl|CDD|143431 cd07113, ALDH_PADH_NahF, Escherichia coli NAD+-dependent
           phenylacetaldehyde dehydrogenase PadA-like.
           NAD+-dependent, homodimeric, phenylacetaldehyde
           dehydrogenase (PADH, EC=1.2.1.39) PadA of Escherichia
           coli involved in the catabolism of 2-phenylethylamine,
           and other related sequences, are present in this CD.
           Also included is the Pseudomonas fluorescens ST StyD
           PADH involved in styrene catabolism, the Sphingomonas
           sp. LB126 FldD protein involved in fluorene degradation,
           and the Novosphingobium aromaticivorans NahF
           salicylaldehyde dehydrogenase involved in the
           NAD+-dependent conversion of salicylaldehyde to
           salicylate.
          Length = 477

 Score = 78.6 bits (194), Expect = 7e-16
 Identities = 39/125 (31%), Positives = 67/125 (53%), Gaps = 3/125 (2%)

Query: 255 KKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSI 314
           + +   GYFV+PT+V     +SR++  ETF P+V   P++  +E I   N+   GL++S+
Sbjct: 354 EALAGEGYFVQPTLVLARSADSRLMREETFGPVVSFVPYEDEEELIQLINDTPFGLTASV 413

Query: 315 FTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRR 374
           +T +++   +++     + G + VN+ T   +    FGG K +G GRE GS     Y   
Sbjct: 414 WTNNLSKALRYI--PRIEAGTVWVNMHTF-LDPAVPFGGMKQSGIGREFGSAFIDDYTEL 470

Query: 375 STVTI 379
            +V I
Sbjct: 471 KSVMI 475



 Score = 71.7 bits (176), Expect = 1e-13
 Identities = 46/165 (27%), Positives = 66/165 (40%), Gaps = 10/165 (6%)

Query: 41  DGTSWKANGEIIPSI-NPSTGETIASVQQGNVQDYHRCIESSRAAY-AAWAALPAPRRGE 98
           DG       E    I NP+T + IASV      D    + S+  A+ +AWA      RG 
Sbjct: 5   DGRPVAGQSEKRLDITNPATEQVIASVASATEADVDAAVASAWRAFVSAWAKTTPAERGR 64

Query: 99  IVRQIGDALRHKLVPLGQLVSLEMGKI--LAEGIGEVQEFIDICDYAVGLSRTYSGSILP 156
           I+ ++ D +      L QL +L  GK   L+    EV +  +   Y  G +   +G  L 
Sbjct: 65  ILLRLADLIEQHGEELAQLETLCSGKSIHLSRAF-EVGQSANFLRYFAGWATKINGETLA 123

Query: 157 SERPG-----HVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
              P      +       P+GVV  I  +NF V +  W    AL 
Sbjct: 124 PSIPSMQGERYTAFTRREPVGVVAGIVPWNFSVMIAVWKIGAALA 168


>gnl|CDD|237017 PRK11904, PRK11904, bifunctional proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1038

 Score = 78.7 bits (195), Expect = 1e-15
 Identities = 51/196 (26%), Positives = 80/196 (40%), Gaps = 21/196 (10%)

Query: 16  YSDSVSKYPFLKELGLSGSVNPGVYDGTSWKA------NGEIIPSINPS-TGETIASVQQ 68
            +D     P    +          +    W+A       GE  P ++P+     +  V  
Sbjct: 530 LNDRSELEPLAAAIAA--------FLEKQWQAGPIINGEGEARPVVSPADRRRVVGEVAF 581

Query: 69  GNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAE 128
            + +   + + ++RAA+ AW+  P   R  I+ +  D L      L  L   E GK L +
Sbjct: 582 ADAEQVEQALAAARAAFPAWSRTPVEERAAILERAADLLEANRAELIALCVREAGKTLQD 641

Query: 129 GIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLEN---WNPLGVVGIISAFNFPVA 185
            I EV+E +D C Y    +R   G+  P + PG     N    +  GV   IS +NFP+A
Sbjct: 642 AIAEVREAVDFCRYYAAQARRLFGA--PEKLPGPTGESNELRLHGRGVFVCISPWNFPLA 699

Query: 186 VYGWNAAIALV-GVAV 200
           ++    A AL  G  V
Sbjct: 700 IFLGQVAAALAAGNTV 715



 Score = 29.4 bits (67), Expect = 4.1
 Identities = 19/50 (38%), Positives = 26/50 (52%), Gaps = 12/50 (24%)

Query: 210 ELGGNNAIIVNEDADLNLAQQCLV------FACCGTAGQRCTTTRRLFLH 253
           E GG NA+IV+  A   L +Q +       F    +AGQRC+  R LF+ 
Sbjct: 796 ETGGQNAMIVDSTA---LPEQVVDDVVTSAF---RSAGQRCSALRVLFVQ 839


>gnl|CDD|143463 cd07145, ALDH_LactADH_F420-Bios, Methanocaldococcus jannaschii
           NAD+-dependent lactaldehyde dehydrogenase-like.
           NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
           involved the biosynthesis of coenzyme F(420) in
           Methanocaldococcus jannaschii through the oxidation of
           lactaldehyde to lactate and generation of NAPH, and
           similar sequences are included in this CD.
          Length = 456

 Score = 77.4 bits (191), Expect = 1e-15
 Identities = 59/281 (20%), Positives = 104/281 (37%), Gaps = 73/281 (25%)

Query: 52  IPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKL 111
           I   NP+ GE I +V   + ++    IE +  A    + LPA +R +I+ ++ + +  + 
Sbjct: 1   IEVRNPANGEVIDTVPSLSREEVREAIEVAEKAKDVMSNLPAYKRYKILMKVAELIERRK 60

Query: 112 VPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSE----RPGHVLLEN 167
             L +L+++E+GK + +   EV+  I +   A   ++   G  +P +        +    
Sbjct: 61  EELAKLLTIEVGKPIKQSRVEVERTIRLFKLAAEEAKVLRGETIPVDAYEYNERRIAFTV 120

Query: 168 WNPLGVVGIISAFNFPVAVYGWNAAIAL-VGVAV-----------QSRFGKLLLELG--- 212
             P+GVVG I+ FNFP  ++    A A+ VG +V                K+L E G   
Sbjct: 121 REPIGVVGAITPFNFPANLFAHKIAPAIAVGNSVVVKPSSNTPLTAIELAKILEEAGLPP 180

Query: 213 -------GNNAIIVNE-------------------------------------------- 221
                  G  + + +E                                            
Sbjct: 181 GVINVVTGYGSEVGDEIVTNPKVNMISFTGSTAVGLLIASKAGGTGKKVALELGGSDPMI 240

Query: 222 ---DADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR 259
              DADL  A    V      AGQ C   +R+ + ++  D+
Sbjct: 241 VLKDADLERAVSIAVRGRFENAGQVCNAVKRILVEEEVYDK 281



 Score = 68.9 bits (169), Expect = 9e-13
 Identities = 28/113 (24%), Positives = 44/113 (38%), Gaps = 20/113 (17%)

Query: 260 PGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDV 319
            G F  PT++     +  V+  E F P++ +      +EA+   N  + GL +S+FT D+
Sbjct: 339 EGSFFPPTVLENDTPDMIVMKEEVFGPVLPIAKVKDDEEAVEIANSTEYGLQASVFTNDI 398

Query: 320 TNLFKWLGPQGSDCGLINVNIPTNGAEIGG---------AFGGEKYTGGGRES 363
               K           +   +   G  I            FGG K +G GRE 
Sbjct: 399 NRALK-----------VARELEAGGVVINDSTRFRWDNLPFGGFKKSGIGREG 440


>gnl|CDD|226683 COG4230, COG4230, Delta 1-pyrroline-5-carboxylate dehydrogenase
           [Energy production and conversion].
          Length = 769

 Score = 77.8 bits (192), Expect = 2e-15
 Identities = 47/151 (31%), Positives = 72/151 (47%), Gaps = 13/151 (8%)

Query: 47  ANGEIIPSINPST-GETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGD 105
           A GE  P INP+   + + +V +    D  + +E++ AA   W+A P   R  I+ +  D
Sbjct: 124 AGGEPRPVINPADPDDIVGTVTEATEADVEQALEAAVAAAPIWSATPPAERAAILERAAD 183

Query: 106 ALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLL 165
            +  ++  L  L+  E GK L+  I EV+E +D   Y  G +R   G+        H   
Sbjct: 184 LMEAQMPQLMGLLVREAGKTLSNAIAEVREAVDFLRYYAGQARDTFGN------LTH--- 234

Query: 166 ENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
               PLG V  IS +NFP+A++    A AL 
Sbjct: 235 ---RPLGPVVCISPWNFPLAIFTGQIAAALA 262



 Score = 35.8 bits (83), Expect = 0.037
 Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 6/56 (10%)

Query: 207 LLLELGGNNAIIVNEDADLNLAQQCL---VFACCGTAGQRCTTTRRLFLHKKKIDR 259
           L+ E GG NA+IV+  A   LA+Q +   + +   +AGQRC+  R L L +   DR
Sbjct: 345 LIAETGGQNAMIVDSSA---LAEQVVADVLASAFDSAGQRCSALRVLCLQEDVADR 397


>gnl|CDD|143462 cd07144, ALDH_ALD2-YMR170C, Saccharomyces cerevisiae aldehyde
           dehydrogenase 2 (YMR170c)-like.  NAD(P)+-dependent
           Saccharomyces cerevisiae aldehyde dehydrogenase 2
           (YMR170c, ALD5, EC=1.2.1.5) and other similar sequences,
           are present in this CD.
          Length = 484

 Score = 76.7 bits (189), Expect = 3e-15
 Identities = 42/120 (35%), Positives = 69/120 (57%), Gaps = 3/120 (2%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           GYF+ PTI T +P + R+V  E F P+V +  F + +EAI   N+   GL++++FTKD+ 
Sbjct: 367 GYFIPPTIFTDVPQDMRIVKEEIFGPVVVISKFKTYEEAIKKANDTTYGLAAAVFTKDIR 426

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTIN 380
              +    +  + G++ +N  +N +++G  FGG K +G GRE G    + Y +   V IN
Sbjct: 427 RAHRVA--RELEAGMVWIN-SSNDSDVGVPFGGFKMSGIGRELGEYGLETYTQTKAVHIN 483



 Score = 74.8 bits (184), Expect = 1e-14
 Identities = 46/153 (30%), Positives = 81/153 (52%), Gaps = 7/153 (4%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAY-AAWAALPAPRRGEIVRQIGD 105
           ++GE I ++NPSTGE IASV     +D  + ++++R A+ + W+ +    RGE++ ++ D
Sbjct: 20  SDGETIKTVNPSTGEVIASVYAAGEEDVDKAVKAARKAFESWWSKVTGEERGELLDKLAD 79

Query: 106 ALRHKLVPLGQLVSLEMGKIL-AEGIGEVQEFIDICDYAVGLSRTYSGSILP--SERPGH 162
            +      L  + +L+ GK   +  +G++ E I +  Y  G +    G  +P    +  +
Sbjct: 80  LVEKNRDLLAAIEALDSGKPYHSNALGDLDEIIAVIRYYAGWADKIQGKTIPTSPNKLAY 139

Query: 163 VLLENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
            L E   P GV G I  +N+P+A+  W  A AL
Sbjct: 140 TLHE---PYGVCGQIIPWNYPLAMAAWKLAPAL 169



 Score = 42.0 bits (99), Expect = 4e-04
 Identities = 36/112 (32%), Positives = 48/112 (42%), Gaps = 22/112 (19%)

Query: 170 PLGVVGIISAFNFPVAVYGWNAA-------IAL-----VGVAVQSRFGKLL----LELGG 213
           P GVV II  +    AV G   A       IA       G  V     + L    LE GG
Sbjct: 200 PPGVVNIIPGYG---AVAGSALAEHPDVDKIAFTGSTATGRLVMKAAAQNLKAVTLECGG 256

Query: 214 NNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
            +  +V EDADL+ A +         +GQ CT T R+++ +   D+   FVE
Sbjct: 257 KSPALVFEDADLDQAVKWAAAGIMYNSGQNCTATSRIYVQESIYDK---FVE 305


>gnl|CDD|143410 cd07091, ALDH_F1-2_Ald2-like, ALDH subfamily: ALDH families 1and 2,
           including 10-formyltetrahydrofolate dehydrogenase,
           NAD+-dependent retinal dehydrogenase 1 and related
           proteins.  ALDH subfamily which includes the
           NAD+-dependent retinal dehydrogenase 1 (RALDH 1, ALDH1,
           EC=1.2.1.36), also known as aldehyde dehydrogenase
           family 1 member A1 (ALDH1A1), in humans, a
           homotetrameric, cytosolic enzyme that catalyzes the
           oxidation of retinaldehyde to retinoic acid. Human
           ALDH1B1 and ALDH2 are also in this cluster; both are
           mitochrondrial homotetramers which play important roles
           in acetaldehyde oxidation; ALDH1B1 in response to UV
           light exposure and ALDH2 during ethanol metabolism.
           10-formyltetrahydrofolate dehydrogenase (FTHFDH,
           EC=1.5.1.6), also known as aldehyde dehydrogenase family
           1 member L1 (ALDH1L1), in humans, a multi-domain
           homotetramer with an N-terminal formyl transferase
           domain and a C-terminal ALDH domain. FTHFDH catalyzes an
           NADP+-dependent dehydrogenase reaction resulting in the
           conversion of 10-formyltetrahydrofolate to
           tetrahydrofolate and CO2. Also included in this
           subfamily is the Arabidosis aldehyde dehydrogenase
           family 2 members B4 and B7 (EC=1.2.1.3), which are
           mitochondrial, homotetramers that oxidize acetaldehyde
           and glycolaldehyde, as well as, the Arabidosis
           cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
           involved in the oxidation of sinapalehyde and
           coniferaldehyde. Also included is the AldA aldehyde
           dehydrogenase  of Aspergillus nidulans (locus AN0554),
           the aldehyde dehydrogenase 2 (YMR170c, ALD5, EC=1.2.1.5)
           of Saccharomyces cerevisiae, and other similar
           sequences.
          Length = 476

 Score = 76.5 bits (189), Expect = 4e-15
 Identities = 45/155 (29%), Positives = 72/155 (46%), Gaps = 8/155 (5%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAY--AAWAALPAPRRGEIVRQIG 104
            +G+  P+INP+T E I  V + + +D    ++++RAA+    W  +    RG ++ ++ 
Sbjct: 16  VSGKTFPTINPATEEVICQVAEADEEDVDAAVKAARAAFETGWWRKMDPRERGRLLNKLA 75

Query: 105 DALRHKLVPLGQLVSLEMGKILAEGI-GEVQEFIDICDYAVGLSRTYSGSILPSERPGHV 163
           D +      L  L SL+ GK L E   G+V   I    Y  G +    G  +P +     
Sbjct: 76  DLIERDRDELAALESLDNGKPLEESAKGDVALSIKCLRYYAGWADKIQGKTIPIDGNFLA 135

Query: 164 L--LENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
               E   P+GV G I  +NFP+ +  W  A AL 
Sbjct: 136 YTRRE---PIGVCGQIIPWNFPLLMLAWKLAPALA 167



 Score = 73.8 bits (182), Expect = 3e-14
 Identities = 37/141 (26%), Positives = 65/141 (46%), Gaps = 23/141 (16%)

Query: 249 RLFLHKKKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQ 308
            L    ++    GYF++PT+ T +  + ++   E F P+V +  F + DE I   N+ + 
Sbjct: 349 TLLTGGERHGSKGYFIQPTVFTDVKDDMKIAKEEIFGPVVTILKFKTEDEVIERANDTEY 408

Query: 309 GLSSSIFTKDVTNLFK----------WLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTG 358
           GL++ +FTKD+    +          W+        + +  +P         FGG K +G
Sbjct: 409 GLAAGVFTKDINKALRVSRALKAGTVWVNTY----NVFDAAVP---------FGGFKQSG 455

Query: 359 GGRESGSDSWKQYCRRSTVTI 379
            GRE G +  ++Y +   VTI
Sbjct: 456 FGRELGEEGLEEYTQVKAVTI 476


>gnl|CDD|200087 TIGR01237, D1pyr5carbox2, delta-1-pyrroline-5-carboxylate
           dehydrogenase, group 2, putative.  This enzyme is the
           second of two in the degradation of proline to
           glutamate. This model represents one of several related
           branches of delta-1-pyrroline-5-carboxylate
           dehydrogenase. Members of this branch may be associated
           with proline dehydrogenase (the other enzyme of the
           pathway from proline to glutamate) but have not been
           demonstrated experimentally. The branches are not as
           closely related to each other as some distinct aldehyde
           dehydrogenases are to some; separate models were built
           to let each model describe a set of equivalogs [Energy
           metabolism, Amino acids and amines].
          Length = 511

 Score = 76.4 bits (188), Expect = 4e-15
 Identities = 40/165 (24%), Positives = 68/165 (41%), Gaps = 2/165 (1%)

Query: 33  GSVNPGVYDGTSWKANGEIIPSINPS-TGETIASVQQGNVQDYHRCIESSRAAYAAWAAL 91
           G   P V +G   +   +I+ SINP    E + +V + + +     ++++  A+ AW   
Sbjct: 30  GKTYPLVINGERVETENKIV-SINPCDKSEVVGTVSKASQEHAEHALQAAAKAFEAWKKT 88

Query: 92  PAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYS 151
               R  I+ +    +R +      L+  E+GK   E   EV E ID  +Y        +
Sbjct: 89  DPEERAAILFKAAAIVRRRRHEFSALLVKEVGKPWNEADAEVAEAIDFMEYYARQMIELA 148

Query: 152 GSILPSERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
                + R G      + P GV  +IS +NFP A+        +V
Sbjct: 149 KGKPVNSREGETNQYVYTPTGVTVVISPWNFPFAIMVGMTVAPIV 193



 Score = 59.9 bits (145), Expect = 1e-09
 Identities = 51/238 (21%), Positives = 85/238 (35%), Gaps = 62/238 (26%)

Query: 201 QSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKK----- 255
           Q    +++ E+GG + +IV+EDAD+ LA Q    +  G AGQ+C+   R  +H+K     
Sbjct: 273 QKHLKRVIAEMGGKDTVIVDEDADIELAAQSAFTSAFGFAGQKCSAGSRAVVHEKVYDEV 332

Query: 256 --------------KIDRPGYFVEPTI--------------------------------- 268
                           D    +V P I                                 
Sbjct: 333 VERFVEITESLKVGPPDSADVYVGPVIDQKSFNKIMEYIEIGKAEGRLVSGGCGDDSKGY 392

Query: 269 ------VTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNL 322
                    +   +R+   E F P+V        DEA+   N  + GL+  + + +  ++
Sbjct: 393 FIGPTIFADVDRKARLAQEEIFGPVVAFIRASDFDEALEIANNTEYGLTGGVISNNRDHI 452

Query: 323 FKWLGPQGSDCGLINVNIPTNGAEIG-GAFGGEKYTG-GGRESGSDSWKQYCRRSTVT 378
            +       + G +  N    GA +G   FGG K +G   +  G D    + +  TVT
Sbjct: 453 NR--AKAEFEVGNLYFNRNITGAIVGYQPFGGFKMSGTDSKAGGPDYLALFMQAKTVT 508


>gnl|CDD|131352 TIGR02299, HpaE, 5-carboxymethyl-2-hydroxymuconate semialdehyde
           dehydrogenase.  This model represents the dehydrogenase
           responsible for the conversion of
           5-carboxymethyl-2-hydroxymuconate semialdehyde to
           5-carboxymethyl-2-hydroxymuconate (a tricarboxylic
           acid). This is the step in the degradation of
           4-hydroxyphenylacetic acid via homoprotocatechuate
           following the oxidative opening of the aromatic ring.
          Length = 488

 Score = 76.0 bits (187), Expect = 5e-15
 Identities = 38/154 (24%), Positives = 65/154 (42%), Gaps = 15/154 (9%)

Query: 49  GEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALR 108
           GE   +++P+T E + SV +G   D  R  ++++ A+  WA L A  R   + +I D + 
Sbjct: 15  GETFETLSPATNEVLGSVARGGAADVDRAAKAAKEAFKRWAELKAAERKRYLHKIADLIE 74

Query: 109 HKLVPLGQLVSLE-------MGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPG 161
                +  L  L+         + +       + F D C+ A+       G   P     
Sbjct: 75  QHADEIAVLECLDCGQPLRQTRQQVIRAAENFRFFADKCEEAM------DGRTYPV--DT 126

Query: 162 HVLLENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
           H+      P+G VG+I+ +N P  +  W  A AL
Sbjct: 127 HLNYTVRVPVGPVGLITPWNAPFMLSTWKIAPAL 160



 Score = 73.7 bits (181), Expect = 3e-14
 Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 3/124 (2%)

Query: 259 RPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
             G +V PT+ TG  ++ R+   E F P++ V PF   +EAI   N+ + GL+  ++T D
Sbjct: 359 GRGNYVLPTVFTGADNHMRIAQEEIFGPVLTVIPFKDEEEAIEKANDTRYGLAGYVWTND 418

Query: 319 VTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVT 378
           V    +       + G+I VN   N   +   FGG K +G GRE G+ S+  Y     V 
Sbjct: 419 VGRAHRV--ALALEAGMIWVN-SQNVRHLPTPFGGVKASGIGREGGTYSFDFYTETKNVA 475

Query: 379 INHG 382
           +  G
Sbjct: 476 LALG 479


>gnl|CDD|143419 cd07101, ALDH_SSADH2_GabD2, Mycobacterium tuberculosis
           succinate-semialdehyde dehydrogenase 2-like.
           Succinate-semialdehyde dehydrogenase 2 (SSADH2) and
           similar proteins are in this CD. SSADH1 (GabD1,
           EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
           of succinate semialdehyde to succinate.  SSADH activity
           in Mycobacterium tuberculosis is encoded by both gabD1
           (Rv0234c) and gabD2 (Rv1731), however ,the Vmax of GabD1
           was shown to be much higher than that of GabD2, and
           GabD2 (SSADH2) is likely to serve physiologically as a
           dehydrogenase for a different aldehyde(s).
          Length = 454

 Score = 75.0 bits (185), Expect = 1e-14
 Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 17/156 (10%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPL 114
             P TGE +  + Q    D       +RAA  AWAA P   R  +  +  D +  +   L
Sbjct: 1   EAPFTGEPLGELPQSTPADVEAAFARARAAQRAWAARPFAERAAVFLRFHDLVLERRDEL 60

Query: 115 GQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYS---GSILPSERPGHVL------L 165
             L+ LE GK           F ++ D A+  +R Y+     +L   R    +       
Sbjct: 61  LDLIQLETGKARR------HAFEEVLDVAIV-ARYYARRAERLLKPRRRRGAIPVLTRTT 113

Query: 166 ENWNPLGVVGIISAFNFPVAVYGWNAAIALV-GVAV 200
            N  P GVVG+IS +N+P+ +   +A  AL+ G AV
Sbjct: 114 VNRRPKGVVGVISPWNYPLTLAVSDAIPALLAGNAV 149



 Score = 68.5 bits (168), Expect = 1e-12
 Identities = 37/120 (30%), Positives = 60/120 (50%), Gaps = 6/120 (5%)

Query: 262 YFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTN 321
           YF EPT++TG+  +  +   ETF P+V ++     DEAI   N+   GL++S++T+D   
Sbjct: 337 YFYEPTVLTGVTEDMELFAEETFGPVVSIYRVADDDEAIELANDTDYGLNASVWTRDGAR 396

Query: 322 LFKWLGPQGSDCGLINVN---IPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVT 378
             + +  +    G +NVN        + I    GG K +G GR  G++   +Y    TV 
Sbjct: 397 GRR-IAAR-LRAGTVNVNEGYAAAWAS-IDAPMGGMKDSGLGRRHGAEGLLKYTETQTVA 453



 Score = 56.9 bits (138), Expect = 8e-09
 Identities = 26/57 (45%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
           LELGG N +IV EDADL+ A    V AC   AGQ C +  R+++H+   D    FV 
Sbjct: 224 LELGGKNPMIVLEDADLDKAAAGAVRACFSNAGQLCVSIERIYVHESVYDE---FVR 277


>gnl|CDD|143401 cd07082, ALDH_F11_NP-GAPDH, NADP+-dependent non-phosphorylating
           glyceraldehyde 3-phosphate dehydrogenase and ALDH family
           11.  NADP+-dependent non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase (NP-GAPDH, EC=1.2.1.9)
           catalyzes the irreversible oxidation of glyceraldehyde
           3-phosphate to 3-phosphoglycerate generating NADPH for
           biosynthetic reactions.  This CD also includes the
           Arabidopsis thaliana osmotic-stress-inducible ALDH
           family 11, ALDH11A3  and similar sequences. In
           autotrophic eukaryotes, NP-GAPDH generates NADPH for
           biosynthetic processes from photosynthetic
           glyceraldehyde-3-phosphate exported from the chloroplast
           and catalyzes one of the classic glycolytic bypass
           reactions unique to plants.
          Length = 473

 Score = 74.9 bits (185), Expect = 1e-14
 Identities = 60/244 (24%), Positives = 88/244 (36%), Gaps = 85/244 (34%)

Query: 207 LLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE- 265
           L+LELGG +  IV  DADL LA + +V      +GQRCT  +R+ +H+   D     VE 
Sbjct: 245 LVLELGGKDPAIVLPDADLELAAKEIVKGALSYSGQRCTAIKRVLVHESVADE---LVEL 301

Query: 266 ----------------PTIVTGLP-----------------HNSRVVH------------ 280
                              +T L                    + V++            
Sbjct: 302 LKEEVAKLKVGMPWDNGVDITPLIDPKSADFVEGLIDDAVAKGATVLNGGGREGGNLIYP 361

Query: 281 --------------RETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNLFKWL 326
                          E F P++ +   + ++EAI   N+   GL +SIFTKD+    K  
Sbjct: 362 TLLDPVTPDMRLAWEEPFGPVLPIIRVNDIEEAIELANKSNYGLQASIFTKDINKARK-- 419

Query: 327 GPQGSDCGLIN------VNIPTNGAEIGG----AFGGEKYTGGGRESGSDSWKQYCRRST 376
                   L +      VNI  N     G     F G K +G G +   D+ +   RR  
Sbjct: 420 --------LADALEVGTVNI--NSKCQRGPDHFPFLGRKDSGIGTQGIGDALRSMTRRKG 469

Query: 377 VTIN 380
           + IN
Sbjct: 470 IVIN 473



 Score = 60.7 bits (148), Expect = 4e-10
 Identities = 37/155 (23%), Positives = 63/155 (40%), Gaps = 5/155 (3%)

Query: 35  VNPGVYDGTSWKANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAA-LPA 93
               + +G   +++G+ I   +P  GE I SV   +  +     E++  A   W   +P 
Sbjct: 1   QFKYLINGEWKESSGKTIEVYSPIDGEVIGSVPALSALEILEAAETAYDAGRGWWPTMPL 60

Query: 94  PRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGS 153
             R + + +  D L+     +  L+  E+GK L + + EV   ID     +   +   G 
Sbjct: 61  EERIDCLHKFADLLKENKEEVANLLMWEIGKTLKDALKEVDRTIDYIRDTIEELKRLDGD 120

Query: 154 ILPSERPGHVL-LENW---NPLGVVGIISAFNFPV 184
            LP +               PLGVV  I  FN+P+
Sbjct: 121 SLPGDWFPGTKGKIAQVRREPLGVVLAIGPFNYPL 155


>gnl|CDD|132417 TIGR03374, ABALDH, 1-pyrroline dehydrogenase.  Members of this
           protein family are 1-pyrroline dehydrogenase (1.5.1.35),
           also called gamma-aminobutyraldehyde dehydrogenase. This
           enzyme can follow putrescine transaminase (EC 2.6.1.82)
           for a two-step conversion of putrescine to
           gamma-aminobutyric acid (GABA). The member from
           Escherichia coli is characterized as a homotetramer that
           binds one NADH per momomer. This enzyme belongs to the
           medium-chain aldehyde dehydrogenases, and is quite
           similar in sequence to the betaine aldehyde
           dehydrogenase (EC 1.2.1.8) family.
          Length = 472

 Score = 74.7 bits (183), Expect = 2e-14
 Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 1/149 (0%)

Query: 48  NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDAL 107
            GE  P  NP+TGE I  + + + +     + ++ AA+A W       R E + ++ D +
Sbjct: 14  EGEKQPVYNPATGEVILEIAEASAEQVDAAVRAADAAFAEWGQTTPKARAECLLKLADVI 73

Query: 108 RHKLVPLGQLVSLEMGKILAEGIG-EVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLE 166
                   +L S   GK L      E+   +D+  +  G +R  SG        GH  + 
Sbjct: 74  EENAQVFAELESRNCGKPLHSVFNDEIPAIVDVFRFFAGAARCLSGLAAGEYLEGHTSMI 133

Query: 167 NWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
             +PLGVV  I+ +N+P+ +  W  A AL
Sbjct: 134 RRDPLGVVASIAPWNYPLMMAAWKLAPAL 162



 Score = 61.6 bits (149), Expect = 2e-10
 Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 12/109 (11%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           GY+  PT++ G   +  +V +E F P+V +  FD  ++ + W N+ + GL+SS++TKDV 
Sbjct: 356 GYYFAPTLLAGAKQDDAIVQKEVFGPVVSITSFDDEEQVVNWANDSQYGLASSVWTKDVG 415

Query: 321 NLFKWLGPQGSDCGLINVN------IPTNGAEIGG------AFGGEKYT 357
              +        C  +N +      +P  G ++ G       +G E YT
Sbjct: 416 RAHRLSARLQYGCTWVNTHFMLVSEMPHGGQKLSGYGKDMSLYGLEDYT 464


>gnl|CDD|143436 cd07118, ALDH_SNDH, Gluconobacter oxydans L-sorbosone
           dehydrogenase-like.  Included in this CD is the
           L-sorbosone dehydrogenase (SNDH) from Gluconobacter
           oxydans UV10. In G. oxydans,  D-sorbitol is converted to
           2-keto-L-gulonate (a precursor of L-ascorbic acid) in
           sequential oxidation steps catalyzed by a FAD-dependent,
           L-sorbose dehydrogenase and an NAD(P)+-dependent,
           L-sorbosone dehydrogenase.
          Length = 454

 Score = 73.9 bits (182), Expect = 3e-14
 Identities = 41/134 (30%), Positives = 71/134 (52%), Gaps = 8/134 (5%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAA--WAALPAPRRGEIVRQIGDALRHKLV 112
            +P+ G  +A   +G V+D    + ++R A+    W  +    R  ++ ++ D +R +  
Sbjct: 2   RSPAHGVVVARYAEGTVEDVDAAVAAARKAFDKGPWPRMSGAERAAVLLKVADLIRARRE 61

Query: 113 PLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSG---SILPSERPGHVLLENWN 169
            L  + +LE GK +++  GE++   D+  YA  L+RT  G   + L  +  G VL E   
Sbjct: 62  RLALIETLESGKPISQARGEIEGAADLWRYAASLARTLHGDSYNNLGDDMLGLVLRE--- 118

Query: 170 PLGVVGIISAFNFP 183
           P+GVVGII+ +NFP
Sbjct: 119 PIGVVGIITPWNFP 132



 Score = 71.2 bits (175), Expect = 2e-13
 Identities = 38/122 (31%), Positives = 59/122 (48%), Gaps = 5/122 (4%)

Query: 259 RPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
             G F +PTI T +  +  +   E F P++ V  FD++DEAI   N+   GLS+ +++KD
Sbjct: 337 AAGLFYQPTIFTDVTPDMAIAREEIFGPVLSVLTFDTVDEAIALANDTVYGLSAGVWSKD 396

Query: 319 VTNLFKWLGPQGSDCGLINVN-IPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTV 377
           +         +    G + VN       E+   FGG K +G GRE G    ++Y    TV
Sbjct: 397 IDTAL--TVARRIRAGTVWVNTFLDGSPEL--PFGGFKQSGIGRELGRYGVEEYTELKTV 452

Query: 378 TI 379
            +
Sbjct: 453 HL 454



 Score = 46.6 bits (111), Expect = 1e-05
 Identities = 22/49 (44%), Positives = 29/49 (59%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHK 254
           K+ LELGG N  IV  DADL+ A   +VF     AG+ C +  RL +H+
Sbjct: 224 KVSLELGGKNPQIVFADADLDAAADAVVFGVYFNAGECCNSGSRLLVHE 272


>gnl|CDD|143458 cd07140, ALDH_F1L_FTFDH, 10-formyltetrahydrofolate dehydrogenase,
           ALDH family 1L.  10-formyltetrahydrofolate dehydrogenase
           (FTHFDH, EC=1.5.1.6), also known as aldehyde
           dehydrogenase family 1 member L1 (ALDH1L1) in humans, is
           a multi-domain homotetramer with an N-terminal formyl
           transferase domain and a C-terminal ALDH domain. FTHFDH
           catalyzes an NADP+-dependent dehydrogenase reaction
           resulting in the conversion of 10-formyltetrahydrofolate
           to tetrahydrofolate and CO2. The ALDH domain is also
           capable of the oxidation of short chain aldehydes to
           their corresponding acids.
          Length = 486

 Score = 73.7 bits (181), Expect = 3e-14
 Identities = 57/244 (23%), Positives = 96/244 (39%), Gaps = 64/244 (26%)

Query: 199 AVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFL------ 252
              S   K+ LELGG + +I+  D D++ A +  + +     G+ C    RLF+      
Sbjct: 246 CAVSNLKKVSLELGGKSPLIIFADCDMDKAVRMGMSSVFFNKGENCIAAGRLFVEESIHD 305

Query: 253 -----------------------------HKKKID------------------------R 259
                                        HK  +D                        R
Sbjct: 306 EFVRRVVEEVKKMKIGDPLDRSTDHGPQNHKAHLDKLVEYCERGVKEGATLVYGGKQVDR 365

Query: 260 PGYFVEPTIVTGLPHNSRVVHRETFAPIVYV--FPFDSLDEAITWNNEVKQGLSSSIFTK 317
           PG+F EPT+ T +  +  +   E+F PI+ +  F    +D  +   N+ + GL+S +FTK
Sbjct: 366 PGFFFEPTVFTDVEDHMFIAKEESFGPIMIISKFDDGDVDGVLQRANDTEYGLASGVFTK 425

Query: 318 DVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTV 377
           D+            + G + VN   N  ++   FGG K +G G++ G ++  +Y +  TV
Sbjct: 426 DINKALYV--SDKLEAGTVFVNT-YNKTDVAAPFGGFKQSGFGKDLGEEALNEYLKTKTV 482

Query: 378 TINH 381
           TI +
Sbjct: 483 TIEY 486



 Score = 57.9 bits (140), Expect = 4e-09
 Identities = 40/155 (25%), Positives = 66/155 (42%), Gaps = 6/155 (3%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAY--AAWAALPAPRRGEIVRQIG 104
             G+   +INP+ G  I  V    V+D  R + +++ A+    W  + A  RG ++ ++ 
Sbjct: 18  EGGKTYNTINPTDGSVICKVSLATVEDVDRAVAAAKEAFENGEWGKMNARDRGRLMYRLA 77

Query: 105 DALRHKLVPLGQLVSLEMGKILAEGIG-EVQEFIDICDYAVGLSRTYSGSILP--SERPG 161
           D +      L  + SL+ G +    +   V   I    Y  G      G  +P    RP 
Sbjct: 78  DLMEEHQEELATIESLDSGAVYTLALKTHVGMSIQTFRYFAGWCDKIQGKTIPINQARPN 137

Query: 162 HVL-LENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
             L L    P+GV GI+  +N+P+ +  W  A  L
Sbjct: 138 RNLTLTKREPIGVCGIVIPWNYPLMMLAWKMAACL 172


>gnl|CDD|143457 cd07139, ALDH_AldA-Rv0768, Mycobacterium tuberculosis aldehyde
           dehydrogenase  AldA-like.  The Mycobacterium
           tuberculosis NAD+-dependent, aldehyde dehydrogenase  PDB
           structure,  3B4W, and the Mycobacterium tuberculosis
           H37Rv aldehyde dehydrogenase  AldA (locus Rv0768)
           sequence, as well as the Rhodococcus rhodochrous ALDH
           involved in haloalkane catabolism, and other similar
           sequences, are included in this CD.
          Length = 471

 Score = 73.4 bits (181), Expect = 4e-14
 Identities = 42/168 (25%), Positives = 67/168 (39%), Gaps = 17/168 (10%)

Query: 40  YDGTSWKA--NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAA--WAALPAPR 95
           + G  W A    E I  ++P+T E +  V +    D    + ++R A+    W  L    
Sbjct: 2   FIGGRWVAPSGSETIDVVSPATEEVVGRVPEATPADVDAAVAAARRAFDNGPWPRLSPAE 61

Query: 96  RGEIVRQIGDALRHKLVPLGQLVSLEMGK-ILAEGIGEVQEFIDICDYAVGLSRTYSGSI 154
           R  ++R++ DAL  +   L +L + E G  I      +      +  Y   L+R +    
Sbjct: 62  RAAVLRRLADALEARADELARLWTAENGMPISWSRRAQGPGPAALLRYYAALARDFP--- 118

Query: 155 LPSERPG------HVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
               RPG       V  E   P+GVV  I  +N P+ +     A AL 
Sbjct: 119 FEERRPGSGGGHVLVRRE---PVGVVAAIVPWNAPLFLAALKIAPALA 163



 Score = 71.1 bits (175), Expect = 2e-13
 Identities = 37/110 (33%), Positives = 55/110 (50%), Gaps = 16/110 (14%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G+FVEPT+   + ++ R+   E F P++ V P+D  D+A+   N+   GLS S++T DV 
Sbjct: 357 GWFVEPTLFADVDNDMRIAQEEIFGPVLSVIPYDDEDDAVRIANDSDYGLSGSVWTADV- 415

Query: 321 NLFKWLGPQGSD------CGLINVNIPTNGAEIGGAFGGEKYTGGGRESG 364
                   +G         G + VN      + G  FGG K +G GRE G
Sbjct: 416 -------ERGLAVARRIRTGTVGVNGFR--LDFGAPFGGFKQSGIGREGG 456



 Score = 45.3 bits (108), Expect = 4e-05
 Identities = 25/67 (37%), Positives = 33/67 (49%)

Query: 184 VAVYGWNAAIALVGVAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQR 243
           V+  G  AA   +      R  ++ LELGG +A IV +DADL+ A   LV A     GQ 
Sbjct: 219 VSFTGSTAAGRRIAAVCGERLARVTLELGGKSAAIVLDDADLDAAVPGLVPASLMNNGQV 278

Query: 244 CTTTRRL 250
           C    R+
Sbjct: 279 CVALTRI 285


>gnl|CDD|236501 PRK09407, gabD2, succinic semialdehyde dehydrogenase; Reviewed.
          Length = 524

 Score = 73.4 bits (181), Expect = 4e-14
 Identities = 44/143 (30%), Positives = 61/143 (42%), Gaps = 16/143 (11%)

Query: 52  IPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKL 111
                P TGE +A+V      D       +RAA  AWAA P   R  ++ +  D +    
Sbjct: 34  REVTAPFTGEPLATVPVSTAADVEAAFARARAAQRAWAATPVRERAAVLLRFHDLVLENR 93

Query: 112 VPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTY---SGSIL-PSERPGHVLL-- 165
             L  LV LE GK       EV       D A+  +R Y   +  +L P  R G + +  
Sbjct: 94  EELLDLVQLETGKARRHAFEEV------LDVALT-ARYYARRAPKLLAPRRRAGALPVLT 146

Query: 166 ---ENWNPLGVVGIISAFNFPVA 185
              E   P GVVG+IS +N+P+ 
Sbjct: 147 KTTELRQPKGVVGVISPWNYPLT 169



 Score = 68.4 bits (168), Expect = 2e-12
 Identities = 36/120 (30%), Positives = 60/120 (50%), Gaps = 6/120 (5%)

Query: 262 YFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTN 321
            F EPT++TG+  +  +   ETF P+V V+P   +DEA+   N+   GL++S++T D   
Sbjct: 373 LFYEPTVLTGVTPDMELAREETFGPVVSVYPVADVDEAVERANDTPYGLNASVWTGDTAR 432

Query: 322 LFKWLGPQGSDCGLINVN---IPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVT 378
               +  +    G +NVN       G+ +    GG K +G GR  G++   +Y    T+ 
Sbjct: 433 GRA-IAAR-IRAGTVNVNEGYAAAWGS-VDAPMGGMKDSGLGRRHGAEGLLKYTESQTIA 489



 Score = 57.6 bits (140), Expect = 4e-09
 Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
           LELGG N +IV +DADL+ A    V AC   AGQ C +  R+++H+   D    FV 
Sbjct: 260 LELGGKNPMIVLDDADLDKAAAGAVRACFSNAGQLCISIERIYVHESIYDE---FVR 313


>gnl|CDD|143456 cd07138, ALDH_CddD_SSP0762, Rhodococcus ruber 6-oxolauric acid
           dehydrogenase-like.  The 6-oxolauric acid dehydrogenase
           (CddD) from Rhodococcus ruber SC1 which converts
           6-oxolauric acid to dodecanedioic acid, and the aldehyde
           dehydrogenase (locus SSP0762) from Staphylococcus
           saprophyticus subsp. saprophyticus ATCC 15305 and other
           similar sequences, are included in this CD.
          Length = 466

 Score = 72.9 bits (180), Expect = 6e-14
 Identities = 43/140 (30%), Positives = 68/140 (48%), Gaps = 8/140 (5%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDA 106
           A  E I  INP+T E I +V  G   D  R + ++R A+ AW+A     R  ++ +I +A
Sbjct: 11  AGTETIDVINPATEEVIGTVPLGTAADVDRAVAAARRAFPAWSATSVEERAALLERIAEA 70

Query: 107 LRHKLVPLGQLVSLEMG--KILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVL 164
              +   L Q ++LEMG    LA    +V   I     A    + +       ER G+ L
Sbjct: 71  YEARADELAQAITLEMGAPITLARAA-QVGLGIGHLRAAADALKDFEF----EERRGNSL 125

Query: 165 LENWNPLGVVGIISAFNFPV 184
           +    P+GV G+I+ +N+P+
Sbjct: 126 VV-REPIGVCGLITPWNWPL 144



 Score = 55.6 bits (135), Expect = 2e-08
 Identities = 32/110 (29%), Positives = 51/110 (46%), Gaps = 16/110 (14%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD-- 318
           GYFV+PT+   +  +  +   E F P++ + P+D  DEAI   N+   GL+  +++ D  
Sbjct: 352 GYFVKPTVFADVTPDMTIAREEIFGPVLSIIPYDDEDEAIAIANDTPYGLAGYVWSADPE 411

Query: 319 ----VTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESG 364
               V    +         G +++N        G  FGG K +G GRE G
Sbjct: 412 RARAVARRLR--------AGQVHIN--GAAFNPGAPFGGYKQSGNGREWG 451



 Score = 35.2 bits (82), Expect = 0.049
 Identities = 19/51 (37%), Positives = 27/51 (52%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR 259
           LELGG +A I+ +DADL  A    V AC   +GQ C    R+ + + +   
Sbjct: 238 LELGGKSANIILDDADLEKAVPRGVAACFANSGQSCNAPTRMLVPRSRYAE 288


>gnl|CDD|143461 cd07143, ALDH_AldA_AN0554, Aspergillus nidulans aldehyde
           dehydrogenase, AldA (AN0554)-like.  NAD(P)+-dependent
           aldehyde dehydrogenase (AldA) of Aspergillus nidulans
           (locus AN0554), and other similar sequences, are present
           in this CD.
          Length = 481

 Score = 72.2 bits (177), Expect = 8e-14
 Identities = 41/120 (34%), Positives = 62/120 (51%), Gaps = 3/120 (2%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           GYF+EPTI T +  + ++V  E F P+V V  F + +EAI   N+   GL++++FT ++ 
Sbjct: 364 GYFIEPTIFTDVTEDMKIVKEEIFGPVVAVIKFKTEEEAIKRANDSTYGLAAAVFTNNIN 423

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTIN 380
           N  +         G + VN   N       FGG K +G GRE G  + + Y +   V IN
Sbjct: 424 NAIR--VANALKAGTVWVNC-YNLLHHQVPFGGYKQSGIGRELGEYALENYTQIKAVHIN 480



 Score = 56.8 bits (137), Expect = 7e-09
 Identities = 40/169 (23%), Positives = 70/169 (41%), Gaps = 10/169 (5%)

Query: 34  SVNPGVYDGTSW--KANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAAL 91
               G++    +    +G  +   NPSTG+ I  + +    D    +E + AA+     L
Sbjct: 4   EQPTGLFINGEFVDSVHGGTVKVYNPSTGKLITKIAEATEADVDIAVEVAHAAFETDWGL 63

Query: 92  --PAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGK-ILAEGIGEVQEFIDICDYAVGLSR 148
                +RG  + ++ D +   L  L  + +L+ GK        +VQ   D   Y  G + 
Sbjct: 64  KVSGSKRGRCLSKLADLMERNLDYLASIEALDNGKTFGTAKRVDVQASADTFRYYGGWAD 123

Query: 149 TYSGSILP--SERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
              G ++    ++  +   E   P+GV G I  +NFP+ +  W  A AL
Sbjct: 124 KIHGQVIETDIKKLTYTRHE---PIGVCGQIIPWNFPLLMCAWKIAPAL 169


>gnl|CDD|215260 PLN02467, PLN02467, betaine aldehyde dehydrogenase.
          Length = 503

 Score = 72.1 bits (177), Expect = 9e-14
 Identities = 63/245 (25%), Positives = 93/245 (37%), Gaps = 92/245 (37%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRL---------FLHK----- 254
           LELGG + IIV +D DL+ A +  +F C  T GQ C+ T RL         FL K     
Sbjct: 259 LELGGKSPIIVFDDVDLDKAVEWAMFGCFWTNGQICSATSRLLVHERIASEFLEKLVKWA 318

Query: 255 ------------------------KKI------------------DRP-----GYFVEPT 267
                                   +K+                   RP     G+F+EPT
Sbjct: 319 KNIKISDPLEEGCRLGPVVSEGQYEKVLKFISTAKSEGATILCGGKRPEHLKKGFFIEPT 378

Query: 268 IVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD------VTN 321
           I+T +  + ++   E F P++ V  F + DEAI   N+   GL+ ++ + D      V+ 
Sbjct: 379 IITDVTTSMQIWREEVFGPVLCVKTFSTEDEAIELANDSHYGLAGAVISNDLERCERVSE 438

Query: 322 LFKWLGPQGSDCGLINVN--------IPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCR 373
            F          G++ +N         P         +GG K +G GRE G    + Y  
Sbjct: 439 AF--------QAGIVWINCSQPCFCQAP---------WGGIKRSGFGRELGEWGLENYLS 481

Query: 374 RSTVT 378
              VT
Sbjct: 482 VKQVT 486



 Score = 70.1 bits (172), Expect = 4e-13
 Identities = 48/171 (28%), Positives = 81/171 (47%), Gaps = 16/171 (9%)

Query: 38  GVYDGTSWKA--NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAA-----WAA 90
            ++ G  W+    G+ IP +NP+T ETI  +     +D    +E++R A+       WA 
Sbjct: 9   QLFIGGEWREPVLGKRIPVVNPATEETIGDIPAATAEDVDAAVEAARKAFKRNKGKDWAR 68

Query: 91  LPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGL---- 146
                R + +R I   +  +   L +L +L+ GK L E   ++ +     +Y   L    
Sbjct: 69  TTGAVRAKYLRAIAAKITERKSELAKLETLDCGKPLDEAAWDMDDVAGCFEYYADLAEAL 128

Query: 147 -SRTYSGSILPSER-PGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
            ++  +   LP E   G+VL     PLGVVG+I+ +N+P+ +  W  A AL
Sbjct: 129 DAKQKAPVSLPMETFKGYVL---KEPLGVVGLITPWNYPLLMATWKVAPAL 176


>gnl|CDD|143402 cd07083, ALDH_P5CDH, ALDH subfamily NAD+-dependent
           delta(1)-pyrroline-5-carboxylate dehydrogenase-like.
           ALDH subfamily of the NAD+-dependent,
           delta(1)-pyrroline-5-carboxylate dehydrogenases (P5CDH,
           EC=1.5.1.12). The proline catabolic enzymes, proline
           dehydrogenase and P5CDH catalyze the two-step oxidation
           of proline to glutamate.  P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). These enzymes play important roles
           in cellular redox control, superoxide generation, and
           apoptosis. In certain prokaryotes such as Escherichia
           coli, PutA is also a transcriptional repressor of the
           proline utilization genes. Monofunctional enzyme
           sequences such as those seen in the Bacillus RocA P5CDH
           are also present in this subfamily as well as the human
           ALDH4A1 P5CDH and the Drosophila Aldh17 P5CDH.
          Length = 500

 Score = 71.8 bits (176), Expect = 1e-13
 Identities = 43/164 (26%), Positives = 70/164 (42%), Gaps = 3/164 (1%)

Query: 40  YDGTSWKANGEIIPSINPS-TGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGE 98
             G  W    E + S++P    E + +  + +  +    +E++ AA+  W   P   R  
Sbjct: 22  VIGGEWVDTKERMVSVSPFAPSEVVGTTAKADKAEAEAALEAAWAAFKTWKDWPQEDRAR 81

Query: 99  IVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDY-AVGLSRTYSGSILPS 157
           ++ +  D LR +   L   ++ E+GK   E I +V E ID   Y A    R    ++   
Sbjct: 82  LLLKAADLLRRRRRELIATLTYEVGKNWVEAIDDVAEAIDFIRYYARAALRLRYPAVEVV 141

Query: 158 ERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNA-AIALVGVAV 200
             PG      +  LG   +IS +NFPVA++     A   VG  V
Sbjct: 142 PYPGEDNESFYVGLGAGVVISPWNFPVAIFTGMIVAPVAVGNTV 185



 Score = 48.7 bits (116), Expect = 3e-06
 Identities = 34/119 (28%), Positives = 59/119 (49%), Gaps = 5/119 (4%)

Query: 249 RLFLHKKKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPI--VYVFPFDSLDEAITWNNEV 306
           +L L  K+++  GYFV PT+V  +P  +R+   E F P+  V  +  D   EA+   N  
Sbjct: 366 QLVLGGKRLEGEGYFVAPTVVEEVPPKARIAQEEIFGPVLSVIRYKDDDFAEALEVANST 425

Query: 307 KQGLSSSIFTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIG-GAFGGEKYTGGGRESG 364
             GL+  ++++   +L +    +    G + +N    GA +G   FGG K +G   ++G
Sbjct: 426 PYGLTGGVYSRKREHL-EEARRE-FHVGNLYINRKITGALVGVQPFGGFKLSGTNAKTG 482



 Score = 45.3 bits (107), Expect = 4e-05
 Identities = 23/59 (38%), Positives = 34/59 (57%)

Query: 201 QSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR 259
           Q+ F +L +E GG NAIIV+E AD  L  + +V +  G  GQ+C+   RL L +   + 
Sbjct: 260 QTWFKRLYVETGGKNAIIVDETADFELVVEGVVVSAFGFQGQKCSAASRLILTQGAYEP 318


>gnl|CDD|215410 PLN02766, PLN02766, coniferyl-aldehyde dehydrogenase.
          Length = 501

 Score = 70.6 bits (173), Expect = 3e-13
 Identities = 101/477 (21%), Positives = 177/477 (37%), Gaps = 161/477 (33%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAY--AAWAALPAPRRGEIVRQIG 104
           A+G+   + +P TGE IA + +G+ +D    ++++R A+    W  +    RG I+ +  
Sbjct: 33  ASGKTFETRDPRTGEVIARIAEGDKEDVDLAVKAAREAFDHGPWPRMSGFERGRIMMKFA 92

Query: 105 DALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYS-------GSILPS 157
           D +   +  L  L +++ GK+ A G     + +DI   A GL R Y+       G  L  
Sbjct: 93  DLIEEHIEELAALDTIDAGKLFALG-----KAVDIPA-AAGLLRYYAGAADKIHGETLKM 146

Query: 158 ERP--GHVLLEN---------WN------------------------------------- 169
            R   G+ L E          WN                                     
Sbjct: 147 SRQLQGYTLKEPIGVVGHIIPWNFPSTMFFMKVAPALAAGCTMVVKPAEQTPLSALFYAH 206

Query: 170 -------PLGVVGIISAFNFPVAVYGWNAAIA-------------------LVGVAVQSR 203
                  P GV+ +++ F  P A     AAIA                   ++  A  S 
Sbjct: 207 LAKLAGVPDGVINVVTGFG-PTA----GAAIASHMDVDKVSFTGSTEVGRKIMQAAATSN 261

Query: 204 FGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLH---------- 253
             ++ LELGG + +++ +DAD+++A    +       G+ C  + R+++           
Sbjct: 262 LKQVSLELGGKSPLLIFDDADVDMAVDLALLGIFYNKGEICVASSRVYVQEGIYDEFVKK 321

Query: 254 -------------------------KKKIDRPGYFVE------PTIVTG----------- 271
                                    K++ ++   ++E       T++TG           
Sbjct: 322 LVEKAKDWVVGDPFDPRARQGPQVDKQQFEKILSYIEHGKREGATLLTGGKPCGDKGYYI 381

Query: 272 -------LPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTK--DVTNL 322
                  +  + ++   E F P++ +  F +++EAI   N  K GL++ I TK  DV N 
Sbjct: 382 EPTIFTDVTEDMKIAQDEIFGPVMSLMKFKTVEEAIKKANNTKYGLAAGIVTKDLDVANT 441

Query: 323 FKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCR-RSTVT 378
                 +    G I VN      +    FGG K +G GR+ G D+  +Y + +S VT
Sbjct: 442 VS----RSIRAGTIWVNC-YFAFDPDCPFGGYKMSGFGRDQGMDALDKYLQVKSVVT 493


>gnl|CDD|233325 TIGR01238, D1pyr5carbox3, delta-1-pyrroline-5-carboxylate
           dehydrogenase (PutA C-terminal domain).  This model
           represents one of several related branches of
           delta-1-pyrroline-5-carboxylate dehydrogenase. Members
           of this branch are the C-terminal domain of the PutA
           bifunctional proline dehydrogenase /
           delta-1-pyrroline-5-carboxylate dehydrogenase [Energy
           metabolism, Amino acids and amines].
          Length = 500

 Score = 70.3 bits (172), Expect = 4e-13
 Identities = 46/156 (29%), Positives = 67/156 (42%), Gaps = 13/156 (8%)

Query: 42  GTSWKANGEIIPSINPSTGETI-ASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIV 100
           G S+KA+GE  P  NP+    I   V   N+      I+S++ A+  W A PA  R   +
Sbjct: 43  GHSYKADGEAQPVTNPADRRDIVGQVFHANLAHVQAAIDSAQQAFPTWNATPAKERAAKL 102

Query: 101 RQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERP 160
            ++ D L   +  L  L   E GK +   I EV+E +D C Y     R     +L     
Sbjct: 103 DRLADLLELHMPELMALCVREAGKTIHNAIAEVREAVDFCRYYAKQVR----DVLGEFSV 158

Query: 161 GHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
                      GV   IS +NFP+A++    + AL 
Sbjct: 159 --------ESRGVFVCISPWNFPLAIFTGQISAALA 186


>gnl|CDD|143429 cd07111, ALDH_F16, Aldehyde dehydrogenase family 16A1-like.
           Uncharacterized aldehyde dehydrogenase family 16 member
           A1 (ALDH16A1) and other related sequences are present in
           this CD. The active site cysteine and glutamate residues
           are not conserved in the human ALDH16A1 protein
           sequence.
          Length = 480

 Score = 70.1 bits (172), Expect = 4e-13
 Identities = 47/151 (31%), Positives = 73/151 (48%), Gaps = 14/151 (9%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDA 106
            N +  P+INP+TGE +ASV Q   +D    + ++R A+ +W+ALP   R   + +I   
Sbjct: 34  ENRKSFPTINPATGEVLASVLQAEEEDVDAAVAAARTAFESWSALPGHVRARHLYRIARH 93

Query: 107 LRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSG--SILPSERPGHVL 164
           ++        L SL+ GK     I E ++  DI          ++G   +L +E  G   
Sbjct: 94  IQKHQRLFAVLESLDNGK----PIRESRD-CDI-PLVARHFYHHAGWAQLLDTELAG--- 144

Query: 165 LENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
              W P+GVVG I  +NFP+ +  W    AL
Sbjct: 145 ---WKPVGVVGQIVPWNFPLLMLAWKICPAL 172



 Score = 65.9 bits (161), Expect = 1e-11
 Identities = 33/115 (28%), Positives = 59/115 (51%), Gaps = 3/115 (2%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G F  PT+ T +P  SR+   E F P++ V  F +  EA+   N    GL++S+++++++
Sbjct: 365 GPFYPPTLFTNVPPASRIAQEEIFGPVLVVLTFRTAKEAVALANNTPYGLAASVWSENLS 424

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRS 375
              + +       G++ +N   N  +    FGG + +G GRE G +   +Y R S
Sbjct: 425 LALE-VALS-LKAGVVWIN-GHNLFDAAAGFGGYRESGFGREGGKEGLYEYLRPS 476



 Score = 38.1 bits (89), Expect = 0.006
 Identities = 21/45 (46%), Positives = 26/45 (57%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRL 250
           KL LELGG +  IV +DADL+ A + +V A     GQ C    RL
Sbjct: 251 KLSLELGGKSPFIVFDDADLDSAVEGIVDAIWFNQGQVCCAGSRL 295


>gnl|CDD|132294 TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde
           dehydrogenase.  This family of genes are members of the
           pfam00171 NAD-dependent aldehyde dehydrogenase family.
           These genes are observed in Ralstonia eutropha JMP134,
           Sinorhizobium meliloti 1021, Burkholderia mallei ATCC
           23344, Burkholderia thailandensis E264, Burkholderia
           cenocepacia AU 1054, Burkholderia pseudomallei K96243
           and 1710b, Burkholderia xenovorans LB400, Burkholderia
           sp. 383 and Polaromonas sp. JS666 in close proximity to
           the PhnW gene (TIGR02326) encoding 2-aminoethyl
           phosphonate aminotransferase (which generates
           phosphonoacetaldehyde) and PhnA (TIGR02335) encoding
           phosphonoacetate hydrolase (not to be confused with the
           alkylphosphonate utilization operon protein PhnA modeled
           by TIGR00686). Additionally, transporters believed to be
           specific for 2-aminoethyl phosphonate are often present.
           PhnW is, in other organisms, coupled with PhnX
           (TIGR01422) for the degradation of phosphonoacetaldehyde
           (GenProp0238), but PhnX is apparently absent in each of
           the organisms containing this aldehyde reductase. PhnA,
           characterized in a strain of Pseudomonas fluorescens
           that has not het been genome sequenced, is only rarely
           found outside of the PhnW and aldehyde dehydrogenase
           context. For instance in Rhodopseudomonas and Bordetella
           bronchiseptica, where it is adjacent to transporters
           presumably specific for the import of phosphonoacetate.
           It seems reasonably certain then, that this enzyme
           catalyzes the NAD-dependent oxidation of
           phosphonoacetaldehyde to phosphonoacetate, bridging the
           metabolic gap between PhnW and PhnA. We propose the name
           phosphonoacetaldehyde dehydrogenase and the gene symbol
           PhnY for this enzyme.
          Length = 472

 Score = 69.5 bits (170), Expect = 8e-13
 Identities = 66/282 (23%), Positives = 100/282 (35%), Gaps = 78/282 (27%)

Query: 56  NPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLG 115
            P  G  + +V + +V D  R       A A    L    R  I+ +    L  +   + 
Sbjct: 23  YPYNGTVVGTVPKASVDDVRRAF---AIAAAYRPTLTRYERSAILDRAAALLAARKEEIS 79

Query: 116 QLVSLEMGKILAEGIGEVQEFIDICDYAVGLS-----RTYSGSILPSERPGHVLLENWNP 170
            L++LE G    + + EV    D+  +A   +     + +S  + P  +   V  +    
Sbjct: 80  DLITLESGLSKKDSLYEVGRVADVLTFAAAEALRDDGQIFSCDLTPHGKARKVFTQREPL 139

Query: 171 LGVVGIISAFNFP---VA------------------------------------------ 185
           LGV+  I+ FN P   VA                                          
Sbjct: 140 LGVISAITPFNHPMNQVAHKIAPAIATNNRMVVKPSEKTPLSALYLADILYEAGLPPQML 199

Query: 186 --VYGWNAAIA----------LV----GVAVQSR------FGKLLLELGGNNAIIVNEDA 223
             V G    IA          LV    GVA+         + + +LELGGN+ +IV EDA
Sbjct: 200 QVVTGDPREIADELITNPHVDLVTFTGGVAIGKYIAARAGYRRQVLELGGNDPLIVMEDA 259

Query: 224 DLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
           DL+ A    V      +GQRCT  +R+ + +   DR   F E
Sbjct: 260 DLDRAADLAVKGSYKNSGQRCTAVKRMLVQESVADR---FTE 298



 Score = 61.8 bits (150), Expect = 2e-10
 Identities = 37/133 (27%), Positives = 62/133 (46%), Gaps = 8/133 (6%)

Query: 249 RLFLHKKKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQ 308
           RL L      R G    PT++  +  +  +V  ETF P+  V  F  +D+AI  +N    
Sbjct: 344 RLLLGNV---RDGALYAPTVLDRVDPSMTLVREETFGPVSPVIRFCDIDDAIRISNSTAY 400

Query: 309 GLSSSIFTKDVTNLFKWLGPQGSDCGLINV-NIPTNGAEIGGAFGGEKYTGGGRESG-SD 366
           GLSS + T  +  + +++       G +NV  +P    E+   FGG K +G G + G  +
Sbjct: 401 GLSSGVCTNRLDYITRFI--AELQVGTVNVWEVPGYRLEL-TPFGGIKDSGLGYKEGVQE 457

Query: 367 SWKQYCRRSTVTI 379
           + K +    T ++
Sbjct: 458 AMKSFTNLKTYSL 470


>gnl|CDD|182233 PRK10090, PRK10090, aldehyde dehydrogenase A; Provisional.
          Length = 409

 Score = 69.0 bits (169), Expect = 9e-13
 Identities = 25/70 (35%), Positives = 43/70 (61%)

Query: 255 KKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSI 314
           K ++  GY+  PT++  +     ++H ETF P++ V  FD+L+EAI   N+   GL+SSI
Sbjct: 285 KAVEGKGYYYPPTLLLDVRQEMSIMHEETFGPVLPVVAFDTLEEAIAMANDSDYGLTSSI 344

Query: 315 FTKDVTNLFK 324
           +T+++    K
Sbjct: 345 YTQNLNVAMK 354



 Score = 56.3 bits (136), Expect = 1e-08
 Identities = 32/96 (33%), Positives = 44/96 (45%)

Query: 101 RQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERP 160
           R+I   +R +   +  L+  E GKI      EV    D  DY    +R Y G I+ S+RP
Sbjct: 2   RKIAAGIRERASEISALIVEEGGKIQQLAEVEVAFTADYIDYMAEWARRYEGEIIQSDRP 61

Query: 161 GHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
           G  +L     LGV   I  +NFP  +     A AL+
Sbjct: 62  GENILLFKRALGVTTGILPWNFPFFLIARKMAPALL 97



 Score = 44.7 bits (106), Expect = 5e-05
 Identities = 23/60 (38%), Positives = 34/60 (56%), Gaps = 3/60 (5%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
           K+ LELGG    IV +DADL+LA + +V +    +GQ C    R+++ K   D+   FV 
Sbjct: 176 KVCLELGGKAPAIVMDDADLDLAVKAIVDSRVINSGQVCNCAERVYVQKGIYDQ---FVN 232


>gnl|CDD|179543 PRK03137, PRK03137, 1-pyrroline-5-carboxylate dehydrogenase;
           Provisional.
          Length = 514

 Score = 69.2 bits (170), Expect = 9e-13
 Identities = 36/148 (24%), Positives = 60/148 (40%), Gaps = 1/148 (0%)

Query: 50  EIIPSINPS-TGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALR 108
           + I SINP+   E +  V +   +   + ++++  A+  W       R  I+ +    +R
Sbjct: 50  DKIVSINPANKSEVVGRVSKATKELAEKAMQAALEAFETWKKWSPEDRARILLRAAAIIR 109

Query: 109 HKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENW 168
            +       +  E GK  AE   +  E ID  +Y        +       RPG      +
Sbjct: 110 RRKHEFSAWLVKEAGKPWAEADADTAEAIDFLEYYARQMLKLADGKPVESRPGEHNRYFY 169

Query: 169 NPLGVVGIISAFNFPVAVYGWNAAIALV 196
            PLGV  +IS +NFP A+       A+V
Sbjct: 170 IPLGVGVVISPWNFPFAIMAGMTLAAIV 197



 Score = 51.1 bits (123), Expect = 5e-07
 Identities = 21/74 (28%), Positives = 37/74 (50%)

Query: 249 RLFLHKKKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQ 308
           RL L  +  D  GYF++PTI   +   +R++  E F P+V        D A+   N  + 
Sbjct: 382 RLVLGGEGDDSKGYFIQPTIFADVDPKARIMQEEIFGPVVAFIKAKDFDHALEIANNTEY 441

Query: 309 GLSSSIFTKDVTNL 322
           GL+ ++ + +  +L
Sbjct: 442 GLTGAVISNNREHL 455



 Score = 49.9 bits (120), Expect = 1e-06
 Identities = 21/54 (38%), Positives = 37/54 (68%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR 259
           +++ E+GG +AI+V+EDADL+LA + +V +  G +GQ+C+   R  +H+   D 
Sbjct: 282 RVIAEMGGKDAIVVDEDADLDLAAESIVASAFGFSGQKCSACSRAIVHEDVYDE 335


>gnl|CDD|212007 TIGR04284, aldehy_Rv0768, aldehyde dehydrogenase, Rv0768 family.
           This family describes a branch of the aldehyde
           dehydrogenase (NAD) family (see pfam00171) that includes
           Rv0768 from Mycobacterium tuberculosis. All members of
           this family belong to species predicted to synthesize
           mycofactocin, suggesting that this enzyme or another
           upstream or downstream in the same pathway might be
           mycofactocin-dependent. However, the taxonomic range of
           this family is not nearly broad enough to make that
           relationship conclusive [Unknown function, Enzymes of
           unknown specificity].
          Length = 480

 Score = 68.3 bits (167), Expect = 2e-12
 Identities = 42/122 (34%), Positives = 59/122 (48%), Gaps = 18/122 (14%)

Query: 258 DRP-GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFT 316
           DR  G+FVEPT++ GL +N+RV   E F P++ V   D  D+A+   N+   GLS ++F 
Sbjct: 359 DRDRGFFVEPTVIAGLDNNARVAREEIFGPVLTVIAHDGDDDAVRIANDSPYGLSGTVFG 418

Query: 317 KDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGA-------FGGEKYTGGGRESGSDSWK 369
            D               G +NVN        GG        FGG K +G GRE G   ++
Sbjct: 419 ADPERAAAVA--ARVRTGTVNVN--------GGVWYSADAPFGGYKQSGIGREMGVAGFE 468

Query: 370 QY 371
           +Y
Sbjct: 469 EY 470



 Score = 48.6 bits (116), Expect = 4e-06
 Identities = 42/152 (27%), Positives = 69/152 (45%), Gaps = 12/152 (7%)

Query: 41  DGTSWKANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAY--AAWAALPAPRRGE 98
           DG     +    P++NP+T E +         D    I ++R A+    W+   A  R  
Sbjct: 6   DGKLVAGSAGTFPTVNPATEEVLGVAADATAADMDAAIAAARRAFDETDWSRDTA-LRVR 64

Query: 99  IVRQIGDALRHKLVPLGQLVSLEMG--KILAEGIGEVQEFIDICDYAVGLSRTYS----- 151
            +RQ+ DALR  +  L +L   E+G  ++L  G  +++  +D   +A  L+ +Y+     
Sbjct: 65  CLRQLRDALRAHVEELRELTIAEVGAPRMLTAG-AQLEGPVDDLGFAADLAESYAWTTDL 123

Query: 152 GSILPSERPGHVLLENWNPLGVVGIISAFNFP 183
           G   P   P    L     +GVVG I+ +NFP
Sbjct: 124 GVASPMGIPTRRTLRR-EAVGVVGAITPWNFP 154


>gnl|CDD|215110 PLN00412, PLN00412, NADP-dependent glyceraldehyde-3-phosphate
           dehydrogenase; Provisional.
          Length = 496

 Score = 67.9 bits (166), Expect = 2e-12
 Identities = 51/179 (28%), Positives = 87/179 (48%), Gaps = 19/179 (10%)

Query: 35  VNPGVY----DGTSWK--ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAW 88
           ++  VY    DG  W+  ++G+ +   NPST +T   VQ    ++ ++ +ES++AA  AW
Sbjct: 11  LDGDVYKYYADGE-WRTSSSGKSVAITNPSTRKTQYKVQACTQEEVNKAMESAKAAQKAW 69

Query: 89  AALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAV--GL 146
           A  P  +R E++ +    L+    P+ + +  E+ K   + + EV    D+  Y    G+
Sbjct: 70  AKTPLWKRAELLHKAAAILKEHKAPIAECLVKEIAKPAKDAVTEVVRSGDLISYTAEEGV 129

Query: 147 SRTYSGSILPSER-PGH----VLLENWNPLGVVGIISAFNFPVAVYGWNAAIALVGVAV 200
                G  L S+  PG+      L +  PLGVV  I  FN+PV     N A++ +  A+
Sbjct: 130 RILGEGKFLVSDSFPGNERNKYCLTSKIPLGVVLAIPPFNYPV-----NLAVSKIAPAL 183



 Score = 43.2 bits (102), Expect = 2e-04
 Identities = 23/51 (45%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 197 GVAVQSRFG--KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCT 245
           G+A+  + G   L +ELGG +A IV EDADL+LA   ++      +GQRCT
Sbjct: 249 GIAISKKAGMVPLQMELGGKDACIVLEDADLDLAAANIIKGGFSYSGQRCT 299



 Score = 39.7 bits (93), Expect = 0.002
 Identities = 17/61 (27%), Positives = 31/61 (50%)

Query: 259 RPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
           R G  + P ++  +  + R+   E F P++ V   +S++E I   N    GL   +FT+D
Sbjct: 368 REGNLIWPLLLDNVRPDMRIAWEEPFGPVLPVIRINSVEEGIHHCNASNFGLQGCVFTRD 427

Query: 319 V 319
           +
Sbjct: 428 I 428


>gnl|CDD|143471 cd07559, ALDH_ACDHII_AcoD-like, Ralstonia eutrophus NAD+-dependent
           acetaldehyde dehydrogenase II and Staphylococcus aureus
           AldA1 (SACOL0154)-like.  Included in this CD is the
           NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII,
           AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus
           H16 involved in the catabolism of acetoin and ethanol,
           and similar proteins, such as, the dimeric
           dihydrolipoamide dehydrogenase of the acetoin
           dehydrogenase enzyme system of Klebsiella pneumonia.
           Also included are sequences similar to the
           NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
           and AldB) of Xanthobacter autotrophicus GJ10 which are
           involved in the degradation of 1,2-dichloroethane, as
           well as, the uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences.
          Length = 480

 Score = 67.8 bits (166), Expect = 3e-12
 Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 8/163 (4%)

Query: 38  GVYDGTSWKA--NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPR 95
             +    W A   GE   + NP  G+ +  + +   +D    ++++  A+  W       
Sbjct: 2   DNFINGEWVAPSKGEYFDNYNPVNGKVLCEIPRSTAEDVDLAVDAAHEAFKTWGKTSVAE 61

Query: 96  RGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIG-EVQEFIDICDYAVGLSRTYSGSI 154
           R  I+ +I D +   L  L    +L+ GK + E +  ++   ID   Y  G+ R   GS+
Sbjct: 62  RANILNKIADRIEENLELLAVAETLDNGKPIRETLAADIPLAIDHFRYFAGVIRAQEGSL 121

Query: 155 --LPSERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
             +  +   +   E   PLGVVG I  +NFP+ +  W  A AL
Sbjct: 122 SEIDEDTLSYHFHE---PLGVVGQIIPWNFPLLMAAWKLAPAL 161



 Score = 66.2 bits (162), Expect = 7e-12
 Identities = 34/106 (32%), Positives = 52/106 (49%), Gaps = 11/106 (10%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           GYF EPT++ G  ++ R+   E F P++ V  F   +EAI   N+ + GL   ++T+D+ 
Sbjct: 363 GYFYEPTLIKGGNNDMRIFQEEIFGPVLAVITFKDEEEAIAIANDTEYGLGGGVWTRDIN 422

Query: 321 NLFKWLGPQGSDCGLINVN----IPTNGAEIGGAFGGEKYTGGGRE 362
              +    +G   G + VN     P         FGG K +G GRE
Sbjct: 423 RALRV--ARGIQTGRVWVNCYHQYPA-----HAPFGGYKKSGIGRE 461


>gnl|CDD|132284 TIGR03240, arg_catab_astD, succinylglutamic semialdehyde
           dehydrogenase.  Members of this protein family are
           succinylglutamic semialdehyde dehydrogenase (EC
           1.2.1.71), the fourth enzyme in the arginine
           succinyltransferase (AST) pathway for arginine
           catabolism [Energy metabolism, Amino acids and amines].
          Length = 484

 Score = 67.8 bits (166), Expect = 3e-12
 Identities = 41/143 (28%), Positives = 64/143 (44%), Gaps = 8/143 (5%)

Query: 45  WKA-NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQI 103
           W+A  GE   S NP+T E +      +       + ++RAA+ AWA L    R  +V++ 
Sbjct: 7   WRAGQGESFSSTNPATQEVLWQGAAASAAQVEAAVAAARAAFPAWARLSLEERIAVVQRF 66

Query: 104 GDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERP--- 160
              L  +   L ++++ E GK L E   EV   I     ++      +G    SE P   
Sbjct: 67  AALLEERKEALARVIARETGKPLWETRTEVASMIGKVAISIKAYHERTGE---SENPMPD 123

Query: 161 GHVLLENWNPLGVVGIISAFNFP 183
           G  +L +  P GVV +   +NFP
Sbjct: 124 GRAVLRH-RPHGVVAVFGPYNFP 145



 Score = 52.4 bits (126), Expect = 2e-07
 Identities = 22/46 (47%), Positives = 32/46 (69%)

Query: 207 LLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFL 252
           L LE+GGNN +IV+E AD++ A   ++ +   +AGQRCT  RRL +
Sbjct: 238 LALEMGGNNPLIVDEVADIDAAVHLIIQSAFISAGQRCTCARRLLV 283



 Score = 42.8 bits (101), Expect = 2e-04
 Identities = 32/103 (31%), Positives = 48/103 (46%), Gaps = 7/103 (6%)

Query: 263 FVEPTI--VTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
            + P I  VTG+   + +   E F P++ V  +D  DEAI   N  + GLS+ + + D  
Sbjct: 355 LLTPGIIDVTGV---AELPDEEHFGPLLQVIRYDDFDEAIAIANNTRFGLSAGLLSDDRE 411

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRES 363
              ++L       G++N N P  GA     FGG   +G  R S
Sbjct: 412 LYDRFL--LEIRAGIVNWNKPLTGASSAAPFGGIGASGNHRPS 452


>gnl|CDD|181826 PRK09406, gabD1, succinic semialdehyde dehydrogenase; Reviewed.
          Length = 457

 Score = 67.5 bits (165), Expect = 3e-12
 Identities = 37/126 (29%), Positives = 65/126 (51%), Gaps = 5/126 (3%)

Query: 255 KKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSI 314
           K+ D PG+F  PT++T +  + R+   E F P+  ++    +DEAI   N    GL S+ 
Sbjct: 335 KRPDGPGWFYPPTVITDITPDMRLYTEEVFGPVASLYRVADIDEAIEIANATTFGLGSNA 394

Query: 315 FTKDVTNLFKWLGPQGSDCGLINVN-IPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCR 373
           +T+D     +++     + G + +N +  +  E+   FGG K +G GRE  +   +++C 
Sbjct: 395 WTRDEAEQERFI--DDLEAGQVFINGMTVSYPELP--FGGVKRSGYGRELSAHGIREFCN 450

Query: 374 RSTVTI 379
             TV I
Sbjct: 451 IKTVWI 456



 Score = 44.3 bits (105), Expect = 7e-05
 Identities = 37/149 (24%), Positives = 54/149 (36%), Gaps = 32/149 (21%)

Query: 52  IPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKL 111
           I +INP+TGET+ +       +    I  + A +  +      +R        D L  + 
Sbjct: 3   IATINPATGETVKTFTALTDDEVDAAIARAHARFRDYRTTTFAQRARWANAAADLLEAEA 62

Query: 112 VPLGQLVSLEMGKILAEGIGEVQEFIDICDY-----------------AVGLSRTYSGSI 154
             +  L++LEMGK LA    E  +      Y                 AVG SR Y    
Sbjct: 63  DQVAALMTLEMGKTLASAKAEALKCAKGFRYYAEHAEALLADEPADAAAVGASRAYV--- 119

Query: 155 LPSERPGHVLLENWNPLGVVGIISAFNFP 183
                        + PLGVV  +  +NFP
Sbjct: 120 ------------RYQPLGVVLAVMPWNFP 136



 Score = 39.0 bits (91), Expect = 0.003
 Identities = 19/48 (39%), Positives = 26/48 (54%)

Query: 206 KLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLH 253
           K +LELGG++  IV   ADL+ A +  V A     GQ C   +R  +H
Sbjct: 227 KTVLELGGSDPFIVMPSADLDRAAETAVTARVQNNGQSCIAAKRFIVH 274


>gnl|CDD|143435 cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus
           AldA1 (SACOL0154) aldehyde dehydrogenase-like.
           Uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences are present in this CD.
          Length = 475

 Score = 67.5 bits (165), Expect = 3e-12
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 3/103 (2%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G+F+EPT++  + ++ RV   E F P+  V  F + DE I   N+ + GL   +FTKD+ 
Sbjct: 358 GFFIEPTLIVNVTNDMRVAQEEIFGPVATVIKFKTEDEVIDMANDSEYGLGGGVFTKDIN 417

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRES 363
              +    +  + G + VN   N    G  FGG K +G GRE+
Sbjct: 418 RALRV--ARAVETGRVWVNT-YNQIPAGAPFGGYKKSGIGRET 457



 Score = 65.9 bits (161), Expect = 8e-12
 Identities = 43/155 (27%), Positives = 72/155 (46%), Gaps = 12/155 (7%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDA 106
           ++GE I S NP+ GET++ +      D  R +++++ A+  W       R  I+ +I D 
Sbjct: 13  SSGETIDSYNPANGETLSEITDATDADVDRAVKAAQEAFKTWRKTTVAERANILNKIADI 72

Query: 107 LRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLE 166
           +      L  + +L+ GK + E        +DI   A    R ++G I   E   +++ E
Sbjct: 73  IDENKELLAMVETLDNGKPIRETRA-----VDI-PLAADHFRYFAGVIRAEEGSANMIDE 126

Query: 167 NW------NPLGVVGIISAFNFPVAVYGWNAAIAL 195
           +        P+GVVG I  +NFP  +  W  A AL
Sbjct: 127 DTLSIVLREPIGVVGQIIPWNFPFLMAAWKLAPAL 161


>gnl|CDD|181873 PRK09457, astD, succinylglutamic semialdehyde dehydrogenase;
           Reviewed.
          Length = 487

 Score = 67.3 bits (165), Expect = 3e-12
 Identities = 41/146 (28%), Positives = 58/146 (39%), Gaps = 14/146 (9%)

Query: 45  WKA-NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQI 103
           W A  GE   S NP +GE +              + ++RAA+ AWA L    R  IV + 
Sbjct: 9   WIAGQGEAFESRNPVSGEVLWQGNDATAAQVDAAVRAARAAFPAWARLSFEERQAIVERF 68

Query: 104 GDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLS------RTYSGSILPS 157
              L      L ++++ E GK L E   EV   I+     + +S      RT       S
Sbjct: 69  AALLEENKEELAEVIARETGKPLWEAATEVTAMIN----KIAISIQAYHERTGEKR---S 121

Query: 158 ERPGHVLLENWNPLGVVGIISAFNFP 183
           E      +    P GVV +   +NFP
Sbjct: 122 EMADGAAVLRHRPHGVVAVFGPYNFP 147



 Score = 49.2 bits (118), Expect = 2e-06
 Identities = 21/49 (42%), Positives = 34/49 (69%), Gaps = 1/49 (2%)

Query: 205 GKLL-LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFL 252
            K+L LE+GGNN ++++E AD++ A   ++ +   +AGQRCT  RRL +
Sbjct: 237 EKILALEMGGNNPLVIDEVADIDAAVHLIIQSAFISAGQRCTCARRLLV 285



 Score = 39.9 bits (94), Expect = 0.002
 Identities = 32/103 (31%), Positives = 49/103 (47%), Gaps = 7/103 (6%)

Query: 263 FVEPTI--VTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
            + P I  VTG+   + +   E F P++ V  +D  DEAI   N  + GLS+ + + D  
Sbjct: 357 LLTPGIIDVTGV---AELPDEEYFGPLLQVVRYDDFDEAIRLANNTRFGLSAGLLSDDRE 413

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRES 363
           +  ++L       G++N N P  GA     FGG   +G  R S
Sbjct: 414 DYDQFL--LEIRAGIVNWNKPLTGASSAAPFGGVGASGNHRPS 454


>gnl|CDD|143423 cd07105, ALDH_SaliADH, Salicylaldehyde dehydrogenase, DoxF-like.
           Salicylaldehyde dehydrogenase (DoxF, SaliADH,
           EC=1.2.1.65) involved in the upper naphthalene catabolic
           pathway of Pseudomonas strain C18 and other similar
           sequences are present in this CD.
          Length = 432

 Score = 66.8 bits (164), Expect = 4e-12
 Identities = 30/119 (25%), Positives = 58/119 (48%), Gaps = 2/119 (1%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G  + PTI+  +  +  +   E+F P+V +      +EA+   N+ + GLS+++FT+D+ 
Sbjct: 316 GTSMPPTILDNVTPDMDIYSEESFGPVVSIIRVKDEEEAVRIANDSEYGLSAAVFTRDLA 375

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTI 379
                +  +  + G +++N  T   E     GG K +G GR +G     ++     +TI
Sbjct: 376 RALA-VAKR-IESGAVHINGMTVHDEPTLPHGGVKSSGYGRFNGKWGIDEFTETKWITI 432



 Score = 51.8 bits (125), Expect = 3e-07
 Identities = 25/58 (43%), Positives = 32/58 (55%), Gaps = 3/58 (5%)

Query: 208 LLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
           LLELGG    IV EDADL+ A    +F     +GQ C +T R+ +H+   D    FVE
Sbjct: 208 LLELGGKAPAIVLEDADLDAAANAALFGAFLNSGQICMSTERIIVHESIADE---FVE 262



 Score = 44.9 bits (107), Expect = 4e-05
 Identities = 29/117 (24%), Positives = 50/117 (42%), Gaps = 10/117 (8%)

Query: 73  DYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGE 132
           D  + +E++ AA+ AW+  P   R +I+ +  D L  +     + +  E G         
Sbjct: 1   DADQAVEAAAAAFPAWSKTPPSERRDILLKAADLLESRRDEFIEAMMEETG-----ATAA 55

Query: 133 VQEF-----IDICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGVVGIISAFNFPV 184
              F       +   A  L     G  +PS++PG + +    P+GVV  I+ +N PV
Sbjct: 56  WAGFNVDLAAGMLREAASLITQIIGGSIPSDKPGTLAMVVKEPVGVVLGIAPWNAPV 112


>gnl|CDD|143420 cd07102, ALDH_EDX86601, Uncharacterized aldehyde dehydrogenase of
           Synechococcus sp. PCC 7335 (EDX86601).  Uncharacterized
           aldehyde dehydrogenase of Synechococcus sp. PCC 7335
           (locus EDX86601) and other similar sequences, are
           present in this CD.
          Length = 452

 Score = 64.6 bits (158), Expect = 2e-11
 Identities = 43/150 (28%), Positives = 71/150 (47%), Gaps = 7/150 (4%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPL 114
           I+P  G  IA     +++     +E +RAA   W A+P   R  IV +  + L      +
Sbjct: 1   ISPIDGSVIAERPLASLEAVRAALERARAAQKGWRAVPLEERKAIVTRAVELLAANTDEI 60

Query: 115 GQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGVV 174
            + ++ +MG+ +A+  GE++  ++   Y + ++      I   E+ G        PLGVV
Sbjct: 61  AEELTWQMGRPIAQAGGEIRGMLERARYMISIAEEALADIRVPEKDGFERYIRREPLGVV 120

Query: 175 GIISAFNFP--VAVYGWNAAIA--LVGVAV 200
            II+ +N+P   AV   NA I   L G AV
Sbjct: 121 LIIAPWNYPYLTAV---NAVIPALLAGNAV 147



 Score = 54.6 bits (132), Expect = 4e-08
 Identities = 23/59 (38%), Positives = 38/59 (64%)

Query: 260 PGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
            G ++ PT++T + H+ RV+  ETF P+V +    S  EAI   N+ + GL++S++TKD
Sbjct: 336 GGAYLAPTVLTNVDHSMRVMREETFGPVVGIMKVKSDAEAIALMNDSEYGLTASVWTKD 394



 Score = 44.2 bits (105), Expect = 8e-05
 Identities = 28/73 (38%), Positives = 39/73 (53%), Gaps = 7/73 (9%)

Query: 197 GVAVQ----SRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFL 252
           G A+Q     RF K+ LELGG +   V  DADL+ A + LV      +GQ C +  R+++
Sbjct: 207 GRAIQRAAAGRFIKVGLELGGKDPAYVRPDADLDAAAESLVDGAFFNSGQSCCSIERIYV 266

Query: 253 HKKKIDRPGYFVE 265
           H+   D    FVE
Sbjct: 267 HESIYDA---FVE 276


>gnl|CDD|166060 PLN02419, PLN02419, methylmalonate-semialdehyde dehydrogenase
           [acylating].
          Length = 604

 Score = 64.8 bits (157), Expect = 2e-11
 Identities = 35/148 (23%), Positives = 69/148 (46%)

Query: 48  NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDAL 107
           +   I  INP+T E ++ V     +++   + +++ A+  W   P   R  ++ +  + +
Sbjct: 127 SSSFIDVINPATQEVVSKVPLTTNEEFKAAVSAAKQAFPLWRNTPITTRQRVMLKFQELI 186

Query: 108 RHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLEN 167
           R  +  L   ++ E GK L +  G++   +++ ++A G++    G  LP+   G      
Sbjct: 187 RKNMDKLAMNITTEQGKTLKDSHGDIFRGLEVVEHACGMATLQMGEYLPNVSNGVDTYSI 246

Query: 168 WNPLGVVGIISAFNFPVAVYGWNAAIAL 195
             PLGV   I  FNFP  +  W   +A+
Sbjct: 247 REPLGVCAGICPFNFPAMIPLWMFPVAV 274



 Score = 47.0 bits (111), Expect = 1e-05
 Identities = 30/97 (30%), Positives = 48/97 (49%), Gaps = 6/97 (6%)

Query: 249 RLFLHKKKIDRPGY----FVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNN 304
           +L L  + I  PGY    F+ PTI++G+  +      E F P++     +S DEAI+  N
Sbjct: 454 KLLLDGRDIVVPGYEKGNFIGPTILSGVTPDMECYKEEIFGPVLVCMQANSFDEAISIIN 513

Query: 305 EVKQGLSSSIFTKDVTNLFKWLGPQGSDCGLINVNIP 341
           + K G  ++IFT       K+      + G I +N+P
Sbjct: 514 KNKYGNGAAIFTSSGAAARKF--QMDIEAGQIGINVP 548


>gnl|CDD|143416 cd07098, ALDH_F15-22, Aldehyde dehydrogenase family 15A1 and
           22A1-like.  Aldehyde dehydrogenase family members
           ALDH15A1 (Saccharomyces cerevisiae YHR039C) and ALDH22A1
           (Arabidopsis thaliana, EC=1.2.1.3), and similar
           sequences, are in this CD. Significant improvement of
           stress tolerance in tobacco plants was observed by
           overexpressing the ALDH22A1 gene from maize (Zea mays)
           and was accompanied by a reduction of malondialdehyde
           derived from cellular lipid peroxidation.
          Length = 465

 Score = 64.6 bits (158), Expect = 3e-11
 Identities = 31/125 (24%), Positives = 58/125 (46%), Gaps = 15/125 (12%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           G++  PT++  +  + ++   E F P++ V      +EA+   N  + GL +S+F KD+ 
Sbjct: 346 GHYFPPTLLVDVTPDMKIAQEEVFGPVMVVMKASDDEEAVEIANSTEYGLGASVFGKDIK 405

Query: 321 NLFKWLGPQGSDCGLINVNIPTNGAEIGGA-------FGGEKYTGGGRESGSDSWKQYCR 373
              + +  Q    G++ +N      + G         FGG K +G GR +G +  +  C 
Sbjct: 406 RA-RRIASQLE-TGMVAIN------DFGVNYYVQQLPFGGVKGSGFGRFAGEEGLRGLCN 457

Query: 374 RSTVT 378
             +VT
Sbjct: 458 PKSVT 462



 Score = 41.9 bits (99), Expect = 5e-04
 Identities = 18/53 (33%), Positives = 31/53 (58%)

Query: 207 LLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR 259
           ++LELGG +  IV +DADL+     ++     ++GQ C    R+ +H+K  D+
Sbjct: 229 VVLELGGKDPAIVLDDADLDQIASIIMRGTFQSSGQNCIGIERVIVHEKIYDK 281



 Score = 39.2 bits (92), Expect = 0.003
 Identities = 34/136 (25%), Positives = 66/136 (48%), Gaps = 12/136 (8%)

Query: 56  NPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGD-ALRHKLVPL 114
           +P+TG+ + SV     +D    I ++RAA   WA      R +++R +    L ++   +
Sbjct: 2   DPATGQHLGSVPADTPEDVDEAIAAARAAQREWAKTSFAERRKVLRSLLKYILENQEE-I 60

Query: 115 GQLVSLEMGKILAEG-IGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLE------N 167
            ++   + GK + +  +GE+   +  C+      +    ++ P  RPG +L+        
Sbjct: 61  CRVACRDTGKTMVDASLGEI---LVTCEKIRWTLKHGEKALRPESRPGGLLMFYKRARVE 117

Query: 168 WNPLGVVGIISAFNFP 183
           + PLGVVG I ++N+P
Sbjct: 118 YEPLGVVGAIVSWNYP 133


>gnl|CDD|236989 PRK11809, putA, trifunctional transcriptional regulator/proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1318

 Score = 62.3 bits (152), Expect = 2e-10
 Identities = 44/150 (29%), Positives = 69/150 (46%), Gaps = 13/150 (8%)

Query: 47  ANGEIIPSINPS-TGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGD 105
           A GE+ P INP+   + +  V++    +  + +ES+  A   W A P   R  I+ +  D
Sbjct: 656 AAGEMSPVINPADPRDIVGYVREATPAEVEQALESAVNAAPIWFATPPAERAAILERAAD 715

Query: 106 ALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLL 165
            +  ++  L  L+  E GK  +  I EV+E       AV   R Y+G +           
Sbjct: 716 LMEAQMQTLMGLLVREAGKTFSNAIAEVRE-------AVDFLRYYAGQVRDDFDN----- 763

Query: 166 ENWNPLGVVGIISAFNFPVAVYGWNAAIAL 195
           +   PLG V  IS +NFP+A++    A AL
Sbjct: 764 DTHRPLGPVVCISPWNFPLAIFTGQVAAAL 793



 Score = 31.5 bits (72), Expect = 0.92
 Identities = 21/56 (37%), Positives = 30/56 (53%), Gaps = 6/56 (10%)

Query: 207 LLLELGGNNAIIVNEDADLNLAQQC---LVFACCGTAGQRCTTTRRLFLHKKKIDR 259
           L+ E GG NA+IV+  A   L +Q    ++ +   +AGQRC+  R L L     DR
Sbjct: 880 LIAETGGQNAMIVDSSA---LTEQVVADVLASAFDSAGQRCSALRVLCLQDDVADR 932


>gnl|CDD|143453 cd07135, ALDH_F14-YMR110C, Saccharomyces cerevisiae aldehyde
           dehydrogenase family 14 and related proteins.  Aldehyde
           dehydrogenase family 14 (ALDH14), isolated mainly from
           the mitochondrial outer membrane of Saccharomyces
           cerevisiae (YMR110C) and most closely related to the
           plant and animal ALDHs and fatty ALDHs family 3 members,
           and similar fungal sequences, are present in this CD.
          Length = 436

 Score = 61.1 bits (149), Expect = 3e-10
 Identities = 53/269 (19%), Positives = 79/269 (29%), Gaps = 120/269 (44%)

Query: 170 PLGVVGIISAFNFPV-------------------------------------------AV 186
           PLGVV II  +N+PV                                           A 
Sbjct: 108 PLGVVLIIGPWNYPVLLALSPLVGAIAAGCTVVLKPSELTPHTAALLAELVPKYLDPDAF 167

Query: 187 YGWNAAIALVGVAVQSRFGK-----------------------LLLELGGNNAIIVNEDA 223
                 +      ++ +F K                       + LELGG + +IV ++A
Sbjct: 168 QVVQGGVPETTALLEQKFDKIFYTGSGRVGRIIAEAAAKHLTPVTLELGGKSPVIVTKNA 227

Query: 224 DLNLAQQCLVFACCGTAGQ--------------------RCTTTRRLF------------ 251
           DL LA + +++   G AGQ                            F            
Sbjct: 228 DLELAAKRILWGKFGNAGQICVAPDYVLVDPSVYDEFVEELKKVLDEFYPGGANASPDYT 287

Query: 252 ----------LHKK------KI------DRPGYFVEPTIVTGLPHNSRVVHRETFAPIVY 289
                     L         K+      D    F+ PTIV+ +  +  ++  E F P++ 
Sbjct: 288 RIVNPRHFNRLKSLLDTTKGKVVIGGEMDEATRFIPPTIVSDVSWDDSLMSEELFGPVLP 347

Query: 290 VFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
           +   D LDEAI   N     L+  IFT D
Sbjct: 348 IIKVDDLDEAIKVINSRDTPLALYIFTDD 376


>gnl|CDD|143434 cd07116, ALDH_ACDHII-AcoD, Ralstonia eutrophus NAD+-dependent
           acetaldehyde dehydrogenase II-like.  Included in this CD
           is the NAD+-dependent, acetaldehyde dehydrogenase II
           (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes)
           eutrophus H16 involved in the catabolism of acetoin and
           ethanol, and similar proteins, such as, the dimeric
           dihydrolipoamide dehydrogenase of the acetoin
           dehydrogenase enzyme system of Klebsiella pneumonia.
           Also included are sequences similar to the
           NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
           and AldB) of Xanthobacter autotrophicus GJ10 which are
           involved in the degradation of 1,2-dichloroethane. These
           proteins apparently require RpoN factors for expression.
          Length = 479

 Score = 60.5 bits (147), Expect = 5e-10
 Identities = 48/160 (30%), Positives = 74/160 (46%), Gaps = 6/160 (3%)

Query: 40  YDGTSWKA--NGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRG 97
           + G  W A   GE   +I P TG+    V +   +D    ++++ AA  AW       R 
Sbjct: 4   FIGGEWVAPVKGEYFDNITPVTGKVFCEVPRSTAEDIELALDAAHAAKEAWGKTSVAERA 63

Query: 98  EIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIG-EVQEFIDICDYAVGLSRTYSGSILP 156
            I+ +I D +   L  L    + + GK + E +  ++   ID   Y  G  R   GSI  
Sbjct: 64  NILNKIADRMEANLEMLAVAETWDNGKPVRETLAADIPLAIDHFRYFAGCIRAQEGSI-- 121

Query: 157 SERPGHVLLENWN-PLGVVGIISAFNFPVAVYGWNAAIAL 195
           SE   + +  +++ PLGVVG I  +NFP+ +  W  A AL
Sbjct: 122 SEIDENTVAYHFHEPLGVVGQIIPWNFPLLMATWKLAPAL 161



 Score = 55.2 bits (133), Expect = 2e-08
 Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 13/107 (12%)

Query: 261 GYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVT 320
           GY+V  T   G  +  R+   E F P++ V  F   +EA+   N+   GL + ++T+D  
Sbjct: 364 GYYVPTTFKGG--NKMRIFQEEIFGPVLAVTTFKDEEEALEIANDTLYGLGAGVWTRDGN 421

Query: 321 NLFKWLGPQGSDCGLINVN----IPTNGAEIGGAFGGEKYTGGGRES 363
             ++    +G   G +  N     P        AFGG K +G GRE+
Sbjct: 422 TAYRM--GRGIQAGRVWTNCYHLYPA-----HAAFGGYKQSGIGREN 461


>gnl|CDD|143454 cd07136, ALDH_YwdH-P39616, Bacillus subtilis aldehyde dehydrogenase
           ywdH-like.  Uncharacterized Bacillus subtilis ywdH
           aldehyde dehydrogenase (locus P39616)  most closely
           related to the ALDHs and fatty ALDHs of families 3 and
           14, and similar sequences, are included in this CD.
          Length = 449

 Score = 55.6 bits (135), Expect = 2e-08
 Identities = 44/162 (27%), Positives = 64/162 (39%), Gaps = 52/162 (32%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRC----------------------TT 246
           LELGG +  IV+EDA+L LA + +V+     AGQ C                        
Sbjct: 205 LELGGKSPCIVDEDANLKLAAKRIVWGKFLNAGQTCVAPDYVLVHESVKEKFIKELKEEI 264

Query: 247 T----------------------RRL--FLHKKKI------DRPGYFVEPTIVTGLPHNS 276
                                   RL   L   KI      DR   ++EPTI+  +  + 
Sbjct: 265 KKFYGEDPLESPDYGRIINEKHFDRLAGLLDNGKIVFGGNTDRETLYIEPTILDNVTWDD 324

Query: 277 RVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
            V+  E F PI+ V  +D+LDEAI       + L+  +F++D
Sbjct: 325 PVMQEEIFGPILPVLTYDTLDEAIEIIKSRPKPLALYLFSED 366


>gnl|CDD|240392 PTZ00381, PTZ00381, aldehyde dehydrogenase family protein;
           Provisional.
          Length = 493

 Score = 50.8 bits (122), Expect = 6e-07
 Identities = 48/235 (20%), Positives = 81/235 (34%), Gaps = 71/235 (30%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQ-------------------------- 242
           LELGG + +IV++  +L +A + + +     AGQ                          
Sbjct: 214 LELGGKSPVIVDKSCNLKVAARRIAWGKFLNAGQTCVAPDYVLVHRSIKDKFIEALKEAI 273

Query: 243 ------------------RCTTTRRL----FLHKKKI------DRPGYFVEPTIVTGLPH 274
                                 T+RL      H  K+      D    +V PTI+     
Sbjct: 274 KEFFGEDPKKSEDYSRIVNEFHTKRLAELIKDHGGKVVYGGEVDIENKYVAPTIIVNPDL 333

Query: 275 NSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNLFKWLGPQGS--- 331
           +S ++  E F PI+ +  ++++DE + + N   + L+   F +D  +    L    S   
Sbjct: 334 DSPLMQEEIFGPILPILTYENIDEVLEFINSRPKPLALYYFGEDKRHKELVLENTSSGAV 393

Query: 332 ---DC--GLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTVTINH 381
              DC   L+N N+P         FGG   +G G   G   +  +     V    
Sbjct: 394 VINDCVFHLLNPNLP---------FGGVGNSGMGAYHGKYGFDTFSHPKPVLNKS 439



 Score = 31.9 bits (73), Expect = 0.67
 Identities = 14/56 (25%), Positives = 28/56 (50%), Gaps = 6/56 (10%)

Query: 130 IGEVQEFI-DICDYAVGLSRTYSGSILPSERPGHVLLENWNPLGVVGIISAFNFPV 184
           + E++  +  + +Y         G   P +   +++ E   PLGVV +I A+N+P+
Sbjct: 73  VAEIEHLLKHLDEYLKPEKVDTVGVFGPGK--SYIIPE---PLGVVLVIGAWNYPL 123


>gnl|CDD|143452 cd07134, ALDH_AlkH-like, Pseudomonas putida Aldehyde dehydrogenase
           AlkH-like.  Aldehyde dehydrogenase AlkH (locus name
           P12693, EC=1.2.1.3) of the alkBFGHJKL operon that allows
           Pseudomonas putida to metabolize alkanes and the
           aldehyde dehydrogenase AldX of Bacillus subtilis (locus
           P46329, EC=1.2.1.3), and similar sequences, are present
           in this CD.
          Length = 433

 Score = 49.5 bits (119), Expect = 2e-06
 Identities = 16/63 (25%), Positives = 36/63 (57%)

Query: 256 KIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIF 315
           + D    ++ PT++T +  + +++  E F P++ +  ++ LDE I + N   + L+  +F
Sbjct: 311 QFDAAQRYIAPTVLTNVTPDMKIMQEEIFGPVLPIITYEDLDEVIEYINAKPKPLALYVF 370

Query: 316 TKD 318
           +KD
Sbjct: 371 SKD 373



 Score = 38.4 bits (90), Expect = 0.005
 Identities = 19/51 (37%), Positives = 27/51 (52%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDR 259
           LELGG +  IV+E ADL  A + + +     AGQ C     +F+H+   D 
Sbjct: 205 LELGGKSPTIVDETADLKKAAKKIAWGKFLNAGQTCIAPDYVFVHESVKDA 255


>gnl|CDD|182108 PRK09847, PRK09847, gamma-glutamyl-gamma-aminobutyraldehyde
           dehydrogenase; Provisional.
          Length = 494

 Score = 49.5 bits (118), Expect = 2e-06
 Identities = 33/159 (20%), Positives = 66/159 (41%), Gaps = 16/159 (10%)

Query: 47  ANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAY--AAWAALPAPRRGEIVRQIG 104
           A  E   +++P T   +A + +G   D  R + ++R  +    W+     +R  ++ ++ 
Sbjct: 32  AENETFETVDPVTQAPLAKIARGKSVDIDRAVSAARGVFERGDWSLSSPAKRKAVLNKLA 91

Query: 105 DALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICD-------YAVGLSRTYSGSILPS 157
           D +      L  L +L+ GK +   + +     DI         YA  + + Y G +  +
Sbjct: 92  DLMEAHAEELALLETLDTGKPIRHSLRD-----DIPGAARAIRWYAEAIDKVY-GEVATT 145

Query: 158 ERPGHVLLENWNPLGVVGIISAFNFPVAVYGWNAAIALV 196
                 ++    P+GV+  I  +NFP+ +  W    AL 
Sbjct: 146 SSHELAMIVR-EPVGVIAAIVPWNFPLLLTCWKLGPALA 183



 Score = 49.1 bits (117), Expect = 2e-06
 Identities = 31/130 (23%), Positives = 62/130 (47%), Gaps = 4/130 (3%)

Query: 250 LFLHKKKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQG 309
           L L  +        + PTI   +  N+ +   E F P++ V  F S ++A+   N+ + G
Sbjct: 366 LLLDGRNAGLAAA-IGPTIFVDVDPNASLSREEIFGPVLVVTRFTSEEQALQLANDSQYG 424

Query: 310 LSSSIFTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWK 369
           L ++++T+D++   +    +    G + VN   N  ++   FGG K +G GR+    + +
Sbjct: 425 LGAAVWTRDLSRAHRM--SRRLKAGSVFVN-NYNDGDMTVPFGGYKQSGNGRDKSLHALE 481

Query: 370 QYCRRSTVTI 379
           ++    T+ I
Sbjct: 482 KFTELKTIWI 491


>gnl|CDD|143406 cd07087, ALDH_F3-13-14_CALDH-like, ALDH subfamily: Coniferyl
           aldehyde dehydrogenase, ALDH families 3, 13, and 14, and
           other related proteins.  ALDH subfamily which includes
           NAD(P)+-dependent, aldehyde dehydrogenase, family 3
           member A1 and B1  (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and
           fatty aldehyde dehydrogenase, family 3 member A2
           (ALDH3A2, EC=1.2.1.3), and also plant ALDH family
           members ALDH3F1, ALDH3H1, and ALDH3I1, fungal ALDH14
           (YMR110C) and the protozoan family 13 member (ALDH13),
           as well as coniferyl aldehyde dehydrogenases (CALDH,
           EC=1.2.1.68), and other similar  sequences, such as the
           Pseudomonas putida benzaldehyde dehydrogenase I that is
           involved in the metabolism of mandelate.
          Length = 426

 Score = 49.4 bits (119), Expect = 2e-06
 Identities = 19/67 (28%), Positives = 39/67 (58%)

Query: 252 LHKKKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLS 311
           +   ++D+   ++ PTI+  +  +S ++  E F PI+ +  +D LDEAI + N   + L+
Sbjct: 300 VIGGQVDKEERYIAPTILDDVSPDSPLMQEEIFGPILPILTYDDLDEAIEFINSRPKPLA 359

Query: 312 SSIFTKD 318
             +F++D
Sbjct: 360 LYLFSED 366



 Score = 39.4 bits (93), Expect = 0.003
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRC 244
           LELGG +  IV++DA+L +A + + +     AGQ C
Sbjct: 205 LELGGKSPCIVDKDANLEVAARRIAWGKFLNAGQTC 240



 Score = 33.7 bits (78), Expect = 0.17
 Identities = 9/15 (60%), Positives = 12/15 (80%)

Query: 170 PLGVVGIISAFNFPV 184
           PLGVV II  +N+P+
Sbjct: 100 PLGVVLIIGPWNYPL 114


>gnl|CDD|143466 cd07148, ALDH_RL0313, Uncharacterized ALDH ( RL0313) with
           similarity to Tortula ruralis aldehyde dehydrogenase
           ALDH21A1.  Uncharacterized aldehyde dehydrogenase (locus
           RL0313) with sequence similarity to the moss Tortula
           ruralis aldehyde dehydrogenase ALDH21A1 (RNP123)
           believed to play an important role in the detoxification
           of aldehydes generated in response to desiccation- and
           salinity-stress, and similar sequences are included in
           this CD.
          Length = 455

 Score = 48.2 bits (115), Expect = 4e-06
 Identities = 39/168 (23%), Positives = 64/168 (38%), Gaps = 57/168 (33%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHK-------------- 254
           LE GG   +IV+  ADL+     LV      AGQ C + +R+F+                
Sbjct: 230 LEHGGAAPVIVDRSADLDAMIPPLVKGGFYHAGQVCVSVQRVFVPAEIADDFAQRLAAAA 289

Query: 255 ---------------------KKIDRPGYFVEP------TIVTG---------------- 271
                                +++DR   +V         ++ G                
Sbjct: 290 EKLVVGDPTDPDTEVGPLIRPREVDRVEEWVNEAVAAGARLLCGGKRLSDTTYAPTVLLD 349

Query: 272 LPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKDV 319
            P +++V  +E F P+V V+ +D LDEAI   N +     +++FTKD+
Sbjct: 350 PPRDAKVSTQEIFGPVVCVYSYDDLDEAIAQANSLPVAFQAAVFTKDL 397



 Score = 45.9 bits (109), Expect = 2e-05
 Identities = 40/156 (25%), Positives = 64/156 (41%), Gaps = 20/156 (12%)

Query: 55  INPSTGETIASVQQGNVQDYHRCIESSRAAY---AAWAALPAPRRGEIVRQIGDALRHKL 111
           +NP   + I  V   +     + ++++ A +     W  LPA  R  I+ ++ D +  + 
Sbjct: 4   VNPFDLKPIGEVPTVDWAAIDKALDTAHALFLDRNNW--LPAHERIAILERLADLMEERA 61

Query: 112 VPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILP-------SERPGHVL 164
             L  L++ E GK L +   EV   ID  + A        G  +P       + R     
Sbjct: 62  DELALLIAREGGKPLVDAKVEVTRAIDGVELAADELGQLGGREIPMGLTPASAGRIAFTT 121

Query: 165 LENWNPLGVVGIISAFNFPVAVYGWNAAIALVGVAV 200
            E   P+GVV  ISAFN P+     N  +  V  A+
Sbjct: 122 RE---PIGVVVAISAFNHPL-----NLIVHQVAPAI 149


>gnl|CDD|143446 cd07128, ALDH_MaoC-N, N-terminal domain of the monoamine oxidase C
           dehydratase.  The N-terminal domain of the MaoC
           dehydratase, a monoamine oxidase regulatory protein.
           Orthologs of MaoC include PaaZ (Escherichia coli) and
           PaaN (Pseudomonas putida), which are putative
           ring-opening enzymes of the aerobic phenylacetic acid
           (PA) catabolic pathway. The C-terminal domain of MaoC
           has sequence similarity to enoyl-CoA hydratase. Also
           included in this CD is a novel Burkholderia xenovorans
           LB400 ALDH of the aerobic benzoate oxidation (box)
           pathway. This pathway involves first the synthesis of a
           CoA thio-esterified aromatic acid, with subsequent
           dihydroxylation and cleavage steps, yielding the CoA
           thio-esterified aliphatic aldehyde,
           3,4-dehydroadipyl-CoA semialdehyde, which is further
           converted into its corresponding CoA acid by the
           Burkholderia LB400 ALDH.
          Length = 513

 Score = 47.6 bits (114), Expect = 8e-06
 Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 2/64 (3%)

Query: 261 GYFVEPTIVTGL-PHNSRVVHR-ETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
           G F  PT++    P  +  VH  E F P+  + P+DSL EAI      +  L +S+ T D
Sbjct: 374 GAFFPPTLLLCDDPDAATAVHDVEAFGPVATLMPYDSLAEAIELAARGRGSLVASVVTND 433

Query: 319 VTNL 322
               
Sbjct: 434 PAFA 437



 Score = 33.8 bits (78), Expect = 0.17
 Identities = 43/173 (24%), Positives = 61/173 (35%), Gaps = 46/173 (26%)

Query: 40  YDGTSWKAN-GEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPR--- 95
           Y    W A  G+     +  TGE +A V    + D+   +     AYA     PA R   
Sbjct: 4   YVAGQWHAGTGDGRTLHDAVTGEVVARVSSEGL-DFAAAV-----AYAREKGGPALRALT 57

Query: 96  ---RGEIVRQIGDAL-RHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYS 151
              R  +++ +   L   K      L +L      A G      +IDI D  +G    Y+
Sbjct: 58  FHERAAMLKALAKYLMERK----EDLYALS----AATGATRRDSWIDI-DGGIGTLFAYA 108

Query: 152 G---SILPSERPGHVLLEN----------------WNPLGVVGI-ISAFNFPV 184
                 LP+    H L+E                   P   V + I+AFNFPV
Sbjct: 109 SLGRRELPNA---HFLVEGDVEPLSKDGTFVGQHILTPRRGVAVHINAFNFPV 158


>gnl|CDD|143450 cd07132, ALDH_F3AB, Aldehyde dehydrogenase family 3 members A1, A2,
           and B1 and related proteins.  NAD(P)+-dependent,
           aldehyde dehydrogenase, family 3 members A1 and B1
           (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and fatty aldehyde
           dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3),
           and similar sequences are included in this CD. Human
           ALDH3A1 is a homodimer with a critical role in cellular
           defense against oxidative stress; it catalyzes the
           oxidation of various cellular membrane lipid-derived
           aldehydes. Corneal crystalline ALDH3A1 protects the
           cornea and underlying lens against UV-induced oxidative
           stress. Human ALDH3A2, a microsomal homodimer, catalyzes
           the oxidation of long-chain aliphatic aldehydes to fatty
           acids. Human ALDH3B1 is highly expressed in the kidney
           and liver and catalyzes the oxidation of various medium-
           and long-chain saturated and unsaturated aliphatic
           aldehydes.
          Length = 443

 Score = 45.7 bits (109), Expect = 3e-05
 Identities = 19/63 (30%), Positives = 36/63 (57%), Gaps = 2/63 (3%)

Query: 263 FVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFT--KDVT 320
           ++ PT++T +  +  V+  E F PI+ +   ++LDEAI + N  ++ L+  +F+  K V 
Sbjct: 311 YIAPTVLTDVKPSDPVMQEEIFGPILPIVTVNNLDEAIEFINSREKPLALYVFSNNKKVI 370

Query: 321 NLF 323
           N  
Sbjct: 371 NKI 373



 Score = 31.0 bits (71), Expect = 1.2
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 167 NWNPLGVVGIISAFNFPVAV 186
              PLGVV II A+N+P+ +
Sbjct: 97  YKEPLGVVLIIGAWNYPLQL 116


>gnl|CDD|143451 cd07133, ALDH_CALDH_CalB, Coniferyl aldehyde dehydrogenase-like.
           Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of
           Pseudomonas sp. strain HR199 (CalB) which catalyzes the
           NAD+-dependent oxidation of coniferyl aldehyde to
           ferulic acid, and similar sequences, are present in this
           CD.
          Length = 434

 Score = 44.0 bits (105), Expect = 1e-04
 Identities = 16/35 (45%), Positives = 23/35 (65%)

Query: 266 PTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAI 300
           PT+V  +  + RV+  E F PI+ +  +DSLDEAI
Sbjct: 322 PTLVLNVTDDMRVMQEEIFGPILPILTYDSLDEAI 356



 Score = 40.9 bits (97), Expect = 8e-04
 Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 3/57 (5%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVE 265
           LELGG +  I+  DADL  A + + F     AGQ C     + + + K++    FV 
Sbjct: 206 LELGGKSPAIIAPDADLAKAAERIAFGKLLNAGQTCVAPDYVLVPEDKLEE---FVA 259



 Score = 37.5 bits (88), Expect = 0.011
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query: 170 PLGVVGIISAFNFPV 184
           PLGVVGII  +N+P+
Sbjct: 101 PLGVVGIIVPWNYPL 115


>gnl|CDD|233324 TIGR01236, D1pyr5carbox1, delta-1-pyrroline-5-carboxylate
           dehydrogenase, group 1.  This model represents one of
           two related branches of delta-1-pyrroline-5-carboxylate
           dehydrogenase. The two branches are not as closely
           related to each other as some aldehyde dehydrogenases
           are to this branch, and separate models are built for
           this reason. The enzyme is the second of two in the
           degradation of proline to glutamate [Energy metabolism,
           Amino acids and amines].
          Length = 532

 Score = 43.6 bits (103), Expect = 1e-04
 Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 10/113 (8%)

Query: 258 DRPGYFVEPTIVTGLPHNSRVVHRETFAPI--VYVFPFDSLDEAITW--NNEVKQGLSSS 313
           D  GYFVEPT+V        ++  E F P+  VYV+P D   E I    ++    GL+ +
Sbjct: 392 DSVGYFVEPTVVESKDPQEPLMKEEIFGPVLTVYVYPDDKYKE-ILDLVDSTTSYGLTGA 450

Query: 314 IFTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSD 366
           +F KD   + +         G   +N    GA +     G++  GG R SG++
Sbjct: 451 VFAKDRDAILEADKVLRFAAGNFYINDKPTGAVV-----GQQPFGGARASGTN 498



 Score = 41.7 bits (98), Expect = 5e-04
 Identities = 47/175 (26%), Positives = 66/175 (37%), Gaps = 19/175 (10%)

Query: 37  PGVYDGTSWKANGEIIPSINPST-GETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPR 95
           P V  G     +GE I  ++P      +A       +  ++ +E++  A   W+ LP   
Sbjct: 33  PLVIGGEEVWTSGERIYQVSPHNHQAVLAKATNATEELANKAVEAALDAKKEWSLLPFYD 92

Query: 96  RGEIVRQIGDAL----RHKLVP---LGQLVSLEMGK--ILAEGIGEVQEFIDICDYAVGL 146
           R  I  +  D L    R +++    LGQ       K    AE I  V E ID   +    
Sbjct: 93  RAAIFLKAADLLSGPYRAEILAATMLGQ------SKTVYQAE-IDAVAELIDFFRFNAKY 145

Query: 147 SRTYSGSILPSERPGHVLLENWNPL-GVVGIISAFNFPVAVYGWNAAIALVGVAV 200
           +R       P   PG      + PL G V  IS FNF         A AL+G  V
Sbjct: 146 ARELYEQ-QPISAPGEWNRTEYRPLEGFVYAISPFNFTAIAGNLAGAPALMGNTV 199



 Score = 28.2 bits (63), Expect = 9.3
 Identities = 12/53 (22%), Positives = 24/53 (45%)

Query: 204 FGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKK 256
           F +++ E GG +  +V+  AD+       +       GQ+C+   RL++    
Sbjct: 277 FPRIVGETGGKDFHVVHPSADVEHVVSGTIRGAFEYQGQKCSACSRLYVPHSL 329


>gnl|CDD|143441 cd07123, ALDH_F4-17_P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, ALDH families 4 and 17.
           Delta(1)-pyrroline-5-carboxylate dehydrogenase
           (EC=1.5.1.12 ), families 4 and 17: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), also known as ALDH4A1 in humans, 
           is a mitochondrial  homodimer involved in proline
           degradation and catalyzes the NAD + -dependent
           conversion of P5C to glutamate. This is a necessary step
           in the pathway interconnecting the urea and
           tricarboxylic acid cycles. The preferred substrate is
           glutamic gamma-semialdehyde, other substrates include
           succinic, glutaric and adipic semialdehydes. Also
           included in this CD is the Aldh17 Drosophila
           melanogaster (Q9VUC0) P5CDH and similar sequences.
          Length = 522

 Score = 42.2 bits (100), Expect = 4e-04
 Identities = 20/64 (31%), Positives = 33/64 (51%), Gaps = 3/64 (4%)

Query: 258 DRPGYFVEPTIVTGLPHNSRVVHRETFAPI--VYVFPFDSLDEAI-TWNNEVKQGLSSSI 314
           D  GYFVEPT++       +++  E F P+  VYV+P    +E +   +      L+ +I
Sbjct: 392 DSVGYFVEPTVIETTDPKHKLMTEEIFGPVLTVYVYPDSDFEETLELVDTTSPYALTGAI 451

Query: 315 FTKD 318
           F +D
Sbjct: 452 FAQD 455



 Score = 28.3 bits (64), Expect = 8.3
 Identities = 37/138 (26%), Positives = 54/138 (39%), Gaps = 27/138 (19%)

Query: 60  GETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVRQIGDAL----RHKLVP-- 113
              +A+    +     + IE++  A   WA +P   R  I  +  D L    R++L    
Sbjct: 57  AHVLATYHYADAALVEKAIEAALEARKEWARMPFEDRAAIFLKAADLLSGKYRYELNAAT 116

Query: 114 -LGQLVSLEMGK--ILAEGIGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWN- 169
            LGQ      GK    AE I    E ID   + V     Y+  +  +++P       WN 
Sbjct: 117 MLGQ------GKNVWQAE-IDAACELIDFLRFNV----KYAEELY-AQQPLSSPAGVWNR 164

Query: 170 ----PL-GVVGIISAFNF 182
               PL G V  +S FNF
Sbjct: 165 LEYRPLEGFVYAVSPFNF 182


>gnl|CDD|143403 cd07084, ALDH_KGSADH-like, ALDH subfamily: NAD(P)+-dependent
           alpha-ketoglutaric semialdehyde dehydrogenases and plant
           delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH
           family 12-like.  ALDH subfamily which includes the
           NAD(P)+-dependent, alpha-ketoglutaric semialdehyde
           dehydrogenases (KGSADH, EC 1.2.1.26); plant
           delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           EC=1.5.1.12 ), ALDH family 12; the N-terminal domain of
           the MaoC (monoamine oxidase C) dehydratase regulatory
           protein; and orthologs of MaoC, PaaZ and PaaN, which are
           putative ring-opening enzymes of the aerobic
           phenylacetic acid catabolic pathway.
          Length = 442

 Score = 38.8 bits (90), Expect = 0.004
 Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 3/62 (4%)

Query: 203 RFGKLLLELGGNNAIIVNEDADL--NLAQQCLVFACCGTAGQRCTTTRRLFLHKKKIDRP 260
           +  ++ LEL G N  ++  DA     +A QC V      +GQ+CT    LF+ +     P
Sbjct: 200 KQARIYLELAGFNWKVLGPDAQAVDYVAWQC-VQDMTACSGQKCTAQSMLFVPENWSKTP 258

Query: 261 GY 262
             
Sbjct: 259 LV 260



 Score = 32.6 bits (74), Expect = 0.34
 Identities = 32/125 (25%), Positives = 46/125 (36%), Gaps = 7/125 (5%)

Query: 76  RCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGK--ILAEGIGEV 133
           R + ++  +  A   L  P+R + + +I   L  K   +     L  GK  + AE I   
Sbjct: 3   RALLAADISTKAARRLALPKRADFLARIIQRLAAKSYDIAAGAVLVTGKGWMFAENICGD 62

Query: 134 QEFIDICDYAVGLSRTY---SGSILPSERPGHVLLENWNPLGVVGIISAFNFPVAVYGWN 190
           Q         V  S       G+ L            W P G V +I AFNFP+ +    
Sbjct: 63  Q-VQLRARAFVIYSYRIPHEPGNHLGQGLKQQSHGYRW-PYGPVLVIGAFNFPLWIPLLQ 120

Query: 191 AAIAL 195
            A AL
Sbjct: 121 LAGAL 125


>gnl|CDD|236931 PRK11563, PRK11563, bifunctional aldehyde dehydrogenase/enoyl-CoA
           hydratase; Provisional.
          Length = 675

 Score = 37.9 bits (89), Expect = 0.009
 Identities = 22/60 (36%), Positives = 30/60 (50%), Gaps = 2/60 (3%)

Query: 261 GYFVEPTIVT-GLPHNSRVVHR-ETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
           G F  PT++    P  +  VH  E F P+  + P+D LDEAI      K  L +S+ T D
Sbjct: 378 GAFFPPTLLYCDDPLEAPAVHDVEAFGPVSTLMPYDDLDEAIELAARGKGSLVASLVTAD 437


>gnl|CDD|143455 cd07137, ALDH_F3FHI, Plant aldehyde dehydrogenase family 3 members
           F1, H1, and I1 and related proteins.  Aldehyde
           dehydrogenase family members 3F1, 3H1, and 3I1 (ALDH3F1,
           ALDH3H1, and ALDH3I1), and similar plant sequences, are
           in this CD.  In Arabidopsis thaliana, stress-regulated
           expression of ALDH3I1  was observed in  leaves and
           osmotic stress expression of  ALDH3H1 was observed in
           root tissue, whereas, ALDH3F1 expression was not stress
           responsive. Functional analysis of ALDH3I1 suggest it
           may be involved in a detoxification pathway in plants
           that limits aldehyde accumulation and oxidative stress.
          Length = 432

 Score = 37.4 bits (87), Expect = 0.011
 Identities = 18/67 (26%), Positives = 37/67 (55%)

Query: 252 LHKKKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLS 311
           +H  + D    ++EPTI+   P +S ++  E F P++ +     ++E+I   N   + L+
Sbjct: 306 VHGGERDEKNLYIEPTILLDPPLDSSIMTEEIFGPLLPIITVKKIEESIEIINSRPKPLA 365

Query: 312 SSIFTKD 318
           + +FTK+
Sbjct: 366 AYVFTKN 372



 Score = 29.3 bits (66), Expect = 3.7
 Identities = 11/14 (78%), Positives = 13/14 (92%)

Query: 170 PLGVVGIISAFNFP 183
           PLGVV +ISA+NFP
Sbjct: 101 PLGVVLVISAWNFP 114



 Score = 28.9 bits (65), Expect = 4.5
 Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 5/39 (12%)

Query: 209 LELGGNNAIIVNEDADLNLAQQCLV---FACCGTAGQRC 244
           LELGG   +IV+   DL +A + +    + C    GQ C
Sbjct: 206 LELGGKCPVIVDSTVDLKVAVRRIAGGKWGCN--NGQAC 242


>gnl|CDD|165847 PLN02203, PLN02203, aldehyde dehydrogenase.
          Length = 484

 Score = 35.9 bits (83), Expect = 0.035
 Identities = 37/146 (25%), Positives = 61/146 (41%), Gaps = 32/146 (21%)

Query: 191 AAIALVGVAVQSRFGKLLLEL---------GGNN------AIIVNEDADL---NLAQQCL 232
           A IA+  V V+ RF  +L+EL         G N       A I+N+       NL +   
Sbjct: 251 ACIAIDYVLVEERFAPILIELLKSTIKKFFGENPRESKSMARILNKKHFQRLSNLLKDPR 310

Query: 233 VFACCGTAGQRCTTTRRLFLHKKKIDRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFP 292
           V A                +H   ID    F+EPTI+   P +S ++  E F P++ +  
Sbjct: 311 VAAS--------------IVHGGSIDEKKLFIEPTILLNPPLDSDIMTEEIFGPLLPIIT 356

Query: 293 FDSLDEAITWNNEVKQGLSSSIFTKD 318
              ++++I + N   + L+   FT +
Sbjct: 357 VKKIEDSIAFINSKPKPLAIYAFTNN 382


>gnl|CDD|143439 cd07121, ALDH_EutE, Ethanolamine utilization protein EutE-like.
           Coenzyme A acylating aldehyde dehydrogenase (ACDH), an
           NAD+ and CoA-dependent acetaldehyde dehydrogenase,
           acetylating (EC=1.2.1.10), converts acetaldehyde into
           acetyl-CoA.  This CD is limited to such monofunctional
           enzymes as the Ethanolamine utilization protein, EutE,
           in Salmonella typhimurium.  Mutations in eutE abolish
           the ability to utilize ethanolamine as a carbon source.
          Length = 429

 Score = 35.7 bits (83), Expect = 0.040
 Identities = 39/173 (22%), Positives = 67/173 (38%), Gaps = 30/173 (17%)

Query: 198 VAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKI 257
           +AV S    L+  +  N A ++N++    L +  L             T +    +KK +
Sbjct: 247 IAVDSVADYLIAAMQRNGAYVLNDEQAEQLLEVVL------------LTNKGATPNKKWV 294

Query: 258 DR-PGYFVE----------PTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEV 306
            +     ++            I+     +   V  E   PI+ V    + DEAI    E+
Sbjct: 295 GKDASKILKAAGIEVPADIRLIIVETDKDHPFVVEEQMMPILPVVRVKNFDEAIELAVEL 354

Query: 307 KQGL--SSSIFTKDVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYT 357
           + G   ++ I +K+V NL K    +     +   N P+  A +G   GGE YT
Sbjct: 355 EHGNRHTAIIHSKNVENLTKM--ARAMQTTIFVKNGPS-YAGLG--VGGEGYT 402


>gnl|CDD|237016 PRK11903, PRK11903, aldehyde dehydrogenase; Provisional.
          Length = 521

 Score = 34.7 bits (80), Expect = 0.076
 Identities = 14/60 (23%), Positives = 25/60 (41%), Gaps = 2/60 (3%)

Query: 261 GYFVEPTI-VTGLPHNSRVVH-RETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
              V PT+     P  +  VH  E F P+  + P+     A+      +  L +S+++ D
Sbjct: 377 AACVGPTLLGASDPDAATAVHDVEVFGPVATLLPYRDAAHALALARRGQGSLVASVYSDD 436


>gnl|CDD|177831 PLN02174, PLN02174, aldehyde dehydrogenase family 3 member H1.
          Length = 484

 Score = 33.1 bits (75), Expect = 0.28
 Identities = 28/120 (23%), Positives = 55/120 (45%), Gaps = 1/120 (0%)

Query: 258 DRPGYFVEPTIVTGLPHNSRVVHRETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTK 317
           DR    + PTI+  +P +S ++  E F P++ +   ++L+E+        + L++ +FT 
Sbjct: 323 DRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVIRSRPKPLAAYLFTH 382

Query: 318 DVTNLFKWLGPQGSDCGLINVNIPTNGAEIGGAFGGEKYTGGGRESGSDSWKQYCRRSTV 377
           +   L +      S  G++  +I  + A     FGG   +G G   G  S+  +  +  V
Sbjct: 383 N-KKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHGKFSFDAFSHKKAV 441



 Score = 29.2 bits (65), Expect = 4.0
 Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 195 LVGVAVQSRFGKLLLELGGNNAIIVNEDADLNLAQQCLVFACCG-TAGQRCTT 246
           ++  A       ++LELGG + ++V+ D DL +  + ++    G   GQ C +
Sbjct: 203 VIMAAAAKHLTPVVLELGGKSPVVVDSDTDLKVTVRRIIAGKWGCNNGQACIS 255


>gnl|CDD|143445 cd07127, ALDH_PAD-PaaZ, Phenylacetic acid degradation proteins PaaZ
           (Escherichia coli) and PaaN (Pseudomonas putida)-like.
           Phenylacetic acid degradation (PAD) proteins PaaZ
           (Escherichia coli) and PaaN (Pseudomonas putida) are
           putative aromatic ring cleavage enzymes of the aerobic
           PA catabolic pathway. PaaZ mutants were defective for
           growth with PA as a sole carbon source due to
           interruption of the putative ring opening system.  This
           CD is limited to bacterial monofunctional enzymes.
          Length = 549

 Score = 32.5 bits (74), Expect = 0.41
 Identities = 14/49 (28%), Positives = 25/49 (51%)

Query: 210 ELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKID 258
           E  G N ++V+   DL    + L F+    +GQ CTT + +++ +  I 
Sbjct: 305 EKAGVNTVVVDSTDDLKAMLRNLAFSLSLYSGQMCTTPQNIYVPRDGIQ 353


>gnl|CDD|131341 TIGR02288, PaaN_2, phenylacetic acid degradation protein paaN.
           This enzyme is proposed to act in the ring-opening step
           of phenylacetic acid degradation which follows ligation
           of the acid with coenzyme A (by PaaF) and hydroxylation
           by a multicomponent non-heme iron hydroxylase complex
           (PaaGHIJK). Gene symbols have been standardized in. This
           enzyme is related to aldehyde dehydrogenases and has a
           domain which is a member of the pfam00171 family. This
           family includes sequences from Burkholderia, Bordetella,
           Streptomyces. Other PaaN enzymes are represented by a
           separate model, TIGR02278.
          Length = 551

 Score = 32.5 bits (74), Expect = 0.42
 Identities = 14/49 (28%), Positives = 23/49 (46%)

Query: 210 ELGGNNAIIVNEDADLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKID 258
           E  G N +I+    D     + L F+    +GQ CTTT+ + + +  I 
Sbjct: 305 EKAGVNTVIIESTDDYKAMLRNLAFSLSLYSGQMCTTTQAILVPRDGIR 353


>gnl|CDD|131331 TIGR02278, PaaN-DH, phenylacetic acid degradation protein paaN.
           This enzyme is proposed to act in the ring-opening step
           of phenylacetic acid degradation which follows ligation
           of the acid with coenzyme A (by PaaF) and hydroxylation
           by a multicomponent non-heme iron hydroxylase complex
           (PaaGHIJK). Gene symbols have been standardized in. This
           enzyme is related to aldehyde dehydrogenases and has
           domains which are members of the pfam00171 and pfam01575
           families. This family includes paaN genes from
           Pseudomonas, Sinorhizobium, Rhodopseudomonas,
           Escherichia, Deinococcus and Corynebacterium. Another
           homology family (TIGR02288) includes several other
           species.
          Length = 663

 Score = 31.3 bits (71), Expect = 0.94
 Identities = 16/59 (27%), Positives = 24/59 (40%), Gaps = 1/59 (1%)

Query: 261 GYFVEPTIVTGLPHNSRVVHR-ETFAPIVYVFPFDSLDEAITWNNEVKQGLSSSIFTKD 318
           G F  PT++      +  VH  E F P+   FP+    EA          L +++ T D
Sbjct: 367 GAFFPPTLLLAEDPWAGAVHATEAFGPVATFFPYGDRAEAARLAARGGGSLVATLATSD 425



 Score = 29.0 bits (65), Expect = 4.6
 Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 5/49 (10%)

Query: 215 NAIIVNEDA-----DLNLAQQCLVFACCGTAGQRCTTTRRLFLHKKKID 258
           NA I+ EDA     + +L  Q +V      AGQ+CT  RR+ + K  ++
Sbjct: 258 NAAILGEDATPDEPEFDLFAQEIVRELTIKAGQKCTAIRRVIVPKALLE 306


>gnl|CDD|223093 COG0014, ProA, Gamma-glutamyl phosphate reductase [Amino acid
           transport and metabolism].
          Length = 417

 Score = 30.6 bits (70), Expect = 1.3
 Identities = 46/166 (27%), Positives = 62/166 (37%), Gaps = 36/166 (21%)

Query: 213 GNNAIIVNEDADLNLAQQCLVFACCGTAGQR---CTTTRRLFLHKKKIDRP--------- 260
           GN  I V+E ADL+ A + +V A      QR   C     L +H+               
Sbjct: 226 GNCHIYVDESADLDKALKIIVNAKT----QRPSVCNAAETLLVHRAIAKSFLPKLANALQ 281

Query: 261 GYFVE----PTIVTGLPHNSRVVHRETF-----APIVYVFPFDSLDEAITWNNEVKQGLS 311
              VE       +  LP   +    E +       I+ V   DSLDEAI   N    G S
Sbjct: 282 EAGVELRGDAEALALLPDAVKPATEEDWDTEYLDLILAVKVVDSLDEAIAHINTYGSGHS 341

Query: 312 SSIFTKDVTNLFKWLGPQGSDCGLINVNIPTN---------GAEIG 348
            +I T+D  N  +++     D   + VN  T          GAEIG
Sbjct: 342 DAIITEDYANAERFV--NEVDSAAVYVNASTRFTDGGQFGLGAEIG 385


>gnl|CDD|214338 CHL00025, ndhF, NADH dehydrogenase subunit 5.
          Length = 741

 Score = 30.2 bits (69), Expect = 2.5
 Identities = 14/24 (58%), Positives = 14/24 (58%), Gaps = 2/24 (8%)

Query: 342 TNGAEIGGAFGGE--KYTGGGRES 363
           TNG  I   F GE  KY GGGR S
Sbjct: 694 TNGVGITSFFVGEGIKYVGGGRIS 717


>gnl|CDD|143447 cd07129, ALDH_KGSADH, Alpha-Ketoglutaric Semialdehyde
           Dehydrogenase.  Alpha-Ketoglutaric Semialdehyde (KGSA)
           Dehydrogenase (KGSADH, EC 1.2.1.26) catalyzes the
           NAD(P)+-dependent conversion of KGSA to
           alpha-ketoglutarate. This CD contains such sequences as
           those seen in Azospirillum brasilense, KGSADH-II
           (D-glucarate/D-galactarate-inducible) and KGSADH-III
           (hydroxy-L-proline-inducible). Both show similar high
           substrate specificity for KGSA and different coenzyme
           specificity; KGSADH-II is NAD+-dependent and KGSADH-III
           is NADP+-dependent. Also included in this CD is the
           NADP(+)-dependent aldehyde dehydrogenase from Vibrio
           harveyi which catalyzes the oxidation of long-chain
           aliphatic aldehydes to acids.
          Length = 454

 Score = 29.4 bits (67), Expect = 4.0
 Identities = 32/137 (23%), Positives = 51/137 (37%), Gaps = 24/137 (17%)

Query: 76  RCIESSRAAYAAWAALPAPRRGEIVRQIGDALRHKLVPLGQLVSLEMGKILAEG------ 129
               ++ AA+ ++ AL   RR   +  I D +      L      E G  L E       
Sbjct: 3   AAAAAAAAAFESYRALSPARRAAFLEAIADEIEALGDELVARAHAETG--LPEARLQGEL 60

Query: 130 ---IGEVQEFIDICDYAVGLSRTYSGSILPSERPGHVLLENWN------PLGVVGIISAF 180
               G+++ F D+         ++  + +    P    L   +      PLG V +  A 
Sbjct: 61  GRTTGQLRLFADLVR-----EGSWLDARIDPADPDRQPLPRPDLRRMLVPLGPVAVFGAS 115

Query: 181 NFPVA--VYGWNAAIAL 195
           NFP+A  V G + A AL
Sbjct: 116 NFPLAFSVAGGDTASAL 132


>gnl|CDD|132150 TIGR03106, trio_M42_hydro, hydrolase, peptidase M42 family.  This
           model describes a subfamily of MEROPS peptidase family
           M42, a glutamyl aminopeptidase family that also includes
           the cellulase CelM from Clostridium thermocellum and
           deblocking aminopeptidases that can remove acylated
           amino acids. Members of this family occur in a three
           gene cassette with an amidotransferase (TIGR03104)in the
           asparagine synthase (glutamine-hydrolyzing) family, and
           a probable acetyltransferase (TIGR03103) in the GNAT
           family.
          Length = 343

 Score = 28.9 bits (65), Expect = 4.3
 Identities = 22/67 (32%), Positives = 31/67 (46%), Gaps = 14/67 (20%)

Query: 97  GEIVRQIGDALRHKLVPLGQLVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTYSGSILP 156
           G +VR++ D  R +LVP+G   +       AEG   V  F D        S  + G+ILP
Sbjct: 71  GAMVRELKDNGRLELVPIGHWSAR-----FAEG-ARVTIFTD--------SGEFRGTILP 116

Query: 157 SERPGHV 163
            +  GH 
Sbjct: 117 LKASGHA 123


>gnl|CDD|221215 pfam11771, DUF3314, Protein of unknown function (DUF3314).  This
           small family contains human, mouse and fish members but
           the function is not known.
          Length = 162

 Score = 28.4 bits (63), Expect = 5.0
 Identities = 21/98 (21%), Positives = 27/98 (27%), Gaps = 8/98 (8%)

Query: 4   SVRHLQCLRFRSYSDSVSKYPFLKELGLSGSVNPGVYDGTSWKANGEIIPSINPSTGETI 63
           SV H    +F      VS + +        S   G+Y    W              GE  
Sbjct: 17  SVSHFAIGQFDIRGHEVSIFRYCAPTPYLASGFTGLYKCMRWNVEAPG---EAAGAGEAS 73

Query: 64  ASVQQGNVQDYHRCIESSRAAYAAWAALPAPRRGEIVR 101
              +      Y  C E +R        L A   GE  R
Sbjct: 74  DGTEY-----YFLCYEDTREPAKEAGPLGADSCGEAQR 106


>gnl|CDD|235751 PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated.
          Length = 871

 Score = 29.1 bits (66), Expect = 5.3
 Identities = 13/44 (29%), Positives = 18/44 (40%), Gaps = 5/44 (11%)

Query: 61  ETIASVQQGNVQDYHRCIESSRAAYAAWAALPAPR----RGEIV 100
           E +  V + N  DY   I   +  Y  +  L  PR     GEI+
Sbjct: 712 EELREVVRTNKLDY-ELIAKRKEEYELYEKLTPPRVMTSDGEII 754


>gnl|CDD|115934 pfam07310, PAS_5, PAS domain.  This family contains a number of
          hypothetical bacterial proteins of unknown function
          approximately 200 residues long. This region is is
          distantly similar to other PAS domains.
          Length = 137

 Score = 27.7 bits (62), Expect = 6.6
 Identities = 11/24 (45%), Positives = 14/24 (58%), Gaps = 4/24 (16%)

Query: 80 SSRAAYAAWAAL----PAPRRGEI 99
          S+RA +A W  L     APRR +I
Sbjct: 5  STRALFAYWNELRGGRAAPRRSDI 28


>gnl|CDD|237395 PRK13482, PRK13482, DNA integrity scanning protein DisA;
           Provisional.
          Length = 352

 Score = 28.2 bits (64), Expect = 6.9
 Identities = 19/59 (32%), Positives = 34/59 (57%), Gaps = 5/59 (8%)

Query: 95  RRGEIVRQIGDALRHKLVPLGQ---LVSLEMGKILAEGIGEVQEFIDICDYAVGLSRTY 150
           +R E+VR+I + +   +V LG    L+SL++ ++LA G+ E +E + I DY+       
Sbjct: 187 QRLEMVRRISEEIEGYVVELGTEGRLISLQLEELLA-GVEEERELL-IRDYSQEKDEDP 243


>gnl|CDD|216115 pfam00780, CNH, CNH domain.  Domain found in NIK1-like kinase,
           mouse citron and yeast ROM1, ROM2. Unpublished
           observations.
          Length = 266

 Score = 28.0 bits (63), Expect = 9.1
 Identities = 12/46 (26%), Positives = 17/46 (36%), Gaps = 4/46 (8%)

Query: 288 VYVFPFDSLDEAITWNNEVKQGLSSSIFTKDVTNLFKWLGPQGSDC 333
           +YV+P  +LD     +    Q L     TK  +  F   G  G   
Sbjct: 57  LYVYPLSALDSKELSSKSAPQKLPE---TKGCS-FFARAGYCGGRF 98


>gnl|CDD|143444 cd07126, ALDH_F12_P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, ALDH family 12.
           Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           EC=1.5.1.12), family 12: a proline catabolic enzyme of
           the aldehyde dehydrogenase (ALDH) protein superfamily.
           P5CDH is a mitochondrial enzyme involved in proline
           degradation and catalyzes the NAD + -dependent
           conversion of P5C to glutamate.  The P5CDH, ALDH12A1
           gene, in Arabidopsis, has been identified as an
           osmotic-stress-inducible ALDH gene. This CD contains
           both Viridiplantae and Alveolata P5CDH sequences.
          Length = 489

 Score = 28.2 bits (63), Expect = 9.6
 Identities = 37/160 (23%), Positives = 65/160 (40%), Gaps = 26/160 (16%)

Query: 45  WKANGEIIPSINPSTGETIASVQQGNVQDYHRCIESSRAA--YAAWAALPAPRR----GE 98
           WK        ++P  G+   SV   +  + +  ++S R          L  P R    G+
Sbjct: 7   WKGASNYTTLLDPLNGDKFISVPDTDEDEINEFVDSLRQCPKSGLHNPLKNPERYLLYGD 66

Query: 99  IVRQIGDALRHKLVP--LGQLVSLEMGKILAEGIGEV---QEFI-----DICDYAVGLSR 148
           +  ++   LR   V     +L+     K  A+ +GEV   ++F+     D   +   L+R
Sbjct: 67  VSHRVAHELRKPEVEDFFARLIQRVAPKSDAQALGEVVVTRKFLENFAGDQVRF---LAR 123

Query: 149 TY--SGSILPSERPGHVLLENWNPLGVVGIISAFNFPVAV 186
           ++   G     +  G+     W P G V II+ FNFP+ +
Sbjct: 124 SFNVPGDHQGQQSSGY----RW-PYGPVAIITPFNFPLEI 158


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.138    0.423 

Gapped
Lambda     K      H
   0.267   0.0575    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,539,263
Number of extensions: 2024210
Number of successful extensions: 2230
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2039
Number of HSP's successfully gapped: 327
Length of query: 394
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 295
Effective length of database: 6,546,556
Effective search space: 1931234020
Effective search space used: 1931234020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (27.2 bits)