RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15673
         (484 letters)



>gnl|CDD|143450 cd07132, ALDH_F3AB, Aldehyde dehydrogenase family 3 members A1, A2,
           and B1 and related proteins.  NAD(P)+-dependent,
           aldehyde dehydrogenase, family 3 members A1 and B1
           (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and fatty aldehyde
           dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3),
           and similar sequences are included in this CD. Human
           ALDH3A1 is a homodimer with a critical role in cellular
           defense against oxidative stress; it catalyzes the
           oxidation of various cellular membrane lipid-derived
           aldehydes. Corneal crystalline ALDH3A1 protects the
           cornea and underlying lens against UV-induced oxidative
           stress. Human ALDH3A2, a microsomal homodimer, catalyzes
           the oxidation of long-chain aliphatic aldehydes to fatty
           acids. Human ALDH3B1 is highly expressed in the kidney
           and liver and catalyzes the oxidation of various medium-
           and long-chain saturated and unsaturated aliphatic
           aldehydes.
          Length = 443

 Score =  489 bits (1262), Expect = e-171
 Identities = 166/269 (61%), Positives = 212/269 (78%)

Query: 6   DLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
           + V+ AR+ F SGK++P EFR +QL+ L+R+ EEN+ ++  ALA DLRK K EAVL EI 
Sbjct: 2   EAVRRAREAFSSGKTRPLEFRIQQLEALLRMLEENEDEIVEALAKDLRKPKFEAVLSEIL 61

Query: 66  FLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPA 125
            + N+++  +++L +WM PE   K++A +LD VYIY +P GV LIIGAWNYPLQL+L+P 
Sbjct: 62  LVKNEIKYAISNLPEWMKPEPVKKNLATLLDDVYIYKEPLGVVLIIGAWNYPLQLTLVPL 121

Query: 126 AGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRFDYIF 185
            GAIAAGN V++KPSEV+PA+AK++AEL+PKYLD + + VVLGGVEETTELLK RFDYIF
Sbjct: 122 VGAIAAGNCVVIKPSEVSPATAKLLAELIPKYLDKECYPVVLGGVEETTELLKQRFDYIF 181

Query: 186 YTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCI 245
           YTGSTSVGKIV QAA +HLTPVTLELGGKSP Y+D S +I++A RR  WGK INAGQTCI
Sbjct: 182 YTGSTSVGKIVMQAAAKHLTPVTLELGGKSPCYVDKSCDIDVAARRIAWGKFINAGQTCI 241

Query: 246 APDYILCSRQVQAQILNQAKAVLDSWYTE 274
           APDY+LC+ +VQ + +   K  L  +Y E
Sbjct: 242 APDYVLCTPEVQEKFVEALKKTLKEFYGE 270



 Score =  156 bits (396), Expect = 2e-42
 Identities = 52/104 (50%), Positives = 70/104 (67%), Gaps = 3/104 (2%)

Query: 315 PLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTEQ 374
           P Y+D S +I++A RR  WGK INAGQTCIAPDY+LC+ +VQ + +   K  L  +Y E 
Sbjct: 212 PCYVDKSCDIDVAARRIAWGKFINAGQTCIAPDYVLCTPEVQEKFVEALKKTLKEFYGED 271

Query: 375 VQGSKHYCRIVSDKHFQRLKSLVHSSGTIALGGDMDASDRPLYI 418
            + S  Y RI++D+HFQRLK L+ S G +A+GG  D  +R  YI
Sbjct: 272 PKESPDYGRIINDRHFQRLKKLL-SGGKVAIGGQTDEKER--YI 312



 Score = 56.1 bits (136), Expect = 2e-08
 Identities = 20/34 (58%), Positives = 27/34 (79%), Gaps = 2/34 (5%)

Query: 284 AYHGKYSFNTFTHRKSCLVKDYNPVLEALSAPLY 317
           AYHGKYSF+TF+H++SCLVK  N  +E L++  Y
Sbjct: 408 AYHGKYSFDTFSHKRSCLVKSLN--MEKLNSLRY 439



 Score = 54.5 bits (132), Expect = 6e-08
 Identities = 20/40 (50%), Positives = 25/40 (62%), Gaps = 5/40 (12%)

Query: 402 TIALGGDMDASDRPLYIDSSVNIELAVRRFLWGKCINAGQ 441
           T+ LGG       P Y+D S +I++A RR  WGK INAGQ
Sbjct: 204 TLELGGK-----SPCYVDKSCDIDVAARRIAWGKFINAGQ 238



 Score = 52.2 bits (126), Expect = 3e-07
 Identities = 18/30 (60%), Positives = 24/30 (80%), Gaps = 2/30 (6%)

Query: 453 YHGKYSFVTFTHRKSCLVKDYNPVLEALSA 482
           YHGKYSF TF+H++SCLVK  N  +E L++
Sbjct: 409 YHGKYSFDTFSHKRSCLVKSLN--MEKLNS 436


>gnl|CDD|143406 cd07087, ALDH_F3-13-14_CALDH-like, ALDH subfamily: Coniferyl
           aldehyde dehydrogenase, ALDH families 3, 13, and 14, and
           other related proteins.  ALDH subfamily which includes
           NAD(P)+-dependent, aldehyde dehydrogenase, family 3
           member A1 and B1  (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and
           fatty aldehyde dehydrogenase, family 3 member A2
           (ALDH3A2, EC=1.2.1.3), and also plant ALDH family
           members ALDH3F1, ALDH3H1, and ALDH3I1, fungal ALDH14
           (YMR110C) and the protozoan family 13 member (ALDH13),
           as well as coniferyl aldehyde dehydrogenases (CALDH,
           EC=1.2.1.68), and other similar  sequences, such as the
           Pseudomonas putida benzaldehyde dehydrogenase I that is
           involved in the metabolism of mandelate.
          Length = 426

 Score =  411 bits (1058), Expect = e-141
 Identities = 135/270 (50%), Positives = 185/270 (68%)

Query: 6   DLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
           +LV   R+TF +GK++  E+R+ QL+ L R+  EN++++A AL ADL K   EA L EI 
Sbjct: 2   ELVARLRETFLTGKTRSLEWRKAQLKALKRMLTENEEEIAAALYADLGKPPAEAYLTEIA 61

Query: 66  FLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPA 125
            +  ++ + L HLK+WM P +    +       Y+ P+P GV LIIG WNYPLQL+L P 
Sbjct: 62  VVLGEIDHALKHLKKWMKPRRVSVPLLLQPAKAYVIPEPLGVVLIIGPWNYPLQLALAPL 121

Query: 126 AGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRFDYIF 185
            GAIAAGN V+LKPSE+APA++ ++A+L+PKY D +   VV GGVE  T LL   FD+IF
Sbjct: 122 IGAIAAGNTVVLKPSELAPATSALLAKLIPKYFDPEAVAVVEGGVEVATALLAEPFDHIF 181

Query: 186 YTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCI 245
           +TGS +VGKIV +AA +HLTPVTLELGGKSP  +D   N+E+A RR  WGK +NAGQTCI
Sbjct: 182 FTGSPAVGKIVMEAAAKHLTPVTLELGGKSPCIVDKDANLEVAARRIAWGKFLNAGQTCI 241

Query: 246 APDYILCSRQVQAQILNQAKAVLDSWYTEQ 275
           APDY+L    ++ +++ + K  +  +Y E 
Sbjct: 242 APDYVLVHESIKDELIEELKKAIKEFYGED 271



 Score =  126 bits (318), Expect = 1e-31
 Identities = 42/104 (40%), Positives = 65/104 (62%), Gaps = 3/104 (2%)

Query: 315 PLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTEQ 374
           P  +D   N+E+A RR  WGK +NAGQTCIAPDY+L    ++ +++ + K  +  +Y E 
Sbjct: 212 PCIVDKDANLEVAARRIAWGKFLNAGQTCIAPDYVLVHESIKDELIEELKKAIKEFYGED 271

Query: 375 VQGSKHYCRIVSDKHFQRLKSLVHSSGTIALGGDMDASDRPLYI 418
            + S  Y RI++++HF RL SL+   G + +GG +D  +R  YI
Sbjct: 272 PKESPDYGRIINERHFDRLASLL-DDGKVVIGGQVDKEER--YI 312



 Score = 47.5 bits (114), Expect = 9e-06
 Identities = 14/27 (51%), Positives = 18/27 (66%)

Query: 415 PLYIDSSVNIELAVRRFLWGKCINAGQ 441
           P  +D   N+E+A RR  WGK +NAGQ
Sbjct: 212 PCIVDKDANLEVAARRIAWGKFLNAGQ 238



 Score = 42.1 bits (100), Expect = 4e-04
 Identities = 12/19 (63%), Positives = 14/19 (73%)

Query: 284 AYHGKYSFNTFTHRKSCLV 302
           AYHGK  F+TF+H KS L 
Sbjct: 408 AYHGKAGFDTFSHLKSVLK 426



 Score = 38.7 bits (91), Expect = 0.006
 Identities = 11/18 (61%), Positives = 12/18 (66%)

Query: 453 YHGKYSFVTFTHRKSCLV 470
           YHGK  F TF+H KS L 
Sbjct: 409 YHGKAGFDTFSHLKSVLK 426


>gnl|CDD|143453 cd07135, ALDH_F14-YMR110C, Saccharomyces cerevisiae aldehyde
           dehydrogenase family 14 and related proteins.  Aldehyde
           dehydrogenase family 14 (ALDH14), isolated mainly from
           the mitochondrial outer membrane of Saccharomyces
           cerevisiae (YMR110C) and most closely related to the
           plant and animal ALDHs and fatty ALDHs family 3 members,
           and similar fungal sequences, are present in this CD.
          Length = 436

 Score =  363 bits (934), Expect = e-122
 Identities = 132/267 (49%), Positives = 177/267 (66%), Gaps = 1/267 (0%)

Query: 7   LVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEF 66
           +    R TF SGK+K  E+R  QL+QL    ++N++ +  AL  DL +   E +L E+  
Sbjct: 10  IHSRLRATFRSGKTKDLEYRLWQLKQLYWAVKDNEEAIVEALKKDLGRPPFETLLTEVSG 69

Query: 67  LANDVRNTLNHLKQWMTPEKP-GKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPA 125
           + ND+ + L +LK+W   EK     +A M     I  +P GV LIIG WNYP+ L+L P 
Sbjct: 70  VKNDILHMLKNLKKWAKDEKVKDGPLAFMFGKPRIRKEPLGVVLIIGPWNYPVLLALSPL 129

Query: 126 AGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRFDYIF 185
            GAIAAG  V+LKPSE+ P +A ++AEL+PKYLD D FQVV GGV ETT LL+ +FD IF
Sbjct: 130 VGAIAAGCTVVLKPSELTPHTAALLAELVPKYLDPDAFQVVQGGVPETTALLEQKFDKIF 189

Query: 186 YTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCI 245
           YTGS  VG+I+ +AA +HLTPVTLELGGKSP+ +  + ++ELA +R LWGK  NAGQ C+
Sbjct: 190 YTGSGRVGRIIAEAAAKHLTPVTLELGGKSPVIVTKNADLELAAKRILWGKFGNAGQICV 249

Query: 246 APDYILCSRQVQAQILNQAKAVLDSWY 272
           APDY+L    V  + + + K VLD +Y
Sbjct: 250 APDYVLVDPSVYDEFVEELKKVLDEFY 276



 Score =  114 bits (288), Expect = 1e-27
 Identities = 44/110 (40%), Positives = 68/110 (61%), Gaps = 3/110 (2%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTE 373
           +P+ +  + ++ELA +R LWGK  NAGQ C+APDY+L    V  + + + K VLD +Y  
Sbjct: 219 SPVIVTKNADLELAAKRILWGKFGNAGQICVAPDYVLVDPSVYDEFVEELKKVLDEFYPG 278

Query: 374 QVQGSKHYCRIVSDKHFQRLKSLV-HSSGTIALGGDMDASDRPLYIDSSV 422
               S  Y RIV+ +HF RLKSL+  + G + +GG+MD + R  +I  ++
Sbjct: 279 GANASPDYTRIVNPRHFNRLKSLLDTTKGKVVIGGEMDEATR--FIPPTI 326



 Score = 45.7 bits (109), Expect = 3e-05
 Identities = 13/27 (48%), Positives = 19/27 (70%)

Query: 415 PLYIDSSVNIELAVRRFLWGKCINAGQ 441
           P+ +  + ++ELA +R LWGK  NAGQ
Sbjct: 220 PVIVTKNADLELAAKRILWGKFGNAGQ 246



 Score = 39.1 bits (92), Expect = 0.004
 Identities = 11/16 (68%), Positives = 14/16 (87%)

Query: 284 AYHGKYSFNTFTHRKS 299
           AYHGKY F+TFTH ++
Sbjct: 418 AYHGKYGFDTFTHERT 433


>gnl|CDD|240392 PTZ00381, PTZ00381, aldehyde dehydrogenase family protein;
           Provisional.
          Length = 493

 Score =  365 bits (938), Expect = e-122
 Identities = 131/269 (48%), Positives = 189/269 (70%)

Query: 6   DLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
            +V+  +++F +GK++P EFR++QL+ L+R+ EEN+Q+ + A+  DL +H  E  + E+ 
Sbjct: 11  PIVKKLKESFLTGKTRPLEFRKQQLRNLLRMLEENKQEFSEAVHKDLGRHPFETKMTEVL 70

Query: 66  FLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPA 125
               ++ + L HL +++ PEK            YI P+P GV L+IGAWNYPL L+L+P 
Sbjct: 71  LTVAEIEHLLKHLDEYLKPEKVDTVGVFGPGKSYIIPEPLGVVLVIGAWNYPLNLTLIPL 130

Query: 126 AGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRFDYIF 185
           AGAIAAGN V+LKPSE++P ++K+MA+LL KYLD    +V+ GGVE TTELLK  FD+IF
Sbjct: 131 AGAIAAGNTVVLKPSELSPHTSKLMAKLLTKYLDPSYVRVIEGGVEVTTELLKEPFDHIF 190

Query: 186 YTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCI 245
           +TGS  VGK+V QAA E+LTP TLELGGKSP+ +D S N+++A RR  WGK +NAGQTC+
Sbjct: 191 FTGSPRVGKLVMQAAAENLTPCTLELGGKSPVIVDKSCNLKVAARRIAWGKFLNAGQTCV 250

Query: 246 APDYILCSRQVQAQILNQAKAVLDSWYTE 274
           APDY+L  R ++ + +   K  +  ++ E
Sbjct: 251 APDYVLVHRSIKDKFIEALKEAIKEFFGE 279



 Score =  114 bits (287), Expect = 2e-27
 Identities = 37/101 (36%), Positives = 63/101 (62%), Gaps = 1/101 (0%)

Query: 315 PLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTEQ 374
           P+ +D S N+++A RR  WGK +NAGQTC+APDY+L  R ++ + +   K  +  ++ E 
Sbjct: 221 PVIVDKSCNLKVAARRIAWGKFLNAGQTCVAPDYVLVHRSIKDKFIEALKEAIKEFFGED 280

Query: 375 VQGSKHYCRIVSDKHFQRLKSLVHSS-GTIALGGDMDASDR 414
            + S+ Y RIV++ H +RL  L+    G +  GG++D  ++
Sbjct: 281 PKKSEDYSRIVNEFHTKRLAELIKDHGGKVVYGGEVDIENK 321



 Score = 49.3 bits (118), Expect = 3e-06
 Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 5/40 (12%)

Query: 402 TIALGGDMDASDRPLYIDSSVNIELAVRRFLWGKCINAGQ 441
           T+ LGG       P+ +D S N+++A RR  WGK +NAGQ
Sbjct: 213 TLELGGK-----SPVIVDKSCNLKVAARRIAWGKFLNAGQ 247



 Score = 36.9 bits (86), Expect = 0.024
 Identities = 13/20 (65%), Positives = 15/20 (75%)

Query: 284 AYHGKYSFNTFTHRKSCLVK 303
           AYHGKY F+TF+H K  L K
Sbjct: 419 AYHGKYGFDTFSHPKPVLNK 438



 Score = 32.7 bits (75), Expect = 0.43
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 453 YHGKYSFVTFTHRKSCLVK 471
           YHGKY F TF+H K  L K
Sbjct: 420 YHGKYGFDTFSHPKPVLNK 438


>gnl|CDD|143454 cd07136, ALDH_YwdH-P39616, Bacillus subtilis aldehyde dehydrogenase
           ywdH-like.  Uncharacterized Bacillus subtilis ywdH
           aldehyde dehydrogenase (locus P39616)  most closely
           related to the ALDHs and fatty ALDHs of families 3 and
           14, and similar sequences, are included in this CD.
          Length = 449

 Score =  359 bits (924), Expect = e-120
 Identities = 125/270 (46%), Positives = 178/270 (65%)

Query: 6   DLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
            LV+  R  F +G +K  EFR  QL++L +  ++ + ++  AL  DL K + EA + EI 
Sbjct: 2   SLVEKQRAFFKTGATKDVEFRIEQLKKLKQAIKKYENEILEALKKDLGKSEFEAYMTEIG 61

Query: 66  FLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPA 125
           F+ +++   + HLK+WM P++    + N     YIY +PYGV LII  WNYP QL+L P 
Sbjct: 62  FVLSEINYAIKHLKKWMKPKRVKTPLLNFPSKSYIYYEPYGVVLIIAPWNYPFQLALAPL 121

Query: 126 AGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRFDYIF 185
            GAIAAGN  +LKPSE+ P ++K++A+++ +  D +   VV GGVEE  ELL  +FDYIF
Sbjct: 122 IGAIAAGNTAVLKPSELTPNTSKVIAKIIEETFDEEYVAVVEGGVEENQELLDQKFDYIF 181

Query: 186 YTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCI 245
           +TGS  VGKIV +AA +HLTPVTLELGGKSP  +D   N++LA +R +WGK +NAGQTC+
Sbjct: 182 FTGSVRVGKIVMEAAAKHLTPVTLELGGKSPCIVDEDANLKLAAKRIVWGKFLNAGQTCV 241

Query: 246 APDYILCSRQVQAQILNQAKAVLDSWYTEQ 275
           APDY+L    V+ + + + K  +  +Y E 
Sbjct: 242 APDYVLVHESVKEKFIKELKEEIKKFYGED 271



 Score =  119 bits (301), Expect = 2e-29
 Identities = 41/104 (39%), Positives = 61/104 (58%), Gaps = 3/104 (2%)

Query: 315 PLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTEQ 374
           P  +D   N++LA +R +WGK +NAGQTC+APDY+L    V+ + + + K  +  +Y E 
Sbjct: 212 PCIVDEDANLKLAAKRIVWGKFLNAGQTCVAPDYVLVHESVKEKFIKELKEEIKKFYGED 271

Query: 375 VQGSKHYCRIVSDKHFQRLKSLVHSSGTIALGGDMDASDRPLYI 418
              S  Y RI+++KHF RL  L+  +G I  GG+ D      YI
Sbjct: 272 PLESPDYGRIINEKHFDRLAGLLD-NGKIVFGGNTDRETL--YI 312



 Score = 44.4 bits (106), Expect = 1e-04
 Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 5/40 (12%)

Query: 402 TIALGGDMDASDRPLYIDSSVNIELAVRRFLWGKCINAGQ 441
           T+ LGG       P  +D   N++LA +R +WGK +NAGQ
Sbjct: 204 TLELGGK-----SPCIVDEDANLKLAAKRIVWGKFLNAGQ 238



 Score = 42.9 bits (102), Expect = 3e-04
 Identities = 14/20 (70%), Positives = 18/20 (90%)

Query: 284 AYHGKYSFNTFTHRKSCLVK 303
           +YHGKYSF+TF+H+KS L K
Sbjct: 408 SYHGKYSFDTFSHKKSILKK 427



 Score = 40.2 bits (95), Expect = 0.002
 Identities = 14/19 (73%), Positives = 16/19 (84%)

Query: 453 YHGKYSFVTFTHRKSCLVK 471
           YHGKYSF TF+H+KS L K
Sbjct: 409 YHGKYSFDTFSHKKSILKK 427


>gnl|CDD|143455 cd07137, ALDH_F3FHI, Plant aldehyde dehydrogenase family 3 members
           F1, H1, and I1 and related proteins.  Aldehyde
           dehydrogenase family members 3F1, 3H1, and 3I1 (ALDH3F1,
           ALDH3H1, and ALDH3I1), and similar plant sequences, are
           in this CD.  In Arabidopsis thaliana, stress-regulated
           expression of ALDH3I1  was observed in  leaves and
           osmotic stress expression of  ALDH3H1 was observed in
           root tissue, whereas, ALDH3F1 expression was not stress
           responsive. Functional analysis of ALDH3I1 suggest it
           may be involved in a detoxification pathway in plants
           that limits aldehyde accumulation and oxidative stress.
          Length = 432

 Score =  302 bits (775), Expect = 2e-98
 Identities = 124/270 (45%), Positives = 178/270 (65%), Gaps = 3/270 (1%)

Query: 7   LVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEF 66
           LV+  R+TF SG+++  E+R+ QL+ L+RL +EN+ D+  AL  DL K   E+   E+  
Sbjct: 4   LVRELRETFRSGRTRSAEWRKSQLKGLLRLVDENEDDIFAALRQDLGKPSAESFRDEVSV 63

Query: 67  LANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAA 126
           L +  +  +  LK+WM PEK    +        I  +P GV L+I AWN+P  LSL P  
Sbjct: 64  LVSSCKLAIKELKKWMAPEKVKTPLTTFPAKAEIVSEPLGVVLVISAWNFPFLLSLEPVI 123

Query: 127 GAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRFDYIFY 186
           GAIAAGN V+LKPSE+APA++ ++A+L+P+YLD    +V+ GGV ETT LL+ ++D IF+
Sbjct: 124 GAIAAGNAVVLKPSELAPATSALLAKLIPEYLDTKAIKVIEGGVPETTALLEQKWDKIFF 183

Query: 187 TGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGK--CINAGQTC 244
           TGS  VG+I+  AA +HLTPVTLELGGK P+ +DS+V++++AVRR   GK  C N GQ C
Sbjct: 184 TGSPRVGRIIMAAAAKHLTPVTLELGGKCPVIVDSTVDLKVAVRRIAGGKWGC-NNGQAC 242

Query: 245 IAPDYILCSRQVQAQILNQAKAVLDSWYTE 274
           IAPDY+L        +++  K  L+ ++ E
Sbjct: 243 IAPDYVLVEESFAPTLIDALKNTLEKFFGE 272



 Score = 83.2 bits (206), Expect = 3e-17
 Identities = 46/126 (36%), Positives = 67/126 (53%), Gaps = 18/126 (14%)

Query: 315 PLYIDSSVNIELAVRRFLWGK--CINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYT 372
           P+ +DS+V++++AVRR   GK  C N GQ CIAPDY+L        +++  K  L+ ++ 
Sbjct: 213 PVIVDSTVDLKVAVRRIAGGKWGC-NNGQACIAPDYVLVEESFAPTLIDALKNTLEKFFG 271

Query: 373 EQVQGSKHYCRIVSDKHFQRLKSLVHSSGT---IALGGDMDASDRPLYI----------D 419
           E  + SK   RIV+  HFQRL  L+        I  GG+ D  ++ LYI          D
Sbjct: 272 ENPKESKDLSRIVNSHHFQRLSRLLDDPSVADKIVHGGERD--EKNLYIEPTILLDPPLD 329

Query: 420 SSVNIE 425
           SS+  E
Sbjct: 330 SSIMTE 335



 Score = 38.5 bits (90), Expect = 0.006
 Identities = 11/19 (57%), Positives = 16/19 (84%)

Query: 284 AYHGKYSFNTFTHRKSCLV 302
           AYHGK+SF+ F+H+K+ L 
Sbjct: 414 AYHGKFSFDAFSHKKAVLY 432



 Score = 35.8 bits (83), Expect = 0.048
 Identities = 10/18 (55%), Positives = 14/18 (77%)

Query: 453 YHGKYSFVTFTHRKSCLV 470
           YHGK+SF  F+H+K+ L 
Sbjct: 415 YHGKFSFDAFSHKKAVLY 432



 Score = 34.3 bits (79), Expect = 0.13
 Identities = 18/42 (42%), Positives = 26/42 (61%), Gaps = 8/42 (19%)

Query: 402 TIALGGDMDASDRPLYIDSSVNIELAVRRFLWGK--CINAGQ 441
           T+ LGG       P+ +DS+V++++AVRR   GK  C N GQ
Sbjct: 205 TLELGGKC-----PVIVDSTVDLKVAVRRIAGGKWGC-NNGQ 240


>gnl|CDD|143452 cd07134, ALDH_AlkH-like, Pseudomonas putida Aldehyde dehydrogenase
           AlkH-like.  Aldehyde dehydrogenase AlkH (locus name
           P12693, EC=1.2.1.3) of the alkBFGHJKL operon that allows
           Pseudomonas putida to metabolize alkanes and the
           aldehyde dehydrogenase AldX of Bacillus subtilis (locus
           P46329, EC=1.2.1.3), and similar sequences, are present
           in this CD.
          Length = 433

 Score =  277 bits (712), Expect = 8e-89
 Identities = 101/247 (40%), Positives = 149/247 (60%)

Query: 26  RRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLANDVRNTLNHLKQWMTPE 85
           R  +L++L +     ++++  ALAAD RK   E  L EI  + +++ + + HLK+WM P+
Sbjct: 22  RIAKLKRLKKAILARREEIIAALAADFRKPAAEVDLTEILPVLSEINHAIKHLKKWMKPK 81

Query: 86  KPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPA 145
           +    +        I  +P GVCLII  WNYP  L+  P   AIAAGN  ILKPSE+ P 
Sbjct: 82  RVRTPLLLFGTKSKIRYEPKGVCLIISPWNYPFNLAFGPLVSAIAAGNTAILKPSELTPH 141

Query: 146 SAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRFDYIFYTGSTSVGKIVRQAANEHLT 205
           ++ ++A+++ +  D D   V  G  E    LL+  FD+IF+TGS +VGKIV  AA +HL 
Sbjct: 142 TSAVIAKIIREAFDEDEVAVFEGDAEVAQALLELPFDHIFFTGSPAVGKIVMAAAAKHLA 201

Query: 206 PVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAK 265
            VTLELGGKSP  +D + +++ A ++  WGK +NAGQTCIAPDY+     V+   +   K
Sbjct: 202 SVTLELGGKSPTIVDETADLKKAAKKIAWGKFLNAGQTCIAPDYVFVHESVKDAFVEHLK 261

Query: 266 AVLDSWY 272
           A ++ +Y
Sbjct: 262 AEIEKFY 268



 Score =  110 bits (277), Expect = 2e-26
 Identities = 39/110 (35%), Positives = 60/110 (54%), Gaps = 8/110 (7%)

Query: 315 PLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWY--T 372
           P  +D + +++ A ++  WGK +NAGQTCIAPDY+     V+   +   KA ++ +Y   
Sbjct: 212 PTIVDETADLKKAAKKIAWGKFLNAGQTCIAPDYVFVHESVKDAFVEHLKAEIEKFYGKD 271

Query: 373 EQVQGSKHYCRIVSDKHFQRLKSLVHSS----GTIALGGDMDASDRPLYI 418
              + S    RIV+D+HF RLK L+  +      +  GG  DA+ R  YI
Sbjct: 272 AARKASPDLARIVNDRHFDRLKGLLDDAVAKGAKVEFGGQFDAAQR--YI 319



 Score = 42.6 bits (101), Expect = 3e-04
 Identities = 15/43 (34%), Positives = 25/43 (58%), Gaps = 5/43 (11%)

Query: 399 SSGTIALGGDMDASDRPLYIDSSVNIELAVRRFLWGKCINAGQ 441
           +S T+ LGG       P  +D + +++ A ++  WGK +NAGQ
Sbjct: 201 ASVTLELGGK-----SPTIVDETADLKKAAKKIAWGKFLNAGQ 238



 Score = 29.1 bits (66), Expect = 5.0
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query: 284 AYHGKYSFNTFTHRKSCLV 302
           +YHG Y F  F+H ++ L 
Sbjct: 415 SYHGVYGFKAFSHERAVLR 433


>gnl|CDD|143451 cd07133, ALDH_CALDH_CalB, Coniferyl aldehyde dehydrogenase-like.
           Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of
           Pseudomonas sp. strain HR199 (CalB) which catalyzes the
           NAD+-dependent oxidation of coniferyl aldehyde to
           ferulic acid, and similar sequences, are present in this
           CD.
          Length = 434

 Score =  272 bits (698), Expect = 1e-86
 Identities = 105/259 (40%), Positives = 145/259 (55%), Gaps = 17/259 (6%)

Query: 23  YEFRRRQLQQLVRLYEENQQDLANALAADL-RKHKQEAVLFEIEFLANDVRNTLNHLKQW 81
            E RR +L +L  L  +NQ  LA A++AD   + + E +L EI      +++   HLK+W
Sbjct: 19  LEERRDRLDRLKALLLDNQDALAEAISADFGHRSRHETLLAEILPSIAGIKHARKHLKKW 78

Query: 82  MTPEK--------PGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGN 133
           M P +        P K          +   P GV  II  WNYPL L+L P   A+AAGN
Sbjct: 79  MKPSRRHVGLLFLPAK--------AEVEYQPLGVVGIIVPWNYPLYLALGPLIAALAAGN 130

Query: 134 VVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRFDYIFYTGSTSVG 193
            V++KPSE  P ++ ++AELL +Y D D   VV GG +         FD++ +TGST+VG
Sbjct: 131 RVMIKPSEFTPRTSALLAELLAEYFDEDEVAVVTGGADVAAAFSSLPFDHLLFTGSTAVG 190

Query: 194 KIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCS 253
           + V +AA E+LTPVTLELGGKSP  I    ++  A  R  +GK +NAGQTC+APDY+L  
Sbjct: 191 RHVMRAAAENLTPVTLELGGKSPAIIAPDADLAKAAERIAFGKLLNAGQTCVAPDYVLVP 250

Query: 254 RQVQAQILNQAKAVLDSWY 272
                + +  AKA +   Y
Sbjct: 251 EDKLEEFVAAAKAAVAKMY 269



 Score = 91.0 bits (227), Expect = 9e-20
 Identities = 27/83 (32%), Positives = 45/83 (54%), Gaps = 1/83 (1%)

Query: 315 PLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTEQ 374
           P  I    ++  A  R  +GK +NAGQTC+APDY+L       + +  AKA +   Y   
Sbjct: 213 PAIIAPDADLAKAAERIAFGKLLNAGQTCVAPDYVLVPEDKLEEFVAAAKAAVAKMYP-T 271

Query: 375 VQGSKHYCRIVSDKHFQRLKSLV 397
           +  +  Y  I++++H+ RL+ L+
Sbjct: 272 LADNPDYTSIINERHYARLQGLL 294



 Score = 39.4 bits (93), Expect = 0.003
 Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 5/40 (12%)

Query: 402 TIALGGDMDASDRPLYIDSSVNIELAVRRFLWGKCINAGQ 441
           T+ LGG    S  P  I    ++  A  R  +GK +NAGQ
Sbjct: 205 TLELGGK---S--PAIIAPDADLAKAAERIAFGKLLNAGQ 239



 Score = 33.2 bits (77), Expect = 0.30
 Identities = 10/16 (62%), Positives = 11/16 (68%)

Query: 284 AYHGKYSFNTFTHRKS 299
           AYHGK  F TF+H K 
Sbjct: 416 AYHGKEGFLTFSHAKP 431



 Score = 30.9 bits (71), Expect = 1.4
 Identities = 9/15 (60%), Positives = 11/15 (73%)

Query: 453 YHGKYSFVTFTHRKS 467
           YHGK  F+TF+H K 
Sbjct: 417 YHGKEGFLTFSHAKP 431


>gnl|CDD|165847 PLN02203, PLN02203, aldehyde dehydrogenase.
          Length = 484

 Score =  271 bits (696), Expect = 6e-86
 Identities = 114/280 (40%), Positives = 171/280 (61%), Gaps = 8/280 (2%)

Query: 1   MVNFGDLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAV 60
                  V   R+T++SG+++  E+R+ QL+ L+RL ++N++ +  AL  DL KH+ EA 
Sbjct: 5   GETLEGSVAELRETYESGRTRSLEWRKSQLKGLLRLLKDNEEAIFKALHQDLGKHRVEAY 64

Query: 61  LFEIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQL 120
             E+  L       L++LK+WM P+K    +        + P+P GV LI  +WN+P+ L
Sbjct: 65  RDEVGVLTKSANLALSNLKKWMAPKKAKLPLVAFPATAEVVPEPLGVVLIFSSWNFPIGL 124

Query: 121 SLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHR 180
           SL P  GAIAAGN V+LKPSE+APA++  +A  +PKYLD+   +V+ GG     +LL+H+
Sbjct: 125 SLEPLIGAIAAGNAVVLKPSELAPATSAFLAANIPKYLDSKAVKVIEGGPAVGEQLLQHK 184

Query: 181 FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYID---SSVNIELAVRRFL---W 234
           +D IF+TGS  VG+I+  AA +HLTPV LELGGK P  +D   SS + ++AV R +   W
Sbjct: 185 WDKIFFTGSPRVGRIIMTAAAKHLTPVALELGGKCPCIVDSLSSSRDTKVAVNRIVGGKW 244

Query: 235 GKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTE 274
           G C  AGQ CIA DY+L   +    ++   K+ +  ++ E
Sbjct: 245 GSC--AGQACIAIDYVLVEERFAPILIELLKSTIKKFFGE 282



 Score = 73.2 bits (180), Expect = 6e-14
 Identities = 38/113 (33%), Positives = 61/113 (53%), Gaps = 14/113 (12%)

Query: 318 IDSSVNIELAVRRFL---WGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTEQ 374
           + SS + ++AV R +   WG C  AGQ CIA DY+L   +    ++   K+ +  ++ E 
Sbjct: 226 LSSSRDTKVAVNRIVGGKWGSC--AGQACIAIDYVLVEERFAPILIELLKSTIKKFFGEN 283

Query: 375 VQGSKHYCRIVSDKHFQRLKSL-----VHSSGTIALGGDMDASDRPLYIDSSV 422
            + SK   RI++ KHFQRL +L     V +S  I  GG +D     L+I+ ++
Sbjct: 284 PRESKSMARILNKKHFQRLSNLLKDPRVAAS--IVHGGSIDEKK--LFIEPTI 332



 Score = 38.9 bits (91), Expect = 0.005
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 285 YHGKYSFNTFTHRKSCLVKDYNPVLEALSAP 315
           YHGKYSF+TF+H K+ L +      E    P
Sbjct: 425 YHGKYSFDTFSHEKAVLRRSLLTEFEFRYPP 455



 Score = 37.4 bits (87), Expect = 0.017
 Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 452 EYHGKYSFVTFTHRKSCLVKDYNPVLEA 479
            YHGKYSF TF+H K+ L +      E 
Sbjct: 424 RYHGKYSFDTFSHEKAVLRRSLLTEFEF 451


>gnl|CDD|143395 cd06534, ALDH-SF, NAD(P)+-dependent aldehyde dehydrogenase
           superfamily.  The aldehyde dehydrogenase superfamily
           (ALDH-SF) of  NAD(P)+-dependent enzymes, in general,
           oxidize a wide range of  endogenous and exogenous
           aliphatic and aromatic aldehydes to their corresponding
           carboxylic acids and play an  important role in
           detoxification. Besides aldehyde detoxification, many
           ALDH isozymes possess multiple additional catalytic and
           non-catalytic functions such as participating in
           metabolic pathways, or as binding proteins, or
           osmoregulants, to mention a few. The enzyme has three
           domains, a NAD(P)+ cofactor-binding domain, a catalytic
           domain, and a bridging domain; and the active enzyme is
           generally either homodimeric or homotetrameric. The
           catalytic mechanism is proposed to involve cofactor
           binding, resulting in a conformational change and
           activation of an invariant catalytic cysteine
           nucleophile. The cysteine and aldehyde substrate form an
           oxyanion thiohemiacetal intermediate resulting in
           hydride transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-L) proteins. The ALDH proteins
           are represented by enzymes which share a number of
           highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group. The
           ALDH-like group is represented by such proteins as
           gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA
           reductase, and coenzyme A acylating aldehyde
           dehydrogenase. All of these proteins have a conserved
           cysteine that aligns with the catalytic cysteine of the
           ALDH group.
          Length = 367

 Score =  237 bits (607), Expect = 4e-74
 Identities = 95/263 (36%), Positives = 136/263 (51%), Gaps = 9/263 (3%)

Query: 11  ARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLAND 70
           AR  F +  + P   R   L+++  L EE +++LA     +  K   E  L E+    + 
Sbjct: 3   ARAAFKAWAALPPAERAAILRKIADLLEERREELAALETLETGK-PIEEALGEVARAIDT 61

Query: 71  VRNTLNHLKQWMTPEKPGKDIANMLDGV-YIYPDPYGVCLIIGAWNYPLQLSLLPAAGAI 129
            R       +   PE P    +    G  Y+  +P GV  +I  WN+PL L+    A A+
Sbjct: 62  FRYAAGLADKLGGPELP----SPDPGGEAYVRREPLGVVGVITPWNFPLLLAAWKLAPAL 117

Query: 130 AAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLG-GVEETTELLKH-RFDYIFY 186
           AAGN V+LKPSE+ P +A  +AELL +  L      VV G G E    LL H R D I +
Sbjct: 118 AAGNTVVLKPSELTPLTALALAELLQEAGLPPGVVNVVPGGGDEVGAALLSHPRVDKISF 177

Query: 187 TGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIA 246
           TGST+VGK + +AA E+L PVTLELGGKSP+ +D   +++ AV   ++G   NAGQ C A
Sbjct: 178 TGSTAVGKAIMKAAAENLKPVTLELGGKSPVIVDEDADLDAAVEGAVFGAFFNAGQICTA 237

Query: 247 PDYILCSRQVQAQILNQAKAVLD 269
              +L    +  + + +   VL 
Sbjct: 238 ASRLLVHESIYDEFVEKLVTVLV 260



 Score = 57.6 bits (140), Expect = 5e-09
 Identities = 14/55 (25%), Positives = 27/55 (49%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLD 368
           +P+ +D   +++ AV   ++G   NAGQ C A   +L    +  + + +   VL 
Sbjct: 206 SPVIVDEDADLDAAVEGAVFGAFFNAGQICTAASRLLVHESIYDEFVEKLVTVLV 260



 Score = 36.8 bits (86), Expect = 0.021
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 415 PLYIDSSVNIELAVRRFLWGKCINAGQ 441
           P+ +D   +++ AV   ++G   NAGQ
Sbjct: 207 PVIVDEDADLDAAVEGAVFGAFFNAGQ 233


>gnl|CDD|143397 cd07078, ALDH, NAD(P)+ dependent aldehyde dehydrogenase family.
           The aldehyde dehydrogenase family (ALDH) of NAD(P)+
           dependent enzymes, in general, oxidize a wide range of
           endogenous and exogenous aliphatic and aromatic
           aldehydes to their corresponding carboxylic acids and
           play an  important role in detoxification. Besides
           aldehyde detoxification, many ALDH isozymes possess
           multiple additional catalytic and non-catalytic
           functions such as participating in  metabolic pathways,
           or as  binding proteins, or as osmoregulants, to mention
           a few. The enzyme has three domains, a NAD(P)+
           cofactor-binding domain, a catalytic domain, and a
           bridging domain; and the active enzyme  is generally
           either homodimeric or homotetrameric. The catalytic
           mechanism is proposed to involve cofactor binding,
           resulting in a conformational change and activation of
           an invariant catalytic cysteine nucleophile. The
           cysteine and aldehyde substrate form an oxyanion
           thiohemiacetal intermediate resulting in hydride
           transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-like) proteins. The ALDH
           proteins are represented by enzymes which share a number
           of highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group.
          Length = 432

 Score =  238 bits (610), Expect = 1e-73
 Identities = 90/269 (33%), Positives = 136/269 (50%), Gaps = 9/269 (3%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  AR  F +  + P   R   L++L  L EE +++LA     +  K  +EA L E+   
Sbjct: 4   VAAARAAFKAWAALPPAERAAILRKLADLLEERREELAALETLETGKPIEEA-LGEVARA 62

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDIANMLDGV-YIYPDPYGVCLIIGAWNYPLQLSLLPAA 126
           A+  R      ++      P    +     +  +  +P GV   I  WN+PL L+    A
Sbjct: 63  ADTFRYYAGLARRLHGEVIP----SPDPGELAIVRREPLGVVGAITPWNFPLLLAAWKLA 118

Query: 127 GAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEET-TELLKH-RFDY 183
            A+AAGN V+LKPSE+ P +A ++AELL +  L      VV G  +E    L  H R D 
Sbjct: 119 PALAAGNTVVLKPSELTPLTALLLAELLAEAGLPPGVLNVVTGDGDEVGAALASHPRVDK 178

Query: 184 IFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQT 243
           I +TGST+VGK + +AA E+L  VTLELGGKSPL +    +++ AV+  ++G   NAGQ 
Sbjct: 179 ISFTGSTAVGKAIMRAAAENLKRVTLELGGKSPLIVFDDADLDAAVKGAVFGAFGNAGQV 238

Query: 244 CIAPDYILCSRQVQAQILNQAKAVLDSWY 272
           C A   +L    +  + + +    + +  
Sbjct: 239 CTAASRLLVHESIYDEFVERLVERVKALK 267



 Score = 66.8 bits (164), Expect = 8e-12
 Identities = 16/114 (14%), Positives = 40/114 (35%), Gaps = 13/114 (11%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWY-- 371
           +PL +    +++ AV+  ++G   NAGQ C A   +L    +  + + +    + +    
Sbjct: 210 SPLIVFDDADLDAAVKGAVFGAFGNAGQVCTAASRLLVHESIYDEFVERLVERVKALKVG 269

Query: 372 ---TEQVQGSKHYCRIVSDKHFQRLKSLV----HSSGTIALGGDMDASDRPLYI 418
                          ++S     R+ + +         +  GG      +  ++
Sbjct: 270 NPLDPDTD----MGPLISAAQLDRVLAYIEDAKAEGAKLLCGGKRLEGGKGYFV 319



 Score = 35.6 bits (83), Expect = 0.056
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 415 PLYIDSSVNIELAVRRFLWGKCINAGQ 441
           PL +    +++ AV+  ++G   NAGQ
Sbjct: 211 PLIVFDDADLDAAVKGAVFGAFGNAGQ 237


>gnl|CDD|177831 PLN02174, PLN02174, aldehyde dehydrogenase family 3 member H1.
          Length = 484

 Score =  239 bits (612), Expect = 2e-73
 Identities = 118/273 (43%), Positives = 181/273 (66%), Gaps = 3/273 (1%)

Query: 7   LVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEF 66
           LV   R +FD G ++ YE+R  QL++L+ + + ++ ++  AL  DL K + E+ ++E+  
Sbjct: 15  LVTELRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSL 74

Query: 67  LANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAA 126
           L N ++  L  LK WM PEK    +        I  +P GV L+I AWNYP  LS+ P  
Sbjct: 75  LRNSIKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVI 134

Query: 127 GAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRFDYIFY 186
           GAI+AGN V+LKPSE+APAS+ ++A+LL +YLD+   +VV G V ETT LL+ ++D IFY
Sbjct: 135 GAISAGNAVVLKPSELAPASSALLAKLLEQYLDSSAVRVVEGAVTETTALLEQKWDKIFY 194

Query: 187 TGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGK--CINAGQTC 244
           TGS+ +G+++  AA +HLTPV LELGGKSP+ +DS  ++++ VRR + GK  C N GQ C
Sbjct: 195 TGSSKIGRVIMAAAAKHLTPVVLELGGKSPVVVDSDTDLKVTVRRIIAGKWGC-NNGQAC 253

Query: 245 IAPDYILCSRQVQAQILNQAKAVLDSWYTEQEI 277
           I+PDYIL +++   ++++  K  L+++Y +  +
Sbjct: 254 ISPDYILTTKEYAPKVIDAMKKELETFYGKNPM 286



 Score = 70.8 bits (173), Expect = 4e-13
 Identities = 41/115 (35%), Positives = 65/115 (56%), Gaps = 8/115 (6%)

Query: 303 KDYNPVLEAL--SAPLYIDSSVNIELAVRRFLWGK--CINAGQTCIAPDYILCSRQVQAQ 358
           K   PV+  L   +P+ +DS  ++++ VRR + GK  C N GQ CI+PDYIL +++   +
Sbjct: 210 KHLTPVVLELGGKSPVVVDSDTDLKVTVRRIIAGKWGC-NNGQACISPDYILTTKEYAPK 268

Query: 359 ILNQAKAVLDSWYTEQVQGSKHYCRIVSDKHFQRLKSLVHS---SGTIALGGDMD 410
           +++  K  L+++Y +    SK   RIV+  HF RL  L+     S  I  GG+ D
Sbjct: 269 VIDAMKKELETFYGKNPMESKDMSRIVNSTHFDRLSKLLDEKEVSDKIVYGGEKD 323



 Score = 33.5 bits (76), Expect = 0.25
 Identities = 11/20 (55%), Positives = 17/20 (85%)

Query: 284 AYHGKYSFNTFTHRKSCLVK 303
           AYHGK+SF+ F+H+K+ L +
Sbjct: 425 AYHGKFSFDAFSHKKAVLYR 444



 Score = 30.0 bits (67), Expect = 3.2
 Identities = 10/19 (52%), Positives = 15/19 (78%)

Query: 453 YHGKYSFVTFTHRKSCLVK 471
           YHGK+SF  F+H+K+ L +
Sbjct: 426 YHGKFSFDAFSHKKAVLYR 444


>gnl|CDD|223944 COG1012, PutA, NAD-dependent aldehyde dehydrogenases [Energy
           production and conversion].
          Length = 472

 Score =  221 bits (564), Expect = 2e-66
 Identities = 88/267 (32%), Positives = 128/267 (47%), Gaps = 8/267 (2%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  AR  F++      E R   L+++  L E   ++LA  +  +  K   EA   EI   
Sbjct: 42  VAAARAAFEAWSRLSAEERAAILRRIADLLEARAEELAALITLETGKPISEA-RGEIARA 100

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAG 127
           A+ +R      ++      P    +       +  +P GV   I  WN+PL L+    A 
Sbjct: 101 ADFIRYYAEEARRLEGETIPTDKGSKA----LVRREPLGVVGAITPWNFPLALAAWKLAP 156

Query: 128 AIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETTE-LLKH-RFDYI 184
           A+AAGN V+LKPSE  P SA  +AEL  +  L      VV GG  E  + L+ H   D I
Sbjct: 157 ALAAGNTVVLKPSEQTPLSALALAELAAEAGLPAGVLNVVTGGGAEVGDALVAHPDVDAI 216

Query: 185 FYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTC 244
            +TGST+VG+ +  AA  +L PVTLELGGKSP  +    +++ AV   ++G   NAGQ C
Sbjct: 217 SFTGSTAVGRAIAAAAAANLKPVTLELGGKSPAIVLEDADLDAAVDAAVFGAFFNAGQRC 276

Query: 245 IAPDYILCSRQVQAQILNQAKAVLDSW 271
            A   ++    V  + + +  A   S 
Sbjct: 277 TAASRLIVHESVYDEFVERLVARAASL 303



 Score = 51.1 bits (123), Expect = 8e-07
 Identities = 17/89 (19%), Positives = 36/89 (40%), Gaps = 9/89 (10%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTE 373
           +P  +    +++ AV   ++G   NAGQ C A   ++    V  + + +  A   S    
Sbjct: 247 SPAIVLEDADLDAAVDAAVFGAFFNAGQRCTAASRLIVHESVYDEFVERLVARAASL--- 303

Query: 374 QVQG-----SKHYCRIVSDKHFQRLKSLV 397
              G     S     ++S++   R++  +
Sbjct: 304 -KVGDPLDPSTDLGPLISEEQLDRVEGYI 331



 Score = 32.7 bits (75), Expect = 0.52
 Identities = 8/27 (29%), Positives = 14/27 (51%)

Query: 415 PLYIDSSVNIELAVRRFLWGKCINAGQ 441
           P  +    +++ AV   ++G   NAGQ
Sbjct: 248 PAIVLEDADLDAAVDAAVFGAFFNAGQ 274


>gnl|CDD|215767 pfam00171, Aldedh, Aldehyde dehydrogenase family.  This family of
           dehydrogenases act on aldehyde substrates. Members use
           NADP as a cofactor. The family includes the following
           members: The prototypical members are the aldehyde
           dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
           dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase
           EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28.
           Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27.
           Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9.
           Delta-1-pyrroline-5-carboxylate dehydrogenase EC:
           1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10.
           Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This
           family also includes omega crystallin, an eye lens
           protein from squid and octopus that has little aldehyde
           dehydrogenase activity.
          Length = 459

 Score =  194 bits (495), Expect = 2e-56
 Identities = 86/274 (31%), Positives = 128/274 (46%), Gaps = 22/274 (8%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  AR  F +    P   R   L +   L EE + +LA     +  K   EA        
Sbjct: 35  VAAARAAFKAWAKTPPSERAAILLKAADLLEERRDELAELETLETGKPLAEA-------- 86

Query: 68  ANDVRNTLNHLKQWMT-PEKPGKDIANMLDGV--YIYPDPYGVCLIIGAWNYPLQLSLLP 124
             +V   ++ L+ +     K   +      GV  Y   +P GV   I  WN+PL L+   
Sbjct: 87  RGEVPRAIDTLRYYAGLARKLEGETLPSDPGVLAYTRREPLGVVGAITPWNFPLLLAAWK 146

Query: 125 AAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVEETTE-LLK 178
            A A+AAGN V+LKPSE+ P +A ++AEL     LP         VV G   E  + L++
Sbjct: 147 IAPALAAGNTVVLKPSELTPLTALLLAELFEEAGLPP----GVLNVVTGSGSEVGDALVE 202

Query: 179 H-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKC 237
           H   D + +TGST VG+ + +AA ++L  VTLELGGK+PL +    +++ AV   ++G  
Sbjct: 203 HPDVDKVSFTGSTEVGRRIAKAAAKNLKRVTLELGGKNPLIVFDDADLDAAVEGAVFGAF 262

Query: 238 INAGQTCIAPDYILCSRQVQAQILNQAKAVLDSW 271
            NAGQ C A   +L    +  + + +      S 
Sbjct: 263 GNAGQVCTAGSRLLVHESIYDEFVERLVEAAKSL 296



 Score = 48.7 bits (117), Expect = 4e-06
 Identities = 20/104 (19%), Positives = 38/104 (36%), Gaps = 15/104 (14%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSW--- 370
            PL +    +++ AV   ++G   NAGQ C A   +L    +  + + +      S    
Sbjct: 240 NPLIVFDDADLDAAVEGAVFGAFGNAGQVCTAGSRLLVHESIYDEFVERLVEAAKSLKVG 299

Query: 371 --YTEQVQ-GSKHYCRIVSDKHFQRLKSLVHSS----GTIALGG 407
                    G      ++S K  +R+ S +  +      +  GG
Sbjct: 300 DPLDPDTDIGP-----LISKKQRERVLSYIEDAKEEGAKLLCGG 338



 Score = 29.4 bits (67), Expect = 4.8
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 415 PLYIDSSVNIELAVRRFLWGKCINAGQ 441
           PL +    +++ AV   ++G   NAGQ
Sbjct: 241 PLIVFDDADLDAAVEGAVFGAFGNAGQ 267


>gnl|CDD|143417 cd07099, ALDH_DDALDH, Methylomonas sp.
           4,4'-diapolycopene-dialdehyde dehydrogenase-like.  The
           4,4'-diapolycopene-dialdehyde dehydrogenase (DDALDH)
           involved in C30 carotenoid synthesis in Methylomonas sp.
           strain 16a and other similar sequences are present in
           this CD. DDALDH converts 4,4'-diapolycopene-dialdehyde
           into 4,4'-diapolycopene-diacid.
          Length = 453

 Score =  165 bits (419), Expect = 9e-46
 Identities = 79/252 (31%), Positives = 113/252 (44%), Gaps = 22/252 (8%)

Query: 6   DLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
             V  AR    +  +   E R ++L +  R   ++  +LA  L A+  K + +A L    
Sbjct: 22  AAVARARAAQRAWAALGVEGRAQRLLRWKRALADHADELAELLHAETGKPRADAGL---- 77

Query: 66  FLANDVRNTLNHLKQW-------MTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPL 118
               +V   L  +          + P K    +        +   PYGV  +I  WNYPL
Sbjct: 78  ----EVLLALEAIDWAARNAPRVLAPRKVPTGLLMPNKKATVEYRPYGVVGVISPWNYPL 133

Query: 119 QLSLLPAAGAIAAGNVVILKPSEVAPAS----AKIMAELLPKYLDNDTFQVVLGGVEETT 174
              +     A+AAGN V+LKPSEV P      A+  A   P        QVV G      
Sbjct: 134 LTPMGDIIPALAAGNAVVLKPSEVTPLVGELLAEAWAAAGP---PQGVLQVVTGDGATGA 190

Query: 175 ELLKHRFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLW 234
            L+    D + +TGS + G+ V  AA E L PV LELGGK P+ + +  ++E A    +W
Sbjct: 191 ALIDAGVDKVAFTGSVATGRKVMAAAAERLIPVVLELGGKDPMIVLADADLERAAAAAVW 250

Query: 235 GKCINAGQTCIA 246
           G  +NAGQTCI+
Sbjct: 251 GAMVNAGQTCIS 262



 Score = 35.7 bits (83), Expect = 0.059
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 315 PLYIDSSVNIELAVRRFLWGKCINAGQTCIA 345
           P+ + +  ++E A    +WG  +NAGQTCI+
Sbjct: 232 PMIVLADADLERAAAAAVWGAMVNAGQTCIS 262


>gnl|CDD|143456 cd07138, ALDH_CddD_SSP0762, Rhodococcus ruber 6-oxolauric acid
           dehydrogenase-like.  The 6-oxolauric acid dehydrogenase
           (CddD) from Rhodococcus ruber SC1 which converts
           6-oxolauric acid to dodecanedioic acid, and the aldehyde
           dehydrogenase (locus SSP0762) from Staphylococcus
           saprophyticus subsp. saprophyticus ATCC 15305 and other
           similar sequences, are included in this CD.
          Length = 466

 Score =  148 bits (375), Expect = 2e-39
 Identities = 90/272 (33%), Positives = 131/272 (48%), Gaps = 21/272 (7%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  AR  F +  +   E R   L+++   YE    +LA A+  ++      A   ++   
Sbjct: 42  VAAARRAFPAWSATSVEERAALLERIAEAYEARADELAQAITLEMGAPITLARAAQVGLG 101

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAG 127
              +R   + LK +   E+ G  +        +  +P GVC +I  WN+PL   +L  A 
Sbjct: 102 IGHLRAAADALKDFEFEERRGNSL--------VVREPIGVCGLITPWNWPLNQIVLKVAP 153

Query: 128 AIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLG-GVEETTELLKH-R 180
           A+AAG  V+LKPSEVAP SA I+AE+     LP       F +V G G      L  H  
Sbjct: 154 ALAAGCTVVLKPSEVAPLSAIILAEILDEAGLPA----GVFNLVNGDGPVVGEALSAHPD 209

Query: 181 FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKC-IN 239
            D + +TGST  GK V +AA + +  V LELGGKS   I    ++E AV R +   C  N
Sbjct: 210 VDMVSFTGSTRAGKRVAEAAADTVKRVALELGGKSANIILDDADLEKAVPRGV-AACFAN 268

Query: 240 AGQTCIAPDYILCSRQVQAQILNQAKAVLDSW 271
           +GQ+C AP  +L  R   A+    A A  +++
Sbjct: 269 SGQSCNAPTRMLVPRSRYAEAEEIAAAAAEAY 300



 Score = 36.3 bits (85), Expect = 0.031
 Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 11/82 (13%)

Query: 324 IELAVRRFLWGKC-INAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTEQVQG----- 377
           +E AV R +   C  N+GQ+C AP  +L  R   A+    A A  +++    V G     
Sbjct: 254 LEKAVPRGV-AACFANSGQSCNAPTRMLVPRSRYAEAEEIAAAAAEAY----VVGDPRDP 308

Query: 378 SKHYCRIVSDKHFQRLKSLVHS 399
           +     + S   F R++  +  
Sbjct: 309 ATTLGPLASAAQFDRVQGYIQK 330


>gnl|CDD|143427 cd07109, ALDH_AAS00426, Uncharacterized Saccharopolyspora spinosa
           aldehyde dehydrogenase (AAS00426)-like.  Uncharacterized
           aldehyde dehydrogenase of Saccharopolyspora spinosa
           (AAS00426) and other similar sequences, are present in
           this CD.
          Length = 454

 Score =  142 bits (361), Expect = 1e-37
 Identities = 93/267 (34%), Positives = 135/267 (50%), Gaps = 17/267 (6%)

Query: 8   VQNARDTFDSG--KSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
           VQ AR  F+SG  +  P E R R L ++ RL  E+  +LA   + D  K   +A      
Sbjct: 25  VQAARRAFESGWLRLSPAE-RGRLLLRIARLIREHADELARLESLDTGKPLTQA------ 77

Query: 66  FLANDVRNTLNHLKQWM-TPEKPGKDIANMLDG--VYIYPDPYGVCLIIGAWNYPLQLSL 122
               DV     + + +    +K   +   +  G  VY   +P+GV   I  WNYPLQ++ 
Sbjct: 78  --RADVEAAARYFEYYGGAADKLHGETIPLGPGYFVYTVREPHGVTGHIIPWNYPLQITG 135

Query: 123 LPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLG-GVEETTELLKHR 180
              A A+AAGN V++KP+E AP +A  +AEL  +  L      VV G G E    L+ H 
Sbjct: 136 RSVAPALAAGNAVVVKPAEDAPLTALRLAELAEEAGLPAGALNVVTGLGAEAGAALVAHP 195

Query: 181 -FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCIN 239
             D+I +TGS   G  V +AA E++ PVTLELGGKSP  + +  ++E A+   +     N
Sbjct: 196 GVDHISFTGSVETGIAVMRAAAENVVPVTLELGGKSPQIVFADADLEAALPVVVNAIIQN 255

Query: 240 AGQTCIAPDYILCSRQVQAQILNQAKA 266
           AGQTC A   +L  R +  ++L +   
Sbjct: 256 AGQTCSAGSRLLVHRSIYDEVLERLVE 282


>gnl|CDD|143408 cd07089, ALDH_CddD-AldA-like, Rhodococcus ruber 6-oxolauric acid
           dehydrogenase-like and related proteins.  The
           6-oxolauric acid dehydrogenase (CddD) from Rhodococcus
           ruber SC1 which converts 6-oxolauric acid to
           dodecanedioic acid; and the aldehyde dehydrogenase
           (locus SSP0762) from Staphylococcus saprophyticus subsp.
           saprophyticus ATCC 15305 and also, the Mycobacterium
           tuberculosis H37Rv ALDH AldA (locus Rv0768) sequence;
           and other similar sequences, are included in this CD.
          Length = 459

 Score =  142 bits (361), Expect = 2e-37
 Identities = 79/271 (29%), Positives = 118/271 (43%), Gaps = 15/271 (5%)

Query: 11  ARDTFDSGK-SKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLAN 69
           AR  FD+G  S   E R R L+QL    E  +++L   L A++      A   +++    
Sbjct: 28  ARRAFDTGDWSTDAEERARCLRQLHEALEARKEELRALLVAEVGAPVMTARAMQVDGPIG 87

Query: 70  DVRNTLNHLKQWMTPEK-PGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGA 128
            +R   +    +      P   +        +  +P GV   I  WN+P  L+L   A A
Sbjct: 88  HLRYFADLADSFPWEFDLPVPALRGGPGRRVVRREPVGVVAAITPWNFPFFLNLAKLAPA 147

Query: 129 IAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVEETTELL--KHRF 181
           +AAGN V+LKP+   P SA ++ E+     LP         VV G      E L    R 
Sbjct: 148 LAAGNTVVLKPAPDTPLSALLLGEIIAETDLPA----GVVNVVTGSDNAVGEALTTDPRV 203

Query: 182 DYIFYTGSTSVGK-IVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINA 240
           D + +TGST+VG+ I+ QAA   L  V LELGGKS   +    ++  A    +     NA
Sbjct: 204 DMVSFTGSTAVGRRIMAQAA-ATLKRVLLELGGKSANIVLDDADLAAAAPAAVGVCMHNA 262

Query: 241 GQTCIAPDYILCSRQVQAQILNQAKAVLDSW 271
           GQ C     +L  R    +++    A  ++ 
Sbjct: 263 GQGCALTTRLLVPRSRYDEVVEALAAAFEAL 293



 Score = 31.4 bits (72), Expect = 1.1
 Identities = 12/58 (20%), Positives = 23/58 (39%), Gaps = 1/58 (1%)

Query: 313 SAPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSW 370
           SA + +D   ++  A    +     NAGQ C     +L  R    +++    A  ++ 
Sbjct: 237 SANIVLDD-ADLAAAAPAAVGVCMHNAGQGCALTTRLLVPRSRYDEVVEALAAAFEAL 293


>gnl|CDD|143422 cd07104, ALDH_BenzADH-like, ALDH subfamily: NAD(P)+-dependent
           benzaldehyde dehydrogenase II, vanillin dehydrogenase,
           p-hydroxybenzaldehyde dehydrogenase and related
           proteins.  ALDH subfamily which includes the
           NAD(P)+-dependent, benzaldehyde dehydrogenase II (XylC,
           BenzADH, EC=1.2.1.28)  involved in the oxidation of
           benzyl alcohol to benzoate; p-hydroxybenzaldehyde
           dehydrogenase (PchA, HBenzADH) which catalyzes the
           oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic
           acid; vanillin dehydrogenase (Vdh, VaniDH) involved in
           the metabolism of ferulic acid as seen in Pseudomonas
           putida KT2440; and other related sequences.
          Length = 431

 Score =  140 bits (355), Expect = 6e-37
 Identities = 75/259 (28%), Positives = 123/259 (47%), Gaps = 20/259 (7%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
              A     +  + P + R   L++   + EE + ++A+ L  +    + +A  FE+   
Sbjct: 6   YAAAAAAQKAWAATPPQERAAILRKAAEILEERRDEIADWLIRESGSTRPKA-AFEVGAA 64

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDIANMLDGV--YIYPDPYGVCLIIGAWNYPLQLSLLPA 125
              +R         +     G+ + + + G    +   P GV  +I  +N+PL L++   
Sbjct: 65  IAILREAAG-----LPRRPEGEILPSDVPGKESMVRRVPLGVVGVISPFNFPLILAMRSV 119

Query: 126 AGAIAAGNVVILKPSEVAPAS-----AKIMAEL-LPKYLDNDTFQVVLGGVEETTE-LLK 178
           A A+A GN V+LKP    P +     A+I  E  LPK +      VV GG  E  + L++
Sbjct: 120 APALALGNAVVLKPDSRTPVTGGLLIAEIFEEAGLPKGV----LNVVPGGGSEIGDALVE 175

Query: 179 H-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKC 237
           H R   I +TGST+VG+ + + A  HL  V LELGG +PL +    +++LAV    +G  
Sbjct: 176 HPRVRMISFTGSTAVGRHIGELAGRHLKKVALELGGNNPLIVLDDADLDLAVSAAAFGAF 235

Query: 238 INAGQTCIAPDYILCSRQV 256
           ++ GQ C+A   IL    V
Sbjct: 236 LHQGQICMAAGRILVHESV 254


>gnl|CDD|143432 cd07114, ALDH_DhaS, Uncharacterized Candidatus pelagibacter
           aldehyde dehydrogenase, DhaS-like.  Uncharacterized
           aldehyde dehydrogenase from Candidatus pelagibacter
           (DhaS) and other related sequences are present in this
           CD.
          Length = 457

 Score =  133 bits (337), Expect = 3e-34
 Identities = 89/269 (33%), Positives = 122/269 (45%), Gaps = 59/269 (21%)

Query: 8   VQNARDTFDSG---KSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEI 64
           V  AR  F+ G   K  P E R + L++L  L E N ++LA                 E+
Sbjct: 25  VAAARAAFEGGAWRKLTPTE-RGKLLRRLADLIEANAEELA-----------------EL 66

Query: 65  EFLAN-----DVRNTLNHLKQW--------------MTP-EKPGKDIANMLDGVYIYPDP 104
           E   N     + R  + +L +W              + P +K      + L   +   +P
Sbjct: 67  ETRDNGKLIRETRAQVRYLAEWYRYYAGLADKIEGAVIPVDKG-----DYL--NFTRREP 119

Query: 105 YGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLD 159
            GV   I  WN PL L     A A+AAGN V+LKPSE  PAS   +A+L      P  + 
Sbjct: 120 LGVVAAITPWNSPLLLLAKKLAPALAAGNTVVLKPSEHTPASTLELAKLAEEAGFPPGV- 178

Query: 160 NDTFQVVLGGVEETTELL-KH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPL 217
                VV G   ET E L +H     I +TG T  G+ + +AA E+L PVTLELGGKSP 
Sbjct: 179 ---VNVVTGFGPETGEALVEHPLVAKIAFTGGTETGRHIARAAAENLAPVTLELGGKSPN 235

Query: 218 YIDSSVNIELAVRRFLWGKCINAGQTCIA 246
            +    +++ AV   + G    AGQTC+A
Sbjct: 236 IVFDDADLDAAVNGVVAGIFAAAGQTCVA 264



 Score = 28.3 bits (64), Expect = 9.8
 Identities = 17/99 (17%), Positives = 38/99 (38%), Gaps = 21/99 (21%)

Query: 322 VNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYTEQVQGSK-- 379
            +++ AV   + G    AGQTC+A   +L    VQ  I ++    L      + +  +  
Sbjct: 241 ADLDAAVNGVVAGIFAAAGQTCVAGSRLL----VQRSIYDEFVERL----VARARAIRVG 292

Query: 380 -------HYCRIVSDKHFQRLKSLVHSS----GTIALGG 407
                      + +++  ++++  V  +      +  GG
Sbjct: 293 DPLDPETQMGPLATERQLEKVERYVARAREEGARVLTGG 331


>gnl|CDD|143468 cd07150, ALDH_VaniDH_like, Pseudomonas putida vanillin
           dehydrogenase-like.  Vanillin dehydrogenase (Vdh,
           VaniDH) involved in the metabolism of ferulic acid and
           other related  sequences are included in this CD.  The
           E. coli vanillin dehydrogenase (LigV) preferred NAD+ to
           NADP+  and exhibited a broad substrate preference,
           including vanillin,  benzaldehyde, protocatechualdehyde,
           m-anisaldehyde, and p-hydroxybenzaldehyde.
          Length = 451

 Score =  132 bits (334), Expect = 7e-34
 Identities = 80/260 (30%), Positives = 121/260 (46%), Gaps = 23/260 (8%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           +  A D F +  +     R R L +   + E    DL + L  +      +A  FE  F 
Sbjct: 27  IAAAYDAFPAWAATTPSERERILLKAAEIMERRADDLIDLLIDEGGSTYGKAW-FETTFT 85

Query: 68  ANDVRNTLN---HLK-QWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLL 123
              +R        ++ + +  + PG               P GV   I  +NYPL L+  
Sbjct: 86  PELLRAAAGECRRVRGETLPSDSPGTV-------SMSVRRPLGVVAGITPFNYPLILATK 138

Query: 124 PAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVEET-TELL 177
             A A+AAGN V+LKPSE  P     +AE+     LPK +    F VV GG  E   EL+
Sbjct: 139 KVAFALAAGNTVVLKPSEETPVIGLKIAEIMEEAGLPKGV----FNVVTGGGAEVGDELV 194

Query: 178 KH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGK 236
              R   + +TGST+VG+ + + A  HL  +TLELGGK+PL + +  +++ AVR   +G 
Sbjct: 195 DDPRVRMVTFTGSTAVGREIAEKAGRHLKKITLELGGKNPLIVLADADLDYAVRAAAFGA 254

Query: 237 CINAGQTCIAPDYILCSRQV 256
            ++ GQ C++   I+    V
Sbjct: 255 FMHQGQICMSASRIIVEEPV 274


>gnl|CDD|143416 cd07098, ALDH_F15-22, Aldehyde dehydrogenase family 15A1 and
           22A1-like.  Aldehyde dehydrogenase family members
           ALDH15A1 (Saccharomyces cerevisiae YHR039C) and ALDH22A1
           (Arabidopsis thaliana, EC=1.2.1.3), and similar
           sequences, are in this CD. Significant improvement of
           stress tolerance in tobacco plants was observed by
           overexpressing the ALDH22A1 gene from maize (Zea mays)
           and was accompanied by a reduction of malondialdehyde
           derived from cellular lipid peroxidation.
          Length = 465

 Score =  132 bits (333), Expect = 1e-33
 Identities = 72/244 (29%), Positives = 113/244 (46%), Gaps = 6/244 (2%)

Query: 26  RRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLANDVRNTLNHLKQWMTPE 85
           RR+ L+ L++   ENQ+++      D  K   +A L EI      +R TL H ++ + PE
Sbjct: 42  RRKVLRSLLKYILENQEEICRVACRDTGKTMVDASLGEILVTCEKIRWTLKHGEKALRPE 101

Query: 86  KPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPA 145
                +        +  +P GV   I +WNYP    L P   A+ AGN +++K SE    
Sbjct: 102 SRPGGLLMFYKRARVEYEPLGVVGAIVSWNYPFHNLLGPIIAALFAGNAIVVKVSEQVAW 161

Query: 146 SAKIMAELLPKYLDN-----DTFQVVLGGVEETTELLKH-RFDYIFYTGSTSVGKIVRQA 199
           S+     ++ + L       D  Q+V    E    L  H   D+I + GS  VGK V  A
Sbjct: 162 SSGFFLSIIRECLAACGHDPDLVQLVTCLPETAEALTSHPVIDHITFIGSPPVGKKVMAA 221

Query: 200 ANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQ 259
           A E LTPV LELGGK P  +    +++      + G   ++GQ CI  + ++   ++  +
Sbjct: 222 AAESLTPVVLELGGKDPAIVLDDADLDQIASIIMRGTFQSSGQNCIGIERVIVHEKIYDK 281

Query: 260 ILNQ 263
           +L  
Sbjct: 282 LLEI 285


>gnl|CDD|143424 cd07106, ALDH_AldA-AAD23400, Streptomyces aureofaciens putative
           aldehyde dehydrogenase AldA (AAD23400)-like.  Putative
           aldehyde dehydrogenase, AldA, from Streptomyces
           aureofaciens (locus AAD23400) and other similar
           sequences are present in this CD.
          Length = 446

 Score =  131 bits (332), Expect = 1e-33
 Identities = 79/241 (32%), Positives = 108/241 (44%), Gaps = 10/241 (4%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  A+  F    + P E RR  L  +    E N ++LA  L  +  K   EA  FE+   
Sbjct: 25  VAAAKAAFPGWSATPLEERRAALLAIADAIEANAEELARLLTLEQGKPLAEAQ-FEVGG- 82

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDIANMLDG-VYIYPDPYGVCLIIGAWNYPLQLSLLPAA 126
                  +  L+   + + P + I +     V +   P GV   I  WN+PL L+    A
Sbjct: 83  ------AVAWLRYTASLDLPDEVIEDDDTRRVELRRKPLGVVAAIVPWNFPLLLAAWKIA 136

Query: 127 GAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKH-RFDYIF 185
            A+ AGN V+LKPS   P     + EL  + L      VV GG E    L  H     I 
Sbjct: 137 PALLAGNTVVLKPSPFTPLCTLKLGELAQEVLPPGVLNVVSGGDELGPALTSHPDIRKIS 196

Query: 186 YTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCI 245
           +TGST+ GK V  +A + L  VTLELGG     +   V+I+    +  WG  IN+GQ C 
Sbjct: 197 FTGSTATGKKVMASAAKTLKRVTLELGGNDAAIVLPDVDIDAVAPKLFWGAFINSGQVCA 256

Query: 246 A 246
           A
Sbjct: 257 A 257



 Score = 31.7 bits (73), Expect = 0.99
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 322 VNIELAVRRFLWGKCINAGQTCIA 345
           V+I+    +  WG  IN+GQ C A
Sbjct: 234 VDIDAVAPKLFWGAFINSGQVCAA 257


>gnl|CDD|143412 cd07093, ALDH_F8_HMSADH, Human aldehyde dehydrogenase family 8
           member A1-like.  In humans, the  aldehyde dehydrogenase
           family 8 member A1 (ALDH8A1) protein functions to
           convert 9-cis-retinal to 9-cis-retinoic acid and has a
           preference for NAD+. Also included in this CD is the
           2-hydroxymuconic semialdehyde dehydrogenase (HMSADH)
           which catalyzes the conversion of 2-hydroxymuconic
           semialdehyde to 4-oxalocrotonate, a step in the meta
           cleavage pathway of aromatic hydrocarbons in bacteria.
           Such HMSADHs seen here are: XylG of the TOL plasmid pWW0
           of Pseudomonas putida, TomC  of Burkholderia cepacia G4,
           and AphC of Comamonas testosterone.
          Length = 455

 Score =  131 bits (332), Expect = 2e-33
 Identities = 77/248 (31%), Positives = 112/248 (45%), Gaps = 19/248 (7%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  A++ F          R R L ++  L E    +LA   + D  K    A   +I   
Sbjct: 25  VAAAKEAFPGWSRMSPAERARILHKVADLIEARADELALLESLDTGKPITLARTRDIPRA 84

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDIANMLDGV--YIYPDPYGVCLIIGAWNYPLQLSLLPA 125
           A + R   +++ Q       G+       G   Y+   P GV  +I  WN PL L     
Sbjct: 85  AANFRFFADYILQ-----LDGESY-PQDGGALNYVLRQPVGVAGLITPWNLPLMLLTWKI 138

Query: 126 AGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVEETTELL-KH 179
           A A+A GN V+LKPSE  P +A ++AEL     LP         VV G   E    L  H
Sbjct: 139 APALAFGNTVVLKPSEWTPLTAWLLAELANEAGLPP----GVVNVVHGFGPEAGAALVAH 194

Query: 180 -RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCI 238
              D I +TG T+ G+ + +AA  +L PV+LELGGK+P  + +  +++ AV   +     
Sbjct: 195 PDVDLISFTGETATGRTIMRAAAPNLKPVSLELGGKNPNIVFADADLDRAVDAAVRSSFS 254

Query: 239 NAGQTCIA 246
           N G+ C+A
Sbjct: 255 NNGEVCLA 262


>gnl|CDD|143409 cd07090, ALDH_F9_TMBADH, NAD+-dependent
           4-trimethylaminobutyraldehyde dehydrogenase, ALDH family
           9A1.  NAD+-dependent, 4-trimethylaminobutyraldehyde
           dehydrogenase (TMABADH, EC=1.2.1.47), also known as
           aldehyde dehydrogenase family 9 member A1 (ALDH9A1) in
           humans, is a cytosolic tetramer which catalyzes the
           oxidation of gamma-aminobutyraldehyde involved in
           4-aminobutyric acid (GABA) biosynthesis  and also
           oxidizes betaine aldehyde
           (gamma-trimethylaminobutyraldehyde) which is involved in
           carnitine biosynthesis.
          Length = 457

 Score =  128 bits (324), Expect = 2e-32
 Identities = 82/252 (32%), Positives = 119/252 (47%), Gaps = 33/252 (13%)

Query: 8   VQNARDTFDS-GKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA------V 60
           V++A+        +   E R R L++   L  E   ++A     D  K  +EA       
Sbjct: 25  VKSAKAAQKEWSATSGME-RGRILRKAADLLRERNDEIARLETIDNGKPIEEARVDIDSS 83

Query: 61  LFEIEFLANDVRNTLN--HLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPL 118
              +E+ A  +  TL+  H+        PG   A      Y   +P GVC  IGAWNYP+
Sbjct: 84  ADCLEYYAG-LAPTLSGEHV------PLPGGSFA------YTRREPLGVCAGIGAWNYPI 130

Query: 119 QLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVEET 173
           Q++   +A A+A GN ++ KPS   P +A ++AE+     LP  +    F VV GG E  
Sbjct: 131 QIASWKSAPALACGNAMVYKPSPFTPLTALLLAEILTEAGLPDGV----FNVVQGGGETG 186

Query: 174 TELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRF 232
             L +H     + +TGS   GK V  AA + +  VTLELGGKSPL I    ++E AV   
Sbjct: 187 QLLCEHPDVAKVSFTGSVPTGKKVMSAAAKGIKHVTLELGGKSPLIIFDDADLENAVNGA 246

Query: 233 LWGKCINAGQTC 244
           +    ++ GQ C
Sbjct: 247 MMANFLSQGQVC 258


>gnl|CDD|143437 cd07119, ALDH_BADH-GbsA, Bacillus subtilis NAD+-dependent betaine
           aldehyde dehydrogenase-like.  Included in this CD is the
           NAD+-dependent, betaine aldehyde dehydrogenase (BADH,
           GbsA, EC=1.2.1.8) of Bacillus subtilis involved in the
           synthesis of the osmoprotectant glycine betaine from
           choline or glycine betaine aldehyde.
          Length = 482

 Score =  128 bits (325), Expect = 2e-32
 Identities = 78/255 (30%), Positives = 117/255 (45%), Gaps = 22/255 (8%)

Query: 8   VQNARDTFDSG--KSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
           +  AR  FDSG     P + R   L ++     E+ ++LA     +  K      L E E
Sbjct: 41  IAAARRAFDSGEWPHLPAQERAALLFRIADKIREDAEELARLETLNTGK-----TLRESE 95

Query: 66  FLANDVRNTLNHLKQWMTPEKPGKDIANMLDGV--YIYPDPYGVCLIIGAWNYPLQLSLL 123
              +DV N   +     T  K   ++ ++   V      +P GVC +I  WNYPL  +  
Sbjct: 96  IDIDDVANCFRYYAGLAT--KETGEVYDVPPHVISRTVREPVGVCGLITPWNYPLLQAAW 153

Query: 124 PAAGAIAAGNVVILKPSEVAPASA----KIMAEL-LPKYLDNDTFQVVLGGVEETTELL- 177
             A A+AAGN V++KPSEV P +     +++ E  LP         +V G        L 
Sbjct: 154 KLAPALAAGNTVVIKPSEVTPLTTIALFELIEEAGLPA----GVVNLVTGSGATVGAELA 209

Query: 178 -KHRFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGK 236
                D + +TG T+ G+ + +AA  ++  V LELGGK+P  + +  + E AV + L G 
Sbjct: 210 ESPDVDLVSFTGGTATGRSIMRAAAGNVKKVALELGGKNPNIVFADADFETAVDQALNGV 269

Query: 237 CINAGQTCIAPDYIL 251
             NAGQ C A   +L
Sbjct: 270 FFNAGQVCSAGSRLL 284


>gnl|CDD|143418 cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis
           succinate-semialdehyde dehydrogenase 1-like.
           Succinate-semialdehyde dehydrogenase 1 (SSADH1, GabD1,
           EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
           of succinate semialdehyde (SSA)  to succinate.  SSADH
           activity in Mycobacterium tuberculosis (Mtb) is encoded
           by both gabD1 (Rv0234c) and gabD2 (Rv1731).  The Mtb
           GabD1 SSADH1 reportedly is an enzyme of the
           gamma-aminobutyrate shunt, which forms a functional link
           between two TCA half-cycles by converting
           alpha-ketoglutarate to succinate.
          Length = 429

 Score =  127 bits (323), Expect = 2e-32
 Identities = 66/242 (27%), Positives = 107/242 (44%), Gaps = 7/242 (2%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           +  A   F + +   +  R   L++L  L  E + +LA  +  ++ K   EA   E+E  
Sbjct: 5   LDRAHAAFLAWRKTSFAERAALLRKLADLLRERKDELARLITLEMGKPIAEARA-EVEKC 63

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAG 127
           A   R    + + ++  E     I       Y+  +P GV L I  WN+P       AA 
Sbjct: 64  AWICRYYAENAEAFLADEP----IETDAGKAYVRYEPLGVVLGIMPWNFPFWQVFRFAAP 119

Query: 128 AIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETTELLKH-RFDYIF 185
            + AGN V+LK +   P  A  + EL  +       FQ +L   ++   ++   R   + 
Sbjct: 120 NLMAGNTVLLKHASNVPGCALAIEELFREAGFPEGVFQNLLIDSDQVEAIIADPRVRGVT 179

Query: 186 YTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCI 245
            TGS   G+ V   A ++L    LELGG  P  +    +++ AV+  + G+  NAGQ+CI
Sbjct: 180 LTGSERAGRAVAAEAGKNLKKSVLELGGSDPFIVLDDADLDKAVKTAVKGRLQNAGQSCI 239

Query: 246 AP 247
           A 
Sbjct: 240 AA 241



 Score = 36.3 bits (85), Expect = 0.037
 Identities = 10/24 (41%), Positives = 17/24 (70%)

Query: 323 NIELAVRRFLWGKCINAGQTCIAP 346
           +++ AV+  + G+  NAGQ+CIA 
Sbjct: 218 DLDKAVKTAVKGRLQNAGQSCIAA 241


>gnl|CDD|143421 cd07103, ALDH_F5_SSADH_GabD, Mitochondrial succinate-semialdehyde
           dehydrogenase and ALDH family members 5A1 and 5F1-like. 
           Succinate-semialdehyde dehydrogenase, mitochondrial
           (SSADH, GabD, EC=1.2.1.24) catalyzes the NAD+-dependent
           oxidation of succinate semialdehyde (SSA) to succinate.
           This group includes the human aldehyde dehydrogenase
           family 5 member A1 (ALDH5A1) which is a mitochondrial
           homotetramer that converts SSA to succinate in the last
           step of 4-aminobutyric acid (GABA) catabolism. This CD
           also includes the Arabidopsis SSADH gene product
           ALDH5F1. Mutations in this gene result in the
           accumulation of H2O2, suggesting a role in plant defense
           against the environmental stress of elevated reactive
           oxygen species.
          Length = 451

 Score =  127 bits (321), Expect = 5e-32
 Identities = 82/272 (30%), Positives = 116/272 (42%), Gaps = 43/272 (15%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           +  A   F + +      R   L++   L  E  +DLA  L  +  K   EA   E+++ 
Sbjct: 25  IDAAAAAFKTWRKTTARERAAILRRWADLIRERAEDLARLLTLEQGKPLAEARG-EVDYA 83

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPD-----------PYGVCLIIGAWNY 116
           A+ +         W   E      A  + G  I              P GV   I  WN+
Sbjct: 84  ASFLE--------WFAEE------ARRIYGRTIPSPAPGKRILVIKQPVGVVAAITPWNF 129

Query: 117 PLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVE 171
           P  +     A A+AAG  V+LKP+E  P SA  +AEL     LP         VV G   
Sbjct: 130 PAAMITRKIAPALAAGCTVVLKPAEETPLSALALAELAEEAGLPA----GVLNVVTGSPA 185

Query: 172 ETTE-LLKH-RFDYIFYTGSTSVGKIV-RQAANEHLTPVTLELGGKSPLYIDSSVNIELA 228
           E  E L    R   I +TGST+VGK++  QAA + +  V+LELGG +P  +    +++ A
Sbjct: 186 EIGEALCASPRVRKISFTGSTAVGKLLMAQAA-DTVKRVSLELGGNAPFIVFDDADLDKA 244

Query: 229 VRRFLWGKCINAGQTCIAPDYILCSRQVQAQI 260
           V   +  K  NAGQTC+  + I     V   I
Sbjct: 245 VDGAIASKFRNAGQTCVCANRIY----VHESI 272



 Score = 37.4 bits (88), Expect = 0.016
 Identities = 14/46 (30%), Positives = 21/46 (45%), Gaps = 4/46 (8%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQI 359
           AP  +    +++ AV   +  K  NAGQTC+  + I     V   I
Sbjct: 231 APFIVFDDADLDKAVDGAIASKFRNAGQTCVCANRIY----VHESI 272


>gnl|CDD|143467 cd07149, ALDH_y4uC, Uncharacterized ALDH (y4uC) with similarity to
           Tortula ruralis aldehyde dehydrogenase ALDH21A1.
           Uncharacterized aldehyde dehydrogenase (ORF name y4uC)
           with sequence similarity to the moss Tortula ruralis
           aldehyde dehydrogenase ALDH21A1 (RNP123) believed to
           play an important role in the detoxification of
           aldehydes generated in response to desiccation- and
           salinity-stress, and similar sequences are included in
           this CD.
          Length = 453

 Score =  126 bits (319), Expect = 8e-32
 Identities = 82/271 (30%), Positives = 118/271 (43%), Gaps = 41/271 (15%)

Query: 11  ARDTFDSGKSKP-YEFRRRQLQQLVRLYEENQQDLANALA-------ADLRKHKQEAVLF 62
           A++     KS P YE R   L++  +L EE +++ A  +A        D RK    A+  
Sbjct: 30  AKEGAKEMKSLPAYE-RAEILERAAQLLEERREEFARTIALEAGKPIKDARKEVDRAIE- 87

Query: 63  EIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGV--------YIYPDPYGVCLIIGAW 114
            +   A + +     L     P           D          +   +P GV   I  +
Sbjct: 88  TLRLSAEEAK----RLAGETIP----------FDASPGGEGRIGFTIREPIGVVAAITPF 133

Query: 115 NYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPK-YLDNDTFQVVLGGVEET 173
           N+PL L       AIAAGN V+LKP+   P SA  +AELL +  L      VV G  E  
Sbjct: 134 NFPLNLVAHKVGPAIAAGNAVVLKPASQTPLSALKLAELLLEAGLPKGALNVVTGSGETV 193

Query: 174 -TELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRR 231
              L+   R   I +TGS +VG+ + + A   L  VTLELG  + + +D+  ++E AV R
Sbjct: 194 GDALVTDPRVRMISFTGSPAVGEAIARKAG--LKKVTLELGSNAAVIVDADADLEKAVER 251

Query: 232 FLWGKCINAGQTCIAPDYILCSRQVQAQILN 262
            + G   NAGQ CI+   I     V   I +
Sbjct: 252 CVSGAFANAGQVCISVQRIF----VHEDIYD 278



 Score = 37.2 bits (87), Expect = 0.015
 Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILN 361
           A + +D+  ++E AV R + G   NAGQ CI+   I     V   I +
Sbjct: 235 AAVIVDADADLEKAVERCVSGAFANAGQVCISVQRIF----VHEDIYD 278



 Score = 29.1 bits (66), Expect = 5.9
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 412 SDRPLYIDSSVNIELAVRRFLWGKCINAGQ 441
           S+  + +D+  ++E AV R + G   NAGQ
Sbjct: 233 SNAAVIVDADADLEKAVERCVSGAFANAGQ 262


>gnl|CDD|143410 cd07091, ALDH_F1-2_Ald2-like, ALDH subfamily: ALDH families 1and 2,
           including 10-formyltetrahydrofolate dehydrogenase,
           NAD+-dependent retinal dehydrogenase 1 and related
           proteins.  ALDH subfamily which includes the
           NAD+-dependent retinal dehydrogenase 1 (RALDH 1, ALDH1,
           EC=1.2.1.36), also known as aldehyde dehydrogenase
           family 1 member A1 (ALDH1A1), in humans, a
           homotetrameric, cytosolic enzyme that catalyzes the
           oxidation of retinaldehyde to retinoic acid. Human
           ALDH1B1 and ALDH2 are also in this cluster; both are
           mitochrondrial homotetramers which play important roles
           in acetaldehyde oxidation; ALDH1B1 in response to UV
           light exposure and ALDH2 during ethanol metabolism.
           10-formyltetrahydrofolate dehydrogenase (FTHFDH,
           EC=1.5.1.6), also known as aldehyde dehydrogenase family
           1 member L1 (ALDH1L1), in humans, a multi-domain
           homotetramer with an N-terminal formyl transferase
           domain and a C-terminal ALDH domain. FTHFDH catalyzes an
           NADP+-dependent dehydrogenase reaction resulting in the
           conversion of 10-formyltetrahydrofolate to
           tetrahydrofolate and CO2. Also included in this
           subfamily is the Arabidosis aldehyde dehydrogenase
           family 2 members B4 and B7 (EC=1.2.1.3), which are
           mitochondrial, homotetramers that oxidize acetaldehyde
           and glycolaldehyde, as well as, the Arabidosis
           cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
           involved in the oxidation of sinapalehyde and
           coniferaldehyde. Also included is the AldA aldehyde
           dehydrogenase  of Aspergillus nidulans (locus AN0554),
           the aldehyde dehydrogenase 2 (YMR170c, ALD5, EC=1.2.1.5)
           of Saccharomyces cerevisiae, and other similar
           sequences.
          Length = 476

 Score =  126 bits (320), Expect = 8e-32
 Identities = 85/257 (33%), Positives = 121/257 (47%), Gaps = 34/257 (13%)

Query: 8   VQNARDTFDSGKSK---PYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEI 64
           V+ AR  F++G  +   P E R R L +L  L E ++ +LA   + D  K  +E+   ++
Sbjct: 47  VKAARAAFETGWWRKMDPRE-RGRLLNKLADLIERDRDELAALESLDNGKPLEESAKGDV 105

Query: 65  EFLANDVRNTLNHLKQW------MTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPL 118
                     L +   W       T    G  +A      Y   +P GVC  I  WN+PL
Sbjct: 106 ----ALSIKCLRYYAGWADKIQGKTIPIDGNFLA------YTRREPIGVCGQIIPWNFPL 155

Query: 119 QLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVEET 173
            +     A A+AAGN V+LKP+E  P SA  +AEL      P  + N     ++ G   T
Sbjct: 156 LMLAWKLAPALAAGNTVVLKPAEQTPLSALYLAELIKEAGFPPGVVN-----IVPGFGPT 210

Query: 174 T--ELLKH-RFDYIFYTGSTSVGK-IVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAV 229
               +  H   D I +TGST+VG+ I+  AA  +L  VTLELGGKSP  +    +++ AV
Sbjct: 211 AGAAISSHMDVDKIAFTGSTAVGRTIMEAAAKSNLKKVTLELGGKSPNIVFDDADLDKAV 270

Query: 230 RRFLWGKCINAGQTCIA 246
               +G   N GQ C A
Sbjct: 271 EWAAFGIFFNQGQCCCA 287


>gnl|CDD|200131 TIGR01804, BADH, glycine betaine aldehyde dehydrogenase.  Under
           osmotic stress, betaine aldehyde dehydrogenase oxidizes
           glycine betaine aldehyde into the osmoprotectant glycine
           betaine, via the second of two oxidation steps from
           exogenously supplied choline or betaine aldehyde. This
           choline-glycine betaine synthesis pathway can be found
           in gram-positive and gram-negative bacteria. In
           Escherichia coli, betaine aldehyde dehydrogenase (betB)
           is osmotically co-induced with choline dehydrogenase
           (betA) in the presence of choline. These dehydrogenases
           are located in a betaine gene cluster with the upstream
           choline transporter (betT) and transcriptional regulator
           (betI). Similar to E.coli, betaine synthesis in
           Staphylococcus xylosus is also influenced by osmotic
           stress and the presence of choline with genes localized
           in a functionally equivalent gene cluster. Organization
           of the betaine gene cluster in Sinorhizobium meliloti
           and Bacillus subtilis differs from that of E.coli by the
           absence of upstream choline transporter and
           transcriptional regulator homologues. Additionally,
           B.subtilis co-expresses a type II alcohol dehydrogenase
           with betaine aldehyde dehydrogenase instead of choline
           dehydrogenase as in E.coli, St.xylosus, and Si.meliloti.
           Betaine aldehyde dehydrogenase is a member of the
           aldehyde dehydrogenase family (pfam00171) [Cellular
           processes, Adaptations to atypical conditions].
          Length = 467

 Score =  126 bits (319), Expect = 8e-32
 Identities = 66/178 (37%), Positives = 93/178 (52%), Gaps = 11/178 (6%)

Query: 96  DGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-- 153
              Y   +P GVC+ IGAWNYPLQ++    A A+AAGN ++ KPSE  P +A  +AE+  
Sbjct: 125 SFAYTIREPLGVCVGIGAWNYPLQIASWKIAPALAAGNAMVFKPSENTPLTALKVAEIME 184

Query: 154 ---LPKYLDNDTFQVVLGGVEETTELL-KHR-FDYIFYTGSTSVGKIVRQAANEHLTPVT 208
              LPK +    F VV G   E   LL  H     + +TG    GK +  AA  HL  VT
Sbjct: 185 EAGLPKGV----FNVVQGDGAEVGPLLVNHPDVAKVSFTGGVPTGKKIMAAAAGHLKHVT 240

Query: 209 LELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKA 266
           +ELGGKSPL +    ++E AV   + G   +AGQ C     +   ++++ + L +   
Sbjct: 241 MELGGKSPLIVFDDADLESAVDGAMLGNFFSAGQVCSNGTRVFVHKKIKERFLARLVE 298


>gnl|CDD|143420 cd07102, ALDH_EDX86601, Uncharacterized aldehyde dehydrogenase of
           Synechococcus sp. PCC 7335 (EDX86601).  Uncharacterized
           aldehyde dehydrogenase of Synechococcus sp. PCC 7335
           (locus EDX86601) and other similar sequences, are
           present in this CD.
          Length = 452

 Score =  126 bits (318), Expect = 1e-31
 Identities = 70/250 (28%), Positives = 111/250 (44%), Gaps = 24/250 (9%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAAD----LRKHKQE--AVL 61
           ++ AR      ++ P E R+  + + V L   N  ++A  L       + +   E   +L
Sbjct: 24  LERARAAQKGWRAVPLEERKAIVTRAVELLAANTDEIAEELTWQMGRPIAQAGGEIRGML 83

Query: 62  FEIEFL---ANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPL 118
               ++   A +       L     PEK G +        YI  +P GV LII  WNYP 
Sbjct: 84  ERARYMISIAEEA------LADIRVPEKDGFE-------RYIRREPLGVVLIIAPWNYPY 130

Query: 119 QLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETTELL 177
             ++     A+ AGN VILK S   P   +  A    +  L    FQV+    E +  L+
Sbjct: 131 LTAVNAVIPALLAGNAVILKHSPQTPLCGERFAAAFAEAGLPEGVFQVLHLSHETSAALI 190

Query: 178 K-HRFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGK 236
              R D++ +TGS + G+ +++AA      V LELGGK P Y+    +++ A    + G 
Sbjct: 191 ADPRIDHVSFTGSVAGGRAIQRAAAGRFIKVGLELGGKDPAYVRPDADLDAAAESLVDGA 250

Query: 237 CINAGQTCIA 246
             N+GQ+C +
Sbjct: 251 FFNSGQSCCS 260


>gnl|CDD|143426 cd07108, ALDH_MGR_2402, Magnetospirillum NAD(P)+-dependent aldehyde
           dehydrogenase MSR-1-like.  NAD(P)+-dependent aldehyde
           dehydrogenase of Magnetospirillum gryphiswaldense MSR-1
           (MGR_2402) , and other similar sequences, are present in
           this CD.
          Length = 457

 Score =  124 bits (312), Expect = 7e-31
 Identities = 79/245 (32%), Positives = 120/245 (48%), Gaps = 13/245 (5%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  A+  F    + P   R + L ++    E   ++LA  LA +     +     E   L
Sbjct: 25  VAAAKAAFPEWAATPARERGKLLARIADALEARSEELARLLALETGNALRTQARPEAAVL 84

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDI---ANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLP 124
           A+     L      +  E  G+ +    ++L   Y   +P GV   I  WN PL L+ L 
Sbjct: 85  AD-----LFRYFGGLAGELKGETLPFGPDVLT--YTVREPLGVVGAILPWNAPLMLAALK 137

Query: 125 AAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTE-LLKHR-FD 182
            A A+ AGN V+LK +E AP +  ++AE+L + L      V+ G  EE    L+ H   D
Sbjct: 138 IAPALVAGNTVVLKAAEDAPLAVLLLAEILAQVLPAGVLNVITGYGEECGAALVDHPDVD 197

Query: 183 YIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWG-KCINAG 241
            + +TGST VGKI+ +AA + L PV+LELGGKSP+ +    +++ AV   + G +    G
Sbjct: 198 KVTFTGSTEVGKIIYRAAADRLIPVSLELGGKSPMIVFPDADLDDAVDGAIAGMRFTRQG 257

Query: 242 QTCIA 246
           Q+C A
Sbjct: 258 QSCTA 262


>gnl|CDD|143471 cd07559, ALDH_ACDHII_AcoD-like, Ralstonia eutrophus NAD+-dependent
           acetaldehyde dehydrogenase II and Staphylococcus aureus
           AldA1 (SACOL0154)-like.  Included in this CD is the
           NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII,
           AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus
           H16 involved in the catabolism of acetoin and ethanol,
           and similar proteins, such as, the dimeric
           dihydrolipoamide dehydrogenase of the acetoin
           dehydrogenase enzyme system of Klebsiella pneumonia.
           Also included are sequences similar to the
           NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
           and AldB) of Xanthobacter autotrophicus GJ10 which are
           involved in the degradation of 1,2-dichloroethane, as
           well as, the uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences.
          Length = 480

 Score =  123 bits (310), Expect = 2e-30
 Identities = 86/271 (31%), Positives = 127/271 (46%), Gaps = 31/271 (11%)

Query: 8   VQNARDTFDS-GKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEF 66
           V  A + F + GK+   E R   L ++    EEN + LA A   D  K  +E        
Sbjct: 44  VDAAHEAFKTWGKTSVAE-RANILNKIADRIEENLELLAVAETLDNGKPIRET------- 95

Query: 67  LANDVRNTLNHLKQWMTPEKPGKDIANMLDG---VYIYPDPYGVCLIIGAWNYPLQLSLL 123
           LA D+   ++H + +    +  +   + +D     Y + +P GV   I  WN+PL ++  
Sbjct: 96  LAADIPLAIDHFRYFAGVIRAQEGSLSEIDEDTLSYHFHEPLGVVGQIIPWNFPLLMAAW 155

Query: 124 PAAGAIAAGNVVILKPSEVAPAS----AKIMAELLPKYLDNDTFQVVLG-GVEETTELLK 178
             A A+AAGN V+LKP+   P S     +++ +LLPK + N    VV G G E    L  
Sbjct: 156 KLAPALAAGNTVVLKPASQTPLSILVLMELIGDLLPKGVVN----VVTGFGSEAGKPLAS 211

Query: 179 H-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSV-----NIELAVRRF 232
           H R   + +TGST+VG+++ Q A E+L PVTLELGGKSP            + +      
Sbjct: 212 HPRIAKLAFTGSTTVGRLIMQYAAENLIPVTLELGGKSPNIFFDDAMDADDDFDDKAEEG 271

Query: 233 LWGKCINAGQTCIAPDYILCSRQVQAQILNQ 263
             G   N G+ C  P   L    VQ  I ++
Sbjct: 272 QLGFAFNQGEVCTCPSRAL----VQESIYDE 298


>gnl|CDD|143411 cd07092, ALDH_ABALDH-YdcW, Escherichia coli NAD+-dependent
           gamma-aminobutyraldehyde dehydrogenase YdcW-like.
           NAD+-dependent, tetrameric, gamma-aminobutyraldehyde
           dehydrogenase (ABALDH), YdcW of Escherichia coli K12,
           catalyzes the oxidation of gamma-aminobutyraldehyde to
           gamma-aminobutyric acid. ABALDH can also oxidize n-alkyl
           medium-chain aldehydes, but with a lower catalytic
           efficiency.
          Length = 450

 Score =  122 bits (309), Expect = 2e-30
 Identities = 81/266 (30%), Positives = 112/266 (42%), Gaps = 25/266 (9%)

Query: 8   VQNARDTFDSGK-SKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEI-- 64
           V  A   F S + + P E R + L +L    EEN ++LA   + +  K        E+  
Sbjct: 25  VAAAHAAFPSWRRTTPAE-RSKALLKLADAIEENAEELAALESRNTGKPLHLVRDDELPG 83

Query: 65  -----EFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGV--YIYPDPYGVCLIIGAWNYP 117
                 F A   R               G      L G    I  +P GV   I  WNYP
Sbjct: 84  AVDNFRFFAGAARTL------------EGPAAGEYLPGHTSMIRREPIGVVAQIAPWNYP 131

Query: 118 LQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELL 177
           L ++    A A+AAGN V+LKPSE  P +  ++AEL  + L      VV GG     + L
Sbjct: 132 LMMAAWKIAPALAAGNTVVLKPSETTPLTTLLLAELAAEVLPPGVVNVVCGGGASAGDAL 191

Query: 178 --KHRFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWG 235
               R   +  TGS   GK V +AA + L  V LELGGK+P+ +    +++ AV      
Sbjct: 192 VAHPRVRMVSLTGSVRTGKKVARAAADTLKRVHLELGGKAPVIVFDDADLDAAVAGIATA 251

Query: 236 KCINAGQTCIAPDYILCSRQVQAQIL 261
              NAGQ C A   +     V  + +
Sbjct: 252 GYYNAGQDCTAACRVYVHESVYDEFV 277


>gnl|CDD|143428 cd07110, ALDH_F10_BADH, Arabidopsis betaine aldehyde dehydrogenase
           1 and 2, ALDH family 10A8 and 10A9-like.  Present in
           this CD are the Arabidopsis betaine aldehyde
           dehydrogenase (BADH) 1 (chloroplast) and 2
           (mitochondria), also known as, aldehyde dehydrogenase
           family 10 member A8 and aldehyde dehydrogenase family 10
           member A9, respectively, and are putative dehydration-
           and salt-inducible BADHs (EC 1.2.1.8) that catalyze the
           oxidation of betaine aldehyde to the compatible solute
           glycine betaine.
          Length = 456

 Score =  122 bits (308), Expect = 2e-30
 Identities = 84/279 (30%), Positives = 121/279 (43%), Gaps = 30/279 (10%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA------VL 61
           V+ AR  F   K      R + L+ +     E +++LA   A D  K   EA      V 
Sbjct: 25  VRAARRAFPRWKKTTGAERAKYLRAIAEGVRERREELAELEARDNGKPLDEAAWDVDDVA 84

Query: 62  FEIEF---LANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPL 118
              E+   LA  +               P +D         +  +P GV  +I  WN+PL
Sbjct: 85  GCFEYYADLAEQLDAKAE-----RAVPLPSEDFK-----ARVRREPVGVVGLITPWNFPL 134

Query: 119 QLSLLPAAGAIAAGNVVILKPSEVAP----ASAKIMAEL-LPKYLDNDTFQVVLGGVEET 173
            ++    A A+AAG  V+LKPSE+        A+I AE  LP         VV G  +E 
Sbjct: 135 LMAAWKVAPALAAGCTVVLKPSELTSLTELELAEIAAEAGLPP----GVLNVVTGTGDEA 190

Query: 174 -TELLKHR-FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRR 231
              L  H   D I +TGST+ G  V QAA + + PV+LELGGKSP+ +    ++E AV  
Sbjct: 191 GAPLAAHPGIDKISFTGSTATGSQVMQAAAQDIKPVSLELGGKSPIIVFDDADLEKAVEW 250

Query: 232 FLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDS 270
            ++G   N GQ C A   +L    +    L +     ++
Sbjct: 251 AMFGCFWNNGQICSATSRLLVHESIADAFLERLATAAEA 289


>gnl|CDD|143423 cd07105, ALDH_SaliADH, Salicylaldehyde dehydrogenase, DoxF-like.
           Salicylaldehyde dehydrogenase (DoxF, SaliADH,
           EC=1.2.1.65) involved in the upper naphthalene catabolic
           pathway of Pseudomonas strain C18 and other similar
           sequences are present in this CD.
          Length = 432

 Score =  121 bits (305), Expect = 5e-30
 Identities = 76/272 (27%), Positives = 123/272 (45%), Gaps = 39/272 (14%)

Query: 19  KSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA------VLFEIEFLANDVR 72
           K+ P E RR  L +   L E  + +   A         +E         F ++  A  +R
Sbjct: 18  KTPPSE-RRDILLKAADLLESRRDEFIEA-------MMEETGATAAWAGFNVDLAAGMLR 69

Query: 73  NTLNHLKQWMTPEKPGKDIANMLDGVY--IYPDPYGVCLIIGAWNYPLQLSLLPAAGAIA 130
              +     +  +  G  I +   G    +  +P GV L I  WN P+ L     A  +A
Sbjct: 70  EAAS-----LITQIIGGSIPSDKPGTLAMVVKEPVGVVLGIAPWNAPVILGTRAIAYPLA 124

Query: 131 AGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVE---ETTE-LLKH-- 179
           AGN V+LK SE++P +  ++  +     LPK        VV    E   E  E L+ H  
Sbjct: 125 AGNTVVLKASELSPRTHWLIGRVFHEAGLPK----GVLNVVTHSPEDAPEVVEALIAHPA 180

Query: 180 -RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCI 238
            R   + +TGST VG+I+ + A +HL PV LELGGK+P  +    +++ A    L+G  +
Sbjct: 181 VRK--VNFTGSTRVGRIIAETAAKHLKPVLLELGGKAPAIVLEDADLDAAANAALFGAFL 238

Query: 239 NAGQTCIAPDYILCSRQVQAQILNQAKAVLDS 270
           N+GQ C++ + I+    +  + + + KA  + 
Sbjct: 239 NSGQICMSTERIIVHESIADEFVEKLKAAAEK 270



 Score = 32.2 bits (74), Expect = 0.60
 Identities = 23/94 (24%), Positives = 41/94 (43%), Gaps = 8/94 (8%)

Query: 307 PVLEALS--APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAK 364
           PVL  L   AP  +    +++ A    L+G  +N+GQ C++ + I+    +  + + + K
Sbjct: 206 PVLLELGGKAPAIVLEDADLDAAANAALFGAFLNSGQICMSTERIIVHESIADEFVEKLK 265

Query: 365 AVLDSWYTEQVQGSKHYCRI-VSDKHFQRLKSLV 397
           A       E++          VS     R+K LV
Sbjct: 266 AA-----AEKLFAGPVVLGSLVSAAAADRVKELV 294


>gnl|CDD|143407 cd07088, ALDH_LactADH-AldA, Escherichia coli lactaldehyde
           dehydrogenase AldA-like.  Lactaldehyde dehydrogenase
           from Escherichia coli (AldA, LactADH, EC=1.2.1.22), an
           NAD(+)-dependent enzyme involved in the metabolism of
           L-fucose and L-rhamnose, and other similar sequences are
           present in this CD.
          Length = 468

 Score =  121 bits (306), Expect = 5e-30
 Identities = 75/274 (27%), Positives = 126/274 (45%), Gaps = 31/274 (11%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  A     + +  P   R   L++L  L  EN  +LA  +  +  K    A + E+EF 
Sbjct: 41  VDAAEAAQKAWERLPAIERAAYLRKLADLIRENADELAKLIVEEQGKTLSLARV-EVEFT 99

Query: 68  ANDVRNTLNHLKQW--------MTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQ 119
           A+ +     ++ +W        +  ++P ++I       +I+  P GV   I  WN+P  
Sbjct: 100 ADYID----YMAEWARRIEGEIIPSDRPNENI-------FIFKVPIGVVAGILPWNFPFF 148

Query: 120 LSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLG-GVEET 173
           L     A A+  GN +++KPSE  P +A   AEL     LP  +      +V G G    
Sbjct: 149 LIARKLAPALVTGNTIVIKPSEETPLNALEFAELVDEAGLPAGV----LNIVTGRGSVVG 204

Query: 174 TELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRF 232
             L+ H +   I  TGST  G+ + +AA E++T V+LELGGK+P  +    +++LAV+  
Sbjct: 205 DALVAHPKVGMISLTGSTEAGQKIMEAAAENITKVSLELGGKAPAIVMKDADLDLAVKAI 264

Query: 233 LWGKCINAGQTCIAPDYILCSRQVQAQILNQAKA 266
           +  + IN GQ C   + +     +  + + +   
Sbjct: 265 VDSRIINCGQVCTCAERVYVHEDIYDEFMEKLVE 298



 Score = 33.0 bits (76), Expect = 0.41
 Identities = 10/52 (19%), Positives = 23/52 (44%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKA 365
           AP  +    +++LAV+  +  + IN GQ C   + +     +  + + +   
Sbjct: 247 APAIVMKDADLDLAVKAIVDSRIINCGQVCTCAERVYVHEDIYDEFMEKLVE 298


>gnl|CDD|143449 cd07131, ALDH_AldH-CAJ73105, Uncharacterized Candidatus kuenenia
           aldehyde dehydrogenase AldH (CAJ73105)-like.
           Uncharacterized aldehyde dehydrogenase of Candidatus
           kuenenia AldH (locus CAJ73105) and similar sequences
           with similarity to alpha-aminoadipic semialdehyde
           dehydrogenase (AASADH, human ALDH7A1, EC=1.2.1.31),
           Arabidopsis ALDH7B4, and Streptomyces clavuligerus
           delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH)
           are included in this CD.
          Length = 478

 Score =  121 bits (306), Expect = 6e-30
 Identities = 74/260 (28%), Positives = 115/260 (44%), Gaps = 23/260 (8%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALA-------ADLRKHKQEAV 60
           V+ AR+ F   +  P   R   L +   L ++ +++LA  +        A+ R   QEA+
Sbjct: 43  VEAAREAFPEWRKVPAPRRAEYLFRAAELLKKRKEELARLVTREMGKPLAEGRGDVQEAI 102

Query: 61  LFEIEFLANDVRNTLNHLKQWMTP-EKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQ 119
               ++ A + R     L     P E P KD             P GV  +I  WN+P+ 
Sbjct: 103 DM-AQYAAGEGR----RLFGETVPSELPNKD-------AMTRRQPIGVVALITPWNFPVA 150

Query: 120 LSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETTE-LL 177
           +       A+  GN V+ KP+E  PA A  + EL  +  L      VV G  EE  E L+
Sbjct: 151 IPSWKIFPALVCGNTVVFKPAEDTPACALKLVELFAEAGLPPGVVNVVHGRGEEVGEALV 210

Query: 178 KH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGK 236
           +H   D + +TGST VG+ + +        V LE+GGK+P+ +    +++LA+   LW  
Sbjct: 211 EHPDVDVVSFTGSTEVGERIGETCARPNKRVALEMGGKNPIIVMDDADLDLALEGALWSA 270

Query: 237 CINAGQTCIAPDYILCSRQV 256
               GQ C A   ++    V
Sbjct: 271 FGTTGQRCTATSRLIVHESV 290


>gnl|CDD|143462 cd07144, ALDH_ALD2-YMR170C, Saccharomyces cerevisiae aldehyde
           dehydrogenase 2 (YMR170c)-like.  NAD(P)+-dependent
           Saccharomyces cerevisiae aldehyde dehydrogenase 2
           (YMR170c, ALD5, EC=1.2.1.5) and other similar sequences,
           are present in this CD.
          Length = 484

 Score =  121 bits (305), Expect = 8e-30
 Identities = 86/261 (32%), Positives = 129/261 (49%), Gaps = 14/261 (5%)

Query: 8   VQNARDTFDSGKSK-PYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEF 66
           V+ AR  F+S  SK   E R   L +L  L E+N+  LA   A D  K      L +++ 
Sbjct: 51  VKAARKAFESWWSKVTGEERGELLDKLADLVEKNRDLLAAIEALDSGKPYHSNALGDLD- 109

Query: 67  LANDVRNTLNHLKQWMTPEKPGKDIANMLDGV-YIYPDPYGVCLIIGAWNYPLQLSLLPA 125
              ++   + +   W    + GK I    + + Y   +PYGVC  I  WNYPL ++    
Sbjct: 110 ---EIIAVIRYYAGWADKIQ-GKTIPTSPNKLAYTLHEPYGVCGQIIPWNYPLAMAAWKL 165

Query: 126 AGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLG-GVEETTELLKH-RFD 182
           A A+AAGN V++KP+E  P S    A L+ +         ++ G G    + L +H   D
Sbjct: 166 APALAAGNTVVIKPAENTPLSLLYFANLVKEAGFPPGVVNIIPGYGAVAGSALAEHPDVD 225

Query: 183 YIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQ 242
            I +TGST+ G++V +AA ++L  VTLE GGKSP  +    +++ AV+    G   N+GQ
Sbjct: 226 KIAFTGSTATGRLVMKAAAQNLKAVTLECGGKSPALVFEDADLDQAVKWAAAGIMYNSGQ 285

Query: 243 TCIAPDYILCSRQVQAQILNQ 263
            C A   I     VQ  I ++
Sbjct: 286 NCTATSRIY----VQESIYDK 302


>gnl|CDD|143413 cd07094, ALDH_F21_LactADH-like, ALDH subfamily: NAD+-dependent,
           lactaldehyde dehydrogenase, ALDH family 21 A1, and
           related proteins.  ALDH subfamily which includes Tortula
           ruralis aldehyde dehydrogenase ALDH21A1 (RNP123), and
           NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
           and like sequences.
          Length = 453

 Score =  120 bits (302), Expect = 2e-29
 Identities = 66/250 (26%), Positives = 112/250 (44%), Gaps = 9/250 (3%)

Query: 7   LVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEF 66
            +  AR   ++ ++ P   R   L++   L ++  ++ A  +A +  K  ++A   E++ 
Sbjct: 26  ALATARAGAENRRALPPHERMAILERAADLLKKRAEEFAKIIACEGGKPIKDA-RVEVDR 84

Query: 67  LANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIY--PDPYGVCLIIGAWNYPLQLSLLP 124
             + +R      ++    E P  D     D    +   +P GV L I  +N+PL L    
Sbjct: 85  AIDTLRLAAEEAERIRGEEIPL-DATQGSDNRLAWTIREPVGVVLAITPFNFPLNLVAHK 143

Query: 125 AAGAIAAGNVVILKPSEVAPASAKIMAELL-PKYLDNDTFQVVLG-GVEETTELLKH-RF 181
            A AIA G  V+LKP+   P SA  +A++L    +     QVV G             R 
Sbjct: 144 LAPAIATGCPVVLKPASKTPLSALELAKILVEAGVPEGVLQVVTGEREVLGDAFAADERV 203

Query: 182 DYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAG 241
             + +TGS +VG+ +R  A      + LELGG +P+ +D   +++ A+     G   +AG
Sbjct: 204 AMLSFTGSAAVGEALRANAG--GKRIALELGGNAPVIVDRDADLDAAIEALAKGGFYHAG 261

Query: 242 QTCIAPDYIL 251
           Q CI+   I 
Sbjct: 262 QVCISVQRIY 271



 Score = 33.2 bits (76), Expect = 0.30
 Identities = 11/37 (29%), Positives = 19/37 (51%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYIL 350
           AP+ +D   +++ A+     G   +AGQ CI+   I 
Sbjct: 235 APVIVDRDADLDAAIEALAKGGFYHAGQVCISVQRIY 271


>gnl|CDD|143419 cd07101, ALDH_SSADH2_GabD2, Mycobacterium tuberculosis
           succinate-semialdehyde dehydrogenase 2-like.
           Succinate-semialdehyde dehydrogenase 2 (SSADH2) and
           similar proteins are in this CD. SSADH1 (GabD1,
           EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation
           of succinate semialdehyde to succinate.  SSADH activity
           in Mycobacterium tuberculosis is encoded by both gabD1
           (Rv0234c) and gabD2 (Rv1731), however ,the Vmax of GabD1
           was shown to be much higher than that of GabD2, and
           GabD2 (SSADH2) is likely to serve physiologically as a
           dehydrogenase for a different aldehyde(s).
          Length = 454

 Score =  120 bits (302), Expect = 2e-29
 Identities = 67/228 (29%), Positives = 102/228 (44%), Gaps = 9/228 (3%)

Query: 22  PYEFRRRQLQQLVRLYEENQQ---DLANALAADLRKHKQEAVLFEIEFLANDVRNTLNHL 78
           P+  R     +   L  E +    DL        R+H  E    E+  +A   R      
Sbjct: 38  PFAERAAVFLRFHDLVLERRDELLDLIQLETGKARRHAFE----EVLDVAIVARYYARRA 93

Query: 79  KQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILK 138
           ++ + P +    I  +         P GV  +I  WNYPL L++  A  A+ AGN V+LK
Sbjct: 94  ERLLKPRRRRGAIPVLTRTTVNR-RPKGVVGVISPWNYPLTLAVSDAIPALLAGNAVVLK 152

Query: 139 PSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETTELLKHRFDYIFYTGSTSVGKIVR 197
           P      +A    ELL +  L  D +QVV G   E    +    DY+ +TGST+ G++V 
Sbjct: 153 PDSQTALTALWAVELLIEAGLPRDLWQVVTGPGSEVGGAIVDNADYVMFTGSTATGRVVA 212

Query: 198 QAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCI 245
           + A   L   +LELGGK+P+ +    +++ A    +     NAGQ C+
Sbjct: 213 ERAGRRLIGCSLELGGKNPMIVLEDADLDKAAAGAVRACFSNAGQLCV 260


>gnl|CDD|143430 cd07112, ALDH_GABALDH-PuuC, Escherichia coli NADP+-dependent
           gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
           PuuC-like.  NADP+-dependent,
           gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase
           (GABALDH) PuuC of  Escherichia coli which catalyzes the
           conversion of putrescine to 4-aminobutanoate and other
           similar sequences are present in this CD.
          Length = 462

 Score =  120 bits (302), Expect = 2e-29
 Identities = 87/292 (29%), Positives = 127/292 (43%), Gaps = 53/292 (18%)

Query: 8   VQNARDTFDSG---KSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEI 64
           V  AR  F+SG   +  P   R+  L +L  L E ++ +LA     D+ K   +A     
Sbjct: 30  VAAARRAFESGVWSRLSP-AERKAVLLRLADLIEAHRDELALLETLDMGKPISDA----- 83

Query: 65  EFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVY-------------IYPDPYGVCLII 111
             LA DV +  N   +W          A  +D VY             I  +P GV   +
Sbjct: 84  --LAVDVPSAANTF-RWY---------AEAIDKVYGEVAPTGPDALALITREPLGV---V 128

Query: 112 GA---WNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTF 163
           GA   WN+PL ++    A A+AAGN V+LKP+E +P +A  +AEL     LP        
Sbjct: 129 GAVVPWNFPLLMAAWKIAPALAAGNSVVLKPAEQSPLTALRLAELALEAGLPA----GVL 184

Query: 164 QVVLGGVEETTELL-KHR-FDYIFYTGSTSVGK-IVRQAANEHLTPVTLELGGKSP-LYI 219
            VV G      E L  H   D + +TGST VG+  +  +   +L  V LE GGKSP +  
Sbjct: 185 NVVPGFGHTAGEALGLHMDVDALAFTGSTEVGRRFLEYSGQSNLKRVWLECGGKSPNIVF 244

Query: 220 DSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSW 271
             + +++ A      G   N G+ C A   +L    ++ + L +  A    W
Sbjct: 245 ADAPDLDAAAEAAAAGIFWNQGEVCSAGSRLLVHESIKDEFLEKVVAAAREW 296



 Score = 32.6 bits (75), Expect = 0.46
 Identities = 19/106 (17%), Positives = 37/106 (34%), Gaps = 15/106 (14%)

Query: 313 SAPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSWYT 372
           S  +    + +++ A      G   N G+ C A   +L    ++ + L +  A    W  
Sbjct: 239 SPNIVFADAPDLDAAAEAAAAGIFWNQGEVCSAGSRLLVHESIKDEFLEKVVAAAREWKP 298

Query: 373 ------EQVQGSKHYCRIVSDKHFQRLKSLVHSS----GTIALGGD 408
                     G+     +VS+ HF ++   + S       +  GG 
Sbjct: 299 GDPLDPATRMGA-----LVSEAHFDKVLGYIESGKAEGARLVAGGK 339


>gnl|CDD|143435 cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus
           AldA1 (SACOL0154) aldehyde dehydrogenase-like.
           Uncharacterized aldehyde dehydrogenase from
           Staphylococcus aureus (AldA1, locus SACOL0154) and other
           similar sequences are present in this CD.
          Length = 475

 Score =  118 bits (297), Expect = 8e-29
 Identities = 83/266 (31%), Positives = 126/266 (47%), Gaps = 26/266 (9%)

Query: 8   VQNARDTFDS-GKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEF 66
           V+ A++ F +  K+   E R   L ++  + +EN++ LA     D  K  +E    +I  
Sbjct: 44  VKAAQEAFKTWRKTTVAE-RANILNKIADIIDENKELLAMVETLDNGKPIRETRAVDIPL 102

Query: 67  LANDVRNTLNHLKQWMTPEKPGKDIANMLDGVY---IYPDPYGVCLIIGAWNYPLQLSLL 123
            A+       H + +    +  +  ANM+D      +  +P GV   I  WN+P  ++  
Sbjct: 103 AAD-------HFRYFAGVIRAEEGSANMIDEDTLSIVLREPIGVVGQIIPWNFPFLMAAW 155

Query: 124 PAAGAIAAGNVVILKPSEVAPAS----AKIMAELLPKYLDNDTFQVVLGGVEETTELLKH 179
             A A+AAGN V++KPS     S    AKI+ ++LPK + N    +V G   ++ E L +
Sbjct: 156 KLAPALAAGNTVVIKPSSTTSLSLLELAKIIQDVLPKGVVN----IVTGKGSKSGEYLLN 211

Query: 180 R--FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKC 237
               D + +TGST VG+ V  AA + L P TLELGGKS   I    N + A+     G  
Sbjct: 212 HPGLDKLAFTGSTEVGRDVAIAAAKKLIPATLELGGKSANIIFDDANWDKALEGAQLGIL 271

Query: 238 INAGQTCIAPDYILCSRQVQAQILNQ 263
            N GQ C A   I     VQ  I ++
Sbjct: 272 FNQGQVCCAGSRIF----VQEGIYDE 293


>gnl|CDD|183918 PRK13252, PRK13252, betaine aldehyde dehydrogenase; Provisional.
          Length = 488

 Score =  117 bits (296), Expect = 2e-28
 Identities = 60/155 (38%), Positives = 82/155 (52%), Gaps = 10/155 (6%)

Query: 96  DGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-- 153
             VY   +P GVC  IGAWNYP+Q++   +A A+AAGN +I KPSEV P +A  +AE+  
Sbjct: 134 SFVYTRREPLGVCAGIGAWNYPIQIACWKSAPALAAGNAMIFKPSEVTPLTALKLAEIYT 193

Query: 154 ---LPKYLDNDTFQVVLGGVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTL 209
              LP  +    F VV G       L +H     + +TG    GK V  AA   L  VT+
Sbjct: 194 EAGLPDGV----FNVVQGDGRVGAWLTEHPDIAKVSFTGGVPTGKKVMAAAAASLKEVTM 249

Query: 210 ELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTC 244
           ELGGKSPL +    +++ A    +     ++GQ C
Sbjct: 250 ELGGKSPLIVFDDADLDRAADIAMLANFYSSGQVC 284


>gnl|CDD|143433 cd07115, ALDH_HMSADH_HapE, Pseudomonas fluorescens 4-hydroxymuconic
           semialdehyde dehydrogenase-like.  4-hydroxymuconic
           semialdehyde dehydrogenase (HapE, EC=1.2.1.61) of
           Pseudomonas fluorescens ACB involved in
           4-hydroxyacetophenone degradation, and putative
           hydroxycaproate semialdehyde dehydrogenase (ChnE) of
           Brachymonas petroleovorans involved in cyclohexane
           metabolism, and other similar sequences, are present in
           this CD.
          Length = 453

 Score =  116 bits (293), Expect = 2e-28
 Identities = 83/260 (31%), Positives = 123/260 (47%), Gaps = 13/260 (5%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  AR  F++  +     R R L +L  L   N  +LA   + D  K  + A   ++   
Sbjct: 25  VAAARAAFEAWSAMDPAERGRILWRLAELILANADELARLESLDTGKPIRAARRLDVPRA 84

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDI---ANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLP 124
           A+    T  +   W   +  G+ I      L+  Y   +P GV   I  WN+PL  +   
Sbjct: 85  AD----TFRYYAGWAD-KIEGEVIPVRGPFLN--YTVREPVGVVGAIVPWNFPLMFAAWK 137

Query: 125 AAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETTE-LLKH-RF 181
            A A+AAGN V+LKP+E+ P SA  +AEL+ +         VV G  E     L++H   
Sbjct: 138 VAPALAAGNTVVLKPAELTPLSALRIAELMAEAGFPAGVLNVVTGFGEVAGAALVEHPDV 197

Query: 182 DYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAG 241
           D I +TGST+VG+ + Q A  +L  V+LELGGKS   + +  +++ AVR    G   N G
Sbjct: 198 DKITFTGSTAVGRKIMQGAAGNLKRVSLELGGKSANIVFADADLDAAVRAAATGIFYNQG 257

Query: 242 QTCIAPDYILCSRQVQAQIL 261
           Q C A   +L    +  + L
Sbjct: 258 QMCTAGSRLLVHESIYDEFL 277



 Score = 30.5 bits (69), Expect = 2.1
 Identities = 11/38 (28%), Positives = 17/38 (44%)

Query: 323 NIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQIL 360
           +++ AVR    G   N GQ C A   +L    +  + L
Sbjct: 240 DLDAAVRAAATGIFYNQGQMCTAGSRLLVHESIYDEFL 277


>gnl|CDD|143457 cd07139, ALDH_AldA-Rv0768, Mycobacterium tuberculosis aldehyde
           dehydrogenase  AldA-like.  The Mycobacterium
           tuberculosis NAD+-dependent, aldehyde dehydrogenase  PDB
           structure,  3B4W, and the Mycobacterium tuberculosis
           H37Rv aldehyde dehydrogenase  AldA (locus Rv0768)
           sequence, as well as the Rhodococcus rhodochrous ALDH
           involved in haloalkane catabolism, and other similar
           sequences, are included in this CD.
          Length = 471

 Score =  115 bits (291), Expect = 5e-28
 Identities = 60/173 (34%), Positives = 85/173 (49%), Gaps = 10/173 (5%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYL 158
           P GV   I  WN PL L+ L  A A+AAG  V+LKPS   P  A ++AE      LP  +
Sbjct: 137 PVGVVAAIVPWNAPLFLAALKIAPALAAGCTVVLKPSPETPLDAYLLAEAAEEAGLPPGV 196

Query: 159 DNDTFQVVLGGVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPL 217
            N    VV    E    L++H   D + +TGST+ G+ +     E L  VTLELGGKS  
Sbjct: 197 VN----VVPADREVGEYLVRHPGVDKVSFTGSTAAGRRIAAVCGERLARVTLELGGKSAA 252

Query: 218 YIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDS 270
            +    +++ AV   +    +N GQ C+A   IL  R    +++    A + +
Sbjct: 253 IVLDDADLDAAVPGLVPASLMNNGQVCVALTRILVPRSRYDEVVEALAAAVAA 305



 Score = 29.1 bits (66), Expect = 6.3
 Identities = 11/46 (23%), Positives = 21/46 (45%)

Query: 324 IELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDS 369
           ++ AV   +    +N GQ C+A   IL  R    +++    A + +
Sbjct: 260 LDAAVPGLVPASLMNNGQVCVALTRILVPRSRYDEVVEALAAAVAA 305


>gnl|CDD|143469 cd07151, ALDH_HBenzADH, NADP+-dependent p-hydroxybenzaldehyde
           dehydrogenase-like.  NADP+-dependent,
           p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH)
           which catalyzes oxidation of p-hydroxybenzaldehyde to
           p-hydroxybenzoic acid and other related sequences are
           included in this CD.
          Length = 465

 Score =  114 bits (287), Expect = 2e-27
 Identities = 72/240 (30%), Positives = 114/240 (47%), Gaps = 21/240 (8%)

Query: 22  PYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLANDVRNTLNHLKQW 81
           P E R   L++  ++ EE + ++   L  +    + +A   E        R         
Sbjct: 53  PQE-RAEILEKAAQILEERRDEIVEWLIRESGSTRIKAN-IEWGAAMAITREAAT----- 105

Query: 82  MTPEKPGKDIANMLDGV--YIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKP 139
                 G+ + + + G    +Y +P GV  +I  WN+PL LS+   A A+A GN V+LKP
Sbjct: 106 FPLRMEGRILPSDVPGKENRVYREPLGVVGVISPWNFPLHLSMRSVAPALALGNAVVLKP 165

Query: 140 SEVAPAS-----AKIMAEL-LPKYLDNDTFQVVLGGVEET-TELLKHRF-DYIFYTGSTS 191
           +   P +     AKI  E  LPK +      VV+G   E     ++H     I +TGST 
Sbjct: 166 ASDTPITGGLLLAKIFEEAGLPKGV----LNVVVGAGSEIGDAFVEHPVPRLISFTGSTP 221

Query: 192 VGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYIL 251
           VG+ + + A  HL  V LELGG +P  +    +I+ AV   ++GK ++ GQ C+A + I+
Sbjct: 222 VGRHIGELAGRHLKKVALELGGNNPFVVLEDADIDAAVNAAVFGKFLHQGQICMAINRII 281


>gnl|CDD|143414 cd07095, ALDH_SGSD_AstD, N-succinylglutamate 5-semialdehyde
           dehydrogenase, AstD-like.  N-succinylglutamate
           5-semialdehyde dehydrogenase or succinylglutamic
           semialdehyde dehydrogenase (SGSD, E. coli AstD,
           EC=1.2.1.71) involved in L-arginine degradation via the
           arginine succinyltransferase (AST) pathway and catalyzes
           the NAD+-dependent reduction of succinylglutamate
           semialdehyde into succinylglutamate.
          Length = 431

 Score =  113 bits (285), Expect = 2e-27
 Identities = 67/250 (26%), Positives = 110/250 (44%), Gaps = 28/250 (11%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  AR  F    +   E R   L++   L + N+++LA  ++ +  K   EA   E+  +
Sbjct: 6   VAAARAAFPGWAALSLEERAAILRRFAELLKANKEELARLISRETGKPLWEA-QTEVAAM 64

Query: 68  ANDVRNTLNHLKQWMTPE-KPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLS---LL 123
           A  +  ++    +       P      +L        P+GV  + G +N+P  L    ++
Sbjct: 65  AGKIDISIKAYHERTGERATPMAQGRAVLRH-----RPHGVMAVFGPFNFPGHLPNGHIV 119

Query: 124 PAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVEETTELLK 178
           PA   + AGN V+ KPSE+ PA A++M EL     LP         +V GG E    L  
Sbjct: 120 PA---LLAGNTVVFKPSELTPAVAELMVELWEEAGLPP----GVLNLVQGGRETGEALAA 172

Query: 179 H-RFDYIFYTGSTSVGKIVRQAANEHLTP---VTLELGGKSPLYIDSSVNIELAVRRFLW 234
           H   D + +TGS + G ++ +       P   + LE+GG +PL +    +I+ A    + 
Sbjct: 173 HEGIDGLLFTGSAATGLLLHRQFAGR--PGKILALEMGGNNPLVVWDVADIDAAAYLIVQ 230

Query: 235 GKCINAGQTC 244
              + AGQ C
Sbjct: 231 SAFLTAGQRC 240


>gnl|CDD|237391 PRK13473, PRK13473, gamma-aminobutyraldehyde dehydrogenase;
           Provisional.
          Length = 475

 Score =  113 bits (285), Expect = 4e-27
 Identities = 81/265 (30%), Positives = 110/265 (41%), Gaps = 25/265 (9%)

Query: 8   VQNARDTFDS-GKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEI-- 64
           V  A   F    ++ P E R   L +L    EEN  + A   + +  K    A+  EI  
Sbjct: 45  VAAADAAFPEWSQTTPKE-RAEALLKLADAIEENADEFARLESLNCGKPLHLALNDEIPA 103

Query: 65  -----EFLANDVRNTLNHLKQWMTPEKPGKDIANMLDG--VYIYPDPYGVCLIIGAWNYP 117
                 F A   R               GK     L+G    I  DP GV   I  WNYP
Sbjct: 104 IVDVFRFFAGAARCL------------EGKAAGEYLEGHTSMIRRDPVGVVASIAPWNYP 151

Query: 118 LQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTE-L 176
           L ++    A A+AAGN V+LKPSE+ P +A  +AEL    L      VV G      + L
Sbjct: 152 LMMAAWKLAPALAAGNTVVLKPSEITPLTALKLAELAADILPPGVLNVVTGRGATVGDAL 211

Query: 177 LKHR-FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWG 235
           + H     +  TGS + GK V  AA + +    LELGGK+P+ +    +++  V      
Sbjct: 212 VGHPKVRMVSLTGSIATGKHVLSAAADSVKRTHLELGGKAPVIVFDDADLDAVVEGIRTF 271

Query: 236 KCINAGQTCIAPDYILCSRQVQAQI 260
              NAGQ C A   I   R +   +
Sbjct: 272 GYYNAGQDCTAACRIYAQRGIYDDL 296


>gnl|CDD|143405 cd07086, ALDH_F7_AASADH-like, NAD+-dependent alpha-aminoadipic
           semialdehyde dehydrogenase and related proteins.  ALDH
           subfamily which includes the NAD+-dependent,
           alpha-aminoadipic semialdehyde dehydrogenase (AASADH,
           EC=1.2.1.31), also known as Antiquitin-1, ALDH7A1,
           ALDH7B or delta-1-piperideine-6-carboxylate
           dehydrogenase (P6CDH), and other similar sequences, such
           as the uncharacterized aldehyde dehydrogenase of
           Candidatus kuenenia AldH (locus CAJ73105).
          Length = 478

 Score =  113 bits (284), Expect = 5e-27
 Identities = 59/167 (35%), Positives = 85/167 (50%), Gaps = 15/167 (8%)

Query: 85  EKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAP 144
           E+PG  +           +P GV  +I A+N+P+ +    AA A+  GN V+ KPSE  P
Sbjct: 121 ERPGHRL-------MEQWNPLGVVGVITAFNFPVAVPGWNAAIALVCGNTVVWKPSETTP 173

Query: 145 ASA----KIMAELLPKY-LDNDTFQVVLGGVEETTELLKH--RFDYIFYTGSTSVGKIVR 197
            +A    KI+AE+L K  L      +V GG +   ELL H  R   + +TGST VG+ V 
Sbjct: 174 LTAIAVTKILAEVLEKNGLPPGVVNLVTGGGD-GGELLVHDPRVPLVSFTGSTEVGRRVG 232

Query: 198 QAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTC 244
           +        V LELGG + + +    +++LAVR  L+     AGQ C
Sbjct: 233 ETVARRFGRVLLELGGNNAIIVMDDADLDLAVRAVLFAAVGTAGQRC 279


>gnl|CDD|143463 cd07145, ALDH_LactADH_F420-Bios, Methanocaldococcus jannaschii
           NAD+-dependent lactaldehyde dehydrogenase-like.
           NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22)
           involved the biosynthesis of coenzyme F(420) in
           Methanocaldococcus jannaschii through the oxidation of
           lactaldehyde to lactate and generation of NAPH, and
           similar sequences are included in this CD.
          Length = 456

 Score =  112 bits (282), Expect = 7e-27
 Identities = 72/257 (28%), Positives = 115/257 (44%), Gaps = 19/257 (7%)

Query: 11  ARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLAND 70
           A    D   + P   R + L ++  L E  +++LA  L  ++ K  +++   E+E     
Sbjct: 30  AEKAKDVMSNLPAYKRYKILMKVAELIERRKEELAKLLTIEVGKPIKQSR-VEVERTIRL 88

Query: 71  VRNTLNHLKQWMTPEKPGKDIANMLDG--------VYIYPDPYGVCLIIGAWNYPLQLSL 122
            +      K        G+ I   +D          +   +P GV   I  +N+P  L  
Sbjct: 89  FKLAAEEAK-----VLRGETIP--VDAYEYNERRIAFTVREPIGVVGAITPFNFPANLFA 141

Query: 123 LPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEET-TELLKH- 179
              A AIA GN V++KPS   P +A  +A++L +  L      VV G   E   E++ + 
Sbjct: 142 HKIAPAIAVGNSVVVKPSSNTPLTAIELAKILEEAGLPPGVINVVTGYGSEVGDEIVTNP 201

Query: 180 RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCIN 239
           + + I +TGST+VG ++   A      V LELGG  P+ +    ++E AV   + G+  N
Sbjct: 202 KVNMISFTGSTAVGLLIASKAGGTGKKVALELGGSDPMIVLKDADLERAVSIAVRGRFEN 261

Query: 240 AGQTCIAPDYILCSRQV 256
           AGQ C A   IL   +V
Sbjct: 262 AGQVCNAVKRILVEEEV 278



 Score = 31.2 bits (71), Expect = 1.3
 Identities = 15/43 (34%), Positives = 22/43 (51%)

Query: 313 SAPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQV 355
           S P+ +    ++E AV   + G+  NAGQ C A   IL   +V
Sbjct: 236 SDPMIVLKDADLERAVSIAVRGRFENAGQVCNAVKRILVEEEV 278


>gnl|CDD|143415 cd07097, ALDH_KGSADH-YcbD, Bacillus subtilis NADP+-dependent
           alpha-ketoglutaric semialdehyde dehydrogenase ycbD-like.
            Kinetic studies of the Bacillus subtilis ALDH-like ycbD
           protein, which is involved in d-glucarate/d-galactarate
           utilization, reveal that it is a NADP+-dependent,
           alpha-ketoglutaric semialdehyde dehydrogenase (KGSADH).
           KGSADHs (EC 1.2.1.26) catalyze the NAD(P)+-dependent
           conversion of KGSA to alpha-ketoglutarate.
           Interestingly, the NADP+-dependent, tetrameric,
           2,5-dioxopentanoate dehydrogenase (EC=1.2.1.26), an
           enzyme involved in the catabolic pathway for D-arabinose
           in Sulfolobus solfataricus, also clusters in this group.
           This CD shows a distant phylogenetic relationship to the
           Azospirillum brasilense KGSADH-II (-III) group.
          Length = 473

 Score =  111 bits (279), Expect = 2e-26
 Identities = 58/169 (34%), Positives = 89/169 (52%), Gaps = 13/169 (7%)

Query: 87  PGKDIANMLDG--VYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAP 144
            G+ + +   G  V    +P GV  +I  WN+P+ +     A A+A GN V+ KP+E+ P
Sbjct: 116 SGETLPSTRPGVEVETTREPLGVVGLITPWNFPIAIPAWKIAPALAYGNTVVFKPAELTP 175

Query: 145 ASAKIMAEL-----LPKYLDNDTFQVVLG-GVEETTELLKH-RFDYIFYTGSTSVGKIVR 197
           ASA  + E+     LP       F +V+G G E    L++H   D + +TGST+VG+ + 
Sbjct: 176 ASAWALVEILEEAGLPA----GVFNLVMGSGSEVGQALVEHPDVDAVSFTGSTAVGRRIA 231

Query: 198 QAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIA 246
            AA      V LE+GGK+PL +    +++LAV   + G   + GQ C A
Sbjct: 232 AAAAARGARVQLEMGGKNPLVVLDDADLDLAVECAVQGAFFSTGQRCTA 280


>gnl|CDD|143460 cd07142, ALDH_F2BC, Arabidosis aldehyde dehydrogenase family 2 B4,
           B7, C4-like.  Included in this CD is the Arabidosis
           aldehyde dehydrogenase family 2 members B4 and B7
           (EC=1.2.1.3),  which are mitochondrial homotetramers
           that oxidize acetaldehyde and glycolaldehyde, but not
           L-lactaldehyde. Also in this group, is the Arabidosis
           cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme
           involved in the oxidation of sinapalehyde and
           coniferaldehyde.
          Length = 476

 Score =  111 bits (278), Expect = 2e-26
 Identities = 88/277 (31%), Positives = 124/277 (44%), Gaps = 24/277 (8%)

Query: 8   VQNARDTFDSG---KSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEI 64
           V+ AR  FD G   +   YE R R L +   L E++  +LA     D  K  ++A   E+
Sbjct: 47  VKAARKAFDEGPWPRMTGYE-RSRILLRFADLLEKHADELAALETWDNGKPYEQARYAEV 105

Query: 65  EFLANDVRNTLNHLKQWMTPEKPGKDIANM---LDG---VYIYPDPYGVCLIIGAWNYPL 118
              A   R    +   W         I  M    DG   VY   +P GV   I  WN+PL
Sbjct: 106 PLAARLFR----YYAGW------ADKIHGMTLPADGPHHVYTLHEPIGVVGQIIPWNFPL 155

Query: 119 QLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLG-GVEETTEL 176
            +       A+A GN ++LKP+E  P SA + A+L  +  L +    +V G G      +
Sbjct: 156 LMFAWKVGPALACGNTIVLKPAEQTPLSALLAAKLAAEAGLPDGVLNIVTGFGPTAGAAI 215

Query: 177 LKHR-FDYIFYTGSTSVGKIVRQ-AANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLW 234
             H   D + +TGST VGKI+ Q AA  +L PVTLELGGKSP  +    +++ AV    +
Sbjct: 216 ASHMDVDKVAFTGSTEVGKIIMQLAAKSNLKPVTLELGGKSPFIVCEDADVDKAVELAHF 275

Query: 235 GKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSW 271
               N GQ C A         +  + + +AKA     
Sbjct: 276 ALFFNQGQCCCAGSRTFVHESIYDEFVEKAKARALKR 312


>gnl|CDD|236501 PRK09407, gabD2, succinic semialdehyde dehydrogenase; Reviewed.
          Length = 524

 Score =  111 bits (279), Expect = 3e-26
 Identities = 57/147 (38%), Positives = 81/147 (55%), Gaps = 7/147 (4%)

Query: 103 DPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY---LD 159
            P GV  +I  WNYPL L++  A  A+ AGN V+LKP    P +A    ELL  Y   L 
Sbjct: 153 QPKGVVGVISPWNYPLTLAVSDAIPALLAGNAVVLKPDSQTPLTALAAVELL--YEAGLP 210

Query: 160 NDTFQVVLG-GVEETTELLKHRFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLY 218
            D +QVV G G    T L+ +  DY+ +TGST+ G+++ + A   L   +LELGGK+P+ 
Sbjct: 211 RDLWQVVTGPGPVVGTALVDN-ADYLMFTGSTATGRVLAEQAGRRLIGFSLELGGKNPMI 269

Query: 219 IDSSVNIELAVRRFLWGKCINAGQTCI 245
           +    +++ A    +     NAGQ CI
Sbjct: 270 VLDDADLDKAAAGAVRACFSNAGQLCI 296


>gnl|CDD|143459 cd07141, ALDH_F1AB_F2_RALDH1, NAD+-dependent retinal dehydrogenase
           1, ALDH families 1A, 1B, and 2-like.  NAD+-dependent
           retinal dehydrogenase 1 (RALDH 1, ALDH1, EC=1.2.1.36)
           also known as aldehyde dehydrogenase family 1 member A1
           (ALDH1A1) in humans, is a homotetrameric, cytosolic
           enzyme that catalyzes the oxidation of retinaldehyde to
           retinoic acid. Human ALDH1B1 and ALDH2 are also in this
           cluster; both are mitochrondrial homotetramers which
           play important roles in acetaldehyde oxidation; ALDH1B1
           in response to UV light exposure and ALDH2 during
           ethanol metabolism.
          Length = 481

 Score =  110 bits (277), Expect = 4e-26
 Identities = 82/257 (31%), Positives = 120/257 (46%), Gaps = 31/257 (12%)

Query: 8   VQNARDTFDSGKSKPYEF-----RRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLF 62
           V+ AR  F  G   P+       R R L +L  L E ++  LA+    D  K   ++ L 
Sbjct: 50  VKAARAAFKLGS--PWRTMDASERGRLLNKLADLIERDRAYLASLETLDNGKPFSKSYLV 107

Query: 63  EIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDG---VYIYPDPYGVCLIIGAWNYPLQ 119
           ++      +R       +       GK I   +DG    Y   +P GVC  I  WN+PL 
Sbjct: 108 DLPGAIKVLRYYAGWADKIH-----GKTIP--MDGDFFTYTRHEPVGVCGQIIPWNFPLL 160

Query: 120 LSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELL-----PKYLDNDTFQVVLGGVEETT 174
           ++    A A+A GN V+LKP+E  P +A  +A L+     P  + N     V+ G   T 
Sbjct: 161 MAAWKLAPALACGNTVVLKPAEQTPLTALYLASLIKEAGFPPGVVN-----VVPGYGPTA 215

Query: 175 --ELLKH-RFDYIFYTGSTSVGKIVRQAA-NEHLTPVTLELGGKSPLYIDSSVNIELAVR 230
              +  H   D + +TGST VGK+++QAA   +L  VTLELGGKSP  + +  +++ AV 
Sbjct: 216 GAAISSHPDIDKVAFTGSTEVGKLIQQAAGKSNLKRVTLELGGKSPNIVFADADLDYAVE 275

Query: 231 RFLWGKCINAGQTCIAP 247
           +       N GQ C A 
Sbjct: 276 QAHEALFFNMGQCCCAG 292


>gnl|CDD|143470 cd07152, ALDH_BenzADH, NAD-dependent benzaldehyde dehydrogenase
           II-like.  NAD-dependent, benzaldehyde dehydrogenase II
           (XylC, BenzADH, EC=1.2.1.28) is involved in the
           oxidation of benzyl alcohol to benzoate. In
           Acinetobacter calcoaceticus, this process is carried out
           by the chromosomally encoded, benzyl alcohol
           dehydrogenase (xylB) and benzaldehyde dehydrogenase II
           (xylC) enzymes; whereas in Pseudomonas putida they are
           encoded by TOL plasmids.
          Length = 443

 Score =  107 bits (270), Expect = 3e-25
 Identities = 60/172 (34%), Positives = 88/172 (51%), Gaps = 13/172 (7%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPAS-----AKIMAEL-LPKY 157
           P GV  +I  +N+PL L++   A A+A GN V+LKP    P S     A++  E  LP  
Sbjct: 110 PLGVVGVISPFNFPLILAMRSVAPALALGNAVVLKPDPRTPVSGGVVIARLFEEAGLPAG 169

Query: 158 LDNDTFQVVLGGVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSP 216
           +      V+ GG +    L++      I +TGST+VG+ V +AA  HL  V+LELGGK+ 
Sbjct: 170 V----LHVLPGGADAGEALVEDPNVAMISFTGSTAVGRKVGEAAGRHLKKVSLELGGKNA 225

Query: 217 LYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQ--AKA 266
           L +    +++LA     WG  ++ GQ C+A    L    V      +  AKA
Sbjct: 226 LIVLDDADLDLAASNGAWGAFLHQGQICMAAGRHLVHESVADAYTAKLAAKA 277


>gnl|CDD|188167 TIGR01780, SSADH, succinate-semialdehyde dehydrogenase.  Succinic
           semialdehyde dehydrogenase is one of three enzymes
           constituting 4-aminobutyrate (GABA) degradation in both
           prokaryotes and eukaryotes, catalyzing the
           (NAD(P)+)-dependent catabolism reaction of succinic
           semialdehyde to succinate for metabolism by the citric
           acid cycle. The EC number depends on the cofactor:
           1.2.1.24 for NAD only, 1.2.1.79 for NADP only, and
           1.2.1.16 if both can be used. In Escherichia coli,
           succinic semialdehyde dehydrogenase is located in an
           unidirectionally transcribed gene cluster encoding
           enzymes for GABA degradation and is suggested to be
           cotranscribed with succinic semialdehyde transaminase
           from a common promoter upstream of SSADH. Similar gene
           arrangements can be found in characterized Ralstonia
           eutropha and the genome analysis of Bacillus subtilis.
           Prokaryotic succinic semialdehyde dehydrogenases
           (1.2.1.16) share high sequence homology to characterized
           succinic semialdehyde dehydrogenases from rat and human
           (1.2.1.24), exhibiting conservation of proposed cofactor
           binding residues, and putative active sites (G-237 &
           G-242, C-293 & G-259 respectively of rat SSADH).
           Eukaryotic SSADH enzymes exclusively utilize NAD+ as a
           cofactor, exhibiting little to no NADP+ activity. While
           a NADP+ preference has been detected in prokaryotes in
           addition to both NADP+- and NAD+-dependencies as in
           E.coli, Pseudomonas, and Klebsiella pneumoniae. The
           function of this alternative SSADH currently is unknown,
           but has been suggested to play a possible role in
           4-hydroxyphenylacetic degradation. Just outside the
           scope of this model, are several sequences belonging to
           clades scoring between trusted and noise. These
           sequences may be actual SSADH enzymes, but lack
           sufficiently close characterized homologs to make a
           definitive assignment at this time. SSADH enzyme belongs
           to the aldehyde dehydrogenase family (pfam00171),
           sharing a common evolutionary origin and enzymatic
           mechanism with lactaldehyde dehydrogenase. Like in
           lactaldehyde dehydrogenase and succinate semialdehyde
           dehydrogenase, the mammalian catalytic glutamic acid and
           cysteine residues are conserved in all the enzymes of
           this family (PS00687, PS00070) [Central intermediary
           metabolism, Other].
          Length = 448

 Score =  107 bits (269), Expect = 3e-25
 Identities = 75/271 (27%), Positives = 121/271 (44%), Gaps = 44/271 (16%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           ++ A + F + ++   + R   L++   L  EN+ DLA  +  +  K  +EA   EI + 
Sbjct: 25  IRAAYEAFKTWRATTAKERSSLLRKWYNLMMENKDDLARLITLENGKPLKEAKG-EILYA 83

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYP-----------DPYGVCLIIGAWNY 116
           A+ +        +W   E      A  + G  I              P GVC  I  WN+
Sbjct: 84  ASFL--------EWFAEE------AKRVYGDTIPSPQSDKRLIVIKQPVGVCAAITPWNF 129

Query: 117 PLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLG--- 168
           P  +    A  A+AAG  V++KP+E  P SA  +A L     +PK        V+ G   
Sbjct: 130 PAAMITRKAGAALAAGCTVVVKPAEQTPLSALALARLAEQAGIPK----GVLNVITGSRA 185

Query: 169 ---GVEETTELLKHRFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNI 225
              G   TT  L  +   I +TGST+VGKI+ + +   +  V++ELGG +P  +    ++
Sbjct: 186 KEVGNVLTTSPLVRK---ISFTGSTNVGKILMKQSASTVKKVSMELGGNAPFIVFDDADL 242

Query: 226 ELAVRRFLWGKCINAGQTCIAPDYILCSRQV 256
           + AV   +  K  NAGQTC+  + +     +
Sbjct: 243 DQAVEGAMASKFRNAGQTCVCANRLYVHDGI 273



 Score = 36.3 bits (84), Expect = 0.035
 Identities = 11/42 (26%), Positives = 20/42 (47%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQV 355
           AP  +    +++ AV   +  K  NAGQTC+  + +     +
Sbjct: 232 APFIVFDDADLDQAVEGAMASKFRNAGQTCVCANRLYVHDGI 273


>gnl|CDD|215157 PLN02278, PLN02278, succinic semialdehyde dehydrogenase.
          Length = 498

 Score =  107 bits (270), Expect = 4e-25
 Identities = 78/265 (29%), Positives = 123/265 (46%), Gaps = 25/265 (9%)

Query: 6   DLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA---VLF 62
           D + +A D F S        R + L++   L   N++DLA  +  +  K  +EA   V +
Sbjct: 66  DAIASAHDAFPSWSKLTASERSKILRRWYDLIIANKEDLAQLMTLEQGKPLKEAIGEVAY 125

Query: 63  E---IEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDG-VYIYPDPYGVCLIIGAWNYPL 118
               +E+ A + +     +     P       +   D  + +   P GV   I  WN+PL
Sbjct: 126 GASFLEYFAEEAKRVYGDI----IP-------SPFPDRRLLVLKQPVGVVGAITPWNFPL 174

Query: 119 QLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-LPKYLDNDTFQVVLGGVEETTELL 177
            +       A+AAG  V++KPSE+ P +A   AEL L   +      VV+G   E  + L
Sbjct: 175 AMITRKVGPALAAGCTVVVKPSELTPLTALAAAELALQAGIPPGVLNVVMGDAPEIGDAL 234

Query: 178 --KHRFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWG 235
               +   I +TGST+VGK +   A   +  V+LELGG +P  +    ++++AV+  L  
Sbjct: 235 LASPKVRKITFTGSTAVGKKLMAGAAATVKRVSLELGGNAPFIVFDDADLDVAVKGALAS 294

Query: 236 KCINAGQTCIAPDYILCSRQVQAQI 260
           K  N+GQTC+  + IL    VQ  I
Sbjct: 295 KFRNSGQTCVCANRIL----VQEGI 315



 Score = 37.4 bits (87), Expect = 0.017
 Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 4/46 (8%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQI 359
           AP  +    ++++AV+  L  K  N+GQTC+  + IL    VQ  I
Sbjct: 274 APFIVFDDADLDVAVKGALASKFRNSGQTCVCANRIL----VQEGI 315


>gnl|CDD|215410 PLN02766, PLN02766, coniferyl-aldehyde dehydrogenase.
          Length = 501

 Score =  106 bits (265), Expect = 2e-24
 Identities = 82/275 (29%), Positives = 125/275 (45%), Gaps = 20/275 (7%)

Query: 8   VQNARDTFDSGK---SKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEI 64
           V+ AR+ FD G       +E R R + +   L EE+ ++LA     D  K      LF +
Sbjct: 64  VKAAREAFDHGPWPRMSGFE-RGRIMMKFADLIEEHIEELAALDTIDAGK------LFAL 116

Query: 65  EFLANDVRNTLNHLKQWM-TPEKPGKDIANM---LDGVYIYPDPYGVCLIIGAWNYPLQL 120
                D+      L+ +    +K   +   M   L G Y   +P GV   I  WN+P  +
Sbjct: 117 GKAV-DIPAAAGLLRYYAGAADKIHGETLKMSRQLQG-YTLKEPIGVVGHIIPWNFPSTM 174

Query: 121 SLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLG-GVEETTELLK 178
             +  A A+AAG  +++KP+E  P SA   A L     + +    VV G G      +  
Sbjct: 175 FFMKVAPALAAGCTMVVKPAEQTPLSALFYAHLAKLAGVPDGVINVVTGFGPTAGAAIAS 234

Query: 179 HR-FDYIFYTGSTSVG-KIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGK 236
           H   D + +TGST VG KI++ AA  +L  V+LELGGKSPL I    ++++AV   L G 
Sbjct: 235 HMDVDKVSFTGSTEVGRKIMQAAATSNLKQVSLELGGKSPLLIFDDADVDMAVDLALLGI 294

Query: 237 CINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSW 271
             N G+ C+A   +     +  + + +       W
Sbjct: 295 FYNKGEICVASSRVYVQEGIYDEFVKKLVEKAKDW 329


>gnl|CDD|143438 cd07120, ALDH_PsfA-ACA09737, Pseudomonas putida aldehyde
           dehydrogenase PsfA (ACA09737)-like.  Included in this CD
           is the aldehyde dehydrogenase (PsfA, locus ACA09737) of
           Pseudomonas putida involved in furoic acid metabolism.
           Transcription of psfA was induced in response to
           2-furoic acid, furfuryl alcohol, and furfural.
          Length = 455

 Score =  104 bits (262), Expect = 3e-24
 Identities = 80/279 (28%), Positives = 124/279 (44%), Gaps = 40/279 (14%)

Query: 8   VQNARDTFDSGK-SKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA------V 60
           +  AR  FD    +     R R L +L   +E N + LA  LA +  K   EA       
Sbjct: 25  IAAARRAFDETDWAHDPRLRARVLLELADAFEANAERLARLLALENGKILGEARFEISGA 84

Query: 61  LFEIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQL 120
           + E+ + A   R     + +     +PG   + +L       +P GV  II  WN P+ L
Sbjct: 85  ISELRYYAGLARTEAGRMIEP----EPGS-FSLVLR------EPMGVAGIIVPWNSPVVL 133

Query: 121 SLLPAAGAIAAGNVVILKP----SEVAPASAKIMAEL--LPKYLDNDTFQVVLGGVEETT 174
            +   A A+AAG  V++KP    +++  A  +I+AE+  LP         VV    E  +
Sbjct: 134 LVRSLAPALAAGCTVVVKPAGQTAQINAAIIRILAEIPSLPA-------GVVNLFTESGS 186

Query: 175 ELLKH-----RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAV 229
           E   H       D I +TGST+ G+ +  AA   L  + LELGGK+P  +    +++ A+
Sbjct: 187 EGAAHLVASPDVDVISFTGSTATGRAIMAAAAPTLKRLGLELGGKTPCIVFDDADLDAAL 246

Query: 230 RRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVL 268
            +      I AGQ C+A   +L    VQ  I ++ +  L
Sbjct: 247 PKLERALTIFAGQFCMAGSRVL----VQRSIADEVRDRL 281


>gnl|CDD|143461 cd07143, ALDH_AldA_AN0554, Aspergillus nidulans aldehyde
           dehydrogenase, AldA (AN0554)-like.  NAD(P)+-dependent
           aldehyde dehydrogenase (AldA) of Aspergillus nidulans
           (locus AN0554), and other similar sequences, are present
           in this CD.
          Length = 481

 Score =  103 bits (258), Expect = 9e-24
 Identities = 86/266 (32%), Positives = 120/266 (45%), Gaps = 12/266 (4%)

Query: 8   VQNARDTF--DSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
           V+ A   F  D G       R R L +L  L E N   LA+  A D  K    A   +  
Sbjct: 50  VEVAHAAFETDWGLKVSGSKRGRCLSKLADLMERNLDYLASIEALDNGKTFGTAKRVD-- 107

Query: 66  FLANDVRNTLNHLKQWMTPEKPGKDI-ANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLP 124
            +      T  +   W   +  G+ I  ++    Y   +P GVC  I  WN+PL +    
Sbjct: 108 -VQASAD-TFRYYGGW-ADKIHGQVIETDIKKLTYTRHEPIGVCGQIIPWNFPLLMCAWK 164

Query: 125 AAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLG-GVEETTELLKH-RF 181
            A A+AAGN ++LKPSE+ P SA  M +L+P+         VV G G      +  H   
Sbjct: 165 IAPALAAGNTIVLKPSELTPLSALYMTKLIPEAGFPPGVINVVSGYGRTCGNAISSHMDI 224

Query: 182 DYIFYTGSTSVG-KIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINA 240
           D + +TGST VG K++  AA  +L  VTLELGGKSP  +    ++E AV    +G   N 
Sbjct: 225 DKVAFTGSTLVGRKVMEAAAKSNLKKVTLELGGKSPNIVFDDADLESAVVWTAYGIFFNH 284

Query: 241 GQTCIAPDYILCSRQVQAQILNQAKA 266
           GQ C A   I     +  + + + K 
Sbjct: 285 GQVCCAGSRIYVQEGIYDKFVKRFKE 310


>gnl|CDD|143442 cd07124, ALDH_PutA-P5CDH-RocA, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, RocA.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (EC=1.5.1.12 ), RocA: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily. The proline catabolic enzymes, proline
           dehydrogenase and Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), catalyze the two-step oxidation
           of proline to glutamate; P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). In this CD, monofunctional enzyme
           sequences such as seen in the Bacillus subtilis RocA
           P5CDH are also present. These enzymes play important
           roles in cellular redox control, superoxide generation,
           and apoptosis.
          Length = 512

 Score =  103 bits (260), Expect = 9e-24
 Identities = 78/263 (29%), Positives = 117/263 (44%), Gaps = 33/263 (12%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA---VLFEI 64
           VQ AR  F + +  P E R R L +   L    + +LA  +  ++ K+  EA   V   I
Sbjct: 75  VQAARAAFPTWRRTPPEERARLLLRAAALLRRRRFELAAWMVLEVGKNWAEADADVAEAI 134

Query: 65  EFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLP 124
           +FL    R  L  L+ +     PG+D        Y+Y  P GV  +I  WN+PL +    
Sbjct: 135 DFLEYYAREMLR-LRGFPVEMVPGED------NRYVY-RPLGVGAVISPWNFPLAILAGM 186

Query: 125 AAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVEETTE-LLK 178
              A+  GN V+LKP+E  P  A  + E+     LP          + G  EE  + L++
Sbjct: 187 TTAALVTGNTVVLKPAEDTPVIAAKLVEILEEAGLPP----GVVNFLPGPGEEVGDYLVE 242

Query: 179 H-RFDYIFYTGSTSVG-KIVRQAAN-----EHLTPVTLELGGKSPLYIDSSVNIELAVRR 231
           H    +I +TGS  VG +I  +AA      + L  V  E+GGK+ + +D   +++ A   
Sbjct: 243 HPDVRFIAFTGSREVGLRIYERAAKVQPGQKWLKRVIAEMGGKNAIIVDEDADLDEAAEG 302

Query: 232 FLWGKCINAGQTCIAPDYILCSR 254
            +       GQ C A     CSR
Sbjct: 303 IVRSAFGFQGQKCSA-----CSR 320


>gnl|CDD|143458 cd07140, ALDH_F1L_FTFDH, 10-formyltetrahydrofolate dehydrogenase,
           ALDH family 1L.  10-formyltetrahydrofolate dehydrogenase
           (FTHFDH, EC=1.5.1.6), also known as aldehyde
           dehydrogenase family 1 member L1 (ALDH1L1) in humans, is
           a multi-domain homotetramer with an N-terminal formyl
           transferase domain and a C-terminal ALDH domain. FTHFDH
           catalyzes an NADP+-dependent dehydrogenase reaction
           resulting in the conversion of 10-formyltetrahydrofolate
           to tetrahydrofolate and CO2. The ALDH domain is also
           capable of the oxidation of short chain aldehydes to
           their corresponding acids.
          Length = 486

 Score =  103 bits (258), Expect = 1e-23
 Identities = 81/268 (30%), Positives = 125/268 (46%), Gaps = 52/268 (19%)

Query: 8   VQNARDTFDSG---KSKPYEFRRRQLQQLVRLYEENQQDLANALAAD--------LRKHK 56
           V  A++ F++G   K    + R R + +L  L EE+Q++LA   + D        L+ H 
Sbjct: 49  VAAAKEAFENGEWGKMNARD-RGRLMYRLADLMEEHQEELATIESLDSGAVYTLALKTH- 106

Query: 57  QEAVLFEIEFLANDVRNTLNHLKQW----------MTPEKPGKDIANMLDGVYIYPDPYG 106
              V   I+        T  +   W          +   +P +++           +P G
Sbjct: 107 ---VGMSIQ--------TFRYFAGWCDKIQGKTIPINQARPNRNLT------LTKREPIG 149

Query: 107 VCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDND 161
           VC I+  WNYPL +     A  +AAGN V+LKP++V P +A   AEL      PK + N 
Sbjct: 150 VCGIVIPWNYPLMMLAWKMAACLAAGNTVVLKPAQVTPLTALKFAELTVKAGFPKGVIN- 208

Query: 162 TFQVVLGGVEETTELLKHRFDY--IFYTGSTSVGK-IVRQAANEHLTPVTLELGGKSPLY 218
              ++ G      + L    D   + +TGST +GK I++  A  +L  V+LELGGKSPL 
Sbjct: 209 ---ILPGSGSLVGQRLSDHPDVRKLGFTGSTPIGKHIMKSCAVSNLKKVSLELGGKSPLI 265

Query: 219 IDSSVNIELAVRRFLWGKCINAGQTCIA 246
           I +  +++ AVR  +     N G+ CIA
Sbjct: 266 IFADCDMDKAVRMGMSSVFFNKGENCIA 293


>gnl|CDD|215260 PLN02467, PLN02467, betaine aldehyde dehydrogenase.
          Length = 503

 Score =  102 bits (256), Expect = 2e-23
 Identities = 83/285 (29%), Positives = 123/285 (43%), Gaps = 45/285 (15%)

Query: 8   VQNARDTFDSGKSK-----PYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLF 62
           V+ AR  F   K K         R + L+ +     E + +LA     D  K   EA   
Sbjct: 51  VEAARKAFKRNKGKDWARTTGAVRAKYLRAIAAKITERKSELAKLETLDCGKPLDEA--- 107

Query: 63  EIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDG--------------VYIYPDPYGVC 108
              +  +DV     +            D+A  LD                Y+  +P GV 
Sbjct: 108 --AWDMDDVAGCFEYYA----------DLAEALDAKQKAPVSLPMETFKGYVLKEPLGVV 155

Query: 109 LIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPAS----AKIMAEL-LPKYLDNDTF 163
            +I  WNYPL ++    A A+AAG   +LKPSE+A  +    A I  E+ LP        
Sbjct: 156 GLITPWNYPLLMATWKVAPALAAGCTAVLKPSELASVTCLELADICREVGLPP----GVL 211

Query: 164 QVVLG-GVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDS 221
            VV G G E    L  H   D I +TGST+ G+ +  AA + + PV+LELGGKSP+ +  
Sbjct: 212 NVVTGLGTEAGAPLASHPGVDKIAFTGSTATGRKIMTAAAQMVKPVSLELGGKSPIIVFD 271

Query: 222 SVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKA 266
            V+++ AV   ++G     GQ C A   +L   ++ ++ L +   
Sbjct: 272 DVDLDKAVEWAMFGCFWTNGQICSATSRLLVHERIASEFLEKLVK 316


>gnl|CDD|143464 cd07146, ALDH_PhpJ, Streptomyces putative phosphonoformaldehyde
           dehydrogenase PhpJ-like.  Putative phosphonoformaldehyde
           dehydrogenase (PhpJ), an aldehyde dehydrogenase homolog
           reportedly involved in the biosynthesis of
           phosphinothricin tripeptides in Streptomyces
           viridochromogenes DSM 40736, and similar sequences are
           included in this CD.
          Length = 451

 Score =  102 bits (255), Expect = 2e-23
 Identities = 60/162 (37%), Positives = 80/162 (49%), Gaps = 5/162 (3%)

Query: 98  VYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY 157
           ++   +P GV L I  +N+PL       A AIAA N ++LKPSE  P SA  +A+LL + 
Sbjct: 114 IFTLREPLGVVLAITPFNHPLNQVAHKIAPAIAANNRIVLKPSEKTPLSAIYLADLLYEA 173

Query: 158 -LDNDTFQVVLGGVEE-TTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGK 214
            L  D   VV G   E   EL+ H   D + +TG  +VGK +  AA        LELGG 
Sbjct: 174 GLPPDMLSVVTGEPGEIGDELITHPDVDLVTFTGGVAVGKAI--AATAGYKRQLLELGGN 231

Query: 215 SPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQV 256
            PL +    ++E A    + G   N+GQ C A   IL    V
Sbjct: 232 DPLIVMDDADLERAATLAVAGSYANSGQRCTAVKRILVHESV 273


>gnl|CDD|143436 cd07118, ALDH_SNDH, Gluconobacter oxydans L-sorbosone
           dehydrogenase-like.  Included in this CD is the
           L-sorbosone dehydrogenase (SNDH) from Gluconobacter
           oxydans UV10. In G. oxydans,  D-sorbitol is converted to
           2-keto-L-gulonate (a precursor of L-ascorbic acid) in
           sequential oxidation steps catalyzed by a FAD-dependent,
           L-sorbose dehydrogenase and an NAD(P)+-dependent,
           L-sorbosone dehydrogenase.
          Length = 454

 Score =  102 bits (255), Expect = 3e-23
 Identities = 57/163 (34%), Positives = 85/163 (52%), Gaps = 15/163 (9%)

Query: 103 DPYGVCLIIGAWNYP-LQLS-LLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LP 155
           +P GV  II  WN+P L LS  LP A  +AAG  V++KPSE    +  ++AEL     LP
Sbjct: 118 EPIGVVGIITPWNFPFLILSQKLPFA--LAAGCTVVVKPSEFTSGTTLMLAELLIEAGLP 175

Query: 156 KYLDNDTFQVVLG-GVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGG 213
                    +V G G      + +H   D + +TGST VGK +  AA  +L  V+LELGG
Sbjct: 176 A----GVVNIVTGYGATVGQAMTEHPDVDMVSFTGSTRVGKAIAAAAARNLKKVSLELGG 231

Query: 214 KSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQV 256
           K+P  + +  +++ A    ++G   NAG+ C +   +L    +
Sbjct: 232 KNPQIVFADADLDAAADAVVFGVYFNAGECCNSGSRLLVHESI 274



 Score = 31.9 bits (73), Expect = 0.82
 Identities = 20/110 (18%), Positives = 42/110 (38%), Gaps = 15/110 (13%)

Query: 323 NIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVL------DSWYTEQVQ 376
           +++ A    ++G   NAG+ C +   +L    +    +    A        D    E   
Sbjct: 242 DLDAAADAVVFGVYFNAGECCNSGSRLLVHESIADAFVAAVVARSRKVRVGDPLDPETKV 301

Query: 377 GSKHYCRIVSDKHFQRLKSLVHSS----GTIALGGDMDASDRPLYIDSSV 422
           G+     I+++    ++   V +      T+ LGG+  AS   L+   ++
Sbjct: 302 GA-----IINEAQLAKITDYVDAGRAEGATLLLGGERLASAAGLFYQPTI 346


>gnl|CDD|182233 PRK10090, PRK10090, aldehyde dehydrogenase A; Provisional.
          Length = 409

 Score =  100 bits (250), Expect = 6e-23
 Identities = 63/228 (27%), Positives = 115/228 (50%), Gaps = 26/228 (11%)

Query: 28  RQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLANDVRNTLNHLKQW------ 81
           R++   +R   E   +++  +  +  K +Q A + E+ F A+     ++++ +W      
Sbjct: 2   RKIAAGIR---ERASEISALIVEEGGKIQQLAEV-EVAFTAD----YIDYMAEWARRYEG 53

Query: 82  --MTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKP 139
             +  ++PG++I        ++    GV   I  WN+P  L     A A+  GN +++KP
Sbjct: 54  EIIQSDRPGENI-------LLFKRALGVTTGILPWNFPFFLIARKMAPALLTGNTIVIKP 106

Query: 140 SEVAPASAKIMAELLPKY-LDNDTFQVVLG-GVEETTELLKH-RFDYIFYTGSTSVGKIV 196
           SE  P +A   A+++ +  L    F +VLG G     EL  + +   +  TGS S G+ +
Sbjct: 107 SEFTPNNAIAFAKIVDEIGLPKGVFNLVLGRGETVGQELAGNPKVAMVSMTGSVSAGEKI 166

Query: 197 RQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTC 244
             AA +++T V LELGGK+P  +    +++LAV+  +  + IN+GQ C
Sbjct: 167 MAAAAKNITKVCLELGGKAPAIVMDDADLDLAVKAIVDSRVINSGQVC 214


>gnl|CDD|143425 cd07107, ALDH_PhdK-like, Nocardioides 2-carboxybenzaldehyde
           dehydrogenase, PhdK-like.  Nocardioides sp. strain
           KP72-carboxybenzaldehyde dehydrogenase (PhdK), an enzyme
           involved in phenanthrene degradation, and other similar
           sequences, are present in this CD.
          Length = 456

 Score =  100 bits (252), Expect = 7e-23
 Identities = 73/248 (29%), Positives = 107/248 (43%), Gaps = 24/248 (9%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V  AR  F   ++     R R L++L     E+ ++LA   A D            +  +
Sbjct: 25  VAAARAAFPEWRATTPLERARMLRELATRLREHAEELALIDALDCG--------NPVSAM 76

Query: 68  ANDVRNTLNHLKQW--MTPEKPGKDIANMLDGV-YIYPDPYGVCLIIGAWNYPLQLSLLP 124
             DV      L  +  +  E  G+ I      + Y   +PYGV   I A+N+PL  +   
Sbjct: 77  LGDVMVAAALLDYFAGLVTELKGETIPVGGRNLHYTLREPYGVVARIVAFNHPLMFAAAK 136

Query: 125 AAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTE-LLKHR-FD 182
            A  +AAGN V++KP E AP SA  +AEL  + L    F ++ G        L++H    
Sbjct: 137 IAAPLAAGNTVVVKPPEQAPLSALRLAELAREVLPPGVFNILPGDGATAGAALVRHPDVK 196

Query: 183 YIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVR------RFLWGK 236
            I   GS   G+ + +AA E +  VTLELGGK+ L +    + E A         F W  
Sbjct: 197 RIALIGSVPTGRAIMRAAAEGIKHVTLELGGKNALIVFPDADPEAAADAAVAGMNFTW-- 254

Query: 237 CINAGQTC 244
               GQ+C
Sbjct: 255 ---CGQSC 259


>gnl|CDD|143404 cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase
           and ALDH family members 6A1 and 6B2.  Methylmalonate
           semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27)
           [acylating] from Bacillus subtilis is involved in valine
           metabolism and catalyses the NAD+- and CoA-dependent
           oxidation of methylmalonate semialdehyde into
           propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and
           Arabidopsis MMSDH ALDH6B2 are also present in this CD.
          Length = 478

 Score = 98.4 bits (246), Expect = 4e-22
 Identities = 74/280 (26%), Positives = 114/280 (40%), Gaps = 40/280 (14%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQ-----LQQLVRLYEENQQDLANALAADLRKHKQEA--- 59
           V  A+  F +  + P    +RQ      +QL+   EEN  +LA  +  +  K   +A   
Sbjct: 44  VAAAKAAFPAWSATP--VLKRQQVMFKFRQLL---EENLDELARLITLEHGKTLADARGD 98

Query: 60  ---VLFEIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNY 116
               L  +EF  +     + HL +    E   + I    D  Y Y  P GV   I  +N+
Sbjct: 99  VLRGLEVVEFACS-----IPHLLKGEYLENVARGI----D-TYSYRQPLGVVAGITPFNF 148

Query: 117 PLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVE 171
           P  + L     AIA GN  +LKPSE  P +A  +AEL     LP         VV GG E
Sbjct: 149 PAMIPLWMFPMAIACGNTFVLKPSERVPGAAMRLAELLQEAGLPD----GVLNVVHGGKE 204

Query: 172 ETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVR 230
               LL H     + + GST VG+ + + A  +   V    G K+   +    ++E    
Sbjct: 205 AVNALLDHPDIKAVSFVGSTPVGEYIYERAAANGKRVQALGGAKNHAVVMPDADLEQTAN 264

Query: 231 RFLWGKCINAGQTCIAPDYILC----SRQVQAQILNQAKA 266
             +      AGQ C+A    +     + +   +++ +AK 
Sbjct: 265 ALVGAAFGAAGQRCMALSVAVAVGDEADEWIPKLVERAKK 304


>gnl|CDD|143465 cd07147, ALDH_F21_RNP123, Aldehyde dehydrogenase family 21A1-like. 
           Aldehyde dehydrogenase ALDH21A1 (gene name RNP123) was
           first described in the moss Tortula ruralis and is
           believed to play an important role in the detoxification
           of aldehydes generated in response to desiccation- and
           salinity-stress, and ALDH21A1 expression represents a
           unique stress tolerance mechanism. So far, of plants,
           only the bryophyte sequence has been observed, but
           similar protein sequences from bacteria and archaea are
           also present in this CD.
          Length = 452

 Score = 98.1 bits (245), Expect = 4e-22
 Identities = 69/275 (25%), Positives = 114/275 (41%), Gaps = 37/275 (13%)

Query: 9   QNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA------VLF 62
             A           +  R   L   V   EE  ++LA  +  +  K  ++A       + 
Sbjct: 31  VKAFRPM--RALPAHR-RAAILLHCVARLEERFEELAETIVLEAGKPIKDARGEVARAID 87

Query: 63  EIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGV--YIYPDPYGVCLIIGAWNYPLQL 120
                A +           +  E    DI+   +G    +   P G    I  +N+PL L
Sbjct: 88  TFRIAAEEATR--------IYGEVLPLDISARGEGRQGLVRRFPIGPVSAITPFNFPLNL 139

Query: 121 SLLPAAGAIAAGNVVILKPSEVAPASAKIMAELL-----PKYLDNDTFQVVLGGVEETTE 175
                A AIAAG   +LKP+   P SA I+ E+L     PK      F V+    ++   
Sbjct: 140 VAHKVAPAIAAGCPFVLKPASRTPLSALILGEVLAETGLPK----GAFSVLPCSRDDADL 195

Query: 176 LLKH-RFDYIFYTGSTSVG-KIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFL 233
           L+   R   + +TGS +VG  +  +A  +    V LELGG + + +DS  +++ A +R +
Sbjct: 196 LVTDERIKLLSFTGSPAVGWDLKARAGKKK---VVLELGGNAAVIVDSDADLDFAAQRII 252

Query: 234 WGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVL 268
           +G    AGQ+CI+   +L    V   + ++ K+ L
Sbjct: 253 FGAFYQAGQSCISVQRVL----VHRSVYDEFKSRL 283



 Score = 28.8 bits (65), Expect = 6.5
 Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 4/54 (7%)

Query: 314 APLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVL 367
           A + +DS  +++ A +R ++G    AGQ+CI+   +L    V   + ++ K+ L
Sbjct: 234 AAVIVDSDADLDFAAQRIIFGAFYQAGQSCISVQRVL----VHRSVYDEFKSRL 283


>gnl|CDD|131352 TIGR02299, HpaE, 5-carboxymethyl-2-hydroxymuconate semialdehyde
           dehydrogenase.  This model represents the dehydrogenase
           responsible for the conversion of
           5-carboxymethyl-2-hydroxymuconate semialdehyde to
           5-carboxymethyl-2-hydroxymuconate (a tricarboxylic
           acid). This is the step in the degradation of
           4-hydroxyphenylacetic acid via homoprotocatechuate
           following the oxidative opening of the aromatic ring.
          Length = 488

 Score = 98.3 bits (245), Expect = 6e-22
 Identities = 49/151 (32%), Positives = 79/151 (52%), Gaps = 3/151 (1%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDT 162
           P G   +I  WN P  LS    A A+A GN V+LKP+E +P +A  +AE+  +  L +  
Sbjct: 135 PVGPVGLITPWNAPFMLSTWKIAPALAFGNTVVLKPAEWSPLTAARLAEIAKEAGLPDGV 194

Query: 163 FQVVLGGVEETTE-LLKHR-FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYID 220
           F +V G  EE  + L+ H     + +TG T+ G I+ +   + L   ++ELGGKSP+ + 
Sbjct: 195 FNLVHGFGEEAGKALVAHPDVKAVSFTGETATGSIIMRNGADTLKRFSMELGGKSPVIVF 254

Query: 221 SSVNIELAVRRFLWGKCINAGQTCIAPDYIL 251
              ++E A+   ++      G+ C A   +L
Sbjct: 255 DDADLERALDAVVFMIFSFNGERCTASSRLL 285


>gnl|CDD|215259 PLN02466, PLN02466, aldehyde dehydrogenase family 2 member.
          Length = 538

 Score = 98.0 bits (244), Expect = 8e-22
 Identities = 86/266 (32%), Positives = 118/266 (44%), Gaps = 12/266 (4%)

Query: 8   VQNARDTFDSG---KSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEI 64
           V  AR  FD G   K   YE R R L +   L E++  +LA     D  K  +++   E+
Sbjct: 101 VAAARKAFDEGPWPKMTAYE-RSRILLRFADLLEKHNDELAALETWDNGKPYEQSAKAEL 159

Query: 65  EFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLP 124
              A   R    +   W          A+    V    +P GV   I  WN+PL +    
Sbjct: 160 PMFARLFR----YYAGWADKIHGLTVPADGPHHVQTLHEPIGVAGQIIPWNFPLLMFAWK 215

Query: 125 AAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLG-GVEETTELLKHR-F 181
              A+A GN ++LK +E  P SA   A+LL +  L      VV G G      L  H   
Sbjct: 216 VGPALACGNTIVLKTAEQTPLSALYAAKLLHEAGLPPGVLNVVSGFGPTAGAALASHMDV 275

Query: 182 DYIFYTGSTSVGKIVRQ-AANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINA 240
           D + +TGST  GKIV + AA  +L PVTLELGGKSP  +    +++ AV    +    N 
Sbjct: 276 DKLAFTGSTDTGKIVLELAAKSNLKPVTLELGGKSPFIVCEDADVDKAVELAHFALFFNQ 335

Query: 241 GQTCIAPDYILCSRQVQAQILNQAKA 266
           GQ C A        +V  + + +AKA
Sbjct: 336 GQCCCAGSRTFVHERVYDEFVEKAKA 361


>gnl|CDD|143401 cd07082, ALDH_F11_NP-GAPDH, NADP+-dependent non-phosphorylating
           glyceraldehyde 3-phosphate dehydrogenase and ALDH family
           11.  NADP+-dependent non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase (NP-GAPDH, EC=1.2.1.9)
           catalyzes the irreversible oxidation of glyceraldehyde
           3-phosphate to 3-phosphoglycerate generating NADPH for
           biosynthetic reactions.  This CD also includes the
           Arabidopsis thaliana osmotic-stress-inducible ALDH
           family 11, ALDH11A3  and similar sequences. In
           autotrophic eukaryotes, NP-GAPDH generates NADPH for
           biosynthetic processes from photosynthetic
           glyceraldehyde-3-phosphate exported from the chloroplast
           and catalyzes one of the classic glycolytic bypass
           reactions unique to plants.
          Length = 473

 Score = 97.6 bits (244), Expect = 8e-22
 Identities = 78/271 (28%), Positives = 126/271 (46%), Gaps = 21/271 (7%)

Query: 14  TFDSGKSK----PYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLAN 69
            +D+G+      P E R   L +   L +EN++++AN L  ++ K  ++A L E++   +
Sbjct: 47  AYDAGRGWWPTMPLEERIDCLHKFADLLKENKEEVANLLMWEIGKTLKDA-LKEVDRTID 105

Query: 70  DVRNTLNHLKQWMTPEKPGKDIANMLDGVY--IYPDPYGVCLIIGAWNYPLQLSLLPAAG 127
            +R+T+  LK+      PG D      G    +  +P GV L IG +NYPL L++     
Sbjct: 106 YIRDTIEELKRLDGDSLPG-DWFPGTKGKIAQVRREPLGVVLAIGPFNYPLNLTVSKLIP 164

Query: 128 AIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVEETTE-LLKH-R 180
           A+  GN V+ KP+         +AE       PK        VV G   E  + L+ H R
Sbjct: 165 ALIMGNTVVFKPATQGVLLGIPLAEAFHDAGFPK----GVVNVVTGRGREIGDPLVTHGR 220

Query: 181 FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINA 240
            D I +TGST VG  +++     +  + LELGGK P  +    ++ELA +  + G    +
Sbjct: 221 IDVISFTGSTEVGNRLKKQH--PMKRLVLELGGKDPAIVLPDADLELAAKEIVKGALSYS 278

Query: 241 GQTCIAPDYILCSRQVQAQILNQAKAVLDSW 271
           GQ C A   +L    V  +++   K  +   
Sbjct: 279 GQRCTAIKRVLVHESVADELVELLKEEVAKL 309


>gnl|CDD|143434 cd07116, ALDH_ACDHII-AcoD, Ralstonia eutrophus NAD+-dependent
           acetaldehyde dehydrogenase II-like.  Included in this CD
           is the NAD+-dependent, acetaldehyde dehydrogenase II
           (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes)
           eutrophus H16 involved in the catabolism of acetoin and
           ethanol, and similar proteins, such as, the dimeric
           dihydrolipoamide dehydrogenase of the acetoin
           dehydrogenase enzyme system of Klebsiella pneumonia.
           Also included are sequences similar to the
           NAD+-dependent chloroacetaldehyde dehydrogenases (AldA
           and AldB) of Xanthobacter autotrophicus GJ10 which are
           involved in the degradation of 1,2-dichloroethane. These
           proteins apparently require RpoN factors for expression.
          Length = 479

 Score = 97.5 bits (243), Expect = 9e-22
 Identities = 71/216 (32%), Positives = 109/216 (50%), Gaps = 15/216 (6%)

Query: 6   DLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
           D    A++ +  GK+   E R   L ++    E N + LA A   D  K  +E       
Sbjct: 45  DAAHAAKEAW--GKTSVAE-RANILNKIADRMEANLEMLAVAETWDNGKPVRET------ 95

Query: 66  FLANDVRNTLNHLKQWMTPEKPGKDIANMLDG---VYIYPDPYGVCLIIGAWNYPLQLSL 122
            LA D+   ++H + +    +  +   + +D     Y + +P GV   I  WN+PL ++ 
Sbjct: 96  -LAADIPLAIDHFRYFAGCIRAQEGSISEIDENTVAYHFHEPLGVVGQIIPWNFPLLMAT 154

Query: 123 LPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLG-GVEETTELLKH-R 180
              A A+AAGN V+LKP+E  PAS  ++ EL+   L      VV G G+E    L    R
Sbjct: 155 WKLAPALAAGNCVVLKPAEQTPASILVLMELIGDLLPPGVVNVVNGFGLEAGKPLASSKR 214

Query: 181 FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSP 216
              + +TG T+ G+++ Q A+E++ PVTLELGGKSP
Sbjct: 215 IAKVAFTGETTTGRLIMQYASENIIPVTLELGGKSP 250


>gnl|CDD|143431 cd07113, ALDH_PADH_NahF, Escherichia coli NAD+-dependent
           phenylacetaldehyde dehydrogenase PadA-like.
           NAD+-dependent, homodimeric, phenylacetaldehyde
           dehydrogenase (PADH, EC=1.2.1.39) PadA of Escherichia
           coli involved in the catabolism of 2-phenylethylamine,
           and other related sequences, are present in this CD.
           Also included is the Pseudomonas fluorescens ST StyD
           PADH involved in styrene catabolism, the Sphingomonas
           sp. LB126 FldD protein involved in fluorene degradation,
           and the Novosphingobium aromaticivorans NahF
           salicylaldehyde dehydrogenase involved in the
           NAD+-dependent conversion of salicylaldehyde to
           salicylate.
          Length = 477

 Score = 96.7 bits (241), Expect = 2e-21
 Identities = 75/274 (27%), Positives = 124/274 (45%), Gaps = 17/274 (6%)

Query: 8   VQNARDTFDS--GKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
           V +A   F S   K+ P E R R L +L  L E++ ++LA        K    +  FE+ 
Sbjct: 43  VASAWRAFVSAWAKTTPAE-RGRILLRLADLIEQHGEELAQLETLCSGKSIHLSRAFEVG 101

Query: 66  FLANDVRNTLNHLKQWMTPEKPGKDIANMLDGV-------YIYPDPYGVCLIIGAWNYPL 118
             AN +R    +   W T +  G+ +A  +  +       +   +P GV   I  WN+ +
Sbjct: 102 QSANFLR----YFAGWAT-KINGETLAPSIPSMQGERYTAFTRREPVGVVAGIVPWNFSV 156

Query: 119 QLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETTELL 177
            +++     A+A G  +++KPSE  P +   +AEL  +  + +    VV G      +L+
Sbjct: 157 MIAVWKIGAALATGCTIVIKPSEFTPLTLLRVAELAKEAGIPDGVLNVVNGKGAVGAQLI 216

Query: 178 KH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGK 236
            H     + +TGS + GK + + A   LT VTLELGGK+        +I+  V   L   
Sbjct: 217 SHPDVAKVSFTGSVATGKKIGRQAASDLTRVTLELGGKNAAAFLKDADIDWVVEGLLTAG 276

Query: 237 CINAGQTCIAPDYILCSRQVQAQILNQAKAVLDS 270
            ++ GQ C AP+     R    +++ + K  L S
Sbjct: 277 FLHQGQVCAAPERFYVHRSKFDELVTKLKQALSS 310


>gnl|CDD|143448 cd07130, ALDH_F7_AASADH, NAD+-dependent alpha-aminoadipic
           semialdehyde dehydrogenase, ALDH family members 7A1 and
           7B.  Alpha-aminoadipic semialdehyde dehydrogenase
           (AASADH, EC=1.2.1.31), also known as ALDH7A1,
           Antiquitin-1, ALDH7B, or
           delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH),
           is a NAD+-dependent ALDH. Human ALDH7A1 is involved in
           the pipecolic acid pathway of lysine catabolism,
           catalyzing the oxidation of alpha-aminoadipic
           semialdehyde to alpha-aminoadipate.  Arabidopsis
           thaliana ALDH7B4 appears to be an
           osmotic-stress-inducible ALDH gene encoding a
           turgor-responsive or stress-inducible ALDH. The
           Streptomyces clavuligerus P6CDH appears to be involved
           in cephamycin biosynthesis, catalyzing the second stage
           of the two-step conversion of lysine to
           alpha-aminoadipic acid.  The ALDH7A1 enzyme and others
           in this group have been observed as tetramers, yet the
           bacterial P6CDH enzyme has been reported as a monomer.
          Length = 474

 Score = 93.4 bits (233), Expect = 2e-20
 Identities = 54/147 (36%), Positives = 77/147 (52%), Gaps = 6/147 (4%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAP----ASAKIMAELLPKY-L 158
           P GV  +I A+N+P+ +    AA A+  GNVV+ KPS   P    A  KI+A +L K  L
Sbjct: 132 PLGVVGVITAFNFPVAVWGWNAAIALVCGNVVVWKPSPTTPLTAIAVTKIVARVLEKNGL 191

Query: 159 DNDTFQVVLGGVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPL 217
                 +V GG +    L+K  R   + +TGST+VG+ V QA         LELGG + +
Sbjct: 192 PGAIASLVCGGADVGEALVKDPRVPLVSFTGSTAVGRQVGQAVAARFGRSLLELGGNNAI 251

Query: 218 YIDSSVNIELAVRRFLWGKCINAGQTC 244
            +    +++LAVR  L+     AGQ C
Sbjct: 252 IVMEDADLDLAVRAVLFAAVGTAGQRC 278


>gnl|CDD|200087 TIGR01237, D1pyr5carbox2, delta-1-pyrroline-5-carboxylate
           dehydrogenase, group 2, putative.  This enzyme is the
           second of two in the degradation of proline to
           glutamate. This model represents one of several related
           branches of delta-1-pyrroline-5-carboxylate
           dehydrogenase. Members of this branch may be associated
           with proline dehydrogenase (the other enzyme of the
           pathway from proline to glutamate) but have not been
           demonstrated experimentally. The branches are not as
           closely related to each other as some distinct aldehyde
           dehydrogenases are to some; separate models were built
           to let each model describe a set of equivalogs [Energy
           metabolism, Amino acids and amines].
          Length = 511

 Score = 93.4 bits (232), Expect = 3e-20
 Identities = 76/261 (29%), Positives = 112/261 (42%), Gaps = 24/261 (9%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA---VLFEI 64
           +Q A   F++ K    E R   L +   +    + + +  L  ++ K   EA   V   I
Sbjct: 75  LQAAAKAFEAWKKTDPEERAAILFKAAAIVRRRRHEFSALLVKEVGKPWNEADAEVAEAI 134

Query: 65  EFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLP 124
           +F+    R  +   K      + G+         Y+Y  P GV ++I  WN+P  + +  
Sbjct: 135 DFMEYYARQMIELAKGKPVNSREGET------NQYVY-TPTGVTVVISPWNFPFAIMVGM 187

Query: 125 AAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLG-GVEETTELLKH-RF 181
               I  GN V+LKP+E AP  A    E+L +  L     Q V G G E    L+ H + 
Sbjct: 188 TVAPIVTGNCVVLKPAEAAPVIAAKFVEILEEAGLPKGVVQFVPGSGSEVGDYLVDHPKT 247

Query: 182 DYIFYTGSTSVG-KIVRQAA-----NEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWG 235
             I +TGS  VG +I  +AA      +HL  V  E+GGK  + +D   +IELA +     
Sbjct: 248 SLITFTGSREVGTRIFERAAKVQPGQKHLKRVIAEMGGKDTVIVDEDADIELAAQSAFTS 307

Query: 236 KCINAGQTCIAPDYILCSRQV 256
               AGQ C A      SR V
Sbjct: 308 AFGFAGQKCSA-----GSRAV 323


>gnl|CDD|182108 PRK09847, PRK09847, gamma-glutamyl-gamma-aminobutyraldehyde
           dehydrogenase; Provisional.
          Length = 494

 Score = 91.9 bits (228), Expect = 7e-20
 Identities = 88/286 (30%), Positives = 126/286 (44%), Gaps = 41/286 (14%)

Query: 8   VQNARDTFDSG---KSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEI 64
           V  AR  F+ G    S P + R+  L +L  L E + ++LA     D  K  + ++  +I
Sbjct: 63  VSAARGVFERGDWSLSSPAK-RKAVLNKLADLMEAHAEELALLETLDTGKPIRHSLRDDI 121

Query: 65  EFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVY-------------IYPDPYGVCLII 111
              A  +R        W          A  +D VY             I  +P GV   I
Sbjct: 122 PGAARAIR--------WY---------AEAIDKVYGEVATTSSHELAMIVREPVGVIAAI 164

Query: 112 GAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLG-G 169
             WN+PL L+      A+AAGN VILKPSE +P SA  +A L  +  L +    VV G G
Sbjct: 165 VPWNFPLLLTCWKLGPALAAGNSVILKPSEKSPLSAIRLAGLAKEAGLPDGVLNVVTGFG 224

Query: 170 VEETTELLKH-RFDYIFYTGSTSVGK-IVRQAANEHLTPVTLELGGKSP--LYIDSSVNI 225
            E    L +H   D I +TGST  GK +++ A + ++  V LE GGKS   ++ D   ++
Sbjct: 225 HEAGQALSRHNDIDAIAFTGSTRTGKQLLKDAGDSNMKRVWLEAGGKSANIVFADCP-DL 283

Query: 226 ELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSW 271
           + A      G   N GQ CIA   +L    +  + L   K    +W
Sbjct: 284 QQAASATAAGIFYNQGQVCIAGTRLLLEESIADEFLALLKQQAQNW 329


>gnl|CDD|143429 cd07111, ALDH_F16, Aldehyde dehydrogenase family 16A1-like.
           Uncharacterized aldehyde dehydrogenase family 16 member
           A1 (ALDH16A1) and other related sequences are present in
           this CD. The active site cysteine and glutamate residues
           are not conserved in the human ALDH16A1 protein
           sequence.
          Length = 480

 Score = 90.5 bits (225), Expect = 2e-19
 Identities = 51/151 (33%), Positives = 73/151 (48%), Gaps = 10/151 (6%)

Query: 102 PDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPK 156
             P GV   I  WN+PL +       A+A GN V+LKP+E  P +A + AE+     LP 
Sbjct: 145 WKPVGVVGQIVPWNFPLLMLAWKICPALAMGNTVVLKPAEYTPLTALLFAEICAEAGLPP 204

Query: 157 YLDNDTFQVVLGGVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKS 215
                   +V G     + L  H   D + +TGST VG+ +R+A       ++LELGGKS
Sbjct: 205 ----GVLNIVTGNGSFGSALANHPGVDKVAFTGSTEVGRALRRATAGTGKKLSLELGGKS 260

Query: 216 PLYIDSSVNIELAVRRFLWGKCINAGQTCIA 246
           P  +    +++ AV   +     N GQ C A
Sbjct: 261 PFIVFDDADLDSAVEGIVDAIWFNQGQVCCA 291


>gnl|CDD|132284 TIGR03240, arg_catab_astD, succinylglutamic semialdehyde
           dehydrogenase.  Members of this protein family are
           succinylglutamic semialdehyde dehydrogenase (EC
           1.2.1.71), the fourth enzyme in the arginine
           succinyltransferase (AST) pathway for arginine
           catabolism [Energy metabolism, Amino acids and amines].
          Length = 484

 Score = 90.6 bits (225), Expect = 2e-19
 Identities = 74/259 (28%), Positives = 117/259 (45%), Gaps = 22/259 (8%)

Query: 5   GDLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEI 64
              V  AR  F +      E R   +Q+   L EE ++ LA  +A +  K   E    E+
Sbjct: 38  EAAVAAARAAFPAWARLSLEERIAVVQRFAALLEERKEALARVIARETGKPLWET-RTEV 96

Query: 65  EFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGV-YIYPDPYGVCLIIGAWNYPLQLS-- 121
             +   V  ++         E+ G+    M DG   +   P+GV  + G +N+P  L   
Sbjct: 97  ASMIGKVAISIKAYH-----ERTGESENPMPDGRAVLRHRPHGVVAVFGPYNFPGHLPNG 151

Query: 122 -LLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETTELLKH 179
            ++PA   + AGN V+ KPSE+ P  A+   +L  K  L      +V G  E    L  H
Sbjct: 152 HIVPA---LLAGNTVVFKPSELTPWVAEETVKLWEKAGLPAGVLNLVQGARETGVALAAH 208

Query: 180 R-FDYIFYTGSTSVGKIV-RQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKC 237
              D + +TGS++ G ++ RQ A      + LE+GG +PL +D   +I+ AV   +    
Sbjct: 209 PDIDGLLFTGSSNTGHLLHRQFAGRPEKILALEMGGNNPLIVDEVADIDAAVHLIIQSAF 268

Query: 238 INAGQTCIAPDYILCSRQV 256
           I+AGQ C       C+R++
Sbjct: 269 ISAGQRCT------CARRL 281


>gnl|CDD|181873 PRK09457, astD, succinylglutamic semialdehyde dehydrogenase;
           Reviewed.
          Length = 487

 Score = 90.0 bits (224), Expect = 2e-19
 Identities = 77/260 (29%), Positives = 127/260 (48%), Gaps = 30/260 (11%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFL 67
           V+ AR  F +     +E R+  +++   L EEN+++LA  +A +  K   EA   E+  +
Sbjct: 43  VRAARAAFPAWARLSFEERQAIVERFAALLEENKEELAEVIARETGKPLWEA-ATEVTAM 101

Query: 68  ANDVRNTLNHLKQWMTPEKPGKDIANMLDGV-YIYPDPYGVCLIIGAWNYPLQLS---LL 123
            N +  ++         E+ G+  + M DG   +   P+GV  + G +N+P  L    ++
Sbjct: 102 INKIAISIQAYH-----ERTGEKRSEMADGAAVLRHRPHGVVAVFGPYNFPGHLPNGHIV 156

Query: 124 PAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYLDNDTFQVVLGGVEETTELLK 178
           PA   + AGN V+ KPSE+ P  A++  +L     LP         +V GG E    L  
Sbjct: 157 PA---LLAGNTVVFKPSELTPWVAELTVKLWQQAGLPA----GVLNLVQGGRETGKALAA 209

Query: 179 HR-FDYIFYTGSTSVGKIV-RQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGK 236
           H   D + +TGS + G ++ RQ A +    + LE+GG +PL ID   +I+ AV   +   
Sbjct: 210 HPDIDGLLFTGSANTGYLLHRQFAGQPEKILALEMGGNNPLVIDEVADIDAAVHLIIQSA 269

Query: 237 CINAGQTCIAPDYILCSRQV 256
            I+AGQ C       C+R++
Sbjct: 270 FISAGQRCT------CARRL 283


>gnl|CDD|143443 cd07125, ALDH_PutA-P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, PutA.  The proline catabolic enzymes of
           the aldehyde dehydrogenase (ALDH) protein superfamily,
           proline dehydrogenase and
           Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           (EC=1.5.1.12 )), catalyze the two-step oxidation of
           proline to glutamate; P5CDH catalyzes the oxidation of
           glutamate semialdehyde, utilizing NAD+ as the electron
           acceptor. In some bacteria, the two enzymes are fused
           into the bifunctional flavoenzyme, proline utilization A
           (PutA) These enzymes play important roles in cellular
           redox control, superoxide generation, and apoptosis. In
           certain prokaryotes such as Escherichia coli, PutA is
           also a transcriptional repressor of the proline
           utilization genes.
          Length = 518

 Score = 88.4 bits (220), Expect = 9e-19
 Identities = 81/258 (31%), Positives = 116/258 (44%), Gaps = 32/258 (12%)

Query: 6   DLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQ-------E 58
             +  A   F    + P E R   L++   L E N+ +L    AA+  K          E
Sbjct: 73  AALAIAAAAFAGWSATPVEERAEILEKAADLLEANRGELIALAAAEAGKTLADADAEVRE 132

Query: 59  AVLFEIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPL 118
           A+ F   + A   R   +       PE PG      L+G+ +     GV + I  WN+PL
Sbjct: 133 AIDF-CRYYAAQARELFSD------PELPGPT--GELNGLEL--HGRGVFVCISPWNFPL 181

Query: 119 QLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELL-----PKYLDNDTFQVVLGGVEET 173
            +     A A+AAGN VI KP+E  P  A    ELL     P+    D  Q+V G  EE 
Sbjct: 182 AIFTGQIAAALAAGNTVIAKPAEQTPLIAARAVELLHEAGVPR----DVLQLVPGDGEEI 237

Query: 174 TE-LLKH-RFDYIFYTGSTSVGKIVRQAANEH---LTPVTLELGGKSPLYIDSSVNIELA 228
            E L+ H R D + +TGST   K++ +A  E    + P+  E GGK+ + +DS+   E A
Sbjct: 238 GEALVAHPRIDGVIFTGSTETAKLINRALAERDGPILPLIAETGGKNAMIVDSTALPEQA 297

Query: 229 VRRFLWGKCINAGQTCIA 246
           V+  +     +AGQ C A
Sbjct: 298 VKDVVQSAFGSAGQRCSA 315


>gnl|CDD|179543 PRK03137, PRK03137, 1-pyrroline-5-carboxylate dehydrogenase;
           Provisional.
          Length = 514

 Score = 86.1 bits (214), Expect = 5e-18
 Identities = 57/185 (30%), Positives = 80/185 (43%), Gaps = 26/185 (14%)

Query: 98  VYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY 157
            YI   P GV ++I  WN+P  +       AI AGN V+LKP+   P  A    E+L + 
Sbjct: 168 FYI---PLGVGVVISPWNFPFAIMAGMTLAAIVAGNTVLLKPASDTPVIAAKFVEVLEEA 224

Query: 158 -LDNDTFQVVLGGVEETTELL----KHRFDYIFYTGSTSVG-KIVRQAA-----NEHLTP 206
            L       V G   E  + L    K RF  I +TGS  VG +I  +AA        L  
Sbjct: 225 GLPAGVVNFVPGSGSEVGDYLVDHPKTRF--ITFTGSREVGLRIYERAAKVQPGQIWLKR 282

Query: 207 VTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSR-----QVQAQIL 261
           V  E+GGK  + +D   +++LA    +      +GQ C A     CSR      V  ++L
Sbjct: 283 VIAEMGGKDAIVVDEDADLDLAAESIVASAFGFSGQKCSA-----CSRAIVHEDVYDEVL 337

Query: 262 NQAKA 266
            +   
Sbjct: 338 EKVVE 342


>gnl|CDD|132260 TIGR03216, OH_muco_semi_DH, 2-hydroxymuconic semialdehyde
           dehydrogenase.  Members of this protein family are
           2-hydroxymuconic semialdehyde dehydrogenase. Many
           aromatic compounds are catabolized by way of the
           catechol, via the meta-cleavage pathway, to pyruvate and
           acetyl-CoA. This enzyme performs the second of seven
           steps in that pathway for catechol degradation [Energy
           metabolism, Other].
          Length = 481

 Score = 84.4 bits (209), Expect = 2e-17
 Identities = 55/166 (33%), Positives = 81/166 (48%), Gaps = 18/166 (10%)

Query: 99  YIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL----- 153
           Y    P GV  +I  WN PL L       A+A GN V++KPSE  P +A ++ E+     
Sbjct: 134 YAVRKPLGVVGVISPWNLPLLLMTWKVGPALACGNTVVVKPSEETPGTATLLGEVMNAVG 193

Query: 154 LPKYLDNDTFQVVLG-GVEETTELL-KHR-FDYIFYTGSTSVGKIVRQAANEHLTPVTLE 210
           +PK      + VV G G +   E L +H   D I +TG T  G  + +AA + + PV+ E
Sbjct: 194 VPK----GVYNVVHGFGPDSAGEFLTRHPGVDAITFTGETRTGSAIMKAAADGVKPVSFE 249

Query: 211 LGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQV 256
           LGGK+   + +  + + AV   L    +N GQ C      L + +V
Sbjct: 250 LGGKNAAIVFADCDFDAAVAGILRSAFLNTGQVC------LGTERV 289


>gnl|CDD|130783 TIGR01722, MMSDH, methylmalonic acid semialdehyde dehydrogenase.
           Involved in valine catabolism,
           methylmalonate-semialdehyde dehydrogenase catalyzes the
           irreversible NAD+- and CoA-dependent oxidative
           decarboxylation of methylmalonate semialdehyde to
           propionyl-CoA. Methylmalonate-semialdehyde dehydrogenase
           has been characterized in both prokaryotes and
           eukaryotes, functioning as a mammalian tetramer and a
           bacterial homodimer. Although similar in monomeric
           molecular mass and enzymatic activity, the N-terminal
           sequence in P.aeruginosa does not correspond with the
           N-terminal sequence predicted for rat liver. Sequence
           homology to a variety of prokaryotic and eukaryotic
           aldehyde dehydrogenases places MMSDH in the aldehyde
           dehydrogenase (NAD+) superfamily (pfam00171), making
           MMSDH's CoA requirement unique among known ALDHs.
           Methylmalonate semialdehyde dehydrogenase is closely
           related to betaine aldehyde dehydrogenase,
           2-hydroxymuconic semialdehyde dehydrogenase, and class 1
           and 2 aldehyde dehydrogenase. In Bacillus, a highly
           homologous protein to methylmalonic acid semialdehyde
           dehydrogenase, groups out from the main MMSDH clade with
           Listeria and Sulfolobus. This Bacillus protein has been
           suggested to be located in an iol operon and/or involved
           in myo-inositol catabolism, converting malonic
           semialdehyde to acetyl CoA ad CO2. The preceeding
           enzymes responsible for valine catabolism are present in
           Bacillus, Listeria, and Sulfolobus [Energy metabolism,
           Amino acids and amines].
          Length = 477

 Score = 84.2 bits (208), Expect = 2e-17
 Identities = 64/244 (26%), Positives = 106/244 (43%), Gaps = 12/244 (4%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA---VLFEI 64
           V +AR+TF +        R   L +   L +E++ ++A  + A+  K   +A   V   +
Sbjct: 44  VASARETFLTWGQTSLAQRTSVLLRYQALLKEHRDEIAELITAEHGKTHSDALGDVARGL 103

Query: 65  EFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLP 124
           E +  +    +N L +  T  +    +      VY    P GVC  I  +N+P  + L  
Sbjct: 104 EVV--EHACGVNSLLKGETSTQVATRV-----DVYSIRQPLGVCAGITPFNFPAMIPLWM 156

Query: 125 AAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETTELLKH-RFD 182
              AIA GN  +LKPSE  P++A  +AEL  +    +    VV G  E    LL+H    
Sbjct: 157 FPIAIACGNTFVLKPSEKVPSAAVKLAELFSEAGAPDGVLNVVHGDKEAVDRLLEHPDVK 216

Query: 183 YIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQ 242
            + + GST +G+ +    + H   V    G K+ + +    + + A    +      AGQ
Sbjct: 217 AVSFVGSTPIGRYIHTTGSAHGKRVQALGGAKNHMVVMPDADKDAAADALVGAAYGAAGQ 276

Query: 243 TCIA 246
            C+A
Sbjct: 277 RCMA 280


>gnl|CDD|183050 PRK11241, gabD, succinate-semialdehyde dehydrogenase I;
           Provisional.
          Length = 482

 Score = 82.6 bits (204), Expect = 6e-17
 Identities = 77/259 (29%), Positives = 119/259 (45%), Gaps = 30/259 (11%)

Query: 26  RRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLANDVRNTLNHLKQWMTPE 85
           R   L++   L  E+Q DLA  +  +  K   EA   EI + A+ +        +W   E
Sbjct: 72  RANILRRWFNLMMEHQDDLARLMTLEQGKPLAEAK-GEISYAASFI--------EWFAEE 122

Query: 86  KPGKDI-ANMLDG------VYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILK 138
             GK I  + + G      + +   P GV   I  WN+P  +    A  A+AAG  ++LK
Sbjct: 123 --GKRIYGDTIPGHQADKRLIVIKQPIGVTAAITPWNFPAAMITRKAGPALAAGCTMVLK 180

Query: 139 PSEVAPASAKIMAELLPKY-LDNDTFQVVLG-----GVEETTELLKHRFDYIFYTGSTSV 192
           P+   P SA  +AEL  +  +    F VV G     G E T+  L  +  +   TGST +
Sbjct: 181 PASQTPFSALALAELAIRAGIPAGVFNVVTGSAGAVGGELTSNPLVRKLSF---TGSTEI 237

Query: 193 GKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILC 252
           G+ + +   + +  V+LELGG +P  +    +++ AV   L  K  NAGQTC+  + +  
Sbjct: 238 GRQLMEQCAKDIKKVSLELGGNAPFIVFDDADLDKAVEGALASKFRNAGQTCVCANRLYV 297

Query: 253 SRQVQ---AQILNQAKAVL 268
              V    A+ L QA + L
Sbjct: 298 QDGVYDRFAEKLQQAVSKL 316


>gnl|CDD|181826 PRK09406, gabD1, succinic semialdehyde dehydrogenase; Reviewed.
          Length = 457

 Score = 82.1 bits (203), Expect = 1e-16
 Identities = 66/240 (27%), Positives = 98/240 (40%), Gaps = 8/240 (3%)

Query: 11  ARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLAND 70
           A   F   ++  +  R R       L E     +A  +  ++ K    A   E    A  
Sbjct: 32  AHARFRDYRTTTFAQRARWANAAADLLEAEADQVAALMTLEMGKTLASAKA-EALKCAKG 90

Query: 71  VRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIA 130
            R    H +  +  E P    A      Y+   P GV L +  WN+PL   +  AA A+ 
Sbjct: 91  FRYYAEHAEALLADE-PADAAAVGASRAYVRYQPLGVVLAVMPWNFPLWQVVRFAAPALM 149

Query: 131 AGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVL---GGVEETTELLKHRFDYIFY 186
           AGNV +LK +   P +A  +A+L  +    +  FQ +L   G VE    L   R      
Sbjct: 150 AGNVGLLKHASNVPQTALYLADLFRRAGFPDGCFQTLLVGSGAVEAI--LRDPRVAAATL 207

Query: 187 TGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIA 246
           TGS   G+ V   A + +    LELGG  P  +  S +++ A    +  +  N GQ+CIA
Sbjct: 208 TGSEPAGRAVAAIAGDEIKKTVLELGGSDPFIVMPSADLDRAAETAVTARVQNNGQSCIA 267


>gnl|CDD|132294 TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde
           dehydrogenase.  This family of genes are members of the
           pfam00171 NAD-dependent aldehyde dehydrogenase family.
           These genes are observed in Ralstonia eutropha JMP134,
           Sinorhizobium meliloti 1021, Burkholderia mallei ATCC
           23344, Burkholderia thailandensis E264, Burkholderia
           cenocepacia AU 1054, Burkholderia pseudomallei K96243
           and 1710b, Burkholderia xenovorans LB400, Burkholderia
           sp. 383 and Polaromonas sp. JS666 in close proximity to
           the PhnW gene (TIGR02326) encoding 2-aminoethyl
           phosphonate aminotransferase (which generates
           phosphonoacetaldehyde) and PhnA (TIGR02335) encoding
           phosphonoacetate hydrolase (not to be confused with the
           alkylphosphonate utilization operon protein PhnA modeled
           by TIGR00686). Additionally, transporters believed to be
           specific for 2-aminoethyl phosphonate are often present.
           PhnW is, in other organisms, coupled with PhnX
           (TIGR01422) for the degradation of phosphonoacetaldehyde
           (GenProp0238), but PhnX is apparently absent in each of
           the organisms containing this aldehyde reductase. PhnA,
           characterized in a strain of Pseudomonas fluorescens
           that has not het been genome sequenced, is only rarely
           found outside of the PhnW and aldehyde dehydrogenase
           context. For instance in Rhodopseudomonas and Bordetella
           bronchiseptica, where it is adjacent to transporters
           presumably specific for the import of phosphonoacetate.
           It seems reasonably certain then, that this enzyme
           catalyzes the NAD-dependent oxidation of
           phosphonoacetaldehyde to phosphonoacetate, bridging the
           metabolic gap between PhnW and PhnA. We propose the name
           phosphonoacetaldehyde dehydrogenase and the gene symbol
           PhnY for this enzyme.
          Length = 472

 Score = 81.8 bits (202), Expect = 1e-16
 Identities = 52/163 (31%), Positives = 78/163 (47%), Gaps = 6/163 (3%)

Query: 98  VYIYPDPY-GVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPK 156
           V+   +P  GV   I  +N+P+       A AIA  N +++KPSE  P SA  +A++L +
Sbjct: 132 VFTQREPLLGVISAITPFNHPMNQVAHKIAPAIATNNRMVVKPSEKTPLSALYLADILYE 191

Query: 157 Y-LDNDTFQVVLGGVEE-TTELLKHRF-DYIFYTGSTSVGKIVRQAANEHLTPVTLELGG 213
             L     QVV G   E   EL+ +   D + +TG  ++GK +  AA        LELGG
Sbjct: 192 AGLPPQMLQVVTGDPREIADELITNPHVDLVTFTGGVAIGKYI--AARAGYRRQVLELGG 249

Query: 214 KSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQV 256
             PL +    +++ A    + G   N+GQ C A   +L    V
Sbjct: 250 NDPLIVMEDADLDRAADLAVKGSYKNSGQRCTAVKRMLVQESV 292


>gnl|CDD|143403 cd07084, ALDH_KGSADH-like, ALDH subfamily: NAD(P)+-dependent
           alpha-ketoglutaric semialdehyde dehydrogenases and plant
           delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH
           family 12-like.  ALDH subfamily which includes the
           NAD(P)+-dependent, alpha-ketoglutaric semialdehyde
           dehydrogenases (KGSADH, EC 1.2.1.26); plant
           delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           EC=1.5.1.12 ), ALDH family 12; the N-terminal domain of
           the MaoC (monoamine oxidase C) dehydratase regulatory
           protein; and orthologs of MaoC, PaaZ and PaaN, which are
           putative ring-opening enzymes of the aerobic
           phenylacetic acid catabolic pathway.
          Length = 442

 Score = 81.5 bits (201), Expect = 1e-16
 Identities = 43/152 (28%), Positives = 69/152 (45%), Gaps = 6/152 (3%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPK--YLDND 161
           PYG  L+IGA+N+PL + LL  AGA+A GN VI+KP        +IM  LL     L  +
Sbjct: 100 PYGPVLVIGAFNFPLWIPLLQLAGALAMGNPVIVKPHTAVSIVMQIMVRLLHYAGLLPPE 159

Query: 162 TFQVVLGGVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYID 220
              ++ G  +    LL H     + +TGS+ V + +    +     + LEL G +   + 
Sbjct: 160 DVTLINGDGKTMQALLLHPNPKMVLFTGSSRVAEKLA--LDAKQARIYLELAGFNWKVLG 217

Query: 221 SSVN-IELAVRRFLWGKCINAGQTCIAPDYIL 251
                ++    + +      +GQ C A   + 
Sbjct: 218 PDAQAVDYVAWQCVQDMTACSGQKCTAQSMLF 249


>gnl|CDD|212007 TIGR04284, aldehy_Rv0768, aldehyde dehydrogenase, Rv0768 family.
           This family describes a branch of the aldehyde
           dehydrogenase (NAD) family (see pfam00171) that includes
           Rv0768 from Mycobacterium tuberculosis. All members of
           this family belong to species predicted to synthesize
           mycofactocin, suggesting that this enzyme or another
           upstream or downstream in the same pathway might be
           mycofactocin-dependent. However, the taxonomic range of
           this family is not nearly broad enough to make that
           relationship conclusive [Unknown function, Enzymes of
           unknown specificity].
          Length = 480

 Score = 81.0 bits (200), Expect = 3e-16
 Identities = 53/173 (30%), Positives = 81/173 (46%), Gaps = 4/173 (2%)

Query: 103 DPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLD--N 160
           +  GV   I  WN+P Q++L     A+AAGN V+LKP+   P  A ++ EL+ ++ D   
Sbjct: 140 EAVGVVGAITPWNFPHQINLAKLGPALAAGNTVVLKPAPDTPWCAAVLGELIAEHTDFPP 199

Query: 161 DTFQVVLGGVEETTELLKH--RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLY 218
               +V         LL    R D + +TGST+ G+ V   A   L  V LELGGKS   
Sbjct: 200 GVVNIVTSSDHRLGALLAKDPRVDMVSFTGSTATGRAVMADAAATLKKVFLELGGKSAFI 259

Query: 219 IDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKAVLDSW 271
           +    ++  A     +  C++AGQ C     ++  R    + +  A A + S 
Sbjct: 260 VLDDADLAAACSMAAFTVCMHAGQGCAITTRLVVPRARYDEAVAAAAATMGSI 312


>gnl|CDD|132417 TIGR03374, ABALDH, 1-pyrroline dehydrogenase.  Members of this
           protein family are 1-pyrroline dehydrogenase (1.5.1.35),
           also called gamma-aminobutyraldehyde dehydrogenase. This
           enzyme can follow putrescine transaminase (EC 2.6.1.82)
           for a two-step conversion of putrescine to
           gamma-aminobutyric acid (GABA). The member from
           Escherichia coli is characterized as a homotetramer that
           binds one NADH per momomer. This enzyme belongs to the
           medium-chain aldehyde dehydrogenases, and is quite
           similar in sequence to the betaine aldehyde
           dehydrogenase (EC 1.2.1.8) family.
          Length = 472

 Score = 80.8 bits (199), Expect = 3e-16
 Identities = 58/182 (31%), Positives = 82/182 (45%), Gaps = 5/182 (2%)

Query: 100 IYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLD 159
           I  DP GV   I  WNYPL ++    A A+AAGN V+LKPSE+ P +A  +AEL      
Sbjct: 133 IRRDPLGVVASIAPWNYPLMMAAWKLAPALAAGNCVVLKPSEITPLTALKLAELAKDIFP 192

Query: 160 NDTFQVVLG-GVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPL 217
                ++ G G      L  H +   +  TGS + G+ +       +    +ELGGK+P+
Sbjct: 193 AGVVNILFGRGKTVGDPLTGHEKVRMVSLTGSIATGEHILSHTAPSIKRTHMELGGKAPV 252

Query: 218 YIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQI---LNQAKAVLDSWYTE 274
            +    +I+  V         NAGQ C A   I   R +   +   L  A A L S   +
Sbjct: 253 IVFDDADIDAVVEGVRTFGFYNAGQDCTAACRIYAQRGIYDTLVEKLGAAVATLKSGAPD 312

Query: 275 QE 276
            E
Sbjct: 313 DE 314


>gnl|CDD|143402 cd07083, ALDH_P5CDH, ALDH subfamily NAD+-dependent
           delta(1)-pyrroline-5-carboxylate dehydrogenase-like.
           ALDH subfamily of the NAD+-dependent,
           delta(1)-pyrroline-5-carboxylate dehydrogenases (P5CDH,
           EC=1.5.1.12). The proline catabolic enzymes, proline
           dehydrogenase and P5CDH catalyze the two-step oxidation
           of proline to glutamate.  P5CDH catalyzes the oxidation
           of glutamate semialdehyde, utilizing NAD+ as the
           electron acceptor. In some bacteria, the two enzymes are
           fused into the bifunctional flavoenzyme, proline
           utilization A (PutA). These enzymes play important roles
           in cellular redox control, superoxide generation, and
           apoptosis. In certain prokaryotes such as Escherichia
           coli, PutA is also a transcriptional repressor of the
           proline utilization genes. Monofunctional enzyme
           sequences such as those seen in the Bacillus RocA P5CDH
           are also present in this subfamily as well as the human
           ALDH4A1 P5CDH and the Drosophila Aldh17 P5CDH.
          Length = 500

 Score = 79.9 bits (197), Expect = 5e-16
 Identities = 69/275 (25%), Positives = 104/275 (37%), Gaps = 26/275 (9%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA---VLFEI 64
           ++ A   F + K  P E R R L +   L    +++L   L  ++ K+  EA   V   I
Sbjct: 61  LEAAWAAFKTWKDWPQEDRARLLLKAADLLRRRRRELIATLTYEVGKNWVEAIDDVAEAI 120

Query: 65  EFLANDVRNTLNHL-KQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLL 123
           +F+    R  L            PG+D  +   G+       G  ++I  WN+P+ +   
Sbjct: 121 DFIRYYARAALRLRYPAVEVVPYPGEDNESFYVGL-------GAGVVISPWNFPVAIFTG 173

Query: 124 PAAGAIAAGNVVILKPSEVAP-ASAKIMAELLPKYLDNDTFQVVLG-GVEETTELLKH-R 180
                +A GN VI KP+E A     K+              Q + G G E    L +H R
Sbjct: 174 MIVAPVAVGNTVIAKPAEDAVVVGYKVFEIFHEAGFPPGVVQFLPGVGEEVGAYLTEHER 233

Query: 181 FDYIFYTGSTSVGK-IVRQAANEH-----LTPVTLELGGKSPLYIDSSVNIELAVRRFLW 234
              I +TGS   GK I   AA           + +E GGK+ + +D + + EL V   + 
Sbjct: 234 IRGINFTGSLETGKKIYEAAARLAPGQTWFKRLYVETGGKNAIIVDETADFELVVEGVVV 293

Query: 235 GKCINAGQTCIAPDYILCSRQ-VQAQILNQAKAVL 268
                 GQ C A      SR  +           L
Sbjct: 294 SAFGFQGQKCSA-----ASRLILTQGAYEPVLERL 323


>gnl|CDD|177949 PLN02315, PLN02315, aldehyde dehydrogenase family 7 member.
          Length = 508

 Score = 72.9 bits (179), Expect = 8e-14
 Identities = 51/167 (30%), Positives = 78/167 (46%), Gaps = 6/167 (3%)

Query: 103 DPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAP----ASAKIMAELLPKY- 157
           +P G+  +I A+N+P  +    A  A+  GN V+ K +   P    A  K++AE+L K  
Sbjct: 153 NPLGIVGVITAFNFPCAVLGWNACIALVCGNCVVWKGAPTTPLITIAMTKLVAEVLEKNN 212

Query: 158 LDNDTFQVVLGGVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSP 216
           L    F    GG E    + K  R   + +TGS+ VG +V+Q  N       LEL G + 
Sbjct: 213 LPGAIFTSFCGGAEIGEAIAKDTRIPLVSFTGSSKVGLMVQQTVNARFGKCLLELSGNNA 272

Query: 217 LYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQ 263
           + +    +I+LAVR  L+     AGQ C     +L    +   +L Q
Sbjct: 273 IIVMDDADIQLAVRSVLFAAVGTAGQRCTTCRRLLLHESIYDDVLEQ 319


>gnl|CDD|226683 COG4230, COG4230, Delta 1-pyrroline-5-carboxylate dehydrogenase
           [Energy production and conversion].
          Length = 769

 Score = 69.7 bits (171), Expect = 1e-12
 Identities = 69/257 (26%), Positives = 103/257 (40%), Gaps = 46/257 (17%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDL------------ANALAADLRKH 55
           ++ A        + P   R   L++   L E     L            +NA+A ++R  
Sbjct: 156 LEAAVAAAPIWSATPPAERAAILERAADLMEAQMPQLMGLLVREAGKTLSNAIA-EVR-- 212

Query: 56  KQEAVLFEIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWN 115
             EAV F + + A   R+T  +L                         P G  + I  WN
Sbjct: 213 --EAVDF-LRYYAGQARDTFGNLTH----------------------RPLGPVVCISPWN 247

Query: 116 YPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETT 174
           +PL +     A A+AAGN V+ KP+E  P  A     LL +  +     Q++ G  E   
Sbjct: 248 FPLAIFTGQIAAALAAGNSVLAKPAEQTPLIAAQAVRLLHEAGVPPGVLQLLPGRGETVG 307

Query: 175 ELL--KHRFDYIFYTGSTSVGKIV-RQAANEHLTPVTL--ELGGKSPLYIDSSVNIELAV 229
             L    R   + +TGST V +++ RQ A     P+ L  E GG++ + +DSS   E  V
Sbjct: 308 AALTADARVAGVMFTGSTEVARLIQRQLAKRQGRPIPLIAETGGQNAMIVDSSALAEQVV 367

Query: 230 RRFLWGKCINAGQTCIA 246
              L     +AGQ C A
Sbjct: 368 ADVLASAFDSAGQRCSA 384


>gnl|CDD|233325 TIGR01238, D1pyr5carbox3, delta-1-pyrroline-5-carboxylate
           dehydrogenase (PutA C-terminal domain).  This model
           represents one of several related branches of
           delta-1-pyrroline-5-carboxylate dehydrogenase. Members
           of this branch are the C-terminal domain of the PutA
           bifunctional proline dehydrogenase /
           delta-1-pyrroline-5-carboxylate dehydrogenase [Energy
           metabolism, Amino acids and amines].
          Length = 500

 Score = 68.8 bits (168), Expect = 2e-12
 Identities = 52/158 (32%), Positives = 77/158 (48%), Gaps = 7/158 (4%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDT 162
             GV + I  WN+PL +     + A+AAGN VI KP+E     A    EL+ +      T
Sbjct: 160 SRGVFVCISPWNFPLAIFTGQISAALAAGNTVIAKPAEQTSLIAYRAVELMQEAGFPAGT 219

Query: 163 FQVVLG-GVEETTELLKH-RFDYIFYTGSTSVGKIVRQA-ANEHLTPVTL--ELGGKSPL 217
            Q++ G G +    L    R   + +TGST V +++ Q  A     PV L  E GG++ +
Sbjct: 220 IQLLPGRGADVGAALTSDPRIAGVAFTGSTEVAQLINQTLAQREDAPVPLIAETGGQNAM 279

Query: 218 YIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQ 255
            +DS+   E  VR  L     +AGQ C A   +LC ++
Sbjct: 280 IVDSTALPEQVVRDVLRSAFDSAGQRCSALR-VLCVQE 316


>gnl|CDD|143466 cd07148, ALDH_RL0313, Uncharacterized ALDH ( RL0313) with
           similarity to Tortula ruralis aldehyde dehydrogenase
           ALDH21A1.  Uncharacterized aldehyde dehydrogenase (locus
           RL0313) with sequence similarity to the moss Tortula
           ruralis aldehyde dehydrogenase ALDH21A1 (RNP123)
           believed to play an important role in the detoxification
           of aldehydes generated in response to desiccation- and
           salinity-stress, and similar sequences are included in
           this CD.
          Length = 455

 Score = 64.7 bits (158), Expect = 4e-11
 Identities = 68/233 (29%), Positives = 105/233 (45%), Gaps = 16/233 (6%)

Query: 22  PYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLANDVRNTLNHLKQW 81
            +E R   L++L  L EE   +LA  +A +  K   +A + E+    + V    + L Q 
Sbjct: 43  AHE-RIAILERLADLMEERADELALLIAREGGKPLVDAKV-EVTRAIDGVELAADELGQL 100

Query: 82  MTPEKPGKDIANMLDG--VYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKP 139
              E P   +     G   +   +P GV + I A+N+PL L +   A AIAAG  VI+KP
Sbjct: 101 GGREIP-MGLTPASAGRIAFTTREPIGVVVAISAFNHPLNLIVHQVAPAIAAGCPVIVKP 159

Query: 140 SEVAPASAKIMAELLPKY-LDNDTFQVVLGGVEETTELL--KHRFDYIFYTGSTSVGKIV 196
           +   P S     +LL +  L     Q V        E L    R  +  + GS  VG ++
Sbjct: 160 ALATPLSCLAFVDLLHEAGLPEGWCQAVPCE-NAVAEKLVTDPRVAFFSFIGSARVGWML 218

Query: 197 RQAANEHLTPVT---LELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIA 246
           R      L P T   LE GG +P+ +D S +++  +   + G   +AGQ C++
Sbjct: 219 RS----KLAPGTRCALEHGGAAPVIVDRSADLDAMIPPLVKGGFYHAGQVCVS 267


>gnl|CDD|184426 PRK13968, PRK13968, putative succinate semialdehyde dehydrogenase;
           Provisional.
          Length = 462

 Score = 64.5 bits (157), Expect = 4e-11
 Identities = 43/148 (29%), Positives = 66/148 (44%), Gaps = 8/148 (5%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTF 163
           P G  L I  WN+PL   +  A   + AGN  +LK +      A+++A++   + D    
Sbjct: 126 PLGTILAIMPWNFPLWQVMRGAVPILLAGNGYLLKHAPNVMGCAQLIAQV---FKDAGIP 182

Query: 164 QVVLGGVEETTELLKH-----RFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLY 218
           Q V G +    + +       R   +  TGS   G  +   A   L    LELGG  P  
Sbjct: 183 QGVYGWLNADNDGVSQMINDSRIAAVTVTGSVRAGAAIGAQAGAALKKCVLELGGSDPFI 242

Query: 219 IDSSVNIELAVRRFLWGKCINAGQTCIA 246
           + +  ++ELAV+  + G+  N GQ C A
Sbjct: 243 VLNDADLELAVKAAVAGRYQNTGQVCAA 270


>gnl|CDD|166060 PLN02419, PLN02419, methylmalonate-semialdehyde dehydrogenase
           [acylating].
          Length = 604

 Score = 61.7 bits (149), Expect = 5e-10
 Identities = 63/273 (23%), Positives = 117/273 (42%), Gaps = 17/273 (6%)

Query: 4   FGDLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFE 63
           F   V  A+  F   ++ P   R+R + +   L  +N   LA  +  +  K  +++    
Sbjct: 153 FKAAVSAAKQAFPLWRNTPITTRQRVMLKFQELIRKNMDKLAMNITTEQGKTLKDS---- 208

Query: 64  IEFLANDVRNTLNHLKQ--WMTPEKPGKDIANMLDGVYIYP--DPYGVCLIIGAWNYPLQ 119
                 D+   L  ++    M   + G+ + N+ +GV  Y   +P GVC  I  +N+P  
Sbjct: 209 ----HGDIFRGLEVVEHACGMATLQMGEYLPNVSNGVDTYSIREPLGVCAGICPFNFPAM 264

Query: 120 LSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKH 179
           + L     A+  GN  ILKPSE  P ++ I+AEL  +    D    ++ G  +T   +  
Sbjct: 265 IPLWMFPVAVTCGNTFILKPSEKDPGASVILAELAMEAGLPDGVLNIVHGTNDTVNAICD 324

Query: 180 RFDY--IFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKC 237
             D   + + GS + G  +   A      +   +G K+   +    NI+  +   L    
Sbjct: 325 DEDIRAVSFVGSNTAGMHIYARAAAKGKRIQSNMGAKNHGLVLPDANIDATLNALLAAGF 384

Query: 238 INAGQTCIAPDYILC---SRQVQAQILNQAKAV 267
             AGQ C+A   ++    ++  + +++ +AKA+
Sbjct: 385 GAAGQRCMALSTVVFVGDAKSWEDKLVERAKAL 417


>gnl|CDD|215110 PLN00412, PLN00412, NADP-dependent glyceraldehyde-3-phosphate
           dehydrogenase; Provisional.
          Length = 496

 Score = 59.4 bits (144), Expect = 2e-09
 Identities = 50/170 (29%), Positives = 75/170 (44%), Gaps = 14/170 (8%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAEL-----LPKYL 158
           P GV L I  +NYP+ L++   A A+ AGN V+LKP      +A  M         PK L
Sbjct: 158 PLGVVLAIPPFNYPVNLAVSKIAPALIAGNAVVLKPPTQGAVAALHMVHCFHLAGFPKGL 217

Query: 159 DNDTFQVVLG-GVEETTELLKHR-FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSP 216
                  V G G E    L  H   + I +TG  +   I ++A    + P+ +ELGGK  
Sbjct: 218 ----ISCVTGKGSEIGDFLTMHPGVNCISFTGGDTGIAISKKAG---MVPLQMELGGKDA 270

Query: 217 LYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAKA 266
             +    +++LA    + G    +GQ C A   +L    V   ++ +  A
Sbjct: 271 CIVLEDADLDLAAANIIKGGFSYSGQRCTAVKVVLVMESVADALVEKVNA 320


>gnl|CDD|143396 cd07077, ALDH-like, NAD(P)+-dependent aldehyde dehydrogenase-like
           (ALDH-like) family.  The aldehyde dehydrogenase-like
           (ALDH-like) group of the ALDH superfamily of
           NAD(P)+-dependent enzymes which, in general, oxidize a
           wide range of  endogenous and exogenous aliphatic and
           aromatic aldehydes to their corresponding carboxylic
           acids and play an  important role in detoxification.
           This group includes families ALDH18, ALDH19, and ALDH20
           and represents such proteins as gamma-glutamyl phosphate
           reductase, LuxC-like acyl-CoA reductase, and coenzyme A
           acylating aldehyde dehydrogenase.  All of these proteins
           have a conserved cysteine that aligns with the catalytic
           cysteine of the ALDH group.
          Length = 397

 Score = 57.6 bits (139), Expect = 5e-09
 Identities = 38/175 (21%), Positives = 61/175 (34%), Gaps = 10/175 (5%)

Query: 97  GVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPK 156
             Y+   P GV + I     PL   +  A   IA  N  I +P   AP + + +A L   
Sbjct: 93  ETYVRAFPIGVTMHILPSTNPL-SGITSALRGIATRNQCIFRPHPSAPFTNRALALLFQA 151

Query: 157 YLDNDTFQVVLG-----GVEETTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVTLE 210
                  ++++        E   ELL H + D I  TG         + +     PV   
Sbjct: 152 ADAAHGPKILVLYVPHPSDELAEELLSHPKIDLIVATGGRDAVDAAVKHSPH--IPVIGF 209

Query: 211 LGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQAK 265
             G SP+ +D + + E A       K  +    C +   +     V   +  + K
Sbjct: 210 GAGNSPVVVDETADEERASGSVHDSKFFD-QNACASEQNLYVVDDVLDPLYEEFK 263


>gnl|CDD|237018 PRK11905, PRK11905, bifunctional proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1208

 Score = 58.3 bits (142), Expect = 6e-09
 Identities = 69/261 (26%), Positives = 107/261 (40%), Gaps = 54/261 (20%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDL------------ANALAADLRKH 55
           +  A+  F    + P   R   L++   L E +  +L            ANA+A ++R  
Sbjct: 596 LAAAQAAFPEWSATPAAERAAILERAADLMEAHMPELFALAVREAGKTLANAIA-EVR-- 652

Query: 56  KQEAVLFEIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWN 115
             EAV F + + A   R  LN         KP                  G  + I  WN
Sbjct: 653 --EAVDF-LRYYAAQARRLLN-----GPGHKP-----------------LGPVVCISPWN 687

Query: 116 YPLQLSLLPAAGAIAAGNVVILKPSEVAPASA----KIMAEL-LPKYLDNDTFQVVLGGV 170
           +PL +     A A+ AGN V+ KP+E  P  A    +++ E  +PK    D  Q++ G  
Sbjct: 688 FPLAIFTGQIAAALVAGNTVLAKPAEQTPLIAARAVRLLHEAGVPK----DALQLLPGDG 743

Query: 171 EETTELL--KHRFDYIFYTGSTSVGKIVRQAANEHLT-PVTL--ELGGKSPLYIDSSVNI 225
                 L    R   + +TGST V +++++   +    PV L  E GG++ + +DSS   
Sbjct: 744 RTVGAALVADPRIAGVMFTGSTEVARLIQRTLAKRSGPPVPLIAETGGQNAMIVDSSALP 803

Query: 226 ELAVRRFLWGKCINAGQTCIA 246
           E  V   +     +AGQ C A
Sbjct: 804 EQVVADVIASAFDSAGQRCSA 824



 Score = 29.1 bits (66), Expect = 7.6
 Identities = 15/75 (20%), Positives = 22/75 (29%), Gaps = 11/75 (14%)

Query: 125  AAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRFDYI 184
             A A+A GNV ++       A+   +  L+                E         F   
Sbjct: 1096 LAAALATGNVAVVAADSGLAAALADLPGLV------AARIDWTQDWEAD-----DPFAGA 1144

Query: 185  FYTGSTSVGKIVRQA 199
               G     + VRQA
Sbjct: 1145 LLEGDAERARAVRQA 1159


>gnl|CDD|236989 PRK11809, putA, trifunctional transcriptional regulator/proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1318

 Score = 57.7 bits (140), Expect = 1e-08
 Identities = 49/165 (29%), Positives = 71/165 (43%), Gaps = 35/165 (21%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSE----VAPASAKIMAE------- 152
           P G  + I  WN+PL +     A A+AAGN V+ KP+E    +A  + +I+ E       
Sbjct: 768 PLGPVVCISPWNFPLAIFTGQVAAALAAGNSVLAKPAEQTPLIAAQAVRILLEAGVPAGV 827

Query: 153 --LLPKYLDNDTFQVVLGGVEET--TELLKH-RFDYIFYTGSTSVGKIV------RQAAN 201
             LLP             G  ET    L+   R   + +TGST V +++      R    
Sbjct: 828 VQLLP-------------GRGETVGAALVADARVRGVMFTGSTEVARLLQRNLAGRLDPQ 874

Query: 202 EHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIA 246
               P+  E GG++ + +DSS   E  V   L     +AGQ C A
Sbjct: 875 GRPIPLIAETGGQNAMIVDSSALTEQVVADVLASAFDSAGQRCSA 919


>gnl|CDD|131570 TIGR02518, EutH_ACDH, acetaldehyde dehydrogenase (acetylating). 
          Length = 488

 Score = 49.9 bits (119), Expect = 2e-06
 Identities = 54/268 (20%), Positives = 94/268 (35%), Gaps = 38/268 (14%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQ-------LVRLYEENQQDLANALAADLRKHKQEAV 60
           +Q  R+   S K    +      +Q       +V    EN   LA     +    K E  
Sbjct: 7   IQQVRNLIRSAKVAQKKLANMTQEQIDKIVKAIVDAAYENAVKLAKMANEETGFGKWEDK 66

Query: 61  LFEIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQL 120
           + +  F A  V +++  +K      +      +    V     P GV   +     P   
Sbjct: 67  VIKNVFAATIVYDSIKDMKTIGILSE------DKEKKVIEIAVPVGVVAGLIPSTNPTST 120

Query: 121 SLLPAAGAIAAGNVVILKPSEVAPA----SAKIMAELLPKY---------LDNDTFQVVL 167
           ++     +I A N ++  P   A      + K+M +   +          +   T     
Sbjct: 121 AIYKTLISIKARNAIVFSPHPNAKKCIIETVKLMRKAAEEAGAPEGAIGCITVPT----- 175

Query: 168 GGVEETTELLKHR-FDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIE 226
             +E T EL+K++    I  TG    G+ + +AA    TP      G  P YI+ + N++
Sbjct: 176 --IEGTNELMKNKDTSLILATG----GEAMVKAAYSSGTPAIGVGPGNGPAYIERTANVK 229

Query: 227 LAVRRFLWGKCINAGQTCIAPDYILCSR 254
            AVR  +  K  + G  C +   I+   
Sbjct: 230 KAVRDIIDSKTFDNGTICASEQSIIVEE 257



 Score = 35.6 bits (82), Expect = 0.052
 Identities = 15/43 (34%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 398 HSSGTIALGGDMDASDRPLYIDSSVNIELAVRRFLWGKCINAG 440
           +SSGT A+G  +   + P YI+ + N++ AVR  +  K  + G
Sbjct: 204 YSSGTPAIG--VGPGNGPAYIERTANVKKAVRDIIDSKTFDNG 244



 Score = 35.2 bits (81), Expect = 0.084
 Identities = 11/41 (26%), Positives = 20/41 (48%)

Query: 313 SAPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSR 353
           + P YI+ + N++ AVR  +  K  + G  C +   I+   
Sbjct: 217 NGPAYIERTANVKKAVRDIIDSKTFDNGTICASEQSIIVEE 257


>gnl|CDD|143447 cd07129, ALDH_KGSADH, Alpha-Ketoglutaric Semialdehyde
           Dehydrogenase.  Alpha-Ketoglutaric Semialdehyde (KGSA)
           Dehydrogenase (KGSADH, EC 1.2.1.26) catalyzes the
           NAD(P)+-dependent conversion of KGSA to
           alpha-ketoglutarate. This CD contains such sequences as
           those seen in Azospirillum brasilense, KGSADH-II
           (D-glucarate/D-galactarate-inducible) and KGSADH-III
           (hydroxy-L-proline-inducible). Both show similar high
           substrate specificity for KGSA and different coenzyme
           specificity; KGSADH-II is NAD+-dependent and KGSADH-III
           is NADP+-dependent. Also included in this CD is the
           NADP(+)-dependent aldehyde dehydrogenase from Vibrio
           harveyi which catalyzes the oxidation of long-chain
           aliphatic aldehydes to acids.
          Length = 454

 Score = 49.5 bits (119), Expect = 3e-06
 Identities = 38/130 (29%), Positives = 59/130 (45%), Gaps = 17/130 (13%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAG-----AIAAGNVVILKPSEVAPASAKIMAELLPKYL 158
           P G   + GA N+PL  S    AG     A+AAG  V++K     P +++++A  +   L
Sbjct: 105 PLGPVAVFGASNFPLAFS---VAGGDTASALAAGCPVVVKAHPAHPGTSELVARAIRAAL 161

Query: 159 D-----NDTFQVVLGGVEET-TELLKH-RFDYIFYTGSTSVGKIVRQAANEHLT--PVTL 209
                    F ++ GG  E    L+KH     + +TGS   G+ +  AA       P   
Sbjct: 162 RATGLPAGVFSLLQGGGREVGVALVKHPAIKAVGFTGSRRGGRALFDAAAARPEPIPFYA 221

Query: 210 ELGGKSPLYI 219
           ELG  +P++I
Sbjct: 222 ELGSVNPVFI 231


>gnl|CDD|237017 PRK11904, PRK11904, bifunctional proline
           dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
           Reviewed.
          Length = 1038

 Score = 49.4 bits (119), Expect = 4e-06
 Identities = 50/151 (33%), Positives = 74/151 (49%), Gaps = 14/151 (9%)

Query: 106 GVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSE----VAPASAKIMAEL-LPKYLDN 160
           GV + I  WN+PL + L   A A+AAGN VI KP+E    +A  + K++ E  +PK    
Sbjct: 686 GVFVCISPWNFPLAIFLGQVAAALAAGNTVIAKPAEQTPLIAAEAVKLLHEAGIPK---- 741

Query: 161 DTFQVVLG-GVEETTELLKH-RFDYIFYTGSTSVGKIVRQA-ANEHLTPVTL--ELGGKS 215
           D  Q++ G G      L    R   + +TGST   +I+ +  A      V L  E GG++
Sbjct: 742 DVLQLLPGDGATVGAALTADPRIAGVAFTGSTETARIINRTLAARDGPIVPLIAETGGQN 801

Query: 216 PLYIDSSVNIELAVRRFLWGKCINAGQTCIA 246
            + +DS+   E  V   +     +AGQ C A
Sbjct: 802 AMIVDSTALPEQVVDDVVTSAFRSAGQRCSA 832


>gnl|CDD|131331 TIGR02278, PaaN-DH, phenylacetic acid degradation protein paaN.
           This enzyme is proposed to act in the ring-opening step
           of phenylacetic acid degradation which follows ligation
           of the acid with coenzyme A (by PaaF) and hydroxylation
           by a multicomponent non-heme iron hydroxylase complex
           (PaaGHIJK). Gene symbols have been standardized in. This
           enzyme is related to aldehyde dehydrogenases and has
           domains which are members of the pfam00171 and pfam01575
           families. This family includes paaN genes from
           Pseudomonas, Sinorhizobium, Rhodopseudomonas,
           Escherichia, Deinococcus and Corynebacterium. Another
           homology family (TIGR02288) includes several other
           species.
          Length = 663

 Score = 46.7 bits (111), Expect = 2e-05
 Identities = 46/185 (24%), Positives = 75/185 (40%), Gaps = 18/185 (9%)

Query: 97  GVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPS----EVAPASAKIMAE 152
           G +I     GV + I A+N+P+   L   A A  AG   + KP+     VA A  + M E
Sbjct: 137 GRHILTPKGGVAVQINAFNFPVWGLLEKFAPAFLAGVPTLAKPATPTAYVAEALVRTMVE 196

Query: 153 --LLPKYLDNDTFQVVLGGVEETTELLKHRFDYIFYTGSTSVGKIVRQAAN--EHLTPVT 208
             LLP+     + Q++ G   +  + L HR D + +TGS +    +R   N  E      
Sbjct: 197 SGLLPE----GSLQLICGSAGDLLDHLDHR-DVVAFTGSAATADRLRAHPNVLERGIRFN 251

Query: 209 LELGGKSPLYIDSSVN-----IELAVRRFLWGKCINAGQTCIAPDYILCSRQVQAQILNQ 263
            E    +   +           +L  +  +    I AGQ C A   ++  + +   +L  
Sbjct: 252 AEADSLNAAILGEDATPDEPEFDLFAQEIVRELTIKAGQKCTAIRRVIVPKALLEAVLKA 311

Query: 264 AKAVL 268
            +A L
Sbjct: 312 LQARL 316


>gnl|CDD|131341 TIGR02288, PaaN_2, phenylacetic acid degradation protein paaN.
           This enzyme is proposed to act in the ring-opening step
           of phenylacetic acid degradation which follows ligation
           of the acid with coenzyme A (by PaaF) and hydroxylation
           by a multicomponent non-heme iron hydroxylase complex
           (PaaGHIJK). Gene symbols have been standardized in. This
           enzyme is related to aldehyde dehydrogenases and has a
           domain which is a member of the pfam00171 family. This
           family includes sequences from Burkholderia, Bordetella,
           Streptomyces. Other PaaN enzymes are represented by a
           separate model, TIGR02278.
          Length = 551

 Score = 45.6 bits (108), Expect = 4e-05
 Identities = 45/186 (24%), Positives = 77/186 (41%), Gaps = 13/186 (6%)

Query: 77  HLKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVI 136
               W  P+  GK+    L   +    P G+ L+IG   +P   +      ++A GN V+
Sbjct: 169 ETAVWEKPQ--GKNDPLKLKKRF-TIVPRGIALVIGCSTFPTWNTYPGLFASLATGNPVL 225

Query: 137 LKPSEVA--PA--SAKIMAELLPKY-LDND--TFQVVLGGVEETTELLKH-RFDYIFYTG 188
           +KP   A  P   + ++  E+L +   D +  T      G E    L        I +TG
Sbjct: 226 VKPHPGAILPLALTVQVAREVLGEAGFDPNLVTLAAFDPGHEAAQRLATDPAVRIIDFTG 285

Query: 189 STSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPD 248
           S + G+ + Q A +    V  E  G + + I+S+ + +  +R   +   + +GQ C    
Sbjct: 286 SNAFGQWLEQNARQAQ--VYTEKAGVNTVIIESTDDYKAMLRNLAFSLSLYSGQMCTTTQ 343

Query: 249 YILCSR 254
            IL  R
Sbjct: 344 AILVPR 349


>gnl|CDD|130164 TIGR01092, P5CS, delta l-pyrroline-5-carboxylate synthetase.  This
           protein contains a glutamate 5-kinase (ProB, EC
           2.7.2.11) region followed by a gamma-glutamyl phosphate
           reductase (ProA, EC 1.2.1.41) region [Amino acid
           biosynthesis, Glutamate family].
          Length = 715

 Score = 41.8 bits (98), Expect = 6e-04
 Identities = 39/141 (27%), Positives = 65/141 (46%), Gaps = 14/141 (9%)

Query: 104 PYGVCLIIGAWNYPLQLSLLP--AAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDND 161
           P GV LI+    +  +   L   A+ AI +GN ++LK  + A  S  I+ +++ + +   
Sbjct: 397 PIGVLLIV----FESRPDALVQIASLAIRSGNGLLLKGGKEAARSNAILHKVITEAIPIH 452

Query: 162 TFQVVLGGV---EETTELLK--HRFDYIFYTGSTSVGKIVRQAANEHLTPVTLELGGKSP 216
             + ++G V   EE  +LLK     D +   GS    K+V Q       PV     G   
Sbjct: 453 VGKKLIGLVTSREEIPDLLKLDDVIDLVIPRGSN---KLVSQIKKSTKIPVLGHADGICH 509

Query: 217 LYIDSSVNIELAVRRFLWGKC 237
           +Y+D S ++++A R     KC
Sbjct: 510 VYVDKSASVDMAKRIVRDAKC 530


>gnl|CDD|143445 cd07127, ALDH_PAD-PaaZ, Phenylacetic acid degradation proteins PaaZ
           (Escherichia coli) and PaaN (Pseudomonas putida)-like.
           Phenylacetic acid degradation (PAD) proteins PaaZ
           (Escherichia coli) and PaaN (Pseudomonas putida) are
           putative aromatic ring cleavage enzymes of the aerobic
           PA catabolic pathway. PaaZ mutants were defective for
           growth with PA as a sole carbon source due to
           interruption of the putative ring opening system.  This
           CD is limited to bacterial monofunctional enzymes.
          Length = 549

 Score = 41.3 bits (97), Expect = 0.001
 Identities = 46/166 (27%), Positives = 70/166 (42%), Gaps = 24/166 (14%)

Query: 104 PYGVCLIIG-----AWN-YP-LQLSLLPAAGAIAAGNVVILKPSEVA--PA--SAKIMAE 152
           P GV L+IG      WN YP L  SL       A GN VI+KP   A  P   + ++  E
Sbjct: 193 PRGVALVIGCSTFPTWNGYPGLFASL-------ATGNPVIVKPHPAAILPLAITVQVARE 245

Query: 153 LLPKY-LDNDTFQVVLGGVEE--TTELLKH-RFDYIFYTGSTSVGKIVRQAANEHLTPVT 208
           +L +   D +   +     EE     L        I +TGS + G  +   A +    V 
Sbjct: 246 VLAEAGFDPNLVTLAADTPEEPIAQTLATRPEVRIIDFTGSNAFGDWLEANARQAQ--VY 303

Query: 209 LELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTCIAPDYILCSR 254
            E  G + + +DS+ +++  +R   +   + +GQ C  P  I   R
Sbjct: 304 TEKAGVNTVVVDSTDDLKAMLRNLAFSLSLYSGQMCTTPQNIYVPR 349


>gnl|CDD|233324 TIGR01236, D1pyr5carbox1, delta-1-pyrroline-5-carboxylate
           dehydrogenase, group 1.  This model represents one of
           two related branches of delta-1-pyrroline-5-carboxylate
           dehydrogenase. The two branches are not as closely
           related to each other as some aldehyde dehydrogenases
           are to this branch, and separate models are built for
           this reason. The enzyme is the second of two in the
           degradation of proline to glutamate [Energy metabolism,
           Amino acids and amines].
          Length = 532

 Score = 39.8 bits (93), Expect = 0.003
 Identities = 40/141 (28%), Positives = 57/141 (40%), Gaps = 15/141 (10%)

Query: 123 LPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKY-LDNDTFQVVLG-GVEETTELLKH- 179
           L  A A+  GN VI KPS+ A  S  ++  +L +  L       V G G   +  +L   
Sbjct: 188 LAGAPALM-GNTVIWKPSDTAMLSNYLVMRILEEAGLPPGVINFVPGDGPLVSDTVLADP 246

Query: 180 RFDYIFYTGSTSV-----GKIVRQAANEHLTP-VTLELGGKSPLYIDSSVNIELAVRRFL 233
               I +TGST        ++ +     H  P +  E GGK    +  S ++E  V   +
Sbjct: 247 DLAGIHFTGSTPTFKHLWKQVAQNLDRYHNFPRIVGETGGKDFHVVHPSADVEHVVSGTI 306

Query: 234 WGKCINAGQTCIAPDYILCSR 254
            G     GQ C A     CSR
Sbjct: 307 RGAFEYQGQKCSA-----CSR 322


>gnl|CDD|143400 cd07081, ALDH_F20_ACDH_EutE-like, Coenzyme A acylating aldehyde
           dehydrogenase (ACDH), Ethanolamine utilization protein
           EutE, and related proteins.  Coenzyme A acylating
           aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent
           acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10),
           functions as a single enzyme (such as the Ethanolamine
           utilization protein, EutE, in Salmonella typhimurium) or
           as part of a multifunctional enzyme to convert
           acetaldehyde into acetyl-CoA. The E. coli
           aldehyde-alcohol dehydrogenase includes the functional
           domains, alcohol dehydrogenase (ADH), ACDH, and
           pyruvate-formate-lyase deactivase; and the Entamoeba
           histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1)
           includes the functional domains ADH and ACDH, and may be
           critical enzymes in the fermentative pathway.
          Length = 439

 Score = 39.2 bits (91), Expect = 0.005
 Identities = 38/163 (23%), Positives = 62/163 (38%), Gaps = 27/163 (16%)

Query: 97  GVYIYPDPYGV-CLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELL- 154
           G  I  +P GV   I  + N P    +  +  ++   N +I  P   A    +  A LL 
Sbjct: 88  GTLIIAEPIGVVASITPSTN-PTSTVIFKSLISLKTRNSIIFSPHPRAKKVTQRAATLLL 146

Query: 155 --------PK----YLDNDTFQVVLGGVEETTELLKH-RFDYIFYTGSTSVGKIVRQAAN 201
                   P+    ++DN +       +E    L+K      +  TG  +V K    AA 
Sbjct: 147 QAAVAAGAPENLIGWIDNPS-------IELAQRLMKFPGIGLLLATGGPAVVK----AAY 195

Query: 202 EHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGKCINAGQTC 244
               P      G +P+ ID + +I+ AV+  +  K  + G  C
Sbjct: 196 SSGKPAIGVGAGNTPVVIDETADIKRAVQSIVKSKTFDNGVIC 238


>gnl|CDD|143440 cd07122, ALDH_F20_ACDH, Coenzyme A acylating aldehyde dehydrogenase
           (ACDH), ALDH family 20-like.  Coenzyme A acylating
           aldehyde dehydrogenase (ACDH, EC=1.2.1.10), an NAD+ and
           CoA-dependent acetaldehyde dehydrogenase, functions as a
           single enzyme (such as the Ethanolamine utilization
           protein, EutE, in Salmonella typhimurium) or as part of
           a multifunctional enzyme to convert acetaldehyde into
           acetyl-CoA . The E. coli aldehyde-alcohol dehydrogenase
           includes the functional domains, alcohol dehydrogenase
           (ADH), ACDH, and pyruvate-formate-lyase deactivase; and
           the Entamoeba histolytica aldehyde-alcohol dehydrogenase
           2 (ALDH20A1) includes the functional domains ADH and
           ACDH and may be critical enzymes in the fermentative
           pathway.
          Length = 436

 Score = 38.2 bits (90), Expect = 0.008
 Identities = 35/119 (29%), Positives = 48/119 (40%), Gaps = 17/119 (14%)

Query: 128 AIAAGNVVILKPSEVAPAS----AKIMAELLPKY-LDNDTFQVV-LGGVEETTELLKHR- 180
           A+   N +I  P   A       AKIM E            Q +    +E T EL+KH  
Sbjct: 119 ALKTRNAIIFSPHPRAKKCSIEAAKIMREAAVAAGAPEGLIQWIEEPSIELTQELMKHPD 178

Query: 181 FDYIFYTGSTSVGKIVRQAANEHLTPVTLELG---GKSPLYIDSSVNIELAVRRFLWGK 236
            D I  TG   +   V+ AA     P    +G   G  P YID + +I+ AV+  +  K
Sbjct: 179 VDLILATGGPGM---VK-AAYSSGKPA---IGVGPGNVPAYIDETADIKRAVKDIILSK 230



 Score = 30.5 bits (70), Expect = 1.9
 Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 2/38 (5%)

Query: 398 HSSGTIALGGDMDASDRPLYIDSSVNIELAVRRFLWGK 435
           +SSG  A+G      + P YID + +I+ AV+  +  K
Sbjct: 195 YSSGKPAIGVG--PGNVPAYIDETADIKRAVKDIILSK 230


>gnl|CDD|143441 cd07123, ALDH_F4-17_P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, ALDH families 4 and 17.
           Delta(1)-pyrroline-5-carboxylate dehydrogenase
           (EC=1.5.1.12 ), families 4 and 17: a proline catabolic
           enzyme of the aldehyde dehydrogenase (ALDH) protein
           superfamily.  Delta(1)-pyrroline-5-carboxylate
           dehydrogenase (P5CDH), also known as ALDH4A1 in humans, 
           is a mitochondrial  homodimer involved in proline
           degradation and catalyzes the NAD + -dependent
           conversion of P5C to glutamate. This is a necessary step
           in the pathway interconnecting the urea and
           tricarboxylic acid cycles. The preferred substrate is
           glutamic gamma-semialdehyde, other substrates include
           succinic, glutaric and adipic semialdehydes. Also
           included in this CD is the Aldh17 Drosophila
           melanogaster (Q9VUC0) P5CDH and similar sequences.
          Length = 522

 Score = 37.2 bits (87), Expect = 0.017
 Identities = 43/167 (25%), Positives = 65/167 (38%), Gaps = 32/167 (19%)

Query: 123 LPAAGAIAAGNVVILKPSEVAPASA----KIMAEL-LPKYLDNDTFQVVLGGVEETTE-L 176
           L  A A+  GNVV+ KPS+ A  S     KI+ E  LP          V G      + +
Sbjct: 189 LAGAPALM-GNVVLWKPSDTAVLSNYLVYKILEEAGLPP----GVINFVPGDGPVVGDTV 243

Query: 177 LKHR-FDYIFYTGSTSVGK-IVRQAANEHLTPVTL-----ELGGKSPLYIDSSVNIELAV 229
           L       + +TGST   K + +Q         T      E GGK+   +  S +++  V
Sbjct: 244 LASPHLAGLHFTGSTPTFKSLWKQIGENLDRYRTYPRIVGETGGKNFHLVHPSADVDSLV 303

Query: 230 RRFLWGKCINAGQTCIAPDYILCSR---------QVQAQILNQAKAV 267
              + G     GQ C A      SR         +V+ ++L + K +
Sbjct: 304 TATVRGAFEYQGQKCSA-----ASRAYVPESLWPEVKERLLEELKEI 345


>gnl|CDD|223093 COG0014, ProA, Gamma-glutamyl phosphate reductase [Amino acid
           transport and metabolism].
          Length = 417

 Score = 36.8 bits (86), Expect = 0.020
 Identities = 59/269 (21%), Positives = 96/269 (35%), Gaps = 76/269 (28%)

Query: 6   DLVQNARDTFDSGKSKPYEFRRRQL----QQLVR----LYEENQQDLANALAADLRKHKQ 57
           +L + A+       +   E + R L      L      +   N +DLA A     R++  
Sbjct: 7   ELGKRAKAASRKLATLSTEEKNRALLAMADALEAARAEILAANAKDLAAA-----RENGL 61

Query: 58  EAVLFEIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGV----YIYPDPYGVCLIIGA 113
              +             L+ L   +TP +    I  M DG+     + PDP G   +I  
Sbjct: 62  SEAM-------------LDRLA--LTPSR----IEAMADGLRQVAAL-PDPVGE--VIDG 99

Query: 114 WNYPLQLSLL----------------PAAGAIAA------GNVVILKPSEVAPASAKIMA 151
           W  P  L +                 P     AA      GN VIL+    A  S   + 
Sbjct: 100 WTLPNGLQIYRVRVPLGVIGVIYESRPNVTVDAAALCLKSGNAVILRGGSEAIHSNAAIV 159

Query: 152 ELLPKYL-----DNDTFQVVL-GGVEETTELLKHR--FDYIFYTGSTS-VGKIVRQAANE 202
           E++ + L       D  Q++     EE  ELL+     D +   G    + ++V  A   
Sbjct: 160 EVIQEALEKAGLPADAVQLIEDTDREEVLELLRLDGYIDLVIPRGGAGLIRRVVENA--- 216

Query: 203 HLTPVTLELG-GKSPLYIDSSVNIELAVR 230
             T   +E G G   +Y+D S +++ A++
Sbjct: 217 --TVPVIEHGVGNCHIYVDESADLDKALK 243


>gnl|CDD|143444 cd07126, ALDH_F12_P5CDH, Delta(1)-pyrroline-5-carboxylate
           dehydrogenase, ALDH family 12.
           Delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH,
           EC=1.5.1.12), family 12: a proline catabolic enzyme of
           the aldehyde dehydrogenase (ALDH) protein superfamily.
           P5CDH is a mitochondrial enzyme involved in proline
           degradation and catalyzes the NAD + -dependent
           conversion of P5C to glutamate.  The P5CDH, ALDH12A1
           gene, in Arabidopsis, has been identified as an
           osmotic-stress-inducible ALDH gene. This CD contains
           both Viridiplantae and Alveolata P5CDH sequences.
          Length = 489

 Score = 35.9 bits (83), Expect = 0.040
 Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 104 PYGVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILK 138
           PYG   II  +N+PL++  L   GA+  GN  +LK
Sbjct: 142 PYGPVAIITPFNFPLEIPALQLMGALFMGNKPLLK 176


>gnl|CDD|143398 cd07079, ALDH_F18-19_ProA-GPR, Gamma-glutamyl phosphate reductase
           (GPR), aldehyde dehydrogenase families 18 and 19.
           Gamma-glutamyl phosphate reductase (GPR), a L-proline
           biosynthetic pathway (PBP) enzyme that catalyzes the
           NADPH dependent reduction of L-gamma-glutamyl
           5-phosphate into L-glutamate 5-semialdehyde and
           phosphate. The glutamate route of the PBP involves two
           enzymatic steps catalyzed by gamma-glutamyl kinase (GK,
           EC 2.7.2.11) and GPR (EC 1.2.1.41). These enzymes are
           fused into the bifunctional enzyme, ProA or
           delta(1)-pyrroline-5-carboxylate synthetase (P5CS) in
           plants and animals, whereas they are separate enzymes in
           bacteria and yeast. In humans, the P5CS (ALDH18A1), an
           inner mitochondrial membrane enzyme, is essential to the
           de novo synthesis of the amino acids proline and
           arginine. Tomato (Lycopersicon esculentum) has both the
           prokaryotic-like polycistronic operons encoding GK and
           GPR (PRO1, ALDH19) and the full-length, bifunctional
           P5CS (PRO2, ALDH18B1).
          Length = 406

 Score = 33.9 bits (79), Expect = 0.18
 Identities = 61/228 (26%), Positives = 85/228 (37%), Gaps = 66/228 (28%)

Query: 38  EENQQDLANALAADLRKHKQEAVLFEIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDG 97
           E N +DLA A  A L +   +             R  L       TPE+    I  M +G
Sbjct: 42  EANAKDLAAAREAGLSEALLD-------------RLLL-------TPER----IEAMAEG 77

Query: 98  ---VYIYPDPYGVCLIIGAWNYPLQLSL----------------LP-----AAG-AIAAG 132
              V   PDP G   ++  W  P  L +                 P     AA   + +G
Sbjct: 78  LRQVAALPDPVGE--VLRGWTLPNGLQIEKVRVPLGVIGIIYESRPNVTVDAAALCLKSG 135

Query: 133 NVVILKPSEVAPAS----AKIMAELLPKY-LDNDTFQVVLG-GVEETTELLKHRFDY--- 183
           N VIL+    A  S     +I+ E L +  L  D  Q++     E   ELLK   DY   
Sbjct: 136 NAVILRGGSEALHSNRALVEIIQEALEEAGLPEDAVQLIPDTDREAVQELLKLD-DYIDL 194

Query: 184 -IFYTGSTSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVR 230
            I   G+  +  +V  A      PV     G   +Y+D S ++E+AVR
Sbjct: 195 IIPRGGAGLIRFVVENA----TIPVIKHGDGNCHVYVDESADLEMAVR 238


>gnl|CDD|161862 TIGR00407, proA, gamma-glutamyl phosphate reductase.  The related
           model TIGR01092 describes a full-length fusion protein
           delta l-pyrroline-5-carboxylate synthetase that includes
           a gamma-glutamyl phosphate reductase region as described
           by this model. Alternate name: glutamate-5-semialdehyde
           dehydrogenase. The prosite motif begins at residue 332
           of the seed alignment although not all of the members of
           the family exactly obey the motif [Amino acid
           biosynthesis, Glutamate family].
          Length = 398

 Score = 33.6 bits (77), Expect = 0.23
 Identities = 49/222 (22%), Positives = 86/222 (38%), Gaps = 40/222 (18%)

Query: 31  QQLVRLYEENQQDLANA----LAADLRKHKQEAVLFE--IEFLANDVRNTLNHLKQWMTP 84
            Q   +   N +D+A A    LA  L       +L E  ++ +A+ V+     + +   P
Sbjct: 29  AQAPAILAANAKDIAVAKENGLADALLD---RLLLTEGRLKGIADGVK----DVIELADP 81

Query: 85  EKPGKDIANMLDGVYIYP--DPYGVCLII--GAWNYPLQLSLLPAAGAIAAGNVVILKPS 140
                D   +  G+ +     P GV  +I     N  + ++ L     +  GN VIL+  
Sbjct: 82  VGKVIDGRELDSGLTLERVRVPLGVLGVIYEARPNVTVDIASL----CLKTGNAVILRGG 137

Query: 141 EVAPASAKIMAELLPKYLDNDTFQVVL--GGV--------EETTELLK--HRFDYIFYTG 188
           + A  S K + E++   L     Q  L  G V        E  +ELL      D +   G
Sbjct: 138 KEAVRSNKALVEVIQDALA----QTGLPVGAVQLIETPSRELVSELLDLDEYIDLLIPRG 193

Query: 189 STSVGKIVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVR 230
              + ++++Q       PV     G   +Y+D S ++  A++
Sbjct: 194 GNGLVRLIKQ---TSTIPVLGHGDGICHIYLDESADLIKAIK 232


>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
           [Translation, ribosomal structure and biogenesis].
          Length = 569

 Score = 33.5 bits (76), Expect = 0.30
 Identities = 23/96 (23%), Positives = 39/96 (40%), Gaps = 4/96 (4%)

Query: 3   NFGDLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLF 62
            F D     R+        P EF   + + ++ L EE + DL   +       + +    
Sbjct: 354 TFDDHAYRIREEVTIADEIP-EFELLESKSILSLAEEYEGDLMQIVDESALSEELDKGHN 412

Query: 63  EIEFLANDVRNTLNHLKQWMTPEKPGKDIANMLDGV 98
           EI  L +++R  L+ LK   +  KPG   A  L+ +
Sbjct: 413 EIFVLDSEIRMVLDTLKSLHSVPKPG---AKSLEII 445


>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
          Length = 110

 Score = 30.6 bits (70), Expect = 0.50
 Identities = 19/57 (33%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 24 EFRRRQLQQLVRLYEENQQDLANALAADLRKHKQ-EAVLFEIEFLANDVRNTLNHLK 79
          E RRR+L++L +  +E QQ       A  +   Q +A+L +I  L  + R  LN +K
Sbjct: 5  EIRRRKLEELQKQAQEQQQAQEEQEEAQAQAEAQKQAILRQI--LTPEARERLNRIK 59


>gnl|CDD|153294 cd07610, FCH_F-BAR, The Extended FES-CIP4 Homology (FCH) or F-BAR
           (FCH and Bin/Amphiphysin/Rvs) domain, a dimerization
           module that binds and bends membranes.  F-BAR domains
           are dimerization modules that bind and bend membranes
           and are found in proteins involved in membrane dynamics
           and actin reorganization. F-BAR domain containing
           proteins, also known as Pombe Cdc15 homology (PCH)
           family proteins, include Fes and Fer tyrosine kinases,
           PACSINs/Syndapins, FCHO, PSTPIP, CIP4-like proteins and
           srGAPs. Many members also contain an SH3 domain and play
           roles in endocytosis. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           These tubules have diameters larger than those observed
           with N-BARs. The F-BAR domains of some members such as
           NOSTRIN and Rgd1 are important for the subcellular
           localization of the protein.
          Length = 191

 Score = 31.5 bits (72), Expect = 0.67
 Identities = 16/75 (21%), Positives = 36/75 (48%), Gaps = 5/75 (6%)

Query: 6   DLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIE 65
            L +  ++ +     K  E  R Q+++L     E +++  N + A+     QE     +E
Sbjct: 108 KLKKKLQELWAKLAKKADEEYREQVEKLNPAQSEYEEEKLNKIQAE-----QEREEERLE 162

Query: 66  FLANDVRNTLNHLKQ 80
            L ++++N +N +K+
Sbjct: 163 ILKDNLKNYINAIKE 177


>gnl|CDD|214856 smart00852, MoCF_biosynth, Probable molybdopterin binding domain.
           This domain is found a variety of proteins involved in
           biosynthesis of molybdopterin cofactor. The domain is
           presumed to bind molybdopterin. The structure of this
           domain is known, and it forms an alpha/beta structure.
           In the known structure of Gephyrin this domain mediates
           trimerisation.
          Length = 138

 Score = 30.2 bits (69), Expect = 1.1
 Identities = 18/77 (23%), Positives = 31/77 (40%), Gaps = 8/77 (10%)

Query: 136 ILKPSEVAPASAKIMAELLPKYLDNDTFQVVLGG---VEETTELLKH---RFDYIFYTGS 189
           +L   ++  ++  ++A LL +        VV+GG    E   E L+      D +  TG 
Sbjct: 9   LLSGGQIRDSNGPMLAALLRELGIEVVRVVVVGGPDDPEAIREALREALAEADVVITTGG 68

Query: 190 TSVGK--IVRQAANEHL 204
           T  G   +  +A  E  
Sbjct: 69  TGPGPDDLTPEALAELG 85


>gnl|CDD|143399 cd07080, ALDH_Acyl-CoA-Red_LuxC, Acyl-CoA reductase LuxC.  Acyl-CoA
           reductase, LuxC, (EC=1.2.1.50) is the fatty acid
           reductase enzyme responsible for synthesis of the
           aldehyde  substrate for the luminescent reaction
           catalyzed by luciferase. The fatty acid reductase, a
           luminescence-specific, multienzyme complex (LuxCDE),
           reduces myristic acid to generate the long chain fatty
           aldehyde required for the luciferase-catalyzed reaction
           resulting in the emission of blue-green light.
           Mutational studies of conserved cysteines of LuxC
           revealed that the cysteine which aligns with the
           catalytic cysteine conserved throughout the ALDH
           superfamily is the LuxC acylation site. This CD is
           composed of mainly bacterial sequences but also includes
           a few archaeal sequences similar to the Methanospirillum
           hungateiacyl acyl-CoA reductase RfbN.
          Length = 422

 Score = 31.1 bits (71), Expect = 1.2
 Identities = 29/121 (23%), Positives = 45/121 (37%), Gaps = 15/121 (12%)

Query: 94  MLDG-VYIYPDPY------GVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAP-- 144
           +LD  V      Y      G+ + I A N PL L +      +   NV +LK S   P  
Sbjct: 95  ILDEWVPPGRGGYIRAQPRGLVVHIIAGNVPL-LPVWSIVRGLLVKNVNLLKMSSSDPLT 153

Query: 145 --ASAKIMAELLPKYLDNDTFQVV---LGGVEETTELLKHRFDYIFYTGSTSVGKIVRQA 199
             A  + +A++ P +   D+  VV    G  E    +L      + + G  +V  I    
Sbjct: 154 ATALLRSLADVDPNHPLTDSISVVYWPGGDAELEERILASADAVVAWGGEEAVKAIRSLL 213

Query: 200 A 200
            
Sbjct: 214 P 214


>gnl|CDD|235928 PRK07079, PRK07079, hypothetical protein; Provisional.
          Length = 469

 Score = 30.7 bits (70), Expect = 1.9
 Identities = 20/70 (28%), Positives = 25/70 (35%), Gaps = 10/70 (14%)

Query: 6  DLVQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQ-QDLANALAADLRKHKQEAVL--- 61
            +  A   FDSG      F    L + V    E+Q  D A AL A L      A+    
Sbjct: 5  AAIARAAAYFDSG-----AFFAD-LARRVAYRTESQNPDRAPALRAYLTDEIAPALAALG 58

Query: 62 FEIEFLANDV 71
          F    + N V
Sbjct: 59 FTCRIVDNPV 68


>gnl|CDD|237016 PRK11903, PRK11903, aldehyde dehydrogenase; Provisional.
          Length = 521

 Score = 30.4 bits (69), Expect = 2.4
 Identities = 31/99 (31%), Positives = 45/99 (45%), Gaps = 13/99 (13%)

Query: 106 GVCLIIGAWNYPLQLSLLPAAGAIAAGNVVILKPSEVAPASAKIMAELLPKYLDNDTF-- 163
           GV L I A+N+P       AA A+ AG  VI+KP   A A+A +   ++   +       
Sbjct: 150 GVALFINAFNFPAWGLWEKAAPALLAGVPVIVKP---ATATAWLTQRMVKDVVAAGILPA 206

Query: 164 ---QVVLGGVEETTELLKH--RFDYIFYTGSTSVGKIVR 197
               VV G    +  LL H   FD + +TGS     ++R
Sbjct: 207 GALSVVCGS---SAGLLDHLQPFDVVSFTGSAETAAVLR 242


>gnl|CDD|219178 pfam06792, UPF0261, Uncharacterized protein family (UPF0261). 
          Length = 403

 Score = 29.8 bits (68), Expect = 3.8
 Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 17/68 (25%)

Query: 1   MVNFGDLVQNARDTFDSGKSKPYEFRRRQLQQ------LVRLYEENQQDLANALAADLRK 54
           MVNFG     A DT       P  FR R+L +      L+R   E  + +   +A  L +
Sbjct: 282 MVNFG-----AWDTV------PERFRGRKLHEHNPQVTLMRTTPEENRAIGRWIAEKLNR 330

Query: 55  HKQEAVLF 62
            K    + 
Sbjct: 331 AKGPVRVL 338


>gnl|CDD|131168 TIGR02113, coaC_strep, phosphopantothenoylcysteine decarboxylase,
           streptococcal.  In most bacteria, a single bifunctional
           protein catalyses phosphopantothenoylcysteine
           decarboxylase and phosphopantothenate--cysteine ligase
           activities, sequential steps in coenzyme A biosynthesis
           (see TIGR00521). These activities reside in separate
           proteins encoded by tandem genes in some bacterial
           lineages. This model describes proteins from the genera
           Streptococcus and Enterococcus homologous to the
           N-terminal region of TIGR00521, corresponding to
           phosphopantothenoylcysteine decarboxylase activity
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Pantothenate and coenzyme A].
          Length = 177

 Score = 29.0 bits (65), Expect = 4.0
 Identities = 15/44 (34%), Positives = 28/44 (63%), Gaps = 8/44 (18%)

Query: 195 IVRQAANEHLTPVTLELGGKSPLYID-------SSVN-IELAVR 230
           ++ QAA + +TP+TL++  K+P+++D         +N IELA +
Sbjct: 33  LMTQAATQFITPLTLQVLSKNPVHLDVMDEHDPKVINHIELAKK 76


>gnl|CDD|191268 pfam05405, Mt_ATP-synt_B, Mitochondrial ATP synthase B chain
           precursor (ATP-synt_B).  The Fo sector of the ATP
           synthase is a membrane bound complex which mediates
           proton transport. It is composed of nine different
           polypeptide subunits (a, b, c, d, e, f, g F6, A6L).
          Length = 163

 Score = 28.4 bits (64), Expect = 5.5
 Identities = 11/48 (22%), Positives = 20/48 (41%), Gaps = 1/48 (2%)

Query: 351 CSRQVQAQILNQAKAVLDSWY-TEQVQGSKHYCRIVSDKHFQRLKSLV 397
             R++QA +  + K+ LD+    E     +    +V+      LK L 
Sbjct: 99  FERELQAALYREIKSKLDTLVRKESSVRQREQDHLVNWVISSVLKELS 146


>gnl|CDD|132132 TIGR03088, stp2, sugar transferase, PEP-CTERM/EpsH1 system
           associated.  Members of this family include a match to
           the pfam00534 Glycosyl transferases group 1 domain.
           Nearly all are found in species that encode the
           PEP-CTERM/exosortase system predicted to act in protein
           sorting in a number of Gram-negative bacteria. In
           particular, these transferases are found proximal to a
           particular variant of exosortase, EpsH1, which appears
           to travel with a conserved group of genes summarized by
           Genome Property GenProp0652. The nature of the sugar
           transferase reaction catalyzed by members of this clade
           is unknown and may conceivably be variable with respect
           to substrate by species, but we hypothesize a conserved
           substrate.
          Length = 374

 Score = 28.9 bits (65), Expect = 5.7
 Identities = 13/46 (28%), Positives = 23/46 (50%), Gaps = 1/46 (2%)

Query: 9   QNARDTFD-SGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLR 53
           ++ RD FD  G +  Y + RR  + L+  Y    +DL + L   ++
Sbjct: 111 EHGRDVFDLDGSNWKYRWLRRLYRPLIHHYVAVSRDLEDWLRGPVK 156


>gnl|CDD|187565 cd05255, SQD1_like_SDR_e, UDP_sulfoquinovose_synthase (Arabidopsis
           thaliana SQD1 and related proteins), extended (e) SDRs. 
           Arabidopsis thaliana UDP-sulfoquinovose-synthase (
           SQD1), an extended SDR,  catalyzes the transfer of
           SO(3)(-) to UDP-glucose in the biosynthesis of plant
           sulfolipids. Members of this subgroup share the
           conserved SDR catalytic residues, and a partial match to
           the characteristic extended-SDR NAD-binding motif.
           Extended SDRs are distinct from classical SDRs. In
           addition to the Rossmann fold (alpha/beta folding
           pattern with a central beta-sheet) core region typical
           of all SDRs, extended SDRs have a less conserved
           C-terminal extension of approximately 100 amino acids.
           Extended SDRs are a diverse collection of proteins, and
           include isomerases, epimerases, oxidoreductases, and
           lyases; they typically have a TGXXGXXG cofactor binding
           motif. SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 382

 Score = 28.9 bits (65), Expect = 6.7
 Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 10/73 (13%)

Query: 165 VVLGGVEETT---ELLKHRFDYIFYTGSTSVGKIVRQAANEHLTPVTLE-LGGKSPLYI- 219
           VV G   E T   E L +RFDY    G T + +   QAA  H  P+T+   GG++  +I 
Sbjct: 210 VVYGTKTEETEADERLINRFDYDGVFG-TVLNRFCVQAAIGH--PLTVYGKGGQTRGFIS 266

Query: 220 --DSSVNIELAVR 230
             D+   +ELA+ 
Sbjct: 267 IRDTVQCLELALE 279


>gnl|CDD|152165 pfam11729, Capsid-VNN, nodavirus capsid protein.  The capsid or
           coat protein of this family is expressed in Nodaviridae,
           that are ssRNA positive-strand viruses, with no DNA
           stage. These viruses are the causative agents of viral
           nervous necrosis in marine fish.
          Length = 340

 Score = 28.8 bits (64), Expect = 6.7
 Identities = 14/38 (36%), Positives = 17/38 (44%)

Query: 413 DRPLYIDSSVNIELAVRRFLWGKCINAGQLTRGPGWDR 450
           DRPL ID S+      R   W    + G LTR  G+  
Sbjct: 262 DRPLRIDYSLGTGDVDRAQYWHFSKSGGNLTRPAGYQL 299


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 29.0 bits (65), Expect = 7.0
 Identities = 11/63 (17%), Positives = 28/63 (44%), Gaps = 1/63 (1%)

Query: 18  GKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEAVLFEIEFLANDVRNTLNH 77
              K  + +  +L+  +    E+ +DL  AL  +L++ K+   + E +     +   +  
Sbjct: 178 EVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEE-LEQEIEA 236

Query: 78  LKQ 80
           L++
Sbjct: 237 LEE 239


>gnl|CDD|173685 cd05594, STKc_PKB_alpha, Catalytic domain of the Protein
           Serine/Threonine Kinase, Protein Kinase B alpha.
           Serine/Threonine Kinases (STKs), Protein Kinase B (PKB)
           or Akt subfamily, alpha (or Akt1) isoform, catalytic (c)
           domain. STKs catalyze the transfer of the
           gamma-phosphoryl group from ATP to serine/threonine
           residues on protein substrates. The PKB subfamily is
           part of a larger superfamily that includes the catalytic
           domains of other protein STKs, protein tyrosine kinases,
           RIO kinases, aminoglycoside phosphotransferase, choline
           kinase, and phosphoinositide 3-kinase. There are three
           PKB isoforms from different genes, PKB-alpha (or Akt1),
           PKB-beta (or Akt2), and PKB-gamma (or Akt3). PKB
           contains an N-terminal pleckstrin homology (PH) domain
           and a C-terminal catalytic domain. PKB-alpha is
           predominantly expressed in endothelial cells. It is
           critical for the regulation of angiogenesis and the
           maintenance of vascular integrity. It also plays a role
           in adipocyte differentiation. Mice deficient in
           PKB-alpha exhibit perinatal morbidity, growth
           retardation, reduction in body weight accompanied by
           reduced sizes of multiple organs, and enhanced apoptosis
           in some cell types. PKB-alpha activity has been reported
           to be frequently elevated in breast and prostate
           cancers. In some cancer cells, PKB-alpha may act as a
           suppressor of metastasis.
          Length = 325

 Score = 28.4 bits (63), Expect = 7.7
 Identities = 12/43 (27%), Positives = 25/43 (58%), Gaps = 4/43 (9%)

Query: 139 PSEVAPASAKIMAELLPKYLDNDTFQVVLGGVEETTELLKHRF 181
           P  ++P +  +++ LL K    D  Q + GG ++  E+++H+F
Sbjct: 216 PRTLSPEAKSLLSGLLKK----DPKQRLGGGPDDAKEIMQHKF 254


>gnl|CDD|218242 pfam04751, DUF615, Protein of unknown function (DUF615).  This
          family of bacterial proteins has no known function.
          Length = 153

 Score = 27.9 bits (63), Expect = 7.9
 Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 2/42 (4%)

Query: 26 RRRQLQQLVRLY-EENQQDLANALAADLRKHKQEAVLF-EIE 65
          RRRQLQ + +L  EE+ + +  AL     K +QE     ++E
Sbjct: 54 RRRQLQYIGKLMREEDIEAIRAALDKLKNKSEQETARLHKLE 95


>gnl|CDD|237515 PRK13805, PRK13805, bifunctional acetaldehyde-CoA/alcohol
           dehydrogenase; Provisional.
          Length = 862

 Score = 29.0 bits (66), Expect = 7.9
 Identities = 15/31 (48%), Positives = 19/31 (61%), Gaps = 4/31 (12%)

Query: 399 SSGTIALG-GDMDASDRPLYIDSSVNIELAV 428
           SSG  ALG G   A + P YID + +I+ AV
Sbjct: 209 SSGKPALGVG---AGNVPAYIDKTADIKRAV 236


>gnl|CDD|185431 PTZ00077, PTZ00077, asparagine synthetase-like protein;
           Provisional.
          Length = 586

 Score = 28.9 bits (65), Expect = 7.9
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 245 IAPDYILCSR---QVQAQILNQAKAVLDSWYTEQEILPRQ 281
           I P Y +C+    Q++  IL +A   L+  Y   EIL RQ
Sbjct: 424 IDPKYKMCNAFEGQMEKYILRKAFEGLEKPYLPDEILWRQ 463


>gnl|CDD|131094 TIGR02039, CysD, sulfate adenylyltransferase, small subunit.
           Metabolic assimilation of sulfur from inorganic sulfate,
           requires sulfate activation by coupling to a nucleoside,
           for the production of high-energy nucleoside
           phosphosulfates. This pathway appears to be similar in
           all prokaryotic organisms. Activation is first achieved
           through sulfation of sulfate with ATP by sulfate
           adenylyltransferase (ATP sulfurylase) to produce
           5'-phosphosulfate (APS), coupled by GTP hydrolysis.
           Subsequently, APS is phosphorylated by an APS kinase to
           produce 3'-phosphoadenosine-5'-phosphosulfate (PAPS). In
           Escherichia coli, ATP sulfurylase is a heterodimer
           composed of two subunits encoded by cysD and cysN, with
           APS kinase encoded by cysC. These genes are located in a
           unidirectionally transcribed gene cluster, and have been
           shown to be required for the synthesis of
           sulfur-containing amino acids. Homologous to this E.coli
           activation pathway are nodPQH gene products found among
           members of the Rhizobiaceae family. These gene products
           have been shown to exhibit ATP sulfurase and APS kinase
           activity, yet are involved in Nod factor sulfation, and
           sulfation of other macromolecules [Central intermediary
           metabolism, Sulfur metabolism].
          Length = 294

 Score = 28.5 bits (64), Expect = 8.0
 Identities = 17/42 (40%), Positives = 21/42 (50%), Gaps = 5/42 (11%)

Query: 195 IVRQAANEHLTPVTLELGGKSPLYIDSSVNIELAVRRFLWGK 236
           I+R+ A E   PV L   GK     DSSV + LA + F  G 
Sbjct: 11  IIREVAAEFERPVMLYSIGK-----DSSVLLHLARKAFYPGP 47


>gnl|CDD|232861 TIGR00181, pepF, oligoendopeptidase F.  This family represents the
           oligoendopeptidase F clade of the family of larger M3 or
           thimet (for thiol-dependent metallopeptidase)
           oligopeptidase family. Lactococcus lactis PepF
           hydrolyzed peptides of 7 and 17 amino acids with fairly
           broad specificity. The homolog of lactococcal PepF in
           group B Streptococcus was named PepB (PMID:8757883),
           with the name difference reflecting a difference in
           species of origin rather activity; substrate profiles
           were quite similar. Differences in substrate specificity
           should be expected in other species. The gene is
           duplicated in Lactococcus lactis on the plasmid that
           bears it. A shortened second copy is found in Bacillus
           subtilis [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 591

 Score = 28.4 bits (64), Expect = 9.2
 Identities = 9/52 (17%), Positives = 23/52 (44%)

Query: 8   VQNARDTFDSGKSKPYEFRRRQLQQLVRLYEENQQDLANALAADLRKHKQEA 59
           + N+       K+K  E R++  +   + Y +++   A     +++K+   A
Sbjct: 190 ITNSTYENFLQKNKDREIRKKAYESFYKAYRKHKNTFAALYYGNVQKNVFLA 241


>gnl|CDD|149916 pfam09002, DUF1887, Domain of unknown function (DUF1887).  This
           domain is found in a set of hypothetical bacterial
           proteins.
          Length = 379

 Score = 28.2 bits (63), Expect = 9.7
 Identities = 21/97 (21%), Positives = 34/97 (35%), Gaps = 12/97 (12%)

Query: 24  EFRRRQLQQLVRLYEENQQDLA------NALAADLRKHKQEAVLFEIEFLANDVRNTLNH 77
           E   R+L +L   +     +L       N LA   RK ++  V  E+        N LN 
Sbjct: 162 EELDRRLLELAERWASRALELGPGLATLNYLATTCRKEQRLDV--ELSEKQQGYLNELNL 219

Query: 78  LKQWMTPEKPGKDIANMLDGVYIYPDPYGVCLIIGAW 114
           L + +        +A    G+  + D      + G W
Sbjct: 220 LLEDLAET----GLATYEGGILTFLDEEARRFLNGEW 252


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.136    0.414 

Gapped
Lambda     K      H
   0.267   0.0785    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,807,643
Number of extensions: 2405335
Number of successful extensions: 2697
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2507
Number of HSP's successfully gapped: 228
Length of query: 484
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 383
Effective length of database: 6,457,848
Effective search space: 2473355784
Effective search space used: 2473355784
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (27.2 bits)