RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16526
         (422 letters)



>gnl|CDD|187547 cd05236, FAR-N_SDR_e, fatty acyl CoA reductases (FARs), extended
           (e) SDRs.  SDRs are Rossmann-fold NAD(P)H-binding
           proteins, many of which may function as fatty acyl CoA
           reductases (FAR), acting on medium and long chain fatty
           acids, and have been reported to be involved in diverse
           processes such as biosynthesis of insect pheromones,
           plant cuticular wax production, and mammalian wax
           biosynthesis. In Arabidopsis thaliana, proteins with
           this particular architecture have also been identified
           as the MALE STERILITY 2 (MS2) gene product, which is
           implicated in male gametogenesis. Mutations in MS2
           inhibit the synthesis of exine (sporopollenin),
           rendering plants unable to reduce pollen wall fatty
           acids to corresponding alcohols. This N-terminal domain
           shares the catalytic triad (but not the upstream Asn)
           and characteristic NADP-binding motif of the extended
           SDR family. Extended SDRs are distinct from classical
           SDRs. In addition to the Rossmann fold (alpha/beta
           folding pattern with a central beta-sheet) core region
           typical of all SDRs, extended SDRs have a less conserved
           C-terminal extension of approximately 100 amino acids.
           Extended SDRs are a diverse collection of proteins, and
           include isomerases, epimerases, oxidoreductases, and
           lyases; they typically have a TGXXGXXG cofactor binding
           motif. SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 320

 Score =  168 bits (428), Expect = 7e-49
 Identities = 62/134 (46%), Positives = 87/134 (64%)

Query: 289 IDLLFDQLRQERSNELAKIKPILGDITEPELGISQNDQKILKQNVSVVFHSAATVKFDEA 348
            D LFD+ R       +KI PI GD++EP LG+S  D + L + V+++ H AATV FDE 
Sbjct: 51  KDKLFDRGRNLNPLFESKIVPIEGDLSEPNLGLSDEDLQTLIEEVNIIIHCAATVTFDER 110

Query: 349 LKLSVTINMLGTKRLVELCHEMTHLEALIHVSTAYCNCDREEVREIIYSPPYDPQKIIET 408
           L  +++IN+LGT RL+EL      L+A +HVSTAY N DR+ + E +Y PP DP+K+I+ 
Sbjct: 111 LDEALSINVLGTLRLLELAKRCKKLKAFVHVSTAYVNGDRQLIEEKVYPPPADPEKLIDI 170

Query: 409 MEWMDDSLVNTLTP 422
           +E MDD  +   TP
Sbjct: 171 LELMDDLELERATP 184



 Score =  141 bits (357), Expect = 1e-38
 Identities = 49/122 (40%), Positives = 74/122 (60%), Gaps = 5/122 (4%)

Query: 1   MLQNESGNLPVAIVRPSIVISSVNEPVAGWVDNYNGPTGIIAAAGKGFFRTMLCHENKVA 60
           ++  E GNLP+ IVRPSIV +++ EP  GW+DN+NGP G+  A GKG  RTM    N VA
Sbjct: 204 LVLKERGNLPLVIVRPSIVGATLKEPFPGWIDNFNGPDGLFLAYGKGILRTMNADPNAVA 263

Query: 61  DLVPVDIVINLMICAAWKTAVKYKNSNGAQGITVYNCCTGQRNPISWKQFVNYSFESMRQ 120
           D++PVD+V N ++ AA  + V+         + VY+C +   NP +W +      + +++
Sbjct: 264 DIIPVDVVANALLAAAAYSGVRKPRE-----LEVYHCGSSDVNPFTWGEAEELINQYLKK 318

Query: 121 NP 122
           NP
Sbjct: 319 NP 320


>gnl|CDD|176924 cd09071, FAR_C, C-terminal domain of fatty acyl CoA reductases.
           C-terminal domain of fatty acyl CoA reductases, a family
           of SDR-like proteins. SDRs or short-chain
           dehydrogenases/reductases are Rossmann-fold
           NAD(P)H-binding proteins. Many proteins in this FAR_C
           family may function as fatty acyl-CoA reductases (FARs),
           acting on medium and long chain fatty acids, and have
           been reported to be involved in diverse processes such
           as the biosynthesis of insect pheromones, plant
           cuticular wax production, and mammalian wax
           biosynthesis. In Arabidopsis thaliana, proteins with
           this particular architecture have also been identified
           as the MALE STERILITY 2 (MS2) gene product, which is
           implicated in male gametogenesis. Mutations in MS2
           inhibit the synthesis of exine (sporopollenin),
           rendering plants unable to reduce pollen wall fatty
           acids to corresponding alcohols. The function of this
           C-terminal domain is unclear.
          Length = 92

 Score =  131 bits (332), Expect = 1e-37
 Identities = 43/92 (46%), Positives = 60/92 (65%)

Query: 147 LLHRLPAHVLDLFSLLTGKKPFMVRIQNKLDKAAKCLEYFSTQEWRFLDDNVRELNASLS 206
            LH LPA++LDL   L G+KP ++++  K+ K    LEYF+T EWRF +DN R L   LS
Sbjct: 1   FLHLLPAYLLDLLLRLLGRKPRLLKLYRKIHKLLDLLEYFTTNEWRFDNDNTRALWERLS 60

Query: 207 LEDRRVFSFDVTEIDWPKYIANYVLGIRTFIF 238
            EDR +F+FD+  IDW  Y  NY+ G+R ++ 
Sbjct: 61  EEDRELFNFDIRSIDWDDYFENYIPGLRKYLL 92


>gnl|CDD|111859 pfam03015, Sterile, Male sterility protein.  This family represents
           the C-terminal region of the male sterility protein in a
           number of arabidopsis and drosophila. A sequence-related
           jojoba acyl CoA reductase is also included.
          Length = 94

 Score =  108 bits (272), Expect = 5e-29
 Identities = 39/94 (41%), Positives = 59/94 (62%)

Query: 147 LLHRLPAHVLDLFSLLTGKKPFMVRIQNKLDKAAKCLEYFSTQEWRFLDDNVRELNASLS 206
             H LPA+ LDL   L G+KP +V++  K+ K  + L+ FS  EW F + N REL   +S
Sbjct: 1   FYHTLPAYFLDLLLRLYGQKPRLVKLYRKIHKGLEVLQPFSLNEWIFDNKNTRELREKMS 60

Query: 207 LEDRRVFSFDVTEIDWPKYIANYVLGIRTFIFKE 240
            ED+++F+FD+  +DW +Y  N + GIR ++ KE
Sbjct: 61  EEDKKLFNFDMESLDWDEYFRNAIRGIRKYLLKE 94


>gnl|CDD|219687 pfam07993, NAD_binding_4, Male sterility protein.  This family
           represents the C-terminal region of the male sterility
           protein in a number of arabidopsis and drosophila. A
           sequence-related jojoba acyl CoA reductase is also
           included.
          Length = 245

 Score =  111 bits (279), Expect = 3e-28
 Identities = 46/138 (33%), Positives = 66/138 (47%), Gaps = 9/138 (6%)

Query: 252 LYKMLWIHRLSKLLMILLVWRLLMLRSSVARSSWHLFIDLLFDQLRQERSNELAKIKPIL 311
           L K+L      K+    LV      R+    S+       L      +R   L +I P+ 
Sbjct: 13  LEKLLRSTPEVKI--YCLV------RAKDGESALERLRQELLKYGLFDRLKALERIIPVA 64

Query: 312 GDITEPELGISQNDQKILKQNVSVVFHSAATVKFDEALKLSVTINMLGTKRLVELCHEMT 371
           GD++EP LG+S  D + L + V V+ H+AATV F E        N+LGT+ ++ L  +M 
Sbjct: 65  GDLSEPNLGLSDEDFQELAEEVDVIIHNAATVNFVEPYSDLRATNVLGTREVLRLAKQMK 124

Query: 372 HLEALIHVSTAYCNCDRE 389
            L    HVSTAY N +R 
Sbjct: 125 KL-PFHHVSTAYVNGERG 141



 Score = 80.0 bits (198), Expect = 5e-17
 Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 4/73 (5%)

Query: 1   MLQNESGNLPVAIVRPSIVISSVNEPVAGWVDN-YNGPTGIIAAAGKGFFRTMLCHENKV 59
           +++  +G LPV I RPSI+     E   GW++    GP G++  AG G    +L   +  
Sbjct: 176 LVREAAGGLPVVIYRPSIITG---ESRTGWINGDDFGPRGLLGGAGLGVLPDILGDPDAR 232

Query: 60  ADLVPVDIVINLM 72
            DLVPVD V N +
Sbjct: 233 LDLVPVDYVANAI 245


>gnl|CDD|215538 PLN02996, PLN02996, fatty acyl-CoA reductase.
          Length = 491

 Score = 84.0 bits (208), Expect = 2e-17
 Identities = 62/261 (23%), Positives = 111/261 (42%), Gaps = 37/261 (14%)

Query: 1   MLQNESGNLPVAIVRPSIVISSVNEPVAGWVDNYNGPTGIIAAAGKGFFRTMLCHENKVA 60
           +L N   NLP+ I+RP+++ S+  EP  GW++       +I   GKG     L   N V 
Sbjct: 246 LLGNFKENLPLVIIRPTMITSTYKEPFPGWIEGLRTIDSVIVGYGKGKLTCFLADPNSVL 305

Query: 61  DLVPVDIVINLMICAAWKTAVKYKNSNGAQGITVYNCCTGQRNPISWKQFVNYSFESMRQ 120
           D++P D+V+N MI A    A       G     +Y+  +  +NP+ +    ++++    +
Sbjct: 306 DVIPADMVVNAMIVAMAAHA------GGQGSEIIYHVGSSLKNPVKFSNLHDFAYRYFSK 359

Query: 121 NPLSHITWYPDGQ---CRSNPISNAMCVF-----LLHRLPAHVLDLFSLLTGKKPFMVRI 172
           NP  +     +G         I + M  F     + + LP   L L +++  K     R 
Sbjct: 360 NPWIN----KEGSPVKVGKGTILSTMASFSLYMTIRYLLPLKALQLVNIILPK-----RY 410

Query: 173 QNK---LDKAAKCL-------EYFSTQEWRFLDDNVRELNASLSLEDRR---VFSFDVTE 219
            +K   L++  K +       + +   +  F D N  +L        +    +F FD   
Sbjct: 411 GDKYTDLNRKIKLVMRLVDLYKPYVFFKGIFDDTNTEKLRIKRKETGKEEADMFDFDPKS 470

Query: 220 IDWPKYIAN-YVLGIRTFIFK 239
           IDW  Y+ N ++ G+  ++ K
Sbjct: 471 IDWEDYMTNVHIPGLVKYVLK 491



 Score = 54.3 bits (131), Expect = 6e-08
 Identities = 38/122 (31%), Positives = 67/122 (54%), Gaps = 13/122 (10%)

Query: 274 LMLRSSVARSSWHLFID-----LLFDQLRQERSNEL-----AKIKPILGDITEPELGISQ 323
           L+LR+S A+S+     D      LF  LR++    L      K+ P+ GDI+  +LG+  
Sbjct: 43  LLLRASDAKSATQRLHDEVIGKDLFKVLREKLGENLNSLISEKVTPVPGDISYDDLGVKD 102

Query: 324 ND--QKILKQNVSVVFHSAATVKFDEALKLSVTINMLGTKRLVELCHEMTHLEALIHVST 381
           ++  +++ K+ + +V + AAT  FDE   +++ IN LG   ++    +   ++ L+HVST
Sbjct: 103 SNLREEMWKE-IDIVVNLAATTNFDERYDVALGINTLGALNVLNFAKKCVKVKMLLHVST 161

Query: 382 AY 383
           AY
Sbjct: 162 AY 163


>gnl|CDD|215279 PLN02503, PLN02503, fatty acyl-CoA reductase 2.
          Length = 605

 Score = 82.6 bits (204), Expect = 7e-17
 Identities = 64/258 (24%), Positives = 111/258 (43%), Gaps = 34/258 (13%)

Query: 1   MLQNESGNLPVAIVRPSIVISSVNEPVAGWVDNYNGPTGIIAAAGKGFFRTMLCHENKVA 60
           ++ +  G++PV I+RPS++ S+  +P  GW++       I+   GKG     L   N V 
Sbjct: 360 VINSMRGDIPVVIIRPSVIESTWKDPFPGWMEGNRMMDPIVLYYGKGQLTGFLADPNGVL 419

Query: 61  DLVPVDIVINLMICA-AWKTAVKYKNSNGAQGITVYNCCTGQRNPISWKQFVNYSFESMR 119
           D+VP D+V+N  + A A               I VY   +   NP+ ++      +E   
Sbjct: 420 DVVPADMVVNATLAAMAK------HGGAAKPEINVYQIASSVVNPLVFQDLARLLYE--- 470

Query: 120 QNPLSHITWYPDGQCRSNPIS-------NAMCVFLLHRLPAHVLD---LFSLLTGKKPFM 169
                H    P    +  PI        ++M  F  H L    L    L  + +  +   
Sbjct: 471 -----HYKSSPYMDSKGRPIHVPPMKLFSSMEDFSSH-LWRDALLRSGLAGMSSSDRKLS 524

Query: 170 VRIQNKLDKAAKCLEYFST--QEW-----RFLDDNVRELNASLSLEDRRVFSFDVTEIDW 222
            +++N   K+ +  +Y ++  + +     RF + N + L   +S E++  F FDV  IDW
Sbjct: 525 QKLENICAKSVEQAKYLASIYEPYTFYGGRFDNSNTQRLMERMSEEEKAEFGFDVGSIDW 584

Query: 223 PKYIAN-YVLGIRTFIFK 239
             YI N ++ G+R  + K
Sbjct: 585 RDYITNVHIPGLRRHVMK 602



 Score = 60.6 bits (147), Expect = 6e-10
 Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 5/103 (4%)

Query: 292 LFDQLRQE-----RSNELAKIKPILGDITEPELGISQNDQKILKQNVSVVFHSAATVKFD 346
           LF  L++      +S  L+K+ P++G++ E  LG+  +    + + V V+ +SAA   FD
Sbjct: 174 LFKCLQETHGKSYQSFMLSKLVPVVGNVCESNLGLEPDLADEIAKEVDVIINSAANTTFD 233

Query: 347 EALKLSVTINMLGTKRLVELCHEMTHLEALIHVSTAYCNCDRE 389
           E   +++ IN  G   L+    +   L+  + VSTAY N  R+
Sbjct: 234 ERYDVAIDINTRGPCHLMSFAKKCKKLKLFLQVSTAYVNGQRQ 276


>gnl|CDD|187573 cd05263, MupV_like_SDR_e, Pseudomonas fluorescens MupV-like,
           extended (e) SDRs.  This subgroup of extended SDR family
           domains have the characteristic active site tetrad and a
           well-conserved NAD(P)-binding motif. This subgroup is
           not well characterized, its members are annotated as
           having a variety of putative functions. One
           characterized member is Pseudomonas fluorescens MupV a
           protein  involved in the biosynthesis of Mupirocin, a
           polyketide-derived antibiotic. Extended SDRs are
           distinct from classical SDRs. In addition to the
           Rossmann fold (alpha/beta folding pattern with a central
           beta-sheet) core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids. Extended SDRs are a
           diverse collection of proteins, and include isomerases,
           epimerases, oxidoreductases, and lyases; they typically
           have a TGXXGXXG cofactor binding motif. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 293

 Score = 69.7 bits (171), Expect = 2e-13
 Identities = 30/105 (28%), Positives = 52/105 (49%), Gaps = 2/105 (1%)

Query: 290 DLLFDQLRQERSNELAKIKPILGDITEPELGISQNDQKILKQNVSVVFHSAATVKFDEAL 349
                +  +E   E  +++ + GD+T+P LG+S    + L   V  V H AA+  F    
Sbjct: 34  LGEAHERIEEAGLEADRVRVLEGDLTQPNLGLSAAASRELAGKVDHVIHCAASYDFQAPN 93

Query: 350 KLSVTINMLGTKRLVELCHEMTHLEALIHVSTAYCNCDREE-VRE 393
           + +   N+ GT+ ++EL      ++   +VSTAY   +RE  +RE
Sbjct: 94  EDAWRTNIDGTEHVLELAAR-LDIQRFHYVSTAYVAGNREGNIRE 137



 Score = 32.3 bits (74), Expect = 0.38
 Identities = 18/102 (17%), Positives = 42/102 (41%), Gaps = 13/102 (12%)

Query: 1   MLQNESGNLPVAIVRPSIVISSVNEPVAGWVDNYNGPTGII-AAAGKGFFRTMLCHENKV 59
           +++  +  +P+ + RPSIV   V +   G ++  +G   ++   A  G +  M  ++   
Sbjct: 160 LVRAAATQIPLTVYRPSIV---VGDSKTGRIEKIDGLYELLNLLAKLGRWLPMPGNKGAR 216

Query: 60  ADLVPVDIVINLMICAAWKTAVKYKNSNGAQGITVYNCCTGQ 101
            +LVPVD V + ++  +                 +++     
Sbjct: 217 LNLVPVDYVADAIVYLS---------KKPEANGQIFHLTDPT 249


>gnl|CDD|187546 cd05235, SDR_e1, extended (e) SDRs, subgroup 1.  This family
           consists of an SDR module of multidomain proteins
           identified as putative polyketide sythases fatty acid
           synthases (FAS), and nonribosomal peptide synthases,
           among others. However, unlike the usual ketoreductase
           modules of FAS and polyketide synthase, these domains
           are related to the extended SDRs, and have canonical
           NAD(P)-binding motifs and an active site tetrad.
           Extended SDRs are distinct from classical SDRs. In
           addition to the Rossmann fold (alpha/beta folding
           pattern with a central beta-sheet) core region typical
           of all SDRs, extended SDRs have a less conserved
           C-terminal extension of approximately 100 amino acids.
           Extended SDRs are a diverse collection of proteins, and
           include isomerases, epimerases, oxidoreductases, and
           lyases; they typically have a TGXXGXXG cofactor binding
           motif. SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 290

 Score = 67.7 bits (166), Expect = 1e-12
 Identities = 39/118 (33%), Positives = 60/118 (50%), Gaps = 14/118 (11%)

Query: 295 QLRQERSNELAKIKPILGDITEPELGISQNDQKILKQNVSVVFHSAATVKF---DEALKL 351
            L      EL++IK ++GD+++P LG+S +D + L + V V+ H+ A V +    E LK 
Sbjct: 52  GLNLWDELELSRIKVVVGDLSKPNLGLSDDDYQELAEEVDVIIHNGANVNWVYPYEELK- 110

Query: 352 SVTINMLGTKRLVELCHEMTHLEALIHVSTA-------YCNCDREEVREIIYSPPYDP 402
               N+LGTK L++L      L+ L  VST        Y   D EE  +++ S    P
Sbjct: 111 --PANVLGTKELLKLA-ATGKLKPLHFVSTLSVFSAEEYNALDDEESDDMLESQNGLP 165


>gnl|CDD|233557 TIGR01746, Thioester-redct, thioester reductase domain.  This model
           includes the terminal domain from the fungal alpha
           aminoadipate reductase enzyme (also known as
           aminoadipate semialdehyde dehydrogenase) which is
           involved in the biosynthesis of lysine , as well as the
           reductase-containing component of the myxochelin
           biosynthetic gene cluster, MxcG. The mechanism of
           reduction involves activation of the substrate by
           adenylation and transfer to a covalently-linked
           pantetheine cofactor as a thioester. This thioester is
           then reduced to give an aldehyde (thus releasing the
           product) and a regenerated pantetheine thiol. (In
           myxochelin biosynthesis this aldehyde is further reduced
           to an alcohol or converted to an amine by an
           aminotransferase.) This is a fundamentally different
           reaction than beta-ketoreductase domains of polyketide
           synthases which act at a carbonyl two carbons removed
           from the thioester and forms an alcohol as a product.
           This domain is invariably found at the C-terminus of the
           proteins which contain it (presumably because it results
           in the release of the product). The majority of hits to
           this model are non-ribosomal peptide synthetases in
           which this domain is similarly located proximal to a
           thiolation domain (pfam00550). In some cases this domain
           is found at the end of a polyketide synthetase enzyme,
           but is unlike ketoreductase domains which are found
           before the thiolase domains. Exceptions to this observed
           relationship with the thiolase domain include three
           proteins which consist of stand-alone reductase domains
           (GP|466833 from M. leprae, GP|435954 from Anabaena and
           OMNI|NTL02SC1199 from Strep. coelicolor) and one protein
           (OMNI|NTL01NS2636 from Nostoc) which contains N-terminal
           homology with a small group of hypothetical proteins but
           no evidence of a thiolation domain next to the putative
           reductase domain. Below the noise cutoff to this model
           are proteins containing more distantly related
           ketoreductase and dehydratase/epimerase domains. It has
           been suggested that a NADP-binding motif can be found in
           the N-terminal portion of this domain that may form a
           Rossman-type fold.
          Length = 367

 Score = 58.6 bits (142), Expect = 2e-09
 Identities = 27/94 (28%), Positives = 47/94 (50%), Gaps = 11/94 (11%)

Query: 296 LRQERSNELA----KIKPILGDITEPELGISQNDQKILKQNVSVVFHSAATVKF---DEA 348
           LR  R         +I+ + GD++EP LG+S  + + L +NV  + H+ A V +      
Sbjct: 48  LRSYRLWHEDLARERIEVVAGDLSEPRLGLSDAEWERLAENVDTIVHNGALVNWVYPYSE 107

Query: 349 LKLSVTINMLGTKRLVELCHEMTHLEALIHVSTA 382
           L+ +   N+LGT+ ++ L       + L +VST 
Sbjct: 108 LRGA---NVLGTREVLRLA-ASGRAKPLHYVSTI 137


>gnl|CDD|225857 COG3320, COG3320, Putative dehydrogenase domain of multifunctional
           non-ribosomal peptide synthetases and related enzymes
           [Secondary metabolites biosynthesis, transport, and
           catabolism].
          Length = 382

 Score = 57.0 bits (138), Expect = 7e-09
 Identities = 26/96 (27%), Positives = 48/96 (50%), Gaps = 7/96 (7%)

Query: 289 IDLLFDQLRQERSNELAKIKPILGDITEPELGISQNDQKILKQNVSVVFHSAATVKF--- 345
           ++  FD  R        +++ + GD+ EP+LG+S+   + L +NV ++ H+AA V     
Sbjct: 44  LEKTFDLYRHWDELSADRVEVVAGDLAEPDLGLSERTWQELAENVDLIIHNAALVNHVFP 103

Query: 346 DEALKLSVTINMLGTKRLVELCHEMTHLEALIHVST 381
              L+     N+LGT  ++ L       + L +VS+
Sbjct: 104 YSELR---GANVLGTAEVLRLA-ATGKPKPLHYVSS 135


>gnl|CDD|212494 cd08946, SDR_e, extended (e) SDRs.  Extended SDRs are distinct from
           classical SDRs. In addition to the Rossmann fold
           (alpha/beta folding pattern with a central beta-sheet)
           core region typical of all SDRs, extended SDRs have a
           less conserved C-terminal extension of approximately 100
           amino acids. Extended SDRs are a diverse collection of
           proteins, and include isomerases, epimerases,
           oxidoreductases, and lyases; they typically have a
           TGXXGXXG cofactor binding motif. SDRs are a functionally
           diverse family of oxidoreductases that have a single
           domain with a structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 200

 Score = 46.5 bits (111), Expect = 6e-06
 Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 4/71 (5%)

Query: 332 NVSVVFHSAATVKFDEAL---KLSVTINMLGTKRLVELCHEMTHLEALIHVSTAYCNCDR 388
            + VV H AA V    +          N++GT  L+E   +   ++  ++ S+A      
Sbjct: 30  RLDVVVHLAALVGVPASWDNPDEDFETNVVGTLNLLEAARKAG-VKRFVYASSASVYGSP 88

Query: 389 EEVREIIYSPP 399
           E + E   +PP
Sbjct: 89  EGLPEEEETPP 99


>gnl|CDD|223528 COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell
           envelope biogenesis, outer membrane / Carbohydrate
           transport and metabolism].
          Length = 314

 Score = 41.1 bits (96), Expect = 7e-04
 Identities = 18/98 (18%), Positives = 39/98 (39%), Gaps = 10/98 (10%)

Query: 291 LLFDQLRQERSNELAKIKPILGDITEPELGISQNDQKILKQNVSVVFHSAATVKFDEAL- 349
              D+LR      L+ ++ ++ D+T+ +L       ++ K     V H AA     ++  
Sbjct: 28  RGLDRLRDGLDPLLSGVEFVVLDLTDRDL-----VDELAKGVPDAVIHLAAQSSVPDSNA 82

Query: 350 ---KLSVTINMLGTKRLVELCHEMTHLEALIHVSTAYC 384
                 + +N+ GT  L+E       ++  +  S+   
Sbjct: 83  SDPAEFLDVNVDGTLNLLEAARAA-GVKRFVFASSVSV 119


>gnl|CDD|235962 PRK07201, PRK07201, short chain dehydrogenase; Provisional.
          Length = 657

 Score = 41.5 bits (98), Expect = 7e-04
 Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 14/81 (17%)

Query: 306 KIKPILGDITEPELGISQNDQKILKQNVSVVFHSAA----TVKFDEALKLSVTINMLGTK 361
           ++ P++GD+TEP LG+S+ D   L   +  V H AA    T   +EA       N+ GT+
Sbjct: 52  RVVPLVGDLTEPGLGLSEADIAELGD-IDHVVHLAAIYDLTAD-EEA---QRAANVDGTR 106

Query: 362 RLVELCHEMTHLEA--LIHVS 380
            +VEL      L+A    HVS
Sbjct: 107 NVVELAER---LQAATFHHVS 124


>gnl|CDD|216461 pfam01370, Epimerase, NAD dependent epimerase/dehydratase family.
           This family of proteins utilise NAD as a cofactor. The
           proteins in this family use nucleotide-sugar substrates
           for a variety of chemical reactions.
          Length = 233

 Score = 38.0 bits (89), Expect = 0.005
 Identities = 27/125 (21%), Positives = 50/125 (40%), Gaps = 17/125 (13%)

Query: 297 RQERSNELAKIKPILGDITEPELGISQNDQKILKQN-VSVVFHSAATVKFDEALKLSVT- 354
           R+  S    +I+   GD+T+P+       +++L +     V H AA      + +     
Sbjct: 32  RRSESLNTGRIRFHEGDLTDPD-----ALERLLAEVQPDAVIHLAAQSGVGASFEDPADF 86

Query: 355 --INMLGTKRLVELCHEMTHLEALIHVSTA--YCNCDREEVREI-IYSP--PYDPQKIIE 407
              N+LGT RL+E       ++  +  S++  Y +     + E     P  PY   K+  
Sbjct: 87  IRANVLGTLRLLEAARRA-GVKRFVFASSSEVYGDVADPPITEDTPLGPLSPYAAAKL-- 143

Query: 408 TMEWM 412
             E +
Sbjct: 144 AAERL 148


>gnl|CDD|187673 cd09813, 3b-HSD-NSDHL-like_SDR_e, human NSDHL (NAD(P)H steroid
           dehydrogenase-like protein)-like, extended (e) SDRs.
           This subgroup includes human NSDHL and related proteins.
           These proteins have the characteristic active site
           tetrad of extended SDRs, and also have a close match to
           their  NAD(P)-binding motif.  Human NSDHL is a
           3beta-hydroxysteroid dehydrogenase (3 beta-HSD) which
           functions in the cholesterol biosynthetic pathway.  3
           beta-HSD catalyzes the oxidative conversion of delta 5-3
           beta-hydroxysteroids to the delta 4-3-keto
           configuration; this activity is essential for the
           biosynthesis of all classes of hormonal steroids.
           Mutations in the gene encoding NSDHL cause CHILD
           syndrome (congenital hemidysplasia with ichthyosiform
           nevus and limb defects), an X-linked dominant,
           male-lethal trait.  This subgroup also includes an
           unusual bifunctional [3beta-hydroxysteroid dehydrogenase
           (3b-HSD)/C-4 decarboxylase from Arabidopsis thaliana,
           and Saccharomyces cerevisiae ERG26, a 3b-HSD/C-4
           decarboxylase, involved in the synthesis of ergosterol,
           the major sterol of yeast.  Extended SDRs are distinct
           from classical SDRs. In addition to the Rossmann fold
           (alpha/beta folding pattern with a central beta-sheet)
           core region typical of all SDRs, extended SDRs have a
           less conserved C-terminal extension of approximately 100
           amino acids. Extended SDRs are a diverse collection of
           proteins, and include isomerases, epimerases,
           oxidoreductases, and lyases; they typically have a
           TGXXGXXG cofactor binding motif. SDRs are a functionally
           diverse family of oxidoreductases that have a single
           domain with a structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid sythase have
           a GGXGXXG NAD(P)-binding motif and an altered active
           site motif (YXXXN). Fungal type ketoacyl reductases have
           a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 335

 Score = 36.6 bits (85), Expect = 0.018
 Identities = 22/111 (19%), Positives = 55/111 (49%), Gaps = 13/111 (11%)

Query: 275 MLRSSVARSSWHLFIDLLFD--QLRQERSNELAKIKPILGDITEPELGISQNDQKILKQN 332
           ++   + R +  + +   FD     +   +   +++   GD+T+P+      ++   ++ 
Sbjct: 15  LVEQLLRRGNPTVHV---FDIRPTFELDPSSSGRVQFHTGDLTDPQ----DLEKAFNEKG 67

Query: 333 VSVVFHSAATV-KFDEALKLSVTINMLGTKRLVELCHEMTHLEALIHVSTA 382
            +VVFH+A+     ++ L   V  N+ GT+ ++E C +   ++ L++ S+A
Sbjct: 68  PNVVFHTASPDHGSNDDLYYKV--NVQGTRNVIEACRK-CGVKKLVYTSSA 115


>gnl|CDD|234212 TIGR03443, alpha_am_amid, L-aminoadipate-semialdehyde dehydrogenase. 
            Members of this protein family are
            L-aminoadipate-semialdehyde dehydrogenase (EC 1.2.1.31),
            product of the LYS2 gene. It is also called
            alpha-aminoadipate reductase. In fungi, lysine is
            synthesized via aminoadipate. Currently, all members of
            this family are fungal.
          Length = 1389

 Score = 36.2 bits (84), Expect = 0.033
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 305  AKIKPILGDITEPELGISQNDQKILKQNVSVVFHSAATV 343
            ++I+ +LGD+++ + G+S      L   V V+ H+ A V
Sbjct: 1034 SRIEVVLGDLSKEKFGLSDEKWSDLTNEVDVIIHNGALV 1072


>gnl|CDD|224016 COG1091, RfbD, dTDP-4-dehydrorhamnose reductase [Cell envelope
           biogenesis, outer membrane].
          Length = 281

 Score = 35.0 bits (81), Expect = 0.061
 Identities = 22/75 (29%), Positives = 36/75 (48%), Gaps = 11/75 (14%)

Query: 313 DITEPELGISQNDQKILKQNVSVVFHSAATVKFDEA---LKLSVTINMLGTKRLVELCHE 369
           DIT+P+       + I +    VV ++AA    D+A    +L+  +N  G + L     E
Sbjct: 35  DITDPD----AVLEVIRETRPDVVINAAAYTAVDKAESEPELAFAVNATGAENLARAAAE 90

Query: 370 MTHLEA-LIHVSTAY 383
              + A L+H+ST Y
Sbjct: 91  ---VGARLVHISTDY 102


>gnl|CDD|187548 cd05237, UDP_invert_4-6DH_SDR_e, UDP-Glcnac (UDP-linked
           N-acetylglucosamine) inverting 4,6-dehydratase, extended
           (e) SDRs.  UDP-Glcnac inverting 4,6-dehydratase was
           identified in Helicobacter pylori as the hexameric flaA1
           gene product (FlaA1). FlaA1 is hexameric, possesses
           UDP-GlcNAc-inverting 4,6-dehydratase activity,  and
           catalyzes the first step in the creation of a
           pseudaminic acid derivative in protein glycosylation.
           Although this subgroup has the NADP-binding motif
           characteristic of extended SDRs, its members tend to
           have a Met substituted for the active site Tyr found in
           most SDR families. Extended SDRs are distinct from
           classical SDRs. In addition to the Rossmann fold
           (alpha/beta folding pattern with a central beta-sheet)
           core region typical of all SDRs, extended SDRs have a
           less conserved C-terminal extension of approximately 100
           amino acids. Extended SDRs are a diverse collection of
           proteins, and include isomerases, epimerases,
           oxidoreductases, and lyases; they typically have a
           TGXXGXXG cofactor binding motif. SDRs are a functionally
           diverse family of oxidoreductases that have a single
           domain with a structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 287

 Score = 33.7 bits (78), Expect = 0.14
 Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 21/86 (24%)

Query: 295 QLRQE-RSNELA-KIKPILGDITEPELGISQNDQKILKQ-NVSVVFHSAATVK------- 344
           +L +E RS     K++ I+GD+ + E       ++  K+    +VFH+AA  K       
Sbjct: 42  ELVRELRSRFPHDKLRFIIGDVRDKER-----LRRAFKERGPDIVFHAAAL-KHVPSMED 95

Query: 345 -FDEALKLSVTINMLGTKRLVELCHE 369
             +EA+K     N+LGTK +++   E
Sbjct: 96  NPEEAIK----TNVLGTKNVIDAAIE 117


>gnl|CDD|187552 cd05241, 3b-HSD-like_SDR_e, 3beta-hydroxysteroid dehydrogenases
           (3b-HSD)-like, extended (e) SDRs.  Extended SDR family
           domains belonging to this subgroup have the
           characteristic active site tetrad and a fairly
           well-conserved NAD(P)-binding motif. 3b-HSD catalyzes
           the NAD-dependent conversion of various steroids, such
           as pregnenolone to progesterone, or androstenediol to
           testosterone. This subgroup includes an unusual
           bifunctional 3b-HSD/C-4 decarboxylase from Arabidopsis
           thaliana, and Saccharomyces cerevisiae ERG26, a
           3b-HSD/C-4 decarboxylase, involved in the synthesis of
           ergosterol, the major sterol of yeast. It also includes
           human 3 beta-HSD/HSD3B1 and C(27) 3beta-HSD/
           [3beta-hydroxy-delta(5)-C(27)-steroid oxidoreductase;
           HSD3B7].  C(27) 3beta-HSD/HSD3B7 is a membrane-bound
           enzyme of the endoplasmic reticulum, that catalyzes the
           isomerization and oxidation of 7alpha-hydroxylated
           sterol intermediates, an early step in bile acid
           biosynthesis. Mutations in the human NSDHL (NAD(P)H
           steroid dehydrogenase-like protein) cause CHILD syndrome
           (congenital hemidysplasia with ichthyosiform nevus and
           limb defects), an X-linked dominant, male-lethal trait.
           Mutations in the human gene encoding C(27) 3beta-HSD
           underlie a rare autosomal recessive form of neonatal
           cholestasis. Extended SDRs are distinct from classical
           SDRs. In addition to the Rossmann fold (alpha/beta
           folding pattern with a central beta-sheet) core region
           typical of all SDRs, extended SDRs have a less conserved
           C-terminal extension of approximately 100 amino acids.
           Extended SDRs are a diverse collection of proteins, and
           include isomerases, epimerases, oxidoreductases, and
           lyases; they typically have a TGXXGXXG cofactor binding
           motif. SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid sythase have
           a GGXGXXG NAD(P)-binding motif and an altered active
           site motif (YXXXN). Fungal type ketoacyl reductases have
           a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 331

 Score = 33.6 bits (77), Expect = 0.16
 Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 7/76 (9%)

Query: 307 IKPILGDITEPELGISQNDQKILKQNVSVVFHSAATVKFDEALKLSVTINMLGTKRLVEL 366
           I+ + GDIT+    + Q            VFH+AA V       L   +N+ GT+ +++ 
Sbjct: 47  IEFLKGDITDRND-VEQALSGA-----DCVFHTAAIVPLAGPRDLYWEVNVGGTQNVLDA 100

Query: 367 CHEMTHLEALIHVSTA 382
           C     ++  ++ S++
Sbjct: 101 CQR-CGVQKFVYTSSS 115


>gnl|CDD|216283 pfam01073, 3Beta_HSD, 3-beta hydroxysteroid dehydrogenase/isomerase
           family.  The enzyme 3 beta-hydroxysteroid
           dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD)
           catalyzes the oxidation and isomerisation of 5-ene-3
           beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid
           precursors into the corresponding 4-ene-ketosteroids
           necessary for the formation of all classes of steroid
           hormones.
          Length = 280

 Score = 33.5 bits (77), Expect = 0.17
 Identities = 25/100 (25%), Positives = 45/100 (45%), Gaps = 17/100 (17%)

Query: 289 IDLLFDQLRQERSNELAKIKPILGDITEPELGISQNDQKILKQNVSVVFHSAATV----- 343
            DL F     E  ++L  I  I GD+T+ +      D +   Q   VV H+AA +     
Sbjct: 29  FDLRFSPELLEDFSKLQVITYIEGDVTDKQ------DLRRALQGSDVVIHTAAIIDVFGK 82

Query: 344 -KFDEALKLSVTINMLGTKRLVELCHEMTHLEALIHVSTA 382
              D  +K    +N+ GT+ +++ C +   +  L++ S+ 
Sbjct: 83  AYRDTIMK----VNVKGTQNVLDACVK-AGVRVLVYTSSM 117


>gnl|CDD|187671 cd09811, 3b-HSD_HSDB1_like_SDR_e, human 3beta-HSD (hydroxysteroid
           dehydrogenase) and HSD3B1(delta 5-delta
           4-isomerase)-like, extended (e) SDRs.  This extended-SDR
           subgroup includes human 3 beta-HSD/HSD3B1 and C(27)
           3beta-HSD/ [3beta-hydroxy-delta(5)-C(27)-steroid
           oxidoreductase; HSD3B7], and related proteins. These
           proteins have the characteristic active site tetrad and
           NAD(P)-binding motif of extended SDRs. 3 beta-HSD
           catalyzes the oxidative conversion of delta 5-3
           beta-hydroxysteroids to the delta 4-3-keto
           configuration; this activity is essential for the
           biosynthesis of all classes of hormonal steroids. C(27)
           3beta-HSD is a membrane-bound enzyme of the endoplasmic
           reticulum, it catalyzes the isomerization and oxidation
           of 7alpha-hydroxylated sterol intermediates, an early
           step in bile acid biosynthesis. Mutations in the human
           gene encoding C(27) 3beta-HSD underlie a rare autosomal
           recessive form of neonatal cholestasis. Extended SDRs
           are distinct from classical SDRs. In addition to the
           Rossmann fold (alpha/beta folding pattern with a central
           beta-sheet) core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids. Extended SDRs are a
           diverse collection of proteins, and include isomerases,
           epimerases, oxidoreductases, and lyases; they typically
           have a TGXXGXXG cofactor binding motif. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid sythase have
           a GGXGXXG NAD(P)-binding motif and an altered active
           site motif (YXXXN). Fungal type ketoacyl reductases have
           a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 354

 Score = 33.2 bits (76), Expect = 0.20
 Identities = 28/113 (24%), Positives = 46/113 (40%), Gaps = 22/113 (19%)

Query: 306 KIKPILGDIT-EPELGISQNDQKILKQNVSVVFHSAATVK-FDEALKLSV-TINMLGTKR 362
            +  I GDI     L  +          VSVV H+AA V  F       +  +N+ GT+ 
Sbjct: 52  YVTDIEGDIKDLSFLFRACQG-------VSVVIHTAAIVDVFGPPNYEELEEVNVNGTQA 104

Query: 363 LVELCHEMTHLEALIHVST---AYCNC--------DREEVREIIYSPPYDPQK 404
           ++E C +  +++ L++ S+   A  N           +   E   +PPY   K
Sbjct: 105 VLEACVQ-NNVKRLVYTSSIEVAGPNFKGRPIFNGVEDTPYEDTSTPPYASSK 156


>gnl|CDD|218026 pfam04321, RmlD_sub_bind, RmlD substrate binding domain.
           L-rhamnose is a saccharide required for the virulence of
           some bacteria. Its precursor, dTDP-L-rhamnose, is
           synthesised by four different enzymes the final one of
           which is RmlD. The RmlD substrate binding domain is
           responsible for binding a sugar nucleotide.
          Length = 284

 Score = 33.0 bits (76), Expect = 0.22
 Identities = 24/76 (31%), Positives = 35/76 (46%), Gaps = 11/76 (14%)

Query: 312 GDITEPELGISQNDQKILKQNVSVVFHSAATVKFDEALK---LSVTINMLGTKRLVELCH 368
            D+T+PE  +      + +    VV ++AA    D+A     L+  +N LG   L E C 
Sbjct: 33  LDLTDPE-AV---AALVREARPDVVVNAAAYTAVDKAESEPELAYAVNALGPGNLAEAC- 87

Query: 369 EMTHLEA-LIHVSTAY 383
                 A LIH+ST Y
Sbjct: 88  --AARGAPLIHISTDY 101


>gnl|CDD|233954 TIGR02622, CDP_4_6_dhtase, CDP-glucose 4,6-dehydratase.  Members of
           this protein family are CDP-glucose 4,6-dehydratase from
           a variety of Gram-negative and Gram-positive bacteria.
           Members typically are encoded next to a gene that
           encodes a glucose-1-phosphate cytidylyltransferase,
           which produces the substrate, CDP-D-glucose, used by
           this enzyme to produce CDP-4-keto-6-deoxyglucose [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 349

 Score = 33.0 bits (76), Expect = 0.29
 Identities = 25/106 (23%), Positives = 49/106 (46%), Gaps = 16/106 (15%)

Query: 292 LFDQLRQERSNELAKIKPILGDITEPELGISQNDQKILKQ-NVSVVFHSAA----TVKFD 346
           LF+ L   +     KI+   GDI +         +K + +    +VFH AA       + 
Sbjct: 44  LFELLNLAK-----KIEDHFGDIRDAA-----KLRKAIAEFKPEIVFHLAAQPLVRKSYA 93

Query: 347 EALKLSVTINMLGTKRLVELCHEMTHLEALIHVSTAYCNCDREEVR 392
           + L+ +   N++GT  L+E    +  ++A+++V++  C  + E V 
Sbjct: 94  DPLE-TFETNVMGTVNLLEAIRAIGSVKAVVNVTSDKCYRNDEWVW 138


>gnl|CDD|187562 cd05252, CDP_GD_SDR_e, CDP-D-glucose 4,6-dehydratase, extended (e)
           SDRs.  This subgroup contains CDP-D-glucose
           4,6-dehydratase, an extended SDR, which catalyzes the
           conversion of CDP-D-glucose to
           CDP-4-keto-6-deoxy-D-glucose. This subgroup has the
           characteristic active site tetrad and NAD-binding motif
           of the extended SDRs. Extended SDRs are distinct from
           classical SDRs. In addition to the Rossmann fold
           (alpha/beta folding pattern with a central beta-sheet)
           core region typical of all SDRs, extended SDRs have a
           less conserved C-terminal extension of approximately 100
           amino acids. Extended SDRs are a diverse collection of
           proteins, and include isomerases, epimerases,
           oxidoreductases, and lyases; they typically have a
           TGXXGXXG cofactor binding motif. SDRs are a functionally
           diverse family of oxidoreductases that have a single
           domain with a structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 336

 Score = 32.3 bits (74), Expect = 0.46
 Identities = 29/117 (24%), Positives = 51/117 (43%), Gaps = 15/117 (12%)

Query: 292 LFDQLRQERSNELAKIKPILGDITEPELGISQNDQKILKQNVSVVFHSAATVKFDEALKL 351
           LF+    +      KI    GDI +    ++   + I +    +VFH AA      + K 
Sbjct: 44  LFELANLDN-----KISSTRGDIRD----LNALREAIREYEPEIVFHLAAQPLVRLSYKD 94

Query: 352 SVT---INMLGTKRLVELCHEMTHLEALIHVSTAYCNCDREEV---REIIYSPPYDP 402
            V     N++GT  L+E   E   ++A+++V++  C  ++E     RE      +DP
Sbjct: 95  PVETFETNVMGTVNLLEAIRETGSVKAVVNVTSDKCYENKEWGWGYRENDPLGGHDP 151


>gnl|CDD|217199 pfam02719, Polysacc_synt_2, Polysaccharide biosynthesis protein.
           This is a family of diverse bacterial polysaccharide
           biosynthesis proteins including the CapD protein, WalL
           protein mannosyl-transferase and several putative
           epimerases (e.g. WbiI).
          Length = 280

 Score = 31.3 bits (72), Expect = 0.83
 Identities = 28/98 (28%), Positives = 51/98 (52%), Gaps = 20/98 (20%)

Query: 294 DQLRQERSNELA--KIKPILGDITEPELGISQNDQKILKQN-VSVVFHSAA-----TVKF 345
            ++RQE   E    K++  +GD+ + E       ++ ++Q+ V  VFH+AA      V++
Sbjct: 37  YEIRQELRQEYNDPKLRFFIGDVRDRERL-----ERAMEQHGVDTVFHAAALKHVPLVEY 91

Query: 346 D--EALKLSVTINMLGTKRLVELCHEMTHLEALIHVST 381
           +  EA+K     N+LGT+ + E   E   +E  + +ST
Sbjct: 92  NPMEAIK----TNVLGTENVAEAAIEN-GVEKFVLIST 124


>gnl|CDD|187564 cd05254, dTDP_HR_like_SDR_e, dTDP-6-deoxy-L-lyxo-4-hexulose
           reductase and related proteins, extended (e) SDRs.
           dTDP-6-deoxy-L-lyxo-4-hexulose reductase, an extended
           SDR, synthesizes dTDP-L-rhamnose from
           alpha-D-glucose-1-phosphate,  providing the precursor of
           L-rhamnose, an essential cell wall component of many
           pathogenic bacteria. This subgroup has the
           characteristic active site tetrad and NADP-binding
           motif. This subgroup also contains human MAT2B, the
           regulatory subunit of methionine adenosyltransferase
           (MAT); MAT catalyzes S-adenosylmethionine synthesis. The
           human gene encoding MAT2B encodes two major splicing
           variants which are induced in human cell liver cancer
           and regulate HuR, an mRNA-binding protein which
           stabilizes the mRNA of several cyclins, to affect cell
           proliferation. Both MAT2B variants include this extended
           SDR domain. Extended SDRs are distinct from classical
           SDRs. In addition to the Rossmann fold (alpha/beta
           folding pattern with a central beta-sheet) core region
           typical of all SDRs, extended SDRs have a less conserved
           C-terminal extension of approximately 100 amino acids.
           Extended SDRs are a diverse collection of proteins, and
           include isomerases, epimerases, oxidoreductases, and
           lyases; they typically have a TGXXGXXG cofactor binding
           motif. SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 280

 Score = 31.1 bits (71), Expect = 0.94
 Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 7/53 (13%)

Query: 335 VVFHSAATVKFDEA---LKLSVTINMLGTKRLVELCHEM-THLEALIHVSTAY 383
           V+ + AA  + D+     +L+  +N+L  + L     E+      LIH+ST Y
Sbjct: 58  VIINCAAYTRVDKCESDPELAYRVNVLAPENLARAAKEVGAR---LIHISTDY 107


>gnl|CDD|130249 TIGR01181, dTDP_gluc_dehyt, dTDP-glucose 4,6-dehydratase.  This
           protein is related to UDP-glucose 4-epimerase (GalE) and
           likewise has an NAD cofactor [Cell envelope,
           Biosynthesis and degradation of surface polysaccharides
           and lipopolysaccharides].
          Length = 317

 Score = 31.2 bits (71), Expect = 1.1
 Identities = 19/73 (26%), Positives = 30/73 (41%), Gaps = 7/73 (9%)

Query: 312 GDITEPELGISQNDQKILKQNVSVVFHSAATVKFDEAL---KLSVTINMLGTKRLVELCH 368
           GDI + EL      +   +     V H AA    D ++      +  N++GT  L+E   
Sbjct: 57  GDIGDREL----VSRLFTEHQPDAVVHFAAESHVDRSISGPAAFIETNVVGTYTLLEAVR 112

Query: 369 EMTHLEALIHVST 381
           +  H     H+ST
Sbjct: 113 KYWHEFRFHHIST 125


>gnl|CDD|224011 COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases
           [Cell envelope biogenesis, outer membrane / Carbohydrate
           transport and metabolism].
          Length = 588

 Score = 29.9 bits (68), Expect = 3.2
 Identities = 19/83 (22%), Positives = 39/83 (46%), Gaps = 17/83 (20%)

Query: 294 DQLRQE--RSNELAKIKPILGDITEPELGISQNDQKILKQNVSVVFHSAA-----TVKF- 345
             +  E        K++  +GD+ + +    + ++ +    V +VFH+AA      V++ 
Sbjct: 289 YLIDMELREKFPELKLRFYIGDVRDRD----RVERAMEGHKVDIVFHAAALKHVPLVEYN 344

Query: 346 -DEALKLSVTINMLGTKRLVELC 367
            +EA+K     N+LGT+ + E  
Sbjct: 345 PEEAIK----TNVLGTENVAEAA 363


>gnl|CDD|187551 cd05240, UDP_G4E_3_SDR_e, UDP-glucose 4 epimerase (G4E), subgroup
           3, extended (e) SDRs.  Members of this bacterial
           subgroup are identified as possible sugar epimerases,
           such as UDP-glucose 4 epimerase. However, while the
           NAD(P)-binding motif is fairly well conserved, not all
           members retain the canonical active site tetrad of the
           extended SDRs. UDP-glucose 4 epimerase (aka
           UDP-galactose-4-epimerase), is a homodimeric extended
           SDR. It catalyzes the NAD-dependent conversion of
           UDP-galactose to UDP-glucose, the final step in Leloir
           galactose synthesis. Extended SDRs are distinct from
           classical SDRs. In addition to the Rossmann fold
           (alpha/beta folding pattern with a central beta-sheet)
           core region typical of all SDRs, extended SDRs have a
           less conserved C-terminal extension of approximately 100
           amino acids. Extended SDRs are a diverse collection of
           proteins, and include isomerases, epimerases,
           oxidoreductases, and lyases; they typically have a
           TGXXGXXG cofactor binding motif. SDRs are a functionally
           diverse family of oxidoreductases that have a single
           domain with a structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 306

 Score = 29.3 bits (66), Expect = 3.4
 Identities = 12/71 (16%), Positives = 25/71 (35%), Gaps = 6/71 (8%)

Query: 312 GDITEPELGISQNDQKILKQNVSVVFHSAATVKFDEALKLSVTINMLGTKRLVELCHEMT 371
            DI +P            ++    V H A  +           IN+ GT+ +++ C    
Sbjct: 47  LDIRDPAA-----ADVFREREADAVVHLAFILDPPRDGAERHRINVDGTQNVLDACAA-A 100

Query: 372 HLEALIHVSTA 382
            +  ++  S+ 
Sbjct: 101 GVPRVVVTSSV 111


>gnl|CDD|187539 cd05228, AR_FR_like_1_SDR_e, uncharacterized subgroup of aldehyde
           reductase and flavonoid reductase related proteins,
           extended (e) SDRs.  This subgroup contains proteins of
           unknown function related to aldehyde reductase and
           flavonoid reductase of the extended SDR-type. Aldehyde
           reductase I (aka carbonyl reductase) is an NADP-binding
           SDR; it has an NADP-binding motif consensus that is
           slightly different from the canonical SDR form and lacks
           the Asn of the extended SDR active site tetrad. Aldehyde
           reductase I catalyzes the NADP-dependent  reduction of
           ethyl 4-chloro-3-oxobutanoate to ethyl
           (R)-4-chloro-3-hydroxybutanoate. The related flavonoid
           reductases act in the NADP-dependent reduction of
           flavonoids, ketone-containing plant secondary
           metabolites. Extended SDRs are distinct from classical
           SDRs. In addition to the Rossmann fold (alpha/beta
           folding pattern with a central beta-sheet) core region
           typical of all SDRs, extended SDRs have a less conserved
           C-terminal extension of approximately 100 amino acids.
           Extended SDRs are a diverse collection of proteins, and
           include isomerases, epimerases, oxidoreductases, and
           lyases; they typically have a TGXXGXXG cofactor binding
           motif. SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 318

 Score = 29.2 bits (66), Expect = 4.6
 Identities = 12/52 (23%), Positives = 23/52 (44%), Gaps = 2/52 (3%)

Query: 331 QNVSVVFHSAATVKF-DEALKLSVTINMLGTKRLVELCHEMTHLEALIHVST 381
           +    VFH AA      +  K     N+ GT+ +++   E   +  ++H S+
Sbjct: 61  KGCDRVFHLAAFTSLWAKDRKELYRTNVEGTRNVLDAALE-AGVRRVVHTSS 111


>gnl|CDD|165284 PHA02980, PHA02980, hypothetical protein; Provisional.
          Length = 160

 Score = 28.2 bits (63), Expect = 5.4
 Identities = 17/75 (22%), Positives = 35/75 (46%), Gaps = 4/75 (5%)

Query: 223 PKYIANYVLGIRTFIFKEQASSLPQARKRLYKMLWIHRLSKLLM----ILLVWRLLMLRS 278
           P Y   + L I ++IF      L   R+R    L  + + +L +     ++ + + MLR 
Sbjct: 43  PPYYVRFSLYIISYIFMGHGMYLIHTRRRDSNELIAYYILQLSINLSWFIVFYEMHMLRL 102

Query: 279 SVARSSWHLFIDLLF 293
           S+A S++ + + +  
Sbjct: 103 SIAISTFSICLSVYN 117


>gnl|CDD|178094 PLN02476, PLN02476, O-methyltransferase.
          Length = 278

 Score = 28.5 bits (63), Expect = 7.0
 Identities = 14/57 (24%), Positives = 31/57 (54%), Gaps = 6/57 (10%)

Query: 310 ILGDITEPELGISQNDQKILKQNVSVVFHSAATVKFDEALKLSVTINMLGTKRLVEL 366
           +L ++ EP++       + L++  S +  S   V  D+A  L++ + +LG +R +E+
Sbjct: 75  VLSNVREPKI------LRQLREETSKMRGSQMQVSPDQAQLLAMLVQILGAERCIEV 125


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.136    0.418 

Gapped
Lambda     K      H
   0.267   0.0752    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,641,081
Number of extensions: 2096949
Number of successful extensions: 1773
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1758
Number of HSP's successfully gapped: 51
Length of query: 422
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 323
Effective length of database: 6,546,556
Effective search space: 2114537588
Effective search space used: 2114537588
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 60 (26.8 bits)