RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3027
         (300 letters)



>gnl|CDD|187547 cd05236, FAR-N_SDR_e, fatty acyl CoA reductases (FARs), extended
           (e) SDRs.  SDRs are Rossmann-fold NAD(P)H-binding
           proteins, many of which may function as fatty acyl CoA
           reductases (FAR), acting on medium and long chain fatty
           acids, and have been reported to be involved in diverse
           processes such as biosynthesis of insect pheromones,
           plant cuticular wax production, and mammalian wax
           biosynthesis. In Arabidopsis thaliana, proteins with
           this particular architecture have also been identified
           as the MALE STERILITY 2 (MS2) gene product, which is
           implicated in male gametogenesis. Mutations in MS2
           inhibit the synthesis of exine (sporopollenin),
           rendering plants unable to reduce pollen wall fatty
           acids to corresponding alcohols. This N-terminal domain
           shares the catalytic triad (but not the upstream Asn)
           and characteristic NADP-binding motif of the extended
           SDR family. Extended SDRs are distinct from classical
           SDRs. In addition to the Rossmann fold (alpha/beta
           folding pattern with a central beta-sheet) core region
           typical of all SDRs, extended SDRs have a less conserved
           C-terminal extension of approximately 100 amino acids.
           Extended SDRs are a diverse collection of proteins, and
           include isomerases, epimerases, oxidoreductases, and
           lyases; they typically have a TGXXGXXG cofactor binding
           motif. SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 320

 Score =  126 bits (319), Expect = 3e-34
 Identities = 50/133 (37%), Positives = 75/133 (56%), Gaps = 3/133 (2%)

Query: 1   LLHEWPNTYTFTKALAEDVLCKEGKGLPLAVLRPAVIVSTYREPVRGWIDNVYGPIGMLV 60
           LL   PNTYTFTKALAE ++ KE   LPL ++RP+++ +T +EP  GWIDN  GP G+ +
Sbjct: 186 LLGGHPNTYTFTKALAERLVLKERGNLPLVIVRPSIVGATLKEPFPGWIDNFNGPDGLFL 245

Query: 61  GIATGVLHTHLINLNTVTDMVPSTVSDGPSEGVWQKKFPTSVFRPDPIPIYNFVGSARNP 120
               G+L T   + N V D++P    D  +  +      + V +P  + +Y+   S  NP
Sbjct: 246 AYGKGILRTMNADPNAVADIIP---VDVVANALLAAAAYSGVRKPRELEVYHCGSSDVNP 302

Query: 121 ISWGKLDELATVC 133
            +WG+ +EL    
Sbjct: 303 FTWGEAEELINQY 315


>gnl|CDD|176924 cd09071, FAR_C, C-terminal domain of fatty acyl CoA reductases.
           C-terminal domain of fatty acyl CoA reductases, a family
           of SDR-like proteins. SDRs or short-chain
           dehydrogenases/reductases are Rossmann-fold
           NAD(P)H-binding proteins. Many proteins in this FAR_C
           family may function as fatty acyl-CoA reductases (FARs),
           acting on medium and long chain fatty acids, and have
           been reported to be involved in diverse processes such
           as the biosynthesis of insect pheromones, plant
           cuticular wax production, and mammalian wax
           biosynthesis. In Arabidopsis thaliana, proteins with
           this particular architecture have also been identified
           as the MALE STERILITY 2 (MS2) gene product, which is
           implicated in male gametogenesis. Mutations in MS2
           inhibit the synthesis of exine (sporopollenin),
           rendering plants unable to reduce pollen wall fatty
           acids to corresponding alcohols. The function of this
           C-terminal domain is unclear.
          Length = 92

 Score =  116 bits (293), Expect = 5e-33
 Identities = 42/91 (46%), Positives = 58/91 (63%)

Query: 163 FLHTIPGFILDSLASLSGRKPMLMAIYKKIDKASDTLEYFAHRQWVWTDHNVQALFNKLD 222
           FLH +P ++LD L  L GRKP L+ +Y+KI K  D LEYF   +W + + N +AL+ +L 
Sbjct: 1   FLHLLPAYLLDLLLRLLGRKPRLLKLYRKIHKLLDLLEYFTTNEWRFDNDNTRALWERLS 60

Query: 223 PEDQELFFFDMGQMKWEYYAEAMGLGLRVYL 253
            ED+ELF FD+  + W+ Y E    GLR YL
Sbjct: 61  EEDRELFNFDIRSIDWDDYFENYIPGLRKYL 91


>gnl|CDD|111859 pfam03015, Sterile, Male sterility protein.  This family represents
           the C-terminal region of the male sterility protein in a
           number of arabidopsis and drosophila. A sequence-related
           jojoba acyl CoA reductase is also included.
          Length = 94

 Score = 96.2 bits (240), Expect = 4e-25
 Identities = 37/95 (38%), Positives = 60/95 (63%), Gaps = 2/95 (2%)

Query: 163 FLHTIPGFILDSLASLSGRKPMLMAIYKKIDKASDTLEYFAHRQWVWTDHNVQALFNKLD 222
           F HT+P + LD L  L G+KP L+ +Y+KI K  + L+ F+  +W++ + N + L  K+ 
Sbjct: 1   FYHTLPAYFLDLLLRLYGQKPRLVKLYRKIHKGLEVLQPFSLNEWIFDNKNTRELREKMS 60

Query: 223 PEDQELFFFDMGQMKW-EYYAEAMGLGLRVYLVHD 256
            ED++LF FDM  + W EY+  A+  G+R YL+ +
Sbjct: 61  EEDKKLFNFDMESLDWDEYFRNAI-RGIRKYLLKE 94


>gnl|CDD|219687 pfam07993, NAD_binding_4, Male sterility protein.  This family
           represents the C-terminal region of the male sterility
           protein in a number of arabidopsis and drosophila. A
           sequence-related jojoba acyl CoA reductase is also
           included.
          Length = 245

 Score = 90.4 bits (225), Expect = 2e-21
 Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 4/83 (4%)

Query: 1   LLHEWPNTYTFTKALAEDVLCKEGKGLPLAVLRPAVIVSTYREPVRGWIDNVY-GPIGML 59
           LL   PN YT +K LAE ++ +   GLP+ + RP++I     E   GWI+    GP G+L
Sbjct: 158 LLGGLPNGYTQSKWLAEQLVREAAGGLPVVIYRPSIITG---ESRTGWINGDDFGPRGLL 214

Query: 60  VGIATGVLHTHLINLNTVTDMVP 82
            G   GVL   L + +   D+VP
Sbjct: 215 GGAGLGVLPDILGDPDARLDLVP 237


>gnl|CDD|215538 PLN02996, PLN02996, fatty acyl-CoA reductase.
          Length = 491

 Score = 87.1 bits (216), Expect = 3e-19
 Identities = 45/143 (31%), Positives = 69/143 (48%), Gaps = 32/143 (22%)

Query: 2   LHEWPNTYTFTKALAEDVLCKEGKGLPLAVLRPAVIVSTYREPVRGWIDNVYGPIGMLVG 61
           LH WPNTY FTKA+ E +L    + LPL ++RP +I STY+EP  GWI+ +     ++VG
Sbjct: 229 LHGWPNTYVFTKAMGEMLLGNFKENLPLVIIRPTMITSTYKEPFPGWIEGLRTIDSVIVG 288

Query: 62  IATGVLHTHLINLNTVTDMVPSTV-------------SDGPSEGVWQKKFPTSVFRPDPI 108
              G L   L + N+V D++P+ +                 SE ++              
Sbjct: 289 YGKGKLTCFLADPNSVLDVIPADMVVNAMIVAMAAHAGGQGSEIIYH------------- 335

Query: 109 PIYNFVGSA-RNPISWGKLDELA 130
                VGS+ +NP+ +  L + A
Sbjct: 336 -----VGSSLKNPVKFSNLHDFA 353


>gnl|CDD|215279 PLN02503, PLN02503, fatty acyl-CoA reductase 2.
          Length = 605

 Score = 71.4 bits (175), Expect = 6e-14
 Identities = 68/257 (26%), Positives = 111/257 (43%), Gaps = 29/257 (11%)

Query: 2   LHEWPNTYTFTKALAEDVLCKEGKGLPLAVLRPAVIVSTYREPVRGWIDN--VYGPIGML 59
           L+ W +TY FTKA+ E V+      +P+ ++RP+VI ST+++P  GW++   +  PI + 
Sbjct: 343 LYGWQDTYVFTKAMGEMVINSMRGDIPVVIIRPSVIESTWKDPFPGWMEGNRMMDPIVLY 402

Query: 60  VGIATGVLHTHLINLNTVTDMVP------STVSDGPSEGVWQKKFPTSVFRPDPIPIYNF 113
            G   G L   L + N V D+VP      +T++     G   K        P+ I +Y  
Sbjct: 403 YG--KGQLTGFLADPNGVLDVVPADMVVNATLAAMAKHGGAAK--------PE-INVYQI 451

Query: 114 VGSARNPISWGKLDELATVCHPIWPTIRAIWYFSFIPASNKYTYYILSFFLHTIPGFILD 173
             S  NP+ +  L  L    +   P + +      +P   K    +  F  H     +L 
Sbjct: 452 ASSVVNPLVFQDLARLLYEHYKSSPYMDSKGRPIHVP-PMKLFSSMEDFSSHLWRDALLR 510

Query: 174 S-LASLSGRKPMLMAIYKKIDKAS-DTLEYFAHRQWVWT-------DHNVQALFNKLDPE 224
           S LA +S     L    + I   S +  +Y A     +T       + N Q L  ++  E
Sbjct: 511 SGLAGMSSSDRKLSQKLENICAKSVEQAKYLASIYEPYTFYGGRFDNSNTQRLMERMSEE 570

Query: 225 DQELFFFDMGQMKWEYY 241
           ++  F FD+G + W  Y
Sbjct: 571 EKAEFGFDVGSIDWRDY 587


>gnl|CDD|187573 cd05263, MupV_like_SDR_e, Pseudomonas fluorescens MupV-like,
           extended (e) SDRs.  This subgroup of extended SDR family
           domains have the characteristic active site tetrad and a
           well-conserved NAD(P)-binding motif. This subgroup is
           not well characterized, its members are annotated as
           having a variety of putative functions. One
           characterized member is Pseudomonas fluorescens MupV a
           protein  involved in the biosynthesis of Mupirocin, a
           polyketide-derived antibiotic. Extended SDRs are
           distinct from classical SDRs. In addition to the
           Rossmann fold (alpha/beta folding pattern with a central
           beta-sheet) core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids. Extended SDRs are a
           diverse collection of proteins, and include isomerases,
           epimerases, oxidoreductases, and lyases; they typically
           have a TGXXGXXG cofactor binding motif. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 293

 Score = 34.7 bits (80), Expect = 0.038
 Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 3/58 (5%)

Query: 6   PNTYTFTKALAEDVLCKEGKGLPLAVLRPAVIVSTYREPVRGWIDNVYGPIGMLVGIA 63
            N Y  +KA AE ++      +PL V RP+++V   +    G I+ + G   +L  +A
Sbjct: 147 KNPYEQSKAEAEQLVRAAATQIPLTVYRPSIVVGDSKT---GRIEKIDGLYELLNLLA 201


>gnl|CDD|220927 pfam10989, DUF2808, Protein of unknown function (DUF2808).  This
           family of proteins with unknown function appears to be
           restricted to Cyanobacteria.
          Length = 142

 Score = 30.0 bits (68), Expect = 0.72
 Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 3/44 (6%)

Query: 75  NTVTDMVPSTVSDGPSEGVWQKKFPTSVFRPDPIPIYNFVGSAR 118
            TVT +V   V +  S G +Q  F  +   P   PI  ++GS R
Sbjct: 99  TTVT-VVLKNVRNPRSGGTYQ--FNATAQPPGDNPISRYLGSWR 139


>gnl|CDD|221256 pfam11837, DUF3357, Domain of unknown function (DUF3357).  This
          domain is functionally uncharacterized. This domain is
          found in eukaryotes. This presumed domain is typically
          between 96 to 119 amino acids in length.
          Length = 106

 Score = 28.1 bits (63), Expect = 2.0
 Identities = 11/49 (22%), Positives = 18/49 (36%), Gaps = 5/49 (10%)

Query: 56 IGMLVGIATGVLHTH-----LINLNTVTDMVPSTVSDGPSEGVWQKKFP 99
          +  LV +             L++    +    + VS G SEGV +K   
Sbjct: 42 LLSLVALLILNGQPPDGDVVLVSSPETSPETAAPVSRGVSEGVSEKSNG 90


>gnl|CDD|238952 cd01994, Alpha_ANH_like_IV, This is a subfamily of Adenine
           nucleotide alpha hydrolases superfamily.Adeninosine
           nucleotide alpha hydrolases superfamily  includes N type
           ATP PPases and ATP sulphurylases. It forms a
           apha/beta/apha fold which  binds to Adenosine group.
           This subfamily   of proteins is predicted to  bind ATP.
           This domainhas  a strongly conserved motif SGGKD at the
           N terminus.
          Length = 194

 Score = 29.1 bits (66), Expect = 2.1
 Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 16/73 (21%)

Query: 175 LASLSGRKPMLMAIYKKIDKASDTLEYFAHRQWVWTDHNVQALFNKLDPEDQELFFFDMG 234
           +A +SG K    A+Y+ +++                 H V AL N    E   + +  + 
Sbjct: 3   VALISGGKDSCYALYRALEE----------------GHEVVALLNLTPEEGSSMMYHTVN 46

Query: 235 QMKWEYYAEAMGL 247
               E  AEAMG+
Sbjct: 47  HELLELQAEAMGI 59


>gnl|CDD|177223 MTH00168, COX2, cytochrome c oxidase subunit II; Provisional.
          Length = 225

 Score = 28.8 bits (65), Expect = 3.2
 Identities = 26/77 (33%), Positives = 37/77 (48%), Gaps = 14/77 (18%)

Query: 142 AIWYFSFIPASNKYTYYILSF-----FLHT-IPGFILDSLA--SLSGRKPMLMAIYKKID 193
            + Y   +  ++KYT   L       F+ T IP FIL SLA  SL      L+ +  +ID
Sbjct: 37  LVLYSLLVLVTSKYTNRFLLDSQMIEFVWTIIPAFILISLALPSL-----RLLYLMDEID 91

Query: 194 KASDTLEYFAHRQWVWT 210
           K   T++   H QW W+
Sbjct: 92  KPDLTIKAVGH-QWYWS 107


>gnl|CDD|225857 COG3320, COG3320, Putative dehydrogenase domain of multifunctional
           non-ribosomal peptide synthetases and related enzymes
           [Secondary metabolites biosynthesis, transport, and
           catabolism].
          Length = 382

 Score = 28.5 bits (64), Expect = 4.1
 Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 3/40 (7%)

Query: 5   WPNTYTFTKALAEDVLCKE--GKGLPLAVLRPAVIVSTYR 42
               Y  +K +AE  L +E   +GLP+ + RP  I    R
Sbjct: 164 LAGGYGRSKWVAE-KLVREAGDRGLPVTIFRPGYITGDSR 202


>gnl|CDD|173241 PRK14780, PRK14780, lipoprotein signal peptidase; Provisional.
          Length = 263

 Score = 28.3 bits (63), Expect = 4.2
 Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 2/67 (2%)

Query: 107 PIPIYNFVGSARNPISWGKLDELATVCHPIWPTIRAIWYFSFIPASNKYTYYILSFFLHT 166
             PI++F  +  N I+ G+++  + + + +      + + SFI   N   Y  L F L  
Sbjct: 89  HGPIFHFTLN--NGIALGRINNNSGIVYFLQSIPIILGFLSFIFLKNSLLYIPLLFLLFG 146

Query: 167 IPGFILD 173
             G I+D
Sbjct: 147 GLGNIID 153


>gnl|CDD|220119 pfam09126, NaeI, Restriction endonuclease NaeI.  Members of this
           family adopt a secondary structure consisting of nine
           alpha-helices, six 3-10 helices and 13 beta-strands.
           They bind two GCC-CGG recognition sequences to cleave
           DNA into blunt-ended products.
          Length = 291

 Score = 28.3 bits (63), Expect = 5.2
 Identities = 15/58 (25%), Positives = 21/58 (36%), Gaps = 1/58 (1%)

Query: 174 SLASLSGRKPMLMAIYKKIDKASDTLEYFAHRQWVWTDHNVQA-LFNKLDPEDQELFF 230
               +  R   L     +  K S T E  A   W+W DH +   L   L  + +E  F
Sbjct: 117 YAGLIVVRPDYLTRGQNRDGKRSLTAEGRAAVLWLWRDHPLPENLLLHLPDDVREQIF 174


>gnl|CDD|165182 PHA02839, PHA02839, Il-24-like protein; Provisional.
          Length = 156

 Score = 26.9 bits (59), Expect = 9.0
 Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 3/64 (4%)

Query: 162 FFLHTIPGFILDSLASLSGRKPMLMAIYKKIDK---ASDTLEYFAHRQWVWTDHNVQALF 218
           F  +T     L+ L SLS      +AI+ K+ K     + +     ++ +  D+N + LF
Sbjct: 83  FVKYTNTALKLNILKSLSSVANNFLAIFNKVKKRRVKKNNVNVLEIKKLLLIDNNCKKLF 142

Query: 219 NKLD 222
           +++D
Sbjct: 143 SEID 146


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.326    0.142    0.462 

Gapped
Lambda     K      H
   0.267   0.0764    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,147,911
Number of extensions: 1578582
Number of successful extensions: 2037
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2031
Number of HSP's successfully gapped: 35
Length of query: 300
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 204
Effective length of database: 6,679,618
Effective search space: 1362642072
Effective search space used: 1362642072
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 59 (26.4 bits)