RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3027
(300 letters)
>gnl|CDD|187547 cd05236, FAR-N_SDR_e, fatty acyl CoA reductases (FARs), extended
(e) SDRs. SDRs are Rossmann-fold NAD(P)H-binding
proteins, many of which may function as fatty acyl CoA
reductases (FAR), acting on medium and long chain fatty
acids, and have been reported to be involved in diverse
processes such as biosynthesis of insect pheromones,
plant cuticular wax production, and mammalian wax
biosynthesis. In Arabidopsis thaliana, proteins with
this particular architecture have also been identified
as the MALE STERILITY 2 (MS2) gene product, which is
implicated in male gametogenesis. Mutations in MS2
inhibit the synthesis of exine (sporopollenin),
rendering plants unable to reduce pollen wall fatty
acids to corresponding alcohols. This N-terminal domain
shares the catalytic triad (but not the upstream Asn)
and characteristic NADP-binding motif of the extended
SDR family. Extended SDRs are distinct from classical
SDRs. In addition to the Rossmann fold (alpha/beta
folding pattern with a central beta-sheet) core region
typical of all SDRs, extended SDRs have a less conserved
C-terminal extension of approximately 100 amino acids.
Extended SDRs are a diverse collection of proteins, and
include isomerases, epimerases, oxidoreductases, and
lyases; they typically have a TGXXGXXG cofactor binding
motif. SDRs are a functionally diverse family of
oxidoreductases that have a single domain with a
structurally conserved Rossmann fold, an
NAD(P)(H)-binding region, and a structurally diverse
C-terminal region. Sequence identity between different
SDR enzymes is typically in the 15-30% range; they
catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Complex (multidomain) SDRs
such as ketoreductase domains of fatty acid synthase
have a GGXGXXG NAD(P)-binding motif and an altered
active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 320
Score = 126 bits (319), Expect = 3e-34
Identities = 50/133 (37%), Positives = 75/133 (56%), Gaps = 3/133 (2%)
Query: 1 LLHEWPNTYTFTKALAEDVLCKEGKGLPLAVLRPAVIVSTYREPVRGWIDNVYGPIGMLV 60
LL PNTYTFTKALAE ++ KE LPL ++RP+++ +T +EP GWIDN GP G+ +
Sbjct: 186 LLGGHPNTYTFTKALAERLVLKERGNLPLVIVRPSIVGATLKEPFPGWIDNFNGPDGLFL 245
Query: 61 GIATGVLHTHLINLNTVTDMVPSTVSDGPSEGVWQKKFPTSVFRPDPIPIYNFVGSARNP 120
G+L T + N V D++P D + + + V +P + +Y+ S NP
Sbjct: 246 AYGKGILRTMNADPNAVADIIP---VDVVANALLAAAAYSGVRKPRELEVYHCGSSDVNP 302
Query: 121 ISWGKLDELATVC 133
+WG+ +EL
Sbjct: 303 FTWGEAEELINQY 315
>gnl|CDD|176924 cd09071, FAR_C, C-terminal domain of fatty acyl CoA reductases.
C-terminal domain of fatty acyl CoA reductases, a family
of SDR-like proteins. SDRs or short-chain
dehydrogenases/reductases are Rossmann-fold
NAD(P)H-binding proteins. Many proteins in this FAR_C
family may function as fatty acyl-CoA reductases (FARs),
acting on medium and long chain fatty acids, and have
been reported to be involved in diverse processes such
as the biosynthesis of insect pheromones, plant
cuticular wax production, and mammalian wax
biosynthesis. In Arabidopsis thaliana, proteins with
this particular architecture have also been identified
as the MALE STERILITY 2 (MS2) gene product, which is
implicated in male gametogenesis. Mutations in MS2
inhibit the synthesis of exine (sporopollenin),
rendering plants unable to reduce pollen wall fatty
acids to corresponding alcohols. The function of this
C-terminal domain is unclear.
Length = 92
Score = 116 bits (293), Expect = 5e-33
Identities = 42/91 (46%), Positives = 58/91 (63%)
Query: 163 FLHTIPGFILDSLASLSGRKPMLMAIYKKIDKASDTLEYFAHRQWVWTDHNVQALFNKLD 222
FLH +P ++LD L L GRKP L+ +Y+KI K D LEYF +W + + N +AL+ +L
Sbjct: 1 FLHLLPAYLLDLLLRLLGRKPRLLKLYRKIHKLLDLLEYFTTNEWRFDNDNTRALWERLS 60
Query: 223 PEDQELFFFDMGQMKWEYYAEAMGLGLRVYL 253
ED+ELF FD+ + W+ Y E GLR YL
Sbjct: 61 EEDRELFNFDIRSIDWDDYFENYIPGLRKYL 91
>gnl|CDD|111859 pfam03015, Sterile, Male sterility protein. This family represents
the C-terminal region of the male sterility protein in a
number of arabidopsis and drosophila. A sequence-related
jojoba acyl CoA reductase is also included.
Length = 94
Score = 96.2 bits (240), Expect = 4e-25
Identities = 37/95 (38%), Positives = 60/95 (63%), Gaps = 2/95 (2%)
Query: 163 FLHTIPGFILDSLASLSGRKPMLMAIYKKIDKASDTLEYFAHRQWVWTDHNVQALFNKLD 222
F HT+P + LD L L G+KP L+ +Y+KI K + L+ F+ +W++ + N + L K+
Sbjct: 1 FYHTLPAYFLDLLLRLYGQKPRLVKLYRKIHKGLEVLQPFSLNEWIFDNKNTRELREKMS 60
Query: 223 PEDQELFFFDMGQMKW-EYYAEAMGLGLRVYLVHD 256
ED++LF FDM + W EY+ A+ G+R YL+ +
Sbjct: 61 EEDKKLFNFDMESLDWDEYFRNAI-RGIRKYLLKE 94
>gnl|CDD|219687 pfam07993, NAD_binding_4, Male sterility protein. This family
represents the C-terminal region of the male sterility
protein in a number of arabidopsis and drosophila. A
sequence-related jojoba acyl CoA reductase is also
included.
Length = 245
Score = 90.4 bits (225), Expect = 2e-21
Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 1 LLHEWPNTYTFTKALAEDVLCKEGKGLPLAVLRPAVIVSTYREPVRGWIDNVY-GPIGML 59
LL PN YT +K LAE ++ + GLP+ + RP++I E GWI+ GP G+L
Sbjct: 158 LLGGLPNGYTQSKWLAEQLVREAAGGLPVVIYRPSIITG---ESRTGWINGDDFGPRGLL 214
Query: 60 VGIATGVLHTHLINLNTVTDMVP 82
G GVL L + + D+VP
Sbjct: 215 GGAGLGVLPDILGDPDARLDLVP 237
>gnl|CDD|215538 PLN02996, PLN02996, fatty acyl-CoA reductase.
Length = 491
Score = 87.1 bits (216), Expect = 3e-19
Identities = 45/143 (31%), Positives = 69/143 (48%), Gaps = 32/143 (22%)
Query: 2 LHEWPNTYTFTKALAEDVLCKEGKGLPLAVLRPAVIVSTYREPVRGWIDNVYGPIGMLVG 61
LH WPNTY FTKA+ E +L + LPL ++RP +I STY+EP GWI+ + ++VG
Sbjct: 229 LHGWPNTYVFTKAMGEMLLGNFKENLPLVIIRPTMITSTYKEPFPGWIEGLRTIDSVIVG 288
Query: 62 IATGVLHTHLINLNTVTDMVPSTV-------------SDGPSEGVWQKKFPTSVFRPDPI 108
G L L + N+V D++P+ + SE ++
Sbjct: 289 YGKGKLTCFLADPNSVLDVIPADMVVNAMIVAMAAHAGGQGSEIIYH------------- 335
Query: 109 PIYNFVGSA-RNPISWGKLDELA 130
VGS+ +NP+ + L + A
Sbjct: 336 -----VGSSLKNPVKFSNLHDFA 353
>gnl|CDD|215279 PLN02503, PLN02503, fatty acyl-CoA reductase 2.
Length = 605
Score = 71.4 bits (175), Expect = 6e-14
Identities = 68/257 (26%), Positives = 111/257 (43%), Gaps = 29/257 (11%)
Query: 2 LHEWPNTYTFTKALAEDVLCKEGKGLPLAVLRPAVIVSTYREPVRGWIDN--VYGPIGML 59
L+ W +TY FTKA+ E V+ +P+ ++RP+VI ST+++P GW++ + PI +
Sbjct: 343 LYGWQDTYVFTKAMGEMVINSMRGDIPVVIIRPSVIESTWKDPFPGWMEGNRMMDPIVLY 402
Query: 60 VGIATGVLHTHLINLNTVTDMVP------STVSDGPSEGVWQKKFPTSVFRPDPIPIYNF 113
G G L L + N V D+VP +T++ G K P+ I +Y
Sbjct: 403 YG--KGQLTGFLADPNGVLDVVPADMVVNATLAAMAKHGGAAK--------PE-INVYQI 451
Query: 114 VGSARNPISWGKLDELATVCHPIWPTIRAIWYFSFIPASNKYTYYILSFFLHTIPGFILD 173
S NP+ + L L + P + + +P K + F H +L
Sbjct: 452 ASSVVNPLVFQDLARLLYEHYKSSPYMDSKGRPIHVP-PMKLFSSMEDFSSHLWRDALLR 510
Query: 174 S-LASLSGRKPMLMAIYKKIDKAS-DTLEYFAHRQWVWT-------DHNVQALFNKLDPE 224
S LA +S L + I S + +Y A +T + N Q L ++ E
Sbjct: 511 SGLAGMSSSDRKLSQKLENICAKSVEQAKYLASIYEPYTFYGGRFDNSNTQRLMERMSEE 570
Query: 225 DQELFFFDMGQMKWEYY 241
++ F FD+G + W Y
Sbjct: 571 EKAEFGFDVGSIDWRDY 587
>gnl|CDD|187573 cd05263, MupV_like_SDR_e, Pseudomonas fluorescens MupV-like,
extended (e) SDRs. This subgroup of extended SDR family
domains have the characteristic active site tetrad and a
well-conserved NAD(P)-binding motif. This subgroup is
not well characterized, its members are annotated as
having a variety of putative functions. One
characterized member is Pseudomonas fluorescens MupV a
protein involved in the biosynthesis of Mupirocin, a
polyketide-derived antibiotic. Extended SDRs are
distinct from classical SDRs. In addition to the
Rossmann fold (alpha/beta folding pattern with a central
beta-sheet) core region typical of all SDRs, extended
SDRs have a less conserved C-terminal extension of
approximately 100 amino acids. Extended SDRs are a
diverse collection of proteins, and include isomerases,
epimerases, oxidoreductases, and lyases; they typically
have a TGXXGXXG cofactor binding motif. SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold, an NAD(P)(H)-binding region, and a structurally
diverse C-terminal region. Sequence identity between
different SDR enzymes is typically in the 15-30% range;
they catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Complex (multidomain) SDRs
such as ketoreductase domains of fatty acid synthase
have a GGXGXXG NAD(P)-binding motif and an altered
active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 293
Score = 34.7 bits (80), Expect = 0.038
Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 3/58 (5%)
Query: 6 PNTYTFTKALAEDVLCKEGKGLPLAVLRPAVIVSTYREPVRGWIDNVYGPIGMLVGIA 63
N Y +KA AE ++ +PL V RP+++V + G I+ + G +L +A
Sbjct: 147 KNPYEQSKAEAEQLVRAAATQIPLTVYRPSIVVGDSKT---GRIEKIDGLYELLNLLA 201
>gnl|CDD|220927 pfam10989, DUF2808, Protein of unknown function (DUF2808). This
family of proteins with unknown function appears to be
restricted to Cyanobacteria.
Length = 142
Score = 30.0 bits (68), Expect = 0.72
Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 3/44 (6%)
Query: 75 NTVTDMVPSTVSDGPSEGVWQKKFPTSVFRPDPIPIYNFVGSAR 118
TVT +V V + S G +Q F + P PI ++GS R
Sbjct: 99 TTVT-VVLKNVRNPRSGGTYQ--FNATAQPPGDNPISRYLGSWR 139
>gnl|CDD|221256 pfam11837, DUF3357, Domain of unknown function (DUF3357). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 96 to 119 amino acids in length.
Length = 106
Score = 28.1 bits (63), Expect = 2.0
Identities = 11/49 (22%), Positives = 18/49 (36%), Gaps = 5/49 (10%)
Query: 56 IGMLVGIATGVLHTH-----LINLNTVTDMVPSTVSDGPSEGVWQKKFP 99
+ LV + L++ + + VS G SEGV +K
Sbjct: 42 LLSLVALLILNGQPPDGDVVLVSSPETSPETAAPVSRGVSEGVSEKSNG 90
>gnl|CDD|238952 cd01994, Alpha_ANH_like_IV, This is a subfamily of Adenine
nucleotide alpha hydrolases superfamily.Adeninosine
nucleotide alpha hydrolases superfamily includes N type
ATP PPases and ATP sulphurylases. It forms a
apha/beta/apha fold which binds to Adenosine group.
This subfamily of proteins is predicted to bind ATP.
This domainhas a strongly conserved motif SGGKD at the
N terminus.
Length = 194
Score = 29.1 bits (66), Expect = 2.1
Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 16/73 (21%)
Query: 175 LASLSGRKPMLMAIYKKIDKASDTLEYFAHRQWVWTDHNVQALFNKLDPEDQELFFFDMG 234
+A +SG K A+Y+ +++ H V AL N E + + +
Sbjct: 3 VALISGGKDSCYALYRALEE----------------GHEVVALLNLTPEEGSSMMYHTVN 46
Query: 235 QMKWEYYAEAMGL 247
E AEAMG+
Sbjct: 47 HELLELQAEAMGI 59
>gnl|CDD|177223 MTH00168, COX2, cytochrome c oxidase subunit II; Provisional.
Length = 225
Score = 28.8 bits (65), Expect = 3.2
Identities = 26/77 (33%), Positives = 37/77 (48%), Gaps = 14/77 (18%)
Query: 142 AIWYFSFIPASNKYTYYILSF-----FLHT-IPGFILDSLA--SLSGRKPMLMAIYKKID 193
+ Y + ++KYT L F+ T IP FIL SLA SL L+ + +ID
Sbjct: 37 LVLYSLLVLVTSKYTNRFLLDSQMIEFVWTIIPAFILISLALPSL-----RLLYLMDEID 91
Query: 194 KASDTLEYFAHRQWVWT 210
K T++ H QW W+
Sbjct: 92 KPDLTIKAVGH-QWYWS 107
>gnl|CDD|225857 COG3320, COG3320, Putative dehydrogenase domain of multifunctional
non-ribosomal peptide synthetases and related enzymes
[Secondary metabolites biosynthesis, transport, and
catabolism].
Length = 382
Score = 28.5 bits (64), Expect = 4.1
Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 3/40 (7%)
Query: 5 WPNTYTFTKALAEDVLCKE--GKGLPLAVLRPAVIVSTYR 42
Y +K +AE L +E +GLP+ + RP I R
Sbjct: 164 LAGGYGRSKWVAE-KLVREAGDRGLPVTIFRPGYITGDSR 202
>gnl|CDD|173241 PRK14780, PRK14780, lipoprotein signal peptidase; Provisional.
Length = 263
Score = 28.3 bits (63), Expect = 4.2
Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 2/67 (2%)
Query: 107 PIPIYNFVGSARNPISWGKLDELATVCHPIWPTIRAIWYFSFIPASNKYTYYILSFFLHT 166
PI++F + N I+ G+++ + + + + + + SFI N Y L F L
Sbjct: 89 HGPIFHFTLN--NGIALGRINNNSGIVYFLQSIPIILGFLSFIFLKNSLLYIPLLFLLFG 146
Query: 167 IPGFILD 173
G I+D
Sbjct: 147 GLGNIID 153
>gnl|CDD|220119 pfam09126, NaeI, Restriction endonuclease NaeI. Members of this
family adopt a secondary structure consisting of nine
alpha-helices, six 3-10 helices and 13 beta-strands.
They bind two GCC-CGG recognition sequences to cleave
DNA into blunt-ended products.
Length = 291
Score = 28.3 bits (63), Expect = 5.2
Identities = 15/58 (25%), Positives = 21/58 (36%), Gaps = 1/58 (1%)
Query: 174 SLASLSGRKPMLMAIYKKIDKASDTLEYFAHRQWVWTDHNVQA-LFNKLDPEDQELFF 230
+ R L + K S T E A W+W DH + L L + +E F
Sbjct: 117 YAGLIVVRPDYLTRGQNRDGKRSLTAEGRAAVLWLWRDHPLPENLLLHLPDDVREQIF 174
>gnl|CDD|165182 PHA02839, PHA02839, Il-24-like protein; Provisional.
Length = 156
Score = 26.9 bits (59), Expect = 9.0
Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 162 FFLHTIPGFILDSLASLSGRKPMLMAIYKKIDK---ASDTLEYFAHRQWVWTDHNVQALF 218
F +T L+ L SLS +AI+ K+ K + + ++ + D+N + LF
Sbjct: 83 FVKYTNTALKLNILKSLSSVANNFLAIFNKVKKRRVKKNNVNVLEIKKLLLIDNNCKKLF 142
Query: 219 NKLD 222
+++D
Sbjct: 143 SEID 146
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.326 0.142 0.462
Gapped
Lambda K H
0.267 0.0764 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,147,911
Number of extensions: 1578582
Number of successful extensions: 2037
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2031
Number of HSP's successfully gapped: 35
Length of query: 300
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 204
Effective length of database: 6,679,618
Effective search space: 1362642072
Effective search space used: 1362642072
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 59 (26.4 bits)