RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4705
(164 letters)
>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
40 amino acid long tandemly repeated sequence motif
first identified in the Drosophila segment polarity gene
armadillo; these repeats were also found in the
mammalian armadillo homolog beta-catenin, the junctional
plaque protein plakoglobin, the adenomatous polyposis
coli (APC) tumor suppressor protein, and a number of
other proteins. ARM has been implicated in mediating
protein-protein interactions, but no common features
among the target proteins recognized by the ARM repeats
have been identified; related to the HEAT domain; three
consecutive copies of the repeat are represented by this
alignment model.
Length = 120
Score = 29.6 bits (67), Expect = 0.38
Identities = 22/104 (21%), Positives = 38/104 (36%), Gaps = 3/104 (2%)
Query: 31 ESDNTRELLARILNALASQQELRGLMVQQGASKALIHLAHNGTVKGKRQAAQGLARLGIT 90
+ + RE + N A + +V+ G AL+ L + + + A L +
Sbjct: 20 DENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRN--LA 77
Query: 91 LNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAA 134
PE V L++LL ++ AL NLA+
Sbjct: 78 AGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQK-NATGALSNLAS 120
>gnl|CDD|227623 COG5307, COG5307, SEC7 domain proteins [General function prediction
only].
Length = 1024
Score = 30.5 bits (69), Expect = 0.46
Identities = 10/65 (15%), Positives = 23/65 (35%)
Query: 8 QTMTNMIVSEDVIKIYEANKDKVESDNTRELLARILNALASQQELRGLMVQQGASKALIH 67
++ ++ V E +I +D++ L ++ N L + + L
Sbjct: 572 DSLLSISVLELAARIGRGFEDQLSDVKLFRLRSKSGNKEILSSSLLSNIENKSLKLYLSS 631
Query: 68 LAHNG 72
L +G
Sbjct: 632 LIEDG 636
>gnl|CDD|187624 cd05366, meso-BDH-like_SDR_c, meso-2,3-butanediol
dehydrogenase-like, classical (c) SDRs. 2,3-butanediol
dehydrogenases (BDHs) catalyze the NAD+ dependent
conversion of 2,3-butanediol to acetonin; BDHs are
classified into types according to their
stereospecificity as to substrates and products.
Included in this subgroup are Klebsiella pneumonia
meso-BDH which catalyzes meso-2,3-butanediol to
D(-)-acetonin, and Corynebacterium glutamicum L-BDH
which catalyzes lX+)-2,3-butanediol to L(+)-acetonin.
This subgroup is comprised of classical SDRs with the
characteristic catalytic triad and NAD-binding motif.
SDRs are a functionally diverse family of
oxidoreductases that have a single domain with a
structurally conserved Rossmann fold (alpha/beta folding
pattern with a central beta-sheet), an NAD(P)(H)-binding
region, and a structurally diverse C-terminal region.
Classical SDRs are typically about 250 residues long,
while extended SDRs are approximately 350 residues.
Sequence identity between different SDR enzymes are
typically in the 15-30% range, but the enzymes share the
Rossmann fold NAD-binding motif and characteristic
NAD-binding and catalytic sequence patterns. These
enzymes catalyze a wide range of activities including
the metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase (15-PGDH)
numbering). In addition to the Tyr and Lys, there is
often an upstream Ser (Ser-138, 15-PGDH numbering)
and/or an Asn (Asn-107, 15-PGDH numbering) contributing
to the active site; while substrate binding is in the
C-terminal region, which determines specificity. The
standard reaction mechanism is a 4-pro-S hydride
transfer and proton relay involving the conserved Tyr
and Lys, a water molecule stabilized by Asn, and
nicotinamide. Extended SDRs have additional elements in
the C-terminal region, and typically have a TGXXGXXG
cofactor binding motif. Complex (multidomain) SDRs such
as ketoreductase domains of fatty acid synthase have a
GGXGXXG NAD(P)-binding motif and an altered active site
motif (YXXXN). Fungal type ketoacyl reductases have a
TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs
have lost catalytic activity and/or have an unusual
NAD(P)-binding motif and missing or unusual active site
residues. Reactions catalyzed within the SDR family
include isomerization, decarboxylation, epimerization,
C=N bond reduction, dehydratase activity,
dehalogenation, Enoyl-CoA reduction, and
carbonyl-alcohol oxidoreduction.
Length = 257
Score = 29.7 bits (67), Expect = 0.70
Identities = 20/55 (36%), Positives = 27/55 (49%), Gaps = 12/55 (21%)
Query: 41 RILNALASQQELRGLMVQQ--GASKALIHLAHNGTVKGKRQ-AAQGLARLGITLN 92
+I+NA +S ++G ASK V+G Q AAQ LA GIT+N
Sbjct: 134 KIINA-SSIAGVQGFPNLGAYSASKF--------AVRGLTQTAAQELAPKGITVN 179
>gnl|CDD|215629 PLN03200, PLN03200, cellulose synthase-interactive protein;
Provisional.
Length = 2102
Score = 29.7 bits (67), Expect = 0.73
Identities = 30/119 (25%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 17 EDVIKIYEANKDKVESDNTRELLARILNALASQQELRGLMVQQGASKALIHLAHNGTVKG 76
+I++ ++K++ + + +LA I +S+Q+L + I L N T
Sbjct: 612 RTLIQLLSSSKEETQ-EKAASVLADIF---SSRQDLCESLATDEIINPCIKLLTNNTEAV 667
Query: 77 KRQAAQGLARL--GITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLM-ALCNL 132
Q+A+ LA L I N +V++ E + ++PL+ L +++++E E + AL NL
Sbjct: 668 ATQSARALAALSRSIKENRKVSYAAE---DAIKPLIKL--AKSSSIEVAEQAVCALANL 721
Score = 27.4 bits (61), Expect = 4.9
Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 1/57 (1%)
Query: 32 SDNTRELLARILNALASQQE-LRGLMVQQGASKALIHLAHNGTVKGKRQAAQGLARL 87
S +E A +L L E +R + GA AL+ L NG KG+ AA+ L +L
Sbjct: 501 SQKAKEDSATVLWNLCCHSEDIRACVESAGAVPALLWLLKNGGPKGQEIAAKTLTKL 557
>gnl|CDD|225858 COG3321, COG3321, Polyketide synthase modules and related proteins
[Secondary metabolites biosynthesis, transport, and
catabolism].
Length = 1061
Score = 29.3 bits (66), Expect = 0.91
Identities = 10/39 (25%), Positives = 12/39 (30%)
Query: 34 NTRELLARILNALASQQELRGLMVQQGASKALIHLAHNG 72
N RE L L A A+ + V S G
Sbjct: 495 NDREELEAGLRAFAAGKAKALSGVGADDSGKKTVFVFPG 533
>gnl|CDD|236618 PRK09736, PRK09736, 5-methylcytosine-specific restriction enzyme
subunit McrC; Provisional.
Length = 352
Score = 29.2 bits (66), Expect = 0.94
Identities = 12/35 (34%), Positives = 17/35 (48%), Gaps = 3/35 (8%)
Query: 24 EANKDKV---ESDNTRELLARILNALASQQELRGL 55
E + + E DN +LLA IL ++ RGL
Sbjct: 24 EQDYKDIATEEFDNLPDLLAEILAKGVERRLKRGL 58
>gnl|CDD|79056 PRK12581, PRK12581, oxaloacetate decarboxylase; Provisional.
Length = 468
Score = 28.5 bits (63), Expect = 1.5
Identities = 28/87 (32%), Positives = 39/87 (44%), Gaps = 16/87 (18%)
Query: 78 RQAAQGLARLGITLNPEVAFPGERSLEVVRP------LLSLLHPEATALETFEGLMALCN 131
RQA Q G L+P + FP R+L+ P +LS L +A A E ++A
Sbjct: 280 RQARQKYLADG-ILDPSLLFPDPRTLQYQVPGGMLSNMLSQLK-QANAESKLEEVLA--- 334
Query: 132 LAAIGEKQRQRKKLNFRPFVSCLGGVV 158
E R RK L + P V+ L +V
Sbjct: 335 -----EVPRVRKDLGYPPLVTPLSQMV 356
>gnl|CDD|182406 PRK10358, PRK10358, putative rRNA methylase; Provisional.
Length = 157
Score = 28.0 bits (62), Expect = 1.8
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 35 TRELLARILNALASQQELRGLMVQQGASKAL 65
TR L A IL+AL ++Q++R M+ S L
Sbjct: 103 TRGLPASILDALPAEQKIRIPMMPDSRSMNL 133
>gnl|CDD|223199 COG0121, COG0121, Predicted glutamine amidotransferase [General
function prediction only].
Length = 252
Score = 28.1 bits (63), Expect = 1.8
Identities = 15/78 (19%), Positives = 22/78 (28%), Gaps = 18/78 (23%)
Query: 68 LAHNGTVKGKRQAAQG-----------LARLGITLNPEVAFPGERSLEVVRPLLSLLHPE 116
AHNG + + A GIT + + + L +L E
Sbjct: 104 FAHNGQLDKFKLLEGRKLEPVGYTDSEAAFCGITDSERIFLLILKP-----GLDALFQAE 158
Query: 117 ATALETFEGLMALCNLAA 134
AL F L +
Sbjct: 159 IRALGLF--NTLLSDGNY 174
>gnl|CDD|164623 MTH00052, COX3, cytochrome c oxidase subunit III; Provisional.
Length = 262
Score = 27.1 bits (60), Expect = 4.7
Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 4/45 (8%)
Query: 61 ASKALIHLAHNGTVKGKRQAA-QGLARLGITLNPEVAFPGERSLE 104
+S A + AH+G + GKR+ A GLA L + L + F G +++E
Sbjct: 140 SSGATVTWAHHGIISGKRKEAIIGLA-LTVALG--LLFTGLQAME 181
>gnl|CDD|182923 PRK11041, PRK11041, DNA-binding transcriptional regulator CytR;
Provisional.
Length = 309
Score = 26.9 bits (60), Expect = 4.8
Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 12/71 (16%)
Query: 82 QGLARLGITLNPEVAFPGERSLE----VVRPLLSLLHPEATALETFEGLMALCNLAAIGE 137
Q L R GIT++P+ G+ + E ++ LL L P TA+ +MAL L+
Sbjct: 176 QALRRCGITVDPQYIARGDFTFEAGAKALKQLLDLPQP-PTAVFCHSDVMALGALS---- 230
Query: 138 KQRQRKKLNFR 148
Q K++ R
Sbjct: 231 ---QAKRMGLR 238
>gnl|CDD|111382 pfam02477, Nairo_nucleo, Nucleocapsid N protein. The nucleoprotein
of the ssRNA negative-strand Nairovirus is an internal
part of the virus particle.
Length = 442
Score = 27.2 bits (60), Expect = 5.0
Identities = 12/36 (33%), Positives = 16/36 (44%)
Query: 17 EDVIKIYEANKDKVESDNTRELLARILNALASQQEL 52
D+ E NKD V+ EL+ + LA EL
Sbjct: 256 LDLKGWVEDNKDAVDDGKANELVQTMTKHLAKAVEL 291
>gnl|CDD|184255 PRK13699, PRK13699, putative methylase; Provisional.
Length = 227
Score = 26.7 bits (59), Expect = 6.2
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 1/38 (2%)
Query: 98 PGERSLEVVRPLL-SLLHPEATALETFEGLMALCNLAA 134
P E+ + ++PL+ S HP A L+ F G + C A
Sbjct: 145 PTEKPVTSLQPLIESFTHPNAIVLDPFAGSGSTCVAAL 182
>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat. Approx. 40
amino acid repeat. Tandem repeats form super-helix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. CAUTION: This family
does not contain all known armadillo repeats.
Length = 41
Score = 24.3 bits (54), Expect = 7.1
Identities = 9/38 (23%), Positives = 19/38 (50%)
Query: 50 QELRGLMVQQGASKALIHLAHNGTVKGKRQAAQGLARL 87
E + +++ GA L+ L + + + +AA L+ L
Sbjct: 2 PENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNL 39
>gnl|CDD|238448 cd00883, beta_CA_cladeA, Carbonic anhydrases (CA) are
zinc-containing enzymes that catalyze the reversible
hydration of carbon dioxide in a two-step mechanism in
which the nucleophilic attack of a zinc-bound hydroxide
ion on carbon dioxide is followed by the regeneration of
an active site by ionization of the zinc-bound water
molecule and removal of a proton from the active site.
CAs are ubiquitous enzymes involved in fundamental
processes like photosynthesis, respiration, pH
homeostasis and ion transport. There are three
evolutionarily distinct families of CAs (the alpha-,
beta-, and gamma-CAs) which show no significant sequence
identity or structural similarity. Within the beta-CA
family there are four evolutionarily distinct clades (A
through D). The beta-CAs are multimeric enzymes (forming
dimers,tetramers,hexamers and octamers) which are
present in higher plants, algae, fungi, archaea and
prokaryotes.
Length = 182
Score = 26.4 bits (59), Expect = 7.2
Identities = 11/50 (22%), Positives = 21/50 (42%)
Query: 35 TRELLARILNALASQQELRGLMVQQGASKALIHLAHNGTVKGKRQAAQGL 84
L A L+AL ++E +V+ + + +L V+ + Q L
Sbjct: 116 VYRLHAAELDALEDEEERVDRLVELNVVEQVKNLCKTPIVQDAWKRGQEL 165
>gnl|CDD|237755 PRK14555, PRK14555, hypothetical protein; Provisional.
Length = 145
Score = 26.0 bits (58), Expect = 8.0
Identities = 12/40 (30%), Positives = 19/40 (47%), Gaps = 4/40 (10%)
Query: 17 EDVIKIYEANKDKVESDNTRELLARILNALASQQELRGLM 56
+ I I A ++E RE+L +L L + EL L+
Sbjct: 47 GNPITILSA---RLEKKEAREVLKHLLELL-DETELEKLL 82
>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats. Approx. 40
amino acid repeat. Tandem repeats form superhelix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. Involved in transducing
the Wingless/Wnt signal. In plakoglobin arm repeats
bind alpha-catenin and N-cadherin.
Length = 41
Score = 23.9 bits (53), Expect = 8.8
Identities = 10/38 (26%), Positives = 18/38 (47%)
Query: 50 QELRGLMVQQGASKALIHLAHNGTVKGKRQAAQGLARL 87
E + +V G AL+ L + + ++AA L+ L
Sbjct: 2 DENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNL 39
>gnl|CDD|204266 pfam09537, DUF2383, Domain of unknown function (DUF2383).
Members of this protein family are found mostly in the
Proteobacteria, although one member is found in the the
marine planctomycete Pirellula sp. strain 1. The
function is unknown.
Length = 87
Score = 24.8 bits (55), Expect = 9.5
Identities = 12/64 (18%), Positives = 20/64 (31%), Gaps = 6/64 (9%)
Query: 15 VSEDVIKIYEANKDKVESDNTRELLARILNALASQ-QELRGLMVQQGASKALIHLAHNGT 73
D + YE +KV+ + L R EL+ + G G+
Sbjct: 12 TLYDGEEGYEKAAEKVKDPELKALFQRRAAERRQAAAELQAEIRALGGE-----PEDGGS 66
Query: 74 VKGK 77
+ G
Sbjct: 67 LAGT 70
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.133 0.368
Gapped
Lambda K H
0.267 0.0877 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,177,256
Number of extensions: 754947
Number of successful extensions: 867
Number of sequences better than 10.0: 1
Number of HSP's gapped: 867
Number of HSP's successfully gapped: 48
Length of query: 164
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 74
Effective length of database: 6,945,742
Effective search space: 513984908
Effective search space used: 513984908
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 55 (24.7 bits)