RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4705
         (164 letters)



>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
           40 amino acid long tandemly repeated sequence motif
           first identified in the Drosophila segment polarity gene
           armadillo; these repeats were also found in the
           mammalian armadillo homolog beta-catenin, the junctional
           plaque protein plakoglobin, the adenomatous polyposis
           coli (APC) tumor suppressor protein, and a number of
           other proteins. ARM has been implicated in mediating
           protein-protein interactions, but no common features
           among the target proteins recognized by the ARM repeats
           have been identified; related to the HEAT domain; three
           consecutive copies of the repeat are represented by this
           alignment model.
          Length = 120

 Score = 29.6 bits (67), Expect = 0.38
 Identities = 22/104 (21%), Positives = 38/104 (36%), Gaps = 3/104 (2%)

Query: 31  ESDNTRELLARILNALASQQELRGLMVQQGASKALIHLAHNGTVKGKRQAAQGLARLGIT 90
           + +  RE    + N  A   +    +V+ G   AL+ L  +   +  + A   L    + 
Sbjct: 20  DENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRN--LA 77

Query: 91  LNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLMALCNLAA 134
             PE           V  L++LL      ++      AL NLA+
Sbjct: 78  AGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQK-NATGALSNLAS 120


>gnl|CDD|227623 COG5307, COG5307, SEC7 domain proteins [General function prediction
           only].
          Length = 1024

 Score = 30.5 bits (69), Expect = 0.46
 Identities = 10/65 (15%), Positives = 23/65 (35%)

Query: 8   QTMTNMIVSEDVIKIYEANKDKVESDNTRELLARILNALASQQELRGLMVQQGASKALIH 67
            ++ ++ V E   +I    +D++       L ++  N       L   +  +     L  
Sbjct: 572 DSLLSISVLELAARIGRGFEDQLSDVKLFRLRSKSGNKEILSSSLLSNIENKSLKLYLSS 631

Query: 68  LAHNG 72
           L  +G
Sbjct: 632 LIEDG 636


>gnl|CDD|187624 cd05366, meso-BDH-like_SDR_c, meso-2,3-butanediol
           dehydrogenase-like, classical (c) SDRs.  2,3-butanediol
           dehydrogenases (BDHs) catalyze the NAD+ dependent
           conversion of 2,3-butanediol to acetonin; BDHs are
           classified into types according to their
           stereospecificity as to substrates and products.
           Included in this subgroup are Klebsiella pneumonia
           meso-BDH which catalyzes meso-2,3-butanediol to
           D(-)-acetonin, and Corynebacterium glutamicum L-BDH
           which catalyzes lX+)-2,3-butanediol to L(+)-acetonin.
           This subgroup is comprised of classical SDRs with the
           characteristic catalytic triad and NAD-binding motif.
           SDRs are a functionally diverse family of
           oxidoreductases that have a single domain with a
           structurally conserved Rossmann fold (alpha/beta folding
           pattern with a central beta-sheet), an NAD(P)(H)-binding
           region, and a structurally diverse C-terminal region.
           Classical SDRs are typically about 250 residues long,
           while extended SDRs are approximately 350 residues.
           Sequence identity between different SDR enzymes are
           typically in the 15-30% range, but the enzymes share the
           Rossmann fold NAD-binding motif and characteristic
           NAD-binding and catalytic sequence patterns. These
           enzymes catalyze a wide range of activities including
           the metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase (15-PGDH)
           numbering). In addition to the Tyr and Lys, there is
           often an upstream Ser (Ser-138, 15-PGDH numbering)
           and/or an Asn (Asn-107, 15-PGDH numbering) contributing
           to the active site; while substrate binding is in the
           C-terminal region, which determines specificity. The
           standard reaction mechanism is a 4-pro-S hydride
           transfer and proton relay involving the conserved Tyr
           and Lys, a water molecule stabilized by Asn, and
           nicotinamide. Extended SDRs have additional elements in
           the C-terminal region, and typically have a TGXXGXXG
           cofactor binding motif. Complex (multidomain) SDRs such
           as ketoreductase domains of fatty acid synthase have a
           GGXGXXG NAD(P)-binding motif and an altered active site
           motif (YXXXN). Fungal type ketoacyl reductases have a
           TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs
           have lost catalytic activity and/or have an unusual
           NAD(P)-binding motif and missing or unusual active site
           residues. Reactions catalyzed within the SDR family
           include isomerization, decarboxylation, epimerization,
           C=N bond reduction, dehydratase activity,
           dehalogenation, Enoyl-CoA reduction, and
           carbonyl-alcohol oxidoreduction.
          Length = 257

 Score = 29.7 bits (67), Expect = 0.70
 Identities = 20/55 (36%), Positives = 27/55 (49%), Gaps = 12/55 (21%)

Query: 41  RILNALASQQELRGLMVQQ--GASKALIHLAHNGTVKGKRQ-AAQGLARLGITLN 92
           +I+NA +S   ++G        ASK          V+G  Q AAQ LA  GIT+N
Sbjct: 134 KIINA-SSIAGVQGFPNLGAYSASKF--------AVRGLTQTAAQELAPKGITVN 179


>gnl|CDD|215629 PLN03200, PLN03200, cellulose synthase-interactive protein;
           Provisional.
          Length = 2102

 Score = 29.7 bits (67), Expect = 0.73
 Identities = 30/119 (25%), Positives = 59/119 (49%), Gaps = 12/119 (10%)

Query: 17  EDVIKIYEANKDKVESDNTRELLARILNALASQQELRGLMVQQGASKALIHLAHNGTVKG 76
             +I++  ++K++ + +    +LA I    +S+Q+L   +         I L  N T   
Sbjct: 612 RTLIQLLSSSKEETQ-EKAASVLADIF---SSRQDLCESLATDEIINPCIKLLTNNTEAV 667

Query: 77  KRQAAQGLARL--GITLNPEVAFPGERSLEVVRPLLSLLHPEATALETFEGLM-ALCNL 132
             Q+A+ LA L   I  N +V++  E   + ++PL+ L   +++++E  E  + AL NL
Sbjct: 668 ATQSARALAALSRSIKENRKVSYAAE---DAIKPLIKL--AKSSSIEVAEQAVCALANL 721



 Score = 27.4 bits (61), Expect = 4.9
 Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 1/57 (1%)

Query: 32  SDNTRELLARILNALASQQE-LRGLMVQQGASKALIHLAHNGTVKGKRQAAQGLARL 87
           S   +E  A +L  L    E +R  +   GA  AL+ L  NG  KG+  AA+ L +L
Sbjct: 501 SQKAKEDSATVLWNLCCHSEDIRACVESAGAVPALLWLLKNGGPKGQEIAAKTLTKL 557


>gnl|CDD|225858 COG3321, COG3321, Polyketide synthase modules and related proteins
           [Secondary metabolites biosynthesis, transport, and
           catabolism].
          Length = 1061

 Score = 29.3 bits (66), Expect = 0.91
 Identities = 10/39 (25%), Positives = 12/39 (30%)

Query: 34  NTRELLARILNALASQQELRGLMVQQGASKALIHLAHNG 72
           N RE L   L A A+ +      V    S         G
Sbjct: 495 NDREELEAGLRAFAAGKAKALSGVGADDSGKKTVFVFPG 533


>gnl|CDD|236618 PRK09736, PRK09736, 5-methylcytosine-specific restriction enzyme
          subunit McrC; Provisional.
          Length = 352

 Score = 29.2 bits (66), Expect = 0.94
 Identities = 12/35 (34%), Positives = 17/35 (48%), Gaps = 3/35 (8%)

Query: 24 EANKDKV---ESDNTRELLARILNALASQQELRGL 55
          E +   +   E DN  +LLA IL     ++  RGL
Sbjct: 24 EQDYKDIATEEFDNLPDLLAEILAKGVERRLKRGL 58


>gnl|CDD|79056 PRK12581, PRK12581, oxaloacetate decarboxylase; Provisional.
          Length = 468

 Score = 28.5 bits (63), Expect = 1.5
 Identities = 28/87 (32%), Positives = 39/87 (44%), Gaps = 16/87 (18%)

Query: 78  RQAAQGLARLGITLNPEVAFPGERSLEVVRP------LLSLLHPEATALETFEGLMALCN 131
           RQA Q     G  L+P + FP  R+L+   P      +LS L  +A A    E ++A   
Sbjct: 280 RQARQKYLADG-ILDPSLLFPDPRTLQYQVPGGMLSNMLSQLK-QANAESKLEEVLA--- 334

Query: 132 LAAIGEKQRQRKKLNFRPFVSCLGGVV 158
                E  R RK L + P V+ L  +V
Sbjct: 335 -----EVPRVRKDLGYPPLVTPLSQMV 356


>gnl|CDD|182406 PRK10358, PRK10358, putative rRNA methylase; Provisional.
          Length = 157

 Score = 28.0 bits (62), Expect = 1.8
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 35  TRELLARILNALASQQELRGLMVQQGASKAL 65
           TR L A IL+AL ++Q++R  M+    S  L
Sbjct: 103 TRGLPASILDALPAEQKIRIPMMPDSRSMNL 133


>gnl|CDD|223199 COG0121, COG0121, Predicted glutamine amidotransferase [General
           function prediction only].
          Length = 252

 Score = 28.1 bits (63), Expect = 1.8
 Identities = 15/78 (19%), Positives = 22/78 (28%), Gaps = 18/78 (23%)

Query: 68  LAHNGTVKGKRQAAQG-----------LARLGITLNPEVAFPGERSLEVVRPLLSLLHPE 116
            AHNG +   +                 A  GIT +  +     +       L +L   E
Sbjct: 104 FAHNGQLDKFKLLEGRKLEPVGYTDSEAAFCGITDSERIFLLILKP-----GLDALFQAE 158

Query: 117 ATALETFEGLMALCNLAA 134
             AL  F     L +   
Sbjct: 159 IRALGLF--NTLLSDGNY 174


>gnl|CDD|164623 MTH00052, COX3, cytochrome c oxidase subunit III; Provisional.
          Length = 262

 Score = 27.1 bits (60), Expect = 4.7
 Identities = 17/45 (37%), Positives = 27/45 (60%), Gaps = 4/45 (8%)

Query: 61  ASKALIHLAHNGTVKGKRQAA-QGLARLGITLNPEVAFPGERSLE 104
           +S A +  AH+G + GKR+ A  GLA L + L   + F G +++E
Sbjct: 140 SSGATVTWAHHGIISGKRKEAIIGLA-LTVALG--LLFTGLQAME 181


>gnl|CDD|182923 PRK11041, PRK11041, DNA-binding transcriptional regulator CytR;
           Provisional.
          Length = 309

 Score = 26.9 bits (60), Expect = 4.8
 Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 12/71 (16%)

Query: 82  QGLARLGITLNPEVAFPGERSLE----VVRPLLSLLHPEATALETFEGLMALCNLAAIGE 137
           Q L R GIT++P+    G+ + E     ++ LL L  P  TA+     +MAL  L+    
Sbjct: 176 QALRRCGITVDPQYIARGDFTFEAGAKALKQLLDLPQP-PTAVFCHSDVMALGALS---- 230

Query: 138 KQRQRKKLNFR 148
              Q K++  R
Sbjct: 231 ---QAKRMGLR 238


>gnl|CDD|111382 pfam02477, Nairo_nucleo, Nucleocapsid N protein.  The nucleoprotein
           of the ssRNA negative-strand Nairovirus is an internal
           part of the virus particle.
          Length = 442

 Score = 27.2 bits (60), Expect = 5.0
 Identities = 12/36 (33%), Positives = 16/36 (44%)

Query: 17  EDVIKIYEANKDKVESDNTRELLARILNALASQQEL 52
            D+    E NKD V+     EL+  +   LA   EL
Sbjct: 256 LDLKGWVEDNKDAVDDGKANELVQTMTKHLAKAVEL 291


>gnl|CDD|184255 PRK13699, PRK13699, putative methylase; Provisional.
          Length = 227

 Score = 26.7 bits (59), Expect = 6.2
 Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 1/38 (2%)

Query: 98  PGERSLEVVRPLL-SLLHPEATALETFEGLMALCNLAA 134
           P E+ +  ++PL+ S  HP A  L+ F G  + C  A 
Sbjct: 145 PTEKPVTSLQPLIESFTHPNAIVLDPFAGSGSTCVAAL 182


>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat.  Approx. 40
          amino acid repeat. Tandem repeats form super-helix of
          helices that is proposed to mediate interaction of
          beta-catenin with its ligands. CAUTION: This family
          does not contain all known armadillo repeats.
          Length = 41

 Score = 24.3 bits (54), Expect = 7.1
 Identities = 9/38 (23%), Positives = 19/38 (50%)

Query: 50 QELRGLMVQQGASKALIHLAHNGTVKGKRQAAQGLARL 87
           E +  +++ GA   L+ L  +   + + +AA  L+ L
Sbjct: 2  PENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNL 39


>gnl|CDD|238448 cd00883, beta_CA_cladeA, Carbonic anhydrases (CA) are
           zinc-containing enzymes that catalyze the reversible
           hydration of carbon dioxide in a two-step mechanism in
           which the nucleophilic attack of a zinc-bound hydroxide
           ion on carbon dioxide is followed by the regeneration of
           an active site by ionization of the zinc-bound water
           molecule and removal of a proton from the active site.
           CAs are ubiquitous enzymes involved in fundamental
           processes like photosynthesis, respiration, pH
           homeostasis and ion transport. There are three
           evolutionarily distinct families of CAs (the alpha-,
           beta-, and gamma-CAs) which show no significant sequence
           identity or structural similarity.  Within the beta-CA
           family there are four evolutionarily distinct clades (A
           through D). The beta-CAs are multimeric enzymes (forming
           dimers,tetramers,hexamers and octamers) which are
           present in higher plants, algae, fungi, archaea and
           prokaryotes.
          Length = 182

 Score = 26.4 bits (59), Expect = 7.2
 Identities = 11/50 (22%), Positives = 21/50 (42%)

Query: 35  TRELLARILNALASQQELRGLMVQQGASKALIHLAHNGTVKGKRQAAQGL 84
              L A  L+AL  ++E    +V+    + + +L     V+   +  Q L
Sbjct: 116 VYRLHAAELDALEDEEERVDRLVELNVVEQVKNLCKTPIVQDAWKRGQEL 165


>gnl|CDD|237755 PRK14555, PRK14555, hypothetical protein; Provisional.
          Length = 145

 Score = 26.0 bits (58), Expect = 8.0
 Identities = 12/40 (30%), Positives = 19/40 (47%), Gaps = 4/40 (10%)

Query: 17 EDVIKIYEANKDKVESDNTRELLARILNALASQQELRGLM 56
           + I I  A   ++E    RE+L  +L  L  + EL  L+
Sbjct: 47 GNPITILSA---RLEKKEAREVLKHLLELL-DETELEKLL 82


>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats.  Approx. 40
          amino acid repeat. Tandem repeats form superhelix of
          helices that is proposed to mediate interaction of
          beta-catenin with its ligands. Involved in transducing
          the Wingless/Wnt signal. In plakoglobin arm repeats
          bind alpha-catenin and N-cadherin.
          Length = 41

 Score = 23.9 bits (53), Expect = 8.8
 Identities = 10/38 (26%), Positives = 18/38 (47%)

Query: 50 QELRGLMVQQGASKALIHLAHNGTVKGKRQAAQGLARL 87
           E +  +V  G   AL+ L  +   +  ++AA  L+ L
Sbjct: 2  DENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNL 39


>gnl|CDD|204266 pfam09537, DUF2383, Domain of unknown function (DUF2383).
          Members of this protein family are found mostly in the
          Proteobacteria, although one member is found in the the
          marine planctomycete Pirellula sp. strain 1. The
          function is unknown.
          Length = 87

 Score = 24.8 bits (55), Expect = 9.5
 Identities = 12/64 (18%), Positives = 20/64 (31%), Gaps = 6/64 (9%)

Query: 15 VSEDVIKIYEANKDKVESDNTRELLARILNALASQ-QELRGLMVQQGASKALIHLAHNGT 73
             D  + YE   +KV+    + L  R          EL+  +   G           G+
Sbjct: 12 TLYDGEEGYEKAAEKVKDPELKALFQRRAAERRQAAAELQAEIRALGGE-----PEDGGS 66

Query: 74 VKGK 77
          + G 
Sbjct: 67 LAGT 70


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.133    0.368 

Gapped
Lambda     K      H
   0.267   0.0877    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,177,256
Number of extensions: 754947
Number of successful extensions: 867
Number of sequences better than 10.0: 1
Number of HSP's gapped: 867
Number of HSP's successfully gapped: 48
Length of query: 164
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 74
Effective length of database: 6,945,742
Effective search space: 513984908
Effective search space used: 513984908
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 55 (24.7 bits)