RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13239
         (182 letters)



>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score =  185 bits (472), Expect = 1e-56
 Identities = 83/179 (46%), Positives = 119/179 (66%), Gaps = 7/179 (3%)

Query: 7   QVESSPFVERLLRKGYEILYLIEAVDEYTLSSLPEFEGKKFQNIAKEGLSISAN---KEK 63
           QVE SPF+ERL +KGYE+LY+ + +DEY +  L EFEGKK  N+ KEGL +  +   K+K
Sbjct: 298 QVEKSPFLERLKKKGYEVLYMTDPIDEYAMQQLKEFEGKKLVNVTKEGLKLEESEEEKKK 357

Query: 64  MEALKTQYEPLLKWLSEKALKDQIAKAEISERLSDSPCALVASMFGWTGNMERLAMSNAH 123
            E LK ++E L KW+ +  L D++ K  +S RL DSPCALV S +GW+ NMER+  + A 
Sbjct: 358 REELKKEFEELCKWM-KDILGDKVEKVVVSNRLVDSPCALVTSQYGWSANMERIMKAQAL 416

Query: 124 QKADDPQRKYYLSQKKTMEVNPRHPLIRELYRRVKDDPEDAKATEIATMMFQTGTLREG 182
           +   D     Y+S KKT+E+NPRHP+I+EL +RV+ D  D    ++A ++++T  L  G
Sbjct: 417 R---DSSMSAYMSSKKTLEINPRHPIIKELRKRVEADKNDKTVKDLARLLYETALLTSG 472


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score =  134 bits (337), Expect = 9e-37
 Identities = 74/179 (41%), Positives = 108/179 (60%), Gaps = 7/179 (3%)

Query: 7   QVESSPFVERLLRKGYEILYLIEAVDEYTLSSLPEFEGKKFQNIAKEGLSISAN---KEK 63
           ++E+SPF+E+  R+G E+L++ E +DEY +  + +FE KKF  + KEG+    +   K++
Sbjct: 473 KLETSPFIEQARRRGLEVLFMTEPIDEYVMQQVKDFEDKKFACLTKEGVHFEESEEEKQQ 532

Query: 64  MEALKTQYEPLLKWLSEKALKDQIAKAEISERLSDSPCALVASMFGWTGNMERLAMSNAH 123
            E  K   E L K + E  L D++ K  +SERLS SPC LV S FGW+ +ME++ M N  
Sbjct: 533 REEEKAACEKLCKTMKE-VLGDKVEKVIVSERLSTSPCILVTSEFGWSAHMEQI-MRN-- 588

Query: 124 QKADDPQRKYYLSQKKTMEVNPRHPLIRELYRRVKDDPEDAKATEIATMMFQTGTLREG 182
           Q   D     Y+  KKTME+NPRHP+I+EL RRV  D  D    ++  ++F T  L  G
Sbjct: 589 QALRDSSMAQYMMSKKTMELNPRHPIIKELRRRVGADENDKAVKDLVFLLFDTSLLTSG 647


>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
          Length = 613

 Score =  125 bits (318), Expect = 4e-34
 Identities = 57/180 (31%), Positives = 103/180 (57%), Gaps = 18/180 (10%)

Query: 7   QVESSPFVERLLRKGYEILYLIEAVDEYTLSSLPEFEGKKFQNIAKEGLSIS----ANKE 62
             ++SP +E   +KG E+L L + +DE+ +S L EF+GK F+++A+  L +       KE
Sbjct: 426 AAKNSPHLELFKKKGIEVLLLTDPIDEFWISYLEEFDGKPFKSVARGDLDLGKEDEEEKE 485

Query: 63  KMEALKTQYEPLLKWLSEKALKDQIAKAEISERLSDSPCALVASMFGWTGNMERLAMSNA 122
           + E  + +++PLL+ L ++AL D++    +S RL+DSP  LVA     +  ME+L +  A
Sbjct: 486 EKEEAEEEFKPLLERL-KEALGDKVKDVRLSHRLTDSPACLVADEGDMSTQMEKL-LKAA 543

Query: 123 HQKADDPQRKYYLSQKKTMEVNPRHPLIRELYRRVKDDPEDAKATEIATMMFQTGTLREG 182
            Q+  +         K  +E+NP HPL+    +++ D+ ++AK  ++A +++    L EG
Sbjct: 544 GQEVPES--------KPILEINPNHPLV----KKLADEADEAKFKDLAELLYDQALLAEG 591


>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
           modification, protein turnover, chaperones].
          Length = 623

 Score =  117 bits (295), Expect = 5e-31
 Identities = 57/180 (31%), Positives = 95/180 (52%), Gaps = 18/180 (10%)

Query: 7   QVESSPFVERLLRKGYEILYLIEAVDEYTLSSLPEFEGKKFQNIAKEGLSISANKEKMEA 66
             + SP +E    KG E+L L + +DE+ L+ LPEFEGK F++I K  L +   +E+ EA
Sbjct: 436 AAKGSPHLELFKAKGIEVLLLTDRIDEFMLTMLPEFEGKPFKSITKGDLDLELLEEEDEA 495

Query: 67  ----LKTQYEPLLKWLSEKALKDQIAKAEISERLSDSPCALVASMFGWTGNMERLAMSNA 122
                K +++PLL+ + ++ L D++    +S RL+DSP  L       +  MERL +   
Sbjct: 496 DSEEEKKEFKPLLERV-KEILGDKVKDVRLSHRLTDSPACLTTDGADLSTQMERL-LKAQ 553

Query: 123 HQKADDPQRKYYLSQKKTMEVNPRHPLIRELYRRVKDDPEDAKATEIATMMFQTGTLREG 182
            Q+   P+ K  L      E+NP HPL+++L        ++A   ++  +++    L EG
Sbjct: 554 GQE--VPESKKIL------EINPNHPLVKKL----ASLEDEASVADLVELLYDQALLAEG 601


>gnl|CDD|185466 PTZ00130, PTZ00130, heat shock protein 90; Provisional.
          Length = 814

 Score =  102 bits (254), Expect = 2e-25
 Identities = 57/174 (32%), Positives = 107/174 (61%), Gaps = 8/174 (4%)

Query: 12  PFVERLLRKGYEILYLIEAVDEYTLSSLPEFEGKKFQNIAKEGLSISAN---KEKMEALK 68
           P ++   +K  ++++L E+VDE  +  + E++GKKF++I K  ++       K+K E +K
Sbjct: 567 PQLQIFKKKNIDVVFLTESVDESCVQRVQEYDGKKFKSIQKGEITFELTEDEKKKEEKVK 626

Query: 69  TQYEPLLKWLSEKALKDQIAKAEISERLSDSPCALVASMFGWTGNMERLAMSNAHQKADD 128
             Y+ L+  +S+  L++QI K EIS RL D+PCA+V++ +G +G ME+L   N     ++
Sbjct: 627 KMYKALIDVISD-TLRNQIFKVEISRRLVDAPCAVVSTEWGLSGQMEKLMKIN----VNN 681

Query: 129 PQRKYYLSQKKTMEVNPRHPLIRELYRRVKDDPEDAKATEIATMMFQTGTLREG 182
             +   +S +K +E+NP HP++ +L +R   +P+D++ TE   +++Q+  L  G
Sbjct: 682 SDQIKAMSGQKILEINPDHPIMIDLLKRSVSNPKDSQLTESIKIIYQSAKLASG 735


>gnl|CDD|216564 pfam01548, DEDD_Tnp_IS110, Transposase.  Transposase proteins are
           necessary for efficient DNA transposition. This family
           includes an amino-terminal region of the pilin gene
           inverting protein (PIVML) and members of the
           IS111A/IS1328/IS1533 family of transposases.
          Length = 142

 Score = 30.7 bits (70), Expect = 0.22
 Identities = 17/41 (41%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 142 EVNPRH-PLIRELYRRVKDDPEDAKATEIATMMFQTGTLRE 181
            VNPR     R+  RR K D  DA+A  IA +  + G LR 
Sbjct: 74  VVNPRQVKAFRKSLRRAKTDAIDAEA--IARLA-RRGRLRP 111


>gnl|CDD|150245 pfam09506, Salt_tol_Pase, Glucosylglycerol-phosphate phosphatase
           (Salt_tol_Pase).  Proteins in this family are
           glucosylglycerol-phosphate phosphatases, with the gene
           symbol stpA (Salt Tolerance Protein A). A motif
           characteristic of acid phosphatases is found, but
           otherwise this family shows little sequence similarity
           to other phosphatases. This enzyme acts on the
           glucosylglycerol phosphate, product of glucosylglycerol
           phosphate synthase and immediate precursor of the
           osmoprotectant glucosylglycerol.
          Length = 381

 Score = 31.5 bits (72), Expect = 0.23
 Identities = 21/69 (30%), Positives = 34/69 (49%), Gaps = 9/69 (13%)

Query: 47  FQNIAKEGLS------ISANKEKMEA-LKTQYEPLLKWLSEKALKDQIAKAEISERLSDS 99
           F NI+  G+S      ++A   +MEA LK +   +L  LSE+ +      A +  R+  S
Sbjct: 89  FGNISHPGVSDAEIAFLAAVPARMEALLKERLPSILPELSEEEIAKLAQAAILDTRV--S 146

Query: 100 PCALVASMF 108
           P   + S+F
Sbjct: 147 PTINLNSLF 155


>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
           [Transcription].
          Length = 182

 Score = 29.1 bits (65), Expect = 0.99
 Identities = 17/56 (30%), Positives = 30/56 (53%)

Query: 43  EGKKFQNIAKEGLSISANKEKMEALKTQYEPLLKWLSEKALKDQIAKAEISERLSD 98
              + + + KEGL I    E ++ +K ++ P+ K L E+ LK ++ K E  E + D
Sbjct: 107 VEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEVIKGEPLEEVID 162


>gnl|CDD|234096 TIGR03031, cas_csx12, CRISPR-associated protein Cas9/Csx12, subtype
           II-B/NMENI.  Members of this family of CRISPR-associated
           (cas) protein are found, so far, in CRISPR/cas loci in
           Wolinella succinogenes DSM 1740, Legionella pneumophila
           str. Paris, and Francisella tularensis, where the last
           probably is an example of a degenerate CRISPR locus,
           having neither repeats nor a functional Cas1. The
           characteristic repeat length is 37 base pairs and period
           is about 72. One region of this large protein shows
           sequence similarity to pfam01844, HNH endonuclease
           [Mobile and extrachromosomal element functions, Other].
          Length = 802

 Score = 28.4 bits (63), Expect = 2.8
 Identities = 13/78 (16%), Positives = 30/78 (38%), Gaps = 1/78 (1%)

Query: 19  RKGYEILYLIEAVDEYTLSSLPEFEGKKFQNIAKEG-LSISANKEKMEALKTQYEPLLKW 77
           RK  ++  L+  + + +       + + F N  K G   + +  +++E  +       K 
Sbjct: 499 RKQKQLPNLVAGILQVSSEDFINNKDEWFWNAHKVGRTKLKSYCKEIEEARKNSGNTFKI 558

Query: 78  LSEKALKDQIAKAEISER 95
             E+AL         +E+
Sbjct: 559 DIEEALSKDPEHLTSNEK 576


>gnl|CDD|162974 TIGR02683, upstrm_HI1419, putative addiction module killer
          protein.  Members of this strictly bacterial protein
          family are small, at roughly 100 amino acids. The gene
          is almost invariably the upstream member of a gene
          pair, where the downstream member is a predicted
          DNA-binding protein from a clade within Pfam
          helix-turn-helix family pfam01381. These gene pairs,
          when found on the bacterial chromosome, often are
          located with prophage regions, but also in both
          integrated plasmid regions and near housekeeping genes.
          Analysis suggests that the gene pair may serve as an
          addiction module.
          Length = 95

 Score = 26.4 bits (59), Expect = 3.6
 Identities = 11/21 (52%), Positives = 12/21 (57%), Gaps = 2/21 (9%)

Query: 76 KWLSEKALKDQIAKAEISERL 96
          KWL    LKD  AKA I  R+
Sbjct: 11 KWL--DGLKDPRAKARILARI 29


>gnl|CDD|213293 cd05927, LC-FACS_euk, Eukaryotic long-chain fatty acid CoA
           synthetase (LC-FACS).  The members of this family are
           eukaryotic fatty acid CoA synthetases that activate
           fatty acids with chain lengths of 12 to 20. LC-FACS
           catalyzes the formation of fatty acyl-CoA in a two-step
           reaction: the formation of a fatty acyl-AMP molecule as
           an intermediate, and the formation of a fatty acyl-CoA.
           This is a required step before free fatty acids can
           participate in most catabolic and anabolic reactions.
           Organisms tend to have multiple isoforms of LC-FACS
           genes with multiple splice variants. For example, nine
           genes are found in Arabidopsis and six genes are
           expressed in mammalian cells.
          Length = 539

 Score = 26.7 bits (60), Expect = 7.5
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 150 IRELYRRVKDDPEDAKATEIATMMFQTGT 178
           + EL ++ K  P   K  ++AT+M+ +GT
Sbjct: 94  LEELGKKNKVPPTPPKPEDLATIMYTSGT 122


>gnl|CDD|187541 cd05230, UGD_SDR_e, UDP-glucuronate decarboxylase (UGD) and
          related proteins, extended (e) SDRs.  UGD catalyzes the
          formation of UDP-xylose from UDP-glucuronate; it is an
          extended-SDR, and has the characteristic glycine-rich
          NAD-binding pattern, TGXXGXXG, and active site tetrad. 
          Extended SDRs are distinct from classical SDRs. In
          addition to the Rossmann fold (alpha/beta folding
          pattern with a central beta-sheet) core region typical
          of all SDRs, extended SDRs have a less conserved
          C-terminal extension of approximately 100 amino acids.
          Extended SDRs are a diverse collection of proteins, and
          include isomerases, epimerases, oxidoreductases, and
          lyases; they typically have a TGXXGXXG cofactor binding
          motif. SDRs are a functionally diverse family of
          oxidoreductases that have a single domain with a
          structurally conserved Rossmann fold, an
          NAD(P)(H)-binding region, and a structurally diverse
          C-terminal region. Sequence identity between different
          SDR enzymes is typically in the 15-30% range; they
          catalyze a wide range of activities including the
          metabolism of steroids, cofactors, carbohydrates,
          lipids, aromatic compounds, and amino acids, and act in
          redox sensing. Classical SDRs have an TGXXX[AG]XG
          cofactor binding motif and a YXXXK active site motif,
          with the Tyr residue of the active site motif serving
          as a critical catalytic residue (Tyr-151, human
          15-hydroxyprostaglandin dehydrogenase numbering). In
          addition to the Tyr and Lys, there is often an upstream
          Ser and/or an Asn, contributing to the active site;
          while substrate binding is in the C-terminal region,
          which determines specificity. The standard reaction
          mechanism is a 4-pro-S hydride transfer and proton
          relay involving the conserved Tyr and Lys, a water
          molecule stabilized by Asn, and nicotinamide. Atypical
          SDRs generally lack the catalytic residues
          characteristic of the SDRs, and their glycine-rich
          NAD(P)-binding motif is often different from the forms
          normally seen in classical or extended SDRs. Complex
          (multidomain) SDRs such as ketoreductase domains of
          fatty acid synthase have a GGXGXXG NAD(P)-binding motif
          and an altered active site motif (YXXXN). Fungal type
          ketoacyl reductases have a TGXXXGX(1-2)G NAD(P)-binding
          motif.
          Length = 305

 Score = 26.4 bits (59), Expect = 8.2
 Identities = 13/42 (30%), Positives = 20/42 (47%), Gaps = 12/42 (28%)

Query: 11 SPFVERLLRKGYEILYLIEAVDEYTLSSLPEFEGKKFQNIAK 52
          S   +RLL  G+E++     VD +       F G+K +NI  
Sbjct: 14 SHLCDRLLEDGHEVI----CVDNF-------FTGRK-RNIEH 43


>gnl|CDD|226183 COG3657, COG3657, Uncharacterized protein conserved in bacteria
          [Function unknown].
          Length = 100

 Score = 25.4 bits (56), Expect = 9.4
 Identities = 12/21 (57%), Positives = 16/21 (76%), Gaps = 2/21 (9%)

Query: 76 KWLSEKALKDQIAKAEISERL 96
          +WL  K LKD+ AKA+I+ RL
Sbjct: 14 EWL--KKLKDRRAKAKIAARL 32


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.130    0.361 

Gapped
Lambda     K      H
   0.267   0.0609    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,271,053
Number of extensions: 838953
Number of successful extensions: 767
Number of sequences better than 10.0: 1
Number of HSP's gapped: 755
Number of HSP's successfully gapped: 30
Length of query: 182
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 91
Effective length of database: 6,901,388
Effective search space: 628026308
Effective search space used: 628026308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 56 (25.6 bits)