RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12933
         (65 letters)



>gnl|CDD|222269 pfam13621, Cupin_8, Cupin-like domain.  This cupin like domain
           shares similarity to the JmjC domain.
          Length = 247

 Score = 58.5 bits (142), Expect = 2e-12
 Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 1/37 (2%)

Query: 30  HLIECTLNPGDMLYIPPKVWHHVRSLS-TSISVSFWF 65
             +   L PGD LYIP   WHHVRSL   +I+V++WF
Sbjct: 205 KALVAELEPGDALYIPAGWWHHVRSLDDFNIAVNYWF 241


>gnl|CDD|219693 pfam08007, Cupin_4, Cupin superfamily protein.  This family
           contains many hypothetical proteins that belong to the
           cupin superfamily.
          Length = 319

 Score = 47.7 bits (114), Expect = 2e-08
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query: 30  HLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65
            + +  L PGDMLY+P    H   +L  S+  S  F
Sbjct: 174 PVHDFVLEPGDMLYLPRGFIHQGVALDDSLHYSVGF 209


>gnl|CDD|225406 COG2850, COG2850, Uncharacterized conserved protein [Function
           unknown].
          Length = 383

 Score = 38.9 bits (91), Expect = 3e-05
 Identities = 12/25 (48%), Positives = 15/25 (60%), Gaps = 1/25 (4%)

Query: 28  PGHL-IECTLNPGDMLYIPPKVWHH 51
           P    I+  L PGD+LYIPP   H+
Sbjct: 174 PFEPDIDEVLEPGDILYIPPGFPHY 198


>gnl|CDD|219618 pfam07883, Cupin_2, Cupin domain.  This family represents the
          conserved barrel domain of the 'cupin' superfamily
          ('cupa' is the Latin term for a small barrel).
          Length = 70

 Score = 34.9 bits (81), Expect = 2e-04
 Identities = 9/25 (36%), Positives = 11/25 (44%)

Query: 33 ECTLNPGDMLYIPPKVWHHVRSLST 57
          E  L  GD +Y P  V H  R+   
Sbjct: 37 EVVLKAGDSVYFPAGVPHRFRNTGD 61


>gnl|CDD|202224 pfam02373, JmjC, JmjC domain, hydroxylase.  The JmjC domain belongs
           to the Cupin superfamily. JmjC-domain proteins may be
           protein hydroxylases that catalyze a novel histone
           modification. This is confirmed to be a hydroxylase: the
           human JmjC protein named Tyw5p unexpectedly acts in the
           biosynthesis of a hypermodified nucleoside,
           hydroxy-wybutosine, in tRNA-Phe by catalyzing
           hydroxylation.
          Length = 114

 Score = 32.7 bits (75), Expect = 0.004
 Identities = 8/34 (23%), Positives = 16/34 (47%)

Query: 32  IECTLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65
                 PG+ ++  P  +H V +L  +I+ +  F
Sbjct: 81  YRFVQKPGEFVFTFPGWYHQVFNLGFNIAEAVNF 114


>gnl|CDD|224829 COG1917, COG1917, Uncharacterized conserved protein, contains
           double-stranded beta-helix domain [Function unknown].
          Length = 131

 Score = 30.6 bits (69), Expect = 0.026
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 35  TLNPGDMLYIPPKVWHHVRSLSTSISV 61
            L  GD++ IPP V H ++++     V
Sbjct: 85  ELKAGDVIIIPPGVVHGLKAVEDEPMV 111


>gnl|CDD|223734 COG0662, {ManC}, Mannose-6-phosphate isomerase [Carbohydrate
          transport and metabolism].
          Length = 127

 Score = 30.0 bits (68), Expect = 0.035
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query: 35 TLNPGDMLYIPPKVWHHVRS 54
           +  GD +YIP    H VR+
Sbjct: 78 EVKAGDSVYIPAGTPHRVRN 97


>gnl|CDD|204196 pfam09313, DUF1971, Domain of unknown function (DUF1971).
          Members of this family of functionally uncharacterized
          domains are predominantly found in bacterial Tellurite
          resistance protein.
          Length = 82

 Score = 28.3 bits (64), Expect = 0.094
 Identities = 9/21 (42%), Positives = 11/21 (52%)

Query: 36 LNPGDMLYIPPKVWHHVRSLS 56
          +  G    IPP+ WH V  LS
Sbjct: 54 IAAGQPAVIPPQYWHKVEPLS 74


>gnl|CDD|226343 COG3822, COG3822, ABC-type sugar transport system, auxiliary
           component [General function prediction only].
          Length = 225

 Score = 29.1 bits (65), Expect = 0.11
 Identities = 11/41 (26%), Positives = 19/41 (46%), Gaps = 13/41 (31%)

Query: 24  HNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSFW 64
              T G  ++  L+PG+ + +PP ++H           SFW
Sbjct: 147 QTHTAGSQLK--LSPGESITLPPGLYH-----------SFW 174


>gnl|CDD|226586 COG4101, COG4101, Predicted mannose-6-phosphate isomerase
           [Carbohydrate transport and metabolism].
          Length = 142

 Score = 27.9 bits (62), Expect = 0.21
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 35  TLNPGDMLYIPPKVWHHVRSLSTSISV 61
            + PGD  YIPP V H   +LST    
Sbjct: 91  EVGPGDFFYIPPGVPHQPANLSTEPLS 117


>gnl|CDD|225796 COG3257, GlxB, Uncharacterized protein, possibly involved in
           glyoxylate utilization [General function prediction
           only].
          Length = 264

 Score = 28.2 bits (63), Expect = 0.21
 Identities = 8/31 (25%), Positives = 11/31 (35%)

Query: 35  TLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65
            L  G   Y+PP     +R+     S   W 
Sbjct: 104 ALREGGYAYLPPGSGWTLRNAQKEDSRFHWI 134


>gnl|CDD|203266 pfam05523, FdtA, WxcM-like, C-terminal.  This family includes FdtA
           from Aneurinibacillus thermoaerophilus, which has been
           characterized as a dtdp-6-deoxy-3,4-keto-hexulose
           isomerase. It also includes WxcM from Xanthomonas
           campestris (pv. campestris).
          Length = 131

 Score = 27.9 bits (63), Expect = 0.24
 Identities = 9/23 (39%), Positives = 13/23 (56%), Gaps = 1/23 (4%)

Query: 35  TLN-PGDMLYIPPKVWHHVRSLS 56
            L+ P   LYIPP +W  + + S
Sbjct: 79  LLDSPDKGLYIPPMIWREMYNFS 101


>gnl|CDD|183011 PRK11171, PRK11171, hypothetical protein; Provisional.
          Length = 266

 Score = 27.9 bits (63), Expect = 0.24
 Identities = 7/31 (22%), Positives = 12/31 (38%)

Query: 35  TLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65
            L+ G   Y+PP     +R+     +   W 
Sbjct: 104 ALSEGGYAYLPPGSDWTLRNAGAEDARFHWI 134


>gnl|CDD|236717 PRK10572, PRK10572, DNA-binding transcriptional regulator AraC;
          Provisional.
          Length = 290

 Score = 27.6 bits (62), Expect = 0.33
 Identities = 8/15 (53%), Positives = 9/15 (60%)

Query: 37 NPGDMLYIPPKVWHH 51
           PGD+L  PP   HH
Sbjct: 72 RPGDLLLFPPGEIHH 86


>gnl|CDD|224810 COG1898, RfbC, dTDP-4-dehydrorhamnose 3,5-epimerase and related
           enzymes [Cell envelope biogenesis, outer membrane].
          Length = 173

 Score = 26.9 bits (60), Expect = 0.59
 Identities = 12/44 (27%), Positives = 18/44 (40%), Gaps = 2/44 (4%)

Query: 23  PHNQTPGHLIECTLNPGD--MLYIPPKVWHHVRSLSTSISVSFW 64
             + T G  +   L+  +   LYIPP   H  + LS    V + 
Sbjct: 87  KDSPTYGKWVGVVLSAENKRQLYIPPGFAHGFQVLSDDAEVVYK 130


>gnl|CDD|214845 smart00835, Cupin_1, Cupin.  This family represents the conserved
          barrel domain of the 'cupin' superfamily ('cupa' is the
          Latin term for a small barrel). This family contains
          11S and 7S plant seed storage proteins, and germins.
          Plant seed storage proteins provide the major nitrogen
          source for the developing plant.
          Length = 146

 Score = 26.5 bits (59), Expect = 0.70
 Identities = 5/25 (20%), Positives = 10/25 (40%)

Query: 30 HLIECTLNPGDMLYIPPKVWHHVRS 54
           + +  L  GD+  +P    H   +
Sbjct: 73 KVYDARLREGDVFVVPQGHPHFQVN 97


>gnl|CDD|234250 TIGR03540, DapC_direct, LL-diaminopimelate aminotransferase.  This
           clade of the pfam00155 superfamily of aminotransferases
           includes several which are adjacent to elements of the
           lysine biosynthesis via diaminopimelate pathway
           (GenProp0125). Every member of this clade is from a
           genome which possesses most of the lysine biosynthesis
           pathway but lacks any of the known aminotransferases,
           succinylases, desuccinylases, acetylases or deacetylases
           typical of the acylated versions of this pathway nor do
           they have the direct, NADPH-dependent enzyme (ddh).
           Although there is no experimental characterization of
           any of the sequences in this clade, a direct pathway is
           known in plants and Chlamydia and the clade containing
           the Chlamydia gene is a neighboring one in the same
           pfam00155 superfamily so it seems quite reasonable that
           these enzymes catalyze the same transformation.
          Length = 383

 Score = 26.2 bits (58), Expect = 1.2
 Identities = 6/18 (33%), Positives = 11/18 (61%)

Query: 29  GHLIECTLNPGDMLYIPP 46
            H+    +NPGD++ +P 
Sbjct: 105 AHIPLAFVNPGDIVLVPD 122


>gnl|CDD|219393 pfam07385, DUF1498, Protein of unknown function (DUF1498).  This
           family consists of several hypothetical bacterial
           proteins of around 225 residues in length. The function
           of this family is unknown.
          Length = 225

 Score = 25.9 bits (57), Expect = 1.2
 Identities = 11/41 (26%), Positives = 17/41 (41%), Gaps = 13/41 (31%)

Query: 24  HNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSFW 64
                G L++  L PG  + +PP ++H           SFW
Sbjct: 148 VTMKAGGLLK--LKPGQSICLPPGLYH-----------SFW 175


>gnl|CDD|225051 COG2140, COG2140, Thermophilic glucose-6-phosphate isomerase and
           related metalloenzymes [Carbohydrate transport and
           metabolism / General function prediction only].
          Length = 209

 Score = 25.9 bits (57), Expect = 1.3
 Identities = 7/26 (26%), Positives = 12/26 (46%)

Query: 28  PGHLIECTLNPGDMLYIPPKVWHHVR 53
            G      +  GD++Y+PP   H+  
Sbjct: 122 EGEARVIAVRAGDVIYVPPGYGHYTI 147


>gnl|CDD|219088 pfam06560, GPI, Glucose-6-phosphate isomerase (GPI).  This family
           consists of several bacterial and archaeal
           glucose-6-phosphate isomerase (GPI) proteins
           (EC:5.3.1.9).
          Length = 181

 Score = 26.0 bits (57), Expect = 1.3
 Identities = 6/30 (20%), Positives = 12/30 (40%), Gaps = 1/30 (3%)

Query: 23  PHNQTPGHLIECTLNPGDMLYIPPKVWHHV 52
           P       +    +  G ++Y+PP  + H 
Sbjct: 100 PEGGAVDDVRVIEMEKGTVVYVPPY-YGHR 128


>gnl|CDD|183450 PRK12335, PRK12335, tellurite resistance protein TehB;
          Provisional.
          Length = 287

 Score = 25.7 bits (57), Expect = 1.6
 Identities = 5/17 (29%), Positives = 9/17 (52%)

Query: 40 DMLYIPPKVWHHVRSLS 56
             +I P+ WH + + S
Sbjct: 66 QPPFIEPQAWHRIEAAS 82


>gnl|CDD|181219 PRK08068, PRK08068, transaminase; Reviewed.
          Length = 389

 Score = 25.7 bits (57), Expect = 1.7
 Identities = 8/18 (44%), Positives = 12/18 (66%), Gaps = 3/18 (16%)

Query: 31  LIE---CTLNPGDMLYIP 45
           L+E   C +NPGD + +P
Sbjct: 107 LVELPQCLMNPGDTILVP 124


>gnl|CDD|145456 pfam02311, AraC_binding, AraC-like ligand binding domain.  This
          family represents the arabinose-binding and
          dimerisation domain of the bacterial gene regulatory
          protein AraC. The domain is found in conjunction with
          the helix-turn-helix (HTH) DNA-binding motif pfam00165.
          This domain is distantly related to the Cupin domain
          pfam00190.
          Length = 134

 Score = 25.5 bits (56), Expect = 1.8
 Identities = 7/22 (31%), Positives = 9/22 (40%)

Query: 33 ECTLNPGDMLYIPPKVWHHVRS 54
             L PGD+  +PP   H    
Sbjct: 42 TYHLGPGDLFLLPPGEPHDYEP 63


>gnl|CDD|226142 COG3615, TehB, Uncharacterized protein/domain, possibly involved
          in tellurite resistance [Inorganic ion transport and
          metabolism].
          Length = 99

 Score = 25.1 bits (55), Expect = 2.1
 Identities = 9/43 (20%), Positives = 17/43 (39%), Gaps = 5/43 (11%)

Query: 24 HNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLS--TSISVSFW 64
              P H+             PP+ WH V +++     ++SF+
Sbjct: 56 GETEPEHVFSIEAQFP---VFPPQAWHRVEAMTDDAEFNLSFY 95


>gnl|CDD|200251 TIGR03214, ura-cupin, putative allantoin catabolism protein.  This
           model represents a protein containing a tandem
           arrangement of cupin domains (N-terminal part of
           pfam07883 and C-terminal more distantly related to
           pfam00190). This protein is found in the vicinity of
           genes involved in the catabolism of allantoin, a
           breakdown product of urate and sometimes of urate
           iteslf. The distribution of pathway components in the
           genomes in which this family is observed suggests that
           the function is linked to the allantoate catabolism to
           glyoxylate pathway (GenProp0686) since it is sometimes
           found in genomes lacking any elements of the
           xanthine-to-allantoin pathways (e.g. in Enterococcus
           faecalis).
          Length = 252

 Score = 25.4 bits (56), Expect = 2.1
 Identities = 6/31 (19%), Positives = 11/31 (35%)

Query: 35  TLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65
            L  G   Y+PP     + +     +  F +
Sbjct: 93  ELEEGGYAYLPPGSKWTLENAQDEDARFFLY 123


>gnl|CDD|183930 PRK13264, PRK13264, 3-hydroxyanthranilate 3,4-dioxygenase;
          Provisional.
          Length = 177

 Score = 25.3 bits (56), Expect = 2.3
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query: 26 QTPGHLIECTLNPGDMLYIPPKVWH 50
          Q  G   +  +  G+M  +PP V H
Sbjct: 70 QEDGKRRDVPIREGEMFLLPPHVPH 94


>gnl|CDD|232879 TIGR00218, manA, mannose-6-phosphate isomerase, class I.  The names
           phosphomannose isomerase and mannose-6-phosphate
           isomerase are synonomous. This family contains two
           rather deeply branched groups. One group contains an
           experimentally determined phosphomannose isomerase of
           Streptococcus mutans as well as three uncharacterized
           paralogous proteins of Bacillus subtilis, all at more
           than 50 % identity to each other, plus a more distant
           homolog from Archaeoglobus fulgidus. The other group
           contains members from E. coli, budding yeast, Borrelia
           burgdorferi, etc [Energy metabolism, Sugars].
          Length = 303

 Score = 24.7 bits (54), Expect = 3.4
 Identities = 6/17 (35%), Positives = 7/17 (41%)

Query: 34  CTLNPGDMLYIPPKVWH 50
             L PGD  Y+     H
Sbjct: 154 IKLKPGDFFYVRSGTPH 170


>gnl|CDD|183298 PRK11752, PRK11752, putative S-transferase; Provisional.
          Length = 264

 Score = 24.5 bits (54), Expect = 4.1
 Identities = 6/9 (66%), Positives = 6/9 (66%)

Query: 43 YIPPKVWHH 51
          Y PPKVW  
Sbjct: 6  YQPPKVWTW 14


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.127    0.403 

Gapped
Lambda     K      H
   0.267   0.0745    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 2,893,671
Number of extensions: 180549
Number of successful extensions: 202
Number of sequences better than 10.0: 1
Number of HSP's gapped: 202
Number of HSP's successfully gapped: 28
Length of query: 65
Length of database: 10,937,602
Length adjustment: 36
Effective length of query: 29
Effective length of database: 9,340,858
Effective search space: 270884882
Effective search space used: 270884882
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 53 (24.2 bits)