RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 031643
         (156 letters)



>gnl|CDD|215466 PLN02865, PLN02865, galactokinase.
          Length = 423

 Score =  239 bits (612), Expect = 6e-79
 Identities = 104/152 (68%), Positives = 122/152 (80%), Gaps = 10/152 (6%)

Query: 1   MRNKVSEMSGRDAEVVRVVVSPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPSGDTEV 60
           +R +V+ MSGR++  VRVVVSPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPSGD EV
Sbjct: 15  IRERVAAMSGRNSGEVRVVVSPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPSGDPEV 74

Query: 61  VLRSGQFDGEVRFSIDEIQQPRNSVKKHHVVHASDSAKIKEECKWGNYARGALYALQSRG 120
           +LRS QF+GEVRF +DEIQ P            + S+  KEE  WG+YARGA+YALQSRG
Sbjct: 75  LLRSAQFEGEVRFRVDEIQHPIA----------NVSSDSKEESNWGDYARGAVYALQSRG 124

Query: 121 NNLTQGIIGYICGSDNLDSSGLSSSAAVSMSF 152
           + L+QGI GYI GS+ LDSSGLSSSAAV +++
Sbjct: 125 HALSQGITGYISGSEGLDSSGLSSSAAVGVAY 156


>gnl|CDD|223231 COG0153, GalK, Galactokinase [Carbohydrate transport and
           metabolism].
          Length = 390

 Score = 63.5 bits (155), Expect = 1e-12
 Identities = 42/137 (30%), Positives = 54/137 (39%), Gaps = 21/137 (15%)

Query: 12  DAEVVRVVVSPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPSGDTEVVLRSGQFDGEV 71
             E      +P R+  +G H D+ GG V    IN G  +      D +V L S  F    
Sbjct: 19  YVEPTVTAFAPGRVNLIGEHTDYNGGFVLPCAINYGTYVAVAKRDDGKVRLYSANFG--- 75

Query: 72  RFSIDEIQQPRNSVKKHHVVHASDSAKIKEECKWGNYARGALYALQSRGNNLTQGIIGYI 131
                                  D AK K +  W NY +G + ALQ RG   T G+   I
Sbjct: 76  -------------NAGDIFFLLLDIAKEKID-DWANYVKGVIKALQKRGYAFT-GLDIVI 120

Query: 132 CGSDNL-DSSGLSSSAA 147
            G  N+   +GLSSSAA
Sbjct: 121 SG--NIPIGAGLSSSAA 135


>gnl|CDD|235407 PRK05322, PRK05322, galactokinase; Provisional.
          Length = 387

 Score = 63.3 bits (155), Expect = 2e-12
 Identities = 46/153 (30%), Positives = 67/153 (43%), Gaps = 26/153 (16%)

Query: 1   MRNKVSEMSGRDAEVVRVVVSPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPSGDTEV 60
           ++ K +E+ G +AE   V  SP RI  +G H D+ GG V    I  G         D +V
Sbjct: 6   LKKKFAEVFGEEAE--DVFFSPGRINLIGEHTDYNGGHVFPAAITLGTYGAARKRDDKKV 63

Query: 61  VLRSGQFD--GEVRFSIDEIQQPRNSVKKHHVVHASDSAKIKEECKWGNYARGALYALQS 118
            L S  F+  G + F +D++     S  K                 W NY +G L  LQ 
Sbjct: 64  RLYSANFEDLGIIEFDLDDL-----SFDKED--------------DWANYPKGVLKFLQE 104

Query: 119 RGNNLTQGIIGYICGSDNL-DSSGLSSSAAVSM 150
            G  +  G    I G  N+ + +GLSSSA++ +
Sbjct: 105 AGYKIDHGFDILIYG--NIPNGAGLSSSASIEL 135


>gnl|CDD|232841 TIGR00131, gal_kin, galactokinase.  Galactokinase is a member of
           the GHMP kinases (Galactokinase, Homoserine kinase,
           Mevalonate kinase, Phosphomevalonate kinase) and shares
           with them an amino-terminal domain probably related to
           ATP binding.The galactokinases found by This model are
           divided into two sets. Prokaryotic forms are generally
           shorter. The eukaryotic forms are longer because of
           additional central regions and in some cases are known
           to be bifunctional, with regulatory activities that are
           independent of galactokinase activity [Energy
           metabolism, Sugars].
          Length = 386

 Score = 55.6 bits (134), Expect = 7e-10
 Identities = 35/132 (26%), Positives = 50/132 (37%), Gaps = 20/132 (15%)

Query: 18  VVVSPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPSGDTEVVLRSGQFDGEV-RFSID 76
              +P R+  +G H D+  G+V    I+ G L       D  V +     D +    S+D
Sbjct: 19  TARAPGRVNLIGEHTDYNDGSVLPCAIDFGTLCAVAVRDDKNVRIYLANADNKFAERSLD 78

Query: 77  EIQQPRNSVKKHHVVHASDSAKIKEECKWGNYARGALYALQSRGNNLTQGIIGYICGSDN 136
               P +                 E   W NY +G L+  Q R N+   G    +C  + 
Sbjct: 79  L---PLDG---------------SEVSDWANYFKGVLHVAQERFNSFPLG-ADIVCSGNV 119

Query: 137 LDSSGLSSSAAV 148
              SGLSSSAA 
Sbjct: 120 PTGSGLSSSAAF 131


>gnl|CDD|235163 PRK03817, PRK03817, galactokinase; Provisional.
          Length = 351

 Score = 50.8 bits (122), Expect = 3e-08
 Identities = 35/134 (26%), Positives = 57/134 (42%), Gaps = 27/134 (20%)

Query: 18  VVVSPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPSGDTEVVLRSGQFDGEVRFSIDE 77
            V SP R+  +G H D+  G V    IN    L    S   + +  S  F+ E  F +D 
Sbjct: 2   KVKSPGRVNLIGEHTDYNDGYVLPFAINLYTFLEIEKSE--KFIFYSENFNEEKTFELD- 58

Query: 78  IQQPRNSVKKHHVVHASDSAKIKEECKWGNYARGALYALQSRGNNLTQGIIGYICGSDNL 137
                               K+++   W +Y +G ++ L+ RG  +  G+ G +    NL
Sbjct: 59  --------------------KLEKLNSWADYIKGVIWVLEKRGYEV-GGVKGKVSS--NL 95

Query: 138 D-SSGLSSSAAVSM 150
              +GLSSSA++ +
Sbjct: 96  PIGAGLSSSASLEV 109


>gnl|CDD|179937 PRK05101, PRK05101, galactokinase; Provisional.
          Length = 382

 Score = 44.5 bits (106), Expect = 6e-06
 Identities = 34/132 (25%), Positives = 53/132 (40%), Gaps = 23/132 (17%)

Query: 18  VVVSPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPSGDTEVVLRSGQFDGEV-RFSID 76
            + +P R+  +G H D+  G V    I+   ++      D  V + +  +D +   FS+D
Sbjct: 22  TIQAPGRVNLIGEHTDYNDGFVLPCAIDYQTVISCAKRDDRIVRVIAADYDNQQDEFSLD 81

Query: 77  EIQQPRNSVKKHHVVHASDSAKIKEECKWGNYARGALYALQSRGNNLTQGIIGYICGSDN 136
               P                    E +W NY RG +  LQ R  +   G    I G  N
Sbjct: 82  APIVPH------------------PEQQWANYVRGVVKHLQERNPDF-GGADLVISG--N 120

Query: 137 L-DSSGLSSSAA 147
           +   +GLSSSA+
Sbjct: 121 VPQGAGLSSSAS 132


>gnl|CDD|179063 PRK00555, PRK00555, galactokinase; Provisional.
          Length = 363

 Score = 41.0 bits (96), Expect = 1e-04
 Identities = 32/128 (25%), Positives = 50/128 (39%), Gaps = 22/128 (17%)

Query: 21  SPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPSGDTEVVLRSGQFDGEVRFSIDEIQQ 80
           +P RI  +G H D+  G    + + +  ++ F P     +   S + DG  R  +D    
Sbjct: 7   APGRINLIGEHTDYNLGFALPIALPQRTVVTFTPEHTDAITASSDRADGSARIPLDTTPG 66

Query: 81  PRNSVKKHHVVHASDSAKIKEECKWGNYARGALYALQSRGNNLTQGIIGYICGSDNLDSS 140
                                   W  YA G ++AL+  G+ +  G +     SD    S
Sbjct: 67  QVTG--------------------WAAYAAGVIWALRGAGHPVPGGAMS--ITSDVEIGS 104

Query: 141 GLSSSAAV 148
           GLSSSAA+
Sbjct: 105 GLSSSAAL 112


>gnl|CDD|204502 pfam10509, GalKase_gal_bdg, Galactokinase galactose-binding
          signature.  This is the highly conserved galactokinase
          signature sequence which appears to be present in all
          galactokinases irrespective of how many other ATP
          binding sites, etc that they carry. The function of
          this domain appears to be to bind galactose, and the
          domain is normally at the N-terminus of the enzymes,
          EC:2.7.1.6. This domain is associated with the families
          GHMP_kinases_C, pfam08544 and GHMP_kinases_N,
          pfam00288.
          Length = 52

 Score = 32.5 bits (75), Expect = 0.009
 Identities = 13/46 (28%), Positives = 19/46 (41%), Gaps = 2/46 (4%)

Query: 10 GRDAEVVRVVVSPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPS 55
          G + E V    +P R+  +G H D+ GG V    IN    +     
Sbjct: 9  GVEPEGV--ASAPGRVNLIGEHTDYNGGFVLPAAINLDTYVAVSKR 52


>gnl|CDD|215285 PLN02521, PLN02521, galactokinase.
          Length = 497

 Score = 33.9 bits (78), Expect = 0.022
 Identities = 40/153 (26%), Positives = 59/153 (38%), Gaps = 54/153 (35%)

Query: 21  SPYRICPLGAHIDHQGGTVSAMTINKGILLGFVPSGDTEVVLRSGQFDGEVR-------- 72
           SP R+  +G HID++G +V  M I +          DT V +R  +   ++R        
Sbjct: 53  SPGRVNLIGEHIDYEGYSVLPMAIRQ----------DTIVAIRRAEGSKKLRIANVNDKY 102

Query: 73  ----FSIDEIQQPRNSVKKHHVVHASDSAKIKEECKWGNYA----RGALYALQSRGNNLT 124
               F  D  Q+    +  H                WGNY     +G    L+S+G ++ 
Sbjct: 103 TTCTFPADPDQEV--DLANHK---------------WGNYFICGYKGVFEFLKSKGVDV- 144

Query: 125 QGIIGYICGSDNL------DSSGLSSSAAVSMS 151
               G   G D +        SGLSSSAA+  S
Sbjct: 145 ----GPPVGLDVVVDGTVPTGSGLSSSAALVCS 173


>gnl|CDD|181437 PRK08470, PRK08470, adenylosuccinate lyase; Provisional.
          Length = 442

 Score = 32.4 bits (74), Expect = 0.065
 Identities = 18/62 (29%), Positives = 31/62 (50%), Gaps = 9/62 (14%)

Query: 50  LGFVPSGDTEVVLRSGQFDGEVRFSIDEIQQPRNSVKKHHVVH--ASDSAKIKEECKWGN 107
           LG +P  D E + ++ +FD      IDEI++      KH ++    S S  + EE ++ +
Sbjct: 37  LGLIPDSDCEKICKNAKFDIA---RIDEIEK----TTKHDLIAFLTSVSESLGEESRFVH 89

Query: 108 YA 109
           Y 
Sbjct: 90  YG 91


>gnl|CDD|177574 PHA03278, PHA03278, envelope glycoprotein K; Provisional.
          Length = 347

 Score = 28.9 bits (65), Expect = 0.94
 Identities = 14/67 (20%), Positives = 23/67 (34%), Gaps = 6/67 (8%)

Query: 88  HHVVHAS---DSAKIKEECKWGNYARGALYALQSRGNNLTQGIIGYICGSDNLDSS---G 141
           H  V+A          +   W  Y    +Y      N L       IC +D ++ +   G
Sbjct: 35  HGCVYAVLPLGELSDGKNFTWEAYNSTLIYVPLGNKNALDFSGFDDICRTDLVNRTAIGG 94

Query: 142 LSSSAAV 148
           L+   A+
Sbjct: 95  LAGDEAL 101


>gnl|CDD|237304 PRK13206, ureC, urease subunit alpha; Reviewed.
          Length = 573

 Score = 27.8 bits (62), Expect = 2.9
 Identities = 14/29 (48%), Positives = 17/29 (58%), Gaps = 1/29 (3%)

Query: 12  DAEVVRVVVSPYRICPLGAH-IDHQGGTV 39
           D    R  V+ Y ICP  AH IDH+ G+V
Sbjct: 402 DNNRARRYVAKYTICPAVAHGIDHEIGSV 430


>gnl|CDD|131387 TIGR02334, prpF, probable AcnD-accessory protein PrpF.  The
          2-methylcitrate cycle is one of at least five
          degradation pathways for propionate via propionyl-CoA.
          Degradation of propionate toward pyruvate consumes
          oxaloacetate and releases succinate. Oxidation of
          succinate back into oxaloacetate by the TCA cycle makes
          the 2-methylcitrate pathway a cycle. This family
          consists of PrpF, an incompletely characterized protein
          that appears to be an essential accessory protein for
          the Fe/S-dependent 2-methylisocitrate dehydratase AcnD
          (TIGR02333). This protein is related to but distinct
          from FldA (part of Pfam family pfam04303), a putative
          fluorene degradation protein of Sphingomonas sp. LB126
          [Energy metabolism, Fermentation].
          Length = 390

 Score = 27.5 bits (61), Expect = 3.1
 Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 3/33 (9%)

Query: 9  SGRDAEVVRVVVSPYRICPLGAHIDHQGGTVSA 41
            RD  ++RV+ SP    P G  ID  GG  S+
Sbjct: 35 EARDKLLLRVIGSPD---PYGKQIDGMGGATSS 64


>gnl|CDD|99858 cd06105, ScCit1-2_like, Saccharomyces cerevisiae (Sc) citrate
           synthases Cit1-2_like. Citrate synthases (CS) catalyzes
           the condensation of acetyl coenzyme A (AcCoA) with
           oxaloacetate (OAA) to form citrate and coenzyme A (CoA),
           the first step in the citric acid cycle (TCA or Krebs
           cycle). Some CS proteins function as 2-methylcitrate
           synthase (2MCS). 2MCS catalyzes the condensation of
           propionyl-coenzyme A (PrCoA) and OAA to form
           2-methylcitrate and CoA during propionate metabolism.
           The overall CS reaction is thought to proceed through
           three partial reactions and involves both closed and
           open conformational forms of the enzyme: a) the
           carbanion or equivalent is generated from AcCoA by base
           abstraction of a proton, b) the nucleophilic attack of
           this carbanion on OAA to generate citryl-CoA, and c) the
           hydrolysis of citryl-CoA to produce citrate and CoA.
           There are two types of CSs: type I CS and type II CSs.
           Type I CSs are found in eukarya, gram-positive bacteria,
           archaea, and in some gram-negative bacteria and are
           homodimers with both subunits participating in the
           active site.  Type II CSs are unique to gram-negative
           bacteria and are homohexamers of identical subunits
           (approximated as a trimer of dimers).  ScCit1 is a
           nuclear-encoded mitochondrial CS with highly specificity
           for AcCoA. In addition to its CS function, ScCit1 plays
           a part in the construction of the TCA cycle metabolon.
           Yeast cells deleted for Cit1 are hyper-susceptible to
           apoptosis induced by heat and aging stress. ScCit2 is a
           peroxisomal CS involved in the glyoxylate cycle; in
           addition to having activity with AcCoA, it may have
           activity with PrCoA. Chicken and pig heart CS, two
           Arabidopsis thaliana (Ath) CSs, CSY4 and -5, and
           Aspergillus niger (An) CS also belong to this group. Ath
           CSY4 has a mitochondrial targeting sequence; AthCSY5 has
           no identifiable targeting sequence. AnCS encoded by the
           citA gene has both an N-terminal mitochondrial import
           signal and a C-terminal peroxisiomal target sequence; it
           is not known if both these signals are functional in
           vivo. This group contains proteins which functions
           exclusively as either a CS or a 2MCS, as well as those
           with relaxed specificity which have dual functions as
           both a CS and a 2MCS.
          Length = 427

 Score = 27.3 bits (61), Expect = 4.2
 Identities = 9/13 (69%), Positives = 10/13 (76%)

Query: 31  HIDHQGGTVSAMT 43
           H DH+GG VSA T
Sbjct: 229 HSDHEGGNVSAHT 241


>gnl|CDD|99859 cd06106, ScCit3_like, Saccharomyces cerevisiae (Sc) 2-methylcitrate
           synthase Cit3-like. 2-methylcitrate synthase (2MCS)
           catalyzes the condensation of propionyl-coenzyme A
           (PrCoA) and oxaloacetate (OAA) to form 2-methylcitrate
           and CoA. Citrate synthase (CS) catalyzes the
           condensation of acetyl coenzyme A (AcCoA) with OAA to
           form citrate and CoA, the first step in the citric acid
           cycle (TCA or Krebs cycle). The overall CS reaction is
           thought to proceed through three partial reactions and
           involves both closed and open conformational forms of
           the enzyme: a) the carbanion or equivalent is generated
           from AcCoA by base abstraction of a proton, b) the
           nucleophilic attack of this carbanion on OAA to generate
           citryl-CoA, and c) the hydrolysis of citryl-CoA to
           produce citrate and CoA. There are two types of CSs:
           type I CS and type II CSs.  Type I CSs are found in
           eukarya, gram-positive bacteria, archaea, and in some
           gram-negative bacteria and are homodimers with both
           subunits participating in the active site.  Type II CSs
           are unique to gram-negative bacteria and are
           homohexamers of identical subunits (approximated as a
           trimer of dimers). ScCit3 is mitochondrial and functions
           in the metabolism of PrCoA; it is a dual specificity CS
           and 2MCS, having similar catalytic efficiency with both
           AcCoA and PrCoA. The pattern of expression of the ScCIT3
           gene follows that of the major mitochondrial CS gene
           (CIT1, not included in this group) and its expression is
           increased in the presence of a CIT1 deletion. This group
           also contains Aspergillus nidulans 2MCS; a deletion of
           the gene encoding this protein results in a strain
           unable to grow on propionate. This group contains
           proteins which functions exclusively as either a CS or a
           2MCS, as well as those with relaxed specificity which
           have dual functions as both a CS and a 2MCS.
          Length = 428

 Score = 27.1 bits (60), Expect = 4.8
 Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 28  LGAHIDHQGGTVSAMT 43
           +  H DH+GG VSA T
Sbjct: 228 IALHGDHEGGNVSAHT 243


>gnl|CDD|223782 COG0710, AroD, 3-dehydroquinate dehydratase [Amino acid transport
           and metabolism].
          Length = 231

 Score = 26.2 bits (58), Expect = 6.8
 Identities = 5/22 (22%), Positives = 13/22 (59%)

Query: 1   MRNKVSEMSGRDAEVVRVVVSP 22
           +  ++ +M    A++V++ V P
Sbjct: 135 IIERLDKMESLGADIVKIAVMP 156


>gnl|CDD|233571 TIGR01778, TonB-copper, TonB-dependent copper receptor.  This model
           represents a family of proteobacterial TonB-dependent
           outer membrane receptor/transporters which bind and
           translocate copper ions. Two characterized members of
           this family exist, outer membrane protein C (OprC) from
           Pseudomonas aeruginosa and NosA from Pseudomonas
           stutzeri which is responsible for providing copper for
           the copper-containing N2O reducatse [Transport and
           binding proteins, Cations and iron carrying compounds,
           Transport and binding proteins, Porins].
          Length = 636

 Score = 26.4 bits (58), Expect = 9.3
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 45  NKGILLGFVPSGDTEVVLRSGQFDGEVRFS 74
           N  + LG+ P  DT V L  G   GE R++
Sbjct: 172 NGNLALGWTPDADTVVELSHGASSGEARYA 201


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.133    0.388 

Gapped
Lambda     K      H
   0.267   0.0836    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,630,325
Number of extensions: 669143
Number of successful extensions: 562
Number of sequences better than 10.0: 1
Number of HSP's gapped: 550
Number of HSP's successfully gapped: 29
Length of query: 156
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 67
Effective length of database: 6,990,096
Effective search space: 468336432
Effective search space used: 468336432
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 55 (24.8 bits)