RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 001944
         (993 letters)



>gnl|CDD|215860 pfam00328, His_Phos_2, Histidine phosphatase superfamily (branch
           2).  The histidine phosphatase superfamily is so named
           because catalysis centres on a conserved His residue
           that is transiently phosphorylated during the catalytic
           cycle. Other conserved residues contribute to a
           'phosphate pocket' and interact with the phospho group
           of substrate before, during and after its transfer to
           the His residue. Structure and sequence analyses show
           that different families contribute different additional
           residues to the 'phosphate pocket' and, more
           surprisingly, differ in the position, in sequence and in
           three dimensions, of a catalytically essential acidic
           residue. The superfamily may be divided into two main
           branches.The smaller branch 2 contains predominantly
           eukaryotic proteins. The catalytic functions in members
           include phytase, glucose-1-phosphatase and multiple
           inositol polyphosphate phosphatase. The in vivo roles of
           the mammalian acid phosphatases in branch 2 are not
           fully understood, although activity against
           lysophosphatidic acid and tyrosine-phosphorylated
           proteins has been demonstrated.
          Length = 327

 Score = 80.6 bits (199), Expect = 3e-16
 Identities = 55/272 (20%), Positives = 88/272 (32%), Gaps = 78/272 (28%)

Query: 468 GLLRLHSTYRHDLKIYSSDEGRVQMSAAAFAKGLLDLEGQLTPILVSLVSKDSSMLDGLD 527
           GLL        ++ I SSD  R   SA AF  GL   EG            D+ +LD   
Sbjct: 73  GLLPDGYPP-SEVYIRSSDSNRTLASAQAFLAGLFPPEG----------DIDNDLLDWQP 121

Query: 528 NASIEMEEAKARLNEIIKSGSKMIHSNGSSDCPWMADGVGLPPNASELLPKLVKLTKKVT 587
              + + E K  L  ++          G   CP   +                 + K+V 
Sbjct: 122 IPVVTLPEVKKALANLLLL--------GYDSCPAFDE---------------SLVEKRVD 158

Query: 588 EQVRQLAKDEDEDLAETNPYDVIPPYDQAKALGKTNIDVDRIAAGLPCGSEGFLLMYARW 647
            ++ +L     E +A+                G+TN+                      W
Sbjct: 159 PELAKLLAVYLEPIAKRL---------SQLLPGETNLT-----------------GLDVW 192

Query: 648 RKLERDLYNERKERFDITQIPDVYDSCKYDLLHNAHLNLEGLDELFKVAQLLADGVIPNE 707
             L+  L+       D +  PD++    YD LH  +L+   L+E + ++           
Sbjct: 193 ALLDLCLFE--TNASDNSPFPDLFTG--YDALHLEYLSD--LEEYYGLS----------- 235

Query: 708 YGINPKQKLKIGSKIARRLLGKLLIDLRNTRE 739
            GI P+    IG  +   LL +L   L   + 
Sbjct: 236 -GIGPELAKLIGGPLLNELLKRLTNALVPDQT 266



 Score = 45.1 bits (107), Expect = 1e-04
 Identities = 14/41 (34%), Positives = 17/41 (41%)

Query: 314 ELRCVIAVMRHGDRTPKQKVKLKVTEEKLLNLMLKYNGGRP 354
           EL  V  V RHGDRTP + +           L  K+N    
Sbjct: 1   ELEFVQVVSRHGDRTPTKSLPKDPEAWPFKLLSEKHNDFLN 41



 Score = 41.7 bits (98), Expect = 0.001
 Identities = 16/90 (17%), Positives = 32/90 (35%), Gaps = 23/90 (25%)

Query: 808 ANVKTPERHVRTRLYFTSESHIHSLMNVLRYCNLDESLQGEDSLVCHSALERLYKTKELD 867
                    ++  LYF  ++ I  L++ L    L + L G  S              E  
Sbjct: 261 LVPDQTRLDLKLYLYFAHDTTIAPLLSAL---GLFDQLPGLSS-------------GEPP 304

Query: 868 YMSYIVLRMFENTAVALEDPKRFRIELTFS 897
           Y + +V  ++++        K F + L ++
Sbjct: 305 YAARLVFELWQDG-------KEFYVRLLYN 327


>gnl|CDD|132717 cd07061, HP_HAP_like, Histidine phosphatase domain found in
           histidine acid phosphatases and phytases; contains a His
           residue which is phosphorylated during the reaction.
           Catalytic domain of HAP (histidine acid phosphatases)
           and phytases (myo-inositol hexakisphosphate
           phosphohydrolases). The conserved catalytic core of this
           domain contains a His residue which is phosphorylated in
           the reaction. Functions in this subgroup include roles
           in metabolism, signaling, or regulation, for example
           Escherichia coli glucose-1-phosphatase functions to
           scavenge glucose from glucose-1-phosphate and the
           signaling molecules inositol 1,3,4,5,6-pentakisphosphate
           (InsP5) and inositol hexakisphosphate (InsP6) are in
           vivo substrates for eukaryotic multiple inositol
           polyphosphate phosphatase 1 (Minpp1). Phytases scavenge
           phosphate from extracellular sources and are added to
           animal feed while prostatic acid phosphatase (PAP) has
           been used for many years as a serum marker for prostate
           cancer. Recently PAP has been shown in mouse models to
           suppress pain by functioning as an
           ecto-5prime-nucleotidase. In vivo it dephosphorylates
           extracellular adenosine monophosphate (AMP) generating
           adenosine,and leading to the activation of A1-adenosine
           receptors in dorsal spinal cord.
          Length = 242

 Score = 56.2 bits (136), Expect = 2e-08
 Identities = 22/65 (33%), Positives = 31/65 (47%)

Query: 468 GLLRLHSTYRHDLKIYSSDEGRVQMSAAAFAKGLLDLEGQLTPILVSLVSKDSSMLDGLD 527
            LL LHS  R DL I SSD  R   SA AF  GL   +G     + ++  ++  + +  D
Sbjct: 40  ELLLLHSYNRSDLYIRSSDSQRTLQSAQAFLAGLFPPDGWQPIAVHTIPEEEDDVSNLFD 99

Query: 528 NASIE 532
             + E
Sbjct: 100 LCAYE 104



 Score = 37.0 bits (86), Expect = 0.031
 Identities = 9/16 (56%), Positives = 10/16 (62%)

Query: 314 ELRCVIAVMRHGDRTP 329
           EL  V  + RHGDR P
Sbjct: 1   ELEQVQVLSRHGDRYP 16



 Score = 34.7 bits (80), Expect = 0.18
 Identities = 14/81 (17%), Positives = 25/81 (30%), Gaps = 10/81 (12%)

Query: 798 ETQYRLDPKYANVKTPERHVRTRLYFTSESHIHSLMNVLRYCNLDESLQGEDSLVCHSAL 857
           E   RL          +     R  +   SH  +++ +L    L +  +           
Sbjct: 155 ELLARLTNG---PSGSQTFPLDRKLYLYFSHDTTILPLLTALGLFDFAEPLP-------P 204

Query: 858 ERLYKTKELDYMSYIVLRMFE 878
           + L    E DY  +    +FE
Sbjct: 205 DFLRGFSESDYPPFAARLVFE 225


>gnl|CDD|117020 pfam08443, RimK, RimK-like ATP-grasp domain.  This ATP-grasp domain
           is found in the ribosomal S6 modification enzyme RimK.
          Length = 190

 Score = 46.7 bits (111), Expect = 1e-05
 Identities = 50/206 (24%), Positives = 77/206 (37%), Gaps = 34/206 (16%)

Query: 55  DRRKVYEQLEKYGIPVPRYALVNREVPYQELDYFIEEEDFVEVHGNRFWKPFVEKPVHGD 114
           D+ K ++ L K+GIPVP   L              + E F+E     F  P V K V G 
Sbjct: 3   DKAKSHQLLAKHGIPVPNTGLAWSPE---------DAEKFIE-QIKGF--PVVVKSVFGS 50

Query: 115 --DHSIMIYYPSSAGGGMKELFRKVGNRSSEFHPDVRRVRREGSYIYEEFMPT-GGTDVK 171
                 +     S    + E F+ + N                  + +EF+   G  D++
Sbjct: 51  QGIGVFLAEDEQSLEQLL-EAFKWLKN----------------QILVQEFIAEAGNRDIR 93

Query: 172 VYTVGPEYAHAEARKSPVVDGVVMRNPDGKEVRYPVLLTPNEKQMAREVCIAFRQAVCGF 231
              VG E   A  R+S   +G    N     V  P  L+  E+++A +   A    V G 
Sbjct: 94  CLVVGGEVVGAIHRQSN--EGDFRTNLHRGGVAEPYQLSQEEEEIAIKAAQAMGLDVLGV 151

Query: 232 DLLRCEGRSYVCDVNGWSFVKNSYKY 257
           D+ R +    VC+VN    +K   + 
Sbjct: 152 DIFRSKRGLLVCEVNSSPGLKGIERT 177


>gnl|CDD|233121 TIGR00768, rimK_fam, alpha-L-glutamate ligases, RimK family.  This
           family, related to bacterial glutathione synthetases,
           contains at least two different alpha-L-glutamate
           ligases. One is RimK, as in E. coli, which adds
           additional Glu residues to the native Glu-Glu C-terminus
           of ribosomal protein S6, but not to Lys-Glu mutants.
           Most species with a member of this subfamily lack an S6
           homolog ending in Glu-Glu, however. Members in
           Methanococcus jannaschii act instead as a
           tetrahydromethanopterin:alpha-l-glutamate ligase
           (MJ0620) and a gamma-F420-2:alpha-l-glutamate ligase
           (MJ1001).
          Length = 277

 Score = 46.6 bits (111), Expect = 3e-05
 Identities = 53/217 (24%), Positives = 75/217 (34%), Gaps = 54/217 (24%)

Query: 55  DRRKVYEQLEKYGIPVPRYALVNREVPYQELDYFIEEEDFVEVHGNRFWKPFVEKPVHGD 114
           D+    + L K G+P PR  L        E    IEE  F          P V KPV G 
Sbjct: 88  DKFLTSQLLAKAGLPQPRTGLAGSPE---EALKLIEEIGF----------PVVLKPVFGS 134

Query: 115 --DHSIMIYYPSSAGGGMKELFRKVGNRSSEFHPDVRRVRREGSYIYEEFMP-TGGTDVK 171
                 +     +A   + E F ++                +  +  +E++   GG D++
Sbjct: 135 WGRLVSLARDKQAAETLL-EHFEQLNGP-------------QNLFYVQEYIKKPGGRDIR 180

Query: 172 VYTVGPEYAHAEARKSPVVDGVVMRNPD----------GKEVRYPVLLTPNEKQMAREVC 221
           V+ VG E           V   + R             GK    P  LT   +++A +  
Sbjct: 181 VFVVGDE-----------VIAAIYRITSGHWRTNLARGGKAEPCP--LTEEIEELAIKAA 227

Query: 222 IAFRQAVCGFDLLRCEGRSY-VCDVNGWSFVKNSYKY 257
            A    V G DLL  E R   V +VN     KNS K 
Sbjct: 228 KALGLDVVGIDLLESEDRGLLVNEVNPNPEFKNSVKT 264


>gnl|CDD|223267 COG0189, RimK, Glutathione synthase/Ribosomal protein S6
           modification enzyme (glutaminyl transferase) [Coenzyme
           metabolism / Translation, ribosomal structure and
           biogenesis].
          Length = 318

 Score = 41.6 bits (98), Expect = 0.001
 Identities = 52/243 (21%), Positives = 79/243 (32%), Gaps = 54/243 (22%)

Query: 18  CLIAFYSSGYPLEKAESYATLRKPFLVNELEPQHLLHDRRKVYEQ--LEKYGIPVPRYAL 75
               F  +   L  AE         ++N+  PQ +   R K+Y    L K GIPVP   +
Sbjct: 86  KDPPFDFATRFLRLAERKGV----PVIND--PQSIRRCRNKLYTTQLLAKAGIPVPPTLI 139

Query: 76  VNREVPYQELDYFIEEEDFVEVHGNRFWKPFVEKPVHGDDHSIMIYYPSSAGGGMKELFR 135
                   E   F+ E             P V KP+ G           S G G+     
Sbjct: 140 TRDPD---EAAEFVAE---------HLGFPVVLKPLDG-----------SGGRGVF---- 172

Query: 136 KVGNRSSEF--HPDVRRVRREGSYIYEEFMPTGGTDVKVYTVGPEYAHAEARKSPVVDGV 193
            V +   E     +          I +E++P    D +   VG            V    
Sbjct: 173 LVEDADPELLSLLETLTQEGRKLIIVQEYIPKAKRDDRRVLVG--------GGEVVAIYA 224

Query: 194 VMRNPDGKEVR---------YPVLLTPNEKQMAREVCIAFRQAVCGFDLLRCEGRSYVCD 244
           + R P   + R          P  LT  E+++A +   A    + G D++  +   YV +
Sbjct: 225 LARIPASGDFRSNLARGGRAEPCELTEEEEELAVKAAPALGLGLVGVDIIEDKDGLYVTE 284

Query: 245 VNG 247
           VN 
Sbjct: 285 VNV 287


>gnl|CDD|132716 cd07040, HP, Histidine phosphatase domain found in a functionally
           diverse set of proteins, mostly phosphatases; contains a
           His residue which is phosphorylated during the reaction.
            Catalytic domain of a functionally diverse set of
           proteins, most of which are phosphatases. The conserved
           catalytic core of this domain contains a His residue
           which is phosphorylated in the reaction. This set of
           proteins includes cofactor-dependent and
           cofactor-independent phosphoglycerate mutases (dPGM, and
           BPGM respectively), fructose-2,6-bisphosphatase
           (F26BP)ase, Sts-1, SixA, histidine acid phosphatases,
           phytases, and related proteins. Functions include roles
           in metabolism, signaling, or regulation, for example
           F26BPase affects glycolysis and gluconeogenesis through
           controlling the concentration of F26BP; BPGM controls
           the concentration of 2,3-BPG (the main allosteric
           effector of hemoglobin in human blood cells); human
           Sts-1 is a T-cell regulator; Escherichia coli Six A
           participates in the ArcB-dependent His-to-Asp
           phosphorelay signaling system; phytases scavenge
           phosphate from extracellular sources. Deficiency and
           mutation in many of the human members result in disease,
           for example erythrocyte BPGM deficiency is a disease
           associated with a decrease in the concentration of
           2,3-BPG. Clinical applications include the use of
           prostatic acid phosphatase (PAP) as a serum marker for
           prostate cancer. Agricultural applications include the
           addition of phytases to animal feed.
          Length = 153

 Score = 39.7 bits (93), Expect = 0.002
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 471 RLHSTYRHDLKIYSSDEGRVQMSAAAFAKGLLD 503
            L   Y    +IYSS   R   +A    +GL +
Sbjct: 40  ALRERYIKFDRIYSSPLKRAIQTAEIILEGLFE 72


>gnl|CDD|222206 pfam13535, ATP-grasp_4, ATP-grasp domain.  This family includes a
           diverse set of enzymes that possess ATP-dependent
           carboxylate-amine ligase activity.
          Length = 183

 Score = 37.3 bits (87), Expect = 0.016
 Identities = 42/224 (18%), Positives = 67/224 (29%), Gaps = 76/224 (33%)

Query: 52  LLHDRRKVYEQLEKYGIPVPRYALVNREVPYQELDYFIEEEDFVEVHGNRFWKPFVEKPV 111
              D+  + E L   G+PVP + LV+     ++LD   EE  F          P V KP 
Sbjct: 1   AFRDKALMRELLRAAGLPVPPFFLVD---DEEDLDAAAEEIGF----------PVVLKPR 47

Query: 112 HGDDHSIMIYYPSSAGGGMKELFRKVGNRS---SEFHPDVRRVRREGSYIYEEFMPTGGT 168
            G           S G        +V + +   +        V     Y+ EE++     
Sbjct: 48  DGA---------GSLG------VFRVDSAAELEAALAALAAEVEDTREYLVEEYID---- 88

Query: 169 DVKVYTVGPEYAHAEARKSPVVDGVV------------------MRNPDGKEVRYPVLLT 210
                  G EY          VDG+V                      +G E+       
Sbjct: 89  -------GDEYH---------VDGLVDDGELVFLGVSRYLGPPPPDFSEGVELGSVSPGE 132

Query: 211 PNEKQMAREVC------IAFRQAVCGFDLLRC-EGRSYVCDVNG 247
               +  RE+       +  R  V   +     +GR  + ++N 
Sbjct: 133 DPLPEALRELAERVLKALGLRNGVFHLEFFLTPDGRPVLLEINP 176


>gnl|CDD|165256 PHA02946, PHA02946, ankyin-like protein; Provisional.
          Length = 446

 Score = 35.0 bits (80), Expect = 0.17
 Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 11/96 (11%)

Query: 637 SEGFLLMYARWRKLERDLYNERKERFDITQIPDVYDSCKYDLLHNAHLNLEGLDELFKVA 696
           +E +L +YA++     D++           +  +  S  Y +LH A+  ++GLDE F V 
Sbjct: 7   AEYYLSLYAKYNSKNLDVFR--------NMLQAIEPSGNYHILH-AYCGIKGLDERF-VE 56

Query: 697 QLLADGVIPNEYGINPKQKLKIGSKI-ARRLLGKLL 731
           +LL  G  PNE   +    L I SKI   R++  LL
Sbjct: 57  ELLHRGYSPNETDDDGNYPLHIASKINNNRIVAMLL 92


>gnl|CDD|216956 pfam02274, Amidinotransf, Amidinotransferase.  This family contains
           glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2)
           amidinotransferases, enzymes involved in creatine and
           streptomycin biosynthesis respectively. This family also
           includes arginine deiminases, EC:3.5.3.6. These enzymes
           catalyze the reaction: arginine + H2O <=> citrulline +
           NH3. Also found in this family is the Streptococcus anti
           tumour glycoprotein.
          Length = 286

 Score = 33.4 bits (77), Expect = 0.50
 Identities = 13/51 (25%), Positives = 20/51 (39%), Gaps = 4/51 (7%)

Query: 643 MYARWRKLERDLYNERKERFDITQIPDVYDSCKYDLLHNAHLNLEGLDELF 693
           M+A  R+ ER L           ++  V D      L + +  LEG D + 
Sbjct: 81  MFAPNRRRERRLD-ILILLKHPYKVQLVLD---ISELEDENAFLEGGDMVL 127


>gnl|CDD|181145 PRK07850, PRK07850, acetyl-CoA acetyltransferase; Provisional.
          Length = 387

 Score = 33.5 bits (77), Expect = 0.59
 Identities = 57/195 (29%), Positives = 81/195 (41%), Gaps = 41/195 (21%)

Query: 486 DEGRVQMSAAAFAKGLLDLEGQLTPILVSLVSKDSSMLDGLDNASIEMEEAKARLNEIIK 545
            EGR     +     +LD EGQ T     LV++D     GL + ++E     A L  +++
Sbjct: 181 AEGRFDREISPVQAPVLDEEGQPTGETR-LVTRDQ----GLRDTTME---GLAGLKPVLE 232

Query: 546 SGSKMIH--------SNGSSDCPWM----ADGVGLPPNA---------SE---LLPKLVK 581
            G   IH        S+G++   WM    A  +GL P A         +E    L   V+
Sbjct: 233 GG---IHTAGTSSQISDGAAAVLWMDEDRARALGLRPRARIVAQALVGAEPYYHLDGPVQ 289

Query: 582 LTKKVTEQVRQLAKDEDEDLAETNPY--DVIPPYDQAKA--LGKTNIDVDRIAAGLPCGS 637
            T KV E  +   K  D DL E N     V+  + Q     + K N++   IA G P GS
Sbjct: 290 ATAKVLE--KAGMKIGDIDLVEINEAFASVVLSWAQVHEPDMDKVNVNGGAIALGHPVGS 347

Query: 638 EGFLLMYARWRKLER 652
            G  L+     +LER
Sbjct: 348 TGARLITTALHELER 362


>gnl|CDD|235876 PRK06849, PRK06849, hypothetical protein; Provisional.
          Length = 389

 Score = 32.7 bits (75), Expect = 1.1
 Identities = 19/73 (26%), Positives = 29/73 (39%), Gaps = 12/73 (16%)

Query: 42  FLVNELEPQHLLHDRRKVYEQLEKYGIPVPRYALVNREVPYQELDYFIEEEDFVEVHGNR 101
            L  + E   LLH++ +  EQ    G+ VP+  L+    P       I    F   H   
Sbjct: 103 VLHFDFELLLLLHNKWEFAEQARSLGLSVPKTYLITD--PEA-----IRNFMFKTPH--- 152

Query: 102 FWKPFVEKPVHGD 114
              P+V KP++  
Sbjct: 153 --TPYVLKPIYSR 163


>gnl|CDD|131199 TIGR02144, LysX_arch, Lysine biosynthesis enzyme LysX.  The family
           of proteins found in this equivalog include the
           characterized LysX from Thermus thermophilus which is
           part of a well-organized lysine biosynthesis gene
           cluster. LysX is believed to carry out an ATP-dependent
           acylation of the amino group of alpha-aminoadipate in
           the prokaryotic version of the fungal AAA lysine
           biosynthesis pathway. No species having a sequence in
           this equivalog contains the elements of the more common
           diaminopimelate lysine biosythesis pathway, and none has
           been shown to be a lysine auxotroph. These sequences
           have mainly recieved the name of the related enzyme,
           "ribosomal protein S6 modification protein RimK". RimK
           has been characterized in E. coli, and acts by
           ATP-dependent condensation of S6 with glutamate
           residues.
          Length = 280

 Score = 31.6 bits (72), Expect = 1.6
 Identities = 44/203 (21%), Positives = 71/203 (34%), Gaps = 31/203 (15%)

Query: 54  HDRRKVYEQLEKYGIPVPRYALVNREVPYQELDYFIEEEDFVEVHGNRFWKPFVEKPVHG 113
            D+   Y +L K G+P PR  L       +      E   +          P V KPV G
Sbjct: 86  GDKIFTYLKLAKAGVPTPRTYLAFDR---EAALKLAEALGY----------PVVLKPVIG 132

Query: 114 DDHSIMIYYPSSAGGGMKELFRKVGNRSSEF-HPDVRRVRREGSYIYEEFMPTGGTDVKV 172
                       + G +  L R      S   H +V    +   +  +E++   G D++V
Sbjct: 133 ------------SWGRLVALIRDKDELESLLEHKEVLGGSQHKLFYIQEYINKPGRDIRV 180

Query: 173 YTVGPEYAHAEARKSPVVDGVVMRNPDGKEVRYPVLLTPNEKQMAREVCIAFRQAVCGFD 232
           + +G E   A  R S   +        G +   P  L    +++A +   A    V   D
Sbjct: 181 FVIGDEAIAAIYRYS---NHWRTNTARGGKA-EPCPLDEEVEELAVKAAEAVGGGVVAID 236

Query: 233 LLRCEGRSYVCD-VNGWSFVKNS 254
           +   + R  + + VN     KNS
Sbjct: 237 IFESKERGLLVNEVNHVPEFKNS 259


>gnl|CDD|129443 TIGR00343, TIGR00343, pyridoxal 5'-phosphate synthase, synthase
           subunit Pdx1.  This protein had been believed to be a
           singlet oxygen resistance protein. Subsequent work
           showed that it is a protein of pyridoxine (vitamin B6)
           biosynthesis, and that pyridoxine quenches the highly
           toxic singlet form of oxygen produced by light in the
           presence of certain chemicals [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Pyridoxine].
          Length = 287

 Score = 30.1 bits (68), Expect = 5.5
 Identities = 28/112 (25%), Positives = 50/112 (44%), Gaps = 22/112 (19%)

Query: 535 EAKARLNEIIKSGSKMIHSNGSSDCPWMADGVGLPPNASELLPKLVKLTKKVTEQVRQLA 594
           EA  R+NE    G+ MI + G +       G G   N  E     V+  +K+ E++RQ+ 
Sbjct: 126 EALRRINE----GAAMIRTKGEA-------GTG---NIVEA----VRHMRKINEEIRQIQ 167

Query: 595 KDEDEDLAETNPYDVIPPYD---QAKALGKTNIDVDRIAAGLPCGSEGFLLM 643
              +E+       ++  P +   +   LGK  + V+  A G+   ++  L+M
Sbjct: 168 NMLEEEDLAAVAKELRVPVELLLEVLKLGKLPV-VNFAAGGVATPADAALMM 218


>gnl|CDD|223292 COG0214, SNZ1, Pyridoxine biosynthesis enzyme [Coenzyme
           metabolism].
          Length = 296

 Score = 30.0 bits (68), Expect = 5.5
 Identities = 25/103 (24%), Positives = 45/103 (43%), Gaps = 28/103 (27%)

Query: 535 EAKARLNEIIKSGSKMIHSNGSSDCPWMADGVGLPPNASELLPKLVKLTKKVTEQVRQLA 594
           EA  R++E    G+ MI + G +       G G   N  E     V+  +K+  ++R+L 
Sbjct: 136 EALRRISE----GAAMIRTKGEA-------GTG---NVVEA----VRHMRKINGEIRRLQ 177

Query: 595 KDEDEDLAE-----TNPYDVIPPYDQAKALGKTNIDVDRIAAG 632
              +++L         PY+++    +   LG+  + V   AAG
Sbjct: 178 SMTEDELYVVAKELQAPYELV---KEVAKLGR--LPVVNFAAG 215


>gnl|CDD|226304 COG3781, COG3781, Predicted membrane protein [Function unknown].
          Length = 306

 Score = 30.1 bits (68), Expect = 6.2
 Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 723 ARRLLGKLLIDLRN-TREEAISVAELKSSQDQVS 755
           AR+L G+L+  +RN  R+    V E   S D   
Sbjct: 82  ARKLWGQLVNAIRNLARQIWTLVPETLDSADVRE 115


>gnl|CDD|239172 cd02771, MopB_NDH-1_NuoG2-N7, MopB_NDH-1_NuoG2-N7: The second
           domain of the NuoG subunit (with a [4Fe-4S] cluster, N7)
           of the NADH-quinone oxidoreductase/NADH dehydrogenase-1
           (NDH-1) found in various bacteria. The NDH-1 is the
           first energy-transducting complex in the respiratory
           chain and functions as a redox pump that uses the redox
           energy to translocate H+ ions across the membrane,
           resulting in a significant contribution to energy
           production. In Escherichia coli NDH-1, the largest
           subunit is encoded by the nuoG gene, and is part of the
           14 distinct subunits constituting the functional enzyme.
           The NuoG subunit is made of two domains: the first
           contains three binding sites for FeS clusters (the fer2
           domain), the second domain (this CD), is of unknown
           function or, as postulated, has lost an ancestral
           formate dehydrogenase activity that became redundant
           during the evolution of the complex I enzyme. Unique to
           this group, compared to the other prokaryotic and
           eukaryotic groups in this domain protein family
           (NADH-Q-OR-NuoG2), is an N-terminal [4Fe-4S] cluster
           (N7/N1c) present in the second domain. Although only
           vestigial sequence evidence remains of a molybdopterin
           binding site, this protein domain belongs to the
           molybdopterin_binding (MopB) superfamily of proteins.
          Length = 472

 Score = 30.0 bits (68), Expect = 6.4
 Identities = 23/135 (17%), Positives = 39/135 (28%), Gaps = 15/135 (11%)

Query: 494 AAAFAKGLLDLEGQLTPILVSLVSKDSSMLDGLDNASIEMEEAKARLNEIIKSGSKMIHS 553
           +AA   GL   E           +K   ++ G  + S+E+ +A A L + +K   +    
Sbjct: 242 SAAGVSGLAPKEKAARIAARLTGAKKPLIVSGTLSGSLELIKAAANLAKALKRRGEN--- 298

Query: 554 NGSSDCPWMADGVGLPPNASELLP--KLVKLTKKVTEQVRQLAKDEDED---LAETNPYD 608
                            N+  LL     V       +      +D   D   +   + Y 
Sbjct: 299 -----AGLT--LAVEEGNSPGLLLLGGHVTEPGLDLDGALAALEDGSADALIVLGNDLYR 351

Query: 609 VIPPYDQAKALGKTN 623
             P      AL    
Sbjct: 352 SAPERRVEAALDAAE 366


>gnl|CDD|151671 pfam11229, DUF3028, Protein of unknown function (DUF3028).  This
           eukaryotic family of proteins has no known function.
          Length = 589

 Score = 30.0 bits (67), Expect = 8.5
 Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 10/68 (14%)

Query: 512 LVSLVSKDSSMLDGLDNASIEMEEAKARLNEIIKSGSKMIHSNGSSDCPWMADGVGLPPN 571
           +V+LV  +  ++  L + +I+    + RLNE+I++ +++I  +G          +GL  N
Sbjct: 68  MVALVGSEGDVMQ-LKSEAIQNSHFQGRLNEVIRTLTQVISVSGV---------IGLQSN 117

Query: 572 ASELLPKL 579
           A  LL  L
Sbjct: 118 AIWLLGHL 125


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.136    0.402 

Gapped
Lambda     K      H
   0.267   0.0747    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 51,468,653
Number of extensions: 5209315
Number of successful extensions: 5043
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5031
Number of HSP's successfully gapped: 37
Length of query: 993
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 887
Effective length of database: 6,236,078
Effective search space: 5531401186
Effective search space used: 5531401186
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.4 bits)