RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8911
         (280 letters)



>gnl|CDD|221713 pfam12689, Acid_PPase, Acid Phosphatase.  This family contains
           phosphatase enzymes and other proteins of the HAD
           superfamily. It includes MDP-1 which is a eukaryotic
           magnesium-dependent acid phosphatase.
          Length = 164

 Score =  192 bits (489), Expect = 1e-61
 Identities = 67/155 (43%), Positives = 91/155 (58%), Gaps = 8/155 (5%)

Query: 133 LDYTLWPLHVHDLV-APFKKIGQ-KVMDAKGTLIKYYRGVPEILRYLKENKCLVAAASRT 190
           LDYTLWP      V  PFKK     V+D +G  +  Y  VP IL+ LKE    +A ASRT
Sbjct: 10  LDYTLWPFWCDTHVGPPFKKDKNGTVVDRRGEELSLYPDVPSILQELKEKGITIAVASRT 69

Query: 191 SEILHAKQILNLINLN------QYFSNKEIYPGPKTTHFESLKKATGIEYKDMVFFDDEE 244
                A+++L L+++N       YF   EIYPG KTTHF+ + K +G+ Y DM+FFDDE 
Sbjct: 70  DAPDWARELLKLLHINDGPPAIDYFDYLEIYPGSKTTHFKKIHKKSGVPYSDMLFFDDES 129

Query: 245 RNSHDVSPLGVTCIHVKKGMSHAVLQKGLKQWASK 279
           RN   VS LGVT + V  G++    ++GL++W  +
Sbjct: 130 RNIDVVSRLGVTFVLVPDGLTREEFERGLRKWRKR 164



 Score =  171 bits (435), Expect = 1e-53
 Identities = 58/118 (49%), Positives = 74/118 (62%), Gaps = 6/118 (5%)

Query: 1   DKVVDARGAHIKYYPDVPGILKYLKQNNCLVAAASRTSEIQGAQQLLDLFNWN------Q 54
             VVD RG  +  YPDVP IL+ LK+    +A ASRT     A++LL L + N       
Sbjct: 33  GTVVDRRGEELSLYPDVPSILQELKEKGITIAVASRTDAPDWARELLKLLHINDGPPAID 92

Query: 55  YFDHKQIFPGQKTTHFANLKKATGIEYKDMVFFDDEERNSHDVSPLGVTCILVEDGMT 112
           YFD+ +I+PG KTTHF  + K +G+ Y DM+FFDDE RN   VS LGVT +LV DG+T
Sbjct: 93  YFDYLEIYPGSKTTHFKKIHKKSGVPYSDMLFFDDESRNIDVVSRLGVTFVLVPDGLT 150


>gnl|CDD|233529 TIGR01685, MDP-1, magnesium-dependent phosphatase-1.  This model
           represents two closely related clades of sequences from
           eukaryotes and archaea. The mouse enzyme has been
           characterized as a phosphatase and has been positively
           identified as a member of the haloacid dehalogenase
           (HAD) superfamily by site-directed mutagenesis of the
           active site residues.
          Length = 174

 Score = 65.3 bits (159), Expect = 6e-13
 Identities = 42/166 (25%), Positives = 69/166 (41%), Gaps = 20/166 (12%)

Query: 133 LDYTLWPLH-VHDLVAPFKKIGQK---VMDAKGTLIKYYRGVPEILRYLKENKCLVAAAS 188
           LD TLW  + +  L  PFK + Q    ++D  GT +   + V ++L+ LK+    +A AS
Sbjct: 9   LDGTLWDHYMISLLGGPFKPVKQNNSIIIDKSGTEVTLIKEVRDVLQTLKDAGTYLATAS 68

Query: 189 RTSEILHAKQILNL---------INLNQYFSNKEIYPGPKTT--HFESLKK--ATG---I 232
                  A +IL           + ++  F ++     P         L+K        +
Sbjct: 69  WNDVPEWAYEILGTFEITYAGKTVPMHSLFDDRIEIYKPNKAKQLEMILQKVNKVDPSVL 128

Query: 233 EYKDMVFFDDEERNSHDVSPLGVTCIHVKKGMSHAVLQKGLKQWAS 278
           +   ++FFDD   N  +V   GVT  +   GM     +K L+ W S
Sbjct: 129 KPAQILFFDDRTDNVREVWGYGVTSCYCPSGMDKGTFKKILEMWRS 174



 Score = 65.3 bits (159), Expect = 7e-13
 Identities = 26/126 (20%), Positives = 45/126 (35%), Gaps = 16/126 (12%)

Query: 3   VVDARGAHIKYYPDVPGILKYLKQNNCLVAAASRTSEIQGAQQLLDLFN---------WN 53
           ++D  G  +    +V  +L+ LK     +A AS     + A ++L  F           +
Sbjct: 36  IIDKSGTEVTLIKEVRDVLQTLKDAGTYLATASWNDVPEWAYEILGTFEITYAGKTVPMH 95

Query: 54  QYFDHKQIFP-GQKTTHFANL------KKATGIEYKDMVFFDDEERNSHDVSPLGVTCIL 106
             FD +       K      +         + ++   ++FFDD   N  +V   GVT   
Sbjct: 96  SLFDDRIEIYKPNKAKQLEMILQKVNKVDPSVLKPAQILFFDDRTDNVREVWGYGVTSCY 155

Query: 107 VEDGMT 112
              GM 
Sbjct: 156 CPSGMD 161


>gnl|CDD|233526 TIGR01681, HAD-SF-IIIC, HAD-superfamily phosphatase, subfamily
           IIIC.  This model represents the IIIC subfamily of the
           Haloacid Dehalogenase (HAD) superfamily of aspartate
           nucleophile hydrolases. Subfamily III (also including
           IIIA - TIGR01662 and IIIB - pfam03767) contains
           sequences which do not contain either of the insert
           domains (between the 1st and 2nd conserved catalytic
           motifs, subfamily I - TIGR01493, TIGR01509, TIGR01549,
           TIGR01488, TIGR01494, TIGR01658, TIGR01544 and
           TIGR01545, or between the 2nd and 3rd, subfamily II -
           TIGR01460 and TIGR01484). Subfamily IIIC contains five
           relatively distantly related clades: a family of viral
           proteins (TIGR01684), a family of eukaryotic proteins
           called MDP-1 and a family of archaeal proteins most
           closely related to MDP-1 (TIGR01685), a family of
           bacteria including the Streptomyces FkbH protein
           (TIGR01686), and a small clade including the Pasteurella
           BcbF and EcbF proteins. The overall lack of species
           overlap among these clades may indicate a conserved
           function, but the degree of divergence between the
           clades and the differences in archetecture outside of
           the domain in some clades warns against such a
           conclusion. No member of this subfamily is characterized
           with respect to function, however the MDP-1 protein is a
           characterized phosphatase. All of the characterized
           enzymes within subfamily III are phosphatases, and all
           of the active site residues characteristic of
           HAD-superfamily phosphatases are present in subfamily
           IIIC.
          Length = 128

 Score = 44.7 bits (106), Expect = 5e-06
 Identities = 34/129 (26%), Positives = 53/129 (41%), Gaps = 20/129 (15%)

Query: 133 LDYTLWPLHVHDLVAPFKKIGQKVMDAKGTLIKYYRGVPEILRYLKENKCLVAAASRTS- 191
           LD TLW             +G+  +      IK    + + L+ LK+N  L+A AS    
Sbjct: 7   LDNTLWTGE-------NIVVGEDPIIDLEVTIKE---IRDKLQTLKKNGFLLALASYNDD 56

Query: 192 -----EILHAKQILNLIN-LNQYFSN-KEIYPGPKT-THFESLKKATG-IEYKDMVFFDD 242
                E+L   +   +I  L +YF      Y  PK+    E   K  G ++ K ++F DD
Sbjct: 57  PHVAYELLKIFEDFGIIFPLAEYFDPLTIGYWLPKSPRLVEIALKLNGVLKPKSILFVDD 116

Query: 243 EERNSHDVS 251
              N+ +V 
Sbjct: 117 RPDNNEEVD 125



 Score = 40.5 bits (95), Expect = 1e-04
 Identities = 25/95 (26%), Positives = 41/95 (43%), Gaps = 10/95 (10%)

Query: 14  YPDVPGILKYLKQNNCLVAAASRTSEIQGAQQLLDLF------NW-NQYFDHKQIFP-GQ 65
             ++   L+ LK+N  L+A AS   +   A +LL +F          +YFD   I     
Sbjct: 31  IKEIRDKLQTLKKNGFLLALASYNDDPHVAYELLKIFEDFGIIFPLAEYFDPLTIGYWLP 90

Query: 66  KTTHFAN--LKKATGIEYKDMVFFDDEERNSHDVS 98
           K+       LK    ++ K ++F DD   N+ +V 
Sbjct: 91  KSPRLVEIALKLNGVLKPKSILFVDDRPDNNEEVD 125


>gnl|CDD|227329 COG4996, COG4996, Predicted phosphatase [General function
           prediction only].
          Length = 164

 Score = 37.5 bits (87), Expect = 0.002
 Identities = 22/97 (22%), Positives = 42/97 (43%), Gaps = 8/97 (8%)

Query: 2   KVVDARGAHIKYYPDVPGILKYLKQNNCLVAAASRTSEIQGAQQLLDLFNWNQYFDHKQI 61
            + D++G  +  +PDV   LK+ + +  ++  AS   E   A + L   +  QYF +  I
Sbjct: 31  TIEDSKGREVHLFPDVKETLKWARNSGYILGLASWNFE-DKAIKALRALDLLQYFHYIVI 89

Query: 62  FPG-------QKTTHFANLKKATGIEYKDMVFFDDEE 91
            P         +     N ++   I+  ++V+ DD  
Sbjct: 90  EPHPYKFLMLSQLLREINTERNQKIKPSEIVYLDDRR 126


>gnl|CDD|233675 TIGR01993, Pyr-5-nucltdase, pyrimidine 5'-nucleotidase.  This
           family of proteins includes the SDT1/SSM1 gene from
           yeast which has been shown to code for a pyrimidine
           (UMP/CMP) 5'nucleotidase. The family spans plants, fungi
           and a small number of bacteria. These enzymes are
           members of the haloacid dehalogenase (HAD) superfamily
           of hydrolases, specifically the IA subfamily (variant 3,
           TIGR01509).
          Length = 183

 Score = 31.6 bits (72), Expect = 0.26
 Identities = 22/97 (22%), Positives = 39/97 (40%), Gaps = 13/97 (13%)

Query: 172 EILRYLKENKCLVAAASRTSEILHAKQILNLINLNQYF--------SNKEIYPGPKTTHF 223
            +L  L   K +     R     HA++ L  + +   F        +N ++ P P    +
Sbjct: 91  NLLLRLPGRKIIFTNGDRA----HARRALRRLGIEDCFDGIFCFDTANPDLLPKPSPQAY 146

Query: 224 ESLKKATGIEYKDMVFFDDEERNSHDVSPLGV-TCIH 259
           E   +  G++ +  +FFDD  RN      LG+ T + 
Sbjct: 147 EKALREAGVDPERAIFFDDSARNIAAGKALGMKTVLV 183


>gnl|CDD|222115 pfam13419, HAD_2, Haloacid dehalogenase-like hydrolase. 
          Length = 176

 Score = 31.5 bits (72), Expect = 0.26
 Identities = 21/102 (20%), Positives = 40/102 (39%), Gaps = 16/102 (15%)

Query: 169 GVPEILRYLKENKCLVAAASRTSEILHAKQILNLINLNQYFSN-------KEIYPGPKTT 221
            V E+LR LK     +   S        +++L  + L   F             P P+  
Sbjct: 81  DVVELLRRLKAKGVKLVILS-NGSREAVERLLEKLGLLDLFDAVFTSDDVGARKPDPEA- 138

Query: 222 HFESLKKATGIEYKDMVFFDDEERNSHDVSP---LGVTCIHV 260
            +E + +  G+  ++++F DD   +  D+      G+  +HV
Sbjct: 139 -YERVLERLGLPPEEILFIDD---SPEDLEAARAAGIKTVHV 176



 Score = 30.4 bits (69), Expect = 0.60
 Identities = 19/104 (18%), Positives = 40/104 (38%), Gaps = 16/104 (15%)

Query: 14  YPDVPGILKYLKQNNCLVAAASRTSEIQGAQQLLDLFNWNQYFDHKQIFPGQKTTH---- 69
           +PDV  +L+ LK     +   S     +  ++LL+       FD   +F           
Sbjct: 79  FPDVVELLRRLKAKGVKLVILS-NGSREAVERLLEKLGLLDLFD--AVFTSDDVGARKPD 135

Query: 70  ---FANLKKATGIEYKDMVFFDDEERNSHDVSP---LGVTCILV 107
              +  + +  G+  ++++F DD   +  D+      G+  + V
Sbjct: 136 PEAYERVLERLGLPPEEILFIDD---SPEDLEAARAAGIKTVHV 176


>gnl|CDD|226825 COG4390, COG4390, Uncharacterized protein conserved in bacteria
          [Function unknown].
          Length = 106

 Score = 29.9 bits (67), Expect = 0.47
 Identities = 16/48 (33%), Positives = 21/48 (43%)

Query: 3  VVDARGAHIKYYPDVPGILKYLKQNNCLVAAASRTSEIQGAQQLLDLF 50
          V D  G  + + P V G L  LK  N L  A     + + A+Q L  F
Sbjct: 21 VQDGDGKTVHHIPAVQGKLGSLKVYNALAEAFDGKLDKEAAEQGLIWF 68


>gnl|CDD|233443 TIGR01509, HAD-SF-IA-v3, haloacid dehalogenase superfamily,
           subfamily IA, variant 3 with third motif having DD or
           ED.  This model represents part of one structural
           subfamily of the Haloacid Dehalogenase (HAD) superfamily
           of aspartate-nucleophile hydrolases. The superfamily is
           defined by the presence of three short catalytic motifs.
           The subfamilies are defined based on the location and
           the observed or predicted fold of a so-called "capping
           domain", or the absence of such a domain. Subfamily I
           consists of sequences in which the capping domain is
           found in between the first and second catalytic motifs.
           Subfamily II consists of sequences in which the capping
           domain is found between the second and third motifs.
           Subfamily III sequences have no capping domain in either
           of these positions.The Subfamily IA and IB capping
           domains are predicted by PSI-PRED to consist of an alpha
           helical bundle. Subfamily I encompasses such a wide
           region of sequence space (the sequences are highly
           divergent) that representing it with a single model is
           impossible, resulting in an overly broad description
           which allows in many unrelated sequences. Subfamily IA
           and IB are separated based on an aparrent phylogenetic
           bifurcation. Subfamily IA is still too broad to model,
           but cannot be further subdivided into large chunks based
           on phylogenetic trees. Of the three motifs defining the
           HAD superfamily, the third has three variant forms : (1)
           hhhhsDxxx(x)D, (2) hhhhssxxx(x)D and (3) hhhhDDxxx(x)s
           where _s_ refers to a small amino acid and _h_ to a
           hydrophobic one. All three of these variants are found
           in subfamily IA. Individual models were made based on
           seeds exhibiting only one of the variants each. Variant
           3 (this model) is found in the enzymes
           beta-phosphoglucomutase (TIGR01990) and
           deoxyglucose-6-phosphatase, while many other enzymes of
           subfamily IA exhibit this variant as well as variant 1
           (TIGR01549). These three variant models were created
           withthe knowledge that there will be overlap among them
           - this is by design and serves the purpose of
           eliminating the overlap with models of more distantly
           relatedHAD subfamilies caused by an overly broad single
           model [Unknown function, Enzymes of unknown
           specificity].
          Length = 177

 Score = 30.5 bits (69), Expect = 0.68
 Identities = 16/90 (17%), Positives = 28/90 (31%), Gaps = 11/90 (12%)

Query: 11  IKYYPDVPGILKYLKQNNCLVAAASRTSEIQGAQ-QLLDLFNWNQYFDHKQIFPGQKTTH 69
           +K  P V  +L+ L+     +A  + +          L L      FD   I        
Sbjct: 78  LKPLPGVRALLEALRARGKKLALLTNSPRADAKLVLELGLR---ALFDVV-IDSSDVGLG 133

Query: 70  ------FANLKKATGIEYKDMVFFDDEERN 93
                 +    K  G++  + +F DD    
Sbjct: 134 KPDPDIYLQALKKLGLKPSECLFVDDSPAG 163


>gnl|CDD|223943 COG1011, COG1011, Predicted hydrolase (HAD superfamily) [General
           function prediction only].
          Length = 229

 Score = 30.3 bits (68), Expect = 0.84
 Identities = 21/117 (17%), Positives = 39/117 (33%), Gaps = 5/117 (4%)

Query: 134 DYTLWPLHVHDLVAPFKKIGQKVMDAKGTLIKYYRGVPEILRYLKENKCLVA----AASR 189
              L  L    L     ++ ++++ A   L+  Y    E L+ L +   L      A   
Sbjct: 68  LERLLELLERLLGDEDAELVEELLAALAKLLPDYPEALEALKELGKKYKLGILTNGARPH 127

Query: 190 TSEILHAKQILNLINLNQYFSNKEIYPGPKTTHFESLKKATGIEYKDMVFFDDEERN 246
               L    +L+  +   + S       P    FE   +  G+  ++ +F  D   N
Sbjct: 128 QERKLRQLGLLDYFDA-VFISEDVGVAKPDPEIFEYALEKLGVPPEEALFVGDSLEN 183


>gnl|CDD|181872 PRK09456, PRK09456, ?-D-glucose-1-phosphatase; Provisional.
          Length = 199

 Score = 30.0 bits (68), Expect = 0.93
 Identities = 16/34 (47%), Positives = 18/34 (52%)

Query: 76  ATGIEYKDMVFFDDEERNSHDVSPLGVTCILVED 109
           A G    D VFFDD   N    + LG+T ILV D
Sbjct: 153 AEGFSAADAVFFDDNADNIEAANALGITSILVTD 186



 Score = 30.0 bits (68), Expect = 0.99
 Identities = 14/32 (43%), Positives = 16/32 (50%)

Query: 229 ATGIEYKDMVFFDDEERNSHDVSPLGVTCIHV 260
           A G    D VFFDD   N    + LG+T I V
Sbjct: 153 AEGFSAADAVFFDDNADNIEAANALGITSILV 184


>gnl|CDD|136007 PRK06698, PRK06698, bifunctional
           5'-methylthioadenosine/S-adenosylhomocysteine
           nucleosidase/phosphatase; Validated.
          Length = 459

 Score = 29.2 bits (65), Expect = 2.7
 Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 3/51 (5%)

Query: 14  YPDVPGILKYLKQNNCLVAAASR--TSEIQGAQQLLDLFNW-NQYFDHKQI 61
           YP+V  I  Y+K+NNC +  AS   T  ++      DL  W  + F  +QI
Sbjct: 332 YPNVKEIFTYIKENNCSIYIASNGLTEYLRAIVSYYDLDQWVTETFSIEQI 382



 Score = 28.8 bits (64), Expect = 3.9
 Identities = 20/63 (31%), Positives = 34/63 (53%), Gaps = 6/63 (9%)

Query: 149 FKKIGQKVMDAKGTLIKYYRGVPEILRYLKENKCLVAAASR-TSEILHAKQILNLINLNQ 207
            +++ + +   KG L   Y  V EI  Y+KEN C +  AS   +E L A  I++  +L+Q
Sbjct: 317 LERLIENIKSGKGAL---YPNVKEIFTYIKENNCSIYIASNGLTEYLRA--IVSYYDLDQ 371

Query: 208 YFS 210
           + +
Sbjct: 372 WVT 374


>gnl|CDD|185197 PRK15297, PRK15297, putative fimbrial protein StiH; Provisional.
          Length = 359

 Score = 28.9 bits (64), Expect = 2.8
 Identities = 15/43 (34%), Positives = 23/43 (53%)

Query: 79  IEYKDMVFFDDEERNSHDVSPLGVTCILVEDGMTNAITLYGRS 121
           IE KD+    D  +NSH  + +  TC  V DG   +++L G +
Sbjct: 240 IEGKDIRHLGDSPQNSHVTTQVNFTCSNVADGTNLSMSLNGAT 282


>gnl|CDD|119389 cd01427, HAD_like, Haloacid dehalogenase-like hydrolases. The
          haloacid dehalogenase-like (HAD) superfamily includes
          L-2-haloacid dehalogenase, epoxide hydrolase,
          phosphoserine phosphatase, phosphomannomutase,
          phosphoglycolate phosphatase, P-type ATPase, and many
          others, all of which use a nucleophilic aspartate in
          their phosphoryl transfer reaction. All members possess
          a highly conserved alpha/beta core domain, and many
          also possess a small cap domain, the fold and function
          of which is variable. Members of this superfamily are
          sometimes referred to as belonging to the DDDD
          superfamily of phosphohydrolases.
          Length = 139

 Score = 28.1 bits (63), Expect = 3.3
 Identities = 16/58 (27%), Positives = 26/58 (44%), Gaps = 1/58 (1%)

Query: 1  DKVVDARGAHIKYYPDVPGILKYLKQNNCLVAAASRTSEIQGAQQLLDLFNWNQYFDH 58
           +   A    ++ YP V   LK LK+    +A A+  S  +   +LL+    + YFD 
Sbjct: 13 SEPGIAEIEELELYPGVKEALKELKEKGIKLALATNKS-RREVLELLEELGLDDYFDP 69


>gnl|CDD|233397 TIGR01405, polC_Gram_pos, DNA polymerase III, alpha chain,
           Gram-positive type.  This model describes a polypeptide
           chain of DNA polymerase III. Full-length homologs of
           this protein are restricted to the Gram-positive
           lineages, including the Mycoplasmas. This protein is
           designated alpha chain and given the gene symbol polC,
           but is not a full-length homolog of other polC genes.
           The N-terminal region of about 200 amino acids is rich
           in low-complexity sequence, poorly alignable, and not
           included n this model [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 1213

 Score = 28.5 bits (64), Expect = 4.4
 Identities = 37/146 (25%), Positives = 59/146 (40%), Gaps = 21/146 (14%)

Query: 59  KQIFPGQKTTHFANLKKATGIEYKDMVFFDDEERNSHDVSPLG-VTCILVEDGMTNAITL 117
           + + P  K+    N+ K  G++       DD  R  +D      V  ++VE      IT 
Sbjct: 309 RALNPEYKSHRLGNICKKLGVDL------DDHHRADYDAEATAKVFKVMVEQLKEKGITN 362

Query: 118 YGRSVFIISNHLSKKLDYT-LWPLHVHDLV---APFKKIGQKVMDAKGTLIKYYRGVPEI 173
                  ++N LS +  Y  L P H+       A  K + + V  +   L KY+   P I
Sbjct: 363 LEE----LNNKLSSEELYKRLRPNHIIIYAKNQAGLKNLYKLVSIS---LTKYFYTRPRI 415

Query: 174 LR-YLKENK--CLVAAASRTSEILHA 196
           LR  LK+ +   L+ +A    E+  A
Sbjct: 416 LRSLLKKYREGLLIGSACSEGELFDA 441


>gnl|CDD|223710 COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General
           function prediction only].
          Length = 221

 Score = 28.2 bits (63), Expect = 4.5
 Identities = 15/41 (36%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 169 GVPEILRYLKENKCLVAAASRTSEILHAKQILNLINLNQYF 209
           GV E+L  LK     +A AS +S    A+++L  + L  YF
Sbjct: 90  GVVELLEQLKARGIPLAVAS-SSPRRAAERVLARLGLLDYF 129


>gnl|CDD|240135 cd04865, LigD_Pol_like_2, LigD_Pol_like_2: Polymerase (Pol) domain
           of bacterial LigD proteins similar to Pseudomonas
           aeruginosa (Pae) LigD, subgroup 2. The LigD Pol domain
           belongs to the archaeal/eukaryal primase (AEP)
           superfamily. In prokaryotes, LigD along with Ku is
           required for non-homologous end joining (NHEJ)-mediated
           repair of DNA double-strand breaks (DSB). NHEJ-mediated
           DNA DSB repair is error-prone. It has been suggested
           that LigD Pol contributes to NHEJ-mediated DNA DSB
           repair in vivo, by filling in short 5'-overhangs with
           ribonucleotides; the filled in termini would then be
           sealed by the associated LigD ligase domain, resulting
           in short stretches of RNA incorporated into the genomic
           DNA. The Pol domains of PaeLigD and Mycobacterium
           tuberculosis (Mt)LigD are stimulated by manganese, are
           error-prone, and prefer adding rNTPs to dNTPs in vitro;
           however PaeLigD and MtLigD belong to other subgroups,
           proteins in this subgroup await functional
           characterization.
          Length = 228

 Score = 28.0 bits (63), Expect = 4.7
 Identities = 11/21 (52%), Positives = 15/21 (71%), Gaps = 1/21 (4%)

Query: 160 KGTLIKYYRGV-PEILRYLKE 179
           K  LI+YYR V P +L YL++
Sbjct: 1   KRDLIRYYRAVAPYLLPYLRD 21


>gnl|CDD|220488 pfam09950, DUF2184, Uncharacterized protein conserved in bacteria
           (DUF2184).  This domain, found in various hypothetical
           bacterial proteins, has no known function.
          Length = 247

 Score = 28.0 bits (63), Expect = 5.5
 Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 20  ILKYLKQNNCLVAAASRTSEIQGAQQL 46
           IL++L++NN   A   R   I+G ++L
Sbjct: 155 ILEFLQENNIYTAQTGRPLTIRGVKEL 181


>gnl|CDD|222551 pfam14113, DUF4285, Domain of unknown function (DUF4285).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 157 and
           206 amino acids in length.
          Length = 117

 Score = 27.0 bits (60), Expect = 6.0
 Identities = 10/32 (31%), Positives = 13/32 (40%), Gaps = 2/32 (6%)

Query: 150 KKIGQKVMDAKGTLIKYYRGVPEILRYLKENK 181
             IG+ V    G   KY   V E+  +LK   
Sbjct: 25  FSIGRIVGGGDGK--KYIIRVKELADWLKLKP 54


>gnl|CDD|118668 pfam10140, YukC, WXG100 protein secretion system (Wss), protein
           YukC.  Members of this family of proteins include
           predicted membrane proteins homologous to YukC in B.
           subtilis. The YukC protein family would participate to
           the formation of a translocon required for the secretion
           of WXG100 proteins (pfam06013) in monoderm bacteria, the
           WXG100 protein secretion system (Wss). This family
           includes EssB in Staphylococcus aureus.
          Length = 359

 Score = 27.6 bits (62), Expect = 7.7
 Identities = 21/78 (26%), Positives = 31/78 (39%), Gaps = 21/78 (26%)

Query: 118 YGRSVFIISNHLSKKLDYTLWPLHVHDLVAPFKKIGQKVMDAKGTLIKYYRGVPEIL--- 174
              +V  +  HL+K+L + L P ++             V D   T    +RGV E L   
Sbjct: 64  LALNVAALEEHLNKRLTFILHPENL-------------VFDKGLTPKFIHRGVKESLPPY 110

Query: 175 -----RYLKENKCLVAAA 187
                R+LKE K  + A 
Sbjct: 111 ELDEERFLKEYKAFIIAL 128


>gnl|CDD|236233 PRK08311, PRK08311, putative RNA polymerase sigma factor SigI;
           Reviewed.
          Length = 237

 Score = 27.5 bits (62), Expect = 7.9
 Identities = 16/59 (27%), Positives = 23/59 (38%), Gaps = 9/59 (15%)

Query: 128 HLSKKLDYTLWPLHVHDLVAPFKKIGQKVMDAKGTLIKYYRGV---PEILRYLKENKCL 183
              K+L    + +   DLV    K   K  D +   IK  + +    E+L  LK  K L
Sbjct: 136 EFKKEL--KEFGITFEDLV----KESPKHRDTRENAIKIAKTIAENEELLEKLKRKKKL 188


>gnl|CDD|236298 PRK08574, PRK08574, cystathionine gamma-synthase; Provisional.
          Length = 385

 Score = 27.3 bits (61), Expect = 9.9
 Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 145 LVAPFKKIGQKVMDAKGTLIKYYRGVPEILRYLKE 179
           L A F++ G+     +G  +KY R     LR L+E
Sbjct: 26  LTALFRQEGEAYPSDRGFDLKYSREENPTLRPLEE 60


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.406 

Gapped
Lambda     K      H
   0.267   0.0663    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,328,963
Number of extensions: 1355516
Number of successful extensions: 1116
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1105
Number of HSP's successfully gapped: 42
Length of query: 280
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 184
Effective length of database: 6,679,618
Effective search space: 1229049712
Effective search space used: 1229049712
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.3 bits)