RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16054
         (563 letters)



>gnl|CDD|200596 cd10974, CE4_CDA_like_1, Putative catalytic domain of chitin
           deacetylase-like proteins with additional chitin-binding
           peritrophin-A domain (ChBD) and/or a low-density
           lipoprotein receptor class A domain (LDLa).  Chitin
           deacetylases (CDAs, EC 3.5.1.41) are secreted
           metalloproteins belonging to a family of extracellular
           chitin-modifying enzymes that catalyze the
           N-deacetylation of chitin, a beta-1,4-linked
           N-acetylglucosamine polymer, to form chitosan, a polymer
           of beta-(1,4)-linked d-glucosamine residues. CDAs have
           been isolated and characterized from various bacterial
           and fungal species and belong to the larger carbohydrate
           esterase 4 (CE4) superfamily. This family includes many
           CDA-like proteins mainly from insects, which contain a
           putative CDA-like catalytic domain similar to the
           catalytic NodB homology domain of CE4 esterases. In
           addition to the CDA-like domain, family members contain
           two additional domains, a chitin-binding peritrophin-A
           domain (ChBD) and a low-density lipoprotein receptor
           class A domain (LDLa), or have the ChBD domain but do
           not have the LDLa domain.
          Length = 269

 Score =  470 bits (1211), Expect = e-165
 Identities = 166/277 (59%), Positives = 209/277 (75%), Gaps = 9/277 (3%)

Query: 218 PQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYTNYASVQELHRKGH 277
           PQMIT+TF+ A+N +N +LY +IFNG R NPNGC IKGTFFVSH+YTNY +VQ+LHRKGH
Sbjct: 1   PQMITLTFDDAINDNNIELYKKIFNGKRNNPNGCPIKGTFFVSHEYTNYQAVQKLHRKGH 60

Query: 278 EISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNK 337
           EI+V S+TH DD       +Y+DW+ EM G R I+E+FANI+D  ++G+RAP+LRVGGN+
Sbjct: 61  EIAVHSITHNDDEN---NATYEDWVKEMVGMREILEKFANITDNEIVGMRAPFLRVGGNR 117

Query: 338 QFSMMTDQFFVYDASITASLGRVPIWPYTLYFRMPHKCNGNAHNCPSKSHPT-WEMVMNE 396
           QF MM +  F+YD+SITA    VP+WPYTL ++MPH+C+G   NCP++S P  WEMV+NE
Sbjct: 118 QFEMMEEFGFLYDSSITAPPSNVPLWPYTLDYKMPHECHG--QNCPTRSFPGVWEMVLNE 175

Query: 397 LDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLK 456
           LD RDDP  DE L    M DSC NI +G+Q    L+HNF RH+ TNRAP GL+FH +WLK
Sbjct: 176 LDVRDDPQGDEPL---AMDDSCLNILSGDQVYEWLQHNFERHYLTNRAPYGLYFHTNWLK 232

Query: 457 SKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNP 493
           +K E    L KF++E+L   DVYFVTM Q IQWMQNP
Sbjct: 233 TKNELLRALQKFLDEILQLPDVYFVTMTQAIQWMQNP 269


>gnl|CDD|200597 cd10975, CE4_CDA_like_2, Putative catalytic domain of chitin
           deacetylase-like proteins.  Chitin deacetylases (CDAs,
           EC 3.5.1.41) are secreted metalloproteins belonging to a
           family of extracellular chitin-modifying enzymes that
           catalyze the N-deacetylation of chitin, a
           beta-1,4-linked N-acetylglucosamine polymer, to form
           chitosan, a polymer of beta-(1,4)-linked d-glucosamine
           residues. CDAs have been isolated and characterized from
           various bacterial and fungal species and belong to the
           larger carbohydrate esterase 4 (CE4) superfamily. This
           family includes many midgut-specific CDA-like proteins
           mainly from insects, such as Tribolium castaneum CDAs
           (TcCDA6-9). These proteins contain a putative CDA-like
           catalytic domain similar to the catalytic NodB homology
           domain of CE4 esterases. In addition to the CDA-like
           domain, some family members have an additional
           chitin-binding peritrophin-A domain (ChBD).
          Length = 268

 Score =  279 bits (715), Expect = 1e-90
 Identities = 113/278 (40%), Positives = 161/278 (57%), Gaps = 12/278 (4%)

Query: 218 PQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYTNYASVQELHRKGH 277
           PQ++T+TF+ AVN  N   Y+++F  NR+NPNGC I  TFFVSH+YT+Y  VQEL+  GH
Sbjct: 1   PQLVTLTFDDAVNTLNYPYYEKLFG-NRKNPNGCPIGATFFVSHEYTDYRLVQELYNDGH 59

Query: 278 EISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNK 337
           EI++ S++H+    YW   S D+W  E  G R I+  FANI    + G RAP+L++GG+ 
Sbjct: 60  EIALHSISHRSPQDYWRNASVDEWEREFGGQREILAHFANIPAEDIKGFRAPFLQLGGDA 119

Query: 338 QFSMMTDQFFVYDAS-ITASLGRVPIWPYTLYFRMPHKCNGNAHNCPSKSHP-TWEMVMN 395
            F  +      YD+S  T S    P+WPYTL +     C      CP+ S+P  W + M 
Sbjct: 120 TFKALKQLGLTYDSSWPTQSFTNPPLWPYTLDYGSTQDC--VIPPCPTDSYPGFWVVPMV 177

Query: 396 ELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWL 455
           +         D +   C M+ +C    T ++    L  NF RH++TNRAP GL+ HASW 
Sbjct: 178 DWQ-------DLNGVPCSMLAACPPPGTADEVYDWLLSNFERHYNTNRAPFGLYLHASWF 230

Query: 456 KSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNP 493
           +      +   KF++E+L  +DVY VT+ Q I+WM+NP
Sbjct: 231 EFTPNRLEGFKKFLDELLSLDDVYLVTISQAIEWMRNP 268


>gnl|CDD|200545 cd10919, CE4_CDA_like, Putative catalytic domain of chitin
           deacetylase-like proteins from insects and similar
           proteins.  Chitin deacetylases (CDAs, EC 3.5.1.41) are
           secreted metalloproteins belonging to a family of
           extracellular chitin-modifying enzymes that catalyze the
           N-deacetylation of chitin, a beta-1,4-linked
           N-acetylglucosamine polymer, to form chitosan, a polymer
           of beta-(1,4)-linked d-glucosamine residues. CDAs have
           been isolated and characterized from various bacterial
           and fungal species and belong to the larger carbohydrate
           esterase family 4 (CE4). This family includes many
           CDA-like proteins, mainly from insects, which contain a
           putative CDA-like catalytic domain similar to the
           catalytic NodB homology domain of CE4 esterases. Some
           family members have an additional chitin binding domain
           (ChBD), or an additional low-density lipoprotein
           receptor class A domain (LDLa), or both. Due to the lack
           of some catalytically relevant residues, several insect
           CDA-like proteins are devoid of enzymatic activity and
           may simply bind to chitin and thus influence the
           mechanical or permeability properties of
           chitin-containing structures such as the cuticle or the
           peritrophic membrane. This family also includes many
           uncharacterized hypothetical proteins from bacteria,
           exhibiting high sequence similarity to insect CDA-like
           proteins.
          Length = 273

 Score =  264 bits (678), Expect = 6e-85
 Identities = 101/283 (35%), Positives = 149/283 (52%), Gaps = 17/283 (6%)

Query: 218 PQMITITFNGAVNVDNSDLYD-EIFNGNRQNPNGCQIKGTFFVSHKYTNYASVQELHRKG 276
           PQ +  TF+ A+N  N+D    EI +G   N  GC I  TFFVS  YT+ + V++L R+G
Sbjct: 1   PQFVLFTFDDAINELNTDAVIQEIADG-TNNNGGCPIPATFFVSTNYTDCSLVKQLWREG 59

Query: 277 HEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGN 336
           HEI+  ++TH  D       S D+W  E+AG R  + +   I    V+G RAPYL     
Sbjct: 60  HEIATHTVTHVPDDSNA---SVDEWEEEIAGQREWLNKTCGIPLEKVVGFRAPYL-AYNP 115

Query: 337 KQFSMMTDQFFVYDASITASL---GRVPIWPYTLYFRMPHKCNGNAHNCPS--KSHPTWE 391
               ++ +  F+YD+SI       G   +WPYTL + +P  CN    +C    +    WE
Sbjct: 116 NTREVLEENGFLYDSSIPEPYTPSGTNRLWPYTLDYGIPQDCNLVPGSCSPTERYPGLWE 175

Query: 392 MVMNELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFH 451
           + +  L   +D T D      +          G+ F  LL++NF+RH++ NRAP G++ H
Sbjct: 176 VPLYTLQDGNDTTGDS-----YYCTPDDGPLNGDSFYALLKYNFDRHYNGNRAPFGIYLH 230

Query: 452 ASWLKSKK-EFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNP 493
           A+WL     E +  L KF++  L + DV+FVT  Q++ WMQNP
Sbjct: 231 AAWLSPPYSERRAALEKFLDYALSKPDVWFVTNSQLLDWMQNP 273


>gnl|CDD|200598 cd10976, CE4_CDA_like_3, Putative catalytic domain of
           uncharacterized bacterial hypothetical proteins similar
           to insect chitin deacetylase-like proteins.  The family
           includes many uncharacterized bacterial hypothetical
           proteins that show high sequence similarity to insect
           chitin deacetylase-like proteins. Chitin deacetylases
           (CDAs, EC 3.5.1.41) are secreted metalloproteins
           belonging to a family of extracellular chitin-modifying
           enzymes that catalyze the N-deacetylation of chitin, a
           beta-1,4-linked N-acetylglucosamine polymer, to form
           chitosan, a polymer of beta-(1,4)-linked d-glucosamine
           residues.
          Length = 299

 Score = 63.5 bits (155), Expect = 5e-11
 Identities = 55/239 (23%), Positives = 85/239 (35%), Gaps = 43/239 (17%)

Query: 273 HRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERF---ANISDG-------- 321
           +R+GHEI   +  H D        S  DW  E       +E       I           
Sbjct: 82  YREGHEIGSHANGHFDGKGGGGRWSVADWKREFDQFYRFVENAYAINGIEGAPPWPAFAP 141

Query: 322 -SVIGVRAPYLRVGGNKQFSMMTDQFFVYDASITASLGRVPIWPYTLYFRMPHKCNGNAH 380
            S+ G RAP L      Q ++     F YDAS   S+ + P W        P K +G  +
Sbjct: 142 NSIKGFRAPCLEGSKGLQPALKKHG-FTYDAS---SVTQGPYW--------PQKVDGIWN 189

Query: 381 N------CPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHN 434
                      S P   M  N L  R     +               +   +     R+ 
Sbjct: 190 FPLPLVPEGPTSRPVIAMDYN-LFVRHSGGVEAPAK---------AAEFEARMLATYRNA 239

Query: 435 FNRHFSTNRAPLGLHFH-ASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQN 492
           F+R ++ NRAPL L  H   W      + + L +F EE+  + +V  VT  +++ +++ 
Sbjct: 240 FDRAYNGNRAPLQLGNHFVKWNGGA--YWNALERFAEEVCTKPEVKCVTYRELVDFLEA 296


>gnl|CDD|238060 cd00112, LDLa, Low Density Lipoprotein Receptor Class A domain, a
           cysteine-rich repeat that plays a central role in
           mammalian cholesterol metabolism; the receptor protein
           binds LDL and transports it into cells by endocytosis; 7
           successive cysteine-rich repeats of about 40 amino acids
           are present in the N-terminal of this multidomain
           membrane protein; other homologous domains occur in
           related receptors, including the very low-density
           lipoprotein receptor and the LDL receptor-related
           protein/alpha 2-macroglobulin receptor, and in proteins
           which are functionally unrelated, such as the C9
           component of complement; the binding of calcium is
           required for in vitro formation of the native disulfide
           isomer and is necessary in establishing and maintaining
           the modular structure.
          Length = 35

 Score = 51.4 bits (124), Expect = 5e-09
 Identities = 16/35 (45%), Positives = 19/35 (54%)

Query: 143 CPEGKLSCGNGECIDKTLFCDDKPDCKDESDENAC 177
           CP  +  C NG CI  +  CD + DC D SDE  C
Sbjct: 1   CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35


>gnl|CDD|213020 cd10585, CE4_SF, Catalytic NodB homology domain of the carbohydrate
           esterase 4 superfamily.  The carbohydrate esterase 4
           (CE4) superfamily mainly includes chitin deacetylases
           (EC 3.5.1.41), bacterial peptidoglycan
           N-acetylglucosamine deacetylases (EC 3.5.1.-), and
           acetylxylan esterases (EC 3.1.1.72), which catalyze the
           N- or O-deacetylation of substrates such as acetylated
           chitin, peptidoglycan, and acetylated xylan,
           respectively. Members in this superfamily contain a NodB
           homology domain that adopts a deformed (beta/alpha)8
           barrel fold, which encompasses a mononuclear
           metalloenzyme employing a conserved His-His-Asp
           zinc-binding triad, closely associated with the
           conserved catalytic base (aspartic acid) and acid
           (histidine) to carry out acid/base catalysis. The NodB
           homology domain of CE4 superfamily is remotely related
           to the 7-stranded beta/alpha barrel catalytic domain of
           the superfamily consisting of family 38 glycoside
           hydrolases (GH38), family 57 heat stable retaining
           glycoside hydrolases (GH57), lactam utilization protein
           LamB/YcsF family proteins, and YdjC-family proteins.
          Length = 142

 Score = 53.2 bits (128), Expect = 2e-08
 Identities = 24/96 (25%), Positives = 34/96 (35%), Gaps = 15/96 (15%)

Query: 249 NGCQIKGTFFVSHKY----------TNYASVQELHRKGHEISVFSLTHKDDPKYWSGGSY 298
            G  I  T FV               N+  ++EL   GHEI +   TH D        S 
Sbjct: 27  EGYGIPATLFVIPGNANPDKLMKSPLNWDLLRELLAYGHEIGLHGYTHPDLAYGN--LSP 84

Query: 299 DDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVG 334
           ++ L ++   R I+E           G RAP   + 
Sbjct: 85  EEVLEDLLRARRILEEAGGQ---PPKGFRAPGGNLS 117


>gnl|CDD|200964 pfam00057, Ldl_recept_a, Low-density lipoprotein receptor domain
           class A. 
          Length = 37

 Score = 46.9 bits (112), Expect = 2e-07
 Identities = 17/37 (45%), Positives = 21/37 (56%)

Query: 141 PVCPEGKLSCGNGECIDKTLFCDDKPDCKDESDENAC 177
             C   +  CG+GECI  +  CD  PDC+D SDE  C
Sbjct: 1   STCGPDEFQCGSGECIPMSWVCDGDPDCEDGSDEKNC 37


>gnl|CDD|197566 smart00192, LDLa, Low-density lipoprotein receptor domain class A. 
           Cysteine-rich repeat in the low-density lipoprotein
           (LDL) receptor that plays a central role in mammalian
           cholesterol metabolism. The N-terminal type A repeats in
           LDL receptor bind the lipoproteins. Other homologous
           domains occur in related receptors, including the very
           low-density lipoprotein receptor and the LDL
           receptor-related protein/alpha 2-macroglobulin receptor,
           and in proteins which are functionally unrelated, such
           as the C9 component of complement. Mutations in the LDL
           receptor gene cause familial hypercholesterolemia.
          Length = 33

 Score = 46.5 bits (111), Expect = 3e-07
 Identities = 16/32 (50%), Positives = 18/32 (56%)

Query: 143 CPEGKLSCGNGECIDKTLFCDDKPDCKDESDE 174
           CP G+  C NG CI  +  CD   DC D SDE
Sbjct: 2   CPPGEFQCDNGRCIPSSWVCDGVDDCGDGSDE 33


>gnl|CDD|216601 pfam01607, CBM_14, Chitin binding Peritrophin-A domain.  This
          domain is called the Peritrophin-A domain and is found
          in chitin binding proteins particularly peritrophic
          matrix proteins of insects and animal chitinases.
          Copies of the domain are also found in some
          baculoviruses. Relevant references that describe
          proteins with this domain include. It is an
          extracellular domain that contains six conserved
          cysteines that probably form three disulphide bridges.
          Chitin binding has been demonstrated for a protein
          containing only two of these domains.
          Length = 53

 Score = 46.3 bits (110), Expect = 6e-07
 Identities = 24/62 (38%), Positives = 28/62 (45%), Gaps = 9/62 (14%)

Query: 29 CEGRPPEEYFRLTIEGDCRDVVRCDQAGENGITRLAGVRCPNGLAFDIDRQTCDWKTNVK 88
          CEGRP   Y      GDC    +C     NG   +    CP GL FD    TCD+  NV 
Sbjct: 1  CEGRPDGLYP---DPGDCSKYYQCS----NGKAVVF--TCPAGLVFDPALGTCDYPDNVV 51

Query: 89 NC 90
          +C
Sbjct: 52 DC 53



 Score = 36.6 bits (85), Expect = 0.001
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 100 SCPSGLAFDVDKQTCDWKAKVTNC 123
           +CP+GL FD    TCD+   V +C
Sbjct: 30  TCPAGLVFDPALGTCDYPDNVVDC 53


>gnl|CDD|213022 cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of
           rhizobial NodB-like proteins.  This family belongs to
           the large and functionally diverse carbohydrate esterase
           4 (CE4) superfamily, whose members show strong sequence
           similarity with some variability due to their distinct
           carbohydrate substrates. It includes many rhizobial NodB
           chitooligosaccharide N-deacetylase (EC 3.5.1.-)-like
           proteins, mainly from bacteria and eukaryotes, such as
           chitin deacetylases (EC 3.5.1.41), bacterial
           peptidoglycan N-acetylglucosamine deacetylases (EC
           3.5.1.-), and acetylxylan esterases (EC 3.1.1.72), which
           catalyze the N- or O-deacetylation of substrates such as
           acetylated chitin, peptidoglycan, and acetylated xylan.
           All members of this family contain a catalytic NodB
           homology domain with the same overall topology and a
           deformed (beta/alpha)8 barrel fold with 6- or 7 strands.
           Their catalytic activity is dependent on the presence of
           a divalent cation, preferably cobalt or zinc, and they
           employ a conserved His-His-Asp zinc-binding triad
           closely associated with the conserved catalytic base
           (aspartic acid) and acid (histidine) to carry out
           acid/base catalysis. Several family members show
           diversity both in metal ion specificities and in the
           residues that coordinate the metal.
          Length = 171

 Score = 43.4 bits (103), Expect = 6e-05
 Identities = 24/114 (21%), Positives = 43/114 (37%), Gaps = 21/114 (18%)

Query: 221 ITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVS-HKYTNYAS-VQELHRKGHE 278
           + +TF+   + + +    +I            +K TFFV       +   V+ +  +GHE
Sbjct: 3   VALTFDDGPDPEYTPKILDILAEY-------GVKATFFVVGENVEKHPDLVRRIVAEGHE 55

Query: 279 ISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLR 332
           I   + +H D     +  S ++  AE+   +  IE           GVR    R
Sbjct: 56  IGNHTYSHPD----LTKLSPEEIRAEIERTQDAIEE--------ATGVRPRLFR 97


>gnl|CDD|200566 cd10941, CE4_PuuE_HpPgdA_like_2, Putative catalytic domain of
           uncharacterized prokaryotic polysaccharide deacetylases
           similar to bacterial PuuE allantoinases and Helicobacter
           pylori peptidoglycan deacetylase (HpPgdA).  This family
           contains many uncharacterized prokaryotic polysaccharide
           deacetylases (DCAs) that show high sequence similarity
           to the catalytic domain of bacterial PuuE allantoinases
           and Helicobacter pylori peptidoglycan deacetylase
           (HpPgdA). PuuE allantoinase appears to be
           metal-independent and specifically catalyzes the
           hydrolysis of (S)-allantoin into allantoic acid.
           Different from PuuE allantoinase, HpPgdA has the ability
           to bind a metal ion at the active site and is
           responsible for a peptidoglycan modification that
           counteracts the host immune response. Both PuuE
           allantoinase and HpPgdA function as homotetramers. The
           monomer is composed of a 7-stranded barrel with
           detectable sequence similarity to the 6-stranded barrel
           NodB homology domain of DCA-like proteins in the CE4
           superfamily, which removes N-linked or O-linked acetyl
           groups from cell wall polysaccharides. In contrast to
           typical NodB-like DCAs, PuuE allantoinase and HpPgdA do
           not exhibit a solvent-accessible polysaccharide binding
           groove and might only bind a small molecule at the
           active site.
          Length = 258

 Score = 43.8 bits (104), Expect = 1e-04
 Identities = 24/146 (16%), Positives = 47/146 (32%), Gaps = 24/146 (16%)

Query: 252 QIKGTFF----VSHKYTNYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAG 307
            +K TFF    V+ +Y     ++ +   GHEI+     H+   +       +D       
Sbjct: 45  GVKATFFVLGEVAERYP--DLIRRIAEAGHEIASHGYAHERVDRLTPEEFREDLRRS--- 99

Query: 308 GRLIIERFANISDGSVIGVRAPYLRVGGNKQ-FSMMTDQFFVYDASITASLGRVPIWPYT 366
                +   +I+   V+G RAP   +        ++ +  ++YD+S+       P     
Sbjct: 100 ----KKILEDITGQKVVGFRAPNFSI--TPWALDILAEAGYLYDSSV------FPTKRP- 146

Query: 367 LYFRMPHKCNGNAHNCPSKSHPTWEM 392
             +  P                  E 
Sbjct: 147 -GYGGPLAPKSEPLPPIRAKGGILEF 171


>gnl|CDD|200589 cd10967, CE4_GLA_like_6s, Putative catalytic NodB homology domain
           of gellan lyase and similar proteins.  This family is
           represented by the extracellular
           polysaccharide-degrading enzyme, gellan lyase
           (gellanase, EC 4.2.2.-), from Bacillus sp. The enzyme
           acts on gellan exolytically and releases a
           tetrasaccharide of glucuronyl-glucosyl-rhamnosyl-glucose
           with unsaturated glucuronic acid at the nonreducing
           terminus. The family also includes many uncharacterized
           prokaryotic polysaccharide deacetylases, which show high
           sequence similarity to Bacillus sp. gellan lyase.
           Although their biological functions remain unknown, all
           members of the family contain a conserved domain with a
           6-stranded beta/alpha barrel, which is similar to the
           catalytic NodB homology domain of rhizobial NodB-like
           proteins, belonging to the larger carbohydrate esterase
           4 (CE4) superfamily.
          Length = 202

 Score = 42.8 bits (101), Expect = 2e-04
 Identities = 20/67 (29%), Positives = 30/67 (44%), Gaps = 10/67 (14%)

Query: 253 IKGTFFVS------HKYTNYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMA 306
           +KGTFFV+        Y +   ++EL   GHEI   ++TH D           +   E+A
Sbjct: 26  LKGTFFVNSGLLGRRGYLDLEELRELAAAGHEIGSHTVTHPDLTSL----PPAELRREIA 81

Query: 307 GGRLIIE 313
             R  +E
Sbjct: 82  ESRAALE 88


>gnl|CDD|216550 pfam01522, Polysacc_deac_1, Polysaccharide deacetylase.  This
           domain is found in polysaccharide deacetylase. This
           family of polysaccharide deacetylases includes NodB
           (nodulation protein B from Rhizobium) which is a
           chitooligosaccharide deacetylase. It also includes
           chitin deacetylase from yeast, and endoxylanases which
           hydrolyses glucosidic bonds in xylan.
          Length = 124

 Score = 40.3 bits (95), Expect = 3e-04
 Identities = 24/113 (21%), Positives = 45/113 (39%), Gaps = 17/113 (15%)

Query: 220 MITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFV--SHKYTNYASVQELHRKGH 277
            + +TF+        D    I +  ++      +K TFFV   +  T    V+ +  +GH
Sbjct: 8   SVALTFDDGP----PDNTPRILDVLKKY----GVKATFFVIGGNAETYPDLVRRMVAEGH 59

Query: 278 EISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPY 330
           EI   + +H D     +  S ++  AE+   +  I++    +       R PY
Sbjct: 60  EIGNHTWSHPD----LTALSPEEIRAEILRSQEAIKK---ATGKRPRLFRPPY 105


>gnl|CDD|200574 cd10950, CE4_BsYlxY_like, Putative catalytic NodB homology domain
           of uncharacterized protein YlxY from Bacillus subtilis
           and its bacterial homologs.  The Bacillus subtilis
           genome contains six polysaccharide deacetylase gene
           homologs: pdaA, pdaB (previously known as ybaN), yheN,
           yjeA, yxkH and ylxY. This family is represented by
           Bacillus subtilis putative polysaccharide deacetylase
           BsYlxY, encoded by the ylxY gene, which is a member of
           the carbohydrate esterase 4 (CE4) superfamily. Although
           its biological function still remains unknown, BsYlxY
           shows high sequence homology to the catalytic domain of
           Bacillus subtilis pdaB gene encoding a putative
           polysaccharide deacetylase (BsPdaB), which is essential
           for the maintenance of spores after the late stage of
           sporulation and is highly conserved in spore-forming
           bacteria. However, disruption of the ylxY gene in B.
           subtilis did not cause any sporulation defect. Moreover,
           the Asp residue in the classical His-His-Asp
           zinc-binding motif of CE4 esterases is mutated to a Val
           residue in this family. Other catalytically relevant
           residues of CE4 esterases are also not conserved, which
           suggest that members of this family may be inactive.
          Length = 188

 Score = 37.6 bits (88), Expect = 0.007
 Identities = 22/103 (21%), Positives = 42/103 (40%), Gaps = 15/103 (14%)

Query: 219 QMITITFNGAVNVDNSDLY-DEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRK 275
           +M+ +     +NV   + Y   +     ++     +K TFF+  ++   N   V+++ + 
Sbjct: 6   KMVALL----INVAWGEEYLPAMLTILEKH----DVKATFFLEGRWAKKNPDLVRKIAKD 57

Query: 276 GHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANI 318
           GHEI     +H D  +     SY+    E+     IIE     
Sbjct: 58  GHEIGNHGYSHPDPSQL----SYEQNREEIRKTNEIIEEITGE 96


>gnl|CDD|213757 TIGR03006, pepcterm_polyde, polysaccharide deacetylase family
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           deacetylases (pfam01522). All are found in species that
           encode the PEP-CTERM/exosortase system predicted to act
           in protein sorting in a number of Gram-negative
           bacteria, and are found near the epsH homolog that is
           the putative exosortase gene. The highest scoring
           homologs below the trusted cutoff for this model are
           found in several species of Methanosarcina, an archaeal
           genus [Cell envelope, Biosynthesis and degradation of
           surface polysaccharides and lipopolysaccharides].
          Length = 272

 Score = 36.5 bits (85), Expect = 0.026
 Identities = 24/108 (22%), Positives = 50/108 (46%), Gaps = 15/108 (13%)

Query: 252 QIKGTFF----VSHKYTNYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAG 307
            +K TFF    V+ +Y     V+ +   GHE++    +H    +  +  + + + A++  
Sbjct: 47  GVKATFFTLGWVAERYPEL--VRRIVAAGHELA----SHGYGHERVTTQTPEAFRADIRR 100

Query: 308 GRLIIERFANISDGSVIGVRAPYLRVG-GNK-QFSMMTDQFFVYDASI 353
            + ++E   ++S   V G RAP   +G  N     ++ +  + Y +SI
Sbjct: 101 SKALLE---DLSGQPVRGYRAPSFSIGKKNLWALDVLAEAGYRYSSSI 145


>gnl|CDD|234001 TIGR02764, spore_ybaN_pdaB, polysaccharide deacetylase family
           sporulation protein PdaB.  This model describes the YbaN
           protein family, also called PdaB and SpoVIE, of
           Gram-positive bacteria. Although ybaN null mutants have
           only a mild sporulation defect, ybaN/ytrI double mutants
           show drastically reducted sporulation efficiencies. This
           synthetic defect suggests the role of this
           sigmaE-controlled gene in sporulation had been masked by
           functional redundancy. Members of this family are
           homologous to a characterized polysaccharide
           deacetylase; the exact function this protein family is
           unknown [Cellular processes, Sporulation and
           germination].
          Length = 191

 Score = 35.0 bits (81), Expect = 0.049
 Identities = 19/70 (27%), Positives = 35/70 (50%), Gaps = 9/70 (12%)

Query: 221 ITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT-NYAS-VQELHRKGHE 278
           I +TF+  ++  N D  + I +  ++      +K TFF+S  +   +   V+E+ + GHE
Sbjct: 8   IALTFD--ISWGN-DYTEPILDTLKEY----DVKATFFLSGSWAERHPELVKEIVKDGHE 60

Query: 279 ISVFSLTHKD 288
           I      HK+
Sbjct: 61  IGSHGYRHKN 70


>gnl|CDD|214696 smart00494, ChtBD2, Chitin-binding domain type 2. 
          Length = 49

 Score = 31.6 bits (72), Expect = 0.065
 Identities = 15/40 (37%), Positives = 18/40 (45%), Gaps = 6/40 (15%)

Query: 45 DCRDVVRCDQAGENGITRLAGVRCPNGLAFDIDRQTCDWK 84
          DC    +C     NG   +    CP GL F+   QTCDW 
Sbjct: 16 DCSKYYQCS----NGRPIVG--SCPAGLVFNPATQTCDWP 49



 Score = 31.6 bits (72), Expect = 0.079
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 100 SCPSGLAFDVDKQTCDWK 117
           SCP+GL F+   QTCDW 
Sbjct: 32  SCPAGLVFNPATQTCDWP 49


>gnl|CDD|200569 cd10944, CE4_SmPgdA_like, Catalytic NodB homology domain of
           Streptococcus mutans polysaccharide deacetylase PgdA,
           Bacillus subtilis YheN, and similar proteins.  This
           family is represented by a putative polysaccharide
           deacetylase PgdA from the oral pathogen Streptococcus
           mutans (SmPgdA) and Bacillus subtilis YheN (BsYheN),
           which are members of the carbohydrate esterase 4 (CE4)
           superfamily. SmPgdA is an extracellular metal-dependent
           polysaccharide deacetylase with a typical CE4 fold, with
           metal bound to a His-His-Asp triad. It possesses
           de-N-acetylase activity toward a hexamer of
           chitooligosaccharide N-acetylglucosamine, but not
           shorter chitooligosaccharides or a synthetic
           peptidoglycan tetrasaccharide. SmPgdA plays a role in
           tuning cell surface properties and in interactions with
           (salivary) agglutinin, an essential component of the
           innate immune system, most likely through deacetylation
           of an as-yet-unidentified polysaccharide. SmPgdA shows
           significant homology to the catalytic domains of
           peptidoglycan deacetylases from Streptococcus pneumoniae
           (SpPgdA) and Listeria monocytogenes (LmPgdA), both of
           which are involved in the bacterial defense mechanism
           against human mucosal lysozyme. The Bacillus subtilis
           genome contains six polysaccharide deacetylase gene
           homologs: pdaA, pdaB (previously known as ybaN), yheN,
           yjeA, yxkH and ylxY. The biological function of BsYheN
           is still unknown. This family also includes many
           uncharacterized polysaccharide deacetylases mainly found
           in bacteria.
          Length = 189

 Score = 34.4 bits (80), Expect = 0.071
 Identities = 22/97 (22%), Positives = 42/97 (43%), Gaps = 14/97 (14%)

Query: 252 QIKGTFFV--SHKYTNYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGR 309
            +K TFFV  S+       V+ + ++GH I + S TH     Y    S + ++ ++   +
Sbjct: 26  NVKATFFVIGSNVEKYPELVKRIVKEGHAIGLHSYTHDYKKLY---SSPEAFIKDLNKTQ 82

Query: 310 LIIERFANISDGSVIGVRAPYLRV-GGNKQFSMMTDQ 345
            +I+         + GV+   +R  GG+    +M   
Sbjct: 83  DLIK--------KITGVKTKLIRFPGGSSNTGLMKAL 111


>gnl|CDD|200582 cd10959, CE4_NodB_like_3, Catalytic NodB homology domain of
           uncharacterized bacterial polysaccharide deacetylases.
           This family includes many uncharacterized bacterial
           polysaccharide deacetylases. Although their biological
           function still remains unknown, members in this family
           show high sequence homology to the catalytic NodB
           homology domain of Streptococcus pneumoniae
           polysaccharide deacetylase PgdA (SpPgdA), which is an
           extracellular metal-dependent polysaccharide deacetylase
           with de-N-acetylase activity toward a hexamer of
           chitooligosaccharide N-acetylglucosamine, but not
           shorter chitooligosaccharides or a synthetic
           peptidoglycan tetrasaccharide. Like SpPgdA, this family
           is a member of the carbohydrate esterase 4 (CE4)
           superfamily.
          Length = 187

 Score = 33.7 bits (78), Expect = 0.13
 Identities = 16/82 (19%), Positives = 29/82 (35%), Gaps = 9/82 (10%)

Query: 251 CQIKGTFFV--SHKYTNYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGG 308
              K TFFV       +   ++ +  +GHEI      H+         S    + ++   
Sbjct: 26  HGAKATFFVVGERAERHPDLIRRIVDEGHEIGNHGYRHRHPWLR----SPWKAIRDLRRA 81

Query: 309 RLIIERFANISDGSVIGVRAPY 330
             IIE+   ++       R P+
Sbjct: 82  ARIIEQ---LTGRPPRYYRPPW 100


>gnl|CDD|200578 cd10954, CE4_CtAXE_like, Catalytic NodB homology domain of
           Clostridium thermocellum acetylxylan esterase and its
           bacterial homologs.  This family is represented by
           Clostridium thermocellum acetylxylan esterase (CtAXE, EC
           3.1.1.72), a member of the carbohydrate esterase 4 (CE4)
           superfamily. CtAXE deacetylates O-acetylated xylan, a
           key component of plant cell walls. It shows no
           detectable activity on generic esterase substrates
           including para-nitrophenyl acetate. It is specific for
           sugar-based substrates and will precipitate acetylxylan,
           as a consequence of deacetylation. CtAXE is a monomeric
           protein containing a catalytic NodB homology domain with
           the same overall topology and a deformed (beta/alpha)8
           barrel fold as other CE4 esterases. However, due to
           differences in the topography of the substrate-binding
           groove, the chemistry of the active center, and metal
           ion coordination, CtAXE has different metal ion
           preference and lacks activity on N-acetyl substrates. It
           is significantly activated by Co2+. Moreover, CtAXE
           displays distinctly different ligand coordination to the
           metal ion, utilizing an aspartate, a histidine, and four
           water molecules, as opposed to the conserved His-His-Asp
           zinc-binding triad of other CE4 esterases.
          Length = 180

 Score = 33.7 bits (78), Expect = 0.13
 Identities = 25/119 (21%), Positives = 39/119 (32%), Gaps = 16/119 (13%)

Query: 220 MITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFF-VSHKYTNYAS-VQELHRKGH 277
           M+ +TF+   N   +    ++            ++ TFF V          V+ +   G 
Sbjct: 2   MVALTFDDGPNAKYTPRLLDVLEKYN-------VRATFFLVGQNVNGNKEIVKRMVEMGC 54

Query: 278 EISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGN 336
           EI   S TH D  K     S  +   E+      I+    I+       R PY  V   
Sbjct: 55  EIGNHSYTHPDLTKL----SPSEIKKEIEKTNEAIK---KITGKRPKLFRPPYGAVNDT 106


>gnl|CDD|200575 cd10951, CE4_ClCDA_like, Catalytic NodB homology domain of
           Colletotrichum lindemuthianum chitin deacetylase and
           similar proteins.  This family is represented by the
           chitin deacetylase (endo-chitin de-N-acetylase, ClCDA,
           EC 3.5.1.41) from Colletotrichum lindemuthianum (also
           known as Glomerella lindemuthiana), which is a member of
           the carbohydrate esterase 4 (CE4) superfamily. ClCDA
           catalyzes the hydrolysis of N-acetamido groups of
           N-acetyl-D-glucosamine residues in chitin, converting it
           to chitosan in fungal cell walls. It consists of a
           single catalytic domain similar to the deformed
           (alpha/beta)8 barrel fold adopted by other CE4
           esterases, which encompasses a mononuclear metalloenzyme
           employing a conserved His-His-Asp zinc-binding triad
           closely associated with the conserved catalytic base
           (aspartic acid) and acid (histidine), to carry out
           acid/base catalysis. It possesses a highly conserved
           substrate-binding groove, with subtle alterations that
           influence substrate specificity and subsite affinity.
           Unlike its bacterial homologs, ClCDA contains two
           intramolecular disulfide bonds that may add stability to
           this secreted protein. The family also includes many
           uncharacterized deacetylases and hypothetical proteins
           mainly from eukaryotes, which show high sequence
           similarity to ClCDA.
          Length = 197

 Score = 33.4 bits (77), Expect = 0.21
 Identities = 27/136 (19%), Positives = 43/136 (31%), Gaps = 41/136 (30%)

Query: 217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYTNYAS------VQ 270
           VP  + +TF+   +     L D +             K TFFV+    N         ++
Sbjct: 6   VPGTVALTFDDGPSTYTPQLLD-LLKEA-------GAKATFFVNGNNFNGCIYDYADVLR 57

Query: 271 ELHRKGHEISVFSLTHKDDPKYWSGGSYDD-----WLAEMAGGRLIIERFANISDGSVIG 325
            ++ +GH+I+  + +H D  K     S           E A           I     +G
Sbjct: 58  RMYNEGHQIASHTWSHPDLTKL----SAAQIRDEMTKLEDA--------LRKI-----LG 100

Query: 326 V-----RAPYLRVGGN 336
           V     R PY      
Sbjct: 101 VKPTYMRPPYGECNDE 116


>gnl|CDD|234837 PRK00783, PRK00783, DNA-directed RNA polymerase subunit D;
           Provisional.
          Length = 263

 Score = 33.7 bits (78), Expect = 0.21
 Identities = 12/48 (25%), Positives = 16/48 (33%), Gaps = 1/48 (2%)

Query: 139 DEPVCPEGKLSCGNGECIDKTLF-CDDKPDCKDESDENACSVEQDPNR 185
               CP G L    G+ +   L  C     C+      A  V  D N+
Sbjct: 177 CVEACPRGVLELKEGKLVVTDLLNCSLCKLCERACPGKAIRVSDDENK 224


>gnl|CDD|200563 cd10938, CE4_HpPgdA_like, Catalytic domain of Helicobacter pylori
           peptidoglycan deacetylase (HpPgdA) and similar proteins.
            This family is represented by a peptidoglycan
           deacetylase (HP0310, HpPgdA) from the gram-negative
           pathogen Helicobacter pylori. HpPgdA has the ability to
           bind a metal ion at the active site and is responsible
           for a peptidoglycan modification that counteracts the
           host immune response. It functions as a homotetramer.
           The monomer is composed of a 7-stranded barrel with
           detectable sequence similarity to the 6-stranded barrel
           NodB homology domain of polysaccharide deacetylase
           (DCA)-like proteins in the CE4 superfamily, which
           removes N-linked or O-linked acetyl groups from cell
           wall polysaccharides. In contrast to typical NodB-like
           DCAs, HpPgdA does not exhibit a solvent-accessible
           polysaccharide binding groove, suggesting that the
           enzyme binds a small molecule at the active site.
          Length = 258

 Score = 32.1 bits (74), Expect = 0.58
 Identities = 25/107 (23%), Positives = 44/107 (41%), Gaps = 20/107 (18%)

Query: 253 IKGTFFVS-HKYTNY-ASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRL 310
           +K TFFV  H    +  +V+ +   GHEI      H    +  +G + ++    +  G  
Sbjct: 51  VKATFFVPGHTAETFPEAVEAILAAGHEIGHHGYLH----ENPTGLTPEEERELLERGLE 106

Query: 311 IIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQF-----FVYDAS 352
           ++E+         +G R+P        +FS  T        F+YD+S
Sbjct: 107 LLEKLTGK---RPVGYRSPSW------EFSPNTLDLLLEHGFLYDSS 144



 Score = 28.7 bits (65), Expect = 8.6
 Identities = 5/25 (20%), Positives = 12/25 (48%)

Query: 467 KFIEEMLDRNDVYFVTMLQVIQWMQ 491
           + IE +     V+F T  ++  + +
Sbjct: 234 RLIEHIKAHGGVWFATGEEIADYWR 258


>gnl|CDD|222448 pfam13905, Thioredoxin_8, Thioredoxin-like.  Thioredoxins are small
           enzymes that participate in redox reactions, via the
           reversible oxidation of an active centre disulfide bond.
          Length = 94

 Score = 30.0 bits (68), Expect = 0.86
 Identities = 10/38 (26%), Positives = 19/38 (50%), Gaps = 1/38 (2%)

Query: 445 PLGLHFHASWLKSKKEFKDELIKFIEEMLDRN-DVYFV 481
            + L+F ASW    + F  EL +  E++     ++ +V
Sbjct: 3   VVLLYFWASWCPPCRAFTPELKELYEKLKKPKVEIVYV 40


>gnl|CDD|187791 cd09660, Csx1_III-U, CRISPR/Cas system-associated protein Csx1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Protein of this family often fused to HTH domain; Some
           proteins could have an additional fusion with
           RecB-family nuclease domain; Core domain appears to have
           a Rossmann-like fold; loosely associated with CRISPR/Cas
           systems; also known as MJ1666 family.
          Length = 394

 Score = 32.0 bits (73), Expect = 0.92
 Identities = 16/74 (21%), Positives = 30/74 (40%), Gaps = 5/74 (6%)

Query: 435 FNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNPT 494
           F   + T      L+FH+  L+  K    +    +EE+    + +F  +      ++N  
Sbjct: 268 FKEDYLTILK---LYFHSLLLEILKFIDYKNEVKLEEIKKLAETFFDKIEVTRVIIEN-- 322

Query: 495 ELTSLRDFQEWKEK 508
           E+  +   Q W EK
Sbjct: 323 EVDKIEKGQNWSEK 336


>gnl|CDD|213023 cd10918, CE4_NodB_like_5s_6s, Putative catalytic NodB homology
           domain of PgaB, IcaB, and similar proteins which consist
           of a deformed (beta/alpha)8 barrel fold with 5- or
           6-strands.  This family belongs to the large and
           functionally diverse carbohydrate esterase 4 (CE4)
           superfamily, whose members show strong sequence
           similarity with some variability due to their distinct
           carbohydrate substrates. It includes bacterial
           poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase PgaB,
           hemin storage system HmsF protein in gram-negative
           species, intercellular adhesion proteins IcaB, and many
           uncharacterized prokaryotic polysaccharide deacetylases.
           It also includes a putative polysaccharide deacetylase
           YxkH encoded by the Bacillus subtilis yxkH gene, which
           is one of six polysaccharide deacetylase gene homologs
           present in the Bacillus subtilis genome. Sequence
           comparison shows all family members contain a conserved
           domain similar to the catalytic NodB homology domain of
           rhizobial NodB-like proteins, which consists of a
           deformed (beta/alpha)8 barrel fold with 6 or 7 strands.
           However, in this family, most proteins have 5 strands
           and some have 6 strands. Moreover, long insertions are
           found in many family members, whose function remains
           unknown.
          Length = 157

 Score = 29.1 bits (66), Expect = 3.4
 Identities = 17/80 (21%), Positives = 27/80 (33%), Gaps = 21/80 (26%)

Query: 252 QIKGTFFVS-----------------HKYTNYASVQELHRKGHEISVFSLTHKDDPKYWS 294
            +  TFFV                    Y  +  ++EL   G EI   + TH D      
Sbjct: 25  GLPATFFVITGYIGGGNPWWAPAPPRPPYLTWDQLRELAASGVEIGSHTHTHPDLTTL-- 82

Query: 295 GGSYDDWLAEMAGGRLIIER 314
             S ++   E+A  +  +E 
Sbjct: 83  --SDEELRRELAESKERLEE 100


>gnl|CDD|200576 cd10952, CE4_MrCDA_like, Catalytic NodB homology domain of Mucor
           rouxii chitin deacetylase and similar proteins.  This
           family is represented by the chitin deacetylase (MrCDA,
           EC 3.5.1.41) encoded from the fungus Mucor rouxii (also
           known as Amylomyces rouxii). MrCDA is an acidic
           glycoprotein with a very stringent specificity for
           beta1-4-linked N-acetylglucosamine homopolymers. It
           requires at least four residues (chitotetraose) for
           catalysis, and can achieve extensive deacetylation on
           chitin polymers. MrCDA shows high sequence similarity to
           Colletotrichum lindemuthianum chitin deacetylase
           (endo-chitin de-N-acetylase, ClCDA), which consists of a
           single catalytic domain similar to the deformed
           (beta/alpha)8 barrel fold adopted by the carbohydrate
           esterase 4 (CE4) superfamily, which encompasses a
           mononuclear metalloenzyme employing a conserved
           His-His-Asp zinc-binding triad closely associated with
           the conserved catalytic base (aspartic acid) and acid
           (histidine) to carry out acid/base catalysis. The family
           also includes some uncharacterized eukaryotic and
           bacterial homologs of MrCDA.
          Length = 178

 Score = 29.3 bits (66), Expect = 3.5
 Identities = 14/37 (37%), Positives = 18/37 (48%), Gaps = 2/37 (5%)

Query: 253 IKGTFFV--SHKYTNYASVQELHRKGHEISVFSLTHK 287
            K TFFV  S+   N   +Q     GHEI V + +H 
Sbjct: 27  QKATFFVIGSNVVNNPDILQRALEAGHEIGVHTWSHP 63


>gnl|CDD|99987 cd03817, GT1_UGDG_like, This family is most closely related to the
           GT1 family of glycosyltransferases.
           UDP-glucose-diacylglycerol glucosyltransferase (UGDG;
           also known as 1,2-diacylglycerol 3-glucosyltransferase)
           catalyzes the transfer of glucose from UDP-glucose to
           1,2-diacylglycerol forming
           3-D-glucosyl-1,2-diacylglycerol.
          Length = 374

 Score = 29.5 bits (67), Expect = 5.6
 Identities = 7/24 (29%), Positives = 15/24 (62%)

Query: 270 QELHRKGHEISVFSLTHKDDPKYW 293
           +EL ++GHE+ V + ++   P+  
Sbjct: 25  EELEKRGHEVYVVAPSYPGAPEEE 48


>gnl|CDD|238481 cd00981, arch_gltB, Archaeal-type gltB domain. This domain shares
           sequence similarity with a region of unknown function
           found in the large subunit of glutamate synthase, which
           is encoded by gltB and found in most bacteria and
           eukaryotes.  It is predicted to be homologous to the
           C-terminal domain of glutamate synthase based upon
           sequence similarity coupled with genome organization
           data, showing that this domain is found in a gene
           cluster with other domains of Glts, which are annotated.
           This domain is found primarily in archaea, but is also
           present in a few bacteria, likely as a result of lateral
           gene transfer.
          Length = 232

 Score = 28.8 bits (65), Expect = 5.8
 Identities = 12/32 (37%), Positives = 18/32 (56%), Gaps = 3/32 (9%)

Query: 295 GGSYDDWLAE-MAGGRLIIERFANISDGSVIG 325
           GG+  D+L E MAGG +I+      +D   +G
Sbjct: 133 GGTAGDFLGEYMAGGVIIV--LGLGTDEEPVG 162


>gnl|CDD|99968 cd03794, GT1_wbuB_like, This family is most closely related to the
           GT1 family of glycosyltransferases. wbuB in E. coli is
           involved in the biosynthesis of the O26 O-antigen.  It
           has been proposed to function as an
           N-acetyl-L-fucosamine (L-FucNAc) transferase.
          Length = 394

 Score = 29.1 bits (66), Expect = 5.9
 Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)

Query: 269 VQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGG 308
            +EL ++GHE++V +      P Y SG  Y  +  E   G
Sbjct: 24  AEELVKRGHEVTVITG----SPNYPSGKIYKGYKREEVDG 59


>gnl|CDD|99990 cd03820, GT1_amsD_like, This family is most closely related to the
           GT1 family of glycosyltransferases. AmSD in Erwinia
           amylovora has been shown to be involved in the
           biosynthesis of amylovoran, the acidic exopolysaccharide
           acting as a virulence factor. This enzyme may be
           responsible for the formation of  galactose alpha-1,6
           linkages in amylovoran.
          Length = 348

 Score = 29.1 bits (66), Expect = 6.8
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 271 ELHRKGHEISVFSLTHKDDPKY 292
            L  KGHE+++ SL   + P Y
Sbjct: 25  ALAEKGHEVTIISLDKGEPPFY 46


>gnl|CDD|200557 cd10931, CE4_u7, Putative catalytic domain of uncharacterized
           bacterial proteins from the carbohydrate esterase 4
           superfamily.  This family corresponds to a group of
           uncharacterized bacterial proteins with high sequence
           similarity to the catalytic domain of the six-stranded
           barrel rhizobial NodB-like proteins, which remove
           N-linked or O-linked acetyl groups from cell wall
           polysaccharides and belong to the larger carbohydrate
           esterase 4 (CE4) superfamily.
          Length = 224

 Score = 28.3 bits (64), Expect = 9.0
 Identities = 26/119 (21%), Positives = 42/119 (35%), Gaps = 33/119 (27%)

Query: 229 VNVDNS-DLYDEIFNGNRQNPNGCQIKGTFF-------------VSHKYTNYASV-QELH 273
            +VD+  D +D + +  ++      ++ TFF              S+      S+ +E+ 
Sbjct: 7   HDVDDPYDTFDWLMDLEKKY----GVRSTFFFLAGDYSPYDDGNYSYNDPKIRSLIKEIA 62

Query: 274 RKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLR 332
            +G EI +    H     Y S    +    E        ER   I    V G R  YLR
Sbjct: 63  DRGWEIGL----H---GSYNSYTDPEKLKKEK-------ERLEKILGRPVTGGRQHYLR 107


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.137    0.445 

Gapped
Lambda     K      H
   0.267   0.0665    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,520,624
Number of extensions: 2706928
Number of successful extensions: 2297
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2274
Number of HSP's successfully gapped: 37
Length of query: 563
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 461
Effective length of database: 6,413,494
Effective search space: 2956620734
Effective search space used: 2956620734
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (27.8 bits)