RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10186
         (124 letters)



>gnl|CDD|201077 pfam00201, UDPGT, UDP-glucoronosyl and UDP-glucosyl transferase. 
          Length = 500

 Score =  113 bits (285), Expect = 1e-30
 Identities = 42/113 (37%), Positives = 68/113 (60%)

Query: 9   LVTAFSRTGLTVLWRYEGDSIENLPGNVHIRKWIPQQDVLAHPNCRLFISHGGVNSALEA 68
           + +A ++    VLWR++G     L  N  + KW+PQ D+L HP  R F++H G N   EA
Sbjct: 298 IASALAQIPQKVLWRFDGTKPSTLGRNTRLVKWLPQNDLLGHPKTRAFVTHAGSNGVYEA 357

Query: 69  IHYGIPIIGVPFYGDQLSHVRHIVDLGAGVELSYFNITLESIAWATSIVLNNP 121
           I +G+P++G+P +GDQ+ + +H+   GA V L+   +T E +  A   V+N+P
Sbjct: 358 ICHGVPMVGMPLFGDQMDNAKHMEAKGAAVTLNVLTMTSEDLLNALKTVINDP 410


>gnl|CDD|223071 PHA03392, egt, ecdysteroid UDP-glucosyltransferase; Provisional.
          Length = 507

 Score =  110 bits (278), Expect = 1e-29
 Identities = 44/123 (35%), Positives = 67/123 (54%), Gaps = 1/123 (0%)

Query: 1   MSDGMRTLLVTAFSRTGLTVLWRYEGD-SIENLPGNVHIRKWIPQQDVLAHPNCRLFISH 59
           M +    +L+  F +    VLW+Y+G+    NLP NV  +KW PQ+ VL H N + F++ 
Sbjct: 312 MDNEFLQMLLRTFKKLPYNVLWKYDGEVEAINLPANVLTQKWFPQRAVLKHKNVKAFVTQ 371

Query: 60  GGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVDLGAGVELSYFNITLESIAWATSIVLN 119
           GGV S  EAI   +P++G+P  GDQ  +    V+LG G  L    ++   +  A   V+ 
Sbjct: 372 GGVQSTDEAIDALVPMVGLPMMGDQFYNTNKYVELGIGRALDTVTVSAAQLVLAIVDVIE 431

Query: 120 NPR 122
           NP+
Sbjct: 432 NPK 434


>gnl|CDD|233407 TIGR01426, MGT, glycosyltransferase, MGT family.  This model
           describes the MGT (macroside glycosyltransferase)
           subfamily of the UDP-glucuronosyltransferase family.
           Members include a number of glucosyl transferases for
           macrolide antibiotic inactivation, but also include
           transferases of glucose-related sugars for macrolide
           antibiotic production [Cellular processes, Toxin
           production and resistance].
          Length = 392

 Score = 84.3 bits (209), Expect = 2e-20
 Identities = 37/96 (38%), Positives = 56/96 (58%), Gaps = 2/96 (2%)

Query: 27  DSIENLPGNVHIRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLS 86
             +  LP NV +R+W+PQ ++L   +   FI+HGG+NS +EA+  G+P++ VP   DQ  
Sbjct: 268 ADLGELPPNVEVRQWVPQLEILKKAD--AFITHGGMNSTMEALFNGVPMVAVPQGADQPM 325

Query: 87  HVRHIVDLGAGVELSYFNITLESIAWATSIVLNNPR 122
             R I +LG G  L    +T E +  A   VL++PR
Sbjct: 326 TARRIAELGLGRHLPPEEVTAEKLREAVLAVLSDPR 361


>gnl|CDD|99960 cd03784, GT1_Gtf_like, This family includes the Gtfs, a group of
           homologous glycosyltransferases involved in the final
           stages of the biosynthesis of antibiotics vancomycin and
           related chloroeremomycin. Gtfs transfer sugar moieties
           from an activated NDP-sugar donor to the oxidatively
           cross-linked heptapeptide core of vancomycin group
           antibiotics. The core structure is important for the
           bioactivity of the antibiotics.
          Length = 401

 Score = 79.7 bits (197), Expect = 1e-18
 Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 2/88 (2%)

Query: 26  GDSIENLPGNVHIRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQL 85
           G   E+LP NV +  ++P   +L  P C   + HGG  +   A+  G+P + VPF+GDQ 
Sbjct: 280 GLGAEDLPDNVRVVDFVPHDWLL--PRCAAVVHHGGAGTTAAALRAGVPQLVVPFFGDQP 337

Query: 86  SHVRHIVDLGAGVELSYFNITLESIAWA 113
                + +LGAG  L    +T E +A A
Sbjct: 338 FWAARVAELGAGPALDPRELTAERLAAA 365


>gnl|CDD|224732 COG1819, COG1819, Glycosyl transferases, related to
           UDP-glucuronosyltransferase [Carbohydrate transport and
           metabolism / Signal transduction mechanisms].
          Length = 406

 Score = 78.6 bits (194), Expect = 3e-18
 Identities = 29/95 (30%), Positives = 53/95 (55%), Gaps = 2/95 (2%)

Query: 27  DSIENLPGNVHIRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLS 86
           D++ N+P NV +  ++PQ ++L        I HGG  +  EA++ G+P++ +P   DQ  
Sbjct: 277 DTLVNVPDNVIVADYVPQLELLPR--ADAVIHHGGAGTTSEALYAGVPLVVIPDGADQPL 334

Query: 87  HVRHIVDLGAGVELSYFNITLESIAWATSIVLNNP 121
           +   + +LGAG+ L +  +T E +  A + VL + 
Sbjct: 335 NAERVEELGAGIALPFEELTEERLRAAVNEVLADD 369


>gnl|CDD|178170 PLN02555, PLN02555, limonoid glucosyltransferase.
          Length = 480

 Score = 64.0 bits (156), Expect = 4e-13
 Identities = 28/73 (38%), Positives = 43/73 (58%), Gaps = 1/73 (1%)

Query: 30  ENLPGNVHIRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVR 89
           E       I +W PQ+ VLAHP+   F++H G NS +EA+  G+P++  P +GDQ++   
Sbjct: 333 EKAGDKGKIVQWCPQEKVLAHPSVACFVTHCGWNSTMEALSSGVPVVCFPQWGDQVTDAV 392

Query: 90  HIVD-LGAGVELS 101
           ++VD    GV L 
Sbjct: 393 YLVDVFKTGVRLC 405


>gnl|CDD|178032 PLN02410, PLN02410, UDP-glucoronosyl/UDP-glucosyl transferase
           family protein.
          Length = 451

 Score = 63.9 bits (155), Expect = 5e-13
 Identities = 31/71 (43%), Positives = 42/71 (59%), Gaps = 8/71 (11%)

Query: 29  IENLP--------GNVHIRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPF 80
           IE+LP        G  +I KW PQ++VL+HP    F SH G NS LE+I  G+P+I  PF
Sbjct: 311 IESLPKEFSKIISGRGYIVKWAPQKEVLSHPAVGGFWSHCGWNSTLESIGEGVPMICKPF 370

Query: 81  YGDQLSHVRHI 91
             DQ  + R++
Sbjct: 371 SSDQKVNARYL 381


>gnl|CDD|215304 PLN02554, PLN02554, UDP-glycosyltransferase family protein.
          Length = 481

 Score = 55.2 bits (133), Expect = 4e-10
 Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 41  WIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVD-LGAGVE 99
           W PQ  VLA P    F++H G NS LE++ +G+P+   P Y +Q  +   +V+ LG  VE
Sbjct: 349 WAPQVAVLAKPAIGGFVTHCGWNSILESLWFGVPMAAWPLYAEQKFNAFEMVEELGLAVE 408

Query: 100 L 100
           +
Sbjct: 409 I 409


>gnl|CDD|215084 PLN00164, PLN00164, glucosyltransferase; Provisional.
          Length = 480

 Score = 54.7 bits (132), Expect = 6e-10
 Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 1/63 (1%)

Query: 39  RKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQ-LSHVRHIVDLGAG 97
             W PQ+++LAH     F++H G NS LE++ +G+P+   P Y +Q L+    + D+G  
Sbjct: 344 PTWAPQKEILAHAAVGGFVTHCGWNSVLESLWHGVPMAPWPLYAEQHLNAFELVADMGVA 403

Query: 98  VEL 100
           V +
Sbjct: 404 VAM 406


>gnl|CDD|215112 PLN02167, PLN02167, UDP-glycosyltransferase family protein.
          Length = 475

 Score = 54.0 bits (130), Expect = 1e-09
 Identities = 26/61 (42%), Positives = 37/61 (60%), Gaps = 1/61 (1%)

Query: 41  WIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGD-QLSHVRHIVDLGAGVE 99
           W PQ ++LAH     F+SH G NS LE++ +G+PI   P Y + QL+    + +LG  VE
Sbjct: 347 WAPQVEILAHKAIGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQLNAFTMVKELGLAVE 406

Query: 100 L 100
           L
Sbjct: 407 L 407


>gnl|CDD|215127 PLN02210, PLN02210, UDP-glucosyl transferase.
          Length = 456

 Score = 52.0 bits (124), Expect = 8e-09
 Identities = 28/102 (27%), Positives = 51/102 (50%), Gaps = 10/102 (9%)

Query: 9   LVTAFSRTGLTVLW----RYEGDSIENL-----PGNVHIRKWIPQQDVLAHPNCRLFISH 59
           +  A    G+  LW    + +  +++ L      G   + +W PQ+ +L+H     F++H
Sbjct: 290 IAKALKNRGVPFLWVIRPKEKAQNVQVLQEMVKEGQGVVLEWSPQEKILSHMAISCFVTH 349

Query: 60  GGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVDL-GAGVEL 100
            G NS +E +  G+P++  P + DQ    R +VD+ G GV +
Sbjct: 350 CGWNSTIETVVAGVPVVAYPSWTDQPIDARLLVDVFGIGVRM 391


>gnl|CDD|215247 PLN02448, PLN02448, UDP-glycosyltransferase family protein.
          Length = 459

 Score = 50.8 bits (122), Expect = 2e-08
 Identities = 20/53 (37%), Positives = 30/53 (56%)

Query: 41  WIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVD 93
           W  Q  VL H +   F +H G NS LEA+  G+P++  P + DQ  + + IV+
Sbjct: 330 WCDQLKVLCHSSVGGFWTHCGWNSTLEAVFAGVPMLTFPLFWDQPLNSKLIVE 382


>gnl|CDD|177830 PLN02173, PLN02173, UDP-glucosyl transferase family protein.
          Length = 449

 Score = 49.6 bits (118), Expect = 4e-08
 Identities = 21/64 (32%), Positives = 38/64 (59%), Gaps = 1/64 (1%)

Query: 38  IRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVDL-GA 96
           + KW PQ  VL++     F++H G NS +E +  G+P++ +P + DQ  + ++I D+   
Sbjct: 321 VLKWSPQLQVLSNKAIGCFMTHCGWNSTMEGLSLGVPMVAMPQWTDQPMNAKYIQDVWKV 380

Query: 97  GVEL 100
           GV +
Sbjct: 381 GVRV 384


>gnl|CDD|177857 PLN02207, PLN02207, UDP-glycosyltransferase.
          Length = 468

 Score = 49.7 bits (118), Expect = 4e-08
 Identities = 26/73 (35%), Positives = 42/73 (57%), Gaps = 1/73 (1%)

Query: 29  IENLPGNVHIRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGD-QLSH 87
           ++ + G   I  W PQ ++LAH     F+SH G NS +E++ +G+PI+  P Y + QL+ 
Sbjct: 327 LDRVSGRGMICGWSPQVEILAHKAVGGFVSHCGWNSIVESLWFGVPIVTWPMYAEQQLNA 386

Query: 88  VRHIVDLGAGVEL 100
              + +L   VEL
Sbjct: 387 FLMVKELKLAVEL 399


>gnl|CDD|215293 PLN02534, PLN02534, UDP-glycosyltransferase.
          Length = 491

 Score = 49.5 bits (118), Expect = 6e-08
 Identities = 24/64 (37%), Positives = 38/64 (59%), Gaps = 1/64 (1%)

Query: 38  IRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVD-LGA 96
           I+ W PQ  +L+HP    F++H G NS +E I  G+P+I  P + +Q  + + IV+ L  
Sbjct: 348 IKGWAPQVLILSHPAIGGFLTHCGWNSTIEGICSGVPMITWPLFAEQFLNEKLIVEVLRI 407

Query: 97  GVEL 100
           GV +
Sbjct: 408 GVRV 411


>gnl|CDD|215465 PLN02863, PLN02863, UDP-glucoronosyl/UDP-glucosyl transferase
           family protein.
          Length = 477

 Score = 48.7 bits (116), Expect = 8e-08
 Identities = 21/56 (37%), Positives = 31/56 (55%)

Query: 38  IRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVD 93
           IR W PQ  +L+H     F++H G NS LE +  G+P++  P   DQ  +   +VD
Sbjct: 347 IRGWAPQVAILSHRAVGAFLTHCGWNSVLEGLVAGVPMLAWPMAADQFVNASLLVD 402


>gnl|CDD|178589 PLN03015, PLN03015, UDP-glucosyl transferase.
          Length = 470

 Score = 48.2 bits (114), Expect = 2e-07
 Identities = 18/47 (38%), Positives = 31/47 (65%)

Query: 38  IRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQ 84
           + +W PQ ++L+H +   F+SH G +S LE++  G+PI+  P Y +Q
Sbjct: 339 VTQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIVAWPLYAEQ 385


>gnl|CDD|222200 pfam13528, Glyco_trans_1_3, Glycosyl transferase family 1. 
          Length = 317

 Score = 47.3 bits (113), Expect = 2e-07
 Identities = 21/91 (23%), Positives = 39/91 (42%), Gaps = 4/91 (4%)

Query: 24  YEGDSIENLPGNVHIRKWIPQQDVLAH-PNCRLFISHGGVNSALEAIHYGIPIIGVPFYG 82
           Y  +  E   GN+  R    ++  L     C   I + G     EA++ G P++ VP  G
Sbjct: 221 YGPNKEEGREGNLTFRP-FSEEGFLRDLAECSAVICNAGFELLSEALYLGKPLLLVPLDG 279

Query: 83  --DQLSHVRHIVDLGAGVELSYFNITLESIA 111
             +Q S+  ++  LG G+ +   ++    + 
Sbjct: 280 QFEQTSNALYLERLGYGIVMDMEDLDPAVLR 310


>gnl|CDD|178581 PLN03004, PLN03004, UDP-glycosyltransferase.
          Length = 451

 Score = 47.0 bits (111), Expect = 3e-07
 Identities = 22/56 (39%), Positives = 32/56 (57%)

Query: 38  IRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVD 93
           ++ W PQ  VL H     F++H G NS LEA+  G+P++  P Y +Q  +   IVD
Sbjct: 338 VKSWAPQVPVLNHKAVGGFVTHCGWNSILEAVCAGVPMVAWPLYAEQRFNRVMIVD 393


>gnl|CDD|215305 PLN02562, PLN02562, UDP-glycosyltransferase.
          Length = 448

 Score = 46.4 bits (110), Expect = 6e-07
 Identities = 33/104 (31%), Positives = 47/104 (45%), Gaps = 9/104 (8%)

Query: 9   LVTAFSRTGLTVLWRYEGDSIENLP-GNVH-------IRKWIPQQDVLAHPNCRLFISHG 60
           L  A   +G   +W       E LP G V        +  W PQ +VL H     +++H 
Sbjct: 295 LALALEASGRPFIWVLNPVWREGLPPGYVERVSKQGKVVSWAPQLEVLKHQAVGCYLTHC 354

Query: 61  GVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVDL-GAGVELSYF 103
           G NS +EAI     ++  P  GDQ  +  +IVD+   GV +S F
Sbjct: 355 GWNSTMEAIQCQKRLLCYPVAGDQFVNCAYIVDVWKIGVRISGF 398


>gnl|CDD|217893 pfam04101, Glyco_tran_28_C, Glycosyltransferase family 28
           C-terminal domain.  The glycosyltransferase family 28
           includes monogalactosyldiacylglycerol synthase (EC
           2.4.1.46) and UDP-N-acetylglucosamine transferase (EC
           2.4.1.-). Structural analysis suggests the C-terminal
           domain contains the UDP-GlcNAc binding site.
          Length = 167

 Score = 44.6 bits (106), Expect = 1e-06
 Identities = 21/75 (28%), Positives = 31/75 (41%), Gaps = 5/75 (6%)

Query: 55  LFISHGGVNSALEAIHYGIPIIGVP----FYGDQLSHVRHIVDLGAGVELSYFNITLESI 110
           L IS  G  +  E +  G P I VP        Q ++   +V  GA + L    +T E +
Sbjct: 75  LVISRAGAGTIAELLALGKPAILVPRPKAAGEHQDNNALELVKAGAALVLLQKELTPEKL 134

Query: 111 AWA-TSIVLNNPRRY 124
             A   ++L   R Y
Sbjct: 135 VEALLKLLLKPLRLY 149


>gnl|CDD|178572 PLN02992, PLN02992, coniferyl-alcohol glucosyltransferase.
          Length = 481

 Score = 45.0 bits (106), Expect = 2e-06
 Identities = 21/64 (32%), Positives = 35/64 (54%), Gaps = 1/64 (1%)

Query: 38  IRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVD-LGA 96
           +  W PQ ++LAH     F++H G +S LE++  G+P+I  P + +Q  +   + D LG 
Sbjct: 342 VPSWAPQAEILAHQAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGI 401

Query: 97  GVEL 100
            V  
Sbjct: 402 AVRS 405


>gnl|CDD|177813 PLN02152, PLN02152, indole-3-acetate beta-glucosyltransferase.
          Length = 455

 Score = 43.9 bits (103), Expect = 4e-06
 Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 7/72 (9%)

Query: 25  EGDSIENLPGNVH-------IRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIG 77
           E   IE + G  H       I  W  Q +VL H     F++H G +S+LE++  G+P++ 
Sbjct: 311 EETEIEKIAGFRHELEEVGMIVSWCSQIEVLRHRAVGCFVTHCGWSSSLESLVLGVPVVA 370

Query: 78  VPFYGDQLSHVR 89
            P + DQ ++ +
Sbjct: 371 FPMWSDQPANAK 382


>gnl|CDD|178584 PLN03007, PLN03007, UDP-glucosyltransferase family protein.
          Length = 482

 Score = 42.2 bits (99), Expect = 2e-05
 Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 5/53 (9%)

Query: 38  IRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVP-----FYGDQL 85
           IR W PQ  +L H     F++H G NS LE +  G+P++  P     FY ++L
Sbjct: 349 IRGWAPQVLILDHQATGGFVTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKL 401


>gnl|CDD|178275 PLN02670, PLN02670, transferase, transferring glycosyl groups.
          Length = 472

 Score = 40.7 bits (95), Expect = 6e-05
 Identities = 20/68 (29%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 34  GNVHIRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVD 93
           G +H+  W+PQ  +L+H +   F++H G NS +E + +G  +I  P   +Q  + R +  
Sbjct: 340 GMIHV-GWVPQVKILSHESVGGFLTHCGWNSVVEGLGFGRVLILFPVLNEQGLNTRLLHG 398

Query: 94  LGAGVELS 101
              G+E+ 
Sbjct: 399 KKLGLEVP 406


>gnl|CDD|178364 PLN02764, PLN02764, glycosyltransferase family protein.
          Length = 453

 Score = 40.0 bits (93), Expect = 1e-04
 Identities = 22/62 (35%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 41  WIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVD-LGAGVE 99
           W+ Q  +L+HP+   F+SH G  S  E++     I+ VP  GDQ+ + R + D L   VE
Sbjct: 324 WVQQPLILSHPSVGCFVSHCGFGSMWESLLSDCQIVLVPQLGDQVLNTRLLSDELKVSVE 383

Query: 100 LS 101
           ++
Sbjct: 384 VA 385


>gnl|CDD|233077 TIGR00661, MJ1255, conserved hypothetical protein.  This model
           represents nearly the full length of MJ1255 from
           Methanococcus jannaschii and of an unpublished protein
           from Vibrio cholerae, as well as the C-terminal half of
           a protein from Methanobacterium thermoautotrophicum. A
           small region (~50 amino acids) within the domain appears
           related to a family of sugar transferases [Hypothetical
           proteins, Conserved].
          Length = 321

 Score = 39.9 bits (93), Expect = 1e-04
 Identities = 26/80 (32%), Positives = 39/80 (48%), Gaps = 4/80 (5%)

Query: 32  LPGNVHIRKWIPQQDVLAH-PNCRLFISHGGVNSALEAIHYGIPIIGVPFYG--DQLSHV 88
              NV IR+ I   +      N  L I+HGG +   EA+  G P+I +P  G  +Q ++ 
Sbjct: 227 YNENVEIRR-ITTDNFKELIKNAELVITHGGFSLISEALSLGKPLIVIPDLGQFEQGNNA 285

Query: 89  RHIVDLGAGVELSYFNITLE 108
             + DLG G+ L Y  + L 
Sbjct: 286 VKLEDLGCGIALEYKELRLL 305


>gnl|CDD|177858 PLN02208, PLN02208, glycosyltransferase family protein.
          Length = 442

 Score = 35.8 bits (82), Expect = 0.003
 Identities = 14/49 (28%), Positives = 26/49 (53%)

Query: 41  WIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVR 89
           W+ Q  +L HP+   F++H G  +  E++     ++ +PF  DQ+   R
Sbjct: 318 WVQQPLILDHPSIGCFVNHCGPGTIWESLVSDCQMVLIPFLSDQVLFTR 366


>gnl|CDD|223779 COG0707, MurG, UDP-N-acetylglucosamine:LPS N-acetylglucosamine
           transferase [Cell envelope biogenesis, outer membrane].
          Length = 357

 Score = 34.6 bits (80), Expect = 0.008
 Identities = 20/74 (27%), Positives = 34/74 (45%), Gaps = 4/74 (5%)

Query: 55  LFISHGGVNSALEAIHYGIPIIGVPF----YGDQLSHVRHIVDLGAGVELSYFNITLESI 110
           L IS  G  +  E +  G+P I VP+     G Q  + + +   GA + +    +T E +
Sbjct: 255 LVISRAGALTIAELLALGVPAILVPYPPGADGHQEYNAKFLEKAGAALVIRQSELTPEKL 314

Query: 111 AWATSIVLNNPRRY 124
           A     +L+NP + 
Sbjct: 315 AELILRLLSNPEKL 328


>gnl|CDD|177807 PLN00414, PLN00414, glycosyltransferase family protein.
          Length = 446

 Score = 33.1 bits (75), Expect = 0.022
 Identities = 15/49 (30%), Positives = 26/49 (53%)

Query: 41  WIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVR 89
           W+ Q  +L+HP+   F++H G  S  E++     I+ +P   DQ+   R
Sbjct: 319 WVEQPLILSHPSVGCFVNHCGFGSMWESLVSDCQIVFIPQLADQVLITR 367


>gnl|CDD|233284 TIGR01133, murG, undecaprenyldiphospho-muramoylpentapeptide
           beta-N-acetylglucosaminyltransferase.  RM 8449890 RT The
           final step of peptidoglycan subunit assembly in
           Escherichia coli occurs in the cytoplasm. RA Bupp K, van
           Heijenoort J. RL J Bacteriol 1993 Mar;175(6):1841-3
           [Cell envelope, Biosynthesis and degradation of murein
           sacculus and peptidoglycan].
          Length = 348

 Score = 31.5 bits (72), Expect = 0.069
 Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 3/46 (6%)

Query: 55  LFISHGGVNSALEAIHYGIPIIGVPF---YGDQLSHVRHIVDLGAG 97
           L IS  G ++  E    G+P I +P+     DQ  + + + DLGAG
Sbjct: 253 LVISRAGASTVAELAAAGVPAILIPYPYAADDQYYNAKFLEDLGAG 298


>gnl|CDD|226488 COG3980, spsG, Spore coat polysaccharide biosynthesis protein,
           predicted glycosyltransferase [Cell envelope biogenesis,
           outer membrane].
          Length = 318

 Score = 31.3 bits (71), Expect = 0.097
 Identities = 18/71 (25%), Positives = 29/71 (40%), Gaps = 1/71 (1%)

Query: 53  CRLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHVRHIVDLGAGVELSYFNITLESIAW 112
             L IS  G  +  EA+  G+P + +P   +Q++  +    LG   +L Y    L     
Sbjct: 228 ADLAISAAGS-TLYEALLLGVPSLVLPLAENQIATAKEFEALGIIKQLGYHLKDLAKDYE 286

Query: 113 ATSIVLNNPRR 123
              I  +  RR
Sbjct: 287 ILQIQKDYARR 297


>gnl|CDD|215979 pfam00534, Glycos_transf_1, Glycosyl transferases group 1.
           Mutations in this domain of human PIGA lead to disease
           (Paroxysmal Nocturnal haemoglobinuria). Members of this
           family transfer activated sugars to a variety of
           substrates, including glycogen, Fructose-6-phosphate and
           lipopolysaccharides. Members of this family transfer
           UDP, ADP, GDP or CMP linked sugars. The eukaryotic
           glycogen synthases may be distant members of this
           family.
          Length = 158

 Score = 29.5 bits (67), Expect = 0.26
 Identities = 22/97 (22%), Positives = 41/97 (42%), Gaps = 11/97 (11%)

Query: 31  NLPGNVHIRKWIPQQDVLAH-PNCRLFI--SH--GGVNSALEAIHYGIPIIGVPFYGDQL 85
            L  NV    ++P +D++       LF+  S   G     LEA+  G+P+I     G   
Sbjct: 56  GLEDNVIFVGFVPDEDLIELYRIADLFVLPSRYEGFGLVLLEAMAAGVPVIATDVGG--P 113

Query: 86  SHVRHIVDLGAGVELSYFNITLESIAWATSIVLNNPR 122
           + +  + D   G+ +   +   E++A A   +L +  
Sbjct: 114 AEI--VKDGETGLLVDPGDA--EALAEAIEKLLKDEE 146


>gnl|CDD|149883 pfam08953, DUF1899, Domain of unknown function (DUF1899).  This set
           of domains is found in various eukaryotic proteins.
           Function is unknown.
          Length = 65

 Score = 27.6 bits (62), Expect = 0.49
 Identities = 11/37 (29%), Positives = 17/37 (45%)

Query: 86  SHVRHIVDLGAGVELSYFNITLESIAWATSIVLNNPR 122
           S  RH+    A  E  Y +I +   AW ++    NP+
Sbjct: 6   SKFRHVFGQPAKKEQCYEDIRISKNAWDSNFCAVNPK 42


>gnl|CDD|222322 pfam13692, Glyco_trans_1_4, Glycosyl transferases group 1. 
          Length = 134

 Score = 28.5 bits (64), Expect = 0.59
 Identities = 11/60 (18%), Positives = 25/60 (41%), Gaps = 8/60 (13%)

Query: 27  DSIENLPGNVHIRKWIPQ-QDVLAHPNCRLFISHGGVNS-----ALEAIHYGIPIIGVPF 80
           + +  L  NVH   ++     +LA  +  + ++     +      LEA+  G+P++    
Sbjct: 45  EELARLAPNVHFLGFVEDLAALLA--SADVALAPLRFGAGSPLKLLEALAAGLPVVATDI 102


>gnl|CDD|184624 PRK14327, PRK14327, (dimethylallyl)adenosine tRNA
           methylthiotransferase; Provisional.
          Length = 509

 Score = 28.1 bits (63), Expect = 1.2
 Identities = 15/34 (44%), Positives = 20/34 (58%), Gaps = 6/34 (17%)

Query: 13  FSRTGLTVLWRYEGDSIENLP----GNVHIRKWI 42
           FS+  +  +W  EGD IENLP    GN  I+ W+
Sbjct: 185 FSKEMVVEVWSKEGDVIENLPKVREGN--IKAWV 216


>gnl|CDD|99969 cd03795, GT1_like_4, This family is most closely related to the GT1
           family of glycosyltransferases. Glycosyltransferases
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. The acceptor molecule can be a lipid,
           a protein, a heterocyclic compound, or another
           carbohydrate residue. This group of glycosyltransferases
           is most closely related to the previously defined
           glycosyltransferase family 1 (GT1). The members of this
           family may transfer UDP, ADP, GDP, or CMP-linked sugars.
           The diverse enzymatic activities among members of this
           family reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found
           mainly in bacteria and eukaryotes.
          Length = 357

 Score = 27.9 bits (63), Expect = 1.4
 Identities = 13/55 (23%), Positives = 22/55 (40%), Gaps = 9/55 (16%)

Query: 30  ENLPGNVHIRKWIPQQDVLA-HPNCRLFI--SHG-----GVNSALEAIHYGIPII 76
             L   V     +  ++  A    C +F+  S       G+   LEA+ +G P+I
Sbjct: 240 LGLLDRVRFLGRLDDEEKAALLAACDVFVFPSVERSEAFGI-VLLEAMAFGKPVI 293


>gnl|CDD|218956 pfam06248, Zw10, Centromere/kinetochore Zw10.  Zw10 and rough deal
           proteins are both required for correct metaphase
           check-pointing during mitosis. These proteins bind to
           the centromere/kinetochore.
          Length = 593

 Score = 27.8 bits (62), Expect = 1.4
 Identities = 9/18 (50%), Positives = 12/18 (66%), Gaps = 1/18 (5%)

Query: 42  IPQQDVLAHPNCRLFISH 59
           IPQ   L H NC +++SH
Sbjct: 507 IPQLSALFHNNC-MYLSH 523


>gnl|CDD|179352 PRK01911, ppnK, inorganic polyphosphate/ATP-NAD kinase;
          Provisional.
          Length = 292

 Score = 27.6 bits (62), Expect = 1.5
 Identities = 10/29 (34%), Positives = 14/29 (48%), Gaps = 4/29 (13%)

Query: 54 RLFISHGGVNSALEAIHY----GIPIIGV 78
           + IS GG  + L    Y     IPI+G+
Sbjct: 66 DMVISIGGDGTFLRTATYVGNSNIPILGI 94


>gnl|CDD|100001 cd04955, GT1_like_6, This family is most closely related to the GT1
           family of glycosyltransferases. Glycosyltransferases
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. The acceptor molecule can be a lipid,
           a protein, a heterocyclic compound, or another
           carbohydrate residue. This group of glycosyltransferases
           is most closely related to the previously defined
           glycosyltransferase family 1 (GT1). The members of this
           family may transfer UDP, ADP, GDP, or CMP linked sugars.
           The diverse enzymatic activities among members of this
           family reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found in
           certain bacteria and Archaea.
          Length = 363

 Score = 27.2 bits (61), Expect = 2.2
 Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 7/44 (15%)

Query: 43  PQQDVLAHPNCRLFI-SH--GGVN-SALEAIHYGIPII--GVPF 80
              ++L +     ++  H  GG N S LEA+ YG P++    PF
Sbjct: 260 ELLELLRY-AALFYLHGHSVGGTNPSLLEAMAYGCPVLASDNPF 302


>gnl|CDD|227429 COG5098, COG5098, Chromosome condensation complex Condensin,
           subunit D2 [Chromatin structure and dynamics / Cell
           division and chromosome partitioning].
          Length = 1128

 Score = 27.3 bits (60), Expect = 3.0
 Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 13/99 (13%)

Query: 28  SIENLPGNVHIRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIPIIGVPFYGDQ--- 84
           S+E L G ++ R++I +  +       ++ S    + +L      I  IG+    D    
Sbjct: 625 SLEQLLGMMYARRYIDKTVI--SELWSMY-SLQKFDFSLLQASGSIIHIGMLLLVDMFHS 681

Query: 85  LSHVRHIVDLGAGVE------LS-YFNITLESIAWATSI 116
           L  +  ++ +G GVE      LS Y  I    +      
Sbjct: 682 LVGLSTLLLIGLGVEGLLDLMLSRYKCIAQIRVRQRVKF 720


>gnl|CDD|182419 PRK10377, PRK10377, PTS system glucitol/sorbitol-specific
           transporter subunit IIA; Provisional.
          Length = 120

 Score = 26.2 bits (58), Expect = 3.3
 Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 20  VLWRYEGDSIENLPGNVHIRKWIPQ 44
           V  R++G +    PG VH+   +P 
Sbjct: 84  VTLRFDGLNEAEFPGTVHVAGPVPD 108


>gnl|CDD|227350 COG5017, COG5017, Uncharacterized conserved protein [Function
           unknown].
          Length = 161

 Score = 26.3 bits (58), Expect = 3.7
 Identities = 21/73 (28%), Positives = 31/73 (42%), Gaps = 9/73 (12%)

Query: 54  RLFISHGGVNSALEAIHYGIPIIGVPFYGDQLSHV-RHIVDLG-AGVELSYFNITLESIA 111
           R+ ISH G  S L  +    P+I VP        V  H V+L     E++Y       +A
Sbjct: 67  RIVISHAGEGSILLLLRLDKPLIVVPRSSQYQELVDDHQVELALKLAEINYV------VA 120

Query: 112 WA-TSIVLNNPRR 123
            + T +VL    +
Sbjct: 121 CSPTELVLQAGLQ 133


>gnl|CDD|204088 pfam08888, HopJ, HopJ type III effector protein.  Pathovars of
          Pseudomonas syringae interact with their plant hosts
          via the action of Hrp outer protein (Hop) effector
          proteins, injected into plant cells by the type III
          secretion system. The proteins in this family are
          called HopJ after the original member HopPmaJ.
          Length = 110

 Score = 26.0 bits (58), Expect = 3.9
 Identities = 9/23 (39%), Positives = 12/23 (52%), Gaps = 6/23 (26%)

Query: 44 QQDVLA------HPNCRLFISHG 60
          ++DVLA      H N R F+  G
Sbjct: 77 REDVLANPEGTDHQNIRNFMKTG 99


>gnl|CDD|165451 PHA03180, PHA03180, helicase-primase primase subunit; Provisional.
          Length = 1071

 Score = 26.7 bits (59), Expect = 4.4
 Identities = 10/23 (43%), Positives = 13/23 (56%), Gaps = 1/23 (4%)

Query: 53  CRLFISHGGVNSALEAIHYGIPI 75
           CR F S   +NS   +I +G PI
Sbjct: 111 CR-FTSESEINSIYASIMHGKPI 132


>gnl|CDD|226254 COG3731, SrlB, Phosphotransferase system sorbitol-specific
           component IIA [Carbohydrate transport and metabolism].
          Length = 123

 Score = 25.5 bits (56), Expect = 5.6
 Identities = 11/38 (28%), Positives = 17/38 (44%), Gaps = 3/38 (7%)

Query: 20  VLWRYEGDSIENLPGNVHIRKWIPQQDVLAHPNCRLFI 57
           V  R++G +    PG +H+   +P    L      LFI
Sbjct: 84  VTIRFDGATEAEFPGTIHVSGPVPD---LVAGGTILFI 118


>gnl|CDD|99972 cd03799, GT1_amsK_like, This is a family of GT1
           glycosyltransferases found specifically in certain
           bacteria. amsK in Erwinia amylovora, has been reported
           to be involved in the biosynthesis of amylovoran, a
           exopolysaccharide acting as a virulence factor.
          Length = 355

 Score = 26.0 bits (58), Expect = 5.7
 Identities = 15/65 (23%), Positives = 24/65 (36%), Gaps = 16/65 (24%)

Query: 31  NLPGNVHIRKWIPQQDVLAH-PNCRLFI--SH--------GGVNSALEAIHYGIPII--- 76
            L   V +     Q++V        LF+  S         G     +EA+  G+P+I   
Sbjct: 233 GLEDRVTLLGAKSQEEVRELLRAADLFVLPSVTAADGDREGLPVVLMEAMAMGLPVISTD 292

Query: 77  --GVP 79
             G+P
Sbjct: 293 VSGIP 297


>gnl|CDD|217556 pfam03435, Saccharop_dh, Saccharopine dehydrogenase.  This family
           comprised of three structural domains that can not be
           separated in the linear sequence. In some organisms this
           enzyme is found as a bifunctional polypeptide with
           lysine ketoglutarate reductase. The saccharopine
           dehydrogenase can also function as a saccharopine
           reductase.
          Length = 380

 Score = 26.1 bits (58), Expect = 5.8
 Identities = 10/79 (12%), Positives = 25/79 (31%), Gaps = 8/79 (10%)

Query: 22  WRYEGDSIENLPGNVHIRKWIPQQDVLAHPNCRLFISHGGVNSALEAIHYGIP-----II 76
             +E      +    H R       +       ++++  G  + +      +P     + 
Sbjct: 184 RYWENGKWVEVGWGSHERTIPFPYGIGPGYAYEIYLNRPGTLTRVR---SWLPEADGVLF 240

Query: 77  GVPFYGDQLSHVRHIVDLG 95
           G       L+ ++ + DLG
Sbjct: 241 GTIRIPGYLTVIKLLGDLG 259


>gnl|CDD|240086 cd04735, OYE_like_4_FMN, Old yellow enzyme (OYE)-related FMN
           binding domain, group 4.  Each monomer of OYE contains
           FMN as a non-covalently bound cofactor, uses NADPH as a
           reducing agent with oxygens, quinones, and
           alpha,beta-unsaturated aldehydes and ketones, and can
           act as electron acceptors in the catalytic reaction.
           Other members of OYE family include trimethylamine
           dehydrogenase, 2,4-dienoyl-CoA reductase, enoate
           reductase, pentaerythriol tetranitrate reductase,
           xenobiotic reductase, and morphinone reductase.
          Length = 353

 Score = 25.6 bits (57), Expect = 7.4
 Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 4/36 (11%)

Query: 47  VLAHPNCRL-FISHGGVNS---ALEAIHYGIPIIGV 78
           V      RL  I+ G +N+   ALEA+  G  ++ +
Sbjct: 277 VKERIAGRLPLIAVGSINTPDDALEALETGADLVAI 312


>gnl|CDD|99997 cd04946, GT1_AmsK_like, This family is most closely related to the
           GT1 family of glycosyltransferases. AmsK is involved in
           the biosynthesis of amylovoran, which functions as a
           virulence factor. It functions as a glycosyl transferase
           which transfers galactose from UDP-galactose to a
           lipid-linked amylovoran-subunit precursor.  The members
           of this family are found mainly in bacteria and Archaea.
          Length = 407

 Score = 25.8 bits (57), Expect = 7.6
 Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 13/53 (24%)

Query: 31  NLPG---NVHIRKWIPQQDVLAHPNCRLFI----SHGGVNSALEAIHYGIPII 76
           N  G   N  + K   +  V       +F+    S G   S +EA+ +GIP+I
Sbjct: 292 NFTGELSNSEVYKLYKENPV------DVFVNLSESEGLPVSIMEAMSFGIPVI 338


>gnl|CDD|172952 PRK14477, PRK14477, bifunctional nitrogenase molybdenum-cofactor
           biosynthesis protein NifE/NifN; Provisional.
          Length = 917

 Score = 25.9 bits (57), Expect = 8.9
 Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 6/35 (17%)

Query: 71  YGIPIIGVPFYG-----DQLSHV-RHIVDLGAGVE 99
           YGIP +   FYG       L  + R + D G G+E
Sbjct: 254 YGIPYLEESFYGMTDTAKALRDIARELDDAGGGLE 288


>gnl|CDD|226807 COG4370, COG4370, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 412

 Score = 25.6 bits (56), Expect = 9.5
 Identities = 16/59 (27%), Positives = 21/59 (35%), Gaps = 15/59 (25%)

Query: 41  WIPQQDVLAHPNCRLFISHGGVNSAL---------------EAIHYGIPIIGVPFYGDQ 84
           W P  D     NC L++S       L               +A+  G P+IG P  G Q
Sbjct: 284 WQPLADRFGKDNCSLWLSQQSFADILHAADAALGMAGTATEQAVGLGKPVIGFPGQGPQ 342


>gnl|CDD|184699 PRK14478, PRK14478, nitrogenase molybdenum-cofactor biosynthesis
           protein NifE; Provisional.
          Length = 475

 Score = 25.4 bits (56), Expect = 9.7
 Identities = 15/36 (41%), Positives = 18/36 (50%), Gaps = 6/36 (16%)

Query: 71  YGIPIIGVPFYG-----DQLSHV-RHIVDLGAGVEL 100
           YGIP     FYG     D L  + R +V+ GA  EL
Sbjct: 261 YGIPFFEGSFYGIEDTSDSLRQIARLLVERGADAEL 296


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.141    0.444 

Gapped
Lambda     K      H
   0.267   0.0711    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,593,021
Number of extensions: 586388
Number of successful extensions: 608
Number of sequences better than 10.0: 1
Number of HSP's gapped: 599
Number of HSP's successfully gapped: 65
Length of query: 124
Length of database: 10,937,602
Length adjustment: 85
Effective length of query: 39
Effective length of database: 7,167,512
Effective search space: 279532968
Effective search space used: 279532968
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 53 (24.2 bits)