RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16938
         (403 letters)



>gnl|CDD|201077 pfam00201, UDPGT, UDP-glucoronosyl and UDP-glucosyl transferase. 
          Length = 500

 Score =  157 bits (400), Expect = 2e-43
 Identities = 69/167 (41%), Positives = 104/167 (62%)

Query: 220 TLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVL 279
           TL  N  + KW PQ D+L HP  R F+TH G + + EA   GVP++G+P FGDQ  N   
Sbjct: 320 TLGRNTRLVKWLPQNDLLGHPKTRAFVTHAGSNGVYEAICHGVPMVGMPLFGDQMDNAKH 379

Query: 280 LRHRGYALIEPIQTLTKQSFLKNAQTMLNDPSFKQNAKKWASIANDEIVSPLERVVYWTE 339
           +  +G A+   + T+T +  L   +T++NDPS+K+N  + +SI +D+ V PL+R V+W E
Sbjct: 380 MEAKGAAVTLNVLTMTSEDLLNALKTVINDPSYKENIMRLSSIHHDQPVKPLDRAVFWIE 439

Query: 340 YVLRHKGAPHLSASSRQLTWYQMYCIDIILVILGILYAVVKLLSMCC 386
           +V+RHKGA HL  ++  LTWYQ + +D+I  +L  +  V  +   CC
Sbjct: 440 FVMRHKGAKHLRPAAHDLTWYQYHSLDVIGFLLACVATVAFITFKCC 486



 Score = 51.6 bits (124), Expect = 4e-07
 Identities = 26/81 (32%), Positives = 37/81 (45%), Gaps = 7/81 (8%)

Query: 2   SFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWYPQT 61
           S GS+V  + + EE          Q+   + W+ D T        TL  N  + KW PQ 
Sbjct: 282 SLGSMV--SNIPEEKANEIASALAQIPQKVLWRFDGTKPS-----TLGRNTRLVKWLPQN 334

Query: 62  DILAHPNLRLFITHGGISSLM 82
           D+L HP  R F+TH G + + 
Sbjct: 335 DLLGHPKTRAFVTHAGSNGVY 355


>gnl|CDD|223071 PHA03392, egt, ecdysteroid UDP-glucosyltransferase; Provisional.
          Length = 507

 Score =  133 bits (337), Expect = 1e-34
 Identities = 53/199 (26%), Positives = 97/199 (48%), Gaps = 10/199 (5%)

Query: 175 IDARMIDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWYPQT 234
           ID   +D   L        L  FK+L   + WK D      + A  LP NV  QKW+PQ 
Sbjct: 307 IDTNDMDNEFLQM-----LLRTFKKLPYNVLWKYD----GEVEAINLPANVLTQKWFPQR 357

Query: 235 DILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLLRHRGYALIEPIQTL 294
            +L H N++ F+T GG+ S  EA    VP++G+P  GDQ+ N       G        T+
Sbjct: 358 AVLKHKNVKAFVTQGGVQSTDEAIDALVPMVGLPMMGDQFYNTNKYVELGIGRALDTVTV 417

Query: 295 TKQSFLKNAQTMLNDPSFKQNAKKWASIANDEIVSPLERVVYWTEYVLRHKG-APHLSAS 353
           +    +     ++ +P +++N K+   +   + ++PL + +++TE+V+R+K     L   
Sbjct: 418 SAAQLVLAIVDVIENPKYRKNLKELRHLIRHQPMTPLHKAIWYTEHVIRNKHGNTSLKTK 477

Query: 354 SRQLTWYQMYCIDIILVIL 372
           +  +++   +   I++ ++
Sbjct: 478 AANVSYSDYFMSYILVPLV 496



 Score = 88.5 bits (220), Expect = 5e-19
 Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 2   SFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWYPQT 61
           SFGS +D   +  E     L  FK+L   + WK D      + A  LP NV  QKW+PQ 
Sbjct: 302 SFGSSIDTNDMDNEFLQMLLRTFKKLPYNVLWKYD----GEVEAINLPANVLTQKWFPQR 357

Query: 62  DILAHPNLRLFITHGGISS 80
            +L H N++ F+T GG+ S
Sbjct: 358 AVLKHKNVKAFVTQGGVQS 376



 Score = 48.4 bits (116), Expect = 4e-06
 Identities = 24/77 (31%), Positives = 34/77 (44%), Gaps = 5/77 (6%)

Query: 104 FIILLSFSFLGTLSCDKILVFLPLPIWSHQMQYEKVWT-ALAERGHEVTIYTKFTPKSNS 162
            I+LL    L  +   +IL   P P +SH   + KV+  ALAERGH VT+          
Sbjct: 6   IILLLLLLLLSGVRAARILAVFPTPAYSHHSVF-KVYVEALAERGHNVTVIKPTL--RVY 62

Query: 163 TNLKHVPIRLPKIDARM 179
                    + +IDA +
Sbjct: 63  YASHLCG-NITEIDASL 78


>gnl|CDD|224732 COG1819, COG1819, Glycosyl transferases, related to
           UDP-glucuronosyltransferase [Carbohydrate transport and
           metabolism / Signal transduction mechanisms].
          Length = 406

 Score = 82.5 bits (204), Expect = 3e-17
 Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 4/99 (4%)

Query: 221 LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
           +PDNV +  + PQ ++L   +    I HGG  +  EA   GVP++ +P   DQ  N   +
Sbjct: 282 VPDNVIVADYVPQLELLPRADA--VIHHGGAGTTSEALYAGVPLVVIPDGADQPLNAERV 339

Query: 281 RHRGYALIEPIQTLTKQSFLKNA-QTMLNDPSFKQNAKK 318
              G  +  P + LT++  L+ A   +L D S+++ A++
Sbjct: 340 EELGAGIALPFEELTEER-LRAAVNEVLADDSYRRAAER 377



 Score = 39.0 bits (91), Expect = 0.003
 Identities = 11/35 (31%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 48  LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLM 82
           +PDNV +  + PQ ++L   +    I HGG  +  
Sbjct: 282 VPDNVIVADYVPQLELLPRAD--AVIHHGGAGTTS 314



 Score = 28.5 bits (64), Expect = 6.7
 Identities = 15/80 (18%), Positives = 25/80 (31%), Gaps = 8/80 (10%)

Query: 142 ALAERGHEVTIYT--KFTPKSNST--NLKHVPIR--LPKIDARMIDPTKLSEETKLGFLE 195
            L  RGHEV   +  KF     +        PIR      +       K        F +
Sbjct: 24  ELRRRGHEVVFASTGKFKEFVEAAGLAFVAYPIRDSELATEDGKFAGVKSFRRLLQQFKK 83

Query: 196 VFKQLKLPI--FWKIDITND 213
           + ++L   +       + +D
Sbjct: 84  LIRELLELLRELEPDLVVDD 103


>gnl|CDD|233407 TIGR01426, MGT, glycosyltransferase, MGT family.  This model
           describes the MGT (macroside glycosyltransferase)
           subfamily of the UDP-glucuronosyltransferase family.
           Members include a number of glucosyl transferases for
           macrolide antibiotic inactivation, but also include
           transferases of glucose-related sugars for macrolide
           antibiotic production [Cellular processes, Toxin
           production and resistance].
          Length = 392

 Score = 73.2 bits (180), Expect = 3e-14
 Identities = 33/98 (33%), Positives = 52/98 (53%), Gaps = 2/98 (2%)

Query: 221 LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
           LP NV +++W PQ +IL   +   FITHGG++S MEA   GVP++ VP   DQ      +
Sbjct: 273 LPPNVEVRQWVPQLEILKKADA--FITHGGMNSTMEALFNGVPMVAVPQGADQPMTARRI 330

Query: 281 RHRGYALIEPIQTLTKQSFLKNAQTMLNDPSFKQNAKK 318
              G     P + +T +   +    +L+DP + +  +K
Sbjct: 331 AELGLGRHLPPEEVTAEKLREAVLAVLSDPRYAERLRK 368



 Score = 48.1 bits (115), Expect = 4e-06
 Identities = 17/35 (48%), Positives = 24/35 (68%), Gaps = 2/35 (5%)

Query: 48  LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLM 82
           LP NV +++W PQ +IL   +   FITHGG++S M
Sbjct: 273 LPPNVEVRQWVPQLEILKKAD--AFITHGGMNSTM 305


>gnl|CDD|99960 cd03784, GT1_Gtf_like, This family includes the Gtfs, a group of
           homologous glycosyltransferases involved in the final
           stages of the biosynthesis of antibiotics vancomycin and
           related chloroeremomycin. Gtfs transfer sugar moieties
           from an activated NDP-sugar donor to the oxidatively
           cross-linked heptapeptide core of vancomycin group
           antibiotics. The core structure is important for the
           bioactivity of the antibiotics.
          Length = 401

 Score = 60.8 bits (148), Expect = 4e-10
 Identities = 30/97 (30%), Positives = 43/97 (44%), Gaps = 2/97 (2%)

Query: 221 LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
           LPDNV +  + P   +L  P     + HGG  +   A   GVP L VPFFGDQ      +
Sbjct: 286 LPDNVRVVDFVPHDWLL--PRCAAVVHHGGAGTTAAALRAGVPQLVVPFFGDQPFWAARV 343

Query: 281 RHRGYALIEPIQTLTKQSFLKNAQTMLNDPSFKQNAK 317
              G       + LT +      + +L+ PS ++ A 
Sbjct: 344 AELGAGPALDPRELTAERLAAALRRLLDPPSRRRAAA 380



 Score = 31.9 bits (73), Expect = 0.54
 Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 3/43 (6%)

Query: 142 ALAERGHEVTIYT--KFTPKSNSTNLKHVPIRLPKIDARMIDP 182
           AL   GHEV + T  +F     +  L+ VP+     D  +  P
Sbjct: 23  ALRAAGHEVRVATPPEFADLVEAAGLEFVPVGGD-PDELLASP 64



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 23/84 (27%), Positives = 33/84 (39%), Gaps = 11/84 (13%)

Query: 1   MSFGSVV--DPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWY 58
           + FGS+V  DP  L+       +E    L       I       L A+ LPDNV +  + 
Sbjct: 244 VGFGSMVVRDPEALARL----DVEAVATLGQRA---ILSLGWGGLGAEDLPDNVRVVDFV 296

Query: 59  PQTDILAHPNLRLFITHGGISSLM 82
           P   +L  P     + HGG  +  
Sbjct: 297 PHDWLL--PRCAAVVHHGGAGTTA 318


>gnl|CDD|178170 PLN02555, PLN02555, limonoid glucosyltransferase.
          Length = 480

 Score = 58.3 bits (141), Expect = 3e-09
 Identities = 27/54 (50%), Positives = 36/54 (66%)

Query: 227 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
           I +W PQ  +LAHP++  F+TH G +S MEA S GVPV+  P +GDQ  + V L
Sbjct: 341 IVQWCPQEKVLAHPSVACFVTHCGWNSTMEALSSGVPVVCFPQWGDQVTDAVYL 394



 Score = 35.2 bits (81), Expect = 0.059
 Identities = 14/29 (48%), Positives = 20/29 (68%)

Query: 54  IQKWYPQTDILAHPNLRLFITHGGISSLM 82
           I +W PQ  +LAHP++  F+TH G +S M
Sbjct: 341 IVQWCPQEKVLAHPSVACFVTHCGWNSTM 369


>gnl|CDD|215084 PLN00164, PLN00164, glucosyltransferase; Provisional.
          Length = 480

 Score = 56.2 bits (136), Expect = 1e-08
 Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 1/59 (1%)

Query: 229 KWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN-MVLLRHRGYA 286
            W PQ +ILAH  +  F+TH G +S++E+   GVP+   P + +Q+ N   L+   G A
Sbjct: 345 TWAPQKEILAHAAVGGFVTHCGWNSVLESLWHGVPMAPWPLYAEQHLNAFELVADMGVA 403



 Score = 39.3 bits (92), Expect = 0.003
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 56  KWYPQTDILAHPNLRLFITHGGISSLM 82
            W PQ +ILAH  +  F+TH G +S++
Sbjct: 345 TWAPQKEILAHAAVGGFVTHCGWNSVL 371


>gnl|CDD|177813 PLN02152, PLN02152, indole-3-acetate beta-glucosyltransferase.
          Length = 455

 Score = 52.7 bits (126), Expect = 1e-07
 Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 6/103 (5%)

Query: 227 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLLRH---R 283
           I  W  Q ++L H  +  F+TH G SS +E+  LGVPV+  P + DQ  N  LL      
Sbjct: 331 IVSWCSQIEVLRHRAVGCFVTHCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKT 390

Query: 284 GYALIEPIQTLTKQSFLKNA-QTMLNDPS--FKQNAKKWASIA 323
           G  + E  + L ++  ++   + ++ + S   +++A+KW  +A
Sbjct: 391 GVRVRENSEGLVERGEIRRCLEAVMEEKSVELRESAEKWKRLA 433


>gnl|CDD|178589 PLN03015, PLN03015, UDP-glucosyl transferase.
          Length = 470

 Score = 52.0 bits (124), Expect = 3e-07
 Identities = 20/56 (35%), Positives = 38/56 (67%)

Query: 225 VFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
           + + +W PQ +IL+H ++  F++H G SS++E+ + GVP++  P + +Q+ N  LL
Sbjct: 337 LVVTQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIVAWPLYAEQWMNATLL 392



 Score = 30.8 bits (69), Expect = 1.2
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 52  VFIQKWYPQTDILAHPNLRLFITHGGISSLM 82
           + + +W PQ +IL+H ++  F++H G SS++
Sbjct: 337 LVVTQWAPQVEILSHRSIGGFLSHCGWSSVL 367


>gnl|CDD|178572 PLN02992, PLN02992, coniferyl-alcohol glucosyltransferase.
          Length = 481

 Score = 51.5 bits (123), Expect = 5e-07
 Identities = 23/54 (42%), Positives = 32/54 (59%)

Query: 227 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
           +  W PQ +ILAH  +  F+TH G SS +E+   GVP++  P F +Q  N  LL
Sbjct: 342 VPSWAPQAEILAHQAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALL 395



 Score = 32.3 bits (73), Expect = 0.48
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 54  IQKWYPQTDILAHPNLRLFITHGGISSLM 82
           +  W PQ +ILAH  +  F+TH G SS +
Sbjct: 342 VPSWAPQAEILAHQAVGGFLTHCGWSSTL 370


>gnl|CDD|178032 PLN02410, PLN02410, UDP-glucoronosyl/UDP-glucosyl transferase
           family protein.
          Length = 451

 Score = 51.2 bits (122), Expect = 5e-07
 Identities = 24/67 (35%), Positives = 38/67 (56%)

Query: 210 ITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPF 269
           I + P   +K +    +I KW PQ ++L+HP +  F +H G +S +E+   GVP++  PF
Sbjct: 311 IESLPKEFSKIISGRGYIVKWAPQKEVLSHPAVGGFWSHCGWNSTLESIGEGVPMICKPF 370

Query: 270 FGDQYRN 276
             DQ  N
Sbjct: 371 SSDQKVN 377



 Score = 35.4 bits (81), Expect = 0.049
 Identities = 18/61 (29%), Positives = 31/61 (50%), Gaps = 4/61 (6%)

Query: 37  ITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMNWNRYGTGGALPNL 96
           I + P   +K +    +I KW PQ ++L+HP +  F +H G +S +       G  +P +
Sbjct: 311 IESLPKEFSKIISGRGYIVKWAPQKEVLSHPAVGGFWSHCGWNSTLE----SIGEGVPMI 366

Query: 97  C 97
           C
Sbjct: 367 C 367


>gnl|CDD|215465 PLN02863, PLN02863, UDP-glucoronosyl/UDP-glucosyl transferase
           family protein.
          Length = 477

 Score = 50.6 bits (121), Expect = 7e-07
 Identities = 23/54 (42%), Positives = 32/54 (59%)

Query: 227 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
           I+ W PQ  IL+H  +  F+TH G +S++E    GVP+L  P   DQ+ N  LL
Sbjct: 347 IRGWAPQVAILSHRAVGAFLTHCGWNSVLEGLVAGVPMLAWPMAADQFVNASLL 400



 Score = 31.0 bits (70), Expect = 1.1
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 54  IQKWYPQTDILAHPNLRLFITHGGISSLM 82
           I+ W PQ  IL+H  +  F+TH G +S++
Sbjct: 347 IRGWAPQVAILSHRAVGAFLTHCGWNSVL 375


>gnl|CDD|215293 PLN02534, PLN02534, UDP-glycosyltransferase.
          Length = 491

 Score = 48.7 bits (116), Expect = 4e-06
 Identities = 22/56 (39%), Positives = 34/56 (60%)

Query: 225 VFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
           + I+ W PQ  IL+HP +  F+TH G +S +E    GVP++  P F +Q+ N  L+
Sbjct: 346 LLIKGWAPQVLILSHPAIGGFLTHCGWNSTIEGICSGVPMITWPLFAEQFLNEKLI 401



 Score = 31.4 bits (71), Expect = 1.0
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 52  VFIQKWYPQTDILAHPNLRLFITHGGISS 80
           + I+ W PQ  IL+HP +  F+TH G +S
Sbjct: 346 LLIKGWAPQVLILSHPAIGGFLTHCGWNS 374


>gnl|CDD|177830 PLN02173, PLN02173, UDP-glucosyl transferase family protein.
          Length = 449

 Score = 48.1 bits (114), Expect = 5e-06
 Identities = 35/116 (30%), Positives = 53/116 (45%), Gaps = 22/116 (18%)

Query: 223 DNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL-- 280
           D   + KW PQ  +L++  +  F+TH G +S ME  SLGVP++ +P + DQ  N   +  
Sbjct: 317 DKSLVLKWSPQLQVLSNKAIGCFMTHCGWNSTMEGLSLGVPMVAMPQWTDQPMNAKYIQD 376

Query: 281 -----------RHRGYALIEPIQTLTKQSFLKNAQTMLNDPS--FKQNAKKWASIA 323
                      +  G A  E I+   K       + M  + S   K+NA KW  +A
Sbjct: 377 VWKVGVRVKAEKESGIAKREEIEFSIK-------EVMEGEKSKEMKENAGKWRDLA 425



 Score = 29.2 bits (65), Expect = 4.9
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 50  DNVFIQKWYPQTDILAHPNLRLFITHGGISSLM 82
           D   + KW PQ  +L++  +  F+TH G +S M
Sbjct: 317 DKSLVLKWSPQLQVLSNKAIGCFMTHCGWNSTM 349


>gnl|CDD|178581 PLN03004, PLN03004, UDP-glycosyltransferase.
          Length = 451

 Score = 46.6 bits (110), Expect = 2e-05
 Identities = 20/63 (31%), Positives = 37/63 (58%)

Query: 218 AKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNM 277
           ++T    + ++ W PQ  +L H  +  F+TH G +S++EA   GVP++  P + +Q  N 
Sbjct: 329 SRTEDKGMVVKSWAPQVPVLNHKAVGGFVTHCGWNSILEAVCAGVPMVAWPLYAEQRFNR 388

Query: 278 VLL 280
           V++
Sbjct: 389 VMI 391



 Score = 28.1 bits (62), Expect = 9.5
 Identities = 11/38 (28%), Positives = 22/38 (57%)

Query: 45  AKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLM 82
           ++T    + ++ W PQ  +L H  +  F+TH G +S++
Sbjct: 329 SRTEDKGMVVKSWAPQVPVLNHKAVGGFVTHCGWNSIL 366


>gnl|CDD|177857 PLN02207, PLN02207, UDP-glycosyltransferase.
          Length = 468

 Score = 45.8 bits (108), Expect = 3e-05
 Identities = 27/76 (35%), Positives = 43/76 (56%), Gaps = 5/76 (6%)

Query: 209 DITNDPVLNAKTLPDNV----FIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPV 264
           ++TND +L    L D V     I  W PQ +ILAH  +  F++H G +S++E+   GVP+
Sbjct: 315 EVTNDDLLPEGFL-DRVSGRGMICGWSPQVEILAHKAVGGFVSHCGWNSIVESLWFGVPI 373

Query: 265 LGVPFFGDQYRNMVLL 280
           +  P + +Q  N  L+
Sbjct: 374 VTWPMYAEQQLNAFLM 389



 Score = 29.6 bits (66), Expect = 3.5
 Identities = 19/51 (37%), Positives = 29/51 (56%), Gaps = 5/51 (9%)

Query: 36  DITNDPVLNAKTLPDNV----FIQKWYPQTDILAHPNLRLFITHGGISSLM 82
           ++TND +L    L D V     I  W PQ +ILAH  +  F++H G +S++
Sbjct: 315 EVTNDDLLPEGFL-DRVSGRGMICGWSPQVEILAHKAVGGFVSHCGWNSIV 364


>gnl|CDD|215127 PLN02210, PLN02210, UDP-glucosyl transferase.
          Length = 456

 Score = 45.0 bits (106), Expect = 4e-05
 Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 4/88 (4%)

Query: 197 FKQLKLPIFWKI----DITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGIS 252
            K   +P  W I       N  VL          + +W PQ  IL+H  +  F+TH G +
Sbjct: 294 LKNRGVPFLWVIRPKEKAQNVQVLQEMVKEGQGVVLEWSPQEKILSHMAISCFVTHCGWN 353

Query: 253 SLMEASSLGVPVLGVPFFGDQYRNMVLL 280
           S +E    GVPV+  P + DQ  +  LL
Sbjct: 354 STIETVVAGVPVVAYPSWTDQPIDARLL 381



 Score = 34.2 bits (78), Expect = 0.11
 Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 7/86 (8%)

Query: 1   MSFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKI----DITNDPVLNAKTLPDNVFIQK 56
           +SFGS+++  +   ET     +  K   +P  W I       N  VL          + +
Sbjct: 274 ISFGSMLESLENQVET---IAKALKNRGVPFLWVIRPKEKAQNVQVLQEMVKEGQGVVLE 330

Query: 57  WYPQTDILAHPNLRLFITHGGISSLM 82
           W PQ  IL+H  +  F+TH G +S +
Sbjct: 331 WSPQEKILSHMAISCFVTHCGWNSTI 356


>gnl|CDD|215304 PLN02554, PLN02554, UDP-glycosyltransferase family protein.
          Length = 481

 Score = 44.8 bits (106), Expect = 6e-05
 Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 3/52 (5%)

Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN---MV 278
           W PQ  +LA P +  F+TH G +S++E+   GVP+   P + +Q  N   MV
Sbjct: 349 WAPQVAVLAKPAIGGFVTHCGWNSILESLWFGVPMAAWPLYAEQKFNAFEMV 400



 Score = 32.5 bits (74), Expect = 0.43
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 57  WYPQTDILAHPNLRLFITHGGISSLM 82
           W PQ  +LA P +  F+TH G +S++
Sbjct: 349 WAPQVAVLAKPAIGGFVTHCGWNSIL 374


>gnl|CDD|215112 PLN02167, PLN02167, UDP-glycosyltransferase family protein.
          Length = 475

 Score = 43.6 bits (103), Expect = 1e-04
 Identities = 18/47 (38%), Positives = 29/47 (61%)

Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 276
           W PQ +ILAH  +  F++H G +S++E+   GVP+   P + +Q  N
Sbjct: 347 WAPQVEILAHKAIGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQLN 393



 Score = 32.1 bits (73), Expect = 0.48
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 57  WYPQTDILAHPNLRLFITHGGISSLMNWNRYG 88
           W PQ +ILAH  +  F++H G +S++    +G
Sbjct: 347 WAPQVEILAHKAIGGFVSHCGWNSVLESLWFG 378


>gnl|CDD|217893 pfam04101, Glyco_tran_28_C, Glycosyltransferase family 28
           C-terminal domain.  The glycosyltransferase family 28
           includes monogalactosyldiacylglycerol synthase (EC
           2.4.1.46) and UDP-N-acetylglucosamine transferase (EC
           2.4.1.-). Structural analysis suggests the C-terminal
           domain contains the UDP-GlcNAc binding site.
          Length = 167

 Score = 41.9 bits (99), Expect = 1e-04
 Identities = 21/85 (24%), Positives = 33/85 (38%), Gaps = 5/85 (5%)

Query: 244 LFITHGGISSLMEASSLGVPVLGVP----FFGDQYRNMVLLRHRGYALIEPIQTLTKQSF 299
           L I+  G  ++ E  +LG P + VP        Q  N + L   G AL+   + LT +  
Sbjct: 75  LVISRAGAGTIAELLALGKPAILVPRPKAAGEHQDNNALELVKAGAALVLLQKELTPEK- 133

Query: 300 LKNAQTMLNDPSFKQNAKKWASIAN 324
           L  A   L     +      A+  +
Sbjct: 134 LVEALLKLLLKPLRLYEMNKAAKGS 158


>gnl|CDD|178584 PLN03007, PLN03007, UDP-glucosyltransferase family protein.
          Length = 482

 Score = 42.2 bits (99), Expect = 4e-04
 Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 9/84 (10%)

Query: 206 WKIDITNDPVLNAKTLPDN---------VFIQKWYPQTDILAHPNLRLFITHGGISSLME 256
           W +    +     + LP+          + I+ W PQ  IL H     F+TH G +SL+E
Sbjct: 319 WVVRKNENQGEKEEWLPEGFEERTKGKGLIIRGWAPQVLILDHQATGGFVTHCGWNSLLE 378

Query: 257 ASSLGVPVLGVPFFGDQYRNMVLL 280
             + G+P++  P   +Q+ N  L+
Sbjct: 379 GVAAGLPMVTWPVGAEQFYNEKLV 402



 Score = 30.6 bits (69), Expect = 1.7
 Identities = 19/71 (26%), Positives = 28/71 (39%), Gaps = 13/71 (18%)

Query: 33  WKIDITNDPVLNAKTLPDN---------VFIQKWYPQTDILAHPNLRLFITHGGISSLMN 83
           W +    +     + LP+          + I+ W PQ  IL H     F+TH G +SL+ 
Sbjct: 319 WVVRKNENQGEKEEWLPEGFEERTKGKGLIIRGWAPQVLILDHQATGGFVTHCGWNSLLE 378

Query: 84  WNRYGTGGALP 94
               G    LP
Sbjct: 379 ----GVAAGLP 385


>gnl|CDD|215247 PLN02448, PLN02448, UDP-glycosyltransferase family protein.
          Length = 459

 Score = 41.5 bits (98), Expect = 5e-04
 Identities = 20/47 (42%), Positives = 26/47 (55%)

Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 276
           W  Q  +L H ++  F TH G +S +EA   GVP+L  P F DQ  N
Sbjct: 330 WCDQLKVLCHSSVGGFWTHCGWNSTLEAVFAGVPMLTFPLFWDQPLN 376


>gnl|CDD|223779 COG0707, MurG, UDP-N-acetylglucosamine:LPS N-acetylglucosamine
           transferase [Cell envelope biogenesis, outer membrane].
          Length = 357

 Score = 40.0 bits (94), Expect = 0.002
 Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 8/83 (9%)

Query: 244 LFITHGGISSLMEASSLGVPVLGVPF----FGDQYRNMVLLRHRGYALIEPIQTLTKQSF 299
           L I+  G  ++ E  +LGVP + VP+     G Q  N   L   G AL+     LT +  
Sbjct: 255 LVISRAGALTIAELLALGVPAILVPYPPGADGHQEYNAKFLEKAGAALVIRQSELTPEKL 314

Query: 300 LKNAQTMLNDP----SFKQNAKK 318
            +    +L++P    +  +NAKK
Sbjct: 315 AELILRLLSNPEKLKAMAENAKK 337


>gnl|CDD|178275 PLN02670, PLN02670, transferase, transferring glycosyl groups.
          Length = 472

 Score = 39.5 bits (92), Expect = 0.002
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 2/59 (3%)

Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLG-VPVLGVPFFGDQYRNMVLLRHRGYAL 287
           W PQ  IL+H ++  F+TH G +S++E    G V +L  P   +Q  N  LL  +   L
Sbjct: 346 WVPQVKILSHESVGGFLTHCGWNSVVEGLGFGRVLIL-FPVLNEQGLNTRLLHGKKLGL 403



 Score = 28.3 bits (63), Expect = 8.5
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 57  WYPQTDILAHPNLRLFITHGGISSLM 82
           W PQ  IL+H ++  F+TH G +S++
Sbjct: 346 WVPQVKILSHESVGGFLTHCGWNSVV 371


>gnl|CDD|234825 PRK00726, murG, undecaprenyldiphospho-muramoylpentapeptide beta-N-
           acetylglucosaminyltransferase; Provisional.
          Length = 357

 Score = 38.6 bits (91), Expect = 0.005
 Identities = 23/94 (24%), Positives = 39/94 (41%), Gaps = 9/94 (9%)

Query: 244 LFITHGGISSLMEASSLGVPVLGVPF----FGDQYRNMVLLRHRGYALIEPIQTLTKQSF 299
           L I   G S++ E ++ G+P + VP        Q  N   L   G AL+ P   LT +  
Sbjct: 255 LVICRAGASTVAELAAAGLPAILVPLPHAADDHQTANARALVDAGAALLIPQSDLTPEKL 314

Query: 300 LKNAQTMLNDP----SFKQNAKKWASI-ANDEIV 328
            +    +L+DP    +  + A+      A + + 
Sbjct: 315 AEKLLELLSDPERLEAMAEAARALGKPDAAERLA 348


>gnl|CDD|215305 PLN02562, PLN02562, UDP-glycosyltransferase.
          Length = 448

 Score = 38.7 bits (90), Expect = 0.005
 Identities = 18/50 (36%), Positives = 27/50 (54%)

Query: 227 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 276
           +  W PQ ++L H  +  ++TH G +S MEA      +L  P  GDQ+ N
Sbjct: 332 VVSWAPQLEVLKHQAVGCYLTHCGWNSTMEAIQCQKRLLCYPVAGDQFVN 381


>gnl|CDD|178364 PLN02764, PLN02764, glycosyltransferase family protein.
          Length = 453

 Score = 38.1 bits (88), Expect = 0.007
 Identities = 28/99 (28%), Positives = 44/99 (44%), Gaps = 9/99 (9%)

Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLLRHRGYALIE 289
           W  Q  IL+HP++  F++H G  S+ E+      ++ VP  GDQ  N  LL       +E
Sbjct: 324 WVQQPLILSHPSVGCFVSHCGFGSMWESLLSDCQIVLVPQLGDQVLNTRLLSDELKVSVE 383

Query: 290 PIQTLT----KQSFLKNAQTMLNDPS-----FKQNAKKW 319
             +  T    K+S      +++   S      K+N  KW
Sbjct: 384 VAREETGWFSKESLRDAINSVMKRDSEIGNLVKKNHTKW 422


>gnl|CDD|233077 TIGR00661, MJ1255, conserved hypothetical protein.  This model
           represents nearly the full length of MJ1255 from
           Methanococcus jannaschii and of an unpublished protein
           from Vibrio cholerae, as well as the C-terminal half of
           a protein from Methanobacterium thermoautotrophicum. A
           small region (~50 amino acids) within the domain appears
           related to a family of sugar transferases [Hypothetical
           proteins, Conserved].
          Length = 321

 Score = 37.5 bits (87), Expect = 0.009
 Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 241 NLRLFITHGGISSLMEASSLGVPVLGVPFFG--DQYRNMVLLRHRGYA 286
           N  L ITHGG S + EA SLG P++ +P  G  +Q  N V L   G  
Sbjct: 247 NAELVITHGGFSLISEALSLGKPLIVIPDLGQFEQGNNAVKLEDLGCG 294


>gnl|CDD|99961 cd03785, GT1_MurG, MurG is an N-acetylglucosaminyltransferase, the
           last enzyme involved in the intracellular phase of
           peptidoglycan biosynthesis. It transfers
           N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the
           C4 hydroxyl of a lipid-linked N-acetylmuramoyl
           pentapeptide (NAM). The resulting disaccharide is then
           transported across the cell membrane, where it is
           polymerized into NAG-NAM cell-wall repeat structure.
           MurG belongs to the GT-B structural superfamily of
           glycoslytransferases, which have characteristic N- and
           C-terminal domains, each containing a typical Rossmann
           fold. The two domains have high structural homology
           despite minimal sequence homology.  The large cleft that
           separates the two domains includes the catalytic center
           and permits a high degree of flexibility.
          Length = 350

 Score = 36.3 bits (85), Expect = 0.020
 Identities = 21/83 (25%), Positives = 38/83 (45%), Gaps = 8/83 (9%)

Query: 244 LFITHGGISSLMEASSLGVPVLGVPF----FGDQYRNMVLLRHRGYALIEPIQTLTKQSF 299
           L I+  G S++ E ++LG+P + +P        Q  N   L   G A++ P + LT +  
Sbjct: 255 LVISRAGASTVAELAALGLPAILIPLPYAADDHQTANARALVKAGAAVLIPQEELTPERL 314

Query: 300 LKNAQTMLNDP----SFKQNAKK 318
                 +L+DP    +  + A+ 
Sbjct: 315 AAALLELLSDPERLKAMAEAARS 337


>gnl|CDD|177858 PLN02208, PLN02208, glycosyltransferase family protein.
          Length = 442

 Score = 33.8 bits (77), Expect = 0.14
 Identities = 14/44 (31%), Positives = 23/44 (52%)

Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQ 273
           W  Q  IL HP++  F+ H G  ++ E+      ++ +PF  DQ
Sbjct: 318 WVQQPLILDHPSIGCFVNHCGPGTIWESLVSDCQMVLIPFLSDQ 361


>gnl|CDD|222200 pfam13528, Glyco_trans_1_3, Glycosyl transferase family 1. 
          Length = 317

 Score = 33.8 bits (78), Expect = 0.14
 Identities = 20/67 (29%), Positives = 29/67 (43%), Gaps = 5/67 (7%)

Query: 243 RLFITHGGISSLMEASSLGVPVLGVPFFG--DQYRNMVLLRHRGYALIEPIQTLTKQ--- 297
              I + G   L EA  LG P+L VP  G  +Q  N + L   GY ++  ++ L      
Sbjct: 251 SAVICNAGFELLSEALYLGKPLLLVPLDGQFEQTSNALYLERLGYGIVMDMEDLDPAVLR 310

Query: 298 SFLKNAQ 304
            FL+   
Sbjct: 311 RFLERLP 317


>gnl|CDD|215979 pfam00534, Glycos_transf_1, Glycosyl transferases group 1.
           Mutations in this domain of human PIGA lead to disease
           (Paroxysmal Nocturnal haemoglobinuria). Members of this
           family transfer activated sugars to a variety of
           substrates, including glycogen, Fructose-6-phosphate and
           lipopolysaccharides. Members of this family transfer
           UDP, ADP, GDP or CMP linked sugars. The eukaryotic
           glycogen synthases may be distant members of this
           family.
          Length = 158

 Score = 31.1 bits (71), Expect = 0.51
 Identities = 22/112 (19%), Positives = 41/112 (36%), Gaps = 23/112 (20%)

Query: 221 LPDNVFIQ---------KWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFG 271
           L DNV            + Y   D+   P+        G+  L+EA + GVPV+     G
Sbjct: 57  LEDNVIFVGFVPDEDLIELYRIADLFVLPSRY---EGFGLV-LLEAMAAGVPVIATDVGG 112

Query: 272 DQYRNMVLLRHRGYALIEPIQTLTKQSFLKNAQTMLNDPS----FKQNAKKW 319
                  +++     L+  +     ++  +  + +L D        +NA+K 
Sbjct: 113 ----PAEIVKDGETGLL--VDPGDAEALAEAIEKLLKDEELRERLGENARKR 158


>gnl|CDD|177807 PLN00414, PLN00414, glycosyltransferase family protein.
          Length = 446

 Score = 31.5 bits (71), Expect = 0.81
 Identities = 14/44 (31%), Positives = 23/44 (52%)

Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQ 273
           W  Q  IL+HP++  F+ H G  S+ E+      ++ +P   DQ
Sbjct: 319 WVEQPLILSHPSVGCFVNHCGFGSMWESLVSDCQIVFIPQLADQ 362



 Score = 28.8 bits (64), Expect = 6.0
 Identities = 36/115 (31%), Positives = 50/115 (43%), Gaps = 19/115 (16%)

Query: 130 WSHQMQYEKVWTALAERGHEVTIYTKFTPKSNSTNLKHVPIRL-PKIDARMIDPTKLSEE 188
           + H + Y  +   LAE+GH VT    F PK     L+  P+ L P  D+ + +P  L   
Sbjct: 15  FGHMIPYLHLANKLAEKGHRVTF---FLPKKAHKQLQ--PLNLFP--DSIVFEPLTLPPV 67

Query: 189 TKLGF-LEVFKQL----KLPIFWKIDITNDPVLNAKTL---PDNVFIQ--KWYPQ 233
             L F  E    L    K PIF  +D+  D +  AK     PD +F     W P+
Sbjct: 68  DGLPFGAETASDLPNSTKKPIFDAMDLLRDQI-EAKVRALKPDLIFFDFVHWVPE 121


>gnl|CDD|99977 cd03805, GT1_ALG2_like, This family is most closely related to the
           GT1 family of glycosyltransferases.  ALG2, a
           1,3-mannosyltransferase, in yeast catalyzes the
           mannosylation of Man(2)GlcNAc(2)-dolichol diphosphate
           and Man(1)GlcNAc(2)-dolichol diphosphate to form
           Man(3)GlcNAc(2)-dolichol diphosphate. A deficiency of
           this enzyme causes an abnormal accumulation of
           Man1GlcNAc2-PP-dolichol and Man2GlcNAc2-PP-dolichol,
           which is associated with a type of congenital disorders
           of glycosylation (CDG), designated CDG-Ii, in humans.
          Length = 392

 Score = 30.6 bits (70), Expect = 1.5
 Identities = 24/69 (34%), Positives = 27/69 (39%), Gaps = 26/69 (37%)

Query: 142 ALAERGHEVTIYTKFTPKSNSTNLKHVPIRLPKIDARMIDPTKLSEETKLGFLEVF-KQL 200
           AL  RGHEVTIYT                          DP+   EETK G L V  +  
Sbjct: 25  ALQSRGHEVTIYTS-----------------------HHDPSHCFEETKDGTLPVRVRGD 61

Query: 201 KLP--IFWK 207
            LP  IF +
Sbjct: 62  WLPRSIFGR 70


>gnl|CDD|238656 cd01346, Maltoporin-like, The Maltoporin-like channels (LamB porin)
           form a trimeric structure which facilitate the diffusion
           of maltodextrins and other sugars across the outer
           membrane of Gram-negative bacteria. The membrane channel
           is formed by an 18-strand antiparallel beta-barrel
           (18,22). Loop 3 folds into the core to constrict pore
           size. Long irregular loops are found on the
           extracelllular side, while short turns are in the
           periplasm.Tightly-bound water molecules are found in the
           eyelet of the passage, and only substrates that can
           displace and replace the broken hydrogen bonds are
           likely to enter the pore.  In the MPR structure, loops
           4,6, and 9 have the greatest mobility and are highly
           variable; these are postulated to attract maltodextrins.
          Length = 392

 Score = 30.4 bits (69), Expect = 1.6
 Identities = 8/38 (21%), Positives = 13/38 (34%), Gaps = 2/38 (5%)

Query: 55  QKWYPQTDILAHPNLRLFITHGGISSLMNWNRYGTGGA 92
             +   TD  + P LR + T+   +      R  T   
Sbjct: 337 PTFSAGTDFWSRPELRFYATY--SNWNDAALRAFTAFG 372


>gnl|CDD|175987 cd04020, C2B_SLP_1-2-3-4, C2 domain second repeat present in
           Synaptotagmin-like proteins 1-4.  All Slp members
           basically share an N-terminal Slp homology domain (SHD)
           and C-terminal tandem C2 domains (named the C2A domain
           and the C2B domain) with the SHD and C2 domains being
           separated by a linker sequence of various length.
           Slp1/JFC1 and Slp2/exophilin 4 promote granule docking
           to the plasma membrane.  Additionally, their C2A domains
           are both Ca2+ independent, unlike the case in Slp3 and
           Slp4/granuphilin in which their C2A domains are Ca2+
           dependent.  It is thought that SHD (except for the
           Slp4-SHD) functions as a specific Rab27A/B-binding
           domain. In addition to Slps, rabphilin, Noc2, and
           Munc13-4 also function as Rab27-binding proteins. It has
           been demonstrated that Slp3 and Slp4/granuphilin promote
           dense-core vesicle exocytosis. C2 domains fold into an
           8-standed beta-sandwich that can adopt 2 structural
           arrangements: Type I and Type II, distinguished by a
           circular permutation involving their N- and C-terminal
           beta strands. Many C2 domains are Ca2+-dependent
           membrane-targeting modules that bind a wide variety of
           substances including bind phospholipids, inositol
           polyphosphates, and intracellular proteins.  Most C2
           domain proteins are either signal transduction enzymes
           that contain a single C2 domain, such as protein kinase
           C, or membrane trafficking proteins which contain at
           least two C2 domains, such as synaptotagmin 1.  However,
           there are a few exceptions to this including RIM
           isoforms and some splice variants of piccolo/aczonin and
           intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.   This cd contains the second
           C2 repeat, C2B, and has a type-I topology.
          Length = 162

 Score = 29.6 bits (67), Expect = 1.7
 Identities = 10/35 (28%), Positives = 18/35 (51%), Gaps = 1/35 (2%)

Query: 128 PIWSHQMQYEKVWTA-LAERGHEVTIYTKFTPKSN 161
           P+W+H   Y+ V    L++   E+T++      SN
Sbjct: 77  PVWNHTFVYDGVSPEDLSQACLELTVWDHDKLSSN 111


>gnl|CDD|235226 PRK04148, PRK04148, hypothetical protein; Provisional.
          Length = 134

 Score = 29.2 bits (66), Expect = 2.1
 Identities = 32/115 (27%), Positives = 50/115 (43%), Gaps = 22/115 (19%)

Query: 188 ETKLGF-LEVFKQLKLPIF--WKIDITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRL 244
           E  +GF  +V K+LK   F    IDI    V  AK L  N F+       D L +PNL +
Sbjct: 22  ELGIGFYFKVAKKLKESGFDVIVIDINEKAVEKAKKLGLNAFV-------DDLFNPNLEI 74

Query: 245 FITHGGISS------LME-----ASSLGVPVLGVPFFGDQ-YRNMVLLRHRGYAL 287
           +     I S      L       A  + VP++  P  G++  + + L+ ++G  +
Sbjct: 75  YKNAKLIYSIRPPRDLQPFILELAKKINVPLIIKPLSGEEPIKELKLINYKGKPI 129


>gnl|CDD|99973 cd03800, GT1_Sucrose_synthase, This family is most closely related
           to the GT1 family of glycosyltransferases. The
           sucrose-phosphate synthases in this family may be unique
           to plants and photosynthetic bacteria. This enzyme
           catalyzes the synthesis of sucrose 6-phosphate from
           fructose 6-phosphate and uridine 5'-diphosphate-glucose,
           a key regulatory step of sucrose metabolism. The
           activity of this enzyme is regulated by phosphorylation
           and moderated by the concentration of various
           metabolites and light.
          Length = 398

 Score = 30.2 bits (69), Expect = 2.1
 Identities = 9/13 (69%), Positives = 10/13 (76%)

Query: 142 ALAERGHEVTIYT 154
           ALA  GHEV I+T
Sbjct: 33  ALARLGHEVDIFT 45


>gnl|CDD|99972 cd03799, GT1_amsK_like, This is a family of GT1
           glycosyltransferases found specifically in certain
           bacteria. amsK in Erwinia amylovora, has been reported
           to be involved in the biosynthesis of amylovoran, a
           exopolysaccharide acting as a virulence factor.
          Length = 355

 Score = 30.2 bits (69), Expect = 2.1
 Identities = 12/52 (23%), Positives = 22/52 (42%), Gaps = 13/52 (25%)

Query: 227 IQKWYPQTDILAHPNLRLFITHGGIS-----SLMEASSLGVPVL-----GVP 268
           +++     D+   P+        G        LMEA ++G+PV+     G+P
Sbjct: 249 VRELLRAADLFVLPS---VTAADGDREGLPVVLMEAMAMGLPVISTDVSGIP 297


>gnl|CDD|233284 TIGR01133, murG, undecaprenyldiphospho-muramoylpentapeptide
           beta-N-acetylglucosaminyltransferase.  RM 8449890 RT The
           final step of peptidoglycan subunit assembly in
           Escherichia coli occurs in the cytoplasm. RA Bupp K, van
           Heijenoort J. RL J Bacteriol 1993 Mar;175(6):1841-3
           [Cell envelope, Biosynthesis and degradation of murein
           sacculus and peptidoglycan].
          Length = 348

 Score = 29.9 bits (68), Expect = 2.3
 Identities = 16/48 (33%), Positives = 25/48 (52%), Gaps = 3/48 (6%)

Query: 244 LFITHGGISSLMEASSLGVPVLGVPF---FGDQYRNMVLLRHRGYALI 288
           L I+  G S++ E ++ GVP + +P+     DQY N   L   G  L+
Sbjct: 253 LVISRAGASTVAELAAAGVPAILIPYPYAADDQYYNAKFLEDLGAGLV 300


>gnl|CDD|99975 cd03802, GT1_AviGT4_like, This family is most closely related to
           the GT1 family of glycosyltransferases. aviGT4 in
           Streptomyces viridochromogenes has been shown to be
           involved in biosynthesis of oligosaccharide antibiotic
           avilamycin A. Inactivation of aviGT4 resulted in a
           mutant that accumulated a novel avilamycin derivative
           lacking the terminal eurekanate residue.
          Length = 335

 Score = 29.5 bits (67), Expect = 2.9
 Identities = 12/35 (34%), Positives = 15/35 (42%), Gaps = 6/35 (17%)

Query: 139 VWT---ALAERGHEVTIYTKFTPKSNSTNLKHVPI 170
           V      L  RGHEVT+   F    + T    VP+
Sbjct: 25  VAALTEGLVARGHEVTL---FASGDSKTAAPLVPV 56


>gnl|CDD|226807 COG4370, COG4370, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 412

 Score = 29.8 bits (67), Expect = 3.0
 Identities = 28/111 (25%), Positives = 40/111 (36%), Gaps = 27/111 (24%)

Query: 199 QLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLM--- 255
              L +FW       P+L   TL +    Q W P  D     N  L+++    + ++   
Sbjct: 256 MFALVVFWAAIAPELPLLLLWTLEER---QGWQPLADRFGKDNCSLWLSQQSFADILHAA 312

Query: 256 ------------EASSLGVPVLGVPFFGDQY------RNMVLLRHRGYALI 288
                       +A  LG PV+G P  G QY      R   LL   G +L 
Sbjct: 313 DAALGMAGTATEQAVGLGKPVIGFPGQGPQYNPGFAERQQRLL---GASLT 360


>gnl|CDD|99974 cd03801, GT1_YqgM_like, This family is most closely related to the
           GT1 family of glycosyltransferases and named after YqgM
           in Bacillus licheniformis about which little is known.
           Glycosyltransferases catalyze the transfer of sugar
           moieties from activated donor molecules to specific
           acceptor molecules, forming glycosidic bonds. The
           acceptor molecule can be a lipid, a protein, a
           heterocyclic compound, or another carbohydrate residue.
           This group of glycosyltransferases is most closely
           related to the previously defined glycosyltransferase
           family 1 (GT1). The members of this family may transfer
           UDP, ADP, GDP, or CMP linked sugars. The diverse
           enzymatic activities among members of this family
           reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found
           mainly in certain bacteria and archaea.
          Length = 374

 Score = 28.9 bits (65), Expect = 4.8
 Identities = 14/69 (20%), Positives = 22/69 (31%), Gaps = 6/69 (8%)

Query: 142 ALAERGHEVTIYTKFTPKSNSTNLKHVPIRLPKIDARMIDPTKLSEETKLGFLEVFKQLK 201
           ALA RGHEVT+ T               + +       +          L  L   +  +
Sbjct: 26  ALAARGHEVTVLTPGDGGLPDEEEVGGIVVVRPPPLLRVRR------LLLLLLLALRLRR 79

Query: 202 LPIFWKIDI 210
           L    + D+
Sbjct: 80  LLRRERFDV 88


>gnl|CDD|222237 pfam13579, Glyco_trans_4_4, Glycosyl transferase 4-like domain. 
          Length = 158

 Score = 28.2 bits (63), Expect = 4.8
 Identities = 10/13 (76%), Positives = 11/13 (84%)

Query: 142 ALAERGHEVTIYT 154
           ALA RGHEVT+ T
Sbjct: 13  ALAARGHEVTVVT 25


>gnl|CDD|238458 cd00916, Npc2_like, Niemann-Pick type C2 (Npc2) is a lysosomal
           protein in which a mutation in the gene causes a rare
           form of Niemann-Pick type C disease, an autosomal
           recessive lipid storage disorder characterized by
           accumulation of low-density lipoprotein-derived
           cholesterol in lysosomes. Although Npc2 is known to bind
           cholesterol, the function of this protein is unknown.
           These proteins belong to the ML domain family. .
          Length = 123

 Score = 27.3 bits (61), Expect = 6.7
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 146 RGHEVTIYTKFTPKSNSTNLKHV 168
           RG    +   FTP  +ST+LK  
Sbjct: 28  RGSTAKVSIDFTPNFDSTSLKTE 50


>gnl|CDD|130050 TIGR00977, LeuA_rel, 2-isopropylmalate synthase/homocitrate
           synthase family protein.  This model represents
           uncharacterized proteins related to 2-isopropylmalate
           synthases and homocitrate synthases but phylogenetically
           distint. Each species represented in the seed alignment
           also has a member of a known family of 2-isopropylmalate
           synthases [Unknown function, General].
          Length = 526

 Score = 28.7 bits (64), Expect = 7.0
 Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 4/53 (7%)

Query: 313 KQNAKKWASIAN--DEIVS--PLERVVYWTEYVLRHKGAPHLSASSRQLTWYQ 361
            +N KK  S A    EIV+  P + + Y       HKG  H+SA  R    Y+
Sbjct: 261 PENLKKLTSTARLVAEIVNLPPDDNMPYVGRSAFAHKGGVHVSAVQRNPFTYE 313


>gnl|CDD|99980 cd03808, GT1_cap1E_like, This family is most closely related to the
           GT1 family of glycosyltransferases. cap1E in
           Streptococcus pneumoniae is required for the synthesis
           of type 1 capsular polysaccharides.
          Length = 359

 Score = 28.3 bits (64), Expect = 8.6
 Identities = 10/39 (25%), Positives = 18/39 (46%), Gaps = 6/39 (15%)

Query: 227 IQKWYPQTDILAHPNLRLFITHGGIS-SLMEASSLGVPV 264
           + +     D+   P+ R      G+   L+EA ++G PV
Sbjct: 257 VPELLAAADVFVLPSYR-----EGLPRVLLEAMAMGRPV 290


>gnl|CDD|239055 cd02140, Nitroreductase_4, Nitroreductase-like family 4.  A
           subfamily of the nitroreductase family containing
           uncharacterized proteins that are similar to
           nitroreductase. Nitroreductase catalyzes the reduction
           of nitroaromatic compounds such as nitrotoluenes,
           nitrofurans and nitroimidazoles. This process requires
           NAD(P)H as electron donor in an obligatory two-electron
           transfer and uses FMN as cofactor.  The enzyme is
           typically a homodimer. Members of this family are also
           called NADH dehydrogenase, oxygen-insensitive NAD(P)H
           nitrogenase or dihydropteridine reductase.
          Length = 192

 Score = 27.6 bits (62), Expect = 8.7
 Identities = 12/24 (50%), Positives = 13/24 (54%), Gaps = 3/24 (12%)

Query: 128 PIWSHQ---MQYEKVWTALAERGH 148
           P+WS Q   M    VWTALA  G 
Sbjct: 116 PVWSEQSSGMAQIAVWTALAAEGI 139


>gnl|CDD|217329 pfam03033, Glyco_transf_28, Glycosyltransferase family 28
           N-terminal domain.  The glycosyltransferase family 28
           includes monogalactosyldiacylglycerol synthase (EC
           2.4.1.46) and UDP-N-acetylglucosamine transferase (EC
           2.4.1.-). This N-terminal domain contains the acceptor
           binding site and likely membrane association site. This
           family also contains a large number of proteins that
           probably have quite distinct activities.
          Length = 136

 Score = 27.3 bits (61), Expect = 8.8
 Identities = 14/68 (20%), Positives = 20/68 (29%), Gaps = 12/68 (17%)

Query: 142 ALAERGHEVTIYTK--FTPKSNSTNLKHVPIRLPKIDARMIDPTKLSEETKLGFLEVFKQ 199
           AL  RGHEV + T            L  VPI    +  + +              E  + 
Sbjct: 21  ALRRRGHEVRLGTPPGLEEFVEEAGLPFVPIGGDGLRRKSLK----------NLKEPLEG 70

Query: 200 LKLPIFWK 207
            +     K
Sbjct: 71  GRALRQAK 78


>gnl|CDD|238885 cd01902, Ntn_CGH, Choloylglycine hydrolase (CGH) is a bile
           salt-modifying enzyme that hydrolyzes non-peptide
           carbon-nitrogen bonds in choloylglycine and
           choloyltaurine, both of which are present in bile.  CGH
           is present in a number of probiotic microbial organisms
           that inhabit the gut.  CGH has an N-terminal
           nucleophilic cysteine, as do other members of the Ntn
           hydrolase family to which CGH belongs.
          Length = 291

 Score = 27.7 bits (62), Expect = 9.9
 Identities = 9/34 (26%), Positives = 13/34 (38%), Gaps = 3/34 (8%)

Query: 301 KNAQTMLNDPSFKQ---NAKKWASIANDEIVSPL 331
           K  Q M N P++ Q     K W    +    + L
Sbjct: 167 KQYQVMTNSPTYDQQLALNKYWKQEKDIGGTTGL 200


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.138    0.428 

Gapped
Lambda     K      H
   0.267   0.0667    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,648,039
Number of extensions: 2012135
Number of successful extensions: 2017
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2005
Number of HSP's successfully gapped: 92
Length of query: 403
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 304
Effective length of database: 6,546,556
Effective search space: 1990153024
Effective search space used: 1990153024
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 60 (27.0 bits)