RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16938
(403 letters)
>gnl|CDD|201077 pfam00201, UDPGT, UDP-glucoronosyl and UDP-glucosyl transferase.
Length = 500
Score = 157 bits (400), Expect = 2e-43
Identities = 69/167 (41%), Positives = 104/167 (62%)
Query: 220 TLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVL 279
TL N + KW PQ D+L HP R F+TH G + + EA GVP++G+P FGDQ N
Sbjct: 320 TLGRNTRLVKWLPQNDLLGHPKTRAFVTHAGSNGVYEAICHGVPMVGMPLFGDQMDNAKH 379
Query: 280 LRHRGYALIEPIQTLTKQSFLKNAQTMLNDPSFKQNAKKWASIANDEIVSPLERVVYWTE 339
+ +G A+ + T+T + L +T++NDPS+K+N + +SI +D+ V PL+R V+W E
Sbjct: 380 MEAKGAAVTLNVLTMTSEDLLNALKTVINDPSYKENIMRLSSIHHDQPVKPLDRAVFWIE 439
Query: 340 YVLRHKGAPHLSASSRQLTWYQMYCIDIILVILGILYAVVKLLSMCC 386
+V+RHKGA HL ++ LTWYQ + +D+I +L + V + CC
Sbjct: 440 FVMRHKGAKHLRPAAHDLTWYQYHSLDVIGFLLACVATVAFITFKCC 486
Score = 51.6 bits (124), Expect = 4e-07
Identities = 26/81 (32%), Positives = 37/81 (45%), Gaps = 7/81 (8%)
Query: 2 SFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWYPQT 61
S GS+V + + EE Q+ + W+ D T TL N + KW PQ
Sbjct: 282 SLGSMV--SNIPEEKANEIASALAQIPQKVLWRFDGTKPS-----TLGRNTRLVKWLPQN 334
Query: 62 DILAHPNLRLFITHGGISSLM 82
D+L HP R F+TH G + +
Sbjct: 335 DLLGHPKTRAFVTHAGSNGVY 355
>gnl|CDD|223071 PHA03392, egt, ecdysteroid UDP-glucosyltransferase; Provisional.
Length = 507
Score = 133 bits (337), Expect = 1e-34
Identities = 53/199 (26%), Positives = 97/199 (48%), Gaps = 10/199 (5%)
Query: 175 IDARMIDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWYPQT 234
ID +D L L FK+L + WK D + A LP NV QKW+PQ
Sbjct: 307 IDTNDMDNEFLQM-----LLRTFKKLPYNVLWKYD----GEVEAINLPANVLTQKWFPQR 357
Query: 235 DILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLLRHRGYALIEPIQTL 294
+L H N++ F+T GG+ S EA VP++G+P GDQ+ N G T+
Sbjct: 358 AVLKHKNVKAFVTQGGVQSTDEAIDALVPMVGLPMMGDQFYNTNKYVELGIGRALDTVTV 417
Query: 295 TKQSFLKNAQTMLNDPSFKQNAKKWASIANDEIVSPLERVVYWTEYVLRHKG-APHLSAS 353
+ + ++ +P +++N K+ + + ++PL + +++TE+V+R+K L
Sbjct: 418 SAAQLVLAIVDVIENPKYRKNLKELRHLIRHQPMTPLHKAIWYTEHVIRNKHGNTSLKTK 477
Query: 354 SRQLTWYQMYCIDIILVIL 372
+ +++ + I++ ++
Sbjct: 478 AANVSYSDYFMSYILVPLV 496
Score = 88.5 bits (220), Expect = 5e-19
Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 2 SFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWYPQT 61
SFGS +D + E L FK+L + WK D + A LP NV QKW+PQ
Sbjct: 302 SFGSSIDTNDMDNEFLQMLLRTFKKLPYNVLWKYD----GEVEAINLPANVLTQKWFPQR 357
Query: 62 DILAHPNLRLFITHGGISS 80
+L H N++ F+T GG+ S
Sbjct: 358 AVLKHKNVKAFVTQGGVQS 376
Score = 48.4 bits (116), Expect = 4e-06
Identities = 24/77 (31%), Positives = 34/77 (44%), Gaps = 5/77 (6%)
Query: 104 FIILLSFSFLGTLSCDKILVFLPLPIWSHQMQYEKVWT-ALAERGHEVTIYTKFTPKSNS 162
I+LL L + +IL P P +SH + KV+ ALAERGH VT+
Sbjct: 6 IILLLLLLLLSGVRAARILAVFPTPAYSHHSVF-KVYVEALAERGHNVTVIKPTL--RVY 62
Query: 163 TNLKHVPIRLPKIDARM 179
+ +IDA +
Sbjct: 63 YASHLCG-NITEIDASL 78
>gnl|CDD|224732 COG1819, COG1819, Glycosyl transferases, related to
UDP-glucuronosyltransferase [Carbohydrate transport and
metabolism / Signal transduction mechanisms].
Length = 406
Score = 82.5 bits (204), Expect = 3e-17
Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 4/99 (4%)
Query: 221 LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
+PDNV + + PQ ++L + I HGG + EA GVP++ +P DQ N +
Sbjct: 282 VPDNVIVADYVPQLELLPRADA--VIHHGGAGTTSEALYAGVPLVVIPDGADQPLNAERV 339
Query: 281 RHRGYALIEPIQTLTKQSFLKNA-QTMLNDPSFKQNAKK 318
G + P + LT++ L+ A +L D S+++ A++
Sbjct: 340 EELGAGIALPFEELTEER-LRAAVNEVLADDSYRRAAER 377
Score = 39.0 bits (91), Expect = 0.003
Identities = 11/35 (31%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 48 LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLM 82
+PDNV + + PQ ++L + I HGG +
Sbjct: 282 VPDNVIVADYVPQLELLPRAD--AVIHHGGAGTTS 314
Score = 28.5 bits (64), Expect = 6.7
Identities = 15/80 (18%), Positives = 25/80 (31%), Gaps = 8/80 (10%)
Query: 142 ALAERGHEVTIYT--KFTPKSNST--NLKHVPIR--LPKIDARMIDPTKLSEETKLGFLE 195
L RGHEV + KF + PIR + K F +
Sbjct: 24 ELRRRGHEVVFASTGKFKEFVEAAGLAFVAYPIRDSELATEDGKFAGVKSFRRLLQQFKK 83
Query: 196 VFKQLKLPI--FWKIDITND 213
+ ++L + + +D
Sbjct: 84 LIRELLELLRELEPDLVVDD 103
>gnl|CDD|233407 TIGR01426, MGT, glycosyltransferase, MGT family. This model
describes the MGT (macroside glycosyltransferase)
subfamily of the UDP-glucuronosyltransferase family.
Members include a number of glucosyl transferases for
macrolide antibiotic inactivation, but also include
transferases of glucose-related sugars for macrolide
antibiotic production [Cellular processes, Toxin
production and resistance].
Length = 392
Score = 73.2 bits (180), Expect = 3e-14
Identities = 33/98 (33%), Positives = 52/98 (53%), Gaps = 2/98 (2%)
Query: 221 LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
LP NV +++W PQ +IL + FITHGG++S MEA GVP++ VP DQ +
Sbjct: 273 LPPNVEVRQWVPQLEILKKADA--FITHGGMNSTMEALFNGVPMVAVPQGADQPMTARRI 330
Query: 281 RHRGYALIEPIQTLTKQSFLKNAQTMLNDPSFKQNAKK 318
G P + +T + + +L+DP + + +K
Sbjct: 331 AELGLGRHLPPEEVTAEKLREAVLAVLSDPRYAERLRK 368
Score = 48.1 bits (115), Expect = 4e-06
Identities = 17/35 (48%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 48 LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLM 82
LP NV +++W PQ +IL + FITHGG++S M
Sbjct: 273 LPPNVEVRQWVPQLEILKKAD--AFITHGGMNSTM 305
>gnl|CDD|99960 cd03784, GT1_Gtf_like, This family includes the Gtfs, a group of
homologous glycosyltransferases involved in the final
stages of the biosynthesis of antibiotics vancomycin and
related chloroeremomycin. Gtfs transfer sugar moieties
from an activated NDP-sugar donor to the oxidatively
cross-linked heptapeptide core of vancomycin group
antibiotics. The core structure is important for the
bioactivity of the antibiotics.
Length = 401
Score = 60.8 bits (148), Expect = 4e-10
Identities = 30/97 (30%), Positives = 43/97 (44%), Gaps = 2/97 (2%)
Query: 221 LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
LPDNV + + P +L P + HGG + A GVP L VPFFGDQ +
Sbjct: 286 LPDNVRVVDFVPHDWLL--PRCAAVVHHGGAGTTAAALRAGVPQLVVPFFGDQPFWAARV 343
Query: 281 RHRGYALIEPIQTLTKQSFLKNAQTMLNDPSFKQNAK 317
G + LT + + +L+ PS ++ A
Sbjct: 344 AELGAGPALDPRELTAERLAAALRRLLDPPSRRRAAA 380
Score = 31.9 bits (73), Expect = 0.54
Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 3/43 (6%)
Query: 142 ALAERGHEVTIYT--KFTPKSNSTNLKHVPIRLPKIDARMIDP 182
AL GHEV + T +F + L+ VP+ D + P
Sbjct: 23 ALRAAGHEVRVATPPEFADLVEAAGLEFVPVGGD-PDELLASP 64
Score = 30.0 bits (68), Expect = 2.3
Identities = 23/84 (27%), Positives = 33/84 (39%), Gaps = 11/84 (13%)
Query: 1 MSFGSVV--DPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWY 58
+ FGS+V DP L+ +E L I L A+ LPDNV + +
Sbjct: 244 VGFGSMVVRDPEALARL----DVEAVATLGQRA---ILSLGWGGLGAEDLPDNVRVVDFV 296
Query: 59 PQTDILAHPNLRLFITHGGISSLM 82
P +L P + HGG +
Sbjct: 297 PHDWLL--PRCAAVVHHGGAGTTA 318
>gnl|CDD|178170 PLN02555, PLN02555, limonoid glucosyltransferase.
Length = 480
Score = 58.3 bits (141), Expect = 3e-09
Identities = 27/54 (50%), Positives = 36/54 (66%)
Query: 227 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
I +W PQ +LAHP++ F+TH G +S MEA S GVPV+ P +GDQ + V L
Sbjct: 341 IVQWCPQEKVLAHPSVACFVTHCGWNSTMEALSSGVPVVCFPQWGDQVTDAVYL 394
Score = 35.2 bits (81), Expect = 0.059
Identities = 14/29 (48%), Positives = 20/29 (68%)
Query: 54 IQKWYPQTDILAHPNLRLFITHGGISSLM 82
I +W PQ +LAHP++ F+TH G +S M
Sbjct: 341 IVQWCPQEKVLAHPSVACFVTHCGWNSTM 369
>gnl|CDD|215084 PLN00164, PLN00164, glucosyltransferase; Provisional.
Length = 480
Score = 56.2 bits (136), Expect = 1e-08
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 1/59 (1%)
Query: 229 KWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN-MVLLRHRGYA 286
W PQ +ILAH + F+TH G +S++E+ GVP+ P + +Q+ N L+ G A
Sbjct: 345 TWAPQKEILAHAAVGGFVTHCGWNSVLESLWHGVPMAPWPLYAEQHLNAFELVADMGVA 403
Score = 39.3 bits (92), Expect = 0.003
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 56 KWYPQTDILAHPNLRLFITHGGISSLM 82
W PQ +ILAH + F+TH G +S++
Sbjct: 345 TWAPQKEILAHAAVGGFVTHCGWNSVL 371
>gnl|CDD|177813 PLN02152, PLN02152, indole-3-acetate beta-glucosyltransferase.
Length = 455
Score = 52.7 bits (126), Expect = 1e-07
Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 6/103 (5%)
Query: 227 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLLRH---R 283
I W Q ++L H + F+TH G SS +E+ LGVPV+ P + DQ N LL
Sbjct: 331 IVSWCSQIEVLRHRAVGCFVTHCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLEEIWKT 390
Query: 284 GYALIEPIQTLTKQSFLKNA-QTMLNDPS--FKQNAKKWASIA 323
G + E + L ++ ++ + ++ + S +++A+KW +A
Sbjct: 391 GVRVRENSEGLVERGEIRRCLEAVMEEKSVELRESAEKWKRLA 433
>gnl|CDD|178589 PLN03015, PLN03015, UDP-glucosyl transferase.
Length = 470
Score = 52.0 bits (124), Expect = 3e-07
Identities = 20/56 (35%), Positives = 38/56 (67%)
Query: 225 VFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
+ + +W PQ +IL+H ++ F++H G SS++E+ + GVP++ P + +Q+ N LL
Sbjct: 337 LVVTQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIVAWPLYAEQWMNATLL 392
Score = 30.8 bits (69), Expect = 1.2
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 52 VFIQKWYPQTDILAHPNLRLFITHGGISSLM 82
+ + +W PQ +IL+H ++ F++H G SS++
Sbjct: 337 LVVTQWAPQVEILSHRSIGGFLSHCGWSSVL 367
>gnl|CDD|178572 PLN02992, PLN02992, coniferyl-alcohol glucosyltransferase.
Length = 481
Score = 51.5 bits (123), Expect = 5e-07
Identities = 23/54 (42%), Positives = 32/54 (59%)
Query: 227 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
+ W PQ +ILAH + F+TH G SS +E+ GVP++ P F +Q N LL
Sbjct: 342 VPSWAPQAEILAHQAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALL 395
Score = 32.3 bits (73), Expect = 0.48
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 54 IQKWYPQTDILAHPNLRLFITHGGISSLM 82
+ W PQ +ILAH + F+TH G SS +
Sbjct: 342 VPSWAPQAEILAHQAVGGFLTHCGWSSTL 370
>gnl|CDD|178032 PLN02410, PLN02410, UDP-glucoronosyl/UDP-glucosyl transferase
family protein.
Length = 451
Score = 51.2 bits (122), Expect = 5e-07
Identities = 24/67 (35%), Positives = 38/67 (56%)
Query: 210 ITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPF 269
I + P +K + +I KW PQ ++L+HP + F +H G +S +E+ GVP++ PF
Sbjct: 311 IESLPKEFSKIISGRGYIVKWAPQKEVLSHPAVGGFWSHCGWNSTLESIGEGVPMICKPF 370
Query: 270 FGDQYRN 276
DQ N
Sbjct: 371 SSDQKVN 377
Score = 35.4 bits (81), Expect = 0.049
Identities = 18/61 (29%), Positives = 31/61 (50%), Gaps = 4/61 (6%)
Query: 37 ITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMNWNRYGTGGALPNL 96
I + P +K + +I KW PQ ++L+HP + F +H G +S + G +P +
Sbjct: 311 IESLPKEFSKIISGRGYIVKWAPQKEVLSHPAVGGFWSHCGWNSTLE----SIGEGVPMI 366
Query: 97 C 97
C
Sbjct: 367 C 367
>gnl|CDD|215465 PLN02863, PLN02863, UDP-glucoronosyl/UDP-glucosyl transferase
family protein.
Length = 477
Score = 50.6 bits (121), Expect = 7e-07
Identities = 23/54 (42%), Positives = 32/54 (59%)
Query: 227 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
I+ W PQ IL+H + F+TH G +S++E GVP+L P DQ+ N LL
Sbjct: 347 IRGWAPQVAILSHRAVGAFLTHCGWNSVLEGLVAGVPMLAWPMAADQFVNASLL 400
Score = 31.0 bits (70), Expect = 1.1
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 54 IQKWYPQTDILAHPNLRLFITHGGISSLM 82
I+ W PQ IL+H + F+TH G +S++
Sbjct: 347 IRGWAPQVAILSHRAVGAFLTHCGWNSVL 375
>gnl|CDD|215293 PLN02534, PLN02534, UDP-glycosyltransferase.
Length = 491
Score = 48.7 bits (116), Expect = 4e-06
Identities = 22/56 (39%), Positives = 34/56 (60%)
Query: 225 VFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 280
+ I+ W PQ IL+HP + F+TH G +S +E GVP++ P F +Q+ N L+
Sbjct: 346 LLIKGWAPQVLILSHPAIGGFLTHCGWNSTIEGICSGVPMITWPLFAEQFLNEKLI 401
Score = 31.4 bits (71), Expect = 1.0
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 52 VFIQKWYPQTDILAHPNLRLFITHGGISS 80
+ I+ W PQ IL+HP + F+TH G +S
Sbjct: 346 LLIKGWAPQVLILSHPAIGGFLTHCGWNS 374
>gnl|CDD|177830 PLN02173, PLN02173, UDP-glucosyl transferase family protein.
Length = 449
Score = 48.1 bits (114), Expect = 5e-06
Identities = 35/116 (30%), Positives = 53/116 (45%), Gaps = 22/116 (18%)
Query: 223 DNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL-- 280
D + KW PQ +L++ + F+TH G +S ME SLGVP++ +P + DQ N +
Sbjct: 317 DKSLVLKWSPQLQVLSNKAIGCFMTHCGWNSTMEGLSLGVPMVAMPQWTDQPMNAKYIQD 376
Query: 281 -----------RHRGYALIEPIQTLTKQSFLKNAQTMLNDPS--FKQNAKKWASIA 323
+ G A E I+ K + M + S K+NA KW +A
Sbjct: 377 VWKVGVRVKAEKESGIAKREEIEFSIK-------EVMEGEKSKEMKENAGKWRDLA 425
Score = 29.2 bits (65), Expect = 4.9
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 50 DNVFIQKWYPQTDILAHPNLRLFITHGGISSLM 82
D + KW PQ +L++ + F+TH G +S M
Sbjct: 317 DKSLVLKWSPQLQVLSNKAIGCFMTHCGWNSTM 349
>gnl|CDD|178581 PLN03004, PLN03004, UDP-glycosyltransferase.
Length = 451
Score = 46.6 bits (110), Expect = 2e-05
Identities = 20/63 (31%), Positives = 37/63 (58%)
Query: 218 AKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNM 277
++T + ++ W PQ +L H + F+TH G +S++EA GVP++ P + +Q N
Sbjct: 329 SRTEDKGMVVKSWAPQVPVLNHKAVGGFVTHCGWNSILEAVCAGVPMVAWPLYAEQRFNR 388
Query: 278 VLL 280
V++
Sbjct: 389 VMI 391
Score = 28.1 bits (62), Expect = 9.5
Identities = 11/38 (28%), Positives = 22/38 (57%)
Query: 45 AKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLM 82
++T + ++ W PQ +L H + F+TH G +S++
Sbjct: 329 SRTEDKGMVVKSWAPQVPVLNHKAVGGFVTHCGWNSIL 366
>gnl|CDD|177857 PLN02207, PLN02207, UDP-glycosyltransferase.
Length = 468
Score = 45.8 bits (108), Expect = 3e-05
Identities = 27/76 (35%), Positives = 43/76 (56%), Gaps = 5/76 (6%)
Query: 209 DITNDPVLNAKTLPDNV----FIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPV 264
++TND +L L D V I W PQ +ILAH + F++H G +S++E+ GVP+
Sbjct: 315 EVTNDDLLPEGFL-DRVSGRGMICGWSPQVEILAHKAVGGFVSHCGWNSIVESLWFGVPI 373
Query: 265 LGVPFFGDQYRNMVLL 280
+ P + +Q N L+
Sbjct: 374 VTWPMYAEQQLNAFLM 389
Score = 29.6 bits (66), Expect = 3.5
Identities = 19/51 (37%), Positives = 29/51 (56%), Gaps = 5/51 (9%)
Query: 36 DITNDPVLNAKTLPDNV----FIQKWYPQTDILAHPNLRLFITHGGISSLM 82
++TND +L L D V I W PQ +ILAH + F++H G +S++
Sbjct: 315 EVTNDDLLPEGFL-DRVSGRGMICGWSPQVEILAHKAVGGFVSHCGWNSIV 364
>gnl|CDD|215127 PLN02210, PLN02210, UDP-glucosyl transferase.
Length = 456
Score = 45.0 bits (106), Expect = 4e-05
Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 4/88 (4%)
Query: 197 FKQLKLPIFWKI----DITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGIS 252
K +P W I N VL + +W PQ IL+H + F+TH G +
Sbjct: 294 LKNRGVPFLWVIRPKEKAQNVQVLQEMVKEGQGVVLEWSPQEKILSHMAISCFVTHCGWN 353
Query: 253 SLMEASSLGVPVLGVPFFGDQYRNMVLL 280
S +E GVPV+ P + DQ + LL
Sbjct: 354 STIETVVAGVPVVAYPSWTDQPIDARLL 381
Score = 34.2 bits (78), Expect = 0.11
Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 7/86 (8%)
Query: 1 MSFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKI----DITNDPVLNAKTLPDNVFIQK 56
+SFGS+++ + ET + K +P W I N VL + +
Sbjct: 274 ISFGSMLESLENQVET---IAKALKNRGVPFLWVIRPKEKAQNVQVLQEMVKEGQGVVLE 330
Query: 57 WYPQTDILAHPNLRLFITHGGISSLM 82
W PQ IL+H + F+TH G +S +
Sbjct: 331 WSPQEKILSHMAISCFVTHCGWNSTI 356
>gnl|CDD|215304 PLN02554, PLN02554, UDP-glycosyltransferase family protein.
Length = 481
Score = 44.8 bits (106), Expect = 6e-05
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN---MV 278
W PQ +LA P + F+TH G +S++E+ GVP+ P + +Q N MV
Sbjct: 349 WAPQVAVLAKPAIGGFVTHCGWNSILESLWFGVPMAAWPLYAEQKFNAFEMV 400
Score = 32.5 bits (74), Expect = 0.43
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 57 WYPQTDILAHPNLRLFITHGGISSLM 82
W PQ +LA P + F+TH G +S++
Sbjct: 349 WAPQVAVLAKPAIGGFVTHCGWNSIL 374
>gnl|CDD|215112 PLN02167, PLN02167, UDP-glycosyltransferase family protein.
Length = 475
Score = 43.6 bits (103), Expect = 1e-04
Identities = 18/47 (38%), Positives = 29/47 (61%)
Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 276
W PQ +ILAH + F++H G +S++E+ GVP+ P + +Q N
Sbjct: 347 WAPQVEILAHKAIGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQLN 393
Score = 32.1 bits (73), Expect = 0.48
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 57 WYPQTDILAHPNLRLFITHGGISSLMNWNRYG 88
W PQ +ILAH + F++H G +S++ +G
Sbjct: 347 WAPQVEILAHKAIGGFVSHCGWNSVLESLWFG 378
>gnl|CDD|217893 pfam04101, Glyco_tran_28_C, Glycosyltransferase family 28
C-terminal domain. The glycosyltransferase family 28
includes monogalactosyldiacylglycerol synthase (EC
2.4.1.46) and UDP-N-acetylglucosamine transferase (EC
2.4.1.-). Structural analysis suggests the C-terminal
domain contains the UDP-GlcNAc binding site.
Length = 167
Score = 41.9 bits (99), Expect = 1e-04
Identities = 21/85 (24%), Positives = 33/85 (38%), Gaps = 5/85 (5%)
Query: 244 LFITHGGISSLMEASSLGVPVLGVP----FFGDQYRNMVLLRHRGYALIEPIQTLTKQSF 299
L I+ G ++ E +LG P + VP Q N + L G AL+ + LT +
Sbjct: 75 LVISRAGAGTIAELLALGKPAILVPRPKAAGEHQDNNALELVKAGAALVLLQKELTPEK- 133
Query: 300 LKNAQTMLNDPSFKQNAKKWASIAN 324
L A L + A+ +
Sbjct: 134 LVEALLKLLLKPLRLYEMNKAAKGS 158
>gnl|CDD|178584 PLN03007, PLN03007, UDP-glucosyltransferase family protein.
Length = 482
Score = 42.2 bits (99), Expect = 4e-04
Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 9/84 (10%)
Query: 206 WKIDITNDPVLNAKTLPDN---------VFIQKWYPQTDILAHPNLRLFITHGGISSLME 256
W + + + LP+ + I+ W PQ IL H F+TH G +SL+E
Sbjct: 319 WVVRKNENQGEKEEWLPEGFEERTKGKGLIIRGWAPQVLILDHQATGGFVTHCGWNSLLE 378
Query: 257 ASSLGVPVLGVPFFGDQYRNMVLL 280
+ G+P++ P +Q+ N L+
Sbjct: 379 GVAAGLPMVTWPVGAEQFYNEKLV 402
Score = 30.6 bits (69), Expect = 1.7
Identities = 19/71 (26%), Positives = 28/71 (39%), Gaps = 13/71 (18%)
Query: 33 WKIDITNDPVLNAKTLPDN---------VFIQKWYPQTDILAHPNLRLFITHGGISSLMN 83
W + + + LP+ + I+ W PQ IL H F+TH G +SL+
Sbjct: 319 WVVRKNENQGEKEEWLPEGFEERTKGKGLIIRGWAPQVLILDHQATGGFVTHCGWNSLLE 378
Query: 84 WNRYGTGGALP 94
G LP
Sbjct: 379 ----GVAAGLP 385
>gnl|CDD|215247 PLN02448, PLN02448, UDP-glycosyltransferase family protein.
Length = 459
Score = 41.5 bits (98), Expect = 5e-04
Identities = 20/47 (42%), Positives = 26/47 (55%)
Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 276
W Q +L H ++ F TH G +S +EA GVP+L P F DQ N
Sbjct: 330 WCDQLKVLCHSSVGGFWTHCGWNSTLEAVFAGVPMLTFPLFWDQPLN 376
>gnl|CDD|223779 COG0707, MurG, UDP-N-acetylglucosamine:LPS N-acetylglucosamine
transferase [Cell envelope biogenesis, outer membrane].
Length = 357
Score = 40.0 bits (94), Expect = 0.002
Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 8/83 (9%)
Query: 244 LFITHGGISSLMEASSLGVPVLGVPF----FGDQYRNMVLLRHRGYALIEPIQTLTKQSF 299
L I+ G ++ E +LGVP + VP+ G Q N L G AL+ LT +
Sbjct: 255 LVISRAGALTIAELLALGVPAILVPYPPGADGHQEYNAKFLEKAGAALVIRQSELTPEKL 314
Query: 300 LKNAQTMLNDP----SFKQNAKK 318
+ +L++P + +NAKK
Sbjct: 315 AELILRLLSNPEKLKAMAENAKK 337
>gnl|CDD|178275 PLN02670, PLN02670, transferase, transferring glycosyl groups.
Length = 472
Score = 39.5 bits (92), Expect = 0.002
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 2/59 (3%)
Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLG-VPVLGVPFFGDQYRNMVLLRHRGYAL 287
W PQ IL+H ++ F+TH G +S++E G V +L P +Q N LL + L
Sbjct: 346 WVPQVKILSHESVGGFLTHCGWNSVVEGLGFGRVLIL-FPVLNEQGLNTRLLHGKKLGL 403
Score = 28.3 bits (63), Expect = 8.5
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 57 WYPQTDILAHPNLRLFITHGGISSLM 82
W PQ IL+H ++ F+TH G +S++
Sbjct: 346 WVPQVKILSHESVGGFLTHCGWNSVV 371
>gnl|CDD|234825 PRK00726, murG, undecaprenyldiphospho-muramoylpentapeptide beta-N-
acetylglucosaminyltransferase; Provisional.
Length = 357
Score = 38.6 bits (91), Expect = 0.005
Identities = 23/94 (24%), Positives = 39/94 (41%), Gaps = 9/94 (9%)
Query: 244 LFITHGGISSLMEASSLGVPVLGVPF----FGDQYRNMVLLRHRGYALIEPIQTLTKQSF 299
L I G S++ E ++ G+P + VP Q N L G AL+ P LT +
Sbjct: 255 LVICRAGASTVAELAAAGLPAILVPLPHAADDHQTANARALVDAGAALLIPQSDLTPEKL 314
Query: 300 LKNAQTMLNDP----SFKQNAKKWASI-ANDEIV 328
+ +L+DP + + A+ A + +
Sbjct: 315 AEKLLELLSDPERLEAMAEAARALGKPDAAERLA 348
>gnl|CDD|215305 PLN02562, PLN02562, UDP-glycosyltransferase.
Length = 448
Score = 38.7 bits (90), Expect = 0.005
Identities = 18/50 (36%), Positives = 27/50 (54%)
Query: 227 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 276
+ W PQ ++L H + ++TH G +S MEA +L P GDQ+ N
Sbjct: 332 VVSWAPQLEVLKHQAVGCYLTHCGWNSTMEAIQCQKRLLCYPVAGDQFVN 381
>gnl|CDD|178364 PLN02764, PLN02764, glycosyltransferase family protein.
Length = 453
Score = 38.1 bits (88), Expect = 0.007
Identities = 28/99 (28%), Positives = 44/99 (44%), Gaps = 9/99 (9%)
Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLLRHRGYALIE 289
W Q IL+HP++ F++H G S+ E+ ++ VP GDQ N LL +E
Sbjct: 324 WVQQPLILSHPSVGCFVSHCGFGSMWESLLSDCQIVLVPQLGDQVLNTRLLSDELKVSVE 383
Query: 290 PIQTLT----KQSFLKNAQTMLNDPS-----FKQNAKKW 319
+ T K+S +++ S K+N KW
Sbjct: 384 VAREETGWFSKESLRDAINSVMKRDSEIGNLVKKNHTKW 422
>gnl|CDD|233077 TIGR00661, MJ1255, conserved hypothetical protein. This model
represents nearly the full length of MJ1255 from
Methanococcus jannaschii and of an unpublished protein
from Vibrio cholerae, as well as the C-terminal half of
a protein from Methanobacterium thermoautotrophicum. A
small region (~50 amino acids) within the domain appears
related to a family of sugar transferases [Hypothetical
proteins, Conserved].
Length = 321
Score = 37.5 bits (87), Expect = 0.009
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 241 NLRLFITHGGISSLMEASSLGVPVLGVPFFG--DQYRNMVLLRHRGYA 286
N L ITHGG S + EA SLG P++ +P G +Q N V L G
Sbjct: 247 NAELVITHGGFSLISEALSLGKPLIVIPDLGQFEQGNNAVKLEDLGCG 294
>gnl|CDD|99961 cd03785, GT1_MurG, MurG is an N-acetylglucosaminyltransferase, the
last enzyme involved in the intracellular phase of
peptidoglycan biosynthesis. It transfers
N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the
C4 hydroxyl of a lipid-linked N-acetylmuramoyl
pentapeptide (NAM). The resulting disaccharide is then
transported across the cell membrane, where it is
polymerized into NAG-NAM cell-wall repeat structure.
MurG belongs to the GT-B structural superfamily of
glycoslytransferases, which have characteristic N- and
C-terminal domains, each containing a typical Rossmann
fold. The two domains have high structural homology
despite minimal sequence homology. The large cleft that
separates the two domains includes the catalytic center
and permits a high degree of flexibility.
Length = 350
Score = 36.3 bits (85), Expect = 0.020
Identities = 21/83 (25%), Positives = 38/83 (45%), Gaps = 8/83 (9%)
Query: 244 LFITHGGISSLMEASSLGVPVLGVPF----FGDQYRNMVLLRHRGYALIEPIQTLTKQSF 299
L I+ G S++ E ++LG+P + +P Q N L G A++ P + LT +
Sbjct: 255 LVISRAGASTVAELAALGLPAILIPLPYAADDHQTANARALVKAGAAVLIPQEELTPERL 314
Query: 300 LKNAQTMLNDP----SFKQNAKK 318
+L+DP + + A+
Sbjct: 315 AAALLELLSDPERLKAMAEAARS 337
>gnl|CDD|177858 PLN02208, PLN02208, glycosyltransferase family protein.
Length = 442
Score = 33.8 bits (77), Expect = 0.14
Identities = 14/44 (31%), Positives = 23/44 (52%)
Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQ 273
W Q IL HP++ F+ H G ++ E+ ++ +PF DQ
Sbjct: 318 WVQQPLILDHPSIGCFVNHCGPGTIWESLVSDCQMVLIPFLSDQ 361
>gnl|CDD|222200 pfam13528, Glyco_trans_1_3, Glycosyl transferase family 1.
Length = 317
Score = 33.8 bits (78), Expect = 0.14
Identities = 20/67 (29%), Positives = 29/67 (43%), Gaps = 5/67 (7%)
Query: 243 RLFITHGGISSLMEASSLGVPVLGVPFFG--DQYRNMVLLRHRGYALIEPIQTLTKQ--- 297
I + G L EA LG P+L VP G +Q N + L GY ++ ++ L
Sbjct: 251 SAVICNAGFELLSEALYLGKPLLLVPLDGQFEQTSNALYLERLGYGIVMDMEDLDPAVLR 310
Query: 298 SFLKNAQ 304
FL+
Sbjct: 311 RFLERLP 317
>gnl|CDD|215979 pfam00534, Glycos_transf_1, Glycosyl transferases group 1.
Mutations in this domain of human PIGA lead to disease
(Paroxysmal Nocturnal haemoglobinuria). Members of this
family transfer activated sugars to a variety of
substrates, including glycogen, Fructose-6-phosphate and
lipopolysaccharides. Members of this family transfer
UDP, ADP, GDP or CMP linked sugars. The eukaryotic
glycogen synthases may be distant members of this
family.
Length = 158
Score = 31.1 bits (71), Expect = 0.51
Identities = 22/112 (19%), Positives = 41/112 (36%), Gaps = 23/112 (20%)
Query: 221 LPDNVFIQ---------KWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFG 271
L DNV + Y D+ P+ G+ L+EA + GVPV+ G
Sbjct: 57 LEDNVIFVGFVPDEDLIELYRIADLFVLPSRY---EGFGLV-LLEAMAAGVPVIATDVGG 112
Query: 272 DQYRNMVLLRHRGYALIEPIQTLTKQSFLKNAQTMLNDPS----FKQNAKKW 319
+++ L+ + ++ + + +L D +NA+K
Sbjct: 113 ----PAEIVKDGETGLL--VDPGDAEALAEAIEKLLKDEELRERLGENARKR 158
>gnl|CDD|177807 PLN00414, PLN00414, glycosyltransferase family protein.
Length = 446
Score = 31.5 bits (71), Expect = 0.81
Identities = 14/44 (31%), Positives = 23/44 (52%)
Query: 230 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQ 273
W Q IL+HP++ F+ H G S+ E+ ++ +P DQ
Sbjct: 319 WVEQPLILSHPSVGCFVNHCGFGSMWESLVSDCQIVFIPQLADQ 362
Score = 28.8 bits (64), Expect = 6.0
Identities = 36/115 (31%), Positives = 50/115 (43%), Gaps = 19/115 (16%)
Query: 130 WSHQMQYEKVWTALAERGHEVTIYTKFTPKSNSTNLKHVPIRL-PKIDARMIDPTKLSEE 188
+ H + Y + LAE+GH VT F PK L+ P+ L P D+ + +P L
Sbjct: 15 FGHMIPYLHLANKLAEKGHRVTF---FLPKKAHKQLQ--PLNLFP--DSIVFEPLTLPPV 67
Query: 189 TKLGF-LEVFKQL----KLPIFWKIDITNDPVLNAKTL---PDNVFIQ--KWYPQ 233
L F E L K PIF +D+ D + AK PD +F W P+
Sbjct: 68 DGLPFGAETASDLPNSTKKPIFDAMDLLRDQI-EAKVRALKPDLIFFDFVHWVPE 121
>gnl|CDD|99977 cd03805, GT1_ALG2_like, This family is most closely related to the
GT1 family of glycosyltransferases. ALG2, a
1,3-mannosyltransferase, in yeast catalyzes the
mannosylation of Man(2)GlcNAc(2)-dolichol diphosphate
and Man(1)GlcNAc(2)-dolichol diphosphate to form
Man(3)GlcNAc(2)-dolichol diphosphate. A deficiency of
this enzyme causes an abnormal accumulation of
Man1GlcNAc2-PP-dolichol and Man2GlcNAc2-PP-dolichol,
which is associated with a type of congenital disorders
of glycosylation (CDG), designated CDG-Ii, in humans.
Length = 392
Score = 30.6 bits (70), Expect = 1.5
Identities = 24/69 (34%), Positives = 27/69 (39%), Gaps = 26/69 (37%)
Query: 142 ALAERGHEVTIYTKFTPKSNSTNLKHVPIRLPKIDARMIDPTKLSEETKLGFLEVF-KQL 200
AL RGHEVTIYT DP+ EETK G L V +
Sbjct: 25 ALQSRGHEVTIYTS-----------------------HHDPSHCFEETKDGTLPVRVRGD 61
Query: 201 KLP--IFWK 207
LP IF +
Sbjct: 62 WLPRSIFGR 70
>gnl|CDD|238656 cd01346, Maltoporin-like, The Maltoporin-like channels (LamB porin)
form a trimeric structure which facilitate the diffusion
of maltodextrins and other sugars across the outer
membrane of Gram-negative bacteria. The membrane channel
is formed by an 18-strand antiparallel beta-barrel
(18,22). Loop 3 folds into the core to constrict pore
size. Long irregular loops are found on the
extracelllular side, while short turns are in the
periplasm.Tightly-bound water molecules are found in the
eyelet of the passage, and only substrates that can
displace and replace the broken hydrogen bonds are
likely to enter the pore. In the MPR structure, loops
4,6, and 9 have the greatest mobility and are highly
variable; these are postulated to attract maltodextrins.
Length = 392
Score = 30.4 bits (69), Expect = 1.6
Identities = 8/38 (21%), Positives = 13/38 (34%), Gaps = 2/38 (5%)
Query: 55 QKWYPQTDILAHPNLRLFITHGGISSLMNWNRYGTGGA 92
+ TD + P LR + T+ + R T
Sbjct: 337 PTFSAGTDFWSRPELRFYATY--SNWNDAALRAFTAFG 372
>gnl|CDD|175987 cd04020, C2B_SLP_1-2-3-4, C2 domain second repeat present in
Synaptotagmin-like proteins 1-4. All Slp members
basically share an N-terminal Slp homology domain (SHD)
and C-terminal tandem C2 domains (named the C2A domain
and the C2B domain) with the SHD and C2 domains being
separated by a linker sequence of various length.
Slp1/JFC1 and Slp2/exophilin 4 promote granule docking
to the plasma membrane. Additionally, their C2A domains
are both Ca2+ independent, unlike the case in Slp3 and
Slp4/granuphilin in which their C2A domains are Ca2+
dependent. It is thought that SHD (except for the
Slp4-SHD) functions as a specific Rab27A/B-binding
domain. In addition to Slps, rabphilin, Noc2, and
Munc13-4 also function as Rab27-binding proteins. It has
been demonstrated that Slp3 and Slp4/granuphilin promote
dense-core vesicle exocytosis. C2 domains fold into an
8-standed beta-sandwich that can adopt 2 structural
arrangements: Type I and Type II, distinguished by a
circular permutation involving their N- and C-terminal
beta strands. Many C2 domains are Ca2+-dependent
membrane-targeting modules that bind a wide variety of
substances including bind phospholipids, inositol
polyphosphates, and intracellular proteins. Most C2
domain proteins are either signal transduction enzymes
that contain a single C2 domain, such as protein kinase
C, or membrane trafficking proteins which contain at
least two C2 domains, such as synaptotagmin 1. However,
there are a few exceptions to this including RIM
isoforms and some splice variants of piccolo/aczonin and
intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. This cd contains the second
C2 repeat, C2B, and has a type-I topology.
Length = 162
Score = 29.6 bits (67), Expect = 1.7
Identities = 10/35 (28%), Positives = 18/35 (51%), Gaps = 1/35 (2%)
Query: 128 PIWSHQMQYEKVWTA-LAERGHEVTIYTKFTPKSN 161
P+W+H Y+ V L++ E+T++ SN
Sbjct: 77 PVWNHTFVYDGVSPEDLSQACLELTVWDHDKLSSN 111
>gnl|CDD|235226 PRK04148, PRK04148, hypothetical protein; Provisional.
Length = 134
Score = 29.2 bits (66), Expect = 2.1
Identities = 32/115 (27%), Positives = 50/115 (43%), Gaps = 22/115 (19%)
Query: 188 ETKLGF-LEVFKQLKLPIF--WKIDITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRL 244
E +GF +V K+LK F IDI V AK L N F+ D L +PNL +
Sbjct: 22 ELGIGFYFKVAKKLKESGFDVIVIDINEKAVEKAKKLGLNAFV-------DDLFNPNLEI 74
Query: 245 FITHGGISS------LME-----ASSLGVPVLGVPFFGDQ-YRNMVLLRHRGYAL 287
+ I S L A + VP++ P G++ + + L+ ++G +
Sbjct: 75 YKNAKLIYSIRPPRDLQPFILELAKKINVPLIIKPLSGEEPIKELKLINYKGKPI 129
>gnl|CDD|99973 cd03800, GT1_Sucrose_synthase, This family is most closely related
to the GT1 family of glycosyltransferases. The
sucrose-phosphate synthases in this family may be unique
to plants and photosynthetic bacteria. This enzyme
catalyzes the synthesis of sucrose 6-phosphate from
fructose 6-phosphate and uridine 5'-diphosphate-glucose,
a key regulatory step of sucrose metabolism. The
activity of this enzyme is regulated by phosphorylation
and moderated by the concentration of various
metabolites and light.
Length = 398
Score = 30.2 bits (69), Expect = 2.1
Identities = 9/13 (69%), Positives = 10/13 (76%)
Query: 142 ALAERGHEVTIYT 154
ALA GHEV I+T
Sbjct: 33 ALARLGHEVDIFT 45
>gnl|CDD|99972 cd03799, GT1_amsK_like, This is a family of GT1
glycosyltransferases found specifically in certain
bacteria. amsK in Erwinia amylovora, has been reported
to be involved in the biosynthesis of amylovoran, a
exopolysaccharide acting as a virulence factor.
Length = 355
Score = 30.2 bits (69), Expect = 2.1
Identities = 12/52 (23%), Positives = 22/52 (42%), Gaps = 13/52 (25%)
Query: 227 IQKWYPQTDILAHPNLRLFITHGGIS-----SLMEASSLGVPVL-----GVP 268
+++ D+ P+ G LMEA ++G+PV+ G+P
Sbjct: 249 VRELLRAADLFVLPS---VTAADGDREGLPVVLMEAMAMGLPVISTDVSGIP 297
>gnl|CDD|233284 TIGR01133, murG, undecaprenyldiphospho-muramoylpentapeptide
beta-N-acetylglucosaminyltransferase. RM 8449890 RT The
final step of peptidoglycan subunit assembly in
Escherichia coli occurs in the cytoplasm. RA Bupp K, van
Heijenoort J. RL J Bacteriol 1993 Mar;175(6):1841-3
[Cell envelope, Biosynthesis and degradation of murein
sacculus and peptidoglycan].
Length = 348
Score = 29.9 bits (68), Expect = 2.3
Identities = 16/48 (33%), Positives = 25/48 (52%), Gaps = 3/48 (6%)
Query: 244 LFITHGGISSLMEASSLGVPVLGVPF---FGDQYRNMVLLRHRGYALI 288
L I+ G S++ E ++ GVP + +P+ DQY N L G L+
Sbjct: 253 LVISRAGASTVAELAAAGVPAILIPYPYAADDQYYNAKFLEDLGAGLV 300
>gnl|CDD|99975 cd03802, GT1_AviGT4_like, This family is most closely related to
the GT1 family of glycosyltransferases. aviGT4 in
Streptomyces viridochromogenes has been shown to be
involved in biosynthesis of oligosaccharide antibiotic
avilamycin A. Inactivation of aviGT4 resulted in a
mutant that accumulated a novel avilamycin derivative
lacking the terminal eurekanate residue.
Length = 335
Score = 29.5 bits (67), Expect = 2.9
Identities = 12/35 (34%), Positives = 15/35 (42%), Gaps = 6/35 (17%)
Query: 139 VWT---ALAERGHEVTIYTKFTPKSNSTNLKHVPI 170
V L RGHEVT+ F + T VP+
Sbjct: 25 VAALTEGLVARGHEVTL---FASGDSKTAAPLVPV 56
>gnl|CDD|226807 COG4370, COG4370, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 412
Score = 29.8 bits (67), Expect = 3.0
Identities = 28/111 (25%), Positives = 40/111 (36%), Gaps = 27/111 (24%)
Query: 199 QLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLM--- 255
L +FW P+L TL + Q W P D N L+++ + ++
Sbjct: 256 MFALVVFWAAIAPELPLLLLWTLEER---QGWQPLADRFGKDNCSLWLSQQSFADILHAA 312
Query: 256 ------------EASSLGVPVLGVPFFGDQY------RNMVLLRHRGYALI 288
+A LG PV+G P G QY R LL G +L
Sbjct: 313 DAALGMAGTATEQAVGLGKPVIGFPGQGPQYNPGFAERQQRLL---GASLT 360
>gnl|CDD|99974 cd03801, GT1_YqgM_like, This family is most closely related to the
GT1 family of glycosyltransferases and named after YqgM
in Bacillus licheniformis about which little is known.
Glycosyltransferases catalyze the transfer of sugar
moieties from activated donor molecules to specific
acceptor molecules, forming glycosidic bonds. The
acceptor molecule can be a lipid, a protein, a
heterocyclic compound, or another carbohydrate residue.
This group of glycosyltransferases is most closely
related to the previously defined glycosyltransferase
family 1 (GT1). The members of this family may transfer
UDP, ADP, GDP, or CMP linked sugars. The diverse
enzymatic activities among members of this family
reflect a wide range of biological functions. The
protein structure available for this family has the GTB
topology, one of the two protein topologies observed for
nucleotide-sugar-dependent glycosyltransferases. GTB
proteins have distinct N- and C- terminal domains each
containing a typical Rossmann fold. The two domains have
high structural homology despite minimal sequence
homology. The large cleft that separates the two domains
includes the catalytic center and permits a high degree
of flexibility. The members of this family are found
mainly in certain bacteria and archaea.
Length = 374
Score = 28.9 bits (65), Expect = 4.8
Identities = 14/69 (20%), Positives = 22/69 (31%), Gaps = 6/69 (8%)
Query: 142 ALAERGHEVTIYTKFTPKSNSTNLKHVPIRLPKIDARMIDPTKLSEETKLGFLEVFKQLK 201
ALA RGHEVT+ T + + + L L + +
Sbjct: 26 ALAARGHEVTVLTPGDGGLPDEEEVGGIVVVRPPPLLRVRR------LLLLLLLALRLRR 79
Query: 202 LPIFWKIDI 210
L + D+
Sbjct: 80 LLRRERFDV 88
>gnl|CDD|222237 pfam13579, Glyco_trans_4_4, Glycosyl transferase 4-like domain.
Length = 158
Score = 28.2 bits (63), Expect = 4.8
Identities = 10/13 (76%), Positives = 11/13 (84%)
Query: 142 ALAERGHEVTIYT 154
ALA RGHEVT+ T
Sbjct: 13 ALAARGHEVTVVT 25
>gnl|CDD|238458 cd00916, Npc2_like, Niemann-Pick type C2 (Npc2) is a lysosomal
protein in which a mutation in the gene causes a rare
form of Niemann-Pick type C disease, an autosomal
recessive lipid storage disorder characterized by
accumulation of low-density lipoprotein-derived
cholesterol in lysosomes. Although Npc2 is known to bind
cholesterol, the function of this protein is unknown.
These proteins belong to the ML domain family. .
Length = 123
Score = 27.3 bits (61), Expect = 6.7
Identities = 9/23 (39%), Positives = 12/23 (52%)
Query: 146 RGHEVTIYTKFTPKSNSTNLKHV 168
RG + FTP +ST+LK
Sbjct: 28 RGSTAKVSIDFTPNFDSTSLKTE 50
>gnl|CDD|130050 TIGR00977, LeuA_rel, 2-isopropylmalate synthase/homocitrate
synthase family protein. This model represents
uncharacterized proteins related to 2-isopropylmalate
synthases and homocitrate synthases but phylogenetically
distint. Each species represented in the seed alignment
also has a member of a known family of 2-isopropylmalate
synthases [Unknown function, General].
Length = 526
Score = 28.7 bits (64), Expect = 7.0
Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 4/53 (7%)
Query: 313 KQNAKKWASIAN--DEIVS--PLERVVYWTEYVLRHKGAPHLSASSRQLTWYQ 361
+N KK S A EIV+ P + + Y HKG H+SA R Y+
Sbjct: 261 PENLKKLTSTARLVAEIVNLPPDDNMPYVGRSAFAHKGGVHVSAVQRNPFTYE 313
>gnl|CDD|99980 cd03808, GT1_cap1E_like, This family is most closely related to the
GT1 family of glycosyltransferases. cap1E in
Streptococcus pneumoniae is required for the synthesis
of type 1 capsular polysaccharides.
Length = 359
Score = 28.3 bits (64), Expect = 8.6
Identities = 10/39 (25%), Positives = 18/39 (46%), Gaps = 6/39 (15%)
Query: 227 IQKWYPQTDILAHPNLRLFITHGGIS-SLMEASSLGVPV 264
+ + D+ P+ R G+ L+EA ++G PV
Sbjct: 257 VPELLAAADVFVLPSYR-----EGLPRVLLEAMAMGRPV 290
>gnl|CDD|239055 cd02140, Nitroreductase_4, Nitroreductase-like family 4. A
subfamily of the nitroreductase family containing
uncharacterized proteins that are similar to
nitroreductase. Nitroreductase catalyzes the reduction
of nitroaromatic compounds such as nitrotoluenes,
nitrofurans and nitroimidazoles. This process requires
NAD(P)H as electron donor in an obligatory two-electron
transfer and uses FMN as cofactor. The enzyme is
typically a homodimer. Members of this family are also
called NADH dehydrogenase, oxygen-insensitive NAD(P)H
nitrogenase or dihydropteridine reductase.
Length = 192
Score = 27.6 bits (62), Expect = 8.7
Identities = 12/24 (50%), Positives = 13/24 (54%), Gaps = 3/24 (12%)
Query: 128 PIWSHQ---MQYEKVWTALAERGH 148
P+WS Q M VWTALA G
Sbjct: 116 PVWSEQSSGMAQIAVWTALAAEGI 139
>gnl|CDD|217329 pfam03033, Glyco_transf_28, Glycosyltransferase family 28
N-terminal domain. The glycosyltransferase family 28
includes monogalactosyldiacylglycerol synthase (EC
2.4.1.46) and UDP-N-acetylglucosamine transferase (EC
2.4.1.-). This N-terminal domain contains the acceptor
binding site and likely membrane association site. This
family also contains a large number of proteins that
probably have quite distinct activities.
Length = 136
Score = 27.3 bits (61), Expect = 8.8
Identities = 14/68 (20%), Positives = 20/68 (29%), Gaps = 12/68 (17%)
Query: 142 ALAERGHEVTIYTK--FTPKSNSTNLKHVPIRLPKIDARMIDPTKLSEETKLGFLEVFKQ 199
AL RGHEV + T L VPI + + + E +
Sbjct: 21 ALRRRGHEVRLGTPPGLEEFVEEAGLPFVPIGGDGLRRKSLK----------NLKEPLEG 70
Query: 200 LKLPIFWK 207
+ K
Sbjct: 71 GRALRQAK 78
>gnl|CDD|238885 cd01902, Ntn_CGH, Choloylglycine hydrolase (CGH) is a bile
salt-modifying enzyme that hydrolyzes non-peptide
carbon-nitrogen bonds in choloylglycine and
choloyltaurine, both of which are present in bile. CGH
is present in a number of probiotic microbial organisms
that inhabit the gut. CGH has an N-terminal
nucleophilic cysteine, as do other members of the Ntn
hydrolase family to which CGH belongs.
Length = 291
Score = 27.7 bits (62), Expect = 9.9
Identities = 9/34 (26%), Positives = 13/34 (38%), Gaps = 3/34 (8%)
Query: 301 KNAQTMLNDPSFKQ---NAKKWASIANDEIVSPL 331
K Q M N P++ Q K W + + L
Sbjct: 167 KQYQVMTNSPTYDQQLALNKYWKQEKDIGGTTGL 200
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.138 0.428
Gapped
Lambda K H
0.267 0.0667 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,648,039
Number of extensions: 2012135
Number of successful extensions: 2017
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2005
Number of HSP's successfully gapped: 92
Length of query: 403
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 304
Effective length of database: 6,546,556
Effective search space: 1990153024
Effective search space used: 1990153024
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 60 (27.0 bits)