RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16939
(174 letters)
>gnl|CDD|223071 PHA03392, egt, ecdysteroid UDP-glucosyltransferase; Provisional.
Length = 507
Score = 122 bits (308), Expect = 4e-33
Identities = 45/119 (37%), Positives = 65/119 (54%), Gaps = 4/119 (3%)
Query: 21 SDLQQRADAAKGGFVYMSFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLN 80
L++ + + G VY+SFGS +D + E L FK+L + WK D +
Sbjct: 285 DYLEEFLNNSTNGVVYVSFGSSIDTNDMDNEFLQMLLRTFKKLPYNVLWKYD----GEVE 340
Query: 81 AKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 139
A LP NV QKW+PQ +L H N++ F+T GG+ S EA VP++G+P GDQ+ N
Sbjct: 341 AINLPANVLTQKWFPQRAVLKHKNVKAFVTQGGVQSTDEAIDALVPMVGLPMMGDQFYN 399
>gnl|CDD|201077 pfam00201, UDPGT, UDP-glucoronosyl and UDP-glucosyl transferase.
Length = 500
Score = 95.2 bits (237), Expect = 2e-23
Identities = 49/141 (34%), Positives = 72/141 (51%), Gaps = 7/141 (4%)
Query: 33 GFVYMSFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDNVFIQK 92
G V S GS+V + + EE Q+ + W+ D T TL N + K
Sbjct: 277 GVVVFSLGSMV--SNIPEEKANEIASALAQIPQKVLWRFDGTKPS-----TLGRNTRLVK 329
Query: 93 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLLRHRGYALIE 152
W PQ D+L HP R F+TH G + + EA GVP++G+P FGDQ N + +G A+
Sbjct: 330 WLPQNDLLGHPKTRAFVTHAGSNGVYEAICHGVPMVGMPLFGDQMDNAKHMEAKGAAVTL 389
Query: 153 PIQTLTKQSFLKNAQTMLNDP 173
+ T+T + L +T++NDP
Sbjct: 390 NVLTMTSEDLLNALKTVINDP 410
>gnl|CDD|224732 COG1819, COG1819, Glycosyl transferases, related to
UDP-glucuronosyltransferase [Carbohydrate transport and
metabolism / Signal transduction mechanisms].
Length = 406
Score = 74.8 bits (184), Expect = 3e-16
Identities = 38/144 (26%), Positives = 65/144 (45%), Gaps = 19/144 (13%)
Query: 35 VYMSFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKT----LPDNVFI 90
VY+S G+V + +L LE L ++ ++ + A+ +PDNV +
Sbjct: 240 VYVSLGTVGNAVELLAI----VLEALADLD----VRVIVS---LGGARDTLVNVPDNVIV 288
Query: 91 QKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLLRHRGYAL 150
+ PQ ++L + I HGG + EA GVP++ +P DQ N + G +
Sbjct: 289 ADYVPQLELLPRADA--VIHHGGAGTTSEALYAGVPLVVIPDGADQPLNAERVEELGAGI 346
Query: 151 IEPIQTLTKQSFLKNA-QTMLNDP 173
P + LT++ L+ A +L D
Sbjct: 347 ALPFEELTEER-LRAAVNEVLADD 369
>gnl|CDD|233407 TIGR01426, MGT, glycosyltransferase, MGT family. This model
describes the MGT (macroside glycosyltransferase)
subfamily of the UDP-glucuronosyltransferase family.
Members include a number of glucosyl transferases for
macrolide antibiotic inactivation, but also include
transferases of glucose-related sugars for macrolide
antibiotic production [Cellular processes, Toxin
production and resistance].
Length = 392
Score = 70.5 bits (173), Expect = 9e-15
Identities = 44/141 (31%), Positives = 67/141 (47%), Gaps = 8/141 (5%)
Query: 35 VYMSFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDP-VLNAKTLPDNVFIQKW 93
V +S G+V + T +E F+ L + + DP L LP NV +++W
Sbjct: 228 VLISLGTVFNNQPSFYRT---CVEAFRDLDWHVVLSVGRGVDPADLG--ELPPNVEVRQW 282
Query: 94 YPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLLRHRGYALIEP 153
PQ +IL + FITHGG++S MEA GVP++ VP DQ + G P
Sbjct: 283 VPQLEILKKADA--FITHGGMNSTMEALFNGVPMVAVPQGADQPMTARRIAELGLGRHLP 340
Query: 154 IQTLTKQSFLKNAQTMLNDPR 174
+ +T + + +L+DPR
Sbjct: 341 PEEVTAEKLREAVLAVLSDPR 361
>gnl|CDD|178170 PLN02555, PLN02555, limonoid glucosyltransferase.
Length = 480
Score = 61.7 bits (150), Expect = 1e-11
Identities = 42/123 (34%), Positives = 60/123 (48%), Gaps = 22/123 (17%)
Query: 35 VYMSFGSVVDPTKLS-EETKLGFLEVFKQLKLPIFWKI-----DITNDPVLNAKTLPDNV 88
VY+SFG+VV + +E G L + W + D +P + LP+
Sbjct: 280 VYISFGTVVYLKQEQIDEIAYGVLNS----GVSFLWVMRPPHKDSGVEPHV----LPEEF 331
Query: 89 F--------IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNM 140
I +W PQ +LAHP++ F+TH G +S MEA S GVPV+ P +GDQ +
Sbjct: 332 LEKAGDKGKIVQWCPQEKVLAHPSVACFVTHCGWNSTMEALSSGVPVVCFPQWGDQVTDA 391
Query: 141 VLL 143
V L
Sbjct: 392 VYL 394
>gnl|CDD|99960 cd03784, GT1_Gtf_like, This family includes the Gtfs, a group of
homologous glycosyltransferases involved in the final
stages of the biosynthesis of antibiotics vancomycin and
related chloroeremomycin. Gtfs transfer sugar moieties
from an activated NDP-sugar donor to the oxidatively
cross-linked heptapeptide core of vancomycin group
antibiotics. The core structure is important for the
bioactivity of the antibiotics.
Length = 401
Score = 59.7 bits (145), Expect = 5e-11
Identities = 42/141 (29%), Positives = 58/141 (41%), Gaps = 11/141 (7%)
Query: 35 VYMSFGSVV--DPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDNVFIQK 92
VY+ FGS+V DP L+ +E L I L A+ LPDNV +
Sbjct: 242 VYVGFGSMVVRDPEALARL----DVEAVATLGQRA---ILSLGWGGLGAEDLPDNVRVVD 294
Query: 93 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLLRHRGYALIE 152
+ P +L P + HGG + A GVP L VPFFGDQ + G
Sbjct: 295 FVPHDWLL--PRCAAVVHHGGAGTTAAALRAGVPQLVVPFFGDQPFWAARVAELGAGPAL 352
Query: 153 PIQTLTKQSFLKNAQTMLNDP 173
+ LT + + +L+ P
Sbjct: 353 DPRELTAERLAAALRRLLDPP 373
>gnl|CDD|215127 PLN02210, PLN02210, UDP-glucosyl transferase.
Length = 456
Score = 56.6 bits (136), Expect = 7e-10
Identities = 37/118 (31%), Positives = 55/118 (46%), Gaps = 7/118 (5%)
Query: 30 AKGGFVYMSFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKI----DITNDPVLNAKTLP 85
A+ VY+SFGS+++ + ET + K +P W I N VL
Sbjct: 267 ARSSVVYISFGSMLESLENQVET---IAKALKNRGVPFLWVIRPKEKAQNVQVLQEMVKE 323
Query: 86 DNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 143
+ +W PQ IL+H + F+TH G +S +E GVPV+ P + DQ + LL
Sbjct: 324 GQGVVLEWSPQEKILSHMAISCFVTHCGWNSTIETVVAGVPVVAYPSWTDQPIDARLL 381
>gnl|CDD|215084 PLN00164, PLN00164, glucosyltransferase; Provisional.
Length = 480
Score = 55.1 bits (133), Expect = 2e-09
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 1/59 (1%)
Query: 92 KWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN-MVLLRHRGYA 149
W PQ +ILAH + F+TH G +S++E+ GVP+ P + +Q+ N L+ G A
Sbjct: 345 TWAPQKEILAHAAVGGFVTHCGWNSVLESLWHGVPMAPWPLYAEQHLNAFELVADMGVA 403
>gnl|CDD|177813 PLN02152, PLN02152, indole-3-acetate beta-glucosyltransferase.
Length = 455
Score = 53.5 bits (128), Expect = 8e-09
Identities = 40/127 (31%), Positives = 57/127 (44%), Gaps = 24/127 (18%)
Query: 35 VYMSFGSVVDPTKLS-EETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKT---------- 83
+Y+SFG++V+ +K EE +E K P W I T+ AK
Sbjct: 264 IYVSFGTMVELSKKQIEELARALIEG----KRPFLWVI--TDKLNREAKIEGEEETEIEK 317
Query: 84 -------LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQ 136
L + I W Q ++L H + F+TH G SS +E+ LGVPV+ P + DQ
Sbjct: 318 IAGFRHELEEVGMIVSWCSQIEVLRHRAVGCFVTHCGWSSSLESLVLGVPVVAFPMWSDQ 377
Query: 137 YRNMVLL 143
N LL
Sbjct: 378 PANAKLL 384
>gnl|CDD|177830 PLN02173, PLN02173, UDP-glucosyl transferase family protein.
Length = 449
Score = 52.7 bits (126), Expect = 1e-08
Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 9/113 (7%)
Query: 31 KGGFVYMSFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTL----PD 86
+G VY++FGS+ + E + F L W + + + L L D
Sbjct: 263 QGSVVYIAFGSMAKLSSEQMEEIASAISNFSYL-----WVVRASEESKLPPGFLETVDKD 317
Query: 87 NVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 139
+ KW PQ +L++ + F+TH G +S ME SLGVP++ +P + DQ N
Sbjct: 318 KSLVLKWSPQLQVLSNKAIGCFMTHCGWNSTMEGLSLGVPMVAMPQWTDQPMN 370
>gnl|CDD|178589 PLN03015, PLN03015, UDP-glucosyl transferase.
Length = 470
Score = 52.4 bits (125), Expect = 2e-08
Identities = 20/56 (35%), Positives = 38/56 (67%)
Query: 88 VFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 143
+ + +W PQ +IL+H ++ F++H G SS++E+ + GVP++ P + +Q+ N LL
Sbjct: 337 LVVTQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIVAWPLYAEQWMNATLL 392
>gnl|CDD|178032 PLN02410, PLN02410, UDP-glucoronosyl/UDP-glucosyl transferase
family protein.
Length = 451
Score = 51.6 bits (123), Expect = 3e-08
Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 4/109 (3%)
Query: 32 GGFVYMSFGSVVDPTKLSEETKLGFLEVFKQLKLP-IFWKIDITNDPVLNAKTLPDNVFI 90
G M V++ + + FL V + + W I + P +K + +I
Sbjct: 272 GSLALMEINEVMETASGLDSSNQQFLWVIRPGSVRGSEW---IESLPKEFSKIISGRGYI 328
Query: 91 QKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 139
KW PQ ++L+HP + F +H G +S +E+ GVP++ PF DQ N
Sbjct: 329 VKWAPQKEVLSHPAVGGFWSHCGWNSTLESIGEGVPMICKPFSSDQKVN 377
>gnl|CDD|215465 PLN02863, PLN02863, UDP-glucoronosyl/UDP-glucosyl transferase
family protein.
Length = 477
Score = 51.4 bits (123), Expect = 4e-08
Identities = 23/54 (42%), Positives = 32/54 (59%)
Query: 90 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 143
I+ W PQ IL+H + F+TH G +S++E GVP+L P DQ+ N LL
Sbjct: 347 IRGWAPQVAILSHRAVGAFLTHCGWNSVLEGLVAGVPMLAWPMAADQFVNASLL 400
>gnl|CDD|178572 PLN02992, PLN02992, coniferyl-alcohol glucosyltransferase.
Length = 481
Score = 51.1 bits (122), Expect = 4e-08
Identities = 23/54 (42%), Positives = 32/54 (59%)
Query: 90 IQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 143
+ W PQ +ILAH + F+TH G SS +E+ GVP++ P F +Q N LL
Sbjct: 342 VPSWAPQAEILAHQAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALL 395
>gnl|CDD|215293 PLN02534, PLN02534, UDP-glycosyltransferase.
Length = 491
Score = 48.7 bits (116), Expect = 3e-07
Identities = 22/56 (39%), Positives = 34/56 (60%)
Query: 88 VFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 143
+ I+ W PQ IL+HP + F+TH G +S +E GVP++ P F +Q+ N L+
Sbjct: 346 LLIKGWAPQVLILSHPAIGGFLTHCGWNSTIEGICSGVPMITWPLFAEQFLNEKLI 401
>gnl|CDD|178584 PLN03007, PLN03007, UDP-glucosyltransferase family protein.
Length = 482
Score = 47.2 bits (112), Expect = 9e-07
Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 18/124 (14%)
Query: 32 GGFVYMSFGSVVDPTKLSEETKLGFLEVFKQLKLP---IFWKIDITNDPVLNAKTLPDN- 87
+Y+SFGSV E E+ L+ W + + + LP+
Sbjct: 285 DSVIYLSFGSV---ASFKNEQ---LFEIAAGLEGSGQNFIWVVRKNENQGEKEEWLPEGF 338
Query: 88 --------VFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 139
+ I+ W PQ IL H F+TH G +SL+E + G+P++ P +Q+ N
Sbjct: 339 EERTKGKGLIIRGWAPQVLILDHQATGGFVTHCGWNSLLEGVAAGLPMVTWPVGAEQFYN 398
Query: 140 MVLL 143
L+
Sbjct: 399 EKLV 402
>gnl|CDD|178581 PLN03004, PLN03004, UDP-glycosyltransferase.
Length = 451
Score = 47.0 bits (111), Expect = 1e-06
Identities = 20/63 (31%), Positives = 37/63 (58%)
Query: 81 AKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNM 140
++T + ++ W PQ +L H + F+TH G +S++EA GVP++ P + +Q N
Sbjct: 329 SRTEDKGMVVKSWAPQVPVLNHKAVGGFVTHCGWNSILEAVCAGVPMVAWPLYAEQRFNR 388
Query: 141 VLL 143
V++
Sbjct: 389 VMI 391
>gnl|CDD|177857 PLN02207, PLN02207, UDP-glycosyltransferase.
Length = 468
Score = 45.8 bits (108), Expect = 4e-06
Identities = 27/76 (35%), Positives = 43/76 (56%), Gaps = 5/76 (6%)
Query: 72 DITNDPVLNAKTLPDNV----FIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPV 127
++TND +L L D V I W PQ +ILAH + F++H G +S++E+ GVP+
Sbjct: 315 EVTNDDLLPEGFL-DRVSGRGMICGWSPQVEILAHKAVGGFVSHCGWNSIVESLWFGVPI 373
Query: 128 LGVPFFGDQYRNMVLL 143
+ P + +Q N L+
Sbjct: 374 VTWPMYAEQQLNAFLM 389
>gnl|CDD|215304 PLN02554, PLN02554, UDP-glycosyltransferase family protein.
Length = 481
Score = 44.4 bits (105), Expect = 9e-06
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 93 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN---MV 141
W PQ +LA P + F+TH G +S++E+ GVP+ P + +Q N MV
Sbjct: 349 WAPQVAVLAKPAIGGFVTHCGWNSILESLWFGVPMAAWPLYAEQKFNAFEMV 400
>gnl|CDD|215112 PLN02167, PLN02167, UDP-glycosyltransferase family protein.
Length = 475
Score = 43.6 bits (103), Expect = 2e-05
Identities = 18/47 (38%), Positives = 29/47 (61%)
Query: 93 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 139
W PQ +ILAH + F++H G +S++E+ GVP+ P + +Q N
Sbjct: 347 WAPQVEILAHKAIGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQLN 393
>gnl|CDD|217893 pfam04101, Glyco_tran_28_C, Glycosyltransferase family 28
C-terminal domain. The glycosyltransferase family 28
includes monogalactosyldiacylglycerol synthase (EC
2.4.1.46) and UDP-N-acetylglucosamine transferase (EC
2.4.1.-). Structural analysis suggests the C-terminal
domain contains the UDP-GlcNAc binding site.
Length = 167
Score = 41.5 bits (98), Expect = 4e-05
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 4/56 (7%)
Query: 107 LFITHGGISSLMEASSLGVPVLGVP----FFGDQYRNMVLLRHRGYALIEPIQTLT 158
L I+ G ++ E +LG P + VP Q N + L G AL+ + LT
Sbjct: 75 LVISRAGAGTIAELLALGKPAILVPRPKAAGEHQDNNALELVKAGAALVLLQKELT 130
>gnl|CDD|215305 PLN02562, PLN02562, UDP-glycosyltransferase.
Length = 448
Score = 42.2 bits (99), Expect = 5e-05
Identities = 32/109 (29%), Positives = 48/109 (44%), Gaps = 7/109 (6%)
Query: 35 VYMSFGSVVDPTKLSEETKLGF-LEVFKQLKLPIFWKIDIT---NDPVLNAKTLPDNVFI 90
+Y+SFGS V P S L LE + P W ++ P + + +
Sbjct: 276 IYISFGSWVSPIGESNVRTLALALEASGR---PFIWVLNPVWREGLPPGYVERVSKQGKV 332
Query: 91 QKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 139
W PQ ++L H + ++TH G +S MEA +L P GDQ+ N
Sbjct: 333 VSWAPQLEVLKHQAVGCYLTHCGWNSTMEAIQCQKRLLCYPVAGDQFVN 381
>gnl|CDD|215247 PLN02448, PLN02448, UDP-glycosyltransferase family protein.
Length = 459
Score = 41.5 bits (98), Expect = 8e-05
Identities = 20/47 (42%), Positives = 26/47 (55%)
Query: 93 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRN 139
W Q +L H ++ F TH G +S +EA GVP+L P F DQ N
Sbjct: 330 WCDQLKVLCHSSVGGFWTHCGWNSTLEAVFAGVPMLTFPLFWDQPLN 376
>gnl|CDD|178275 PLN02670, PLN02670, transferase, transferring glycosyl groups.
Length = 472
Score = 39.9 bits (93), Expect = 3e-04
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 2/59 (3%)
Query: 93 WYPQTDILAHPNLRLFITHGGISSLMEASSLG-VPVLGVPFFGDQYRNMVLLRHRGYAL 150
W PQ IL+H ++ F+TH G +S++E G V +L P +Q N LL + L
Sbjct: 346 WVPQVKILSHESVGGFLTHCGWNSVVEGLGFGRVLIL-FPVLNEQGLNTRLLHGKKLGL 403
>gnl|CDD|223779 COG0707, MurG, UDP-N-acetylglucosamine:LPS N-acetylglucosamine
transferase [Cell envelope biogenesis, outer membrane].
Length = 357
Score = 39.2 bits (92), Expect = 5e-04
Identities = 21/71 (29%), Positives = 33/71 (46%), Gaps = 4/71 (5%)
Query: 107 LFITHGGISSLMEASSLGVPVLGVPF----FGDQYRNMVLLRHRGYALIEPIQTLTKQSF 162
L I+ G ++ E +LGVP + VP+ G Q N L G AL+ LT +
Sbjct: 255 LVISRAGALTIAELLALGVPAILVPYPPGADGHQEYNAKFLEKAGAALVIRQSELTPEKL 314
Query: 163 LKNAQTMLNDP 173
+ +L++P
Sbjct: 315 AELILRLLSNP 325
>gnl|CDD|233077 TIGR00661, MJ1255, conserved hypothetical protein. This model
represents nearly the full length of MJ1255 from
Methanococcus jannaschii and of an unpublished protein
from Vibrio cholerae, as well as the C-terminal half of
a protein from Methanobacterium thermoautotrophicum. A
small region (~50 amino acids) within the domain appears
related to a family of sugar transferases [Hypothetical
proteins, Conserved].
Length = 321
Score = 37.9 bits (88), Expect = 0.001
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 104 NLRLFITHGGISSLMEASSLGVPVLGVPFFG--DQYRNMVLLRHRGYA 149
N L ITHGG S + EA SLG P++ +P G +Q N V L G
Sbjct: 247 NAELVITHGGFSLISEALSLGKPLIVIPDLGQFEQGNNAVKLEDLGCG 294
>gnl|CDD|178364 PLN02764, PLN02764, glycosyltransferase family protein.
Length = 453
Score = 37.0 bits (85), Expect = 0.003
Identities = 19/51 (37%), Positives = 28/51 (54%)
Query: 93 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVLL 143
W Q IL+HP++ F++H G S+ E+ ++ VP GDQ N LL
Sbjct: 324 WVQQPLILSHPSVGCFVSHCGFGSMWESLLSDCQIVLVPQLGDQVLNTRLL 374
>gnl|CDD|234825 PRK00726, murG, undecaprenyldiphospho-muramoylpentapeptide beta-N-
acetylglucosaminyltransferase; Provisional.
Length = 357
Score = 35.5 bits (83), Expect = 0.009
Identities = 21/71 (29%), Positives = 32/71 (45%), Gaps = 4/71 (5%)
Query: 107 LFITHGGISSLMEASSLGVPVLGVPF----FGDQYRNMVLLRHRGYALIEPIQTLTKQSF 162
L I G S++ E ++ G+P + VP Q N L G AL+ P LT +
Sbjct: 255 LVICRAGASTVAELAAAGLPAILVPLPHAADDHQTANARALVDAGAALLIPQSDLTPEKL 314
Query: 163 LKNAQTMLNDP 173
+ +L+DP
Sbjct: 315 AEKLLELLSDP 325
>gnl|CDD|222200 pfam13528, Glyco_trans_1_3, Glycosyl transferase family 1.
Length = 317
Score = 34.2 bits (79), Expect = 0.022
Identities = 33/149 (22%), Positives = 55/149 (36%), Gaps = 25/149 (16%)
Query: 27 ADAAKGGF--VYMSFGSVVDPTKLSEETKLGFLEVFKQLKLPIFWKIDITND-PVLNAKT 83
A +GG VY+ F D ++E L+ F + ++ L +
Sbjct: 186 ARPEEGGKILVYLPFE---DLEDVAE-----LLKPFPDYEFIVYGPNKEEGREGNLTFRP 237
Query: 84 LPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFG--DQYRNMV 141
+ F++ D+ I + G L EA LG P+L VP G +Q N +
Sbjct: 238 FSEEGFLR------DLAEC---SAVICNAGFELLSEALYLGKPLLLVPLDGQFEQTSNAL 288
Query: 142 LLRHRGYALIEPIQTLTKQ---SFLKNAQ 167
L GY ++ ++ L FL+
Sbjct: 289 YLERLGYGIVMDMEDLDPAVLRRFLERLP 317
>gnl|CDD|177858 PLN02208, PLN02208, glycosyltransferase family protein.
Length = 442
Score = 34.2 bits (78), Expect = 0.025
Identities = 14/44 (31%), Positives = 23/44 (52%)
Query: 93 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQ 136
W Q IL HP++ F+ H G ++ E+ ++ +PF DQ
Sbjct: 318 WVQQPLILDHPSIGCFVNHCGPGTIWESLVSDCQMVLIPFLSDQ 361
>gnl|CDD|99961 cd03785, GT1_MurG, MurG is an N-acetylglucosaminyltransferase, the
last enzyme involved in the intracellular phase of
peptidoglycan biosynthesis. It transfers
N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the
C4 hydroxyl of a lipid-linked N-acetylmuramoyl
pentapeptide (NAM). The resulting disaccharide is then
transported across the cell membrane, where it is
polymerized into NAG-NAM cell-wall repeat structure.
MurG belongs to the GT-B structural superfamily of
glycoslytransferases, which have characteristic N- and
C-terminal domains, each containing a typical Rossmann
fold. The two domains have high structural homology
despite minimal sequence homology. The large cleft that
separates the two domains includes the catalytic center
and permits a high degree of flexibility.
Length = 350
Score = 33.6 bits (78), Expect = 0.033
Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 4/71 (5%)
Query: 107 LFITHGGISSLMEASSLGVPVLGVPF----FGDQYRNMVLLRHRGYALIEPIQTLTKQSF 162
L I+ G S++ E ++LG+P + +P Q N L G A++ P + LT +
Sbjct: 255 LVISRAGASTVAELAALGLPAILIPLPYAADDHQTANARALVKAGAAVLIPQEELTPERL 314
Query: 163 LKNAQTMLNDP 173
+L+DP
Sbjct: 315 AAALLELLSDP 325
>gnl|CDD|177807 PLN00414, PLN00414, glycosyltransferase family protein.
Length = 446
Score = 31.5 bits (71), Expect = 0.19
Identities = 14/44 (31%), Positives = 23/44 (52%)
Query: 93 WYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQ 136
W Q IL+HP++ F+ H G S+ E+ ++ +P DQ
Sbjct: 319 WVEQPLILSHPSVGCFVNHCGFGSMWESLVSDCQIVFIPQLADQ 362
>gnl|CDD|235226 PRK04148, PRK04148, hypothetical protein; Provisional.
Length = 134
Score = 29.6 bits (67), Expect = 0.46
Identities = 32/115 (27%), Positives = 50/115 (43%), Gaps = 22/115 (19%)
Query: 51 ETKLGF-LEVFKQLKLPIF--WKIDITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRL 107
E +GF +V K+LK F IDI V AK L N F+ D L +PNL +
Sbjct: 22 ELGIGFYFKVAKKLKESGFDVIVIDINEKAVEKAKKLGLNAFV-------DDLFNPNLEI 74
Query: 108 FITHGGISS------LME-----ASSLGVPVLGVPFFGDQ-YRNMVLLRHRGYAL 150
+ I S L A + VP++ P G++ + + L+ ++G +
Sbjct: 75 YKNAKLIYSIRPPRDLQPFILELAKKINVPLIIKPLSGEEPIKELKLINYKGKPI 129
>gnl|CDD|99972 cd03799, GT1_amsK_like, This is a family of GT1
glycosyltransferases found specifically in certain
bacteria. amsK in Erwinia amylovora, has been reported
to be involved in the biosynthesis of amylovoran, a
exopolysaccharide acting as a virulence factor.
Length = 355
Score = 29.8 bits (68), Expect = 0.71
Identities = 12/52 (23%), Positives = 22/52 (42%), Gaps = 13/52 (25%)
Query: 90 IQKWYPQTDILAHPNLRLFITHGGIS-----SLMEASSLGVPVL-----GVP 131
+++ D+ P+ G LMEA ++G+PV+ G+P
Sbjct: 249 VRELLRAADLFVLPS---VTAADGDREGLPVVLMEAMAMGLPVISTDVSGIP 297
>gnl|CDD|233284 TIGR01133, murG, undecaprenyldiphospho-muramoylpentapeptide
beta-N-acetylglucosaminyltransferase. RM 8449890 RT The
final step of peptidoglycan subunit assembly in
Escherichia coli occurs in the cytoplasm. RA Bupp K, van
Heijenoort J. RL J Bacteriol 1993 Mar;175(6):1841-3
[Cell envelope, Biosynthesis and degradation of murein
sacculus and peptidoglycan].
Length = 348
Score = 29.5 bits (67), Expect = 0.85
Identities = 16/48 (33%), Positives = 25/48 (52%), Gaps = 3/48 (6%)
Query: 107 LFITHGGISSLMEASSLGVPVLGVPF---FGDQYRNMVLLRHRGYALI 151
L I+ G S++ E ++ GVP + +P+ DQY N L G L+
Sbjct: 253 LVISRAGASTVAELAAAGVPAILIPYPYAADDQYYNAKFLEDLGAGLV 300
>gnl|CDD|226807 COG4370, COG4370, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 412
Score = 28.7 bits (64), Expect = 1.4
Identities = 23/91 (25%), Positives = 34/91 (37%), Gaps = 18/91 (19%)
Query: 62 QLKLPIFWKIDITNDPVLNAKTLPDNVFIQKWYPQTDILAHPNLRLFITHGGISSLM--- 118
L +FW P+L TL + Q W P D N L+++ + ++
Sbjct: 256 MFALVVFWAAIAPELPLLLLWTLEER---QGWQPLADRFGKDNCSLWLSQQSFADILHAA 312
Query: 119 ------------EASSLGVPVLGVPFFGDQY 137
+A LG PV+G P G QY
Sbjct: 313 DAALGMAGTATEQAVGLGKPVIGFPGQGPQY 343
>gnl|CDD|236119 PRK07865, PRK07865, N-succinyldiaminopimelate aminotransferase;
Reviewed.
Length = 364
Score = 28.7 bits (65), Expect = 1.5
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 18 DGLSDLQQRADAAKGGFVYMSFGSVVDPT 46
D L+ + A A G V +S G+ VDP
Sbjct: 13 DTLAPAKATAAAHPDGIVDLSVGTPVDPV 41
>gnl|CDD|215979 pfam00534, Glycos_transf_1, Glycosyl transferases group 1.
Mutations in this domain of human PIGA lead to disease
(Paroxysmal Nocturnal haemoglobinuria). Members of this
family transfer activated sugars to a variety of
substrates, including glycogen, Fructose-6-phosphate and
lipopolysaccharides. Members of this family transfer
UDP, ADP, GDP or CMP linked sugars. The eukaryotic
glycogen synthases may be distant members of this
family.
Length = 158
Score = 28.0 bits (63), Expect = 1.7
Identities = 16/60 (26%), Positives = 23/60 (38%), Gaps = 13/60 (21%)
Query: 84 LPDNVFIQ---------KWYPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLGVPFFG 134
L DNV + Y D+ P+ G+ L+EA + GVPV+ G
Sbjct: 57 LEDNVIFVGFVPDEDLIELYRIADLFVLPSRY---EGFGLV-LLEAMAAGVPVIATDVGG 112
>gnl|CDD|240319 PTZ00226, PTZ00226, fumarate hydratase; Provisional.
Length = 570
Score = 28.5 bits (64), Expect = 2.2
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 13/76 (17%)
Query: 27 ADAAK--GGFVYMSFGSVVDPTKLSEETKLGFLEV--FKQLKLPIFWKIDITNDP---VL 79
+A K GGF Y+ GS+ P + + + +EV F +L + WKI++ N P V+
Sbjct: 501 TNACKKYGGF-YL--GSIGGPAAILAKDNIKKVEVLDFPELGMEAVWKIEVENFPAFIVV 557
Query: 80 NAKTLPDNVFIQKWYP 95
+ K N F +W
Sbjct: 558 DDK---GNDFYSQWLG 570
>gnl|CDD|99980 cd03808, GT1_cap1E_like, This family is most closely related to the
GT1 family of glycosyltransferases. cap1E in
Streptococcus pneumoniae is required for the synthesis
of type 1 capsular polysaccharides.
Length = 359
Score = 27.9 bits (63), Expect = 3.1
Identities = 10/39 (25%), Positives = 18/39 (46%), Gaps = 6/39 (15%)
Query: 90 IQKWYPQTDILAHPNLRLFITHGGIS-SLMEASSLGVPV 127
+ + D+ P+ R G+ L+EA ++G PV
Sbjct: 257 VPELLAAADVFVLPSYR-----EGLPRVLLEAMAMGRPV 290
>gnl|CDD|99997 cd04946, GT1_AmsK_like, This family is most closely related to the
GT1 family of glycosyltransferases. AmsK is involved in
the biosynthesis of amylovoran, which functions as a
virulence factor. It functions as a glycosyl transferase
which transfers galactose from UDP-galactose to a
lipid-linked amylovoran-subunit precursor. The members
of this family are found mainly in bacteria and Archaea.
Length = 407
Score = 27.7 bits (62), Expect = 3.4
Identities = 15/37 (40%), Positives = 20/37 (54%), Gaps = 6/37 (16%)
Query: 113 GIS-SLMEASSLGVPVL-----GVPFFGDQYRNMVLL 143
G+ S+MEA S G+PV+ G P D N +LL
Sbjct: 322 GLPVSIMEAMSFGIPVIATNVGGTPEIVDNGGNGLLL 358
>gnl|CDD|221875 pfam12965, DUF3854, Domain of unknown function (DUF3854). A family
of uncharacterized proteins found by clustering human
gut metagenomic sequences. This domain is likely to be
related to the Toprim domain.
Length = 130
Score = 26.4 bits (59), Expect = 5.3
Identities = 11/42 (26%), Positives = 17/42 (40%), Gaps = 10/42 (23%)
Query: 93 WYPQTDILAHPNLRLFITHGG---ISSLMEASSLGVPVLGVP 131
W +L HP + + IT G + L S G + +P
Sbjct: 2 WQW---VLDHPEIPIIITEGAKKAAALL----SQGYAAIALP 36
>gnl|CDD|99971 cd03798, GT1_wlbH_like, This family is most closely related to the
GT1 family of glycosyltransferases. wlbH in Bordetella
parapertussis has been shown to be required for the
biosynthesis of a trisaccharide that, when attached to
the B. pertussis lipopolysaccharide (LPS) core (band B),
generates band A LPS.
Length = 377
Score = 26.9 bits (60), Expect = 5.9
Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 6/41 (14%)
Query: 90 IQKWYPQTDILAHPNLRLFITHGGIS-SLMEASSLGVPVLG 129
+ +Y D+ P+LR G L+EA + G+PV+
Sbjct: 272 VPAYYAAADVFVLPSLR-----EGFGLVLLEAMACGLPVVA 307
>gnl|CDD|130682 TIGR01621, RluA-like, pseudouridine synthase Rlu family protein,
TIGR01621. This model represents a clade of sequences
within the pseudouridine synthase superfamily
(pfam00849). The superfamily includes E. coli proteins:
RluA, RluB, RluC, RluD, and RsuA. The sequences modeled
here are most closely related to RluA. Neisseria, among
those species hitting this model, does not appear to
have an RluA homolog. It is presumed that these
sequences function as pseudouridine synthases, although
perhaps with different specificity [Protein synthesis,
tRNA and rRNA base modification].
Length = 217
Score = 26.8 bits (59), Expect = 6.4
Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 7/53 (13%)
Query: 86 DNVFIQKWYPQTDILAHPNLRLFIT--HGGISSLMEAS--SLGVPVLGVPFFG 134
+N I +++ A LRLFI H G + + + SLG P+LG P +G
Sbjct: 117 ENPAITRFF---SASAATGLRLFILKPHTGKTHQLRVAMKSLGSPILGDPLYG 166
>gnl|CDD|240646 cd12169, PGDH_like_1, Putative D-3-Phosphoglycerate Dehydrogenases.
Phosphoglycerate dehydrogenases (PGDHs) catalyze the
initial step in the biosynthesis of L-serine from
D-3-phosphoglycerate. PGDHs come in 3 distinct
structural forms, with this first group being related to
2-hydroxy acid dehydrogenases, sharing structural
similarity to formate and glycerate dehydrogenases of
the D-specific 2-hydroxyacid dehydrogenase superfamily,
which also include groups such as L-alanine
dehydrogenase and S-adenosylhomocysteine hydrolase.
Despite often low sequence identity, these proteins
typically have a characteristic arrangement of 2 similar
subdomains of the alpha/beta Rossmann fold NAD+ binding
form. The NAD+ binding domain is inserted within the
linear sequence of the mostly N-terminal catalytic
domain, which has a similar domain structure to the
internal NAD binding domain. Structurally, these domains
are connected by extended alpha helices and create a
cleft in which NAD is bound, primarily to the C-terminal
portion of the 2nd (internal) domain. Some related
proteins have similar structural subdomain but with a
tandem arrangement of the catalytic and NAD-binding
subdomains in the linear sequence. Many, not all,
members of this family are dimeric.
Length = 308
Score = 26.7 bits (60), Expect = 6.5
Identities = 11/32 (34%), Positives = 16/32 (50%), Gaps = 3/32 (9%)
Query: 103 PNLRLFITHGGISS---LMEASSLGVPVLGVP 131
PNL+L +T G ++ L A G+ V G
Sbjct: 68 PNLKLLVTTGMRNASIDLAAAKERGIVVCGTG 99
>gnl|CDD|183292 PRK11728, PRK11728, hydroxyglutarate oxidase; Provisional.
Length = 393
Score = 26.7 bits (60), Expect = 7.2
Identities = 7/27 (25%), Positives = 13/27 (48%)
Query: 141 VLLRHRGYALIEPIQTLTKQSFLKNAQ 167
+ +H L E +L+K +L+ Q
Sbjct: 301 LAQKHWRSGLGEMKNSLSKSGYLRLVQ 327
>gnl|CDD|132578 TIGR03539, DapC_actino, succinyldiaminopimelate transaminase.
This family of actinobacterial succinyldiaminopimelate
transaminase enzymes (DapC) are members of the
pfam00155 superfamily. Many of these genes appear
adjacent to other genes encoding enzymes of the lysine
biosynthesis via diaminopimelate pathway (GenProp0125).
Length = 357
Score = 26.6 bits (59), Expect = 7.4
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 18 DGLSDLQQRADAAKGGFVYMSFGSVVDPT 46
D L+ + +A + G V +S G+ VDP
Sbjct: 7 DSLAPYKAKAASHPDGIVDLSVGTPVDPV 35
>gnl|CDD|181713 PRK09230, PRK09230, cytosine deaminase; Provisional.
Length = 426
Score = 26.6 bits (59), Expect = 8.3
Identities = 16/41 (39%), Positives = 22/41 (53%), Gaps = 9/41 (21%)
Query: 94 YPQTDILAHPNLRLFITHGGISSLMEASSLGVPVLG-VPFF 133
+PQ IL++PN G + L EA LG V+G +P F
Sbjct: 154 FPQEGILSYPN--------GEALLEEALRLGADVVGAIPHF 186
>gnl|CDD|213276 cd05908, A_NRPS_MycA_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS) similar to mycosubtilin
synthase subunit A (MycA). The adenylation (A) domain
of NRPS recognizes a specific amino acid or hydroxy acid
and activates it as (amino)-acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms thioester
to the enzyme-bound cofactor phosphopantetheine of a
peptidyl carrier protein domain. This family includes
NRPS similar to mycosubtilin synthase subunit A (MycA).
Mycosubtilin, which is characterized by a beta-amino
fatty acid moiety linked to the circular heptapeptide
Asn-Tyr-Asn-Gln-Pro-Ser-Asn, belongs to the iturin
family of lipopeptide antibiotics. The mycosubtilin
synthase subunit A (MycA) combines functional domains
derived from peptide synthetases, amino transferases,
and fatty acid synthases. Nonribosomal peptide
synthetases are large multifunction enzymes that
synthesize many therapeutically useful peptides. NRPS
has a distinct modular structure in which each module is
responsible for the recognition, activation, and, in
some cases, modification of a single amino acid residue
of the final peptide product. The modules can be
subdivided into domains that catalyze specific
biochemical reactions.
Length = 499
Score = 26.5 bits (59), Expect = 9.0
Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 17/81 (20%)
Query: 49 SEETKLGFLEVFKQLKLPIFWKIDITNDPVLNAKTLPDN-VFIQ-----KWYPQTDILAH 102
++E KL L V+ L P IT+ L + LPD+ FIQ P+ IL H
Sbjct: 75 TDEHKLKLLRVWNILNNPFL----ITDRDSLLS--LPDDIAFIQFSSGSTGEPKGVILTH 128
Query: 103 PNLRLFITHGGISSLMEASSL 123
NL +T+ I +++EA+ +
Sbjct: 129 KNL---LTN--IEAIIEAAEI 144
>gnl|CDD|100087 cd03085, PGM1, Phosphoglucomutase 1 (PGM1) catalyzes the
bidirectional interconversion of glucose-1-phosphate
(G-1-P) and glucose-6-phosphate (G-6-P) via a glucose
1,6-diphosphate intermediate, an important metabolic
step in prokaryotes and eukaryotes. In one direction,
G-1-P produced from sucrose catabolism is converted to
G-6-P, the first intermediate in glycolysis. In the
other direction, conversion of G-6-P to G-1-P generates
a substrate for synthesis of UDP-glucose which is
required for synthesis of a variety of cellular
constituents including cell wall polymers and
glycoproteins. The PGM1 family also includes a
non-enzymatic PGM-related protein (PGM-RP) thought to
play a structural role in eukaryotes, as well as
pp63/parafusin, a phosphoglycoprotein that plays an
important role in calcium-regulated exocytosis in
ciliated protozoans. PGM1 belongs to the
alpha-D-phosphohexomutase superfamily which includes
several related enzymes that catalyze a reversible
intramolecular phosphoryl transfer on their sugar
substrates. Other members of this superfamily include
phosphoglucosamine mutase (PNGM),
phosphoacetylglucosamine mutase (PAGM), the bacterial
phosphomannomutase ManB, the bacterial
phosphoglucosamine mutase GlmM, and the bifunctional
phosphomannomutase/phosphoglucomutase (PMM/PGM). Each of
these enzymes has four domains with a centrally located
active site formed by four loops, one from each domain.
All four domains are included in this alignment model.
Length = 548
Score = 26.4 bits (59), Expect = 9.2
Identities = 15/40 (37%), Positives = 18/40 (45%), Gaps = 6/40 (15%)
Query: 103 PNLRLFITHGGISSLMEASSLGVPVLGVPFFGDQYRNMVL 142
PNL T+ L+E G P G GD RNM+L
Sbjct: 254 PNL----TYA--KDLVELMKSGEPDFGAASDGDGDRNMIL 287
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.324 0.140 0.419
Gapped
Lambda K H
0.267 0.0672 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,859,551
Number of extensions: 813680
Number of successful extensions: 781
Number of sequences better than 10.0: 1
Number of HSP's gapped: 766
Number of HSP's successfully gapped: 66
Length of query: 174
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 84
Effective length of database: 6,945,742
Effective search space: 583442328
Effective search space used: 583442328
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 56 (25.5 bits)