BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038300
(401 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q5NTH0|UGAT_BELPE Cyanidin-3-O-glucoside 2-O-glucuronosyltransferase OS=Bellis
perennis GN=UGAT PE=1 SV=1
Length = 438
Score = 337 bits (863), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 184/409 (44%), Positives = 251/409 (61%), Gaps = 33/409 (8%)
Query: 3 NFHICFCSTPSILNSIKQ--LDKFSLSIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLK 60
NFHI CS+ + + +K ++S SIQLIEL+LPS ELP QYHTT GLPPHL TL
Sbjct: 37 NFHIYICSSQTNMQYLKNNLTSQYSKSIQLIELNLPSSSELPLQYHTTHGLPPHLTKTLS 96
Query: 61 EAFDMASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAI 120
+ + + P F IL L+P L+IYD Q WAP +AS+L+IP++ L A A H
Sbjct: 97 DDYQKSGPDFETILIKLNPHLVIYDFNQLWAPEVASTLHIPSIQLLSGCVALYALDAHLY 156
Query: 121 KKNSLGDANDDDEEFPSSSIFIHDYYMKSYFSNMVESPTTKRLLQCFERSCNIVLIKSFR 180
K +++ +FP I+ + + S +E R + C RSC I+L++S
Sbjct: 157 TK----PLDENLAKFPFPEIYPKNRDIPKGGSKYIE-----RFVDCMRRSCEIILVRSTM 207
Query: 181 ELEGKYIDYLSDLIKKKVVPVGPLVQDP-VEQTDH--------EKGATEII-----HEYF 226
ELEGKYIDYLS + KKV+PVGPLVQ+ + Q DH +K + ++ EY
Sbjct: 208 ELEGKYIDYLSKTLGKKVLPVGPLVQEASLLQDDHIWIMKWLDKKEESSVVFVCFGSEYI 267
Query: 227 LSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQM 286
LS E+EDIA GLELS V+F+W +R A F++R ++ +VI+ W PQ
Sbjct: 268 LSDNEIEDIAYGLELSQVSFVWAIRAKTSAL--------NGFIDRVGDKGLVIDKWVPQA 319
Query: 287 KILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDVGIGLEVRRNKC 346
IL H S GGF+SHCGWSS MES+R GVPIIAMPM DQP NARL+E VG G+EV R+
Sbjct: 320 NILSHSSTGGFISHCGWSSTMESIRYGVPIIAMPMQFDQPYNARLMETVGAGIEVGRDGE 379
Query: 347 GRIQREEMARVIKEVVMEREGEKIKRKTREMGEKIKEKGEEEIEWVADE 395
GR++REE+A V+++VV+E GE I+ K +E+GE +K+ E E++ + E
Sbjct: 380 GRLKREEIAAVVRKVVVEDSGESIREKAKELGEIMKKNMEAEVDGIVIE 428
>sp|Q8GVE3|FLRT_CITMA Flavanone 7-O-glucoside 2''-O-beta-L-rhamnosyltransferase OS=Citrus
maxima GN=C12RT1 PE=1 SV=2
Length = 452
Score = 331 bits (848), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 188/423 (44%), Positives = 261/423 (61%), Gaps = 37/423 (8%)
Query: 3 NFHICFCSTPSILNSI-KQLDK-FSLSIQLIELHLP-SLPELPPQYHTTKGLPPHLMPTL 59
NFHI FCSTP+ L S + ++K FS SIQLIEL LP + PELP Q TTK LPPHL+ TL
Sbjct: 36 NFHIYFCSTPNNLQSFGRNVEKNFSSSIQLIELQLPNTFPELPSQNQTTKNLPPHLIYTL 95
Query: 60 KEAFDMASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHA 119
AF+ A P+F NIL+ L P L++YDL QPWA A +I A+ FL SA +F+ H
Sbjct: 96 VGAFEDAKPAFCNILETLKPTLVMYDLFQPWAAEAAYQYDIAAILFLPLSAVACSFLLHN 155
Query: 120 IKKNSLGDANDDDEEFPSSSIFIHDYYMK-----SYFSNMVESPTTK--RLLQCFERSCN 172
I SL ++P F DY + +YF ++ + T R L+ FE SC
Sbjct: 156 IVNPSL--------KYP---FFESDYQDRESKNINYFLHLTANGTLNKDRFLKAFELSCK 204
Query: 173 IVLIKSFRELEGKYIDYLSDLIKKKVVPVGPLVQDPVEQTD--------HEKGATEIIH- 223
V IK+ RE+E KY+DY L+ +++PVGPL+Q+P + D +K +++
Sbjct: 205 FVFIKTSREIESKYLDYFPSLMGNEIIPVGPLIQEPTFKEDDTKIMDWLSQKEPRSVVYA 264
Query: 224 ----EYFLSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTK--ERAM 277
EYF SK+E+ +IA GL LS VNFIW R K+ ++E LP+ F E + + M
Sbjct: 265 SFGSEYFPSKDEIHEIASGLLLSEVNFIWAFRLHPDEKMTIEEALPQGFAEEIERNNKGM 324
Query: 278 VIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDVGI 337
+++GW PQ KIL H SIGGF+SHCGW SV+E M GVPII +PM +QP NA++V D G+
Sbjct: 325 IVQGWVPQAKILRHGSIGGFLSHCGWGSVVEGMVFGVPIIGVPMAYEQPSNAKVVVDNGM 384
Query: 338 GLEVRRNKCG-RIQREEMARVIKEVVMEREGEKIKRKTREMGEKIKEKGEEEIEWVADEL 396
G+ V R+K R+ EE+ARVIK VV++ E ++I+RK E+ E +K+ G+ E+ V ++L
Sbjct: 385 GMVVPRDKINQRLGGEEVARVIKHVVLQEEAKQIRRKANEISESMKKIGDAEMSVVVEKL 444
Query: 397 IHL 399
+ L
Sbjct: 445 LQL 447
>sp|Q66PF2|URT1_FRAAN Putative UDP-rhamnose:rhamnosyltransferase 1 OS=Fragaria ananassa
GN=GT4 PE=2 SV=1
Length = 478
Score = 190 bits (483), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 126/417 (30%), Positives = 205/417 (49%), Gaps = 40/417 (9%)
Query: 6 ICFCSTPSILNSIKQL-DKFSLSIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFD 64
+ F STP + + ++ + + I L+++ LP + LP T +P ++P LK A D
Sbjct: 42 VSFISTPRNIQRLPKIPETLTPLINLVQIPLPHVENLPENAEATMDVPHDVIPYLKIAHD 101
Query: 65 MASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKN- 123
L+ SPD +I+D W P +A+ L I +F + +A++ F F + N
Sbjct: 102 GLEQGISEFLQAQSPDWIIHDFAPHWLPPIATKLGISNAHFSIFNASSMCF-FGSTSPNR 160
Query: 124 --------SLGDANDDDEEFPSSSIFIHDYYMKSYFSNMVESPTTKRLLQCFE-----RS 170
L E P S H + + +P + F +
Sbjct: 161 VSRYAPRKKLEQFTSPPEWIPFPSKIYHRPFEAKRLMDGTLTPNASGVTDRFRLESTIQG 220
Query: 171 CNIVLIKSFRELEGKYIDYLSDLIKKKVVPVGPLVQDPVEQTDHEKGAT----------- 219
C + I+S RE+EG+++D L DL +K +V L+ + ++D + G
Sbjct: 221 CQVYFIRSCREIEGEWLDLLEDLHEKPIVLPTGLLPPSLPRSDEDGGKDSNWSKIAVWLD 280
Query: 220 -----EIIHEYF-----LSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDE-ELPESF 268
++++ F LS+E ++ALGLELSG+ F WV+R P D +LP+ F
Sbjct: 281 KQEKGKVVYAAFGSELNLSQEVFNELALGLELSGLPFFWVLRKPSHGSGDGDSVKLPDGF 340
Query: 269 LERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLN 328
+R K R +V WAPQ+KIL H S+GGF++HCGWSS++ES++ G P+I +P DQ L
Sbjct: 341 EDRVKGRGLVWTTWAPQLKILSHESVGGFLTHCGWSSIIESLQYGCPLIMLPFMYDQGLI 400
Query: 329 ARLVEDVGIGLEVRRN-KCGRIQREEMARVIKEVVMEREGEKIKRKTREMGEKIKEK 384
AR D IG EV R+ + G R E+A +K +V++ EG++ + E + ++K
Sbjct: 401 ARFW-DNKIGAEVPRDEETGWFTRNELANSLKLIVVDEEGKQYRDGANEYSKLFRDK 456
>sp|Q43716|UFOG_PETHY Anthocyanidin 3-O-glucosyltransferase OS=Petunia hybrida GN=RT PE=2
SV=1
Length = 473
Score = 189 bits (480), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 124/405 (30%), Positives = 208/405 (51%), Gaps = 28/405 (6%)
Query: 6 ICFCSTPSILNSIKQLDKFSLSIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDM 65
+ F + + +K + + + ++ L LP + LPP +T L P LK A D+
Sbjct: 42 VSFFTASGNASRVKSMLNSAPTTHIVPLTLPHVEGLPPGAESTAELTPASAELLKVALDL 101
Query: 66 ASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFM------FHA 119
P +L +L P +++D Q W P +A+ L I VY+ V A ++AF+
Sbjct: 102 MQPQIKTLLSHLKPHFVLFDFAQEWLPKMANGLGIKTVYYSVVVALSTAFLTCPARVLEP 161
Query: 120 IKKNSLGDANDDDEEFPSSSIF-IHDYYMKSY---FSNMVESPTTKRLLQCFERSCNIVL 175
K SL D FP +S+ + + + + F + PT +Q R C+ +L
Sbjct: 162 KKYPSLEDMKKPPLGFPQTSVTSVRTFEARDFLYVFKSFHNGPTLYDRIQSGLRGCSAIL 221
Query: 176 IKSFRELEGKYIDYLSDLIKKKVVPVGPLVQDPVEQTDHEKGAT--------EIIH---- 223
K+ ++EG YI Y+ K V +GP+V DP EK AT +I+
Sbjct: 222 AKTCSQMEGPYIKYVEAQFNKPVFLIGPVVPDPPSGKLEEKWATWLNKFEGGTVIYCSFG 281
Query: 224 -EYFLSKEEMEDIALGLELSGVNFIWVVRFPCGAKV--KVDEELPESFLERTKERAMVIE 280
E FL+ ++++++ALGLE +G+ F V+ FP V +++ LPE FLER K++ ++
Sbjct: 282 SETFLTDDQVKELALGLEQTGLPFFLVLNFPANVDVSAELNRALPEGFLERVKDKGIIHS 341
Query: 281 GWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVE-DVGIGL 339
GW Q IL H S+G +V H G+SSV+E++ ++ +P DQ LNA+LV D+ G+
Sbjct: 342 GWVQQQNILAHSSVGCYVCHAGFSSVIEALVNDCQVVMLPQKGDQILNAKLVSGDMEAGV 401
Query: 340 EV-RRNKCGRIQREEMARVIKEVVMEREGEKIKRKTREMGEKIKE 383
E+ RR++ G +E++ +++V+++ E + K RE +K KE
Sbjct: 402 EINRRDEDGYFGKEDIKEAVEKVMVDVEKDPGKL-IRENQKKWKE 445
>sp|D4Q9Z5|SGT3_SOYBN Soyasaponin III rhamnosyltransferase OS=Glycine max GN=GmSGT3 PE=1
SV=1
Length = 472
Score = 179 bits (455), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 124/404 (30%), Positives = 211/404 (52%), Gaps = 42/404 (10%)
Query: 13 SILNSIKQLDKFSLS-------IQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDM 65
+ +NS K +D+ + I+L++L LP + LP +T +P LK+A++
Sbjct: 46 TFINSPKNIDRMPKTPKHLEPFIKLVKLPLPKIEHLPEGAESTMDIPSKKNCFLKKAYEG 105
Query: 66 ASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFM---FHAIKK 122
+ +LK +PD ++YD W +A S NIP ++ ++ A F +K
Sbjct: 106 LQYAVSKLLKTSNPDWVLYDFAAAWVIPIAKSYNIPCAHYNITPAFNKVFFDPPKDKMKD 165
Query: 123 NSLGDANDDDEEFP-SSSIFIHDYYMKSYFSNMVESPTTKRL---LQCFERSCNIVLIKS 178
SL P +++I I Y + + T +R L SC++ L+++
Sbjct: 166 YSLASICGPPTWLPFTTTIHIRPYEFLRAYEGTKDEETGERASFDLNKAYSSCDLFLLRT 225
Query: 179 FRELEGKYIDYLSDLIKKKVVPVGPL-----VQDPVEQTDHE------------KGATEI 221
RELEG ++DYL+ K VVPVG L ++D VE+ D+ + ++ +
Sbjct: 226 SRELEGDWLDYLAGNYKVPVVPVGLLPPSMQIRD-VEEEDNNPDWVRIKDWLDTQESSSV 284
Query: 222 IH-----EYFLSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERA 276
++ E LS+E++ ++A G+ELS + F W ++ K V ELPE F ERTKER
Sbjct: 285 VYIGFGSELKLSQEDLTELAHGIELSNLPFFWALK---NLKEGV-LELPEGFEERTKERG 340
Query: 277 MVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDVG 336
+V + WAPQ+KIL H +IGG +SHCG SV+E + G ++ +P +DQ L +R++E+
Sbjct: 341 IVWKTWAPQLKILAHGAIGGCMSHCGSGSVIEKVHFGHVLVTLPYLLDQCLFSRVLEEKQ 400
Query: 337 IGLEV-RRNKCGRIQREEMARVIKEVVMEREGEKIKRKTREMGE 379
+ +EV R K G R ++A+ ++ +++ EG ++ +EMG+
Sbjct: 401 VAVEVPRSEKDGSFTRVDVAKTLRFAIVDEEGSALRENAKEMGK 444
>sp|Q9LTA3|U91C1_ARATH UDP-glycosyltransferase 91C1 OS=Arabidopsis thaliana GN=UGT91C1
PE=2 SV=1
Length = 460
Score = 179 bits (454), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 128/425 (30%), Positives = 210/425 (49%), Gaps = 44/425 (10%)
Query: 6 ICFCSTPSILNSIKQLD-KFSLSIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFD 64
I F STP + + +L + SI + LP + LPP ++ +P + +LK AFD
Sbjct: 39 ISFISTPRNIERLPKLQSNLASSITFVSFPLPPISGLPPSSESSMDVPYNKQQSLKAAFD 98
Query: 65 MASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNS 124
+ P L+ SPD +IYD W P++A+ L I +F + +AAT FM +
Sbjct: 99 LLQPPLKEFLRRSSPDWIIYDYASHWLPSIAAELGISKAFFSLFNAATLCFM--GPSSSL 156
Query: 125 LGDANDDDEEF-------PSSSIFIHDYYMKSYFSNMVESPTTK-----RLLQCFERSCN 172
+ + E+F P S + Y+ + + E T R + S +
Sbjct: 157 IEEIRSTPEDFTVVPPWVPFKSNIVFRYHEVTRYVEKTEEDVTGVSDSVRFGYSIDES-D 215
Query: 173 IVLIKSFRELEGKYIDYLSDLIKKKVVPVG---PLVQDP---------VEQTDHEKGATE 220
V ++S E E ++ L DL +K V P+G P+++D +++ ++
Sbjct: 216 AVFVRSCPEFEPEWFGLLKDLYRKPVFPIGFLPPVIEDDDAVDTTWVRIKKWLDKQRLNS 275
Query: 221 IIH-----EYFLSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKER 275
+++ E L EE+ ++ALGLE S F WV+R + ++P+ F R K R
Sbjct: 276 VVYVSLGTEASLRHEEVTELALGLEKSETPFFWVLR--------NEPKIPDGFKTRVKGR 327
Query: 276 AMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDV 335
MV GW PQ+KIL H S+GGF++HCGW+SV+E + G I P+ +Q LN RL+
Sbjct: 328 GMVHVGWVPQVKILSHESVGGFLTHCGWNSVVEGLGFGKVPIFFPVLNEQGLNTRLLHGK 387
Query: 336 GIGLEVRRN-KCGRIQREEMARVIKEVVMEREGEKIKRKTREMGEKIKEKGEEEIEWVAD 394
G+G+EV R+ + G + +A I+ V+++ GE+I+ K + M + +E I +V D
Sbjct: 388 GLGVEVSRDERDGSFDSDSVADSIRLVMIDDAGEEIRAKAKVMKDLFGNM-DENIRYV-D 445
Query: 395 ELIHL 399
EL+
Sbjct: 446 ELVRF 450
>sp|Q9LSM0|U91B1_ARATH UDP-glycosyltransferase 91B1 OS=Arabidopsis thaliana GN=UGT91B1
PE=2 SV=1
Length = 466
Score = 175 bits (444), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 122/409 (29%), Positives = 192/409 (46%), Gaps = 38/409 (9%)
Query: 6 ICFCSTPSILNSIKQLDKFSLSIQLIELHLP-SLPELPPQYHTTKGLPPHLMPTLKEAFD 64
+ F ST ++ + + LS+ + L L ++ LP T +P + LK+AFD
Sbjct: 38 VSFISTARNISRLPNISS-DLSVNFVSLPLSQTVDHLPENAEATTDVPETHIAYLKKAFD 96
Query: 65 MASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNS 124
S +F L+ P+ ++YD++ W P +A L + F +AA+ +
Sbjct: 97 GLSEAFTEFLEASKPNWIVYDILHHWVPPIAEKLGVRRAIFCTFNAASIIIIGGPASVMI 156
Query: 125 LG-DANDDDEEF----PSSSIFIHDYYMKSYFSNMVESPTTKRLLQCFERSCN------- 172
G D E+ P + Y ++E PT +C
Sbjct: 157 QGHDPRKTAEDLIVPPPWVPFETNIVYRLFEAKRIMEYPTAGVTGVELNDNCRLGLAYVG 216
Query: 173 --IVLIKSFRELEGKYIDYLSDLIKKKVVPVGPLVQDPVEQTDHEKG------------A 218
+++I+S ELE ++I LS L K V+P+G L P++ D E A
Sbjct: 217 SEVIVIRSCMELEPEWIQLLSKLQGKPVIPIGLLPATPMDDADDEGTWLDIREWLDRHQA 276
Query: 219 TEIIH-----EYFLSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTK 273
+++ E +S EE++ +A GLEL + F W +R + + LP+ F ER K
Sbjct: 277 KSVVYVALGTEVTISNEEIQGLAHGLELCRLPFFWTLR----KRTRASMLLPDGFKERVK 332
Query: 274 ERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVE 333
ER ++ W PQ KIL H S+GGFV+HCGW S +E + GVP+I P ++DQPL ARL+
Sbjct: 333 ERGVIWTEWVPQTKILSHGSVGGFVTHCGWGSAVEGLSFGVPLIMFPCNLDQPLVARLLS 392
Query: 334 DVGIGLEVRRN-KCGRIQREEMARVIKEVVMEREGEKIKRKTREMGEKI 381
+ IGLE+ RN + G +A I+ VV+E EG+ + +KI
Sbjct: 393 GMNIGLEIPRNERDGLFTSASVAETIRHVVVEEEGKIYRNNAASQQKKI 441
>sp|Q9FN26|U79B6_ARATH UDP-glycosyltransferase 79B6 OS=Arabidopsis thaliana GN=UGT79B6
PE=2 SV=1
Length = 453
Score = 159 bits (402), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 109/372 (29%), Positives = 189/372 (50%), Gaps = 28/372 (7%)
Query: 19 KQLDKFSL---SIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDMASPSFFNILK 75
KQL+ +L I L +PS+ LP TT +P L L A D +
Sbjct: 45 KQLESLNLFPDCIVFQTLTIPSVDGLPDGAETTSDIPISLGSFLASAMDRTRIQVKEAVS 104
Query: 76 NLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDEEF 135
PDL+ +D W P +A + +V F+ SAA A F + S D +
Sbjct: 105 VGKPDLIFFDFAH-WIPEIAREYGVKSVNFITISAACVAISF--VPGRSQDDLGSTPPGY 161
Query: 136 PSSSIFI--HDYYMKSYFSNMVESPTT--KRLLQCFERSCNIVLIKSFRELEGKYIDYLS 191
PSS + + H+ S+ S T+ +R++ ++C+++ I++ +E+EGK+ D++
Sbjct: 162 PSSKVLLRGHETNSLSFLSYPFGDGTSFYERIMIGL-KNCDVISIRTCQEMEGKFCDFIE 220
Query: 192 DLIKKKVVPVGPLVQDPVEQTDHEKGATEIIHEY--------------FLSKEEMEDIAL 237
+ ++KV+ GP++ +P E + + ++ L K++ +++ L
Sbjct: 221 NQFQRKVLLTGPMLPEPDNSKPLEDQWRQWLSKFDPGSVIYCALGSQIILEKDQFQELCL 280
Query: 238 GLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGHPSIGGF 297
G+EL+G+ F+ V+ P G+ + E LP+ F ER K R +V GW Q IL HPSIG F
Sbjct: 281 GMELTGLPFLVAVKPPKGSST-IQEALPKGFEERVKARGVVWGGWVQQPLILAHPSIGCF 339
Query: 298 VSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLV-EDVGIGLEVRRNKCGRIQREEMAR 356
VSHCG+ S+ E++ I+ +P +Q LN RL+ E++ + +EV+R + G +E ++
Sbjct: 340 VSHCGFGSMWEALVNDCQIVFIPHLGEQILNTRLMSEELKVSVEVKREETGWFSKESLSG 399
Query: 357 VIKEVVMEREGE 368
++ VM+R+ E
Sbjct: 400 AVRS-VMDRDSE 410
>sp|Q9LJA6|U79B4_ARATH UDP-glycosyltransferase 79B4 OS=Arabidopsis thaliana GN=UGT79B4
PE=2 SV=1
Length = 448
Score = 157 bits (398), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 116/408 (28%), Positives = 202/408 (49%), Gaps = 43/408 (10%)
Query: 6 ICFCSTPSILNSIKQLDKFSLSIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDM 65
+ F + ++ L+ F SI + LP + LP TT LP L +A D+
Sbjct: 35 VTFLAPKKAQKQLEPLNLFPNSIHFENVTLPHVDGLPVGAETTADLPNSSKRVLADAMDL 94
Query: 66 ASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSL 125
+++L PDL+ +D + W P +A L I +V + + SAA A F + L
Sbjct: 95 LREQIEVKIRSLKPDLIFFDFVD-WIPQMAKELGIKSVSYQIISAAFIAMFF--APRAEL 151
Query: 126 GDANDDDEEFPSSSIFI--HDYYMKSYFSNMVESPTTKRLLQCFER------SCNIVLIK 177
G FPSS + + HD + S F+N T++ L F+R +C+++ I+
Sbjct: 152 GSPPPG---FPSSKVALRGHDANIYSLFAN------TRKFL--FDRVTTGLKNCDVIAIR 200
Query: 178 SFRELEGKYIDYLSDLIKKKVVPVGPLVQDPVEQTD-----------HEKGATEIIH--- 223
+ E+EG D++ ++KV+ GP+ DP ++ + + +++
Sbjct: 201 TCAEIEGNLCDFIERQCQRKVLLTGPMFLDPQGKSGKPLEDRWNNWLNGFEPSSVVYCAF 260
Query: 224 --EYFLSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEG 281
+F ++ +++ LG+EL+G+ F+ V P G+ + E LPE F ER K R +V G
Sbjct: 261 GTHFFFEIDQFQELCLGMELTGLPFLVAVMPPRGSST-IQEALPEGFEERIKGRGIVWGG 319
Query: 282 WAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLV-EDVGIGLE 340
W Q IL HPSIG FV+HCG+ S+ ES+ I+ +P VDQ L RL+ E++ + ++
Sbjct: 320 WVEQPLILSHPSIGCFVNHCGFGSMWESLVSDCQIVFIPQLVDQVLTTRLLTEELEVSVK 379
Query: 341 VRRNK-CGRIQREEMARVIKEVVMERE--GEKIKRKTREMGEKIKEKG 385
V+R++ G +E + +K V+ + G ++R +++ E + G
Sbjct: 380 VKRDEITGWFSKESLRDTVKSVMDKNSEIGNLVRRNHKKLKETLVSPG 427
>sp|Q9T081|U79B3_ARATH UDP-glycosyltransferase 79B3 OS=Arabidopsis thaliana GN=UGT79B3
PE=2 SV=1
Length = 453
Score = 157 bits (396), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 186/382 (48%), Gaps = 43/382 (11%)
Query: 16 NSIKQLDKFSL---SIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDMASPSFFN 72
S+KQL+ F+L +I + +P + LP T +P L A D+
Sbjct: 43 KSLKQLEHFNLFPHNIVFRSVTVPHVDGLPVGTETASEIPVTSTDLLMSAMDLTRDQVEA 102
Query: 73 ILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDD 132
+++ + PDL+ +D W P +A + V ++V SA+T A M + LG
Sbjct: 103 VVRAVEPDLIFFDFAH-WIPEVARDFGLKTVKYVVVSASTIASML--VPGGELGVPPPG- 158
Query: 133 EEFPSSSIFIHD---YYMK--------SYFSNMVESPTTKRLLQCFERSCNIVLIKSFRE 181
+PSS + + Y MK N++E TT + + +++ I++ RE
Sbjct: 159 --YPSSKVLLRKQDAYTMKKLEPTNTIDVGPNLLERVTTSLM------NSDVIAIRTARE 210
Query: 182 LEGKYIDYLSDLIKKKVVPVGPLVQDPVEQTDHEKGATEIIHEY--------------FL 227
+EG + DY+ +KKV+ GP+ +P + + E+ + + Y L
Sbjct: 211 IEGNFCDYIEKHCRKKVLLTGPVFPEPDKTRELEERWVKWLSGYEPDSVVFCALGSQVIL 270
Query: 228 SKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMK 287
K++ +++ LG+EL+G F+ V+ P G+ + E LPE F ER K R +V GW Q
Sbjct: 271 EKDQFQELCLGMELTGSPFLVAVKPPRGSST-IQEALPEGFEERVKGRGLVWGGWVQQPL 329
Query: 288 ILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVED-VGIGLEVRRNKC 346
IL HPS+G FVSHCG+ S+ ES+ I+ +P DQ LN RL+ D + + +EV R +
Sbjct: 330 ILSHPSVGCFVSHCGFGSMWESLLSDCQIVLVPQLGDQVLNTRLLSDELKVSVEVAREET 389
Query: 347 GRIQREEMARVIKEVVMEREGE 368
G +E + + VM+R+ E
Sbjct: 390 GWFSKESLCDAVNS-VMKRDSE 410
>sp|Q940V3|U91A1_ARATH UDP-glycosyltransferase 91A1 OS=Arabidopsis thaliana GN=UGT91A1
PE=2 SV=1
Length = 470
Score = 155 bits (393), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 193/404 (47%), Gaps = 34/404 (8%)
Query: 6 ICFCSTPSILNSI--KQLDKFSLSIQLIELHLP-SLPELPPQYHTTKGLPPHLMPTLKEA 62
+ F STP ++ + + + S I ++L LP +LP T +P L+P LK A
Sbjct: 44 VSFISTPRNIDRLLPRLPENLSSVINFVKLSLPVGDNKLPEDGEATTDVPFELIPYLKIA 103
Query: 63 FDMASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFM----FH 118
+D L++ PD ++ D W P ++ L I +F + AT + F
Sbjct: 104 YDGLKVPVTEFLESSKPDWVLQDFAGFWLPPISRRLGIKTGFFSAFNGATLGILKPPGFE 163
Query: 119 AIKKNSLGDANDDDEEFPSSS-----IFIHDYYMKSYFSNMVES--PTTKRLLQCFERSC 171
+ S D + P + +F + K + + E P R+ + C
Sbjct: 164 EYR-TSPADFMKPPKWVPFETSVAFKLFECRFIFKGFMAETTEGNVPDIHRVGGVID-GC 221
Query: 172 NIVLIKSFRELEGKYIDYLSDLIKKKVVPVGPLVQDP---VEQTD---------HEKGAT 219
+++ ++S E E +++ +L +K V+PVG L P E TD + +
Sbjct: 222 DVIFVRSCYEYEAEWLGLTQELHRKPVIPVGVLPPKPDEKFEDTDTWLSVKKWLDSRKSK 281
Query: 220 EIIHEYFLS-----KEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKE 274
I++ F S + E+ +IALGLELSG+ F WV++ G ELPE F ERT +
Sbjct: 282 SIVYVAFGSEAKPSQTELNEIALGLELSGLPFFWVLKTRRGPWDTEPVELPEGFEERTAD 341
Query: 275 RAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVED 334
R MV GW Q++ L H SIG ++H GW +++E++R P+ + DQ LNAR++E+
Sbjct: 342 RGMVWRGWVEQLRTLSHDSIGLVLTHPGWGTIIEAIRFAKPMAMLVFVYDQGLNARVIEE 401
Query: 335 VGIGLEVRRNKC-GRIQREEMARVIKEVVMEREGEKIKRKTREM 377
IG + R++ G +E +A ++ V++E EG+ + +EM
Sbjct: 402 KKIGYMIPRDETEGFFTKESVANSLRLVMVEEEGKVYRENVKEM 445
>sp|Q9M0P3|U79B7_ARATH UDP-glycosyltransferase 79B7 OS=Arabidopsis thaliana GN=UGT79B7
PE=2 SV=1
Length = 442
Score = 154 bits (390), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 117/388 (30%), Positives = 189/388 (48%), Gaps = 37/388 (9%)
Query: 19 KQLDKFSL---SIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDMASPSFFNILK 75
KQL+ +L SI L +P + LP TT +P L L +A D+ ++
Sbjct: 45 KQLEHHNLFPDSIVFHPLTVPPVNGLPAGAETTSDIPISLDNLLSKALDLTRDQVEAAVR 104
Query: 76 NLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDEEF 135
L PDL+ +D Q W P +A I +V +++ SA T A + LG +
Sbjct: 105 ALRPDLIFFDFAQ-WIPDMAKEHMIKSVSYIIVSATTIAHT--HVPGGKLGVRPPG---Y 158
Query: 136 PSSSIF-----IHDYYMKSYFSNMVESPTTKRLLQCFERSCNIVLIKSFRELEGKYIDYL 190
PSS + +H S F + T L +SC+++ +++ +E+EG + D++
Sbjct: 159 PSSKVMFRENDVHALATLSIFYKRLYHQITTGL-----KSCDVIALRTCKEVEGMFCDFI 213
Query: 191 SDLIKKKVVPVGPLVQDPVEQTDHEKGATEIIHEY--------------FLSKEEMEDIA 236
S KKV+ GP+ +P E+ + + L K++ +++
Sbjct: 214 SRQYHKKVLLTGPMFPEPDTSKPLEERWNHFLSGFAPKSVVFCSPGSQVILEKDQFQELC 273
Query: 237 LGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGHPSIGG 296
LG+EL+G+ F+ V+ P G+ V E LPE F ER K+R +V GW Q IL HPSIG
Sbjct: 274 LGMELTGLPFLLAVKPPRGSST-VQEGLPEGFEERVKDRGVVWGGWVQQPLILAHPSIGC 332
Query: 297 FVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLV-EDVGIGLEVRRNKCGRIQREEMA 355
FV+HCG ++ ES+ ++ +P DQ L RL+ E+ + +EV R K G +E ++
Sbjct: 333 FVNHCGPGTIWESLVSDCQMVLIPFLSDQVLFTRLMTEEFEVSVEVPREKTGWFSKESLS 392
Query: 356 RVIKEVVMEREGEKIKRKTREMGEKIKE 383
IK VM+++ + I + R K+KE
Sbjct: 393 NAIKS-VMDKDSD-IGKLVRSNHTKLKE 418
>sp|O81010|U79B8_ARATH UDP-glycosyltransferase 79B8 OS=Arabidopsis thaliana GN=UGT79B8
PE=2 SV=1
Length = 442
Score = 152 bits (385), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 115/386 (29%), Positives = 186/386 (48%), Gaps = 33/386 (8%)
Query: 19 KQLDKFSL---SIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDMASPSFFNILK 75
KQL+ +L SI L +P + LP TT + + L EA D+ ++
Sbjct: 45 KQLEHHNLFPDSIVFHPLTIPHVNGLPAGAETTSDISISMDNLLSEALDLTRDQVEAAVR 104
Query: 76 NLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDEEF 135
L PDL+ +D W P +A I +V +++ SA T A+ F G +
Sbjct: 105 ALRPDLIFFDFAH-WIPEIAKEHMIKSVSYMIVSATTIAYTF-----APGGVLGVPPPGY 158
Query: 136 PSSSIFIHDYYMKSYFSNMVESPTTKRLLQCFE---RSCNIVLIKSFRELEGKYIDYLSD 192
PSS + Y + S KRL +SC+I+ +++ E+EGK+ DY+S
Sbjct: 159 PSSKVL---YRENDAHALATLSIFYKRLYHQITTGFKSCDIIALRTCNEIEGKFCDYISS 215
Query: 193 LIKKKVVPVGPLVQDPVEQTDHEKGATEIIHEY--------------FLSKEEMEDIALG 238
KKV+ GP++ + E+ + + + L K++ +++ LG
Sbjct: 216 QYHKKVLLTGPMLPEQDTSKPLEEQLSHFLSRFPPRSVVFCALGSQIVLEKDQFQELCLG 275
Query: 239 LELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGHPSIGGFV 298
+EL+G+ F+ V+ P G+ V+E LPE F ER K R +V GW Q IL HPSIG FV
Sbjct: 276 MELTGLPFLIAVKPPRGSST-VEEGLPEGFQERVKGRGVVWGGWVQQPLILDHPSIGCFV 334
Query: 299 SHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLV-EDVGIGLEVRRNKCGRIQREEMARV 357
+HCG ++ E + ++ +P DQ L RL+ E+ + +EV R K G +E ++
Sbjct: 335 NHCGPGTIWECLMTDCQMVLLPFLGDQVLFTRLMTEEFKVSVEVSREKTGWFSKESLSDA 394
Query: 358 IKEVVMEREGEKIKRKTREMGEKIKE 383
IK VM+++ + + + R K+KE
Sbjct: 395 IKS-VMDKDSD-LGKLVRSNHAKLKE 418
>sp|Q9XIQ5|U7B10_ARATH UDP-glycosyltransferase 79B10 OS=Arabidopsis thaliana GN=UGT79B10
PE=2 SV=1
Length = 447
Score = 152 bits (384), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 110/381 (28%), Positives = 185/381 (48%), Gaps = 24/381 (6%)
Query: 18 IKQLDKFSLSIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDMASPSFFNILKNL 77
++ L+ F SI L +P + LP T +P L L A D+ + L
Sbjct: 47 LEHLNLFPDSIVFHSLTIPHVDGLPAGAETFSDIPMPLWKFLPPAIDLTRDQVEAAVSAL 106
Query: 78 SPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDEEFPS 137
SPDL+++D I W P +A + ++ + + SA + A F + LG +PS
Sbjct: 107 SPDLILFD-IASWVPEVAKEYRVKSMLYNIISATSIAHDF--VPGGELGVPPPG---YPS 160
Query: 138 SSIFIHDYYMKSYFSNMVESPTTKRLLQCFERSCNIVLIKSFRELEGKYIDYLSDLIKKK 197
S + + + S V L +C+ + I++ +E+EGK+ +YL KK
Sbjct: 161 SKLLYRKHDAHALLSFSVYYKRFSHRLITGLMNCDFISIRTCKEIEGKFCEYLERQYHKK 220
Query: 198 VVPVGPLVQDPVEQ-----------TDHEKGAT---EIIHEYFLSKEEMEDIALGLELSG 243
V GP++ +P + E+G+ + + L K++ +++ LG+EL+G
Sbjct: 221 VFLTGPMLPEPNKGKPLEDRWSHWLNGFEQGSVVFCALGSQVTLEKDQFQELCLGIELTG 280
Query: 244 VNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGW 303
+ F V P GAK + + LPE F ER K+R +V+ W Q +L HPS+G F+SHCG+
Sbjct: 281 LPFFVAVTPPKGAKT-IQDALPEGFEERVKDRGVVLGEWVQQPLLLAHPSVGCFLSHCGF 339
Query: 304 SSVMESMRLGVPIIAMPMHVDQPLNARLV-EDVGIGLEVRRNKCGRIQREEMARVIKEVV 362
S+ ES+ I+ +P DQ LN RL+ E++ + +EV+R + G +E ++ I V+
Sbjct: 340 GSMWESIMSDCQIVLLPFLADQVLNTRLMTEELKVSVEVQREETGWFSKESLSVAITSVM 399
Query: 363 MEREGEKIKRKTREMGEKIKE 383
+ +I R K+KE
Sbjct: 400 --DQASEIGNLVRRNHSKLKE 418
>sp|Q9T080|U79B2_ARATH UDP-glycosyltransferase 79B2 OS=Arabidopsis thaliana GN=UGT79B2
PE=2 SV=1
Length = 455
Score = 151 bits (382), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 114/409 (27%), Positives = 193/409 (47%), Gaps = 43/409 (10%)
Query: 6 ICFCSTPSILNSIKQLDKFSLSIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDM 65
+ F L ++ L+ F +I + +P + LP T +P L A D+
Sbjct: 36 VTFLIPKKALKQLENLNLFPHNIVFRSVTVPHVDGLPVGTETVSEIPVTSADLLMSAMDL 95
Query: 66 ASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSL 125
+++ + PDL+ +D W P +A + V ++V SA+T A M + L
Sbjct: 96 TRDQVEGVVRAVEPDLIFFDFAH-WIPEVARDFGLKTVKYVVVSASTIASML--VPGGEL 152
Query: 126 GDANDDDEEFPSSSIFIHD---YYMKSYFS--------NMVESPTTKRLLQCFERSCNIV 174
G +PSS + + Y MK+ S N++E TT + + +++
Sbjct: 153 GVPPPG---YPSSKVLLRKQDAYTMKNLESTNTINVGPNLLERVTTSLM------NSDVI 203
Query: 175 LIKSFRELEGKYIDYLSDLIKKKVVPVGPLVQDPVEQTDHEKGATEIIHEY--------- 225
I++ RE+EG + DY+ +KKV+ GP+ +P + + E+ + + Y
Sbjct: 204 AIRTAREIEGNFCDYIEKHCRKKVLLTGPVFPEPDKTRELEERWVKWLSGYEPDSVVFCA 263
Query: 226 -----FLSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIE 280
L K++ +++ LG+EL+G F+ V+ P G+ + E LPE F ER K R +V
Sbjct: 264 LGSQVILEKDQFQELCLGMELTGSPFLVAVKPPRGSST-IQEALPEGFEERVKGRGVVWG 322
Query: 281 GWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVED-VGIGL 339
W Q +L HPS+G FVSHCG+ S+ ES+ I+ +P DQ LN RL+ D + + +
Sbjct: 323 EWVQQPLLLSHPSVGCFVSHCGFGSMWESLLSDCQIVLVPQLGDQVLNTRLLSDELKVSV 382
Query: 340 EVRRNKCGRIQREEMARVIKEVVMERE---GEKIKRKTREMGEKIKEKG 385
EV R + G +E + I VM+R+ G +K+ + E + G
Sbjct: 383 EVAREETGWFSKESLFDAINS-VMKRDSEIGNLVKKNHTKWRETLTSPG 430
>sp|Q9FN28|U79B9_ARATH UDP-glycosyltransferase 79B9 OS=Arabidopsis thaliana GN=UGT79B9
PE=2 SV=1
Length = 447
Score = 151 bits (381), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 188/385 (48%), Gaps = 31/385 (8%)
Query: 19 KQLDKFSLSIQLIELH---LPSLPELPPQYHTTKGLPPHLMPTLKEAFDMASPSFFNILK 75
KQL+ +L I H +P + LP T +P L L A D+ ++
Sbjct: 45 KQLEHHNLFPDRIIFHSLTIPHVDGLPAGAETASDIPISLGKFLTAAMDLTRDQVEAAVR 104
Query: 76 NLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDEEF 135
L PDL+ +D W P +A + +V + V SA + A H + G+ +
Sbjct: 105 ALRPDLIFFDTAY-WVPEMAKEHRVKSVIYFVISANSIA---HELVPG--GELGVPPPGY 158
Query: 136 PSSSIFI--HDYYMKSYFSNMVESPTTKRLLQCFERSCNIVLIKSFRELEGKYIDYLSDL 193
PSS + HD + FS E + + ++C+ + I++ +E+EGK+ DY+
Sbjct: 159 PSSKVLYRGHDAHALLTFSIFYERLHYR--ITTGLKNCDFISIRTCKEIEGKFCDYIERQ 216
Query: 194 IKKKVVPVGPLVQDPVEQTDHEKGATEIIHEY--------------FLSKEEMEDIALGL 239
++KV+ GP++ +P E ++++ L K++ +++ LG+
Sbjct: 217 YQRKVLLTGPMLPEPDNSRPLEDRWNHWLNQFKPGSVIYCALGSQITLEKDQFQELCLGM 276
Query: 240 ELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGHPSIGGFVS 299
EL+G+ F+ V+ P GAK + E LPE F ER K +V W Q IL HPS+G FV+
Sbjct: 277 ELTGLPFLVAVKPPKGAKT-IQEALPEGFEERVKNHGVVWGEWVQQPLILAHPSVGCFVT 335
Query: 300 HCGWSSVMESMRLGVPIIAMPMHVDQPLNARLV-EDVGIGLEVRRNKCGRIQREEMARVI 358
HCG+ S+ ES+ I+ +P DQ LN RL+ E++ + +EV+R + G +E ++ I
Sbjct: 336 HCGFGSMWESLVSDCQIVLLPYLCDQILNTRLMSEELEVSVEVKREETGWFSKESLSVAI 395
Query: 359 KEVVMEREGEKIKRKTREMGEKIKE 383
VM+++ E + R K+KE
Sbjct: 396 TS-VMDKDSE-LGNLVRRNHAKLKE 418
>sp|Q9LVW3|U79B1_ARATH UDP-glycosyltransferase 79B1 OS=Arabidopsis thaliana GN=UGT79B1
PE=2 SV=1
Length = 468
Score = 150 bits (379), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/411 (28%), Positives = 192/411 (46%), Gaps = 63/411 (15%)
Query: 17 SIKQLDKFSLSIQLIELHLPSLPE---LPPQYHTTKGLPPHLMPTLKEAFDMASPSFFNI 73
++ QL+ +L LI H S+P+ LPP T +P L L A D P I
Sbjct: 50 ALNQLEPLNLYPNLITFHTISIPQVKGLPPGAETNSDVPFFLTHLLAVAMDQTRPEVETI 109
Query: 74 LKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDE 133
+ + PDL+ YD W P +A + V F + SAA+ A + + D +
Sbjct: 110 FRTIKPDLVFYDSAH-WIPEIAKPIGAKTVCFNIVSAASIALSLVPSAEREVIDGKEMSG 168
Query: 134 E--------FPSSSIFIHDYYMK-------------SYFSNMVESPTTKRLLQCFERSCN 172
E +PSS + + + K S+F V + R+C+
Sbjct: 169 EELAKTPLGYPSSKVVLRPHEAKSLSFVWRKHEAIGSFFDGKVTA----------MRNCD 218
Query: 173 IVLIKSFRELEGKYIDYLSDLIKKKVVPVGPLVQ---------DP-----VEQTDHEK-- 216
+ I++ RE EGK+ DY+S K V GP++ DP + + +H
Sbjct: 219 AIAIRTCRETEGKFCDYISRQYSKPVYLTGPVLPGSQPNQPSLDPQWAEWLAKFNHGSVV 278
Query: 217 ----GATEIIHEYFLSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERT 272
G+ ++++ ++ +++ LGLE +G F+ ++ P G V+E LPE F ER
Sbjct: 279 FCAFGSQPVVNKI----DQFQELCLGLESTGFPFLVAIKPPSGVST-VEEALPEGFKERV 333
Query: 273 KERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLV 332
+ R +V GW Q +L HPS+G FVSHCG+ S+ ES+ I+ +P H +Q LNARL+
Sbjct: 334 QGRGVVFGGWIQQPLVLNHPSVGCFVSHCGFGSMWESLMSDCQIVLVPQHGEQILNARLM 393
Query: 333 E-DVGIGLEVRRNKCGRIQREEMARVIKEVVMEREGEKIKRKTREMGEKIK 382
++ + +EV R K G R+ + +K V+ EG +I K R+ +K +
Sbjct: 394 TEEMEVAVEVEREKKGWFSRQSLENAVKSVM--EEGSEIGEKVRKNHDKWR 442
>sp|Q9XIQ4|U7B11_ARATH UDP-glycosyltransferase 79B11 OS=Arabidopsis thaliana GN=UGT79B11
PE=3 SV=1
Length = 452
Score = 148 bits (374), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 115/396 (29%), Positives = 189/396 (47%), Gaps = 41/396 (10%)
Query: 19 KQLDKFSLSIQLIELH---LPSLPELPPQYHTTKGLPPHLMPTLKEAFDMASPSFFNILK 75
KQL+ +L I H +P + LP T +P L+ L A D+ +
Sbjct: 45 KQLEHQNLFPHGIVFHPLVIPHVDGLPAGAETASDIPISLVKFLSIAMDLTRDQIEAAIG 104
Query: 76 NLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDEEF 135
L PDL+++DL W P +A +L + ++ + V SA + A + LG A +
Sbjct: 105 ALRPDLILFDLAH-WVPEMAKALKVKSMLYNVMSATSIAHDL--VPGGELGVAPPG---Y 158
Query: 136 PSSSIFI--HDYYMKSYFSNMVESPTTKRLLQCFER---SCNIVLIKSFRELEGKYIDYL 190
PSS HD + FS KR F +C+ + I++ E+EGK+ DY+
Sbjct: 159 PSSKALYREHDAHALLTFSGFY-----KRFYHRFTTGLMNCDFISIRTCEEIEGKFCDYI 213
Query: 191 SDLIKKKVVPVGPLVQDPVEQTDHEK---------GATEII-----HEYFLSKEEMEDIA 236
KKKV+ GP++ +P + E G ++ + L K + +++
Sbjct: 214 ESQYKKKVLLTGPMLPEPDKSKPLEDQWSHWLSGFGQGSVVFCALGSQTILEKNQFQELC 273
Query: 237 LGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMK----ILGHP 292
LG+EL+G+ F+ V+ P GA + E LPE F ER K R +V W Q IL HP
Sbjct: 274 LGIELTGLPFLVAVKPPKGANT-IHEALPEGFEERVKGRGIVWGEWVQQPSWQPLILAHP 332
Query: 293 SIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNAR-LVEDVGIGLEVRRNKCGRIQR 351
S+G FVSHCG+ S+ ES+ I+ +P+ DQ L R + E++ + +EV+R + G +
Sbjct: 333 SVGCFVSHCGFGSMWESLMSDCQIVFIPVLNDQVLTTRVMTEELEVSVEVQREETGWFSK 392
Query: 352 EEMARVIKEVVMERE--GEKIKRKTREMGEKIKEKG 385
E ++ I ++ + G +++R ++ E + G
Sbjct: 393 ENLSGAIMSLMDQDSEIGNQVRRNHSKLKETLASPG 428
>sp|Q9LNI1|U72B3_ARATH UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana GN=UGT72B3
PE=2 SV=1
Length = 481
Score = 147 bits (370), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 84/234 (35%), Positives = 130/234 (55%), Gaps = 28/234 (11%)
Query: 174 VLIKSFRELEGKYIDYLSDLIKKK--VVPVGPLVQDPVEQTD-----------HEKGATE 220
+L+ SF +LE I + + K V +GPLV D +
Sbjct: 210 ILVNSFVDLEPNTIKIVQEPAPDKPPVYLIGPLVNSGSHDADVNDEYKCLNWLDNQPFGS 269
Query: 221 IIHEYF-----LSKEEMEDIALGLELSGVNFIWVVRFPCG----------AKVKVDEELP 265
+++ F L+ E+ ++ALGL SG F+WV+R P G ++ LP
Sbjct: 270 VLYVSFGSGGTLTFEQFIELALGLAESGKRFLWVIRSPSGIASSSYFNPQSRNDPFSFLP 329
Query: 266 ESFLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQ 325
+ FL+RTKE+ +V+ WAPQ +IL H SIGGF++HCGW+S +ES+ GVP+IA P++ +Q
Sbjct: 330 QGFLDRTKEKGLVVGSWAPQAQILTHTSIGGFLTHCGWNSSLESIVNGVPLIAWPLYAEQ 389
Query: 326 PLNARLVEDVGIGLEVRRNKCGRIQREEMARVIKEVVMEREGEKIKRKTREMGE 379
+NA L+ DVG L R + G + REE+ARV+K ++ EG +++K +E+ E
Sbjct: 390 KMNALLLVDVGAALRARLGEDGVVGREEVARVVKGLIEGEEGNAVRKKMKELKE 443
>sp|Q9M156|U72B1_ARATH UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1
PE=1 SV=1
Length = 480
Score = 146 bits (368), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 87/235 (37%), Positives = 134/235 (57%), Gaps = 29/235 (12%)
Query: 174 VLIKSFRELEGKYIDYLSD--LIKKKVVPVGPLV---QDPVEQTDH--------EKGATE 220
+L+ +F ELE I L + L K V PVGPLV + +QT+ +
Sbjct: 210 ILVNTFFELEPNAIKALQEPGLDKPPVYPVGPLVNIGKQEAKQTEESECLKWLDNQPLGS 269
Query: 221 IIHEYF-----LSKEEMEDIALGLELSGVNFIWVVRFPCG----------AKVKVDEELP 265
+++ F L+ E++ ++ALGL S F+WV+R P G ++ LP
Sbjct: 270 VLYVSFGSGGTLTCEQLNELALGLADSEQRFLWVIRSPSGIANSSYFDSHSQTDPLTFLP 329
Query: 266 ESFLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQ 325
FLERTK+R VI WAPQ ++L HPS GGF++HCGW+S +ES+ G+P+IA P++ +Q
Sbjct: 330 PGFLERTKKRGFVIPFWAPQAQVLAHPSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQ 389
Query: 326 PLNARLV-EDVGIGLEVRRNKCGRIQREEMARVIKEVVMEREGEKIKRKTREMGE 379
+NA L+ ED+ L R G ++REE+ARV+K ++ EG+ ++ K +E+ E
Sbjct: 390 KMNAVLLSEDIRAALRPRAGDDGLVRREEVARVVKGLMEGEEGKGVRNKMKELKE 444
>sp|Q9LPS8|U79B5_ARATH UDP-glycosyltransferase 79B5 OS=Arabidopsis thaliana GN=UGT79B5
PE=2 SV=1
Length = 448
Score = 145 bits (367), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 114/391 (29%), Positives = 193/391 (49%), Gaps = 35/391 (8%)
Query: 18 IKQLDKFSLSIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDMASPSFFNILKNL 77
++ L+ F SI L LP + LP T LP + A D+ ++ L
Sbjct: 47 LQPLNLFPDSIVFEPLTLPPVDGLPFGAETASDLPNSTKKPIFVAMDLLRDQIEAKVRAL 106
Query: 78 SPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDEEFPS 137
PDL+ +D + W P +A I +V + + SAA A + + LG D +P
Sbjct: 107 KPDLIFFDFVH-WVPEMAEEFGIKSVNYQIISAACVAMVLAP--RAELGFPPPD---YPL 160
Query: 138 SSIFI--HDYYMKSYFSNMVE--SPTTKRLLQCFERSCNIVLIKSFRELEGKYIDYLSDL 193
S + + H+ + S F+N E TK L ++C++V I++ ELEGK ++
Sbjct: 161 SKVALRGHEANVCSLFANSHELFGLITKGL-----KNCDVVSIRTCVELEGKLCGFIEKE 215
Query: 194 IKKKVVPVGPLVQDPVEQTDH-------------EKGATEIIH---EYFLSKEEMEDIAL 237
+KK++ GP++ +P ++ E G+ ++F K++ ++ L
Sbjct: 216 CQKKLLLTGPMLPEPQNKSGKFLEDRWNHWLNGFEPGSVVFCAFGTQFFFEKDQFQEFCL 275
Query: 238 GLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGHPSIGGF 297
G+EL G+ F+ V P G+ V E LP+ F ER K+ +V EGW Q IL HPS+G F
Sbjct: 276 GMELMGLPFLISVMPPKGSPT-VQEALPKGFEERVKKHGIVWEGWLEQPLILSHPSVGCF 334
Query: 298 VSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLV-EDVGIGLEVRRNKCGRIQREEMAR 356
V+HCG+ S+ ES+ I+ +P DQ L RL+ E++ + ++V+R G +E++
Sbjct: 335 VNHCGFGSMWESLVSDCQIVFIPQLADQVLITRLLTEELEVSVKVQREDSGWFSKEDLRD 394
Query: 357 VIKEVV-MERE-GEKIKRKTREMGEKIKEKG 385
+K V+ ++ E G +KR +++ E + G
Sbjct: 395 TVKSVMDIDSEIGNLVKRNHKKLKETLVSPG 425
>sp|Q9ZU72|U72D1_ARATH UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana GN=UGT72D1
PE=2 SV=1
Length = 470
Score = 144 bits (362), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/235 (38%), Positives = 138/235 (58%), Gaps = 32/235 (13%)
Query: 174 VLIKSFRELEGKYI------DYLSDLIKKKVVPVGPLVQDPVEQTDHEKGATEIIHEY-- 225
VL+ ++ EL+G + + LS ++K V P+GP+V+ + D E + E
Sbjct: 208 VLVNTWEELQGNTLAALREDEELSRVMKVPVYPIGPIVRTN-QHVDKPNSIFEWLDEQRE 266
Query: 226 ------------FLSKEEMEDIALGLELSGVNFIWVVRFPC---GAKVKVDEE----LPE 266
L+ E+ ++ALGLELSG F+WV+R P GA DE+ LPE
Sbjct: 267 RSVVFVCLGSGGTLTFEQTVELALGLELSGQRFVWVLRRPASYLGAISSDDEQVSASLPE 326
Query: 267 SFLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQP 326
FL+RT+ +V+ WAPQ++IL H SIGGF+SHCGWSS +ES+ GVPIIA P++ +Q
Sbjct: 327 GFLDRTRGVGIVVTQWAPQVEILSHRSIGGFLSHCGWSSALESLTKGVPIIAWPLYAEQW 386
Query: 327 LNARLV-EDVGIGLEVRRNKCGR-IQREEMARVIKEVVME--REGEKIKRKTREM 377
+NA L+ E++G+ + R I REE+A ++++++ E EG+KI+ K E+
Sbjct: 387 MNATLLTEEIGVAVRTSELPSERVIGREEVASLVRKIMAEEDEEGQKIRAKAEEV 441
>sp|Q4R1I9|ANGLT_ROSHC Anthocyanidin 5,3-O-glucosyltransferase OS=Rosa hybrid cultivar
GN=RhGT1 PE=2 SV=1
Length = 473
Score = 143 bits (360), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 116/401 (28%), Positives = 190/401 (47%), Gaps = 72/401 (17%)
Query: 47 TTKGLPPHLMPTLKEAFDMAS---PSFFNILKNLSPDL--LIYDLIQPWAPALASSLNIP 101
T LP H+ L F+ A P+ +L+ L L LI D+ + LNIP
Sbjct: 82 TISSLPEHI-EKLNLPFEYARLQIPNILQVLQTLKSSLKALILDMFCDALFDVTKDLNIP 140
Query: 102 AVYFLVSSAATSAFM-----FHAIKKNSLGDANDDDEEFPSSSIFIHDYYMKSYFSNMVE 156
YF S+ + A + FH NSL D D P S S M
Sbjct: 141 TFYFYTSAGRSLAVLLNIPTFHR-TTNSLSDFGD----VPIS------------ISGMPP 183
Query: 157 SPTTKRLLQCFERSCNI----------------VLIKSFRELEGKYIDYLSDLI------ 194
P + F+RS N +++ +F LE + + L +
Sbjct: 184 IPVSAMPKLLFDRSTNFYKSFLSTSTHMAKSNGIILNTFDLLEERALKALRAGLCLPNQP 243
Query: 195 KKKVVPVGPLVQDPVEQTDHEKGATEIIHE-----YFL--------SKEEMEDIALGLEL 241
+ VGPL+ D + + ++ FL S +++E +ALGLE
Sbjct: 244 TPPIFTVGPLISGKSGDNDEHESLKWLNNQPKDSVVFLCFGSMGVFSIKQLEAMALGLEK 303
Query: 242 SGVNFIWVVRFPCGAKVKVDEE-----LPESFLERTKERAMVIEGWAPQMKILGHPSIGG 296
SG F+WVVR P ++ V+E LP+ F+ERTK+R +V+ WAPQ+++L H S+GG
Sbjct: 304 SGQRFLWVVRNPPIEELPVEEPSLEEILPKGFVERTKDRGLVVRKWAPQVEVLSHDSVGG 363
Query: 297 FVSHCGWSSVMESMRLGVPIIAMPMHVDQPLN-ARLVEDVGIGLEVRRNKCGRIQREEMA 355
FV+HCGW+SV+E++ GVP++A P++ +Q L LVE++ + + V+ ++ G + +E+
Sbjct: 364 FVTHCGWNSVLEAVCNGVPMVAWPLYAEQKLGRVFLVEEMKVAVGVKESETGFVSADELE 423
Query: 356 RVIKEVVMEREGEKIKRKTREM---GEKIKEKGEEEIEWVA 393
+ ++E++ G++I+ + E G K KE+G + +A
Sbjct: 424 KRVRELMDSESGDEIRGRVSEFSNGGVKAKEEGGSSVASLA 464
>sp|Q8W491|U73B3_ARATH UDP-glycosyltransferase 73B3 OS=Arabidopsis thaliana GN=UGT73B3
PE=2 SV=1
Length = 481
Score = 141 bits (355), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 163/347 (46%), Gaps = 43/347 (12%)
Query: 73 ILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDD 132
+L+ PD LI D+ PWA A N+P + F + + + N
Sbjct: 120 LLETTRPDCLIADMFFPWATEAAEKFNVPRLVFHGTGYFSLCSEYCIRVHNPQNIVASRY 179
Query: 133 EEF-----PSSSIFIHDYYMKSYFSNMVESPTTKRLLQCFERSCNI--VLIKSFRELEGK 185
E F P + + + ES K +++ E V++ SF ELE
Sbjct: 180 EPFVIPDLPGNIVITQEQIA----DRDEESEMGKFMIEVKESDVKSSGVIVNSFYELEPD 235
Query: 186 YIDYLSDLIKKKVVPVGPL-VQDPVEQTDHEKGATEIIHEY------------------- 225
Y D+ ++ K+ +GPL V + + E+G I+E
Sbjct: 236 YADFYKSVVLKRAWHIGPLSVYNRGFEEKAERGKKASINEVECLKWLDSKKPDSVIYISF 295
Query: 226 ----FLSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEG 281
E++ +IA GLE SG NFIWVVR G ++ +E LPE F ER K + M+I G
Sbjct: 296 GSVACFKNEQLFEIAAGLETSGANFIWVVRKNIG--IEKEEWLPEGFEERVKGKGMIIRG 353
Query: 282 WAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDV---GIG 338
WAPQ+ IL H + GFV+HCGW+S++E + G+P++ P+ +Q N +LV V G+
Sbjct: 354 WAPQVLILDHQATCGFVTHCGWNSLLEGVAAGLPMVTWPVAAEQFYNEKLVTQVLRTGVS 413
Query: 339 LEVRRN---KCGRIQREEMARVIKEVVMEREGEKIKRKTREMGEKIK 382
+ ++N I RE++ + ++EV++ E ++ + + +++ E K
Sbjct: 414 VGAKKNVRTTGDFISREKVVKAVREVLVGEEADERRERAKKLAEMAK 460
>sp|Q9SY84|U90A2_ARATH UDP-glycosyltransferase 90A2 OS=Arabidopsis thaliana GN=UGT90A2
PE=2 SV=1
Length = 467
Score = 141 bits (355), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 119/398 (29%), Positives = 191/398 (47%), Gaps = 50/398 (12%)
Query: 30 LIELHLP-SLPELPPQYHTTKGLPPHLMPTLKEAFDMASPS----FFNILKNL-SPDLLI 83
++++ P ++PE+PP T LP L +L F A+ S F L +L ++
Sbjct: 63 IVDVPFPDNVPEIPPGVECTDKLPA-LSSSLFVPFTRATKSMQADFERELMSLPRVSFMV 121
Query: 84 YDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDE-----EFPSS 138
D W A L P + F + A++ + L + + E EFP
Sbjct: 122 SDGFLWWTQESARKLGFPRLVFFGMNCASTVICDSVFQNQLLSNVKSETEPVSVPEFPWI 181
Query: 139 SIFIHDYYMKSYFSNMVESPTTKRLLQ---CFERSCNIVLIKSFRELEGKYIDYLSDLIK 195
+ D+ + P K +L +S I+ +F +LE +ID+ K
Sbjct: 182 KVRKCDFVKDMFDPKTTTDPGFKLILDQVTSMNQSQGIIF-NTFDDLEPVFIDFYKRKRK 240
Query: 196 KKVVPVGPL------VQDPVEQT-----------DHEKGATEIIHEYF-----LSKEEME 233
K+ VGPL + D VE+ +KG +++ F +S+E++E
Sbjct: 241 LKLWAVGPLCYVNNFLDDEVEEKVKPSWMKWLDEKRDKGCN-VLYVAFGSQAEISREQLE 299
Query: 234 DIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVI-EGWAPQMKILGHP 292
+IALGLE S VNF+WVV+ E+ + F ER ER M++ + W Q KIL H
Sbjct: 300 EIALGLEESKVNFLWVVK---------GNEIGKGFEERVGERGMMVRDEWVDQRKILEHE 350
Query: 293 SIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARL-VEDVGIGLEVRRNKCGRIQR 351
S+ GF+SHCGW+S+ ES+ VPI+A P+ +QPLNA L VE++ + V G ++R
Sbjct: 351 SVRGFLSHCGWNSLTESICSEVPILAFPLAAEQPLNAILVVEELRVAERVVAASEGVVRR 410
Query: 352 EEMARVIKEVVMEREGEKIKRKTREMGEKIKEKGEEEI 389
EE+A +KE++ +G++++R G+ K+ EE I
Sbjct: 411 EEIAEKVKELMEGEKGKELRRNVEAYGKMAKKALEEGI 448
>sp|Q8VZE9|U73B1_ARATH UDP-glycosyltransferase 73B1 OS=Arabidopsis thaliana GN=UGT73B1
PE=2 SV=1
Length = 488
Score = 139 bits (351), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/351 (29%), Positives = 174/351 (49%), Gaps = 36/351 (10%)
Query: 73 ILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDD 132
+L + PD L+ ++ PW+ +A +P + F + S H I+ + +
Sbjct: 123 LLVTMRPDCLVGNMFFPWSTKVAEKFGVPRLVFH-GTGYFSLCASHCIRLPKNVATSSEP 181
Query: 133 EEFPS--SSIFIHDYYMKSYFSNMVESPTTKRLLQCFERSCNIVLIKSFRELEGKYIDYL 190
P I I + + V K + ER VL+ SF ELE Y DY
Sbjct: 182 FVIPDLPGDILITEEQVMETEEESVMGRFMKAIRDS-ERDSFGVLVNSFYELEQAYSDYF 240
Query: 191 SDLIKKKVVPVGPL------VQDPVEQ------TDHE-------KGATEIIHEYF----- 226
+ K+ +GPL ++ E+ +HE K +I+ F
Sbjct: 241 KSFVAKRAWHIGPLSLGNRKFEEKAERGKKASIDEHECLKWLDSKKCDSVIYMAFGTMSS 300
Query: 227 LSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQM 286
E++ +IA GL++SG +F+WVV G++V+ ++ LPE F E+TK + ++I GWAPQ+
Sbjct: 301 FKNEQLIEIAAGLDMSGHDFVWVVNRK-GSQVEKEDWLPEGFEEKTKGKGLIIRGWAPQV 359
Query: 287 KILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDV---GIGLEVRR 343
IL H +IGGF++HCGW+S++E + G+P++ P+ +Q N +LV V G+ + V++
Sbjct: 360 LILEHKAIGGFLTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVTQVLKTGVSVGVKK 419
Query: 344 --NKCGR-IQREEMARVIKEVVMEREGEKIKRKTREMGEK-IKEKGEEEIE 390
G I RE++ ++EV++ E K ++ EM + +KE G ++E
Sbjct: 420 MMQVVGDFISREKVEGAVREVMVGEERRKRAKELAEMAKNAVKEGGSSDLE 470
>sp|Q94A84|U72E1_ARATH UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana GN=UGT72E1
PE=1 SV=1
Length = 487
Score = 138 bits (348), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 152/275 (55%), Gaps = 46/275 (16%)
Query: 170 SCNIVLIKSFRELEGKYIDYLSD------LIKKKVVPVGPLVQ--DPVEQTDH------- 214
+C+ +++ ++ ++E K + L D + V P+GPL + DP +T+H
Sbjct: 205 TCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIGPLSRPVDP-SKTNHPVLDWLN 263
Query: 215 EKGATEIIHEYF-----LSKEEMEDIALGLELSGVNFIWVVRFP-----CGAKVKVD--- 261
++ +++ F LS +++ ++A GLE+S F+WVVR P C A + +
Sbjct: 264 KQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVWVVRPPVDGSACSAYLSANSGK 323
Query: 262 ------EELPESFLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVP 315
+ LPE F+ RT ER ++ WAPQ +IL H ++GGF++HCGW+S++ES+ GVP
Sbjct: 324 IRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQAVGGFLTHCGWNSILESVVGGVP 383
Query: 316 IIAMPMHVDQPLNARLVEDVGIGLEVRRNKC---GRIQREEMARVIKEVVMEREGEKIKR 372
+IA P+ +Q +NA L+ + +G+ VR K G I R E+ +++++++E EG ++++
Sbjct: 384 MIAWPLFAEQMMNATLLNE-ELGVAVRSKKLPSEGVITRAEIEALVRKIMVEEEGAEMRK 442
Query: 373 KTREMGEKIKEK-------GEEEIEWVADELIHLF 400
K +++ E E E + +ADE HL
Sbjct: 443 KIKKLKETAAESLSCDGGVAHESLSRIADESEHLL 477
>sp|P0C7P7|U74E1_ARATH UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana GN=UGT74E1
PE=3 SV=1
Length = 453
Score = 137 bits (344), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 102/348 (29%), Positives = 168/348 (48%), Gaps = 52/348 (14%)
Query: 79 PDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKN----SLGDANDDDEE 134
P L+YD PW +A S + F SA +H K + S +
Sbjct: 103 PRALVYDSTMPWLLDVAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKYGHSTLAS 162
Query: 135 FPSSSIFIHDYYMKSYFSNMVESPTTKRL----LQCFERSCNIVLIKSFRELEGKYIDYL 190
FPS I ++ + S+ P R L +R +IVL +F +LE K + ++
Sbjct: 163 FPSLPI-LNANDLPSFLCESSSYPYILRTVIDQLSNIDR-VDIVLCNTFDKLEEKLLKWI 220
Query: 191 SDLIKKKVVPVGPLV------QDPVEQTDH-----------------EKGATEIIHEYF- 226
+ V+ +GP V + E ++ K + +++ F
Sbjct: 221 KSVWP--VLNIGPTVPSMYLDKRLAEDKNYGFSLFGAKIAECMEWLNSKQPSSVVYVSFG 278
Query: 227 ----LSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGW 282
L K+++ ++A GL+ SG F+WVVR K LPE+++E E+ + + W
Sbjct: 279 SLVVLKKDQLIELAAGLKQSGHFFLWVVRETERRK------LPENYIEEIGEKGLTV-SW 331
Query: 283 APQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDV-GIGLEV 341
+PQ+++L H SIG FV+HCGW+S +E + LGVP+I MP DQP NA+ +EDV +G+ V
Sbjct: 332 SPQLEVLTHKSIGCFVTHCGWNSTLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRV 391
Query: 342 RRNKCGRIQREEMARVIKEVVMEREGEKIKRKTREMGEKIKEKGEEEI 389
+ + G ++REE R ++EV+ +G++I R+ EK K +E +
Sbjct: 392 KADSDGFVRREEFVRRVEEVMEAEQGKEI----RKNAEKWKVLAQEAV 435
>sp|Q94C57|U73B2_ARATH UDP-glucosyl transferase 73B2 OS=Arabidopsis thaliana GN=UGT73B2
PE=1 SV=1
Length = 483
Score = 136 bits (343), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 104/331 (31%), Positives = 154/331 (46%), Gaps = 51/331 (15%)
Query: 73 ILKNLSPDLLIYDLIQPWAPALASSLNIPAVYF----LVSSAATSAFMFHAIKKNSLGDA 128
+L PD LI D+ PWA A N+P + F S A H +K +
Sbjct: 121 LLGTTRPDCLIADMFFPWATEAAGKFNVPRLVFHGTGYFSLCAGYCIGVHKPQKRVASSS 180
Query: 129 NDDD-EEFPSSSIFIHDYYMKSYFSNMVESPTTKRLLQCFE---RSCNIVLIKSFRELEG 184
E P + + + + ES K + + E +S +VL SF ELE
Sbjct: 181 EPFVIPELPGNIVITEEQII----DGDGESDMGKFMTEVRESEVKSSGVVL-NSFYELEH 235
Query: 185 KYIDYLSDLIKKKVVPVGPL-VQDPVEQTDHEKGATEIIHE------------------- 224
Y D+ ++K+ +GPL V + + E+G I E
Sbjct: 236 DYADFYKSCVQKRAWHIGPLSVYNRGFEEKAERGKKANIDEAECLKWLDSKKPNSVIYVS 295
Query: 225 ----YFLSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEE--LPESFLERTKERAMV 278
F E++ +IA GLE SG +FIWVVR K K D E LPE F ER K + M+
Sbjct: 296 FGSVAFFKNEQLFEIAAGLEASGTSFIWVVR-----KTKDDREEWLPEGFEERVKGKGMI 350
Query: 279 IEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDV-GI 337
I GWAPQ+ IL H + GGFV+HCGW+S++E + G+P++ P+ +Q N +LV V
Sbjct: 351 IRGWAPQVLILDHQATGGFVTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVTQVLRT 410
Query: 338 GLEVRRNKCGR------IQREEMARVIKEVV 362
G+ V +K + I RE++ + ++EV+
Sbjct: 411 GVSVGASKHMKVMMGDFISREKVDKAVREVL 441
>sp|Q7Y232|U73B4_ARATH UDP-glycosyltransferase 73B4 OS=Arabidopsis thaliana GN=UGT73B4
PE=2 SV=1
Length = 484
Score = 136 bits (342), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 170/357 (47%), Gaps = 48/357 (13%)
Query: 72 NILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDD 131
+ ++ P L+ D+ PWA A + +P + F TS+F + +
Sbjct: 116 SFIETTKPSALVADMFFPWATESAEKIGVPRLVF----HGTSSFALCCSYNMRIHKPHKK 171
Query: 132 DEEFPSSSIFI-----HDYYMKSYFSNMV--ESPTTK--RLLQCFERSCNIVLIKSFREL 182
SS+ F+ D + +N+ E+P K + ++ E S VL+ SF EL
Sbjct: 172 VAS--SSTPFVIPGLPGDIVITEDQANVTNEETPFGKFWKEVRESETSSFGVLVNSFYEL 229
Query: 183 EGKYIDYLSDLIKKKVVPVGPLV---QDPVEQTDHEKGAT----------------EIIH 223
E Y D+ + KK +GPL + E+ K A +++
Sbjct: 230 ESSYADFYRSFVAKKAWHIGPLSLSNRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVVY 289
Query: 224 EYF-----LSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEE---LPESFLERTKER 275
F L E++ +IA GLE SG NFIWVV +V E LP+ F ER K +
Sbjct: 290 LSFGSGTGLPNEQLLEIAFGLEGSGQNFIWVVS-KNENQVGTGENEDWLPKGFEERNKGK 348
Query: 276 AMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDV 335
++I GWAPQ+ IL H +IGGFV+HCGW+S +E + G+P++ PM +Q N +L+ V
Sbjct: 349 GLIIRGWAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKV 408
Query: 336 -GIGLEVRRN---KCGR-IQREEMARVIKEVVMEREGEKIKRKTREMGEKIKEKGEE 387
IG+ V K G+ I R ++ + ++EV+ + E+ + + +E+GE K EE
Sbjct: 409 LRIGVNVGATELVKKGKLISRAQVEKAVREVIGGEKAEERRLRAKELGEMAKAAVEE 465
>sp|Q2V6K0|UFOG6_FRAAN UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria
ananassa GN=GT6 PE=1 SV=1
Length = 479
Score = 136 bits (342), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 167/337 (49%), Gaps = 46/337 (13%)
Query: 94 LASSLNIPAVYFLVSSAATSAFMFH--AIKKNSLGDAN---DDDEEFPSSSIFIHDYYMK 148
LA+ +P+ F S AA MFH A++ D D D E SS F++
Sbjct: 130 LANEFGLPSYVFYTSGAADLGLMFHLQALRDEENKDCTEFKDSDAELVVSS-FVNPLPAA 188
Query: 149 SYFSNMVESPTTKRLLQCFE---RSCNIVLIKSFRELEGKYIDYLSDLIK-KKVVPVGPL 204
++V F R +L+ +F ELE I LS K V PVGP+
Sbjct: 189 RVLPSVVFEKEGGNFFLNFAKRYRETKGILVNTFLELEPHAIQSLSSDGKILPVYPVGPI 248
Query: 205 --VQDPVEQTDHEKGA--------------TEIIHEYFLS-----KEEMEDIALGLELSG 243
V+ Q EK + ++ F S ++++++IA LE G
Sbjct: 249 LNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGEDQVKEIAHALEQGG 308
Query: 244 VNFIWVVRFPCGAKVKVDEE-------LPESFLERTKERAMVIEGWAPQMKILGHPSIGG 296
+ F+W +R P K+ + LPE FL+RT + VI GWAPQ+ IL HP++GG
Sbjct: 309 IRFLWSLRQPSKEKIGFPSDYTDYKAVLPEGFLDRTTDLGKVI-GWAPQLAILAHPAVGG 367
Query: 297 FVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNA-RLVEDVGIGLEV----RRNKCGRIQR 351
FVSHCGW+S +ES+ GVPI P + +Q +NA LV+++ + +E+ R++ + R
Sbjct: 368 FVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVEIDMGYRKDSGVIVSR 427
Query: 352 EEMARVIKEVVMEREGEKIKRKTREMGEKIKEKGEEE 388
E + + IKE VME+E E ++++ +EM + ++ EE+
Sbjct: 428 ENIEKGIKE-VMEQESE-LRKRVKEMSQMSRKALEED 462
>sp|O82382|U71C2_ARATH UDP-glycosyltransferase 71C2 OS=Arabidopsis thaliana GN=UGT71C2
PE=1 SV=1
Length = 474
Score = 135 bits (340), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 92/320 (28%), Positives = 164/320 (51%), Gaps = 34/320 (10%)
Query: 94 LASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDA---NDDDEEFPSSSIFIHDYYMKSY 150
+ + N+P+ FL SA+ M + +++N DEE S F++ +K
Sbjct: 140 VGNEFNLPSYIFLTCSASFLGMMKYLLERNRETKPELNRSSDEETISVPGFVNSVPVKVL 199
Query: 151 FSNMVESPTTKRLLQCFER--SCNIVLIKSFRELEGKYIDYLSDLIKK--KVVPVGPLV- 205
+ + + + ++ ER +L+ SF LE DY V P+GP++
Sbjct: 200 PPGLFTTESYEAWVEMAERFPEAKGILVNSFESLERNAFDYFDRRPDNYPPVYPIGPILC 259
Query: 206 -----------QDPVEQTDHEKGATEIIHEYF-----LSKEEMEDIALGLELSGVNFIWV 249
+D + + ++ + ++ F L+ ++++IA LEL G+ F+W
Sbjct: 260 SNDRPNLDLSERDRILKWLDDQPESSVVFLCFGSLKSLAASQIKEIAQALELVGIRFLWS 319
Query: 250 VRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMES 309
+R +E LP+ F+ R +V GWAPQ++IL H +IGGFVSHCGW+S++ES
Sbjct: 320 IRTDPKEYASPNEILPDGFMNRVMGLGLVC-GWAPQVEILAHKAIGGFVSHCGWNSILES 378
Query: 310 MRLGVPIIAMPMHVDQPLNA-RLVEDVGIGLEVRRNKCGR----IQREEMARVIKEVVME 364
+R GVPI PM+ +Q LNA +V+++G+ LE+R + ++ +E+A ++ ++
Sbjct: 379 LRFGVPIATWPMYAEQQLNAFTIVKELGLALEMRLDYVSEYGEIVKADEIAGAVRSLM-- 436
Query: 365 REGEKI-KRKTREMGEKIKE 383
+GE + +RK +E+ E KE
Sbjct: 437 -DGEDVPRRKLKEIAEAGKE 455
>sp|Q9ZQG4|U73B5_ARATH UDP-glycosyltransferase 73B5 OS=Arabidopsis thaliana GN=UGT73B5
PE=2 SV=1
Length = 484
Score = 135 bits (340), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 170/354 (48%), Gaps = 45/354 (12%)
Query: 72 NILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDD 131
+ ++ P L+ D+ PWA A L +P + F +S F ++
Sbjct: 119 SFIETTKPSALVADMFFPWATESAEKLGVPRLVFHGTS------FFSLCCSYNMRIHKPH 172
Query: 132 DEEFPSSSIFI-----HDYYMKSYFSNMV--ESPTTKRLLQCFERSCNI--VLIKSFREL 182
+ SS+ F+ D + +N+ E+P K + + E N VL+ SF EL
Sbjct: 173 KKVATSSTPFVIPGLPGDIVITEDQANVAKEETPMGKFMKEVRESETNSFGVLVNSFYEL 232
Query: 183 EGKYIDYLSDLIKKKVVPVGPLV---QDPVEQTDHEKGAT----------------EIIH 223
E Y D+ + K+ +GPL ++ E+ K A +++
Sbjct: 233 ESAYADFYRSFVAKRAWHIGPLSLSNRELGEKARRGKKANIDEQECLKWLDSKTPGSVVY 292
Query: 224 EYF-----LSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMV 278
F + +++ +IA GLE SG +FIWVVR + +E LPE F ERT + ++
Sbjct: 293 LSFGSGTNFTNDQLLEIAFGLEGSGQSFIWVVR-KNENQGDNEEWLPEGFKERTTGKGLI 351
Query: 279 IEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDV-GI 337
I GWAPQ+ IL H +IGGFV+HCGW+S +E + G+P++ PM +Q N +L+ V I
Sbjct: 352 IPGWAPQVLILDHKAIGGFVTHCGWNSAIEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRI 411
Query: 338 GLEVRRN---KCGR-IQREEMARVIKEVVMEREGEKIKRKTREMGEKIKEKGEE 387
G+ V K G+ I R ++ + ++EV+ + E+ + +++GE K EE
Sbjct: 412 GVNVGATELVKKGKLISRAQVEKAVREVIGGEKAEERRLWAKKLGEMAKAAVEE 465
>sp|O22182|U84B1_ARATH UDP-glycosyltransferase 84B1 OS=Arabidopsis thaliana GN=UGT84B1
PE=2 SV=1
Length = 456
Score = 135 bits (339), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 121/416 (29%), Positives = 200/416 (48%), Gaps = 66/416 (15%)
Query: 3 NFHICFCSTPSILNSIKQLDKFSLSIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEA 62
N HI + S + + ++K + L+ LP+ P+ P L+ +L +
Sbjct: 38 NLHINLATIESARDLLSTVEKPRYPVDLV-FFSDGLPKEDPKA------PETLLKSLNKV 90
Query: 63 FDMASPSFFNILKNLSPDLLIYDLIQPWAPALASSLNIP-AVYFLVSSAATSAFMFHAIK 121
M + I++ +I PW PA+A+S NI A+ ++ + A S + + +K
Sbjct: 91 GAM---NLSKIIEEKRYSCIISSPFTPWVPAVAASHNISCAILWIQACGAYSVYYRYYMK 147
Query: 122 KNSLGDANDDDE--EFPSSSIF-IHDY--YM----KSYFSNMVESPTTKRLLQCFERSCN 172
NS D D ++ E P+ + + D +M ++F N++ C R
Sbjct: 148 TNSFPDLEDLNQTVELPALPLLEVRDLPSFMLPSGGAHFYNLM-----AEFADCL-RYVK 201
Query: 173 IVLIKSFRELEGKYIDYLSDLIKKKVVPVGPLVQ-----DPVEQTDHEK----------- 216
VL+ SF ELE + I+ ++DL K V+P+GPLV D E+T K
Sbjct: 202 WVLVNSFYELESEIIESMADL--KPVIPIGPLVSPFLLGDGEEETLDGKNLDFCKSDDCC 259
Query: 217 -------GATEIIHEYFLS-----KEEMEDIALGLELSGVNFIWVVRFPCGAK-VKVDEE 263
+ +++ F S + ++E IA L+ G+ F+WV+R A+ V V +E
Sbjct: 260 MEWLDKQARSSVVYISFGSMLETLENQVETIAKALKNRGLPFLWVIRPKEKAQNVAVLQE 319
Query: 264 LPESFLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHV 323
+ KE V+ W+PQ KIL H +I FV+HCGW+S ME++ GVP++A P
Sbjct: 320 M-------VKEGQGVVLEWSPQEKILSHEAISCFVTHCGWNSTMETVVAGVPVVAYPSWT 372
Query: 324 DQPLNARLVEDV-GIGLEVRRNKC-GRIQREEMARVIKEVVMEREGEKIKRKTREM 377
DQP++ARL+ DV GIG+ +R + G ++ EE+ R I+ V I+R+ E+
Sbjct: 373 DQPIDARLLVDVFGIGVRMRNDSVDGELKVEEVERCIEAVTEGPAAVDIRRRAAEL 428
>sp|O22183|U84B2_ARATH UDP-glycosyltransferase 84B2 OS=Arabidopsis thaliana GN=UGT84B2
PE=3 SV=1
Length = 438
Score = 134 bits (337), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/331 (29%), Positives = 166/331 (50%), Gaps = 45/331 (13%)
Query: 80 DLLIYDLIQPWAPALASSLNIP-AVYFLVSSAATSAFMFHAIKKNSLGDANDDDE--EFP 136
D +I PW PA+A++ NIP A+ ++ + A S + + +K N D D ++ E P
Sbjct: 92 DCIISVPFTPWVPAVAAAHNIPCAILWIQACGAFSVYYRYYMKTNPFPDLEDLNQTVELP 151
Query: 137 SSSIF----IHDYYMKSYFSNMVESPTTKRLLQCFERSCNIVLIKSFRELEGKYIDYLSD 192
+ + + + S +N+ + C + VL+ SF ELE + I+ +SD
Sbjct: 152 ALPLLEVRDLPSLMLPSQGANV--NTLMAEFADCL-KDVKWVLVNSFYELESEIIESMSD 208
Query: 193 LIKKKVVPVGPLVQDPVEQTDHEK------------------GATEIIHEYFLS-----K 229
L K ++P+GPLV + D EK + +++ F S +
Sbjct: 209 L--KPIIPIGPLVSPFLLGNDEEKTLDMWKVDDYCMEWLDKQARSSVVYISFGSILKSLE 266
Query: 230 EEMEDIALGLELSGVNFIWVVR-FPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMKI 288
++E IA L+ GV F+WV+R G V+V +E+ KE V+ W Q KI
Sbjct: 267 NQVETIATALKNRGVPFLWVIRPKEKGENVQVLQEM-------VKEGKGVVTEWGQQEKI 319
Query: 289 LGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDV-GIGLEVRRNKC- 346
L H +I F++HCGW+S +E++ GVP++A P +DQPL+ARL+ DV GIG+ ++ +
Sbjct: 320 LSHMAISCFITHCGWNSTIETVVTGVPVVAYPTWIDQPLDARLLVDVFGIGVRMKNDAID 379
Query: 347 GRIQREEMARVIKEVVMEREGEKIKRKTREM 377
G ++ E+ R I+ V ++R+ E+
Sbjct: 380 GELKVAEVERCIEAVTEGPAAADMRRRATEL 410
>sp|Q9LML6|U71C4_ARATH UDP-glycosyltransferase 71C4 OS=Arabidopsis thaliana GN=UGT71C4
PE=2 SV=2
Length = 479
Score = 134 bits (337), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 164/321 (51%), Gaps = 35/321 (10%)
Query: 94 LASSLNIPAVYFLVSSAATSAFMFHAIKKN----SLGDANDDDEEFPSSSIFIHDYYMKS 149
+ + LN+P+ +L +A M + ++ S D + DEE P FI+ K
Sbjct: 137 VGNELNLPSYIYLTCNARYLGMMKYIPDRHRKIASEFDLSSGDEELPVPG-FINAIPTKF 195
Query: 150 YFSNMVESPTTKRLLQCFERSCNI--VLIKSFRELEGKYIDYLSDLIK-KKVVPVGPLVQ 206
+ + ++ R + +L+ SF ELE DY S L K V PVGP++
Sbjct: 196 MPPGLFNKEAYEAYVELAPRFADAKGILVNSFTELEPHPFDYFSHLEKFPPVYPVGPILS 255
Query: 207 DPVEQTDHEKGA--------------TEIIHEYFLSK-----EEMEDIALGLELSGVNFI 247
+ +E+ + ++ F S+ ++++IA LEL G F+
Sbjct: 256 LKDRASPNEEAVDRDQIVGWLDDQPESSVVFLCFGSRGSVDEPQVKEIARALELVGCRFL 315
Query: 248 WVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVM 307
W +R + ++ LPE F+ R R +V GWAPQ+++L H +IGGFVSHCGW+S +
Sbjct: 316 WSIRTSGDVETNPNDVLPEGFMGRVAGRGLVC-GWAPQVEVLAHKAIGGFVSHCGWNSTL 374
Query: 308 ESMRLGVPIIAMPMHVDQPLNA-RLVEDVGIGLEVRRN----KCGRIQREEMARVIKEVV 362
ES+ GVP+ PM+ +Q LNA LV+++G+ +++R + + G + +E+AR ++ ++
Sbjct: 375 ESLWFGVPVATWPMYAEQQLNAFTLVKELGLAVDLRMDYVSSRGGLVTCDEIARAVRSLM 434
Query: 363 MEREGEKIKRKTREMGEKIKE 383
G++ ++K +EM + ++
Sbjct: 435 --DGGDEKRKKVKEMADAARK 453
>sp|O81498|U72E3_ARATH UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana GN=UGT72E3
PE=1 SV=1
Length = 481
Score = 134 bits (336), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/344 (29%), Positives = 174/344 (50%), Gaps = 51/344 (14%)
Query: 78 SPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDEE--- 134
+P LI DL A LA+ LN+ F+ S+A ++ +I +L + ++
Sbjct: 104 NPTALIIDLFGTDALCLAAELNMLTYVFIASNAR---YLGVSIYYPTLDEVIKEEHTVQR 160
Query: 135 ----FPSSSIFIHDYYMKSYFSNMVESPTTKRLLQ-CFER-SCNIVLIKSFRELEGKYID 188
P + M +Y + + P L++ C + +L+ ++ E+E K +
Sbjct: 161 KPLTIPGCEPVRFEDIMDAYL--VPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLK 218
Query: 189 YLSD------LIKKKVVPVGPLVQDPVEQ--TDH-------EKGATEIIHEYF-----LS 228
L D + + V PVGPL + P++ TDH ++ +++ F L+
Sbjct: 219 SLQDPKLLGRVARVPVYPVGPLCR-PIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLT 277
Query: 229 KEEMEDIALGLELSGVNFIWVVRFPCGAKVKVD--------------EELPESFLERTKE 274
+++ ++A GLE S FIWVVR P D E LPE F+ RT +
Sbjct: 278 AQQLTELAWGLEESQQRFIWVVRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCD 337
Query: 275 RAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVED 334
R +I WAPQ +IL H ++GGF++HCGWSS +ES+ GVP+IA P+ +Q +NA L+ D
Sbjct: 338 RGFMIPSWAPQAEILAHQAVGGFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSD 397
Query: 335 -VGIGLEVRRNKCGRIQREEMARVIKEVVMEREGEKIKRKTREM 377
+GI + V K I R ++ ++++V+ E EGE+++RK +++
Sbjct: 398 ELGISVRVDDPK-EAISRSKIEAMVRKVMAEDEGEEMRRKVKKL 440
>sp|Q9SKC1|U74C1_ARATH UDP-glycosyltransferase 74C1 OS=Arabidopsis thaliana GN=UGT74C1
PE=2 SV=1
Length = 457
Score = 134 bits (336), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 184/385 (47%), Gaps = 85/385 (22%)
Query: 74 LKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDE 133
L + P LIYD P+A +A L++ V + S +H I + + D D
Sbjct: 99 LSDNPPKALIYDPFMPFALDIAKDLDLYVVAYFTQPWLASLVYYH-INEGTY-DVPVDRH 156
Query: 134 EFPSSSIF-----------------------IHDYYMKSYFSNMVESPTTKRLLQCFERS 170
E P+ + F +H++ ++ FSN++++
Sbjct: 157 ENPTLASFPGFPLLSQDDLPSFACEKGSYPLLHEFVVRQ-FSNLLQA------------- 202
Query: 171 CNIVLIKSFRELEGKYIDYLSDLIKKKVVPVGPLVQDPV------EQTDHE--------- 215
+ +L +F +LE K + +++D + V +GP+V E D+E
Sbjct: 203 -DCILCNTFDQLEPKVVKWMND--QWPVKNIGPVVPSKFLDNRLPEDKDYELENSKTEPD 259
Query: 216 ---------KGATEIIHEYF-----LSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVD 261
+ A +++ F LS+++M++IA+ + +G +F+W VR +K
Sbjct: 260 ESVLKWLGNRPAKSVVYVAFGTLVALSEKQMKEIAMAISQTGYHFLWSVRESERSK---- 315
Query: 262 EELPESFLERTKER-AMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMP 320
LP F+E +E+ + ++ W PQ+++L H SIG FVSHCGW+S +E++ LGVP++ +P
Sbjct: 316 --LPSGFIEEAEEKDSGLVAKWVPQLEVLAHESIGCFVSHCGWNSTLEALCLGVPMVGVP 373
Query: 321 MHVDQPLNARLVEDV-GIGLEVRRNKCGRIQREEMARVIKEVVMEREGEKIKRKTREMG- 378
DQP NA+ +EDV IG+ VR + G +EE+AR I EV+ G++I++ ++
Sbjct: 374 QWTDQPTNAKFIEDVWKIGVRVRTDGEGLSSKEEIARCIVEVMEGERGKEIRKNVEKLKV 433
Query: 379 ---EKIKEKGEEEIEWVADELIHLF 400
E I E G + + DE + L
Sbjct: 434 LAREAISEGGSSDKK--IDEFVALL 456
>sp|Q9SYK9|U74E2_ARATH UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana GN=UGT74E2
PE=1 SV=1
Length = 453
Score = 132 bits (333), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 100/348 (28%), Positives = 166/348 (47%), Gaps = 52/348 (14%)
Query: 79 PDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKN----SLGDANDDDEE 134
P ++YD PW +A S + F +A +H K + S +
Sbjct: 103 PRAIVYDSTMPWLLDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKYGHSTLAS 162
Query: 135 FPSSSIFIHDYYMKSYFSNMVESPTTKRL----LQCFERSCNIVLIKSFRELEGKYIDYL 190
FPS + + + S+ P R+ L +R +IVL +F +LE K + ++
Sbjct: 163 FPSFPMLTAND-LPSFLCESSSYPNILRIVVDQLSNIDR-VDIVLCNTFDKLEEKLLKWV 220
Query: 191 SDLIKKKVVPVGPLVQ----DPVEQTDHEKGAT-------------------EIIHEYF- 226
L V+ +GP V D D G + +++ F
Sbjct: 221 QSLWP--VLNIGPTVPSMYLDKRLSEDKNYGFSLFNAKVAECMEWLNSKEPNSVVYLSFG 278
Query: 227 ----LSKEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGW 282
L +++M ++A GL+ SG F+WVVR +LP +++E E+ +++ W
Sbjct: 279 SLVILKEDQMLELAAGLKQSGRFFLWVVR------ETETHKLPRNYVEEIGEKGLIVS-W 331
Query: 283 APQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDV-GIGLEV 341
+PQ+ +L H SIG F++HCGW+S +E + LGVP+I MP DQP NA+ ++DV +G+ V
Sbjct: 332 SPQLDVLAHKSIGCFLTHCGWNSTLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRV 391
Query: 342 RRNKCGRIQREEMARVIKEVVMEREGEKIKRKTREMGEKIKEKGEEEI 389
+ G ++REE+ R ++EV+ EGEK K + R+ EK K +E +
Sbjct: 392 KAEGDGFVRREEIMRSVEEVM---EGEKGK-EIRKNAEKWKVLAQEAV 435
>sp|Q9LVR1|U72E2_ARATH UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana GN=UGT72E2
PE=1 SV=1
Length = 481
Score = 132 bits (332), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 80/239 (33%), Positives = 135/239 (56%), Gaps = 37/239 (15%)
Query: 174 VLIKSFRELEGKYID------YLSDLIKKKVVPVGPLVQDPVE--QTDH-------EKGA 218
+L+ ++ E+E K + L + + V P+GPL + P++ +TDH E+
Sbjct: 204 ILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGPLCR-PIQSSETDHPVLDWLNEQPN 262
Query: 219 TEIIHEYF-----LSKEEMEDIALGLELSGVNFIWVVRFPC--------------GAKVK 259
+++ F LS +++ ++A GLE S F+WVVR P G +
Sbjct: 263 ESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWVVRPPVDGSCCSEYVSANGGGTEDN 322
Query: 260 VDEELPESFLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAM 319
E LPE F+ RT +R V+ WAPQ +IL H ++GGF++HCGWSS +ES+ GVP+IA
Sbjct: 323 TPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAVGGFLTHCGWSSTLESVVGGVPMIAW 382
Query: 320 PMHVDQPLNARLVEDVGIGLEVRRNKCGR-IQREEMARVIKEVVMEREGEKIKRKTREM 377
P+ +Q +NA L+ D +G+ VR + I R ++ ++++V+ E+EGE ++RK +++
Sbjct: 383 PLFAEQNMNAALLSD-ELGIAVRLDDPKEDISRWKIEALVRKVMTEKEGEAMRRKVKKL 440
>sp|Q9C9B0|U89B1_ARATH UDP-glycosyltransferase 89B1 OS=Arabidopsis thaliana GN=UGT89B1
PE=2 SV=2
Length = 473
Score = 131 bits (330), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 114/423 (26%), Positives = 192/423 (45%), Gaps = 44/423 (10%)
Query: 1 GSNFHICFCSTPSILNSIKQLDKFSLSIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLK 60
G+ I TP L + L ++I+ + L PS P +P + LPP P +
Sbjct: 41 GAALKITVLVTPKNLPFLSPLLSAVVNIEPLILPFPSHPSIPSGVENVQDLPPSGFPLMI 100
Query: 61 EAF-DMASPSFFNILKNLSPDL-LIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFH 118
A ++ +P I + SP + ++ D W +L IP F S+A T +
Sbjct: 101 HALGNLHAPLISWITSHPSPPVAIVSDFFLGWT----KNLGIPRFDFSPSAAITCCILNT 156
Query: 119 AIKKNSLGDANDDDEEF------PSSSIFIHDYYMKSYFSNMVESPTTKRLLQCFERSCN 172
+ DDD E P+ + D Y S + P + + F +
Sbjct: 157 LWIEMPTKINEDDDNEILHFPKIPNCPKYRFDQISSLYRSYVHGDPAWEFIRDSFRDNVA 216
Query: 173 I--VLIKSFRELEGKYIDYLS-DLIKKKVVPVGPLVQDPVEQTDHEKGATEIIHEYFLS- 228
+++ SF +EG Y+++L ++ +V VGP++ P+ D+ G T + ++ +S
Sbjct: 217 SWGLVVNSFTAMEGVYLEHLKREMGHDRVWAVGPII--PLSG-DNRGGPTSVSVDHVMSW 273
Query: 229 ---------------------KEEMEDIALGLELSGVNFIWVVRFPCGAKVKVDEELPES 267
KE+ +A GLE SGV+FIW V+ P K + +
Sbjct: 274 LDAREDNHVVYVCFGSQVVLTKEQTLALASGLEKSGVHFIWAVKEPV-EKDSTRGNILDG 332
Query: 268 FLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPL 327
F +R R +VI GWAPQ+ +L H ++G F++HCGW+SV+E++ GV ++ PM DQ
Sbjct: 333 FDDRVAGRGLVIRGWAPQVAVLRHRAVGAFLTHCGWNSVVEAVVAGVLMLTWPMRADQYT 392
Query: 328 NARLVED-VGIGLEVRRNKCGRIQREEMARVIKEVVMEREGEKIKR-KTREMG-EKIKEK 384
+A LV D + +G+ +E+ARV + V + E+IK + R+ + I+E+
Sbjct: 393 DASLVVDELKVGVRACEGPDTVPDPDELARVFADSVTGNQTERIKAVELRKAALDAIQER 452
Query: 385 GEE 387
G
Sbjct: 453 GSS 455
>sp|Q9ZVX4|U90A1_ARATH UDP-glycosyltransferase 90A1 OS=Arabidopsis thaliana GN=UGT90A1
PE=2 SV=1
Length = 478
Score = 131 bits (329), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 180/393 (45%), Gaps = 49/393 (12%)
Query: 28 IQLIELHLP-SLPELPPQYHTTKGLPP-HLMPTLKEAFDMASPSFFNILKNLSP-DLLIY 84
I++I L P ++ +PP T+ LP L A + P F LK L ++
Sbjct: 65 IKVISLPFPENITGIPPGVENTEKLPSMSLFVPFTRATKLLQPFFEETLKTLPKVSFMVS 124
Query: 85 DLIQPWAPALASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDANDDDEEFPSSSIFIHD 144
D W A+ NIP ++ ++A K + + P + + D
Sbjct: 125 DGFLWWTSESAAKFNIPRFVSYGMNSYSAAVSISVFKHELFTEPESKSDTEP---VTVPD 181
Query: 145 Y----YMKSYFSNMVESPTTKRLLQCFERSCNIV---------LIKSFRELEGKYIDYLS 191
+ K F + P E S + + L+ SF ELE ++DY +
Sbjct: 182 FPWIKVKKCDFDHGTTEPEESG--AALELSMDQIKSTTTSHGFLVNSFYELESAFVDYNN 239
Query: 192 DLIKK-KVVPVGPL-VQDPVEQTDHEKG-----------ATEIIHEYF-----LSKEEME 233
+ K K VGPL + DP +Q + +++ F +S +++
Sbjct: 240 NSGDKPKSWCVGPLCLTDPPKQGSAKPAWIHWLDQKREEGRPVLYVAFGTQAEISNKQLM 299
Query: 234 DIALGLELSGVNFIWVVRFPCGAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGHPS 293
++A GLE S VNF+WV R V+E + E F +R +E M++ W Q +IL H S
Sbjct: 300 ELAFGLEDSKVNFLWVTR------KDVEEIIGEGFNDRIRESGMIVRDWVDQWEILSHES 353
Query: 294 IGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARL-VEDVGIGLEVRRNKC---GRI 349
+ GF+SHCGW+S ES+ +GVP++A PM +QPLNA++ VE++ +G+ V G +
Sbjct: 354 VKGFLSHCGWNSAQESICVGVPLLAWPMMAEQPLNAKMVVEEIKVGVRVETEDGSVKGFV 413
Query: 350 QREEMARVIKEVVMEREGEKIKRKTREMGEKIK 382
REE++ IKE++ G+ ++ +E + K
Sbjct: 414 TREELSGKIKELMEGETGKTARKNVKEYSKMAK 446
>sp|Q9LK73|U88A1_ARATH UDP-glycosyltransferase 88A1 OS=Arabidopsis thaliana GN=UGT88A1
PE=2 SV=1
Length = 462
Score = 131 bits (329), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 165/333 (49%), Gaps = 48/333 (14%)
Query: 69 SFFNILKNLSPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFH--AIKKNSLG 126
+ F++ +N + +I D + + P +F S AA AF F+ I + + G
Sbjct: 103 TLFSLSRNFNVRAMIIDFFCTAVLDITADFTFPVYFFYTSGAACLAFSFYLPTIDETTPG 162
Query: 127 DANDDDEEFPSSSIFIHDYYMKSYFSNMVESPTTKRLLQCFERSCNI------------- 173
D P+ I M S K +L+ + ++
Sbjct: 163 KNLKD---IPTVHI--------PGVPPMKGSDMPKAVLERDDEVYDVFIMFGKQLSKSSG 211
Query: 174 VLIKSFRELEGKYIDYLSD-LIKKKVVPVGPL-VQDPVEQTDHEKGAT-----------E 220
++I +F LE + I +++ L + + P+GPL V +E + K +
Sbjct: 212 IIINTFDALENRAIKAITEELCFRNIYPIGPLIVNGRIEDRNDNKAVSCLNWLDSQPEKS 271
Query: 221 IIHEYF-----LSKEEMEDIALGLELSGVNFIWVVRFPC---GAKVKVDEELPESFLERT 272
++ F SKE++ +IA+GLE SG F+WVVR P ++ + LPE FL RT
Sbjct: 272 VVFLCFGSLGLFSKEQVIEIAVGLEKSGQRFLWVVRNPPELEKTELDLKSLLPEGFLSRT 331
Query: 273 KERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLN-ARL 331
+++ MV++ WAPQ+ +L H ++GGFV+HCGW+S++E++ GVP++A P++ +Q N +
Sbjct: 332 EDKGMVVKSWAPQVPVLNHKAVGGFVTHCGWNSILEAVCAGVPMVAWPLYAEQRFNRVMI 391
Query: 332 VEDVGIGLEVRRNKCGRIQREEMARVIKEVVME 364
V+++ I + + ++ G + E+ + ++E++ E
Sbjct: 392 VDEIKIAISMNESETGFVSSTEVEKRVQEIIGE 424
>sp|Q40287|UFOG5_MANES Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5
PE=2 SV=1
Length = 487
Score = 130 bits (326), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 87/271 (32%), Positives = 142/271 (52%), Gaps = 45/271 (16%)
Query: 150 YFSNMVESPTTKRLLQCFERSCNIVLIKSFRELEGKYIDYLSDLIKKKVVPVGPLVQ--- 206
YF +E PT +L + + R+ + +L + K V P+GPL +
Sbjct: 199 YFRLGIEIPTADGILMNTWEALEPTTFGALRD-----VKFLGRVAKVPVFPIGPLRRQAG 253
Query: 207 ---------DPVEQTDHEKGATEIIHEYF-----LSKEEMEDIALGLELSGVNFIWVVRF 252
D ++Q E +++ F LS E+M ++A GLE S FIWVVR
Sbjct: 254 PCGSNCELLDWLDQQPKES----VVYVSFGSGGTLSLEQMIELAWGLERSQQRFIWVVRQ 309
Query: 253 PC-------------GAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGHPSIGGFVS 299
P GA + PE FL R + +V+ W+PQ+ I+ HPS+G F+S
Sbjct: 310 PTVKTGDAAFFTQGDGAD-DMSGYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPSVGVFLS 368
Query: 300 HCGWSSVMESMRLGVPIIAMPMHVDQPLNARLV-EDVGIGLEVRRNKCGR-IQREEMARV 357
HCGW+SV+ES+ GVPIIA P++ +Q +NA L+ E++G+ + + ++REE+ R+
Sbjct: 369 HCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVKREEIERM 428
Query: 358 IKEVVMEREGEKIKRKTREM---GEKIKEKG 385
I+ ++++ EG +I+++ RE+ GEK +G
Sbjct: 429 IRRIMVDEEGSEIRKRVRELKDSGEKALNEG 459
>sp|Q66PF3|UFOG3_FRAAN Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3
OS=Fragaria ananassa GN=GT3 PE=2 SV=1
Length = 478
Score = 129 bits (325), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 105/337 (31%), Positives = 163/337 (48%), Gaps = 55/337 (16%)
Query: 94 LASSLNIPAVYFLVSSAATSAFMFHAIKKNSLGDA-NDDDEEFPSSSIFIHDYYMKSYFS 152
+A+ L +P+ F S AAT +FH L D N D EF S + + S+F+
Sbjct: 131 VANQLGVPSYVFFTSGAATLGLLFHL---QELRDQYNKDCTEFKDSDA---ELIIPSFFN 184
Query: 153 NMVESPTTKRLLQCFERSCNIVLIKSFRELEGKYIDYLSDLIKKK------------VVP 200
+ R+L + +IK FRE +G ++ +DL V P
Sbjct: 185 PLPAKVLPGRMLVKDSAEPFLNVIKRFRETKGILVNTFTDLESHALHALSSDAEIPPVYP 244
Query: 201 VGPLVQ----DPVEQTDHEKGATEIIH---------EYFL--------SKEEMEDIALGL 239
VGPL+ + +D K +I+ FL + ++ +IA L
Sbjct: 245 VGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQVREIANAL 304
Query: 240 ELSGVNFIWVVR-FPCGAKVKVDEE-------LPESFLERTKERAMVIEGWAPQMKILGH 291
E +G F+W +R P KV + LPE FL+RT VI GWAPQ+ +L H
Sbjct: 305 EHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVI-GWAPQVAVLAH 363
Query: 292 PSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNA-RLVEDVGIGLEV----RRNKC 346
PS+GGFVSHCGW+S +ES+ GVP+ P++ +Q LNA + V+++ + +E+ R
Sbjct: 364 PSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDMSYRSKSP 423
Query: 347 GRIQREEMARVIKEVVMEREGEKIKRKTREMGEKIKE 383
+ +E+ R I+E VME + I+++ +EM EK K+
Sbjct: 424 VLVSAKEIERGIRE-VMELDSSDIRKRVKEMSEKGKK 459
>sp|Q9MB73|LGT_CITUN Limonoid UDP-glucosyltransferase OS=Citrus unshiu PE=2 SV=1
Length = 511
Score = 129 bits (324), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 164/335 (48%), Gaps = 47/335 (14%)
Query: 82 LIYDLIQPWAPALASSLNIPAVYFLVSSAATSA---FMFHAIKKNSLGDANDDDEEFPSS 138
LI + PW +A SL +P+ V S A A FH + + D + P
Sbjct: 118 LINNPFIPWVSDVAESLGLPSAMLWVQSCACFAAYYHYFHGLVPFPSEKEPEIDVQLPCM 177
Query: 139 SIFIHDYYMKSYFSNMVESPTTKR-LLQCFERSCN--IVLIKSFRELEGKYIDYLSDLIK 195
+ HD M S+ P +R +L +E +L+ +F ELE + IDY++ +
Sbjct: 178 PLLKHDE-MPSFLHPSTPYPFLRRAILGQYENLGKPFCILLDTFYELEKEIIDYMAKICP 236
Query: 196 KKVVPVGPLVQDP-----------------VEQTDHEKGATEIIHEY----FLSKEEMED 234
K PVGPL ++P ++ D + ++ + + +L +E++E+
Sbjct: 237 IK--PVGPLFKNPKAPTLTVRDDCMKPDECIDWLDKKPPSSVVYISFGTVVYLKQEQVEE 294
Query: 235 IALGLELSGVNFIWVVRFP---CGAKVKVDEELPESFLERTKERAMVIEGWAPQMKILGH 291
I L SG++F+WV++ P G K+ VD LP+ FLE+ ++ V++ W+PQ K+L H
Sbjct: 295 IGYALLNSGISFLWVMKPPPEDSGVKI-VD--LPDGFLEKVGDKGKVVQ-WSPQEKVLAH 350
Query: 292 PSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVEDV-GIGLEVRRNKCGR-- 348
PS+ FV+HCGW+S MES+ GVP+I P DQ +A + DV GL + R +
Sbjct: 351 PSVACFVTHCGWNSTMESLASGVPVITFPQWGDQVTDAMYLCDVFKTGLRLCRGEAENRI 410
Query: 349 IQREEMARVI-------KEVVMEREGEKIKRKTRE 376
I R+E+ + + K V +E K K++ E
Sbjct: 411 ISRDEVEKCLLEATAGPKAVALEENALKWKKEAEE 445
>sp|Q8W4C2|U72B2_ARATH UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana GN=UGT72B2
PE=2 SV=1
Length = 480
Score = 128 bits (321), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 84/237 (35%), Positives = 129/237 (54%), Gaps = 29/237 (12%)
Query: 174 VLIKSFRELEGKYIDYLSDLI--KKKVVPVGPLVQDPVEQTDHE-----------KGATE 220
+L+ SF +LE I L + K V P+GPLV + E +
Sbjct: 210 ILVNSFVDLESNAIKALQEPAPDKPTVYPIGPLVNTSSSNVNLEDKFGCLSWLDNQPFGS 269
Query: 221 IIHEYF-----LSKEEMEDIALGLELSGVNFIWVVRFPC----------GAKVKVDEELP 265
+++ F L+ E+ ++A+GL SG FIWV+R P ++ LP
Sbjct: 270 VLYISFGSGGTLTCEQFNELAIGLAESGKRFIWVIRSPSEIVSSSYFNPHSETDPFSFLP 329
Query: 266 ESFLERTKERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQ 325
FL+RTKE+ +V+ WAPQ++IL HPS GF++HCGW+S +ES+ GVP+IA P+ +Q
Sbjct: 330 IGFLDRTKEKGLVVPSWAPQVQILAHPSTCGFLTHCGWNSTLESIVNGVPLIAWPLFAEQ 389
Query: 326 PLNA-RLVEDVGIGLEVRRNKCGRIQREEMARVIKEVVMEREGEKIKRKTREMGEKI 381
+N LVEDVG L + + G ++REE+ RV+K ++ EG+ I K +E+ E +
Sbjct: 390 KMNTLLLVEDVGAALRIHAGEDGIVRREEVVRVVKALMEGEEGKAIGNKVKELKEGV 446
>sp|O23205|U72C1_ARATH UDP-glycosyltransferase 72C1 OS=Arabidopsis thaliana GN=UGT72C1
PE=2 SV=3
Length = 457
Score = 128 bits (321), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 124/211 (58%), Gaps = 26/211 (12%)
Query: 198 VVPVGPLVQDPVEQTDHE-------KGATEIIHEYF-----LSKEEMEDIALGLELSGVN 245
V PVGPLV+ H + +++ F L+ E+ ++A GLEL+G
Sbjct: 235 VYPVGPLVRPAEPGLKHGVLDWLDLQPKESVVYVSFGSGGALTFEQTNELAYGLELTGHR 294
Query: 246 FIWVVRFPCGA--------KVKVDEE----LPESFLERTKERAMVIEGWAPQMKILGHPS 293
F+WVVR P K K + E LP FL+RTK+ +V+ WAPQ +IL H S
Sbjct: 295 FVWVVRPPAEDDPSASMFDKTKNETEPLDFLPNGFLDRTKDIGLVVRTWAPQEEILAHKS 354
Query: 294 IGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLNARLVE-DVGIGLEVRRNKCGRIQRE 352
GGFV+HCGW+SV+ES+ GVP++A P++ +Q +NAR+V ++ I L++ G +++E
Sbjct: 355 TGGFVTHCGWNSVLESIVNGVPMVAWPLYSEQKMNARMVSGELKIALQINVAD-GIVKKE 413
Query: 353 EMARVIKEVVMEREGEKIKRKTREMGEKIKE 383
+A ++K V+ E EG+++++ +E+ + +E
Sbjct: 414 VIAEMVKRVMDEEEGKEMRKNVKELKKTAEE 444
>sp|Q9LXV0|U92A1_ARATH UDP-glycosyltransferase 92A1 OS=Arabidopsis thaliana GN=UGT92A1
PE=2 SV=1
Length = 488
Score = 127 bits (319), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 197/421 (46%), Gaps = 51/421 (12%)
Query: 6 ICFCSTPSILNSIKQLDKFSLSIQLIELHLPSLPE-LPPQYHTTKGLPPHLMPTLKEAFD 64
I +TPS + I+ SI LIEL S LP LP L+ +L EA
Sbjct: 46 ISMINTPSNIPKIRSNLPPESSISLIELPFNSSDHGLPHDGENFDSLPYSLVISLLEASR 105
Query: 65 MASPSFFNILKNL------SPDLLIYDLIQPWAPALASSLNIPAVYFLVSSAATSAFMFH 118
F + + + S ++I D W + + + +V F +S A +
Sbjct: 106 SLREPFRDFMTKILKEEGQSSVIVIGDFFLGWIGKVCKEVGVYSVIF-SASGAFGLGCYR 164
Query: 119 AIKKNSLGDANDDDE----EFPSSSIFIHDYYMKSYFSNMVESPTT-------KRLLQCF 167
+I N D+ +FP + I + S+ M+E+ T K+++ +
Sbjct: 165 SIWLNLPHKETKQDQFLLDDFPEAG-EIEKTQLNSF---MLEADGTDDWSVFMKKIIPGW 220
Query: 168 ERSCNIVLIKSFRELEGKYIDYLSDLIKKKVVPVGPLVQDPVEQTDHEKGATEIIHEYFL 227
+ L + E++ + Y + V PVGP+++ P ++ + E + +
Sbjct: 221 S-DFDGFLFNTVAEIDQMGLSYFRRITGVPVWPVGPVLKSPDKKVG-SRSTEEAVKSWLD 278
Query: 228 SKEE------------------MEDIALGLELSGVNFIWVVRFPCGAKVK----VDEELP 265
SK + M ++A+ LE S NFIWVVR P G +VK V LP
Sbjct: 279 SKPDHSVVYVCFGSMNSILQTHMLELAMALESSEKNFIWVVRPPIGVEVKSEFDVKGYLP 338
Query: 266 ESFLERT--KERAMVIEGWAPQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHV 323
E F ER ER ++++ WAPQ+ IL H + F+SHCGW+S++ES+ GVP++ PM
Sbjct: 339 EGFEERITRSERGLLVKKWAPQVDILSHKATCVFLSHCGWNSILESLSHGVPLLGWPMAA 398
Query: 324 DQPLNARLVED-VGIGLEVRRNKCGRIQREEMARVIKEVVMERE-GEKIKRKTREMGEKI 381
+Q N+ L+E +G+ +EV R K I+ +++ IK V+ E E G++I++K RE+ E +
Sbjct: 399 EQFFNSILMEKHIGVSVEVARGKRCEIKCDDIVSKIKLVMEETEVGKEIRKKAREVKELV 458
Query: 382 K 382
+
Sbjct: 459 R 459
>sp|Q76MR7|UBGAT_SCUBA Baicalein 7-O-glucuronosyltransferase OS=Scutellaria baicalensis
GN=UBGAT-I PE=1 SV=1
Length = 441
Score = 127 bits (319), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 113/395 (28%), Positives = 177/395 (44%), Gaps = 57/395 (14%)
Query: 27 SIQLIELHLPSLPELPPQYHTTKGLPPHLMPTLKEAFDMASPSFFNILKNLSPDL----L 82
+I I H LPE+PP T + + E +++P+ L+ +S +
Sbjct: 33 AIPSISYHRLPLPEIPPDMTTDR------VELFFELPRLSNPNLLTALQQISQKTRIRAV 86
Query: 83 IYDLIQPWAPALASSLNIPAVYFLVSSAATS--AFMFHAIKKN---SLGDANDDDEEFPS 137
I D A + +SLNIP Y+ + T+ F I + L D ND +
Sbjct: 87 ILDFFCNAAFEVPTSLNIPTYYYFSAGTPTAILTLYFETIDETIPVDLQDLNDYVDIPGL 146
Query: 138 SSIFIHDYYMKSYFSNMVESPTTKRLLQCFERSCNIVLIKSFRELEGKYIDYLSDL---I 194
I D + + ++ + + RS I L+ F LE + I S
Sbjct: 147 PPIHCLDIPVALSPRKSLVYKSSVDISKNLRRSAGI-LVNGFDALEFRAIGSHSQRPMHF 205
Query: 195 KKKVVPV---GPLVQDPVEQTDHEKGATEIIHEYF---------------------LSKE 230
K PV GPLV D D + G+ E HE S +
Sbjct: 206 KGPTPPVYFIGPLVGD----VDTKAGSEE--HECLRWLDTQPSKSVVFLCFGRRGVFSAK 259
Query: 231 EMEDIALGLELSGVNFIWVVRFPCGAKVK-------VDEELPESFLERTKERAMVIEGWA 283
++++ A LE SG F+W VR P K +DE LPE FLERTK+R VI+ WA
Sbjct: 260 QLKETAAALENSGHRFLWSVRNPPELKKATGSDEPDLDELLPEGFLERTKDRGFVIKSWA 319
Query: 284 PQMKILGHPSIGGFVSHCGWSSVMESMRLGVPIIAMPMHVDQPLN-ARLVEDVGIGLEVR 342
PQ ++L H S+GGFV+HCG SSV E + GVP+I P+ + LN A +V+D+ + L +
Sbjct: 320 PQKEVLAHDSVGGFVTHCGRSSVSEGVWFGVPMIGWPVDAELRLNRAVMVDDLQVALPLE 379
Query: 343 RNKCGRIQREEMARVIKEVVMEREGEKIKRKTREM 377
G + E+ + ++E++ + G+ ++++ E+
Sbjct: 380 EEAGGFVTAAELEKRVRELMETKAGKAVRQRVTEL 414
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.137 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 153,190,484
Number of Sequences: 539616
Number of extensions: 6683743
Number of successful extensions: 24835
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 277
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 24153
Number of HSP's gapped (non-prelim): 395
length of query: 401
length of database: 191,569,459
effective HSP length: 120
effective length of query: 281
effective length of database: 126,815,539
effective search space: 35635166459
effective search space used: 35635166459
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)