BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 045570
(468 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O82383|U71D1_ARATH UDP-glycosyltransferase 71D1 OS=Arabidopsis thaliana GN=UGT71D1
PE=2 SV=1
Length = 467
Score = 461 bits (1186), Expect = e-129, Method: Compositional matrix adjust.
Identities = 241/471 (51%), Positives = 328/471 (69%), Gaps = 19/471 (4%)
Query: 1 MKKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQ 60
M+ ELIF+P+P +GHLV LEFA+ L ++DDRI +TIL MKL +D Y KS+ SQ
Sbjct: 1 MRNVELIFIPTPTVGHLVPFLEFARRLIEQDDRIRITILLMKLQGQSHLDTYVKSIASSQ 60
Query: 61 PRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSL---QVTGLV 117
P + ID+P ++ +S E ++ V+E ++P V+NIV S +L +V GLV
Sbjct: 61 PFVRFIDVPELEEKPTLGSTQSVEAYVYDVIERNIPLVRNIVMDILTSLALDGVKVKGLV 120
Query: 118 LDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELL-IPGI 176
+DFFC+ M+D+AK++SLP Y+FLT+N GFL +M YL R R ++VF + +E+L IPG
Sbjct: 121 VDFFCLPMIDVAKDISLPFYVFLTTNSGFLAMMQYLADRHSRDTSVFVRNSEEMLSIPGF 180
Query: 177 TSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDLN-PP 235
+PVP V+PS LF +DG A VKLA F +GI+VN+ ++EPY+VN F + N P
Sbjct: 181 VNPVPANVLPSALFVEDGYDA-YVKLAILFTKANGILVNSSFDIEPYSVNHFLQEQNYPS 239
Query: 236 LYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIG 295
+Y GP+ LK+QP+P+ D + ++ +WLDD E+SVVFLCFGS + VKEIA G
Sbjct: 240 VYAVGPIFDLKAQPHPEQDLTRRDELMKWLDDQPEASVVFLCFGSMARLRGSLVKEIAHG 299
Query: 296 LERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIWGWVPQVEILAHK 355
LE Y FLWSLR K+EV+ PEGFL+R+ GRGMI GW PQVEILAHK
Sbjct: 300 LELCQYRFLWSLR----KEEVTKDD-------LPEGFLDRVDGRGMICGWSPQVEILAHK 348
Query: 356 AIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDYRVGSD- 414
A+GGFVSHCGWNSI+ESLW+GVPI TWP+YAEQQLNAF MVKEL LA++L+LDYRV SD
Sbjct: 349 AVGGFVSHCGWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELKLAVELKLDYRVHSDE 408
Query: 415 LVMAGDIESAVRCLMDGENK-IRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
+V A +IE+A+R +MD +N +RK+V +++++ +++ GGSSF +I +FI
Sbjct: 409 IVNANEIETAIRYVMDTDNNVVRKRVMDISQMIQRATKNGGSSFAAIEKFI 459
>sp|Q9LML6|U71C4_ARATH UDP-glycosyltransferase 71C4 OS=Arabidopsis thaliana GN=UGT71C4
PE=2 SV=2
Length = 479
Score = 459 bits (1181), Expect = e-128, Method: Compositional matrix adjust.
Identities = 244/482 (50%), Positives = 327/482 (67%), Gaps = 24/482 (4%)
Query: 1 MKKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRI-SVTILSMKLAVAPWVDAYTKSLTDS 59
+K+ ELIF+P P GH++ +EFAK L + D RI ++TIL++ +P + +SL S
Sbjct: 2 VKETELIFIPVPSTGHILVHIEFAKRLINLDHRIHTITILNLSSPSSPHASVFARSLIAS 61
Query: 60 QPRICVIDLPPV-DPPLPDVLKKSPEYFISLVVESHLPNVKNIVSS-----RSNSGSLQV 113
QP+I + DLPP+ DPP D+ +++PE +I +++ + P +K+ VSS R S S+QV
Sbjct: 62 QPKIRLHDLPPIQDPPPFDLYQRAPEAYIVKLIKKNTPLIKDAVSSIVASRRGGSDSVQV 121
Query: 114 TGLVLDFFCVSMV-DIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFE--SSDDE 170
GLVLD FC S+V D+ EL+LPSY++LT N +L +M Y+P R +I++ F+ S D+E
Sbjct: 122 AGLVLDLFCNSLVKDVGNELNLPSYIYLTCNARYLGMMKYIPDRHRKIASEFDLSSGDEE 181
Query: 171 LLIPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSG 230
L +PG + +P MP LFNK+ + V+LA RF D GI+VN+F ELEP+ + FS
Sbjct: 182 LPVPGFINAIPTKFMPPGLFNKEA-YEAYVELAPRFADAKGILVNSFTELEPHPFDYFSH 240
Query: 231 -DLNPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQV 289
+ PP+Y GP+L LK + +P+ + +I WLDD ESSVVFLCFGS GS D QV
Sbjct: 241 LEKFPPVYPVGPILSLKDRASPNEEAVDRDQIVGWLDDQPESSVVFLCFGSRGSVDEPQV 300
Query: 290 KEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIWGWVPQV 349
KEIA LE G FLWS+R S + N V PEGF+ R+ GRG++ GW PQV
Sbjct: 301 KEIARALELVGCRFLWSIRTSGDVE-------TNPNDVLPEGFMGRVAGRGLVCGWAPQV 353
Query: 350 EILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDY 409
E+LAHKAIGGFVSHCGWNS LESLW+GVP+ATWP+YAEQQLNAF +VKELGLA+DLR+DY
Sbjct: 354 EVLAHKAIGGFVSHCGWNSTLESLWFGVPVATWPMYAEQQLNAFTLVKELGLAVDLRMDY 413
Query: 410 ---RVGSDLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQFISL 466
R G LV +I AVR LMDG ++ RKKVKEMA+ +RK+LM+GGSS + +FI+
Sbjct: 414 VSSRGG--LVTCDEIARAVRSLMDGGDEKRKKVKEMADAARKALMDGGSSSLATARFIAE 471
Query: 467 NF 468
F
Sbjct: 472 LF 473
>sp|Q9LML7|U71C3_ARATH UDP-glycosyltransferase 71C3 OS=Arabidopsis thaliana GN=UGT71C3
PE=2 SV=1
Length = 476
Score = 443 bits (1139), Expect = e-123, Method: Compositional matrix adjust.
Identities = 238/474 (50%), Positives = 327/474 (68%), Gaps = 21/474 (4%)
Query: 3 KAELIFVPSPGIGHLVSTLEFAKHLTDRDDRI-SVTILSMKLAVAPWVDAYTKSLTDSQP 61
+AE+IFV P GHL+ ++EFAK L RDDRI ++TIL L +AP + KSL SQP
Sbjct: 4 EAEIIFVTYPSPGHLLVSIEFAKSLIKRDDRIHTITILYWALPLAPQAHLFAKSLVASQP 63
Query: 62 RICVIDLPPV-DPPLPDVLKKSPEYFISLVVESHLPNVKN----IVSSRSNSGSLQVTGL 116
RI ++ LP V +PP ++ K+PE +I + +P V++ +VSSR SGS++V GL
Sbjct: 64 RIRLLALPDVQNPPPLELFFKAPEAYILESTKKTVPLVRDALSTLVSSRKESGSVRVVGL 123
Query: 117 VLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTV---FESSDDELLI 173
V+DFFCV M+++A EL+LPSY+FLT N GFL +M YLP R RI+T S + E I
Sbjct: 124 VIDFFCVPMIEVANELNLPSYIFLTCNAGFLSMMKYLPERH-RITTSELDLSSGNVEHPI 182
Query: 174 PGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSG-DL 232
PG VP V+P LF ++ A V++A++F GI+VN+ LE A + F+ D
Sbjct: 183 PGYVCSVPTKVLPPGLFVRESYEA-WVEIAEKFPGAKGILVNSVTCLEQNAFDYFARLDE 241
Query: 233 N-PPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKE 291
N PP+Y GPVL LK +P+P+LD + +I +WL+D ESS+V++CFGS G Q++E
Sbjct: 242 NYPPVYPVGPVLSLKDRPSPNLDASDRDRIMRWLEDQPESSIVYICFGSLGIIGKLQIEE 301
Query: 292 IAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIWGWVPQVEI 351
IA LE +G+ FLWS+R ++P ++ S + + PEGFL+R +G++ W PQVE+
Sbjct: 302 IAEALELTGHRFLWSIR-TNPTEKASPY------DLLPEGFLDRTASKGLVCDWAPQVEV 354
Query: 352 LAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDY-R 410
LAHKA+GGFVSHCGWNS+LESLW+GVPIATWP+YAEQQLNAF MVKELGLA++LRLDY
Sbjct: 355 LAHKALGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQLNAFSMVKELGLAVELRLDYVS 414
Query: 411 VGSDLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
++V A +I A+R LMDGE+ RK+VKEMAE +R +LM+GGSSF ++ +F+
Sbjct: 415 AYGEIVKAEEIAGAIRSLMDGEDTPRKRVKEMAEAARNALMDGGSSFVAVKRFL 468
>sp|O82385|U71D2_ARATH UDP-glycosyltransferase 71D2 OS=Arabidopsis thaliana GN=UGT71D2
PE=2 SV=1
Length = 467
Score = 442 bits (1136), Expect = e-123, Method: Compositional matrix adjust.
Identities = 233/472 (49%), Positives = 324/472 (68%), Gaps = 21/472 (4%)
Query: 1 MKKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQ 60
M+ AELIF+P+P +GHLV LEFA+ L ++DDRI +T L MK +D+Y K+++ S
Sbjct: 1 MRNAELIFIPTPTVGHLVPFLEFARRLIEQDDRIRITFLLMKQQGQSHLDSYVKTISSSL 60
Query: 61 PRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIV----SSRSNSGSLQVTGL 116
P + ID+P ++ P + +S E ++ +E+++P V+NI+ SS + G + V G
Sbjct: 61 PFVRFIDVPELEEK-PTLGTQSVEAYVYDFIETNVPLVQNIIMGILSSPAFDG-VTVKGF 118
Query: 117 VLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELL-IPG 175
V DFFC+ M+D+AK+ SLP Y+FLTSN GFL +M YL + ++VF + +E+L IPG
Sbjct: 119 VADFFCLPMIDVAKDASLPFYVFLTSNSGFLAMMQYLAYGHKKDTSVFARNSEEMLSIPG 178
Query: 176 ITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDLN-P 234
+PVP V+PS LF +DG A VKLA F +GI+VNT ++EP ++N F G+ N P
Sbjct: 179 FVNPVPAKVLPSALFIEDGYDAD-VKLAILFTKANGILVNTSFDIEPTSLNHFLGEENYP 237
Query: 235 PLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAI 294
+Y GP+ + K+ P+PD D A + +WLD E+SVVFLCFGS GS VKEIA
Sbjct: 238 SVYAVGPIFNPKAHPHPDQDLACCDESMKWLDAQPEASVVFLCFGSMGSLRGPLVKEIAH 297
Query: 295 GLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIWGWVPQVEILAH 354
GLE Y FLWSLR VTN+ + PEGF++R+ GRGMI GW PQVEILAH
Sbjct: 298 GLELCQYRFLWSLRTEE----------VTNDDLLPEGFMDRVSGRGMICGWSPQVEILAH 347
Query: 355 KAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDYRVGS- 413
KA+GGFVSHCGWNSI+ESLW+GVPI TWP+YAEQQLNAF MVKEL LA++L+LDY V S
Sbjct: 348 KAVGGFVSHCGWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELKLAVELKLDYSVHSG 407
Query: 414 DLVMAGDIESAVRCLMDGENK-IRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
++V A +IE+A+ C+M+ +N +RK+V +++++ +++ GGSSF +I +FI
Sbjct: 408 EIVSANEIETAISCVMNKDNNVVRKRVMDISQMIQRATKNGGSSFAAIEKFI 459
>sp|Q9FE68|U71C5_ARATH UDP-glycosyltransferase 71C5 OS=Arabidopsis thaliana GN=UGT71C5
PE=2 SV=1
Length = 480
Score = 434 bits (1117), Expect = e-121, Method: Compositional matrix adjust.
Identities = 235/480 (48%), Positives = 315/480 (65%), Gaps = 24/480 (5%)
Query: 1 MKKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRIS-VTILSMKLAVAPWVDAYTKSLTDS 59
MK AELIFVP P GHL+ST+EF K L + D RIS +TILSM L AP DA SLT S
Sbjct: 1 MKTAELIFVPLPETGHLLSTIEFGKRLLNLDRRISMITILSMNLPYAPHADASLASLTAS 60
Query: 60 QPRICVIDLPPV-DPPLPDVLKKSPEYFISLVVESHLPNVKNIVSS------RSNSGSLQ 112
+P I +I LP + DPP +L S E +I + ++P ++ + S GS
Sbjct: 61 EPGIRIISLPEIHDPPPIKLLDTSSETYILDFIHKNIPCLRKTIQDLVSSSSSSGGGSSH 120
Query: 113 VTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFE--SSDDE 170
V GL+LDFFCV ++DI +E++LPSY+F+TSN GFL ++ YLP RQ + F+ S ++E
Sbjct: 121 VAGLILDFFCVGLIDIGREVNLPSYIFMTSNFGFLGVLQYLPERQRLTPSEFDESSGEEE 180
Query: 171 LLIPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFS- 229
L IP + VP V+P +F+K + +LVK+ +R + GI+VN+F ++EPYA FS
Sbjct: 181 LHIPAFVNRVPAKVLPPGVFDKLS-YGSLVKIGERLHEAKGILVNSFTQVEPYAAEHFSQ 239
Query: 230 GDLNPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQV 289
G P +Y GPVL+L + NP L AQY+++ +WLD+ +SSV+FLCFGS G F Q+
Sbjct: 240 GRDYPHVYPVGPVLNLTGRTNPGLASAQYKEMMKWLDEQPDSSVLFLCFGSMGVFPAPQI 299
Query: 290 KEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIWGWVPQV 349
EIA LE G F+W++R + D PEGF++R GRG++ W PQV
Sbjct: 300 TEIAHALELIGCRFIWAIRTNMAGDGDPQEP-------LPEGFVDRTMGRGIVCSWAPQV 352
Query: 350 EILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDY 409
+ILAHKA GGFVSHCGWNS+ ESLWYGVPIATWP+YAEQQLNAF MVKELGLA+++RLDY
Sbjct: 353 DILAHKATGGFVSHCGWNSVQESLWYGVPIATWPMYAEQQLNAFEMVKELGLAVEIRLDY 412
Query: 410 -----RVGSDLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
RV ++V A +I +AVR LMD +N +RKKV E + ++RK++ +GGSS + FI
Sbjct: 413 VADGDRVTLEIVSADEIATAVRSLMDSDNPVRKKVIEKSSVARKAVGDGGSSTVATCNFI 472
>sp|Q66PF3|UFOG3_FRAAN Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3
OS=Fragaria ananassa GN=GT3 PE=2 SV=1
Length = 478
Score = 418 bits (1074), Expect = e-116, Method: Compositional matrix adjust.
Identities = 229/477 (48%), Positives = 316/477 (66%), Gaps = 18/477 (3%)
Query: 2 KKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKL-AVAPWVDAYTKSLTDSQ 60
K AEL+ +PSPGIGHLVSTLE AK L RDD++ +T+L M AV+ DAY +SL DS
Sbjct: 3 KPAELVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSKGTDAYVQSLADSS 62
Query: 61 P----RICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSLQVTGL 116
RI I+LP + D + S + VES P+VK+ V++ +S + ++ G
Sbjct: 63 SPISQRINFINLPHTNM---DHTEGSVRNSLVGFVESQQPHVKDAVANLRDSKTTRLAGF 119
Query: 117 VLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRIS---TVFESSDDELLI 173
V+D FC +M+++A +L +PSY+F TS L L+ +L +D+ + T F+ SD EL+I
Sbjct: 120 VVDMFCTTMINVANQLGVPSYVFFTSGAATLGLLFHLQELRDQYNKDCTEFKDSDAELII 179
Query: 174 PGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDLN 233
P +P+P V+P + KD L + +RF++ GI+VNTF +LE +A++A S D
Sbjct: 180 PSFFNPLPAKVLPGRMLVKDSAEPFL-NVIKRFRETKGILVNTFTDLESHALHALSSDAE 238
Query: 234 -PPLYTAGPVLHLKS-QPNPDLDEAQYQK-IFQWLDDLAESSVVFLCFGSSGSFDVAQVK 290
PP+Y GP+L+L S + D DE + + I +WLDD SVVFLCFGS GSFD +QV+
Sbjct: 239 IPPVYPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQVR 298
Query: 291 EIAIGLERSGYNFLWSLRVSSPKDEVS-AHRYVTNNGVFPEGFLERIKGRGMIWGWVPQV 349
EIA LE +G+ FLWSLR S P +V+ Y + GV PEGFL+R G G + GW PQV
Sbjct: 299 EIANALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVIGWAPQV 358
Query: 350 EILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDY 409
+LAH ++GGFVSHCGWNS LESLW+GVP+ATWP+YAEQQLNAF+ VKEL LA+++ + Y
Sbjct: 359 AVLAHPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDMSY 418
Query: 410 RVGSD-LVMAGDIESAVRCLMD-GENKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
R S LV A +IE +R +M+ + IRK+VKEM+E +K+LM+GGSS+ S+G FI
Sbjct: 419 RSKSPVLVSAKEIERGIREVMELDSSDIRKRVKEMSEKGKKALMDGGSSYTSLGHFI 475
>sp|O82382|U71C2_ARATH UDP-glycosyltransferase 71C2 OS=Arabidopsis thaliana GN=UGT71C2
PE=1 SV=1
Length = 474
Score = 414 bits (1063), Expect = e-114, Method: Compositional matrix adjust.
Identities = 232/477 (48%), Positives = 315/477 (66%), Gaps = 24/477 (5%)
Query: 2 KKAELIFVPSPGIGHLVSTLEFAKHL-TDRDDRI-SVTILSMKLAVAPWVD--AYTKSLT 57
++AELIF+P P GH+++T+E AK L + + RI ++TIL L P D A+ KSL
Sbjct: 5 QEAELIFIPFPIPGHILATIELAKRLISHQPSRIHTITILHWSLPFLPQSDTIAFLKSLI 64
Query: 58 DSQPRICVIDLPPV-DPPLPDVLKKSPEYFISLVVESHLPNVKN----IVSSRSNSGSLQ 112
+++ RI +I LP V +PP ++ K+ E +I V+ +P V+N ++SSR S S+
Sbjct: 65 ETESRIRLITLPDVQNPPPMELFVKASESYILEYVKKMVPLVRNALSTLLSSRDESDSVH 124
Query: 113 VTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDE-- 170
V GLVLDFFCV ++D+ E +LPSY+FLT + FL +M YL R S DE
Sbjct: 125 VAGLVLDFFCVPLIDVGNEFNLPSYIFLTCSASFLGMMKYLLERNRETKPELNRSSDEET 184
Query: 171 LLIPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSG 230
+ +PG + VPV V+P LF + A V++A+RF + GI+VN+F LE A + F
Sbjct: 185 ISVPGFVNSVPVKVLPPGLFTTESYEA-WVEMAERFPEAKGILVNSFESLERNAFDYFDR 243
Query: 231 --DLNPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQ 288
D PP+Y GP+L +PN DL E +I +WLDD ESSVVFLCFGS S +Q
Sbjct: 244 RPDNYPPVYPIGPILCSNDRPNLDLSERD--RILKWLDDQPESSVVFLCFGSLKSLAASQ 301
Query: 289 VKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIWGWVPQ 348
+KEIA LE G FLWS+R + PK+ Y + N + P+GF+ R+ G G++ GW PQ
Sbjct: 302 IKEIAQALELVGIRFLWSIR-TDPKE------YASPNEILPDGFMNRVMGLGLVCGWAPQ 354
Query: 349 VEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLD 408
VEILAHKAIGGFVSHCGWNSILESL +GVPIATWP+YAEQQLNAF +VKELGLAL++RLD
Sbjct: 355 VEILAHKAIGGFVSHCGWNSILESLRFGVPIATWPMYAEQQLNAFTIVKELGLALEMRLD 414
Query: 409 Y-RVGSDLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
Y ++V A +I AVR LMDGE+ R+K+KE+AE ++++M+GGSSF ++ +FI
Sbjct: 415 YVSEYGEIVKADEIAGAVRSLMDGEDVPRRKLKEIAEAGKEAVMDGGSSFVAVKRFI 471
>sp|O82381|U71C1_ARATH UDP-glycosyltransferase 71C1 OS=Arabidopsis thaliana GN=UGT71C1
PE=1 SV=1
Length = 481
Score = 399 bits (1024), Expect = e-110, Method: Compositional matrix adjust.
Identities = 226/477 (47%), Positives = 311/477 (65%), Gaps = 25/477 (5%)
Query: 2 KKAELIFVPSPGIGHLVSTLEFAKHLTDRDD-RI-SVTILSMKLAVAPWVD--AYTKSLT 57
+ AEL+ +P P GH+++T+E AK L +D+ RI ++TIL L P D A+ +SL
Sbjct: 5 EDAELVIIPFPFSGHILATIELAKRLISQDNPRIHTITILYWGLPFIPQADTIAFLRSLV 64
Query: 58 DSQPRICVIDLPPV-DPPLPDVLKKSPEYFISLVVESHLPNVK----NIVSSRSNSGSLQ 112
++PRI ++ LP V DPP ++ + E +I V+ +P ++ ++SSR SGS++
Sbjct: 65 KNEPRIRLVTLPEVQDPPPMELFVEFAESYILEYVKKMVPIIREALSTLLSSRDESGSVR 124
Query: 113 VTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESS-DDEL 171
V GLVLDFFCV M+D+ E +LPSY+FLT + GFL +M YLP R I + F S ++EL
Sbjct: 125 VAGLVLDFFCVPMIDVGNEFNLPSYIFLTCSAGFLGMMKYLPERHREIKSEFNRSFNEEL 184
Query: 172 -LIPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSG 230
LIPG + VP V+PS LF K+ + V+LA+RF + GI+VN++ LEP F
Sbjct: 185 NLIPGYVNSVPTKVLPSGLFMKET-YEPWVELAERFPEAKGILVNSYTALEPNGFKYFDR 243
Query: 231 --DLNPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQ 288
D P +Y GP+L +PN LD ++ +I WLDD ESSVVFLCFGS + Q
Sbjct: 244 CPDNYPTIYPIGPILCSNDRPN--LDSSERDRIITWLDDQPESSVVFLCFGSLKNLSATQ 301
Query: 289 VKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIWGWVPQ 348
+ EIA LE F+WS R ++PK+ S + P GF++R+ +G++ GW PQ
Sbjct: 302 INEIAQALEIVDCKFIWSFR-TNPKEYASPYE------ALPHGFMDRVMDQGIVCGWAPQ 354
Query: 349 VEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLD 408
VEILAHKA+GGFVSHCGWNSILESL +GVPIATWP+YAEQQLNAF MVKELGLAL++RLD
Sbjct: 355 VEILAHKAVGGFVSHCGWNSILESLGFGVPIATWPMYAEQQLNAFTMVKELGLALEMRLD 414
Query: 409 Y-RVGSDLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
Y D+V A +I VR LMDG + + KVKE+AE ++++ +GGSSF ++ +FI
Sbjct: 415 YVSEDGDIVKADEIAGTVRSLMDGVDVPKSKVKEIAEAGKEAV-DGGSSFLAVKRFI 470
>sp|Q2V6K0|UFOG6_FRAAN UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria
ananassa GN=GT6 PE=1 SV=1
Length = 479
Score = 399 bits (1024), Expect = e-110, Method: Compositional matrix adjust.
Identities = 221/476 (46%), Positives = 304/476 (63%), Gaps = 15/476 (3%)
Query: 1 MKKA-ELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLA-VAPWVDAYTKSLTD 58
MKKA ELIF+P PGIGH+VST+E AK L RDD + +TIL MK A D Y KSL
Sbjct: 1 MKKASELIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPFTADGSDVYIKSLA- 59
Query: 59 SQPRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSS--RSNSGSLQVTGL 116
P + + V+ P F + + +SH +VK+ V+ + S + ++ G
Sbjct: 60 VDPSLKTQRIRFVNLPQEHFQGTGATGFFTFI-DSHKSHVKDAVTRLMETKSETTRIAGF 118
Query: 117 VLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQD---RISTVFESSDDELLI 173
V+D FC M+D+A E LPSY+F TS L LM +L +D + T F+ SD EL++
Sbjct: 119 VIDMFCTGMIDLANEFGLPSYVFYTSGAADLGLMFHLQALRDEENKDCTEFKDSDAELVV 178
Query: 174 PGITSPVPVC-VMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDL 232
+P+P V+PS +F K+GG+ L A+R+++ GI+VNTF ELEP+A+ + S D
Sbjct: 179 SSFVNPLPAARVLPSVVFEKEGGNFFL-NFAKRYRETKGILVNTFLELEPHAIQSLSSDG 237
Query: 233 NP-PLYTAGPVLHLKSQPNPDLDEAQYQK--IFQWLDDLAESSVVFLCFGSSGSFDVAQV 289
P+Y GP+L++KS+ N E QK I +WLDD SSVVFLCFGS G F QV
Sbjct: 238 KILPVYPVGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGEDQV 297
Query: 290 KEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIWGWVPQV 349
KEIA LE+ G FLWSLR S + Y V PEGFL+R G + GW PQ+
Sbjct: 298 KEIAHALEQGGIRFLWSLRQPSKEKIGFPSDYTDYKAVLPEGFLDRTTDLGKVIGWAPQL 357
Query: 350 EILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDY 409
ILAH A+GGFVSHCGWNS LES+WYGVPIATWP YAEQQ+NAF +VKEL LA+++ + Y
Sbjct: 358 AILAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVEIDMGY 417
Query: 410 RVGSDLVMAGD-IESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
R S ++++ + IE ++ +M+ E+++RK+VKEM+++SRK+L E GSS++S+G+F+
Sbjct: 418 RKDSGVIVSRENIEKGIKEVMEQESELRKRVKEMSQMSRKALEEDGSSYSSLGRFL 473
>sp|Q9LSY8|U71B2_ARATH UDP-glycosyltransferase 71B2 OS=Arabidopsis thaliana GN=UGT71B2
PE=1 SV=1
Length = 485
Score = 373 bits (958), Expect = e-102, Method: Compositional matrix adjust.
Identities = 213/488 (43%), Positives = 296/488 (60%), Gaps = 36/488 (7%)
Query: 3 KAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLT----- 57
K EL+F+PSPG GHL +E AK DRDD +S+TI+ + P + ++ S +
Sbjct: 2 KLELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITII-----IIPQMHGFSSSNSSSYIA 56
Query: 58 ----DSQPRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGS--- 110
DS+ R+ L D P D K P +F +++ P VK V ++ G
Sbjct: 57 SLSSDSEERLSYNVLSVPDKPDSDDTK--PHFFD--YIDNFKPQVKATVEKLTDPGPPDS 112
Query: 111 -LQVTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQD----RISTVFE 165
++ G V+D FC+ M+D+A E +PSYMF TSN FL L +++ D +S + +
Sbjct: 113 PSRLAGFVVDMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLYDVKNYDVSDLKD 172
Query: 166 SSDDELLIPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAV 225
S EL +P +T P+PV PS L K+ + + +RF++ GI+VNTF ELEP A+
Sbjct: 173 SDTTELEVPCLTRPLPVKCFPSVLLTKEW-LPVMFRQTRRFRETKGILVNTFAELEPQAM 231
Query: 226 NAFSGDLNP--PLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGS 283
FSG +P +YT GPV++LK P+ + + +I +WLD+ SVVFLCFGS G
Sbjct: 232 KFFSGVDSPLPTVYTVGPVMNLKIN-GPNSSDDKQSEILRWLDEQPRKSVVFLCFGSMGG 290
Query: 284 FDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTN-NGVFPEGFLERIKGRGMI 342
F Q KEIAI LERSG+ F+WSLR + PK + TN + PEGFLER G I
Sbjct: 291 FREGQAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKI 350
Query: 343 WGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLA 402
GW PQ ILA+ AIGGFVSHCGWNS LESLW+GVP+ATWP+YAEQQ+NAF MV+ELGLA
Sbjct: 351 VGWAPQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLA 410
Query: 403 LDLRLDYR-----VGSDLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSF 457
+++R +R +L+ A +IE +RCLM+ ++ +R +VKEM+E S +LM+GGSS
Sbjct: 411 VEVRNSFRGDFMAADDELMTAEEIERGIRCLMEQDSDVRSRVKEMSEKSHVALMDGGSSH 470
Query: 458 NSIGQFIS 465
++ +FI
Sbjct: 471 VALLKFIQ 478
>sp|Q9LSY5|U71B7_ARATH UDP-glycosyltransferase 71B7 OS=Arabidopsis thaliana GN=UGT71B7
PE=2 SV=2
Length = 495
Score = 363 bits (932), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 206/482 (42%), Positives = 288/482 (59%), Gaps = 28/482 (5%)
Query: 3 KAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDA--YTKSLT-DS 59
K EL+F+P PGIGHL ST+E AK L DR+ R+S++++ + V A Y +L+ S
Sbjct: 2 KFELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVIILPFISEGEVGASDYIAALSASS 61
Query: 60 QPRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSS-----RSNSGSLQVT 114
R+ + VD P ++ I + +++ P V++ V+ S S ++
Sbjct: 62 NNRLRYEVISAVDQPTIEMTT------IEIHMKNQEPKVRSTVAKLLEDYSSKPDSPKIA 115
Query: 115 GLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRL-----MLYLPTRQDRISTVFESSDD 169
G VLD FC SMVD+A E PSYMF TS+ G L + ML + D + S+
Sbjct: 116 GFVLDMFCTSMVDVANEFGFPSYMFYTSSAGILSVTYHVQMLCDENKYDVSENDYADSEA 175
Query: 170 ELLIPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFS 229
L P ++ P PV +P L + V A++F+++ GI+VNT ELEPY + S
Sbjct: 176 VLNFPSLSRPYPVKCLPHAL-AANMWLPVFVNQARKFREMKGILVNTVAELEPYVLKFLS 234
Query: 230 GDLNPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQV 289
PP+Y GP+LHL++Q + DE + + I +WLD SSVVFLCFGS G F QV
Sbjct: 235 SSDTPPVYPVGPLLHLENQRDDSKDEKRLE-IIRWLDQQPPSSVVFLCFGSMGGFGEEQV 293
Query: 290 KEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTN-NGVFPEGFLERIKGRGMIWGWVPQ 348
+EIAI LERSG+ FLWSLR +SP TN V PEGF +R K G + GW PQ
Sbjct: 294 REIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFFDRTKDIGKVIGWAPQ 353
Query: 349 VEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLD 408
V +LA+ AIGGFV+HCGWNS LESLW+GVP A WP+YAEQ+ NAF MV+ELGLA+++R
Sbjct: 354 VAVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNAFLMVEELGLAVEIRKY 413
Query: 409 YR------VGSDLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQ 462
+R + + V A +IE A+ CLM+ ++ +RK+VK+M+E +LM+GGSS ++ +
Sbjct: 414 WRGEHLAGLPTATVTAEEIEKAIMCLMEQDSDVRKRVKDMSEKCHVALMDGGSSRTALQK 473
Query: 463 FI 464
FI
Sbjct: 474 FI 475
>sp|O23382|U71B5_ARATH UDP-glycosyltransferase 71B5 OS=Arabidopsis thaliana GN=UGT71B5
PE=3 SV=1
Length = 478
Score = 353 bits (906), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 207/483 (42%), Positives = 285/483 (59%), Gaps = 33/483 (6%)
Query: 3 KAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAV-APWVDAYTKSLTD-SQ 60
K EL+F+P PGIGHL T++ AK L ++R+S+TI+ + A A SLT SQ
Sbjct: 2 KIELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRFDAGDASACIASLTTLSQ 61
Query: 61 ------PRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSLQVT 114
I V PP P P + +E V++ V++R + ++
Sbjct: 62 DDRLHYESISVAKQPPTSDPDP--------VPAQVYIEKQKTKVRDAVAARIVDPTRKLA 113
Query: 115 GLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRIS---TVFESSDDEL 171
G V+D FC SM+D+A E +P YM TSN FL ML++ D+ + E+S EL
Sbjct: 114 GFVVDMFCSSMIDVANEFGVPCYMVYTSNATFLGTMLHVQQMYDQKKYDVSELENSVTEL 173
Query: 172 LIPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFS-- 229
P +T P PV +P L +K+ +L + A+ F+ + GI+VNT ELEP+A+ F+
Sbjct: 174 EFPSLTRPYPVKCLPHILTSKEWLPLSLAQ-ARCFRKMKGILVNTVAELEPHALKMFNIN 232
Query: 230 GDLNPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQV 289
GD P +Y GPVLHL+ N + D+ + +I +WLD+ SVVFLCFGS G F Q
Sbjct: 233 GDDLPQVYPVGPVLHLE---NGNDDDEKQSEILRWLDEQPSKSVVFLCFGSLGGFTEEQT 289
Query: 290 KEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTN-NGVFPEGFLERIKGRGMIWGWVPQ 348
+E A+ L+RSG FLW LR +SP + R TN V PEGFLER RG + GW PQ
Sbjct: 290 RETAVALDRSGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRGKVIGWAPQ 349
Query: 349 VEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLD 408
V +L AIGGFV+HCGWNSILESLW+GVP+ TWP+YAEQ++NAF MV+ELGLA+++R
Sbjct: 350 VAVLEKPAIGGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEELGLAVEIR-K 408
Query: 409 YRVGS------DLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQ 462
Y G + V A DIE A+R +M+ ++ +R VKEMAE +LM+GGSS ++ +
Sbjct: 409 YLKGDLFAGEMETVTAEDIERAIRRVMEQDSDVRNNVKEMAEKCHFALMDGGSSKAALEK 468
Query: 463 FIS 465
FI
Sbjct: 469 FIQ 471
>sp|Q9LSY4|U71B8_ARATH UDP-glycosyltransferase 71B8 OS=Arabidopsis thaliana GN=UGT71B8
PE=3 SV=1
Length = 480
Score = 351 bits (900), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 204/484 (42%), Positives = 293/484 (60%), Gaps = 30/484 (6%)
Query: 1 MKKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQ 60
M K L+FVP P +GHL ST E AK L +++ R+S++I+ + L V A
Sbjct: 1 MNKFALVFVPFPILGHLKSTAEMAKLLVEQETRLSISIIILPLLSGDDVSA--------S 52
Query: 61 PRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSS-----RSNSGSLQVTG 115
I + D +V+ + + L V++H+P VK V+ S ++ G
Sbjct: 53 AYISALSAASNDRLHYEVISDGDQPTVGLHVDNHIPMVKRTVAKLVDDYSRRPDSPRLAG 112
Query: 116 LVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRIS-----TVFESSDDE 170
LV+D FC+S++D+A E+S+P Y+F TSN+G L L L++ D+ T FE S+
Sbjct: 113 LVVDMFCISVIDVANEVSVPCYLFYTSNVGILALGLHIQMLFDKKEYSVSETDFEDSEVV 172
Query: 171 LLIPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAF-- 228
L +P +T P PV +P L K+ L + +RF+++ GI+VNTF ELEPYA+ +
Sbjct: 173 LDVPSLTCPYPVKCLPYGLATKEWLPMYLNQ-GRRFREMKGILVNTFAELEPYALESLHS 231
Query: 229 SGDLNPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQ 288
SGD P Y GP+LHL++ + DE + I +WLD+ SVVFLCFGS G F+ Q
Sbjct: 232 SGD-TPRAYPVGPLLHLENHVDGSKDE-KGSDILRWLDEQPPKSVVFLCFGSIGGFNEEQ 289
Query: 289 VKEIAIGLERSGYNFLWSLRVSSPK-DEVSAHRYVTNNGVFPEGFLERIKGRGMIWGWVP 347
+E+AI LERSG+ FLWSLR +S D+ + + PEGF +R K +G + GW P
Sbjct: 290 AREMAIALERSGHRFLWSLRRASRDIDKELPGEFKNLEEILPEGFFDRTKDKGKVIGWAP 349
Query: 348 QVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRL 407
QV +LA AIGGFV+HCGWNSILESLW+GVPIA WP+YAEQ+ NAF MV+ELGLA+ +R
Sbjct: 350 QVAVLAKPAIGGFVTHCGWNSILESLWFGVPIAPWPLYAEQKFNAFVMVEELGLAVKIRK 409
Query: 408 DYR----VGSD--LVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIG 461
+R VG+ +V A +IE +RCLM+ ++ +R +VKEM++ +L +GGSS +++
Sbjct: 410 YWRGDQLVGTATVIVTAEEIERGIRCLMEQDSDVRNRVKEMSKKCHMALKDGGSSQSALK 469
Query: 462 QFIS 465
FI
Sbjct: 470 LFIQ 473
>sp|Q40284|UFOG1_MANES Anthocyanidin 3-O-glucosyltransferase 1 OS=Manihot esculenta GN=GT1
PE=2 SV=1
Length = 449
Score = 349 bits (895), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 201/461 (43%), Positives = 273/461 (59%), Gaps = 41/461 (8%)
Query: 14 IGHLVSTLEFAKHLTDRDDRISVTILSMKLAV-APWVDAYTKS-LTDSQPRICVIDLPPV 71
+GHLVS +E AK L R +S+T+L +V V Y S + S R+ I LP
Sbjct: 1 MGHLVSAVETAKLLLSRCHSLSITVLIFNNSVVTSKVHNYVDSQIASSSNRLRFIYLPRD 60
Query: 72 DPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSL----QVTGLVLDFFCVSMVD 127
+ + S ++E P+VK V + GS ++ G ++D FC +M+D
Sbjct: 61 ETGISS---------FSSLIEKQKPHVKESVMKITEFGSSVESPRLVGFIVDMFCTAMID 111
Query: 128 IAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRIS---TVFESSDDELLIPGITSPVPVCV 184
+A E +PSY+F TS FL ML++ D + T F +SD EL +PG+ + P
Sbjct: 112 VANEFGVPSYIFYTSGAAFLNFMLHVQKIHDEENFNPTEFNASDGELQVPGLVNSFPSKA 171
Query: 185 MPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDLNPPLYTAGPVLH 244
MP+ + +K L++ +R+ + G+I+NTF ELE +A+ +F +PP+Y GP+L
Sbjct: 172 MPTAILSKQW-FPPLLENTRRYGEAKGVIINTFFELESHAIESFK---DPPIYPVGPILD 227
Query: 245 LKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGLERSGYNFL 304
++S Q+I QWLDD SSVVFLCFGS+GSF QVKEIA LE SG+ FL
Sbjct: 228 VRSN-----GRNTNQEIMQWLDDQPPSSVVFLCFGSNGSFSKDQVKEIACALEDSGHRFL 282
Query: 305 WSLR-------VSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIWGWVPQVEILAHKAI 357
WSL + SP D Y V PEGFLER G + GW PQV +LAH A
Sbjct: 283 WSLADHRAPGFLESPSD------YEDLQEVLPEGFLERTSGIEKVIGWAPQVAVLAHPAT 336
Query: 358 GGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDYRVGS-DLV 416
GG VSH GWNSILES+W+GVP+ATWP+YAEQQ NAF+MV ELGLA+++++DYR S ++V
Sbjct: 337 GGLVSHSGWNSILESIWFGVPVATWPMYAEQQFNAFQMVIELGLAVEIKMDYRNDSGEIV 396
Query: 417 MAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSF 457
IE +RCLM ++ RKKVKEM+E SR +LMEGGSS+
Sbjct: 397 KCDQIERGIRCLMKHDSDRRKKVKEMSEKSRGALMEGGSSY 437
>sp|Q9LSY6|U71B6_ARATH UDP-glycosyltransferase 71B6 OS=Arabidopsis thaliana GN=UGT71B6
PE=1 SV=1
Length = 479
Score = 346 bits (887), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 195/478 (40%), Positives = 291/478 (60%), Gaps = 30/478 (6%)
Query: 3 KAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQPR 62
K EL+F+PSP I HL++T+E A+ L D++D +S+T++ + + + T ++++ R
Sbjct: 2 KIELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVIIISFS-SKNTSMITSLTSNNRLR 60
Query: 63 ICVI---DLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNS---GSLQVTGL 116
+I D P + LK + + ++S P V++ V+ +S + ++ G
Sbjct: 61 YEIISGGDQQPTE------LKATDSH-----IQSLKPLVRDAVAKLVDSTLPDAPRLAGF 109
Query: 117 VLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRIS----TVFESSDDELL 172
V+D +C SM+D+A E +PSY+F TSN GFL L+L++ D + E SD EL+
Sbjct: 110 VVDMYCTSMIDVANEFGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYDMSELEDSDVELV 169
Query: 173 IPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDL 232
+P +TSP P+ +P +F V A+RF++ GI+VNT +LEP A+ S
Sbjct: 170 VPSLTSPYPLKCLP-YIFKSKEWLTFFVTQARRFRETKGILVNTVPDLEPQALTFLSNGN 228
Query: 233 NPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEI 292
P Y GP+LHLK+ N D + + +I +WLD+ SVVFLCFGS G F QV+E
Sbjct: 229 IPRAYPVGPLLHLKNV-NCDYVDKKQSEILRWLDEQPPRSVVFLCFGSMGGFSEEQVRET 287
Query: 293 AIGLERSGYNFLWSLRVSSPKDEVSAHRYVTN-NGVFPEGFLERIKGRGMIWGWVPQVEI 351
A+ L+RSG+ FLWSLR +SP TN + PEGF +R RG + GW QV I
Sbjct: 288 ALALDRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVIGWAEQVAI 347
Query: 352 LAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDYR- 410
LA AIGGFVSH GWNS LESLW+GVP+A WP+YAEQ+ NAF MV+ELGLA++++ +R
Sbjct: 348 LAKPAIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAVEIKKHWRG 407
Query: 411 ---VG-SDLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
+G S++V A +IE + CLM+ ++ +RK+V E++E +LM+GGSS ++ +FI
Sbjct: 408 DLLLGRSEIVTAEEIEKGIICLMEQDSDVRKRVNEISEKCHVALMDGGSSETALKRFI 465
>sp|Q40288|UFOG6_MANES Anthocyanidin 3-O-glucosyltransferase 6 (Fragment) OS=Manihot
esculenta GN=GT6 PE=2 SV=1
Length = 394
Score = 345 bits (884), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 175/373 (46%), Positives = 244/373 (65%), Gaps = 10/373 (2%)
Query: 91 VESHLPNVKNIVSSRSNSGSLQVTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLM 150
++ +VK VS + + G VLD FC SM+D+AKEL +P Y+F TS FL +
Sbjct: 9 IDKQKAHVKEAVSKLTARSDSSLAGFVLDMFCTSMIDVAKELGVPYYIFFTSGAAFLGFL 68
Query: 151 LY---LPTRQDRISTVFESSDDELLIPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFK 207
Y + QD T F+ SD EL +P + + +P V+P+ + KD +A +++ + +
Sbjct: 69 FYVQLIHDEQDADLTQFKDSDAELSVPSLANSLPARVLPASMLVKDRFYA-FIRIIRGLR 127
Query: 208 DVDGIIVNTFHELEPYAVNAFSGDLN--PPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWL 265
+ GI+VNTF ELE +A+N+ D + PP+Y GP+L L +Q N E +I +WL
Sbjct: 128 EAKGIMVNTFMELESHALNSLKDDQSKIPPIYPVGPILKLSNQENDVGPEGS--EIIEWL 185
Query: 266 DDLAESSVVFLCFGSSGSFDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEV-SAHRYVTN 324
DD SSVVFLCFGS G FD+ Q KEIA LE+S + FLWSLR PK ++ ++ Y
Sbjct: 186 DDQPPSSVVFLCFGSMGGFDMDQAKEIACALEQSRHRFLWSLRRPPPKGKIETSTDYENL 245
Query: 325 NGVFPEGFLERIKGRGMIWGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPI 384
+ P GF ER G G + GW PQV IL H AIGGFVSHCGWNSILES+W+ VPIATWP+
Sbjct: 246 QEILPVGFSERTAGMGKVVGWAPQVAILEHPAIGGFVSHCGWNSILESIWFSVPIATWPL 305
Query: 385 YAEQQLNAFRMVKELGLALDLRLDYRVGSDLVM-AGDIESAVRCLMDGENKIRKKVKEMA 443
YAEQQ NAF MV ELGLA+++++DY+ S++++ A DIE ++C+M+ ++IRK+VKEM+
Sbjct: 306 YAEQQFNAFTMVTELGLAVEIKMDYKKESEIILSADDIERGIKCVMEHHSEIRKRVKEMS 365
Query: 444 EISRKSLMEGGSS 456
+ SRK+LM+ SS
Sbjct: 366 DKSRKALMDDESS 378
>sp|Q9LSY9|U71B1_ARATH UDP-glycosyltransferase 71B1 OS=Arabidopsis thaliana GN=UGT71B1
PE=2 SV=1
Length = 473
Score = 343 bits (879), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 193/484 (39%), Positives = 290/484 (59%), Gaps = 42/484 (8%)
Query: 3 KAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQPR 62
K EL+F+PSPG+GH+ +T AK L D+R+SVT++ + V+ DA + T+S+ R
Sbjct: 2 KVELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIVIPSRVSD--DASSSVYTNSEDR 59
Query: 63 ICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVS-------SRSNSGSLQVTG 115
+ I LP D D++ ++S P V+ +VS +RS+S ++ G
Sbjct: 60 LRYILLPARDQTT-DLVS---------YIDSQKPQVRAVVSKVAGDVSTRSDS---RLAG 106
Query: 116 LVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRIS---TVFESSDDELL 172
+V+D FC SM+DIA E +L +Y+F TSN +L L ++ + D + F+ ++ +
Sbjct: 107 IVVDMFCTSMIDIADEFNLSAYIFYTSNASYLGLQFHVQSLYDEKELDVSEFKDTEMKFD 166
Query: 173 IPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSG-- 230
+P +T P P +PS + NK L + A+ F+ GI+VN+ ++EP A++ FSG
Sbjct: 167 VPTLTQPFPAKCLPSVMLNKKWFPYVLGR-ARSFRATKGILVNSVADMEPQALSFFSGGN 225
Query: 231 -DLN-PPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQ 288
+ N PP+Y GP++ L+S DE + ++I WL + SVVFLCFGS G F Q
Sbjct: 226 GNTNIPPVYAVGPIMDLESSG----DEEKRKEILHWLKEQPTKSVVFLCFGSMGGFSEEQ 281
Query: 289 VKEIAIGLERSGYNFLWSLRVSSP---KDEVSAHRYVTNNGVFPEGFLERIKGRGMIWGW 345
+EIA+ LERSG+ FLWSLR +SP K + + P+GFL+R G I W
Sbjct: 282 AREIAVALERSGHRFLWSLRRASPVGNKSNPPPGEFTNLEEILPKGFLDRTVEIGKIISW 341
Query: 346 VPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDL 405
PQV++L AIG FV+HCGWNSILESLW+GVP+A WPIYAEQQ NAF MV ELGLA ++
Sbjct: 342 APQVDVLNSPAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDELGLAAEV 401
Query: 406 RLDYRVG-----SDLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSI 460
+ +YR ++V A +IE ++C M+ ++K+RK+V EM + +L++GGSS ++
Sbjct: 402 KKEYRRDFLVEEPEIVTADEIERGIKCAMEQDSKMRKRVMEMKDKLHVALVDGGSSNCAL 461
Query: 461 GQFI 464
+F+
Sbjct: 462 KKFV 465
>sp|Q40285|UFOG2_MANES Anthocyanidin 3-O-glucosyltransferase 2 (Fragment) OS=Manihot
esculenta GN=GT2 PE=2 SV=1
Length = 346
Score = 341 bits (874), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 177/349 (50%), Positives = 229/349 (65%), Gaps = 14/349 (4%)
Query: 121 FCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRIS---TVFESSDDELLIPGIT 177
FC M+D+A E +PSY+F S GFL MLY+ D + F+ SD EL++P +
Sbjct: 1 FCTPMMDLADEFGIPSYIFFASGGGFLGFMLYVQKIHDEENFNPIEFKDSDTELIVPSLV 60
Query: 178 SPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDLNPPLY 237
+P P ++PS + NK+ L+ +A++F+ GIIVNTF ELE A+ +F PPLY
Sbjct: 61 NPFPTRILPSSILNKER-FGQLLAIAKKFRQAKGIIVNTFLELESRAIESFK---VPPLY 116
Query: 238 TAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGLE 297
GP+L +KS + +I QWLDD E SVVFLCFGS GSF Q+KEIA LE
Sbjct: 117 HVGPILDVKSD-----GRNTHPEIMQWLDDQPEGSVVFLCFGSMGSFSEDQLKEIAYALE 171
Query: 298 RSGYNFLWSLRVSSPKDEV-SAHRYVTNNGVFPEGFLERIKGRGMIWGWVPQVEILAHKA 356
SG+ FLWS+R P D++ S Y V PEGFLER G + GW PQV +LAH A
Sbjct: 172 NSGHRFLWSIRRPPPPDKIASPTDYEDPRDVLPEGFLERTVAVGKVIGWAPQVAVLAHPA 231
Query: 357 IGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDYRVGSDLV 416
IGGFVSHCGWNS+LESLW+GVPIATWP+YAEQQ NAF MV ELGL +++ + YR S ++
Sbjct: 232 IGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQFNAFEMVVELGLGVEIDMGYRKESGII 291
Query: 417 MAGD-IESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
+ D IE A+R LM+ ++ RKKVKEM E S+ +L++GGSSF S+G FI
Sbjct: 292 VNSDKIERAIRKLMENSDEKRKKVKEMREKSKMALIDGGSSFISLGDFI 340
>sp|Q4R1I9|ANGLT_ROSHC Anthocyanidin 5,3-O-glucosyltransferase OS=Rosa hybrid cultivar
GN=RhGT1 PE=2 SV=1
Length = 473
Score = 261 bits (668), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 165/485 (34%), Positives = 268/485 (55%), Gaps = 45/485 (9%)
Query: 6 LIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSM----------KLAVAP--WVDAYT 53
++ P PG+GHL+S +E K L S+TIL+ KL + + Y
Sbjct: 6 IVLYPYPGLGHLISMVELGKLLLTHHPSFSITILASTAPTTIAATAKLVASSNDQLTNYI 65
Query: 54 KSLTDSQPRICVIDLPPVDPPLPDVLKKS--PEYFISLVVESHLPNVKNIVSSRSNSGSL 111
K+++ P I LP + LP+ ++K P + L +PN+ ++ + +S
Sbjct: 66 KAVSADNPAINFHHLPTISS-LPEHIEKLNLPFEYARL----QIPNILQVLQTLKSS--- 117
Query: 112 QVTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDEL 171
+ L+LD FC ++ D+ K+L++P++ F TS L ++L +PT +++ + D +
Sbjct: 118 -LKALILDMFCDALFDVTKDLNIPTFYFYTSAGRSLAVLLNIPTFHRTTNSLSDFGDVPI 176
Query: 172 LIPGITSPVPVCVMPSCLFNKDGG-HATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSG 230
I G+ P+PV MP LF++ + + + + +GII+NTF LE A+ A
Sbjct: 177 SISGM-PPIPVSAMPKLLFDRSTNFYKSFLSTSTHMAKSNGIILNTFDLLEERALKALRA 235
Query: 231 DL------NPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSF 284
L PP++T GP++ KS N DE + K WL++ + SVVFLCFGS G F
Sbjct: 236 GLCLPNQPTPPIFTVGPLISGKSGDN---DEHESLK---WLNNQPKDSVVFLCFGSMGVF 289
Query: 285 DVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMI-W 343
+ Q++ +A+GLE+SG FLW +R + P +E+ + P+GF+ER K RG++
Sbjct: 290 SIKQLEAMALGLEKSGQRFLWVVR-NPPIEELPVEEPSLEE-ILPKGFVERTKDRGLVVR 347
Query: 344 GWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLAL 403
W PQVE+L+H ++GGFV+HCGWNS+LE++ GVP+ WP+YAEQ+L +V+E+ +A+
Sbjct: 348 KWAPQVEVLSHDSVGGFVTHCGWNSVLEAVCNGVPMVAWPLYAEQKLGRVFLVEEMKVAV 407
Query: 404 DLRLDYRVGSDLVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKSLMEGGSSFNSIG 461
++ + V A ++E VR LMD E ++IR +V E + K+ EGGSS S+
Sbjct: 408 GVK---ESETGFVSADELEKRVRELMDSESGDEIRGRVSEFSNGGVKAKEEGGSSVASLA 464
Query: 462 QFISL 466
+ L
Sbjct: 465 KLAQL 469
>sp|Q9M156|U72B1_ARATH UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1
PE=1 SV=1
Length = 480
Score = 249 bits (636), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 161/470 (34%), Positives = 246/470 (52%), Gaps = 25/470 (5%)
Query: 2 KKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQP 61
K + +PSPG+GHL+ +EFAK L +T+ + P A +++ DS P
Sbjct: 5 KTPHVAIIPSPGMGHLIPLVEFAKRLVHLH---GLTVTFVIAGEGPPSKA-QRTVLDSLP 60
Query: 62 R-ICVIDLPPVDPPLPDVLKKSP-EYFISLVVESHLPNVKNIVSSRSNSGSLQVTGLVLD 119
I + LPPVD L D+ + E ISL V P ++ + S G L T LV+D
Sbjct: 61 SSISSVFLPPVD--LTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLP-TALVVD 117
Query: 120 FFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITSP 179
F D+A E +P Y+F + L L+LP + +S F + L++PG
Sbjct: 118 LFGTDAFDVAVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPV 177
Query: 180 VPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFS--GDLNPPLY 237
+ KD + L+ +R+K+ +GI+VNTF ELEP A+ A G PP+Y
Sbjct: 178 AGKDFLDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGLDKPPVY 237
Query: 238 TAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGLE 297
GP++++ Q +E++ +WLD+ SV+++ FGS G+ Q+ E+A+GL
Sbjct: 238 PVGPLVNIGKQEAKQTEESE---CLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLA 294
Query: 298 RSGYNFLWSLRVSSPKDEVS---AHRYVTNNGVFPEGFLERIKGRG-MIWGWVPQVEILA 353
S FLW +R S S +H P GFLER K RG +I W PQ ++LA
Sbjct: 295 DSEQRFLWVIRSPSGIANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQVLA 354
Query: 354 HKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDYRVGS 413
H + GGF++HCGWNS LES+ G+P+ WP+YAEQ++NA + +++ A L R G
Sbjct: 355 HPSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAA----LRPRAGD 410
Query: 414 D-LVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKSLMEGGSSFNSI 460
D LV ++ V+ LM+GE +R K+KE+ E + + L + G+S ++
Sbjct: 411 DGLVRREEVARVVKGLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKAL 460
>sp|Q9LNI1|U72B3_ARATH UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana GN=UGT72B3
PE=2 SV=1
Length = 481
Score = 248 bits (634), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 165/464 (35%), Positives = 240/464 (51%), Gaps = 24/464 (5%)
Query: 9 VPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQPRICVIDL 68
+PSPGIGHL+ +E AK L D T+ + +P A L I + L
Sbjct: 12 IPSPGIGHLIPLVELAKRLLDNH---GFTVTFIIPGDSPPSKAQRSVLNSLPSSIASVFL 68
Query: 69 PPVDPPLPDVLKKSP-EYFISLVVESHLPNVKNIVSSRSNSGSLQVTGLVLDFFCVSMVD 127
PP D L DV + E ISL V P ++ + S S L LV+D F D
Sbjct: 69 PPAD--LSDVPSTARIETRISLTVTRSNPALRELFGSLSAEKRLPAV-LVVDLFGTDAFD 125
Query: 128 IAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITSPVPVCVMPS 187
+A E + Y+F SN L +L+LP + +S F + ++IPG +
Sbjct: 126 VAAEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVIIPGCVPITGKDFVDP 185
Query: 188 CLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDL--NPPLYTAGPVLHL 245
C KD + L+ +RFK+ +GI+VN+F +LEP + PP+Y GP+++
Sbjct: 186 CQDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQEPAPDKPPVYLIGPLVNS 245
Query: 246 KSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGLERSGYNFLW 305
S D D K WLD+ SV+++ FGS G+ Q E+A+GL SG FLW
Sbjct: 246 GSH---DADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALGLAESGKRFLW 302
Query: 306 SLRVSSPKDEVSAHRYVTNNGVF---PEGFLERIKGRGMIWG-WVPQVEILAHKAIGGFV 361
+R S S + N F P+GFL+R K +G++ G W PQ +IL H +IGGF+
Sbjct: 303 VIRSPSGIASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQILTHTSIGGFL 362
Query: 362 SHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDYRVGSDLVMA-GD 420
+HCGWNS LES+ GVP+ WP+YAEQ++NA +V ++G AL RL G D V+ +
Sbjct: 363 THCGWNSSLESIVNGVPLIAWPLYAEQKMNALLLV-DVGAALRARL----GEDGVVGREE 417
Query: 421 IESAVRCLMDGE--NKIRKKVKEMAEISRKSLMEGGSSFNSIGQ 462
+ V+ L++GE N +RKK+KE+ E S + L + G S S+ +
Sbjct: 418 VARVVKGLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNE 461
>sp|Q9AR73|HQGT_RAUSE Hydroquinone glucosyltransferase OS=Rauvolfia serpentina GN=AS PE=1
SV=1
Length = 470
Score = 237 bits (604), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 159/472 (33%), Positives = 249/472 (52%), Gaps = 31/472 (6%)
Query: 5 ELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQPR-I 63
+ VP+PG+GHL+ +EFAK L R + VT + P KS D+ P +
Sbjct: 6 HIAMVPTPGMGHLIPLVEFAKRLVLRHN-FGVTFIIPTDGPLP---KAQKSFLDALPAGV 61
Query: 64 CVIDLPPVD-PPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSLQVTGLVLDFFC 122
+ LPPV LP ++ E I L + LP V++ V ++ + ++ LV+D F
Sbjct: 62 NYVLLPPVSFDDLPADVRI--ETRICLTITRSLPFVRDAV--KTLLATTKLAALVVDLFG 117
Query: 123 VSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITSPVPV 182
D+A E + Y+F + L L +LP +S + + L IPG
Sbjct: 118 TDAFDVAIEFKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPLQIPGCIPIHGK 177
Query: 183 CVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGD--LNPPLYTAG 240
+ K+ + L+ A+R++ +GI+VNTF++LEP + A + PP+Y G
Sbjct: 178 DFLDPAQDRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEEDQGKPPVYPIG 237
Query: 241 PVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGLERSG 300
P++ S D E +WLDD SV+F+ FGS G+ Q E+A+GLE S
Sbjct: 238 PLIRADSSSKVDDCEC-----LKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGLEMSE 292
Query: 301 YNFLWSLRVSSPKDEVSAHRYVT----NNGV--FPEGFLERIKGRGM-IWGWVPQVEILA 353
FLW +R SP D+++ Y + N+ + PEGFLER KGR + + W PQ EIL+
Sbjct: 293 QRFLWVVR--SPNDKIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEILS 350
Query: 354 HKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDYRVGS 413
H + GGF++HCGWNSILES+ GVP+ WP+YAEQ++NA + + L +AL + +
Sbjct: 351 HGSTGGFLTHCGWNSILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVALRPKAGE---N 407
Query: 414 DLVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKSLMEGGSSFNSIGQF 463
L+ +I +AV+ LM+GE K R +K++ + + ++L + GSS ++ +
Sbjct: 408 GLIGRVEIANAVKGLMEGEEGKKFRSTMKDLKDAASRALSDDGSSTKALAEL 459
>sp|Q9LK73|U88A1_ARATH UDP-glycosyltransferase 88A1 OS=Arabidopsis thaliana GN=UGT88A1
PE=2 SV=1
Length = 462
Score = 229 bits (584), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 153/469 (32%), Positives = 242/469 (51%), Gaps = 18/469 (3%)
Query: 1 MKKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDA-YTKSLTDS 59
M + ++ P+P IGHLVS +E K + ++ +S+ I+ + P A Y S++ S
Sbjct: 1 MGEEAIVLYPAPPIGHLVSMVELGKTILSKNPSLSIHIILVPPPYQPESTATYISSVSSS 60
Query: 60 QPRICVIDLPPVDPPLPDVLKKSPEYFISL-VVESHLPNVKNIVSSRSNSGSLQVTGLVL 118
P I LP V P + + L ++ P+V + S S + V +++
Sbjct: 61 FPSITFHHLPAVTPYSSSSTSRHHHESLLLEILCFSNPSVHRTLFSLSRN--FNVRAMII 118
Query: 119 DFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITS 178
DFFC +++DI + + P Y F TS L YLPT + + IPG+
Sbjct: 119 DFFCTAVLDITADFTFPVYFFYTSGAACLAFSFYLPTIDETTPGKNLKDIPTVHIPGV-P 177
Query: 179 PVPVCVMPSCLFNKDGG-HATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDLN-PPL 236
P+ MP + +D + + ++ GII+NTF LE A+ A + +L +
Sbjct: 178 PMKGSDMPKAVLERDDEVYDVFIMFGKQLSKSSGIIINTFDALENRAIKAITEELCFRNI 237
Query: 237 YTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGL 296
Y GP++ + D ++ + WLD E SVVFLCFGS G F QV EIA+GL
Sbjct: 238 YPIGPLI--VNGRIEDRNDNKAVSCLNWLDSQPEKSVVFLCFGSLGLFSKEQVIEIAVGL 295
Query: 297 ERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIW-GWVPQVEILAHK 355
E+SG FLW +R + P+ E + + + PEGFL R + +GM+ W PQV +L HK
Sbjct: 296 EKSGQRFLWVVR-NPPELEKTE---LDLKSLLPEGFLSRTEDKGMVVKSWAPQVPVLNHK 351
Query: 356 AIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDYRVGSDL 415
A+GGFV+HCGWNSILE++ GVP+ WP+YAEQ+ N +V E+ +A+ + +
Sbjct: 352 AVGGFVTHCGWNSILEAVCAGVPMVAWPLYAEQRFNRVMIVDEIKIAISMN---ESETGF 408
Query: 416 VMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
V + ++E V+ ++ GE +R++ M + +L E GSS ++ +
Sbjct: 409 VSSTEVEKRVQEII-GECPVRERTMAMKNAAELALTETGSSHTALTTLL 456
>sp|Q9ZU72|U72D1_ARATH UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana GN=UGT72D1
PE=2 SV=1
Length = 470
Score = 228 bits (580), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 157/487 (32%), Positives = 261/487 (53%), Gaps = 49/487 (10%)
Query: 1 MKKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQ 60
M + + V SPG+GHL+ LE L+ + I VTIL++ + + T+++ +
Sbjct: 1 MDQPHALLVASPGLGHLIPILELGNRLSSVLN-IHVTILAVTSGSSSPTE--TEAIHAAA 57
Query: 61 PR-ICVI-DLPPVDPPLPDVLKKSPEYFISLVVE--SHLPNVKNIVSSRSNSGSLQVTGL 116
R IC I ++P VD + ++++ F +VV+ + P V++ V + T +
Sbjct: 58 ARTICQITEIPSVD--VDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKR----KPTVM 111
Query: 117 VLDFFCVSMVDIAKELSLPS-YMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPG 175
++DF ++ +A ++ + + Y+++ ++ FL +M+YLP + + + L IPG
Sbjct: 112 IVDFLGTELMSVADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPG 171
Query: 176 ITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGD---- 231
P +M + L + V+ DG++VNT+ EL+ + A D
Sbjct: 172 CKPVGPKELMETMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELS 231
Query: 232 --LNPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQV 289
+ P+Y GP++ + N +D+ IF+WLD+ E SVVF+C GS G+ Q
Sbjct: 232 RVMKVPVYPIGPIV----RTNQHVDKPN--SIFEWLDEQRERSVVFVCLGSGGTLTFEQT 285
Query: 290 KEIAIGLERSGYNFLWSLR--------VSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGM 341
E+A+GLE SG F+W LR +SS ++VSA PEGFL+R +G G+
Sbjct: 286 VELALGLELSGQRFVWVLRRPASYLGAISSDDEQVSAS--------LPEGFLDRTRGVGI 337
Query: 342 I-WGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELG 400
+ W PQVEIL+H++IGGF+SHCGW+S LESL GVPI WP+YAEQ +NA + +E+G
Sbjct: 338 VVTQWAPQVEILSHRSIGGFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIG 397
Query: 401 LALDLRLDYRVGSDLVMAGDIESAVRCLMDGEN----KIRKKVKEMAEISRKSLMEGGSS 456
+A +R ++ ++ S VR +M E+ KIR K +E+ S ++ + GSS
Sbjct: 398 VA--VRTSELPSERVIGREEVASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSS 455
Query: 457 FNSIGQF 463
+NS+ ++
Sbjct: 456 YNSLFEW 462
>sp|Q40287|UFOG5_MANES Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5
PE=2 SV=1
Length = 487
Score = 225 bits (573), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 153/476 (32%), Positives = 247/476 (51%), Gaps = 40/476 (8%)
Query: 3 KAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTIL---SMKLAVAPWVDAYTKSLTDS 59
K ++ + SPG+GHL+ LE K + + VTI S A P V ++T
Sbjct: 9 KPHIVLLSSPGLGHLIPVLELGKRIVTLCN-FDVTIFMVGSDTSAAEPQV--LRSAMT-- 63
Query: 60 QPRIC-VIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSLQVTGLVL 118
P++C +I LPP P + ++ PE + + + ++ + ++ + +++
Sbjct: 64 -PKLCEIIQLPP--PNISCLI--DPEATVCTRLFVLMREIRPAFRAAVSALKFRPAAIIV 118
Query: 119 DFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITS 178
D F +++AKEL + Y+++ SN FL L +Y+P + F + + IPG
Sbjct: 119 DLFGTESLEVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRP 178
Query: 179 PVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNA-----FSGDLN 233
V+ L + ++ +L DGI++NT+ LEP A F G +
Sbjct: 179 VRTEEVVDPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVA 238
Query: 234 P-PLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEI 292
P++ GP L+ Q P + + WLD + SVV++ FGS G+ + Q+ E+
Sbjct: 239 KVPVFPIGP---LRRQAGPCGSNCE---LLDWLDQQPKESVVYVSFGSGGTLSLEQMIEL 292
Query: 293 AIGLERSGYNFLWSLRVSSPKDEVSAHRYVTN-------NGVFPEGFLERIKGRGMIW-G 344
A GLERS F+W +R P + + T +G FPEGFL RI+ G++
Sbjct: 293 AWGLERSQQRFIWVVR--QPTVKTGDAAFFTQGDGADDMSGYFPEGFLTRIQNVGLVVPQ 350
Query: 345 WVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALD 404
W PQ+ I++H ++G F+SHCGWNS+LES+ GVPI WPIYAEQ++NA + +ELG+A
Sbjct: 351 WSPQIHIMSHPSVGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVA-- 408
Query: 405 LRLDYRVGSDLVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKSLMEGGSSFN 458
+R ++V +IE +R +M E ++IRK+V+E+ + K+L EGGSSFN
Sbjct: 409 VRPKNLPAKEVVKREEIERMIRRIMVDEEGSEIRKRVRELKDSGEKALNEGGSSFN 464
>sp|Q33DV3|4CGT_ANTMA Chalcone 4'-O-glucosyltransferase OS=Antirrhinum majus PE=1 SV=1
Length = 457
Score = 221 bits (562), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 129/363 (35%), Positives = 199/363 (54%), Gaps = 30/363 (8%)
Query: 113 VTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELL 172
+ L++DFFC + +++ +++P+Y ++ L L+ PT + +D +
Sbjct: 111 IKALIIDFFCNAAFEVSTSMNIPTYFDVSGGAFLLCTFLHHPTLHQTVRGDIADLNDSVE 170
Query: 173 IPGI----TSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAF 228
+PG +S +P+ S + K + + + + GI+VNTF LE A A
Sbjct: 171 MPGFPLIHSSDLPM----SLFYRKTNVYKHFLDTSLNMRKSSGILVNTFVALEFRAKEAL 226
Query: 229 SGDL---NPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFD 285
S L PPLY H ++P+ + WLD SV+FLCFG G+F
Sbjct: 227 SNGLYGPTPPLYLLS---HTIAEPHDTKVLVNQHECLSWLDLQPSKSVIFLCFGRRGAFS 283
Query: 286 VAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMIWG- 344
Q+KEIAIGLE+SG FLW R+S D N + PEGFL R KG G +
Sbjct: 284 AQQLKEIAIGLEKSGCRFLWLARISPEMDL---------NALLPEGFLSRTKGVGFVTNT 334
Query: 345 WVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALD 404
WVPQ E+L+H A+GGFV+HCGW+S+LE+L +GVP+ WP+YAEQ++N MV+E+ +AL
Sbjct: 335 WVPQKEVLSHDAVGGFVTHCGWSSVLEALSFGVPMIGWPLYAEQRINRVFMVEEIKVALP 394
Query: 405 LRLDYRVGSDLVMAGDIESAVRCLMDG--ENKIRKKVKEMAEISRKSLMEGGSSFNSIGQ 462
LD G V A ++E VR LM+ +++++V E+ ++ ++ +GGSS S+ +
Sbjct: 395 --LDEEDG--FVTAMELEKRVRELMESVKGKEVKRRVAELKISTKAAVSKGGSSLASLEK 450
Query: 463 FIS 465
FI+
Sbjct: 451 FIN 453
>sp|Q8W4C2|U72B2_ARATH UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana GN=UGT72B2
PE=2 SV=1
Length = 480
Score = 217 bits (553), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 155/472 (32%), Positives = 236/472 (50%), Gaps = 27/472 (5%)
Query: 5 ELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQPRIC 64
+ +PSPG+GHL+ +E AK L D I+S + + + + SL S I
Sbjct: 8 HIAIMPSPGMGHLIPFVELAKRLVQHDCFTVTMIISGETSPSKAQRSVLNSLPSS---IA 64
Query: 65 VIDLPPVDPPLPDVLKKSP-EYFISLVVESHLPNVKNIVSSRSNSGSLQVTGLVLDFFCV 123
+ LPP D L DV + E L + P ++ + S S SL LV+D F
Sbjct: 65 SVFLPPAD--LSDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAV-LVVDMFGA 121
Query: 124 SMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITSPVPVC 183
D+A + + Y+F SN L L+LP +S F + L IPG
Sbjct: 122 DAFDVAVDFHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPLKIPGCVPITGKD 181
Query: 184 VMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFS--GDLNPPLYTAGP 241
+ + D + L+ +R+K+ GI+VN+F +LE A+ A P +Y GP
Sbjct: 182 FLDTVQDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQEPAPDKPTVYPIGP 241
Query: 242 VLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGLERSGY 301
+++ S N +L++ WLD+ SV+++ FGS G+ Q E+AIGL SG
Sbjct: 242 LVNTSSS-NVNLEDKF--GCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAIGLAESGK 298
Query: 302 NFLWSLRVSSPKDEVSA-----HRYVTNNGVFPEGFLERIKGRGMIW-GWVPQVEILAHK 355
F+W +R SP + VS+ H P GFL+R K +G++ W PQV+ILAH
Sbjct: 299 RFIWVIR--SPSEIVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQILAHP 356
Query: 356 AIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDYRVGSD- 414
+ GF++HCGWNS LES+ GVP+ WP++AEQ++N +V+++G AL + G D
Sbjct: 357 STCGFLTHCGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRI----HAGEDG 412
Query: 415 LVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
+V ++ V+ LM+GE I KVKE+ E + L + G S S G+ +
Sbjct: 413 IVRREEVVRVVKALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVL 464
>sp|Q76MR7|UBGAT_SCUBA Baicalein 7-O-glucuronosyltransferase OS=Scutellaria baicalensis
GN=UBGAT-I PE=1 SV=1
Length = 441
Score = 214 bits (545), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 138/418 (33%), Positives = 216/418 (51%), Gaps = 30/418 (7%)
Query: 61 PRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSLQVTGLVLDF 120
P I LP + P PD+ E F L L N + + + S ++ ++LDF
Sbjct: 35 PSISYHRLPLPEIP-PDMTTDRVELFFEL---PRLSNPNLLTALQQISQKTRIRAVILDF 90
Query: 121 FCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITSPV 180
FC + ++ L++P+Y + ++ L LY T + I + +D + IPG+ P+
Sbjct: 91 FCNAAFEVPTSLNIPTYYYFSAGTPTAILTLYFETIDETIPVDLQDLNDYVDIPGL-PPI 149
Query: 181 PVCVMPSCLF-NKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNA-------FSGDL 232
+P L K + + V +++ + GI+VN F LE A+ + F G
Sbjct: 150 HCLDIPVALSPRKSLVYKSSVDISKNLRRSAGILVNGFDALEFRAIGSHSQRPMHFKGP- 208
Query: 233 NPPLYTAGPVLHLKSQPNPDLDE---AQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQV 289
PP+Y GP++ D+D ++ + +WLD SVVFLCFG G F Q+
Sbjct: 209 TPPVYFIGPLVG-------DVDTKAGSEEHECLRWLDTQPSKSVVFLCFGRRGVFSAKQL 261
Query: 290 KEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRG-MIWGWVPQ 348
KE A LE SG+ FLWS+R + + + + PEGFLER K RG +I W PQ
Sbjct: 262 KETAAALENSGHRFLWSVRNPPELKKATGSDEPDLDELLPEGFLERTKDRGFVIKSWAPQ 321
Query: 349 VEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLD 408
E+LAH ++GGFV+HCG +S+ E +W+GVP+ WP+ AE +LN MV +L +AL L +
Sbjct: 322 KEVLAHDSVGGFVTHCGRSSVSEGVWFGVPMIGWPVDAELRLNRAVMVDDLQVALPLEEE 381
Query: 409 YRVGSDLVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKSLMEGGSSFNSIGQFI 464
V A ++E VR LM+ + +R++V E+ +R ++ E GSS N + +F+
Sbjct: 382 ---AGGFVTAAELEKRVRELMETKAGKAVRQRVTELKLSARAAVAENGSSLNDLKKFL 436
>sp|O81498|U72E3_ARATH UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana GN=UGT72E3
PE=1 SV=1
Length = 481
Score = 207 bits (526), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 151/475 (31%), Positives = 240/475 (50%), Gaps = 52/475 (10%)
Query: 11 SPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQPRICVIDLPP 70
SPG+GH++ +E AK L+ + VT+ ++ A +K L + + +++LP
Sbjct: 13 SPGMGHVLPVIELAKRLS-ANHGFHVTVFVLETDAA---SVQSKLLNSTG--VDIVNLPS 66
Query: 71 ------VDPPLPDVLKKSPEYFISLVVESHLPNVKN-IVSSRSNSGSLQVTGLVLDFFCV 123
VDP V K I +++ +P +++ IV+ N T L++D F
Sbjct: 67 PDISGLVDPNAHVVTK------IGVIMREAVPTLRSKIVAMHQNP-----TALIIDLFGT 115
Query: 124 SMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITSPVPVC 183
+ +A EL++ +Y+F+ SN +L + +Y PT + I L IPG
Sbjct: 116 DALCLAAELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFED 175
Query: 184 VMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGD------LNPPLY 237
+M + L + + LV+ + DGI+VNT+ E+EP ++ + P+Y
Sbjct: 176 IMDAYLVPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVY 235
Query: 238 TAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGLE 297
GP+ P +F WL+ SV+++ FGS GS Q+ E+A GLE
Sbjct: 236 PVGPLC------RPIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLE 289
Query: 298 RSGYNFLWSLRV----SSPKDEVSAHRYVTNNGV---FPEGFLERIKGRG-MIWGWVPQV 349
S F+W +R SS D SA VT + PEGF+ R RG MI W PQ
Sbjct: 290 ESQQRFIWVVRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQA 349
Query: 350 EILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRLDY 409
EILAH+A+GGF++HCGW+S LES+ GVP+ WP++AEQ +NA + ELG++ +R+D
Sbjct: 350 EILAHQAVGGFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGIS--VRVDD 407
Query: 410 RVGSDLVMAGDIESAVRCLM--DGENKIRKKVKEMAEISRKSL--MEGGSSFNSI 460
+ + IE+ VR +M D ++R+KVK++ + + SL GGS+ S+
Sbjct: 408 P--KEAISRSKIEAMVRKVMAEDEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESL 460
>sp|Q94A84|U72E1_ARATH UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana GN=UGT72E1
PE=1 SV=1
Length = 487
Score = 204 bits (519), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 141/485 (29%), Positives = 239/485 (49%), Gaps = 40/485 (8%)
Query: 1 MKKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQ 60
+ K + SPG+GH++ +E K L VTI ++ A + S
Sbjct: 3 ITKPHVAMFASPGMGHIIPVIELGKRLAGSHG-FDVTIFVLETDAASAQSQFLNSPGCDA 61
Query: 61 PRICVIDLPPVDPPLPDVLKKSPEYFISLVV--ESHLPNVKNIVSSRSNSGSLQVTGLVL 118
+ ++ LP P + ++ S + I L+V +P +++ + + + T L++
Sbjct: 62 ALVDIVGLP--TPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQH----KPTALIV 115
Query: 119 DFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITS 178
D F + + + E ++ +Y+F+ SN FL + L+ PT + +++PG
Sbjct: 116 DLFGLDAIPLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEP 175
Query: 179 PVPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGD------L 232
+ + L + V F DGIIVNT+ ++EP + +
Sbjct: 176 VRFEDTLETFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIA 235
Query: 233 NPPLYTAGPVLHLKSQPNPDLDEAQYQK-IFQWLDDLAESSVVFLCFGSSGSFDVAQVKE 291
P+Y GP+ S+P +D ++ + WL+ + SV+++ FGS GS Q+ E
Sbjct: 236 GVPVYPIGPL----SRP---VDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTE 288
Query: 292 IAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNG---------VFPEGFLERIKGRG-M 341
+A GLE S F+W +R P D + Y++ N PEGF+ R RG M
Sbjct: 289 LAWGLEMSQQRFVWVVR--PPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFM 346
Query: 342 IWGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGL 401
+ W PQ EILAH+A+GGF++HCGWNSILES+ GVP+ WP++AEQ +NA + +ELG+
Sbjct: 347 VSSWAPQAEILAHQAVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGV 406
Query: 402 ALDLRLDYRVGSDLVMAGDIESAVRCLMDGEN--KIRKKVKEMAEISRKSL-MEGGSSFN 458
A +R ++ +IE+ VR +M E ++RKK+K++ E + +SL +GG +
Sbjct: 407 A--VRSKKLPSEGVITRAEIEALVRKIMVEEEGAEMRKKIKKLKETAAESLSCDGGVAHE 464
Query: 459 SIGQF 463
S+ +
Sbjct: 465 SLSRI 469
>sp|Q9LVR1|U72E2_ARATH UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana GN=UGT72E2
PE=1 SV=1
Length = 481
Score = 198 bits (504), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 144/471 (30%), Positives = 236/471 (50%), Gaps = 57/471 (12%)
Query: 11 SPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQPRICVIDLPP 70
SPG+GH++ +E K L+ ++ VT+ ++ A A +K L + I + P
Sbjct: 13 SPGMGHVIPVIELGKRLS-ANNGFHVTVFVLETDAA---SAQSKFLNSTGVDIVKLPSPD 68
Query: 71 ----VDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSLQVTGLVLDFFCVSMV 126
VDP V K I +++ + +P +++ +++ + T L++D F +
Sbjct: 69 IYGLVDPDDHVVTK------IGVIMRAAVPALRSKIAAMHQ----KPTALIVDLFGTDAL 118
Query: 127 DIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITSPVPVCVMP 186
+AKE ++ SY+F+ +N FL + +Y P I + L IPG +
Sbjct: 119 CLAKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLD 178
Query: 187 SCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDLNP---------PLY 237
+ L + + V+ + DGI+VNT+ E+EP ++ + LNP P+Y
Sbjct: 179 AYLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSL---LNPKLLGRVARVPVY 235
Query: 238 TAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGLE 297
GP+ D + WL++ SV+++ FGS G Q+ E+A GLE
Sbjct: 236 PIGPLCRPIQSSETD------HPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLE 289
Query: 298 RSGYNFLWSLRVSSPKDEVSAHRYVTNNG---------VFPEGFLERIKGRGMIW-GWVP 347
+S F+W +R P D YV+ NG PEGF+ R RG + W P
Sbjct: 290 QSQQRFVWVVR--PPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAP 347
Query: 348 QVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLRL 407
Q EIL+H+A+GGF++HCGW+S LES+ GVP+ WP++AEQ +NA + ELG+A +RL
Sbjct: 348 QAEILSHRAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIA--VRL 405
Query: 408 DYRVGSDLVMAGDIESAVRCLM---DGENKIRKKVKEMAEISRKSL-MEGG 454
D + + IE+ VR +M +GE +R+KVK++ + + SL ++GG
Sbjct: 406 DD--PKEDISRWKIEALVRKVMTEKEGE-AMRRKVKKLRDSAEMSLSIDGG 453
>sp|O23205|U72C1_ARATH UDP-glycosyltransferase 72C1 OS=Arabidopsis thaliana GN=UGT72C1
PE=2 SV=3
Length = 457
Score = 194 bits (492), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 148/468 (31%), Positives = 227/468 (48%), Gaps = 53/468 (11%)
Query: 8 FVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQPRICVID 67
V SPG+GH V LE KHL + VT+ + V+ K+L + P+ VI
Sbjct: 7 LVASPGMGHAVPILELGKHLLNHHGFDRVTVFLVTDDVSRSKSLIGKTLMEEDPKF-VIR 65
Query: 68 LPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSLQVTGLVLDFFCVSMVD 127
P+D D L S ++ ++ LP +K+ V + V+D ++
Sbjct: 66 FIPLDVSGQD-LSGSLLTKLAEMMRKALPEIKSSVMELEPRPRV----FVVDLLGTEALE 120
Query: 128 IAKELS-LPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITSPVPVCVMP 186
+AKEL + ++ +T++ FL +Y+ + + SS LLIPG + P
Sbjct: 121 VAKELGIMRKHVLVTTSAWFLAFTVYMASLDKQELYKQLSSIGALLIPGCS--------P 172
Query: 187 SCLFNKDGGHATLVKLA--QRFKD----VDGIIVNTFHELEPYAVNAFSGDLNP------ 234
+ +LA QR D DG+ VNT+H LE + +F L+P
Sbjct: 173 VKFERAQDPRKYIRELAESQRIGDEVITADGVFVNTWHSLEQVTIGSF---LDPENLGRV 229
Query: 235 ----PLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVK 290
P+Y GP++ P L + WLD + SVV++ FGS G+ Q
Sbjct: 230 MRGVPVYPVGPLVR---PAEPGLKHG----VLDWLDLQPKESVVYVSFGSGGALTFEQTN 282
Query: 291 EIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGV-----FPEGFLERIKGRGMI-WG 344
E+A GLE +G+ F+W +R + D ++ T N P GFL+R K G++
Sbjct: 283 ELAYGLELTGHRFVWVVRPPAEDDPSASMFDKTKNETEPLDFLPNGFLDRTKDIGLVVRT 342
Query: 345 WVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALD 404
W PQ EILAHK+ GGFV+HCGWNS+LES+ GVP+ WP+Y+EQ++NA + EL +AL
Sbjct: 343 WAPQEEILAHKSTGGFVTHCGWNSVLESIVNGVPMVAWPLYSEQKMNARMVSGELKIALQ 402
Query: 405 LRLDYRVGSDLVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKSL 450
+ V +V I V+ +MD E ++RK VKE+ + + ++L
Sbjct: 403 I----NVADGIVKKEVIAEMVKRVMDEEEGKEMRKNVKELKKTAEEAL 446
>sp|Q8W491|U73B3_ARATH UDP-glycosyltransferase 73B3 OS=Arabidopsis thaliana GN=UGT73B3
PE=2 SV=1
Length = 481
Score = 185 bits (470), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 141/489 (28%), Positives = 238/489 (48%), Gaps = 45/489 (9%)
Query: 2 KKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISV--TILSMKLAVAPWVDAYTKSLTDS 59
+K ++F P GH++ TL+ AK + R + ++ T L+ K+ P ++ +
Sbjct: 7 RKLHVVFFPFMAYGHMIPTLDMAKLFSSRGAKSTILTTPLNSKIFQKP-IERFKNLNPSF 65
Query: 60 QPRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSLQV------ 113
+ I + D P VD LP+ + + + + +K S+R L+
Sbjct: 66 EIDIQIFDFPCVDLGLPEGCENVDFFTSNNNDDRQYLTLKFFKSTRFFKDQLEKLLETTR 125
Query: 114 -TGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELL 172
L+ D F + A++ ++P +F G+ L R + S + +
Sbjct: 126 PDCLIADMFFPWATEAAEKFNVPRLVF--HGTGYFSLCSEYCIRVHNPQNIVASRYEPFV 183
Query: 173 IPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVD----GIIVNTFHELEPYAVNAF 228
IP + P + + + ++D + + K K+ D G+IVN+F+ELEP + +
Sbjct: 184 IPDL--PGNIVITQEQIADRDE-ESEMGKFMIEVKESDVKSSGVIVNSFYELEPDYADFY 240
Query: 229 SGDLNPPLYTAGPVLHLKSQPNPDLDEAQYQKI-----FQWLDDLAESSVVFLCFGSSGS 283
+ + GP+ + + I +WLD SV+++ FGS
Sbjct: 241 KSVVLKRAWHIGPLSVYNRGFEEKAERGKKASINEVECLKWLDSKKPDSVIYISFGSVAC 300
Query: 284 FDVAQVKEIAIGLERSGYNFLWSLR--VSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGM 341
F Q+ EIA GLE SG NF+W +R + K+E PEGF ER+KG+GM
Sbjct: 301 FKNEQLFEIAAGLETSGANFIWVVRKNIGIEKEEW-----------LPEGFEERVKGKGM 349
Query: 342 I-WGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKEL- 399
I GW PQV IL H+A GFV+HCGWNS+LE + G+P+ TWP+ AEQ N ++V ++
Sbjct: 350 IIRGWAPQVLILDHQATCGFVTHCGWNSLLEGVAAGLPMVTWPVAAEQFYNE-KLVTQVL 408
Query: 400 --GLALDLRLDYRVGSDLVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKSLMEGGS 455
G+++ + + R D + + AVR ++ GE ++ R++ K++AE++ K+ +EGGS
Sbjct: 409 RTGVSVGAKKNVRTTGDFISREKVVKAVREVLVGEEADERRERAKKLAEMA-KAAVEGGS 467
Query: 456 SFNSIGQFI 464
SFN + FI
Sbjct: 468 SFNDLNSFI 476
>sp|Q9ZQG4|U73B5_ARATH UDP-glycosyltransferase 73B5 OS=Arabidopsis thaliana GN=UGT73B5
PE=2 SV=1
Length = 484
Score = 183 bits (465), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 137/488 (28%), Positives = 243/488 (49%), Gaps = 42/488 (8%)
Query: 2 KKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISV--TILSMKLAVAPWVDAYTKSLTDS 59
++ ++F P GH++ L+ AK + R + ++ T ++ K+ P ++A+ D
Sbjct: 7 ERIHILFFPFMAQGHMIPILDMAKLFSRRGAKSTLLTTPINAKIFEKP-IEAFKNQNPDL 65
Query: 60 QPRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNV--KNIVSSRSNSGSLQV---- 113
+ I + + P V+ LP+ + + FI+ +S ++ K + S++ L+
Sbjct: 66 EIGIKIFNFPCVELGLPEGCENAD--FINSYQKSDSGDLFLKFLFSTKYMKQQLESFIET 123
Query: 114 ---TGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDE 170
+ LV D F + A++L +P +F ++ F L R + +S
Sbjct: 124 TKPSALVADMFFPWATESAEKLGVPRLVFHGTS--FFSLCCSYNMRIHKPHKKVATSSTP 181
Query: 171 LLIPGITSPVPVCVMPSCLFNKDGGHATLVK-LAQRFKDVDGIIVNTFHELEPYAVNAFS 229
+IPG+ + + + + ++ +K + + + G++VN+F+ELE + +
Sbjct: 182 FVIPGLPGDIVITEDQANVAKEETPMGKFMKEVRESETNSFGVLVNSFYELESAYADFYR 241
Query: 230 GDLNPPLYTAGPVLHLKSQPNPDLDEAQY---------QKIFQWLDDLAESSVVFLCFGS 280
+ + GP+ S N +L E Q+ +WLD SVV+L FGS
Sbjct: 242 SFVAKRAWHIGPL----SLSNRELGEKARRGKKANIDEQECLKWLDSKTPGSVVYLSFGS 297
Query: 281 SGSFDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRG 340
+F Q+ EIA GLE SG +F+W +R + + + N PEGF ER G+G
Sbjct: 298 GTNFTNDQLLEIAFGLEGSGQSFIWVVRKNENQGD--------NEEWLPEGFKERTTGKG 349
Query: 341 MIW-GWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKEL 399
+I GW PQV IL HKAIGGFV+HCGWNS +E + G+P+ TWP+ AEQ N + K L
Sbjct: 350 LIIPGWAPQVLILDHKAIGGFVTHCGWNSAIEGIAAGLPMVTWPMGAEQFYNEKLLTKVL 409
Query: 400 GLALDLRLDYRV-GSDLVMAGDIESAVRCLMDGENKIRKKV--KEMAEISRKSLMEGGSS 456
+ +++ V L+ +E AVR ++ GE +++ K++ E+++ ++ EGGSS
Sbjct: 410 RIGVNVGATELVKKGKLISRAQVEKAVREVIGGEKAEERRLWAKKLGEMAKAAVEEGGSS 469
Query: 457 FNSIGQFI 464
+N + +F+
Sbjct: 470 YNDVNKFM 477
>sp|Q7Y232|U73B4_ARATH UDP-glycosyltransferase 73B4 OS=Arabidopsis thaliana GN=UGT73B4
PE=2 SV=1
Length = 484
Score = 181 bits (458), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 142/484 (29%), Positives = 240/484 (49%), Gaps = 39/484 (8%)
Query: 6 LIFVPSPGIGHLVSTLEFAKHLTDRDDRISV--TILSMKLAVAPWVDAYTKSLTDSQPRI 63
++F P GH++ L+ AK R + ++ T ++ K+ P ++A+ D + I
Sbjct: 8 ILFFPFMAHGHMIPLLDMAKLFARRGAKSTLLTTPINAKILEKP-IEAFKVQNPDLEIGI 66
Query: 64 CVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNV--KNIVSSRSNSGSLQV-------T 114
+++ P V+ LP+ + FI+ +S ++ K + S++ L+ +
Sbjct: 67 KILNFPCVELGLPEGCENRD--FINSYQKSDSFDLFLKFLFSTKYMKQQLESFIETTKPS 124
Query: 115 GLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIP 174
LV D F + A+++ +P +F ++ L R + SS +IP
Sbjct: 125 ALVADMFFPWATESAEKIGVPRLVFHGTSS--FALCCSYNMRIHKPHKKVASSSTPFVIP 182
Query: 175 GITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVD-GIIVNTFHELEPYAVNAFSGDLN 233
G+ + + + + N++ K + + G++VN+F+ELE + + +
Sbjct: 183 GLPGDIVITEDQANVTNEETPFGKFWKEVRESETSSFGVLVNSFYELESSYADFYRSFVA 242
Query: 234 PPLYTAGPVLHLKSQ---------PNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSF 284
+ GP L L ++ ++DE Q+ +WLD SVV+L FGS
Sbjct: 243 KKAWHIGP-LSLSNRGIAEKAGRGKKANIDE---QECLKWLDSKTPGSVVYLSFGSGTGL 298
Query: 285 DVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMI-W 343
Q+ EIA GLE SG NF+W VS +++V N P+GF ER KG+G+I
Sbjct: 299 PNEQLLEIAFGLEGSGQNFIWV--VSKNENQVGTGE---NEDWLPKGFEERNKGKGLIIR 353
Query: 344 GWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLAL 403
GW PQV IL HKAIGGFV+HCGWNS LE + G+P+ TWP+ AEQ N + K L + +
Sbjct: 354 GWAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGV 413
Query: 404 DLRLDYRV-GSDLVMAGDIESAVRCLMDGEN--KIRKKVKEMAEISRKSLMEGGSSFNSI 460
++ V L+ +E AVR ++ GE + R + KE+ E+++ ++ EGGSS+N +
Sbjct: 414 NVGATELVKKGKLISRAQVEKAVREVIGGEKAEERRLRAKELGEMAKAAVEEGGSSYNDV 473
Query: 461 GQFI 464
+F+
Sbjct: 474 NKFM 477
>sp|Q8VZE9|U73B1_ARATH UDP-glycosyltransferase 73B1 OS=Arabidopsis thaliana GN=UGT73B1
PE=2 SV=1
Length = 488
Score = 170 bits (431), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 130/475 (27%), Positives = 224/475 (47%), Gaps = 37/475 (7%)
Query: 3 KAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQP- 61
K + P GH++ TL+ AK + + TIL+ L + + KS P
Sbjct: 9 KLHFLLFPFMAHGHMIPTLDMAKLFATKGAK--STILTTPLNAKLFFEKPIKSFNQDNPG 66
Query: 62 ----RICVIDLPPVDPPLPDVLKKS------PEYFISLVVESHLPNVKNIVSSRSNS-GS 110
I +++ P + LPD + + P+ + + + L +K +
Sbjct: 67 LEDITIQILNFPCTELGLPDGCENTDFIFSTPDLNVGDLSQKFLLAMKYFEEPLEELLVT 126
Query: 111 LQVTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDE 170
++ LV + F +A++ +P +F G+ L + R+ +S +
Sbjct: 127 MRPDCLVGNMFFPWSTKVAEKFGVPRLVF--HGTGYFSL---CASHCIRLPKNVATSSEP 181
Query: 171 LLIPGITSPVPVCVMPSCLFNKDGGHATLVK-LAQRFKDVDGIIVNTFHELEPYAVNAFS 229
+IP + + + ++ +K + +D G++VN+F+ELE + F
Sbjct: 182 FVIPDLPGDILITEEQVMETEEESVMGRFMKAIRDSERDSFGVLVNSFYELEQAYSDYFK 241
Query: 230 GDLNPPLYTAGPVLHLKSQPNPDLDEAQYQKI-----FQWLDDLAESSVVFLCFGSSGSF 284
+ + GP+ + + + I +WLD SV+++ FG+ SF
Sbjct: 242 SFVAKRAWHIGPLSLGNRKFEEKAERGKKASIDEHECLKWLDSKKCDSVIYMAFGTMSSF 301
Query: 285 DVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGMI-W 343
Q+ EIA GL+ SG++F+W + + V PEGF E+ KG+G+I
Sbjct: 302 KNEQLIEIAAGLDMSGHDFVWVVNRKGSQ--------VEKEDWLPEGFEEKTKGKGLIIR 353
Query: 344 GWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNA--FRMVKELGL 401
GW PQV IL HKAIGGF++HCGWNS+LE + G+P+ TWP+ AEQ N V + G+
Sbjct: 354 GWAPQVLILEHKAIGGFLTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVTQVLKTGV 413
Query: 402 ALDLRLDYRVGSDLVMAGDIESAVRCLMDGENKIRKKVKEMAEISRKSLMEGGSS 456
++ ++ +V D + +E AVR +M GE + RK+ KE+AE+++ ++ EGGSS
Sbjct: 414 SVGVKKMMQVVGDFISREKVEGAVREVMVGEER-RKRAKELAEMAKNAVKEGGSS 467
>sp|Q9ZQ97|U73C4_ARATH UDP-glycosyltransferase 73C4 OS=Arabidopsis thaliana GN=UGT73C4
PE=2 SV=1
Length = 496
Score = 168 bits (425), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 140/488 (28%), Positives = 237/488 (48%), Gaps = 54/488 (11%)
Query: 6 LIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQP-RIC 64
I P GH++ ++ A+ L R +VTI++ + + + ++++ P I
Sbjct: 15 FILFPFMAQGHMIPMIDIARLLAQRG--ATVTIVTTRYNAGRFENVLSRAMESGLPINIV 72
Query: 65 VIDLPPVDPPLPDVLKKSPEY--------FISLVVESHLPNVKNIVSSRSNSGSLQVTGL 116
++ P + LP+ + Y F V P +K ++ S ++ L
Sbjct: 73 HVNFPYQEFGLPEGKENIDSYDSMELMVPFFQAVNMLEDPVMK-LMEEMKPRPSCIISDL 131
Query: 117 VLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGI 176
+L + IA++ S+P +F + F L +++ R I +S D L+P
Sbjct: 132 LLPY----TSKIARKFSIPKIVFHGTGC-FNLLCMHVLRRNLEILKNLKSDKDYFLVPSF 186
Query: 177 ------TSP-VPVCVMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFS 229
T P VPV S + A L ++ + G+IVNTF ELEP V ++
Sbjct: 187 PDRVEFTKPQVPVETTASGDWK-----AFLDEMVEAEYTSYGVIVNTFQELEPAYVKDYT 241
Query: 230 GDLNPPLYTAGPVLHLKSQPNPDLDEAQYQ------KIFQWLDDLAESSVVFLCFGSSGS 283
+++ GPV L ++ D E Q + QWLD + SV+++C GS +
Sbjct: 242 KARAGKVWSIGPV-SLCNKAGADKAERGNQAAIDQDECLQWLDSKEDGSVLYVCLGSICN 300
Query: 284 FDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGM-I 342
++Q+KE+ +GLE+S +F+W +R +E+ + GF ERIK RG+ I
Sbjct: 301 LPLSQLKELGLGLEKSQRSFIWVIRGWEKYNELY-------EWMMESGFEERIKERGLLI 353
Query: 343 WGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKEL--G 400
GW PQV IL+H ++GGF++HCGWNS LE + G+P+ TWP++ +Q N +V+ L G
Sbjct: 354 KGWSPQVLILSHPSVGGFLTHCGWNSTLEGITSGIPLITWPLFGDQFCNQKLVVQVLKAG 413
Query: 401 LALDLRLDYRVGSD-----LVMAGDIESAVRCLM---DGENKIRKKVKEMAEISRKSLME 452
++ + + G + LV ++ AV LM D + R++VKE+ E + K++ E
Sbjct: 414 VSAGVEEVMKWGEEEKIGVLVDKEGVKKAVEELMGASDDAKERRRRVKELGESAHKAVEE 473
Query: 453 GGSSFNSI 460
GGSS ++I
Sbjct: 474 GGSSHSNI 481
>sp|Q2V6J9|UFOG7_FRAAN UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria
ananassa GN=GT7 PE=1 SV=1
Length = 487
Score = 166 bits (421), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 135/494 (27%), Positives = 220/494 (44%), Gaps = 73/494 (14%)
Query: 2 KKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQP 61
++ + F+P GH + + AK + R ++ + AP ++K+ +
Sbjct: 9 QQLHIFFLPFMARGHSIPLTDIAKLFSSHGARCTIVTTPLN---AP---LFSKATQRGEI 62
Query: 62 RICVIDLPPVDPPLP------------DVLKKSPEYFISLVVESHLPNVKNIVSSRSNSG 109
+ +I P + LP D+L K + + ++E H + + R +
Sbjct: 63 ELVLIKFPSAEAGLPQDCESADLITTQDMLGKFVK--ATFLIEPHFEKILD--EHRPHC- 117
Query: 110 SLQVTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDD 169
LV D F D+A + +P F GF L L + + S +
Sbjct: 118 ------LVADAFFTWATDVAAKFRIPRLYF--HGTGFFALCASLSVMMYQPHSNLSSDSE 169
Query: 170 ELLIPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVD-GIIVNTFHELEPYAVNAF 228
+IP + + + +F + ++K + ++ G+IVN+F+ELEP N +
Sbjct: 170 SFVIPNLPDEIKMTRSQLPVFPDESEFMKMLKASIEIEERSYGVIVNSFYELEPAYANHY 229
Query: 229 SGDLNPPLYTAGPVLHLKSQPNPDLDE-------AQYQKIFQWLDDLAESSVVFLCFGSS 281
+ GPV + A+ + +WLD SVV++ FGS
Sbjct: 230 RKVFGRKAWHIGPVSFCNKAIEDKAERGSIKSSTAEKHECLKWLDSKKPRSVVYVSFGSM 289
Query: 282 GSFDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGM 341
F +Q+ EIA GLE SG +F+W ++ PEGF +R++G+G+
Sbjct: 290 VRFADSQLLEIATGLEASGQDFIWVVKKEK----------KEVEEWLPEGFEKRMEGKGL 339
Query: 342 I-WGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELG 400
I W PQV IL H+AIG FV+HCGWNSILE++ GVP+ TWP++ EQ N ++V E+
Sbjct: 340 IIRDWAPQVLILEHEAIGAFVTHCGWNSILEAVSAGVPMITWPVFGEQFYNE-KLVTEIH 398
Query: 401 LALDLRLDYRVGSD---------------LVMAGDIESAVRCLMDGENKI--RKKVKEMA 443
R+ VGS+ V IE AV +M G+ + R +VKE+
Sbjct: 399 -----RIGVPVGSEKWALSFVDVNAETEGRVRREAIEEAVTRIMVGDEAVETRSRVKELG 453
Query: 444 EISRKSLMEGGSSF 457
E +R+++ EGGSSF
Sbjct: 454 ENARRAVEEGGSSF 467
>sp|Q9SCP5|U73C7_ARATH UDP-glycosyltransferase 73C7 OS=Arabidopsis thaliana GN=UGT73C7
PE=2 SV=1
Length = 490
Score = 163 bits (413), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 179/356 (50%), Gaps = 27/356 (7%)
Query: 128 IAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITSPVPVCVMPS 187
+AK+ +P +F LM R+ I + ES+D+ +PG+ V
Sbjct: 134 LAKKFKIPKLIF--HGFSCFSLMSIQVVRESGILKMIESNDEYFDLPGLPDKVEFTKPQV 191
Query: 188 CLFNKDGGH--ATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDLNPPLYTAGPV--- 242
+ G+ + K+ + D G+IVNTF ELE + ++ GPV
Sbjct: 192 SVLQPVEGNMKESTAKIIEADNDSYGVIVNTFEELEVDYAREYRKARAGKVWCVGPVSLC 251
Query: 243 --LHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGLERSG 300
L L D + QWLD SV+++C GS + +AQ+KE+ +GLE S
Sbjct: 252 NRLGLDKAKRGDKASIGQDQCLQWLDSQETGSVLYVCLGSLCNLPLAQLKELGLGLEASN 311
Query: 301 YNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGM-IWGWVPQVEILAHKAIGG 359
F+W +R +++ N + GF ERIK RG+ I GW PQV IL+H +IGG
Sbjct: 312 KPFIWVIREWGKYGDLA-------NWMQQSGFEERIKDRGLVIKGWAPQVFILSHASIGG 364
Query: 360 FVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKEL--GLALDLRLDYRVGSD--- 414
F++HCGWNS LE + GVP+ TWP++AEQ LN +V+ L GL + + + G +
Sbjct: 365 FLTHCGWNSTLEGITAGVPLLTWPLFAEQFLNEKLVVQILKAGLKIGVEKLMKYGKEEEI 424
Query: 415 --LVMAGDIESAVRCLM-DGE--NKIRKKVKEMAEISRKSLMEGGSSFNSIGQFIS 465
+V + AV LM D E + R+KV E+++++ K+L +GGSS ++I I
Sbjct: 425 GAMVSRECVRKAVDELMGDSEEAEERRRKVTELSDLANKALEKGGSSDSNITLLIQ 480
>sp|Q8H0F2|ANGT_GENTR Anthocyanin 3'-O-beta-glucosyltransferase OS=Gentiana triflora PE=1
SV=1
Length = 482
Score = 163 bits (412), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 137/488 (28%), Positives = 228/488 (46%), Gaps = 39/488 (7%)
Query: 1 MKKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKS-LTDS 59
M + + F P GH++ T++ AK + R + T+++ A ++ A +S +
Sbjct: 1 MDQLHVFFFPFLANGHILPTIDMAKLFSSRG--VKATLITTHNNSAIFLKAINRSKILGF 58
Query: 60 QPRICVIDLPPVDPPLPDVLKKSPE-YFISLVVESHLPNVKNIVSSRSNSGSLQVTGLVL 118
+ I P + LP+ + + + I ++ E + + LV
Sbjct: 59 DISVLTIKFPSAEFGLPEGYETADQARSIDMMDEFFRACILLQEPLEELLKEHRPQALVA 118
Query: 119 DFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITS 178
D F D A + +P +F S+ ++ R+++ S D ++P I
Sbjct: 119 DLFFYWANDAAAKFGIPRLLFHGSSS--FAMIAAESVRRNKPYKNLSSDSDPFVVPDI-- 174
Query: 179 PVPVCVMPSCLFNKDGGHATLVKLAQRFKDVD-------GIIVNTFHELEPYAVNAFSGD 231
P + + S + D + + +K++ G+IVN+F+ELEP V+
Sbjct: 175 PDKIILTKSQVPTPDETEENNTHITEMWKNISESENDCYGVIVNSFYELEPDYVDYCKNV 234
Query: 232 LNPPLYTAGPVLHLKSQPNPDLDEA------QYQKIFQWLDDLAESSVVFLCFGSSGSFD 285
L + GP L L + D+ E + WLD SVV++CFGS +F+
Sbjct: 235 LGRRAWHIGP-LSLCNNEGEDVAERGKKSDIDAHECLNWLDSKNPDSVVYVCFGSMANFN 293
Query: 286 VAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIK--GRGMIW 343
AQ+ E+A+GLE SG F+W +R +++ S FP+GF +R++ +G+I
Sbjct: 294 AAQLHELAMGLEESGQEFIWVVRTCVDEEDESKW--------FPDGFEKRVQENNKGLII 345
Query: 344 -GWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKEL--G 400
GW PQV IL H+A+G FVSHCGWNS LE + GV + TWP++AEQ N M L G
Sbjct: 346 KGWAPQVLILEHEAVGAFVSHCGWNSTLEGICGGVAMVTWPLFAEQFYNEKLMTDILRTG 405
Query: 401 LALDLRLDYRVGSDLVMAG--DIESAVRCLMDGEN--KIRKKVKEMAEISRKSLMEGGSS 456
+++ RV + V+ I AVR LM E IR + K + E ++K++ GGSS
Sbjct: 406 VSVGSLQWSRVTTSAVVVKRESISKAVRRLMAEEEGVDIRNRAKALKEKAKKAVEGGGSS 465
Query: 457 FNSIGQFI 464
++ + +
Sbjct: 466 YSDLSALL 473
>sp|Q9LXV0|U92A1_ARATH UDP-glycosyltransferase 92A1 OS=Arabidopsis thaliana GN=UGT92A1
PE=2 SV=1
Length = 488
Score = 162 bits (411), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 117/363 (32%), Positives = 184/363 (50%), Gaps = 33/363 (9%)
Query: 119 DFFCVSMVDIAKELSLPSYMFLTSN---MGFLR-LMLYLPTRQDRISTVFESSDDELLIP 174
DFF + + KE+ + S +F S +G R + L LP + E+ D+ L+
Sbjct: 132 DFFLGWIGKVCKEVGVYSVIFSASGAFGLGCYRSIWLNLPHK--------ETKQDQFLLD 183
Query: 175 GI--TSPVPVCVMPSCLFNKDGG---HATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFS 229
+ + S + DG + K+ + D DG + NT E++ ++ F
Sbjct: 184 DFPEAGEIEKTQLNSFMLEADGTDDWSVFMKKIIPGWSDFDGFLFNTVAEIDQMGLSYFR 243
Query: 230 GDLNPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQV 289
P++ GPVL KS + + + WLD + SVV++CFGS S +
Sbjct: 244 RITGVPVWPVGPVL--KSPDKKVGSRSTEEAVKSWLDSKPDHSVVYVCFGSMNSILQTHM 301
Query: 290 KEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERI--KGRGM-IWGWV 346
E+A+ LE S NF+W +R P V G PEGF ERI RG+ + W
Sbjct: 302 LELAMALESSEKNFIWVVR---PPIGVEVKSEFDVKGYLPEGFEERITRSERGLLVKKWA 358
Query: 347 PQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKELGLALDLR 406
PQV+IL+HKA F+SHCGWNSILESL +GVP+ WP+ AEQ N+ M K +G+++++
Sbjct: 359 PQVDILSHKATCVFLSHCGWNSILESLSHGVPLLGWPMAAEQFFNSILMEKHIGVSVEVA 418
Query: 407 LDYRVGSDLVMAGDIESAVRCLMDGE---NKIRKKVKEMAEISRKSLMEG--GSSFNSIG 461
R + DI S ++ +M+ +IRKK +E+ E+ R+++++G GSS +
Sbjct: 419 RGKRCE---IKCDDIVSKIKLVMEETEVGKEIRKKAREVKELVRRAMVDGVKGSSVIGLE 475
Query: 462 QFI 464
+F+
Sbjct: 476 EFL 478
>sp|Q94C57|U73B2_ARATH UDP-glucosyl transferase 73B2 OS=Arabidopsis thaliana GN=UGT73B2
PE=1 SV=1
Length = 483
Score = 162 bits (409), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 140/496 (28%), Positives = 242/496 (48%), Gaps = 58/496 (11%)
Query: 2 KKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISV--TILSMKLAVAPWVDAYTKSLTDS 59
+K ++F P GH++ TL+ AK + R + ++ T L+ K+ P +D +
Sbjct: 8 RKLHVMFFPFMAYGHMIPTLDMAKLFSSRGAKSTILTTSLNSKILQKP-IDTFKNLNPGL 66
Query: 60 QPRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNS-------GSLQ 112
+ I + + P V+ LP+ + + + + + VK S+R G+ +
Sbjct: 67 EIDIQIFNFPCVELGLPEGCENVDFFTSNNNDDKNEMIVKFFFSTRFFKDQLEKLLGTTR 126
Query: 113 VTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELL 172
L+ D F + A + ++P +F G+ L + SS + +
Sbjct: 127 PDCLIADMFFPWATEAAGKFNVPRLVF--HGTGYFSLCAGYCIGVHKPQKRVASSSEPFV 184
Query: 173 IPGITSPVPVCVMPSCLFNKDGGHATLVKLAQRFKDVD----GIIVNTFHELEPYAVNAF 228
IP + P + + + + DG + + K ++ + G+++N+F+ELE + +
Sbjct: 185 IPEL--PGNIVITEEQIIDGDG-ESDMGKFMTEVRESEVKSSGVVLNSFYELEHDYADFY 241
Query: 229 SGDLNPPLYTAGPV------LHLKSQ--PNPDLDEAQYQKIFQWLDDLAESSVVFLCFGS 280
+ + GP+ K++ ++DEA+ +WLD +SV+++ FGS
Sbjct: 242 KSCVQKRAWHIGPLSVYNRGFEEKAERGKKANIDEAE---CLKWLDSKKPNSVIYVSFGS 298
Query: 281 SGSFDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRG 340
F Q+ EIA GLE SG +F+W +R + E PEGF ER+KG+G
Sbjct: 299 VAFFKNEQLFEIAAGLEASGTSFIWVVRKTKDDREE----------WLPEGFEERVKGKG 348
Query: 341 MI-WGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKEL 399
MI GW PQV IL H+A GGFV+HCGWNS+LE + G+P+ TWP+ AEQ N ++V ++
Sbjct: 349 MIIRGWAPQVLILDHQATGGFVTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNE-KLVTQV 407
Query: 400 GLALDLRLDYRVGSD---LVMAGD------IESAVRCLMDG--ENKIRKKVKEMAEISRK 448
LR VG+ VM GD ++ AVR ++ G + R++ K++A +++
Sbjct: 408 -----LRTGVSVGASKHMKVMMGDFISREKVDKAVREVLAGEAAEERRRRAKKLAAMAKA 462
Query: 449 SLMEGGSSFNSIGQFI 464
++ EGGSSFN + F+
Sbjct: 463 AVEEGGSSFNDLNSFM 478
>sp|Q9SK82|U85A1_ARATH UDP-glycosyltransferase 85A1 OS=Arabidopsis thaliana GN=UGT85A1
PE=1 SV=1
Length = 489
Score = 161 bits (407), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 137/480 (28%), Positives = 230/480 (47%), Gaps = 59/480 (12%)
Query: 2 KKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQP 61
+K ++ VP P GH+ + AK L R VT ++ ++ + + D P
Sbjct: 10 QKPHVVCVPYPAQGHINPMMRVAKLLHARG--FYVTFVNTVYNHNRFLRSRGSNALDGLP 67
Query: 62 RICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNV----KNIVSSRSNSGSLQVTGLV 117
+ LP+ + + I+ + ES + N + ++ + ++ +
Sbjct: 68 SF---RFESIADGLPETDMDATQD-ITALCESTMKNCLAPFRELLQRINAGDNVPPVSCI 123
Query: 118 LDFFCVSM-VDIAKELSLPSYMFLTSN----MGFLRLMLY----LPTRQDRISTVFESSD 168
+ C+S +D+A+EL +P +F T++ + +L L+ L +D E +
Sbjct: 124 VSDGCMSFTLDVAEELGVPEVLFWTTSGCAFLAYLHFYLFIEKGLCPLKDESYLTKEYLE 183
Query: 169 DELL--IPGITSPVPVCVMPSCLFNKDGGHATL---VKLAQRFKDVDGIIVNTFHELEPY 223
D ++ IP + + V + +PS + + + ++ +R K II+NTF +LE
Sbjct: 184 DTVIDFIPTMKN-VKLKDIPSFIRTTNPDDVMISFALRETERAKRASAIILNTFDDLEHD 242
Query: 224 AVNAFSGDLNPPLYTAGPVLHLKSQPNPDLDEA------------QYQKIFQWLDDLAES 271
V+A L PP+Y+ GP LHL + N +++E + + WLD ++
Sbjct: 243 VVHAMQSIL-PPVYSVGP-LHLLA--NREIEEGSEIGMMSSNLWKEEMECLDWLDTKTQN 298
Query: 272 SVVFLCFGSSGSFDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEG 331
SV+++ FGS V Q+ E A GL SG FLW +R D V+ + P
Sbjct: 299 SVIYINFGSITVLSVKQLVEFAWGLAGSGKEFLWVIR----PDLVAGEE-----AMVPPD 349
Query: 332 FLERIKGRGMIWGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLN 391
FL K R M+ W PQ ++L+H AIGGF++HCGWNSILESL GVP+ WP +A+QQ+N
Sbjct: 350 FLMETKDRSMLASWCPQEKVLSHPAIGGFLTHCGWNSILESLSCGVPMVCWPFFADQQMN 409
Query: 392 AFRMVKELGLALDLRLDYRVGSDLVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKS 449
E + ++ +G D V ++E+ VR LMDGE K+R+K E ++ K+
Sbjct: 410 CKFCCDEWDVGIE------IGGD-VKREEVEAVVRELMDGEKGKKMREKAVEWQRLAEKA 462
>sp|Q9ZQ96|U73C3_ARATH UDP-glycosyltransferase 73C3 OS=Arabidopsis thaliana GN=UGT73C3
PE=2 SV=1
Length = 496
Score = 161 bits (407), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/353 (31%), Positives = 184/353 (52%), Gaps = 30/353 (8%)
Query: 128 IAKELSLPSYMFLTSNMGFLRLM-LYLPTRQDRISTVFESSDDELLIPGITSPVPVCVMP 186
IAK ++P +F MG L+ +++ R I +S ++ L+P V +
Sbjct: 139 IAKNFNIPKIVF--HGMGCFNLLCMHVLRRNLEILENVKSDEEYFLVPSFPDRVEFTKLQ 196
Query: 187 SCLFNKDGG--HATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDLNPPLYTAGPVLH 244
+ G + ++ + G+IVNTF ELEP V + ++ +++ GPV
Sbjct: 197 LPVKANASGDWKEIMDEMVKAEYTSYGVIVNTFQELEPPYVKDYKEAMDGKVWSIGPV-S 255
Query: 245 LKSQPNPDLDEA------QYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEIAIGLER 298
L ++ D E + QWLD E SV+++C GS + ++Q+KE+ +GLE
Sbjct: 256 LCNKAGADKAERGSKAAIDQDECLQWLDSKEEGSVLYVCLGSICNLPLSQLKELGLGLEE 315
Query: 299 SGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGM-IWGWVPQVEILAHKAI 357
S +F+W +R S E+ + GF ERIK RG+ I GW PQV IL+H ++
Sbjct: 316 SRRSFIWVIRGSEKYKELF-------EWMLESGFEERIKERGLLIKGWAPQVLILSHPSV 368
Query: 358 GGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKEL--GLALDLRLDYRVGSD- 414
GGF++HCGWNS LE + G+P+ TWP++ +Q N +V+ L G++ + + G +
Sbjct: 369 GGFLTHCGWNSTLEGITSGIPLITWPLFGDQFCNQKLVVQVLKAGVSAGVEEVMKWGEED 428
Query: 415 ----LVMAGDIESAVRCLM---DGENKIRKKVKEMAEISRKSLMEGGSSFNSI 460
LV ++ AV LM D + R++VKE+ E++ K++ +GGSS ++I
Sbjct: 429 KIGVLVDKEGVKKAVEELMGDSDDAKERRRRVKELGELAHKAVEKGGSSHSNI 481
>sp|D4Q9Z4|SGT2_SOYBN Soyasapogenol B glucuronide galactosyltransferase OS=Glycine max
GN=GmSGT2 PE=1 SV=1
Length = 495
Score = 160 bits (405), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 134/503 (26%), Positives = 235/503 (46%), Gaps = 70/503 (13%)
Query: 2 KKAEL--IFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDS 59
KK EL IF+P H++ ++ A+ D + VTI++ + + +
Sbjct: 4 KKGELKSIFLPFLSTSHIIPLVDMARLFALHD--VDVTIITTAHNATVFQKSIDLDASRG 61
Query: 60 QP-RICVIDLPPVDPPLP--------DVLKK-SPEYFISLVVESHLPNVKNIVSSRSNSG 109
+P R V++ P LP D ++ +P ++ L + ++ + +
Sbjct: 62 RPIRTHVVNFPAAQVGLPVGIEAFNVDTPREMTPRIYMGLSL------LQQVFEKLFHD- 114
Query: 110 SLQVTGLVLDFFCVSMVDIAKELSLPSYMFLTSNM----GFLRLMLYLPTRQDRISTVFE 165
LQ +V D F VD A +L +P MF ++ + Y P + + T
Sbjct: 115 -LQPDFIVTDMFHPWSVDAAAKLGIPRIMFHGASYLARSAAHSVEQYAPHLEAKFDT--- 170
Query: 166 SSDDELLIPGITSPVPVC--VMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPY 223
D+ ++PG+ + + +P L + + + + Q K G + N+F++LE
Sbjct: 171 ---DKFVLPGLPDNLEMTRLQLPDWLRSPNQYTELMRTIKQSEKKSYGSLFNSFYDLESA 227
Query: 224 AVNAFSGDLNPPLYTAGPVLHLKSQPNPDL-------DEAQYQKIFQWLDDLAESSVVFL 276
+ + + GPV +Q D +E + + +WL+ AESSV+++
Sbjct: 228 YYEHYKSIMGTKSWGIGPVSLWANQDAQDKAARGYAKEEEEKEGWLKWLNSKAESSVLYV 287
Query: 277 CFGSSGSFDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERI 336
FGS F +Q+ EIA LE SG++F+W +R N+G + FLE
Sbjct: 288 SFGSINKFPYSQLVEIARALEDSGHDFIWVVR--------------KNDGGEGDNFLEEF 333
Query: 337 KGRG-------MIWGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQ 389
+ R +IWGW PQ+ IL + AIGG V+HCGWN+++ES+ G+P+ATWP++AE
Sbjct: 334 EKRMKESNKGYLIWGWAPQLLILENPAIGGLVTHCGWNTVVESVNAGLPMATWPLFAEHF 393
Query: 390 LNAFRMVKELGL-----ALDLRLDYRVGSDLVMAGDIESAVRCLMDGENK---IRKKVKE 441
N +V L + A + R GS++V +I +A+ LM E + +RK+ KE
Sbjct: 394 FNEKLVVDVLKIGVPVGAKEWRNWNEFGSEVVKREEIGNAIASLMSEEEEDGGMRKRAKE 453
Query: 442 MAEISRKSLMEGGSSFNSIGQFI 464
++ ++ ++ GGSS N++ + I
Sbjct: 454 LSVAAKSAIKVGGSSHNNMKELI 476
>sp|Q9ZQ99|U73C1_ARATH UDP-glycosyltransferase 73C1 OS=Arabidopsis thaliana GN=UGT73C1
PE=2 SV=1
Length = 491
Score = 155 bits (392), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 132/487 (27%), Positives = 223/487 (45%), Gaps = 53/487 (10%)
Query: 6 LIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDSQP-RIC 64
+ P GH++ ++ A+ L R +++TI++ + + ++++ P +
Sbjct: 11 FVLFPFMAQGHMIPMVDIARLLAQRG--VTITIVTTPQNAGRFKNVLSRAIQSGLPINLV 68
Query: 65 VIDLPPVDPPLP---------DVLKKSPEYFISL-VVESHLPNVKNIVSSRSNSGSLQVT 114
+ P + P D L S +F + ++E + + + R N
Sbjct: 69 QVKFPSQESGSPEGQENLDLLDSLGASLTFFKAFSLLEEPVEKLLKEIQPRPNC------ 122
Query: 115 GLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIP 174
++ D IAK L +P +F F L ++ + ES + IP
Sbjct: 123 -IIADMCLPYTNRIAKNLGIPKIIF-HGMCCFNLLCTHIMHQNHEFLETIESDKEYFPIP 180
Query: 175 GITSPVPVC--VMPSCLFNKDGGHATLVKLAQRFKDVDGIIVNTFHELEPYAVNAFSGDL 232
V +P L D L + + G+IVNTF ELEP V +
Sbjct: 181 NFPDRVEFTKSQLPMVLVAGDW-KDFLDGMTEGDNTSYGVIVNTFEELEPAYVRDYKKVK 239
Query: 233 NPPLYTAGPVL--------HLKSQPNPDLDEAQYQKIFQWLDDLAESSVVFLCFGSSGSF 284
+++ GPV + D+D+ + +WLD E SV+++C GS +
Sbjct: 240 AGKIWSIGPVSLCNKLGEDQAERGNKADIDQDE---CIKWLDSKEEGSVLYVCLGSICNL 296
Query: 285 DVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGM-IW 343
++Q+KE+ +GLE S F+W +R +E+ +++ +G + ERIK RG+ I
Sbjct: 297 PLSQLKELGLGLEESQRPFIWVIRGWEKYNELL--EWISESG-----YKERIKERGLLIT 349
Query: 344 GWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKEL--GL 401
GW PQ+ IL H A+GGF++HCGWNS LE + GVP+ TWP++ +Q N V+ L G+
Sbjct: 350 GWSPQMLILTHPAVGGFLTHCGWNSTLEGITSGVPLLTWPLFGDQFCNEKLAVQILKAGV 409
Query: 402 ALDLRLDYRVGSD-----LVMAGDIESAVRCLMDGEN---KIRKKVKEMAEISRKSLMEG 453
+ R G + LV ++ AV LM N + RK+VKE+ E++ K++ EG
Sbjct: 410 RAGVEESMRWGEEEKIGVLVDKEGVKKAVEELMGDSNDAKERRKRVKELGELAHKAVEEG 469
Query: 454 GSSFNSI 460
GSS ++I
Sbjct: 470 GSSHSNI 476
>sp|Q9SKC1|U74C1_ARATH UDP-glycosyltransferase 74C1 OS=Arabidopsis thaliana GN=UGT74C1
PE=2 SV=1
Length = 457
Score = 154 bits (389), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 138/496 (27%), Positives = 222/496 (44%), Gaps = 76/496 (15%)
Query: 2 KKAELIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWV-DAYTKSLT--- 57
KK ++F P P GH+ ++ AK L+ + S I++ K P+ D Y+ ++
Sbjct: 5 KKGHVLFFPYPLQGHINPMIQLAKRLSKKG-ITSTLIIASKDHREPYTSDDYSITVHTIH 63
Query: 58 -----DSQPRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSLQ 112
P +DL + +S FIS S P
Sbjct: 64 DGFFPHEHPHAKFVDLDRFH----NSTSRSLTDFISSAKLSDNPP--------------- 104
Query: 113 VTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELL 172
L+ D F +DIAK+L L Y+ +L ++Y + + ++ L
Sbjct: 105 -KALIYDPFMPFALDIAKDLDL--YVVAYFTQPWLASLVYYHINEGTYDVPVDRHENPTL 161
Query: 173 --IPGITSPVPVCV---MPSCLFNKDGG---HATLVKLAQRFKDVDGIIVNTFHELEPYA 224
PG P+ +PS K H +V+ D I+ NTF +LEP
Sbjct: 162 ASFPGF----PLLSQDDLPSFACEKGSYPLLHEFVVRQFSNLLQADCILCNTFDQLEPKV 217
Query: 225 VNAFSGDLNPPLYTAGPVLHLKSQPNPDLDEAQYQ----------KIFQWLDDLAESSVV 274
V + P+ GPV+ K N ++ Y+ + +WL + SVV
Sbjct: 218 VKWMNDQW--PVKNIGPVVPSKFLDNRLPEDKDYELENSKTEPDESVLKWLGNRPAKSVV 275
Query: 275 FLCFGSSGSFDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLE 334
++ FG+ + Q+KEIA+ + ++GY+FLWS+R S P GF+E
Sbjct: 276 YVAFGTLVALSEKQMKEIAMAISQTGYHFLWSVRES-------------ERSKLPSGFIE 322
Query: 335 RI--KGRGMIWGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNA 392
K G++ WVPQ+E+LAH++IG FVSHCGWNS LE+L GVP+ P + +Q NA
Sbjct: 323 EAEEKDSGLVAKWVPQLEVLAHESIGCFVSHCGWNSTLEALCLGVPMVGVPQWTDQPTNA 382
Query: 393 FRMVKELGLALDLRLDYRVGSDLVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKSL 450
+ + + +R D G L +I + +M+GE +IRK V+++ ++R+++
Sbjct: 383 KFIEDVWKIGVRVRTD---GEGLSSKEEIARCIVEVMEGERGKEIRKNVEKLKVLAREAI 439
Query: 451 MEGGSSFNSIGQFISL 466
EGGSS I +F++L
Sbjct: 440 SEGGSSDKKIDEFVAL 455
>sp|Q9LMF1|U85A3_ARATH UDP-glycosyltransferase 85A3 OS=Arabidopsis thaliana GN=UGT85A3
PE=2 SV=2
Length = 488
Score = 154 bits (389), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 128/419 (30%), Positives = 195/419 (46%), Gaps = 72/419 (17%)
Query: 55 SLTDSQPRICVIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSLQVT 114
+L++S + C++ P +L++ +V +P V IVS S S +L
Sbjct: 91 ALSESTTKNCLV-------PFKKLLQR-------IVTREDVPPVSCIVSDGSMSFTL--- 133
Query: 115 GLVLDFFCVSMVDIAKELSLPSYMFLTSN----MGFLRLMLYLPTR----QDRISTVFES 166
D+A+EL +P F T++ M +L L++ +D E
Sbjct: 134 ------------DVAEELGVPEIHFWTTSACGFMAYLHFYLFIEKGLCPVKDASCLTKEY 181
Query: 167 SDDEL-LIPGITSPVPVCVMPSCLFNKDGGHATL---VKLAQRFKDVDGIIVNTFHELEP 222
D + IP + + V + +PS + + L V+ A R K II+NTF +LE
Sbjct: 182 LDTVIDWIPSMNN-VKLKDIPSFIRTTNPNDIMLNFVVREACRTKRASAIILNTFDDLEH 240
Query: 223 YAVNAFSGDLNPPLYTAGPVLHLKSQPNPDLDE----------AQYQKIFQWLDDLAESS 272
+ + L PP+Y GP LHL + D + + WL+ + +S
Sbjct: 241 DIIQSMQSIL-PPVYPIGP-LHLLVNREIEEDSEIGRMGSNLWKEETECLGWLNTKSRNS 298
Query: 273 VVFLCFGSSGSFDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGF 332
VV++ FGS AQ+ E A GL +G FLW +R S E V P+ F
Sbjct: 299 VVYVNFGSITIMTTAQLLEFAWGLAATGKEFLWVMRPDSVAGE---------EAVIPKEF 349
Query: 333 LERIKGRGMIWGWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNA 392
L R M+ W PQ ++L+H A+GGF++HCGWNS LESL GVP+ WP +AEQQ N
Sbjct: 350 LAETADRRMLTSWCPQEKVLSHPAVGGFLTHCGWNSTLESLSCGVPMVCWPFFAEQQTNC 409
Query: 393 FRMVKELGLALDLRLDYRVGSDLVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKS 449
E + ++ +G D V G++E+ VR LMDGE K+R+K E ++ K+
Sbjct: 410 KFSCDEWEVGIE------IGGD-VKRGEVEAVVRELMDGEKGKKMREKAVEWRRLAEKA 461
>sp|Q9ZQ98|U73C2_ARATH UDP-glycosyltransferase 73C2 OS=Arabidopsis thaliana GN=UGT73C2
PE=3 SV=1
Length = 496
Score = 153 bits (387), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 125/479 (26%), Positives = 226/479 (47%), Gaps = 36/479 (7%)
Query: 6 LIFVPSPGIGHLVSTLEFAKHLTDRDDRISVTILSMKLAVAPWVDAYTKSLTDS-QPRIC 64
+ P GH++ ++ A+ L R +++TI++ A + D +++ R+
Sbjct: 15 FVLFPFMAQGHMIPMVDIARILAQRG--VTITIVTTPHNAARFKDVLNRAIQSGLHIRVE 72
Query: 65 VIDLPPVDPPLPDVLKKSPEYFISLVVESHLPNVKNI----VSSRSNSGSLQVTGLVLDF 120
+ P + L + +++ ++ S+ + H N+ V + + L+ DF
Sbjct: 73 HVKFPFQEAGLQEG-QENVDFLDSMELMVHFFKAVNMLENPVMKLMEEMKPKPSCLISDF 131
Query: 121 FCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELLIPGITSPV 180
IAK ++P +F + F L +++ R I +S + L+P V
Sbjct: 132 CLPYTSKIAKRFNIPKIVFHGVSC-FCLLSMHILHRNHNILHALKSDKEYFLVPSFPDRV 190
Query: 181 PVCVMPSCLFNKDGGHATLVKLAQRFKDVD--GIIVNTFHELEPYAVNAFSGDLNPPLYT 238
+ + G + Q D G+IVNTF +LE V ++ +++
Sbjct: 191 EFTKLQVTVKTNFSGDWKEIMDEQVDADDTSYGVIVNTFQDLESAYVKNYTEARAGKVWS 250
Query: 239 AGPVLHLKSQPNPDLDEA------QYQKIFQWLDDLAESSVVFLCFGSSGSFDVAQVKEI 292
GPV L ++ D E + +WLD SV+++C GS + +AQ++E+
Sbjct: 251 IGPV-SLCNKVGEDKAERGNKAAIDQDECIKWLDSKDVESVLYVCLGSICNLPLAQLREL 309
Query: 293 AIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGM-IWGWVPQVEI 351
+GLE + F+W +R E++ + GF ER K R + I GW PQ+ I
Sbjct: 310 GLGLEATKRPFIWVIRGGGKYHELA-------EWILESGFEERTKERSLLIKGWSPQMLI 362
Query: 352 LAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKEL--GLALDLRLDY 409
L+H A+GGF++HCGWNS LE + GVP+ TWP++ +Q N +V+ L G+++ +
Sbjct: 363 LSHPAVGGFLTHCGWNSTLEGITSGVPLITWPLFGDQFCNQKLIVQVLKAGVSVGVEEVM 422
Query: 410 RVGSD-----LVMAGDIESAVRCLM---DGENKIRKKVKEMAEISRKSLMEGGSSFNSI 460
+ G + LV ++ AV +M D + RK+V+E+ E++ K++ EGGSS ++I
Sbjct: 423 KWGEEESIGVLVDKEGVKKAVDEIMGESDEAKERRKRVRELGELAHKAVEEGGSSHSNI 481
>sp|Q9SY84|U90A2_ARATH UDP-glycosyltransferase 90A2 OS=Arabidopsis thaliana GN=UGT90A2
PE=2 SV=1
Length = 467
Score = 153 bits (386), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 132/489 (26%), Positives = 225/489 (46%), Gaps = 55/489 (11%)
Query: 1 MKKAELIFVPSPGIGHLVSTLEFAKHLTDRD--DRISVTILSMKLAVAPWVDAYTKSLTD 58
++K ++ P GH++ L+ A+ L ISVT+ + L VD+ + +
Sbjct: 3 LEKVHVVLFPYLSKGHMIPMLQLARLLLSHSFAGDISVTVFTTPLNRPFIVDSLSGT--- 59
Query: 59 SQPRICVIDLP-----PVDPPLPDVLKKSPEYFISLVVESHLPNVKNIVSSRSNSGSL-Q 112
+ ++D+P P PP + K P SL V SL +
Sbjct: 60 ---KATIVDVPFPDNVPEIPPGVECTDKLPALSSSLFVPFTRATKSMQADFERELMSLPR 116
Query: 113 VTGLVLDFFCVSMVDIAKELSLPSYMFLTSNMGFLRLMLYLPTRQDRISTVFESSDDELL 172
V+ +V D F + A++L P +F N ++ Q+++ + +S + +
Sbjct: 117 VSFMVSDGFLWWTQESARKLGFPRLVFFGMNCA--STVICDSVFQNQLLSNVKSETEPVS 174
Query: 173 IPGITSPVPVCVMPSCLFNKD---------GGHATLVKLAQRFKDVDGIIVNTFHELEPY 223
+P P + C F KD G ++ GII NTF +LEP
Sbjct: 175 VPEF----PWIKVRKCDFVKDMFDPKTTTDPGFKLILDQVTSMNQSQGIIFNTFDDLEPV 230
Query: 224 AVNAFSGDLNPPLYTAGPVLHLKSQPNPDLDEAQYQKIFQWLDDLAES--SVVFLCFGSS 281
++ + L+ GP+ ++ + + +++E +WLD+ + +V+++ FGS
Sbjct: 231 FIDFYKRKRKLKLWAVGPLCYVNNFLDDEVEEKVKPSWMKWLDEKRDKGCNVLYVAFGSQ 290
Query: 282 GSFDVAQVKEIAIGLERSGYNFLWSLRVSSPKDEVSAHRYVTNNGVFPEGFLERIKGRGM 341
Q++EIA+GLE S NFLW V +GF ER+ RGM
Sbjct: 291 AEISREQLEEIALGLEESKVNFLW----------------VVKGNEIGKGFEERVGERGM 334
Query: 342 IW--GWVPQVEILAHKAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFRMVKEL 399
+ WV Q +IL H+++ GF+SHCGWNS+ ES+ VPI +P+ AEQ LNA +V+EL
Sbjct: 335 MVRDEWVDQRKILEHESVRGFLSHCGWNSLTESICSEVPILAFPLAAEQPLNAILVVEEL 394
Query: 400 GLALDLRLDYRVGSDLVMAGDIESAVRCLMDGE--NKIRKKVKEMAEISRKSLMEG-GSS 456
+A + +V +I V+ LM+GE ++R+ V+ ++++K+L EG GSS
Sbjct: 395 RVAERV---VAASEGVVRREEIAEKVKELMEGEKGKELRRNVEAYGKMAKKALEEGIGSS 451
Query: 457 FNSIGQFIS 465
++ I+
Sbjct: 452 RKNLDNLIN 460
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.321 0.138 0.416
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 173,313,836
Number of Sequences: 539616
Number of extensions: 7347197
Number of successful extensions: 17763
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 234
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 16975
Number of HSP's gapped (non-prelim): 294
length of query: 468
length of database: 191,569,459
effective HSP length: 121
effective length of query: 347
effective length of database: 126,275,923
effective search space: 43817745281
effective search space used: 43817745281
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)