BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 042987
(481 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q4R1I9|ANGLT_ROSHC Anthocyanidin 5,3-O-glucosyltransferase OS=Rosa hybrid cultivar
GN=RhGT1 PE=2 SV=1
Length = 473
Score = 446 bits (1146), Expect = e-124, Method: Compositional matrix adjust.
Identities = 236/483 (48%), Positives = 314/483 (65%), Gaps = 25/483 (5%)
Query: 3 DTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTA--------PFVSSAGTD--D 52
D IV Y PG GHL SMVELGKL+LT+HP FSI I+ TA V+S+ +
Sbjct: 4 DAIVLYPYPGLGHLISMVELGKLLLTHHPSFSITILASTAPTTIAATAKLVASSNDQLTN 63
Query: 53 YIASVSATAPSVTFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRS 112
YI +VSA P++ FH LP +S LP+ + + P +E L P + + L T+ +S
Sbjct: 64 YIKAVSADNPAINFHHLPT-ISSLPEHIEK-LNLP---FEYARLQIPNILQVLQTL--KS 116
Query: 113 NLKAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGST 172
+LKA ++D FC+ F V+ L+IPT+Y++T+AG LA L +PT H+ TT S + G
Sbjct: 117 SLKALILDMFCDALFDVTKD-LNIPTFYFYTSAGRSLAVLLNIPTFHR-TTNSLSDFGDV 174
Query: 173 LLNFPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAML 232
++ G PP P M + DR YK + T MAKS G+I+NTF+LLEERA+KA+
Sbjct: 175 PISISGMPPIPVSAMPKLLFDRSTNFYKSFLSTSTHMAKSNGIILNTFDLLEERALKALR 234
Query: 233 EGQCTPGETSPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSS 292
G C P + +PP++ +GP++ +G+N D HE L WL+++P SV+FLCFGS+G FS
Sbjct: 235 AGLCLPNQPTPPIFTVGPLISGKSGDN---DEHESLKWLNNQPKDSVVFLCFGSMGVFSI 291
Query: 293 KQLKEMAIGLERSGVKFLWVVRAPAPDSIE-NRSSLESLLPEGFLDRTKDRGLVVESWAP 351
KQL+ MA+GLE+SG +FLWVVR P + + SLE +LP+GF++RTKDRGLVV WAP
Sbjct: 292 KQLEAMALGLEKSGQRFLWVVRNPPIEELPVEEPSLEEILPKGFVERTKDRGLVVRKWAP 351
Query: 352 QVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTR 411
QVEVL+H+SVGGFVTHCGWNSVLE VC GVPM+AWPLYAEQK+ R +VEEMKV + V
Sbjct: 352 QVEVLSHDSVGGFVTHCGWNSVLEAVCNGVPMVAWPLYAEQKLGRVFLVEEMKVAVGVKE 411
Query: 412 SEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVES 471
SE VSA ELE+RV ELMDSE G ++ R A +GGSS +L L +
Sbjct: 412 SETG--FVSADELEKRVRELMDSESGDEIRGRVSEFSNGGVKAKEEGGSSVASLAKLAQL 469
Query: 472 FKR 474
+K+
Sbjct: 470 WKQ 472
>sp|Q9LK73|U88A1_ARATH UDP-glycosyltransferase 88A1 OS=Arabidopsis thaliana GN=UGT88A1
PE=2 SV=1
Length = 462
Score = 402 bits (1032), Expect = e-111, Method: Compositional matrix adjust.
Identities = 215/473 (45%), Positives = 318/473 (67%), Gaps = 18/473 (3%)
Query: 2 KDTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATA 61
++ IV Y +P GHL SMVELGK IL+ +P SI II+ P+ T YI+SVS++
Sbjct: 3 EEAIVLYPAPPIGHLVSMVELGKTILSKNPSLSIHIILVPPPY-QPESTATYISSVSSSF 61
Query: 62 PSVTFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSNLKAFVIDF 121
PS+TFH LP V+ + S +L+ E+ +NP +H TL ++S+ N++A +IDF
Sbjct: 62 PSITFHHLPA-VTPYSSSSTSRHHHESLLLEILCFSNPSVHRTLFSLSRNFNVRAMIIDF 120
Query: 122 FCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTT-KSFRELGSTLLNFPGFP 180
FC +++ + P Y+++T+ + LA + YLPT+ + T K+ +++ + ++ PG P
Sbjct: 121 FCTAVLDITAD-FTFPVYFFYTSGAACLAFSFYLPTIDETTPGKNLKDIPT--VHIPGVP 177
Query: 181 PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGE 240
P DM + +R+ +VY + G Q++KS+G+I+NTF+ LE RAIKA+ E C
Sbjct: 178 PMKGSDMPKAVLERDDEVYDVFIMFGKQLSKSSGIIINTFDALENRAIKAITEELCFRN- 236
Query: 241 TSPPLYCIGPVVGRGNGENRGRDRH-ECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMA 299
+Y IGP++ G E+R ++ CL+WLDS+P +SV+FLCFGSLG FS +Q+ E+A
Sbjct: 237 ----IYPIGPLIVNGRIEDRNDNKAVSCLNWLDSQPEKSVVFLCFGSLGLFSKEQVIEIA 292
Query: 300 IGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHE 359
+GLE+SG +FLWVVR P P+ + L+SLLPEGFL RT+D+G+VV+SWAPQV VLNH+
Sbjct: 293 VGLEKSGQRFLWVVRNP-PELEKTELDLKSLLPEGFLSRTEDKGMVVKSWAPQVPVLNHK 351
Query: 360 SVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEEKDRLV 419
+VGGFVTHCGWNS+LE VCAGVPM+AWPLYAEQ+ R ++V+E+K+ +++ SE V
Sbjct: 352 AVGGFVTHCGWNSILEAVCAGVPMVAWPLYAEQRFNRVMIVDEIKIAISMNESETG--FV 409
Query: 420 SAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESF 472
S+ E+E+RV E++ V+ER +AMK AA A+ + GSS AL L++S+
Sbjct: 410 SSTEVEKRVQEIIGE---CPVRERTMAMKNAAELALTETGSSHTALTTLLQSW 459
>sp|Q33DV3|4CGT_ANTMA Chalcone 4'-O-glucosyltransferase OS=Antirrhinum majus PE=1 SV=1
Length = 457
Score = 370 bits (950), Expect = e-101, Method: Compositional matrix adjust.
Identities = 211/473 (44%), Positives = 292/473 (61%), Gaps = 27/473 (5%)
Query: 4 TIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPS 63
TIVF+TS HLNS + L K I T H II TAP SS +A + PS
Sbjct: 10 TIVFHTS--EEHLNSSIALAKFI-TKHHSSISITIISTAPAESSE-----VAKI-INNPS 60
Query: 64 VTFHQLPPPVSGLPDTLRSPADFP--ALVYELGELNNPKLHETLITISKRSNLKAFVIDF 121
+T+ L LP+ L S + L +E+ L N L E L+ IS++S++KA +IDF
Sbjct: 61 ITYRGLT--AVALPENLTSNINKNPVELFFEIPRLQNANLREALLDISRKSDIKALIIDF 118
Query: 122 FCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFPP 181
FCN AF+VS+S ++IPTY+ + +L L+ PTLH+ +L ++ PGFP
Sbjct: 119 FCNAAFEVSTS-MNIPTYFDVSGGAFLLCTFLHHPTLHQTVRGDIADLNDSV-EMPGFPL 176
Query: 182 FPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGET 241
+ D+ + + R+ VYK +DT + M KS+G++VNTF LE RA +A+ G P
Sbjct: 177 IHSSDLPMSLFYRKTNVYKHFLDTSLNMRKSSGILVNTFVALEFRAKEALSNGLYGP--- 233
Query: 242 SPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIG 301
+PPLY + + + ++HECLSWLD +PS+SV+FLCFG G+FS++QLKE+AIG
Sbjct: 234 TPPLYLLSHTIAEPHDTKVLVNQHECLSWLDLQPSKSVIFLCFGRRGAFSAQQLKEIAIG 293
Query: 302 LERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESV 361
LE+SG +FLW+ R I L +LLPEGFL RTK G V +W PQ EVL+H++V
Sbjct: 294 LEKSGCRFLWLAR------ISPEMDLNALLPEGFLSRTKGVGFVTNTWVPQKEVLSHDAV 347
Query: 362 GGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEEKDRLVSA 421
GGFVTHCGW+SVLE + GVPM+ WPLYAEQ++ R +VEE+KV L + +E+D V+A
Sbjct: 348 GGFVTHCGWSSVLEALSFGVPMIGWPLYAEQRINRVFMVEEIKVALPL---DEEDGFVTA 404
Query: 422 AELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESFKR 474
ELE+RV ELM+S KG+ VK R +K + AA+ GGSS +L+ + S R
Sbjct: 405 MELEKRVRELMESVKGKEVKRRVAELKISTKAAVSKGGSSLASLEKFINSVTR 457
>sp|Q76MR7|UBGAT_SCUBA Baicalein 7-O-glucuronosyltransferase OS=Scutellaria baicalensis
GN=UBGAT-I PE=1 SV=1
Length = 441
Score = 352 bits (904), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 197/446 (44%), Positives = 278/446 (62%), Gaps = 21/446 (4%)
Query: 19 MVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSVTFHQLPPPVSGLPD 78
M L K I HP I III AP ++A A PS+++H+LP P +P
Sbjct: 1 MAVLAKFISKNHPSVPI-IIISNAPESAAASV--------AAIPSISYHRLPLP--EIPP 49
Query: 79 TLRSPADFPALVYELGELNNPKLHETLITISKRSNLKAFVIDFFCNPAFQVSSSTLSIPT 138
+ + D L +EL L+NP L L IS+++ ++A ++DFFCN AF+V +S L+IPT
Sbjct: 50 DMTT--DRVELFFELPRLSNPNLLTALQQISQKTRIRAVILDFFCNAAFEVPTS-LNIPT 106
Query: 139 YYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFPPFPARDMALPMHDREGKV 198
YYYF+ LY T+ + ++L ++ PG PP D+ + + R+ V
Sbjct: 107 YYYFSAGTPTAILTLYFETIDETIPVDLQDLND-YVDIPGLPPIHCLDIPVALSPRKSLV 165
Query: 199 YKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGETSPPLYCIGPVVGRGNGE 258
YK VD + +SAG++VN F+ LE RAI + + +PP+Y IGP+VG + +
Sbjct: 166 YKSSVDISKNLRRSAGILVNGFDALEFRAIGSHSQRPMHFKGPTPPVYFIGPLVGDVDTK 225
Query: 259 NRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAP-- 316
G + HECL WLD++PS+SV+FLCFG G FS+KQLKE A LE SG +FLW VR P
Sbjct: 226 A-GSEEHECLRWLDTQPSKSVVFLCFGRRGVFSAKQLKETAAALENSGHRFLWSVRNPPE 284
Query: 317 -APDSIENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLE 375
+ + L+ LLPEGFL+RTKDRG V++SWAPQ EVL H+SVGGFVTHCG +SV E
Sbjct: 285 LKKATGSDEPDLDELLPEGFLERTKDRGFVIKSWAPQKEVLAHDSVGGFVTHCGRSSVSE 344
Query: 376 GVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEEKDRLVSAAELEQRVSELMDSE 435
GV GVPM+ WP+ AE ++ RAV+V++++V L + EE V+AAELE+RV ELM+++
Sbjct: 345 GVWFGVPMIGWPVDAELRLNRAVMVDDLQVALPL--EEEAGGFVTAAELEKRVRELMETK 402
Query: 436 KGRAVKERAVAMKEAAAAAMRDGGSS 461
G+AV++R +K +A AA+ + GSS
Sbjct: 403 AGKAVRQRVTELKLSARAAVAENGSS 428
>sp|Q9LNI1|U72B3_ARATH UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana GN=UGT72B3
PE=2 SV=1
Length = 481
Score = 291 bits (746), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 180/478 (37%), Positives = 266/478 (55%), Gaps = 28/478 (5%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSV 64
+ SPG GHL +VEL K +L H F++ IIP S A SV + PS
Sbjct: 9 VAIIPSPGIGHLIPLVELAKRLLDNH-GFTVTFIIPGDSPPSKAQR-----SVLNSLPSS 62
Query: 65 TFHQLPPP--VSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSNLKA-FVIDF 121
PP +S +P T R V +NP L E ++S L A V+D
Sbjct: 63 IASVFLPPADLSDVPSTARIETRISLTVTR----SNPALRELFGSLSAEKRLPAVLVVDL 118
Query: 122 FCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFPP 181
F AF V++ + Y ++ + +VL L+LP L + + FREL ++ PG P
Sbjct: 119 FGTDAFDVAAE-FHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVI-IPGCVP 176
Query: 182 FPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGET 241
+D P DR+ + YK L+ + ++ G++VN+F LE IK + E P
Sbjct: 177 ITGKDFVDPCQDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQE----PAPD 232
Query: 242 SPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIG 301
PP+Y IGP+V G+ + D ++CL+WLD++P SVL++ FGS G+ + +Q E+A+G
Sbjct: 233 KPPVYLIGPLVNSGSHDADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALG 292
Query: 302 LERSGVKFLWVVRAP---APDSIEN---RSSLESLLPEGFLDRTKDRGLVVESWAPQVEV 355
L SG +FLWV+R+P A S N R+ S LP+GFLDRTK++GLVV SWAPQ ++
Sbjct: 293 LAESGKRFLWVIRSPSGIASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQI 352
Query: 356 LNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEEK 415
L H S+GGF+THCGWNS LE + GVP++AWPLYAEQKM ++V+ VG A+ +
Sbjct: 353 LTHTSIGGFLTHCGWNSSLESIVNGVPLIAWPLYAEQKMNALLLVD---VGAALRARLGE 409
Query: 416 DRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESFK 473
D +V E+ + V L++ E+G AV+++ +KE + +RD G S +L+ + +K
Sbjct: 410 DGVVGREEVARVVKGLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNEVSLKWK 467
>sp|Q9M156|U72B1_ARATH UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1
PE=1 SV=1
Length = 480
Score = 288 bits (736), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 176/470 (37%), Positives = 256/470 (54%), Gaps = 27/470 (5%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSV 64
+ SPG GHL +VE K ++ H +I P S + S+ ++ SV
Sbjct: 9 VAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGP--PSKAQRTVLDSLPSSISSV 66
Query: 65 TFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSNL-KAFVIDFFC 123
PPV L D L S + + +NP+L + + + L A V+D F
Sbjct: 67 FL----PPVD-LTD-LSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFG 120
Query: 124 NPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFPPFP 183
AF V+ +P Y ++ T +VL+ L+LP L + + FREL L+ PG P
Sbjct: 121 TDAFDVAVE-FHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLM-LPGCVPVA 178
Query: 184 ARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGETSP 243
+D P DR+ YK L+ + ++ G++VNTF LE AIKA+ E PG P
Sbjct: 179 GKDFLDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQE----PGLDKP 234
Query: 244 PLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLE 303
P+Y +GP+V G E + + ECL WLD++P SVL++ FGS G+ + +QL E+A+GL
Sbjct: 235 PVYPVGPLVNIGKQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLA 294
Query: 304 RSGVKFLWVVRAPAPDSIENRSSLES--------LLPEGFLDRTKDRGLVVESWAPQVEV 355
S +FLWV+R+P+ I N S +S LP GFL+RTK RG V+ WAPQ +V
Sbjct: 295 DSEQRFLWVIRSPS--GIANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQV 352
Query: 356 LNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEEK 415
L H S GGF+THCGWNS LE V +G+P++AWPLYAEQKM ++ E+++ L +
Sbjct: 353 LAHPSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGD-- 410
Query: 416 DRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVAL 465
D LV E+ + V LM+ E+G+ V+ + +KEAA ++D G+S AL
Sbjct: 411 DGLVRREEVARVVKGLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKAL 460
>sp|Q9AR73|HQGT_RAUSE Hydroquinone glucosyltransferase OS=Rauvolfia serpentina GN=AS PE=1
SV=1
Length = 470
Score = 276 bits (706), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 175/473 (36%), Positives = 252/473 (53%), Gaps = 29/473 (6%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSV 64
I +PG GHL +VE K ++ H F + IIPT + A S P+
Sbjct: 7 IAMVPTPGMGHLIPLVEFAKRLVLRH-NFGVTFIIPTDGPLPKAQK-----SFLDALPAG 60
Query: 65 TFHQLPPPVS--GLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSNLKAFVIDFF 122
+ L PPVS LP +R + + P + + + T+ + L A V+D F
Sbjct: 61 VNYVLLPPVSFDDLPADVRIETRICLTITR----SLPFVRDAVKTLLATTKLAALVVDLF 116
Query: 123 CNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFPPF 182
AF V+ + Y ++ T L+ +LP L + + +R++ L PG P
Sbjct: 117 GTDAFDVAIE-FKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPL-QIPGCIPI 174
Query: 183 PARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGETS 242
+D P DR+ YK L+ + + G++VNTF LE +KA+ E +
Sbjct: 175 HGKDFLDPAQDRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEED----QGK 230
Query: 243 PPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGL 302
PP+Y IGP++ R + ++ D ECL WLD +P SVLF+ FGS G+ S Q E+A+GL
Sbjct: 231 PPVYPIGPLI-RADSSSK-VDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGL 288
Query: 303 ERSGVKFLWVVRAPAPD-------SIENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEV 355
E S +FLWVVR+P SI+N++ + LPEGFL+RTK R L+V SWAPQ E+
Sbjct: 289 EMSEQRFLWVVRSPNDKIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEI 348
Query: 356 LNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEEK 415
L+H S GGF+THCGWNS+LE V GVP++AWPLYAEQKM ++ E +KV L E
Sbjct: 349 LSHGSTGGFLTHCGWNSILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVALRPKAGE-- 406
Query: 416 DRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNL 468
+ L+ E+ V LM+ E+G+ + +K+AA+ A+ D GSS AL L
Sbjct: 407 NGLIGRVEIANAVKGLMEGEEGKKFRSTMKDLKDAASRALSDDGSSTKALAEL 459
>sp|Q8W4C2|U72B2_ARATH UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana GN=UGT72B2
PE=2 SV=1
Length = 480
Score = 273 bits (697), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 178/479 (37%), Positives = 260/479 (54%), Gaps = 29/479 (6%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIP--TAPFVSSAGTDDYIASVSATAP 62
I SPG GHL VEL K L H CF++ +II T+P S + S+ ++
Sbjct: 9 IAIMPSPGMGHLIPFVELAKR-LVQHDCFTVTMIISGETSP---SKAQRSVLNSLPSSIA 64
Query: 63 SVTFHQLPPP-VSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSNLKA-FVID 120
SV LPP +S +P T R + +NP L E ++S + +L A V+D
Sbjct: 65 SVF---LPPADLSDVPSTARIETRAMLTMTR----SNPALRELFGSLSTKKSLPAVLVVD 117
Query: 121 FFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFP 180
F AF V+ + Y ++ + +VL+ L+LP L K + FR L + L PG
Sbjct: 118 MFGADAFDVAVD-FHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYL-TEPLKIPGCV 175
Query: 181 PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGE 240
P +D + DR YK L+ + ++ G++VN+F LE AIKA+ E P
Sbjct: 176 PITGKDFLDTVQDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQE----PAP 231
Query: 241 TSPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAI 300
P +Y IGP+V + D+ CLSWLD++P SVL++ FGS G+ + +Q E+AI
Sbjct: 232 DKPTVYPIGPLVNTSSSNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAI 291
Query: 301 GLERSGVKFLWVVRAPA---PDSIENRSSLE---SLLPEGFLDRTKDRGLVVESWAPQVE 354
GL SG +F+WV+R+P+ S N S S LP GFLDRTK++GLVV SWAPQV+
Sbjct: 292 GLAESGKRFIWVIRSPSEIVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQ 351
Query: 355 VLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEE 414
+L H S GF+THCGWNS LE + GVP++AWPL+AEQKM ++VE++ L + E
Sbjct: 352 ILAHPSTCGFLTHCGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRIHAGE- 410
Query: 415 KDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESFK 473
D +V E+ + V LM+ E+G+A+ + +KE + D G S + ++ +K
Sbjct: 411 -DGIVRREEVVRVVKALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVLLKWK 468
>sp|Q9LSY5|U71B7_ARATH UDP-glycosyltransferase 71B7 OS=Arabidopsis thaliana GN=UGT71B7
PE=2 SV=2
Length = 495
Score = 266 bits (681), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 172/509 (33%), Positives = 264/509 (51%), Gaps = 56/509 (11%)
Query: 1 MKDTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSA--GTDDYIASVS 58
MK +VF PG GHL S VE+ KL++ SI +II PF+S G DYIA++S
Sbjct: 1 MKFELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVII--LPFISEGEVGASDYIAALS 58
Query: 59 ATAPSVTFHQLPPPVSGLPDTLRSPADFPALVYELGELN----NPKLHETLITISKRSNL 114
A++ + +++ S D P + E++ PK+ T+ + + +
Sbjct: 59 ASSNNRLRYEVI-----------SAVDQPTIEMTTIEIHMKNQEPKVRSTVAKLLEDYSS 107
Query: 115 K-------AFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFR 167
K FV+D FC V++ P+Y ++T++ +L+ ++ L
Sbjct: 108 KPDSPKIAGFVLDMFCTSMVDVANE-FGFPSYMFYTSSAGILSVTYHVQMLCDENKYDVS 166
Query: 168 EL----GSTLLNFPGFP-PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFEL 222
E +LNFP P+P + LP V+ + + G++VNT
Sbjct: 167 ENDYADSEAVLNFPSLSRPYPVK--CLPHALAANMWLPVFVNQARKFREMKGILVNTVAE 224
Query: 223 LEERAIKAMLEGQCTPGETSPPLYCIGPVVGRGNGENRGRD--RHECLSWLDSKPSRSVL 280
LE +K + +PP+Y +GP++ N + +D R E + WLD +P SV+
Sbjct: 225 LEPYVLKFL------SSSDTPPVYPVGPLLHLENQRDDSKDEKRLEIIRWLDQQPPSSVV 278
Query: 281 FLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENR----SSLESLLPEGFL 336
FLCFGS+G F +Q++E+AI LERSG +FLW +R +P+ + ++LE +LPEGF
Sbjct: 279 FLCFGSMGGFGEEQVREIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFF 338
Query: 337 DRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIR 396
DRTKD G V+ WAPQV VL + ++GGFVTHCGWNS LE + GVP AWPLYAEQK
Sbjct: 339 DRTKDIGKVI-GWAPQVAVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNA 397
Query: 397 AVVVEEMKVGLAVTRSEEKDRL-------VSAAELEQRVSELMDSEKGRAVKERAVAMKE 449
++VEE+ + + + + + L V+A E+E+ + LM E+ V++R M E
Sbjct: 398 FLMVEELGLAVEIRKYWRGEHLAGLPTATVTAEEIEKAIMCLM--EQDSDVRKRVKDMSE 455
Query: 450 AAAAAMRDGGSSRVALDNLVESFKRGCIA 478
A+ DGGSSR AL +E + ++
Sbjct: 456 KCHVALMDGGSSRTALQKFIEEVAKNIVS 484
>sp|O23382|U71B5_ARATH UDP-glycosyltransferase 71B5 OS=Arabidopsis thaliana GN=UGT71B5
PE=3 SV=1
Length = 478
Score = 262 bits (670), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 167/494 (33%), Positives = 257/494 (52%), Gaps = 45/494 (9%)
Query: 1 MKDTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSAT 60
MK +VF PG GHL V+L K ++ SI III + F + IAS++
Sbjct: 1 MKIELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRF-DAGDASACIASLTTL 59
Query: 61 APSVTFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLIT--ISKRSNLKAFV 118
+ H V+ P T P PA VY E K+ + + + L FV
Sbjct: 60 SQDDRLHYESISVAKQPPT-SDPDPVPAQVY--IEKQKTKVRDAVAARIVDPTRKLAGFV 116
Query: 119 IDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTL--LNF 176
+D FC+ V++ +P Y +T+ + L L++ ++ EL +++ L F
Sbjct: 117 VDMFCSSMIDVANE-FGVPCYMVYTSNATFLGTMLHVQQMYDQKKYDVSELENSVTELEF 175
Query: 177 PGFP-PFPARDMA--------LPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERA 227
P P+P + + LP+ + + ++ K G++VNT LE A
Sbjct: 176 PSLTRPYPVKCLPHILTSKEWLPLSLAQARCFR----------KMKGILVNTVAELEPHA 225
Query: 228 IKAMLEGQCTPGETSPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSL 287
+K G+ P +Y +GPV+ NG + + E L WLD +PS+SV+FLCFGSL
Sbjct: 226 LKMF----NINGDDLPQVYPVGPVLHLENGNDDDEKQSEILRWLDEQPSKSVVFLCFGSL 281
Query: 288 GSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENR----SSLESLLPEGFLDRTKDRG 343
G F+ +Q +E A+ L+RSG +FLW +R +P+ +R ++LE +LPEGFL+RT DRG
Sbjct: 282 GGFTEEQTRETAVALDRSGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRG 341
Query: 344 LVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEM 403
V+ WAPQV VL ++GGFVTHCGWNS+LE + GVPM+ WPLYAEQK+ +VEE+
Sbjct: 342 KVI-GWAPQVAVLEKPAIGGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEEL 400
Query: 404 KVGLAVTRSEEKD------RLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRD 457
+ + + + + D V+A ++E+ + +M E+ V+ M E A+ D
Sbjct: 401 GLAVEIRKYLKGDLFAGEMETVTAEDIERAIRRVM--EQDSDVRNNVKEMAEKCHFALMD 458
Query: 458 GGSSRVALDNLVES 471
GGSS+ AL+ ++
Sbjct: 459 GGSSKAALEKFIQD 472
>sp|Q66PF3|UFOG3_FRAAN Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3
OS=Fragaria ananassa GN=GT3 PE=2 SV=1
Length = 478
Score = 247 bits (630), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 164/488 (33%), Positives = 257/488 (52%), Gaps = 36/488 (7%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASV----SAT 60
+V SPG GHL S +E+ KL+++ I ++I P VS GTD Y+ S+ S
Sbjct: 7 LVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSK-GTDAYVQSLADSSSPI 65
Query: 61 APSVTFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITI--SKRSNLKAFV 118
+ + F LP + + T S + +LV E P + + + + SK + L FV
Sbjct: 66 SQRINFINLPH--TNMDHTEGSVRN--SLV-GFVESQQPHVKDAVANLRDSKTTRLAGFV 120
Query: 119 IDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKS---FRELGSTLLN 175
+D FC V++ L +P+Y +FT+ + L +L L K F++ + L+
Sbjct: 121 VDMFCTTMINVANQ-LGVPSYVFFTSGAATLGLLFHLQELRDQYNKDCTEFKDSDAELII 179
Query: 176 FPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQ 235
F P PA+ LP + ++ + ++ G++VNTF LE A+ A+
Sbjct: 180 PSFFNPLPAK--VLPGRMLVKDSAEPFLNVIKRFRETKGILVNTFTDLESHALHAL---- 233
Query: 236 CTPGETSPPLYCIGPVVGRGNGENRG-----RDRHECLSWLDSKPSRSVLFLCFGSLGSF 290
+ PP+Y +GP++ + E+R + +++ L WLD +P SV+FLCFGS+GSF
Sbjct: 234 -SSDAEIPPVYPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSF 292
Query: 291 SSKQLKEMAIGLERSGVKFLWVVRAPAPDSI----ENRSSLESLLPEGFLDRTKDRGLVV 346
Q++E+A LE +G +FLW +R P + +LPEGFLDRT G V+
Sbjct: 293 DESQVREIANALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVI 352
Query: 347 ESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVG 406
WAPQV VL H SVGGFV+HCGWNS LE + GVP+ WPLYAEQ++ V+E+++
Sbjct: 353 -GWAPQVAVLAHPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELA 411
Query: 407 LAVTRSEEKDR--LVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVA 464
+ + S LVSA E+E+ + E+M+ + +++R M E A+ DGGSS +
Sbjct: 412 VEIDMSYRSKSPVLVSAKEIERGIREVMELDSSD-IRKRVKEMSEKGKKALMDGGSSYTS 470
Query: 465 LDNLVESF 472
L + ++
Sbjct: 471 LGHFIDQI 478
>sp|O82383|U71D1_ARATH UDP-glycosyltransferase 71D1 OS=Arabidopsis thaliana GN=UGT71D1
PE=2 SV=1
Length = 467
Score = 245 bits (626), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 152/476 (31%), Positives = 243/476 (51%), Gaps = 33/476 (6%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSV 64
++F +P GHL +E + ++ I I++ + D Y+ S++++ P V
Sbjct: 6 LIFIPTPTVGHLVPFLEFARRLIEQDDRIRITILL--MKLQGQSHLDTYVKSIASSQPFV 63
Query: 65 TFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSNL-----KAFVI 119
F +P + P TL S A VY++ E N P + ++ I L K V+
Sbjct: 64 RFIDVPE-LEEKP-TLGSTQSVEAYVYDVIERNIPLVRNIVMDILTSLALDGVKVKGLVV 121
Query: 120 DFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGF 179
DFFC P V+ +S+P Y + TT LA YL H T F +L+ PGF
Sbjct: 122 DFFCLPMIDVAKD-ISLPFYVFLTTNSGFLAMMQYLADRHSRDTSVFVRNSEEMLSIPGF 180
Query: 180 -PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTP 238
P PA + + +G Y V I K+ G++VN+ +E ++ L+ Q
Sbjct: 181 VNPVPANVLPSALFVEDG--YDAYVKLAILFTKANGILVNSSFDIEPYSVNHFLQEQ--- 235
Query: 239 GETSPPLYCIGPVV---GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQL 295
P +Y +GP+ + + E R E + WLD +P SV+FLCFGS+ +
Sbjct: 236 --NYPSVYAVGPIFDLKAQPHPEQDLTRRDELMKWLDDQPEASVVFLCFGSMARLRGSLV 293
Query: 296 KEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEV 355
KE+A GLE +FLW S+ + LPEGFLDR RG++ W+PQVE+
Sbjct: 294 KEIAHGLELCQYRFLW--------SLRKEEVTKDDLPEGFLDRVDGRGMIC-GWSPQVEI 344
Query: 356 LNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMK--VGLAVTRSE 413
L H++VGGFV+HCGWNS++E + GVP++ WP+YAEQ++ ++V+E+K V L +
Sbjct: 345 LAHKAVGGFVSHCGWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELKLAVELKLDYRV 404
Query: 414 EKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLV 469
D +V+A E+E + +MD++ V++R + + + A ++GGSS A++ +
Sbjct: 405 HSDEIVNANEIETAIRYVMDTDNN-VVRKRVMDISQMIQRATKNGGSSFAAIEKFI 459
>sp|Q9LML6|U71C4_ARATH UDP-glycosyltransferase 71C4 OS=Arabidopsis thaliana GN=UGT71C4
PE=2 SV=2
Length = 479
Score = 244 bits (624), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 167/491 (34%), Positives = 249/491 (50%), Gaps = 41/491 (8%)
Query: 5 IVFYTSPGRGHLNSMVELGK-LILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPS 63
++F P GH+ +E K LI H +I I+ ++P SS + S+ A+ P
Sbjct: 7 LIFIPVPSTGHILVHIEFAKRLINLDHRIHTITILNLSSP--SSPHASVFARSLIASQPK 64
Query: 64 VTFHQLPPPVSGLPDTL--RSPADFPALVYELGELNNPKLHETLITI-------SKRSNL 114
+ H LPP P L R+P A + +L + N P + + + +I S +
Sbjct: 65 IRLHDLPPIQDPPPFDLYQRAPE---AYIVKLIKKNTPLIKDAVSSIVASRRGGSDSVQV 121
Query: 115 KAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFR-ELGSTL 173
V+D FCN + + L++P+Y Y T L Y+P H+ F G
Sbjct: 122 AGLVLDLFCNSLVKDVGNELNLPSYIYLTCNARYLGMMKYIPDRHRKIASEFDLSSGDEE 181
Query: 174 LNFPGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAML 232
L PGF P + M + ++E Y+ V+ + A + G++VN+F LE
Sbjct: 182 LPVPGFINAIPTKFMPPGLFNKEA--YEAYVELAPRFADAKGILVNSFTELEPHPFDYF- 238
Query: 233 EGQCTPGETSPPLYCIGPVVG---RGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGS 289
+ E PP+Y +GP++ R + DR + + WLD +P SV+FLCFGS GS
Sbjct: 239 ----SHLEKFPPVYPVGPILSLKDRASPNEEAVDRDQIVGWLDDQPESSVVFLCFGSRGS 294
Query: 290 FSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVVESW 349
Q+KE+A LE G +FLW +R S + ++ +LPEGF+ R RGLV W
Sbjct: 295 VDEPQVKEIARALELVGCRFLWSIRT----SGDVETNPNDVLPEGFMGRVAGRGLVC-GW 349
Query: 350 APQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEE--MKVGL 407
APQVEVL H+++GGFV+HCGWNS LE + GVP+ WP+YAEQ++ +V+E + V L
Sbjct: 350 APQVEVLAHKAIGGFVSHCGWNSTLESLWFGVPVATWPMYAEQQLNAFTLVKELGLAVDL 409
Query: 408 AVTRSEEKDRLVSAAELEQRVSELMD--SEKGRAVKERAVAMKEAAAAAMRDGGSSRVAL 465
+ + LV+ E+ + V LMD EK + VKE M +AA A+ DGGSS +A
Sbjct: 410 RMDYVSSRGGLVTCDEIARAVRSLMDGGDEKRKKVKE----MADAARKALMDGGSSSLAT 465
Query: 466 DNLV-ESFKRG 475
+ E F+ G
Sbjct: 466 ARFIAELFEDG 476
>sp|Q9LSY4|U71B8_ARATH UDP-glycosyltransferase 71B8 OS=Arabidopsis thaliana GN=UGT71B8
PE=3 SV=1
Length = 480
Score = 243 bits (619), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 170/508 (33%), Positives = 262/508 (51%), Gaps = 68/508 (13%)
Query: 2 KDTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVS--SAGTDDYIASVSA 59
K +VF P GHL S E+ KL++ SI III P +S YI+++SA
Sbjct: 3 KFALVFVPFPILGHLKSTAEMAKLLVEQETRLSISIII--LPLLSGDDVSASAYISALSA 60
Query: 60 TAPSVTFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITI----SKRSN-- 113
+ +++ S D P + + + P + T+ + S+R +
Sbjct: 61 ASNDRLHYEVI-----------SDGDQPTVGLHVDN-HIPMVKRTVAKLVDDYSRRPDSP 108
Query: 114 -LKAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFREL--- 169
L V+D FC V++ +S+P Y ++T+ +LA L++ L S E
Sbjct: 109 RLAGLVVDMFCISVIDVANE-VSVPCYLFYTSNVGILALGLHIQMLFDKKEYSVSETDFE 167
Query: 170 -GSTLLNFPGFP-PFPARDMA--------LPMHDREGKVYKGLVDTGIQMAKSAGVIVNT 219
+L+ P P+P + + LPM+ +G+ ++ + G++VNT
Sbjct: 168 DSEVVLDVPSLTCPYPVKCLPYGLATKEWLPMYLNQGRRFREM----------KGILVNT 217
Query: 220 FELLEERAIKAMLEGQCTPGETSPPLYCIGPVVGRGNGENRGRDRH--ECLSWLDSKPSR 277
F LE A LE + G+T P Y +GP++ N + +D + L WLD +P +
Sbjct: 218 FAELEPYA----LESLHSSGDT-PRAYPVGPLLHLENHVDGSKDEKGSDILRWLDEQPPK 272
Query: 278 SVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENR----SSLESLLPE 333
SV+FLCFGS+G F+ +Q +EMAI LERSG +FLW +R + D + +LE +LPE
Sbjct: 273 SVVFLCFGSIGGFNEEQAREMAIALERSGHRFLWSLRRASRDIDKELPGEFKNLEEILPE 332
Query: 334 GFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQK 393
GF DRTKD+G V+ WAPQV VL ++GGFVTHCGWNS+LE + GVP+ WPLYAEQK
Sbjct: 333 GFFDRTKDKGKVI-GWAPQVAVLAKPAIGGFVTHCGWNSILESLWFGVPIAPWPLYAEQK 391
Query: 394 MIRAVVVEEMKVGLAVTRSEEKDRL-------VSAAELEQRVSELMDSEKGRAVKERAVA 446
V+VEE+ + + + + D+L V+A E+E+ + LM E+ V+ R
Sbjct: 392 FNAFVMVEELGLAVKIRKYWRGDQLVGTATVIVTAEEIERGIRCLM--EQDSDVRNRVKE 449
Query: 447 MKEAAAAAMRDGGSSRVALDNLVESFKR 474
M + A++DGGSS+ AL ++ +
Sbjct: 450 MSKKCHMALKDGGSSQSALKLFIQDVTK 477
>sp|O82385|U71D2_ARATH UDP-glycosyltransferase 71D2 OS=Arabidopsis thaliana GN=UGT71D2
PE=2 SV=1
Length = 467
Score = 241 bits (616), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 154/481 (32%), Positives = 246/481 (51%), Gaps = 41/481 (8%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSV 64
++F +P GHL +E + ++ I ++ + D Y+ ++S++ P V
Sbjct: 6 LIFIPTPTVGHLVPFLEFARRLIEQDDRIRITFLLMKQQ--GQSHLDSYVKTISSSLPFV 63
Query: 65 TFHQLP----PPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKR-----SNLK 115
F +P P G A VY+ E N P + ++ I +K
Sbjct: 64 RFIDVPELEEKPTLGTQSV-------EAYVYDFIETNVPLVQNIIMGILSSPAFDGVTVK 116
Query: 116 AFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLN 175
FV DFFC P V+ S+P Y + T+ LA YL HK T F +L+
Sbjct: 117 GFVADFFCLPMIDVAKDA-SLPFYVFLTSNSGFLAMMQYLAYGHKKDTSVFARNSEEMLS 175
Query: 176 FPGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEG 234
PGF P PA+ + + +G Y V I K+ G++VNT +E ++ L G
Sbjct: 176 IPGFVNPVPAKVLPSALFIEDG--YDADVKLAILFTKANGILVNTSFDIEPTSLNHFL-G 232
Query: 235 QCTPGETSPPLYCIGPVVGRGNGENRGRDRHEC---LSWLDSKPSRSVLFLCFGSLGSFS 291
+ E P +Y +GP+ + +D C + WLD++P SV+FLCFGS+GS
Sbjct: 233 E----ENYPSVYAVGPIFNPKAHPHPDQDLACCDESMKWLDAQPEASVVFLCFGSMGSLR 288
Query: 292 SKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVVESWAP 351
+KE+A GLE +FLW +R + + N + LLPEGF+DR RG++ W+P
Sbjct: 289 GPLVKEIAHGLELCQYRFLWSLRT---EEVTN----DDLLPEGFMDRVSGRGMIC-GWSP 340
Query: 352 QVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMK--VGLAV 409
QVE+L H++VGGFV+HCGWNS++E + GVP++ WP+YAEQ++ ++V+E+K V L +
Sbjct: 341 QVEILAHKAVGGFVSHCGWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELKLAVELKL 400
Query: 410 TRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLV 469
S +VSA E+E +S +M+ + V++R + + + A ++GGSS A++ +
Sbjct: 401 DYSVHSGEIVSANEIETAISCVMNKDN-NVVRKRVMDISQMIQRATKNGGSSFAAIEKFI 459
Query: 470 E 470
Sbjct: 460 H 460
>sp|Q94A84|U72E1_ARATH UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana GN=UGT72E1
PE=1 SV=1
Length = 487
Score = 239 bits (611), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 157/486 (32%), Positives = 252/486 (51%), Gaps = 36/486 (7%)
Query: 2 KDTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATA 61
K + + SPG GH+ ++ELGK + H D+ I +++ ++ S A
Sbjct: 5 KPHVAMFASPGMGHIIPVIELGKRLAGSH---GFDVTIFVLETDAASAQSQFLNSPGCDA 61
Query: 62 PSVTFHQLPPP-VSGLPDTLRSPADFPALVYELGELNNPKLHETLITI-SKRSNLK---- 115
V LP P +SGL D P+ + + L + ET+ TI SK ++
Sbjct: 62 ALVDIVGLPTPDISGLVD--------PSAFFGIKLLV--MMRETIPTIRSKIEEMQHKPT 111
Query: 116 AFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLN 175
A ++D F A + ++ TY + + LA L+ PTL K+ + + +
Sbjct: 112 ALIVDLFGLDAIPLGGE-FNMLTYIFIASNARFLAVALFFPTLDKDMEEE-HIIKKQPMV 169
Query: 176 FPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQ 235
PG P D D ++Y+ V G G+IVNT++ +E + +K++ + +
Sbjct: 170 MPGCEPVRFEDTLETFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPK 229
Query: 236 CTPGETSPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQL 295
P+Y IGP+ + + H L WL+ +P SVL++ FGS GS S+KQL
Sbjct: 230 LLGRIAGVPVYPIGPL---SRPVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQL 286
Query: 296 KEMAIGLERSGVKFLWVVRAP----------APDSIENRSSLESLLPEGFLDRTKDRGLV 345
E+A GLE S +F+WVVR P + +S + R LPEGF+ RT +RG +
Sbjct: 287 TELAWGLEMSQQRFVWVVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFM 346
Query: 346 VESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKV 405
V SWAPQ E+L H++VGGF+THCGWNS+LE V GVPM+AWPL+AEQ M ++ EE+ V
Sbjct: 347 VSSWAPQAEILAHQAVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGV 406
Query: 406 GLAVTRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMR-DGGSSRVA 464
+ ++ + +++ AE+E V ++M E+G ++++ +KE AA ++ DGG + +
Sbjct: 407 AVR-SKKLPSEGVITRAEIEALVRKIMVEEEGAEMRKKIKKLKETAAESLSCDGGVAHES 465
Query: 465 LDNLVE 470
L + +
Sbjct: 466 LSRIAD 471
>sp|Q2V6K0|UFOG6_FRAAN UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria
ananassa GN=GT6 PE=1 SV=1
Length = 479
Score = 239 bits (611), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 154/489 (31%), Positives = 258/489 (52%), Gaps = 38/489 (7%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPS- 63
++F PG GH+ S VE+ KL+L I I+I PF ++ G+D YI S+ A PS
Sbjct: 7 LIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPF-TADGSDVYIKSL-AVDPSL 64
Query: 64 ----VTFHQLPPP-VSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSNLKAFV 118
+ F LP G T F + + + T S+ + + FV
Sbjct: 65 KTQRIRFVNLPQEHFQGTGAT-----GFFTFIDSHKSHVKDAVTRLMETKSETTRIAGFV 119
Query: 119 IDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKS---FRELGSTLLN 175
ID FC +++ +P+Y ++T+ + L +L L K F++ + L+
Sbjct: 120 IDMFCTGMIDLANE-FGLPSYVFYTSGAADLGLMFHLQALRDEENKDCTEFKDSDAELVV 178
Query: 176 FPGFPPFPA-RDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEG 234
P PA R + + ++EG + ++ + ++ G++VNTF LE AI+++
Sbjct: 179 SSFVNPLPAARVLPSVVFEKEGGNF--FLNFAKRYRETKGILVNTFLELEPHAIQSL--- 233
Query: 235 QCTPGETSPPLYCIGPVV-----GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGS 289
+ G+ P +Y +GP++ G + + + + L WLD +P SV+FLCFGS+G
Sbjct: 234 -SSDGKILP-VYPVGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGC 291
Query: 290 FSSKQLKEMAIGLERSGVKFLWVVRAPAPDSI---ENRSSLESLLPEGFLDRTKDRGLVV 346
F Q+KE+A LE+ G++FLW +R P+ + I + + +++LPEGFLDRT D G V+
Sbjct: 292 FGEDQVKEIAHALEQGGIRFLWSLRQPSKEKIGFPSDYTDYKAVLPEGFLDRTTDLGKVI 351
Query: 347 ESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVG 406
WAPQ+ +L H +VGGFV+HCGWNS LE + GVP+ WP YAEQ++ +V+E+K+
Sbjct: 352 -GWAPQLAILAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLA 410
Query: 407 LAVTRSEEKDR--LVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVA 464
+ + KD +VS +E+ + E+M+ E +++R M + + A+ + GSS +
Sbjct: 411 VEIDMGYRKDSGVIVSRENIEKGIKEVMEQES--ELRKRVKEMSQMSRKALEEDGSSYSS 468
Query: 465 LDNLVESFK 473
L ++ +
Sbjct: 469 LGRFLDQIQ 477
>sp|Q9LSY9|U71B1_ARATH UDP-glycosyltransferase 71B1 OS=Arabidopsis thaliana GN=UGT71B1
PE=2 SV=1
Length = 473
Score = 239 bits (610), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 157/494 (31%), Positives = 255/494 (51%), Gaps = 52/494 (10%)
Query: 1 MKDTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSAT 60
MK +VF SPG GH+ + L KL++ S+ +I+ + S +DD +SV
Sbjct: 1 MKVELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIV-----IPSRVSDDASSSVYTN 55
Query: 61 APSVTFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETL------ITISKRSNL 114
+ + L P D LV + + P++ + ++ S L
Sbjct: 56 SEDRLRYILLPARDQTTD----------LVSYI-DSQKPQVRAVVSKVAGDVSTRSDSRL 104
Query: 115 KAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLL 174
V+D FC ++ ++ Y ++T+ S L ++ +L+ E T +
Sbjct: 105 AGIVVDMFCTSMIDIADE-FNLSAYIFYTSNASYLGLQFHVQSLYDEKELDVSEFKDTEM 163
Query: 175 NF--PGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAM 231
F P PFPA+ LP K + ++ + G++VN+ +E +A+
Sbjct: 164 KFDVPTLTQPFPAK--CLPSVMLNKKWFPYVLGRARSFRATKGILVNSVADMEPQALSFF 221
Query: 232 LEGQCTPGETS-PPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSF 290
G G T+ PP+Y +GP++ + + + R E L WL +P++SV+FLCFGS+G F
Sbjct: 222 SGGN---GNTNIPPVYAVGPIMDLESSGDEEK-RKEILHWLKEQPTKSVVFLCFGSMGGF 277
Query: 291 SSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRS--------SLESLLPEGFLDRTKDR 342
S +Q +E+A+ LERSG +FLW +R +P + N+S +LE +LP+GFLDRT +
Sbjct: 278 SEEQAREIAVALERSGHRFLWSLRRASP--VGNKSNPPPGEFTNLEEILPKGFLDRTVEI 335
Query: 343 GLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEE 402
G ++ SWAPQV+VLN ++G FVTHCGWNS+LE + GVPM AWP+YAEQ+ +V+E
Sbjct: 336 GKII-SWAPQVDVLNSPAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDE 394
Query: 403 MKVGLAVTRSEEKD------RLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMR 456
+ + V + +D +V+A E+E+ + M E+ +++R + MK+ A+
Sbjct: 395 LGLAAEVKKEYRRDFLVEEPEIVTADEIERGIKCAM--EQDSKMRKRVMEMKDKLHVALV 452
Query: 457 DGGSSRVALDNLVE 470
DGGSS AL V+
Sbjct: 453 DGGSSNCALKKFVQ 466
>sp|Q40287|UFOG5_MANES Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5
PE=2 SV=1
Length = 487
Score = 239 bits (609), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 156/471 (33%), Positives = 242/471 (51%), Gaps = 28/471 (5%)
Query: 2 KDTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATA 61
K IV +SPG GHL ++ELGK I+T C + D+ I +SA + S + T
Sbjct: 9 KPHIVLLSSPGLGHLIPVLELGKRIVTL--C-NFDVTIFMVGSDTSAAEPQVLRS-AMTP 64
Query: 62 PSVTFHQLPPP-VSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSNLKAFVID 120
QLPPP +S L D A ++ L P + + R A ++D
Sbjct: 65 KLCEIIQLPPPNISCLID---PEATVCTRLFVLMREIRPAFRAAVSALKFRP--AAIIVD 119
Query: 121 FFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFP 180
F + +V+ L I Y Y + LA +Y+P L K F L + PG
Sbjct: 120 LFGTESLEVAKE-LGIAKYVYIASNAWFLALTIYVPILDKEVEGEFV-LQKEPMKIPGCR 177
Query: 181 PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGE 240
P ++ PM DR + Y GI++ + G+++NT+E LE A+ + +
Sbjct: 178 PVRTEEVVDPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRV 237
Query: 241 TSPPLYCIGPVVGRGN--GENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEM 298
P++ IGP+ + G N E L WLD +P SV+++ FGS G+ S +Q+ E+
Sbjct: 238 AKVPVFPIGPLRRQAGPCGSN-----CELLDWLDQQPKESVVYVSFGSGGTLSLEQMIEL 292
Query: 299 AIGLERSGVKFLWVVRAPAPDSIE--------NRSSLESLLPEGFLDRTKDRGLVVESWA 350
A GLERS +F+WVVR P + + + PEGFL R ++ GLVV W+
Sbjct: 293 AWGLERSQQRFIWVVRQPTVKTGDAAFFTQGDGADDMSGYFPEGFLTRIQNVGLVVPQWS 352
Query: 351 PQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVT 410
PQ+ +++H SVG F++HCGWNSVLE + AGVP++AWP+YAEQ+M ++ EE+ V +
Sbjct: 353 PQIHIMSHPSVGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPK 412
Query: 411 RSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSS 461
K+ +V E+E+ + +M E+G +++R +K++ A+ +GGSS
Sbjct: 413 NLPAKE-VVKREEIERMIRRIMVDEEGSEIRKRVRELKDSGEKALNEGGSS 462
>sp|Q9LSY8|U71B2_ARATH UDP-glycosyltransferase 71B2 OS=Arabidopsis thaliana GN=UGT71B2
PE=1 SV=1
Length = 485
Score = 238 bits (608), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 173/507 (34%), Positives = 265/507 (52%), Gaps = 56/507 (11%)
Query: 1 MKDTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIII--PTAPFVSSAGTDDYIASVS 58
MK +VF SPG GHL +VE+ KL + SI III F SS + + S
Sbjct: 1 MKLELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIIIPQMHGFSSSNSSSYIASLSS 60
Query: 59 ATAPSVTFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKR------S 112
+ ++++ L P PD+ + F ++ + P++ T+ ++ S
Sbjct: 61 DSEERLSYNVLSVPDK--PDSDDTKPHF----FDYIDNFKPQVKATVEKLTDPGPPDSPS 114
Query: 113 NLKAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLH--KNTTKS-FREL 169
L FV+D FC V++ +P+Y ++T+ + L +++ L+ KN S ++
Sbjct: 115 RLAGFVVDMFCMMMIDVANE-FGVPSYMFYTSNATFLGLQVHVEYLYDVKNYDVSDLKDS 173
Query: 170 GSTLLNFPG---------FPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTF 220
+T L P FP LP+ R+ + ++ ++ G++VNTF
Sbjct: 174 DTTELEVPCLTRPLPVKCFPSVLLTKEWLPVMFRQTRRFR----------ETKGILVNTF 223
Query: 221 ELLEERAIKAMLEGQCTPGETSPPLYCIGPVVG-RGNGENRGRDRH-ECLSWLDSKPSRS 278
LE +A+K G +P P +Y +GPV+ + NG N D+ E L WLD +P +S
Sbjct: 224 AELEPQAMK-FFSGVDSP---LPTVYTVGPVMNLKINGPNSSDDKQSEILRWLDEQPRKS 279
Query: 279 VLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPD-SI---ENRSSLESLLPEG 334
V+FLCFGS+G F Q KE+AI LERSG +F+W +R P SI E ++LE +LPEG
Sbjct: 280 VVFLCFGSMGGFREGQAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEG 339
Query: 335 FLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKM 394
FL+RT + G +V WAPQ +L + ++GGFV+HCGWNS LE + GVPM WPLYAEQ++
Sbjct: 340 FLERTAEIGKIV-GWAPQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQV 398
Query: 395 IRAVVVEEMKVGLAVTRS------EEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMK 448
+VEE+ + + V S D L++A E+E+ + LM E+ V+ R M
Sbjct: 399 NAFEMVEELGLAVEVRNSFRGDFMAADDELMTAEEIERGIRCLM--EQDSDVRSRVKEMS 456
Query: 449 EAAAAAMRDGGSSRVALDNLVESFKRG 475
E + A+ DGGSS VAL ++ +
Sbjct: 457 EKSHVALMDGGSSHVALLKFIQDVTKN 483
>sp|Q9ZU72|U72D1_ARATH UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana GN=UGT72D1
PE=2 SV=1
Length = 470
Score = 234 bits (598), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 158/483 (32%), Positives = 252/483 (52%), Gaps = 28/483 (5%)
Query: 6 VFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSVT 65
+ SPG GHL ++ELG + + +I + I SS+ T+ +A
Sbjct: 7 LLVASPGLGHLIPILELGNRLSS---VLNIHVTILAVTSGSSSPTETEAIHAAAARTICQ 63
Query: 66 FHQLPPPVSGLPDTLRSP--ADFPALVYELGELNNPKLHETLITISKRSNLKAFVIDFFC 123
++P S D L P F +V ++ + P + + + + ++ + ++DF
Sbjct: 64 ITEIP---SVDVDNLVEPDATIFTKMVVKMRAMK-PAVRDAVKLMKRKPTV--MIVDFLG 117
Query: 124 NPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFPPFP 183
V+ Y Y T LA +YLP L + ++ L PG P
Sbjct: 118 TELMSVADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPL-KIPGCKPVG 176
Query: 184 ARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGETSP 243
+++ M DR G+ YK V G+++ S GV+VNT+E L+ + A+ E +
Sbjct: 177 PKELMETMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKV 236
Query: 244 PLYCIGPVVGRGNGENRGRDR-HECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGL 302
P+Y IGP+V N+ D+ + WLD + RSV+F+C GS G+ + +Q E+A+GL
Sbjct: 237 PVYPIGPIVRT----NQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGL 292
Query: 303 ERSGVKFLWVVRAPAPDSIENRSSLESL---LPEGFLDRTKDRGLVVESWAPQVEVLNHE 359
E SG +F+WV+R PA S E + LPEGFLDRT+ G+VV WAPQVE+L+H
Sbjct: 293 ELSGQRFVWVLRRPASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHR 352
Query: 360 SVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSE-EKDRL 418
S+GGF++HCGW+S LE + GVP++AWPLYAEQ M ++ EE +G+AV SE +R+
Sbjct: 353 SIGGFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEE--IGVAVRTSELPSERV 410
Query: 419 VSAAELEQRVSELM--DSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESFKRGC 476
+ E+ V ++M + E+G+ ++ +A ++ ++ A GSS ++L E KR
Sbjct: 411 IGREEVASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSS---YNSLFEWAKRCY 467
Query: 477 IAP 479
+ P
Sbjct: 468 LVP 470
>sp|Q40285|UFOG2_MANES Anthocyanidin 3-O-glucosyltransferase 2 (Fragment) OS=Manihot
esculenta GN=GT2 PE=2 SV=1
Length = 346
Score = 234 bits (596), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 138/366 (37%), Positives = 213/366 (58%), Gaps = 32/366 (8%)
Query: 122 FCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKS---FRELGSTLLNFPG 178
FC P ++ IP+Y +F + G L LY+ +H + F++ + L+
Sbjct: 1 FCTPMMDLADE-FGIPSYIFFASGGGFLGFMLYVQKIHDEENFNPIEFKDSDTELIVPSL 59
Query: 179 FPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTP 238
PFP R + + ++E + L+ + ++ G+IVNTF LE RAI++
Sbjct: 60 VNPFPTRILPSSILNKER--FGQLLAIAKKFRQAKGIIVNTFLELESRAIESF------- 110
Query: 239 GETSPPLYCIGPVVGRGNGENRGRDRH-ECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKE 297
PPLY +GP++ + ++ GR+ H E + WLD +P SV+FLCFGS+GSFS QLKE
Sbjct: 111 --KVPPLYHVGPIL---DVKSDGRNTHPEIMQWLDDQPEGSVVFLCFGSMGSFSEDQLKE 165
Query: 298 MAIGLERSGVKFLWVVR-APAPDSIENRSSLES---LLPEGFLDRTKDRGLVVESWAPQV 353
+A LE SG +FLW +R P PD I + + E +LPEGFL+RT G V+ WAPQV
Sbjct: 166 IAYALENSGHRFLWSIRRPPPPDKIASPTDYEDPRDVLPEGFLERTVAVGKVI-GWAPQV 224
Query: 354 EVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSE 413
VL H ++GGFV+HCGWNSVLE + GVP+ WP+YAEQ+ +V E+ +G+ +
Sbjct: 225 AVLAHPAIGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQFNAFEMVVELGLGVEIDMGY 284
Query: 414 EKDR--LVSAAELEQRVSELMDS--EKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLV 469
K+ +V++ ++E+ + +LM++ EK + VKE M+E + A+ DGGSS ++L + +
Sbjct: 285 RKESGIIVNSDKIERAIRKLMENSDEKRKKVKE----MREKSKMALIDGGSSFISLGDFI 340
Query: 470 ESFKRG 475
+ G
Sbjct: 341 KDAMEG 346
>sp|Q40284|UFOG1_MANES Anthocyanidin 3-O-glucosyltransferase 1 OS=Manihot esculenta GN=GT1
PE=2 SV=1
Length = 449
Score = 234 bits (596), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 156/475 (32%), Positives = 253/475 (53%), Gaps = 49/475 (10%)
Query: 14 GHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSVTFHQLPPPV 73
GHL S VE KL+L+ SI ++I V+S + + +++++ + F LP
Sbjct: 2 GHLVSAVETAKLLLSRCHSLSITVLIFNNSVVTSKVHNYVDSQIASSSNRLRFIYLPRDE 61
Query: 74 SGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSN------LKAFVIDFFCNPAF 127
+G+ + F +L+ E P + E+++ I++ + L F++D FC
Sbjct: 62 TGI-------SSFSSLI----EKQKPHVKESVMKITEFGSSVESPRLVGFIVDMFCTAMI 110
Query: 128 QVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGST--LLNFPGF-PPFPA 184
V++ +P+Y ++T+ + L L++ +H + E ++ L PG FP+
Sbjct: 111 DVANE-FGVPSYIFYTSGAAFLNFMLHVQKIHDEENFNPTEFNASDGELQVPGLVNSFPS 169
Query: 185 RDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGETSPP 244
+ A+P + + L++ + ++ GVI+NTF LE AI++ PP
Sbjct: 170 K--AMPTAILSKQWFPPLLENTRRYGEAKGVIINTFFELESHAIESF---------KDPP 218
Query: 245 LYCIGPVVG-RGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLE 303
+Y +GP++ R NG N + E + WLD +P SV+FLCFGS GSFS Q+KE+A LE
Sbjct: 219 IYPVGPILDVRSNGRNTNQ---EIMQWLDDQPPSSVVFLCFGSNGSFSKDQVKEIACALE 275
Query: 304 RSGVKFLWVV---RAPA-PDSIENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHE 359
SG +FLW + RAP +S + L+ +LPEGFL+RT V+ WAPQV VL H
Sbjct: 276 DSGHRFLWSLADHRAPGFLESPSDYEDLQEVLPEGFLERTSGIEKVI-GWAPQVAVLAHP 334
Query: 360 SVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEEKD--R 417
+ GG V+H GWNS+LE + GVP+ WP+YAEQ+ +V E+ + + + D
Sbjct: 335 ATGGLVSHSGWNSILESIWFGVPVATWPMYAEQQFNAFQMVIELGLAVEIKMDYRNDSGE 394
Query: 418 LVSAAELEQRVSELM--DSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVE 470
+V ++E+ + LM DS++ + VKE M E + A+ +GGSS LDNL++
Sbjct: 395 IVKCDQIERGIRCLMKHDSDRRKKVKE----MSEKSRGALMEGGSSYCWLDNLIK 445
>sp|Q9LML7|U71C3_ARATH UDP-glycosyltransferase 71C3 OS=Arabidopsis thaliana GN=UGT71C3
PE=2 SV=1
Length = 476
Score = 229 bits (585), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 162/490 (33%), Positives = 245/490 (50%), Gaps = 41/490 (8%)
Query: 5 IVFYTSPGRGHLNSMVELGK-LILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPS 63
I+F T P GHL +E K LI +I I+ P A + S+ A+ P
Sbjct: 7 IIFVTYPSPGHLLVSIEFAKSLIKRDDRIHTITILYWALPLAPQAHL--FAKSLVASQPR 64
Query: 64 VTFHQLP-----PPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITI------SKRS 112
+ LP PP+ ++P A + E + P + + L T+ S
Sbjct: 65 IRLLALPDVQNPPPLELF---FKAPE---AYILESTKKTVPLVRDALSTLVSSRKESGSV 118
Query: 113 NLKAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFR-ELGS 171
+ VIDFFC P +V++ L++P+Y + T L+ YLP H+ TT G+
Sbjct: 119 RVVGLVIDFFCVPMIEVANE-LNLPSYIFLTCNAGFLSMMKYLPERHRITTSELDLSSGN 177
Query: 172 TLLNFPGFP-PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKA 230
PG+ P + + + RE Y+ V+ + + G++VN+ LE+ A
Sbjct: 178 VEHPIPGYVCSVPTKVLPPGLFVRES--YEAWVEIAEKFPGAKGILVNSVTCLEQNAFDY 235
Query: 231 MLEGQCTPGETSPPLYCIGPVVG---RGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSL 287
E PP+Y +GPV+ R + DR + WL+ +P S++++CFGSL
Sbjct: 236 F----ARLDENYPPVYPVGPVLSLKDRPSPNLDASDRDRIMRWLEDQPESSIVYICFGSL 291
Query: 288 GSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVVE 347
G Q++E+A LE +G +FLW +R + ++S LLPEGFLDRT +GLV +
Sbjct: 292 GIIGKLQIEEIAEALELTGHRFLWSIRT----NPTEKASPYDLLPEGFLDRTASKGLVCD 347
Query: 348 SWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEE--MKV 405
WAPQVEVL H+++GGFV+HCGWNSVLE + GVP+ WP+YAEQ++ +V+E + V
Sbjct: 348 -WAPQVEVLAHKALGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQLNAFSMVKELGLAV 406
Query: 406 GLAVTRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVAL 465
L + +V A E+ + LMD E ++R M EAA A+ DGGSS VA+
Sbjct: 407 ELRLDYVSAYGEIVKAEEIAGAIRSLMDGED--TPRKRVKEMAEAARNALMDGGSSFVAV 464
Query: 466 DNLVESFKRG 475
++ G
Sbjct: 465 KRFLDELIGG 474
>sp|O82382|U71C2_ARATH UDP-glycosyltransferase 71C2 OS=Arabidopsis thaliana GN=UGT71C2
PE=1 SV=1
Length = 474
Score = 228 bits (582), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 156/481 (32%), Positives = 247/481 (51%), Gaps = 32/481 (6%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHP--CFSIDIIIPTAPFVSSAGTDDYIASVSATAP 62
++F P GH+ + +EL K ++++ P +I I+ + PF+ + T ++ S+ T
Sbjct: 9 LIFIPFPIPGHILATIELAKRLISHQPSRIHTITILHWSLPFLPQSDTIAFLKSLIETES 68
Query: 63 SVTFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITI------SKRSNLKA 116
+ LP + P L A + + E + P + L T+ S ++
Sbjct: 69 RIRLITLPDVQNPPPMELFVKAS-ESYILEYVKKMVPLVRNALSTLLSSRDESDSVHVAG 127
Query: 117 FVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSF-RELGSTLLN 175
V+DFFC P V + ++P+Y + T + S L YL ++ T R ++
Sbjct: 128 LVLDFFCVPLIDVGNE-FNLPSYIFLTCSASFLGMMKYLLERNRETKPELNRSSDEETIS 186
Query: 176 FPGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEG 234
PGF P + LP + Y+ V+ + ++ G++VN+FE LE A
Sbjct: 187 VPGFVNSVPVK--VLPPGLFTTESYEAWVEMAERFPEAKGILVNSFESLERNAFDYFDRR 244
Query: 235 QCTPGETSPPLYCIGPVV---GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFS 291
+ PP+Y IGP++ R N + RDR L WLD +P SV+FLCFGSL S +
Sbjct: 245 P----DNYPPVYPIGPILCSNDRPNLDLSERDR--ILKWLDDQPESSVVFLCFGSLKSLA 298
Query: 292 SKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVVESWAP 351
+ Q+KE+A LE G++FLW +R + +S +LP+GF++R GLV WAP
Sbjct: 299 ASQIKEIAQALELVGIRFLWSIRTDP----KEYASPNEILPDGFMNRVMGLGLVC-GWAP 353
Query: 352 QVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTR 411
QVE+L H+++GGFV+HCGWNS+LE + GVP+ WP+YAEQ++ +V+E+ + L +
Sbjct: 354 QVEILAHKAIGGFVSHCGWNSILESLRFGVPIATWPMYAEQQLNAFTIVKELGLALEMRL 413
Query: 412 S--EEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLV 469
E +V A E+ V LMD E K + +A EA A+ DGGSS VA+ +
Sbjct: 414 DYVSEYGEIVKADEIAGAVRSLMDGEDVPRRKLKEIA--EAGKEAVMDGGSSFVAVKRFI 471
Query: 470 E 470
+
Sbjct: 472 D 472
>sp|Q9FE68|U71C5_ARATH UDP-glycosyltransferase 71C5 OS=Arabidopsis thaliana GN=UGT71C5
PE=2 SV=1
Length = 480
Score = 228 bits (582), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 160/492 (32%), Positives = 247/492 (50%), Gaps = 46/492 (9%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFS-IDIIIPTAPFVSSAGTDDYIASVSATAPS 63
++F P GHL S +E GK +L S I I+ P+ A D +AS++A+ P
Sbjct: 6 LIFVPLPETGHLLSTIEFGKRLLNLDRRISMITILSMNLPYAPHA--DASLASLTASEPG 63
Query: 64 VTFHQLP----PPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITI--------SKR 111
+ LP PP L DT + + N P L +T+ +
Sbjct: 64 IRIISLPEIHDPPPIKLLDTSSE-----TYILDFIHKNIPCLRKTIQDLVSSSSSSGGGS 118
Query: 112 SNLKAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFREL-G 170
S++ ++DFFC + +++P+Y + T+ L YLP + T F E G
Sbjct: 119 SHVAGLILDFFCVGLIDIGRE-VNLPSYIFMTSNFGFLGVLQYLPERQRLTPSEFDESSG 177
Query: 171 STLLNFPGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIK 229
L+ P F PA+ + + D+ Y LV G ++ ++ G++VN+F +E A +
Sbjct: 178 EEELHIPAFVNRVPAKVLPPGVFDKLS--YGSLVKIGERLHEAKGILVNSFTQVEPYAAE 235
Query: 230 AMLEGQCTPGETSPPLYCIGPVV---GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGS 286
+G+ P +Y +GPV+ GR N E + WLD +P SVLFLCFGS
Sbjct: 236 HFSQGR-----DYPHVYPVGPVLNLTGRTNPGLASAQYKEMMKWLDEQPDSSVLFLCFGS 290
Query: 287 LGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVV 346
+G F + Q+ E+A LE G +F+W +R ++ + LPEGF+DRT RG+V
Sbjct: 291 MGVFPAPQITEIAHALELIGCRFIWAIRT----NMAGDGDPQEPLPEGFVDRTMGRGIVC 346
Query: 347 ESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVG 406
SWAPQV++L H++ GGFV+HCGWNSV E + GVP+ WP+YAEQ++ +V+E+ +
Sbjct: 347 -SWAPQVDILAHKATGGFVSHCGWNSVQESLWYGVPIATWPMYAEQQLNAFEMVKELGLA 405
Query: 407 LAVTRSEEKD------RLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGS 460
+ + D +VSA E+ V LMDS+ V+++ + A A+ DGGS
Sbjct: 406 VEIRLDYVADGDRVTLEIVSADEIATAVRSLMDSDN--PVRKKVIEKSSVARKAVGDGGS 463
Query: 461 SRVALDNLVESF 472
S VA N ++
Sbjct: 464 STVATCNFIKDI 475
>sp|Q9LVR1|U72E2_ARATH UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana GN=UGT72E2
PE=1 SV=1
Length = 481
Score = 226 bits (577), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 152/475 (32%), Positives = 245/475 (51%), Gaps = 53/475 (11%)
Query: 7 FYTSPGRGHLNSMVELGKLILTYHPCFSIDIII-------PTAPFVSSAGTDDYIASVSA 59
++SPG GH+ ++ELGK L+ + F + + + + F++S G D
Sbjct: 10 MFSSPGMGHVIPVIELGKR-LSANNGFHVTVFVLETDAASAQSKFLNSTGVD-------- 60
Query: 60 TAPSVTFHQLPPP-VSGLPDTLRSPADFPALVYELGELNN---PKLHETLITISKRSNLK 115
+LP P + GL D P D +V ++G + P L + + ++
Sbjct: 61 ------IVKLPSPDIYGLVD----PDDH--VVTKIGVIMRAAVPALRSKIAAMHQKPT-- 106
Query: 116 AFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLN 175
A ++D F A ++ ++ +Y + T L ++Y P L K+ K + L
Sbjct: 107 ALIVDLFGTDALCLAKE-FNMLSYVFIPTNARFLGVSIYYPNLDKDI-KEEHTVQRNPLA 164
Query: 176 FPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQ 235
PG P D + VY+ V G+ K+ G++VNT+E +E +++K++L +
Sbjct: 165 IPGCEPVRFEDTLDAYLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPK 224
Query: 236 CTPGETSPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQL 295
P+Y IGP+ H L WL+ +P+ SVL++ FGS G S+KQL
Sbjct: 225 LLGRVARVPVYPIGPLC---RPIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQL 281
Query: 296 KEMAIGLERSGVKFLWVVRAPAPDSI----------ENRSSLESLLPEGFLDRTKDRGLV 345
E+A GLE+S +F+WVVR P S + LPEGF+ RT DRG V
Sbjct: 282 TELAWGLEQSQQRFVWVVRPPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFV 341
Query: 346 VESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKV 405
V SWAPQ E+L+H +VGGF+THCGW+S LE V GVPM+AWPL+AEQ M A++ +E+
Sbjct: 342 VPSWAPQAEILSHRAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDEL-- 399
Query: 406 GLAVTRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMR-DGG 459
G+AV + K+ +S ++E V ++M ++G A++ + ++++A ++ DGG
Sbjct: 400 GIAVRLDDPKED-ISRWKIEALVRKVMTEKEGEAMRRKVKKLRDSAEMSLSIDGG 453
>sp|O23205|U72C1_ARATH UDP-glycosyltransferase 72C1 OS=Arabidopsis thaliana GN=UGT72C1
PE=2 SV=3
Length = 457
Score = 224 bits (571), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 153/466 (32%), Positives = 237/466 (50%), Gaps = 49/466 (10%)
Query: 10 SPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSVTFHQL 69
SPG GH ++ELGK +L +H + + + T S ++ P +
Sbjct: 10 SPGMGHAVPILELGKHLLNHHGFDRVTVFLVTDDVSRSKSLIG--KTLMEEDPKFVIRFI 67
Query: 70 PPPVSGLPDTLRSPADFP-ALVYELGELNN---PKLHETLITISKRSNLKAFVIDFFCNP 125
P VSG D +L+ +L E+ P++ +++ + R + FV+D
Sbjct: 68 PLDVSG--------QDLSGSLLTKLAEMMRKALPEIKSSVMELEPRP--RVFVVDLLGTE 117
Query: 126 AFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTT-KSFRELGSTLLNFPGFPPFPA 184
A +V+ + + TT+ LA +Y+ +L K K +G+ L+ PG P
Sbjct: 118 ALEVAKELGIMRKHVLVTTSAWFLAFTVYMASLDKQELYKQLSSIGALLI--PGCSPVKF 175
Query: 185 RDMALPMHDREGKVYKGLVDT---GIQMAKSAGVIVNTFELLEERAIKAMLE----GQCT 237
P K + L ++ G ++ + GV VNT+ LE+ I + L+ G+
Sbjct: 176 ERAQDPR-----KYIRELAESQRIGDEVITADGVFVNTWHSLEQVTIGSFLDPENLGRVM 230
Query: 238 PGETSPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKE 297
G P+Y +GP+V + H L WLD +P SV+++ FGS G+ + +Q E
Sbjct: 231 RG---VPVYPVGPLVRPAEPGLK----HGVLDWLDLQPKESVVYVSFGSGGALTFEQTNE 283
Query: 298 MAIGLERSGVKFLWVVRAPAPDS--------IENRSSLESLLPEGFLDRTKDRGLVVESW 349
+A GLE +G +F+WVVR PA D +N + LP GFLDRTKD GLVV +W
Sbjct: 284 LAYGLELTGHRFVWVVRPPAEDDPSASMFDKTKNETEPLDFLPNGFLDRTKDIGLVVRTW 343
Query: 350 APQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAV 409
APQ E+L H+S GGFVTHCGWNSVLE + GVPM+AWPLY+EQKM +V E+K+ L +
Sbjct: 344 APQEEILAHKSTGGFVTHCGWNSVLESIVNGVPMVAWPLYSEQKMNARMVSGELKIALQI 403
Query: 410 TRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAM 455
+ D +V + + V +MD E+G+ +++ +K+ A A+
Sbjct: 404 NVA---DGIVKKEVIAEMVKRVMDEEEGKEMRKNVKELKKTAEEAL 446
>sp|O81498|U72E3_ARATH UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana GN=UGT72E3
PE=1 SV=1
Length = 481
Score = 221 bits (564), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 146/493 (29%), Positives = 250/493 (50%), Gaps = 42/493 (8%)
Query: 7 FYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSVTF 66
++SPG GH+ ++EL K + H F + + + S + S + V
Sbjct: 10 MFSSPGMGHVLPVIELAKRLSANH-GFHVTVFVLETDAAS-------VQSKLLNSTGVDI 61
Query: 67 HQLPPP-VSGLPDTLRSPADFPALVYELGELNN---PKLHETLITISKRSNLKAFVIDFF 122
LP P +SGL D +V ++G + P L ++ + + N A +ID F
Sbjct: 62 VNLPSPDISGLVDP------NAHVVTKIGVIMREAVPTLRSKIVAMHQ--NPTALIIDLF 113
Query: 123 CNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFPPF 182
A +++ L++ TY + + L ++Y PTL + K + L PG P
Sbjct: 114 GTDALCLAAE-LNMLTYVFIASNARYLGVSIYYPTLDE-VIKEEHTVQRKPLTIPGCEPV 171
Query: 183 PARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGETS 242
D+ + VY LV + K+ G++VNT+E +E +++K++ + +
Sbjct: 172 RFEDIMDAYLVPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVAR 231
Query: 243 PPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGL 302
P+Y +GP+ H WL+ +P+ SVL++ FGS GS +++QL E+A GL
Sbjct: 232 VPVYPVGPLC---RPIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGL 288
Query: 303 ERSGVKFLWVVRAPAPDSI----------ENRSSLESLLPEGFLDRTKDRGLVVESWAPQ 352
E S +F+WVVR P S + + LPEGF+ RT DRG ++ SWAPQ
Sbjct: 289 EESQQRFIWVVRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQ 348
Query: 353 VEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRS 412
E+L H++VGGF+THCGW+S LE V GVPM+AWPL+AEQ M A++ +E+ + + R
Sbjct: 349 AEILAHQAVGGFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISV---RV 405
Query: 413 EEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAA--AAAMRDGGSSRVALDNLVE 470
++ +S +++E V ++M ++G ++ + +++ A + ++ GGS+ +L + +
Sbjct: 406 DDPKEAISRSKIEAMVRKVMAEDEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465
Query: 471 SFKR--GCIAPFG 481
+R C+ G
Sbjct: 466 ECQRFLECVGDLG 478
>sp|Q40288|UFOG6_MANES Anthocyanidin 3-O-glucosyltransferase 6 (Fragment) OS=Manihot
esculenta GN=GT6 PE=2 SV=1
Length = 394
Score = 219 bits (557), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 137/396 (34%), Positives = 213/396 (53%), Gaps = 28/396 (7%)
Query: 92 ELGELNNPKLH--ETLITISKRSN--LKAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGS 147
+LG ++ K H E + ++ RS+ L FV+D FC V+ L +P Y +FT+ +
Sbjct: 5 DLGFIDKQKAHVKEAVSKLTARSDSSLAGFVLDMFCTSMIDVAKE-LGVPYYIFFTSGAA 63
Query: 148 VLAANLYLPTLHKNTTKSFRELGST--LLNFPGFP-PFPARDMALPMHDREGKVYKGLVD 204
L Y+ +H + + L+ P PAR + M ++ +
Sbjct: 64 FLGFLFYVQLIHDEQDADLTQFKDSDAELSVPSLANSLPARVLPASMLVKDRFYAFIRII 123
Query: 205 TGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGETSPPLYCIGPVVGRGNGENR-GRD 263
G++ AK G++VNTF LE A+ ++ + Q PP+Y +GP++ N EN G +
Sbjct: 124 RGLREAK--GIMVNTFMELESHALNSLKDDQSK----IPPIYPVGPILKLSNQENDVGPE 177
Query: 264 RHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAP----D 319
E + WLD +P SV+FLCFGS+G F Q KE+A LE+S +FLW +R P P +
Sbjct: 178 GSEIIEWLDDQPPSSVVFLCFGSMGGFDMDQAKEIACALEQSRHRFLWSLRRPPPKGKIE 237
Query: 320 SIENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCA 379
+ + +L+ +LP GF +RT G VV WAPQV +L H ++GGFV+HCGWNS+LE +
Sbjct: 238 TSTDYENLQEILPVGFSERTAGMGKVV-GWAPQVAILEHPAIGGFVSHCGWNSILESIWF 296
Query: 380 GVPMLAWPLYAEQKMIRAVVVEEMKVGLAV----TRSEEKDRLVSAAELEQRVSELMDSE 435
VP+ WPLYAEQ+ +V E+ GLAV +E + ++SA ++E+ + +M E
Sbjct: 297 SVPIATWPLYAEQQFNAFTMVTEL--GLAVEIKMDYKKESEIILSADDIERGIKCVM--E 352
Query: 436 KGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVES 471
+++R M + + A+ D SS LD L+E
Sbjct: 353 HHSEIRKRVKEMSDKSRKALMDDESSSFWLDRLIED 388
>sp|O82381|U71C1_ARATH UDP-glycosyltransferase 71C1 OS=Arabidopsis thaliana GN=UGT71C1
PE=1 SV=1
Length = 481
Score = 218 bits (556), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 164/494 (33%), Positives = 245/494 (49%), Gaps = 49/494 (9%)
Query: 5 IVFYTSPGRGHLNSMVELGK-LILTYHP-CFSIDIIIPTAPFVSSAGTDDYIASVSATAP 62
+V P GH+ + +EL K LI +P +I I+ PF+ A T ++ S+ P
Sbjct: 9 LVIIPFPFSGHILATIELAKRLISQDNPRIHTITILYWGLPFIPQADTIAFLRSLVKNEP 68
Query: 63 SVTFHQLP-----PPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITI------SKR 111
+ LP PP+ + S + E + P + E L T+ S
Sbjct: 69 RIRLVTLPEVQDPPPMELFVEFAES------YILEYVKKMVPIIREALSTLLSSRDESGS 122
Query: 112 SNLKAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSF-RELG 170
+ V+DFFC P V + ++P+Y + T + L YLP H+ F R
Sbjct: 123 VRVAGLVLDFFCVPMIDVGNE-FNLPSYIFLTCSAGFLGMMKYLPERHREIKSEFNRSFN 181
Query: 171 STLLNFPGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIK 229
L PG+ P + LP + Y+ V+ + ++ G++VN++ LE K
Sbjct: 182 EELNLIPGYVNSVPTK--VLPSGLFMKETYEPWVELAERFPEAKGILVNSYTALEPNGFK 239
Query: 230 AMLEGQCTPGETSPPLYCIGPVV---GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGS 286
+C + P +Y IGP++ R N ++ RDR ++WLD +P SV+FLCFGS
Sbjct: 240 YF--DRCP--DNYPTIYPIGPILCSNDRPNLDSSERDR--IITWLDDQPESSVVFLCFGS 293
Query: 287 LGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVV 346
L + S+ Q+ E+A LE KF+W R + E S E+L P GF+DR D+G+V
Sbjct: 294 LKNLSATQINEIAQALEIVDCKFIWSFRT---NPKEYASPYEAL-PHGFMDRVMDQGIVC 349
Query: 347 ESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVG 406
WAPQVE+L H++VGGFV+HCGWNS+LE + GVP+ WP+YAEQ++ +V+E+ +
Sbjct: 350 -GWAPQVEILAHKAVGGFVSHCGWNSILESLGFGVPIATWPMYAEQQLNAFTMVKELGLA 408
Query: 407 LAVTR---SEEKDRLVSAAELEQRVSELMDSEK--GRAVKERAVAMKEAAAAAMRDGGSS 461
L + SE+ D +V A E+ V LMD VKE A A KEA DGGSS
Sbjct: 409 LEMRLDYVSEDGD-IVKADEIAGTVRSLMDGVDVPKSKVKEIAEAGKEAV-----DGGSS 462
Query: 462 RVALDNLVESFKRG 475
+A+ + G
Sbjct: 463 FLAVKRFIGDLIDG 476
>sp|Q9LSY6|U71B6_ARATH UDP-glycosyltransferase 71B6 OS=Arabidopsis thaliana GN=UGT71B6
PE=1 SV=1
Length = 479
Score = 214 bits (545), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 151/486 (31%), Positives = 243/486 (50%), Gaps = 36/486 (7%)
Query: 1 MKDTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSAT 60
MK +VF SP HL + VE+ + ++ + SI +II + SS T S T
Sbjct: 1 MKIELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVIIIS---FSSKNTS---MITSLT 54
Query: 61 APSVTFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSNLKAFVID 120
+ + +++ P L++ + L KL ++ T+ L FV+D
Sbjct: 55 SNNRLRYEIISGGDQQPTELKATDSHIQSLKPLVRDAVAKLVDS--TLPDAPRLAGFVVD 112
Query: 121 FFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTT---KSFRELGSTLLNFP 177
+C V++ +P+Y ++T+ L L++ ++ S E L P
Sbjct: 113 MYCTSMIDVANE-FGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYDMSELEDSDVELVVP 171
Query: 178 GF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQC 236
P+P + + +E + V + ++ G++VNT LE +A+ + G
Sbjct: 172 SLTSPYPLKCLPYIFKSKEWLTF--FVTQARRFRETKGILVNTVPDLEPQALTFLSNGNI 229
Query: 237 TPGETSPPLYCIGPVV--GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQ 294
P Y +GP++ N + + + E L WLD +P RSV+FLCFGS+G FS +Q
Sbjct: 230 ------PRAYPVGPLLHLKNVNCDYVDKKQSEILRWLDEQPPRSVVFLCFGSMGGFSEEQ 283
Query: 295 LKEMAIGLERSGVKFLWVVRAPAPDSIEN----RSSLESLLPEGFLDRTKDRGLVVESWA 350
++E A+ L+RSG +FLW +R +P+ + ++LE +LPEGF DRT +RG V+ WA
Sbjct: 284 VRETALALDRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVI-GWA 342
Query: 351 PQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVT 410
QV +L ++GGFV+H GWNS LE + GVPM WPLYAEQK +VEE+ + + +
Sbjct: 343 EQVAILAKPAIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAVEIK 402
Query: 411 RSEEKDRL------VSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVA 464
+ D L V+A E+E+ + LM E+ V++R + E A+ DGGSS A
Sbjct: 403 KHWRGDLLLGRSEIVTAEEIEKGIICLM--EQDSDVRKRVNEISEKCHVALMDGGSSETA 460
Query: 465 LDNLVE 470
L ++
Sbjct: 461 LKRFIQ 466
>sp|Q9ZWJ3|U85A2_ARATH UDP-glycosyltransferase 85A2 OS=Arabidopsis thaliana GN=UGT85A2
PE=2 SV=1
Length = 481
Score = 197 bits (500), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 145/497 (29%), Positives = 239/497 (48%), Gaps = 59/497 (11%)
Query: 2 KDTIVFYTSPGRGHLNSMVELGKLILT--YHPCFSIDIIIPTAPFVSSAGTDDYIASVSA 59
K +V P +GH+N M+++ KL+ +H F ++ + + S G +
Sbjct: 8 KQHVVCVPYPAQGHINPMMKVAKLLYAKGFHITF-VNTVYNHNRLLRSRGPN-----AVD 61
Query: 60 TAPSVTFHQLPPPVSGLPDT-LRSPADFPALVYELGELNNPKLHETLITISKRSNLK--A 116
PS F +P GLP+T + D P L + E L I+ R ++ +
Sbjct: 62 GLPSFRFESIP---DGLPETDVDVTQDIPTLCESTMKHCLAPFKELLRQINARDDVPPVS 118
Query: 117 FVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLY--------LPTLHKNTTKSFRE 168
++ C ++ L +P ++TT+ A LY L + + +
Sbjct: 119 CIVSDGCMSFTLDAAEELGVPEVLFWTTSACGFLAYLYYYRFIEKGLSPIKDESYLTKEH 178
Query: 169 LGSTLLNFPGFPPFPARDMA--LPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEER 226
L + + P +D+ + + + + ++ + +++ +I+NTF+ LE
Sbjct: 179 LDTKIDWIPSMKNLRLKDIPSFIRTTNPDDIMLNFIIREADRAKRASAIILNTFDDLEHD 238
Query: 227 AIKAMLEGQCTPGETSPPLYCIGPV-------------VGRGNGENRGRDRHECLSWLDS 273
I++M PP+Y IGP+ +GR G N R+ ECL WL++
Sbjct: 239 VIQSM-------KSIVPPVYSIGPLHLLEKQESGEYSEIGR-TGSNLWREETECLDWLNT 290
Query: 274 KPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPE 333
K SV+++ FGS+ S+KQL E A GL +G +FLWV+R PD + + E+++P
Sbjct: 291 KARNSVVYVNFGSITVLSAKQLVEFAWGLAATGKEFLWVIR---PDLV---AGDEAMVPP 344
Query: 334 GFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQK 393
FL T DR ++ SW PQ +VL+H ++GGF+THCGWNS LE +C GVPM+ WP +AEQ+
Sbjct: 345 EFLTATADRRMLA-SWCPQEKVLSHPAIGGFLTHCGWNSTLESLCGGVPMVCWPFFAEQQ 403
Query: 394 MIRAVVVEEMKVGLAVTRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKE-AAA 452
+E +VG+ + V E+E V ELMD EKG+ ++E+A + A
Sbjct: 404 TNCKFSRDEWEVGIEIGGD------VKREEVEAVVRELMDEEKGKNMREKAEEWRRLANE 457
Query: 453 AAMRDGGSSRVALDNLV 469
A GSS++ + LV
Sbjct: 458 ATEHKHGSSKLNFEMLV 474
>sp|Q9SK82|U85A1_ARATH UDP-glycosyltransferase 85A1 OS=Arabidopsis thaliana GN=UGT85A1
PE=1 SV=1
Length = 489
Score = 195 bits (495), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 150/505 (29%), Positives = 239/505 (47%), Gaps = 62/505 (12%)
Query: 2 KDTIVFYTSPGRGHLNSMVELGKLILT--YHPCFSIDIIIPTAPFVSSAGTDDYIASVSA 59
K +V P +GH+N M+ + KL+ ++ F ++ + F+ S G++
Sbjct: 11 KPHVVCVPYPAQGHINPMMRVAKLLHARGFYVTF-VNTVYNHNRFLRSRGSNALDG---- 65
Query: 60 TAPSVTFHQLPPPVSGLPDT-LRSPADFPALVYELGELNNPKLHETLITISKRSNLK--A 116
PS F + GLP+T + + D AL + E L I+ N+ +
Sbjct: 66 -LPSFRFESI---ADGLPETDMDATQDITALCESTMKNCLAPFRELLQRINAGDNVPPVS 121
Query: 117 FVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLY--------LPTLHKNTTKSFRE 168
++ C + L +P ++TT+G A L+ L L + +
Sbjct: 122 CIVSDGCMSFTLDVAEELGVPEVLFWTTSGCAFLAYLHFYLFIEKGLCPLKDESYLTKEY 181
Query: 169 LGSTLLNF-PGFPPFPARDMALPMHDREGKVYKGLVDTGI----QMAKSAGVIVNTFELL 223
L T+++F P +D +P R ++ + + +++ +I+NTF+ L
Sbjct: 182 LEDTVIDFIPTMKNVKLKD--IPSFIRTTNPDDVMISFALRETERAKRASAIILNTFDDL 239
Query: 224 EERAIKAMLEGQCTPGETSPPLYCIGPVVGRGNGE------------NRGRDRHECLSWL 271
E + AM PP+Y +GP+ N E N ++ ECL WL
Sbjct: 240 EHDVVHAM-------QSILPPVYSVGPLHLLANREIEEGSEIGMMSSNLWKEEMECLDWL 292
Query: 272 DSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLL 331
D+K SV+++ FGS+ S KQL E A GL SG +FLWV+R PD + E+++
Sbjct: 293 DTKTQNSVIYINFGSITVLSVKQLVEFAWGLAGSGKEFLWVIR---PDLVAGE---EAMV 346
Query: 332 PEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAE 391
P FL TKDR ++ SW PQ +VL+H ++GGF+THCGWNS+LE + GVPM+ WP +A+
Sbjct: 347 PPDFLMETKDRSMLA-SWCPQEKVLSHPAIGGFLTHCGWNSILESLSCGVPMVCWPFFAD 405
Query: 392 QKMIRAVVVEEMKVGLAVTRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAA 451
Q+M +E VG+ + V E+E V ELMD EKG+ ++E+AV + A
Sbjct: 406 QQMNCKFCCDEWDVGIEIGGD------VKREEVEAVVRELMDGEKGKKMREKAVEWQRLA 459
Query: 452 AAAMRDG-GSSRVALDNLVESFKRG 475
A GSS + + +V F G
Sbjct: 460 EKATEHKLGSSVMNFETVVSKFLLG 484
>sp|Q9ZVX4|U90A1_ARATH UDP-glycosyltransferase 90A1 OS=Arabidopsis thaliana GN=UGT90A1
PE=2 SV=1
Length = 478
Score = 194 bits (492), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 143/488 (29%), Positives = 240/488 (49%), Gaps = 54/488 (11%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYH---PCFSIDIII--PTAPFVSSAGTDDYIASVSA 59
+V + +GH+ +++ G+L+L +H P ++ + PF+S D+++
Sbjct: 10 VVLFPFMSKGHIIPLLQFGRLLLRHHRKEPTITVTVFTTPKNQPFIS-----DFLSD--- 61
Query: 60 TAPSVTFHQLPPP--VSGLPDTLRSPADFPAL-----VYELGELNNPKLHETLITISKRS 112
P + LP P ++G+P + + P++ +L P ETL T+ K S
Sbjct: 62 -TPEIKVISLPFPENITGIPPGVENTEKLPSMSLFVPFTRATKLLQPFFEETLKTLPKVS 120
Query: 113 NLKAFVIDFFCNPAFQVSSSTLSIPTY--YYFTTAGSVLAANLYLPTLHKNTTKSFRELG 170
+ V D F + S++ +IP + Y + + ++ +++ H+ T+ +
Sbjct: 121 FM---VSDGFLWWTSE-SAAKFNIPRFVSYGMNSYSAAVSISVFK---HELFTEPESKSD 173
Query: 171 STLLNFPGFPPFPAR----DMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEER 226
+ + P FP + D + G + +D S G +VN+F LE
Sbjct: 174 TEPVTVPDFPWIKVKKCDFDHGTTEPEESGAALELSMDQIKSTTTSHGFLVNSFYELE-- 231
Query: 227 AIKAMLEGQCTPGETSPPLYCIGPVVGRGNGENRGRDRHECLSWLDSK--PSRSVLFLCF 284
A ++ G+ P +C+GP+ + +G + + WLD K R VL++ F
Sbjct: 232 --SAFVDYNNNSGD-KPKSWCVGPLC-LTDPPKQGSAKPAWIHWLDQKREEGRPVLYVAF 287
Query: 285 GSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGL 344
G+ S+KQL E+A GLE S V FLWV R +E ++ EGF DR ++ G+
Sbjct: 288 GTQAEISNKQLMELAFGLEDSKVNFLWVTR----------KDVEEIIGEGFNDRIRESGM 337
Query: 345 VVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMK 404
+V W Q E+L+HESV GF++HCGWNS E +C GVP+LAWP+ AEQ + +VVEE+K
Sbjct: 338 IVRDWVDQWEILSHESVKGFLSHCGWNSAQESICVGVPLLAWPMMAEQPLNAKMVVEEIK 397
Query: 405 VGLAV-TRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDG-GSSR 462
VG+ V T V+ EL ++ ELM+ E G+ ++ + A AA+ +G GSS
Sbjct: 398 VGVRVETEDGSVKGFVTREELSGKIKELMEGETGKTARKNVKEYSKMAKAALVEGTGSSW 457
Query: 463 VALDNLVE 470
LD +++
Sbjct: 458 KNLDMILK 465
>sp|Q9ZQG4|U73B5_ARATH UDP-glycosyltransferase 73B5 OS=Arabidopsis thaliana GN=UGT73B5
PE=2 SV=1
Length = 484
Score = 193 bits (491), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 136/380 (35%), Positives = 198/380 (52%), Gaps = 31/380 (8%)
Query: 103 ETLITISKRSNLKAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVL--AANLYLPTLHK 160
E+ I +K S A V D F P S+ L +P + T+ L + N+ + HK
Sbjct: 118 ESFIETTKPS---ALVADMFF-PWATESAEKLGVPRLVFHGTSFFSLCCSYNMRIHKPHK 173
Query: 161 NTTKSFRELGSTLLNFPGFPP--FPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVN 218
S ST PG P D A + E + K + + S GV+VN
Sbjct: 174 KVATS-----STPFVIPGLPGDIVITEDQA-NVAKEETPMGKFMKEVRESETNSFGVLVN 227
Query: 219 TFELLEE------RAIKAMLEGQCTPGETSPPLYCIGPVVGRGNGENRGRDRHECLSWLD 272
+F LE R+ A P S +G RG N D ECL WLD
Sbjct: 228 SFYELESAYADFYRSFVAKRAWHIGPLSLSN--RELGEKARRGKKANI--DEQECLKWLD 283
Query: 273 SKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLP 332
SK SV++L FGS +F++ QL E+A GLE SG F+WVVR EN+ E LP
Sbjct: 284 SKTPGSVVYLSFGSGTNFTNDQLLEIAFGLEGSGQSFIWVVRKN-----ENQGDNEEWLP 338
Query: 333 EGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQ 392
EGF +RT +GL++ WAPQV +L+H+++GGFVTHCGWNS +EG+ AG+PM+ WP+ AEQ
Sbjct: 339 EGFKERTTGKGLIIPGWAPQVLILDHKAIGGFVTHCGWNSAIEGIAAGLPMVTWPMGAEQ 398
Query: 393 KMIRAVVVEEMKVGLAVTRSE--EKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEA 450
++ + +++G+ V +E +K +L+S A++E+ V E++ EK + A + E
Sbjct: 399 FYNEKLLTKVLRIGVNVGATELVKKGKLISRAQVEKAVREVIGGEKAEERRLWAKKLGEM 458
Query: 451 AAAAMRDGGSSRVALDNLVE 470
A AA+ +GGSS ++ +E
Sbjct: 459 AKAAVEEGGSSYNDVNKFME 478
>sp|Q7Y232|U73B4_ARATH UDP-glycosyltransferase 73B4 OS=Arabidopsis thaliana GN=UGT73B4
PE=2 SV=1
Length = 484
Score = 192 bits (487), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 135/387 (34%), Positives = 200/387 (51%), Gaps = 42/387 (10%)
Query: 103 ETLITISKRSNLKAFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVL--AANLYLPTLHK 160
E+ I +K S A V D F P S+ + +P + T+ L + N+ + HK
Sbjct: 115 ESFIETTKPS---ALVADMFF-PWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHK 170
Query: 161 NTTKSFRELGSTLLNFPGFPP--FPARDMALPMHDRE--GKVYKGLVDTGIQMAKSAGVI 216
S ST PG P D A ++ GK +K + ++ S GV+
Sbjct: 171 KVASS-----STPFVIPGLPGDIVITEDQANVTNEETPFGKFWKEVRES---ETSSFGVL 222
Query: 217 VNTFELLEERAIKAMLEGQCTPGETSPPLYCIGPVV--GRGNGENRGR------DRHECL 268
VN+F LE + + IGP+ RG E GR D ECL
Sbjct: 223 VNSFYELESSY------ADFYRSFVAKKAWHIGPLSLSNRGIAEKAGRGKKANIDEQECL 276
Query: 269 SWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSL- 327
WLDSK SV++L FGS ++QL E+A GLE SG F+WVV EN+
Sbjct: 277 KWLDSKTPGSVVYLSFGSGTGLPNEQLLEIAFGLEGSGQNFIWVVSKN-----ENQVGTG 331
Query: 328 --ESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLA 385
E LP+GF +R K +GL++ WAPQV +L+H+++GGFVTHCGWNS LEG+ AG+PM+
Sbjct: 332 ENEDWLPKGFEERNKGKGLIIRGWAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVT 391
Query: 386 WPLYAEQKMIRAVVVEEMKVGLAVTRSE--EKDRLVSAAELEQRVSELMDSEKGRAVKER 443
WP+ AEQ ++ + +++G+ V +E +K +L+S A++E+ V E++ EK + R
Sbjct: 392 WPMGAEQFYNEKLLTKVLRIGVNVGATELVKKGKLISRAQVEKAVREVIGGEKAEERRLR 451
Query: 444 AVAMKEAAAAAMRDGGSSRVALDNLVE 470
A + E A AA+ +GGSS ++ +E
Sbjct: 452 AKELGEMAKAAVEEGGSSYNDVNKFME 478
>sp|Q9LMF1|U85A3_ARATH UDP-glycosyltransferase 85A3 OS=Arabidopsis thaliana GN=UGT85A3
PE=2 SV=2
Length = 488
Score = 191 bits (484), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 149/506 (29%), Positives = 247/506 (48%), Gaps = 61/506 (12%)
Query: 2 KDTIVFYTSPGRGHLNSMVELGKLILT--YHPCFSIDIIIPTAPFVSSAGTDDYIASVSA 59
K +V P +GH+N M+++ KL+ +H F ++ + + S G A+
Sbjct: 11 KPHVVCVPYPAQGHINPMMKVAKLLHVKGFHVTF-VNTVYNHNRLLRSRG-----ANALD 64
Query: 60 TAPSVTFHQLPPPVSGLPDT-LRSPADFPALVYELGE---LNNPKLHETLITISKRSNLK 115
PS F +P GLP+T + + D PAL + + KL + ++T +
Sbjct: 65 GLPSFQFESIP---DGLPETGVDATQDIPALSESTTKNCLVPFKKLLQRIVTREDVPPVS 121
Query: 116 AFVIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLH-------KNTTKSFRE 168
V D + V+ L +P +++TT+ A L+ K+ + +E
Sbjct: 122 CIVSDGSMSFTLDVAEE-LGVPEIHFWTTSACGFMAYLHFYLFIEKGLCPVKDASCLTKE 180
Query: 169 LGSTLLNF-PGFPPFPARDMA--LPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEE 225
T++++ P +D+ + + + +V + +++ +I+NTF+ LE
Sbjct: 181 YLDTVIDWIPSMNNVKLKDIPSFIRTTNPNDIMLNFVVREACRTKRASAIILNTFDDLEH 240
Query: 226 RAIKAMLEGQCTPGETSPPLYCIGPV-------------VGRGNGENRGRDRHECLSWLD 272
I++M PP+Y IGP+ +GR G N ++ ECL WL+
Sbjct: 241 DIIQSM-------QSILPPVYPIGPLHLLVNREIEEDSEIGR-MGSNLWKEETECLGWLN 292
Query: 273 SKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLP 332
+K SV+++ FGS+ ++ QL E A GL +G +FLWV+R PDS+ E+++P
Sbjct: 293 TKSRNSVVYVNFGSITIMTTAQLLEFAWGLAATGKEFLWVMR---PDSVAGE---EAVIP 346
Query: 333 EGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQ 392
+ FL T DR ++ SW PQ +VL+H +VGGF+THCGWNS LE + GVPM+ WP +AEQ
Sbjct: 347 KEFLAETADRRMLT-SWCPQEKVLSHPAVGGFLTHCGWNSTLESLSCGVPMVCWPFFAEQ 405
Query: 393 KMIRAVVVEEMKVGLAVTRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAA 452
+ +E +VG+ + V E+E V ELMD EKG+ ++E+AV + A
Sbjct: 406 QTNCKFSCDEWEVGIEIGGD------VKRGEVEAVVRELMDGEKGKKMREKAVEWRRLAE 459
Query: 453 AAMR-DGGSSRVALDNLVESFKRGCI 477
A + GSS + + +V G I
Sbjct: 460 KATKLPCGSSVINFETIVNKVLLGKI 485
>sp|Q9ZQ96|U73C3_ARATH UDP-glycosyltransferase 73C3 OS=Arabidopsis thaliana GN=UGT73C3
PE=2 SV=1
Length = 496
Score = 189 bits (480), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 144/496 (29%), Positives = 237/496 (47%), Gaps = 52/496 (10%)
Query: 6 VFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASV-SATAPSV 64
V + +GH+ M+++ +L+ + I I T P ++ + ++ S A ++
Sbjct: 16 VLFPFMAQGHMIPMIDIARLLAQR----GVTITIVTTPHNAARFKNVLNRAIESGLAINI 71
Query: 65 TFHQLPPPVSGLP------DTLRSPADFPALVYELGELNNPKLHETLITISKRSNLKAFV 118
+ P GLP D+L S + L +P + + + + +
Sbjct: 72 LHVKFPYQEFGLPEGKENIDSLDSTELMVPFFKAVNLLEDP----VMKLMEEMKPRPSCL 127
Query: 119 IDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPG 178
I +C P + + +IP + NL + + + + S F
Sbjct: 128 ISDWCLPYTSIIAKNFNIPKIVFHGMG----CFNLLCMHVLRRNLEILENVKSDEEYFL- 182
Query: 179 FPPFPAR----DMALPMHDREGKVYKGLVDTGIQMA-KSAGVIVNTFELLEERAIKAMLE 233
P FP R + LP+ +K ++D ++ S GVIVNTF+ LE +K E
Sbjct: 183 VPSFPDRVEFTKLQLPVKANASGDWKEIMDEMVKAEYTSYGVIVNTFQELEPPYVKDYKE 242
Query: 234 GQCTPGETSPPLYCIGPV-----VGRGNGENRGR---DRHECLSWLDSKPSRSVLFLCFG 285
++ IGPV G E + D+ ECL WLDSK SVL++C G
Sbjct: 243 A------MDGKVWSIGPVSLCNKAGADKAERGSKAAIDQDECLQWLDSKEEGSVLYVCLG 296
Query: 286 SLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLV 345
S+ + QLKE+ +GLE S F+WV+R S + + E +L GF +R K+RGL+
Sbjct: 297 SICNLPLSQLKELGLGLEESRRSFIWVIRG----SEKYKELFEWMLESGFEERIKERGLL 352
Query: 346 VESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKV 405
++ WAPQV +L+H SVGGF+THCGWNS LEG+ +G+P++ WPL+ +Q + +VV+ +K
Sbjct: 353 IKGWAPQVLILSHPSVGGFLTHCGWNSTLEGITSGIPLITWPLFGDQFCNQKLVVQVLKA 412
Query: 406 GLA-----VTRSEEKDR---LVSAAELEQRVSELM-DSEKGRAVKERAVAMKEAAAAAMR 456
G++ V + E+D+ LV +++ V ELM DS+ + + R + E A A+
Sbjct: 413 GVSAGVEEVMKWGEEDKIGVLVDKEGVKKAVEELMGDSDDAKERRRRVKELGELAHKAVE 472
Query: 457 DGGSSRVALDNLVESF 472
GGSS + L++
Sbjct: 473 KGGSSHSNITLLLQDI 488
>sp|Q9SCP5|U73C7_ARATH UDP-glycosyltransferase 73C7 OS=Arabidopsis thaliana GN=UGT73C7
PE=2 SV=1
Length = 490
Score = 187 bits (476), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 109/281 (38%), Positives = 160/281 (56%), Gaps = 33/281 (11%)
Query: 212 SAGVIVNTFELLEERAIKAMLEGQCTPGETSPPLYCIGPV----------VGRGNGENRG 261
S GVIVNTFE LE + + + + ++C+GPV RG+ + G
Sbjct: 215 SYGVIVNTFEELEVDYAREYRKAR------AGKVWCVGPVSLCNRLGLDKAKRGDKASIG 268
Query: 262 RDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSI 321
+D +CL WLDS+ + SVL++C GSL + QLKE+ +GLE S F+WV+R
Sbjct: 269 QD--QCLQWLDSQETGSVLYVCLGSLCNLPLAQLKELGLGLEASNKPFIWVIREWG---- 322
Query: 322 ENRSSLESLLPE-GFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAG 380
L + + + GF +R KDRGLV++ WAPQV +L+H S+GGF+THCGWNS LEG+ AG
Sbjct: 323 -KYGDLANWMQQSGFEERIKDRGLVIKGWAPQVFILSHASIGGFLTHCGWNSTLEGITAG 381
Query: 381 VPMLAWPLYAEQKMIRAVVVEEMKVGLAV--------TRSEEKDRLVSAAELEQRVSELM 432
VP+L WPL+AEQ + +VV+ +K GL + + EE +VS + + V ELM
Sbjct: 382 VPLLTWPLFAEQFLNEKLVVQILKAGLKIGVEKLMKYGKEEEIGAMVSRECVRKAVDELM 441
Query: 433 -DSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESF 472
DSE+ + + + + A A+ GGSS + L++
Sbjct: 442 GDSEEAEERRRKVTELSDLANKALEKGGSSDSNITLLIQDI 482
>sp|Q9SY84|U90A2_ARATH UDP-glycosyltransferase 90A2 OS=Arabidopsis thaliana GN=UGT90A2
PE=2 SV=1
Length = 467
Score = 186 bits (472), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 142/484 (29%), Positives = 236/484 (48%), Gaps = 45/484 (9%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSV 64
+V + +GH+ M++L +L+L++ I + + T P D S+S T ++
Sbjct: 8 VVLFPYLSKGHMIPMLQLARLLLSHSFAGDISVTVFTTPLNRPFIVD----SLSGTKATI 63
Query: 65 TFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSNLKAFVIDFFCN 124
P V +P + PAL L + + +++ +R + + F +
Sbjct: 64 VDVPFPDNVPEIPPGVECTDKLPALSSSLF-VPFTRATKSMQADFERELMSLPRVSFMVS 122
Query: 125 PAF----QVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFP 180
F Q S+ L P +F G A+ + ++ +N S + + ++ P FP
Sbjct: 123 DGFLWWTQESARKLGFPRLVFF---GMNCASTVICDSVFQNQLLSNVKSETEPVSVPEFP 179
Query: 181 PFPAR--DMALPMHDREGKV---YKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQ 235
R D M D + +K ++D M +S G+I NTF+ LE I +
Sbjct: 180 WIKVRKCDFVKDMFDPKTTTDPGFKLILDQVTSMNQSQGIIFNTFDDLEPVFIDFYKRKR 239
Query: 236 CTPGETSPPLYCIGPVVGRGN---GENRGRDRHECLSWLDSKPSR--SVLFLCFGSLGSF 290
L+ +GP+ N E + + + WLD K + +VL++ FGS
Sbjct: 240 ------KLKLWAVGPLCYVNNFLDDEVEEKVKPSWMKWLDEKRDKGCNVLYVAFGSQAEI 293
Query: 291 SSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVV-ESW 349
S +QL+E+A+GLE S V FLWVV+ + + +GF +R +RG++V + W
Sbjct: 294 SREQLEEIALGLEESKVNFLWVVKG-------------NEIGKGFEERVGERGMMVRDEW 340
Query: 350 APQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAV 409
Q ++L HESV GF++HCGWNS+ E +C+ VP+LA+PL AEQ + +VVEE++V V
Sbjct: 341 VDQRKILEHESVRGFLSHCGWNSLTESICSEVPILAFPLAAEQPLNAILVVEELRVAERV 400
Query: 410 TRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDG-GSSRVALDNL 468
+ E +V E+ ++V ELM+ EKG+ ++ A + A A+ +G GSSR LDNL
Sbjct: 401 VAASEG--VVRREEIAEKVKELMEGEKGKELRRNVEAYGKMAKKALEEGIGSSRKNLDNL 458
Query: 469 VESF 472
+ F
Sbjct: 459 INEF 462
>sp|Q8W491|U73B3_ARATH UDP-glycosyltransferase 73B3 OS=Arabidopsis thaliana GN=UGT73B3
PE=2 SV=1
Length = 481
Score = 182 bits (463), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 162/292 (55%), Gaps = 24/292 (8%)
Query: 192 HDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLEGQCTPGETSPPLYCIGP- 250
D E ++ K +++ KS+GVIVN+F LE + IGP
Sbjct: 201 RDEESEMGKFMIEVKESDVKSSGVIVNSFYELEPDY------ADFYKSVVLKRAWHIGPL 254
Query: 251 -VVGRGNGENRGRDRH------ECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLE 303
V RG E R + ECL WLDSK SV+++ FGS+ F ++QL E+A GLE
Sbjct: 255 SVYNRGFEEKAERGKKASINEVECLKWLDSKKPDSVIYISFGSVACFKNEQLFEIAAGLE 314
Query: 304 RSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGG 363
SG F+WVVR IE E LPEGF +R K +G+++ WAPQV +L+H++ G
Sbjct: 315 TSGANFIWVVRKNI--GIEK----EEWLPEGFEERVKGKGMIIRGWAPQVLILDHQATCG 368
Query: 364 FVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEE---KDRLVS 420
FVTHCGWNS+LEGV AG+PM+ WP+ AEQ +V + ++ G++V + +S
Sbjct: 369 FVTHCGWNSLLEGVAAGLPMVTWPVAAEQFYNEKLVTQVLRTGVSVGAKKNVRTTGDFIS 428
Query: 421 AAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESF 472
++ + V E++ E+ +ERA + E A AA+ +GGSS L++ +E F
Sbjct: 429 REKVVKAVREVLVGEEADERRERAKKLAEMAKAAV-EGGSSFNDLNSFIEEF 479
>sp|Q9ZQ94|U73C5_ARATH UDP-glycosyltransferase 73C5 OS=Arabidopsis thaliana GN=UGT73C5
PE=2 SV=1
Length = 495
Score = 182 bits (463), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 128/380 (33%), Positives = 193/380 (50%), Gaps = 42/380 (11%)
Query: 118 VIDFFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKN-----TTKSFRELGST 172
+I FC P + +IP + L L + L KN KS +EL T
Sbjct: 125 LISDFCLPYTSKIAKKFNIPKILFHGMGCFCL---LCMHVLRKNREILDNLKSDKEL-FT 180
Query: 173 LLNFPGFPPFPARDMALPMHDREGK---VYKGLVDTGIQMAKSAGVIVNTFELLEERAIK 229
+ +FP F + + + G ++ G+V+ S GVIVN+F+ LE K
Sbjct: 181 VPDFPDRVEFTRTQVPVETYVPAGDWKDIFDGMVEAN---ETSYGVIVNSFQELEPAYAK 237
Query: 230 AMLEGQCTPGETSPPLYCIGPV-----VGRGNGENRGR---DRHECLSWLDSKPSRSVLF 281
E + S + IGPV VG E + D+ ECL WLDSK SVL+
Sbjct: 238 DYKEVR------SGKAWTIGPVSLCNKVGADKAERGNKSDIDQDECLKWLDSKKHGSVLY 291
Query: 282 LCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKD 341
+C GS+ + QLKE+ +GLE S F+WV+R + + +E GF DR +D
Sbjct: 292 VCLGSICNLPLSQLKELGLGLEESQRPFIWVIRGWE----KYKELVEWFSESGFEDRIQD 347
Query: 342 RGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVE 401
RGL+++ W+PQ+ +L+H SVGGF+THCGWNS LEG+ AG+P+L WPL+A+Q +VVE
Sbjct: 348 RGLLIKGWSPQMLILSHPSVGGFLTHCGWNSTLEGITAGLPLLTWPLFADQFCNEKLVVE 407
Query: 402 EMKVG--------LAVTRSEEKDRLVSAAELEQRVSELM-DSEKGRAVKERAVAMKEAAA 452
+K G + E+ LV +++ V ELM +S+ + + RA + ++A
Sbjct: 408 VLKAGVRSGVEQPMKWGEEEKIGVLVDKEGVKKAVEELMGESDDAKERRRRAKELGDSAH 467
Query: 453 AAMRDGGSSRVALDNLVESF 472
A+ +GGSS + L++
Sbjct: 468 KAVEEGGSSHSNISFLLQDI 487
>sp|Q8VZE9|U73B1_ARATH UDP-glycosyltransferase 73B1 OS=Arabidopsis thaliana GN=UGT73B1
PE=2 SV=1
Length = 488
Score = 182 bits (463), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 107/275 (38%), Positives = 162/275 (58%), Gaps = 31/275 (11%)
Query: 212 SAGVIVNTFELLEE---RAIKAMLEGQCTPGETSPPLYCIGPV-VGRGNGEN---RGR-- 262
S GV+VN+F LE+ K+ + + + IGP+ +G E RG+
Sbjct: 221 SFGVLVNSFYELEQAYSDYFKSFVAKRA---------WHIGPLSLGNRKFEEKAERGKKA 271
Query: 263 --DRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDS 320
D HECL WLDSK SV+++ FG++ SF ++QL E+A GL+ SG F+WVV
Sbjct: 272 SIDEHECLKWLDSKKCDSVIYMAFGTMSSFKNEQLIEIAAGLDMSGHDFVWVVNRKG--- 328
Query: 321 IENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAG 380
++ E LPEGF ++TK +GL++ WAPQV +L H+++GGF+THCGWNS+LEGV AG
Sbjct: 329 --SQVEKEDWLPEGFEEKTKGKGLIIRGWAPQVLILEHKAIGGFLTHCGWNSLLEGVAAG 386
Query: 381 VPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEEKD---RLVSAAELEQRVSELMDSEKG 437
+PM+ WP+ AEQ +V + +K G++V + +S ++E V E+M E+
Sbjct: 387 LPMVTWPVGAEQFYNEKLVTQVLKTGVSVGVKKMMQVVGDFISREKVEGAVREVMVGEER 446
Query: 438 RAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESF 472
R +RA + E A A+++GGSS + +D L+E
Sbjct: 447 R---KRAKELAEMAKNAVKEGGSSDLEVDRLMEEL 478
>sp|Q9ZQ97|U73C4_ARATH UDP-glycosyltransferase 73C4 OS=Arabidopsis thaliana GN=UGT73C4
PE=2 SV=1
Length = 496
Score = 181 bits (458), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 176/318 (55%), Gaps = 38/318 (11%)
Query: 180 PPFPAR----DMALPMHDREGKVYKGLVDTGIQMA-KSAGVIVNTFELLEERAIKAMLEG 234
P FP R +P+ +K +D ++ S GVIVNTF+ LE +K +
Sbjct: 184 PSFPDRVEFTKPQVPVETTASGDWKAFLDEMVEAEYTSYGVIVNTFQELEPAYVKDYTKA 243
Query: 235 QCTPGETSPPLYCIGPV----------VGRGNGENRGRDRHECLSWLDSKPSRSVLFLCF 284
+ + ++ IGPV RGN D+ ECL WLDSK SVL++C
Sbjct: 244 R------AGKVWSIGPVSLCNKAGADKAERGN--QAAIDQDECLQWLDSKEDGSVLYVCL 295
Query: 285 GSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSL-ESLLPEGFLDRTKDRG 343
GS+ + QLKE+ +GLE+S F+WV+R E + L E ++ GF +R K+RG
Sbjct: 296 GSICNLPLSQLKELGLGLEKSQRSFIWVIRG-----WEKYNELYEWMMESGFEERIKERG 350
Query: 344 LVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEM 403
L+++ W+PQV +L+H SVGGF+THCGWNS LEG+ +G+P++ WPL+ +Q + +VV+ +
Sbjct: 351 LLIKGWSPQVLILSHPSVGGFLTHCGWNSTLEGITSGIPLITWPLFGDQFCNQKLVVQVL 410
Query: 404 KVGLA-----VTRSEEKDR---LVSAAELEQRVSELMD-SEKGRAVKERAVAMKEAAAAA 454
K G++ V + E+++ LV +++ V ELM S+ + + R + E+A A
Sbjct: 411 KAGVSAGVEEVMKWGEEEKIGVLVDKEGVKKAVEELMGASDDAKERRRRVKELGESAHKA 470
Query: 455 MRDGGSSRVALDNLVESF 472
+ +GGSS + L++
Sbjct: 471 VEEGGSSHSNITYLLQDI 488
>sp|P56725|ZOX_PHAVU Zeatin O-xylosyltransferase OS=Phaseolus vulgaris GN=ZOX1 PE=2 SV=1
Length = 454
Score = 181 bits (458), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 84/211 (39%), Positives = 134/211 (63%), Gaps = 1/211 (0%)
Query: 265 HECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVR-APAPDSIEN 323
H C+ WLD + SV+++ FG+ + +Q++E+A GLE+S KF+WV+R A D +
Sbjct: 244 HPCMEWLDKQEPSSVIYVSFGTTTALRDEQIQELATGLEQSKQKFIWVLRDADKGDIFDG 303
Query: 324 RSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPM 383
+ LPEGF +R + GLVV WAPQ+E+L+H S GGF++HCGWNS LE + GVPM
Sbjct: 304 SEAKRYELPEGFEERVEGMGLVVRDWAPQMEILSHSSTGGFMSHCGWNSCLESLTRGVPM 363
Query: 384 LAWPLYAEQKMIRAVVVEEMKVGLAVTRSEEKDRLVSAAELEQRVSELMDSEKGRAVKER 443
W ++++Q +V + +KVGL V E++ LVSA+ +E V LM++++G +++R
Sbjct: 364 ATWAMHSDQPRNAVLVTDVLKVGLIVKDWEQRKSLVSASVIENAVRRLMETKEGDEIRKR 423
Query: 444 AVAMKEAAAAAMRDGGSSRVALDNLVESFKR 474
AV +K+ +M +GG SR+ + + + R
Sbjct: 424 AVKLKDEIHRSMDEGGVSRMEMASFIAHISR 454
>sp|O48676|U74B1_ARATH UDP-glycosyltransferase 74B1 OS=Arabidopsis thaliana GN=UGT74B1
PE=1 SV=1
Length = 460
Score = 179 bits (455), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 149/494 (30%), Positives = 232/494 (46%), Gaps = 61/494 (12%)
Query: 1 MKDTIVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSAT 60
+K +V P +GHLN MV+ K +++ + + + I T + +S+ T S+S
Sbjct: 8 VKGHVVILPYPVQGHLNPMVQFAKRLVSKN----VKVTIATTTYTASSIT---TPSLSVE 60
Query: 61 APSVTFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKLHETLITISKRSN--LKAFV 118
S F +P + G S E +LN + LI K ++ + +
Sbjct: 61 PISDGFDFIPIGIPGFSVDTYS---------ESFKLNGSETLTLLIEKFKSTDSPIDCLI 111
Query: 119 IDFFCNPAFQVSSSTLSIPTYYYFT---TAGSVLA--ANLYLPTLHKNTTKSFRELGSTL 173
D F +V+ S + + +FT T SVL +N P + FR G
Sbjct: 112 YDSFLPWGLEVARS-MELSAASFFTNNLTVCSVLRKFSNGDFPLPADPNSAPFRIRGLPS 170
Query: 174 LNFPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLEERAIKAMLE 233
L++ P F R H G+V L++ + + VN FE LEE
Sbjct: 171 LSYDELPSFVGRHWL--THPEHGRV---LLNQFPNHENADWLFVNGFEGLEETQ------ 219
Query: 234 GQCTPGETSP-PLYCIGPVVGRGNGENRGRD------------RHECLSWLDSKPSRSVL 280
C GE+ IGP++ ++R D EC+ WL++K ++SV
Sbjct: 220 -DCENGESDAMKATLIGPMIPSAYLDDRMEDDKDYGASLLKPISKECMEWLETKQAQSVA 278
Query: 281 FLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTK 340
F+ FGS G KQL E+AI L+ S + FLWV++ + + LPEGF++ TK
Sbjct: 279 FVSFGSFGILFEKQLAEVAIALQESDLNFLWVIK----------EAHIAKLPEGFVESTK 328
Query: 341 DRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVV 400
DR L+V SW Q+EVL HES+G F+THCGWNS LEG+ GVPM+ P +++Q V
Sbjct: 329 DRALLV-SWCNQLEVLAHESIGCFLTHCGWNSTLEGLSLGVPMVGVPQWSDQMNDAKFVE 387
Query: 401 EEMKVGLAVTRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGS 460
E KVG + E + +V + EL + + +M+ E ++E + K+ A AM +GGS
Sbjct: 388 EVWKVGYR-AKEEAGEVIVKSEELVRCLKGVMEGESSVKIRESSKKWKDLAVKAMSEGGS 446
Query: 461 SRVALDNLVESFKR 474
S +++ +ES +
Sbjct: 447 SDRSINEFIESLGK 460
>sp|Q9ZSK5|ZOG_PHALU Zeatin O-glucosyltransferase OS=Phaseolus lunatus GN=ZOG1 PE=2 SV=1
Length = 459
Score = 179 bits (453), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 82/207 (39%), Positives = 133/207 (64%), Gaps = 1/207 (0%)
Query: 264 RHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVR-APAPDSIE 322
RH C+ WLD + SV+++ FG+ + +Q++++A GLE+S KF+WV+R A D
Sbjct: 248 RHPCMEWLDKQEPSSVIYISFGTTTALRDEQIQQIATGLEQSKQKFIWVLREADKGDIFA 307
Query: 323 NRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVP 382
+ LP+GF +R + GLVV WAPQ+E+L+H S GGF++HCGWNS LE + GVP
Sbjct: 308 GSEAKRYELPKGFEERVEGMGLVVRDWAPQLEILSHSSTGGFMSHCGWNSCLESITMGVP 367
Query: 383 MLAWPLYAEQKMIRAVVVEEMKVGLAVTRSEEKDRLVSAAELEQRVSELMDSEKGRAVKE 442
+ WP++++Q +V E +KVGL V +++ LVSA+ +E V LM++++G +++
Sbjct: 368 IATWPMHSDQPRNAVLVTEVLKVGLVVKDWAQRNSLVSASVVENGVRRLMETKEGDEMRQ 427
Query: 443 RAVAMKEAAAAAMRDGGSSRVALDNLV 469
RAV +K A +M +GG S + + + +
Sbjct: 428 RAVRLKNAIHRSMDEGGVSHMEMGSFI 454
>sp|Q2V6J9|UFOG7_FRAAN UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria
ananassa GN=GT7 PE=1 SV=1
Length = 487
Score = 179 bits (453), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 150/483 (31%), Positives = 214/483 (44%), Gaps = 39/483 (8%)
Query: 5 IVFYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVSSAGTDDYIASVSATAPSV 64
I F RGH + ++ KL ++ +I AP S A I V PS
Sbjct: 13 IFFLPFMARGHSIPLTDIAKLFSSHGARCTIVTTPLNAPLFSKATQRGEIELVLIKFPSA 72
Query: 65 TFHQLPPPVSGLPDTLRSPADFPALVYELGELNNPKL----HETLITISKRSNLKAFVID 120
+GLP S AD LG+ H I R + V D
Sbjct: 73 E--------AGLPQDCES-ADLITTQDMLGKFVKATFLIEPHFEKILDEHRPH--CLVAD 121
Query: 121 FFCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSTLLNFPGFP 180
F A V++ IP Y+ T L A+L + ++ S + N P
Sbjct: 122 AFFTWATDVAAK-FRIPRLYFHGTGFFALCASLSVMMYQPHSNLSSDSESFVIPNLPD-- 178
Query: 181 PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGVIVNTFELLE----ERAIKAMLEGQC 236
LP+ E + K L + +S GVIVN+F LE K
Sbjct: 179 EIKMTRSQLPVFPDESEFMKMLKASIEIEERSYGVIVNSFYELEPAYANHYRKVFGRKAW 238
Query: 237 TPGETSPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLK 296
G S I RG+ ++ ++HECL WLDSK RSV+++ FGS+ F+ QL
Sbjct: 239 HIGPVSFCNKAIEDKAERGSIKSSTAEKHECLKWLDSKKPRSVVYVSFGSMVRFADSQLL 298
Query: 297 EMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVL 356
E+A GLE SG F+WVV+ +E LPEGF R + +GL++ WAPQV +L
Sbjct: 299 EIATGLEASGQDFIWVVKKEK-------KEVEEWLPEGFEKRMEGKGLIIRDWAPQVLIL 351
Query: 357 NHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVVEEMKVGLAVTRSE--- 413
HE++G FVTHCGWNS+LE V AGVPM+ WP++ EQ +V E ++G+ V +
Sbjct: 352 EHEAIGAFVTHCGWNSILEAVSAGVPMITWPVFGEQFYNEKLVTEIHRIGVPVGSEKWAL 411
Query: 414 -------EKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALD 466
E + V +E+ V+ +M ++ + R + E A A+ +GGSS + L
Sbjct: 412 SFVDVNAETEGRVRREAIEEAVTRIMVGDEAVETRSRVKELGENARRAVEEGGSSFLDLS 471
Query: 467 NLV 469
LV
Sbjct: 472 ALV 474
>sp|Q9C9B0|U89B1_ARATH UDP-glycosyltransferase 89B1 OS=Arabidopsis thaliana GN=UGT89B1
PE=2 SV=2
Length = 473
Score = 178 bits (452), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 145/490 (29%), Positives = 243/490 (49%), Gaps = 58/490 (11%)
Query: 2 KDTIVFYTSPGRGHLNSMVEL-GKLILTYHPCFSIDIII--PTAPFVSSAGTDDYIASVS 58
K ++ + P +GH+ +++ +L L I +++ PF+S +++V
Sbjct: 12 KTHVLIFPFPAQGHMIPLLDFTHRLALRGGAALKITVLVTPKNLPFLSP-----LLSAVV 66
Query: 59 ATAPSVT-FHQLPPPVSGLPDTLR-SPADFPALVYELGELNNPKLHETLITISKRSNLKA 116
P + F P SG+ + P+ FP +++ LG L+ P + + IT S S A
Sbjct: 67 NIEPLILPFPSHPSIPSGVENVQDLPPSGFPLMIHALGNLHAPLI--SWIT-SHPSPPVA 123
Query: 117 FVIDFFCNPAFQVSSSTLSIPTYYYFTTAG--SVLAANLYLPTLHKNTTKSFRELGSTLL 174
V DFF + L IP + + +A + L++ + TK + + +L
Sbjct: 124 IVSDFFLG-----WTKNLGIPRFDFSPSAAITCCILNTLWI----EMPTKINEDDDNEIL 174
Query: 175 NFPGFPPFPARDMALPMHDREGKVYKGLV----------DTGIQMAKSAGVIVNTFELLE 224
+FP P P D+ +Y+ V D+ S G++VN+F +E
Sbjct: 175 HFPKIPNCPKYRF-----DQISSLYRSYVHGDPAWEFIRDSFRDNVASWGLVVNSFTAME 229
Query: 225 ERAIKAMLEGQCTPGETSPPLYCIGPVVGRGNGENRGR----DRHECLSWLDSKPSRSVL 280
++ + G ++ +GP++ +G+NRG +SWLD++ V+
Sbjct: 230 GVYLEHLKREM---GHDR--VWAVGPIIPL-SGDNRGGPTSVSVDHVMSWLDAREDNHVV 283
Query: 281 FLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSIENRSSLESLLPEGFLDRTK 340
++CFGS + +Q +A GLE+SGV F+W V+ P +E S+ ++L +GF DR
Sbjct: 284 YVCFGSQVVLTKEQTLALASGLEKSGVHFIWAVKEP----VEKDSTRGNIL-DGFDDRVA 338
Query: 341 DRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIRAVVV 400
RGLV+ WAPQV VL H +VG F+THCGWNSV+E V AGV ML WP+ A+Q ++VV
Sbjct: 339 GRGLVIRGWAPQVAVLRHRAVGAFLTHCGWNSVVEAVVAGVLMLTWPMRADQYTDASLVV 398
Query: 401 EEMKVGLAVTRSEEKDRLVSAAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGS 460
+E+KVG V E D + EL + ++ + + +K AV +++AA A+++ GS
Sbjct: 399 DELKVG--VRACEGPDTVPDPDELARVFADSVTGNQTERIK--AVELRKAALDAIQERGS 454
Query: 461 SRVALDNLVE 470
S LD ++
Sbjct: 455 SVNDLDGFIQ 464
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.135 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 179,378,141
Number of Sequences: 539616
Number of extensions: 7703441
Number of successful extensions: 21418
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 242
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 20664
Number of HSP's gapped (non-prelim): 309
length of query: 481
length of database: 191,569,459
effective HSP length: 121
effective length of query: 360
effective length of database: 126,275,923
effective search space: 45459332280
effective search space used: 45459332280
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)