BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 036436
(485 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q4R1I9|ANGLT_ROSHC Anthocyanidin 5,3-O-glucosyltransferase OS=Rosa hybrid cultivar
GN=RhGT1 PE=2 SV=1
Length = 473
Score = 445 bits (1145), Expect = e-124, Method: Compositional matrix adjust.
Identities = 235/483 (48%), Positives = 315/483 (65%), Gaps = 25/483 (5%)
Query: 3 DTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTA--------PFVTSAGTD--D 52
D IVLY PG GHL SMVELGKL+LT+HP FSI I+ TA V S+ +
Sbjct: 4 DAIVLYPYPGLGHLISMVELGKLLLTHHPSFSITILASTAPTTIAATAKLVASSNDQLTN 63
Query: 53 YIASVSATAPSVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRS 112
YI +VSA P++ FH LP +S +P+ + + P +E L PN+ + L T+ +S
Sbjct: 64 YIKAVSADNPAINFHHLPT-ISSLPEHIEK-LNLP---FEYARLQIPNILQVLQTL--KS 116
Query: 113 NLKAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSA 172
+LKA ++D C+ F V+ L+IPT+Y++T+AG LA L +PT H+ TT S + G
Sbjct: 117 SLKALILDMFCDALFDVTKD-LNIPTFYFYTSAGRSLAVLLNIPTFHR-TTNSLSDFGDV 174
Query: 173 LLNFPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAML 232
++ G PP P M + DR YK + T MAKS GII+NTF+LL+ERA+KA+
Sbjct: 175 PISISGMPPIPVSAMPKLLFDRSTNFYKSFLSTSTHMAKSNGIILNTFDLLEERALKALR 234
Query: 233 EGQCIPGETLPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSS 292
G C+P + PP++ +GP++ +G+N D HE L WL+++P SV+FLCFGS+G FS
Sbjct: 235 AGLCLPNQPTPPIFTVGPLISGKSGDN---DEHESLKWLNNQPKDSVVFLCFGSMGVFSI 291
Query: 293 KQLKEMAIGLERSGVKFLWVVRAPAPDSVE-NRSSLESLLPEGFLDRTKDRGLVVESWAP 351
KQL+ MA+GLE+SG +FLWVVR P + + SLE +LP+GF++RTKDRGLVV WAP
Sbjct: 292 KQLEAMALGLEKSGQRFLWVVRNPPIEELPVEEPSLEEILPKGFVERTKDRGLVVRKWAP 351
Query: 352 QVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTR 411
QVEVL+H+SVGGFVTHCGWNSVLE VC GVPM+AWPLYAEQK+ + +VEEMKV + V
Sbjct: 352 QVEVLSHDSVGGFVTHCGWNSVLEAVCNGVPMVAWPLYAEQKLGRVFLVEEMKVAVGVKE 411
Query: 412 SEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVES 471
SE G VS+ ELE+RV ELMDSE G ++ R A +GGSS +L L +
Sbjct: 412 SE--TGFVSADELEKRVRELMDSESGDEIRGRVSEFSNGGVKAKEEGGSSVASLAKLAQL 469
Query: 472 FKR 474
+K+
Sbjct: 470 WKQ 472
>sp|Q9LK73|U88A1_ARATH UDP-glycosyltransferase 88A1 OS=Arabidopsis thaliana GN=UGT88A1
PE=2 SV=1
Length = 462
Score = 404 bits (1039), Expect = e-112, Method: Compositional matrix adjust.
Identities = 216/473 (45%), Positives = 320/473 (67%), Gaps = 18/473 (3%)
Query: 2 KDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATA 61
++ IVLY +P GHL SMVELGK IL+ +P SI II+ P+ T YI+SVS++
Sbjct: 3 EEAIVLYPAPPIGHLVSMVELGKTILSKNPSLSIHIILVPPPY-QPESTATYISSVSSSF 61
Query: 62 PSVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLKAFVIDF 121
PS+TFH LP V+ + S +L+ E+ +NP++H TL ++S+ N++A +IDF
Sbjct: 62 PSITFHHLPA-VTPYSSSSTSRHHHESLLLEILCFSNPSVHRTLFSLSRNFNVRAMIIDF 120
Query: 122 LCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTT-KSFRELGSALLNFPGFP 180
C +++ + P Y+++T+ + LA + YLPT+ + T K+ +++ + ++ PG P
Sbjct: 121 FCTAVLDITAD-FTFPVYFFYTSGAACLAFSFYLPTIDETTPGKNLKDIPT--VHIPGVP 177
Query: 181 PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGE 240
P DM + +R+ +VY + G Q++KS+GII+NTF+ L+ RAIKA+ E C
Sbjct: 178 PMKGSDMPKAVLERDDEVYDVFIMFGKQLSKSSGIIINTFDALENRAIKAITEELCFRN- 236
Query: 241 TLPPLYCIGPVVGRGNGENRGRDRH-ECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMA 299
+Y IGP++ G E+R ++ CL+WLDS+P +SV+FLCFGSLG FS +Q+ E+A
Sbjct: 237 ----IYPIGPLIVNGRIEDRNDNKAVSCLNWLDSQPEKSVVFLCFGSLGLFSKEQVIEIA 292
Query: 300 IGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHE 359
+GLE+SG +FLWVVR P P+ + L+SLLPEGFL RT+D+G+VV+SWAPQV VLNH+
Sbjct: 293 VGLEKSGQRFLWVVRNP-PELEKTELDLKSLLPEGFLSRTEDKGMVVKSWAPQVPVLNHK 351
Query: 360 SVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEEGDGLV 419
+VGGFVTHCGWNS+LE VCAGVPM+AWPLYAEQ+ + ++V+E+K+ +++ SE G V
Sbjct: 352 AVGGFVTHCGWNSILEAVCAGVPMVAWPLYAEQRFNRVMIVDEIKIAISMNESE--TGFV 409
Query: 420 SSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESF 472
SS E+E+RV E++ V+ER +AMK AA A+ + GSS AL L++S+
Sbjct: 410 SSTEVEKRVQEIIGE---CPVRERTMAMKNAAELALTETGSSHTALTTLLQSW 459
>sp|Q33DV3|4CGT_ANTMA Chalcone 4'-O-glucosyltransferase OS=Antirrhinum majus PE=1 SV=1
Length = 457
Score = 364 bits (934), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 207/475 (43%), Positives = 293/475 (61%), Gaps = 31/475 (6%)
Query: 4 TIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPS 63
TIV +TS HLNS + L K I T H II TAP +S +A + PS
Sbjct: 10 TIVFHTS--EEHLNSSIALAKFI-TKHHSSISITIISTAPAESSE-----VAKI-INNPS 60
Query: 64 VTFHQLPP---PVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLKAFVID 120
+T+ L P + + ++P + L +E+ L N NL E L+ IS++S++KA +ID
Sbjct: 61 ITYRGLTAVALPENLTSNINKNPVE---LFFEIPRLQNANLREALLDISRKSDIKALIID 117
Query: 121 FLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGFP 180
F CN AF+VS+S ++IPTY+ + +L L+ PTLH+ +L ++ PGFP
Sbjct: 118 FFCNAAFEVSTS-MNIPTYFDVSGGAFLLCTFLHHPTLHQTVRGDIADLNDSV-EMPGFP 175
Query: 181 PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGE 240
+ D+ + + R+ VYK +DT + M KS+GI+VNTF L+ RA +A+ G P
Sbjct: 176 LIHSSDLPMSLFYRKTNVYKHFLDTSLNMRKSSGILVNTFVALEFRAKEALSNGLYGPT- 234
Query: 241 TLPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAI 300
PPLY + + + ++HECLSWLD +PS+SV+FLCFG G+FS++QLKE+AI
Sbjct: 235 --PPLYLLSHTIAEPHDTKVLVNQHECLSWLDLQPSKSVIFLCFGRRGAFSAQQLKEIAI 292
Query: 301 GLERSGVKFLWVVR-APAPDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHE 359
GLE+SG +FLW+ R +P D L +LLPEGFL RTK G V +W PQ EVL+H+
Sbjct: 293 GLEKSGCRFLWLARISPEMD-------LNALLPEGFLSRTKGVGFVTNTWVPQKEVLSHD 345
Query: 360 SVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEEGDGLV 419
+VGGFVTHCGW+SVLE + GVPM+ WPLYAEQ++ + +VEE+KV L + +E DG V
Sbjct: 346 AVGGFVTHCGWSSVLEALSFGVPMIGWPLYAEQRINRVFMVEEIKVALPL---DEEDGFV 402
Query: 420 SSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESFKR 474
++ ELE+RV ELM+S KG+ VK R +K + AA+ GGSS +L+ + S R
Sbjct: 403 TAMELEKRVRELMESVKGKEVKRRVAELKISTKAAVSKGGSSLASLEKFINSVTR 457
>sp|Q76MR7|UBGAT_SCUBA Baicalein 7-O-glucuronosyltransferase OS=Scutellaria baicalensis
GN=UBGAT-I PE=1 SV=1
Length = 441
Score = 354 bits (909), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 199/446 (44%), Positives = 280/446 (62%), Gaps = 21/446 (4%)
Query: 19 MVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPSVTFHQLPPPVSRIPD 78
M L K I HP I III AP + ASV+A PS+++H+LP P IP
Sbjct: 1 MAVLAKFISKNHPSVPI-IIISNAP-------ESAAASVAAI-PSISYHRLPLP--EIPP 49
Query: 79 TLRSPADFPALVYELGELNNPNLHETLITISKRSNLKAFVIDFLCNPAFQVSSSTLSIPT 138
+ + D L +EL L+NPNL L IS+++ ++A ++DF CN AF+V +S L+IPT
Sbjct: 50 DMTT--DRVELFFELPRLSNPNLLTALQQISQKTRIRAVILDFFCNAAFEVPTS-LNIPT 106
Query: 139 YYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGFPPFPARDMALPMHDREGKV 198
YYYF+ LY T+ + ++L ++ PG PP D+ + + R+ V
Sbjct: 107 YYYFSAGTPTAILTLYFETIDETIPVDLQDLND-YVDIPGLPPIHCLDIPVALSPRKSLV 165
Query: 199 YKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGETLPPLYCIGPVVGRGNGE 258
YK VD + +SAGI+VN F+ L+ RAI + + PP+Y IGP+VG + +
Sbjct: 166 YKSSVDISKNLRRSAGILVNGFDALEFRAIGSHSQRPMHFKGPTPPVYFIGPLVGDVDTK 225
Query: 259 NRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAP-- 316
G + HECL WLD++PS+SV+FLCFG G FS+KQLKE A LE SG +FLW VR P
Sbjct: 226 A-GSEEHECLRWLDTQPSKSVVFLCFGRRGVFSAKQLKETAAALENSGHRFLWSVRNPPE 284
Query: 317 -APDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLE 375
+ + L+ LLPEGFL+RTKDRG V++SWAPQ EVL H+SVGGFVTHCG +SV E
Sbjct: 285 LKKATGSDEPDLDELLPEGFLERTKDRGFVIKSWAPQKEVLAHDSVGGFVTHCGRSSVSE 344
Query: 376 GVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEEGDGLVSSAELEQRVSELMDSE 435
GV GVPM+ WP+ AE ++ +AV+V++++V L + EE G V++AELE+RV ELM+++
Sbjct: 345 GVWFGVPMIGWPVDAELRLNRAVMVDDLQVALPL--EEEAGGFVTAAELEKRVRELMETK 402
Query: 436 KGRAVKERAVAMKEAAAAAMRDGGSS 461
G+AV++R +K +A AA+ + GSS
Sbjct: 403 AGKAVRQRVTELKLSARAAVAENGSS 428
>sp|Q9LNI1|U72B3_ARATH UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana GN=UGT72B3
PE=2 SV=1
Length = 481
Score = 291 bits (746), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 180/478 (37%), Positives = 270/478 (56%), Gaps = 28/478 (5%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIP-TAPFVTSAGTDDYIASVSATAPS 63
+ + SPG GHL +VEL K +L H F++ IIP +P S + S+ ++ S
Sbjct: 9 VAIIPSPGIGHLIPLVELAKRLLDNH-GFTVTFIIPGDSP--PSKAQRSVLNSLPSSIAS 65
Query: 64 VTFHQLPPP-VSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLKA-FVIDF 121
V LPP +S +P T R V +NP L E ++S L A V+D
Sbjct: 66 VF---LPPADLSDVPSTARIETRISLTVTR----SNPALRELFGSLSAEKRLPAVLVVDL 118
Query: 122 LCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGFPP 181
AF V++ + Y ++ + +VL L+LP L + + FREL ++ PG P
Sbjct: 119 FGTDAFDVAAE-FHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVI-IPGCVP 176
Query: 182 FPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGET 241
+D P DR+ + YK L+ + ++ GI+VN+F L+ IK + E P
Sbjct: 177 ITGKDFVDPCQDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQE----PAPD 232
Query: 242 LPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIG 301
PP+Y IGP+V G+ + D ++CL+WLD++P SVL++ FGS G+ + +Q E+A+G
Sbjct: 233 KPPVYLIGPLVNSGSHDADVNDEYKCLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALG 292
Query: 302 LERSGVKFLWVVRAP---APDSVEN---RSSLESLLPEGFLDRTKDRGLVVESWAPQVEV 355
L SG +FLWV+R+P A S N R+ S LP+GFLDRTK++GLVV SWAPQ ++
Sbjct: 293 LAESGKRFLWVIRSPSGIASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQI 352
Query: 356 LNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEEG 415
L H S+GGF+THCGWNS LE + GVP++AWPLYAEQKM ++V+ VG A+
Sbjct: 353 LTHTSIGGFLTHCGWNSSLESIVNGVPLIAWPLYAEQKMNALLLVD---VGAALRARLGE 409
Query: 416 DGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESFK 473
DG+V E+ + V L++ E+G AV+++ +KE + +RD G S +L+ + +K
Sbjct: 410 DGVVGREEVARVVKGLIEGEEGNAVRKKMKELKEGSVRVLRDDGFSTKSLNEVSLKWK 467
>sp|Q9M156|U72B1_ARATH UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1
PE=1 SV=1
Length = 480
Score = 288 bits (737), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 175/471 (37%), Positives = 255/471 (54%), Gaps = 29/471 (6%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPSV 64
+ + SPG GHL +VE K ++ H +I P S + S+ ++ SV
Sbjct: 9 VAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGP--PSKAQRTVLDSLPSSISSV 66
Query: 65 TFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNL-KAFVIDFLC 123
PPV L S + + +NP L + + + L A V+D
Sbjct: 67 FL----PPVDLT--DLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFG 120
Query: 124 NPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGFPPFP 183
AF V+ +P Y ++ T +VL+ L+LP L + + FREL L+ PG P
Sbjct: 121 TDAFDVAVE-FHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLM-LPGCVPVA 178
Query: 184 ARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGETLP 243
+D P DR+ YK L+ + ++ GI+VNTF L+ AIKA+ E PG P
Sbjct: 179 GKDFLDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQE----PGLDKP 234
Query: 244 PLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLE 303
P+Y +GP+V G E + + ECL WLD++P SVL++ FGS G+ + +QL E+A+GL
Sbjct: 235 PVYPVGPLVNIGKQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLA 294
Query: 304 RSGVKFLWVVRAPAPDSVENRSSLES--------LLPEGFLDRTKDRGLVVESWAPQVEV 355
S +FLWV+R+P+ + N S +S LP GFL+RTK RG V+ WAPQ +V
Sbjct: 295 DSEQRFLWVIRSPS--GIANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQV 352
Query: 356 LNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEEG 415
L H S GGF+THCGWNS LE V +G+P++AWPLYAEQKM ++ E+++ L R G
Sbjct: 353 LAHPSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAAL---RPRAG 409
Query: 416 -DGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVAL 465
DGLV E+ + V LM+ E+G+ V+ + +KEAA ++D G+S AL
Sbjct: 410 DDGLVRREEVARVVKGLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKAL 460
>sp|Q9AR73|HQGT_RAUSE Hydroquinone glucosyltransferase OS=Rauvolfia serpentina GN=AS PE=1
SV=1
Length = 470
Score = 278 bits (710), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 176/478 (36%), Positives = 254/478 (53%), Gaps = 39/478 (8%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPT---APFVTSAGTDDYIASVS-AT 60
I + +PG GHL +VE K ++ H F + IIPT P + D A V+
Sbjct: 7 IAMVPTPGMGHLIPLVEFAKRLVLRH-NFGVTFIIPTDGPLPKAQKSFLDALPAGVNYVL 65
Query: 61 APSVTFHQLPPPV---SRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLKAF 117
P V+F LP V +RI T+ F + + + T+ + L A
Sbjct: 66 LPPVSFDDLPADVRIETRICLTITRSLPF--------------VRDAVKTLLATTKLAAL 111
Query: 118 VIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFP 177
V+D AF V+ + Y ++ T L+ +LP L + + +R++ L P
Sbjct: 112 VVDLFGTDAFDVAIE-FKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPL-QIP 169
Query: 178 GFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCI 237
G P +D P DR+ YK L+ + + GI+VNTF L+ +KA+ E
Sbjct: 170 GCIPIHGKDFLDPAQDRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEED-- 227
Query: 238 PGETLPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKE 297
+ PP+Y IGP++ R + ++ D ECL WLD +P SVLF+ FGS G+ S Q E
Sbjct: 228 --QGKPPVYPIGPLI-RADSSSK-VDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFIE 283
Query: 298 MAIGLERSGVKFLWVVRAPAPD-------SVENRSSLESLLPEGFLDRTKDRGLVVESWA 350
+A+GLE S +FLWVVR+P S++N++ + LPEGFL+RTK R L+V SWA
Sbjct: 284 LALGLEMSEQRFLWVVRSPNDKIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSWA 343
Query: 351 PQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVT 410
PQ E+L+H S GGF+THCGWNS+LE V GVP++AWPLYAEQKM ++ E +KV L
Sbjct: 344 PQTEILSHGSTGGFLTHCGWNSILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVALRPK 403
Query: 411 RSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNL 468
E +GL+ E+ V LM+ E+G+ + +K+AA+ A+ D GSS AL L
Sbjct: 404 AGE--NGLIGRVEIANAVKGLMEGEEGKKFRSTMKDLKDAASRALSDDGSSTKALAEL 459
>sp|Q8W4C2|U72B2_ARATH UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana GN=UGT72B2
PE=2 SV=1
Length = 480
Score = 273 bits (699), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 178/479 (37%), Positives = 260/479 (54%), Gaps = 29/479 (6%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIP--TAPFVTSAGTDDYIASVSATAP 62
I + SPG GHL VEL K L H CF++ +II T+P S + S+ ++
Sbjct: 9 IAIMPSPGMGHLIPFVELAKR-LVQHDCFTVTMIISGETSP---SKAQRSVLNSLPSSIA 64
Query: 63 SVTFHQLPPP-VSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLKA-FVID 120
SV LPP +S +P T R + +NP L E ++S + +L A V+D
Sbjct: 65 SVF---LPPADLSDVPSTARIETRAMLTMTR----SNPALRELFGSLSTKKSLPAVLVVD 117
Query: 121 FLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGFP 180
AF V+ + Y ++ + +VL+ L+LP L K + FR L L PG
Sbjct: 118 MFGADAFDVAVD-FHVSPYIFYASNANVLSFFLHLPKLDKTVSCEFRYLTEPL-KIPGCV 175
Query: 181 PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGE 240
P +D + DR YK L+ + ++ GI+VN+F L+ AIKA+ E P
Sbjct: 176 PITGKDFLDTVQDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQE----PAP 231
Query: 241 TLPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAI 300
P +Y IGP+V + D+ CLSWLD++P SVL++ FGS G+ + +Q E+AI
Sbjct: 232 DKPTVYPIGPLVNTSSSNVNLEDKFGCLSWLDNQPFGSVLYISFGSGGTLTCEQFNELAI 291
Query: 301 GLERSGVKFLWVVRAPA---PDSVENRSSLE---SLLPEGFLDRTKDRGLVVESWAPQVE 354
GL SG +F+WV+R+P+ S N S S LP GFLDRTK++GLVV SWAPQV+
Sbjct: 292 GLAESGKRFIWVIRSPSEIVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSWAPQVQ 351
Query: 355 VLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEE 414
+L H S GF+THCGWNS LE + GVP++AWPL+AEQKM ++VE++ L + E
Sbjct: 352 ILAHPSTCGFLTHCGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRIHAGE- 410
Query: 415 GDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESFK 473
DG+V E+ + V LM+ E+G+A+ + +KE + D G S + ++ +K
Sbjct: 411 -DGIVRREEVVRVVKALMEGEEGKAIGNKVKELKEGVVRVLGDDGLSSKSFGEVLLKWK 468
>sp|O23382|U71B5_ARATH UDP-glycosyltransferase 71B5 OS=Arabidopsis thaliana GN=UGT71B5
PE=3 SV=1
Length = 478
Score = 263 bits (672), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 165/492 (33%), Positives = 255/492 (51%), Gaps = 41/492 (8%)
Query: 1 MKDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSAT 60
MK +V PG GHL V+L K ++ SI III + F + IAS++
Sbjct: 1 MKIELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRF-DAGDASACIASLTTL 59
Query: 61 APSVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLKAFVID 120
+ H V++ P T P PA VY + + L FV+D
Sbjct: 60 SQDDRLHYESISVAKQPPT-SDPDPVPAQVYIEKQKTKVRDAVAARIVDPTRKLAGFVVD 118
Query: 121 FLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSAL--LNFPG 178
C+ V++ +P Y +T+ + L L++ ++ EL +++ L FP
Sbjct: 119 MFCSSMIDVANE-FGVPCYMVYTSNATFLGTMLHVQQMYDQKKYDVSELENSVTELEFPS 177
Query: 179 FP-PFPARDMA--------LPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIK 229
P+P + + LP+ + + ++ K GI+VNT L+ A+K
Sbjct: 178 LTRPYPVKCLPHILTSKEWLPLSLAQARCFR----------KMKGILVNTVAELEPHALK 227
Query: 230 AMLEGQCIPGETLPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGS 289
I G+ LP +Y +GPV+ NG + + E L WLD +PS+SV+FLCFGSLG
Sbjct: 228 MF----NINGDDLPQVYPVGPVLHLENGNDDDEKQSEILRWLDEQPSKSVVFLCFGSLGG 283
Query: 290 FSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENR----SSLESLLPEGFLDRTKDRGLV 345
F+ +Q +E A+ L+RSG +FLW +R +P+ +R ++LE +LPEGFL+RT DRG V
Sbjct: 284 FTEEQTRETAVALDRSGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRGKV 343
Query: 346 VESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKV 405
+ WAPQV VL ++GGFVTHCGWNS+LE + GVPM+ WPLYAEQK+ +VEE+ +
Sbjct: 344 I-GWAPQVAVLEKPAIGGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEELGL 402
Query: 406 GLAVTRSEEGDGL------VSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGG 459
+ + + +GD V++ ++E+ + +M E+ V+ M E A+ DGG
Sbjct: 403 AVEIRKYLKGDLFAGEMETVTAEDIERAIRRVM--EQDSDVRNNVKEMAEKCHFALMDGG 460
Query: 460 SSRVALDNLVES 471
SS+ AL+ ++
Sbjct: 461 SSKAALEKFIQD 472
>sp|Q9LSY5|U71B7_ARATH UDP-glycosyltransferase 71B7 OS=Arabidopsis thaliana GN=UGT71B7
PE=2 SV=2
Length = 495
Score = 261 bits (668), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 170/506 (33%), Positives = 257/506 (50%), Gaps = 56/506 (11%)
Query: 1 MKDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTS--AGTDDYIASVS 58
MK +V PG GHL S VE+ KL++ SI +II PF++ G DYIA++S
Sbjct: 1 MKFELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVII--LPFISEGEVGASDYIAALS 58
Query: 59 ATAPSVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPN-----------LHETLIT 107
A++ + R+ + S D P + E++ N L E +
Sbjct: 59 ASSNN-----------RLRYEVISAVDQPTIEMTTIEIHMKNQEPKVRSTVAKLLEDYSS 107
Query: 108 ISKRSNLKAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFR 167
+ FV+D C V++ P+Y ++T++ +L+ ++ L
Sbjct: 108 KPDSPKIAGFVLDMFCTSMVDVANE-FGFPSYMFYTSSAGILSVTYHVQMLCDENKYDVS 166
Query: 168 EL----GSALLNFPGFP-PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFEL 222
E A+LNFP P+P + LP V+ + + GI+VNT
Sbjct: 167 ENDYADSEAVLNFPSLSRPYPVK--CLPHALAANMWLPVFVNQARKFREMKGILVNTVAE 224
Query: 223 LQERAIKAMLEGQCIPGETLPPLYCIGPVVGRGNGENRGRD--RHECLSWLDSKPSRSVL 280
L+ +K + PP+Y +GP++ N + +D R E + WLD +P SV+
Sbjct: 225 LEPYVLKFLSSSDT------PPVYPVGPLLHLENQRDDSKDEKRLEIIRWLDQQPPSSVV 278
Query: 281 FLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENR----SSLESLLPEGFL 336
FLCFGS+G F +Q++E+AI LERSG +FLW +R +P+ + ++LE +LPEGF
Sbjct: 279 FLCFGSMGGFGEEQVREIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFF 338
Query: 337 DRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIK 396
DRTKD G V+ WAPQV VL + ++GGFVTHCGWNS LE + GVP AWPLYAEQK
Sbjct: 339 DRTKDIGKVI-GWAPQVAVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNA 397
Query: 397 AVVVEEMKVGLAVTRSEEGDGL-------VSSAELEQRVSELMDSEKGRAVKERAVAMKE 449
++VEE+ + + + + G+ L V++ E+E+ + LM E+ V++R M E
Sbjct: 398 FLMVEELGLAVEIRKYWRGEHLAGLPTATVTAEEIEKAIMCLM--EQDSDVRKRVKDMSE 455
Query: 450 AAAAAMRDGGSSRVALDNLVESFKRG 475
A+ DGGSSR AL +E +
Sbjct: 456 KCHVALMDGGSSRTALQKFIEEVAKN 481
>sp|Q66PF3|UFOG3_FRAAN Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3
OS=Fragaria ananassa GN=GT3 PE=2 SV=1
Length = 478
Score = 246 bits (629), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 163/487 (33%), Positives = 259/487 (53%), Gaps = 38/487 (7%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASV----SAT 60
+VL SPG GHL S +E+ KL+++ I ++I P V S GTD Y+ S+ S
Sbjct: 7 LVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAV-SKGTDAYVQSLADSSSPI 65
Query: 61 APSVTFHQLPPP-VSRIPDTLRSPADFPALVYELGELNNPNLHETLITI--SKRSNLKAF 117
+ + F LP + ++R+ + E P++ + + + SK + L F
Sbjct: 66 SQRINFINLPHTNMDHTEGSVRNS------LVGFVESQQPHVKDAVANLRDSKTTRLAGF 119
Query: 118 VIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKS---FRELGSALL 174
V+D C V++ L +P+Y +FT+ + L +L L K F++ + L+
Sbjct: 120 VVDMFCTTMINVANQ-LGVPSYVFFTSGAATLGLLFHLQELRDQYNKDCTEFKDSDAELI 178
Query: 175 NFPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEG 234
F P PA+ LP + ++ + ++ GI+VNTF L+ A+ A+
Sbjct: 179 IPSFFNPLPAK--VLPGRMLVKDSAEPFLNVIKRFRETKGILVNTFTDLESHALHALSSD 236
Query: 235 QCIPGETLPPLYCIGPVVGRGNGENRG-----RDRHECLSWLDSKPSRSVLFLCFGSLGS 289
IP P+Y +GP++ + E+R + +++ L WLD +P SV+FLCFGS+GS
Sbjct: 237 AEIP-----PVYPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGS 291
Query: 290 FSSKQLKEMAIGLERSGVKFLWVVR-APAPDSVENRSSLES---LLPEGFLDRTKDRGLV 345
F Q++E+A LE +G +FLW +R +P V S + +LPEGFLDRT G V
Sbjct: 292 FDESQVREIANALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKV 351
Query: 346 VESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKV 405
+ WAPQV VL H SVGGFV+HCGWNS LE + GVP+ WPLYAEQ++ V+E+++
Sbjct: 352 I-GWAPQVAVLAHPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELEL 410
Query: 406 GLAVTRSEEGDG--LVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRV 463
+ + S LVS+ E+E+ + E+M+ + +++R M E A+ DGGSS
Sbjct: 411 AVEIDMSYRSKSPVLVSAKEIERGIREVMELDSSD-IRKRVKEMSEKGKKALMDGGSSYT 469
Query: 464 ALDNLVE 470
+L + ++
Sbjct: 470 SLGHFID 476
>sp|Q9LML6|U71C4_ARATH UDP-glycosyltransferase 71C4 OS=Arabidopsis thaliana GN=UGT71C4
PE=2 SV=2
Length = 479
Score = 241 bits (614), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 167/496 (33%), Positives = 251/496 (50%), Gaps = 42/496 (8%)
Query: 1 MKDTIVLYTS-PGRGHLNSMVELGK-LILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVS 58
+K+T +++ P GH+ +E K LI H +I I+ ++P +S + S+
Sbjct: 2 VKETELIFIPVPSTGHILVHIEFAKRLINLDHRIHTITILNLSSP--SSPHASVFARSLI 59
Query: 59 ATAPSVTFHQLPPPVSRIPDTL--RSPADFPALVYELGELNNPNLHETLITI-------S 109
A+ P + H LPP P L R+P A + +L + N P + + + +I S
Sbjct: 60 ASQPKIRLHDLPPIQDPPPFDLYQRAPE---AYIVKLIKKNTPLIKDAVSSIVASRRGGS 116
Query: 110 KRSNLKAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFR-E 168
+ V+D CN + + L++P+Y Y T L Y+P H+ F
Sbjct: 117 DSVQVAGLVLDLFCNSLVKDVGNELNLPSYIYLTCNARYLGMMKYIPDRHRKIASEFDLS 176
Query: 169 LGSALLNFPGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERA 227
G L PGF P + M + ++E Y+ V+ + A + GI+VN+F L+
Sbjct: 177 SGDEELPVPGFINAIPTKFMPPGLFNKEA--YEAYVELAPRFADAKGILVNSFTELEPHP 234
Query: 228 IKAMLEGQCIPGETLPPLYCIGPVVG---RGNGENRGRDRHECLSWLDSKPSRSVLFLCF 284
E PP+Y +GP++ R + DR + + WLD +P SV+FLCF
Sbjct: 235 FDYFSHL-----EKFPPVYPVGPILSLKDRASPNEEAVDRDQIVGWLDDQPESSVVFLCF 289
Query: 285 GSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGL 344
GS GS Q+KE+A LE G +FLW +R S + ++ +LPEGF+ R RGL
Sbjct: 290 GSRGSVDEPQVKEIARALELVGCRFLWSIRT----SGDVETNPNDVLPEGFMGRVAGRGL 345
Query: 345 VVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEE-- 402
V WAPQVEVL H+++GGFV+HCGWNS LE + GVP+ WP+YAEQ++ +V+E
Sbjct: 346 VC-GWAPQVEVLAHKAIGGFVSHCGWNSTLESLWFGVPVATWPMYAEQQLNAFTLVKELG 404
Query: 403 MKVGLAVTRSEEGDGLVSSAELEQRVSELMD--SEKGRAVKERAVAMKEAAAAAMRDGGS 460
+ V L + GLV+ E+ + V LMD EK + VKE M +AA A+ DGGS
Sbjct: 405 LAVDLRMDYVSSRGGLVTCDEIARAVRSLMDGGDEKRKKVKE----MADAARKALMDGGS 460
Query: 461 SRVALDNLV-ESFKRG 475
S +A + E F+ G
Sbjct: 461 SSLATARFIAELFEDG 476
>sp|Q9LSY4|U71B8_ARATH UDP-glycosyltransferase 71B8 OS=Arabidopsis thaliana GN=UGT71B8
PE=3 SV=1
Length = 480
Score = 239 bits (611), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 169/506 (33%), Positives = 261/506 (51%), Gaps = 64/506 (12%)
Query: 2 KDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATA 61
K +V P GHL S E+ KL++ SI III P ++ DD AS +A
Sbjct: 3 KFALVFVPFPILGHLKSTAEMAKLLVEQETRLSISIII--LPLLSG---DDVSASAYISA 57
Query: 62 PSVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITI----SKRSN---L 114
S + R+ + S D P + + + P + T+ + S+R + L
Sbjct: 58 LSAASND------RLHYEVISDGDQPTVGLHVDN-HIPMVKRTVAKLVDDYSRRPDSPRL 110
Query: 115 KAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFREL----G 170
V+D C V++ +S+P Y ++T+ +LA L++ L S E
Sbjct: 111 AGLVVDMFCISVIDVANE-VSVPCYLFYTSNVGILALGLHIQMLFDKKEYSVSETDFEDS 169
Query: 171 SALLNFPGFP-PFPARDMA--------LPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFE 221
+L+ P P+P + + LPM+ +G+ ++ + GI+VNTF
Sbjct: 170 EVVLDVPSLTCPYPVKCLPYGLATKEWLPMYLNQGRRFREM----------KGILVNTFA 219
Query: 222 LLQERAIKAMLEGQCIPGETLPPLYCIGPVVGRGNGENRGRDRH--ECLSWLDSKPSRSV 279
L+ A++++ G+T P Y +GP++ N + +D + L WLD +P +SV
Sbjct: 220 ELEPYALESLHSS----GDT-PRAYPVGPLLHLENHVDGSKDEKGSDILRWLDEQPPKSV 274
Query: 280 LFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENR----SSLESLLPEGF 335
+FLCFGS+G F+ +Q +EMAI LERSG +FLW +R + D + +LE +LPEGF
Sbjct: 275 VFLCFGSIGGFNEEQAREMAIALERSGHRFLWSLRRASRDIDKELPGEFKNLEEILPEGF 334
Query: 336 LDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMI 395
DRTKD+G V+ WAPQV VL ++GGFVTHCGWNS+LE + GVP+ WPLYAEQK
Sbjct: 335 FDRTKDKGKVI-GWAPQVAVLAKPAIGGFVTHCGWNSILESLWFGVPIAPWPLYAEQKFN 393
Query: 396 KAVVVEEMKVGLAVTRSEEGDGLVSSA-------ELEQRVSELMDSEKGRAVKERAVAMK 448
V+VEE+ + + + + GD LV +A E+E+ + LM E+ V+ R M
Sbjct: 394 AFVMVEELGLAVKIRKYWRGDQLVGTATVIVTAEEIERGIRCLM--EQDSDVRNRVKEMS 451
Query: 449 EAAAAAMRDGGSSRVALDNLVESFKR 474
+ A++DGGSS+ AL ++ +
Sbjct: 452 KKCHMALKDGGSSQSALKLFIQDVTK 477
>sp|O82383|U71D1_ARATH UDP-glycosyltransferase 71D1 OS=Arabidopsis thaliana GN=UGT71D1
PE=2 SV=1
Length = 467
Score = 238 bits (608), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 148/476 (31%), Positives = 240/476 (50%), Gaps = 33/476 (6%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPSV 64
++ +P GHL +E + ++ I I++ + D Y+ S++++ P V
Sbjct: 6 LIFIPTPTVGHLVPFLEFARRLIEQDDRIRITILL--MKLQGQSHLDTYVKSIASSQPFV 63
Query: 65 TFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNL-----KAFVI 119
F +P + TL S A VY++ E N P + ++ I L K V+
Sbjct: 64 RFIDVPELEEK--PTLGSTQSVEAYVYDVIERNIPLVRNIVMDILTSLALDGVKVKGLVV 121
Query: 120 DFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGF 179
DF C P V+ +S+P Y + TT LA YL H T F +L+ PGF
Sbjct: 122 DFFCLPMIDVAKD-ISLPFYVFLTTNSGFLAMMQYLADRHSRDTSVFVRNSEEMLSIPGF 180
Query: 180 -PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIP 238
P PA + + +G Y V I K+ GI+VN+ ++ ++ L+ Q
Sbjct: 181 VNPVPANVLPSALFVEDG--YDAYVKLAILFTKANGILVNSSFDIEPYSVNHFLQEQ--- 235
Query: 239 GETLPPLYCIGPVV---GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQL 295
P +Y +GP+ + + E R E + WLD +P SV+FLCFGS+ +
Sbjct: 236 --NYPSVYAVGPIFDLKAQPHPEQDLTRRDELMKWLDDQPEASVVFLCFGSMARLRGSLV 293
Query: 296 KEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEV 355
KE+A GLE +FLW S+ + LPEGFLDR RG++ W+PQVE+
Sbjct: 294 KEIAHGLELCQYRFLW--------SLRKEEVTKDDLPEGFLDRVDGRGMIC-GWSPQVEI 344
Query: 356 LNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMK--VGLAVTRSE 413
L H++VGGFV+HCGWNS++E + GVP++ WP+YAEQ++ ++V+E+K V L +
Sbjct: 345 LAHKAVGGFVSHCGWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELKLAVELKLDYRV 404
Query: 414 EGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLV 469
D +V++ E+E + +MD++ V++R + + + A ++GGSS A++ +
Sbjct: 405 HSDEIVNANEIETAIRYVMDTDNN-VVRKRVMDISQMIQRATKNGGSSFAAIEKFI 459
>sp|Q40287|UFOG5_MANES Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5
PE=2 SV=1
Length = 487
Score = 238 bits (608), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 156/471 (33%), Positives = 243/471 (51%), Gaps = 28/471 (5%)
Query: 2 KDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATA 61
K IVL +SPG GHL ++ELGK I+T C + D+ I TSA + S + T
Sbjct: 9 KPHIVLLSSPGLGHLIPVLELGKRIVTL--C-NFDVTIFMVGSDTSAAEPQVLRS-AMTP 64
Query: 62 PSVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLKAFVIDF 121
QLPPP I + A ++ L P + + R A ++D
Sbjct: 65 KLCEIIQLPPP--NISCLIDPEATVCTRLFVLMREIRPAFRAAVSALKFRP--AAIIVDL 120
Query: 122 LCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGFPP 181
+ +V+ L I Y Y + LA +Y+P L K F L + PG P
Sbjct: 121 FGTESLEVAKE-LGIAKYVYIASNAWFLALTIYVPILDKEVEGEFV-LQKEPMKIPGCRP 178
Query: 182 FPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGET 241
++ PM DR + Y GI++ + GI++NT+E L+ A+ + + +
Sbjct: 179 VRTEEVVDPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVA 238
Query: 242 LPPLYCIGPVVGRGN--GENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMA 299
P++ IGP+ + G N E L WLD +P SV+++ FGS G+ S +Q+ E+A
Sbjct: 239 KVPVFPIGPLRRQAGPCGSN-----CELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELA 293
Query: 300 IGLERSGVKFLWVVRAPAPDSVE--------NRSSLESLLPEGFLDRTKDRGLVVESWAP 351
GLERS +F+WVVR P + + + PEGFL R ++ GLVV W+P
Sbjct: 294 WGLERSQQRFIWVVRQPTVKTGDAAFFTQGDGADDMSGYFPEGFLTRIQNVGLVVPQWSP 353
Query: 352 QVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVT- 410
Q+ +++H SVG F++HCGWNSVLE + AGVP++AWP+YAEQ+M ++ EE+ G+AV
Sbjct: 354 QIHIMSHPSVGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEEL--GVAVRP 411
Query: 411 RSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSS 461
++ +V E+E+ + +M E+G +++R +K++ A+ +GGSS
Sbjct: 412 KNLPAKEVVKREEIERMIRRIMVDEEGSEIRKRVRELKDSGEKALNEGGSS 462
>sp|Q94A84|U72E1_ARATH UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana GN=UGT72E1
PE=1 SV=1
Length = 487
Score = 236 bits (602), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 155/486 (31%), Positives = 252/486 (51%), Gaps = 36/486 (7%)
Query: 2 KDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATA 61
K + ++ SPG GH+ ++ELGK + H D+ I ++ ++ S A
Sbjct: 5 KPHVAMFASPGMGHIIPVIELGKRLAGSH---GFDVTIFVLETDAASAQSQFLNSPGCDA 61
Query: 62 PSVTFHQLPPP-VSRIPDTLRSPADFPALVYELGELNNPNLHETLITI-SKRSNLK---- 115
V LP P +S + D P+ + + L + ET+ TI SK ++
Sbjct: 62 ALVDIVGLPTPDISGLVD--------PSAFFGIKLLVM--MRETIPTIRSKIEEMQHKPT 111
Query: 116 AFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLN 175
A ++D A + ++ TY + + LA L+ PTL K+ + + +
Sbjct: 112 ALIVDLFGLDAIPLGGE-FNMLTYIFIASNARFLAVALFFPTLDKDMEEE-HIIKKQPMV 169
Query: 176 FPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQ 235
PG P D D ++Y+ V G GIIVNT++ ++ + +K++ + +
Sbjct: 170 MPGCEPVRFEDTLETFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPK 229
Query: 236 CIPGETLPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQL 295
+ P+Y IGP+ + + H L WL+ +P SVL++ FGS GS S+KQL
Sbjct: 230 LLGRIAGVPVYPIGPL---SRPVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQL 286
Query: 296 KEMAIGLERSGVKFLWVVRAP----------APDSVENRSSLESLLPEGFLDRTKDRGLV 345
E+A GLE S +F+WVVR P + +S + R LPEGF+ RT +RG +
Sbjct: 287 TELAWGLEMSQQRFVWVVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFM 346
Query: 346 VESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKV 405
V SWAPQ E+L H++VGGF+THCGWNS+LE V GVPM+AWPL+AEQ M ++ EE+ V
Sbjct: 347 VSSWAPQAEILAHQAVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGV 406
Query: 406 GLAVTRSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMR-DGGSSRVA 464
+ ++ +G+++ AE+E V ++M E+G ++++ +KE AA ++ DGG + +
Sbjct: 407 AVR-SKKLPSEGVITRAEIEALVRKIMVEEEGAEMRKKIKKLKETAAESLSCDGGVAHES 465
Query: 465 LDNLVE 470
L + +
Sbjct: 466 LSRIAD 471
>sp|Q9ZU72|U72D1_ARATH UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana GN=UGT72D1
PE=2 SV=1
Length = 470
Score = 235 bits (600), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 156/483 (32%), Positives = 254/483 (52%), Gaps = 28/483 (5%)
Query: 6 VLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPSVT 65
+L SPG GHL ++ELG + + +I + I +S+ T+ +A
Sbjct: 7 LLVASPGLGHLIPILELGNRLSS---VLNIHVTILAVTSGSSSPTETEAIHAAAARTICQ 63
Query: 66 FHQLPPPVSRIPDTLRSP--ADFPALVYELGELNNPNLHETLITISKRSNLKAFVIDFLC 123
++P S D L P F +V ++ + P + + + + ++ + ++DFL
Sbjct: 64 ITEIP---SVDVDNLVEPDATIFTKMVVKMRAMK-PAVRDAVKLMKRKPTV--MIVDFLG 117
Query: 124 NPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGFPPFP 183
V+ Y Y T LA +YLP L + ++ L PG P
Sbjct: 118 TELMSVADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPL-KIPGCKPVG 176
Query: 184 ARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGETLP 243
+++ M DR G+ YK V G+++ S G++VNT+E LQ + A+ E + +
Sbjct: 177 PKELMETMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKV 236
Query: 244 PLYCIGPVVGRGNGENRGRDR-HECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGL 302
P+Y IGP+V N+ D+ + WLD + RSV+F+C GS G+ + +Q E+A+GL
Sbjct: 237 PVYPIGPIVRT----NQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGL 292
Query: 303 ERSGVKFLWVVRAPAP---DSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHE 359
E SG +F+WV+R PA + + + LPEGFLDRT+ G+VV WAPQVE+L+H
Sbjct: 293 ELSGQRFVWVLRRPASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHR 352
Query: 360 SVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSE-EGDGL 418
S+GGF++HCGW+S LE + GVP++AWPLYAEQ M ++ EE +G+AV SE + +
Sbjct: 353 SIGGFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEE--IGVAVRTSELPSERV 410
Query: 419 VSSAELEQRVSELM--DSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESFKRGR 476
+ E+ V ++M + E+G+ ++ +A ++ ++ A GSS ++L E KR
Sbjct: 411 IGREEVASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSS---YNSLFEWAKRCY 467
Query: 477 MAP 479
+ P
Sbjct: 468 LVP 470
>sp|O82385|U71D2_ARATH UDP-glycosyltransferase 71D2 OS=Arabidopsis thaliana GN=UGT71D2
PE=2 SV=1
Length = 467
Score = 235 bits (599), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 152/477 (31%), Positives = 247/477 (51%), Gaps = 33/477 (6%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPSV 64
++ +P GHL +E + ++ I ++ + D Y+ ++S++ P V
Sbjct: 6 LIFIPTPTVGHLVPFLEFARRLIEQDDRIRITFLLMKQQ--GQSHLDSYVKTISSSLPFV 63
Query: 65 TFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKR-----SNLKAFVI 119
F +P + TL + + A VY+ E N P + ++ I +K FV
Sbjct: 64 RFIDVPELEEK--PTLGTQS-VEAYVYDFIETNVPLVQNIIMGILSSPAFDGVTVKGFVA 120
Query: 120 DFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGF 179
DF C P V+ S+P Y + T+ LA YL HK T F +L+ PGF
Sbjct: 121 DFFCLPMIDVAKDA-SLPFYVFLTSNSGFLAMMQYLAYGHKKDTSVFARNSEEMLSIPGF 179
Query: 180 -PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIP 238
P PA+ + + +G Y V I K+ GI+VNT ++ ++ L G+
Sbjct: 180 VNPVPAKVLPSALFIEDG--YDADVKLAILFTKANGILVNTSFDIEPTSLNHFL-GE--- 233
Query: 239 GETLPPLYCIGPVVGRGNGENRGRDRHEC---LSWLDSKPSRSVLFLCFGSLGSFSSKQL 295
E P +Y +GP+ + +D C + WLD++P SV+FLCFGS+GS +
Sbjct: 234 -ENYPSVYAVGPIFNPKAHPHPDQDLACCDESMKWLDAQPEASVVFLCFGSMGSLRGPLV 292
Query: 296 KEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEV 355
KE+A GLE +FLW +R + V N + LLPEGF+DR RG++ W+PQVE+
Sbjct: 293 KEIAHGLELCQYRFLWSLRT---EEVTN----DDLLPEGFMDRVSGRGMIC-GWSPQVEI 344
Query: 356 LNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMK--VGLAVTRSE 413
L H++VGGFV+HCGWNS++E + GVP++ WP+YAEQ++ ++V+E+K V L + S
Sbjct: 345 LAHKAVGGFVSHCGWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELKLAVELKLDYSV 404
Query: 414 EGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVE 470
+VS+ E+E +S +M+ + V++R + + + A ++GGSS A++ +
Sbjct: 405 HSGEIVSANEIETAISCVMNKDN-NVVRKRVMDISQMIQRATKNGGSSFAAIEKFIH 460
>sp|Q2V6K0|UFOG6_FRAAN UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria
ananassa GN=GT6 PE=1 SV=1
Length = 479
Score = 234 bits (597), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 149/480 (31%), Positives = 252/480 (52%), Gaps = 32/480 (6%)
Query: 11 PGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPSVTFHQL- 69
PG GH+ S VE+ KL+L I I+I PF T+ G+D YI S+ A PS+ ++
Sbjct: 13 PGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPF-TADGSDVYIKSL-AVDPSLKTQRIR 70
Query: 70 --PPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLKAFVIDFLCNPAF 127
P T F + + + T S+ + + FVID C
Sbjct: 71 FVNLPQEHFQGT--GATGFFTFIDSHKSHVKDAVTRLMETKSETTRIAGFVIDMFCTGMI 128
Query: 128 QVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKS---FRELGSALLNFPGFPPFPA 184
+++ +P+Y ++T+ + L +L L K F++ + L+ P PA
Sbjct: 129 DLANE-FGLPSYVFYTSGAADLGLMFHLQALRDEENKDCTEFKDSDAELVVSSFVNPLPA 187
Query: 185 -RDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGETLP 243
R + + ++EG + ++ + ++ GI+VNTF L+ AI+++ G+ LP
Sbjct: 188 ARVLPSVVFEKEGGNF--FLNFAKRYRETKGILVNTFLELEPHAIQSL----SSDGKILP 241
Query: 244 PLYCIGPVV-----GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEM 298
+Y +GP++ G + + + + L WLD +P SV+FLCFGS+G F Q+KE+
Sbjct: 242 -VYPVGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGEDQVKEI 300
Query: 299 AIGLERSGVKFLWVVRAPAPDSV---ENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEV 355
A LE+ G++FLW +R P+ + + + + +++LPEGFLDRT D G V+ WAPQ+ +
Sbjct: 301 AHALEQGGIRFLWSLRQPSKEKIGFPSDYTDYKAVLPEGFLDRTTDLGKVI-GWAPQLAI 359
Query: 356 LNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEEG 415
L H +VGGFV+HCGWNS LE + GVP+ WP YAEQ++ +V+E+K+ + +
Sbjct: 360 LAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVEIDMGYRK 419
Query: 416 DG--LVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESFK 473
D +VS +E+ + E+M+ E +++R M + + A+ + GSS +L ++ +
Sbjct: 420 DSGVIVSRENIEKGIKEVMEQES--ELRKRVKEMSQMSRKALEEDGSSYSSLGRFLDQIQ 477
>sp|Q9LSY8|U71B2_ARATH UDP-glycosyltransferase 71B2 OS=Arabidopsis thaliana GN=UGT71B2
PE=1 SV=1
Length = 485
Score = 233 bits (594), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 168/507 (33%), Positives = 260/507 (51%), Gaps = 56/507 (11%)
Query: 1 MKDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIII--PTAPFVTSAGTDDYIASVS 58
MK +V SPG GHL +VE+ KL + SI III F +S + + S
Sbjct: 1 MKLELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIIIPQMHGFSSSNSSSYIASLSS 60
Query: 59 ATAPSVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKR------S 112
+ ++++ L P PD+ + F ++ + P + T+ ++ S
Sbjct: 61 DSEERLSYNVLSVPDK--PDSDDTKPHF----FDYIDNFKPQVKATVEKLTDPGPPDSPS 114
Query: 113 NLKAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLH--KNTTKS-FREL 169
L FV+D C V++ +P+Y ++T+ + L +++ L+ KN S ++
Sbjct: 115 RLAGFVVDMFCMMMIDVANE-FGVPSYMFYTSNATFLGLQVHVEYLYDVKNYDVSDLKDS 173
Query: 170 GSALLNFPG---------FPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTF 220
+ L P FP LP+ R+ + ++ ++ GI+VNTF
Sbjct: 174 DTTELEVPCLTRPLPVKCFPSVLLTKEWLPVMFRQTRRFR----------ETKGILVNTF 223
Query: 221 ELLQERAIKAMLEGQCIPGETLPPLYCIGPVVG-RGNGENRGRDRH-ECLSWLDSKPSRS 278
L+ +A+K G P LP +Y +GPV+ + NG N D+ E L WLD +P +S
Sbjct: 224 AELEPQAMK-FFSGVDSP---LPTVYTVGPVMNLKINGPNSSDDKQSEILRWLDEQPRKS 279
Query: 279 VLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSV----ENRSSLESLLPEG 334
V+FLCFGS+G F Q KE+AI LERSG +F+W +R P E ++LE +LPEG
Sbjct: 280 VVFLCFGSMGGFREGQAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEG 339
Query: 335 FLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKM 394
FL+RT + G +V WAPQ +L + ++GGFV+HCGWNS LE + GVPM WPLYAEQ++
Sbjct: 340 FLERTAEIGKIV-GWAPQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQV 398
Query: 395 IKAVVVEEMKVGLAVTRSEEG------DGLVSSAELEQRVSELMDSEKGRAVKERAVAMK 448
+VEE+ + + V S G D L+++ E+E+ + LM E+ V+ R M
Sbjct: 399 NAFEMVEELGLAVEVRNSFRGDFMAADDELMTAEEIERGIRCLM--EQDSDVRSRVKEMS 456
Query: 449 EAAAAAMRDGGSSRVALDNLVESFKRG 475
E + A+ DGGSS VAL ++ +
Sbjct: 457 EKSHVALMDGGSSHVALLKFIQDVTKN 483
>sp|Q40285|UFOG2_MANES Anthocyanidin 3-O-glucosyltransferase 2 (Fragment) OS=Manihot
esculenta GN=GT2 PE=2 SV=1
Length = 346
Score = 233 bits (593), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 139/367 (37%), Positives = 214/367 (58%), Gaps = 34/367 (9%)
Query: 122 LCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKS---FRELGSALLNFPG 178
C P ++ IP+Y +F + G L LY+ +H + F++ + L+
Sbjct: 1 FCTPMMDLADE-FGIPSYIFFASGGGFLGFMLYVQKIHDEENFNPIEFKDSDTELIVPSL 59
Query: 179 FPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIP 238
PFP R + + ++E + L+ + ++ GIIVNTF L+ RAI++
Sbjct: 60 VNPFPTRILPSSILNKER--FGQLLAIAKKFRQAKGIIVNTFLELESRAIESF------- 110
Query: 239 GETLPPLYCIGPVVGRGNGENRGRDRH-ECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKE 297
+PPLY +GP++ + ++ GR+ H E + WLD +P SV+FLCFGS+GSFS QLKE
Sbjct: 111 --KVPPLYHVGPIL---DVKSDGRNTHPEIMQWLDDQPEGSVVFLCFGSMGSFSEDQLKE 165
Query: 298 MAIGLERSGVKFLWVVR-APAPDSVENRSSLES---LLPEGFLDRTKDRGLVVESWAPQV 353
+A LE SG +FLW +R P PD + + + E +LPEGFL+RT G V+ WAPQV
Sbjct: 166 IAYALENSGHRFLWSIRRPPPPDKIASPTDYEDPRDVLPEGFLERTVAVGKVI-GWAPQV 224
Query: 354 EVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVT--- 410
VL H ++GGFV+HCGWNSVLE + GVP+ WP+YAEQ+ +V E+ +G+ +
Sbjct: 225 AVLAHPAIGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQFNAFEMVVELGLGVEIDMGY 284
Query: 411 RSEEGDGLVSSAELEQRVSELMDS--EKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNL 468
R E G +V+S ++E+ + +LM++ EK + VKE M+E + A+ DGGSS ++L +
Sbjct: 285 RKESGI-IVNSDKIERAIRKLMENSDEKRKKVKE----MREKSKMALIDGGSSFISLGDF 339
Query: 469 VESFKRG 475
++ G
Sbjct: 340 IKDAMEG 346
>sp|Q9LSY9|U71B1_ARATH UDP-glycosyltransferase 71B1 OS=Arabidopsis thaliana GN=UGT71B1
PE=2 SV=1
Length = 473
Score = 231 bits (590), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 155/494 (31%), Positives = 251/494 (50%), Gaps = 52/494 (10%)
Query: 1 MKDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSAT 60
MK +V SPG GH+ + L KL++ S+ +I+ + S +DD +SV
Sbjct: 1 MKVELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIV-----IPSRVSDDASSSVYTN 55
Query: 61 APSVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETL------ITISKRSNL 114
+ + L P + D LV + + P + + ++ S L
Sbjct: 56 SEDRLRYILLPARDQTTD----------LVSYI-DSQKPQVRAVVSKVAGDVSTRSDSRL 104
Query: 115 KAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALL 174
V+D C ++ ++ Y ++T+ S L ++ +L+ E +
Sbjct: 105 AGIVVDMFCTSMIDIADE-FNLSAYIFYTSNASYLGLQFHVQSLYDEKELDVSEFKDTEM 163
Query: 175 NF--PGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAM 231
F P PFPA+ LP K + ++ + GI+VN+ ++ +A+
Sbjct: 164 KFDVPTLTQPFPAK--CLPSVMLNKKWFPYVLGRARSFRATKGILVNSVADMEPQALSFF 221
Query: 232 LEGQCIPGET-LPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSF 290
G G T +PP+Y +GP++ + + + R E L WL +P++SV+FLCFGS+G F
Sbjct: 222 SGGN---GNTNIPPVYAVGPIMDLESSGDEEK-RKEILHWLKEQPTKSVVFLCFGSMGGF 277
Query: 291 SSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSS--------LESLLPEGFLDRTKDR 342
S +Q +E+A+ LERSG +FLW +R +P V N+S+ LE +LP+GFLDRT +
Sbjct: 278 SEEQAREIAVALERSGHRFLWSLRRASP--VGNKSNPPPGEFTNLEEILPKGFLDRTVEI 335
Query: 343 GLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEE 402
G ++ SWAPQV+VLN ++G FVTHCGWNS+LE + GVPM AWP+YAEQ+ +V+E
Sbjct: 336 GKII-SWAPQVDVLNSPAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDE 394
Query: 403 MKVGLAVTRSEEGDGL------VSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMR 456
+ + V + D L V++ E+E+ + M E+ +++R + MK+ A+
Sbjct: 395 LGLAAEVKKEYRRDFLVEEPEIVTADEIERGIKCAM--EQDSKMRKRVMEMKDKLHVALV 452
Query: 457 DGGSSRVALDNLVE 470
DGGSS AL V+
Sbjct: 453 DGGSSNCALKKFVQ 466
>sp|O23205|U72C1_ARATH UDP-glycosyltransferase 72C1 OS=Arabidopsis thaliana GN=UGT72C1
PE=2 SV=3
Length = 457
Score = 229 bits (585), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 156/469 (33%), Positives = 243/469 (51%), Gaps = 49/469 (10%)
Query: 7 LYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPSVTF 66
L SPG GH ++ELGK +L +H + + + T DD S S ++
Sbjct: 7 LVASPGMGHAVPILELGKHLLNHHGFDRVTVFLVT---------DDVSRSKSLIGKTL-M 56
Query: 67 HQLPPPVSR-IPDTLRSPADFPALVYELGELNN---PNLHETLITISKRSNLKAFVIDFL 122
+ P V R IP + +L+ +L E+ P + +++ + R + FV+D L
Sbjct: 57 EEDPKFVIRFIPLDVSGQDLSGSLLTKLAEMMRKALPEIKSSVMELEPRP--RVFVVDLL 114
Query: 123 CNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTT-KSFRELGSALLNFPGFPP 181
A +V+ + + TT+ LA +Y+ +L K K +G+ L+ PG P
Sbjct: 115 GTEALEVAKELGIMRKHVLVTTSAWFLAFTVYMASLDKQELYKQLSSIGALLI--PGCSP 172
Query: 182 FPARDMALPMHDREGKVYKGLVDT---GIQMAKSAGIIVNTFELLQERAIKAMLE----G 234
P K + L ++ G ++ + G+ VNT+ L++ I + L+ G
Sbjct: 173 VKFERAQDPR-----KYIRELAESQRIGDEVITADGVFVNTWHSLEQVTIGSFLDPENLG 227
Query: 235 QCIPGETLPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQ 294
+ + G P+Y +GP+V + H L WLD +P SV+++ FGS G+ + +Q
Sbjct: 228 RVMRG---VPVYPVGPLVRPAEPGLK----HGVLDWLDLQPKESVVYVSFGSGGALTFEQ 280
Query: 295 LKEMAIGLERSGVKFLWVVRAPAPDS--------VENRSSLESLLPEGFLDRTKDRGLVV 346
E+A GLE +G +F+WVVR PA D +N + LP GFLDRTKD GLVV
Sbjct: 281 TNELAYGLELTGHRFVWVVRPPAEDDPSASMFDKTKNETEPLDFLPNGFLDRTKDIGLVV 340
Query: 347 ESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVG 406
+WAPQ E+L H+S GGFVTHCGWNSVLE + GVPM+AWPLY+EQKM +V E+K+
Sbjct: 341 RTWAPQEEILAHKSTGGFVTHCGWNSVLESIVNGVPMVAWPLYSEQKMNARMVSGELKIA 400
Query: 407 LAVTRSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAM 455
L + + DG+V + + V +MD E+G+ +++ +K+ A A+
Sbjct: 401 LQINVA---DGIVKKEVIAEMVKRVMDEEEGKEMRKNVKELKKTAEEAL 446
>sp|Q40284|UFOG1_MANES Anthocyanidin 3-O-glucosyltransferase 1 OS=Manihot esculenta GN=GT1
PE=2 SV=1
Length = 449
Score = 229 bits (585), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 159/478 (33%), Positives = 258/478 (53%), Gaps = 55/478 (11%)
Query: 14 GHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPSVTFHQLPPPV 73
GHL S VE KL+L+ SI ++I VTS + + +++++ + F LP
Sbjct: 2 GHLVSAVETAKLLLSRCHSLSITVLIFNNSVVTSKVHNYVDSQIASSSNRLRFIYLPRDE 61
Query: 74 SRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSN------LKAFVIDFLCNPAF 127
+ I + F +L+ E P++ E+++ I++ + L F++D C
Sbjct: 62 TGI-------SSFSSLI----EKQKPHVKESVMKITEFGSSVESPRLVGFIVDMFCTAMI 110
Query: 128 QVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSA--LLNFPGF-PPFPA 184
V++ +P+Y ++T+ + L L++ +H + E ++ L PG FP+
Sbjct: 111 DVANE-FGVPSYIFYTSGAAFLNFMLHVQKIHDEENFNPTEFNASDGELQVPGLVNSFPS 169
Query: 185 RDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGETLPP 244
+ A+P + + L++ + ++ G+I+NTF L+ AI++ + PP
Sbjct: 170 K--AMPTAILSKQWFPPLLENTRRYGEAKGVIINTFFELESHAIESFKD---------PP 218
Query: 245 LYCIGPVVG-RGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLE 303
+Y +GP++ R NG N + E + WLD +P SV+FLCFGS GSFS Q+KE+A LE
Sbjct: 219 IYPVGPILDVRSNGRNTNQ---EIMQWLDDQPPSSVVFLCFGSNGSFSKDQVKEIACALE 275
Query: 304 RSGVKFLWVV---RAPA-PDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHE 359
SG +FLW + RAP +S + L+ +LPEGFL+RT V+ WAPQV VL H
Sbjct: 276 DSGHRFLWSLADHRAPGFLESPSDYEDLQEVLPEGFLERTSGIEKVI-GWAPQVAVLAHP 334
Query: 360 SVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVT-----RSEE 414
+ GG V+H GWNS+LE + GVP+ WP+YAEQ+ +V E+ GLAV R++
Sbjct: 335 ATGGLVSHSGWNSILESIWFGVPVATWPMYAEQQFNAFQMVIEL--GLAVEIKMDYRNDS 392
Query: 415 GDGLVSSAELEQRVSELM--DSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVE 470
G+ +V ++E+ + LM DS++ + VKE M E + A+ +GGSS LDNL++
Sbjct: 393 GE-IVKCDQIERGIRCLMKHDSDRRKKVKE----MSEKSRGALMEGGSSYCWLDNLIK 445
>sp|Q9FE68|U71C5_ARATH UDP-glycosyltransferase 71C5 OS=Arabidopsis thaliana GN=UGT71C5
PE=2 SV=1
Length = 480
Score = 226 bits (576), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 157/486 (32%), Positives = 245/486 (50%), Gaps = 46/486 (9%)
Query: 11 PGRGHLNSMVELGKLILTYHPCFS-IDIIIPTAPFVTSAGTDDYIASVSATAPSVTFHQL 69
P GHL S +E GK +L S I I+ P+ A D +AS++A+ P + L
Sbjct: 12 PETGHLLSTIEFGKRLLNLDRRISMITILSMNLPYAPHA--DASLASLTASEPGIRIISL 69
Query: 70 P----PPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITI--------SKRSNLKAF 117
P PP ++ DT + + N P L +T+ + S++
Sbjct: 70 PEIHDPPPIKLLDTSSE-----TYILDFIHKNIPCLRKTIQDLVSSSSSSGGGSSHVAGL 124
Query: 118 VIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFREL-GSALLNF 176
++DF C + +++P+Y + T+ L YLP + T F E G L+
Sbjct: 125 ILDFFCVGLIDIGRE-VNLPSYIFMTSNFGFLGVLQYLPERQRLTPSEFDESSGEEELHI 183
Query: 177 PGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQ 235
P F PA+ + + D+ Y LV G ++ ++ GI+VN+F ++ A + +G+
Sbjct: 184 PAFVNRVPAKVLPPGVFDKLS--YGSLVKIGERLHEAKGILVNSFTQVEPYAAEHFSQGR 241
Query: 236 CIPGETLPPLYCIGPVV---GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSS 292
P +Y +GPV+ GR N E + WLD +P SVLFLCFGS+G F +
Sbjct: 242 -----DYPHVYPVGPVLNLTGRTNPGLASAQYKEMMKWLDEQPDSSVLFLCFGSMGVFPA 296
Query: 293 KQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQ 352
Q+ E+A LE G +F+W +R ++ + LPEGF+DRT RG+V SWAPQ
Sbjct: 297 PQITEIAHALELIGCRFIWAIRT----NMAGDGDPQEPLPEGFVDRTMGRGIVC-SWAPQ 351
Query: 353 VEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRS 412
V++L H++ GGFV+HCGWNSV E + GVP+ WP+YAEQ++ +V+E+ + + +
Sbjct: 352 VDILAHKATGGFVSHCGWNSVQESLWYGVPIATWPMYAEQQLNAFEMVKELGLAVEIRLD 411
Query: 413 EEGDG------LVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALD 466
DG +VS+ E+ V LMDS+ V+++ + A A+ DGGSS VA
Sbjct: 412 YVADGDRVTLEIVSADEIATAVRSLMDSDN--PVRKKVIEKSSVARKAVGDGGSSTVATC 469
Query: 467 NLVESF 472
N ++
Sbjct: 470 NFIKDI 475
>sp|Q9LML7|U71C3_ARATH UDP-glycosyltransferase 71C3 OS=Arabidopsis thaliana GN=UGT71C3
PE=2 SV=1
Length = 476
Score = 223 bits (568), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 163/493 (33%), Positives = 247/493 (50%), Gaps = 47/493 (9%)
Query: 5 IVLYTSPGRGHLNSMVELGK-LILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPS 63
I+ T P GHL +E K LI +I I+ P A + S+ A+ P
Sbjct: 7 IIFVTYPSPGHLLVSIEFAKSLIKRDDRIHTITILYWALPLAPQAHL--FAKSLVASQPR 64
Query: 64 VTFHQLP-----PPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITI------SKRS 112
+ LP PP+ ++P A + E + P + + L T+ S
Sbjct: 65 IRLLALPDVQNPPPLELF---FKAPE---AYILESTKKTVPLVRDALSTLVSSRKESGSV 118
Query: 113 NLKAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFR-ELGS 171
+ VIDF C P +V++ L++P+Y + T L+ YLP H+ TT G+
Sbjct: 119 RVVGLVIDFFCVPMIEVANE-LNLPSYIFLTCNAGFLSMMKYLPERHRITTSELDLSSGN 177
Query: 172 ALLNFPGFP-PFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKA 230
PG+ P + + + RE Y+ V+ + + GI+VN+ L++ A
Sbjct: 178 VEHPIPGYVCSVPTKVLPPGLFVRES--YEAWVEIAEKFPGAKGILVNSVTCLEQNAFDY 235
Query: 231 MLEGQCIPGETLPPLYCIGPVVG---RGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSL 287
E PP+Y +GPV+ R + DR + WL+ +P S++++CFGSL
Sbjct: 236 FARLD----ENYPPVYPVGPVLSLKDRPSPNLDASDRDRIMRWLEDQPESSIVYICFGSL 291
Query: 288 GSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVVE 347
G Q++E+A LE +G +FLW +R + ++S LLPEGFLDRT +GLV +
Sbjct: 292 GIIGKLQIEEIAEALELTGHRFLWSIRT----NPTEKASPYDLLPEGFLDRTASKGLVCD 347
Query: 348 SWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGL 407
WAPQVEVL H+++GGFV+HCGWNSVLE + GVP+ WP+YAEQ++ +V+E+ GL
Sbjct: 348 -WAPQVEVLAHKALGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQLNAFSMVKEL--GL 404
Query: 408 AVTR-----SEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSR 462
AV S G+ +V + E+ + LMD E ++R M EAA A+ DGGSS
Sbjct: 405 AVELRLDYVSAYGE-IVKAEEIAGAIRSLMDGED--TPRKRVKEMAEAARNALMDGGSSF 461
Query: 463 VALDNLVESFKRG 475
VA+ ++ G
Sbjct: 462 VAVKRFLDELIGG 474
>sp|O82382|U71C2_ARATH UDP-glycosyltransferase 71C2 OS=Arabidopsis thaliana GN=UGT71C2
PE=1 SV=1
Length = 474
Score = 223 bits (568), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 158/482 (32%), Positives = 249/482 (51%), Gaps = 34/482 (7%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYHP--CFSIDIIIPTAPFVTSAGTDDYIASVSATAP 62
++ P GH+ + +EL K ++++ P +I I+ + PF+ + T ++ S+ T
Sbjct: 9 LIFIPFPIPGHILATIELAKRLISHQPSRIHTITILHWSLPFLPQSDTIAFLKSLIETES 68
Query: 63 SVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITI------SKRSNLKA 116
+ LP + P L A + + E + P + L T+ S ++
Sbjct: 69 RIRLITLPDVQNPPPMELFVKAS-ESYILEYVKKMVPLVRNALSTLLSSRDESDSVHVAG 127
Query: 117 FVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSF-RELGSALLN 175
V+DF C P V + ++P+Y + T + S L YL ++ T R ++
Sbjct: 128 LVLDFFCVPLIDVGNE-FNLPSYIFLTCSASFLGMMKYLLERNRETKPELNRSSDEETIS 186
Query: 176 FPGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEG 234
PGF P + LP + Y+ V+ + ++ GI+VN+FE L+ A
Sbjct: 187 VPGFVNSVPVK--VLPPGLFTTESYEAWVEMAERFPEAKGILVNSFESLERNAFDYFDRR 244
Query: 235 QCIPGETLPPLYCIGPVV---GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFS 291
+ PP+Y IGP++ R N + RDR L WLD +P SV+FLCFGSL S +
Sbjct: 245 P----DNYPPVYPIGPILCSNDRPNLDLSERDR--ILKWLDDQPESSVVFLCFGSLKSLA 298
Query: 292 SKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVVESWAP 351
+ Q+KE+A LE G++FLW +R D E S E +LP+GF++R GLV WAP
Sbjct: 299 ASQIKEIAQALELVGIRFLWSIRT---DPKEYASPNE-ILPDGFMNRVMGLGLVC-GWAP 353
Query: 352 QVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTR 411
QVE+L H+++GGFV+HCGWNS+LE + GVP+ WP+YAEQ++ +V+E+ + L +
Sbjct: 354 QVEILAHKAIGGFVSHCGWNSILESLRFGVPIATWPMYAEQQLNAFTIVKELGLALEMRL 413
Query: 412 ---SEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNL 468
SE G+ +V + E+ V LMD E K + +A EA A+ DGGSS VA+
Sbjct: 414 DYVSEYGE-IVKADEIAGAVRSLMDGEDVPRRKLKEIA--EAGKEAVMDGGSSFVAVKRF 470
Query: 469 VE 470
++
Sbjct: 471 ID 472
>sp|Q9LVR1|U72E2_ARATH UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana GN=UGT72E2
PE=1 SV=1
Length = 481
Score = 222 bits (565), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 147/475 (30%), Positives = 240/475 (50%), Gaps = 53/475 (11%)
Query: 7 LYTSPGRGHLNSMVELGKLILTYHPCFSIDIII-------PTAPFVTSAGTDDYIASVSA 59
+++SPG GH+ ++ELGK L+ + F + + + + F+ S G D
Sbjct: 10 MFSSPGMGHVIPVIELGKR-LSANNGFHVTVFVLETDAASAQSKFLNSTGVD------IV 62
Query: 60 TAPSVTFHQLPPP----VSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLK 115
PS + L P V++I +R A PAL ++ ++
Sbjct: 63 KLPSPDIYGLVDPDDHVVTKIGVIMR--AAVPALRSKIAAMHQ--------------KPT 106
Query: 116 AFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLN 175
A ++D A ++ ++ +Y + T L ++Y P L K+ K + L
Sbjct: 107 ALIVDLFGTDALCLAKE-FNMLSYVFIPTNARFLGVSIYYPNLDKDI-KEEHTVQRNPLA 164
Query: 176 FPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQ 235
PG P D + VY+ V G+ K+ GI+VNT+E ++ +++K++L +
Sbjct: 165 IPGCEPVRFEDTLDAYLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPK 224
Query: 236 CIPGETLPPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQL 295
+ P+Y IGP+ H L WL+ +P+ SVL++ FGS G S+KQL
Sbjct: 225 LLGRVARVPVYPIGPLC---RPIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQL 281
Query: 296 KEMAIGLERSGVKFLWVVRAPAPDSV----------ENRSSLESLLPEGFLDRTKDRGLV 345
E+A GLE+S +F+WVVR P S + LPEGF+ RT DRG V
Sbjct: 282 TELAWGLEQSQQRFVWVVRPPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFV 341
Query: 346 VESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKV 405
V SWAPQ E+L+H +VGGF+THCGW+S LE V GVPM+AWPL+AEQ M A++ +E+ +
Sbjct: 342 VPSWAPQAEILSHRAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGI 401
Query: 406 GLAVTRSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMR-DGG 459
+ + +E +S ++E V ++M ++G A++ + ++++A ++ DGG
Sbjct: 402 AVRLDDPKED---ISRWKIEALVRKVMTEKEGEAMRRKVKKLRDSAEMSLSIDGG 453
>sp|O82381|U71C1_ARATH UDP-glycosyltransferase 71C1 OS=Arabidopsis thaliana GN=UGT71C1
PE=1 SV=1
Length = 481
Score = 218 bits (556), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 163/494 (32%), Positives = 246/494 (49%), Gaps = 49/494 (9%)
Query: 5 IVLYTSPGRGHLNSMVELGK-LILTYHP-CFSIDIIIPTAPFVTSAGTDDYIASVSATAP 62
+V+ P GH+ + +EL K LI +P +I I+ PF+ A T ++ S+ P
Sbjct: 9 LVIIPFPFSGHILATIELAKRLISQDNPRIHTITILYWGLPFIPQADTIAFLRSLVKNEP 68
Query: 63 SVTFHQLP-----PPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITI------SKR 111
+ LP PP+ + S + E + P + E L T+ S
Sbjct: 69 RIRLVTLPEVQDPPPMELFVEFAES------YILEYVKKMVPIIREALSTLLSSRDESGS 122
Query: 112 SNLKAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSF-RELG 170
+ V+DF C P V + ++P+Y + T + L YLP H+ F R
Sbjct: 123 VRVAGLVLDFFCVPMIDVGNE-FNLPSYIFLTCSAGFLGMMKYLPERHREIKSEFNRSFN 181
Query: 171 SALLNFPGF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIK 229
L PG+ P + LP + Y+ V+ + ++ GI+VN++ L+ K
Sbjct: 182 EELNLIPGYVNSVPTK--VLPSGLFMKETYEPWVELAERFPEAKGILVNSYTALEPNGFK 239
Query: 230 AMLEGQCIPGETLPPLYCIGPVV---GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGS 286
+C + P +Y IGP++ R N ++ RDR ++WLD +P SV+FLCFGS
Sbjct: 240 YF--DRC--PDNYPTIYPIGPILCSNDRPNLDSSERDR--IITWLDDQPESSVVFLCFGS 293
Query: 287 LGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVV 346
L + S+ Q+ E+A LE KF+W R + E S E+L P GF+DR D+G+V
Sbjct: 294 LKNLSATQINEIAQALEIVDCKFIWSFRT---NPKEYASPYEAL-PHGFMDRVMDQGIVC 349
Query: 347 ESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVG 406
WAPQVE+L H++VGGFV+HCGWNS+LE + GVP+ WP+YAEQ++ +V+E+ +
Sbjct: 350 -GWAPQVEILAHKAVGGFVSHCGWNSILESLGFGVPIATWPMYAEQQLNAFTMVKELGLA 408
Query: 407 LAVTR---SEEGDGLVSSAELEQRVSELMDSEK--GRAVKERAVAMKEAAAAAMRDGGSS 461
L + SE+GD +V + E+ V LMD VKE A A KEA DGGSS
Sbjct: 409 LEMRLDYVSEDGD-IVKADEIAGTVRSLMDGVDVPKSKVKEIAEAGKEAV-----DGGSS 462
Query: 462 RVALDNLVESFKRG 475
+A+ + G
Sbjct: 463 FLAVKRFIGDLIDG 476
>sp|Q40288|UFOG6_MANES Anthocyanidin 3-O-glucosyltransferase 6 (Fragment) OS=Manihot
esculenta GN=GT6 PE=2 SV=1
Length = 394
Score = 217 bits (552), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 136/397 (34%), Positives = 215/397 (54%), Gaps = 30/397 (7%)
Query: 92 ELGELNNPNLH--ETLITISKRSN--LKAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGS 147
+LG ++ H E + ++ RS+ L FV+D C V+ L +P Y +FT+ +
Sbjct: 5 DLGFIDKQKAHVKEAVSKLTARSDSSLAGFVLDMFCTSMIDVAKE-LGVPYYIFFTSGAA 63
Query: 148 VLAANLYLPTLHKNTTKSFREL--GSALLNFPGFP-PFPARDMALPMHDREGKVYKGL-V 203
L Y+ +H + A L+ P PAR + M ++ + Y + +
Sbjct: 64 FLGFLFYVQLIHDEQDADLTQFKDSDAELSVPSLANSLPARVLPASMLVKD-RFYAFIRI 122
Query: 204 DTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGETLPPLYCIGPVVGRGNGENR-GR 262
G++ AK GI+VNTF L+ A+ ++ + Q +PP+Y +GP++ N EN G
Sbjct: 123 IRGLREAK--GIMVNTFMELESHALNSLKDDQS----KIPPIYPVGPILKLSNQENDVGP 176
Query: 263 DRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAP---- 318
+ E + WLD +P SV+FLCFGS+G F Q KE+A LE+S +FLW +R P P
Sbjct: 177 EGSEIIEWLDDQPPSSVVFLCFGSMGGFDMDQAKEIACALEQSRHRFLWSLRRPPPKGKI 236
Query: 319 DSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVC 378
++ + +L+ +LP GF +RT G VV WAPQV +L H ++GGFV+HCGWNS+LE +
Sbjct: 237 ETSTDYENLQEILPVGFSERTAGMGKVV-GWAPQVAILEHPAIGGFVSHCGWNSILESIW 295
Query: 379 AGVPMLAWPLYAEQKMIKAVVVEEMKVGLAV----TRSEEGDGLVSSAELEQRVSELMDS 434
VP+ WPLYAEQ+ +V E+ GLAV +E + ++S+ ++E+ + +M
Sbjct: 296 FSVPIATWPLYAEQQFNAFTMVTEL--GLAVEIKMDYKKESEIILSADDIERGIKCVM-- 351
Query: 435 EKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVES 471
E +++R M + + A+ D SS LD L+E
Sbjct: 352 EHHSEIRKRVKEMSDKSRKALMDDESSSFWLDRLIED 388
>sp|O81498|U72E3_ARATH UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana GN=UGT72E3
PE=1 SV=1
Length = 481
Score = 217 bits (552), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 143/484 (29%), Positives = 249/484 (51%), Gaps = 40/484 (8%)
Query: 7 LYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPSVTF 66
+++SPG GH+ ++EL K + H F + + + T A + + S + V
Sbjct: 10 MFSSPGMGHVLPVIELAKRLSANH-GFHVTVFV----LETDAAS---VQSKLLNSTGVDI 61
Query: 67 HQLPPP-VSRIPDTLRSPADFPALVYELGELNN---PNLHETLITISKRSNLKAFVIDFL 122
LP P +S + D +V ++G + P L ++ + + N A +ID
Sbjct: 62 VNLPSPDISGLVDP------NAHVVTKIGVIMREAVPTLRSKIVAMHQ--NPTALIIDLF 113
Query: 123 CNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGFPPF 182
A +++ L++ TY + + L ++Y PTL + K + L PG P
Sbjct: 114 GTDALCLAAE-LNMLTYVFIASNARYLGVSIYYPTLDE-VIKEEHTVQRKPLTIPGCEPV 171
Query: 183 PARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGETL 242
D+ + VY LV + K+ GI+VNT+E ++ +++K++ + + +
Sbjct: 172 RFEDIMDAYLVPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVAR 231
Query: 243 PPLYCIGPVVGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGL 302
P+Y +GP+ H WL+ +P+ SVL++ FGS GS +++QL E+A GL
Sbjct: 232 VPVYPVGPLC---RPIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGL 288
Query: 303 ERSGVKFLWVVRAPAPDSV----------ENRSSLESLLPEGFLDRTKDRGLVVESWAPQ 352
E S +F+WVVR P S + + LPEGF+ RT DRG ++ SWAPQ
Sbjct: 289 EESQQRFIWVVRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQ 348
Query: 353 VEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRS 412
E+L H++VGGF+THCGW+S LE V GVPM+AWPL+AEQ M A++ +E+ + + V
Sbjct: 349 AEILAHQAVGGFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDDP 408
Query: 413 EEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAA--AAAMRDGGSSRVALDNLVE 470
+E +S +++E V ++M ++G ++ + +++ A + ++ GGS+ +L + +
Sbjct: 409 KEA---ISRSKIEAMVRKVMAEDEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465
Query: 471 SFKR 474
+R
Sbjct: 466 ECQR 469
>sp|Q9LSY6|U71B6_ARATH UDP-glycosyltransferase 71B6 OS=Arabidopsis thaliana GN=UGT71B6
PE=1 SV=1
Length = 479
Score = 211 bits (536), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 147/486 (30%), Positives = 242/486 (49%), Gaps = 36/486 (7%)
Query: 1 MKDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSAT 60
MK +V SP HL + VE+ + ++ + SI +II S + + S T
Sbjct: 1 MKIELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVII------ISFSSKNTSMITSLT 54
Query: 61 APSVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLKAFVID 120
+ + +++ + P L++ + L L ++ T+ L FV+D
Sbjct: 55 SNNRLRYEIISGGDQQPTELKATDSHIQSLKPLVRDAVAKLVDS--TLPDAPRLAGFVVD 112
Query: 121 FLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTT---KSFRELGSALLNFP 177
C V++ +P+Y ++T+ L L++ ++ S E L P
Sbjct: 113 MYCTSMIDVANE-FGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYDMSELEDSDVELVVP 171
Query: 178 GF-PPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQC 236
P+P + + +E + V + ++ GI+VNT L+ +A+ + G
Sbjct: 172 SLTSPYPLKCLPYIFKSKEWLTF--FVTQARRFRETKGILVNTVPDLEPQALTFLSNGN- 228
Query: 237 IPGETLPPLYCIGPVV--GRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQ 294
+P Y +GP++ N + + + E L WLD +P RSV+FLCFGS+G FS +Q
Sbjct: 229 -----IPRAYPVGPLLHLKNVNCDYVDKKQSEILRWLDEQPPRSVVFLCFGSMGGFSEEQ 283
Query: 295 LKEMAIGLERSGVKFLWVVRAPAPDSVEN----RSSLESLLPEGFLDRTKDRGLVVESWA 350
++E A+ L+RSG +FLW +R +P+ + ++LE +LPEGF DRT +RG V+ WA
Sbjct: 284 VRETALALDRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVI-GWA 342
Query: 351 PQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVT 410
QV +L ++GGFV+H GWNS LE + GVPM WPLYAEQK +VEE+ + + +
Sbjct: 343 EQVAILAKPAIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAVEIK 402
Query: 411 RSEEGDGL------VSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVA 464
+ GD L V++ E+E+ + LM E+ V++R + E A+ DGGSS A
Sbjct: 403 KHWRGDLLLGRSEIVTAEEIEKGIICLM--EQDSDVRKRVNEISEKCHVALMDGGSSETA 460
Query: 465 LDNLVE 470
L ++
Sbjct: 461 LKRFIQ 466
>sp|Q9ZVX4|U90A1_ARATH UDP-glycosyltransferase 90A1 OS=Arabidopsis thaliana GN=UGT90A1
PE=2 SV=1
Length = 478
Score = 199 bits (506), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 142/497 (28%), Positives = 243/497 (48%), Gaps = 60/497 (12%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYH---PCFSIDIIIPTAPFVTSAGTDDYIASVSATA 61
+VL+ +GH+ +++ G+L+L +H P ++ + T+ +I+ +
Sbjct: 10 VVLFPFMSKGHIIPLLQFGRLLLRHHRKEPTITVTVF-------TTPKNQPFISDFLSDT 62
Query: 62 PSVTFHQLPPP--VSRIPDTLRSPADFPAL-----VYELGELNNPNLHETLITISKRSNL 114
P + LP P ++ IP + + P++ +L P ETL T+ K
Sbjct: 63 PEIKVISLPFPENITGIPPGVENTEKLPSMSLFVPFTRATKLLQPFFEETLKTLPK---- 118
Query: 115 KAFVIDFLCNPAF----QVSSSTLSIPTY--YYFTTAGSVLAANLYLPTLHKNTTKSFRE 168
+ F+ + F S++ +IP + Y + + ++ +++ H+ T+ +
Sbjct: 119 ----VSFMVSDGFLWWTSESAAKFNIPRFVSYGMNSYSAAVSISVFK---HELFTEPESK 171
Query: 169 LGSALLNFPGFPPFPAR----DMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQ 224
+ + P FP + D + G + +D S G +VN+F L+
Sbjct: 172 SDTEPVTVPDFPWIKVKKCDFDHGTTEPEESGAALELSMDQIKSTTTSHGFLVNSFYELE 231
Query: 225 ERAIKAMLEGQCIPGETLPPLYCIGPVVGRGNGENRGRDRHECLSWLDSK--PSRSVLFL 282
A ++ G+ P +C+GP+ + +G + + WLD K R VL++
Sbjct: 232 ----SAFVDYNNNSGDK-PKSWCVGPLC-LTDPPKQGSAKPAWIHWLDQKREEGRPVLYV 285
Query: 283 CFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDR 342
FG+ S+KQL E+A GLE S V FLWV R +E ++ EGF DR ++
Sbjct: 286 AFGTQAEISNKQLMELAFGLEDSKVNFLWVTR----------KDVEEIIGEGFNDRIRES 335
Query: 343 GLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEE 402
G++V W Q E+L+HESV GF++HCGWNS E +C GVP+LAWP+ AEQ + +VVEE
Sbjct: 336 GMIVRDWVDQWEILSHESVKGFLSHCGWNSAQESICVGVPLLAWPMMAEQPLNAKMVVEE 395
Query: 403 MKVGLAVTRSEEGD--GLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDG-G 459
+KVG+ V +E+G G V+ EL ++ ELM+ E G+ ++ + A AA+ +G G
Sbjct: 396 IKVGVRV-ETEDGSVKGFVTREELSGKIKELMEGETGKTARKNVKEYSKMAKAALVEGTG 454
Query: 460 SSRVALDNLVESFKRGR 476
SS LD +++ + R
Sbjct: 455 SSWKNLDMILKELCKSR 471
>sp|Q9ZWJ3|U85A2_ARATH UDP-glycosyltransferase 85A2 OS=Arabidopsis thaliana GN=UGT85A2
PE=2 SV=1
Length = 481
Score = 196 bits (498), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 145/497 (29%), Positives = 240/497 (48%), Gaps = 59/497 (11%)
Query: 2 KDTIVLYTSPGRGHLNSMVELGKLILT--YHPCFSIDIIIPTAPFVTSAGTDDYIASVSA 59
K +V P +GH+N M+++ KL+ +H F ++ + + S G +
Sbjct: 8 KQHVVCVPYPAQGHINPMMKVAKLLYAKGFHITF-VNTVYNHNRLLRSRGPN-----AVD 61
Query: 60 TAPSVTFHQLPPPVSRIPDT-LRSPADFPALVYELGELNNPNLHETLITISKRSNLK--A 116
PS F +P +P+T + D P L + E L I+ R ++ +
Sbjct: 62 GLPSFRFESIP---DGLPETDVDVTQDIPTLCESTMKHCLAPFKELLRQINARDDVPPVS 118
Query: 117 FVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLY--------LPTLHKNTTKSFRE 168
++ C ++ L +P ++TT+ A LY L + + +
Sbjct: 119 CIVSDGCMSFTLDAAEELGVPEVLFWTTSACGFLAYLYYYRFIEKGLSPIKDESYLTKEH 178
Query: 169 LGSALLNFPGFPPFPARDMA--LPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQER 226
L + + P +D+ + + + + ++ + +++ II+NTF+ L+
Sbjct: 179 LDTKIDWIPSMKNLRLKDIPSFIRTTNPDDIMLNFIIREADRAKRASAIILNTFDDLEHD 238
Query: 227 AIKAMLEGQCIPGETLPPLYCIGPV-------------VGRGNGENRGRDRHECLSWLDS 273
I++M +PP+Y IGP+ +GR G N R+ ECL WL++
Sbjct: 239 VIQSM-------KSIVPPVYSIGPLHLLEKQESGEYSEIGR-TGSNLWREETECLDWLNT 290
Query: 274 KPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPE 333
K SV+++ FGS+ S+KQL E A GL +G +FLWV+R PD V + E+++P
Sbjct: 291 KARNSVVYVNFGSITVLSAKQLVEFAWGLAATGKEFLWVIR---PDLV---AGDEAMVPP 344
Query: 334 GFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQK 393
FL T DR ++ SW PQ +VL+H ++GGF+THCGWNS LE +C GVPM+ WP +AEQ+
Sbjct: 345 EFLTATADRRMLA-SWCPQEKVLSHPAIGGFLTHCGWNSTLESLCGGVPMVCWPFFAEQQ 403
Query: 394 MIKAVVVEEMKVGLAVTRSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKE-AAA 452
+E +VG+ + G V E+E V ELMD EKG+ ++E+A + A
Sbjct: 404 TNCKFSRDEWEVGIEI------GGDVKREEVEAVVRELMDEEKGKNMREKAEEWRRLANE 457
Query: 453 AAMRDGGSSRVALDNLV 469
A GSS++ + LV
Sbjct: 458 ATEHKHGSSKLNFEMLV 474
>sp|Q9SK82|U85A1_ARATH UDP-glycosyltransferase 85A1 OS=Arabidopsis thaliana GN=UGT85A1
PE=1 SV=1
Length = 489
Score = 194 bits (494), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 154/507 (30%), Positives = 242/507 (47%), Gaps = 64/507 (12%)
Query: 2 KDTIVLYTSPGRGHLNSMVELGKLILT--YHPCFSIDIIIPTAPFVTSAGTDDYIASVSA 59
K +V P +GH+N M+ + KL+ ++ F ++ + F+ S G++
Sbjct: 11 KPHVVCVPYPAQGHINPMMRVAKLLHARGFYVTF-VNTVYNHNRFLRSRGSNALDG---- 65
Query: 60 TAPSVTFHQLPPPVSRIPDT-LRSPADFPALVYELGELNNPNLHETLITISKRSNL---K 115
PS F + +P+T + + D AL + E L I+ N+
Sbjct: 66 -LPSFRFESI---ADGLPETDMDATQDITALCESTMKNCLAPFRELLQRINAGDNVPPVS 121
Query: 116 AFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLY--------LPTLHKNTTKSFR 167
V D + V+ L +P ++TT+G A L+ L L + +
Sbjct: 122 CIVSDGCMSFTLDVAEE-LGVPEVLFWTTSGCAFLAYLHFYLFIEKGLCPLKDESYLTKE 180
Query: 168 ELGSALLNF-PGFPPFPARDMALPMHDREGKVYKGLVDTGIQ----MAKSAGIIVNTFEL 222
L +++F P +D +P R ++ ++ +++ II+NTF+
Sbjct: 181 YLEDTVIDFIPTMKNVKLKD--IPSFIRTTNPDDVMISFALRETERAKRASAIILNTFDD 238
Query: 223 LQERAIKAMLEGQCIPGETLPPLYCIGPVVGRGNGE------------NRGRDRHECLSW 270
L+ + AM Q I LPP+Y +GP+ N E N ++ ECL W
Sbjct: 239 LEHDVVHAM---QSI----LPPVYSVGPLHLLANREIEEGSEIGMMSSNLWKEEMECLDW 291
Query: 271 LDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESL 330
LD+K SV+++ FGS+ S KQL E A GL SG +FLWV+R PD V E++
Sbjct: 292 LDTKTQNSVIYINFGSITVLSVKQLVEFAWGLAGSGKEFLWVIR---PDLVAGE---EAM 345
Query: 331 LPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYA 390
+P FL TKDR ++ SW PQ +VL+H ++GGF+THCGWNS+LE + GVPM+ WP +A
Sbjct: 346 VPPDFLMETKDRSMLA-SWCPQEKVLSHPAIGGFLTHCGWNSILESLSCGVPMVCWPFFA 404
Query: 391 EQKMIKAVVVEEMKVGLAVTRSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEA 450
+Q+M +E VG+ + G V E+E V ELMD EKG+ ++E+AV +
Sbjct: 405 DQQMNCKFCCDEWDVGIEI------GGDVKREEVEAVVRELMDGEKGKKMREKAVEWQRL 458
Query: 451 AAAAMRDG-GSSRVALDNLVESFKRGR 476
A A GSS + + +V F G+
Sbjct: 459 AEKATEHKLGSSVMNFETVVSKFLLGQ 485
>sp|Q9SY84|U90A2_ARATH UDP-glycosyltransferase 90A2 OS=Arabidopsis thaliana GN=UGT90A2
PE=2 SV=1
Length = 467
Score = 193 bits (490), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 145/484 (29%), Positives = 239/484 (49%), Gaps = 45/484 (9%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAPSV 64
+VL+ +GH+ M++L +L+L++ I + + T P D S+S T ++
Sbjct: 8 VVLFPYLSKGHMIPMLQLARLLLSHSFAGDISVTVFTTPLNRPFIVD----SLSGTKATI 63
Query: 65 TFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLKAFVIDFLCN 124
P V IP + PAL L + +++ +R + + F+ +
Sbjct: 64 VDVPFPDNVPEIPPGVECTDKLPALSSSLF-VPFTRATKSMQADFERELMSLPRVSFMVS 122
Query: 125 PAF----QVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSFRELGSALLNFPGFP 180
F Q S+ L P +F G A+ + ++ +N S + + ++ P FP
Sbjct: 123 DGFLWWTQESARKLGFPRLVFF---GMNCASTVICDSVFQNQLLSNVKSETEPVSVPEFP 179
Query: 181 PFPAR--DMALPMHDREGKV---YKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQ 235
R D M D + +K ++D M +S GII NTF+ L+ I +
Sbjct: 180 WIKVRKCDFVKDMFDPKTTTDPGFKLILDQVTSMNQSQGIIFNTFDDLEPVFIDFYKRKR 239
Query: 236 CIPGETLPPLYCIGPVVGRGN---GENRGRDRHECLSWLDSKPSR--SVLFLCFGSLGSF 290
+ L+ +GP+ N E + + + WLD K + +VL++ FGS
Sbjct: 240 KLK------LWAVGPLCYVNNFLDDEVEEKVKPSWMKWLDEKRDKGCNVLYVAFGSQAEI 293
Query: 291 SSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVV-ESW 349
S +QL+E+A+GLE S V FLWVV+ + + +GF +R +RG++V + W
Sbjct: 294 SREQLEEIALGLEESKVNFLWVVKG-------------NEIGKGFEERVGERGMMVRDEW 340
Query: 350 APQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAV 409
Q ++L HESV GF++HCGWNS+ E +C+ VP+LA+PL AEQ + +VVEE++V V
Sbjct: 341 VDQRKILEHESVRGFLSHCGWNSLTESICSEVPILAFPLAAEQPLNAILVVEELRVAERV 400
Query: 410 TRSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDG-GSSRVALDNL 468
+ E G+V E+ ++V ELM+ EKG+ ++ A + A A+ +G GSSR LDNL
Sbjct: 401 VAASE--GVVRREEIAEKVKELMEGEKGKELRRNVEAYGKMAKKALEEGIGSSRKNLDNL 458
Query: 469 VESF 472
+ F
Sbjct: 459 INEF 462
>sp|Q9LMF1|U85A3_ARATH UDP-glycosyltransferase 85A3 OS=Arabidopsis thaliana GN=UGT85A3
PE=2 SV=2
Length = 488
Score = 190 bits (482), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 151/510 (29%), Positives = 249/510 (48%), Gaps = 69/510 (13%)
Query: 2 KDTIVLYTSPGRGHLNSMVELGKLILT--YHPCFSIDIIIPTAPFVTSAGTDDYIASVSA 59
K +V P +GH+N M+++ KL+ +H F ++ + + S G A+
Sbjct: 11 KPHVVCVPYPAQGHINPMMKVAKLLHVKGFHVTF-VNTVYNHNRLLRSRG-----ANALD 64
Query: 60 TAPSVTFHQLPPPVSRIPDT-LRSPADFPALVYELGELNNPN-------LHETLITISKR 111
PS F +P +P+T + + D PAL E N L + ++T
Sbjct: 65 GLPSFQFESIP---DGLPETGVDATQDIPAL----SESTTKNCLVPFKKLLQRIVTREDV 117
Query: 112 SNLKAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLH-------KNTTK 164
+ V D + V+ L +P +++TT+ A L+ K+ +
Sbjct: 118 PPVSCIVSDGSMSFTLDVAEE-LGVPEIHFWTTSACGFMAYLHFYLFIEKGLCPVKDASC 176
Query: 165 SFRELGSALLNF-PGFPPFPARDMA--LPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFE 221
+E ++++ P +D+ + + + +V + +++ II+NTF+
Sbjct: 177 LTKEYLDTVIDWIPSMNNVKLKDIPSFIRTTNPNDIMLNFVVREACRTKRASAIILNTFD 236
Query: 222 LLQERAIKAMLEGQCIPGETLPPLYCIGPV-------------VGRGNGENRGRDRHECL 268
L+ I++M Q I LPP+Y IGP+ +GR G N ++ ECL
Sbjct: 237 DLEHDIIQSM---QSI----LPPVYPIGPLHLLVNREIEEDSEIGR-MGSNLWKEETECL 288
Query: 269 SWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLE 328
WL++K SV+++ FGS+ ++ QL E A GL +G +FLWV+R PDSV E
Sbjct: 289 GWLNTKSRNSVVYVNFGSITIMTTAQLLEFAWGLAATGKEFLWVMR---PDSVAGE---E 342
Query: 329 SLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPL 388
+++P+ FL T DR ++ SW PQ +VL+H +VGGF+THCGWNS LE + GVPM+ WP
Sbjct: 343 AVIPKEFLAETADRRMLT-SWCPQEKVLSHPAVGGFLTHCGWNSTLESLSCGVPMVCWPF 401
Query: 389 YAEQKMIKAVVVEEMKVGLAVTRSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMK 448
+AEQ+ +E +VG+ + G V E+E V ELMD EKG+ ++E+AV +
Sbjct: 402 FAEQQTNCKFSCDEWEVGIEI------GGDVKRGEVEAVVRELMDGEKGKKMREKAVEWR 455
Query: 449 EAAAAAMR-DGGSSRVALDNLVESFKRGRM 477
A A + GSS + + +V G++
Sbjct: 456 RLAEKATKLPCGSSVINFETIVNKVLLGKI 485
>sp|Q9ZQG4|U73B5_ARATH UDP-glycosyltransferase 73B5 OS=Arabidopsis thaliana GN=UGT73B5
PE=2 SV=1
Length = 484
Score = 186 bits (472), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 93/216 (43%), Positives = 139/216 (64%), Gaps = 7/216 (3%)
Query: 257 GENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAP 316
G+ D ECL WLDSK SV++L FGS +F++ QL E+A GLE SG F+WVVR
Sbjct: 268 GKKANIDEQECLKWLDSKTPGSVVYLSFGSGTNFTNDQLLEIAFGLEGSGQSFIWVVRKN 327
Query: 317 APDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEG 376
EN+ E LPEGF +RT +GL++ WAPQV +L+H+++GGFVTHCGWNS +EG
Sbjct: 328 -----ENQGDNEEWLPEGFKERTTGKGLIIPGWAPQVLILDHKAIGGFVTHCGWNSAIEG 382
Query: 377 VCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSE--EGDGLVSSAELEQRVSELMDS 434
+ AG+PM+ WP+ AEQ + ++ + +++G+ V +E + L+S A++E+ V E++
Sbjct: 383 IAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNVGATELVKKGKLISRAQVEKAVREVIGG 442
Query: 435 EKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVE 470
EK + A + E A AA+ +GGSS ++ +E
Sbjct: 443 EKAEERRLWAKKLGEMAKAAVEEGGSSYNDVNKFME 478
>sp|Q9SCP5|U73C7_ARATH UDP-glycosyltransferase 73C7 OS=Arabidopsis thaliana GN=UGT73C7
PE=2 SV=1
Length = 490
Score = 186 bits (472), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 109/284 (38%), Positives = 161/284 (56%), Gaps = 39/284 (13%)
Query: 212 SAGIIVNTFELLQ---ERAIKAMLEGQCIPGETLPPLYCIGPV----------VGRGNGE 258
S G+IVNTFE L+ R + G+ ++C+GPV RG+
Sbjct: 215 SYGVIVNTFEELEVDYAREYRKARAGK---------VWCVGPVSLCNRLGLDKAKRGDKA 265
Query: 259 NRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAP 318
+ G+D +CL WLDS+ + SVL++C GSL + QLKE+ +GLE S F+WV+R
Sbjct: 266 SIGQD--QCLQWLDSQETGSVLYVCLGSLCNLPLAQLKELGLGLEASNKPFIWVIREWG- 322
Query: 319 DSVENRSSLESLLPE-GFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGV 377
L + + + GF +R KDRGLV++ WAPQV +L+H S+GGF+THCGWNS LEG+
Sbjct: 323 ----KYGDLANWMQQSGFEERIKDRGLVIKGWAPQVFILSHASIGGFLTHCGWNSTLEGI 378
Query: 378 CAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAV--------TRSEEGDGLVSSAELEQRVS 429
AGVP+L WPL+AEQ + + +VV+ +K GL + + EE +VS + + V
Sbjct: 379 TAGVPLLTWPLFAEQFLNEKLVVQILKAGLKIGVEKLMKYGKEEEIGAMVSRECVRKAVD 438
Query: 430 ELM-DSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESF 472
ELM DSE+ + + + + A A+ GGSS + L++
Sbjct: 439 ELMGDSEEAEERRRKVTELSDLANKALEKGGSSDSNITLLIQDI 482
>sp|Q8W491|U73B3_ARATH UDP-glycosyltransferase 73B3 OS=Arabidopsis thaliana GN=UGT73B3
PE=2 SV=1
Length = 481
Score = 185 bits (470), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 111/293 (37%), Positives = 166/293 (56%), Gaps = 26/293 (8%)
Query: 192 HDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGETLPPLYCIGP- 250
D E ++ K +++ KS+G+IVN+F L+ L + IGP
Sbjct: 201 RDEESEMGKFMIEVKESDVKSSGVIVNSFYELEPDY------ADFYKSVVLKRAWHIGPL 254
Query: 251 -VVGRGNGENRGRDRH------ECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLE 303
V RG E R + ECL WLDSK SV+++ FGS+ F ++QL E+A GLE
Sbjct: 255 SVYNRGFEEKAERGKKASINEVECLKWLDSKKPDSVIYISFGSVACFKNEQLFEIAAGLE 314
Query: 304 RSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGG 363
SG F+WVVR +E E LPEGF +R K +G+++ WAPQV +L+H++ G
Sbjct: 315 TSGANFIWVVRKNI--GIEK----EEWLPEGFEERVKGKGMIIRGWAPQVLILDHQATCG 368
Query: 364 FVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEE----GDGLV 419
FVTHCGWNS+LEGV AG+PM+ WP+ AEQ + +V + ++ G++V + GD +
Sbjct: 369 FVTHCGWNSLLEGVAAGLPMVTWPVAAEQFYNEKLVTQVLRTGVSVGAKKNVRTTGD-FI 427
Query: 420 SSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESF 472
S ++ + V E++ E+ +ERA + E A AA+ +GGSS L++ +E F
Sbjct: 428 SREKVVKAVREVLVGEEADERRERAKKLAEMAKAAV-EGGSSFNDLNSFIEEF 479
>sp|Q9ZQ96|U73C3_ARATH UDP-glycosyltransferase 73C3 OS=Arabidopsis thaliana GN=UGT73C3
PE=2 SV=1
Length = 496
Score = 184 bits (467), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 114/315 (36%), Positives = 175/315 (55%), Gaps = 32/315 (10%)
Query: 180 PPFPAR----DMALPMHDREGKVYKGLVDTGIQMA-KSAGIIVNTFELLQERAIKAMLEG 234
P FP R + LP+ +K ++D ++ S G+IVNTF+ L+ +K E
Sbjct: 184 PSFPDRVEFTKLQLPVKANASGDWKEIMDEMVKAEYTSYGVIVNTFQELEPPYVKDYKEA 243
Query: 235 QCIPGETLPPLYCIGPV-----VGRGNGENRGR---DRHECLSWLDSKPSRSVLFLCFGS 286
+ G+ ++ IGPV G E + D+ ECL WLDSK SVL++C GS
Sbjct: 244 --MDGK----VWSIGPVSLCNKAGADKAERGSKAAIDQDECLQWLDSKEEGSVLYVCLGS 297
Query: 287 LGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVV 346
+ + QLKE+ +GLE S F+WV+R S + + E +L GF +R K+RGL++
Sbjct: 298 ICNLPLSQLKELGLGLEESRRSFIWVIRG----SEKYKELFEWMLESGFEERIKERGLLI 353
Query: 347 ESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVG 406
+ WAPQV +L+H SVGGF+THCGWNS LEG+ +G+P++ WPL+ +Q + +VV+ +K G
Sbjct: 354 KGWAPQVLILSHPSVGGFLTHCGWNSTLEGITSGIPLITWPLFGDQFCNQKLVVQVLKAG 413
Query: 407 LA-----VTRSEEGDG---LVSSAELEQRVSELM-DSEKGRAVKERAVAMKEAAAAAMRD 457
++ V + E D LV +++ V ELM DS+ + + R + E A A+
Sbjct: 414 VSAGVEEVMKWGEEDKIGVLVDKEGVKKAVEELMGDSDDAKERRRRVKELGELAHKAVEK 473
Query: 458 GGSSRVALDNLVESF 472
GGSS + L++
Sbjct: 474 GGSSHSNITLLLQDI 488
>sp|Q8VZE9|U73B1_ARATH UDP-glycosyltransferase 73B1 OS=Arabidopsis thaliana GN=UGT73B1
PE=2 SV=1
Length = 488
Score = 182 bits (463), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 141/218 (64%), Gaps = 13/218 (5%)
Query: 257 GENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAP 316
G+ D HECL WLDSK SV+++ FG++ SF ++QL E+A GL+ SG F+WVV
Sbjct: 268 GKKASIDEHECLKWLDSKKCDSVIYMAFGTMSSFKNEQLIEIAAGLDMSGHDFVWVVNRK 327
Query: 317 APDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEG 376
++ E LPEGF ++TK +GL++ WAPQV +L H+++GGF+THCGWNS+LEG
Sbjct: 328 G-----SQVEKEDWLPEGFEEKTKGKGLIIRGWAPQVLILEHKAIGGFLTHCGWNSLLEG 382
Query: 377 VCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEE----GDGLVSSAELEQRVSELM 432
V AG+PM+ WP+ AEQ + +V + +K G++V + GD +S ++E V E+M
Sbjct: 383 VAAGLPMVTWPVGAEQFYNEKLVTQVLKTGVSVGVKKMMQVVGD-FISREKVEGAVREVM 441
Query: 433 DSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVE 470
E+ R +RA + E A A+++GGSS + +D L+E
Sbjct: 442 VGEERR---KRAKELAEMAKNAVKEGGSSDLEVDRLME 476
>sp|Q7Y232|U73B4_ARATH UDP-glycosyltransferase 73B4 OS=Arabidopsis thaliana GN=UGT73B4
PE=2 SV=1
Length = 484
Score = 182 bits (462), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 128/384 (33%), Positives = 194/384 (50%), Gaps = 36/384 (9%)
Query: 103 ETLITISKRSNLKAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAA--NLYLPTLHK 160
E+ I +K S A V D P S+ + +P + T+ L N+ + HK
Sbjct: 115 ESFIETTKPS---ALVADMFF-PWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHK 170
Query: 161 NTTKSFRELGSALLNFPGFPP--FPARDMALPMHDRE--GKVYKGLVDTGIQMAKSAGII 216
S S PG P D A ++ GK +K + ++ S G++
Sbjct: 171 KVASS-----STPFVIPGLPGDIVITEDQANVTNEETPFGKFWKEVRES---ETSSFGVL 222
Query: 217 VNTFELLQERAIKAMLEGQCIPGETLPPLYCIGPVV--GRGNGENRGR------DRHECL 268
VN+F L+ + IGP+ RG E GR D ECL
Sbjct: 223 VNSFYELESSY------ADFYRSFVAKKAWHIGPLSLSNRGIAEKAGRGKKANIDEQECL 276
Query: 269 SWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLE 328
WLDSK SV++L FGS ++QL E+A GLE SG F+WVV ++ E
Sbjct: 277 KWLDSKTPGSVVYLSFGSGTGLPNEQLLEIAFGLEGSGQNFIWVV--SKNENQVGTGENE 334
Query: 329 SLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPL 388
LP+GF +R K +GL++ WAPQV +L+H+++GGFVTHCGWNS LEG+ AG+PM+ WP+
Sbjct: 335 DWLPKGFEERNKGKGLIIRGWAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPM 394
Query: 389 YAEQKMIKAVVVEEMKVGLAVTRSE--EGDGLVSSAELEQRVSELMDSEKGRAVKERAVA 446
AEQ + ++ + +++G+ V +E + L+S A++E+ V E++ EK + RA
Sbjct: 395 GAEQFYNEKLLTKVLRIGVNVGATELVKKGKLISRAQVEKAVREVIGGEKAEERRLRAKE 454
Query: 447 MKEAAAAAMRDGGSSRVALDNLVE 470
+ E A AA+ +GGSS ++ +E
Sbjct: 455 LGEMAKAAVEEGGSSYNDVNKFME 478
>sp|Q9C9B0|U89B1_ARATH UDP-glycosyltransferase 89B1 OS=Arabidopsis thaliana GN=UGT89B1
PE=2 SV=2
Length = 473
Score = 179 bits (455), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 148/499 (29%), Positives = 244/499 (48%), Gaps = 76/499 (15%)
Query: 2 KDTIVLYTSPGRGHLNSMVEL-GKLILTYHPCFSIDIII--PTAPFVTSAGTDDYIASVS 58
K ++++ P +GH+ +++ +L L I +++ PF++ +++V
Sbjct: 12 KTHVLIFPFPAQGHMIPLLDFTHRLALRGGAALKITVLVTPKNLPFLSP-----LLSAVV 66
Query: 59 ATAPSV----TFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLIT--ISKRS 112
P + + +P V + D P+ FP +++ LG NLH LI+ S S
Sbjct: 67 NIEPLILPFPSHPSIPSGVENVQDL--PPSGFPLMIHALG-----NLHAPLISWITSHPS 119
Query: 113 NLKAFVIDFLCNPAFQVSSSTLSIPTYYYFTTAG--SVLAANLYLPTLHKNTTKSFRELG 170
A V DF F + L IP + + +A + L++ + TK +
Sbjct: 120 PPVAIVSDF-----FLGWTKNLGIPRFDFSPSAAITCCILNTLWI----EMPTKINEDDD 170
Query: 171 SALLNFPGFPPFPARDMALPMHDREGKVYKGLV----------DTGIQMAKSAGIIVNTF 220
+ +L+FP P P D+ +Y+ V D+ S G++VN+F
Sbjct: 171 NEILHFPKIPNCPKYRF-----DQISSLYRSYVHGDPAWEFIRDSFRDNVASWGLVVNSF 225
Query: 221 ELLQ----ERAIKAMLEGQCIPGETLPPLYCIGPVVGRGNGENRG----RDRHECLSWLD 272
++ E + M + ++ +GP++ +G+NRG +SWLD
Sbjct: 226 TAMEGVYLEHLKREMGHDR---------VWAVGPIIPL-SGDNRGGPTSVSVDHVMSWLD 275
Query: 273 SKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLP 332
++ V+++CFGS + +Q +A GLE+SGV F+W V+ P VE S+ ++L
Sbjct: 276 AREDNHVVYVCFGSQVVLTKEQTLALASGLEKSGVHFIWAVKEP----VEKDSTRGNIL- 330
Query: 333 EGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQ 392
+GF DR RGLV+ WAPQV VL H +VG F+THCGWNSV+E V AGV ML WP+ A+Q
Sbjct: 331 DGFDDRVAGRGLVIRGWAPQVAVLRHRAVGAFLTHCGWNSVVEAVVAGVLMLTWPMRADQ 390
Query: 393 KMIKAVVVEEMKVGLAVTRSEEGDGLVSSAELEQRVSELMDSEKGRAVKE-RAVAMKEAA 451
++VV+E+KVG+ R+ EG V + RV DS G + +AV +++AA
Sbjct: 391 YTDASLVVDELKVGV---RACEGPDTVPDPDELARV--FADSVTGNQTERIKAVELRKAA 445
Query: 452 AAAMRDGGSSRVALDNLVE 470
A+++ GSS LD ++
Sbjct: 446 LDAIQERGSSVNDLDGFIQ 464
>sp|O48676|U74B1_ARATH UDP-glycosyltransferase 74B1 OS=Arabidopsis thaliana GN=UGT74B1
PE=1 SV=1
Length = 460
Score = 179 bits (454), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 152/498 (30%), Positives = 232/498 (46%), Gaps = 69/498 (13%)
Query: 1 MKDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSAT 60
+K +V+ P +GHLN MV+ K +++ + + + I T + S S T
Sbjct: 8 VKGHVVILPYPVQGHLNPMVQFAKRLVSKN----VKVTIATTTYTAS----------SIT 53
Query: 61 APSVTFHQLPPPVSRIPDTLRSPADFPAL-VYELGELNNPNLHETLITI-----SKRSNL 114
PS++ P+S D + P P V E N ETL + S S +
Sbjct: 54 TPSLSVE----PISDGFDFI--PIGIPGFSVDTYSESFKLNGSETLTLLIEKFKSTDSPI 107
Query: 115 KAFVIDFLCNPAFQVSSSTLSIPTYYYFT---TAGSVLA--ANLYLPTLHKNTTKSFREL 169
+ D +V+ S + + +FT T SVL +N P + FR
Sbjct: 108 DCLIYDSFLPWGLEVARS-MELSAASFFTNNLTVCSVLRKFSNGDFPLPADPNSAPFRIR 166
Query: 170 GSALLNFPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIK 229
G L++ P F R H G+V L++ + + VN FE L+E
Sbjct: 167 GLPSLSYDELPSFVGRHWL--THPEHGRV---LLNQFPNHENADWLFVNGFEGLEETQ-- 219
Query: 230 AMLEGQCIPGET-LPPLYCIGPVVGRGNGENRGRD------------RHECLSWLDSKPS 276
C GE+ IGP++ ++R D EC+ WL++K +
Sbjct: 220 -----DCENGESDAMKATLIGPMIPSAYLDDRMEDDKDYGASLLKPISKECMEWLETKQA 274
Query: 277 RSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFL 336
+SV F+ FGS G KQL E+AI L+ S + FLWV++ + + LPEGF+
Sbjct: 275 QSVAFVSFGSFGILFEKQLAEVAIALQESDLNFLWVIK----------EAHIAKLPEGFV 324
Query: 337 DRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIK 396
+ TKDR L+V SW Q+EVL HES+G F+THCGWNS LEG+ GVPM+ P +++Q
Sbjct: 325 ESTKDRALLV-SWCNQLEVLAHESIGCFLTHCGWNSTLEGLSLGVPMVGVPQWSDQMNDA 383
Query: 397 AVVVEEMKVGLAVTRSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMR 456
V E KVG + E G+ +V S EL + + +M+ E ++E + K+ A AM
Sbjct: 384 KFVEEVWKVGYR-AKEEAGEVIVKSEELVRCLKGVMEGESSVKIRESSKKWKDLAVKAMS 442
Query: 457 DGGSSRVALDNLVESFKR 474
+GGSS +++ +ES +
Sbjct: 443 EGGSSDRSINEFIESLGK 460
>sp|Q94C57|U73B2_ARATH UDP-glucosyl transferase 73B2 OS=Arabidopsis thaliana GN=UGT73B2
PE=1 SV=1
Length = 483
Score = 179 bits (453), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 112/293 (38%), Positives = 167/293 (56%), Gaps = 27/293 (9%)
Query: 193 DREGKVYKGLVDTGIQMAKSAGIIVNTFELLQERAIKAMLEGQCIPGETLPPLYCIGP-- 250
D E + K + + KS+G+++N+F L+ A C+ + IGP
Sbjct: 203 DGESDMGKFMTEVRESEVKSSGVVLNSFYELEHDY--ADFYKSCVQKRA----WHIGPLS 256
Query: 251 VVGRGNGEN--RGR----DRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLER 304
V RG E RG+ D ECL WLDSK SV+++ FGS+ F ++QL E+A GLE
Sbjct: 257 VYNRGFEEKAERGKKANIDEAECLKWLDSKKPNSVIYVSFGSVAFFKNEQLFEIAAGLEA 316
Query: 305 SGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGF 364
SG F+WVVR D E LPEGF +R K +G+++ WAPQV +L+H++ GGF
Sbjct: 317 SGTSFIWVVRKTKDDR-------EEWLPEGFEERVKGKGMIIRGWAPQVLILDHQATGGF 369
Query: 365 VTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEE-----GDGLV 419
VTHCGWNS+LEGV AG+PM+ WP+ AEQ + +V + ++ G++V S+ GD +
Sbjct: 370 VTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVTQVLRTGVSVGASKHMKVMMGD-FI 428
Query: 420 SSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLVESF 472
S ++++ V E++ E + RA + A AA+ +GGSS L++ +E F
Sbjct: 429 SREKVDKAVREVLAGEAAEERRRRAKKLAAMAKAAVEEGGSSFNDLNSFMEEF 481
>sp|Q9LME8|U85A7_ARATH UDP-glycosyltransferase 85A7 OS=Arabidopsis thaliana GN=UGT85A7
PE=2 SV=1
Length = 487
Score = 178 bits (452), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 108/273 (39%), Positives = 157/273 (57%), Gaps = 34/273 (12%)
Query: 211 KSAGIIVNTFELLQERAIKAMLEGQCIPGETLPPLYCIGPV-------------VGRGNG 257
+++ II+NTF+ L+ I++M Q I LPP+Y IGP+ +G+ G
Sbjct: 226 RASAIILNTFDELEHDVIQSM---QSI----LPPVYSIGPLHLLVKEEINEASEIGQM-G 277
Query: 258 ENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPA 317
N R+ ECL WLD+K SVLF+ FG + S+KQL+E A GL S +FLWV+R
Sbjct: 278 LNLWREEMECLDWLDTKTPNSVLFVNFGCITVMSAKQLEEFAWGLAASRKEFLWVIR--- 334
Query: 318 PDSVENRSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGV 377
P+ V + + +LP+ FL T DR ++ SW PQ +VL+H ++GGF+THCGWNS LE +
Sbjct: 335 PNLVVGEAMV--VLPQEFLAETIDRRMLA-SWCPQEKVLSHPAIGGFLTHCGWNSTLESL 391
Query: 378 CAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVTRSEEGDGLVSSAELEQRVSELMDSEKG 437
GVPM+ WP ++EQ +E VG+ + + V E+E V ELMD EKG
Sbjct: 392 AGGVPMICWPCFSEQPTNCKFCCDEWGVGIEIGKD------VKREEVETVVRELMDGEKG 445
Query: 438 RAVKERAVAMKEAAAAAMR-DGGSSRVALDNLV 469
+ ++E+A + A A R GSS + L+ L+
Sbjct: 446 KKLREKAEEWRRLAEEATRYKHGSSVMNLETLI 478
>sp|P56725|ZOX_PHAVU Zeatin O-xylosyltransferase OS=Phaseolus vulgaris GN=ZOX1 PE=2 SV=1
Length = 454
Score = 178 bits (451), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 83/211 (39%), Positives = 133/211 (63%), Gaps = 1/211 (0%)
Query: 265 HECLSWLDSKPSRSVLFLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVR-APAPDSVEN 323
H C+ WLD + SV+++ FG+ + +Q++E+A GLE+S KF+WV+R A D +
Sbjct: 244 HPCMEWLDKQEPSSVIYVSFGTTTALRDEQIQELATGLEQSKQKFIWVLRDADKGDIFDG 303
Query: 324 RSSLESLLPEGFLDRTKDRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPM 383
+ LPEGF +R + GLVV WAPQ+E+L+H S GGF++HCGWNS LE + GVPM
Sbjct: 304 SEAKRYELPEGFEERVEGMGLVVRDWAPQMEILSHSSTGGFMSHCGWNSCLESLTRGVPM 363
Query: 384 LAWPLYAEQKMIKAVVVEEMKVGLAVTRSEEGDGLVSSAELEQRVSELMDSEKGRAVKER 443
W ++++Q +V + +KVGL V E+ LVS++ +E V LM++++G +++R
Sbjct: 364 ATWAMHSDQPRNAVLVTDVLKVGLIVKDWEQRKSLVSASVIENAVRRLMETKEGDEIRKR 423
Query: 444 AVAMKEAAAAAMRDGGSSRVALDNLVESFKR 474
AV +K+ +M +GG SR+ + + + R
Sbjct: 424 AVKLKDEIHRSMDEGGVSRMEMASFIAHISR 454
>sp|Q9ZSK5|ZOG_PHALU Zeatin O-glucosyltransferase OS=Phaseolus lunatus GN=ZOG1 PE=2 SV=1
Length = 459
Score = 177 bits (450), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 132/479 (27%), Positives = 235/479 (49%), Gaps = 48/479 (10%)
Query: 2 KDTIVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYI--ASVSA 59
K ++L P +GHLN + L +LI+ + IP V GT +I A++
Sbjct: 13 KVVVLLIPFPAQGHLNQFLHLSRLIVAQN--------IP----VHYVGTVTHIRQATLRY 60
Query: 60 TAPSVTFHQLPPPVSRIPDTLRSPAD-FPALV---YELGELNNPNLHETLITISKRSNLK 115
P+ H V +P D FP+ + +E + + L ++S ++
Sbjct: 61 NNPTSNIHFHAFQVPPFVSPPPNPEDDFPSHLIPSFEASAHLREPVGKLLQSLSSQAKRV 120
Query: 116 AFVIDFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKNTTKSF-RELGSALL 174
+ D L Q +++ ++ Y + + + NT+ F E+G +
Sbjct: 121 VVINDSLMASVAQDAANISNVENYTFHSFSAF-------------NTSGDFWEEMGKPPV 167
Query: 175 NFPGFPPFPARDMALPMHDREGKVYKGLVDTGIQMAK-SAGIIVNTFELLQERAIKAMLE 233
FP FP+ + + +KG + K + G I NT +++ ++ +
Sbjct: 168 GDFHFPEFPSLEGCIAAQ------FKGFRTAQYEFRKFNNGDIYNTSRVIEGPYVELL-- 219
Query: 234 GQCIPGETLPPLYCIGPV--VGRGNGENRGRDRHECLSWLDSKPSRSVLFLCFGSLGSFS 291
+ G ++ +GP + ++ G RH C+ WLD + SV+++ FG+ +
Sbjct: 220 -ELFNGGK--KVWALGPFNPLAVEKKDSIGF-RHPCMEWLDKQEPSSVIYISFGTTTALR 275
Query: 292 SKQLKEMAIGLERSGVKFLWVVR-APAPDSVENRSSLESLLPEGFLDRTKDRGLVVESWA 350
+Q++++A GLE+S KF+WV+R A D + LP+GF +R + GLVV WA
Sbjct: 276 DEQIQQIATGLEQSKQKFIWVLREADKGDIFAGSEAKRYELPKGFEERVEGMGLVVRDWA 335
Query: 351 PQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVVEEMKVGLAVT 410
PQ+E+L+H S GGF++HCGWNS LE + GVP+ WP++++Q +V E +KVGL V
Sbjct: 336 PQLEILSHSSTGGFMSHCGWNSCLESITMGVPIATWPMHSDQPRNAVLVTEVLKVGLVVK 395
Query: 411 RSEEGDGLVSSAELEQRVSELMDSEKGRAVKERAVAMKEAAAAAMRDGGSSRVALDNLV 469
+ + LVS++ +E V LM++++G +++RAV +K A +M +GG S + + + +
Sbjct: 396 DWAQRNSLVSASVVENGVRRLMETKEGDEMRQRAVRLKNAIHRSMDEGGVSHMEMGSFI 454
>sp|Q9ZQ94|U73C5_ARATH UDP-glycosyltransferase 73C5 OS=Arabidopsis thaliana GN=UGT73C5
PE=2 SV=1
Length = 495
Score = 177 bits (450), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 144/500 (28%), Positives = 236/500 (47%), Gaps = 59/500 (11%)
Query: 5 IVLYTSPGRGHLNSMVELGKLILTYHPCFSIDIIIPTAPFVTSAGTDDYIASVSATAP-- 62
VL+ +GH+ MV++ +L+ + I I T P + + ++ + P
Sbjct: 13 FVLFPFMAQGHMIPMVDIARLLAQR----GVIITIVTTPHNAARFKNVLNRAIESGLPIN 68
Query: 63 --SVTFHQLPPPVSRIPDTLRSPADFPALVYELGELNNPNLHETLITISKRSNLK-AFVI 119
V F L + + + S ++ +N L E + + + N + + +I
Sbjct: 69 LVQVKFPYLEAGLQEGQENIDSLDTMERMIPFFKAVNF--LEEPVQKLIEEMNPRPSCLI 126
Query: 120 DFLCNPAFQVSSSTLSIPTYYYFTTAGSVLAANLYLPTLHKN-----TTKSFRELGSALL 174
C P + +IP + L L + L KN KS +EL +
Sbjct: 127 SDFCLPYTSKIAKKFNIPKILFHGMGCFCL---LCMHVLRKNREILDNLKSDKELFTV-- 181
Query: 175 NFPGFPPFPAR----------DMALPMHDREGKVYKGLVDTGIQMAKSAGIIVNTFELLQ 224
P FP R + +P D + ++ G+V+ S G+IVN+F+ L+
Sbjct: 182 -----PDFPDRVEFTRTQVPVETYVPAGDWK-DIFDGMVEAN---ETSYGVIVNSFQELE 232
Query: 225 ERAIKAMLEGQCIPGETLPPLYCIGPV----VGRGNGENRGRDRHECLSWLDSKPSRSVL 280
K E + T+ P+ V RGN + D+ ECL WLDSK SVL
Sbjct: 233 PAYAKDYKEVRSGKAWTIGPVSLCNKVGADKAERGNKSDI--DQDECLKWLDSKKHGSVL 290
Query: 281 FLCFGSLGSFSSKQLKEMAIGLERSGVKFLWVVRAPAPDSVENRSSLESLLPEGFLDRTK 340
++C GS+ + QLKE+ +GLE S F+WV+R + + +E GF DR +
Sbjct: 291 YVCLGSICNLPLSQLKELGLGLEESQRPFIWVIRGWE----KYKELVEWFSESGFEDRIQ 346
Query: 341 DRGLVVESWAPQVEVLNHESVGGFVTHCGWNSVLEGVCAGVPMLAWPLYAEQKMIKAVVV 400
DRGL+++ W+PQ+ +L+H SVGGF+THCGWNS LEG+ AG+P+L WPL+A+Q + +VV
Sbjct: 347 DRGLLIKGWSPQMLILSHPSVGGFLTHCGWNSTLEGITAGLPLLTWPLFADQFCNEKLVV 406
Query: 401 EEMKVG--------LAVTRSEEGDGLVSSAELEQRVSELM-DSEKGRAVKERAVAMKEAA 451
E +K G + E+ LV +++ V ELM +S+ + + RA + ++A
Sbjct: 407 EVLKAGVRSGVEQPMKWGEEEKIGVLVDKEGVKKAVEELMGESDDAKERRRRAKELGDSA 466
Query: 452 AAAMRDGGSSRVALDNLVES 471
A+ +GGSS + L++
Sbjct: 467 HKAVEEGGSSHSNISFLLQD 486
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.135 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 178,685,461
Number of Sequences: 539616
Number of extensions: 7639617
Number of successful extensions: 20985
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 241
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 20245
Number of HSP's gapped (non-prelim): 297
length of query: 485
length of database: 191,569,459
effective HSP length: 121
effective length of query: 364
effective length of database: 126,275,923
effective search space: 45964435972
effective search space used: 45964435972
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)