BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 044031
(468 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q4R1I9|ANGLT_ROSHC Anthocyanidin 5,3-O-glucosyltransferase OS=Rosa hybrid cultivar
GN=RhGT1 PE=2 SV=1
Length = 473
Score = 381 bits (978), Expect = e-105, Method: Compositional matrix adjust.
Identities = 214/484 (44%), Positives = 290/484 (59%), Gaps = 39/484 (8%)
Query: 5 IALYPGPAFHHMISMVELGKLILQHRSDVSITILVPS-----------MPLEESKTCSYI 53
I LYP P H+ISMVELGKL+L H SITIL + + + +YI
Sbjct: 6 IVLYPYPGLGHLISMVELGKLLLTHHPSFSITILASTAPTTIAATAKLVASSNDQLTNYI 65
Query: 54 NSISHRLNPIISFYYLPAIQMPSETLSRADIAIESIKLNSSNVFQALENI--SLTSKILS 111
++S NP I+F++LP I E + + ++ E +L N+ Q L+ + SL + IL
Sbjct: 66 KAVSAD-NPAINFHHLPTISSLPEHIEKLNLPFEYARLQIPNILQVLQTLKSSLKALILD 124
Query: 112 FIITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPP 171
+ + NIPT+ ++ S +LA +L +PT H + T+S D + I G+PP
Sbjct: 125 MFCDALFDVTKDLNIPTFYFYTSAGRSLAVLLNIPTFH-RTTNSLSDFGDVPISISGMPP 183
Query: 172 VKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGT 231
+ S MP+ + DR Y FL+ ST ++KSNGII+NTFD LE++A+KA+ G C+ N
Sbjct: 184 IPVSAMPKLLFDRSTNFYKSFLSTSTHMAKSNGIILNTFDLLEERALKALRAGLCLPNQP 243
Query: 232 TPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEI 291
TPP+ +GPLI +G D+ + L WL++QP SVVFLCFGS G FS QL+ +
Sbjct: 244 TPPIFTVGPLI------SGKSGDNDEHESLKWLNNQPKDSVVFLCFGSMGVFSIKQLEAM 297
Query: 292 AIGLERSNQRFLWVVRNP--------SNAAEAELPEGFLERTKERGLVVKSWAPQSTILG 343
A+GLE+S QRFLWVVRNP + E LP+GF+ERTK+RGLVV+ WAPQ +L
Sbjct: 298 ALGLEKSGQRFLWVVRNPPIEELPVEEPSLEEILPKGFVERTKDRGLVVRKWAPQVEVLS 357
Query: 344 HESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEET 403
H+SVGGFVTHCGW+SV+EAV GVPM+AWPLYAEQ L V LV+EMKVA+ + E ET
Sbjct: 358 HDSVGGFVTHCGWNSVLEAVCNGVPMVAWPLYAEQKLGRVFLVEEMKVAVGV---KESET 414
Query: 404 IGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNLF 463
G VSA+ +E+RVRELM G +R R E A +GGSS + + L
Sbjct: 415 -----GFVSADELEKRVRELMDSESGDEIRGRVSEFSNGGVKA--KEEGGSSVASLAKLA 467
Query: 464 DLWQ 467
LW+
Sbjct: 468 QLWK 471
>sp|Q9LK73|U88A1_ARATH UDP-glycosyltransferase 88A1 OS=Arabidopsis thaliana GN=UGT88A1
PE=2 SV=1
Length = 462
Score = 356 bits (914), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 210/483 (43%), Positives = 293/483 (60%), Gaps = 44/483 (9%)
Query: 2 KKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLN 61
++ I LYP P H++SMVELGK IL +SI I++ P + T +YI+S+S
Sbjct: 3 EEAIVLYPAPPIGHLVSMVELGKTILSKNPSLSIHIILVPPPYQPESTATYISSVSSSFP 62
Query: 62 PIISFYYLPAIQ----MPSETLSRADIAIESIKLNSSNVFQAL----ENISLTSKILSFI 113
I+F++LPA+ + + +E + ++ +V + L N ++ + I+ F
Sbjct: 63 -SITFHHLPAVTPYSSSSTSRHHHESLLLEILCFSNPSVHRTLFSLSRNFNVRAMIIDFF 121
Query: 114 ITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQIT-SSFKDHPSSLLFIPGLPPV 172
T+ + P Y ++ S A+ LA YLPT+ + KD P+ + IPG+PP+
Sbjct: 122 CTAVLDITADFTFPVYFFYTSGAACLAFSFYLPTIDETTPGKNLKDIPT--VHIPGVPPM 179
Query: 173 KSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTT 232
K S MP+ VL+R +YD F+ + LSKS+GIIINTFD LE +AIKAI C N
Sbjct: 180 KGSDMPKAVLERDDEVYDVFIMFGKQLSKSSGIIINTFDALENRAIKAITEELCFRN--- 236
Query: 233 PPLHCIGPLIVDAK--DRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKE 290
++ IGPLIV+ + DR +D+ + CL WLDSQP SVVFLCFGS G FS Q+ E
Sbjct: 237 --IYPIGPLIVNGRIEDR----NDNKAVSCLNWLDSQPEKSVVFLCFGSLGLFSKEQVIE 290
Query: 291 IAIGLERSNQRFLWVVRNPSNAAEAEL------PEGFLERTKERGLVVKSWAPQSTILGH 344
IA+GLE+S QRFLWVVRNP + EL PEGFL RT+++G+VVKSWAPQ +L H
Sbjct: 291 IAVGLEKSGQRFLWVVRNPPELEKTELDLKSLLPEGFLSRTEDKGMVVKSWAPQVPVLNH 350
Query: 345 ESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETI 404
++VGGFVTHCGW+S++EAV GVPM+AWPLYAEQ N V +V E+K+A+ M E ET
Sbjct: 351 KAVGGFVTHCGWNSILEAVCAGVPMVAWPLYAEQRFNRVMIVDEIKIAISM---NESET- 406
Query: 405 GNGEGVVSAERVEERVRELMMGSEGKA-LRERSLEMRMMAATAWNNNDGGSSFTAFSNLF 463
G VS+ VE+RV+E++ G+ +RER++ M+ A A + GSS TA + L
Sbjct: 407 ----GFVSSTEVEKRVQEII----GECPVRERTMAMKNAAELAL--TETGSSHTALTTLL 456
Query: 464 DLW 466
W
Sbjct: 457 QSW 459
>sp|Q33DV3|4CGT_ANTMA Chalcone 4'-O-glucosyltransferase OS=Antirrhinum majus PE=1 SV=1
Length = 457
Score = 311 bits (797), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 194/473 (41%), Positives = 274/473 (57%), Gaps = 35/473 (7%)
Query: 2 KKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLN 61
KKT + + H+ S + L K I +H S +SITI+ + P E S+ IN N
Sbjct: 6 KKTHTIVFHTSEEHLNSSIALAKFITKHHSSISITIISTA-PAESSEVAKIIN------N 58
Query: 62 PIISFYYLPAIQMPSETLSR-----ADIAIESIKLNSSNVFQALENISLTSKILSFII-- 114
P I++ L A+ +P S ++ E +L ++N+ +AL +IS S I + II
Sbjct: 59 PSITYRGLTAVALPENLTSNINKNPVELFFEIPRLQNANLREALLDISRKSDIKALIIDF 118
Query: 115 --TSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPPV 172
+ S NIPTY + A L L+ PTLH + D S+ +PG P +
Sbjct: 119 FCNAAFEVSTSMNIPTYFDVSGGAFLLCTFLHHPTLHQTVRGDIADLNDSVE-MPGFPLI 177
Query: 173 KSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTT 232
SS +P + R+ +Y FL+ S ++ KS+GI++NTF LE +A +A+ NG G T
Sbjct: 178 HSSDLPMSLFYRKTNVYKHFLDTSLNMRKSSGILVNTFVALEFRAKEALSNG---LYGPT 234
Query: 233 PPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIA 292
PPL+ + I + D V+ +CL+WLD QPS SV+FLCFG RG FSA QLKEIA
Sbjct: 235 PPLYLLSHTIAEPHDTKVLVNQH---ECLSWLDLQPSKSVIFLCFGRRGAFSAQQLKEIA 291
Query: 293 IGLERSNQRFLWVVR-NPSNAAEAELPEGFLERTKERGLVVKSWAPQSTILGHESVGGFV 351
IGLE+S RFLW+ R +P A LPEGFL RTK G V +W PQ +L H++VGGFV
Sbjct: 292 IGLEKSGCRFLWLARISPEMDLNALLPEGFLSRTKGVGFVTNTWVPQKEVLSHDAVGGFV 351
Query: 352 THCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETIGNGEGVV 411
THCGWSSV+EA+++GVPMI WPLYAEQ +N V +V+E+KVA+P+ +EE +G V
Sbjct: 352 THCGWSSVLEALSFGVPMIGWPLYAEQRINRVFMVEEIKVALPL----DEE-----DGFV 402
Query: 412 SAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNLFD 464
+A +E+RVRELM +GK ++ R E+++ A + GGSS + +
Sbjct: 403 TAMELEKRVRELMESVKGKEVKRRVAELKISTKAAVSK--GGSSLASLEKFIN 453
>sp|Q76MR7|UBGAT_SCUBA Baicalein 7-O-glucuronosyltransferase OS=Scutellaria baicalensis
GN=UBGAT-I PE=1 SV=1
Length = 441
Score = 309 bits (791), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 186/460 (40%), Positives = 255/460 (55%), Gaps = 39/460 (8%)
Query: 19 MVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPIISFYYLPAIQMPSE- 77
M L K I ++ V I I++ + P + + + I P IS++ LP ++P +
Sbjct: 1 MAVLAKFISKNHPSVPI-IIISNAPESAAASVAAI--------PSISYHRLPLPEIPPDM 51
Query: 78 TLSRADIAIESIKLNSSNVFQALENISLTSKI----LSFIITSTTSFSYHPNIPTYTYFN 133
T R ++ E +L++ N+ AL+ IS ++I L F + NIPTY YF+
Sbjct: 52 TTDRVELFFELPRLSNPNLLTALQQISQKTRIRAVILDFFCNAAFEVPTSLNIPTYYYFS 111
Query: 134 SCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPPVKSSFMPEPVLDRQKPIYDFFL 193
+ T LY T+ I +D + + IPGLPP+ +P + R+ +Y +
Sbjct: 112 AGTPTAILTLYFETIDETIPVDLQDL-NDYVDIPGLPPIHCLDIPVALSPRKSLVYKSSV 170
Query: 194 NYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHCIGPLIVDAKDRAGGVS 253
+ S +L +S GI++N FD LE +AI + G TPP++ IGPL+ D +AG
Sbjct: 171 DISKNLRRSAGILVNGFDALEFRAIGSHSQRPMHFKGPTPPVYFIGPLVGDVDTKAGSEE 230
Query: 254 DDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAA 313
+CL WLD+QPS SVVFLCFG RG FSA QLKE A LE S RFLW VRNP
Sbjct: 231 ----HECLRWLDTQPSKSVVFLCFGRRGVFSAKQLKETAAALENSGHRFLWSVRNPPELK 286
Query: 314 EAE----------LPEGFLERTKERGLVVKSWAPQSTILGHESVGGFVTHCGWSSVVEAV 363
+A LPEGFLERTK+RG V+KSWAPQ +L H+SVGGFVTHCG SSV E V
Sbjct: 287 KATGSDEPDLDELLPEGFLERTKDRGFVIKSWAPQKEVLAHDSVGGFVTHCGRSSVSEGV 346
Query: 364 TYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETIGNGEGVVSAERVEERVREL 423
+GVPMI WP+ AE LN +V +++VA+P+ EE G G V+A +E+RVREL
Sbjct: 347 WFGVPMIGWPVDAELRLNRAVMVDDLQVALPL-----EEEAG---GFVTAAELEKRVREL 398
Query: 424 MMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNLF 463
M GKA+R+R E+++ A A N GSS
Sbjct: 399 METKAGKAVRQRVTELKLSARAAVAEN--GSSLNDLKKFL 436
>sp|Q9AR73|HQGT_RAUSE Hydroquinone glucosyltransferase OS=Rauvolfia serpentina GN=AS PE=1
SV=1
Length = 470
Score = 273 bits (698), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 179/492 (36%), Positives = 263/492 (53%), Gaps = 63/492 (12%)
Query: 5 IALYPGPAFHHMISMVELGK-LILQHRSDVSITILVPS-MPLEESKTCSYINSISHRLN- 61
IA+ P P H+I +VE K L+L+H + +T ++P+ PL +++ S+++++ +N
Sbjct: 7 IAMVPTPGMGHLIPLVEFAKRLVLRH--NFGVTFIIPTDGPLPKAQK-SFLDALPAGVNY 63
Query: 62 ---PIISFYYLPAIQMPSETLSRADIAIES-----IKLNSSNVFQALENISLTSKILSFI 113
P +SF LPA D+ IE+ I + V A++ + T+K+ + +
Sbjct: 64 VLLPPVSFDDLPA-----------DVRIETRICLTITRSLPFVRDAVKTLLATTKLAALV 112
Query: 114 I----TSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGL 169
+ T + + Y ++ + A L+ +LP L ++ ++D P L IPG
Sbjct: 113 VDLFGTDAFDVAIEFKVSPYIFYPTTAMCLSLFFHLPKLDQMVSCEYRDVPEPLQ-IPGC 171
Query: 170 PPVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTN 229
P+ +P DR+ Y L+ + + GI++NTF+ LE +KA+ D
Sbjct: 172 IPIHGKDFLDPAQDRKNDAYKCLLHQAKRYRLAEGIMVNTFNDLEPGPLKALQEED---- 227
Query: 230 GTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLK 289
PP++ IGPLI RA S +CL WLD QP GSV+F+ FGS G S Q
Sbjct: 228 QGKPPVYPIGPLI-----RADSSSKVDDCECLKWLDDQPRGSVLFISFGSGGAVSHNQFI 282
Query: 290 EIAIGLERSNQRFLWVVRNPS--------------NAAEAELPEGFLERTKERGLVVKSW 335
E+A+GLE S QRFLWVVR+P+ N A A LPEGFLERTK R L+V SW
Sbjct: 283 ELALGLEMSEQRFLWVVRSPNDKIANATYFSIQNQNDALAYLPEGFLERTKGRCLLVPSW 342
Query: 336 APQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPM 395
APQ+ IL H S GGF+THCGW+S++E+V GVP+IAWPLYAEQ +N+V L + +KVA+
Sbjct: 343 APQTEILSHGSTGGFLTHCGWNSILESVVNGVPLIAWPLYAEQKMNAVMLTEGLKVAL-R 401
Query: 396 FLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSS 455
GE IG E + V+ LM G EGK R +++ A+ A +D GSS
Sbjct: 402 PKAGENGLIGRVE-------IANAVKGLMEGEEGKKFRSTMKDLKDAASRAL--SDDGSS 452
Query: 456 FTAFSNLFDLWQ 467
A + L W+
Sbjct: 453 TKALAELACKWE 464
>sp|Q9M156|U72B1_ARATH UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1
PE=1 SV=1
Length = 480
Score = 263 bits (673), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 161/483 (33%), Positives = 254/483 (52%), Gaps = 43/483 (8%)
Query: 5 IALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPII 64
+A+ P P H+I +VE K ++ H +++T ++ E ++ L I
Sbjct: 9 VAIIPSPGMGHLIPLVEFAKRLV-HLHGLTVTFVIAG----EGPPSKAQRTVLDSLPSSI 63
Query: 65 SFYYLPAIQM---PSETLSRADIAIESIKLNSS--NVFQA-LENISL-TSKILSFIITST 117
S +LP + + S T + I++ + N VF + +E L T+ ++ T
Sbjct: 64 SSVFLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGTDA 123
Query: 118 TSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPPVKSSFM 177
+ ++P Y ++ + A+ L+ L+LP L ++ F++ L+ +PG PV
Sbjct: 124 FDVAVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLM-LPGCVPVAGKDF 182
Query: 178 PEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHC 237
+P DR+ Y + L+ + ++ GI++NTF LE AIKA+ PP++
Sbjct: 183 LDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPGL----DKPPVYP 238
Query: 238 IGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLER 297
+GPL+ K A + S+CL WLD+QP GSV+++ FGS GT + QL E+A+GL
Sbjct: 239 VGPLVNIGKQEAKQTEE---SECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLAD 295
Query: 298 SNQRFLWVVRNPSNAAEAE-------------LPEGFLERTKERGLVVKSWAPQSTILGH 344
S QRFLWV+R+PS A + LP GFLERTK+RG V+ WAPQ+ +L H
Sbjct: 296 SEQRFLWVIRSPSGIANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQVLAH 355
Query: 345 ESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETI 404
S GGF+THCGW+S +E+V G+P+IAWPLYAEQ +N+V L ++++ A+
Sbjct: 356 PSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAAL--------RPR 407
Query: 405 GNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNLFD 464
+G+V E V V+ LM G EGK +R + E++ A D G+S A S +
Sbjct: 408 AGDDGLVRREEVARVVKGLMEGEEGKGVRNKMKELKEAACRVL--KDDGTSTKALSLVAL 465
Query: 465 LWQ 467
W+
Sbjct: 466 KWK 468
>sp|Q2V6K0|UFOG6_FRAAN UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria
ananassa GN=GT6 PE=1 SV=1
Length = 479
Score = 259 bits (661), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 171/499 (34%), Positives = 269/499 (53%), Gaps = 54/499 (10%)
Query: 1 MKKT--IALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISH 58
MKK + P P H++S VE+ KL+L ++ ITIL+ P + YI S++
Sbjct: 1 MKKASELIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPFTADGSDVYIKSLA- 59
Query: 59 RLNPIISFYYLPAIQMPSETLSRADIA-----IESIKLNSSN-VFQALENISLTSKILSF 112
++P + + + +P E I+S K + + V + +E S T++I F
Sbjct: 60 -VDPSLKTQRIRFVNLPQEHFQGTGATGFFTFIDSHKSHVKDAVTRLMETKSETTRIAGF 118
Query: 113 II----TSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLH---NQITSSFKDHPSSLL- 164
+I T + +P+Y ++ S A+ L + +L L N+ + FKD + L+
Sbjct: 119 VIDMFCTGMIDLANEFGLPSYVFYTSGAADLGLMFHLQALRDEENKDCTEFKDSDAELVV 178
Query: 165 --FIPGLPPVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIV 222
F+ LP + +P V +++ +FFLN++ ++ GI++NTF LE AI+++
Sbjct: 179 SSFVNPLPAAR--VLPSVVFEKEGG--NFFLNFAKRYRETKGILVNTFLELEPHAIQSLS 234
Query: 223 NGDCVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVS---SDCLTWLDSQPSGSVVFLCFGS 279
++G P++ +GP I++ K VS + S SD L WLD QP SVVFLCFGS
Sbjct: 235 -----SDGKILPVYPVGP-ILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGS 288
Query: 280 RGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAA----------EAELPEGFLERTKERG 329
G F Q+KEIA LE+ RFLW +R PS +A LPEGFL+RT + G
Sbjct: 289 MGCFGEDQVKEIAHALEQGGIRFLWSLRQPSKEKIGFPSDYTDYKAVLPEGFLDRTTDLG 348
Query: 330 LVVKSWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEM 389
V+ WAPQ IL H +VGGFV+HCGW+S +E++ YGVP+ WP YAEQ +N+ LV+E+
Sbjct: 349 KVI-GWAPQLAILAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKEL 407
Query: 390 KVAMPMFLNGEEETIGNGEGV-VSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWN 448
K+A+ + + +++ GV VS E +E+ ++E+M + LR+R EM M+ A
Sbjct: 408 KLAVEIDMGYRKDS-----GVIVSRENIEKGIKEVM--EQESELRKRVKEMSQMSRKALE 460
Query: 449 NNDGGSSFTAFSNLFDLWQ 467
+ GSS+++ D Q
Sbjct: 461 ED--GSSYSSLGRFLDQIQ 477
>sp|Q66PF3|UFOG3_FRAAN Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3
OS=Fragaria ananassa GN=GT3 PE=2 SV=1
Length = 478
Score = 253 bits (645), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 162/487 (33%), Positives = 263/487 (54%), Gaps = 44/487 (9%)
Query: 5 IALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPI- 63
+ L P P H++S +E+ KL++ + IT+L+ P T +Y+ S++ +PI
Sbjct: 7 LVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSKGTDAYVQSLADSSSPIS 66
Query: 64 --ISFYYLPAIQMPSETLSRADIAIESIKLNSSNVFQALENI--SLTSKILSFII----T 115
I+F LP M S + + ++ +V A+ N+ S T+++ F++ T
Sbjct: 67 QRINFINLPHTNMDHTEGSVRNSLVGFVESQQPHVKDAVANLRDSKTTRLAGFVVDMFCT 126
Query: 116 STTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQIT---SSFKDHPSSLLFIPGLPPV 172
+ + + +P+Y +F S A+TL + +L L +Q + FKD + L+ P+
Sbjct: 127 TMINVANQLGVPSYVFFTSGAATLGLLFHLQELRDQYNKDCTEFKDSDAELIIPSFFNPL 186
Query: 173 KSSFMPEPVL--DRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNG 230
+ +P +L D +P FLN ++ GI++NTF LE A+ A+ ++
Sbjct: 187 PAKVLPGRMLVKDSAEP----FLNVIKRFRETKGILVNTFTDLESHALHALS-----SDA 237
Query: 231 TTPPLHCIGPLIVDAKDRAGGVSDDVS--SDCLTWLDSQPSGSVVFLCFGSRGTFSAPQL 288
PP++ +GPL+ + + SD+V +D L WLD QP SVVFLCFGS G+F Q+
Sbjct: 238 EIPPVYPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQV 297
Query: 289 KEIAIGLERSNQRFLWVVRN---------PSNAAE--AELPEGFLERTKERGLVVKSWAP 337
+EIA LE + RFLW +R PS+ + LPEGFL+RT G V+ WAP
Sbjct: 298 REIANALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVI-GWAP 356
Query: 338 QSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFL 397
Q +L H SVGGFV+HCGW+S +E++ +GVP+ WPLYAEQ LN+ V+E+++A+ + +
Sbjct: 357 QVAVLAHPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDM 416
Query: 398 NGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFT 457
+ ++ +VSA+ +E +RE+M + +R+R EM A DGGSS+T
Sbjct: 417 SYRSKS----PVLVSAKEIERGIREVME-LDSSDIRKRVKEMSEKGKKAL--MDGGSSYT 469
Query: 458 AFSNLFD 464
+ + D
Sbjct: 470 SLGHFID 476
>sp|Q9LNI1|U72B3_ARATH UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana GN=UGT72B3
PE=2 SV=1
Length = 481
Score = 249 bits (636), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 158/456 (34%), Positives = 248/456 (54%), Gaps = 42/456 (9%)
Query: 5 IALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPII 64
+A+ P P H+I +VEL K +L + ++T ++P S +NS+ + I
Sbjct: 9 VAIIPSPGIGHLIPLVELAKRLLDNHG-FTVTFIIPGDSPPSKAQRSVLNSLP---SSIA 64
Query: 65 SFYYLPAIQMPSETLSRADIAIESIKLNSSN-----VFQALENISLTSKILSFIITSTTS 119
S + PA + +R + I S+ + SN +F +L +L + T +
Sbjct: 65 SVFLPPADLSDVPSTARIETRI-SLTVTRSNPALRELFGSLSAEKRLPAVLVVDLFGTDA 123
Query: 120 FSYHP--NIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPPVKSSFM 177
F ++ Y ++ S A+ L +L+LP L ++ F++ ++ IPG P+
Sbjct: 124 FDVAAEFHVSPYIFYASNANVLTFLLHLPKLDETVSCEFRELTEPVI-IPGCVPITGKDF 182
Query: 178 PEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHC 237
+P DR+ Y + L+ ++ GI++N+F LE IK + PP++
Sbjct: 183 VDPCQDRKDESYKWLLHNVKRFKEAEGILVNSFVDLEPNTIKIVQE----PAPDKPPVYL 238
Query: 238 IGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLER 297
IGPL V++ V+D+ CL WLD+QP GSV+++ FGS GT + Q E+A+GL
Sbjct: 239 IGPL-VNSGSHDADVNDEYK--CLNWLDNQPFGSVLYVSFGSGGTLTFEQFIELALGLAE 295
Query: 298 SNQRFLWVVRNPSNAAEAE-------------LPEGFLERTKERGLVVKSWAPQSTILGH 344
S +RFLWV+R+PS A + LP+GFL+RTKE+GLVV SWAPQ+ IL H
Sbjct: 296 SGKRFLWVIRSPSGIASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSWAPQAQILTH 355
Query: 345 ESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETI 404
S+GGF+THCGW+S +E++ GVP+IAWPLYAEQ +N++ LV ++ A+ L GE
Sbjct: 356 TSIGGFLTHCGWNSSLESIVNGVPLIAWPLYAEQKMNALLLV-DVGAALRARL-GE---- 409
Query: 405 GNGEGVVSAERVEERVRELMMGSEGKALRERSLEMR 440
+GVV E V V+ L+ G EG A+R++ E++
Sbjct: 410 ---DGVVGREEVARVVKGLIEGEEGNAVRKKMKELK 442
>sp|Q8W4C2|U72B2_ARATH UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana GN=UGT72B2
PE=2 SV=1
Length = 480
Score = 235 bits (600), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 162/492 (32%), Positives = 253/492 (51%), Gaps = 61/492 (12%)
Query: 5 IALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPII 64
IA+ P P H+I VEL K ++QH ++T+++ E+ S+ + L I
Sbjct: 9 IAIMPSPGMGHLIPFVELAKRLVQHDC-FTVTMIISG----ETSPSKAQRSVLNSLPSSI 63
Query: 65 SFYYLPAIQMPSETLSRADIAIES-IKLNSSN-----VFQALENISLTSKILSFIITSTT 118
+ +LP + S+ S A I + + + SN +F +L +L +
Sbjct: 64 ASVFLPPADL-SDVPSTARIETRAMLTMTRSNPALRELFGSLSTKKSLPAVLVVDMFGAD 122
Query: 119 SFS----YHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPPVKS 174
+F +H + Y ++ S A+ L+ L+LP L ++ F+ + + L IPG P+
Sbjct: 123 AFDVAVDFH--VSPYIFYASNANVLSFFLHLPKLDKTVSCEFR-YLTEPLKIPGCVPITG 179
Query: 175 SFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPP 234
+ V DR Y L+ + ++ GI++N+F LE AIKA+ P
Sbjct: 180 KDFLDTVQDRNDDAYKLLLHNTKRYKEAKGILVNSFVDLESNAIKALQE----PAPDKPT 235
Query: 235 LHCIGPLI------VDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQL 288
++ IGPL+ V+ +D+ G CL+WLD+QP GSV+++ FGS GT + Q
Sbjct: 236 VYPIGPLVNTSSSNVNLEDKFG---------CLSWLDNQPFGSVLYISFGSGGTLTCEQF 286
Query: 289 KEIAIGLERSNQRFLWVVRNPSNAAEAE-------------LPEGFLERTKERGLVVKSW 335
E+AIGL S +RF+WV+R+PS + LP GFL+RTKE+GLVV SW
Sbjct: 287 NELAIGLAESGKRFIWVIRSPSEIVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSW 346
Query: 336 APQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPM 395
APQ IL H S GF+THCGW+S +E++ GVP+IAWPL+AEQ +N++ LV+++ A+ +
Sbjct: 347 APQVQILAHPSTCGFLTHCGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRI 406
Query: 396 FLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSS 455
GE +G+V E V V+ LM G EGKA+ + E++ D G S
Sbjct: 407 HA-GE-------DGIVRREEVVRVVKALMEGEEGKAIGNKVKELKEGVVRVL--GDDGLS 456
Query: 456 FTAFSNLFDLWQ 467
+F + W+
Sbjct: 457 SKSFGEVLLKWK 468
>sp|Q94A84|U72E1_ARATH UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana GN=UGT72E1
PE=1 SV=1
Length = 487
Score = 234 bits (596), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 160/502 (31%), Positives = 249/502 (49%), Gaps = 74/502 (14%)
Query: 2 KKTIALYPGPAFHHMISMVELGK-LILQHRSDVSITILVPSMPLEESKTCSYINS----- 55
K +A++ P H+I ++ELGK L H DV+I +L +S+ ++NS
Sbjct: 5 KPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQ---FLNSPGCDA 61
Query: 56 ------------ISHRLNPIISFYYLPAIQMPSETLSRADIAIESIKLNSSNVFQALENI 103
IS ++P +F+ + + M ET+ IE ++
Sbjct: 62 ALVDIVGLPTPDISGLVDPS-AFFGIKLLVMMRETIPTIRSKIEEMQHKP---------- 110
Query: 104 SLTSKILSFIITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSS--FKDHPS 161
T+ I+ N+ TY + S A LA L+ PTL + K P
Sbjct: 111 --TALIVDLFGLDAIPLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQP- 167
Query: 162 SLLFIPGLPPVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAI 221
+ +PG PV+ E LD +Y F+ + + +GII+NT+D +E + +K++
Sbjct: 168 --MVMPGCEPVRFEDTLETFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSL 225
Query: 222 VNGDCVTNGTTPPLHCIGPLI--VDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGS 279
+ + P++ IGPL VD + L WL+ QP SV+++ FGS
Sbjct: 226 QDPKLLGRIAGVPVYPIGPLSRPVDPSK--------TNHPVLDWLNKQPDESVLYISFGS 277
Query: 280 RGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAAEAE-----------------LPEGFL 322
G+ SA QL E+A GLE S QRF+WVVR P + + LPEGF+
Sbjct: 278 GGSLSAKQLTELAWGLEMSQQRFVWVVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFV 337
Query: 323 ERTKERGLVVKSWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNS 382
RT ERG +V SWAPQ+ IL H++VGGF+THCGW+S++E+V GVPMIAWPL+AEQ +N+
Sbjct: 338 SRTHERGFMVSSWAPQAEILAHQAVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNA 397
Query: 383 VALVQEMKVAMPMFLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMM 442
L +E+ VA+ EGV++ +E VR++M+ EG +R++ +++
Sbjct: 398 TLLNEELGVAV-------RSKKLPSEGVITRAEIEALVRKIMVEEEGAEMRKKIKKLKET 450
Query: 443 AATAWNNNDGGSSFTAFSNLFD 464
AA + + DGG + + S + D
Sbjct: 451 AAESL-SCDGGVAHESLSRIAD 471
>sp|Q40287|UFOG5_MANES Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5
PE=2 SV=1
Length = 487
Score = 234 bits (596), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 158/487 (32%), Positives = 246/487 (50%), Gaps = 42/487 (8%)
Query: 2 KKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLN 61
K I L P H+I ++ELGK I+ + +TI + ++ ++++ +L
Sbjct: 9 KPHIVLLSSPGLGHLIPVLELGKRIVTL-CNFDVTIFMVGSDTSAAEPQVLRSAMTPKLC 67
Query: 62 PIISFY--YLPAIQMPSETL-SRADIAIESIKLNSSNVFQALENISLTSKILSFIITSTT 118
II + + P T+ +R + + I+ AL+ + I+ T +
Sbjct: 68 EIIQLPPPNISCLIDPEATVCTRLFVLMREIRPAFRAAVSALK-FRPAAIIVDLFGTESL 126
Query: 119 SFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSF--KDHPSSLLFIPGLPPVKSSF 176
+ I Y Y S A LA +Y+P L ++ F + P + IPG PV++
Sbjct: 127 EVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEP---MKIPGCRPVRTEE 183
Query: 177 MPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLH 236
+ +P+LDR Y + + ++GI++NT++ LE A+ + + P+
Sbjct: 184 VVDPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVAKVPVF 243
Query: 237 CIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLE 296
IGPL + +AG + + L WLD QP SVV++ FGS GT S Q+ E+A GLE
Sbjct: 244 PIGPL----RRQAGPCGSN--CELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLE 297
Query: 297 RSNQRFLWVVRNPS---------------NAAEAELPEGFLERTKERGLVVKSWAPQSTI 341
RS QRF+WVVR P+ + PEGFL R + GLVV W+PQ I
Sbjct: 298 RSQQRFIWVVRQPTVKTGDAAFFTQGDGADDMSGYFPEGFLTRIQNVGLVVPQWSPQIHI 357
Query: 342 LGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAM-PMFLNGE 400
+ H SVG F++HCGW+SV+E++T GVP+IAWP+YAEQ +N+ L +E+ VA+ P L +
Sbjct: 358 MSHPSVGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAK 417
Query: 401 EETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFS 460
E VV E +E +R +M+ EG +R+R E++ A N+GGSSF S
Sbjct: 418 E--------VVKREEIERMIRRIMVDEEGSEIRKRVRELKDSGEKAL--NEGGSSFNYMS 467
Query: 461 NLFDLWQ 467
L + W+
Sbjct: 468 ALGNEWE 474
>sp|Q9LSY4|U71B8_ARATH UDP-glycosyltransferase 71B8 OS=Arabidopsis thaliana GN=UGT71B8
PE=3 SV=1
Length = 480
Score = 227 bits (578), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 152/484 (31%), Positives = 250/484 (51%), Gaps = 42/484 (8%)
Query: 2 KKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILV-PSMPLEESKTCSYINSISHRL 60
K + P P H+ S E+ KL+++ + +SI+I++ P + ++ +YI+++S
Sbjct: 3 KFALVFVPFPILGHLKSTAEMAKLLVEQETRLSISIIILPLLSGDDVSASAYISALSAAS 62
Query: 61 NPIISFYYLPAIQMPSETLSRADIAIESIKLNSSNVFQAL----ENISLTSKILSFIITS 116
N + + + P+ L D I +K + + ++ L ++ S
Sbjct: 63 NDRLHYEVISDGDQPTVGL-HVDNHIPMVKRTVAKLVDDYSRRPDSPRLAGLVVDMFCIS 121
Query: 117 TTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSS-----FKDHPSSLLFIPGLP- 170
+ ++P Y ++ S LA L++ L ++ S F+D +L +P L
Sbjct: 122 VIDVANEVSVPCYLFYTSNVGILALGLHIQMLFDKKEYSVSETDFEDS-EVVLDVPSLTC 180
Query: 171 PVKSSFMPEPVLDRQ-KPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTN 229
P +P + ++ P+Y LN + GI++NTF LE A++++ ++
Sbjct: 181 PYPVKCLPYGLATKEWLPMY---LNQGRRFREMKGILVNTFAELEPYALESLH-----SS 232
Query: 230 GTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLK 289
G TP + +GPL+ ++ G D+ SD L WLD QP SVVFLCFGS G F+ Q +
Sbjct: 233 GDTPRAYPVGPLL-HLENHVDGSKDEKGSDILRWLDEQPPKSVVFLCFGSIGGFNEEQAR 291
Query: 290 EIAIGLERSNQRFLWVVRNPSNAAEAE-----------LPEGFLERTKERGLVVKSWAPQ 338
E+AI LERS RFLW +R S + E LPEGF +RTK++G V+ WAPQ
Sbjct: 292 EMAIALERSGHRFLWSLRRASRDIDKELPGEFKNLEEILPEGFFDRTKDKGKVI-GWAPQ 350
Query: 339 STILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPM--F 396
+L ++GGFVTHCGW+S++E++ +GVP+ WPLYAEQ N+ +V+E+ +A+ + +
Sbjct: 351 VAVLAKPAIGGFVTHCGWNSILESLWFGVPIAPWPLYAEQKFNAFVMVEELGLAVKIRKY 410
Query: 397 LNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSF 456
G ++ +G +V+AE +E +R LM + +R R EM A DGGSS
Sbjct: 411 WRG-DQLVGTATVIVTAEEIERGIRCLM--EQDSDVRNRVKEMSKKCHMAL--KDGGSSQ 465
Query: 457 TAFS 460
+A
Sbjct: 466 SALK 469
>sp|O82383|U71D1_ARATH UDP-glycosyltransferase 71D1 OS=Arabidopsis thaliana GN=UGT71D1
PE=2 SV=1
Length = 467
Score = 227 bits (578), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 143/469 (30%), Positives = 247/469 (52%), Gaps = 35/469 (7%)
Query: 9 PGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLE-ESKTCSYINSISHRLNPIISFY 67
P P H++ +E + +++ + ITIL+ M L+ +S +Y+ SI+ P + F
Sbjct: 10 PTPTVGHLVPFLEFARRLIEQDDRIRITILL--MKLQGQSHLDTYVKSIASS-QPFVRFI 66
Query: 68 YLPAIQ-MPSETLSRA------DIAIESIKLNSSNVFQ-----ALENISLTSKILSFIIT 115
+P ++ P+ +++ D+ +I L + V AL+ + + ++ F
Sbjct: 67 DVPELEEKPTLGSTQSVEAYVYDVIERNIPLVRNIVMDILTSLALDGVKVKGLVVDFFCL 126
Query: 116 STTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPG-LPPVKS 174
+ ++P Y + + + LA + YL H++ TS F + +L IPG + PV +
Sbjct: 127 PMIDVAKDISLPFYVFLTTNSGFLAMMQYLADRHSRDTSVFVRNSEEMLSIPGFVNPVPA 186
Query: 175 SFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPP 234
+ +P + YD ++ + +K+NGI++N+ +E ++ + P
Sbjct: 187 NVLPSALFVEDG--YDAYVKLAILFTKANGILVNSSFDIEPYSVNHFLQ-----EQNYPS 239
Query: 235 LHCIGPLIVDAKDRAGGVSDDVSSD-CLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAI 293
++ +GP I D K + D D + WLD QP SVVFLCFGS +KEIA
Sbjct: 240 VYAVGP-IFDLKAQPHPEQDLTRRDELMKWLDDQPEASVVFLCFGSMARLRGSLVKEIAH 298
Query: 294 GLERSNQRFLWVVRNPSNAAEAELPEGFLERTKERGLVVKSWAPQSTILGHESVGGFVTH 353
GLE RFLW +R + +LPEGFL+R RG++ W+PQ IL H++VGGFV+H
Sbjct: 299 GLELCQYRFLWSLRK-EEVTKDDLPEGFLDRVDGRGMIC-GWSPQVEILAHKAVGGFVSH 356
Query: 354 CGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETIGNGEGVVSA 413
CGW+S+VE++ +GVP++ WP+YAEQ LN+ +V+E+K+A+ + L+ + + +V+A
Sbjct: 357 CGWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELKLAVELKLDYRVHS----DEIVNA 412
Query: 414 ERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNL 462
+E +R +M ++ +R+R +++ M A N GGSSF A
Sbjct: 413 NEIETAIR-YVMDTDNNVVRKRVMDISQMIQRATKN--GGSSFAAIEKF 458
>sp|O82385|U71D2_ARATH UDP-glycosyltransferase 71D2 OS=Arabidopsis thaliana GN=UGT71D2
PE=2 SV=1
Length = 467
Score = 226 bits (575), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 144/468 (30%), Positives = 238/468 (50%), Gaps = 33/468 (7%)
Query: 9 PGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPIISFYY 68
P P H++ +E + +++ + IT L+ +S SY+ +IS L P + F
Sbjct: 10 PTPTVGHLVPFLEFARRLIEQDDRIRITFLLMKQQ-GQSHLDSYVKTISSSL-PFVRFID 67
Query: 69 LPAIQMPSETLSRADIAIESIKLNSSNV------------FQALENISLTSKILSFIITS 116
+P ++ TL + +NV A + +++ + F
Sbjct: 68 VPELE-EKPTLGTQSVEAYVYDFIETNVPLVQNIIMGILSSPAFDGVTVKGFVADFFCLP 126
Query: 117 TTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPG-LPPVKSS 175
+ ++P Y + S + LA + YL H + TS F + +L IPG + PV +
Sbjct: 127 MIDVAKDASLPFYVFLTSNSGFLAMMQYLAYGHKKDTSVFARNSEEMLSIPGFVNPVPAK 186
Query: 176 FMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPL 235
+P + YD + + +K+NGI++NT +E ++ + + P +
Sbjct: 187 VLPSALFIEDG--YDADVKLAILFTKANGILVNTSFDIEPTSLNHFLGEE-----NYPSV 239
Query: 236 HCIGPLIVDAKDRAGGVSDDVSSD-CLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIG 294
+ +GP I + K D D + WLD+QP SVVFLCFGS G+ P +KEIA G
Sbjct: 240 YAVGP-IFNPKAHPHPDQDLACCDESMKWLDAQPEASVVFLCFGSMGSLRGPLVKEIAHG 298
Query: 295 LERSNQRFLWVVRNPSNAAEAELPEGFLERTKERGLVVKSWAPQSTILGHESVGGFVTHC 354
LE RFLW +R + LPEGF++R RG++ W+PQ IL H++VGGFV+HC
Sbjct: 299 LELCQYRFLWSLRTEEVTNDDLLPEGFMDRVSGRGMIC-GWSPQVEILAHKAVGGFVSHC 357
Query: 355 GWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETIGNGEGVVSAE 414
GW+S+VE++ +GVP++ WP+YAEQ LN+ +V+E+K+A+ + L + ++ +GE +VSA
Sbjct: 358 GWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELKLAVELKL---DYSVHSGE-IVSAN 413
Query: 415 RVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNL 462
+E + +M + +R+R +++ M A N GGSSF A
Sbjct: 414 EIETAI-SCVMNKDNNVVRKRVMDISQMIQRATKN--GGSSFAAIEKF 458
>sp|Q9LVR1|U72E2_ARATH UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana GN=UGT72E2
PE=1 SV=1
Length = 481
Score = 225 bits (573), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 147/471 (31%), Positives = 243/471 (51%), Gaps = 40/471 (8%)
Query: 2 KKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLN 61
K A++ P H+I ++ELGK L + +T+ V +++ ++NS +
Sbjct: 5 KPHAAMFSSPGMGHVIPVIELGKR-LSANNGFHVTVFVLETDAASAQS-KFLNSTGVDIV 62
Query: 62 PIISFYYLPAIQMPSETLSRADIAIESIKLNSSNVFQALENISLTSKILSFIITSTTSFS 121
+ S + +++ + + + + A+ T+ I+ T +
Sbjct: 63 KLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQ-KPTALIVDLFGTDALCLA 121
Query: 122 YHPNIPTYTYFNSCASTLAAILYLPTLHNQITS--SFKDHPSSLLFIPGLPPVKSSFMPE 179
N+ +Y + + A L +Y P L I + + +P L IPG PV+ +
Sbjct: 122 KEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNP---LAIPGCEPVRFEDTLD 178
Query: 180 PVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHCIG 239
L +P+Y F+ + + K++GI++NT++ +E +++K+++N + P++ IG
Sbjct: 179 AYLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIG 238
Query: 240 PLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLERSN 299
PL + S + L WL+ QP+ SV+++ FGS G SA QL E+A GLE+S
Sbjct: 239 PLCRPIQ------SSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQ 292
Query: 300 QRFLWVVRNP-----------SNAAEAE------LPEGFLERTKERGLVVKSWAPQSTIL 342
QRF+WVVR P +N E LPEGF+ RT +RG VV SWAPQ+ IL
Sbjct: 293 QRFVWVVRPPVDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEIL 352
Query: 343 GHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEE 402
H +VGGF+THCGWSS +E+V GVPMIAWPL+AEQ +N+ L E+ +A + L+ +E
Sbjct: 353 SHRAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIA--VRLDDPKE 410
Query: 403 TIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGG 453
I S ++E VR++M EG+A+R + ++R A + + + GG
Sbjct: 411 DI-------SRWKIEALVRKVMTEKEGEAMRRKVKKLRDSAEMSLSIDGGG 454
>sp|Q9ZU72|U72D1_ARATH UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana GN=UGT72D1
PE=2 SV=1
Length = 470
Score = 223 bits (568), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 148/472 (31%), Positives = 241/472 (51%), Gaps = 39/472 (8%)
Query: 7 LYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPIISF 66
L P H+I ++ELG L ++ +TIL + I++ + R I
Sbjct: 8 LVASPGLGHLIPILELGNR-LSSVLNIHVTILAVTSGSSSPTETEAIHAAAART--ICQI 64
Query: 67 YYLPAIQM-----PSETL-SRADIAIESIKLNSSNVFQALENISLTSKILSFIITSTTSF 120
+P++ + P T+ ++ + + ++K + + ++ T I+ F+ T S
Sbjct: 65 TEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKR-KPTVMIVDFLGTELMSV 123
Query: 121 SYHPNI-PTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPPVKSSFMPE 179
+ + Y Y + A LA ++YLP L + + D L IPG PV + E
Sbjct: 124 ADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPL-KIPGCKPVGPKELME 182
Query: 180 PVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHCIG 239
+LDR Y + + S+G+++NT++ L+ + A+ + ++ P++ IG
Sbjct: 183 TMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVYPIG 242
Query: 240 PLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLERSN 299
P++ + D + WLD Q SVVF+C GS GT + Q E+A+GLE S
Sbjct: 243 PIVRTNQHV------DKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSG 296
Query: 300 QRFLWVVRNPSN----------AAEAELPEGFLERTKERGLVVKSWAPQSTILGHESVGG 349
QRF+WV+R P++ A LPEGFL+RT+ G+VV WAPQ IL H S+GG
Sbjct: 297 QRFVWVLRRPASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSIGG 356
Query: 350 FVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETIGNGEG 409
F++HCGWSS +E++T GVP+IAWPLYAEQ++N+ L +E+ VA+ E IG
Sbjct: 357 FLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGR--- 413
Query: 410 VVSAERVEERVRELMM--GSEGKALRERSLEMRMMAATAWNNNDGGSSFTAF 459
E V VR++M EG+ +R ++ E+R+ + AW+ + GSS+ +
Sbjct: 414 ----EEVASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKD--GSSYNSL 459
>sp|Q9LSY5|U71B7_ARATH UDP-glycosyltransferase 71B7 OS=Arabidopsis thaliana GN=UGT71B7
PE=2 SV=2
Length = 495
Score = 222 bits (566), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 159/494 (32%), Positives = 254/494 (51%), Gaps = 48/494 (9%)
Query: 1 MKKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILV-PSMPLEESKTCSYINSISHR 59
MK + P P H+ S VE+ KL++ + +SI++++ P + E YI ++S
Sbjct: 1 MKFELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVIILPFISEGEVGASDYIAALSAS 60
Query: 60 LNPIISFYYLPAIQMPSETLSRADIAIESIKLN-SSNVFQALENISL---TSKILSFII- 114
N + + + A+ P+ ++ +I +++ + S V + LE+ S + KI F++
Sbjct: 61 SNNRLRYEVISAVDQPTIEMTTIEIHMKNQEPKVRSTVAKLLEDYSSKPDSPKIAGFVLD 120
Query: 115 ---TSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTL--HNQITSSFKDHPSS--LLFIP 167
TS + P+Y ++ S A L+ ++ L N+ S D+ S +L P
Sbjct: 121 MFCTSMVDVANEFGFPSYMFYTSSAGILSVTYHVQMLCDENKYDVSENDYADSEAVLNFP 180
Query: 168 GLP---PVKSSFMPEPVL-DRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVN 223
L PVK +P + + P+ F+N + + GI++NT LE +K + +
Sbjct: 181 SLSRPYPVKC--LPHALAANMWLPV---FVNQARKFREMKGILVNTVAELEPYVLKFLSS 235
Query: 224 GDCVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTF 283
D TPP++ +GPL+ +++ D+ + + WLD QP SVVFLCFGS G F
Sbjct: 236 SD------TPPVYPVGPLL-HLENQRDDSKDEKRLEIIRWLDQQPPSSVVFLCFGSMGGF 288
Query: 284 SAPQLKEIAIGLERSNQRFLWVVRNPS-----------NAAEAELPEGFLERTKERGLVV 332
Q++EIAI LERS RFLW +R S E LPEGF +RTK+ G V+
Sbjct: 289 GEEQVREIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFFDRTKDIGKVI 348
Query: 333 KSWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVA 392
WAPQ +L + ++GGFVTHCGW+S +E++ +GVP AWPLYAEQ N+ +V+E+ +A
Sbjct: 349 -GWAPQVAVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNAFLMVEELGLA 407
Query: 393 MPM--FLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNN 450
+ + + G E G V+AE +E+ + LM + +R+R +M A
Sbjct: 408 VEIRKYWRG-EHLAGLPTATVTAEEIEKAIMCLM--EQDSDVRKRVKDMSEKCHVAL--M 462
Query: 451 DGGSSFTAFSNLFD 464
DGGSS TA +
Sbjct: 463 DGGSSRTALQKFIE 476
>sp|Q9LML6|U71C4_ARATH UDP-glycosyltransferase 71C4 OS=Arabidopsis thaliana GN=UGT71C4
PE=2 SV=2
Length = 479
Score = 221 bits (562), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 162/481 (33%), Positives = 244/481 (50%), Gaps = 58/481 (12%)
Query: 9 PGPAFHHMISMVELGKLI--LQHRSDVSITIL---VPSMPLEESKTCSYINSISHRLNPI 63
P P+ H++ +E K + L HR +ITIL PS P S I S P
Sbjct: 11 PVPSTGHILVHIEFAKRLINLDHRIH-TITILNLSSPSSPHASVFARSLIAS-----QPK 64
Query: 64 ISFYYLPAIQMPS--ETLSRADIA--IESIKLNSSNVFQALENISLTSK----------- 108
I + LP IQ P + RA A ++ IK N+ + A+ +I + +
Sbjct: 65 IRLHDLPPIQDPPPFDLYQRAPEAYIVKLIKKNTPLIKDAVSSIVASRRGGSDSVQVAGL 124
Query: 109 ILSFIITS-TTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFK-DHPSSLLFI 166
+L S N+P+Y Y A L + Y+P H +I S F L +
Sbjct: 125 VLDLFCNSLVKDVGNELNLPSYIYLTCNARYLGMMKYIPDRHRKIASEFDLSSGDEELPV 184
Query: 167 PG-LPPVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGD 225
PG + + + FMP + +++ Y+ ++ + + + GI++N+F LE + +
Sbjct: 185 PGFINAIPTKFMPPGLFNKEA--YEAYVELAPRFADAKGILVNSFTELEPHPFDYFSHLE 242
Query: 226 CVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLT-WLDSQPSGSVVFLCFGSRGTFS 284
PP++ +GP I+ KDRA + V D + WLD QP SVVFLCFGSRG+
Sbjct: 243 -----KFPPVYPVGP-ILSLKDRASPNEEAVDRDQIVGWLDDQPESSVVFLCFGSRGSVD 296
Query: 285 APQLKEIAIGLERSNQRFLWVVR-------NPSNAAEAELPEGFLERTKERGLVVKSWAP 337
PQ+KEIA LE RFLW +R NP++ LPEGF+ R RGLV WAP
Sbjct: 297 EPQVKEIARALELVGCRFLWSIRTSGDVETNPNDV----LPEGFMGRVAGRGLVC-GWAP 351
Query: 338 QSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFL 397
Q +L H+++GGFV+HCGW+S +E++ +GVP+ WP+YAEQ LN+ LV+E+ +A+ + +
Sbjct: 352 QVEVLAHKAIGGFVSHCGWNSTLESLWFGVPVATWPMYAEQQLNAFTLVKELGLAVDLRM 411
Query: 398 NGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFT 457
+ + + G+V+ + + VR LM G + K R++ EM A A DGGSS
Sbjct: 412 D----YVSSRGGLVTCDEIARAVRSLMDGGDEK--RKKVKEMADAARKAL--MDGGSSSL 463
Query: 458 A 458
A
Sbjct: 464 A 464
>sp|O81498|U72E3_ARATH UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana GN=UGT72E3
PE=1 SV=1
Length = 481
Score = 220 bits (561), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 142/480 (29%), Positives = 241/480 (50%), Gaps = 40/480 (8%)
Query: 2 KKTIALYPGPAFHHMISMVELGK-LILQHRSDVSITILVPSMPLEESKTCSYIN-SISHR 59
K A++ P H++ ++EL K L H V++ +L +SK + I +
Sbjct: 5 KPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLLNSTGVDIVNL 64
Query: 60 LNPIISFYYLPAIQMPSETLSRADIAIESIKLNSSNVFQALENISLTSKILSFIITSTTS 119
+P IS P + ++ I E++ S + +N T+ I+ T
Sbjct: 65 PSPDISGLVDPNAHVVTKI---GVIMREAVPTLRSKIVAMHQNP--TALIIDLFGTDALC 119
Query: 120 FSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPPVKSSFMPE 179
+ N+ TY + S A L +Y PTL I L IPG PV+ + +
Sbjct: 120 LAAELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLT-IPGCEPVRFEDIMD 178
Query: 180 PVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHCIG 239
L +P+Y + + + K++GI++NT++ +E +++K++ + + P++ +G
Sbjct: 179 AYLVPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVG 238
Query: 240 PLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLERSN 299
PL + S WL+ QP+ SV+++ FGS G+ +A QL E+A GLE S
Sbjct: 239 PLCRPIQ------SSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQ 292
Query: 300 QRFLWVVRNPSNAAEAE-----------------LPEGFLERTKERGLVVKSWAPQSTIL 342
QRF+WVVR P + + LPEGF+ RT +RG ++ SWAPQ+ IL
Sbjct: 293 QRFIWVVRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEIL 352
Query: 343 GHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEE 402
H++VGGF+THCGWSS +E+V GVPMIAWPL+AEQ +N+ L E+ +++
Sbjct: 353 AHQAVGGFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISV--------- 403
Query: 403 TIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNL 462
+ + + +S ++E VR++M EG+ +R + ++R A + + + GGS+ + +
Sbjct: 404 RVDDPKEAISRSKIEAMVRKVMAEDEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRV 463
>sp|Q9LSY9|U71B1_ARATH UDP-glycosyltransferase 71B1 OS=Arabidopsis thaliana GN=UGT71B1
PE=2 SV=1
Length = 473
Score = 219 bits (559), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 152/484 (31%), Positives = 248/484 (51%), Gaps = 42/484 (8%)
Query: 1 MKKTIALYPGPAFHHMISMVELGKLILQHRSDVSIT-ILVPSMPLEESKTCSYINSISHR 59
MK + P P H+ + L KL++ + +S+T I++PS +++ + Y NS R
Sbjct: 1 MKVELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIVIPSRVSDDASSSVYTNS-EDR 59
Query: 60 LNPIISFYYLPAIQMPSETLSRADIAIESIKLNSSNVFQALENIS---LTSKILSFIITS 116
L I+ LPA ++ +S D ++ S V + S L ++ TS
Sbjct: 60 LRYIL----LPARDQTTDLVSYIDSQKPQVRAVVSKVAGDVSTRSDSRLAGIVVDMFCTS 115
Query: 117 TTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQI---TSSFKDHPSSLLF-IPGL-PP 171
+ N+ Y ++ S AS L ++ +L+++ S FKD + + F +P L P
Sbjct: 116 MIDIADEFNLSAYIFYTSNASYLGLQFHVQSLYDEKELDVSEFKD--TEMKFDVPTLTQP 173
Query: 172 VKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGT 231
+ +P +L+++ + + L + S + GI++N+ +E QA+ G+ TN
Sbjct: 174 FPAKCLPSVMLNKK--WFPYVLGRARSFRATKGILVNSVADMEPQALSFFSGGNGNTN-- 229
Query: 232 TPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEI 291
PP++ +GP++ D ++ + L WL QP+ SVVFLCFGS G FS Q +EI
Sbjct: 230 IPPVYAVGPIM----DLESSGDEEKRKEILHWLKEQPTKSVVFLCFGSMGGFSEEQAREI 285
Query: 292 AIGLERSNQRFLWVVR------NPSNAAEAE-------LPEGFLERTKERGLVVKSWAPQ 338
A+ LERS RFLW +R N SN E LP+GFL+RT E G ++ SWAPQ
Sbjct: 286 AVALERSGHRFLWSLRRASPVGNKSNPPPGEFTNLEEILPKGFLDRTVEIGKII-SWAPQ 344
Query: 339 STILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLN 398
+L ++G FVTHCGW+S++E++ +GVPM AWP+YAEQ N+ +V E+ +A +
Sbjct: 345 VDVLNSPAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDELGLAAEVKKE 404
Query: 399 GEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTA 458
+ + +V+A+ +E ++ M + +R+R +EM+ A DGGSS A
Sbjct: 405 YRRDFLVEEPEIVTADEIERGIKCAM--EQDSKMRKRVMEMKDKLHVAL--VDGGSSNCA 460
Query: 459 FSNL 462
Sbjct: 461 LKKF 464
>sp|Q40285|UFOG2_MANES Anthocyanidin 3-O-glucosyltransferase 2 (Fragment) OS=Manihot
esculenta GN=GT2 PE=2 SV=1
Length = 346
Score = 218 bits (556), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 128/352 (36%), Positives = 200/352 (56%), Gaps = 39/352 (11%)
Query: 125 NIPTYTYFNSCASTLAAILYLPTLHNQITSS---FKDHPSSLLFIPGLPPVKSSFMPEPV 181
IP+Y +F S L +LY+ +H++ + FKD + L+ + P + +P +
Sbjct: 13 GIPSYIFFASGGGFLGFMLYVQKIHDEENFNPIEFKDSDTELIVPSLVNPFPTRILPSSI 72
Query: 182 LDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHCIGPL 241
L++++ + L + ++ GII+NTF LE +AI++ PPL+ +GP
Sbjct: 73 LNKER--FGQLLAIAKKFRQAKGIIVNTFLELESRAIESF---------KVPPLYHVGP- 120
Query: 242 IVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLERSNQR 301
I+D K + + + WLD QP GSVVFLCFGS G+FS QLKEIA LE S R
Sbjct: 121 ILDVKSDG----RNTHPEIMQWLDDQPEGSVVFLCFGSMGSFSEDQLKEIAYALENSGHR 176
Query: 302 FLWVVRNP------SNAAEAE-----LPEGFLERTKERGLVVKSWAPQSTILGHESVGGF 350
FLW +R P ++ + E LPEGFLERT G V+ WAPQ +L H ++GGF
Sbjct: 177 FLWSIRRPPPPDKIASPTDYEDPRDVLPEGFLERTVAVGKVI-GWAPQVAVLAHPAIGGF 235
Query: 351 VTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETIGNGEGV 410
V+HCGW+SV+E++ +GVP+ WP+YAEQ N+ +V E+ + + + + +E+ +
Sbjct: 236 VSHCGWNSVLESLWFGVPIATWPMYAEQQFNAFEMVVELGLGVEIDMGYRKES----GII 291
Query: 411 VSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNL 462
V+++++E +R+LM S+ K R++ EMR + A DGGSSF + +
Sbjct: 292 VNSDKIERAIRKLMENSDEK--RKKVKEMREKSKMAL--IDGGSSFISLGDF 339
>sp|Q9LML7|U71C3_ARATH UDP-glycosyltransferase 71C3 OS=Arabidopsis thaliana GN=UGT71C3
PE=2 SV=1
Length = 476
Score = 216 bits (551), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 157/483 (32%), Positives = 241/483 (49%), Gaps = 51/483 (10%)
Query: 8 YPGPAFHHMISMVELGKLILQHRSDV-SITILVPSMPLEESKTCSYINSISHRLNPIISF 66
YP P H++ +E K +++ + +ITIL ++PL + ++ + P I
Sbjct: 12 YPSPG--HLLVSIEFAKSLIKRDDRIHTITILYWALPLAPQAHLFAKSLVASQ--PRIRL 67
Query: 67 YYLPAIQMPS--ETLSRADIA--IESIKLNSSNVFQALENISLTSK----------ILSF 112
LP +Q P E +A A +ES K V AL + + K ++ F
Sbjct: 68 LALPDVQNPPPLELFFKAPEAYILESTKKTVPLVRDALSTLVSSRKESGSVRVVGLVIDF 127
Query: 113 IITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFK------DHPSSLLFI 166
+ N+P+Y + A L+ + YLP H TS +HP I
Sbjct: 128 FCVPMIEVANELNLPSYIFLTCNAGFLSMMKYLPERHRITTSELDLSSGNVEHP-----I 182
Query: 167 PG-LPPVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGD 225
PG + V + +P + R+ Y+ ++ + + GI++N+ LEQ A D
Sbjct: 183 PGYVCSVPTKVLPPGLFVRES--YEAWVEIAEKFPGAKGILVNSVTCLEQNAFDYFARLD 240
Query: 226 CVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSD-CLTWLDSQPSGSVVFLCFGSRGTFS 284
PP++ +GP ++ KDR D D + WL+ QP S+V++CFGS G
Sbjct: 241 ----ENYPPVYPVGP-VLSLKDRPSPNLDASDRDRIMRWLEDQPESSIVYICFGSLGIIG 295
Query: 285 APQLKEIAIGLERSNQRFLWVVR-NPSNAAEAE--LPEGFLERTKERGLVVKSWAPQSTI 341
Q++EIA LE + RFLW +R NP+ A LPEGFL+RT +GLV WAPQ +
Sbjct: 296 KLQIEEIAEALELTGHRFLWSIRTNPTEKASPYDLLPEGFLDRTASKGLVC-DWAPQVEV 354
Query: 342 LGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEE 401
L H+++GGFV+HCGW+SV+E++ +GVP+ WP+YAEQ LN+ ++V+E+ +A+ + L+
Sbjct: 355 LAHKALGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQLNAFSMVKELGLAVELRLD--- 411
Query: 402 ETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSN 461
GE +V AE + +R LM G + R+R EM A A DGGSSF A
Sbjct: 412 YVSAYGE-IVKAEEIAGAIRSLMDGEDTP--RKRVKEMAEAARNAL--MDGGSSFVAVKR 466
Query: 462 LFD 464
D
Sbjct: 467 FLD 469
>sp|O23382|U71B5_ARATH UDP-glycosyltransferase 71B5 OS=Arabidopsis thaliana GN=UGT71B5
PE=3 SV=1
Length = 478
Score = 210 bits (534), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 157/493 (31%), Positives = 242/493 (49%), Gaps = 53/493 (10%)
Query: 1 MKKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSIS--- 57
MK + P P H+ V+L K ++ + +SITI++ + + I S++
Sbjct: 1 MKIELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRFDAGDASACIASLTTLS 60
Query: 58 --HRLN-PIISFYYLPAIQMPSETLSRADIAIESIKLNSSNVFQALENISLTSKILSFII 114
RL+ IS P P ++ I + K+ + A + T K+ F++
Sbjct: 61 QDDRLHYESISVAKQPPTSDPDPVPAQVYIEKQKTKVRDA---VAARIVDPTRKLAGFVV 117
Query: 115 ----TSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSL--LFIPG 168
+S + +P Y + S A+ L +L++ +++Q + +S+ L P
Sbjct: 118 DMFCSSMIDVANEFGVPCYMVYTSNATFLGTMLHVQQMYDQKKYDVSELENSVTELEFPS 177
Query: 169 LP---PVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAI-VNG 224
L PVK P + K L + K GI++NT LE A+K +NG
Sbjct: 178 LTRPYPVKCL----PHILTSKEWLPLSLAQARCFRKMKGILVNTVAELEPHALKMFNING 233
Query: 225 DCVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFS 284
D P ++ +GP++ G D+ S+ L WLD QPS SVVFLCFGS G F+
Sbjct: 234 D-----DLPQVYPVGPVL---HLENGNDDDEKQSEILRWLDEQPSKSVVFLCFGSLGGFT 285
Query: 285 APQLKEIAIGLERSNQRFLWVVRNPS-----------NAAEAELPEGFLERTKERGLVVK 333
Q +E A+ L+RS QRFLW +R+ S E LPEGFLERT +RG V+
Sbjct: 286 EEQTRETAVALDRSGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRGKVI- 344
Query: 334 SWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAM 393
WAPQ +L ++GGFVTHCGW+S++E++ +GVPM+ WPLYAEQ +N+ +V+E+ +A+
Sbjct: 345 GWAPQVAVLEKPAIGGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEELGLAV 404
Query: 394 PM--FLNGEEETIGNGE-GVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNN 450
+ +L G+ + GE V+AE +E +R +M + +R EM A
Sbjct: 405 EIRKYLKGD---LFAGEMETVTAEDIERAIRRVM--EQDSDVRNNVKEMAEKCHFAL--M 457
Query: 451 DGGSSFTAFSNLF 463
DGGSS A
Sbjct: 458 DGGSSKAALEKFI 470
>sp|Q9LSY6|U71B6_ARATH UDP-glycosyltransferase 71B6 OS=Arabidopsis thaliana GN=UGT71B6
PE=1 SV=1
Length = 479
Score = 209 bits (533), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 156/489 (31%), Positives = 247/489 (50%), Gaps = 52/489 (10%)
Query: 1 MKKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSIS--H 58
MK + P PA H+++ VE+ + ++ ++SIT+++ S SK S I S++ +
Sbjct: 1 MKIELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVIIISF---SSKNTSMITSLTSNN 57
Query: 59 RLN-PIISFYYLPAIQMPSETLSRADIAIESIKLNSSNVFQALENISLTS--KILSFII- 114
RL IIS Q P+E L D I+S+K + L + +L ++ F++
Sbjct: 58 RLRYEIIS----GGDQQPTE-LKATDSHIQSLKPLVRDAVAKLVDSTLPDAPRLAGFVVD 112
Query: 115 ---TSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQ----ITSSFKDHPSSLLFIP 167
TS + +P+Y ++ S A L +L++ +++ S +D L+ +P
Sbjct: 113 MYCTSMIDVANEFGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYDMSELEDSDVELV-VP 171
Query: 168 GLP---PVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNG 224
L P+K P + + K FF+ + ++ GI++NT LE QA+ +
Sbjct: 172 SLTSPYPLKCL----PYIFKSKEWLTFFVTQARRFRETKGILVNTVPDLEPQALTFL--- 224
Query: 225 DCVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFS 284
+NG P + +GPL+ K+ D S+ L WLD QP SVVFLCFGS G FS
Sbjct: 225 ---SNGNIPRAYPVGPLL-HLKNVNCDYVDKKQSEILRWLDEQPPRSVVFLCFGSMGGFS 280
Query: 285 APQLKEIAIGLERSNQRFLWVVRNPS-----------NAAEAELPEGFLERTKERGLVVK 333
Q++E A+ L+RS RFLW +R S E LPEGF +RT RG V+
Sbjct: 281 EEQVRETALALDRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVI- 339
Query: 334 SWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAM 393
WA Q IL ++GGFV+H GW+S +E++ +GVPM WPLYAEQ N+ +V+E+ +A+
Sbjct: 340 GWAEQVAILAKPAIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAV 399
Query: 394 PMFLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGG 453
+ + + + +V+AE +E+ + LM + +R+R E+ A DGG
Sbjct: 400 EIKKHWRGDLLLGRSEIVTAEEIEKGIICLM--EQDSDVRKRVNEISEKCHVAL--MDGG 455
Query: 454 SSFTAFSNL 462
SS TA
Sbjct: 456 SSETALKRF 464
>sp|O82381|U71C1_ARATH UDP-glycosyltransferase 71C1 OS=Arabidopsis thaliana GN=UGT71C1
PE=1 SV=1
Length = 481
Score = 209 bits (531), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 151/476 (31%), Positives = 241/476 (50%), Gaps = 41/476 (8%)
Query: 5 IALYPGPAFHHMISMVELGK-LILQHRSDV-SITILVPSMP-LEESKTCSYINSISHRLN 61
+ + P P H+++ +EL K LI Q + +ITIL +P + ++ T +++ S+
Sbjct: 9 LVIIPFPFSGHILATIELAKRLISQDNPRIHTITILYWGLPFIPQADTIAFLRSLVKN-E 67
Query: 62 PIISFYYLPAIQMPSETLSRADIA----IESIKLNSSNVFQALE----------NISLTS 107
P I LP +Q P + A +E +K + +AL ++ +
Sbjct: 68 PRIRLVTLPEVQDPPPMELFVEFAESYILEYVKKMVPIIREALSTLLSSRDESGSVRVAG 127
Query: 108 KILSFIITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSF-KDHPSSLLFI 166
+L F N+P+Y + A L + YLP H +I S F + L I
Sbjct: 128 LVLDFFCVPMIDVGNEFNLPSYIFLTCSAGFLGMMKYLPERHREIKSEFNRSFNEELNLI 187
Query: 167 PG-LPPVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGD 225
PG + V + +P + K Y+ ++ + ++ GI++N++ LE K
Sbjct: 188 PGYVNSVPTKVLPSGLF--MKETYEPWVELAERFPEAKGILVNSYTALEPNGFKYF--DR 243
Query: 226 CVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSA 285
C N P ++ IGP++ + DR S + +TWLD QP SVVFLCFGS SA
Sbjct: 244 CPDN--YPTIYPIGPILC-SNDRPNLDSSE-RDRIITWLDDQPESSVVFLCFGSLKNLSA 299
Query: 286 PQLKEIAIGLERSNQRFLWVVR-NPSNAAE--AELPEGFLERTKERGLVVKSWAPQSTIL 342
Q+ EIA LE + +F+W R NP A LP GF++R ++G+V WAPQ IL
Sbjct: 300 TQINEIAQALEIVDCKFIWSFRTNPKEYASPYEALPHGFMDRVMDQGIVC-GWAPQVEIL 358
Query: 343 GHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEE 402
H++VGGFV+HCGW+S++E++ +GVP+ WP+YAEQ LN+ +V+E+ +A+ M L+ E
Sbjct: 359 AHKAVGGFVSHCGWNSILESLGFGVPIATWPMYAEQQLNAFTMVKELGLALEMRLDYVSE 418
Query: 403 TIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTA 458
+G+ +V A+ + VR LM G + + ++ +A DGGSSF A
Sbjct: 419 ---DGD-IVKADEIAGTVRSLMDGVDVPKSK-----VKEIAEAGKEAVDGGSSFLA 465
>sp|Q9LSY8|U71B2_ARATH UDP-glycosyltransferase 71B2 OS=Arabidopsis thaliana GN=UGT71B2
PE=1 SV=1
Length = 485
Score = 208 bits (529), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 159/486 (32%), Positives = 242/486 (49%), Gaps = 40/486 (8%)
Query: 1 MKKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILV-PSM-PLEESKTCSYINSISH 58
MK + P P H+ +VE+ KL + +SITI++ P M S + SYI S+S
Sbjct: 1 MKLELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIIIPQMHGFSSSNSSSYIASLSS 60
Query: 59 RLNPIISFYYLPAIQMPSETLSRADI--AIESIKLNSSNVFQALENI---SLTSKILSFI 113
+S+ L P ++ I++ K + L + S++ F+
Sbjct: 61 DSEERLSYNVLSVPDKPDSDDTKPHFFDYIDNFKPQVKATVEKLTDPGPPDSPSRLAGFV 120
Query: 114 ITS----TTSFSYHPNIPTYTYFNSCASTLA---AILYLPTLHNQITSSFKDHPSSLLFI 166
+ + +P+Y ++ S A+ L + YL + N S KD ++ L +
Sbjct: 121 VDMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLYDVKNYDVSDLKDSDTTELEV 180
Query: 167 PGLP-PVKSSFMPEPVLDRQ-KPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNG 224
P L P+ P +L ++ P+ + ++ GI++NTF LE QA+K
Sbjct: 181 PCLTRPLPVKCFPSVLLTKEWLPV---MFRQTRRFRETKGILVNTFAELEPQAMKFFSGV 237
Query: 225 DCVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFS 284
D P ++ +GP +++ K SDD S+ L WLD QP SVVFLCFGS G F
Sbjct: 238 D----SPLPTVYTVGP-VMNLKINGPNSSDDKQSEILRWLDEQPRKSVVFLCFGSMGGFR 292
Query: 285 APQLKEIAIGLERSNQRFLWVVRNPSNAA-----------EAELPEGFLERTKERGLVVK 333
Q KEIAI LERS RF+W +R E LPEGFLERT E G +V
Sbjct: 293 EGQAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKIV- 351
Query: 334 SWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAM 393
WAPQS IL + ++GGFV+HCGW+S +E++ +GVPM WPLYAEQ +N+ +V+E+ +A+
Sbjct: 352 GWAPQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLAV 411
Query: 394 PMFLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGG 453
+ + + + + +++AE +E +R LM + +R R EM + A DGG
Sbjct: 412 EVRNSFRGDFMAADDELMTAEEIERGIRCLM--EQDSDVRSRVKEMSEKSHVAL--MDGG 467
Query: 454 SSFTAF 459
SS A
Sbjct: 468 SSHVAL 473
>sp|Q40284|UFOG1_MANES Anthocyanidin 3-O-glucosyltransferase 1 OS=Manihot esculenta GN=GT1
PE=2 SV=1
Length = 449
Score = 208 bits (529), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 141/466 (30%), Positives = 241/466 (51%), Gaps = 43/466 (9%)
Query: 15 HMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPIISFYYLP---- 70
H++S VE KL+L +SIT+L+ + + SK +Y++S + + F YLP
Sbjct: 3 HLVSAVETAKLLLSRCHSLSITVLIFNNSVVTSKVHNYVDSQIASSSNRLRFIYLPRDET 62
Query: 71 AIQMPSETLSRADIAIESIKLNSSNVFQALENISLTSKILSFIITSTTSFSYHPNIPTYT 130
I S + + ++ + + ++E+ L I+ T+ + +P+Y
Sbjct: 63 GISSFSSLIEKQKPHVKESVMKITEFGSSVESPRLVGFIVDMFCTAMIDVANEFGVPSYI 122
Query: 131 YFNSCASTLAAILYLPTLHNQITSSFKDHPSS--LLFIPGL-PPVKSSFMPEPVLDRQKP 187
++ S A+ L +L++ +H++ + + +S L +PGL S MP +L +Q
Sbjct: 123 FYTSGAAFLNFMLHVQKIHDEENFNPTEFNASDGELQVPGLVNSFPSKAMPTAILSKQ-- 180
Query: 188 IYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHCIGPLIVDAKD 247
+ L + ++ G+IINTF LE AI++ + PP++ +GP I+D +
Sbjct: 181 WFPPLLENTRRYGEAKGVIINTFFELESHAIESFKD---------PPIYPVGP-ILDVRS 230
Query: 248 RAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLERSNQRFLW--- 304
+ ++ + WLD QP SVVFLCFGS G+FS Q+KEIA LE S RFLW
Sbjct: 231 NGRNTNQEI----MQWLDDQPPSSVVFLCFGSNGSFSKDQVKEIACALEDSGHRFLWSLA 286
Query: 305 ------VVRNPSNAAEAE--LPEGFLERTKERGLVVKSWAPQSTILGHESVGGFVTHCGW 356
+ +PS+ + + LPEGFLERT V+ WAPQ +L H + GG V+H GW
Sbjct: 287 DHRAPGFLESPSDYEDLQEVLPEGFLERTSGIEKVI-GWAPQVAVLAHPATGGLVSHSGW 345
Query: 357 SSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETIGNGEGVVSAERV 416
+S++E++ +GVP+ WP+YAEQ N+ +V E+ +A+ + ++ ++ GE +V +++
Sbjct: 346 NSILESIWFGVPVATWPMYAEQQFNAFQMVIELGLAVEIKMDYRNDS---GE-IVKCDQI 401
Query: 417 EERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNL 462
E +R LM + + + + + A +GGSS+ NL
Sbjct: 402 ERGIRCLMKHDSDRRKKVKEMSEKSRGALM----EGGSSYCWLDNL 443
>sp|Q9FE68|U71C5_ARATH UDP-glycosyltransferase 71C5 OS=Arabidopsis thaliana GN=UGT71C5
PE=2 SV=1
Length = 480
Score = 204 bits (518), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 155/486 (31%), Positives = 234/486 (48%), Gaps = 56/486 (11%)
Query: 9 PGPAFHHMISMVELGKLILQHRSDVS-ITIL---VPSMPLEESKTCSYINSISHRLNPII 64
P P H++S +E GK +L +S ITIL +P P ++ S S P I
Sbjct: 10 PLPETGHLLSTIEFGKRLLNLDRRISMITILSMNLPYAPHADASLASLTAS-----EPGI 64
Query: 65 SFYYLPAIQMPS-----ETLSRADIAIESIKLNSSNVFQALENI------------SLTS 107
LP I P +T S I ++ I N + + ++++ +
Sbjct: 65 RIISLPEIHDPPPIKLLDTSSETYI-LDFIHKNIPCLRKTIQDLVSSSSSSGGGSSHVAG 123
Query: 108 KILSFIITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSL-LFI 166
IL F N+P+Y + S L + YLP S F + L I
Sbjct: 124 LILDFFCVGLIDIGREVNLPSYIFMTSNFGFLGVLQYLPERQRLTPSEFDESSGEEELHI 183
Query: 167 PG-LPPVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGD 225
P + V + +P V D+ Y + L ++ GI++N+F +E A + G
Sbjct: 184 PAFVNRVPAKVLPPGVFDKLS--YGSLVKIGERLHEAKGILVNSFTQVEPYAAEHFSQGR 241
Query: 226 CVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSA 285
P ++ +GP++ G++ + + WLD QP SV+FLCFGS G F A
Sbjct: 242 -----DYPHVYPVGPVLNLTGRTNPGLASAQYKEMMKWLDEQPDSSVLFLCFGSMGVFPA 296
Query: 286 PQLKEIAIGLERSNQRFLWVVRNPSNAA-----EAELPEGFLERTKERGLVVKSWAPQST 340
PQ+ EIA LE RF+W +R +N A + LPEGF++RT RG+V SWAPQ
Sbjct: 297 PQITEIAHALELIGCRFIWAIR--TNMAGDGDPQEPLPEGFVDRTMGRGIVC-SWAPQVD 353
Query: 341 ILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGE 400
IL H++ GGFV+HCGW+SV E++ YGVP+ WP+YAEQ LN+ +V+E+ +A+ + L+
Sbjct: 354 ILAHKATGGFVSHCGWNSVQESLWYGVPIATWPMYAEQQLNAFEMVKELGLAVEIRLD-- 411
Query: 401 EETIGNGEGV----VSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSF 456
+ +G+ V VSA+ + VR LM +R++ +E +A A DGGSS
Sbjct: 412 --YVADGDRVTLEIVSADEIATAVRSLM--DSDNPVRKKVIEKSSVARKA--VGDGGSST 465
Query: 457 TAFSNL 462
A N
Sbjct: 466 VATCNF 471
>sp|O23205|U72C1_ARATH UDP-glycosyltransferase 72C1 OS=Arabidopsis thaliana GN=UGT72C1
PE=2 SV=3
Length = 457
Score = 201 bits (510), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 144/469 (30%), Positives = 229/469 (48%), Gaps = 53/469 (11%)
Query: 6 ALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPIIS 65
AL P H + ++ELGK +L H +T+ + + + SK S I +P
Sbjct: 6 ALVASPGMGHAVPILELGKHLLNHHGFDRVTVFLVTDDVSRSK--SLIGKTLMEEDPKFV 63
Query: 66 FYYLPAIQMPSETLSRADIA--IESIKLNSSNVFQALENISLTSKILSFIITSTTSFSYH 123
++P + + + LS + + E ++ + ++ + ++ + T +
Sbjct: 64 IRFIP-LDVSGQDLSGSLLTKLAEMMRKALPEIKSSVMELEPRPRVFVVDLLGTEALEVA 122
Query: 124 PNI---PTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPS-SLLFIPGLPPVKSSFMPE 179
+ + + A LA +Y+ +L Q +K S L IPG PVK E
Sbjct: 123 KELGIMRKHVLVTTSAWFLAFTVYMASLDKQ--ELYKQLSSIGALLIPGCSPVKF----E 176
Query: 180 PVLDRQKPIYDFF--LNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDC---VTNGTTPP 234
D +K I + + ++G+ +NT+ LEQ I + ++ + V G P
Sbjct: 177 RAQDPRKYIRELAESQRIGDEVITADGVFVNTWHSLEQVTIGSFLDPENLGRVMRGV--P 234
Query: 235 LHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIG 294
++ +GPL+ A+ + L WLD QP SVV++ FGS G + Q E+A G
Sbjct: 235 VYPVGPLVRPAEP-------GLKHGVLDWLDLQPKESVVYVSFGSGGALTFEQTNELAYG 287
Query: 295 LERSNQRFLWVVRNPSN-----------AAEAE----LPEGFLERTKERGLVVKSWAPQS 339
LE + RF+WVVR P+ E E LP GFL+RTK+ GLVV++WAPQ
Sbjct: 288 LELTGHRFVWVVRPPAEDDPSASMFDKTKNETEPLDFLPNGFLDRTKDIGLVVRTWAPQE 347
Query: 340 TILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNG 399
IL H+S GGFVTHCGW+SV+E++ GVPM+AWPLY+EQ +N+ + E+K+A+
Sbjct: 348 EILAHKSTGGFVTHCGWNSVLESIVNGVPMVAWPLYSEQKMNARMVSGELKIAL------ 401
Query: 400 EEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWN 448
I +G+V E + E V+ +M EGK +R+ E++ A A N
Sbjct: 402 ---QINVADGIVKKEVIAEMVKRVMDEEEGKEMRKNVKELKKTAEEALN 447
>sp|O82382|U71C2_ARATH UDP-glycosyltransferase 71C2 OS=Arabidopsis thaliana GN=UGT71C2
PE=1 SV=1
Length = 474
Score = 200 bits (509), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 152/480 (31%), Positives = 239/480 (49%), Gaps = 44/480 (9%)
Query: 9 PGPAFHHMISMVELGKLILQHRSDV--SITILVPSMP-LEESKTCSYINSISHRLNPIIS 65
P P H+++ +EL K ++ H+ +ITIL S+P L +S T +++ S+ I
Sbjct: 13 PFPIPGHILATIELAKRLISHQPSRIHTITILHWSLPFLPQSDTIAFLKSLIET-ESRIR 71
Query: 66 FYYLPAIQMPS--ETLSRADIA--IESIKLNSSNVFQAL----------ENISLTSKILS 111
LP +Q P E +A + +E +K V AL +++ + +L
Sbjct: 72 LITLPDVQNPPPMELFVKASESYILEYVKKMVPLVRNALSTLLSSRDESDSVHVAGLVLD 131
Query: 112 FIITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSF-KDHPSSLLFIPGLP 170
F N+P+Y + AS L + YL + + + + +PG
Sbjct: 132 FFCVPLIDVGNEFNLPSYIFLTCSASFLGMMKYLLERNRETKPELNRSSDEETISVPGFV 191
Query: 171 ---PVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCV 227
PVK +P + + Y+ ++ + ++ GI++N+F+ LE+ A
Sbjct: 192 NSVPVK--VLPPGLFTTES--YEAWVEMAERFPEAKGILVNSFESLERNAFDYFDR---- 243
Query: 228 TNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQ 287
PP++ IGP++ + DR + L WLD QP SVVFLCFGS + +A Q
Sbjct: 244 RPDNYPPVYPIGPILC-SNDRPN-LDLSERDRILKWLDDQPESSVVFLCFGSLKSLAASQ 301
Query: 288 LKEIAIGLERSNQRFLWVVR-NPSNAAEAE--LPEGFLERTKERGLVVKSWAPQSTILGH 344
+KEIA LE RFLW +R +P A LP+GF+ R GLV WAPQ IL H
Sbjct: 302 IKEIAQALELVGIRFLWSIRTDPKEYASPNEILPDGFMNRVMGLGLVC-GWAPQVEILAH 360
Query: 345 ESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETI 404
+++GGFV+HCGW+S++E++ +GVP+ WP+YAEQ LN+ +V+E+ +A+ M L+ E
Sbjct: 361 KAIGGFVSHCGWNSILESLRFGVPIATWPMYAEQQLNAFTIVKELGLALEMRLDYVSEY- 419
Query: 405 GNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNLFD 464
GE +V A+ + VR LM +G+ + R L+ + A DGGSSF A D
Sbjct: 420 --GE-IVKADEIAGAVRSLM---DGEDVPRRKLK-EIAEAGKEAVMDGGSSFVAVKRFID 472
>sp|Q40288|UFOG6_MANES Anthocyanidin 3-O-glucosyltransferase 6 (Fragment) OS=Manihot
esculenta GN=GT6 PE=2 SV=1
Length = 394
Score = 196 bits (499), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 122/374 (32%), Positives = 198/374 (52%), Gaps = 37/374 (9%)
Query: 99 ALENISLTSKILSFIITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSS--- 155
A + SL +L TS + +P Y +F S A+ L + Y+ +H++ +
Sbjct: 25 ARSDSSLAGFVLDMFCTSMIDVAKELGVPYYIFFTSGAAFLGFLFYVQLIHDEQDADLTQ 84
Query: 156 FKDHPSSLLFIPGLP-PVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLE 214
FKD + L +P L + + +P +L + + + F+ L ++ GI++NTF LE
Sbjct: 85 FKDSDAELS-VPSLANSLPARVLPASMLVKDR--FYAFIRIIRGLREAKGIMVNTFMELE 141
Query: 215 QQAIKAIVNGDCVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVS---SDCLTWLDSQPSGS 271
A+ ++ + PP++ +GP++ + +DV S+ + WLD QP S
Sbjct: 142 SHALNSLKD----DQSKIPPIYPVGPIL-----KLSNQENDVGPEGSEIIEWLDDQPPSS 192
Query: 272 VVFLCFGSRGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAAEAE-----------LPEG 320
VVFLCFGS G F Q KEIA LE+S RFLW +R P + E LP G
Sbjct: 193 VVFLCFGSMGGFDMDQAKEIACALEQSRHRFLWSLRRPPPKGKIETSTDYENLQEILPVG 252
Query: 321 FLERTKERGLVVKSWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFL 380
F ERT G VV WAPQ IL H ++GGFV+HCGW+S++E++ + VP+ WPLYAEQ
Sbjct: 253 FSERTAGMGKVV-GWAPQVAILEHPAIGGFVSHCGWNSILESIWFSVPIATWPLYAEQQF 311
Query: 381 NSVALVQEMKVAMPMFLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMR 440
N+ +V E+ +A+ + ++ ++E+ E ++SA+ +E ++ +M +R+R EM
Sbjct: 312 NAFTMVTELGLAVEIKMDYKKES----EIILSADDIERGIKCVM--EHHSEIRKRVKEMS 365
Query: 441 MMAATAWNNNDGGS 454
+ A +++ S
Sbjct: 366 DKSRKALMDDESSS 379
>sp|Q9ZQG4|U73B5_ARATH UDP-glycosyltransferase 73B5 OS=Arabidopsis thaliana GN=UGT73B5
PE=2 SV=1
Length = 484
Score = 192 bits (487), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 149/487 (30%), Positives = 228/487 (46%), Gaps = 46/487 (9%)
Query: 5 IALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPII 64
I +P A HMI ++++ KL R S + P K + + L I
Sbjct: 11 ILFFPFMAQGHMIPILDMAKL-FSRRGAKSTLLTTPINAKIFEKPIEAFKNQNPDLEIGI 69
Query: 65 SFYYLPAIQMP-SETLSRADIAIESIKLNSSNVFQAL--ENISLTSKILSFIITSTTSFS 121
+ P +++ E AD K +S ++F + ++ SFI T+
Sbjct: 70 KIFNFPCVELGLPEGCENADFINSYQKSDSGDLFLKFLFSTKYMKQQLESFIETT----- 124
Query: 122 YHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSF-----------KDHP-----SSLLF 165
P+ F A+ A L +P L TS F K H S+
Sbjct: 125 -KPSALVADMFFPWATESAEKLGVPRLVFHGTSFFSLCCSYNMRIHKPHKKVATSSTPFV 183
Query: 166 IPGLP-PVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNG 224
IPGLP + + V + P+ F S + S G+++N+F LE
Sbjct: 184 IPGLPGDIVITEDQANVAKEETPMGKFMKEVRESETNSFGVLVNSFYELES------AYA 237
Query: 225 DCVTNGTTPPLHCIGPLIVDAKD-----RAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGS 279
D + IGPL + ++ R G ++ +CL WLDS+ GSVV+L FGS
Sbjct: 238 DFYRSFVAKRAWHIGPLSLSNRELGEKARRGKKANIDEQECLKWLDSKTPGSVVYLSFGS 297
Query: 280 RGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAAEAE--LPEGFLERTKERGLVVKSWAP 337
F+ QL EIA GLE S Q F+WVVR N + E LPEGF ERT +GL++ WAP
Sbjct: 298 GTNFTNDQLLEIAFGLEGSGQSFIWVVRKNENQGDNEEWLPEGFKERTTGKGLIIPGWAP 357
Query: 338 QSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFL 397
Q IL H+++GGFVTHCGW+S +E + G+PM+ WP+ AEQF N L + +++ + +
Sbjct: 358 QVLILDHKAIGGFVTHCGWNSAIEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNV-- 415
Query: 398 NGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFT 457
G E + G+ ++S +VE+ VRE++ G + + R + ++ MA A +GGSS+
Sbjct: 416 -GATELVKKGK-LISRAQVEKAVREVIGGEKAEERRLWAKKLGEMAKAA--VEEGGSSYN 471
Query: 458 AFSNLFD 464
+ +
Sbjct: 472 DVNKFME 478
>sp|Q9SY84|U90A2_ARATH UDP-glycosyltransferase 90A2 OS=Arabidopsis thaliana GN=UGT90A2
PE=2 SV=1
Length = 467
Score = 188 bits (477), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 150/476 (31%), Positives = 229/476 (48%), Gaps = 52/476 (10%)
Query: 2 KKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLN 61
K + L+P + HMI M++L +L+L H I++ V + PL ++S+S
Sbjct: 5 KVHVVLFPYLSKGHMIPMLQLARLLLSHSFAGDISVTVFTTPLNRP---FIVDSLSGTKA 61
Query: 62 PIISFYY---LPAIQMPSETLSRADIAIESIKL---NSSNVFQA-LENISLTSKILSFII 114
I+ + +P I E + S+ + ++ QA E ++ +SF++
Sbjct: 62 TIVDVPFPDNVPEIPPGVECTDKLPALSSSLFVPFTRATKSMQADFERELMSLPRVSFMV 121
Query: 115 TS-----TTSFSYHPNIPTYTYFN-SCASTLAAILYLPTLHNQITSSFKDH--PSSLLFI 166
+ T + P +F +CAST ++ NQ+ S+ K P S+
Sbjct: 122 SDGFLWWTQESARKLGFPRLVFFGMNCAST---VICDSVFQNQLLSNVKSETEPVSVPEF 178
Query: 167 PGLPP-----VKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAI 221
P + VK F P+ D P + L+ TS+++S GII NTFD LE I
Sbjct: 179 PWIKVRKCDFVKDMFDPKTTTD---PGFKLILDQVTSMNQSQGIIFNTFDDLEPVFI--- 232
Query: 222 VNGDCVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSG--SVVFLCFGS 279
D L +GPL V + V + WLD + +V+++ FGS
Sbjct: 233 ---DFYKRKRKLKLWAVGPLCYVNNFLDDEVEEKVKPSWMKWLDEKRDKGCNVLYVAFGS 289
Query: 280 RGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAAEAELPEGFLERTKERGLVVKS-WAPQ 338
+ S QL+EIA+GLE S FLWVV+ E+ +GF ER ERG++V+ W Q
Sbjct: 290 QAEISREQLEEIALGLEESKVNFLWVVKG------NEIGKGFEERVGERGMMVRDEWVDQ 343
Query: 339 STILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLN 398
IL HESV GF++HCGW+S+ E++ VP++A+PL AEQ LN++ +V+E++VA
Sbjct: 344 RKILEHESVRGFLSHCGWNSLTESICSEVPILAFPLAAEQPLNAILVVEELRVA------ 397
Query: 399 GEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGS 454
E + EGVV E + E+V+ELM G +GK LR MA A G S
Sbjct: 398 --ERVVAASEGVVRREEIAEKVKELMEGEKGKELRRNVEAYGKMAKKALEEGIGSS 451
>sp|Q9ZQ98|U73C2_ARATH UDP-glycosyltransferase 73C2 OS=Arabidopsis thaliana GN=UGT73C2
PE=3 SV=1
Length = 496
Score = 185 bits (470), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 143/485 (29%), Positives = 228/485 (47%), Gaps = 56/485 (11%)
Query: 7 LYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYIN-SISHRLNPIIS 65
L+P A HMI MV++ +++ Q +TI + + P ++ +N +I L+ +
Sbjct: 17 LFPFMAQGHMIPMVDIARILAQR----GVTITIVTTPHNAARFKDVLNRAIQSGLHIRVE 72
Query: 66 FYYLP----AIQMPSETLSRADIA------IESIKLNSSNVFQALENISLTSKIL--SFI 113
P +Q E + D +++ + + V + +E + L F
Sbjct: 73 HVKFPFQEAGLQEGQENVDFLDSMELMVHFFKAVNMLENPVMKLMEEMKPKPSCLISDFC 132
Query: 114 ITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPP-- 171
+ T+ + NIP + L ++ L HN I + K L +P P
Sbjct: 133 LPYTSKIAKRFNIPKIVFHGVSCFCLLSMHILHRNHN-ILHALKSDKEYFL-VPSFPDRV 190
Query: 172 --------VKSSFMPE--PVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAI 221
VK++F + ++D Q D S G+I+NTF LE +K
Sbjct: 191 EFTKLQVTVKTNFSGDWKEIMDEQVDADD----------TSYGVIVNTFQDLESAYVKNY 240
Query: 222 VNGDCVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRG 281
+ P+ + D +R + D +C+ WLDS+ SV+++C GS
Sbjct: 241 TEARAGKVWSIGPVSLCNKVGEDKAERGNKAAID-QDECIKWLDSKDVESVLYVCLGSIC 299
Query: 282 TFSAPQLKEIAIGLERSNQRFLWVVRNPSN---AAEAELPEGFLERTKERGLVVKSWAPQ 338
QL+E+ +GLE + + F+WV+R AE L GF ERTKER L++K W+PQ
Sbjct: 300 NLPLAQLRELGLGLEATKRPFIWVIRGGGKYHELAEWILESGFEERTKERSLLIKGWSPQ 359
Query: 339 STILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLN 398
IL H +VGGF+THCGW+S +E +T GVP+I WPL+ +QF N +VQ +K + + +
Sbjct: 360 MLILSHPAVGGFLTHCGWNSTLEGITSGVPLITWPLFGDQFCNQKLIVQVLKAGVSVGVE 419
Query: 399 -----GEEETIGNGEGVVSAERVEERVRELMMGS-EGKALRERSLEMRMMAATAWNNNDG 452
GEEE+IG +V E V++ V E+M S E K R+R E+ +A A +G
Sbjct: 420 EVMKWGEEESIGV---LVDKEGVKKAVDEIMGESDEAKERRKRVRELGELAHKA--VEEG 474
Query: 453 GSSFT 457
GSS +
Sbjct: 475 GSSHS 479
>sp|Q9SCP5|U73C7_ARATH UDP-glycosyltransferase 73C7 OS=Arabidopsis thaliana GN=UGT73C7
PE=2 SV=1
Length = 490
Score = 185 bits (469), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 150/476 (31%), Positives = 229/476 (48%), Gaps = 46/476 (9%)
Query: 9 PGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPIISFYY 68
P A HMI +V++ +L L R V++ I+ + + + KT +S+ +N + +
Sbjct: 13 PFMAQGHMIPLVDISRL-LSQRQGVTVCIITTTQNVAKIKTSLSFSSLFATINIVEVKFL 71
Query: 69 LPAIQMPS--ETL----SRADI-----AIESIKLNSSNVFQALENISLTSKILSFIITST 117
+P E+L S D+ A S++ + + + I + T
Sbjct: 72 SQQTGLPEGCESLDMLASMGDMVKFFDAANSLEEQVEKAMEEMVQPRPSCIIGDMSLPFT 131
Query: 118 TSFSYHPNIPTYTYFN-SCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPPVKSSF 176
+ + IP + SC S ++ + + ++ S ++ +PGLP K F
Sbjct: 132 SRLAKKFKIPKLIFHGFSCFSLMSIQVVRESGILKMIESNDEY----FDLPGLPD-KVEF 186
Query: 177 MPEPVLDRQKPIYDFFLNYSTSLSK-------SNGIIINTFDFLEQQAIKAIVNGDCVTN 229
+P + +P+ N S +K S G+I+NTF+ LE +
Sbjct: 187 -TKPQVSVLQPVEG---NMKESTAKIIEADNDSYGVIVNTFEELEVDYAREYRKARAGKV 242
Query: 230 GTTPPLHCIGPLIVDAKDRAGGVSDDVSSD-CLTWLDSQPSGSVVFLCFGSRGTFSAPQL 288
P+ L +D R S + D CL WLDSQ +GSV+++C GS QL
Sbjct: 243 WCVGPVSLCNRLGLDKAKRGDKAS--IGQDQCLQWLDSQETGSVLYVCLGSLCNLPLAQL 300
Query: 289 KEIAIGLERSNQRFLWVVR---NPSNAAEAELPEGFLERTKERGLVVKSWAPQSTILGHE 345
KE+ +GLE SN+ F+WV+R + A GF ER K+RGLV+K WAPQ IL H
Sbjct: 301 KELGLGLEASNKPFIWVIREWGKYGDLANWMQQSGFEERIKDRGLVIKGWAPQVFILSHA 360
Query: 346 SVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLN-----GE 400
S+GGF+THCGW+S +E +T GVP++ WPL+AEQFLN +VQ +K + + + G+
Sbjct: 361 SIGGFLTHCGWNSTLEGITAGVPLLTWPLFAEQFLNEKLVVQILKAGLKIGVEKLMKYGK 420
Query: 401 EETIGNGEGVVSAERVEERVRELMMGSEGKALRERSL-EMRMMAATAWNNNDGGSS 455
EE IG +VS E V + V ELM SE R R + E+ +A A GGSS
Sbjct: 421 EEEIG---AMVSRECVRKAVDELMGDSEEAEERRRKVTELSDLANKALEK--GGSS 471
>sp|Q9ZQ94|U73C5_ARATH UDP-glycosyltransferase 73C5 OS=Arabidopsis thaliana GN=UGT73C5
PE=2 SV=1
Length = 495
Score = 184 bits (466), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 153/494 (30%), Positives = 237/494 (47%), Gaps = 61/494 (12%)
Query: 7 LYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPI--- 63
L+P A HMI MV++ +L+ Q + I + + P ++ + +N PI
Sbjct: 15 LFPFMAQGHMIPMVDIARLLAQR----GVIITIVTTPHNAARFKNVLNRAIESGLPINLV 70
Query: 64 -ISFYYLPA-IQMPSETLSRADIA------IESIKLNSSNVFQALENISLTSKIL--SFI 113
+ F YL A +Q E + D +++ V + +E ++ L F
Sbjct: 71 QVKFPYLEAGLQEGQENIDSLDTMERMIPFFKAVNFLEEPVQKLIEEMNPRPSCLISDFC 130
Query: 114 ITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFK---------DHPSSLL 164
+ T+ + NIP F+ +++ + +I + K D P +
Sbjct: 131 LPYTSKIAKKFNIPK-ILFHGMGCFCLLCMHVLRKNREILDNLKSDKELFTVPDFPDRVE 189
Query: 165 FIPGLPPVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLE---QQAIKAI 221
F PV++ P D K I+D + + + S G+I+N+F LE + K +
Sbjct: 190 FTRTQVPVETYV---PAGD-WKDIFDGMVEANET---SYGVIVNSFQELEPAYAKDYKEV 242
Query: 222 VNGDCVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRG 281
+G T G P+ + D +R G SD +CL WLDS+ GSV+++C GS
Sbjct: 243 RSGKAWTIG---PVSLCNKVGADKAER-GNKSDIDQDECLKWLDSKKHGSVLYVCLGSIC 298
Query: 282 TFSAPQLKEIAIGLERSNQRFLWVVRNPSNAAEAELPE-----GFLERTKERGLVVKSWA 336
QLKE+ +GLE S + F+WV+R E L E GF +R ++RGL++K W+
Sbjct: 299 NLPLSQLKELGLGLEESQRPFIWVIRGWEKYKE--LVEWFSESGFEDRIQDRGLLIKGWS 356
Query: 337 PQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMK------ 390
PQ IL H SVGGF+THCGW+S +E +T G+P++ WPL+A+QF N +V+ +K
Sbjct: 357 PQMLILSHPSVGGFLTHCGWNSTLEGITAGLPLLTWPLFADQFCNEKLVVEVLKAGVRSG 416
Query: 391 VAMPMFLNGEEETIGNGEGVVSAERVEERVRELMMGS-EGKALRERSLEMRMMAATAWNN 449
V PM GEEE IG +V E V++ V ELM S + K R R+ E+ A A
Sbjct: 417 VEQPMKW-GEEEKIGV---LVDKEGVKKAVEELMGESDDAKERRRRAKELGDSAHKA--V 470
Query: 450 NDGGSSFTAFSNLF 463
+GGSS + S L
Sbjct: 471 EEGGSSHSNISFLL 484
>sp|Q9ZQ96|U73C3_ARATH UDP-glycosyltransferase 73C3 OS=Arabidopsis thaliana GN=UGT73C3
PE=2 SV=1
Length = 496
Score = 183 bits (465), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 148/492 (30%), Positives = 235/492 (47%), Gaps = 47/492 (9%)
Query: 7 LYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYIN---------SIS 57
L+P A HMI M+++ +L+ Q +TI + + P ++ + +N +I
Sbjct: 17 LFPFMAQGHMIPMIDIARLLAQR----GVTITIVTTPHNAARFKNVLNRAIESGLAINIL 72
Query: 58 HRLNPIISFYYLPAIQMPSETLSRADIAI---ESIKLNSSNVFQALENISLTSKIL--SF 112
H P F LP + ++L ++ + +++ L V + +E + L +
Sbjct: 73 HVKFPYQEFG-LPEGKENIDSLDSTELMVPFFKAVNLLEDPVMKLMEEMKPRPSCLISDW 131
Query: 113 IITSTTSFSYHPNIPTYTYFN-SCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPP 171
+ T+ + + NIP + C + L +++ + +I + K L +P P
Sbjct: 132 CLPYTSIIAKNFNIPKIVFHGMGCFNLLC--MHVLRRNLEILENVKSDEEYFL-VPSFPD 188
Query: 172 -VKSSFMPEPVLDRQ----KPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDC 226
V+ + + PV K I D + + S G+I+NTF LE +K
Sbjct: 189 RVEFTKLQLPVKANASGDWKEIMDEMVKAEYT---SYGVIVNTFQELEPPYVKDYKEAMD 245
Query: 227 VTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAP 286
+ P+ D +R + D +CL WLDS+ GSV+++C GS
Sbjct: 246 GKVWSIGPVSLCNKAGADKAERGSKAAID-QDECLQWLDSKEEGSVLYVCLGSICNLPLS 304
Query: 287 QLKEIAIGLERSNQRFLWVVRNPSNAAEA---ELPEGFLERTKERGLVVKSWAPQSTILG 343
QLKE+ +GLE S + F+WV+R E L GF ER KERGL++K WAPQ IL
Sbjct: 305 QLKELGLGLEESRRSFIWVIRGSEKYKELFEWMLESGFEERIKERGLLIKGWAPQVLILS 364
Query: 344 HESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLN----- 398
H SVGGF+THCGW+S +E +T G+P+I WPL+ +QF N +VQ +K + +
Sbjct: 365 HPSVGGFLTHCGWNSTLEGITSGIPLITWPLFGDQFCNQKLVVQVLKAGVSAGVEEVMKW 424
Query: 399 GEEETIGNGEGVVSAERVEERVRELMMGS-EGKALRERSLEMRMMAATAWNNNDGGSSFT 457
GEE+ IG +V E V++ V ELM S + K R R E+ +A A GGSS +
Sbjct: 425 GEEDKIGV---LVDKEGVKKAVEELMGDSDDAKERRRRVKELGELAHKA--VEKGGSSHS 479
Query: 458 AFSNLF-DLWQI 468
+ L D+ Q+
Sbjct: 480 NITLLLQDIMQL 491
>sp|Q7Y232|U73B4_ARATH UDP-glycosyltransferase 73B4 OS=Arabidopsis thaliana GN=UGT73B4
PE=2 SV=1
Length = 484
Score = 182 bits (462), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 148/491 (30%), Positives = 224/491 (45%), Gaps = 51/491 (10%)
Query: 5 IALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPII 64
I +P A HMI ++++ KL + R S + P K + L I
Sbjct: 8 ILFFPFMAHGHMIPLLDMAKLFAR-RGAKSTLLTTPINAKILEKPIEAFKVQNPDLEIGI 66
Query: 65 SFYYLPAIQMP-SETLSRADIAIESIKLNSSNVFQAL--ENISLTSKILSFIITSTTSFS 121
P +++ E D K +S ++F + ++ SFI T+
Sbjct: 67 KILNFPCVELGLPEGCENRDFINSYQKSDSFDLFLKFLFSTKYMKQQLESFIETT----- 121
Query: 122 YHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSF-----------KDHP-----SSLLF 165
P+ F A+ A + +P L TSSF K H S+
Sbjct: 122 -KPSALVADMFFPWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVASSSTPFV 180
Query: 166 IPGLP-PVKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNG 224
IPGLP + + V + + P F+ S + S G+++N+F LE
Sbjct: 181 IPGLPGDIVITEDQANVTNEETPFGKFWKEVRESETSSFGVLVNSFYELESSY------A 234
Query: 225 DCVTNGTTPPLHCIGPL------IVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFG 278
D + IGPL I + R + D +CL WLDS+ GSVV+L FG
Sbjct: 235 DFYRSFVAKKAWHIGPLSLSNRGIAEKAGRGKKANID-EQECLKWLDSKTPGSVVYLSFG 293
Query: 279 SRGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAA-----EAELPEGFLERTKERGLVVK 333
S QL EIA GLE S Q F+WVV N E LP+GF ER K +GL+++
Sbjct: 294 SGTGLPNEQLLEIAFGLEGSGQNFIWVVSKNENQVGTGENEDWLPKGFEERNKGKGLIIR 353
Query: 334 SWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAM 393
WAPQ IL H+++GGFVTHCGW+S +E + G+PM+ WP+ AEQF N L + +++ +
Sbjct: 354 GWAPQVLILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGV 413
Query: 394 PMFLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGG 453
+ G E + G+ ++S +VE+ VRE++ G + + R R+ E+ MA A +GG
Sbjct: 414 NV---GATELVKKGK-LISRAQVEKAVREVIGGEKAEERRLRAKELGEMAKAA--VEEGG 467
Query: 454 SSFTAFSNLFD 464
SS+ + +
Sbjct: 468 SSYNDVNKFME 478
>sp|Q9ZQ97|U73C4_ARATH UDP-glycosyltransferase 73C4 OS=Arabidopsis thaliana GN=UGT73C4
PE=2 SV=1
Length = 496
Score = 182 bits (462), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 142/478 (29%), Positives = 227/478 (47%), Gaps = 30/478 (6%)
Query: 7 LYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINS-----ISHRLN 61
L+P A HMI M+++ +L+ Q + V+I + E+ + S I H
Sbjct: 17 LFPFMAQGHMIPMIDIARLLAQRGATVTIVTTRYNAGRFENVLSRAMESGLPINIVHVNF 76
Query: 62 PIISFYYLPAIQMPSETLSRADIAI---ESIKLNSSNVFQALENIS-LTSKILS-FIITS 116
P F LP + ++ ++ + +++ + V + +E + S I+S ++
Sbjct: 77 PYQEFG-LPEGKENIDSYDSMELMVPFFQAVNMLEDPVMKLMEEMKPRPSCIISDLLLPY 135
Query: 117 TTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLPPVKSSF 176
T+ + +IP + + L +++ + +I + K L +P P
Sbjct: 136 TSKIARKFSIPKIVFHGTGCFNLLC-MHVLRRNLEILKNLKSDKDYFL-VPSFPDRVEFT 193
Query: 177 MPE-PVLDRQKPIYDFFLNYSTSLS-KSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPP 234
P+ PV + FL+ S G+I+NTF LE +K + P
Sbjct: 194 KPQVPVETTASGDWKAFLDEMVEAEYTSYGVIVNTFQELEPAYVKDYTKARAGKVWSIGP 253
Query: 235 LHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIG 294
+ D +R + D +CL WLDS+ GSV+++C GS QLKE+ +G
Sbjct: 254 VSLCNKAGADKAERGNQAAID-QDECLQWLDSKEDGSVLYVCLGSICNLPLSQLKELGLG 312
Query: 295 LERSNQRFLWVVR---NPSNAAEAELPEGFLERTKERGLVVKSWAPQSTILGHESVGGFV 351
LE+S + F+WV+R + E + GF ER KERGL++K W+PQ IL H SVGGF+
Sbjct: 313 LEKSQRSFIWVIRGWEKYNELYEWMMESGFEERIKERGLLIKGWSPQVLILSHPSVGGFL 372
Query: 352 THCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLN-----GEEETIGN 406
THCGW+S +E +T G+P+I WPL+ +QF N +VQ +K + + GEEE IG
Sbjct: 373 THCGWNSTLEGITSGIPLITWPLFGDQFCNQKLVVQVLKAGVSAGVEEVMKWGEEEKIGV 432
Query: 407 GEGVVSAERVEERVRELMMGS-EGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNLF 463
+V E V++ V ELM S + K R R E+ A A +GGSS + + L
Sbjct: 433 ---LVDKEGVKKAVEELMGASDDAKERRRRVKELGESAHKA--VEEGGSSHSNITYLL 485
>sp|Q8W491|U73B3_ARATH UDP-glycosyltransferase 73B3 OS=Arabidopsis thaliana GN=UGT73B3
PE=2 SV=1
Length = 481
Score = 181 bits (459), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 146/493 (29%), Positives = 220/493 (44%), Gaps = 53/493 (10%)
Query: 2 KKTIALYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLN 61
K + +P A+ HMI +++ KL R S + P K +++
Sbjct: 8 KLHVVFFPFMAYGHMIPTLDMAKL-FSSRGAKSTILTTPLNSKIFQKPIERFKNLNPSFE 66
Query: 62 PIISFYYLPAIQMP-SETLSRADIAIE---------SIKLNSSNVF--QALENISLTSK- 108
I + P + + E D ++K S F LE + T++
Sbjct: 67 IDIQIFDFPCVDLGLPEGCENVDFFTSNNNDDRQYLTLKFFKSTRFFKDQLEKLLETTRP 126
Query: 109 ---ILSFIITSTTSFSYHPNIPTYT-----YFNSCASTLAAILYLPTLHN-QITSSFKDH 159
I T + N+P YF+ C+ Y +HN Q + +
Sbjct: 127 DCLIADMFFPWATEAAEKFNVPRLVFHGTGYFSLCSE------YCIRVHNPQNIVASRYE 180
Query: 160 PSSLLFIPGLPPVKSSFMPEPVLDR--QKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQA 217
P IP LP E + DR + + F + S KS+G+I+N+F LE
Sbjct: 181 P---FVIPDLPG-NIVITQEQIADRDEESEMGKFMIEVKESDVKSSGVIVNSFYELEPDY 236
Query: 218 IKAIVNGDCVTNGTTPPLHCIGPLIV-----DAKDRAGGVSDDVSSDCLTWLDSQPSGSV 272
D + IGPL V + K G + +CL WLDS+ SV
Sbjct: 237 ------ADFYKSVVLKRAWHIGPLSVYNRGFEEKAERGKKASINEVECLKWLDSKKPDSV 290
Query: 273 VFLCFGSRGTFSAPQLKEIAIGLERSNQRFLWVVR-NPSNAAEAELPEGFLERTKERGLV 331
+++ FGS F QL EIA GLE S F+WVVR N E LPEGF ER K +G++
Sbjct: 291 IYISFGSVACFKNEQLFEIAAGLETSGANFIWVVRKNIGIEKEEWLPEGFEERVKGKGMI 350
Query: 332 VKSWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKV 391
++ WAPQ IL H++ GFVTHCGW+S++E V G+PM+ WP+ AEQF N + Q ++
Sbjct: 351 IRGWAPQVLILDHQATCGFVTHCGWNSLLEGVAAGLPMVTWPVAAEQFYNEKLVTQVLRT 410
Query: 392 AMPMFLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNND 451
+ + T G+ +S E+V + VRE+++G E RER+ ++ MA A +
Sbjct: 411 GVSVGAKKNVRTTGD---FISREKVVKAVREVLVGEEADERRERAKKLAEMAKAA---VE 464
Query: 452 GGSSFTAFSNLFD 464
GGSSF ++ +
Sbjct: 465 GGSSFNDLNSFIE 477
>sp|Q9M9E7|U85A4_ARATH UDP-glycosyltransferase 85A4 OS=Arabidopsis thaliana GN=UGT85A4
PE=2 SV=1
Length = 489
Score = 177 bits (450), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 119/395 (30%), Positives = 203/395 (51%), Gaps = 44/395 (11%)
Query: 89 IKLNSSNVFQALENISLTSKILSFIITSTTSFSYHPNIPTYTYFNSCASTLAAIL-YLPT 147
++LNS + + I ++ +SF I + IP + + A+ L L Y
Sbjct: 109 LRLNSGSDIPPVSCI-ISDASMSFTIDAAEEL----KIPVVLLWTNSATALILYLHYQKL 163
Query: 148 LHNQI-----TSSFKDH-PSSLLFIPGLPPVKSSFMPEPVL--DRQKPIYDFFLNYSTSL 199
+ +I +S K H + + +IP + +K P+ V + Q P+ F L+ + +
Sbjct: 164 IEKEIIPLKDSSDLKKHLETEIDWIPSMKKIKLKDFPDFVTTTNPQDPMISFILHVTGRI 223
Query: 200 SKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHCIGPLIV--------DAKDRAGG 251
+++ I INTF+ LE + ++ P ++ +GP + +++ R G
Sbjct: 224 KRASAIFINTFEKLEHNVLLSL-------RSLLPQIYSVGPFQILENREIDKNSEIRKLG 276
Query: 252 VSD-DVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLERSNQRFLWVVRNPS 310
++ + ++ L WLD++ +V+++ FGS ++ Q+ E A GL RS + FLWVVR+
Sbjct: 277 LNLWEEETESLDWLDTKAEKAVIYVNFGSLTVLTSEQILEFAWGLARSGKEFLWVVRSGM 336
Query: 311 -NAAEAELPEGFLERTKERGLVVKSWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPM 369
+ ++ LP FL TK RG+++K W Q +L H ++GGF+THCGW+S +E++ GVPM
Sbjct: 337 VDGDDSILPAEFLSETKNRGMLIKGWCSQEKVLSHPAIGGFLTHCGWNSTLESLYAGVPM 396
Query: 370 IAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETIGNGEGVVSAERVEERVRELMMGSEG 429
I WP +A+Q N ++ + M + GEE V ERVE V+ELM G +G
Sbjct: 397 ICWPFFADQLTNRKFCCEDWGIGMEI---GEE---------VKRERVETVVKELMDGEKG 444
Query: 430 KALRERSLEMRMMAATAWNNNDGGSSFTAFSNLFD 464
K LRE+ +E R +A A + GSS+ F + +
Sbjct: 445 KRLREKVVEWRRLAEEA-SAPPLGSSYVNFETVVN 478
>sp|Q2V6J9|UFOG7_FRAAN UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria
ananassa GN=GT7 PE=1 SV=1
Length = 487
Score = 177 bits (449), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 122/342 (35%), Positives = 172/342 (50%), Gaps = 24/342 (7%)
Query: 131 YFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGLP-PVKSSFMPEPVLDRQKPIY 189
+F CAS L+ ++Y P H+ ++S S IP LP +K + PV +
Sbjct: 146 FFALCAS-LSVMMYQP--HSNLSSD-----SESFVIPNLPDEIKMTRSQLPVFPDESEFM 197
Query: 190 DFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHCIGPLIVDAKDRA 249
+S G+I+N+F LE P+ I D +R
Sbjct: 198 KMLKASIEIEERSYGVIVNSFYELEPAYANHYRKVFGRKAWHIGPVSFCNKAIEDKAERG 257
Query: 250 GGVSDDVSS-DCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLERSNQRFLWVVRN 308
S +CL WLDS+ SVV++ FGS F+ QL EIA GLE S Q F+WVV+
Sbjct: 258 SIKSSTAEKHECLKWLDSKKPRSVVYVSFGSMVRFADSQLLEIATGLEASGQDFIWVVKK 317
Query: 309 PSNAAEAELPEGFLERTKERGLVVKSWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVP 368
E LPEGF +R + +GL+++ WAPQ IL HE++G FVTHCGW+S++EAV+ GVP
Sbjct: 318 EKKEVEEWLPEGFEKRMEGKGLIIRDWAPQVLILEHEAIGAFVTHCGWNSILEAVSAGVP 377
Query: 369 MIAWPLYAEQFLNSVALVQEMKVAMPM--------FLNGEEETIGNGEGVVSAERVEERV 420
MI WP++ EQF N + + ++ +P+ F++ ET EG V E +EE V
Sbjct: 378 MITWPVFGEQFYNEKLVTEIHRIGVPVGSEKWALSFVDVNAET----EGRVRREAIEEAV 433
Query: 421 RELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNL 462
+M+G E R R E+ A A +GGSSF S L
Sbjct: 434 TRIMVGDEAVETRSRVKELGENARRA--VEEGGSSFLDLSAL 473
>sp|Q8VZE9|U73B1_ARATH UDP-glycosyltransferase 73B1 OS=Arabidopsis thaliana GN=UGT73B1
PE=2 SV=1
Length = 488
Score = 176 bits (447), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/261 (38%), Positives = 149/261 (57%), Gaps = 21/261 (8%)
Query: 202 SNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHCIGPLIV-----DAKDRAGGVSDDV 256
S G+++N+F LEQ D + IGPL + + K G +
Sbjct: 221 SFGVLVNSFYELEQ------AYSDYFKSFVAKRAWHIGPLSLGNRKFEEKAERGKKASID 274
Query: 257 SSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAAEAE 316
+CL WLDS+ SV+++ FG+ +F QL EIA GL+ S F+WVV + E E
Sbjct: 275 EHECLKWLDSKKCDSVIYMAFGTMSSFKNEQLIEIAAGLDMSGHDFVWVVNRKGSQVEKE 334
Query: 317 --LPEGFLERTKERGLVVKSWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPL 374
LPEGF E+TK +GL+++ WAPQ IL H+++GGF+THCGW+S++E V G+PM+ WP+
Sbjct: 335 DWLPEGFEEKTKGKGLIIRGWAPQVLILEHKAIGGFLTHCGWNSLLEGVAAGLPMVTWPV 394
Query: 375 YAEQFLNSVALVQEMKVAMPMFLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRE 434
AEQF N + Q +K + + + + +G+ +S E+VE VRE+M+G E R+
Sbjct: 395 GAEQFYNEKLVTQVLKTGVSVGVKKMMQVVGD---FISREKVEGAVREVMVGEE---RRK 448
Query: 435 RSLEMRMMAATAWNNNDGGSS 455
R+ E+ MA A +GGSS
Sbjct: 449 RAKELAEMAKNA--VKEGGSS 467
>sp|Q9ZQ99|U73C1_ARATH UDP-glycosyltransferase 73C1 OS=Arabidopsis thaliana GN=UGT73C1
PE=2 SV=1
Length = 491
Score = 174 bits (442), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 143/492 (29%), Positives = 225/492 (45%), Gaps = 48/492 (9%)
Query: 7 LYPGPAFHHMISMVELGKLILQHRSDVSITILVPSMPLEESKTCSYINSISHRLNPIISF 66
L+P A HMI MV++ +L+ Q +TI + + P + + ++ PI
Sbjct: 13 LFPFMAQGHMIPMVDIARLLAQR----GVTITIVTTPQNAGRFKNVLSRAIQSGLPI--- 65
Query: 67 YYLPAIQMPSE---------------TLSRADIAIESIKLNSSNVFQALENISLTSK--I 109
L ++ PS+ +L + ++ L V + L+ I I
Sbjct: 66 -NLVQVKFPSQESGSPEGQENLDLLDSLGASLTFFKAFSLLEEPVEKLLKEIQPRPNCII 124
Query: 110 LSFIITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKDHPSSLLFIPGL 169
+ T + + IP + C L + H + + D IP
Sbjct: 125 ADMCLPYTNRIAKNLGIPKIIFHGMCCFNLLCTHIMHQNHEFLETIESD--KEYFPIPNF 182
Query: 170 PP-VKSSFMPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVT 228
P V+ + P++ DF + + S G+I+NTF+ LE ++
Sbjct: 183 PDRVEFTKSQLPMVLVAGDWKDFLDGMTEGDNTSYGVIVNTFEELEPAYVRDYKKVKAGK 242
Query: 229 NGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQL 288
+ P+ L D +R G +D +C+ WLDS+ GSV+++C GS QL
Sbjct: 243 IWSIGPVSLCNKLGEDQAER-GNKADIDQDECIKWLDSKEEGSVLYVCLGSICNLPLSQL 301
Query: 289 KEIAIGLERSNQRFLWVVRNPSNAAEAELPE-----GFLERTKERGLVVKSWAPQSTILG 343
KE+ +GLE S + F+WV+R E L E G+ ER KERGL++ W+PQ IL
Sbjct: 302 KELGLGLEESQRPFIWVIRGWEKYNE--LLEWISESGYKERIKERGLLITGWSPQMLILT 359
Query: 344 HESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLN----- 398
H +VGGF+THCGW+S +E +T GVP++ WPL+ +QF N VQ +K + +
Sbjct: 360 HPAVGGFLTHCGWNSTLEGITSGVPLLTWPLFGDQFCNEKLAVQILKAGVRAGVEESMRW 419
Query: 399 GEEETIGNGEGVVSAERVEERVRELMMGS-EGKALRERSLEMRMMAATAWNNNDGGSSFT 457
GEEE IG +V E V++ V ELM S + K R+R E+ +A A +GGSS +
Sbjct: 420 GEEEKIGV---LVDKEGVKKAVEELMGDSNDAKERRKRVKELGELAHKA--VEEGGSSHS 474
Query: 458 AFSNLF-DLWQI 468
+ L D+ Q+
Sbjct: 475 NITFLLQDIMQL 486
>sp|Q9SK82|U85A1_ARATH UDP-glycosyltransferase 85A1 OS=Arabidopsis thaliana GN=UGT85A1
PE=1 SV=1
Length = 489
Score = 173 bits (438), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 105/302 (34%), Positives = 160/302 (52%), Gaps = 32/302 (10%)
Query: 165 FIPGLPPVKSSFMPEPV--LDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIV 222
FIP + VK +P + + + F L + +++ II+NTFD LE + A+
Sbjct: 189 FIPTMKNVKLKDIPSFIRTTNPDDVMISFALRETERAKRASAIILNTFDDLEHDVVHAM- 247
Query: 223 NGDCVTNGTTPPLHCIGPLI------VDAKDRAGGVSDDV---SSDCLTWLDSQPSGSVV 273
PP++ +GPL ++ G +S ++ +CL WLD++ SV+
Sbjct: 248 ------QSILPPVYSVGPLHLLANREIEEGSEIGMMSSNLWKEEMECLDWLDTKTQNSVI 301
Query: 274 FLCFGSRGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAAE-AELPEGFLERTKERGLVV 332
++ FGS S QL E A GL S + FLWV+R A E A +P FL TK+R ++
Sbjct: 302 YINFGSITVLSVKQLVEFAWGLAGSGKEFLWVIRPDLVAGEEAMVPPDFLMETKDRSMLA 361
Query: 333 KSWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVA 392
SW PQ +L H ++GGF+THCGW+S++E+++ GVPM+ WP +A+Q +N E V
Sbjct: 362 -SWCPQEKVLSHPAIGGFLTHCGWNSILESLSCGVPMVCWPFFADQQMNCKFCCDEWDVG 420
Query: 393 MPMFLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDG 452
+ IG G V E VE VRELM G +GK +RE+++E + +A A + G
Sbjct: 421 I---------EIG---GDVKREEVEAVVRELMDGEKGKKMREKAVEWQRLAEKATEHKLG 468
Query: 453 GS 454
S
Sbjct: 469 SS 470
>sp|Q8H0F2|ANGT_GENTR Anthocyanin 3'-O-beta-glucosyltransferase OS=Gentiana triflora PE=1
SV=1
Length = 482
Score = 172 bits (435), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 107/296 (36%), Positives = 158/296 (53%), Gaps = 20/296 (6%)
Query: 178 PEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPLHC 237
P+ + I + + N S S + G+I+N+F LE + D N
Sbjct: 188 PDETEENNTHITEMWKNISESENDCYGVIVNSFYELEPDYV------DYCKNVLGRRAWH 241
Query: 238 IGPLIV------DAKDRAGGVSDDVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAPQLKEI 291
IGPL + D +R G SD + +CL WLDS+ SVV++CFGS F+A QL E+
Sbjct: 242 IGPLSLCNNEGEDVAER-GKKSDIDAHECLNWLDSKNPDSVVYVCFGSMANFNAAQLHEL 300
Query: 292 AIGLERSNQRFLWVVRNPSNAAEAE--LPEGFLERTKE--RGLVVKSWAPQSTILGHESV 347
A+GLE S Q F+WVVR + + P+GF +R +E +GL++K WAPQ IL HE+V
Sbjct: 301 AMGLEESGQEFIWVVRTCVDEEDESKWFPDGFEKRVQENNKGLIIKGWAPQVLILEHEAV 360
Query: 348 GGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETIGNG 407
G FV+HCGW+S +E + GV M+ WPL+AEQF N + ++ + + + + +
Sbjct: 361 GAFVSHCGWNSTLEGICGGVAMVTWPLFAEQFYNEKLMTDILRTGVSVG-SLQWSRVTTS 419
Query: 408 EGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSFTAFSNLF 463
VV E + + VR LM EG +R R+ ++ A A GGSS++ S L
Sbjct: 420 AVVVKRESISKAVRRLMAEEEGVDIRNRAKALKEKAKKAVEG--GGSSYSDLSALL 473
>sp|Q9LME8|U85A7_ARATH UDP-glycosyltransferase 85A7 OS=Arabidopsis thaliana GN=UGT85A7
PE=2 SV=1
Length = 487
Score = 169 bits (429), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 179/371 (48%), Gaps = 45/371 (12%)
Query: 105 LTSKILSFIITSTTSFSYHPNIPTYTYFNSCASTLAAILYLPTLHNQITSSFKD------ 158
++ ++SF + + +P ++ + A IL+ + S FKD
Sbjct: 124 VSDGVMSFTLDAAEELG----VPEVIFWTNSACGFMTILHFYLFIEKGLSPFKDESYMSK 179
Query: 159 -HPSSLL-FIPGLPPVKSSFMPEPV--LDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLE 214
H +++ +IP + ++ +P + + + +F + +++ II+NTFD LE
Sbjct: 180 EHLDTVIDWIPSMKNLRLKDIPSYIRTTNPDNIMLNFLIREVERSKRASAIILNTFDELE 239
Query: 215 QQAIKAIVNGDCVTNGTTPPLHCIGPLIVDAKDRAGGVSD---------DVSSDCLTWLD 265
I+++ PP++ IGPL + K+ S+ +CL WLD
Sbjct: 240 HDVIQSM-------QSILPPVYSIGPLHLLVKEEINEASEIGQMGLNLWREEMECLDWLD 292
Query: 266 SQPSGSVVFLCFGSRGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAAEAE--LPEGFLE 323
++ SV+F+ FG SA QL+E A GL S + FLWV+R EA LP+ FL
Sbjct: 293 TKTPNSVLFVNFGCITVMSAKQLEEFAWGLAASRKEFLWVIRPNLVVGEAMVVLPQEFLA 352
Query: 324 RTKERGLVVKSWAPQSTILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSV 383
T +R ++ SW PQ +L H ++GGF+THCGW+S +E++ GVPMI WP ++EQ N
Sbjct: 353 ETIDRRMLA-SWCPQEKVLSHPAIGGFLTHCGWNSTLESLAGGVPMICWPCFSEQPTNCK 411
Query: 384 ALVQEMKVAMPMFLNGEEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMA 443
E V + IG V E VE VRELM G +GK LRE++ E R +A
Sbjct: 412 FCCDEWGVGI---------EIGKD---VKREEVETVVRELMDGEKGKKLREKAEEWRRLA 459
Query: 444 ATAWNNNDGGS 454
A G S
Sbjct: 460 EEATRYKHGSS 470
>sp|Q9ZVX4|U90A1_ARATH UDP-glycosyltransferase 90A1 OS=Arabidopsis thaliana GN=UGT90A1
PE=2 SV=1
Length = 478
Score = 169 bits (427), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 133/475 (28%), Positives = 225/475 (47%), Gaps = 53/475 (11%)
Query: 5 IALYPGPAFHHMISMVELGKLILQH-RSDVSITILVPSMPLEESKTCSYINSISHRLNPI 63
+ L+P + H+I +++ G+L+L+H R + +IT+ V + P + +++ P
Sbjct: 10 VVLFPFMSKGHIIPLLQFGRLLLRHHRKEPTITVTVFTTPKNQPFISDFLSD-----TPE 64
Query: 64 ISFYYLPAIQMPSETLSRADIAIESI-KLNSSNVFQAL-----------ENISLTSKILS 111
I LP E ++ +E+ KL S ++F E T +S
Sbjct: 65 IKVISLPF----PENITGIPPGVENTEKLPSMSLFVPFTRATKLLQPFFEETLKTLPKVS 120
Query: 112 FIITS-----TTSFSYHPNIPTYTYF--NSCASTLAAILYLPTLHNQITSSFKDHPSSLL 164
F+++ T+ + NIP + + NS ++ ++ ++ L + S P ++
Sbjct: 121 FMVSDGFLWWTSESAAKFNIPRFVSYGMNSYSAAVSISVFKHELFTEPESKSDTEPVTVP 180
Query: 165 FIPGLPPVKSSF---MPEPVLDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAI 221
P + K F EP + + ++ S + S+G ++N+F LE +
Sbjct: 181 DFPWIKVKKCDFDHGTTEP--EESGAALELSMDQIKSTTTSHGFLVNSFYELESAFVDYN 238
Query: 222 VNGDCVTNGTTPPLHCIGPLIVDAKDRAGGVSDDVSSDCLTWLDSQPSGS--VVFLCFGS 279
N +G P C+GPL + + G + WLD + V+++ FG+
Sbjct: 239 NN-----SGDKPKSWCVGPLCLTDPPKQGSAK----PAWIHWLDQKREEGRPVLYVAFGT 289
Query: 280 RGTFSAPQLKEIAIGLERSNQRFLWVVRNPSNAAEAELPEGFLERTKERGLVVKSWAPQS 339
+ S QL E+A GLE S FLWV R E + EGF +R +E G++V+ W Q
Sbjct: 290 QAEISNKQLMELAFGLEDSKVNFLWVTRK---DVEEIIGEGFNDRIRESGMIVRDWVDQW 346
Query: 340 TILGHESVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNG 399
IL HESV GF++HCGW+S E++ GVP++AWP+ AEQ LN+ +V+E+KV + +
Sbjct: 347 EILSHESVKGFLSHCGWNSAQESICVGVPLLAWPMMAEQPLNAKMVVEEIKVGVRV---- 402
Query: 400 EEETIGNGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGS 454
E G+ +G V+ E + +++ELM G GK R+ E MA A G S
Sbjct: 403 -ETEDGSVKGFVTREELSGKIKELMEGETGKTARKNVKEYSKMAKAALVEGTGSS 456
>sp|Q9ZWJ3|U85A2_ARATH UDP-glycosyltransferase 85A2 OS=Arabidopsis thaliana GN=UGT85A2
PE=2 SV=1
Length = 481
Score = 168 bits (426), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 173/351 (49%), Gaps = 40/351 (11%)
Query: 126 IPTYTYFNSCASTLAAILYLPTLHNQITSSFKDH--------PSSLLFIPGLPPVKSSFM 177
+P ++ + A A LY + S KD + + +IP + ++ +
Sbjct: 138 VPEVLFWTTSACGFLAYLYYYRFIEKGLSPIKDESYLTKEHLDTKIDWIPSMKNLRLKDI 197
Query: 178 PEPV--LDRQKPIYDFFLNYSTSLSKSNGIIINTFDFLEQQAIKAIVNGDCVTNGTTPPL 235
P + + + +F + + +++ II+NTFD LE I+++ PP+
Sbjct: 198 PSFIRTTNPDDIMLNFIIREADRAKRASAIILNTFDDLEHDVIQSM-------KSIVPPV 250
Query: 236 HCIGPLIVDAKDRAGGVSD---------DVSSDCLTWLDSQPSGSVVFLCFGSRGTFSAP 286
+ IGPL + K +G S+ ++CL WL+++ SVV++ FGS SA
Sbjct: 251 YSIGPLHLLEKQESGEYSEIGRTGSNLWREETECLDWLNTKARNSVVYVNFGSITVLSAK 310
Query: 287 QLKEIAIGLERSNQRFLWVVRNPSNAA-EAELPEGFLERTKERGLVVKSWAPQSTILGHE 345
QL E A GL + + FLWV+R A EA +P FL T +R ++ SW PQ +L H
Sbjct: 311 QLVEFAWGLAATGKEFLWVIRPDLVAGDEAMVPPEFLTATADRRMLA-SWCPQEKVLSHP 369
Query: 346 SVGGFVTHCGWSSVVEAVTYGVPMIAWPLYAEQFLNSVALVQEMKVAMPMFLNGEEETIG 405
++GGF+THCGW+S +E++ GVPM+ WP +AEQ N E +V + IG
Sbjct: 370 AIGGFLTHCGWNSTLESLCGGVPMVCWPFFAEQQTNCKFSRDEWEVGI---------EIG 420
Query: 406 NGEGVVSAERVEERVRELMMGSEGKALRERSLEMRMMAATAWNNNDGGSSF 456
G V E VE VRELM +GK +RE++ E R +A A + G S
Sbjct: 421 ---GDVKREEVEAVVRELMDEEKGKNMREKAEEWRRLANEATEHKHGSSKL 468
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.133 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 170,389,725
Number of Sequences: 539616
Number of extensions: 7025363
Number of successful extensions: 17675
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 236
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 17031
Number of HSP's gapped (non-prelim): 276
length of query: 468
length of database: 191,569,459
effective HSP length: 121
effective length of query: 347
effective length of database: 126,275,923
effective search space: 43817745281
effective search space used: 43817745281
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)