BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 044624
(358 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9LXV0|U92A1_ARATH UDP-glycosyltransferase 92A1 OS=Arabidopsis thaliana GN=UGT92A1
PE=2 SV=1
Length = 488
Score = 164 bits (415), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 80/181 (44%), Positives = 108/181 (59%), Gaps = 7/181 (3%)
Query: 1 MLPFMAQGHVIPFLALAHHLES------TKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLE 54
M PFM QGH+IPF+ALA LE TI+ +NT NI KI+S+LP S I +E
Sbjct: 13 MFPFMGQGHIIPFVALALRLEKIMIMNRANKTTISMINTPSNIPKIRSNLPPESSISLIE 72
Query: 55 TPFNIIDHDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIIT 114
PFN DH LP EN DS P+ +V LLEA+ S + F+ + ++ E+ G + +I
Sbjct: 73 LPFNSSDHGLPHDGENFDSLPYSLVISLLEASRSLREPFRDFMTKILKEE-GQSSVIVIG 131
Query: 115 DMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRNTDSDEFLLLDFPEASTI 174
D F GW ++ +E G++ IF G FG CY S+W++LPH+ T D+FLL DFPEA I
Sbjct: 132 DFFLGWIGKVCKEVGVYSVIFSASGAFGLGCYRSIWLNLPHKETKQDQFLLDDFPEAGEI 191
Query: 175 H 175
Sbjct: 192 E 192
Score = 158 bits (399), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 91/214 (42%), Positives = 121/214 (56%), Gaps = 43/214 (20%)
Query: 174 IHPVLRFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMAL 233
+ PVL+ K G+ + E K+WLD KP SV+YV FGS ++I + M++LAMAL
Sbjct: 253 VGPVLKSPDKKVGSRS----TEEAVKSWLDSKPDHSVVYVCFGSMNSILQTHMLELAMAL 308
Query: 234 EASGKNFIWIVRPPIGFDINSEF------------RANDAD------------------- 262
E+S KNFIW+VRPPIG ++ SEF R ++
Sbjct: 309 ESSEKNFIWVVRPPIGVEVKSEFDVKGYLPEGFEERITRSERGLLVKKWAPQVDILSHKA 368
Query: 263 --------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKE 314
G S LE+LSHGVP+ GWP+AAEQF+NS L+ + +GV EVARG C + +
Sbjct: 369 TCVFLSHCGWNSILESLSHGVPLLGWPMAAEQFFNSILMEKHIGVSVEVARGKRCEIKCD 428
Query: 315 HIVVKIELVMNETEKGKPMRMKDLEVKEIIDNAF 348
IV KI+LVM ETE GK +R K EVKE++ A
Sbjct: 429 DIVSKIKLVMEETEVGKEIRKKAREVKELVRRAM 462
>sp|Q6WFW1|GLT3_CROSA Crocetin glucosyltransferase 3 OS=Crocus sativus GN=GLT3 PE=1 SV=1
Length = 475
Score = 142 bits (359), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 73/172 (42%), Positives = 98/172 (56%), Gaps = 2/172 (1%)
Query: 1 MLPFMAQGHVIPFLALAHHL-ESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNI 59
+ PFM+QGH+IPFL+LA + E YTI +NT LNI ++S+LP +S IH P+
Sbjct: 8 LFPFMSQGHIIPFLSLAKLISERHPTYTITLLNTPLNILNLQSTLPPNSNIHLKSLPYRS 67
Query: 60 IDHDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITDMFFG 119
D LPP ENTDS PF +V ++ S HF + DL + + PL I+ D+FFG
Sbjct: 68 SDFGLPPDRENTDSLPFPLVLSFYQSGESLATHFTHFVSDLTRQNHDTPPLLIVADVFFG 127
Query: 120 WCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRNTDSDEFLLLDFPEA 171
W EIA+ H F G +G A Y+S+W+ LPH TD +F FPE
Sbjct: 128 WTAEIAKRLNT-HVSFSTCGAYGTAAYFSVWLHLPHAETDLPDFTAPGFPET 178
Score = 59.3 bits (142), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 66/161 (40%), Gaps = 42/161 (26%)
Query: 176 PVLRFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGS-QDTIAVSQMVQLAMALE 234
P L S +G++ G+ WLD P SV+YVSFGS DT A QM LA+ L
Sbjct: 248 PSLPPNSSLGRSGRKSGMEVSYIMKWLDSHPPGSVVYVSFGSIHDTAA--QMTSLAVGLA 305
Query: 235 A---------SGKNFIWIVRP---PIGFDINSEFRANDAD-------------------- 262
SG+ F P G E R +
Sbjct: 306 VELATRSCGHSGRRFGGNRNRNSNPNGVPDEFEARMRGSGRGILIHGWAPQLEILEHEST 365
Query: 263 -------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGEE 296
G S LE+LS GV + GWPLAAEQFYNS ++ E+
Sbjct: 366 GAFVSHCGWNSTLESLSRGVCMIGWPLAAEQFYNSKMVEED 406
>sp|Q2V6J9|UFOG7_FRAAN UDP-glucose flavonoid 3-O-glucosyltransferase 7 OS=Fragaria
ananassa GN=GT7 PE=1 SV=1
Length = 487
Score = 119 bits (299), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 163/390 (41%), Gaps = 105/390 (26%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNIID 61
LPFMA+GH IP +A L S+ V T LN + S Q I + F +
Sbjct: 16 LPFMARGHSIPLTDIAK-LFSSHGARCTIVTTPLN-APLFSKATQRGEIELVLIKFPSAE 73
Query: 62 HDLPPCTENTD-SHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITDMFFGW 120
LP E+ D D++ K ++AT +PHF+K++ DE H+P C++ D FF W
Sbjct: 74 AGLPQDCESADLITTQDMLGKFVKATFLIEPHFEKIL----DE---HRPHCLVADAFFTW 126
Query: 121 CKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRN--TDSDEFLLLD----------- 167
++A ++ I F G F S+ + PH N +DS+ F++ +
Sbjct: 127 ATDVAAKFRIPRLYFHGTGFFALCASLSVMMYQPHSNLSSDSESFVIPNLPDEIKMTRSQ 186
Query: 168 ---FPEASTIHPVLRFT------------------------------GSKA--------- 185
FP+ S +L+ + G KA
Sbjct: 187 LPVFPDESEFMKMLKASIEIEERSYGVIVNSFYELEPAYANHYRKVFGRKAWHIGPVSFC 246
Query: 186 ----------GAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEA 235
G+ K C WLD K RSV+YVSFGS A SQ++++A LEA
Sbjct: 247 NKAIEDKAERGSIKSSTAEKHECLKWLDSKKPRSVVYVSFGSMVRFADSQLLEIATGLEA 306
Query: 236 SGKNFIWIVRP---------PIGFDINSEFRA--------------NDADGT-------Q 265
SG++FIW+V+ P GF+ E + ++A G
Sbjct: 307 SGQDFIWVVKKEKKEVEEWLPEGFEKRMEGKGLIIRDWAPQVLILEHEAIGAFVTHCGWN 366
Query: 266 SALEALSHGVPINGWPLAAEQFYNSNLLGE 295
S LEA+S GVP+ WP+ EQFYN L+ E
Sbjct: 367 SILEAVSAGVPMITWPVFGEQFYNEKLVTE 396
>sp|Q8VZE9|U73B1_ARATH UDP-glycosyltransferase 73B1 OS=Arabidopsis thaliana GN=UGT73B1
PE=2 SV=1
Length = 488
Score = 110 bits (276), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 118/461 (25%), Positives = 189/461 (40%), Gaps = 124/461 (26%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIK----KIKSSLPQSSP------I 50
+ PFMA GH+IP L +A L +TK + T LN K K S Q +P I
Sbjct: 14 LFPFMAHGHMIPTLDMAK-LFATKGAKSTILTTPLNAKLFFEKPIKSFNQDNPGLEDITI 72
Query: 51 HFLETPFNIIDHDLPPCTENTD---SHP----FDVVRKLLEATLSFKPHFKKLIIDLIDE 103
L P + LP ENTD S P D+ +K L A F+ ++L++ +
Sbjct: 73 QILNFP--CTELGLPDGCENTDFIFSTPDLNVGDLSQKFLLAMKYFEEPLEELLVTM--- 127
Query: 104 QNGHKPLCIITDMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWV------------ 151
+P C++ +MFF W ++A+++G+ +F G F + + +
Sbjct: 128 ----RPDCLVGNMFFPWSTKVAEKFGVPRLVFHGTGYFSLCASHCIRLPKNVATSSEPFV 183
Query: 152 --DLPH-------------------------RNTDSDEFLLL------------DF---- 168
DLP R+++ D F +L D+
Sbjct: 184 IPDLPGDILITEEQVMETEEESVMGRFMKAIRDSERDSFGVLVNSFYELEQAYSDYFKSF 243
Query: 169 --PEASTIHPVL---RFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAV 223
A I P+ R KA GK+ I C WLD K C SV+Y++FG+ +
Sbjct: 244 VAKRAWHIGPLSLGNRKFEEKAERGKKASIDEHECLKWLDSKKCDSVIYMAFGTMSSFKN 303
Query: 224 SQMVQLAMALEASGKNFIWIVRP-----------PIGFDINSEFRANDAD---------- 262
Q++++A L+ SG +F+W+V P GF+ ++ +
Sbjct: 304 EQLIEIAAGLDMSGHDFVWVVNRKGSQVEKEDWLPEGFEEKTKGKGLIIRGWAPQVLILE 363
Query: 263 -----------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGE--EVGVCAEVARGMNC 309
G S LE ++ G+P+ WP+ AEQFYN L+ + + GV V + M
Sbjct: 364 HKAIGGFLTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKLVTQVLKTGVSVGVKKMMQ- 422
Query: 310 AVLKEHIVV-KIELVMNETEKGKPMRMKDLEVKEIIDNAFR 349
V+ + I K+E + E G+ R + E+ E+ NA +
Sbjct: 423 -VVGDFISREKVEGAVREVMVGEERRKRAKELAEMAKNAVK 462
>sp|Q9SCP5|U73C7_ARATH UDP-glycosyltransferase 73C7 OS=Arabidopsis thaliana GN=UGT73C7
PE=2 SV=1
Length = 490
Score = 110 bits (274), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 112/467 (23%), Positives = 189/467 (40%), Gaps = 127/467 (27%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSS---PIHFLETPF 57
++PFMAQGH+IP + ++ L + T+ + T N+ KIK+SL SS I+ +E F
Sbjct: 11 VIPFMAQGHMIPLVDISRLLSQRQGVTVCIITTTQNVAKIKTSLSFSSLFATINIVEVKF 70
Query: 58 NIIDHDLPPCTENTD--SHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITD 115
LP E+ D + D+V K +A S + +K + +++ +P CII D
Sbjct: 71 LSQQTGLPEGCESLDMLASMGDMV-KFFDAANSLEEQVEKAMEEMVQ----PRPSCIIGD 125
Query: 116 MFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRN------TDSDEFLLL--- 166
M + +A+++ I IF GF+C+ + + + + +DE+ L
Sbjct: 126 MSLPFTSRLAKKFKIPKLIF-----HGFSCFSLMSIQVVRESGILKMIESNDEYFDLPGL 180
Query: 167 ----DF--PEASTIHPV----------------------------------LRFTGSKAG 186
+F P+ S + PV + ++AG
Sbjct: 181 PDKVEFTKPQVSVLQPVEGNMKESTAKIIEADNDSYGVIVNTFEELEVDYAREYRKARAG 240
Query: 187 A--------------------GKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQM 226
G + I + C WLD + SVLYV GS + ++Q+
Sbjct: 241 KVWCVGPVSLCNRLGLDKAKRGDKASIGQDQCLQWLDSQETGSVLYVCLGSLCNLPLAQL 300
Query: 227 VQLAMALEASGKNFIWIVRPPIGF-DI-----NSEFRANDAD------------------ 262
+L + LEAS K FIW++R + D+ S F D
Sbjct: 301 KELGLGLEASNKPFIWVIREWGKYGDLANWMQQSGFEERIKDRGLVIKGWAPQVFILSHA 360
Query: 263 ---------GTQSALEALSHGVPINGWPLAAEQFYNSNLL------GEEVGVCAEVARG- 306
G S LE ++ GVP+ WPL AEQF N L+ G ++GV + G
Sbjct: 361 SIGGFLTHCGWNSTLEGITAGVPLLTWPLFAEQFLNEKLVVQILKAGLKIGVEKLMKYGK 420
Query: 307 ---MNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNAFRN 350
+ V +E + ++ +M ++E+ + R K E+ ++ + A
Sbjct: 421 EEEIGAMVSRECVRKAVDELMGDSEEAEERRRKVTELSDLANKALEK 467
>sp|Q7Y232|U73B4_ARATH UDP-glycosyltransferase 73B4 OS=Arabidopsis thaliana GN=UGT73B4
PE=2 SV=1
Length = 484
Score = 107 bits (267), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 113/456 (24%), Positives = 171/456 (37%), Gaps = 130/456 (28%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQ---SSP-----IHFL 53
PFMA GH+IP L +A L + + + T +N K ++ + +P I L
Sbjct: 11 FPFMAHGHMIPLLDMAK-LFARRGAKSTLLTTPINAKILEKPIEAFKVQNPDLEIGIKIL 69
Query: 54 ETPFNIIDHDLPPCTENTD-------SHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNG 106
P ++ LP EN D S FD+ K L +T K + I
Sbjct: 70 NFP--CVELGLPEGCENRDFINSYQKSDSFDLFLKFLFSTKYMKQQLESFI-------ET 120
Query: 107 HKPLCIITDMFFGWCKEIAQEYGIFHAIFIEGGGFGFAC--------------------- 145
KP ++ DMFF W E A++ G+ +F F C
Sbjct: 121 TKPSALVADMFFPWATESAEKIGVPRLVFHGTSSFALCCSYNMRIHKPHKKVASSSTPFV 180
Query: 146 -----------------------YYSLWVDLPHRNTDSDEFLLLDFPEAST--------- 173
+ W ++ T S L+ F E +
Sbjct: 181 IPGLPGDIVITEDQANVTNEETPFGKFWKEVRESETSSFGVLVNSFYELESSYADFYRSF 240
Query: 174 -------IHPVL---RFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAV 223
I P+ R KAG GK+ I + C WLD K SV+Y+SFGS +
Sbjct: 241 VAKKAWHIGPLSLSNRGIAEKAGRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGTGLPN 300
Query: 224 SQMVQLAMALEASGKNFIWIVRP--------------PIGFDINSEFRA----------- 258
Q++++A LE SG+NFIW+V P GF+ ++ +
Sbjct: 301 EQLLEIAFGLEGSGQNFIWVVSKNENQVGTGENEDWLPKGFEERNKGKGLIIRGWAPQVL 360
Query: 259 ---NDADGT-------QSALEALSHGVPINGWPLAAEQFYNSNLL------GEEVGVCAE 302
+ A G S LE ++ G+P+ WP+ AEQFYN LL G VG
Sbjct: 361 ILDHKAIGGFVTHCGWNSTLEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNVGATEL 420
Query: 303 VARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDL 338
V +G + + V+ E++ E + + +R K+L
Sbjct: 421 VKKGKLISRAQVEKAVR-EVIGGEKAEERRLRAKEL 455
>sp|Q9ZQ97|U73C4_ARATH UDP-glycosyltransferase 73C4 OS=Arabidopsis thaliana GN=UGT73C4
PE=2 SV=1
Length = 496
Score = 107 bits (266), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/465 (23%), Positives = 183/465 (39%), Gaps = 122/465 (26%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSS----PIHFLETP 56
+ PFMAQGH+IP + +A L + + T+ V T N + ++ L ++ PI+ +
Sbjct: 17 LFPFMAQGHMIPMIDIARLL-AQRGATVTIVTTRYNAGRFENVLSRAMESGLPINIVHVN 75
Query: 57 FNIIDHDLPPCTENTDSH-PFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITD 115
F + LP EN DS+ +++ +A + KL+ E+ +P CII+D
Sbjct: 76 FPYQEFGLPEGKENIDSYDSMELMVPFFQAVNMLEDPVMKLM-----EEMKPRPSCIISD 130
Query: 116 MFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVD---LPHRNTDSDEFLLLDFPEA- 171
+ + +IA+++ I +F G F C + L + L + +D D FL+ FP+
Sbjct: 131 LLLPYTSKIARKFSIPKIVFHGTGCFNLLCMHVLRRNLEILKNLKSDKDYFLVPSFPDRV 190
Query: 172 ---------------------------------------STIHPVLRFTGSKAGAGK--- 189
+ P +KA AGK
Sbjct: 191 EFTKPQVPVETTASGDWKAFLDEMVEAEYTSYGVIVNTFQELEPAYVKDYTKARAGKVWS 250
Query: 190 -------------------EHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLA 230
+ I + C WLD K SVLYV GS + +SQ+ +L
Sbjct: 251 IGPVSLCNKAGADKAERGNQAAIDQDECLQWLDSKEDGSVLYVCLGSICNLPLSQLKELG 310
Query: 231 MALEASGKNFIWIVRPPIGFD------INSEFRANDAD---------------------- 262
+ LE S ++FIW++R ++ + S F +
Sbjct: 311 LGLEKSQRSFIWVIRGWEKYNELYEWMMESGFEERIKERGLLIKGWSPQVLILSHPSVGG 370
Query: 263 -----GTQSALEALSHGVPINGWPLAAEQFYNSNLLGE--EVGVCAEVARGMNCA----- 310
G S LE ++ G+P+ WPL +QF N L+ + + GV A V M
Sbjct: 371 FLTHCGWNSTLEGITSGIPLITWPLFGDQFCNQKLVVQVLKAGVSAGVEEVMKWGEEEKI 430
Query: 311 ---VLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNAFRNDE 352
V KE + +E +M ++ K R + VKE+ ++A + E
Sbjct: 431 GVLVDKEGVKKAVEELMGASDDAKERRRR---VKELGESAHKAVE 472
>sp|Q9ZQ99|U73C1_ARATH UDP-glycosyltransferase 73C1 OS=Arabidopsis thaliana GN=UGT73C1
PE=2 SV=1
Length = 491
Score = 95.1 bits (235), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 115/468 (24%), Positives = 179/468 (38%), Gaps = 136/468 (29%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSS----PIHFLETP 56
+ PFMAQGH+IP + +A L + + TI V T N + K+ L ++ PI+ ++
Sbjct: 13 LFPFMAQGHMIPMVDIARLL-AQRGVTITIVTTPQNAGRFKNVLSRAIQSGLPINLVQVK 71
Query: 57 FNIIDHDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKL--IIDLIDEQNGHKPLCIIT 114
F + P EN D + L A+L+F F L ++ + ++ +P CII
Sbjct: 72 FPSQESGSPEGQENLD------LLDSLGASLTFFKAFSLLEEPVEKLLKEIQPRPNCIIA 125
Query: 115 DMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRN--------TDSDEFLLL 166
DM + IA+ GI IF G C+ L + H+N +D + F +
Sbjct: 126 DMCLPYTNRIAKNLGIPKIIF-----HGMCCFNLLCTHIMHQNHEFLETIESDKEYFPIP 180
Query: 167 DFPE----ASTIHPVLRFTG-----------------------------------SKAGA 187
+FP+ + P++ G K A
Sbjct: 181 NFPDRVEFTKSQLPMVLVAGDWKDFLDGMTEGDNTSYGVIVNTFEELEPAYVRDYKKVKA 240
Query: 188 GK------------------EHGISAEL----CKNWLDRKPCRSVLYVSFGSQDTIAVSQ 225
GK E G A++ C WLD K SVLYV GS + +SQ
Sbjct: 241 GKIWSIGPVSLCNKLGEDQAERGNKADIDQDECIKWLDSKEEGSVLYVCLGSICNLPLSQ 300
Query: 226 MVQLAMALEASGKNFIWIVRPPIGFDINSEF-----------RANDAD------------ 262
+ +L + LE S + FIW++R G++ +E R +
Sbjct: 301 LKELGLGLEESQRPFIWVIR---GWEKYNELLEWISESGYKERIKERGLLITGWSPQMLI 357
Query: 263 -------------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGE--EVGVCAEVARGM 307
G S LE ++ GVP+ WPL +QF N L + + GV A V M
Sbjct: 358 LTHPAVGGFLTHCGWNSTLEGITSGVPLLTWPLFGDQFCNEKLAVQILKAGVRAGVEESM 417
Query: 308 NCA--------VLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNA 347
V KE + +E +M ++ K R + E+ E+ A
Sbjct: 418 RWGEEEKIGVLVDKEGVKKAVEELMGDSNDAKERRKRVKELGELAHKA 465
>sp|Q9LNI1|U72B3_ARATH UDP-glycosyltransferase 72B3 OS=Arabidopsis thaliana GN=UGT72B3
PE=2 SV=1
Length = 481
Score = 90.9 bits (224), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 98/228 (42%), Gaps = 50/228 (21%)
Query: 167 DFPEASTIHPVLRFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQM 226
D P I P++ A E+ C NWLD +P SVLYVSFGS T+ Q
Sbjct: 232 DKPPVYLIGPLVNSGSHDADVNDEYK-----CLNWLDNQPFGSVLYVSFGSGGTLTFEQF 286
Query: 227 VQLAMALEASGKNFIWIVRPPIGFDINSEFRANDAD------------------------ 262
++LA+ L SGK F+W++R P G +S F +
Sbjct: 287 IELALGLAESGKRFLWVIRSPSGIASSSYFNPQSRNDPFSFLPQGFLDRTKEKGLVVGSW 346
Query: 263 -------------------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEV 303
G S+LE++ +GVP+ WPL AEQ N+ LL +VG
Sbjct: 347 APQAQILTHTSIGGFLTHCGWNSSLESIVNGVPLIAWPLYAEQKMNALLL-VDVGAALRA 405
Query: 304 ARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNAFRND 351
G + V +E + ++ ++ E E+G +R K E+KE R+D
Sbjct: 406 RLGEDGVVGREEVARVVKGLI-EGEEGNAVRKKMKELKEGSVRVLRDD 452
>sp|Q9M156|U72B1_ARATH UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1
PE=1 SV=1
Length = 480
Score = 88.2 bits (217), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 101/229 (44%), Gaps = 49/229 (21%)
Query: 166 LDFPEASTIHPVLRFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQ 225
LD P + P++ +A +E C WLD +P SVLYVSFGS T+ Q
Sbjct: 231 LDKPPVYPVGPLVNIGKQEAKQTEESE-----CLKWLDNQPLGSVLYVSFGSGGTLTCEQ 285
Query: 226 MVQLAMALEASGKNFIWIVRPPIG------FDINSE-----------------------F 256
+ +LA+ L S + F+W++R P G FD +S+ F
Sbjct: 286 LNELALGLADSEQRFLWVIRSPSGIANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPF 345
Query: 257 RANDAD--------------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAE 302
A A G S LE++ G+P+ WPL AEQ N+ LL E++
Sbjct: 346 WAPQAQVLAHPSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALR 405
Query: 303 VARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNAFRND 351
G + V +E + ++ +M E E+GK +R K E+KE ++D
Sbjct: 406 PRAGDDGLVRREEVARVVKGLM-EGEEGKGVRNKMKELKEAACRVLKDD 453
>sp|Q9ZU72|U72D1_ARATH UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana GN=UGT72D1
PE=2 SV=1
Length = 470
Score = 88.2 bits (217), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 91/194 (46%), Gaps = 42/194 (21%)
Query: 200 NWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGF--DINSEFR 257
WLD + RSV++V GS T+ Q V+LA+ LE SG+ F+W++R P + I+S+
Sbjct: 259 EWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFVWVLRRPASYLGAISSDDE 318
Query: 258 ANDAD--------------------------------------GTQSALEALSHGVPING 279
A G SALE+L+ GVPI
Sbjct: 319 QVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSIGGFLSHCGWSSALESLTKGVPIIA 378
Query: 280 WPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVL-KEHIVVKIELVM-NETEKGKPMRMKD 337
WPL AEQ+ N+ LL EE+GV + + V+ +E + + +M E E+G+ +R K
Sbjct: 379 WPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGREEVASLVRKIMAEEDEEGQKIRAKA 438
Query: 338 LEVKEIIDNAFRND 351
EV+ + A+ D
Sbjct: 439 EEVRVSSERAWSKD 452
>sp|Q94A84|U72E1_ARATH UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana GN=UGT72E1
PE=1 SV=1
Length = 487
Score = 85.5 bits (210), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 93/200 (46%), Gaps = 49/200 (24%)
Query: 200 NWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPI----------- 248
+WL+++P SVLY+SFGS +++ Q+ +LA LE S + F+W+VRPP+
Sbjct: 260 DWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVWVVRPPVDGSACSAYLSA 319
Query: 249 ------------------------GFDINS-----EFRANDADGT-------QSALEALS 272
GF ++S E A+ A G S LE++
Sbjct: 320 NSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQAVGGFLTHCGWNSILESVV 379
Query: 273 HGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVL-KEHIVVKIELVMNETEKGK 331
GVP+ WPL AEQ N+ LL EE+GV + + V+ + I + +M E E+G
Sbjct: 380 GGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITRAEIEALVRKIMVE-EEGA 438
Query: 332 PMRMKDLEVKEIIDNAFRND 351
MR K ++KE + D
Sbjct: 439 EMRKKIKKLKETAAESLSCD 458
>sp|Q9ZQ98|U73C2_ARATH UDP-glycosyltransferase 73C2 OS=Arabidopsis thaliana GN=UGT73C2
PE=3 SV=1
Length = 496
Score = 83.2 bits (204), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 111/467 (23%), Positives = 185/467 (39%), Gaps = 133/467 (28%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQS--SPIHF----LE 54
+ PFMAQGH+IP + +A L + + TI V T N + K L ++ S +H ++
Sbjct: 17 LFPFMAQGHMIPMVDIARIL-AQRGVTITIVTTPHNAARFKDVLNRAIQSGLHIRVEHVK 75
Query: 55 TPFNIIDHDLPPCTENTD-SHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCII 113
PF + L EN D +++ +A + KL+ E+ KP C+I
Sbjct: 76 FPFQ--EAGLQEGQENVDFLDSMELMVHFFKAVNMLENPVMKLM-----EEMKPKPSCLI 128
Query: 114 TDMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRN--------TDSDEFLL 165
+D + +IA+ + I +F G +C+ L + + HRN +D + FL+
Sbjct: 129 SDFCLPYTSKIAKRFNIPKIVF-----HGVSCFCLLSMHILHRNHNILHALKSDKEYFLV 183
Query: 166 LDFP---EASTIHPVLR--FTG-----------------------------------SKA 185
FP E + + ++ F+G ++A
Sbjct: 184 PSFPDRVEFTKLQVTVKTNFSGDWKEIMDEQVDADDTSYGVIVNTFQDLESAYVKNYTEA 243
Query: 186 GAGKEHGIS-AELC------------KNWLDRKPC---------RSVLYVSFGSQDTIAV 223
AGK I LC K +D+ C SVLYV GS + +
Sbjct: 244 RAGKVWSIGPVSLCNKVGEDKAERGNKAAIDQDECIKWLDSKDVESVLYVCLGSICNLPL 303
Query: 224 SQMVQLAMALEASGKNFIWIVRP------------PIGFDINSEFRANDADGTQ------ 265
+Q+ +L + LEA+ + FIW++R GF+ ++ R+ G
Sbjct: 304 AQLRELGLGLEATKRPFIWVIRGGGKYHELAEWILESGFEERTKERSLLIKGWSPQMLIL 363
Query: 266 ---------------SALEALSHGVPINGWPLAAEQFYNSNLL------GEEVGVCAEVA 304
S LE ++ GVP+ WPL +QF N L+ G VGV +
Sbjct: 364 SHPAVGGFLTHCGWNSTLEGITSGVPLITWPLFGDQFCNQKLIVQVLKAGVSVGVEEVMK 423
Query: 305 RG----MNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNA 347
G + V KE + ++ +M E+++ K R + E+ E+ A
Sbjct: 424 WGEEESIGVLVDKEGVKKAVDEIMGESDEAKERRKRVRELGELAHKA 470
>sp|O81498|U72E3_ARATH UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana GN=UGT72E3
PE=1 SV=1
Length = 481
Score = 83.2 bits (204), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/195 (32%), Positives = 90/195 (46%), Gaps = 52/195 (26%)
Query: 200 NWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPI----------- 248
+WL+++P SVLY+SFGS ++ Q+ +LA LE S + FIW+VRPP+
Sbjct: 255 DWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWVVRPPVDGSSCSDYFSA 314
Query: 249 ------------------------GFDINS-----EFRANDADGT-------QSALEALS 272
GF I S E A+ A G S LE++
Sbjct: 315 KGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVGGFLTHCGWSSTLESVL 374
Query: 273 HGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKP 332
GVP+ WPL AEQ N+ LL +E+G+ V A+ + I + VM E E G+
Sbjct: 375 CGVPMIAWPLFAEQNMNAALLSDELGISVRVDDPKE-AISRSKIEAMVRKVMAEDE-GEE 432
Query: 333 MRMKDLEVKEIIDNA 347
MR K VK++ D A
Sbjct: 433 MRRK---VKKLRDTA 444
>sp|Q40287|UFOG5_MANES Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5
PE=2 SV=1
Length = 487
Score = 83.2 bits (204), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 92/198 (46%), Gaps = 53/198 (26%)
Query: 200 NWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFDINSEF--R 257
+WLD++P SV+YVSFGS T+++ QM++LA LE S + FIW+VR P ++ F +
Sbjct: 263 DWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQRFIWVVRQPTVKTGDAAFFTQ 322
Query: 258 ANDAD-------------------------------------------GTQSALEALSHG 274
+ AD G S LE+++ G
Sbjct: 323 GDGADDMSGYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPSVGVFLSHCGWNSVLESITAG 382
Query: 275 VPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNC----AVLKEHIVVKIELVMNETEKG 330
VPI WP+ AEQ N+ LL EE+GV R N V +E I I +M + E+G
Sbjct: 383 VPIIAWPIYAEQRMNATLLTEELGVA---VRPKNLPAKEVVKREEIERMIRRIMVD-EEG 438
Query: 331 KPMRMKDLEVKEIIDNAF 348
+R + E+K+ + A
Sbjct: 439 SEIRKRVRELKDSGEKAL 456
>sp|Q8W4C2|U72B2_ARATH UDP-glycosyltransferase 72B2 OS=Arabidopsis thaliana GN=UGT72B2
PE=2 SV=1
Length = 480
Score = 82.8 bits (203), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 99/228 (43%), Gaps = 49/228 (21%)
Query: 167 DFPEASTIHPVLRFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQM 226
D P I P++ + S + G C +WLD +P SVLY+SFGS T+ Q
Sbjct: 232 DKPTVYPIGPLVNTSSSNVNLEDKFG-----CLSWLDNQPFGSVLYISFGSGGTLTCEQF 286
Query: 227 VQLAMALEASGKNFIWIVRPPIGFDINSEFRANDAD------------------------ 262
+LA+ L SGK FIW++R P +S F +
Sbjct: 287 NELAIGLAESGKRFIWVIRSPSEIVSSSYFNPHSETDPFSFLPIGFLDRTKEKGLVVPSW 346
Query: 263 -------------------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEV 303
G S LE++ +GVP+ WPL AEQ N+ LL E+VG +
Sbjct: 347 APQVQILAHPSTCGFLTHCGWNSTLESIVNGVPLIAWPLFAEQKMNTLLLVEDVGAALRI 406
Query: 304 ARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNAFRND 351
G + V +E +V ++ +M E E+GK + K E+KE + +D
Sbjct: 407 HAGEDGIVRREEVVRVVKALM-EGEEGKAIGNKVKELKEGVVRVLGDD 453
Score = 32.3 bits (72), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 73/171 (42%), Gaps = 15/171 (8%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNII 60
++P GH+IPF+ LA L +T+ + + + S Q S ++ L P +I
Sbjct: 11 IMPSPGMGHLIPFVELAKRLVQHDCFTVTMIIS----GETSPSKAQRSVLNSL--PSSIA 64
Query: 61 DHDLPPCT-ENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITDMFFG 119
LPP + S R +L T S P ++L L +++ P ++ DMF
Sbjct: 65 SVFLPPADLSDVPSTARIETRAMLTMTRS-NPALRELFGSLSTKKS--LPAVLVVDMFGA 121
Query: 120 WCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRN-TDSDEFLLLDFP 169
++A ++ + IF A S ++ LP + T S EF L P
Sbjct: 122 DAFDVAVDFHVSPYIFYASN----ANVLSFFLHLPKLDKTVSCEFRYLTEP 168
>sp|Q9LVR1|U72E2_ARATH UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana GN=UGT72E2
PE=1 SV=1
Length = 481
Score = 82.0 bits (201), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 57/195 (29%), Positives = 88/195 (45%), Gaps = 52/195 (26%)
Query: 200 NWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFDINSEFRAN 259
+WL+ +P SVLY+SFGS ++ Q+ +LA LE S + F+W+VRPP+ SE+ +
Sbjct: 255 DWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWVVRPPVDGSCCSEYVSA 314
Query: 260 DADGTQ-----------------------------------------------SALEALS 272
+ GT+ S LE++
Sbjct: 315 NGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAVGGFLTHCGWSSTLESVV 374
Query: 273 HGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKP 332
GVP+ WPL AEQ N+ LL +E+G+ + + + I + VM E E G+
Sbjct: 375 GGVPMIAWPLFAEQNMNAALLSDELGIAVRLDDPKE-DISRWKIEALVRKVMTEKE-GEA 432
Query: 333 MRMKDLEVKEIIDNA 347
MR K VK++ D+A
Sbjct: 433 MRRK---VKKLRDSA 444
>sp|Q9ZQG4|U73B5_ARATH UDP-glycosyltransferase 73B5 OS=Arabidopsis thaliana GN=UGT73B5
PE=2 SV=1
Length = 484
Score = 81.6 bits (200), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 55/166 (33%), Positives = 76/166 (45%), Gaps = 38/166 (22%)
Query: 179 RFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGK 238
R G KA GK+ I + C WLD K SV+Y+SFGS Q++++A LE SG+
Sbjct: 259 RELGEKARRGKKANIDEQECLKWLDSKTPGSVVYLSFGSGTNFTNDQLLEIAFGLEGSGQ 318
Query: 239 NFIWIVRP-----------PIGFDINSEFRA--------------NDADGT-------QS 266
+FIW+VR P GF + + + A G S
Sbjct: 319 SFIWVVRKNENQGDNEEWLPEGFKERTTGKGLIIPGWAPQVLILDHKAIGGFVTHCGWNS 378
Query: 267 ALEALSHGVPINGWPLAAEQFYNSNLL------GEEVGVCAEVARG 306
A+E ++ G+P+ WP+ AEQFYN LL G VG V +G
Sbjct: 379 AIEGIAAGLPMVTWPMGAEQFYNEKLLTKVLRIGVNVGATELVKKG 424
Score = 62.4 bits (150), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/183 (28%), Positives = 78/183 (42%), Gaps = 23/183 (12%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIK----KIKSSLPQSSPIHFLETPF 57
PFMAQGH+IP L +A L S + + T +N K I++ Q+ + F
Sbjct: 14 FPFMAQGHMIPILDMAK-LFSRRGAKSTLLTTPINAKIFEKPIEAFKNQNPDLEIGIKIF 72
Query: 58 NI--IDHDLPPCTENTD-------SHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHK 108
N ++ LP EN D S D+ K L +T K + I K
Sbjct: 73 NFPCVELGLPEGCENADFINSYQKSDSGDLFLKFLFSTKYMKQQLESFI-------ETTK 125
Query: 109 PLCIITDMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRN--TDSDEFLLL 166
P ++ DMFF W E A++ G+ +F F C Y++ + PH+ T S F++
Sbjct: 126 PSALVADMFFPWATESAEKLGVPRLVFHGTSFFSLCCSYNMRIHKPHKKVATSSTPFVIP 185
Query: 167 DFP 169
P
Sbjct: 186 GLP 188
>sp|Q9SK82|U85A1_ARATH UDP-glycosyltransferase 85A1 OS=Arabidopsis thaliana GN=UGT85A1
PE=1 SV=1
Length = 489
Score = 81.3 bits (199), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 98/217 (45%), Gaps = 42/217 (19%)
Query: 168 FPEASTIHPVLRFTGSKAGAGKEHGI-SAEL------CKNWLDRKPCRSVLYVSFGSQDT 220
P ++ P+ + G E G+ S+ L C +WLD K SV+Y++FGS
Sbjct: 251 LPPVYSVGPLHLLANREIEEGSEIGMMSSNLWKEEMECLDWLDTKTQNSVIYINFGSITV 310
Query: 221 IAVSQMVQLAMALEASGKNFIWIVRP----------PIGFDINSEFRANDAD-------- 262
++V Q+V+ A L SGK F+W++RP P F + ++ R+ A
Sbjct: 311 LSVKQLVEFAWGLAGSGKEFLWVIRPDLVAGEEAMVPPDFLMETKDRSMLASWCPQEKVL 370
Query: 263 ------------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCA 310
G S LE+LS GVP+ WP A+Q N +E V E+
Sbjct: 371 SHPAIGGFLTHCGWNSILESLSCGVPMVCWPFFADQQMNCKFCCDEWDVGIEIGGD---- 426
Query: 311 VLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNA 347
V +E + + +M+ EKGK MR K +E + + + A
Sbjct: 427 VKREEVEAVVRELMD-GEKGKKMREKAVEWQRLAEKA 462
Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/152 (24%), Positives = 66/152 (43%), Gaps = 8/152 (5%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNIID 61
+P+ AQGH+ P + +A L + + + + FVNT N + S ++ F I
Sbjct: 17 VPYPAQGHINPMMRVAKLLHA-RGFYVTFVNTVYNHNRFLRSRGSNALDGLPSFRFESIA 75
Query: 62 HDLPPCTENTDSHPFDVVRKLLEATLS--FKPHFKKLIIDLIDEQNGHKPLCIITDMFFG 119
LP TD + L E+T+ P F++L+ + N CI++D
Sbjct: 76 DGLP----ETDMDATQDITALCESTMKNCLAP-FRELLQRINAGDNVPPVSCIVSDGCMS 130
Query: 120 WCKEIAQEYGIFHAIFIEGGGFGFACYYSLWV 151
+ ++A+E G+ +F G F Y ++
Sbjct: 131 FTLDVAEELGVPEVLFWTTSGCAFLAYLHFYL 162
>sp|O23205|U72C1_ARATH UDP-glycosyltransferase 72C1 OS=Arabidopsis thaliana GN=UGT72C1
PE=2 SV=3
Length = 457
Score = 80.9 bits (198), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 97/211 (45%), Gaps = 56/211 (26%)
Query: 185 AGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIV 244
A G +HG+ +WLD +P SV+YVSFGS + Q +LA LE +G F+W+V
Sbjct: 245 AEPGLKHGVL-----DWLDLQPKESVVYVSFGSGGALTFEQTNELAYGLELTGHRFVWVV 299
Query: 245 RPP---------------------------------IGFDINS-----EFRANDADGT-- 264
RPP IG + + E A+ + G
Sbjct: 300 RPPAEDDPSASMFDKTKNETEPLDFLPNGFLDRTKDIGLVVRTWAPQEEILAHKSTGGFV 359
Query: 265 -----QSALEALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAE--VARGMNCAVLKEHIV 317
S LE++ +GVP+ WPL +EQ N+ ++ E+ + + VA G+ V KE I
Sbjct: 360 THCGWNSVLESIVNGVPMVAWPLYSEQKMNARMVSGELKIALQINVADGI---VKKEVIA 416
Query: 318 VKIELVMNETEKGKPMRMKDLEVKEIIDNAF 348
++ VM+E E+GK MR E+K+ + A
Sbjct: 417 EMVKRVMDE-EEGKEMRKNVKELKKTAEEAL 446
>sp|Q8W491|U73B3_ARATH UDP-glycosyltransferase 73B3 OS=Arabidopsis thaliana GN=UGT73B3
PE=2 SV=1
Length = 481
Score = 80.9 bits (198), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 89/211 (42%), Gaps = 39/211 (18%)
Query: 184 KAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWI 243
KA GK+ I+ C WLD K SV+Y+SFGS Q+ ++A LE SG NFIW+
Sbjct: 264 KAERGKKASINEVECLKWLDSKKPDSVIYISFGSVACFKNEQLFEIAAGLETSGANFIWV 323
Query: 244 VRPPIGFDINS------EFRANDAD-------------------------GTQSALEALS 272
VR IG + E R G S LE ++
Sbjct: 324 VRKNIGIEKEEWLPEGFEERVKGKGMIIRGWAPQVLILDHQATCGFVTHCGWNSLLEGVA 383
Query: 273 HGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCA-----VLKEHIVVKI-ELVMNE 326
G+P+ WP+AAEQFYN L+ + + V N + +E +V + E+++ E
Sbjct: 384 AGLPMVTWPVAAEQFYNEKLVTQVLRTGVSVGAKKNVRTTGDFISREKVVKAVREVLVGE 443
Query: 327 TEKGKPMRMKDLEVKEIIDNAFRNDENLRDL 357
+ R K L E+ A + DL
Sbjct: 444 EADERRERAKKL--AEMAKAAVEGGSSFNDL 472
Score = 59.7 bits (143), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 53/184 (28%), Positives = 79/184 (42%), Gaps = 23/184 (12%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQ------SSPIHFLE 54
PFMA GH+IP L +A L S++ + T LN K + + + S I
Sbjct: 13 FFPFMAYGHMIPTLDMAK-LFSSRGAKSTILTTPLNSKIFQKPIERFKNLNPSFEIDIQI 71
Query: 55 TPFNIIDHDLPPCTENTD---SHPFD----VVRKLLEATLSFKPHFKKLIIDLIDEQNGH 107
F +D LP EN D S+ D + K ++T FK +KL+
Sbjct: 72 FDFPCVDLGLPEGCENVDFFTSNNNDDRQYLTLKFFKSTRFFKDQLEKLL-------ETT 124
Query: 108 KPLCIITDMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRNTDS--DEFLL 165
+P C+I DMFF W E A+++ + +F G F Y + V P S + F++
Sbjct: 125 RPDCLIADMFFPWATEAAEKFNVPRLVFHGTGYFSLCSEYCIRVHNPQNIVASRYEPFVI 184
Query: 166 LDFP 169
D P
Sbjct: 185 PDLP 188
>sp|Q9LMF1|U85A3_ARATH UDP-glycosyltransferase 85A3 OS=Arabidopsis thaliana GN=UGT85A3
PE=2 SV=2
Length = 488
Score = 79.7 bits (195), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 88/188 (46%), Gaps = 47/188 (25%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPP--IGFD--IN 253
C WL+ K SV+YV+FGS + +Q+++ A L A+GK F+W++RP G + I
Sbjct: 287 CLGWLNTKSRNSVVYVNFGSITIMTTAQLLEFAWGLAATGKEFLWVMRPDSVAGEEAVIP 346
Query: 254 SEFRANDAD--------------------------GTQSALEALSHGVPINGWPLAAEQF 287
EF A AD G S LE+LS GVP+ WP AEQ
Sbjct: 347 KEFLAETADRRMLTSWCPQEKVLSHPAVGGFLTHCGWNSTLESLSCGVPMVCWPFFAEQQ 406
Query: 288 YNSNL------LGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVK 341
N +G E+G +V RG AV++E + + EKGK MR K +E +
Sbjct: 407 TNCKFSCDEWEVGIEIG--GDVKRGEVEAVVRE---------LMDGEKGKKMREKAVEWR 455
Query: 342 EIIDNAFR 349
+ + A +
Sbjct: 456 RLAEKATK 463
Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 63/150 (42%), Gaps = 4/150 (2%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNIID 61
+P+ AQGH+ P + +A L K + + FVNT N ++ S ++ F I
Sbjct: 17 VPYPAQGHINPMMKVAKLLH-VKGFHVTFVNTVYNHNRLLRSRGANALDGLPSFQFESIP 75
Query: 62 HDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITDMFFGWC 121
LP E D+ T + FKKL+ ++ ++ CI++D +
Sbjct: 76 DGLP---ETGVDATQDIPALSESTTKNCLVPFKKLLQRIVTREDVPPVSCIVSDGSMSFT 132
Query: 122 KEIAQEYGIFHAIFIEGGGFGFACYYSLWV 151
++A+E G+ F GF Y ++
Sbjct: 133 LDVAEELGVPEIHFWTTSACGFMAYLHFYL 162
>sp|Q8H0F2|ANGT_GENTR Anthocyanin 3'-O-beta-glucosyltransferase OS=Gentiana triflora PE=1
SV=1
Length = 482
Score = 78.6 bits (192), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 74/175 (42%), Gaps = 38/175 (21%)
Query: 185 AGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIV 244
A GK+ I A C NWLD K SV+YV FGS +Q+ +LAM LE SG+ FIW+V
Sbjct: 256 AERGKKSDIDAHECLNWLDSKNPDSVVYVCFGSMANFNAAQLHELAMGLEESGQEFIWVV 315
Query: 245 RP-----------PIGFDINSEFRANDAD-------------------------GTQSAL 268
R P GF+ + N+ G S L
Sbjct: 316 RTCVDEEDESKWFPDGFE--KRVQENNKGLIIKGWAPQVLILEHEAVGAFVSHCGWNSTL 373
Query: 269 EALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELV 323
E + GV + WPL AEQFYN L+ + + V V +VVK E +
Sbjct: 374 EGICGGVAMVTWPLFAEQFYNEKLMTDILRTGVSVGSLQWSRVTTSAVVVKRESI 428
Score = 59.3 bits (142), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 79/176 (44%), Gaps = 15/176 (8%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLN----IKKIKSSLPQSSPIHFLETPF 57
PF+A GH++P + +A L S++ + TH N +K I S I L F
Sbjct: 9 FPFLANGHILPTIDMAK-LFSSRGVKATLITTHNNSAIFLKAINRSKILGFDISVLTIKF 67
Query: 58 NIIDHDLPPCTENTD-SHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITDM 116
+ LP E D + D++ + A + + ++L H+P ++ D+
Sbjct: 68 PSAEFGLPEGYETADQARSIDMMDEFFRACILLQEPLEEL-------LKEHRPQALVADL 120
Query: 117 FFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRN--TDSDEFLLLDFPE 170
FF W + A ++GI +F F S+ + P++N +DSD F++ D P+
Sbjct: 121 FFYWANDAAAKFGIPRLLFHGSSSFAMIAAESVRRNKPYKNLSSDSDPFVVPDIPD 176
>sp|Q9SKC5|U74D1_ARATH UDP-glycosyltransferase 74D1 OS=Arabidopsis thaliana GN=UGT74D1
PE=1 SV=1
Length = 456
Score = 78.6 bits (192), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 90/206 (43%), Gaps = 32/206 (15%)
Query: 180 FTGSKAGAGKEHGIS-----AELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALE 234
+ + K++GI+ C +WLD KP SV+YVSFGS + QM+++A L+
Sbjct: 237 YLDKRLAGDKDYGINLFNAQVNECLDWLDSKPPGSVIYVSFGSLAVLKDDQMIEVAAGLK 296
Query: 235 ASGKNFIWIVRPPIGFDINSEFRANDAD--------------------------GTQSAL 268
+G NF+W+VR + S + + D G S L
Sbjct: 297 QTGHNFLWVVRETETKKLPSNYIEDICDKGLIVNWSPQLQVLAHKSIGCFMTHCGWNSTL 356
Query: 269 EALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNE-T 327
EALS GV + G P ++Q N+ + + V V N V KE IV + VM + +
Sbjct: 357 EALSLGVALIGMPAYSDQPTNAKFIEDVWKVGVRVKADQNGFVPKEEIVRCVGEVMEDMS 416
Query: 328 EKGKPMRMKDLEVKEIIDNAFRNDEN 353
EKGK +R + E A + N
Sbjct: 417 EKGKEIRKNARRLMEFAREALSDGGN 442
>sp|Q9ZWJ3|U85A2_ARATH UDP-glycosyltransferase 85A2 OS=Arabidopsis thaliana GN=UGT85A2
PE=2 SV=1
Length = 481
Score = 77.8 bits (190), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 82/180 (45%), Gaps = 35/180 (19%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFD----IN 253
C +WL+ K SV+YV+FGS ++ Q+V+ A L A+GK F+W++RP + +
Sbjct: 284 CLDWLNTKARNSVVYVNFGSITVLSAKQLVEFAWGLAATGKEFLWVIRPDLVAGDEAMVP 343
Query: 254 SEFRANDAD--------------------------GTQSALEALSHGVPINGWPLAAEQF 287
EF AD G S LE+L GVP+ WP AEQ
Sbjct: 344 PEFLTATADRRMLASWCPQEKVLSHPAIGGFLTHCGWNSTLESLCGGVPMVCWPFFAEQQ 403
Query: 288 YNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNA 347
N +E V E+ V +E + + +M+E EKGK MR K E + + + A
Sbjct: 404 TNCKFSRDEWEVGIEIGGD----VKREEVEAVVRELMDE-EKGKNMREKAEEWRRLANEA 458
Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 65/146 (44%), Gaps = 6/146 (4%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNIID 61
+P+ AQGH+ P + +A L + K + I FVNT N ++ S ++ F I
Sbjct: 14 VPYPAQGHINPMMKVAKLLYA-KGFHITFVNTVYNHNRLLRSRGPNAVDGLPSFRFESIP 72
Query: 62 HDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPL-CIITDMFFGW 120
LP TD + L E+T+ K ++ I+ ++ P+ CI++D +
Sbjct: 73 DGLP----ETDVDVTQDIPTLCESTMKHCLAPFKELLRQINARDDVPPVSCIVSDGCMSF 128
Query: 121 CKEIAQEYGIFHAIFIEGGGFGFACY 146
+ A+E G+ +F GF Y
Sbjct: 129 TLDAAEELGVPEVLFWTTSACGFLAY 154
>sp|Q9ZQ95|U73C6_ARATH UDP-glycosyltransferase 73C6 OS=Arabidopsis thaliana GN=UGT73C6
PE=2 SV=1
Length = 495
Score = 77.8 bits (190), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 110/464 (23%), Positives = 182/464 (39%), Gaps = 137/464 (29%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSS----PIHFLETP 56
+ PFMAQGH+IP + +A L + + I V T N + K+ L ++ PI+ ++
Sbjct: 16 LFPFMAQGHMIPMVDIARLL-AQRGVLITIVTTPHNAARFKNVLNRAIESGLPINLVQVK 74
Query: 57 FNIIDHDLPPCTENTDSHPFDVVRKLLEATLSFKP--HFKKLIIDLIDEQNGHKPLCIIT 114
F + L EN D++ + + T FK K+ + +LI+E + +P C+I+
Sbjct: 75 FPYQEAGLQEGQEN-----MDLLTTMEQITSFFKAVNLLKEPVQNLIEEMS-PRPSCLIS 128
Query: 115 DMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRN--------TDSDEFLLL 166
DM + EIA+++ I +F G C+ L V++ +N +D + F++
Sbjct: 129 DMCLSYTSEIAKKFKIPKILF-----HGMGCFCLLCVNVLRKNREILDNLKSDKEYFIVP 183
Query: 167 DFPEA----------------------------------------STIHPVLRFTGSKAG 186
FP+ + P +A
Sbjct: 184 YFPDRVEFTRPQVPVETYVPAGWKEILEDMVEADKTSYGVIVNSFQELEPAYAKDFKEAR 243
Query: 187 AGKEHGIS-AELC------------KNWLDRKPCRSVL---------YVSFGSQDTIAVS 224
+GK I LC K+ +D+ C L YV GS + +S
Sbjct: 244 SGKAWTIGPVSLCNKVGVDKAERGNKSDIDQDECLEWLDSKEPGSVLYVCLGSICNLPLS 303
Query: 225 QMVQLAMALEASGKNFIWIVRPPIGFDINSEF-----------RANDAD----------- 262
Q+++L + LE S + FIW++R G++ E R D
Sbjct: 304 QLLELGLGLEESQRPFIWVIR---GWEKYKELVEWFSESGFEDRIQDRGLLIKGWSPQML 360
Query: 263 --------------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGE--EVGVCAEVARG 306
G S LE ++ G+P+ WPL A+QF N L+ + +VGV AEV
Sbjct: 361 ILSHPSVGGFLTHCGWNSTLEGITAGLPMLTWPLFADQFCNEKLVVQILKVGVSAEVKEV 420
Query: 307 MNCA--------VLKEHIVVKIELVMNETEKGKPMRMKDLEVKE 342
M V KE + +E +M E++ K R + E+ E
Sbjct: 421 MKWGEEEKIGVLVDKEGVKKAVEELMGESDDAKERRRRAKELGE 464
>sp|Q9AR73|HQGT_RAUSE Hydroquinone glucosyltransferase OS=Rauvolfia serpentina GN=AS PE=1
SV=1
Length = 470
Score = 76.6 bits (187), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 87/198 (43%), Gaps = 45/198 (22%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFDINSEF- 256
C WLD +P SVL++SFGS ++ +Q ++LA+ LE S + F+W+VR P N+ +
Sbjct: 253 CLKWLDDQPRGSVLFISFGSGGAVSHNQFIELALGLEMSEQRFLWVVRSPNDKIANATYF 312
Query: 257 ---RANDA----------------------------------------DGTQSALEALSH 273
NDA G S LE++ +
Sbjct: 313 SIQNQNDALAYLPEGFLERTKGRCLLVPSWAPQTEILSHGSTGGFLTHCGWNSILESVVN 372
Query: 274 GVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPM 333
GVP+ WPL AEQ N+ +L E + V G N + + I ++ +M E E+GK
Sbjct: 373 GVPLIAWPLYAEQKMNAVMLTEGLKVALRPKAGENGLIGRVEIANAVKGLM-EGEEGKKF 431
Query: 334 RMKDLEVKEIIDNAFRND 351
R ++K+ A +D
Sbjct: 432 RSTMKDLKDAASRALSDD 449
>sp|Q33DV3|4CGT_ANTMA Chalcone 4'-O-glucosyltransferase OS=Antirrhinum majus PE=1 SV=1
Length = 457
Score = 76.6 bits (187), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 84/177 (47%), Gaps = 37/177 (20%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFDINS--- 254
C +WLD +P +SV+++ FG + + Q+ ++A+ LE SG F+W+ R D+N+
Sbjct: 259 CLSWLDLQPSKSVIFLCFGRRGAFSAQQLKEIAIGLEKSGCRFLWLARISPEMDLNALLP 318
Query: 255 ---------------------EFRANDADGT-------QSALEALSHGVPINGWPLAAEQ 286
E ++DA G S LEALS GVP+ GWPL AEQ
Sbjct: 319 EGFLSRTKGVGFVTNTWVPQKEVLSHDAVGGFVTHCGWSSVLEALSFGVPMIGWPLYAEQ 378
Query: 287 FYNSNLLGEEVGVCAEVAR--GMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVK 341
N + EE+ V + G A+ E V + + E+ KGK ++ + E+K
Sbjct: 379 RINRVFMVEEIKVALPLDEEDGFVTAMELEKRVRE----LMESVKGKEVKRRVAELK 431
>sp|Q94C57|U73B2_ARATH UDP-glucosyl transferase 73B2 OS=Arabidopsis thaliana GN=UGT73B2
PE=1 SV=1
Length = 483
Score = 76.3 bits (186), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 72/156 (46%), Gaps = 32/156 (20%)
Query: 184 KAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWI 243
KA GK+ I C WLD K SV+YVSFGS Q+ ++A LEASG +FIW+
Sbjct: 265 KAERGKKANIDEAECLKWLDSKKPNSVIYVSFGSVAFFKNEQLFEIAAGLEASGTSFIWV 324
Query: 244 VRP---------PIGFDINSEFRA--------------NDADGT-------QSALEALSH 273
VR P GF+ + + + A G S LE ++
Sbjct: 325 VRKTKDDREEWLPEGFEERVKGKGMIIRGWAPQVLILDHQATGGFVTHCGWNSLLEGVAA 384
Query: 274 GVPINGWPLAAEQFYNSNLLGE--EVGVCAEVARGM 307
G+P+ WP+ AEQFYN L+ + GV ++ M
Sbjct: 385 GLPMVTWPVGAEQFYNEKLVTQVLRTGVSVGASKHM 420
Score = 57.8 bits (138), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 80/183 (43%), Gaps = 23/183 (12%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIH-FLETPFNI- 59
PFMA GH+IP L +A L S++ + T LN K ++ + ++ LE I
Sbjct: 15 FPFMAYGHMIPTLDMAK-LFSSRGAKSTILTTSLNSKILQKPIDTFKNLNPGLEIDIQIF 73
Query: 60 ----IDHDLPPCTENTD-------SHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHK 108
++ LP EN D +++ K +T FK +KL+ +
Sbjct: 74 NFPCVELGLPEGCENVDFFTSNNNDDKNEMIVKFFFSTRFFKDQLEKLL-------GTTR 126
Query: 109 PLCIITDMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRN--TDSDEFLLL 166
P C+I DMFF W E A ++ + +F G F Y + V P + + S+ F++
Sbjct: 127 PDCLIADMFFPWATEAAGKFNVPRLVFHGTGYFSLCAGYCIGVHKPQKRVASSSEPFVIP 186
Query: 167 DFP 169
+ P
Sbjct: 187 ELP 189
>sp|Q9ZVX4|U90A1_ARATH UDP-glycosyltransferase 90A1 OS=Arabidopsis thaliana GN=UGT90A1
PE=2 SV=1
Length = 478
Score = 75.9 bits (185), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 104/241 (43%), Gaps = 44/241 (18%)
Query: 140 GFGFACYYSLWVDLPHRNTDSDEFLLLDFPEASTIHPVLRFTGSKAGAGKEHGISAELCK 199
GF +Y L N +S D P++ + P+ K G+ K I
Sbjct: 221 GFLVNSFYELESAFVDYNNNSG-----DKPKSWCVGPLCLTDPPKQGSAKPAWI------ 269
Query: 200 NWLD--RKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPP----IGFDIN 253
+WLD R+ R VLYV+FG+Q I+ Q+++LA LE S NF+W+ R IG N
Sbjct: 270 HWLDQKREEGRPVLYVAFGTQAEISNKQLMELAFGLEDSKVNFLWVTRKDVEEIIGEGFN 329
Query: 254 SEFRANDAD-----------------------GTQSALEALSHGVPINGWPLAAEQFYNS 290
R + G SA E++ GVP+ WP+ AEQ N+
Sbjct: 330 DRIRESGMIVRDWVDQWEILSHESVKGFLSHCGWNSAQESICVGVPLLAWPMMAEQPLNA 389
Query: 291 NLLGEE--VGVCAEVARG-MNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNA 347
++ EE VGV E G + V +E + KI+ +M E E GK R E ++ A
Sbjct: 390 KMVVEEIKVGVRVETEDGSVKGFVTREELSGKIKELM-EGETGKTARKNVKEYSKMAKAA 448
Query: 348 F 348
Sbjct: 449 L 449
Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 72/180 (40%), Gaps = 19/180 (10%)
Query: 1 MLPFMAQGHVIPFLA----LAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETP 56
+ PFM++GH+IP L L H T+ T N I L + I + P
Sbjct: 12 LFPFMSKGHIIPLLQFGRLLLRHHRKEPTITVTVFTTPKNQPFISDFLSDTPEIKVISLP 71
Query: 57 FNIIDHDLPPCTENTDSHP-FDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITD 115
F +PP ENT+ P + AT +P F++ + L K +++D
Sbjct: 72 FPENITGIPPGVENTEKLPSMSLFVPFTRATKLLQPFFEETLKTL------PKVSFMVSD 125
Query: 116 MFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWV------DLPHRNTDSDEFLLLDFP 169
F W E A ++ I F+ G ++ S+ V P +D++ + DFP
Sbjct: 126 GFLWWTSESAAKFNI--PRFVSYGMNSYSAAVSISVFKHELFTEPESKSDTEPVTVPDFP 183
>sp|Q9LME8|U85A7_ARATH UDP-glycosyltransferase 85A7 OS=Arabidopsis thaliana GN=UGT85A7
PE=2 SV=1
Length = 487
Score = 75.1 bits (183), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 82/183 (44%), Gaps = 36/183 (19%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFD-----I 252
C +WLD K SVL+V+FG ++ Q+ + A L AS K F+W++RP + +
Sbjct: 287 CLDWLDTKTPNSVLFVNFGCITVMSAKQLEEFAWGLAASRKEFLWVIRPNLVVGEAMVVL 346
Query: 253 NSEFRANDAD--------------------------GTQSALEALSHGVPINGWPLAAEQ 286
EF A D G S LE+L+ GVP+ WP +EQ
Sbjct: 347 PQEFLAETIDRRMLASWCPQEKVLSHPAIGGFLTHCGWNSTLESLAGGVPMICWPCFSEQ 406
Query: 287 FYNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDN 346
N +E GV E+ + V +E + + +M+ EKGK +R K E + + +
Sbjct: 407 PTNCKFCCDEWGVGIEIGKD----VKREEVETVVRELMD-GEKGKKLREKAEEWRRLAEE 461
Query: 347 AFR 349
A R
Sbjct: 462 ATR 464
Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 64/142 (45%), Gaps = 4/142 (2%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNIID 61
+P+ AQGH+ P L +A L + K + + FVNT N ++ S ++ F F I
Sbjct: 17 VPYPAQGHINPMLKVAKLLYA-KGFHVTFVNTLYNHNRLLRSRGPNALDGFPSFRFESIP 75
Query: 62 HDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITDMFFGWC 121
LP + H V + + L+ FK+++ + D+ + CI++D +
Sbjct: 76 DGLPETDGDRTQHTPTVCMSIEKNCLA---PFKEILRRINDKDDVPPVSCIVSDGVMSFT 132
Query: 122 KEIAQEYGIFHAIFIEGGGFGF 143
+ A+E G+ IF GF
Sbjct: 133 LDAAEELGVPEVIFWTNSACGF 154
>sp|Q94AB5|U7E12_ARATH UDP-glycosyltransferase 76E12 OS=Arabidopsis thaliana GN=UGT76E12
PE=2 SV=1
Length = 458
Score = 74.7 bits (182), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 107/448 (23%), Positives = 170/448 (37%), Gaps = 122/448 (27%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNII 60
++PF AQGH+ P + LA L K ++I V T N S + F+ P ++
Sbjct: 17 LVPFPAQGHISPMMQLAKTLH-LKGFSITVVQTKFNY--FSPSDDFTHDFQFVTIPESLP 73
Query: 61 DHDLPPCTENTDSHPFDVVRKL-LEATLSFKPHFKKLIIDLIDEQNGHKPLCIITDMFFG 119
+ D + P + KL E +SFK +L++ Q ++ C+I D F
Sbjct: 74 ESDF------KNLGPIQFLFKLNKECKVSFKDCLGQLVL-----QQSNEISCVIYDEFMY 122
Query: 120 WCKEIAQEYGIFHAIFIEGGGFGFAC------YYSLWVDLPHRNTDSDEFLLL------- 166
+ + A+E + + IF FAC Y+ V P + T + L+
Sbjct: 123 FAEAAAKECKLPNIIFSTTSATAFACRSVFDKLYANNVQAPLKETKGQQEELVPEFYPLR 182
Query: 167 --DFPEA-----STIHPVLRFTGSKAGA-------------------------------G 188
DFP + +I V R T K A G
Sbjct: 183 YKDFPVSRFASLESIMEVYRNTVDKRTASSVIINTASCLESSSLSFLQQQQLQIPVYPIG 242
Query: 189 KEHGISA---------ELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKN 239
H +++ + C WL+++ SV+Y+S GS + +++++++A L AS ++
Sbjct: 243 PLHMVASAPTSLLEENKSCIEWLNKQKVNSVIYISMGSIALMEINEIMEVASGLAASNQH 302
Query: 240 FIWIVRP---PIGFDINS---EFRANDAD--------------------------GTQSA 267
F+W++RP P I S EF D G S
Sbjct: 303 FLWVIRPGSIPGSEWIESMPEEFSKMVLDRGYIVKWAPQKEVLSHPAVGGFWSHCGWNST 362
Query: 268 LEALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNET 327
LE++ GVP+ P + +Q N AR + C V K I V+ EL
Sbjct: 363 LESIGQGVPMICRPFSGDQKVN--------------ARYLEC-VWKIGIQVEGELDRGVV 407
Query: 328 EKGKPMRMKDLEVKEIIDNAFRNDENLR 355
E+ M D E +E+ AF E LR
Sbjct: 408 ERAVKRLMVDEEGEEMRKRAFSLKEQLR 435
>sp|Q9LZD8|U89A2_ARATH UDP-glycosyltransferase 89A2 OS=Arabidopsis thaliana GN=UGT89A2
PE=2 SV=1
Length = 465
Score = 73.6 bits (179), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 70/166 (42%), Gaps = 24/166 (14%)
Query: 200 NWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRP---PIGFDINSEF 256
+WLD P SVLYV FGSQ + Q LA+ LE S F+W+V+ P GF+
Sbjct: 274 SWLDGSPNGSVLYVCFGSQKALTKDQCDALALGLEKSMTRFVWVVKKDPIPDGFEDRVSG 333
Query: 257 RANDAD---------------------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGE 295
R G S LE ++ G I GWP+ A+QF N+ LL E
Sbjct: 334 RGLVVRGWVSQLAVLRHVAVGGFLSHCGWNSVLEGITSGAVILGWPMEADQFVNARLLVE 393
Query: 296 EVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVK 341
+GV V G + + I M E + R +++ K
Sbjct: 394 HLGVAVRVCEGGETVPDSDELGRVIAETMGEGGREVAARAEEIRRK 439
Score = 45.8 bits (107), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/176 (25%), Positives = 73/176 (41%), Gaps = 17/176 (9%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNII 60
+ PF AQGH++P L L H L + + ++ + T N+ + S L + P F
Sbjct: 22 VFPFPAQGHLLPLLDLTHQL-CLRGFNVSVIVTPGNLTYL-SPLLSAHPSSVTSVVFPFP 79
Query: 61 DH-DLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITDMFFG 119
H L P EN DV ++ ++ II+ + + + P+ +I+D F G
Sbjct: 80 PHPSLSPGVENVK----DVGNSGNLPIMASLRQLREPIINWF-QSHPNPPIALISDFFLG 134
Query: 120 WCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRNTD----SDEFLLLDFPEA 171
W ++ + GI F F + + + N D +D LLD P A
Sbjct: 135 WTHDLCNQIGIPRFAF-----FSISFFLVSVLQFCFENIDLIKSTDPIHLLDLPRA 185
>sp|Q9ZQ94|U73C5_ARATH UDP-glycosyltransferase 73C5 OS=Arabidopsis thaliana GN=UGT73C5
PE=2 SV=1
Length = 495
Score = 73.2 bits (178), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/229 (28%), Positives = 98/229 (42%), Gaps = 55/229 (24%)
Query: 170 EASTIHPVL---RFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQM 226
+A TI PV + KA G + I + C WLD K SVLYV GS + +SQ+
Sbjct: 246 KAWTIGPVSLCNKVGADKAERGNKSDIDQDECLKWLDSKKHGSVLYVCLGSICNLPLSQL 305
Query: 227 VQLAMALEASGKNFIWIVRPPIGFDINSEF-----------RANDAD------------- 262
+L + LE S + FIW++R G++ E R D
Sbjct: 306 KELGLGLEESQRPFIWVIR---GWEKYKELVEWFSESGFEDRIQDRGLLIKGWSPQMLIL 362
Query: 263 ------------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGE--EVGVCAEVARGMN 308
G S LE ++ G+P+ WPL A+QF N L+ E + GV + V + M
Sbjct: 363 SHPSVGGFLTHCGWNSTLEGITAGLPLLTWPLFADQFCNEKLVVEVLKAGVRSGVEQPMK 422
Query: 309 CA--------VLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNAFR 349
V KE + +E +M E++ K R + KE+ D+A +
Sbjct: 423 WGEEEKIGVLVDKEGVKKAVEELMGESDDAKERRRR---AKELGDSAHK 468
Score = 64.3 bits (155), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 89/183 (48%), Gaps = 24/183 (13%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSS----PIHFLETP 56
+ PFMAQGH+IP + +A L + + I V T N + K+ L ++ PI+ ++
Sbjct: 15 LFPFMAQGHMIPMVDIARLL-AQRGVIITIVTTPHNAARFKNVLNRAIESGLPINLVQVK 73
Query: 57 FNIIDHDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKL------IIDLIDEQNGHKPL 110
F ++ L EN DS D + +++ P FK + + LI+E N +P
Sbjct: 74 FPYLEAGLQEGQENIDS--LDTMERMI-------PFFKAVNFLEEPVQKLIEEMN-PRPS 123
Query: 111 CIITDMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVD---LPHRNTDSDEFLLLD 167
C+I+D + +IA+++ I +F G F C + L + L + +D + F + D
Sbjct: 124 CLISDFCLPYTSKIAKKFNIPKILFHGMGCFCLLCMHVLRKNREILDNLKSDKELFTVPD 183
Query: 168 FPE 170
FP+
Sbjct: 184 FPD 186
>sp|O22822|U74F2_ARATH UDP-glycosyltransferase 74F2 OS=Arabidopsis thaliana GN=UGT74F2
PE=1 SV=1
Length = 449
Score = 73.2 bits (178), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 102/444 (22%), Positives = 173/444 (38%), Gaps = 127/444 (28%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNIID 61
+P+ QGH+ PF L + K +K++L ++ + FN I+
Sbjct: 11 VPYPTQGHITPFRQFCKRL---------------HFKGLKTTLALTTFV------FNSIN 49
Query: 62 HDLP------PCTENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDE-QNGHKPL-CII 113
DL ++ D F+ + + FK K I D+I + Q P+ CI+
Sbjct: 50 PDLSGPISIATISDGYDHGGFETADSIDDYLKDFKTSGSKTIADIIQKHQTSDNPITCIV 109
Query: 114 TDMFFGWCKEIAQEYG---------------IFHAIFIEGGG------------------ 140
D F W ++A+E+G +++ +I G
Sbjct: 110 YDAFLPWALDVAREFGLVATPFFTQPCAVNYVYYLSYINNGSLQLPIEELPFLELQDLPS 169
Query: 141 -FGFACYYSLWVDLPHR---NTDSDEFLLLDFPEASTIH---------PVLRFTGSKAGA 187
F + Y + ++ + N + +F+L++ + +H PVL +
Sbjct: 170 FFSVSGSYPAYFEMVLQQFINFEKADFVLVNSFQELELHENELWSKACPVLTIGPTIPSI 229
Query: 188 GKEHGISAE--------------LCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMAL 233
+ I ++ C NWLD +P SV+YV+FGS + QM +LA A+
Sbjct: 230 YLDQRIKSDTGYDLNLFESKDDSFCINWLDTRPQGSVVYVAFGSMAQLTNVQMEELASAV 289
Query: 234 EASGKNFIWIVRP------PIGF--DINSE------------FRANDADGT-------QS 266
S +F+W+VR P GF +N E +N A G S
Sbjct: 290 --SNFSFLWVVRSSEEEKLPSGFLETVNKEKSLVLKWSPQLQVLSNKAIGCFLTHCGWNS 347
Query: 267 ALEALSHGVPINGWPLAAEQFYNSNLLGE--EVGVCAEVARGMNCAVLKEHIVVKIELVM 324
+EAL+ GVP+ P +Q N+ + + + GV + + A +E I I+ VM
Sbjct: 348 TMEALTFGVPMVAMPQWTDQPMNAKYIQDVWKAGVRVKTEKESGIAK-REEIEFSIKEVM 406
Query: 325 NETEKGKPM-----RMKDLEVKEI 343
E E+ K M + +DL VK +
Sbjct: 407 -EGERSKEMKKNVKKWRDLAVKSL 429
>sp|O22820|U74F1_ARATH UDP-glycosyltransferase 74F1 OS=Arabidopsis thaliana GN=UGT74F1
PE=1 SV=1
Length = 449
Score = 72.8 bits (177), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 105/439 (23%), Positives = 168/439 (38%), Gaps = 117/439 (26%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSS--LPQSSPIHFLETPFNI 59
+PF +QGH+ P L S F TH I ++ L SSPI
Sbjct: 11 VPFPSQGHITPIRQFCKRLHSK-----GFKTTHTLTTFIFNTIHLDPSSPI--------- 56
Query: 60 IDHDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDE-QNGHKPL-CIITDMF 117
+ ++ D F + E +FK K + D+I + Q+ P+ CI+ D F
Sbjct: 57 ---SIATISDGYDQGGFSSAGSVPEYLQNFKTFGSKTVADIIRKHQSTDNPITCIVYDSF 113
Query: 118 FGWCKEIAQEYGIFHA-IFIEGGGFGFACYYSL-----------------WVDLP----- 154
W ++A ++G+ A F + + Y S DLP
Sbjct: 114 MPWALDLAMDFGLAAAPFFTQSCAVNYINYLSYINNGSLTLPIKDLPLLELQDLPTFVTP 173
Query: 155 --------------HRNTDSDEFLL------LDFPEA---STIHPVLR--------FTGS 183
N D +F+L LD E S + PVL +
Sbjct: 174 TGSHLAYFEMVLQQFTNFDKADFVLVNSFHDLDLHEEELLSKVCPVLTIGPTVPSMYLDQ 233
Query: 184 KAGAGKEHGIS------AELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASG 237
+ + ++ ++ A LC +WLD++P SV+Y++FGS ++ QM ++A A+ S
Sbjct: 234 QIKSDNDYDLNLFDLKEAALCTDWLDKRPEGSVVYIAFGSMAKLSSEQMEEIASAI--SN 291
Query: 238 KNFIWIVR-------PP-------------IGFDINSEFRANDADGT-------QSALEA 270
+++W+VR PP + + + +N A G S +E
Sbjct: 292 FSYLWVVRASEESKLPPGFLETVDKDKSLVLKWSPQLQVLSNKAIGCFMTHCGWNSTMEG 351
Query: 271 LSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLK-EHIVVKIELVMNETEK 329
LS GVP+ P +Q N+ + + V V + K E I I+ VM E EK
Sbjct: 352 LSLGVPMVAMPQWTDQPMNAKYIQDVWKVGVRVKAEKESGICKREEIEFSIKEVM-EGEK 410
Query: 330 GKPM-----RMKDLEVKEI 343
K M + +DL VK +
Sbjct: 411 SKEMKENAGKWRDLAVKSL 429
>sp|Q9LK73|U88A1_ARATH UDP-glycosyltransferase 88A1 OS=Arabidopsis thaliana GN=UGT88A1
PE=2 SV=1
Length = 462
Score = 72.4 bits (176), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 81/180 (45%), Gaps = 40/180 (22%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPP-----IGFDI 252
C NWLD +P +SV+++ FGS + Q++++A+ LE SG+ F+W+VR P D+
Sbjct: 260 CLNWLDSQPEKSVVFLCFGSLGLFSKEQVIEIAVGLEKSGQRFLWVVRNPPELEKTELDL 319
Query: 253 NSEF------RANDAD-------------------------GTQSALEALSHGVPINGWP 281
S R D G S LEA+ GVP+ WP
Sbjct: 320 KSLLPEGFLSRTEDKGMVVKSWAPQVPVLNHKAVGGFVTHCGWNSILEAVCAGVPMVAWP 379
Query: 282 LAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVK 341
L AEQ +N ++ +E+ + + V + +++ ++ E P+R + + +K
Sbjct: 380 LYAEQRFNRVMIVDEIKIAISMNESETGFVSSTEVEKRVQEIIGEC----PVRERTMAMK 435
>sp|Q9SY84|U90A2_ARATH UDP-glycosyltransferase 90A2 OS=Arabidopsis thaliana GN=UGT90A2
PE=2 SV=1
Length = 467
Score = 72.0 bits (175), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 90/193 (46%), Gaps = 42/193 (21%)
Query: 200 NWLDRKPCR--SVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFDINSEFR 257
WLD K + +VLYV+FGSQ I+ Q+ ++A+ LE S NF+W+V+ G +I F
Sbjct: 270 KWLDEKRDKGCNVLYVAFGSQAEISREQLEEIALGLEESKVNFLWVVK---GNEIGKGFE 326
Query: 258 ANDAD----------------------------GTQSALEALSHGVPINGWPLAAEQFYN 289
+ G S E++ VPI +PLAAEQ N
Sbjct: 327 ERVGERGMMVRDEWVDQRKILEHESVRGFLSHCGWNSLTESICSEVPILAFPLAAEQPLN 386
Query: 290 SNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMR--------MKDLEVK 341
+ L+ EE+ V V V +E I K++ +M E EKGK +R M ++
Sbjct: 387 AILVVEELRVAERVVAASEGVVRREEIAEKVKELM-EGEKGKELRRNVEAYGKMAKKALE 445
Query: 342 EIIDNAFRNDENL 354
E I ++ +N +NL
Sbjct: 446 EGIGSSRKNLDNL 458
Score = 48.1 bits (113), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 72/163 (44%), Gaps = 18/163 (11%)
Query: 1 MLPFMAQGHVIPFLALAHHLES---TKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPF 57
+ P++++GH+IP L LA L S + ++ T LN I SL + ++ PF
Sbjct: 10 LFPYLSKGHMIPMLQLARLLLSHSFAGDISVTVFTTPLNRPFIVDSL-SGTKATIVDVPF 68
Query: 58 NIIDHDLPPCTENTDSHPF---DVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIIT 114
++PP E TD P + AT S + F++ ++ L + +++
Sbjct: 69 PDNVPEIPPGVECTDKLPALSSSLFVPFTRATKSMQADFERELMSL------PRVSFMVS 122
Query: 115 DMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRN 157
D F W +E A++ G +F FG C ++ D +N
Sbjct: 123 DGFLWWTQESARKLGFPRLVF-----FGMNCASTVICDSVFQN 160
>sp|D4Q9Z4|SGT2_SOYBN Soyasapogenol B glucuronide galactosyltransferase OS=Glycine max
GN=GmSGT2 PE=1 SV=1
Length = 495
Score = 72.0 bits (175), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 96/202 (47%), Gaps = 44/202 (21%)
Query: 201 WLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFDINS-----E 255
WL+ K SVLYVSFGS + SQ+V++A ALE SG +FIW+VR G + ++ E
Sbjct: 275 WLNSKAESSVLYVSFGSINKFPYSQLVEIARALEDSGHDFIWVVRKNDGGEGDNFLEEFE 334
Query: 256 FRANDAD---------------------------GTQSALEALSHGVPINGWPLAAEQFY 288
R +++ G + +E+++ G+P+ WPL AE F+
Sbjct: 335 KRMKESNKGYLIWGWAPQLLILENPAIGGLVTHCGWNTVVESVNAGLPMATWPLFAEHFF 394
Query: 289 NSNLLGE----EVGVCAEVARGMN----CAVLKEHIVVKIELVMNETEKGKPMRMKDLEV 340
N L+ + V V A+ R N V +E I I +M+E E+ MR + E+
Sbjct: 395 NEKLVVDVLKIGVPVGAKEWRNWNEFGSEVVKREEIGNAIASLMSEEEEDGGMRKRAKEL 454
Query: 341 KEIIDNAFR----NDENLRDLL 358
+A + + N+++L+
Sbjct: 455 SVAAKSAIKVGGSSHNNMKELI 476
Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/177 (24%), Positives = 75/177 (42%), Gaps = 17/177 (9%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSL----PQSSPIHFLETPF 57
LPF++ H+IP + +A L + + + + T N + S+ + PI F
Sbjct: 13 LPFLSTSHIIPLVDMAR-LFALHDVDVTIITTAHNATVFQKSIDLDASRGRPIRTHVVNF 71
Query: 58 NIIDHDLPPCTE--NTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITD 115
LP E N D+ P ++ ++ + F+KL DL +P I+TD
Sbjct: 72 PAAQVGLPVGIEAFNVDT-PREMTPRIYMGLSLLQQVFEKLFHDL-------QPDFIVTD 123
Query: 116 MFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRNT--DSDEFLLLDFPE 170
MF W + A + GI +F + +S+ PH D+D+F+L P+
Sbjct: 124 MFHPWSVDAAAKLGIPRIMFHGASYLARSAAHSVEQYAPHLEAKFDTDKFVLPGLPD 180
>sp|Q9LSY5|U71B7_ARATH UDP-glycosyltransferase 71B7 OS=Arabidopsis thaliana GN=UGT71B7
PE=2 SV=2
Length = 495
Score = 72.0 bits (175), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 59/223 (26%), Positives = 95/223 (42%), Gaps = 51/223 (22%)
Query: 167 DFPEASTIHPVLRFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQM 226
D P + P+L + + E + E+ + WLD++P SV+++ FGS Q+
Sbjct: 237 DTPPVYPVGPLLHLENQRDDSKDEKRL--EIIR-WLDQQPPSSVVFLCFGSMGGFGEEQV 293
Query: 227 VQLAMALEASGKNFIWIVR---PPIGFDINSEF-------------RANDAD-------- 262
++A+ALE SG F+W +R P I ++ EF R D
Sbjct: 294 REIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFFDRTKDIGKVIGWAPQ 353
Query: 263 ----------------GTQSALEALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEVAR- 305
G S LE+L GVP WPL AEQ +N+ L+ EE+G+ E+ +
Sbjct: 354 VAVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNAFLMVEELGLAVEIRKY 413
Query: 306 -------GMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVK 341
G+ A + + K + + E + R+KD+ K
Sbjct: 414 WRGEHLAGLPTATVTAEEIEKAIMCLMEQDSDVRKRVKDMSEK 456
>sp|Q9LMF0|U85A5_ARATH UDP-glycosyltransferase 85A5 OS=Arabidopsis thaliana GN=UGT85A5
PE=2 SV=1
Length = 479
Score = 71.6 bits (174), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 54/182 (29%), Positives = 82/182 (45%), Gaps = 35/182 (19%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRP----------P 247
C +WLD K SV+YV+FGS ++ Q+V+ A L A+ K+F+W++RP P
Sbjct: 282 CLDWLDTKSPNSVVYVNFGSITVMSAKQLVEFAWGLAATKKDFLWVIRPDLVAGDVPMLP 341
Query: 248 IGFDINSEFRANDA--------------------DGTQSALEALSHGVPINGWPLAAEQF 287
F I + R A G S LE+LS GVP+ WP AEQ
Sbjct: 342 PDFLIETANRRMLASWCPQEKVLSHPAVGGFLTHSGWNSTLESLSGGVPMVCWPFFAEQQ 401
Query: 288 YNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNA 347
N +E V E+ + ++E + + + +KGK MR K E + + + A
Sbjct: 402 TNCKYCCDEWEVGMEIGGDVRREEVEELVRELM-----DGDKGKKMRQKAEEWQRLAEEA 456
Query: 348 FR 349
+
Sbjct: 457 TK 458
Score = 46.6 bits (109), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 67/147 (45%), Gaps = 8/147 (5%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNIID 61
+PF AQGH+ P L +A L + + + + FVNT+ N ++ S +S F I
Sbjct: 17 IPFPAQGHINPMLKVAKLLYA-RGFHVTFVNTNYNHNRLIRSRGPNSLDGLPSFRFESIP 75
Query: 62 HDLPPCTENTDSHPFDVVRKLLEATLS--FKPHFKKLIIDLIDEQNGHKPLCIITDMFFG 119
LP EN D V L E+T+ P FK+L+ + ++ CI++D
Sbjct: 76 DGLP--EENKDV--MQDVPTLCESTMKNCLAP-FKELLRRINTTKDVPPVSCIVSDGVMS 130
Query: 120 WCKEIAQEYGIFHAIFIEGGGFGFACY 146
+ + A+E G+ +F GF Y
Sbjct: 131 FTLDAAEELGVPDVLFWTPSACGFLAY 157
>sp|Q93XP7|CZOG1_MAIZE Cis-zeatin O-glucosyltransferase 1 OS=Zea mays GN=CISZOG1 PE=1 SV=1
Length = 467
Score = 71.6 bits (174), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 89/205 (43%), Gaps = 54/205 (26%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFDIN---- 253
C WLDR+P SVLYVSFG+ + Q+ +LA AL+ S + F+W++R DI
Sbjct: 258 CLEWLDRQPPESVLYVSFGTTSCLHADQVAELAAALKGSKQRFVWVLRDADRADIYAESG 317
Query: 254 --------SEFRANDADGT----------------------------QSALEALSHGVPI 277
SEF + +GT S +E+LSHG P+
Sbjct: 318 ESRHAMFLSEF-TRETEGTGLVITGWAPQLEILAHGATAAFMSHCGWNSTIESLSHGKPV 376
Query: 278 NGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKEHIVV----KIELVMNE---TEKG 330
WP+ ++Q ++S LL C G+ ++H + I+ V+ E ++ G
Sbjct: 377 LAWPMHSDQPWDSELL------CKYFKAGLLVRPWEKHAEIVPAQAIQKVIEEAMLSDSG 430
Query: 331 KPMRMKDLEVKEIIDNAFRNDENLR 355
+R + E+ E + + + N R
Sbjct: 431 MAVRQRAKELGEAVRASVADGGNSR 455
>sp|Q9M9E7|U85A4_ARATH UDP-glycosyltransferase 85A4 OS=Arabidopsis thaliana GN=UGT85A4
PE=2 SV=1
Length = 489
Score = 71.6 bits (174), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 84/179 (46%), Gaps = 36/179 (20%)
Query: 200 NWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPI--GFD--INSE 255
+WLD K ++V+YV+FGS + Q+++ A L SGK F+W+VR + G D + +E
Sbjct: 288 DWLDTKAEKAVIYVNFGSLTVLTSEQILEFAWGLARSGKEFLWVVRSGMVDGDDSILPAE 347
Query: 256 FRANDAD---------------------------GTQSALEALSHGVPINGWPLAAEQFY 288
F + + G S LE+L GVP+ WP A+Q
Sbjct: 348 FLSETKNRGMLIKGWCSQEKVLSHPAIGGFLTHCGWNSTLESLYAGVPMICWPFFADQLT 407
Query: 289 NSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNA 347
N E+ G+ E+ V +E + ++ +M+ EKGK +R K +E + + + A
Sbjct: 408 NRKFCCEDWGIGMEIGE----EVKRERVETVVKELMD-GEKGKRLREKVVEWRRLAEEA 461
Score = 41.2 bits (95), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 62/131 (47%), Gaps = 8/131 (6%)
Query: 2 LPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNIID 61
+P+ AQGH+ P L LA L + + + + FVNT N ++I QS H L +
Sbjct: 17 IPYPAQGHINPMLKLAKLLHA-RGFHVTFVNTDYNHRRIL----QSRGPHALNGLPSFRF 71
Query: 62 HDLPPCTENTDSHPFDVVRKLLEATLS--FKPHFKKLIIDLIDEQNGHKPLCIITDMFFG 119
+P TD + KL+++T++ P FK LI+ L + CII+D
Sbjct: 72 ETIPDGLPWTDVDAKQDMLKLIDSTINNCLAP-FKDLILRLNSGSDIPPVSCIISDASMS 130
Query: 120 WCKEIAQEYGI 130
+ + A+E I
Sbjct: 131 FTIDAAEELKI 141
>sp|Q9ZQ96|U73C3_ARATH UDP-glycosyltransferase 73C3 OS=Arabidopsis thaliana GN=UGT73C3
PE=2 SV=1
Length = 496
Score = 70.9 bits (172), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 87/207 (42%), Gaps = 43/207 (20%)
Query: 184 KAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWI 243
KA G + I + C WLD K SVLYV GS + +SQ+ +L + LE S ++FIW+
Sbjct: 264 KAERGSKAAIDQDECLQWLDSKEEGSVLYVCLGSICNLPLSQLKELGLGLEESRRSFIWV 323
Query: 244 VRPPIGFD------INSEFRANDAD---------------------------GTQSALEA 270
+R + + S F + G S LE
Sbjct: 324 IRGSEKYKELFEWMLESGFEERIKERGLLIKGWAPQVLILSHPSVGGFLTHCGWNSTLEG 383
Query: 271 LSHGVPINGWPLAAEQFYNSNLLGE--EVGVCAEVARGMNCA--------VLKEHIVVKI 320
++ G+P+ WPL +QF N L+ + + GV A V M V KE + +
Sbjct: 384 ITSGIPLITWPLFGDQFCNQKLVVQVLKAGVSAGVEEVMKWGEEDKIGVLVDKEGVKKAV 443
Query: 321 ELVMNETEKGKPMRMKDLEVKEIIDNA 347
E +M +++ K R + E+ E+ A
Sbjct: 444 EELMGDSDDAKERRRRVKELGELAHKA 470
Score = 62.8 bits (151), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 85/183 (46%), Gaps = 24/183 (13%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSS----PIHFLETP 56
+ PFMAQGH+IP + +A L + + TI V T N + K+ L ++ I+ L
Sbjct: 17 LFPFMAQGHMIPMIDIARLL-AQRGVTITIVTTPHNAARFKNVLNRAIESGLAINILHVK 75
Query: 57 FNIIDHDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKL------IIDLIDEQNGHKPL 110
F + LP EN DS L++T P FK + ++ L++E +P
Sbjct: 76 FPYQEFGLPEGKENIDS---------LDSTELMVPFFKAVNLLEDPVMKLMEEMK-PRPS 125
Query: 111 CIITDMFFGWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLP-HRNTDSDE--FLLLD 167
C+I+D + IA+ + I +F G F C + L +L N SDE FL+
Sbjct: 126 CLISDWCLPYTSIIAKNFNIPKIVFHGMGCFNLLCMHVLRRNLEILENVKSDEEYFLVPS 185
Query: 168 FPE 170
FP+
Sbjct: 186 FPD 188
>sp|Q9FE68|U71C5_ARATH UDP-glycosyltransferase 71C5 OS=Arabidopsis thaliana GN=UGT71C5
PE=2 SV=1
Length = 480
Score = 70.5 bits (171), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/169 (31%), Positives = 75/169 (44%), Gaps = 34/169 (20%)
Query: 167 DFPEASTIHPVLRFTGSKAGAGKEHGISAELCKNWLDRKPCRSVLYVSFGSQDTIAVSQM 226
D+P + PVL TG + G E+ K WLD +P SVL++ FGS Q+
Sbjct: 242 DYPHVYPVGPVLNLTG-RTNPGLASAQYKEMMK-WLDEQPDSSVLFLCFGSMGVFPAPQI 299
Query: 227 VQLAMALEASGKNFIWIVRP------------PIGFDINSEFR-------------ANDA 261
++A ALE G FIW +R P GF + R A+ A
Sbjct: 300 TEIAHALELIGCRFIWAIRTNMAGDGDPQEPLPEGFVDRTMGRGIVCSWAPQVDILAHKA 359
Query: 262 DGT-------QSALEALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEV 303
G S E+L +GVPI WP+ AEQ N+ + +E+G+ E+
Sbjct: 360 TGGFVSHCGWNSVQESLWYGVPIATWPMYAEQQLNAFEMVKELGLAVEI 408
>sp|Q8RXA5|CZOG2_MAIZE Cis-zeatin O-glucosyltransferase 2 OS=Zea mays GN=CISZOG2 PE=1 SV=1
Length = 463
Score = 70.5 bits (171), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 89/205 (43%), Gaps = 54/205 (26%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFDIN---- 253
C WLDR+P SVLYVSFG+ + Q+ +LA AL+ S + F+W++R DI
Sbjct: 254 CLEWLDRQPPESVLYVSFGTTSCLHADQVAELAAALKGSKQRFVWVLRDADRADIYAESG 313
Query: 254 --------SEFRANDADGT----------------------------QSALEALSHGVPI 277
SEF + +GT S +E+LSHG P+
Sbjct: 314 DSRHAKFLSEF-TRETEGTGLVVTGWAPQLEILAHGATAAFMSHCGWNSIIESLSHGKPV 372
Query: 278 NGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKEHIVV----KIELVMNE---TEKG 330
WP+ ++Q ++S LL C G+ ++H + I+ V+ E ++ G
Sbjct: 373 LAWPMHSDQPWDSELL------CNYFKAGLLVRPWEKHAEIIPAQAIQKVIEEAMLSDSG 426
Query: 331 KPMRMKDLEVKEIIDNAFRNDENLR 355
+R + E+ E + + + N R
Sbjct: 427 MAVRQRAKELGEAVRASVADGGNSR 451
>sp|Q5NTH0|UGAT_BELPE Cyanidin-3-O-glucoside 2-O-glucuronosyltransferase OS=Bellis
perennis GN=UGAT PE=1 SV=1
Length = 438
Score = 69.3 bits (168), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 96/413 (23%), Positives = 158/413 (38%), Gaps = 81/413 (19%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQ--SSPIHFLETPFN 58
MLP++A H+ FL A L + N+ I ++ N++ +K++L S I +E N
Sbjct: 14 MLPWLAYSHISRFLVFAKRL-TNHNFHIYICSSQTNMQYLKNNLTSQYSKSIQLIE--LN 70
Query: 59 I-IDHDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITDMF 117
+ +LP T P + + L + P F+ ++I L P +I D
Sbjct: 71 LPSSSELPLQYHTTHGLPPHLTKTLSDDYQKSGPDFETILIKL-------NPHLVIYDFN 123
Query: 118 FGWCKEIAQEYGIFHAIFIEG--------------------GGFGFACYYSLWVDLPHRN 157
W E+A I + G F F Y D+P
Sbjct: 124 QLWAPEVASTLHIPSIQLLSGCVALYALDAHLYTKPLDENLAKFPFPEIYPKNRDIPKGG 183
Query: 158 TD-----------SDEFLLLDFP---EASTIHPVLRFTGSKAGAGKEHGISAELCKN--- 200
+ S E +L+ E I + + G K A L ++
Sbjct: 184 SKYIERFVDCMRRSCEIILVRSTMELEGKYIDYLSKTLGKKVLPVGPLVQEASLLQDDHI 243
Query: 201 ----WLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVRPPIGFDINSEF 256
WLD+K SV++V FGS+ ++ +++ +A LE S +F+W +R
Sbjct: 244 WIMKWLDKKEESSVVFVCFGSEYILSDNEIEDIAYGLELSQVSFVWAIRAKTSALNGFID 303
Query: 257 RANDAD-------------------------GTQSALEALSHGVPINGWPLAAEQFYNSN 291
R D G S +E++ +GVPI P+ +Q YN+
Sbjct: 304 RVGDKGLVIDKWVPQANILSHSSTGGFISHCGWSSTMESIRYGVPIIAMPMQFDQPYNAR 363
Query: 292 LLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEII 344
L+ E VG EV R + +E I + V+ E + G+ +R K E+ EI+
Sbjct: 364 LM-ETVGAGIEVGRDGEGRLKREEIAAVVRKVVVE-DSGESIREKAKELGEIM 414
>sp|Q9SGA8|U83A1_ARATH UDP-glycosyltransferase 83A1 OS=Arabidopsis thaliana GN=UGT83A1
PE=2 SV=1
Length = 464
Score = 68.9 bits (167), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 81/163 (49%), Gaps = 6/163 (3%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNII 60
++P+ AQGHV+P ++ + +L + + I F+NT N +I SSLP S ++ N++
Sbjct: 16 VIPYPAQGHVLPLISFSRYL-AKQGIQITFINTEFNHNRIISSLPNSPHEDYVGDQINLV 74
Query: 61 DHDLPPCTENTDSHPFDVVRKLLEATLSFKPH-FKKLIIDLIDEQNGHKPL-CIITDMFF 118
+P E++ ++ KL E+ L F P ++LI ++ E +G + C++ D
Sbjct: 75 S--IPDGLEDSPEE-RNIPGKLSESVLRFMPKKVEELIERMMAETSGGTIISCVVADQSL 131
Query: 119 GWCKEIAQEYGIFHAIFIEGGGFGFACYYSLWVDLPHRNTDSD 161
GW E+A ++GI F +S+ + DSD
Sbjct: 132 GWAIEVAAKFGIRRTAFCPAAAASMVLGFSIQKLIDDGLIDSD 174
Score = 53.1 bits (126), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 77/173 (44%), Gaps = 22/173 (12%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIV--RPPIGFDIN-- 253
C +WLDR+ SV+YV+FGS + Q+ +LA+ LE + + +W+ + PI +
Sbjct: 276 CLDWLDRQIPGSVIYVAFGSFGVMGNPQLEELAIGLELTKRPVLWVTGDQQPIKLGSDRV 335
Query: 254 --------SEFRANDADGT-------QSALEALSHGVPINGWPLAAEQFYNSNLLGEEVG 298
E ++ A G S LE +G+P P A+QF N + +
Sbjct: 336 KVVRWAPQREVLSSGAIGCFVSHCGWNSTLEGAQNGIPFLCIPYFADQFINKAYICDVWK 395
Query: 299 VCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNAFRND 351
+ + R V + + KI+ +M + G + ++VKEI+ + D
Sbjct: 396 IGLGLERDARGVVPRLEVKKKIDEIMRD---GGEYEERAMKVKEIVMKSVAKD 445
>sp|P14726|UFOG_HORVU Anthocyanidin 3-O-glucosyltransferase OS=Hordeum vulgare GN=BZ1
PE=3 SV=1
Length = 455
Score = 68.6 bits (166), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 74/181 (40%), Gaps = 28/181 (15%)
Query: 198 CKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAMALEASGKNFIWIVR------------ 245
C WLDR+P RSV YVSFG+ T ++ +LA LEASG F+W +R
Sbjct: 267 CLAWLDRRPARSVAYVSFGTNATARPDELQELAAGLEASGAPFLWSLRGVVAAAPRGFLE 326
Query: 246 ----------PPIGFDINSEFRA-NDADGTQSALEALSHGVPINGWPLAAEQFYNSNLLG 294
P +G ++ A G S +E +S GVP+ P +Q N+ +
Sbjct: 327 RAPGLVVPWAPQVGVLRHAAVGAFVTHAGWASVMEGVSSGVPMACRPFFGDQTMNARSVA 386
Query: 295 EEVGVCAEVARGMNCAVLKEHIVVKIELVMNETEKGKPMRMKDLEVKEIIDNAFRNDENL 354
G M + + + E G+ MR K E++ ++ AF D
Sbjct: 387 SVWGFGTAFDGPMTRGAVANAVATLL-----RGEDGERMRAKAQELQAMVGKAFEPDGGC 441
Query: 355 R 355
R
Sbjct: 442 R 442
>sp|P0C7P7|U74E1_ARATH UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana GN=UGT74E1
PE=3 SV=1
Length = 453
Score = 68.6 bits (166), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 106/458 (23%), Positives = 168/458 (36%), Gaps = 117/458 (25%)
Query: 1 MLPFMAQGHVIPFLALAHHLESTKNYTIAFVNTHLNIKKIKSSLPQSSPIHFLETPFNII 60
+LPF AQGH+ P L S L I + S S P ++
Sbjct: 9 VLPFPAQGHITPMSQFCKRLASKS----------LKITLVLVSDKPSPPYKTEHDTITVV 58
Query: 61 DHDLPPCTENTDSHPFDVVRKLLEATLSFKPHFKKLIIDLIDEQNGHKPLCIITDMFFGW 120
S D + +E+ S K KLI D+ + +G+ P ++ D W
Sbjct: 59 PISNGFQEGQERSEDLDEYMERVES--SIKNRLPKLIEDM--KLSGNPPRALVYDSTMPW 114
Query: 121 CKEIAQEYG---------------IFHAIF--------IEGGGFGFACYYSLWV----DL 153
++A YG I++ +F + G A + SL + DL
Sbjct: 115 LLDVAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKYGHSTLASFPSLPILNANDL 174
Query: 154 PH-------------------RNTDSDEFLLLD---------FPEASTIHPVLR------ 179
P N D + +L + ++ PVL
Sbjct: 175 PSFLCESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVWPVLNIGPTVP 234
Query: 180 --FTGSKAGAGKEHGIS------AELCKNWLDRKPCRSVLYVSFGSQDTIAVSQMVQLAM 231
+ + K +G S AE C WL+ K SV+YVSFGS + Q+++LA
Sbjct: 235 SMYLDKRLAEDKNYGFSLFGAKIAE-CMEWLNSKQPSSVVYVSFGSLVVLKKDQLIELAA 293
Query: 232 ALEASGKNFIWIVRP-------------------PIGFDINSEFRANDADGT-------Q 265
L+ SG F+W+VR + + E + + G
Sbjct: 294 GLKQSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGCFVTHCGWN 353
Query: 266 SALEALSHGVPINGWPLAAEQFYNSNLLGEEVGVCAEVARGMNCAVLKEHIVVKIELVMN 325
S LE LS GVP+ G P A+Q N+ + + V V + V +E V ++E VM
Sbjct: 354 STLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEFVRRVEEVM- 412
Query: 326 ETEKGKPMRMKDLE-----VKEIIDNAFRNDENLRDLL 358
E E+GK +R K+ E +E + +D+N+ + +
Sbjct: 413 EAEQGKEIR-KNAEKWKVLAQEAVSEGGSSDKNINEFV 449
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.322 0.139 0.427
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 138,317,756
Number of Sequences: 539616
Number of extensions: 5904206
Number of successful extensions: 12651
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 153
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 12212
Number of HSP's gapped (non-prelim): 366
length of query: 358
length of database: 191,569,459
effective HSP length: 119
effective length of query: 239
effective length of database: 127,355,155
effective search space: 30437882045
effective search space used: 30437882045
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 62 (28.5 bits)