BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023401
(282 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224133786|ref|XP_002327680.1| predicted protein [Populus trichocarpa]
gi|222836765|gb|EEE75158.1| predicted protein [Populus trichocarpa]
Length = 481
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 218/281 (77%), Positives = 240/281 (85%), Gaps = 6/281 (2%)
Query: 3 KHSATGWWVP--LTKRWILALLIMLSISTAIAFFIRAALDPCD----RHLEVSDKKRVQS 56
+HS T W P + KRW+L LLIMLS+ST IAFFI++A D CD + +V+ +
Sbjct: 7 QHSNTAGWAPPSVQKRWLLTLLIMLSVSTLIAFFIKSAFDSCDPPHPHNFDVAASNKPAK 66
Query: 57 QSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPS 116
I T SPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLR VGT+V WITIQKPS
Sbjct: 67 VFSNSIKTAPSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRSVGTEVFWITIQKPS 126
Query: 117 EEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRV 176
E DEV+YSLE KM RGVQV+SAKGQE I+TA KADL+VLNTAVAGKWLDAVLKE+VPRV
Sbjct: 127 ETDEVVYSLEQKMLVRGVQVLSAKGQEAIDTAFKADLVVLNTAVAGKWLDAVLKENVPRV 186
Query: 177 LPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVH 236
LP VLWWIHEMRGHYFKLDYVKHLPLV GAMIDSHVTAEYWKNRT+ERLRIKMP+TYVVH
Sbjct: 187 LPKVLWWIHEMRGHYFKLDYVKHLPLVGGAMIDSHVTAEYWKNRTQERLRIKMPETYVVH 246
Query: 237 LGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
LGNSKELMEVAED+VAKRVLREH+RESLGVR+ED+LFAIIN
Sbjct: 247 LGNSKELMEVAEDSVAKRVLREHIRESLGVRDEDILFAIIN 287
>gi|302142552|emb|CBI19755.3| unnamed protein product [Vitis vinifera]
Length = 463
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 206/277 (74%), Positives = 233/277 (84%), Gaps = 8/277 (2%)
Query: 1 MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVP 60
M K S + W+ KRW L L+ +LS ST I IRAA D C+ + R+Q S
Sbjct: 1 MAKQSTS--WLTFHKRWPLLLVALLSTSTVIVLLIRAASDSCN----TNSNTRIQVTS-- 52
Query: 61 RIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDE 120
++ T S+PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVG +V W+TIQKP++ DE
Sbjct: 53 QVETPSNPLRFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGAEVVWLTIQKPTDSDE 112
Query: 121 VIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNV 180
VIYSLEH+M DRGV+V AKGQE I+TALKADL+VLNTAVAGKWLD+V+KE+VPR+LP V
Sbjct: 113 VIYSLEHRMLDRGVKVFPAKGQEAIDTALKADLVVLNTAVAGKWLDSVVKENVPRILPKV 172
Query: 181 LWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNS 240
LWWIHEMRGHYFKL+YVKHLP VAGAMIDSH TAEYWKNRTRERL IKMP+TYVVHLGNS
Sbjct: 173 LWWIHEMRGHYFKLEYVKHLPYVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNS 232
Query: 241 KELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
KELME+AE+NVAKRVLREHVRESLGVRNEDLLFA+IN
Sbjct: 233 KELMEIAENNVAKRVLREHVRESLGVRNEDLLFAVIN 269
>gi|225458171|ref|XP_002281084.1| PREDICTED: uncharacterized protein LOC100257473 [Vitis vinifera]
Length = 479
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 206/289 (71%), Positives = 235/289 (81%), Gaps = 16/289 (5%)
Query: 1 MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVS----------- 49
M K S + W+ KRW L L+ +LS ST I IRAA D C+ + +
Sbjct: 1 MAKQSTS--WLTFHKRWPLLLVALLSTSTVIVLLIRAASDSCNTNSVTTTTTITTTPHYS 58
Query: 50 -DKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVN 108
+ R+Q S ++ T S+PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVG +V
Sbjct: 59 YENTRIQVTS--QVETPSNPLRFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGAEVV 116
Query: 109 WITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAV 168
W+TIQKP++ DEVIYSLEH+M DRGV+V AKGQE I+TALKADL+VLNTAVAGKWLD+V
Sbjct: 117 WLTIQKPTDSDEVIYSLEHRMLDRGVKVFPAKGQEAIDTALKADLVVLNTAVAGKWLDSV 176
Query: 169 LKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIK 228
+KE+VPR+LP VLWWIHEMRGHYFKL+YVKHLP VAGAMIDSH TAEYWKNRTRERL IK
Sbjct: 177 VKENVPRILPKVLWWIHEMRGHYFKLEYVKHLPYVAGAMIDSHTTAEYWKNRTRERLGIK 236
Query: 229 MPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
MP+TYVVHLGNSKELME+AE+NVAKRVLREHVRESLGVRNEDLLFA+IN
Sbjct: 237 MPETYVVHLGNSKELMEIAENNVAKRVLREHVRESLGVRNEDLLFAVIN 285
>gi|147771878|emb|CAN73426.1| hypothetical protein VITISV_033235 [Vitis vinifera]
Length = 495
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 207/289 (71%), Positives = 235/289 (81%), Gaps = 16/289 (5%)
Query: 1 MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVS----------- 49
M K S + W+ KRW L L+ +LS ST I IRAA D C+ + +
Sbjct: 1 MAKQSTS--WLTFHKRWPLLLVALLSTSTVIVLLIRAASDSCNTNSVTTTTTITTTPHYS 58
Query: 50 -DKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVN 108
+ R+Q S ++ T S+PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVG +V
Sbjct: 59 YENTRIQVTS--QVETPSNPLRFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGAEVV 116
Query: 109 WITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAV 168
W+TIQKP++ DEVIYSLEH+M DRGV+V AKGQE I+TALKADL+VLNTAVAGKWLD+V
Sbjct: 117 WLTIQKPTDSDEVIYSLEHRMLDRGVKVFPAKGQEAIDTALKADLVVLNTAVAGKWLDSV 176
Query: 169 LKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIK 228
+KE+VPR+LP VLWWIHEMRGHYFKL+YVKHLP VAGAMIDSH TAEYWKNRTRERL IK
Sbjct: 177 VKENVPRILPKVLWWIHEMRGHYFKLEYVKHLPYVAGAMIDSHTTAEYWKNRTRERLGIK 236
Query: 229 MPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
MP+TYVVHLGNSKELME+AE+NVAKRVLREHVRESLGVRNEDLLFAIIN
Sbjct: 237 MPETYVVHLGNSKELMEIAENNVAKRVLREHVRESLGVRNEDLLFAIIN 285
>gi|15222229|ref|NP_177675.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
gi|30793985|gb|AAP40442.1| unknown protein [Arabidopsis thaliana]
gi|110739259|dbj|BAF01543.1| hypothetical protein [Arabidopsis thaliana]
gi|332197597|gb|AEE35718.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
Length = 463
Score = 355 bits (912), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 193/263 (73%), Positives = 218/263 (82%), Gaps = 3/263 (1%)
Query: 15 KRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVPRIATKSSPLSFMKS 74
KRW L +L+ LS+ST +R++ + C + ++K +S + +S+PL FMKS
Sbjct: 10 KRWALMVLLFLSVSTVCMILVRSSFETCSISSQFVEEKNGESSAAK---FQSNPLDFMKS 66
Query: 75 KLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGV 134
KLVLLVSHELSLSGGPLLLMELAFLLRGVG V WIT QKP E+DEV+YSLEHKM DRGV
Sbjct: 67 KLVLLVSHELSLSGGPLLLMELAFLLRGVGADVVWITNQKPLEDDEVVYSLEHKMLDRGV 126
Query: 135 QVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKL 194
QVISAKGQ+ ++T+LKADLIVLNTAVAGKWLDAVLKE+V +VLP +LWWIHEMRGHYF
Sbjct: 127 QVISAKGQKAVDTSLKADLIVLNTAVAGKWLDAVLKENVVKVLPKILWWIHEMRGHYFNA 186
Query: 195 DYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKR 254
D VKHLP VAGAMIDSH TA YWKNRT+ RL IKMP TYVVHLGNSKELMEVAED+VAKR
Sbjct: 187 DLVKHLPFVAGAMIDSHATAGYWKNRTQARLGIKMPKTYVVHLGNSKELMEVAEDSVAKR 246
Query: 255 VLREHVRESLGVRNEDLLFAIIN 277
VLREHVRESLGVRNEDLLF IIN
Sbjct: 247 VLREHVRESLGVRNEDLLFGIIN 269
>gi|449436130|ref|XP_004135847.1| PREDICTED: uncharacterized protein LOC101206589 [Cucumis sativus]
Length = 472
Score = 355 bits (910), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 198/267 (74%), Positives = 223/267 (83%), Gaps = 4/267 (1%)
Query: 15 KRWILALLIMLSISTAIAFFIRAALDPC---DRHLEVSDKKRVQSQSVPRIATKS-SPLS 70
KRW L +L ++SISTA+ FF+R D C D V +K P+ K+ +PL
Sbjct: 14 KRWPLMILALVSISTAMVFFMRTTFDSCTGNDNRRFVEEKGIGSKILSPQNERKAPNPLD 73
Query: 71 FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMW 130
FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGT+V WIT QKP E DEV+YSLE KM
Sbjct: 74 FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVWITNQKPPEPDEVVYSLERKML 133
Query: 131 DRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGH 190
DRGVQV+SAK QE + TALKA L+VLNTAVAGKWLDAVLKE+VPRVLP VLWWIHEMRG+
Sbjct: 134 DRGVQVLSAKEQEAVETALKAHLVVLNTAVAGKWLDAVLKENVPRVLPKVLWWIHEMRGN 193
Query: 191 YFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDN 250
YFK++YVKHLP VAGAMIDSH TAEYWKNRT +RL I+MP+TYVVHLGNSK+LMEVAE+N
Sbjct: 194 YFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTWDRLGIQMPETYVVHLGNSKDLMEVAENN 253
Query: 251 VAKRVLREHVRESLGVRNEDLLFAIIN 277
VAKRVLREH+RESLGVRNED+LFAIIN
Sbjct: 254 VAKRVLREHIRESLGVRNEDILFAIIN 280
>gi|297839425|ref|XP_002887594.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297333435|gb|EFH63853.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 458
Score = 353 bits (907), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 193/263 (73%), Positives = 217/263 (82%), Gaps = 8/263 (3%)
Query: 15 KRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVPRIATKSSPLSFMKS 74
KRW L +L+ LS+ST +R++ + C + D ++ QS +PL+FMKS
Sbjct: 10 KRWALMVLLFLSVSTVCVILVRSSFESCSVGGQFVDSSALRFQS--------NPLAFMKS 61
Query: 75 KLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGV 134
KLVLLVSHELSLSGGPLLLMELAFLLRGVG +V WIT QKP EEDEV+YSLEHKM DRGV
Sbjct: 62 KLVLLVSHELSLSGGPLLLMELAFLLRGVGAEVVWITNQKPLEEDEVVYSLEHKMLDRGV 121
Query: 135 QVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKL 194
QVISAKGQ+ ++ ALKADLIVLNTAVAGKWLDAVLKE+V +VLP +LWWIHEMRGHYF
Sbjct: 122 QVISAKGQKAVDIALKADLIVLNTAVAGKWLDAVLKENVFKVLPKILWWIHEMRGHYFNP 181
Query: 195 DYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKR 254
D VKHLP VAGAMIDSH TAEYW+NRT+ RL IKMP TYVVHLGNSK+LMEVAED+VAKR
Sbjct: 182 DLVKHLPFVAGAMIDSHATAEYWQNRTQARLGIKMPKTYVVHLGNSKDLMEVAEDSVAKR 241
Query: 255 VLREHVRESLGVRNEDLLFAIIN 277
VLREHVRESLGVRNEDLLF IIN
Sbjct: 242 VLREHVRESLGVRNEDLLFGIIN 264
>gi|356518559|ref|XP_003527946.1| PREDICTED: uncharacterized protein LOC100791337 [Glycine max]
Length = 464
Score = 350 bits (899), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 197/279 (70%), Positives = 223/279 (79%), Gaps = 7/279 (2%)
Query: 1 MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAA-LDPCD-RHLEVSDKKRVQSQS 58
M KHS KRW + L LS+ST +R+ D C+ H V+ +++S
Sbjct: 1 MAKHSVA----MAKKRWPIMLAAFLSVSTVTVLLMRSNNSDSCNTNHFTVAQDNQIRS-P 55
Query: 59 VPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEE 118
V SSPL FMKSKLVL+VSHELSLSGGPLLLMELAFLLRGVG+ V WI+ QKPSE
Sbjct: 56 VQLTNAASSPLIFMKSKLVLMVSHELSLSGGPLLLMELAFLLRGVGSDVVWISNQKPSEH 115
Query: 119 DEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLP 178
D V+YSLE KM DRGVQV+SAKG+ I+TALKAD+++LNTAVAGKWLDA+LKE V VLP
Sbjct: 116 DRVVYSLESKMLDRGVQVLSAKGENAIDTALKADMVILNTAVAGKWLDAILKEKVAHVLP 175
Query: 179 NVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLG 238
VLWWIHEMRGHYFK++YVKHLP VAGAMIDSH TAEYWKNRTRERL I+MP+TYVVHLG
Sbjct: 176 KVLWWIHEMRGHYFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIEMPETYVVHLG 235
Query: 239 NSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
NSKELMEVAED+VAKRVLREHVRESLGVRN+DLLFAIIN
Sbjct: 236 NSKELMEVAEDSVAKRVLREHVRESLGVRNDDLLFAIIN 274
>gi|297844940|ref|XP_002890351.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297336193|gb|EFH66610.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 480
Score = 347 bits (891), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 195/280 (69%), Positives = 223/280 (79%), Gaps = 3/280 (1%)
Query: 1 MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVP 60
MGK S + W KRW L +L++LS+ST +R+ D C + +++ + S
Sbjct: 1 MGKPSTSVWATLQKKRWPLMILLVLSVSTLGMILVRSTFDSCSVSGKRCGREKEDNNSDV 60
Query: 61 RIATKS---SPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSE 117
+I + S +PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV ++V WIT QKP E
Sbjct: 61 KIQSVSGSLNPLGFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVESEVVWITNQKPVE 120
Query: 118 EDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVL 177
EDEVI LEHKM DRGVQVISAK Q+ I+TALK+DL+VLNTAVAGKWLDAVLK++VP+VL
Sbjct: 121 EDEVIKVLEHKMLDRGVQVISAKSQKAIDTALKSDLVVLNTAVAGKWLDAVLKDNVPKVL 180
Query: 178 PNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHL 237
P VLWWIHEMRGHYFK D VKHLP VAGAMIDSH TAEYWKNRT +RL I+MP TYVVHL
Sbjct: 181 PKVLWWIHEMRGHYFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIEMPKTYVVHL 240
Query: 238 GNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
GNSKELMEVAED+ AK VLRE VRESLGVRNED+LF IIN
Sbjct: 241 GNSKELMEVAEDSFAKNVLREQVRESLGVRNEDILFGIIN 280
>gi|255538754|ref|XP_002510442.1| glycosyltransferase, putative [Ricinus communis]
gi|223551143|gb|EEF52629.1| glycosyltransferase, putative [Ricinus communis]
Length = 477
Score = 346 bits (888), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 205/294 (69%), Positives = 225/294 (76%), Gaps = 24/294 (8%)
Query: 1 MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDP---------------CDRH 45
M K S + W RW L +L ++ST I F IR+A D DR
Sbjct: 1 MAKQS-SAWITKHKNRWPLMILAFFTLSTLIVFSIRSASDSCHSSSNITTTATTANVDRF 59
Query: 46 LE--VSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV 103
E V K ++ S P +PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV
Sbjct: 60 GEPKVDSKPQIHSSVAP------NPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV 113
Query: 104 GTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGK 163
G +V WIT QKP+E DEVIYSLE+KM DRGVQV SAKGQ+ I+TALKADL+VLNTAVAGK
Sbjct: 114 GAEVVWITNQKPTETDEVIYSLENKMLDRGVQVFSAKGQKAIDTALKADLVVLNTAVAGK 173
Query: 164 WLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRE 223
WLDA LKE V +VLP VLWWIHEMRGHYFKL+YVKHLP VAGAMIDSH TAEYWKNRTRE
Sbjct: 174 WLDATLKESVQQVLPKVLWWIHEMRGHYFKLEYVKHLPFVAGAMIDSHTTAEYWKNRTRE 233
Query: 224 RLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
RL IKMP+TYVVHLGNSK+LMEVAED+VAKRVL EHVRESLGVRN+DLLFAIIN
Sbjct: 234 RLGIKMPETYVVHLGNSKDLMEVAEDSVAKRVLCEHVRESLGVRNDDLLFAIIN 287
>gi|15223628|ref|NP_173401.1| glycosyl transferase family protein [Arabidopsis thaliana]
gi|51970562|dbj|BAD43973.1| hypothetical protein [Arabidopsis thaliana]
gi|51970668|dbj|BAD44026.1| hypothetical protein [Arabidopsis thaliana]
gi|115646733|gb|ABJ17098.1| At1g19710 [Arabidopsis thaliana]
gi|332191766|gb|AEE29887.1| glycosyl transferase family protein [Arabidopsis thaliana]
Length = 479
Score = 346 bits (887), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 197/282 (69%), Positives = 220/282 (78%), Gaps = 8/282 (2%)
Query: 1 MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCD-----RHLEVSDKKRVQ 55
M K S + W KRW L +L++LS+ST +R+ D C E D ++
Sbjct: 1 MAKPSTSMWATLQKKRWPLMILLVLSVSTVGMILVRSTFDSCSVSGKRCSREKEDNSDIK 60
Query: 56 SQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKP 115
QSV + +PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV ++V WIT QKP
Sbjct: 61 IQSV---SGSLNPLEFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVESEVVWITNQKP 117
Query: 116 SEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPR 175
EEDEVI LEHKM DRGVQVISAK Q+ I+TALK+DL+VLNTAVAGKWLDAVLK++VP+
Sbjct: 118 VEEDEVIKVLEHKMLDRGVQVISAKSQKAIDTALKSDLVVLNTAVAGKWLDAVLKDNVPK 177
Query: 176 VLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVV 235
VLP VLWWIHEMRGHYFK D VKHLP VAGAMIDSH TAEYWKNRT +RL IKMP TYVV
Sbjct: 178 VLPKVLWWIHEMRGHYFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIKMPKTYVV 237
Query: 236 HLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
HLGNSKELMEVAED+ AK VLRE VRESLGVRNED+LF IIN
Sbjct: 238 HLGNSKELMEVAEDSFAKNVLREQVRESLGVRNEDILFGIIN 279
>gi|449491021|ref|XP_004158776.1| PREDICTED: uncharacterized LOC101206589 [Cucumis sativus]
Length = 278
Score = 341 bits (874), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 192/265 (72%), Positives = 218/265 (82%), Gaps = 4/265 (1%)
Query: 15 KRWILALLIMLSISTAIAFFIRAALDPC---DRHLEVSDKKRVQSQSVPRIATKS-SPLS 70
KRW L +L ++SISTA+ FF+R D C D V +K P+ K+ +PL
Sbjct: 14 KRWPLMILALVSISTAMVFFMRTTFDSCTGNDNRRFVEEKGIGSKILSPQNERKAPNPLD 73
Query: 71 FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMW 130
FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGT+V WIT QKP E DEV+YSLE KM
Sbjct: 74 FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVWITNQKPPEPDEVVYSLERKML 133
Query: 131 DRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGH 190
DRGVQV+SAK QE + TALKA L+VLNTAVAGKWLDAVLKE+VPRVLP VLWWIHEMRG+
Sbjct: 134 DRGVQVLSAKEQEAVETALKAHLVVLNTAVAGKWLDAVLKENVPRVLPKVLWWIHEMRGN 193
Query: 191 YFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDN 250
YFK++YVKHLP VAGAMIDSH TAEYWKNRT +RL I+MP+TYVVHLGNSK+LMEVAE+N
Sbjct: 194 YFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTWDRLGIQMPETYVVHLGNSKDLMEVAENN 253
Query: 251 VAKRVLREHVRESLGVRNEDLLFAI 275
VAKRVLREH+RESLGVRNED+ +
Sbjct: 254 VAKRVLREHIRESLGVRNEDICLQL 278
>gi|225431277|ref|XP_002268739.1| PREDICTED: uncharacterized protein LOC100243789 [Vitis vinifera]
Length = 466
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 189/277 (68%), Positives = 221/277 (79%), Gaps = 5/277 (1%)
Query: 1 MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVP 60
M +HSA + L K+++ AL+ S+ T + RA CDRH SD + + +P
Sbjct: 1 MARHSAPRF--SLRKKFLFALVFFFSVPTVVFLIKRAPSISCDRH---SDAGVKRFEPLP 55
Query: 61 RIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDE 120
+ SPLSFMKSK VLLVSHELSLSGGPLLLMELAFLLR VG +V WIT KPSE DE
Sbjct: 56 QFGAAPSPLSFMKSKRVLLVSHELSLSGGPLLLMELAFLLRSVGAEVCWITNHKPSETDE 115
Query: 121 VIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNV 180
VIYSLE+KM RGVQV+ AKG+E IN ALKADL+VLNT +AGKWLD VLKE+VP VLP V
Sbjct: 116 VIYSLENKMQHRGVQVLPAKGREAINRALKADLLVLNTVMAGKWLDDVLKENVPHVLPKV 175
Query: 181 LWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNS 240
LWWIHE++GHYF+L+YV+HLPLVA AMIDSHV AEYWK TR+ LRIKMPDTYVVHLGNS
Sbjct: 176 LWWIHEIQGHYFQLEYVRHLPLVAAAMIDSHVAAEYWKTGTRDYLRIKMPDTYVVHLGNS 235
Query: 241 KELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
K+LM++AED+VAKRVLREHVRESLGVR+ED+LFA+IN
Sbjct: 236 KDLMDIAEDSVAKRVLREHVRESLGVRDEDVLFAMIN 272
>gi|125525278|gb|EAY73392.1| hypothetical protein OsI_01273 [Oryza sativa Indica Group]
Length = 482
Score = 327 bits (839), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 179/259 (69%), Positives = 208/259 (80%), Gaps = 12/259 (4%)
Query: 26 SISTAIAFFIRAA-LDPCDRHLEVSDKKRVQSQSVPRIATK------SSPLSFMKSKLVL 78
+ ++ F +R A LDPCD D +R S +AT +PL FM+SKLVL
Sbjct: 35 ACASTAGFLLRGAMLDPCD-----VDARRGSGSSAAAVATTRTGAVAGNPLEFMRSKLVL 89
Query: 79 LVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVIS 138
LVSHELSLSGGPLLLMELAFLLR VG++V WIT Q+ E ++V YSLEHKM GVQV+
Sbjct: 90 LVSHELSLSGGPLLLMELAFLLRQVGSQVVWITNQRSEETNDVTYSLEHKMLSHGVQVLP 149
Query: 139 AKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVK 198
A+G E I+TALKADL++LNTAVAGKWLDAVL + VP+VLP +LWWIHEMRGHYFKL+YVK
Sbjct: 150 ARGHEAIDTALKADLVILNTAVAGKWLDAVLNDHVPQVLPKILWWIHEMRGHYFKLEYVK 209
Query: 199 HLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLRE 258
HLPLVAGAMIDSH TAEYWK RT +RL+I+MP TYVVHLGNSKELMEVAEDNVA+RVLRE
Sbjct: 210 HLPLVAGAMIDSHTTAEYWKTRTHDRLKIQMPQTYVVHLGNSKELMEVAEDNVARRVLRE 269
Query: 259 HVRESLGVRNEDLLFAIIN 277
H+RE LGVR+EDL+FAIIN
Sbjct: 270 HIREFLGVRSEDLVFAIIN 288
>gi|115435790|ref|NP_001042653.1| Os01g0262600 [Oryza sativa Japonica Group]
gi|56783832|dbj|BAD81244.1| unknown protein [Oryza sativa Japonica Group]
gi|113532184|dbj|BAF04567.1| Os01g0262600 [Oryza sativa Japonica Group]
gi|125569811|gb|EAZ11326.1| hypothetical protein OsJ_01190 [Oryza sativa Japonica Group]
Length = 482
Score = 326 bits (835), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 179/256 (69%), Positives = 205/256 (80%), Gaps = 11/256 (4%)
Query: 28 STAIAFFIRAALDPCDRHLEVSDKKRVQSQSVPRIATK------SSPLSFMKSKLVLLVS 81
STA A LDPCD D +R S +AT +PL FM+SKL+LLVS
Sbjct: 38 STAGFLLCGAMLDPCD-----VDARRGSGSSAAAVATTRTGAVAGNPLEFMRSKLMLLVS 92
Query: 82 HELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKG 141
HELSLSGGPLLLMELAFLLR VG++V WIT Q+ E ++V YSLEHKM GVQV+ A+G
Sbjct: 93 HELSLSGGPLLLMELAFLLRQVGSQVVWITNQRSEETNDVTYSLEHKMLSHGVQVLPARG 152
Query: 142 QETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLP 201
E I+TALKADL++LNTAVAGKWLDAVL + VP+VLP +LWWIHEMRGHYFKL+YVKHLP
Sbjct: 153 HEAIDTALKADLVILNTAVAGKWLDAVLNDHVPQVLPKILWWIHEMRGHYFKLEYVKHLP 212
Query: 202 LVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVR 261
LVAGAMIDSH TAEYWK RT +RL+I+MP TYVVHLGNSKELMEVAEDNVA+RVLREH+R
Sbjct: 213 LVAGAMIDSHTTAEYWKTRTHDRLKIQMPQTYVVHLGNSKELMEVAEDNVARRVLREHIR 272
Query: 262 ESLGVRNEDLLFAIIN 277
E LGVR+EDL+FAIIN
Sbjct: 273 EFLGVRSEDLVFAIIN 288
>gi|356564312|ref|XP_003550399.1| PREDICTED: proteasome-activating nucleotidase-like [Glycine max]
Length = 833
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 173/219 (78%), Positives = 195/219 (89%), Gaps = 1/219 (0%)
Query: 60 PRIATK-SSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEE 118
PR +K +PL FMKSKLVL+VSHELSLSGGPLLLMELAFLLR G+ V WIT QKP +
Sbjct: 54 PRTDSKLPNPLEFMKSKLVLMVSHELSLSGGPLLLMELAFLLRSAGSDVVWITNQKPPKP 113
Query: 119 DEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLP 178
D+VIY+LE+KM DRGVQV+ A+G++ ++TA ADL++LNTAVAGKWLDAVLKE V VLP
Sbjct: 114 DDVIYTLENKMLDRGVQVVDARGEKAVDTARNADLVILNTAVAGKWLDAVLKEKVLEVLP 173
Query: 179 NVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLG 238
VLWWIHEMRGHYFK++YVKHLP VAGAMIDSH TAEYWKNRTRERL IKMP+TYVVHLG
Sbjct: 174 KVLWWIHEMRGHYFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLG 233
Query: 239 NSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
NSKELMEVAED+VAKRVLREHVR+SLGVRN+DLLFAIIN
Sbjct: 234 NSKELMEVAEDSVAKRVLREHVRQSLGVRNDDLLFAIIN 272
>gi|414876968|tpg|DAA54099.1| TPA: hypothetical protein ZEAMMB73_503773 [Zea mays]
Length = 473
Score = 321 bits (822), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 173/247 (70%), Positives = 202/247 (81%), Gaps = 9/247 (3%)
Query: 32 AFFIRAAL-DPCDRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGP 90
F +R AL DPCD R ++ SPL FM+SKLVLLVSHELSLSGGP
Sbjct: 41 GFLLRGALRDPCD--------GRGDPAALNTAVASGSPLGFMRSKLVLLVSHELSLSGGP 92
Query: 91 LLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALK 150
LLLMELAFLLR VG++V WIT Q+ E ++V YSLEH+M + GVQV+ A+GQE ++ ALK
Sbjct: 93 LLLMELAFLLRHVGSQVVWITNQRSQETNDVTYSLEHRMLNHGVQVLPARGQEAVDIALK 152
Query: 151 ADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDS 210
ADL++LNTAVAGKWLD VLK+ VP+VLP +LWWIHEMRGHYFK++YVKHLP VAGAMIDS
Sbjct: 153 ADLVILNTAVAGKWLDPVLKDHVPKVLPKILWWIHEMRGHYFKVEYVKHLPFVAGAMIDS 212
Query: 211 HVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNED 270
H TAEYW +RT +RL+I+MP TYVVHLGNSKELMEVAEDNVA+RVLREH+RESLGVR+ED
Sbjct: 213 HTTAEYWNSRTSDRLKIQMPQTYVVHLGNSKELMEVAEDNVARRVLREHIRESLGVRSED 272
Query: 271 LLFAIIN 277
LLFAIIN
Sbjct: 273 LLFAIIN 279
>gi|357130079|ref|XP_003566684.1| PREDICTED: uncharacterized protein LOC100834487 [Brachypodium
distachyon]
Length = 470
Score = 320 bits (820), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 174/248 (70%), Positives = 205/248 (82%), Gaps = 14/248 (5%)
Query: 32 AFFIRAAL-DPCDRHLEVSDKKRVQSQSVPRI-ATKSSPLSFMKSKLVLLVSHELSLSGG 89
+ +R AL PCD +Q P + A +SPL FM+SK+VLLVSHELSLSGG
Sbjct: 41 GYLLRGALVPPCD------------AQRDPVVLAAPASPLGFMRSKIVLLVSHELSLSGG 88
Query: 90 PLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTAL 149
PLLLMELAFLLR VG++V WIT Q+P ++ YSLEHKM + GVQ++ A+GQE I+TAL
Sbjct: 89 PLLLMELAFLLRQVGSQVVWITNQQPEGTNDASYSLEHKMLNHGVQILPARGQEAIDTAL 148
Query: 150 KADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMID 209
KADL++LNTAVAGKWLDAVLK+ VP+VLP +LWWIHEMRGHYFKL+YVKHLPLVAGAMID
Sbjct: 149 KADLVILNTAVAGKWLDAVLKDHVPQVLPKILWWIHEMRGHYFKLEYVKHLPLVAGAMID 208
Query: 210 SHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNE 269
SH TAEYWK RT +RL I+MP TYVVHLGNS+ELMEVAEDNVA+RVLREH+RESLGVR+E
Sbjct: 209 SHTTAEYWKTRTHDRLNIQMPQTYVVHLGNSEELMEVAEDNVARRVLREHIRESLGVRSE 268
Query: 270 DLLFAIIN 277
DL+FA+IN
Sbjct: 269 DLIFAVIN 276
>gi|255648383|gb|ACU24642.1| unknown [Glycine max]
Length = 463
Score = 316 bits (810), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 172/219 (78%), Positives = 194/219 (88%), Gaps = 1/219 (0%)
Query: 60 PRIATK-SSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEE 118
PR +K +PL FMKSKLVL+VSHELSLSGGPLLLMELAFLLR G+ V IT QKP +
Sbjct: 54 PRTDSKLPNPLEFMKSKLVLMVSHELSLSGGPLLLMELAFLLRSAGSDVVRITNQKPPKP 113
Query: 119 DEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLP 178
D+VIY+LE+KM DRGVQV+ A+G++ ++TA ADL++LNTAVAGKWLDAVLKE V VLP
Sbjct: 114 DDVIYTLENKMLDRGVQVVDARGEKAVDTARNADLVILNTAVAGKWLDAVLKEKVLEVLP 173
Query: 179 NVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLG 238
VLWWIHEMRGHYFK++YVKHLP VAGAMIDSH TAEYWKNRTRERL IKMP+TYVVHLG
Sbjct: 174 KVLWWIHEMRGHYFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLG 233
Query: 239 NSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
NSKELMEVAED+VAKRVLREHVR+SLGVRN+DLLFAIIN
Sbjct: 234 NSKELMEVAEDSVAKRVLREHVRQSLGVRNDDLLFAIIN 272
>gi|10086496|gb|AAG12556.1|AC007797_16 Unknown Protein [Arabidopsis thaliana]
Length = 458
Score = 310 bits (795), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 180/262 (68%), Positives = 202/262 (77%), Gaps = 8/262 (3%)
Query: 1 MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCD-----RHLEVSDKKRVQ 55
M K S + W KRW L +L++LS+ST +R+ D C E D ++
Sbjct: 1 MAKPSTSMWATLQKKRWPLMILLVLSVSTVGMILVRSTFDSCSVSGKRCSREKEDNSDIK 60
Query: 56 SQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKP 115
QSV + +PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV ++V WIT QKP
Sbjct: 61 IQSV---SGSLNPLEFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVESEVVWITNQKP 117
Query: 116 SEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPR 175
EEDEVI LEHKM DRGVQVISAK Q+ I+TALK+DL+VLNTAVAGKWLDAVLK++VP+
Sbjct: 118 VEEDEVIKVLEHKMLDRGVQVISAKSQKAIDTALKSDLVVLNTAVAGKWLDAVLKDNVPK 177
Query: 176 VLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVV 235
VLP VLWWIHEMRGHYFK D VKHLP VAGAMIDSH TAEYWKNRT +RL IKMP TYVV
Sbjct: 178 VLPKVLWWIHEMRGHYFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIKMPKTYVV 237
Query: 236 HLGNSKELMEVAEDNVAKRVLR 257
HLGNSKELMEVAED+ AK VLR
Sbjct: 238 HLGNSKELMEVAEDSFAKNVLR 259
>gi|10120445|gb|AAG13070.1|AC023754_8 Hypothetical protein [Arabidopsis thaliana]
Length = 402
Score = 310 bits (794), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 160/191 (83%), Positives = 171/191 (89%)
Query: 87 SGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETIN 146
S GPLLLMELAFLLRGVG V WIT QKP E+DEV+YSLEHKM DRGVQVISAKGQ+ ++
Sbjct: 18 SRGPLLLMELAFLLRGVGADVVWITNQKPLEDDEVVYSLEHKMLDRGVQVISAKGQKAVD 77
Query: 147 TALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGA 206
T+LKADLIVLNTAVAGKWLDAVLKE+V +VLP +LWWIHEMRGHYF D VKHLP VAGA
Sbjct: 78 TSLKADLIVLNTAVAGKWLDAVLKENVVKVLPKILWWIHEMRGHYFNADLVKHLPFVAGA 137
Query: 207 MIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGV 266
MIDSH TA YWKNRT+ RL IKMP TYVVHLGNSKELMEVAED+VAKRVLREHVRESLGV
Sbjct: 138 MIDSHATAGYWKNRTQARLGIKMPKTYVVHLGNSKELMEVAEDSVAKRVLREHVRESLGV 197
Query: 267 RNEDLLFAIIN 277
RNEDLLF IIN
Sbjct: 198 RNEDLLFGIIN 208
>gi|297735089|emb|CBI17451.3| unnamed protein product [Vitis vinifera]
Length = 399
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 165/206 (80%), Positives = 185/206 (89%)
Query: 72 MKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWD 131
MKSK VLLVSHELSLSGGPLLLMELAFLLR VG +V WIT KPSE DEVIYSLE+KM
Sbjct: 1 MKSKRVLLVSHELSLSGGPLLLMELAFLLRSVGAEVCWITNHKPSETDEVIYSLENKMQH 60
Query: 132 RGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHY 191
RGVQV+ AKG+E IN ALKADL+VLNT +AGKWLD VLKE+VP VLP VLWWIHE++GHY
Sbjct: 61 RGVQVLPAKGREAINRALKADLLVLNTVMAGKWLDDVLKENVPHVLPKVLWWIHEIQGHY 120
Query: 192 FKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNV 251
F+L+YV+HLPLVA AMIDSHV AEYWK TR+ LRIKMPDTYVVHLGNSK+LM++AED+V
Sbjct: 121 FQLEYVRHLPLVAAAMIDSHVAAEYWKTGTRDYLRIKMPDTYVVHLGNSKDLMDIAEDSV 180
Query: 252 AKRVLREHVRESLGVRNEDLLFAIIN 277
AKRVLREHVRESLGVR+ED+LFA+IN
Sbjct: 181 AKRVLREHVRESLGVRDEDVLFAMIN 206
>gi|212275998|ref|NP_001130447.1| uncharacterized protein LOC100191545 [Zea mays]
gi|194689152|gb|ACF78660.1| unknown [Zea mays]
gi|414876969|tpg|DAA54100.1| TPA: hypothetical protein ZEAMMB73_503773 [Zea mays]
Length = 399
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 114/143 (79%), Positives = 132/143 (92%)
Query: 135 QVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKL 194
QV+ A+GQE ++ ALKADL++LNTAVAGKWLD VLK+ VP+VLP +LWWIHEMRGHYFK+
Sbjct: 63 QVLPARGQEAVDIALKADLVILNTAVAGKWLDPVLKDHVPKVLPKILWWIHEMRGHYFKV 122
Query: 195 DYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKR 254
+YVKHLP VAGAMIDSH TAEYW +RT +RL+I+MP TYVVHLGNSKELMEVAEDNVA+R
Sbjct: 123 EYVKHLPFVAGAMIDSHTTAEYWNSRTSDRLKIQMPQTYVVHLGNSKELMEVAEDNVARR 182
Query: 255 VLREHVRESLGVRNEDLLFAIIN 277
VLREH+RESLGVR+EDLLFAIIN
Sbjct: 183 VLREHIRESLGVRSEDLLFAIIN 205
>gi|224062083|ref|XP_002300746.1| predicted protein [Populus trichocarpa]
gi|222842472|gb|EEE80019.1| predicted protein [Populus trichocarpa]
Length = 222
Score = 239 bits (611), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 112/129 (86%), Positives = 123/129 (95%)
Query: 149 LKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMI 208
+KADL+VLNTAVAGKWL+ VLKE+V +VLP VLWWIHEMRGHYFKL+YVKHLP VAGAMI
Sbjct: 1 MKADLVVLNTAVAGKWLEGVLKENVKQVLPKVLWWIHEMRGHYFKLEYVKHLPFVAGAMI 60
Query: 209 DSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRN 268
DSH TAEYWKNRTRERL IKMP+TYVVHLGNSK+LMEVAED+VAKRVLREHVRESLGVR+
Sbjct: 61 DSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAEDSVAKRVLREHVRESLGVRD 120
Query: 269 EDLLFAIIN 277
+DLLFAIIN
Sbjct: 121 DDLLFAIIN 129
>gi|302786982|ref|XP_002975261.1| glycosyltransferase, CAZy family GT4 [Selaginella moellendorffii]
gi|300156835|gb|EFJ23462.1| glycosyltransferase, CAZy family GT4 [Selaginella moellendorffii]
Length = 452
Score = 236 bits (602), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 140/275 (50%), Positives = 189/275 (68%), Gaps = 12/275 (4%)
Query: 8 GWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVPRIATKSS 67
G +P ++ L L+ L ++ A F L L S R + + + + +
Sbjct: 2 GGAIPGKRKAGLRSLLKLGVA-AFGFLAGIGL------LVGSYSVRHPAAGISGVDRRDA 54
Query: 68 P---LSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQK-PSEEDEVIY 123
P L FM+ K V+LVSHEL+L+GGPLLLMELA LL+ G V W+TI K EV
Sbjct: 55 PPHGLHFMRGKNVVLVSHELTLTGGPLLLMELAVLLKNAGATVQWMTINKRDGAGSEVTD 114
Query: 124 SLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKE-DVPRVLPNVLW 182
+LE ++ ++G+ ++ AKG+ET+ A+ +DL+VLNTAVAGKW+D+ LKE D RVLP VLW
Sbjct: 115 NLEQRLQNKGILLVPAKGEETVRAAVDSDLVVLNTAVAGKWIDSTLKESDQQRVLPKVLW 174
Query: 183 WIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKE 242
WIHEMRGHYF L+YVKH+P VA MIDSH TAEYWKNRT++RL IK+P +VVHLGNSK+
Sbjct: 175 WIHEMRGHYFTLNYVKHMPEVAAVMIDSHATAEYWKNRTQQRLGIKIPKVHVVHLGNSKD 234
Query: 243 LMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
L E AE+ +A+ +LR+HVRESLG+ + D++F+ IN
Sbjct: 235 LTEAAENPLARHLLRQHVRESLGISDRDVMFSAIN 269
>gi|168008346|ref|XP_001756868.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692106|gb|EDQ78465.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 452
Score = 214 bits (545), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 116/223 (52%), Positives = 158/223 (70%), Gaps = 5/223 (2%)
Query: 56 SQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWIT-IQK 114
S S+P K L+FM+ K V++VSHELSLSGGPLLLMEL +LR G V W+T +K
Sbjct: 43 SSSLP----KPPGLNFMEGKSVVVVSHELSLSGGPLLLMELGHILRRSGAFVYWVTGNKK 98
Query: 115 PSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVP 174
+ D V+ LE K+ + G+QVI A+G T++ ADL++LNTAVAGKW+ + K D+
Sbjct: 99 ENTSDPVVVFLEEKLLNHGLQVIPARGTRTVSALTTADLVILNTAVAGKWVSSAFKADIK 158
Query: 175 RVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYV 234
++L LWWIHEMRGHYF +YVK LP VAG + DSH TA+YW+ RTR+RLR+ +P +V
Sbjct: 159 KLLAKTLWWIHEMRGHYFAPEYVKFLPEVAGVITDSHATADYWRTRTRDRLRMTLPKMHV 218
Query: 235 VHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
VHLGNS++LM AED V + +R+ VR+ +G+ D++FA+IN
Sbjct: 219 VHLGNSQQLMLDAEDAVGRASMRQRVRQIVGIFENDIVFAMIN 261
>gi|72536739|gb|AAZ73389.1| At1g19710 [Arabidopsis thaliana]
gi|72536741|gb|AAZ73390.1| At1g19710 [Arabidopsis thaliana]
gi|72536743|gb|AAZ73391.1| At1g19710 [Arabidopsis thaliana]
gi|72536745|gb|AAZ73392.1| At1g19710 [Arabidopsis thaliana]
gi|72536747|gb|AAZ73393.1| At1g19710 [Arabidopsis thaliana]
gi|72536749|gb|AAZ73394.1| At1g19710 [Arabidopsis thaliana]
gi|72536753|gb|AAZ73396.1| At1g19710 [Arabidopsis thaliana]
gi|72536755|gb|AAZ73397.1| At1g19710 [Arabidopsis thaliana]
gi|72536757|gb|AAZ73398.1| At1g19710 [Arabidopsis thaliana]
gi|72536759|gb|AAZ73399.1| At1g19710 [Arabidopsis thaliana]
gi|72536761|gb|AAZ73400.1| At1g19710 [Arabidopsis thaliana]
gi|72536763|gb|AAZ73401.1| At1g19710 [Arabidopsis thaliana]
gi|72536765|gb|AAZ73402.1| At1g19710 [Arabidopsis thaliana]
gi|72536767|gb|AAZ73403.1| At1g19710 [Arabidopsis thaliana]
gi|72536769|gb|AAZ73404.1| At1g19710 [Arabidopsis thaliana]
gi|72536771|gb|AAZ73405.1| At1g19710 [Arabidopsis thaliana]
Length = 231
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 89/105 (84%), Positives = 93/105 (88%)
Query: 173 VPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDT 232
VP+VLP VLWWIHEMRGHYFK D VKHLP VAGAMIDSH TAEYWKNRT +RL IKMP T
Sbjct: 1 VPKVLPKVLWWIHEMRGHYFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIKMPKT 60
Query: 233 YVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
YVVHLGNSKELMEVAED+ AK VLRE VRESLGVRNED+LF IIN
Sbjct: 61 YVVHLGNSKELMEVAEDSFAKNVLREQVRESLGVRNEDILFGIIN 105
>gi|72536751|gb|AAZ73395.1| At1g19710 [Arabidopsis thaliana]
Length = 231
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 89/105 (84%), Positives = 93/105 (88%)
Query: 173 VPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDT 232
VP+VLP VLWWIHEMRGHYFK D VKHLP VAGAMIDSH TAEYWKNRT +RL IKMP T
Sbjct: 1 VPKVLPKVLWWIHEMRGHYFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIKMPKT 60
Query: 233 YVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
YVVHLGNSKELMEVAED+ AK VLRE VRESLGVRNED+LF IIN
Sbjct: 61 YVVHLGNSKELMEVAEDSFAKNVLREQVRESLGVRNEDILFGIIN 105
>gi|72536773|gb|AAZ73406.1| At1g19710-like protein [Arabidopsis lyrata]
Length = 231
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/105 (82%), Positives = 93/105 (88%)
Query: 173 VPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDT 232
VP+VLP VLWWIHEMRGHYF+ D VKHLP VAGAMIDSH TAEYWKNRT +RL I+MP T
Sbjct: 1 VPKVLPKVLWWIHEMRGHYFEPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIEMPKT 60
Query: 233 YVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
YVVHLGNSKELMEVAED+ AK VLRE VRESLGVRNED+LF IIN
Sbjct: 61 YVVHLGNSKELMEVAEDSFAKNVLREQVRESLGVRNEDILFGIIN 105
>gi|388505262|gb|AFK40697.1| unknown [Medicago truncatula]
Length = 223
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 80/91 (87%), Positives = 84/91 (92%)
Query: 187 MRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEV 246
MRGHYFK YVKHLP VAGAMIDSH TA YWKNRT+E L IKMP+TYVVHLGNSKELMEV
Sbjct: 1 MRGHYFKEAYVKHLPFVAGAMIDSHTTAGYWKNRTKECLGIKMPETYVVHLGNSKELMEV 60
Query: 247 AEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
AED+VAKRVLREHVRESLGVRN+DLLFAIIN
Sbjct: 61 AEDSVAKRVLREHVRESLGVRNDDLLFAIIN 91
>gi|302811486|ref|XP_002987432.1| hypothetical protein SELMODRAFT_426176 [Selaginella moellendorffii]
gi|300144838|gb|EFJ11519.1| hypothetical protein SELMODRAFT_426176 [Selaginella moellendorffii]
Length = 240
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 67/182 (36%), Positives = 82/182 (45%), Gaps = 58/182 (31%)
Query: 69 LSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHK 128
L F++ ++V+LVSHEL+L+G L + G V WITI+K
Sbjct: 34 LQFIRGRIVVLVSHELTLTG-VLSAHGARCASQNAGATVQWITIEKKE------------ 80
Query: 129 MWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVL-PNVLWWIHEM 187
K L A + +V R P VLWWI
Sbjct: 81 ----------------------------------KRLSA--QREVDRFYAPKVLWWIP-- 102
Query: 188 RGHYFKLDYVKHLP--LVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELME 245
GHYF L+YV P LVA MIDSH AEYW NRT++ L IK+P VHLGNSK L E
Sbjct: 103 -GHYFTLNYVNVCPRLLVAAVMIDSHAMAEYWTNRTQQHLGIKIPK---VHLGNSKHLTE 158
Query: 246 VA 247
A
Sbjct: 159 AA 160
>gi|302782081|ref|XP_002972814.1| glycosyltransferase, CAZy family GT4 [Selaginella moellendorffii]
gi|300159415|gb|EFJ26035.1| glycosyltransferase, CAZy family GT4 [Selaginella moellendorffii]
Length = 614
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 127/257 (49%), Gaps = 29/257 (11%)
Query: 33 FFIRAALDPCD----RHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSG 88
+F+ + P D + L +S +K+ +++ + + F K V+++ HELS++G
Sbjct: 190 YFLGRVVGPYDELEQKFLGMSTEKKTPARTCS--SDEGRFKEFASGKRVVVLMHELSMTG 247
Query: 89 GPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTA 148
PL +MELA + G G KV+ + + + L +++ R + V++ K ++ A
Sbjct: 248 SPLAMMELASEIIGCGGKVSVVVLDRRG-------GLLNELVQRRIPVLADKAAKSWRAA 300
Query: 149 LKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK-----LDYVKHLPLV 203
KADL++ +A+ W+ L+ + V+WW+ E R YF LD V+ L +
Sbjct: 301 AKADLVIAGSALCASWIGEYLRYH-KKGASKVVWWVMENRRLYFDRSKRILDKVRALVFL 359
Query: 204 AGAMIDSHVTAEYWKNRTRERLRIKMPD-TYVVHLGNSKELMEVA--EDNVAKRVLREHV 260
S A+ W+ +R I +P T +V L + ++ A +D + LRE V
Sbjct: 360 ------SKTQADQWREWSRGE-NISLPSLTTIVSLSVNDAVLSAAGIDDALKMAKLREEV 412
Query: 261 RESLGVRNEDLLFAIIN 277
R+ LG++ +D+L A ++
Sbjct: 413 RKDLGLKPDDVLLATLS 429
>gi|449464260|ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207532 [Cucumis sativus]
gi|449496350|ref|XP_004160111.1| PREDICTED: uncharacterized protein LOC101223486 [Cucumis sativus]
Length = 682
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 64/219 (29%), Positives = 104/219 (47%), Gaps = 35/219 (15%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
S+ +L+ HELS++G P+ +MELA L G V+ + + K L ++ R
Sbjct: 242 SRRFVLIFHELSMTGAPISMMELATELLSCGASVSAVALSKKG-------GLMSELSRRR 294
Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
++V+ K + TA+KADL++ +AV W+D + E P V WWI E R YF
Sbjct: 295 IKVLDDKADLSFKTAMKADLVIAGSAVCASWIDGYI-EHFPAGASQVAWWIMENRREYFN 353
Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKM-PDTYVVHLGNSKELMEVA 247
LD VK L + S + ++ W N ++E IK+ +V L + EL VA
Sbjct: 354 RSKVVLDRVKMLIFI------SELQSKQWLNWSQEE-NIKLRSQPAIVPLSVNDELAFVA 406
Query: 248 EDNVA--------------KRVLREHVRESLGVRNEDLL 272
+ + K++LR R+ +GV + D++
Sbjct: 407 GISCSLNTESSSPEKMLEKKQLLRNTTRKEMGVGDNDVV 445
>gi|15218994|ref|NP_175651.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
gi|5903062|gb|AAD55621.1|AC008016_31 Is a member of PF|00534 Glycosyl transferases group 1. EST
gb|N96702 comes from this gene [Arabidopsis thaliana]
gi|12323120|gb|AAG51540.1|AC037424_5 glycosyl transferase, putative; 4406-2038 [Arabidopsis thaliana]
gi|27311817|gb|AAO00874.1| glycosyl transferase, putative [Arabidopsis thaliana]
gi|30725510|gb|AAP37777.1| At1g52420 [Arabidopsis thaliana]
gi|110742556|dbj|BAE99193.1| putative glycosyl transferase [Arabidopsis thaliana]
gi|332194684|gb|AEE32805.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
Length = 670
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 64/218 (29%), Positives = 102/218 (46%), Gaps = 33/218 (15%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
S+ +L+ HELS++G P+ +MELA L G V+ + + + L ++ R
Sbjct: 233 SRRFVLLFHELSMTGAPISMMELASELLSCGATVSAVVLSRRG-------GLMQELSRRR 285
Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
++V+ KG+ + TA+KADLI+ +AV W+D + P + WWI E R YF
Sbjct: 286 IKVVEDKGELSFKTAMKADLIIAGSAVCTSWIDQYMNHH-PAGGSQIAWWIMENRREYFD 344
Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVA- 247
LD VK L ++ + +T W +LR +V L + EL VA
Sbjct: 345 RAKPVLDRVKMLIFLSESQSRQWLT---WCEEEHIKLR---SQPVIVPLSVNDELAFVAG 398
Query: 248 -------------EDNVAKRVLREHVRESLGVRNEDLL 272
+ V +++LRE VR LG+ + D+L
Sbjct: 399 IPSSLNTPTLSPEKMRVKRQILRESVRTELGITDSDML 436
>gi|255574530|ref|XP_002528176.1| glycosyltransferase, putative [Ricinus communis]
gi|223532388|gb|EEF34183.1| glycosyltransferase, putative [Ricinus communis]
Length = 686
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 121/265 (45%), Gaps = 38/265 (14%)
Query: 28 STAIAFFIRAALDPCDRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLS 87
+T F + DR LE S +KR + K + S+ +L+ HELS++
Sbjct: 208 NTTYGFLVGPFGSTEDRILEWSPEKRTGT-----CDRKGDFARLVWSRKFVLIFHELSMT 262
Query: 88 GGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINT 147
G PL +MELA G V+ + + K L ++ R ++V+ K + T
Sbjct: 263 GAPLSMMELATEFLSCGATVSAVVLSKKG-------GLMSELNRRRIKVLEDKADLSFKT 315
Query: 148 ALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK-----LDYVKHLPL 202
A+KADL++ +AV W+D + P ++WWI E R YF L+ VK L
Sbjct: 316 AMKADLVIAGSAVCASWIDQYMTR-FPAGGSQIVWWIMENRREYFDRSKIVLNRVKMLVF 374
Query: 203 VAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVA---------- 252
++ + + ++ W + E+++++ P +V L + EL VA +
Sbjct: 375 LSESQTEQWLS---WCD--EEKIKLRAPPA-IVPLSINDELAFVAGIACSLNTPSSSPEK 428
Query: 253 ----KRVLREHVRESLGVRNEDLLF 273
+R+L + VR+ +G+ ++D+L
Sbjct: 429 MLEKRRLLADSVRKEMGLTDDDVLL 453
>gi|224054196|ref|XP_002298139.1| predicted protein [Populus trichocarpa]
gi|222845397|gb|EEE82944.1| predicted protein [Populus trichocarpa]
Length = 681
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 73/250 (29%), Positives = 119/250 (47%), Gaps = 40/250 (16%)
Query: 43 DRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRG 102
DR LE S +KR S + R K + + S+ +L+ HELS++G PL ++ELA
Sbjct: 222 DRILEWSPEKR--SGTCDR---KGAFARLVWSRKFVLIFHELSMTGAPLSMLELATEFLS 276
Query: 103 VGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAG 162
G V+ + + K L ++ R ++V+ + + TA+KADL++ +AV
Sbjct: 277 CGATVSAVVLSKKG-------GLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCT 329
Query: 163 KWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK-----LDYVKHLPLVAGAMIDSHVTAEYW 217
W+D + P V+WWI E R YF L+ VK L ++ + + + W
Sbjct: 330 SWIDQYIAR-FPAGGSQVVWWIMENRREYFDRSKIILNRVKMLVFLSESQM------KQW 382
Query: 218 KN-RTRERLRIKMPDTYVVHLGNSKELMEVA-------------EDNVAKR-VLREHVRE 262
+ E +R++ P VV L + EL VA E + KR +LRE VR+
Sbjct: 383 QTWCEEENIRLRSPPA-VVQLSVNDELAFVAGIACSLNTPTSSSEKMLEKRQLLRESVRK 441
Query: 263 SLGVRNEDLL 272
+G+ + D+L
Sbjct: 442 EMGLTDNDML 451
>gi|297847636|ref|XP_002891699.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297337541|gb|EFH67958.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 670
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 106/212 (50%), Gaps = 21/212 (9%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
S+ +L+ HELS++G P+ +MELA L G V+ + + + L ++ R
Sbjct: 233 SRRFVLLFHELSMTGAPISMMELASELLSCGATVSAVVLSRRG-------GLMQELTRRR 285
Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
++V+ KG+ + T++KADL++ +AV W+D + P + WWI E R YF
Sbjct: 286 IKVVEDKGELSFKTSMKADLVIAGSAVCTSWIDQYMNHH-PAGGSQIAWWIMENRREYFD 344
Query: 194 -----LDYVKHLPLVAGAMIDSHVT---AEYWKNRTRE---RLRIKMPDTYVVHLGNSKE 242
LD+VK L ++ + +T E+ K R++ L + +V + +S
Sbjct: 345 RAKPVLDHVKMLIFLSESQSRQWLTWCEEEHIKLRSQPVIVPLSVNDELAFVAGIPSSLN 404
Query: 243 LMEVAEDNV--AKRVLREHVRESLGVRNEDLL 272
++ + + +++LRE VR LG+ + D+L
Sbjct: 405 TPTLSPEKMREKRQILRESVRMELGLTDADML 436
>gi|356547414|ref|XP_003542107.1| PREDICTED: uncharacterized protein LOC100795000 [Glycine max]
Length = 701
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 106/221 (47%), Gaps = 39/221 (17%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
S+ +L+ HELS++G PL +MELA L G V+ + + + L ++ R
Sbjct: 246 SRRFILIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKG-------GLMSELARRR 298
Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
++V+ K + TA+KADL++ +AV W++ + + P V WWI E R YF
Sbjct: 299 IKVLEDKSDLSFKTAMKADLVIAGSAVCASWIEQYI-DHFPAGASQVAWWIMENRREYFD 357
Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRE---RLRIKMPDTYVVHLGNSKELME 245
L VK L + S ++ W+ E +LR +P+ +V L ++EL
Sbjct: 358 RSKDILHRVKMLVFL------SESQSKQWQKWCEEESIKLR-SLPE--IVALSVNEELAF 408
Query: 246 VA-------------EDNVAKR-VLREHVRESLGVRNEDLL 272
VA E V K+ +LRE VR+ +G+ + D+L
Sbjct: 409 VAGIPSTLNTPSFSTEKMVEKKQLLRESVRKEMGLTDNDML 449
>gi|356555993|ref|XP_003546312.1| PREDICTED: uncharacterized protein LOC100793827 [Glycine max]
Length = 701
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 99/218 (45%), Gaps = 33/218 (15%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
S+ +L+ HELS++G PL +MELA L G V+ + + + L ++ R
Sbjct: 246 SRRFILIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKG-------GLMSELARRR 298
Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
++V+ K + TA+KADL++ +AV W++ + E P V WWI E R YF
Sbjct: 299 IKVLEDKADLSFKTAMKADLVIAGSAVCASWIEQYI-EHFPAGASQVAWWIMENRREYFD 357
Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVA- 247
L VK L + S ++ W+ E +V L + EL VA
Sbjct: 358 RSKDVLHRVKMLVFL------SESQSKQWQKWCEEESIKLRSHPEIVPLSVNDELAFVAG 411
Query: 248 ------------EDNVAKR-VLREHVRESLGVRNEDLL 272
E V K+ +LRE VR+ +G+ + D+L
Sbjct: 412 IPSTLNTPSFSTEKMVEKKQLLRESVRKEMGLTDNDML 449
>gi|15233237|ref|NP_188215.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
gi|334185383|ref|NP_001189906.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
gi|9294599|dbj|BAB02880.1| glycosyl transferases-like protein [Arabidopsis thaliana]
gi|20147191|gb|AAM10311.1| AT3g15940/MVC8_7 [Arabidopsis thaliana]
gi|22796166|emb|CAD45267.1| putative glycosyltransferase [Arabidopsis thaliana]
gi|332642228|gb|AEE75749.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
gi|332642229|gb|AEE75750.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
Length = 697
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 112/249 (44%), Gaps = 38/249 (15%)
Query: 43 DRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRG 102
DR LE S +KR S + R KS + S+ +L+ HELS++G P+ +MELA L
Sbjct: 209 DRILEWSPQKR--SGTCDR---KSDFKRLVWSRRFVLLFHELSMTGAPISMMELASELLS 263
Query: 103 VGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAG 162
G V + + + L ++ R ++V+ KG+ + TA+KADL++ +AV
Sbjct: 264 CGATVYAVVLSRRG-------GLLQELTRRRIKVVEDKGELSFKTAMKADLVIAGSAVCA 316
Query: 163 KWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK-----LDYVKHLPLVAGAMIDSHVTAEYW 217
W+D + P + WW+ E R YF LD VK L + S V ++ W
Sbjct: 317 SWIDQYMDHH-PAGGSQIAWWVMENRREYFDRAKPVLDRVKLLIFL------SEVQSKQW 369
Query: 218 KNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVA--------------KRVLREHVRES 263
E +V L + EL VA + + ++ LRE VR
Sbjct: 370 LTWCEEDHVKLRSQPVIVPLSVNDELAFVAGVSSSLNTPTLTQETMKEKRQKLRESVRTE 429
Query: 264 LGVRNEDLL 272
G+ ++D+L
Sbjct: 430 FGLTDKDML 438
>gi|297834468|ref|XP_002885116.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297330956|gb|EFH61375.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 696
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 76/250 (30%), Positives = 114/250 (45%), Gaps = 40/250 (16%)
Query: 43 DRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRG 102
DR LE S +KR S + R KS + S+ +L+ HELS++G P+ +MELA L
Sbjct: 209 DRVLEWSPQKR--SGTCDR---KSDFKRLVWSRRFVLLFHELSMTGAPISMMELASELLS 263
Query: 103 VGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAG 162
G V + + + L ++ R ++V+ KG+ + TA+KADL++ +AV
Sbjct: 264 CGATVYAVVLSRRG-------GLLQELTRRRIKVVEDKGELSFKTAMKADLVIAGSAVCA 316
Query: 163 KWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK-----LDYVKHLPLVAGAMIDSHVTAEYW 217
W+D + P + WW+ E R YF LD VK L + S V ++ W
Sbjct: 317 SWIDQYMDHH-PAGGSQIAWWVMENRREYFDRAKPVLDRVKLLIFL------SEVQSKQW 369
Query: 218 KNRTRERLRIKM-PDTYVVHLGNSKELMEVA-------------EDNVAKRV-LREHVRE 262
E IK+ +V L + EL VA E KR LRE VR
Sbjct: 370 LTWCEED-HIKLRSQPVIVPLSVNDELAFVAGIYSSLNTPTLTQEMMKEKRQKLRESVRT 428
Query: 263 SLGVRNEDLL 272
G+ ++D+L
Sbjct: 429 EFGLTDKDML 438
>gi|357473203|ref|XP_003606886.1| Glycosyl transferases-like protein [Medicago truncatula]
gi|355507941|gb|AES89083.1| Glycosyl transferases-like protein [Medicago truncatula]
Length = 634
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/233 (24%), Positives = 108/233 (46%), Gaps = 34/233 (14%)
Query: 57 QSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPS 116
+ + KS + SK +L+ HELS++G PL +MELA L G V+ + + +
Sbjct: 177 KGCGKCDKKSEFAQVVMSKSFVLIFHELSMTGAPLSMMELATELLSCGANVSAVVLSRKG 236
Query: 117 EEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRV 176
L ++ R ++VI K + T++ A L++ +AV W++ + E P
Sbjct: 237 -------GLMQELVRRQIKVIDDKVDHSFKTSMNAHLVIAGSAVCASWIEQYI-EYSPAA 288
Query: 177 LPNVLWWIHEMRGHYFK-----LDYVKHLPLVAGAMIDSHVTAEYWKN-RTRERLRIKMP 230
+V+WWI E R YF L+ V+ L + S + ++ W+ E +++++
Sbjct: 289 ANHVVWWIMENRREYFDRSKDVLNKVRMLIFL------SELQSKKWQKWCDEESIKLRLQ 342
Query: 231 DTYV-----------VHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLL 272
+V L +S + ++ E +++LR VR LG+ + D+L
Sbjct: 343 PAHVPLSVNDKLAFSAGLHSSSDAEKIDE---KRKLLRASVRRELGLNDNDML 392
>gi|326532396|dbj|BAK05127.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 231
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/36 (88%), Positives = 35/36 (97%)
Query: 242 ELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
EL EVAEDNVA+RVLREH+RESLGVR+EDLLFAIIN
Sbjct: 1 ELTEVAEDNVARRVLREHIRESLGVRSEDLLFAIIN 36
>gi|225442687|ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246448 [Vitis vinifera]
Length = 691
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 101/219 (46%), Gaps = 33/219 (15%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
S+ +L+ HELS++G PL +MELA L G V+ + + K L ++ R
Sbjct: 235 SRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKG-------GLMPELARRR 287
Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
++V+ + + TA+KADL++ +AV W++ + ++WWI E R YF
Sbjct: 288 IKVLEDRADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGS-SQIVWWIMENRREYFD 346
Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAE 248
++ VK L ++ + +T W RL I P VV L + EL VA
Sbjct: 347 RSKLVINRVKMLIFLSESQSKQWLT---WCKEENIRL-ISQP--AVVPLSVNDELAFVAG 400
Query: 249 DNVA--------------KRVLREHVRESLGVRNEDLLF 273
+ +R+LR+ +R+ +G+ + D+L
Sbjct: 401 ITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLL 439
>gi|147838732|emb|CAN71826.1| hypothetical protein VITISV_013841 [Vitis vinifera]
Length = 734
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 101/219 (46%), Gaps = 33/219 (15%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
S+ +L+ HELS++G PL +MELA L G V+ + + K L ++ R
Sbjct: 246 SRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKG-------GLMPELARRR 298
Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
++V+ + + TA+KADL++ +AV W++ + ++WWI E R YF
Sbjct: 299 IKVLEDRADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGS-SQIVWWIMENRREYFD 357
Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAE 248
++ VK L ++ + +T W RL I P VV L + EL VA
Sbjct: 358 RSKLVINRVKMLIFLSESQSKQWLT---WCKEENIRL-ISQP--AVVPLSVNDELAFVAG 411
Query: 249 DNVA--------------KRVLREHVRESLGVRNEDLLF 273
+ +R+LR+ +R+ +G+ + D+L
Sbjct: 412 ITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLL 450
>gi|297743306|emb|CBI36173.3| unnamed protein product [Vitis vinifera]
Length = 683
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 101/219 (46%), Gaps = 33/219 (15%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
S+ +L+ HELS++G PL +MELA L G V+ + + K L ++ R
Sbjct: 246 SRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKG-------GLMPELARRR 298
Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
++V+ + + TA+KADL++ +AV W++ + ++WWI E R YF
Sbjct: 299 IKVLEDRADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGS-SQIVWWIMENRREYFD 357
Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAE 248
++ VK L ++ + +T W RL I P VV L + EL VA
Sbjct: 358 RSKLVINRVKMLIFLSESQSKQWLT---WCKEENIRL-ISQP--AVVPLSVNDELAFVAG 411
Query: 249 DNVA--------------KRVLREHVRESLGVRNEDLLF 273
+ +R+LR+ +R+ +G+ + D+L
Sbjct: 412 ITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLL 450
>gi|356528940|ref|XP_003533055.1| PREDICTED: uncharacterized protein LOC100793124 [Glycine max]
Length = 675
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 73/150 (48%), Gaps = 13/150 (8%)
Query: 43 DRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRG 102
DR L++S ++R ++ K + S+ +L+ HELS++G PL +MELA L
Sbjct: 208 DRILQLSPQRRYET-----CDKKGEFARLVWSRRFVLIFHELSMTGAPLSMMELATELLS 262
Query: 103 VGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAG 162
G V+ + + + L ++ R ++V+ K + A K+DL++ +AV
Sbjct: 263 CGASVSAVVLSRKG-------GLMQELARRRIKVLDDKAYLSFKIAKKSDLVIAGSAVCA 315
Query: 163 KWLDAVLKEDVPRVLPNVLWWIHEMRGHYF 192
W++ + E P V WWI E R YF
Sbjct: 316 SWIEQYI-EHFPAGASQVAWWIMENRREYF 344
>gi|356522373|ref|XP_003529821.1| PREDICTED: uncharacterized protein LOC100796443 [Glycine max]
Length = 681
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 8/119 (6%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
S+ +L+ HELS++G PL +MELA L G V+ + + + L ++ R
Sbjct: 240 SRRFVLIFHELSMTGAPLSMMELATELLSCGASVSAVVLSRKG-------GLMQELARRR 292
Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYF 192
++V+ K + A KADL++ +AV W++ + E P V WWI E R YF
Sbjct: 293 IKVLDDKAYLSFKIANKADLVIAGSAVCTSWIEQYI-EHFPAGANQVAWWIMENRREYF 350
>gi|302786980|ref|XP_002975260.1| hypothetical protein SELMODRAFT_415394 [Selaginella moellendorffii]
gi|300156834|gb|EFJ23461.1| hypothetical protein SELMODRAFT_415394 [Selaginella moellendorffii]
Length = 125
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 35/49 (71%), Gaps = 3/49 (6%)
Query: 200 LPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAE 248
+P +A MIDSH TAEYW NRT++ L IK+P VHLGNSK L E A+
Sbjct: 1 MPEIAAVMIDSHATAEYWNNRTQQHLGIKIPK---VHLGNSKHLTEAAQ 46
>gi|147853759|emb|CAN83817.1| hypothetical protein VITISV_002382 [Vitis vinifera]
Length = 97
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 5/101 (4%)
Query: 1 MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVP 60
M +HSA + L K+++ AL+ S+ T + RA CDRH SD + + +P
Sbjct: 1 MARHSAPRFS--LXKKFLFALVFFFSVXTVVFLIKRAPSISCDRH---SDAGVKRFEPLP 55
Query: 61 RIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLR 101
+ SPL FMKSK VLLVSHELSLSG L EL+ +R
Sbjct: 56 QFGAAPSPLXFMKSKRVLLVSHELSLSGNILESPELSVFMR 96
>gi|295104016|emb|CBL01560.1| Glycosyltransferase [Faecalibacterium prausnitzii SL3/3]
Length = 535
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 78/172 (45%), Gaps = 29/172 (16%)
Query: 73 KSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDR 132
K + L++SHEL+++G P++L+ +LR +G +V + PS+E + L D
Sbjct: 178 KGRRALILSHELTMTGAPIVLVSAVPVLRSLGFEV---VVLGPSDEGSLPLFL-----DA 229
Query: 133 GVQVISAKGQETINT----ALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMR 188
G V++ T + A AD ++ NT V ++ + VP VLWW+H+
Sbjct: 230 GAAVVTRPDCVTSSALWGLATSADFVLANTVVEAPVVNTLNGSFVP-----VLWWLHDAF 284
Query: 189 GHYFKLDYVKH-LPLVAG-----AMIDSHVTAEYWKNRTR---ERLRIKMPD 231
Y ++ H +P G + SH TA R E+L +PD
Sbjct: 285 AGY---PFISHSIPKALGKNVHLCAVGSHATAAMHSVRPDFAIEQLIYGLPD 333
>gi|160943782|ref|ZP_02091013.1| hypothetical protein FAEPRAM212_01280 [Faecalibacterium prausnitzii
M21/2]
gi|158444956|gb|EDP21959.1| glycosyltransferase, group 1 family protein [Faecalibacterium
prausnitzii M21/2]
Length = 535
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 78/172 (45%), Gaps = 29/172 (16%)
Query: 73 KSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDR 132
K + L++SHEL+++G P++L+ +LR +G +V + PS+E + L D
Sbjct: 178 KGRRALILSHELTMTGAPIVLVSAVPVLRSLGFEV---VVLGPSDEGSLPLFL-----DA 229
Query: 133 GVQVISAKGQETINT----ALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMR 188
G V++ T + A AD ++ NT V ++ + VP VLWW+H+
Sbjct: 230 GAAVVTRPDCVTSSALWGLATSADFVLANTVVEAPVVNTLNGSFVP-----VLWWLHDAF 284
Query: 189 GHYFKLDYVKH-LPLVAG-----AMIDSHVTAEYWKNRTR---ERLRIKMPD 231
Y ++ H +P G + SH TA R E+L +PD
Sbjct: 285 AGY---PFISHSIPKALGKNVHLCAVGSHATAAMHSVRPDFEIEQLIYGLPD 333
>gi|295102222|emb|CBK99767.1| Glycosyltransferase [Faecalibacterium prausnitzii L2-6]
Length = 532
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 17/124 (13%)
Query: 72 MKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWD 131
+K K V ++SH L ++G P++L+ +LR +G +V + P + + LE
Sbjct: 177 VKGKRVFIMSHLLDMTGAPIVLVSAVPVLRSMGYEV---VVLGPEDGGSMPLFLE----- 228
Query: 132 RGVQVISAKG--QETI--NTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEM 187
G VI+ KG Q + AL ADL++ NT V + + A+ VP VLWW+H+
Sbjct: 229 AGATVITRKGCVQSPLLWGLALCADLVLANTVVEARAVRALSGARVP-----VLWWLHDA 283
Query: 188 RGHY 191
Y
Sbjct: 284 FAGY 287
>gi|257440307|ref|ZP_05616062.1| glycosyl transferase, group 1 family [Faecalibacterium prausnitzii
A2-165]
gi|257197341|gb|EEU95625.1| glycosyltransferase, group 1 family protein [Faecalibacterium
prausnitzii A2-165]
Length = 533
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 57/122 (46%), Gaps = 17/122 (13%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
K ++SH L ++G P++L+ +LR +G +V + PS+ + H D G
Sbjct: 179 GKRAFIMSHVLDMTGAPIVLVSAVPVLRSMGYEV---LVLGPSDGGSL-----HLFLDAG 230
Query: 134 VQVISAKGQETINT----ALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRG 189
VI+ ++ AL AD ++ NT V + + A+ VP VLWW+H+
Sbjct: 231 ASVITRSSCRNVSDAWGMALCADFVIANTVVMARAVRALSGTAVP-----VLWWLHDAFA 285
Query: 190 HY 191
Y
Sbjct: 286 GY 287
>gi|313114821|ref|ZP_07800321.1| glycosyltransferase, group 1 family [Faecalibacterium cf.
prausnitzii KLE1255]
gi|310622876|gb|EFQ06331.1| glycosyltransferase, group 1 family [Faecalibacterium cf.
prausnitzii KLE1255]
Length = 531
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 80/171 (46%), Gaps = 31/171 (18%)
Query: 75 KLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGV 134
K L++SHEL+++G P++L +LR +G +V + P+++ + L D G
Sbjct: 180 KRALILSHELTMTGAPIVLTSAVPVLRSMGFEV---VVLGPADDGSLPLFL-----DAGA 231
Query: 135 QVISAKGQETINTAL-----KADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRG 189
V++ + +N++L AD ++ NT V + + VP VLWW+H+
Sbjct: 232 AVVT-RSDCVMNSSLWGLATSADFVLANTVVEAAAVSTLNGSFVP-----VLWWLHDAFA 285
Query: 190 HYFKLDYVKH-LPLVAGA-----MIDSHVTAEYWKNR---TRERLRIKMPD 231
Y ++ H +P G+ + SH TA R + E+L +PD
Sbjct: 286 GY---PFIAHKIPKTLGSNVHVCAVGSHATAAMHSVRPDFSIEQLIYGLPD 333
>gi|392579068|gb|EIW72195.1| hypothetical protein TREMEDRAFT_36511 [Tremella mesenterica DSM
1558]
Length = 718
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 54/118 (45%), Gaps = 15/118 (12%)
Query: 37 AALDPCDRHLEVSDKKRVQSQSVPRIATKSSP---------LSFMKSKLVLLVSHELSLS 87
A LDP D H++V ++V+ S+ + +SP + ++ K + + +
Sbjct: 174 AVLDPVDHHVQVDCNRQVRLCSIVQYQQSTSPELWKRFTALATQLREKDISIAFFSATPQ 233
Query: 88 GGPLLLMELAF--LLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQE 143
GG + LM A LLR VG KVNW E V++ + + +Q ++ KG E
Sbjct: 234 GGGVALMRHALIRLLRMVGVKVNWYV----PEGHPVVFDITKRKIHNVLQGVAEKGME 287
>gi|423301147|ref|ZP_17279171.1| hypothetical protein HMPREF1057_02312 [Bacteroides finegoldii
CL09T03C10]
gi|408472482|gb|EKJ91010.1| hypothetical protein HMPREF1057_02312 [Bacteroides finegoldii
CL09T03C10]
Length = 824
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 70/157 (44%), Gaps = 26/157 (16%)
Query: 73 KSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDR 132
K+K VLL+SHELSL+G P L+ +A +L+ +G ++++ E E+ D
Sbjct: 472 KTKTVLLISHELSLTGAPRALLNMAIMLKKIGATPVILSLKHGPMEKEIS--------DL 523
Query: 133 GVQVISAKGQETINTALK----------ADLIVLNTAVAGKWLDAV-LKEDVPRVLPNVL 181
G++++ + +N L+ D++V NT L+ V L E + +
Sbjct: 524 GIKLL-VEPFLLMNYNLRHWSLSGFLSNFDVVVFNT------LETVWLIEHFSEIKARKI 576
Query: 182 WWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWK 218
W+HE R Y K L ++ + EY K
Sbjct: 577 CWLHEGRYSYAGWTRFKDLSVLFSLFDKVYAVGEYSK 613
>gi|375090843|ref|ZP_09737151.1| dihydrolipoyl dehydrogenase [Helcococcus kunzii ATCC 51366]
gi|374565151|gb|EHR36426.1| dihydrolipoyl dehydrogenase [Helcococcus kunzii ATCC 51366]
Length = 564
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 42/75 (56%), Gaps = 3/75 (4%)
Query: 84 LSLSGGPLLLMELAFLLRGVGTKVNWITIQK---PSEEDEVIYSLEHKMWDRGVQVISAK 140
L++ GG ++ +E AF+ G+ V I Q P + EV +EH ++G++V+++
Sbjct: 275 LTIIGGGVIGLEFAFIYNAFGSDVKLIEYQDGLLPMFDREVAEEIEHIATEKGIKVMTSS 334
Query: 141 GQETINTALKADLIV 155
+ I+ ++ DLIV
Sbjct: 335 KVKEISETVEGDLIV 349
>gi|255692926|ref|ZP_05416601.1| glycosyltransferase [Bacteroides finegoldii DSM 17565]
gi|260621330|gb|EEX44201.1| glycosyltransferase, group 2 family protein [Bacteroides finegoldii
DSM 17565]
Length = 825
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 68/149 (45%), Gaps = 10/149 (6%)
Query: 74 SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
+K VLL+SHELSL+G P L+ +A +L+ G + ++++ E E I L+ K+
Sbjct: 473 TKTVLLISHELSLTGAPRALVNMAIMLKKSGIRPVILSLKHGPMEKE-IADLDIKLLVEP 531
Query: 134 VQVISA--KGQETINTALKADLIVLNTAVAGKWLDAV-LKEDVPRVLPNVLWWIHEMRGH 190
+++ + + D++V NT L+ V L E + + W+HE R
Sbjct: 532 FLLMNCGLRHRALFRFLSVFDVVVFNT------LETVWLIEHFSEIEARKICWLHEGRYS 585
Query: 191 YFKLDYVKHLPLVAGAMIDSHVTAEYWKN 219
Y K L ++ + EY K+
Sbjct: 586 YMGWTRFKDLSMLFSKFDKIYAVGEYSKS 614
>gi|83582735|ref|YP_425041.1| glycosyl transferase [Rhodospirillum rubrum ATCC 11170]
gi|83578051|gb|ABC24601.1| Glycosyl transferase [Rhodospirillum rubrum ATCC 11170]
Length = 708
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 31/59 (52%)
Query: 64 TKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVI 122
TK L + + V LVSHELS +G P+LL ++A LR G N I++ E I
Sbjct: 300 TKQPVLPGLDGRHVTLVSHELSYTGAPVLLRDVALALRREGVSSNVISLSSGPLAKEFI 358
>gi|15079154|ref|NP_149906.1| 443R [Invertebrate iridescent virus 6]
gi|34223713|sp|P18305.2|Y43R_IIV6 RecName: Full=Uncharacterized protein 443R
gi|15042523|gb|AAK82303.1|AF303741_444 443R [Invertebrate iridescent virus 6]
Length = 2432
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 5/108 (4%)
Query: 110 ITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAV-AGKWLDAV 168
IT+ + D VI+ ++ + D GV V GQ TA+ DL V + V G+ +D+
Sbjct: 13 ITVVRGIPGDIVIFGIDGTIADSGVPVSIGSGQAKQPTAVPGDLAVFGSGVNQGQTIDSG 72
Query: 169 LKEDVPRV-LPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAE 215
L D V LP VLW ++ + LPL G M + ++ +
Sbjct: 73 LTVDDSSVSLPTVLWSSQKIES---LIPTGSFLPLAGGTMTGTIISKD 117
>gi|383459706|ref|YP_005373695.1| glycosyl transferase family protein [Corallococcus coralloides DSM
2259]
gi|380734839|gb|AFE10841.1| glycosyl transferase family protein [Corallococcus coralloides DSM
2259]
Length = 397
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 54/127 (42%), Gaps = 24/127 (18%)
Query: 100 LRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVIS-------AKGQETINTALKAD 152
LR +G V W+ I +P+ + E + GV+V+S G ET AL
Sbjct: 25 LRRLGHTVGWLCIPEPAPQLESL----------GVEVLSLPHVEAAPPGIETGGEALAK- 73
Query: 153 LIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHV 212
+VL+ GKW+ +L + VP +L V + R LD +++ A++ +H
Sbjct: 74 -LVLDDVALGKWIRGLLLDSVPTMLEPVREVVRAFRPDVMALDGMQY-----AAVLAAHT 127
Query: 213 TAEYWKN 219
W
Sbjct: 128 EGIPWAG 134
>gi|423614907|ref|ZP_17590741.1| inosine/xanthosine triphosphatase [Bacillus cereus VD115]
gi|401262126|gb|EJR68271.1| inosine/xanthosine triphosphatase [Bacillus cereus VD115]
Length = 169
Score = 37.0 bits (84), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 5/57 (8%)
Query: 226 RIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIINRHRLY 282
RIK+PD ++V L + KEL EV E+ V ++ +R H ++G+ +D ++R L+
Sbjct: 100 RIKLPDDFLVPLEDGKELSEVMEEFVERKDIRSH-EGAIGIFTDDY----VDRTDLF 151
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.136 0.410
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,200,553,708
Number of Sequences: 23463169
Number of extensions: 160541553
Number of successful extensions: 405383
Number of sequences better than 100.0: 66
Number of HSP's better than 100.0 without gapping: 42
Number of HSP's successfully gapped in prelim test: 24
Number of HSP's that attempted gapping in prelim test: 405257
Number of HSP's gapped (non-prelim): 71
length of query: 282
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 141
effective length of database: 9,050,888,538
effective search space: 1276175283858
effective search space used: 1276175283858
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)