BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 023401
         (282 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224133786|ref|XP_002327680.1| predicted protein [Populus trichocarpa]
 gi|222836765|gb|EEE75158.1| predicted protein [Populus trichocarpa]
          Length = 481

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 218/281 (77%), Positives = 240/281 (85%), Gaps = 6/281 (2%)

Query: 3   KHSATGWWVP--LTKRWILALLIMLSISTAIAFFIRAALDPCD----RHLEVSDKKRVQS 56
           +HS T  W P  + KRW+L LLIMLS+ST IAFFI++A D CD     + +V+   +   
Sbjct: 7   QHSNTAGWAPPSVQKRWLLTLLIMLSVSTLIAFFIKSAFDSCDPPHPHNFDVAASNKPAK 66

Query: 57  QSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPS 116
                I T  SPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLR VGT+V WITIQKPS
Sbjct: 67  VFSNSIKTAPSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRSVGTEVFWITIQKPS 126

Query: 117 EEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRV 176
           E DEV+YSLE KM  RGVQV+SAKGQE I+TA KADL+VLNTAVAGKWLDAVLKE+VPRV
Sbjct: 127 ETDEVVYSLEQKMLVRGVQVLSAKGQEAIDTAFKADLVVLNTAVAGKWLDAVLKENVPRV 186

Query: 177 LPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVH 236
           LP VLWWIHEMRGHYFKLDYVKHLPLV GAMIDSHVTAEYWKNRT+ERLRIKMP+TYVVH
Sbjct: 187 LPKVLWWIHEMRGHYFKLDYVKHLPLVGGAMIDSHVTAEYWKNRTQERLRIKMPETYVVH 246

Query: 237 LGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           LGNSKELMEVAED+VAKRVLREH+RESLGVR+ED+LFAIIN
Sbjct: 247 LGNSKELMEVAEDSVAKRVLREHIRESLGVRDEDILFAIIN 287


>gi|302142552|emb|CBI19755.3| unnamed protein product [Vitis vinifera]
          Length = 463

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 206/277 (74%), Positives = 233/277 (84%), Gaps = 8/277 (2%)

Query: 1   MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVP 60
           M K S +  W+   KRW L L+ +LS ST I   IRAA D C+     +   R+Q  S  
Sbjct: 1   MAKQSTS--WLTFHKRWPLLLVALLSTSTVIVLLIRAASDSCN----TNSNTRIQVTS-- 52

Query: 61  RIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDE 120
           ++ T S+PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVG +V W+TIQKP++ DE
Sbjct: 53  QVETPSNPLRFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGAEVVWLTIQKPTDSDE 112

Query: 121 VIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNV 180
           VIYSLEH+M DRGV+V  AKGQE I+TALKADL+VLNTAVAGKWLD+V+KE+VPR+LP V
Sbjct: 113 VIYSLEHRMLDRGVKVFPAKGQEAIDTALKADLVVLNTAVAGKWLDSVVKENVPRILPKV 172

Query: 181 LWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNS 240
           LWWIHEMRGHYFKL+YVKHLP VAGAMIDSH TAEYWKNRTRERL IKMP+TYVVHLGNS
Sbjct: 173 LWWIHEMRGHYFKLEYVKHLPYVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNS 232

Query: 241 KELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           KELME+AE+NVAKRVLREHVRESLGVRNEDLLFA+IN
Sbjct: 233 KELMEIAENNVAKRVLREHVRESLGVRNEDLLFAVIN 269


>gi|225458171|ref|XP_002281084.1| PREDICTED: uncharacterized protein LOC100257473 [Vitis vinifera]
          Length = 479

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 206/289 (71%), Positives = 235/289 (81%), Gaps = 16/289 (5%)

Query: 1   MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVS----------- 49
           M K S +  W+   KRW L L+ +LS ST I   IRAA D C+ +   +           
Sbjct: 1   MAKQSTS--WLTFHKRWPLLLVALLSTSTVIVLLIRAASDSCNTNSVTTTTTITTTPHYS 58

Query: 50  -DKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVN 108
            +  R+Q  S  ++ T S+PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVG +V 
Sbjct: 59  YENTRIQVTS--QVETPSNPLRFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGAEVV 116

Query: 109 WITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAV 168
           W+TIQKP++ DEVIYSLEH+M DRGV+V  AKGQE I+TALKADL+VLNTAVAGKWLD+V
Sbjct: 117 WLTIQKPTDSDEVIYSLEHRMLDRGVKVFPAKGQEAIDTALKADLVVLNTAVAGKWLDSV 176

Query: 169 LKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIK 228
           +KE+VPR+LP VLWWIHEMRGHYFKL+YVKHLP VAGAMIDSH TAEYWKNRTRERL IK
Sbjct: 177 VKENVPRILPKVLWWIHEMRGHYFKLEYVKHLPYVAGAMIDSHTTAEYWKNRTRERLGIK 236

Query: 229 MPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           MP+TYVVHLGNSKELME+AE+NVAKRVLREHVRESLGVRNEDLLFA+IN
Sbjct: 237 MPETYVVHLGNSKELMEIAENNVAKRVLREHVRESLGVRNEDLLFAVIN 285


>gi|147771878|emb|CAN73426.1| hypothetical protein VITISV_033235 [Vitis vinifera]
          Length = 495

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 207/289 (71%), Positives = 235/289 (81%), Gaps = 16/289 (5%)

Query: 1   MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVS----------- 49
           M K S +  W+   KRW L L+ +LS ST I   IRAA D C+ +   +           
Sbjct: 1   MAKQSTS--WLTFHKRWPLLLVALLSTSTVIVLLIRAASDSCNTNSVTTTTTITTTPHYS 58

Query: 50  -DKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVN 108
            +  R+Q  S  ++ T S+PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVG +V 
Sbjct: 59  YENTRIQVTS--QVETPSNPLRFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGAEVV 116

Query: 109 WITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAV 168
           W+TIQKP++ DEVIYSLEH+M DRGV+V  AKGQE I+TALKADL+VLNTAVAGKWLD+V
Sbjct: 117 WLTIQKPTDSDEVIYSLEHRMLDRGVKVFPAKGQEAIDTALKADLVVLNTAVAGKWLDSV 176

Query: 169 LKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIK 228
           +KE+VPR+LP VLWWIHEMRGHYFKL+YVKHLP VAGAMIDSH TAEYWKNRTRERL IK
Sbjct: 177 VKENVPRILPKVLWWIHEMRGHYFKLEYVKHLPYVAGAMIDSHTTAEYWKNRTRERLGIK 236

Query: 229 MPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           MP+TYVVHLGNSKELME+AE+NVAKRVLREHVRESLGVRNEDLLFAIIN
Sbjct: 237 MPETYVVHLGNSKELMEIAENNVAKRVLREHVRESLGVRNEDLLFAIIN 285


>gi|15222229|ref|NP_177675.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
 gi|30793985|gb|AAP40442.1| unknown protein [Arabidopsis thaliana]
 gi|110739259|dbj|BAF01543.1| hypothetical protein [Arabidopsis thaliana]
 gi|332197597|gb|AEE35718.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
          Length = 463

 Score =  355 bits (912), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 193/263 (73%), Positives = 218/263 (82%), Gaps = 3/263 (1%)

Query: 15  KRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVPRIATKSSPLSFMKS 74
           KRW L +L+ LS+ST     +R++ + C    +  ++K  +S +      +S+PL FMKS
Sbjct: 10  KRWALMVLLFLSVSTVCMILVRSSFETCSISSQFVEEKNGESSAAK---FQSNPLDFMKS 66

Query: 75  KLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGV 134
           KLVLLVSHELSLSGGPLLLMELAFLLRGVG  V WIT QKP E+DEV+YSLEHKM DRGV
Sbjct: 67  KLVLLVSHELSLSGGPLLLMELAFLLRGVGADVVWITNQKPLEDDEVVYSLEHKMLDRGV 126

Query: 135 QVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKL 194
           QVISAKGQ+ ++T+LKADLIVLNTAVAGKWLDAVLKE+V +VLP +LWWIHEMRGHYF  
Sbjct: 127 QVISAKGQKAVDTSLKADLIVLNTAVAGKWLDAVLKENVVKVLPKILWWIHEMRGHYFNA 186

Query: 195 DYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKR 254
           D VKHLP VAGAMIDSH TA YWKNRT+ RL IKMP TYVVHLGNSKELMEVAED+VAKR
Sbjct: 187 DLVKHLPFVAGAMIDSHATAGYWKNRTQARLGIKMPKTYVVHLGNSKELMEVAEDSVAKR 246

Query: 255 VLREHVRESLGVRNEDLLFAIIN 277
           VLREHVRESLGVRNEDLLF IIN
Sbjct: 247 VLREHVRESLGVRNEDLLFGIIN 269


>gi|449436130|ref|XP_004135847.1| PREDICTED: uncharacterized protein LOC101206589 [Cucumis sativus]
          Length = 472

 Score =  355 bits (910), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 198/267 (74%), Positives = 223/267 (83%), Gaps = 4/267 (1%)

Query: 15  KRWILALLIMLSISTAIAFFIRAALDPC---DRHLEVSDKKRVQSQSVPRIATKS-SPLS 70
           KRW L +L ++SISTA+ FF+R   D C   D    V +K        P+   K+ +PL 
Sbjct: 14  KRWPLMILALVSISTAMVFFMRTTFDSCTGNDNRRFVEEKGIGSKILSPQNERKAPNPLD 73

Query: 71  FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMW 130
           FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGT+V WIT QKP E DEV+YSLE KM 
Sbjct: 74  FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVWITNQKPPEPDEVVYSLERKML 133

Query: 131 DRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGH 190
           DRGVQV+SAK QE + TALKA L+VLNTAVAGKWLDAVLKE+VPRVLP VLWWIHEMRG+
Sbjct: 134 DRGVQVLSAKEQEAVETALKAHLVVLNTAVAGKWLDAVLKENVPRVLPKVLWWIHEMRGN 193

Query: 191 YFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDN 250
           YFK++YVKHLP VAGAMIDSH TAEYWKNRT +RL I+MP+TYVVHLGNSK+LMEVAE+N
Sbjct: 194 YFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTWDRLGIQMPETYVVHLGNSKDLMEVAENN 253

Query: 251 VAKRVLREHVRESLGVRNEDLLFAIIN 277
           VAKRVLREH+RESLGVRNED+LFAIIN
Sbjct: 254 VAKRVLREHIRESLGVRNEDILFAIIN 280


>gi|297839425|ref|XP_002887594.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297333435|gb|EFH63853.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 458

 Score =  353 bits (907), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 193/263 (73%), Positives = 217/263 (82%), Gaps = 8/263 (3%)

Query: 15  KRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVPRIATKSSPLSFMKS 74
           KRW L +L+ LS+ST     +R++ + C    +  D   ++ QS        +PL+FMKS
Sbjct: 10  KRWALMVLLFLSVSTVCVILVRSSFESCSVGGQFVDSSALRFQS--------NPLAFMKS 61

Query: 75  KLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGV 134
           KLVLLVSHELSLSGGPLLLMELAFLLRGVG +V WIT QKP EEDEV+YSLEHKM DRGV
Sbjct: 62  KLVLLVSHELSLSGGPLLLMELAFLLRGVGAEVVWITNQKPLEEDEVVYSLEHKMLDRGV 121

Query: 135 QVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKL 194
           QVISAKGQ+ ++ ALKADLIVLNTAVAGKWLDAVLKE+V +VLP +LWWIHEMRGHYF  
Sbjct: 122 QVISAKGQKAVDIALKADLIVLNTAVAGKWLDAVLKENVFKVLPKILWWIHEMRGHYFNP 181

Query: 195 DYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKR 254
           D VKHLP VAGAMIDSH TAEYW+NRT+ RL IKMP TYVVHLGNSK+LMEVAED+VAKR
Sbjct: 182 DLVKHLPFVAGAMIDSHATAEYWQNRTQARLGIKMPKTYVVHLGNSKDLMEVAEDSVAKR 241

Query: 255 VLREHVRESLGVRNEDLLFAIIN 277
           VLREHVRESLGVRNEDLLF IIN
Sbjct: 242 VLREHVRESLGVRNEDLLFGIIN 264


>gi|356518559|ref|XP_003527946.1| PREDICTED: uncharacterized protein LOC100791337 [Glycine max]
          Length = 464

 Score =  350 bits (899), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 197/279 (70%), Positives = 223/279 (79%), Gaps = 7/279 (2%)

Query: 1   MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAA-LDPCD-RHLEVSDKKRVQSQS 58
           M KHS         KRW + L   LS+ST     +R+   D C+  H  V+   +++S  
Sbjct: 1   MAKHSVA----MAKKRWPIMLAAFLSVSTVTVLLMRSNNSDSCNTNHFTVAQDNQIRS-P 55

Query: 59  VPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEE 118
           V      SSPL FMKSKLVL+VSHELSLSGGPLLLMELAFLLRGVG+ V WI+ QKPSE 
Sbjct: 56  VQLTNAASSPLIFMKSKLVLMVSHELSLSGGPLLLMELAFLLRGVGSDVVWISNQKPSEH 115

Query: 119 DEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLP 178
           D V+YSLE KM DRGVQV+SAKG+  I+TALKAD+++LNTAVAGKWLDA+LKE V  VLP
Sbjct: 116 DRVVYSLESKMLDRGVQVLSAKGENAIDTALKADMVILNTAVAGKWLDAILKEKVAHVLP 175

Query: 179 NVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLG 238
            VLWWIHEMRGHYFK++YVKHLP VAGAMIDSH TAEYWKNRTRERL I+MP+TYVVHLG
Sbjct: 176 KVLWWIHEMRGHYFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIEMPETYVVHLG 235

Query: 239 NSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           NSKELMEVAED+VAKRVLREHVRESLGVRN+DLLFAIIN
Sbjct: 236 NSKELMEVAEDSVAKRVLREHVRESLGVRNDDLLFAIIN 274


>gi|297844940|ref|XP_002890351.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336193|gb|EFH66610.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 480

 Score =  347 bits (891), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 195/280 (69%), Positives = 223/280 (79%), Gaps = 3/280 (1%)

Query: 1   MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVP 60
           MGK S + W     KRW L +L++LS+ST     +R+  D C    +   +++  + S  
Sbjct: 1   MGKPSTSVWATLQKKRWPLMILLVLSVSTLGMILVRSTFDSCSVSGKRCGREKEDNNSDV 60

Query: 61  RIATKS---SPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSE 117
           +I + S   +PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV ++V WIT QKP E
Sbjct: 61  KIQSVSGSLNPLGFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVESEVVWITNQKPVE 120

Query: 118 EDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVL 177
           EDEVI  LEHKM DRGVQVISAK Q+ I+TALK+DL+VLNTAVAGKWLDAVLK++VP+VL
Sbjct: 121 EDEVIKVLEHKMLDRGVQVISAKSQKAIDTALKSDLVVLNTAVAGKWLDAVLKDNVPKVL 180

Query: 178 PNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHL 237
           P VLWWIHEMRGHYFK D VKHLP VAGAMIDSH TAEYWKNRT +RL I+MP TYVVHL
Sbjct: 181 PKVLWWIHEMRGHYFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIEMPKTYVVHL 240

Query: 238 GNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           GNSKELMEVAED+ AK VLRE VRESLGVRNED+LF IIN
Sbjct: 241 GNSKELMEVAEDSFAKNVLREQVRESLGVRNEDILFGIIN 280


>gi|255538754|ref|XP_002510442.1| glycosyltransferase, putative [Ricinus communis]
 gi|223551143|gb|EEF52629.1| glycosyltransferase, putative [Ricinus communis]
          Length = 477

 Score =  346 bits (888), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 205/294 (69%), Positives = 225/294 (76%), Gaps = 24/294 (8%)

Query: 1   MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDP---------------CDRH 45
           M K S + W      RW L +L   ++ST I F IR+A D                 DR 
Sbjct: 1   MAKQS-SAWITKHKNRWPLMILAFFTLSTLIVFSIRSASDSCHSSSNITTTATTANVDRF 59

Query: 46  LE--VSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV 103
            E  V  K ++ S   P      +PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV
Sbjct: 60  GEPKVDSKPQIHSSVAP------NPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV 113

Query: 104 GTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGK 163
           G +V WIT QKP+E DEVIYSLE+KM DRGVQV SAKGQ+ I+TALKADL+VLNTAVAGK
Sbjct: 114 GAEVVWITNQKPTETDEVIYSLENKMLDRGVQVFSAKGQKAIDTALKADLVVLNTAVAGK 173

Query: 164 WLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRE 223
           WLDA LKE V +VLP VLWWIHEMRGHYFKL+YVKHLP VAGAMIDSH TAEYWKNRTRE
Sbjct: 174 WLDATLKESVQQVLPKVLWWIHEMRGHYFKLEYVKHLPFVAGAMIDSHTTAEYWKNRTRE 233

Query: 224 RLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           RL IKMP+TYVVHLGNSK+LMEVAED+VAKRVL EHVRESLGVRN+DLLFAIIN
Sbjct: 234 RLGIKMPETYVVHLGNSKDLMEVAEDSVAKRVLCEHVRESLGVRNDDLLFAIIN 287


>gi|15223628|ref|NP_173401.1| glycosyl transferase family protein [Arabidopsis thaliana]
 gi|51970562|dbj|BAD43973.1| hypothetical protein [Arabidopsis thaliana]
 gi|51970668|dbj|BAD44026.1| hypothetical protein [Arabidopsis thaliana]
 gi|115646733|gb|ABJ17098.1| At1g19710 [Arabidopsis thaliana]
 gi|332191766|gb|AEE29887.1| glycosyl transferase family protein [Arabidopsis thaliana]
          Length = 479

 Score =  346 bits (887), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 197/282 (69%), Positives = 220/282 (78%), Gaps = 8/282 (2%)

Query: 1   MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCD-----RHLEVSDKKRVQ 55
           M K S + W     KRW L +L++LS+ST     +R+  D C         E  D   ++
Sbjct: 1   MAKPSTSMWATLQKKRWPLMILLVLSVSTVGMILVRSTFDSCSVSGKRCSREKEDNSDIK 60

Query: 56  SQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKP 115
            QSV   +   +PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV ++V WIT QKP
Sbjct: 61  IQSV---SGSLNPLEFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVESEVVWITNQKP 117

Query: 116 SEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPR 175
            EEDEVI  LEHKM DRGVQVISAK Q+ I+TALK+DL+VLNTAVAGKWLDAVLK++VP+
Sbjct: 118 VEEDEVIKVLEHKMLDRGVQVISAKSQKAIDTALKSDLVVLNTAVAGKWLDAVLKDNVPK 177

Query: 176 VLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVV 235
           VLP VLWWIHEMRGHYFK D VKHLP VAGAMIDSH TAEYWKNRT +RL IKMP TYVV
Sbjct: 178 VLPKVLWWIHEMRGHYFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIKMPKTYVV 237

Query: 236 HLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           HLGNSKELMEVAED+ AK VLRE VRESLGVRNED+LF IIN
Sbjct: 238 HLGNSKELMEVAEDSFAKNVLREQVRESLGVRNEDILFGIIN 279


>gi|449491021|ref|XP_004158776.1| PREDICTED: uncharacterized LOC101206589 [Cucumis sativus]
          Length = 278

 Score =  341 bits (874), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 192/265 (72%), Positives = 218/265 (82%), Gaps = 4/265 (1%)

Query: 15  KRWILALLIMLSISTAIAFFIRAALDPC---DRHLEVSDKKRVQSQSVPRIATKS-SPLS 70
           KRW L +L ++SISTA+ FF+R   D C   D    V +K        P+   K+ +PL 
Sbjct: 14  KRWPLMILALVSISTAMVFFMRTTFDSCTGNDNRRFVEEKGIGSKILSPQNERKAPNPLD 73

Query: 71  FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMW 130
           FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGT+V WIT QKP E DEV+YSLE KM 
Sbjct: 74  FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVWITNQKPPEPDEVVYSLERKML 133

Query: 131 DRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGH 190
           DRGVQV+SAK QE + TALKA L+VLNTAVAGKWLDAVLKE+VPRVLP VLWWIHEMRG+
Sbjct: 134 DRGVQVLSAKEQEAVETALKAHLVVLNTAVAGKWLDAVLKENVPRVLPKVLWWIHEMRGN 193

Query: 191 YFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDN 250
           YFK++YVKHLP VAGAMIDSH TAEYWKNRT +RL I+MP+TYVVHLGNSK+LMEVAE+N
Sbjct: 194 YFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTWDRLGIQMPETYVVHLGNSKDLMEVAENN 253

Query: 251 VAKRVLREHVRESLGVRNEDLLFAI 275
           VAKRVLREH+RESLGVRNED+   +
Sbjct: 254 VAKRVLREHIRESLGVRNEDICLQL 278


>gi|225431277|ref|XP_002268739.1| PREDICTED: uncharacterized protein LOC100243789 [Vitis vinifera]
          Length = 466

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 189/277 (68%), Positives = 221/277 (79%), Gaps = 5/277 (1%)

Query: 1   MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVP 60
           M +HSA  +   L K+++ AL+   S+ T +    RA    CDRH   SD    + + +P
Sbjct: 1   MARHSAPRF--SLRKKFLFALVFFFSVPTVVFLIKRAPSISCDRH---SDAGVKRFEPLP 55

Query: 61  RIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDE 120
           +     SPLSFMKSK VLLVSHELSLSGGPLLLMELAFLLR VG +V WIT  KPSE DE
Sbjct: 56  QFGAAPSPLSFMKSKRVLLVSHELSLSGGPLLLMELAFLLRSVGAEVCWITNHKPSETDE 115

Query: 121 VIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNV 180
           VIYSLE+KM  RGVQV+ AKG+E IN ALKADL+VLNT +AGKWLD VLKE+VP VLP V
Sbjct: 116 VIYSLENKMQHRGVQVLPAKGREAINRALKADLLVLNTVMAGKWLDDVLKENVPHVLPKV 175

Query: 181 LWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNS 240
           LWWIHE++GHYF+L+YV+HLPLVA AMIDSHV AEYWK  TR+ LRIKMPDTYVVHLGNS
Sbjct: 176 LWWIHEIQGHYFQLEYVRHLPLVAAAMIDSHVAAEYWKTGTRDYLRIKMPDTYVVHLGNS 235

Query: 241 KELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           K+LM++AED+VAKRVLREHVRESLGVR+ED+LFA+IN
Sbjct: 236 KDLMDIAEDSVAKRVLREHVRESLGVRDEDVLFAMIN 272


>gi|125525278|gb|EAY73392.1| hypothetical protein OsI_01273 [Oryza sativa Indica Group]
          Length = 482

 Score =  327 bits (839), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 179/259 (69%), Positives = 208/259 (80%), Gaps = 12/259 (4%)

Query: 26  SISTAIAFFIRAA-LDPCDRHLEVSDKKRVQSQSVPRIATK------SSPLSFMKSKLVL 78
           + ++   F +R A LDPCD      D +R    S   +AT        +PL FM+SKLVL
Sbjct: 35  ACASTAGFLLRGAMLDPCD-----VDARRGSGSSAAAVATTRTGAVAGNPLEFMRSKLVL 89

Query: 79  LVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVIS 138
           LVSHELSLSGGPLLLMELAFLLR VG++V WIT Q+  E ++V YSLEHKM   GVQV+ 
Sbjct: 90  LVSHELSLSGGPLLLMELAFLLRQVGSQVVWITNQRSEETNDVTYSLEHKMLSHGVQVLP 149

Query: 139 AKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVK 198
           A+G E I+TALKADL++LNTAVAGKWLDAVL + VP+VLP +LWWIHEMRGHYFKL+YVK
Sbjct: 150 ARGHEAIDTALKADLVILNTAVAGKWLDAVLNDHVPQVLPKILWWIHEMRGHYFKLEYVK 209

Query: 199 HLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLRE 258
           HLPLVAGAMIDSH TAEYWK RT +RL+I+MP TYVVHLGNSKELMEVAEDNVA+RVLRE
Sbjct: 210 HLPLVAGAMIDSHTTAEYWKTRTHDRLKIQMPQTYVVHLGNSKELMEVAEDNVARRVLRE 269

Query: 259 HVRESLGVRNEDLLFAIIN 277
           H+RE LGVR+EDL+FAIIN
Sbjct: 270 HIREFLGVRSEDLVFAIIN 288


>gi|115435790|ref|NP_001042653.1| Os01g0262600 [Oryza sativa Japonica Group]
 gi|56783832|dbj|BAD81244.1| unknown protein [Oryza sativa Japonica Group]
 gi|113532184|dbj|BAF04567.1| Os01g0262600 [Oryza sativa Japonica Group]
 gi|125569811|gb|EAZ11326.1| hypothetical protein OsJ_01190 [Oryza sativa Japonica Group]
          Length = 482

 Score =  326 bits (835), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 179/256 (69%), Positives = 205/256 (80%), Gaps = 11/256 (4%)

Query: 28  STAIAFFIRAALDPCDRHLEVSDKKRVQSQSVPRIATK------SSPLSFMKSKLVLLVS 81
           STA      A LDPCD      D +R    S   +AT        +PL FM+SKL+LLVS
Sbjct: 38  STAGFLLCGAMLDPCD-----VDARRGSGSSAAAVATTRTGAVAGNPLEFMRSKLMLLVS 92

Query: 82  HELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKG 141
           HELSLSGGPLLLMELAFLLR VG++V WIT Q+  E ++V YSLEHKM   GVQV+ A+G
Sbjct: 93  HELSLSGGPLLLMELAFLLRQVGSQVVWITNQRSEETNDVTYSLEHKMLSHGVQVLPARG 152

Query: 142 QETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLP 201
            E I+TALKADL++LNTAVAGKWLDAVL + VP+VLP +LWWIHEMRGHYFKL+YVKHLP
Sbjct: 153 HEAIDTALKADLVILNTAVAGKWLDAVLNDHVPQVLPKILWWIHEMRGHYFKLEYVKHLP 212

Query: 202 LVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVR 261
           LVAGAMIDSH TAEYWK RT +RL+I+MP TYVVHLGNSKELMEVAEDNVA+RVLREH+R
Sbjct: 213 LVAGAMIDSHTTAEYWKTRTHDRLKIQMPQTYVVHLGNSKELMEVAEDNVARRVLREHIR 272

Query: 262 ESLGVRNEDLLFAIIN 277
           E LGVR+EDL+FAIIN
Sbjct: 273 EFLGVRSEDLVFAIIN 288


>gi|356564312|ref|XP_003550399.1| PREDICTED: proteasome-activating nucleotidase-like [Glycine max]
          Length = 833

 Score =  321 bits (823), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 173/219 (78%), Positives = 195/219 (89%), Gaps = 1/219 (0%)

Query: 60  PRIATK-SSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEE 118
           PR  +K  +PL FMKSKLVL+VSHELSLSGGPLLLMELAFLLR  G+ V WIT QKP + 
Sbjct: 54  PRTDSKLPNPLEFMKSKLVLMVSHELSLSGGPLLLMELAFLLRSAGSDVVWITNQKPPKP 113

Query: 119 DEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLP 178
           D+VIY+LE+KM DRGVQV+ A+G++ ++TA  ADL++LNTAVAGKWLDAVLKE V  VLP
Sbjct: 114 DDVIYTLENKMLDRGVQVVDARGEKAVDTARNADLVILNTAVAGKWLDAVLKEKVLEVLP 173

Query: 179 NVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLG 238
            VLWWIHEMRGHYFK++YVKHLP VAGAMIDSH TAEYWKNRTRERL IKMP+TYVVHLG
Sbjct: 174 KVLWWIHEMRGHYFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLG 233

Query: 239 NSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           NSKELMEVAED+VAKRVLREHVR+SLGVRN+DLLFAIIN
Sbjct: 234 NSKELMEVAEDSVAKRVLREHVRQSLGVRNDDLLFAIIN 272


>gi|414876968|tpg|DAA54099.1| TPA: hypothetical protein ZEAMMB73_503773 [Zea mays]
          Length = 473

 Score =  321 bits (822), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 173/247 (70%), Positives = 202/247 (81%), Gaps = 9/247 (3%)

Query: 32  AFFIRAAL-DPCDRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGP 90
            F +R AL DPCD         R    ++       SPL FM+SKLVLLVSHELSLSGGP
Sbjct: 41  GFLLRGALRDPCD--------GRGDPAALNTAVASGSPLGFMRSKLVLLVSHELSLSGGP 92

Query: 91  LLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALK 150
           LLLMELAFLLR VG++V WIT Q+  E ++V YSLEH+M + GVQV+ A+GQE ++ ALK
Sbjct: 93  LLLMELAFLLRHVGSQVVWITNQRSQETNDVTYSLEHRMLNHGVQVLPARGQEAVDIALK 152

Query: 151 ADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDS 210
           ADL++LNTAVAGKWLD VLK+ VP+VLP +LWWIHEMRGHYFK++YVKHLP VAGAMIDS
Sbjct: 153 ADLVILNTAVAGKWLDPVLKDHVPKVLPKILWWIHEMRGHYFKVEYVKHLPFVAGAMIDS 212

Query: 211 HVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNED 270
           H TAEYW +RT +RL+I+MP TYVVHLGNSKELMEVAEDNVA+RVLREH+RESLGVR+ED
Sbjct: 213 HTTAEYWNSRTSDRLKIQMPQTYVVHLGNSKELMEVAEDNVARRVLREHIRESLGVRSED 272

Query: 271 LLFAIIN 277
           LLFAIIN
Sbjct: 273 LLFAIIN 279


>gi|357130079|ref|XP_003566684.1| PREDICTED: uncharacterized protein LOC100834487 [Brachypodium
           distachyon]
          Length = 470

 Score =  320 bits (820), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 174/248 (70%), Positives = 205/248 (82%), Gaps = 14/248 (5%)

Query: 32  AFFIRAAL-DPCDRHLEVSDKKRVQSQSVPRI-ATKSSPLSFMKSKLVLLVSHELSLSGG 89
            + +R AL  PCD            +Q  P + A  +SPL FM+SK+VLLVSHELSLSGG
Sbjct: 41  GYLLRGALVPPCD------------AQRDPVVLAAPASPLGFMRSKIVLLVSHELSLSGG 88

Query: 90  PLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTAL 149
           PLLLMELAFLLR VG++V WIT Q+P   ++  YSLEHKM + GVQ++ A+GQE I+TAL
Sbjct: 89  PLLLMELAFLLRQVGSQVVWITNQQPEGTNDASYSLEHKMLNHGVQILPARGQEAIDTAL 148

Query: 150 KADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMID 209
           KADL++LNTAVAGKWLDAVLK+ VP+VLP +LWWIHEMRGHYFKL+YVKHLPLVAGAMID
Sbjct: 149 KADLVILNTAVAGKWLDAVLKDHVPQVLPKILWWIHEMRGHYFKLEYVKHLPLVAGAMID 208

Query: 210 SHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNE 269
           SH TAEYWK RT +RL I+MP TYVVHLGNS+ELMEVAEDNVA+RVLREH+RESLGVR+E
Sbjct: 209 SHTTAEYWKTRTHDRLNIQMPQTYVVHLGNSEELMEVAEDNVARRVLREHIRESLGVRSE 268

Query: 270 DLLFAIIN 277
           DL+FA+IN
Sbjct: 269 DLIFAVIN 276


>gi|255648383|gb|ACU24642.1| unknown [Glycine max]
          Length = 463

 Score =  316 bits (810), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 172/219 (78%), Positives = 194/219 (88%), Gaps = 1/219 (0%)

Query: 60  PRIATK-SSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEE 118
           PR  +K  +PL FMKSKLVL+VSHELSLSGGPLLLMELAFLLR  G+ V  IT QKP + 
Sbjct: 54  PRTDSKLPNPLEFMKSKLVLMVSHELSLSGGPLLLMELAFLLRSAGSDVVRITNQKPPKP 113

Query: 119 DEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLP 178
           D+VIY+LE+KM DRGVQV+ A+G++ ++TA  ADL++LNTAVAGKWLDAVLKE V  VLP
Sbjct: 114 DDVIYTLENKMLDRGVQVVDARGEKAVDTARNADLVILNTAVAGKWLDAVLKEKVLEVLP 173

Query: 179 NVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLG 238
            VLWWIHEMRGHYFK++YVKHLP VAGAMIDSH TAEYWKNRTRERL IKMP+TYVVHLG
Sbjct: 174 KVLWWIHEMRGHYFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLG 233

Query: 239 NSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           NSKELMEVAED+VAKRVLREHVR+SLGVRN+DLLFAIIN
Sbjct: 234 NSKELMEVAEDSVAKRVLREHVRQSLGVRNDDLLFAIIN 272


>gi|10086496|gb|AAG12556.1|AC007797_16 Unknown Protein [Arabidopsis thaliana]
          Length = 458

 Score =  310 bits (795), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 180/262 (68%), Positives = 202/262 (77%), Gaps = 8/262 (3%)

Query: 1   MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCD-----RHLEVSDKKRVQ 55
           M K S + W     KRW L +L++LS+ST     +R+  D C         E  D   ++
Sbjct: 1   MAKPSTSMWATLQKKRWPLMILLVLSVSTVGMILVRSTFDSCSVSGKRCSREKEDNSDIK 60

Query: 56  SQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKP 115
            QSV   +   +PL FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV ++V WIT QKP
Sbjct: 61  IQSV---SGSLNPLEFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVESEVVWITNQKP 117

Query: 116 SEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPR 175
            EEDEVI  LEHKM DRGVQVISAK Q+ I+TALK+DL+VLNTAVAGKWLDAVLK++VP+
Sbjct: 118 VEEDEVIKVLEHKMLDRGVQVISAKSQKAIDTALKSDLVVLNTAVAGKWLDAVLKDNVPK 177

Query: 176 VLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVV 235
           VLP VLWWIHEMRGHYFK D VKHLP VAGAMIDSH TAEYWKNRT +RL IKMP TYVV
Sbjct: 178 VLPKVLWWIHEMRGHYFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIKMPKTYVV 237

Query: 236 HLGNSKELMEVAEDNVAKRVLR 257
           HLGNSKELMEVAED+ AK VLR
Sbjct: 238 HLGNSKELMEVAEDSFAKNVLR 259


>gi|10120445|gb|AAG13070.1|AC023754_8 Hypothetical protein [Arabidopsis thaliana]
          Length = 402

 Score =  310 bits (794), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 160/191 (83%), Positives = 171/191 (89%)

Query: 87  SGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETIN 146
           S GPLLLMELAFLLRGVG  V WIT QKP E+DEV+YSLEHKM DRGVQVISAKGQ+ ++
Sbjct: 18  SRGPLLLMELAFLLRGVGADVVWITNQKPLEDDEVVYSLEHKMLDRGVQVISAKGQKAVD 77

Query: 147 TALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGA 206
           T+LKADLIVLNTAVAGKWLDAVLKE+V +VLP +LWWIHEMRGHYF  D VKHLP VAGA
Sbjct: 78  TSLKADLIVLNTAVAGKWLDAVLKENVVKVLPKILWWIHEMRGHYFNADLVKHLPFVAGA 137

Query: 207 MIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGV 266
           MIDSH TA YWKNRT+ RL IKMP TYVVHLGNSKELMEVAED+VAKRVLREHVRESLGV
Sbjct: 138 MIDSHATAGYWKNRTQARLGIKMPKTYVVHLGNSKELMEVAEDSVAKRVLREHVRESLGV 197

Query: 267 RNEDLLFAIIN 277
           RNEDLLF IIN
Sbjct: 198 RNEDLLFGIIN 208


>gi|297735089|emb|CBI17451.3| unnamed protein product [Vitis vinifera]
          Length = 399

 Score =  301 bits (772), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 165/206 (80%), Positives = 185/206 (89%)

Query: 72  MKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWD 131
           MKSK VLLVSHELSLSGGPLLLMELAFLLR VG +V WIT  KPSE DEVIYSLE+KM  
Sbjct: 1   MKSKRVLLVSHELSLSGGPLLLMELAFLLRSVGAEVCWITNHKPSETDEVIYSLENKMQH 60

Query: 132 RGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHY 191
           RGVQV+ AKG+E IN ALKADL+VLNT +AGKWLD VLKE+VP VLP VLWWIHE++GHY
Sbjct: 61  RGVQVLPAKGREAINRALKADLLVLNTVMAGKWLDDVLKENVPHVLPKVLWWIHEIQGHY 120

Query: 192 FKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNV 251
           F+L+YV+HLPLVA AMIDSHV AEYWK  TR+ LRIKMPDTYVVHLGNSK+LM++AED+V
Sbjct: 121 FQLEYVRHLPLVAAAMIDSHVAAEYWKTGTRDYLRIKMPDTYVVHLGNSKDLMDIAEDSV 180

Query: 252 AKRVLREHVRESLGVRNEDLLFAIIN 277
           AKRVLREHVRESLGVR+ED+LFA+IN
Sbjct: 181 AKRVLREHVRESLGVRDEDVLFAMIN 206


>gi|212275998|ref|NP_001130447.1| uncharacterized protein LOC100191545 [Zea mays]
 gi|194689152|gb|ACF78660.1| unknown [Zea mays]
 gi|414876969|tpg|DAA54100.1| TPA: hypothetical protein ZEAMMB73_503773 [Zea mays]
          Length = 399

 Score =  251 bits (642), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 114/143 (79%), Positives = 132/143 (92%)

Query: 135 QVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKL 194
           QV+ A+GQE ++ ALKADL++LNTAVAGKWLD VLK+ VP+VLP +LWWIHEMRGHYFK+
Sbjct: 63  QVLPARGQEAVDIALKADLVILNTAVAGKWLDPVLKDHVPKVLPKILWWIHEMRGHYFKV 122

Query: 195 DYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKR 254
           +YVKHLP VAGAMIDSH TAEYW +RT +RL+I+MP TYVVHLGNSKELMEVAEDNVA+R
Sbjct: 123 EYVKHLPFVAGAMIDSHTTAEYWNSRTSDRLKIQMPQTYVVHLGNSKELMEVAEDNVARR 182

Query: 255 VLREHVRESLGVRNEDLLFAIIN 277
           VLREH+RESLGVR+EDLLFAIIN
Sbjct: 183 VLREHIRESLGVRSEDLLFAIIN 205


>gi|224062083|ref|XP_002300746.1| predicted protein [Populus trichocarpa]
 gi|222842472|gb|EEE80019.1| predicted protein [Populus trichocarpa]
          Length = 222

 Score =  239 bits (611), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 112/129 (86%), Positives = 123/129 (95%)

Query: 149 LKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMI 208
           +KADL+VLNTAVAGKWL+ VLKE+V +VLP VLWWIHEMRGHYFKL+YVKHLP VAGAMI
Sbjct: 1   MKADLVVLNTAVAGKWLEGVLKENVKQVLPKVLWWIHEMRGHYFKLEYVKHLPFVAGAMI 60

Query: 209 DSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRN 268
           DSH TAEYWKNRTRERL IKMP+TYVVHLGNSK+LMEVAED+VAKRVLREHVRESLGVR+
Sbjct: 61  DSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAEDSVAKRVLREHVRESLGVRD 120

Query: 269 EDLLFAIIN 277
           +DLLFAIIN
Sbjct: 121 DDLLFAIIN 129


>gi|302786982|ref|XP_002975261.1| glycosyltransferase, CAZy family GT4 [Selaginella moellendorffii]
 gi|300156835|gb|EFJ23462.1| glycosyltransferase, CAZy family GT4 [Selaginella moellendorffii]
          Length = 452

 Score =  236 bits (602), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 140/275 (50%), Positives = 189/275 (68%), Gaps = 12/275 (4%)

Query: 8   GWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVPRIATKSS 67
           G  +P  ++  L  L+ L ++ A  F     L      L  S   R  +  +  +  + +
Sbjct: 2   GGAIPGKRKAGLRSLLKLGVA-AFGFLAGIGL------LVGSYSVRHPAAGISGVDRRDA 54

Query: 68  P---LSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQK-PSEEDEVIY 123
           P   L FM+ K V+LVSHEL+L+GGPLLLMELA LL+  G  V W+TI K      EV  
Sbjct: 55  PPHGLHFMRGKNVVLVSHELTLTGGPLLLMELAVLLKNAGATVQWMTINKRDGAGSEVTD 114

Query: 124 SLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKE-DVPRVLPNVLW 182
           +LE ++ ++G+ ++ AKG+ET+  A+ +DL+VLNTAVAGKW+D+ LKE D  RVLP VLW
Sbjct: 115 NLEQRLQNKGILLVPAKGEETVRAAVDSDLVVLNTAVAGKWIDSTLKESDQQRVLPKVLW 174

Query: 183 WIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKE 242
           WIHEMRGHYF L+YVKH+P VA  MIDSH TAEYWKNRT++RL IK+P  +VVHLGNSK+
Sbjct: 175 WIHEMRGHYFTLNYVKHMPEVAAVMIDSHATAEYWKNRTQQRLGIKIPKVHVVHLGNSKD 234

Query: 243 LMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           L E AE+ +A+ +LR+HVRESLG+ + D++F+ IN
Sbjct: 235 LTEAAENPLARHLLRQHVRESLGISDRDVMFSAIN 269


>gi|168008346|ref|XP_001756868.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692106|gb|EDQ78465.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 452

 Score =  214 bits (545), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 116/223 (52%), Positives = 158/223 (70%), Gaps = 5/223 (2%)

Query: 56  SQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWIT-IQK 114
           S S+P    K   L+FM+ K V++VSHELSLSGGPLLLMEL  +LR  G  V W+T  +K
Sbjct: 43  SSSLP----KPPGLNFMEGKSVVVVSHELSLSGGPLLLMELGHILRRSGAFVYWVTGNKK 98

Query: 115 PSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVP 174
            +  D V+  LE K+ + G+QVI A+G  T++    ADL++LNTAVAGKW+ +  K D+ 
Sbjct: 99  ENTSDPVVVFLEEKLLNHGLQVIPARGTRTVSALTTADLVILNTAVAGKWVSSAFKADIK 158

Query: 175 RVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYV 234
           ++L   LWWIHEMRGHYF  +YVK LP VAG + DSH TA+YW+ RTR+RLR+ +P  +V
Sbjct: 159 KLLAKTLWWIHEMRGHYFAPEYVKFLPEVAGVITDSHATADYWRTRTRDRLRMTLPKMHV 218

Query: 235 VHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           VHLGNS++LM  AED V +  +R+ VR+ +G+   D++FA+IN
Sbjct: 219 VHLGNSQQLMLDAEDAVGRASMRQRVRQIVGIFENDIVFAMIN 261


>gi|72536739|gb|AAZ73389.1| At1g19710 [Arabidopsis thaliana]
 gi|72536741|gb|AAZ73390.1| At1g19710 [Arabidopsis thaliana]
 gi|72536743|gb|AAZ73391.1| At1g19710 [Arabidopsis thaliana]
 gi|72536745|gb|AAZ73392.1| At1g19710 [Arabidopsis thaliana]
 gi|72536747|gb|AAZ73393.1| At1g19710 [Arabidopsis thaliana]
 gi|72536749|gb|AAZ73394.1| At1g19710 [Arabidopsis thaliana]
 gi|72536753|gb|AAZ73396.1| At1g19710 [Arabidopsis thaliana]
 gi|72536755|gb|AAZ73397.1| At1g19710 [Arabidopsis thaliana]
 gi|72536757|gb|AAZ73398.1| At1g19710 [Arabidopsis thaliana]
 gi|72536759|gb|AAZ73399.1| At1g19710 [Arabidopsis thaliana]
 gi|72536761|gb|AAZ73400.1| At1g19710 [Arabidopsis thaliana]
 gi|72536763|gb|AAZ73401.1| At1g19710 [Arabidopsis thaliana]
 gi|72536765|gb|AAZ73402.1| At1g19710 [Arabidopsis thaliana]
 gi|72536767|gb|AAZ73403.1| At1g19710 [Arabidopsis thaliana]
 gi|72536769|gb|AAZ73404.1| At1g19710 [Arabidopsis thaliana]
 gi|72536771|gb|AAZ73405.1| At1g19710 [Arabidopsis thaliana]
          Length = 231

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 89/105 (84%), Positives = 93/105 (88%)

Query: 173 VPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDT 232
           VP+VLP VLWWIHEMRGHYFK D VKHLP VAGAMIDSH TAEYWKNRT +RL IKMP T
Sbjct: 1   VPKVLPKVLWWIHEMRGHYFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIKMPKT 60

Query: 233 YVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           YVVHLGNSKELMEVAED+ AK VLRE VRESLGVRNED+LF IIN
Sbjct: 61  YVVHLGNSKELMEVAEDSFAKNVLREQVRESLGVRNEDILFGIIN 105


>gi|72536751|gb|AAZ73395.1| At1g19710 [Arabidopsis thaliana]
          Length = 231

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 89/105 (84%), Positives = 93/105 (88%)

Query: 173 VPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDT 232
           VP+VLP VLWWIHEMRGHYFK D VKHLP VAGAMIDSH TAEYWKNRT +RL IKMP T
Sbjct: 1   VPKVLPKVLWWIHEMRGHYFKPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIKMPKT 60

Query: 233 YVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           YVVHLGNSKELMEVAED+ AK VLRE VRESLGVRNED+LF IIN
Sbjct: 61  YVVHLGNSKELMEVAEDSFAKNVLREQVRESLGVRNEDILFGIIN 105


>gi|72536773|gb|AAZ73406.1| At1g19710-like protein [Arabidopsis lyrata]
          Length = 231

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 87/105 (82%), Positives = 93/105 (88%)

Query: 173 VPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDT 232
           VP+VLP VLWWIHEMRGHYF+ D VKHLP VAGAMIDSH TAEYWKNRT +RL I+MP T
Sbjct: 1   VPKVLPKVLWWIHEMRGHYFEPDLVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIEMPKT 60

Query: 233 YVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           YVVHLGNSKELMEVAED+ AK VLRE VRESLGVRNED+LF IIN
Sbjct: 61  YVVHLGNSKELMEVAEDSFAKNVLREQVRESLGVRNEDILFGIIN 105


>gi|388505262|gb|AFK40697.1| unknown [Medicago truncatula]
          Length = 223

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 80/91 (87%), Positives = 84/91 (92%)

Query: 187 MRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEV 246
           MRGHYFK  YVKHLP VAGAMIDSH TA YWKNRT+E L IKMP+TYVVHLGNSKELMEV
Sbjct: 1   MRGHYFKEAYVKHLPFVAGAMIDSHTTAGYWKNRTKECLGIKMPETYVVHLGNSKELMEV 60

Query: 247 AEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           AED+VAKRVLREHVRESLGVRN+DLLFAIIN
Sbjct: 61  AEDSVAKRVLREHVRESLGVRNDDLLFAIIN 91


>gi|302811486|ref|XP_002987432.1| hypothetical protein SELMODRAFT_426176 [Selaginella moellendorffii]
 gi|300144838|gb|EFJ11519.1| hypothetical protein SELMODRAFT_426176 [Selaginella moellendorffii]
          Length = 240

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 67/182 (36%), Positives = 82/182 (45%), Gaps = 58/182 (31%)

Query: 69  LSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHK 128
           L F++ ++V+LVSHEL+L+G  L         +  G  V WITI+K              
Sbjct: 34  LQFIRGRIVVLVSHELTLTG-VLSAHGARCASQNAGATVQWITIEKKE------------ 80

Query: 129 MWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVL-PNVLWWIHEM 187
                                             K L A  + +V R   P VLWWI   
Sbjct: 81  ----------------------------------KRLSA--QREVDRFYAPKVLWWIP-- 102

Query: 188 RGHYFKLDYVKHLP--LVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELME 245
            GHYF L+YV   P  LVA  MIDSH  AEYW NRT++ L IK+P    VHLGNSK L E
Sbjct: 103 -GHYFTLNYVNVCPRLLVAAVMIDSHAMAEYWTNRTQQHLGIKIPK---VHLGNSKHLTE 158

Query: 246 VA 247
            A
Sbjct: 159 AA 160


>gi|302782081|ref|XP_002972814.1| glycosyltransferase, CAZy family GT4 [Selaginella moellendorffii]
 gi|300159415|gb|EFJ26035.1| glycosyltransferase, CAZy family GT4 [Selaginella moellendorffii]
          Length = 614

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 68/257 (26%), Positives = 127/257 (49%), Gaps = 29/257 (11%)

Query: 33  FFIRAALDPCD----RHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSG 88
           +F+   + P D    + L +S +K+  +++    + +     F   K V+++ HELS++G
Sbjct: 190 YFLGRVVGPYDELEQKFLGMSTEKKTPARTCS--SDEGRFKEFASGKRVVVLMHELSMTG 247

Query: 89  GPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTA 148
            PL +MELA  + G G KV+ + + +          L +++  R + V++ K  ++   A
Sbjct: 248 SPLAMMELASEIIGCGGKVSVVVLDRRG-------GLLNELVQRRIPVLADKAAKSWRAA 300

Query: 149 LKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK-----LDYVKHLPLV 203
            KADL++  +A+   W+   L+    +    V+WW+ E R  YF      LD V+ L  +
Sbjct: 301 AKADLVIAGSALCASWIGEYLRYH-KKGASKVVWWVMENRRLYFDRSKRILDKVRALVFL 359

Query: 204 AGAMIDSHVTAEYWKNRTRERLRIKMPD-TYVVHLGNSKELMEVA--EDNVAKRVLREHV 260
                 S   A+ W+  +R    I +P  T +V L  +  ++  A  +D +    LRE V
Sbjct: 360 ------SKTQADQWREWSRGE-NISLPSLTTIVSLSVNDAVLSAAGIDDALKMAKLREEV 412

Query: 261 RESLGVRNEDLLFAIIN 277
           R+ LG++ +D+L A ++
Sbjct: 413 RKDLGLKPDDVLLATLS 429


>gi|449464260|ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207532 [Cucumis sativus]
 gi|449496350|ref|XP_004160111.1| PREDICTED: uncharacterized protein LOC101223486 [Cucumis sativus]
          Length = 682

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 64/219 (29%), Positives = 104/219 (47%), Gaps = 35/219 (15%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
           S+  +L+ HELS++G P+ +MELA  L   G  V+ + + K          L  ++  R 
Sbjct: 242 SRRFVLIFHELSMTGAPISMMELATELLSCGASVSAVALSKKG-------GLMSELSRRR 294

Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
           ++V+  K   +  TA+KADL++  +AV   W+D  + E  P     V WWI E R  YF 
Sbjct: 295 IKVLDDKADLSFKTAMKADLVIAGSAVCASWIDGYI-EHFPAGASQVAWWIMENRREYFN 353

Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKM-PDTYVVHLGNSKELMEVA 247
                LD VK L  +      S + ++ W N ++E   IK+     +V L  + EL  VA
Sbjct: 354 RSKVVLDRVKMLIFI------SELQSKQWLNWSQEE-NIKLRSQPAIVPLSVNDELAFVA 406

Query: 248 EDNVA--------------KRVLREHVRESLGVRNEDLL 272
             + +              K++LR   R+ +GV + D++
Sbjct: 407 GISCSLNTESSSPEKMLEKKQLLRNTTRKEMGVGDNDVV 445


>gi|15218994|ref|NP_175651.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
 gi|5903062|gb|AAD55621.1|AC008016_31 Is a member of PF|00534 Glycosyl transferases group 1. EST
           gb|N96702 comes from this gene [Arabidopsis thaliana]
 gi|12323120|gb|AAG51540.1|AC037424_5 glycosyl transferase, putative; 4406-2038 [Arabidopsis thaliana]
 gi|27311817|gb|AAO00874.1| glycosyl transferase, putative [Arabidopsis thaliana]
 gi|30725510|gb|AAP37777.1| At1g52420 [Arabidopsis thaliana]
 gi|110742556|dbj|BAE99193.1| putative glycosyl transferase [Arabidopsis thaliana]
 gi|332194684|gb|AEE32805.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
          Length = 670

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 64/218 (29%), Positives = 102/218 (46%), Gaps = 33/218 (15%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
           S+  +L+ HELS++G P+ +MELA  L   G  V+ + + +          L  ++  R 
Sbjct: 233 SRRFVLLFHELSMTGAPISMMELASELLSCGATVSAVVLSRRG-------GLMQELSRRR 285

Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
           ++V+  KG+ +  TA+KADLI+  +AV   W+D  +    P     + WWI E R  YF 
Sbjct: 286 IKVVEDKGELSFKTAMKADLIIAGSAVCTSWIDQYMNHH-PAGGSQIAWWIMENRREYFD 344

Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVA- 247
                LD VK L  ++ +     +T   W      +LR       +V L  + EL  VA 
Sbjct: 345 RAKPVLDRVKMLIFLSESQSRQWLT---WCEEEHIKLR---SQPVIVPLSVNDELAFVAG 398

Query: 248 -------------EDNVAKRVLREHVRESLGVRNEDLL 272
                        +  V +++LRE VR  LG+ + D+L
Sbjct: 399 IPSSLNTPTLSPEKMRVKRQILRESVRTELGITDSDML 436


>gi|255574530|ref|XP_002528176.1| glycosyltransferase, putative [Ricinus communis]
 gi|223532388|gb|EEF34183.1| glycosyltransferase, putative [Ricinus communis]
          Length = 686

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 68/265 (25%), Positives = 121/265 (45%), Gaps = 38/265 (14%)

Query: 28  STAIAFFIRAALDPCDRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLS 87
           +T   F +       DR LE S +KR  +        K      + S+  +L+ HELS++
Sbjct: 208 NTTYGFLVGPFGSTEDRILEWSPEKRTGT-----CDRKGDFARLVWSRKFVLIFHELSMT 262

Query: 88  GGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINT 147
           G PL +MELA      G  V+ + + K          L  ++  R ++V+  K   +  T
Sbjct: 263 GAPLSMMELATEFLSCGATVSAVVLSKKG-------GLMSELNRRRIKVLEDKADLSFKT 315

Query: 148 ALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK-----LDYVKHLPL 202
           A+KADL++  +AV   W+D  +    P     ++WWI E R  YF      L+ VK L  
Sbjct: 316 AMKADLVIAGSAVCASWIDQYMTR-FPAGGSQIVWWIMENRREYFDRSKIVLNRVKMLVF 374

Query: 203 VAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVA---------- 252
           ++ +  +  ++   W +   E+++++ P   +V L  + EL  VA    +          
Sbjct: 375 LSESQTEQWLS---WCD--EEKIKLRAPPA-IVPLSINDELAFVAGIACSLNTPSSSPEK 428

Query: 253 ----KRVLREHVRESLGVRNEDLLF 273
               +R+L + VR+ +G+ ++D+L 
Sbjct: 429 MLEKRRLLADSVRKEMGLTDDDVLL 453


>gi|224054196|ref|XP_002298139.1| predicted protein [Populus trichocarpa]
 gi|222845397|gb|EEE82944.1| predicted protein [Populus trichocarpa]
          Length = 681

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 73/250 (29%), Positives = 119/250 (47%), Gaps = 40/250 (16%)

Query: 43  DRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRG 102
           DR LE S +KR  S +  R   K +    + S+  +L+ HELS++G PL ++ELA     
Sbjct: 222 DRILEWSPEKR--SGTCDR---KGAFARLVWSRKFVLIFHELSMTGAPLSMLELATEFLS 276

Query: 103 VGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAG 162
            G  V+ + + K          L  ++  R ++V+  +   +  TA+KADL++  +AV  
Sbjct: 277 CGATVSAVVLSKKG-------GLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCT 329

Query: 163 KWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK-----LDYVKHLPLVAGAMIDSHVTAEYW 217
            W+D  +    P     V+WWI E R  YF      L+ VK L  ++ + +      + W
Sbjct: 330 SWIDQYIAR-FPAGGSQVVWWIMENRREYFDRSKIILNRVKMLVFLSESQM------KQW 382

Query: 218 KN-RTRERLRIKMPDTYVVHLGNSKELMEVA-------------EDNVAKR-VLREHVRE 262
           +     E +R++ P   VV L  + EL  VA             E  + KR +LRE VR+
Sbjct: 383 QTWCEEENIRLRSPPA-VVQLSVNDELAFVAGIACSLNTPTSSSEKMLEKRQLLRESVRK 441

Query: 263 SLGVRNEDLL 272
            +G+ + D+L
Sbjct: 442 EMGLTDNDML 451


>gi|297847636|ref|XP_002891699.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337541|gb|EFH67958.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 670

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 58/212 (27%), Positives = 106/212 (50%), Gaps = 21/212 (9%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
           S+  +L+ HELS++G P+ +MELA  L   G  V+ + + +          L  ++  R 
Sbjct: 233 SRRFVLLFHELSMTGAPISMMELASELLSCGATVSAVVLSRRG-------GLMQELTRRR 285

Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
           ++V+  KG+ +  T++KADL++  +AV   W+D  +    P     + WWI E R  YF 
Sbjct: 286 IKVVEDKGELSFKTSMKADLVIAGSAVCTSWIDQYMNHH-PAGGSQIAWWIMENRREYFD 344

Query: 194 -----LDYVKHLPLVAGAMIDSHVT---AEYWKNRTRE---RLRIKMPDTYVVHLGNSKE 242
                LD+VK L  ++ +     +T    E+ K R++     L +     +V  + +S  
Sbjct: 345 RAKPVLDHVKMLIFLSESQSRQWLTWCEEEHIKLRSQPVIVPLSVNDELAFVAGIPSSLN 404

Query: 243 LMEVAEDNV--AKRVLREHVRESLGVRNEDLL 272
              ++ + +   +++LRE VR  LG+ + D+L
Sbjct: 405 TPTLSPEKMREKRQILRESVRMELGLTDADML 436


>gi|356547414|ref|XP_003542107.1| PREDICTED: uncharacterized protein LOC100795000 [Glycine max]
          Length = 701

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 65/221 (29%), Positives = 106/221 (47%), Gaps = 39/221 (17%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
           S+  +L+ HELS++G PL +MELA  L   G  V+ + + +          L  ++  R 
Sbjct: 246 SRRFILIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKG-------GLMSELARRR 298

Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
           ++V+  K   +  TA+KADL++  +AV   W++  + +  P     V WWI E R  YF 
Sbjct: 299 IKVLEDKSDLSFKTAMKADLVIAGSAVCASWIEQYI-DHFPAGASQVAWWIMENRREYFD 357

Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRE---RLRIKMPDTYVVHLGNSKELME 245
                L  VK L  +      S   ++ W+    E   +LR  +P+  +V L  ++EL  
Sbjct: 358 RSKDILHRVKMLVFL------SESQSKQWQKWCEEESIKLR-SLPE--IVALSVNEELAF 408

Query: 246 VA-------------EDNVAKR-VLREHVRESLGVRNEDLL 272
           VA             E  V K+ +LRE VR+ +G+ + D+L
Sbjct: 409 VAGIPSTLNTPSFSTEKMVEKKQLLRESVRKEMGLTDNDML 449


>gi|356555993|ref|XP_003546312.1| PREDICTED: uncharacterized protein LOC100793827 [Glycine max]
          Length = 701

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 63/218 (28%), Positives = 99/218 (45%), Gaps = 33/218 (15%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
           S+  +L+ HELS++G PL +MELA  L   G  V+ + + +          L  ++  R 
Sbjct: 246 SRRFILIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKG-------GLMSELARRR 298

Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
           ++V+  K   +  TA+KADL++  +AV   W++  + E  P     V WWI E R  YF 
Sbjct: 299 IKVLEDKADLSFKTAMKADLVIAGSAVCASWIEQYI-EHFPAGASQVAWWIMENRREYFD 357

Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVA- 247
                L  VK L  +      S   ++ W+    E          +V L  + EL  VA 
Sbjct: 358 RSKDVLHRVKMLVFL------SESQSKQWQKWCEEESIKLRSHPEIVPLSVNDELAFVAG 411

Query: 248 ------------EDNVAKR-VLREHVRESLGVRNEDLL 272
                       E  V K+ +LRE VR+ +G+ + D+L
Sbjct: 412 IPSTLNTPSFSTEKMVEKKQLLRESVRKEMGLTDNDML 449


>gi|15233237|ref|NP_188215.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
 gi|334185383|ref|NP_001189906.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
 gi|9294599|dbj|BAB02880.1| glycosyl transferases-like protein [Arabidopsis thaliana]
 gi|20147191|gb|AAM10311.1| AT3g15940/MVC8_7 [Arabidopsis thaliana]
 gi|22796166|emb|CAD45267.1| putative glycosyltransferase [Arabidopsis thaliana]
 gi|332642228|gb|AEE75749.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
 gi|332642229|gb|AEE75750.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana]
          Length = 697

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 71/249 (28%), Positives = 112/249 (44%), Gaps = 38/249 (15%)

Query: 43  DRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRG 102
           DR LE S +KR  S +  R   KS     + S+  +L+ HELS++G P+ +MELA  L  
Sbjct: 209 DRILEWSPQKR--SGTCDR---KSDFKRLVWSRRFVLLFHELSMTGAPISMMELASELLS 263

Query: 103 VGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAG 162
            G  V  + + +          L  ++  R ++V+  KG+ +  TA+KADL++  +AV  
Sbjct: 264 CGATVYAVVLSRRG-------GLLQELTRRRIKVVEDKGELSFKTAMKADLVIAGSAVCA 316

Query: 163 KWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK-----LDYVKHLPLVAGAMIDSHVTAEYW 217
            W+D  +    P     + WW+ E R  YF      LD VK L  +      S V ++ W
Sbjct: 317 SWIDQYMDHH-PAGGSQIAWWVMENRREYFDRAKPVLDRVKLLIFL------SEVQSKQW 369

Query: 218 KNRTRERLRIKMPDTYVVHLGNSKELMEVAEDNVA--------------KRVLREHVRES 263
                E          +V L  + EL  VA  + +              ++ LRE VR  
Sbjct: 370 LTWCEEDHVKLRSQPVIVPLSVNDELAFVAGVSSSLNTPTLTQETMKEKRQKLRESVRTE 429

Query: 264 LGVRNEDLL 272
            G+ ++D+L
Sbjct: 430 FGLTDKDML 438


>gi|297834468|ref|XP_002885116.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330956|gb|EFH61375.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 696

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 76/250 (30%), Positives = 114/250 (45%), Gaps = 40/250 (16%)

Query: 43  DRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRG 102
           DR LE S +KR  S +  R   KS     + S+  +L+ HELS++G P+ +MELA  L  
Sbjct: 209 DRVLEWSPQKR--SGTCDR---KSDFKRLVWSRRFVLLFHELSMTGAPISMMELASELLS 263

Query: 103 VGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAG 162
            G  V  + + +          L  ++  R ++V+  KG+ +  TA+KADL++  +AV  
Sbjct: 264 CGATVYAVVLSRRG-------GLLQELTRRRIKVVEDKGELSFKTAMKADLVIAGSAVCA 316

Query: 163 KWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK-----LDYVKHLPLVAGAMIDSHVTAEYW 217
            W+D  +    P     + WW+ E R  YF      LD VK L  +      S V ++ W
Sbjct: 317 SWIDQYMDHH-PAGGSQIAWWVMENRREYFDRAKPVLDRVKLLIFL------SEVQSKQW 369

Query: 218 KNRTRERLRIKM-PDTYVVHLGNSKELMEVA-------------EDNVAKRV-LREHVRE 262
                E   IK+     +V L  + EL  VA             E    KR  LRE VR 
Sbjct: 370 LTWCEED-HIKLRSQPVIVPLSVNDELAFVAGIYSSLNTPTLTQEMMKEKRQKLRESVRT 428

Query: 263 SLGVRNEDLL 272
             G+ ++D+L
Sbjct: 429 EFGLTDKDML 438


>gi|357473203|ref|XP_003606886.1| Glycosyl transferases-like protein [Medicago truncatula]
 gi|355507941|gb|AES89083.1| Glycosyl transferases-like protein [Medicago truncatula]
          Length = 634

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 58/233 (24%), Positives = 108/233 (46%), Gaps = 34/233 (14%)

Query: 57  QSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPS 116
           +   +   KS     + SK  +L+ HELS++G PL +MELA  L   G  V+ + + +  
Sbjct: 177 KGCGKCDKKSEFAQVVMSKSFVLIFHELSMTGAPLSMMELATELLSCGANVSAVVLSRKG 236

Query: 117 EEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRV 176
                   L  ++  R ++VI  K   +  T++ A L++  +AV   W++  + E  P  
Sbjct: 237 -------GLMQELVRRQIKVIDDKVDHSFKTSMNAHLVIAGSAVCASWIEQYI-EYSPAA 288

Query: 177 LPNVLWWIHEMRGHYFK-----LDYVKHLPLVAGAMIDSHVTAEYWKN-RTRERLRIKMP 230
             +V+WWI E R  YF      L+ V+ L  +      S + ++ W+     E +++++ 
Sbjct: 289 ANHVVWWIMENRREYFDRSKDVLNKVRMLIFL------SELQSKKWQKWCDEESIKLRLQ 342

Query: 231 DTYV-----------VHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLL 272
             +V             L +S +  ++ E    +++LR  VR  LG+ + D+L
Sbjct: 343 PAHVPLSVNDKLAFSAGLHSSSDAEKIDE---KRKLLRASVRRELGLNDNDML 392


>gi|326532396|dbj|BAK05127.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 231

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 32/36 (88%), Positives = 35/36 (97%)

Query: 242 ELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIIN 277
           EL EVAEDNVA+RVLREH+RESLGVR+EDLLFAIIN
Sbjct: 1   ELTEVAEDNVARRVLREHIRESLGVRSEDLLFAIIN 36


>gi|225442687|ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246448 [Vitis vinifera]
          Length = 691

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 59/219 (26%), Positives = 101/219 (46%), Gaps = 33/219 (15%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
           S+  +L+ HELS++G PL +MELA  L   G  V+ + + K          L  ++  R 
Sbjct: 235 SRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKG-------GLMPELARRR 287

Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
           ++V+  +   +  TA+KADL++  +AV   W++  +          ++WWI E R  YF 
Sbjct: 288 IKVLEDRADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGS-SQIVWWIMENRREYFD 346

Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAE 248
                ++ VK L  ++ +     +T   W      RL I  P   VV L  + EL  VA 
Sbjct: 347 RSKLVINRVKMLIFLSESQSKQWLT---WCKEENIRL-ISQP--AVVPLSVNDELAFVAG 400

Query: 249 DNVA--------------KRVLREHVRESLGVRNEDLLF 273
              +              +R+LR+ +R+ +G+ + D+L 
Sbjct: 401 ITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLL 439


>gi|147838732|emb|CAN71826.1| hypothetical protein VITISV_013841 [Vitis vinifera]
          Length = 734

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 59/219 (26%), Positives = 101/219 (46%), Gaps = 33/219 (15%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
           S+  +L+ HELS++G PL +MELA  L   G  V+ + + K          L  ++  R 
Sbjct: 246 SRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKG-------GLMPELARRR 298

Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
           ++V+  +   +  TA+KADL++  +AV   W++  +          ++WWI E R  YF 
Sbjct: 299 IKVLEDRADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGS-SQIVWWIMENRREYFD 357

Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAE 248
                ++ VK L  ++ +     +T   W      RL I  P   VV L  + EL  VA 
Sbjct: 358 RSKLVINRVKMLIFLSESQSKQWLT---WCKEENIRL-ISQP--AVVPLSVNDELAFVAG 411

Query: 249 DNVA--------------KRVLREHVRESLGVRNEDLLF 273
              +              +R+LR+ +R+ +G+ + D+L 
Sbjct: 412 ITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLL 450


>gi|297743306|emb|CBI36173.3| unnamed protein product [Vitis vinifera]
          Length = 683

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 59/219 (26%), Positives = 101/219 (46%), Gaps = 33/219 (15%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
           S+  +L+ HELS++G PL +MELA  L   G  V+ + + K          L  ++  R 
Sbjct: 246 SRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKG-------GLMPELARRR 298

Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFK 193
           ++V+  +   +  TA+KADL++  +AV   W++  +          ++WWI E R  YF 
Sbjct: 299 IKVLEDRADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGS-SQIVWWIMENRREYFD 357

Query: 194 -----LDYVKHLPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAE 248
                ++ VK L  ++ +     +T   W      RL I  P   VV L  + EL  VA 
Sbjct: 358 RSKLVINRVKMLIFLSESQSKQWLT---WCKEENIRL-ISQP--AVVPLSVNDELAFVAG 411

Query: 249 DNVA--------------KRVLREHVRESLGVRNEDLLF 273
              +              +R+LR+ +R+ +G+ + D+L 
Sbjct: 412 ITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLL 450


>gi|356528940|ref|XP_003533055.1| PREDICTED: uncharacterized protein LOC100793124 [Glycine max]
          Length = 675

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/150 (28%), Positives = 73/150 (48%), Gaps = 13/150 (8%)

Query: 43  DRHLEVSDKKRVQSQSVPRIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRG 102
           DR L++S ++R ++        K      + S+  +L+ HELS++G PL +MELA  L  
Sbjct: 208 DRILQLSPQRRYET-----CDKKGEFARLVWSRRFVLIFHELSMTGAPLSMMELATELLS 262

Query: 103 VGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAVAG 162
            G  V+ + + +          L  ++  R ++V+  K   +   A K+DL++  +AV  
Sbjct: 263 CGASVSAVVLSRKG-------GLMQELARRRIKVLDDKAYLSFKIAKKSDLVIAGSAVCA 315

Query: 163 KWLDAVLKEDVPRVLPNVLWWIHEMRGHYF 192
            W++  + E  P     V WWI E R  YF
Sbjct: 316 SWIEQYI-EHFPAGASQVAWWIMENRREYF 344


>gi|356522373|ref|XP_003529821.1| PREDICTED: uncharacterized protein LOC100796443 [Glycine max]
          Length = 681

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 8/119 (6%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
           S+  +L+ HELS++G PL +MELA  L   G  V+ + + +          L  ++  R 
Sbjct: 240 SRRFVLIFHELSMTGAPLSMMELATELLSCGASVSAVVLSRKG-------GLMQELARRR 292

Query: 134 VQVISAKGQETINTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYF 192
           ++V+  K   +   A KADL++  +AV   W++  + E  P     V WWI E R  YF
Sbjct: 293 IKVLDDKAYLSFKIANKADLVIAGSAVCTSWIEQYI-EHFPAGANQVAWWIMENRREYF 350


>gi|302786980|ref|XP_002975260.1| hypothetical protein SELMODRAFT_415394 [Selaginella moellendorffii]
 gi|300156834|gb|EFJ23461.1| hypothetical protein SELMODRAFT_415394 [Selaginella moellendorffii]
          Length = 125

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 29/49 (59%), Positives = 35/49 (71%), Gaps = 3/49 (6%)

Query: 200 LPLVAGAMIDSHVTAEYWKNRTRERLRIKMPDTYVVHLGNSKELMEVAE 248
           +P +A  MIDSH TAEYW NRT++ L IK+P    VHLGNSK L E A+
Sbjct: 1   MPEIAAVMIDSHATAEYWNNRTQQHLGIKIPK---VHLGNSKHLTEAAQ 46


>gi|147853759|emb|CAN83817.1| hypothetical protein VITISV_002382 [Vitis vinifera]
          Length = 97

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 1   MGKHSATGWWVPLTKRWILALLIMLSISTAIAFFIRAALDPCDRHLEVSDKKRVQSQSVP 60
           M +HSA  +   L K+++ AL+   S+ T +    RA    CDRH   SD    + + +P
Sbjct: 1   MARHSAPRFS--LXKKFLFALVFFFSVXTVVFLIKRAPSISCDRH---SDAGVKRFEPLP 55

Query: 61  RIATKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLR 101
           +     SPL FMKSK VLLVSHELSLSG  L   EL+  +R
Sbjct: 56  QFGAAPSPLXFMKSKRVLLVSHELSLSGNILESPELSVFMR 96


>gi|295104016|emb|CBL01560.1| Glycosyltransferase [Faecalibacterium prausnitzii SL3/3]
          Length = 535

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/172 (27%), Positives = 78/172 (45%), Gaps = 29/172 (16%)

Query: 73  KSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDR 132
           K +  L++SHEL+++G P++L+    +LR +G +V    +  PS+E  +   L     D 
Sbjct: 178 KGRRALILSHELTMTGAPIVLVSAVPVLRSLGFEV---VVLGPSDEGSLPLFL-----DA 229

Query: 133 GVQVISAKGQETINT----ALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMR 188
           G  V++     T +     A  AD ++ NT V    ++ +    VP     VLWW+H+  
Sbjct: 230 GAAVVTRPDCVTSSALWGLATSADFVLANTVVEAPVVNTLNGSFVP-----VLWWLHDAF 284

Query: 189 GHYFKLDYVKH-LPLVAG-----AMIDSHVTAEYWKNRTR---ERLRIKMPD 231
             Y    ++ H +P   G       + SH TA     R     E+L   +PD
Sbjct: 285 AGY---PFISHSIPKALGKNVHLCAVGSHATAAMHSVRPDFAIEQLIYGLPD 333


>gi|160943782|ref|ZP_02091013.1| hypothetical protein FAEPRAM212_01280 [Faecalibacterium prausnitzii
           M21/2]
 gi|158444956|gb|EDP21959.1| glycosyltransferase, group 1 family protein [Faecalibacterium
           prausnitzii M21/2]
          Length = 535

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/172 (27%), Positives = 78/172 (45%), Gaps = 29/172 (16%)

Query: 73  KSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDR 132
           K +  L++SHEL+++G P++L+    +LR +G +V    +  PS+E  +   L     D 
Sbjct: 178 KGRRALILSHELTMTGAPIVLVSAVPVLRSLGFEV---VVLGPSDEGSLPLFL-----DA 229

Query: 133 GVQVISAKGQETINT----ALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMR 188
           G  V++     T +     A  AD ++ NT V    ++ +    VP     VLWW+H+  
Sbjct: 230 GAAVVTRPDCVTSSALWGLATSADFVLANTVVEAPVVNTLNGSFVP-----VLWWLHDAF 284

Query: 189 GHYFKLDYVKH-LPLVAG-----AMIDSHVTAEYWKNRTR---ERLRIKMPD 231
             Y    ++ H +P   G       + SH TA     R     E+L   +PD
Sbjct: 285 AGY---PFISHSIPKALGKNVHLCAVGSHATAAMHSVRPDFEIEQLIYGLPD 333


>gi|295102222|emb|CBK99767.1| Glycosyltransferase [Faecalibacterium prausnitzii L2-6]
          Length = 532

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 17/124 (13%)

Query: 72  MKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWD 131
           +K K V ++SH L ++G P++L+    +LR +G +V    +  P +   +   LE     
Sbjct: 177 VKGKRVFIMSHLLDMTGAPIVLVSAVPVLRSMGYEV---VVLGPEDGGSMPLFLE----- 228

Query: 132 RGVQVISAKG--QETI--NTALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEM 187
            G  VI+ KG  Q  +    AL ADL++ NT V  + + A+    VP     VLWW+H+ 
Sbjct: 229 AGATVITRKGCVQSPLLWGLALCADLVLANTVVEARAVRALSGARVP-----VLWWLHDA 283

Query: 188 RGHY 191
              Y
Sbjct: 284 FAGY 287


>gi|257440307|ref|ZP_05616062.1| glycosyl transferase, group 1 family [Faecalibacterium prausnitzii
           A2-165]
 gi|257197341|gb|EEU95625.1| glycosyltransferase, group 1 family protein [Faecalibacterium
           prausnitzii A2-165]
          Length = 533

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 57/122 (46%), Gaps = 17/122 (13%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
            K   ++SH L ++G P++L+    +LR +G +V    +  PS+   +     H   D G
Sbjct: 179 GKRAFIMSHVLDMTGAPIVLVSAVPVLRSMGYEV---LVLGPSDGGSL-----HLFLDAG 230

Query: 134 VQVISAKGQETINT----ALKADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRG 189
             VI+      ++     AL AD ++ NT V  + + A+    VP     VLWW+H+   
Sbjct: 231 ASVITRSSCRNVSDAWGMALCADFVIANTVVMARAVRALSGTAVP-----VLWWLHDAFA 285

Query: 190 HY 191
            Y
Sbjct: 286 GY 287


>gi|313114821|ref|ZP_07800321.1| glycosyltransferase, group 1 family [Faecalibacterium cf.
           prausnitzii KLE1255]
 gi|310622876|gb|EFQ06331.1| glycosyltransferase, group 1 family [Faecalibacterium cf.
           prausnitzii KLE1255]
          Length = 531

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 45/171 (26%), Positives = 80/171 (46%), Gaps = 31/171 (18%)

Query: 75  KLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGV 134
           K  L++SHEL+++G P++L     +LR +G +V    +  P+++  +   L     D G 
Sbjct: 180 KRALILSHELTMTGAPIVLTSAVPVLRSMGFEV---VVLGPADDGSLPLFL-----DAGA 231

Query: 135 QVISAKGQETINTAL-----KADLIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRG 189
            V++ +    +N++L      AD ++ NT V    +  +    VP     VLWW+H+   
Sbjct: 232 AVVT-RSDCVMNSSLWGLATSADFVLANTVVEAAAVSTLNGSFVP-----VLWWLHDAFA 285

Query: 190 HYFKLDYVKH-LPLVAGA-----MIDSHVTAEYWKNR---TRERLRIKMPD 231
            Y    ++ H +P   G+      + SH TA     R   + E+L   +PD
Sbjct: 286 GY---PFIAHKIPKTLGSNVHVCAVGSHATAAMHSVRPDFSIEQLIYGLPD 333


>gi|392579068|gb|EIW72195.1| hypothetical protein TREMEDRAFT_36511 [Tremella mesenterica DSM
           1558]
          Length = 718

 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 54/118 (45%), Gaps = 15/118 (12%)

Query: 37  AALDPCDRHLEVSDKKRVQSQSVPRIATKSSP---------LSFMKSKLVLLVSHELSLS 87
           A LDP D H++V   ++V+  S+ +    +SP          + ++ K + +     +  
Sbjct: 174 AVLDPVDHHVQVDCNRQVRLCSIVQYQQSTSPELWKRFTALATQLREKDISIAFFSATPQ 233

Query: 88  GGPLLLMELAF--LLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQE 143
           GG + LM  A   LLR VG KVNW       E   V++ +  +     +Q ++ KG E
Sbjct: 234 GGGVALMRHALIRLLRMVGVKVNWYV----PEGHPVVFDITKRKIHNVLQGVAEKGME 287


>gi|423301147|ref|ZP_17279171.1| hypothetical protein HMPREF1057_02312 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472482|gb|EKJ91010.1| hypothetical protein HMPREF1057_02312 [Bacteroides finegoldii
           CL09T03C10]
          Length = 824

 Score = 40.8 bits (94), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 41/157 (26%), Positives = 70/157 (44%), Gaps = 26/157 (16%)

Query: 73  KSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDR 132
           K+K VLL+SHELSL+G P  L+ +A +L+ +G     ++++    E E+         D 
Sbjct: 472 KTKTVLLISHELSLTGAPRALLNMAIMLKKIGATPVILSLKHGPMEKEIS--------DL 523

Query: 133 GVQVISAKGQETINTALK----------ADLIVLNTAVAGKWLDAV-LKEDVPRVLPNVL 181
           G++++  +    +N  L+           D++V NT      L+ V L E    +    +
Sbjct: 524 GIKLL-VEPFLLMNYNLRHWSLSGFLSNFDVVVFNT------LETVWLIEHFSEIKARKI 576

Query: 182 WWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAEYWK 218
            W+HE R  Y      K L ++       +   EY K
Sbjct: 577 CWLHEGRYSYAGWTRFKDLSVLFSLFDKVYAVGEYSK 613


>gi|375090843|ref|ZP_09737151.1| dihydrolipoyl dehydrogenase [Helcococcus kunzii ATCC 51366]
 gi|374565151|gb|EHR36426.1| dihydrolipoyl dehydrogenase [Helcococcus kunzii ATCC 51366]
          Length = 564

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 42/75 (56%), Gaps = 3/75 (4%)

Query: 84  LSLSGGPLLLMELAFLLRGVGTKVNWITIQK---PSEEDEVIYSLEHKMWDRGVQVISAK 140
           L++ GG ++ +E AF+    G+ V  I  Q    P  + EV   +EH   ++G++V+++ 
Sbjct: 275 LTIIGGGVIGLEFAFIYNAFGSDVKLIEYQDGLLPMFDREVAEEIEHIATEKGIKVMTSS 334

Query: 141 GQETINTALKADLIV 155
             + I+  ++ DLIV
Sbjct: 335 KVKEISETVEGDLIV 349


>gi|255692926|ref|ZP_05416601.1| glycosyltransferase [Bacteroides finegoldii DSM 17565]
 gi|260621330|gb|EEX44201.1| glycosyltransferase, group 2 family protein [Bacteroides finegoldii
           DSM 17565]
          Length = 825

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 39/149 (26%), Positives = 68/149 (45%), Gaps = 10/149 (6%)

Query: 74  SKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRG 133
           +K VLL+SHELSL+G P  L+ +A +L+  G +   ++++    E E I  L+ K+    
Sbjct: 473 TKTVLLISHELSLTGAPRALVNMAIMLKKSGIRPVILSLKHGPMEKE-IADLDIKLLVEP 531

Query: 134 VQVISA--KGQETINTALKADLIVLNTAVAGKWLDAV-LKEDVPRVLPNVLWWIHEMRGH 190
             +++   + +         D++V NT      L+ V L E    +    + W+HE R  
Sbjct: 532 FLLMNCGLRHRALFRFLSVFDVVVFNT------LETVWLIEHFSEIEARKICWLHEGRYS 585

Query: 191 YFKLDYVKHLPLVAGAMIDSHVTAEYWKN 219
           Y      K L ++       +   EY K+
Sbjct: 586 YMGWTRFKDLSMLFSKFDKIYAVGEYSKS 614


>gi|83582735|ref|YP_425041.1| glycosyl transferase [Rhodospirillum rubrum ATCC 11170]
 gi|83578051|gb|ABC24601.1| Glycosyl transferase [Rhodospirillum rubrum ATCC 11170]
          Length = 708

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 31/59 (52%)

Query: 64  TKSSPLSFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTKVNWITIQKPSEEDEVI 122
           TK   L  +  + V LVSHELS +G P+LL ++A  LR  G   N I++       E I
Sbjct: 300 TKQPVLPGLDGRHVTLVSHELSYTGAPVLLRDVALALRREGVSSNVISLSSGPLAKEFI 358


>gi|15079154|ref|NP_149906.1| 443R [Invertebrate iridescent virus 6]
 gi|34223713|sp|P18305.2|Y43R_IIV6 RecName: Full=Uncharacterized protein 443R
 gi|15042523|gb|AAK82303.1|AF303741_444 443R [Invertebrate iridescent virus 6]
          Length = 2432

 Score = 38.9 bits (89), Expect = 2.5,   Method: Composition-based stats.
 Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 5/108 (4%)

Query: 110 ITIQKPSEEDEVIYSLEHKMWDRGVQVISAKGQETINTALKADLIVLNTAV-AGKWLDAV 168
           IT+ +    D VI+ ++  + D GV V    GQ    TA+  DL V  + V  G+ +D+ 
Sbjct: 13  ITVVRGIPGDIVIFGIDGTIADSGVPVSIGSGQAKQPTAVPGDLAVFGSGVNQGQTIDSG 72

Query: 169 LKEDVPRV-LPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHVTAE 215
           L  D   V LP VLW   ++      +     LPL  G M  + ++ +
Sbjct: 73  LTVDDSSVSLPTVLWSSQKIES---LIPTGSFLPLAGGTMTGTIISKD 117


>gi|383459706|ref|YP_005373695.1| glycosyl transferase family protein [Corallococcus coralloides DSM
           2259]
 gi|380734839|gb|AFE10841.1| glycosyl transferase family protein [Corallococcus coralloides DSM
           2259]
          Length = 397

 Score = 37.7 bits (86), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 54/127 (42%), Gaps = 24/127 (18%)

Query: 100 LRGVGTKVNWITIQKPSEEDEVIYSLEHKMWDRGVQVIS-------AKGQETINTALKAD 152
           LR +G  V W+ I +P+ + E +          GV+V+S         G ET   AL   
Sbjct: 25  LRRLGHTVGWLCIPEPAPQLESL----------GVEVLSLPHVEAAPPGIETGGEALAK- 73

Query: 153 LIVLNTAVAGKWLDAVLKEDVPRVLPNVLWWIHEMRGHYFKLDYVKHLPLVAGAMIDSHV 212
            +VL+    GKW+  +L + VP +L  V   +   R     LD +++      A++ +H 
Sbjct: 74  -LVLDDVALGKWIRGLLLDSVPTMLEPVREVVRAFRPDVMALDGMQY-----AAVLAAHT 127

Query: 213 TAEYWKN 219
               W  
Sbjct: 128 EGIPWAG 134


>gi|423614907|ref|ZP_17590741.1| inosine/xanthosine triphosphatase [Bacillus cereus VD115]
 gi|401262126|gb|EJR68271.1| inosine/xanthosine triphosphatase [Bacillus cereus VD115]
          Length = 169

 Score = 37.0 bits (84), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 5/57 (8%)

Query: 226 RIKMPDTYVVHLGNSKELMEVAEDNVAKRVLREHVRESLGVRNEDLLFAIINRHRLY 282
           RIK+PD ++V L + KEL EV E+ V ++ +R H   ++G+  +D     ++R  L+
Sbjct: 100 RIKLPDDFLVPLEDGKELSEVMEEFVERKDIRSH-EGAIGIFTDDY----VDRTDLF 151


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.136    0.410 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,200,553,708
Number of Sequences: 23463169
Number of extensions: 160541553
Number of successful extensions: 405383
Number of sequences better than 100.0: 66
Number of HSP's better than 100.0 without gapping: 42
Number of HSP's successfully gapped in prelim test: 24
Number of HSP's that attempted gapping in prelim test: 405257
Number of HSP's gapped (non-prelim): 71
length of query: 282
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 141
effective length of database: 9,050,888,538
effective search space: 1276175283858
effective search space used: 1276175283858
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)