BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 019788
         (336 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359481095|ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
          Length = 336

 Score =  591 bits (1524), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 280/333 (84%), Positives = 301/333 (90%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MNRSF++Y YPH+R+DPFAN LLPVDFEP GNYASESYFKK  MKSHF+TKDPSKADLFF
Sbjct: 1   MNRSFKIYCYPHKRDDPFANALLPVDFEPGGNYASESYFKKVLMKSHFITKDPSKADLFF 60

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LPFSIAR+RHD R+G  GI DFI  YIFNISQ YPYWN+TGGADHFYVACHSIGRSAM+K
Sbjct: 61  LPFSIARLRHDPRVGVGGIQDFIRDYIFNISQNYPYWNQTGGADHFYVACHSIGRSAMEK 120

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
           A EVKLNAIQVVCSSSYF+SG+IAHKD SLPQIWPRQ DPP L  S+R KLAFFAG++NS
Sbjct: 121 ADEVKLNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSINS 180

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           PVRE+LLQVWRNDSEI  H GRL TPYADELLGSKFCLHVKGFE+NTARIADSLYYGCVP
Sbjct: 181 PVRERLLQVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVP 240

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           VIIANHYDLPFAD+LNWKSFSIVVATLDIPLLK++LKGIS  EYL+LQS VLKVR HFQW
Sbjct: 241 VIIANHYDLPFADILNWKSFSIVVATLDIPLLKQVLKGISLNEYLMLQSNVLKVRNHFQW 300

Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLD 333
           HV P DYDAFYMVMYELWLRRSSVRV  ST +D
Sbjct: 301 HVSPVDYDAFYMVMYELWLRRSSVRVPLSTPMD 333


>gi|296089301|emb|CBI39073.3| unnamed protein product [Vitis vinifera]
          Length = 467

 Score =  587 bits (1513), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 277/326 (84%), Positives = 297/326 (91%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MNRSF++Y YPH+R+DPFAN LLPVDFEP GNYASESYFKK  MKSHF+TKDPSKADLFF
Sbjct: 120 MNRSFKIYCYPHKRDDPFANALLPVDFEPGGNYASESYFKKVLMKSHFITKDPSKADLFF 179

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LPFSIAR+RHD R+G  GI DFI  YIFNISQ YPYWN+TGGADHFYVACHSIGRSAM+K
Sbjct: 180 LPFSIARLRHDPRVGVGGIQDFIRDYIFNISQNYPYWNQTGGADHFYVACHSIGRSAMEK 239

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
           A EVKLNAIQVVCSSSYF+SG+IAHKD SLPQIWPRQ DPP L  S+R KLAFFAG++NS
Sbjct: 240 ADEVKLNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSINS 299

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           PVRE+LLQVWRNDSEI  H GRL TPYADELLGSKFCLHVKGFE+NTARIADSLYYGCVP
Sbjct: 300 PVRERLLQVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVP 359

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           VIIANHYDLPFAD+LNWKSFSIVVATLDIPLLK++LKGIS  EYL+LQS VLKVR HFQW
Sbjct: 360 VIIANHYDLPFADILNWKSFSIVVATLDIPLLKQVLKGISLNEYLMLQSNVLKVRNHFQW 419

Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRV 326
           HV P DYDAFYMVMYELWLRRSSVRV
Sbjct: 420 HVSPVDYDAFYMVMYELWLRRSSVRV 445


>gi|255569522|ref|XP_002525728.1| catalytic, putative [Ricinus communis]
 gi|223535028|gb|EEF36711.1| catalytic, putative [Ricinus communis]
          Length = 336

 Score =  584 bits (1506), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 272/331 (82%), Positives = 300/331 (90%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MNRSF++YVYPHR+NDPFANVLLPVDFEP GNYASESYFKK  MKSHF+TKDP+KADLFF
Sbjct: 1   MNRSFKIYVYPHRQNDPFANVLLPVDFEPGGNYASESYFKKVLMKSHFITKDPTKADLFF 60

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LPFSIAR+RHD RIG EGI DFI  Y++NISQKYPYWNRTGG DHFYVACHSIGR+AM+K
Sbjct: 61  LPFSIARLRHDPRIGVEGIQDFIRAYVYNISQKYPYWNRTGGTDHFYVACHSIGRTAMEK 120

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
           A EVK NAIQVVCSSSY++SG+IAHKD SLPQ+WPRQ DPP L SS+R KLAFFAG++NS
Sbjct: 121 AEEVKFNAIQVVCSSSYYLSGYIAHKDASLPQVWPRQGDPPNLASSERQKLAFFAGSINS 180

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           PVRE+LLQVWRNDSEIY H GRL T YADELLGSKFCLHVKGFEVNTARIADSLYYGCVP
Sbjct: 181 PVRERLLQVWRNDSEIYVHYGRLNTSYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           +IIANHYDLPF D+LNW+SFS+VVATLDI  LKKIL+G+SS+ Y++LQS VLKVRKHFQW
Sbjct: 241 IIIANHYDLPFTDILNWESFSVVVATLDILYLKKILQGVSSDRYVMLQSNVLKVRKHFQW 300

Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
           H  P DYDAF+MVMYELWLRRSSVRV W  S
Sbjct: 301 HFPPVDYDAFHMVMYELWLRRSSVRVLWHAS 331


>gi|224144815|ref|XP_002325425.1| predicted protein [Populus trichocarpa]
 gi|222862300|gb|EEE99806.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 264/335 (78%), Positives = 289/335 (86%), Gaps = 4/335 (1%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MNRSF++YVYPHRRNDPFANVLLPVDFEP GNYASESYFKKA MKSHF+TKDP+KADLFF
Sbjct: 1   MNRSFKIYVYPHRRNDPFANVLLPVDFEPGGNYASESYFKKALMKSHFITKDPAKADLFF 60

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LPFSI R+RHD R+G  GI DFI  YI NIS+KYP+WNRTGGADHFY ACHSIGRSAM+K
Sbjct: 61  LPFSITRLRHDPRVGVGGIQDFIRDYILNISRKYPFWNRTGGADHFYAACHSIGRSAMEK 120

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
           + EVK NAIQVVCSSSYF+SG+IAHKDVS P     Q     +    R KLAFFAG++NS
Sbjct: 121 SEEVKFNAIQVVCSSSYFLSGYIAHKDVSFPGCHLSQV----VKCDYRKKLAFFAGSINS 176

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           PVRE+LL  WRNDSEI+AH GRL TPYADELLGSKFCLHVKGFEVNTARI DSLYYGCVP
Sbjct: 177 PVRERLLHSWRNDSEIFAHFGRLTTPYADELLGSKFCLHVKGFEVNTARIGDSLYYGCVP 236

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           VIIANHYDLPFAD+LNWKSFS+VVATLDIPLLKKILKGISS++YL+ Q  VL+VRKHFQW
Sbjct: 237 VIIANHYDLPFADILNWKSFSVVVATLDIPLLKKILKGISSDQYLMFQKKVLEVRKHFQW 296

Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLDSN 335
           H  P DYDAFYMVMYELWLRR+SVRV    S + N
Sbjct: 297 HCPPVDYDAFYMVMYELWLRRTSVRVSLPVSKNPN 331


>gi|357508695|ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula]
 gi|87162615|gb|ABD28410.1| Exostosin-like [Medicago truncatula]
 gi|116831751|gb|ABK28848.1| exostosin-like protein [Medicago truncatula]
 gi|355499651|gb|AES80854.1| Exostosin-like protein [Medicago truncatula]
          Length = 486

 Score =  545 bits (1405), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 254/327 (77%), Positives = 286/327 (87%), Gaps = 1/327 (0%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MNRSF++YVYPH+++DPFANVLLPV  EP GNYASESYFKKA MKSHF+TKDP+KADLFF
Sbjct: 155 MNRSFKIYVYPHKKDDPFANVLLPVKTEPSGNYASESYFKKALMKSHFITKDPTKADLFF 214

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +PFSIA +RHDRR+G  GI DFI  Y+ N+  KYPYWNRT GADHFYVACHSIGRSAM K
Sbjct: 215 MPFSIASLRHDRRVGVGGIQDFIRDYVQNMIHKYPYWNRTNGADHFYVACHSIGRSAMDK 274

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
           A +VK NAIQVVCSSSYF+SG+IAHKD  LPQIWPR E+PP L SS R KLAFFAG VNS
Sbjct: 275 APDVKFNAIQVVCSSSYFLSGYIAHKDACLPQIWPRNENPPNLVSSNRKKLAFFAGEVNS 334

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           PVR  L++ W+ND+EI+ H+GRLKTPY DELLGSKFC HV+G+EVNTARI DSLYYGCVP
Sbjct: 335 PVRINLVETWKNDTEIFVHNGRLKTPYGDELLGSKFCFHVRGYEVNTARIGDSLYYGCVP 394

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI-SSEEYLLLQSYVLKVRKHFQ 299
           VIIAN+YDLPFADVLNWKSFS+VV TLDIPLLKKILKGI +S EYL+LQ  VLKVR+HFQ
Sbjct: 395 VIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKGIVNSGEYLMLQKNVLKVREHFQ 454

Query: 300 WHVFPSDYDAFYMVMYELWLRRSSVRV 326
           WH  P D+DAFYMVMYELWLRRSS+ +
Sbjct: 455 WHSPPIDFDAFYMVMYELWLRRSSIPI 481


>gi|356571729|ref|XP_003554026.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 487

 Score =  544 bits (1402), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 259/333 (77%), Positives = 286/333 (85%), Gaps = 1/333 (0%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MNRS ++YVYPHR +DPFANVLLPV+ EP GNY SESYFKK  MKSHF+TKDP +ADLFF
Sbjct: 153 MNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESYFKKVLMKSHFITKDPPEADLFF 212

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LPFS+AR+ HDRR+G  GI DFI  YI NIS +YPYWN TGGADHFYVACHSIGRSAM K
Sbjct: 213 LPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWNNTGGADHFYVACHSIGRSAMDK 272

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
           A + K NAIQVVCSSSYF++G+ AHKD  LPQIWPR+ +PP L SSKR +LAFFAG VNS
Sbjct: 273 APDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWPRKGNPPNLVSSKRKRLAFFAGGVNS 332

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           PVR KLL+ W+NDSEI+ H GRLKTPYADELLGSKFCLHVKGFEVNTARI DSLYYGCVP
Sbjct: 333 PVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCLHVKGFEVNTARIGDSLYYGCVP 392

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG-ISSEEYLLLQSYVLKVRKHFQ 299
           VIIAN+YDLPFADVLNWKSFS+VV TLDIPLLKKILK  ISS +YL+LQS VLKVRKHFQ
Sbjct: 393 VIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKDIISSNKYLMLQSNVLKVRKHFQ 452

Query: 300 WHVFPSDYDAFYMVMYELWLRRSSVRVQWSTSL 332
           WH  P D+DAFYMVMYELWLRRSS++  W  S 
Sbjct: 453 WHSPPQDFDAFYMVMYELWLRRSSIKNTWVDSF 485


>gi|449527873|ref|XP_004170933.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 466

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 238/326 (73%), Positives = 282/326 (86%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MN+SF++YVYPH+R+DPFA  LLP +FEP GNYASESYFKK+ +KSHF+T DP +AD FF
Sbjct: 137 MNKSFKIYVYPHKRSDPFARSLLPENFEPHGNYASESYFKKSLIKSHFITNDPKEADFFF 196

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LPFSI  +R+DRR+   GIP+FI  YIF++S KYPYWNRTGGADHFYVACHS+GRSAM K
Sbjct: 197 LPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHFYVACHSVGRSAMDK 256

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
           + E K + +QVVCSSSYF++G+I+HKD +LPQIWPR+EDP  L SSKR +LAFFAGA+NS
Sbjct: 257 SSEAKSSIVQVVCSSSYFLTGYISHKDAALPQIWPRKEDPSNLASSKRTRLAFFAGAMNS 316

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           P R+ L+QVW  DSEI+A+SGRLKTPYADELL SKFCLHVKGFEVNTAR+ DS++YGCVP
Sbjct: 317 PTRQALVQVWGKDSEIFAYSGRLKTPYADELLRSKFCLHVKGFEVNTARVGDSIFYGCVP 376

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           VIIAN+YDLPF D+LNWKSFSIVV T DIP LK+ILKGI+ EEY  LQS VLKVRKHF+W
Sbjct: 377 VIIANYYDLPFGDILNWKSFSIVVTTSDIPRLKEILKGINDEEYARLQSNVLKVRKHFKW 436

Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRV 326
           H  P DYD F+MVMY+LWLRR+SVR+
Sbjct: 437 HSSPVDYDTFHMVMYQLWLRRTSVRL 462


>gi|449446746|ref|XP_004141132.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 466

 Score =  518 bits (1334), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 237/326 (72%), Positives = 281/326 (86%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MN+SF++YVYPH+R+DPFA  LLP +FEP GNYASESYFKK+ +KSHF+T DP +AD F 
Sbjct: 137 MNKSFKIYVYPHKRSDPFARSLLPENFEPHGNYASESYFKKSLIKSHFITNDPKEADFFS 196

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LPFSI  +R+DRR+   GIP+FI  YIF++S KYPYWNRTGGADHFYVACHS+GRSAM K
Sbjct: 197 LPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHFYVACHSVGRSAMDK 256

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
           + E K + +QVVCSSSYF++G+I+HKD +LPQIWPR+EDP  L SSKR +LAFFAGA+NS
Sbjct: 257 SSEAKSSIVQVVCSSSYFLTGYISHKDAALPQIWPRKEDPSNLASSKRTRLAFFAGAMNS 316

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           P R+ L+QVW  DSEI+A+SGRLKTPYADELL SKFCLHVKGFEVNTAR+ DS++YGCVP
Sbjct: 317 PTRQALVQVWGKDSEIFAYSGRLKTPYADELLRSKFCLHVKGFEVNTARVGDSIFYGCVP 376

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           VIIAN+YDLPF D+LNWKSFSIVV T DIP LK+ILKGI+ EEY  LQS VLKVRKHF+W
Sbjct: 377 VIIANYYDLPFGDILNWKSFSIVVTTSDIPRLKEILKGINDEEYARLQSNVLKVRKHFKW 436

Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRV 326
           H  P DYD F+MVMY+LWLRR+SVR+
Sbjct: 437 HSSPVDYDTFHMVMYQLWLRRTSVRL 462


>gi|302804767|ref|XP_002984135.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300147984|gb|EFJ14645.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 376

 Score =  427 bits (1097), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 196/326 (60%), Positives = 250/326 (76%), Gaps = 1/326 (0%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M  + +V+VYP    DPF+++ LP    P GNYASE+YFKKA   S  VT DPS+ADLFF
Sbjct: 47  MVNTMKVFVYPCSPRDPFSHIFLPTSSAPSGNYASEAYFKKALAGSGMVTDDPSQADLFF 106

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +PFSI R+R+D ++G   +P F+  Y+ NIS ++PYWNRTGG+DHFYVACHSIG+ A++K
Sbjct: 107 MPFSITRLRNDPKVGVGRMPAFVRDYVKNISHRWPYWNRTGGSDHFYVACHSIGKVALEK 166

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
           A  VKLNAIQVVCSS+Y++ G I HKDV++PQIWPR E   ++ +  +R  LAFFAG  N
Sbjct: 167 AQHVKLNAIQVVCSSNYYVQGFIPHKDVAIPQIWPRSESFREIKTIEQRKVLAFFAGGSN 226

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
           SPVR  +++ WRND++I+A+  R++  YA+ LL SKFCLHVKG+EVNTAR+ D+ +YGCV
Sbjct: 227 SPVRANVVRTWRNDTQIHAYPSRIQGSYAEALLRSKFCLHVKGYEVNTARLGDAFFYGCV 286

Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ 299
           PV+IANHYDLPF+ VLNWKSFS+VV T +IP LK IL GIS E+Y  +   VL  R+HFQ
Sbjct: 287 PVVIANHYDLPFSSVLNWKSFSVVVTTANIPKLKAILSGISREDYSQMHRLVLDARRHFQ 346

Query: 300 WHVFPSDYDAFYMVMYELWLRRSSVR 325
           WH  P +YDAFYMVMY+LWLRR  VR
Sbjct: 347 WHAPPREYDAFYMVMYQLWLRRHVVR 372


>gi|302780894|ref|XP_002972221.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300159688|gb|EFJ26307.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 376

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 194/326 (59%), Positives = 251/326 (76%), Gaps = 1/326 (0%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M  + +V+VYP    DPF+++ LP    P GNYASE+YFKKA  +S  VT DPS+ADLFF
Sbjct: 47  MVNTMKVFVYPCSPRDPFSHIFLPTSSAPSGNYASEAYFKKALAESGMVTDDPSQADLFF 106

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +PFSI R+R+D ++G   +P F+  Y+ NIS ++PYWNRTGG+DHFYVACHSIG+ A++K
Sbjct: 107 MPFSITRLRNDPKVGVGRMPAFVRDYVKNISHRWPYWNRTGGSDHFYVACHSIGKVALEK 166

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
           A  V+LNAIQVVCSS+Y++ G I HKDV++PQIWPR E   ++ +  +R  LAFFAG  N
Sbjct: 167 AQHVRLNAIQVVCSSNYYVQGFIPHKDVAMPQIWPRSESFREIKTIEQRKVLAFFAGGSN 226

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
           SPVR  +++ WRND++I+A+  R++  YA+ LL SKFCLHVKG+EVNTAR+ D+ +YGCV
Sbjct: 227 SPVRANVVRTWRNDTQIHAYPSRIQGSYAEALLRSKFCLHVKGYEVNTARLGDAFFYGCV 286

Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ 299
           PV+IAN+YDLPF+ VLNWKSFS+VV T +IP LK IL GIS E+Y  +   VL  R+HFQ
Sbjct: 287 PVVIANYYDLPFSSVLNWKSFSVVVTTANIPKLKAILSGISREDYSQMHRLVLDARRHFQ 346

Query: 300 WHVFPSDYDAFYMVMYELWLRRSSVR 325
           WH  P +YDAFYMVMY+LWLRR  VR
Sbjct: 347 WHAPPREYDAFYMVMYQLWLRRHVVR 372


>gi|356560377|ref|XP_003548469.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g03795-like [Glycine max]
          Length = 334

 Score =  421 bits (1082), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 204/271 (75%), Positives = 227/271 (83%), Gaps = 9/271 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MNRS  ++VYPHR +D FANVLLPV+ +P GNYASESYFKK  MKSHF+TKDP++ADLFF
Sbjct: 63  MNRSLXIHVYPHREDDSFANVLLPVESKPGGNYASESYFKKVPMKSHFITKDPTEADLFF 122

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LPFSIAR+RH+RR+G  G  DFI  YI NIS KYPYWNRTGGADHFYVACHSIGRSAM K
Sbjct: 123 LPFSIARLRHNRRVGVGGKQDFIRDYIQNISHKYPYWNRTGGADHFYVACHSIGRSAMDK 182

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
           A +VK NAIQVVCSSSYF++G+IAHKD  LPQIWPR+ +PP L SSKR +LAFFAG VNS
Sbjct: 183 APDVKFNAIQVVCSSSYFLTGNIAHKDTCLPQIWPRKGNPPILVSSKRKRLAFFAGGVNS 242

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           PVR KLL+ W+NDSEI+ H GRLKTPYADELLGSKF LHVKGFEVNT RI          
Sbjct: 243 PVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFGLHVKGFEVNTTRIG--------- 293

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
           VIIAN+YDLPFADVLNWKSFS+VV TLDI L
Sbjct: 294 VIIANYYDLPFADVLNWKSFSVVVTTLDIQL 324


>gi|168059172|ref|XP_001781578.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666988|gb|EDQ53629.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 344

 Score =  393 bits (1010), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 177/327 (54%), Positives = 242/327 (74%), Gaps = 2/327 (0%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M R  R++VYPH R DPF  +    +  P GNYASE +F+++ + S F+TK  S+AD FF
Sbjct: 14  MKRKLRIFVYPHDRKDPFHMIFESGNKVPSGNYASEEFFQQSLLTSTFLTKTASEADFFF 73

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P SI + R D+RI   G+  F ++YI ++  ++ YWNR+ GADHFY++CHSI R+AM +
Sbjct: 74  MPVSITKARMDKRINVGGLQSFCANYITDVRSQWSYWNRSNGADHFYLSCHSIARNAMDR 133

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR-QEDPPKLGS-SKRNKLAFFAGAV 178
             +V+ NAIQ++C +SYF+  +I HKD S+PQIWPR  ++P ++ + ++R +LAFFAGA+
Sbjct: 134 VPDVRQNAIQLLCPASYFLPSYITHKDASVPQIWPRLGKEPEEVRTITQRKRLAFFAGAL 193

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
           NSPVR+ L + W NDS+I  H GR+  PY++ LL +KFCLH KGFEVNTAR+ D++YYGC
Sbjct: 194 NSPVRKDLERTWANDSKILVHKGRVPYPYSEALLTTKFCLHAKGFEVNTARLGDAMYYGC 253

Query: 239 VPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           VPV+IAN+YDLPF D+L+W  FSIVV++LDIPLLKK L+ ++ E+Y  L   VL  RKHF
Sbjct: 254 VPVVIANYYDLPFQDILDWTKFSIVVSSLDIPLLKKTLEAVTDEQYAELHRQVLLARKHF 313

Query: 299 QWHVFPSDYDAFYMVMYELWLRRSSVR 325
           QWH  P +YDAF+ VMYELW RR  VR
Sbjct: 314 QWHAPPEEYDAFHTVMYELWKRRHIVR 340


>gi|302790387|ref|XP_002976961.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300155439|gb|EFJ22071.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 408

 Score =  333 bits (853), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 169/331 (51%), Positives = 228/331 (68%), Gaps = 6/331 (1%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLP-VDFEPR--GNYASESYFKKAFMKSHFVTKDPSKAD 57
           M +SF+VYVYP   +D ++ V LP  D   R  GN+ SE  FK   + S F T+DP +A 
Sbjct: 77  MVKSFKVYVYPFGNSD-YSQVFLPHPDPYDRKLGNFFSEHMFKINLLNSTFATRDPGEAH 135

Query: 58  LFFLPFSIARMRHDRRIGTEG-IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
           LFF+PFSI  MR+  RI +E  I  F+  Y+  ISQ+Y +WNRT G DHFYV CHS+GR+
Sbjct: 136 LFFMPFSINAMRNHPRIRSEAMISSFVESYVEEISQRYKFWNRTEGVDHFYVGCHSVGRN 195

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAG 176
           A   +  ++ NAIQV CS++Y+   ++ HKDV+LPQ+WPR  D   +   KR KLAFF+G
Sbjct: 196 AASNSRALQQNAIQVTCSANYYQKLYVPHKDVALPQVWPRPLDTFIVPPEKRTKLAFFSG 255

Query: 177 -AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLY 235
            A NS +RE LL++W NDS++   +G ++  Y D L  SKFCLHVKG+EVNTARI+D+L+
Sbjct: 256 RAQNSHLRETLLKLWSNDSDMDIFAGTMQGSYEDALSRSKFCLHVKGYEVNTARISDALH 315

Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVR 295
           +GCVPVII+N YDLP ++VLNW+SFSIV++   IP LK  L+ ++ +EY  L S   +V+
Sbjct: 316 FGCVPVIISNQYDLPLSNVLNWRSFSIVLSYTQIPALKAKLQSVTHDEYARLWSNGRRVK 375

Query: 296 KHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           +HF WH  P +YD+F M MYELW +R  VR 
Sbjct: 376 RHFGWHHSPREYDSFQMTMYELWSKRHFVRA 406


>gi|255543306|ref|XP_002512716.1| catalytic, putative [Ricinus communis]
 gi|223548677|gb|EEF50168.1| catalytic, putative [Ricinus communis]
          Length = 426

 Score =  331 bits (848), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 162/336 (48%), Positives = 223/336 (66%), Gaps = 8/336 (2%)

Query: 1   MNRSFRVYVYPHRRN--DPFANVLLPVD---FEPR-GNYASESYFKKAFMKSHFVTKDPS 54
           M +  +++VY    N   PFAN+ LP++     P+ GNY SE  FK A ++S  VT DP+
Sbjct: 91  MKQQLKIFVYSDVSNKSSPFANIFLPIENPFHHPKLGNYFSEHIFKVALLRSSLVTLDPA 150

Query: 55  KADLFFLPFSIARMRHDRRIGTE-GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           KA  FFLPFSI  +R+D R  +E  I +F++HY   ISQ++ YWN + GADHFYV CHS+
Sbjct: 151 KALFFFLPFSINNLRNDPRFHSEESISEFVAHYTTTISQRFSYWNASAGADHFYVCCHSV 210

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAF 173
           GR A  +   +  NAIQ+ CSSSYF    ++HKDV LPQ+WPR           R++L +
Sbjct: 211 GRQAASRHPALHNNAIQLTCSSSYFQRFFVSHKDVGLPQVWPRPPQTALNPPHARHRLVY 270

Query: 174 FAGAV-NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
           FAG V NS VR +L+ +W ND+E+   +G    PY +    SK+CLHVKG+EVNTAR++D
Sbjct: 271 FAGRVQNSQVRRELVNLWGNDTEMDIINGSPSFPYEEGFKRSKYCLHVKGYEVNTARVSD 330

Query: 233 SLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
           S++YGC+PVII+N+YDLPFA VL+W  FS+V+   DIP LK  L  I+ + Y+ +   + 
Sbjct: 331 SIHYGCIPVIISNYYDLPFATVLDWSKFSVVINQADIPFLKTTLLAITRKTYITMFQNLC 390

Query: 293 KVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
           +VR+HF+WH  P  YD+FYM  Y+LWLRRS  R+ +
Sbjct: 391 RVRRHFEWHTTPKGYDSFYMTAYQLWLRRSIHRLSY 426


>gi|224053619|ref|XP_002297901.1| predicted protein [Populus trichocarpa]
 gi|222845159|gb|EEE82706.1| predicted protein [Populus trichocarpa]
          Length = 457

 Score =  320 bits (820), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 160/348 (45%), Positives = 224/348 (64%), Gaps = 20/348 (5%)

Query: 1   MNRSFRVYVYPH--RRNDPFANVLLP----VDFEPRGNYASESYFKKAFMKSHFVTKDPS 54
           M R  +++VYP    R+ PFAN+ LP    ++    GNY SE  FK + + S  +T  P 
Sbjct: 110 MMRHLKIFVYPDTFNRSSPFANIFLPHENPLNNPKLGNYFSEHMFKVSLLHSPLLTATPE 169

Query: 55  KADLFFLPFSIARMRHDRRIGTEG-IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           KA  FFLPFSI  +R+D R+ +E  I  F++ Y  +IS  + +WN +GGADHFYV CHS+
Sbjct: 170 KAHFFFLPFSINDLRNDPRVHSEAKISQFVAQYTSSISSSFRFWNASGGADHFYVCCHSV 229

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQE----DPPKLGSS--- 166
           GR A  +   ++ NAIQ+ C SSYF   +++HKDV LPQ+WPR +    +PP        
Sbjct: 230 GREAPSRHHGLRNNAIQLTCCSSYFQRFYLSHKDVGLPQVWPRTDQTALNPPHASVCYLD 289

Query: 167 -----KRNKLAFFAGAV-NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHV 220
                 R++L +FAG V NS VR++L+ +W ND++    +G    PY +    SKFCLHV
Sbjct: 290 VNTYRCRHRLVYFAGRVQNSQVRQQLVNLWGNDTQFDIFNGNPTFPYEEGFKRSKFCLHV 349

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           KG+EVNTAR++D+++YGC+PVII+N+YDLPFA+VL+W  FS+V+   DI  LK  L  I 
Sbjct: 350 KGYEVNTARVSDAIHYGCIPVIISNYYDLPFANVLDWSKFSVVINQRDIAFLKTKLLSIK 409

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
            E YL +   + KVR+HF WH  P  YD+FYM  Y+LWLRRS++R+ +
Sbjct: 410 REMYLRMYHNLFKVRRHFVWHTTPRGYDSFYMTAYQLWLRRSTLRLSY 457


>gi|359473867|ref|XP_002268876.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
 gi|296085575|emb|CBI29307.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 161/341 (47%), Positives = 225/341 (65%), Gaps = 17/341 (4%)

Query: 1   MNRSFRVYVYP----HRRNDPFANVLLPVD--FEPR-GNYASESYFKKAFMKSHFVTKDP 53
           M    +++VYP    ++ + PFA V LP    F P+  NY SE  FK A ++S  +T  P
Sbjct: 113 MLHKLKIFVYPDASMNQSSSPFARVFLPNPNPFHPKLANYFSEHMFKVALLRSSLLTPHP 172

Query: 54  SKADLFFLPFSIARMRHDRRIGTEG-IPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
             A  FFLPFS+  +R+D R+ +E  I DF++ Y   IS +Y +WN + G DHFY+ CHS
Sbjct: 173 QDAHFFFLPFSVNTLRNDPRVHSEASISDFVTQYTTRISWEYKFWNASRGTDHFYICCHS 232

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED----PPKLGSSKR 168
           +GR A  K  ++  NAIQV CSSSYF   +I+HKDV LPQ+WPR  +    PP+L    R
Sbjct: 233 VGREAASKHHDLHNNAIQVTCSSSYFQRLYISHKDVGLPQVWPRPPEKLLNPPEL----R 288

Query: 169 NKLAFFAGAV-NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNT 227
           +KL FFAG V NS +R++L+ VW ND++I   SG    PY + L  SK+CLHVKG+EVNT
Sbjct: 289 HKLVFFAGRVQNSHIRQELMAVWGNDTDIDLFSGSPPFPYEEGLRKSKYCLHVKGYEVNT 348

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
           AR+ D+++YGC+PVI++N+YDLPF++VL+W  FS++++   I  LKKIL  IS ++YL +
Sbjct: 349 ARVCDAIHYGCIPVIVSNYYDLPFSNVLDWSKFSVIISHKSIATLKKILLSISKQKYLSM 408

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
              +  VR+HF WH  P  YD+F+M  Y+LWLRR   R+ +
Sbjct: 409 YQNLCLVRRHFAWHTTPRGYDSFHMTAYQLWLRRGVHRLSY 449


>gi|357138738|ref|XP_003570946.1| PREDICTED: probable glycosyltransferase At5g11130-like
           [Brachypodium distachyon]
          Length = 405

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 148/332 (44%), Positives = 207/332 (62%), Gaps = 14/332 (4%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSF+VY+YP    DP      P      G YASE YF +   +S F T+DP  ADLFF
Sbjct: 80  MERSFKVYIYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTEDPDSADLFF 135

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   +  Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 136 VPISPHKMR-GKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 194

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
              +  N+I+VVCS SY +   I HKDV+LPQ+  +    PK G+   N+  L F+AG  
Sbjct: 195 LPFMVKNSIRVVCSPSYNVD-FIPHKDVALPQVL-QPFALPKGGNDVENRTNLGFWAGHR 252

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVKGFEVNTARIADS 233
           NS +R  L +VW ND+E+   + R+     +  Y  +   +KFC+   G +VN+ARI+DS
Sbjct: 253 NSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDS 312

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           ++YGCVPVI++++YDLPF DVL+WK F+IV+   D+  LK ILK IS EE++ L + +++
Sbjct: 313 IHYGCVPVILSDYYDLPFNDVLDWKKFAIVLKERDVYELKSILKSISQEEFVALHNSLVQ 372

Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           ++KHF WH  P  YDAF+MVMYELWLR   ++
Sbjct: 373 IQKHFVWHSPPIPYDAFHMVMYELWLRHHVIK 404


>gi|224084457|ref|XP_002307304.1| predicted protein [Populus trichocarpa]
 gi|222856753|gb|EEE94300.1| predicted protein [Populus trichocarpa]
          Length = 406

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 142/331 (42%), Positives = 204/331 (61%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M R+F++++YP    DP  N       +  G YASE YF +   +S F T+DP +A LFF
Sbjct: 81  MERNFKIFIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFQTQDPDQAHLFF 136

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  + I  E +   + +Y+ ++  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 137 IPISCHKMR-GKGISYENMTIIVDNYVESLKSKYPYWNRTLGADHFFVTCHDVGVRATEG 195

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
              +  NAI+VVCS SY + G I HKD++LPQ+      P       KR  L F+AG  N
Sbjct: 196 VPFLIKNAIRVVCSPSYDV-GFIPHKDIALPQVLQPFALPAGGNDVEKRTTLGFWAGHRN 254

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L +VW ND+E+   + R+        Y     GSK+C+   G +VN+ARIADS+
Sbjct: 255 SRIRVILARVWENDTELDISNNRINRATGHLVYQKRFYGSKYCICPGGSQVNSARIADSI 314

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGC+PVI++N+YDLPF D+L+W  FS+++   D+  LK+ILK I   + + L   ++KV
Sbjct: 315 HYGCIPVILSNYYDLPFNDILDWHKFSVILKEQDVYRLKQILKDIPDNKLVSLHKNLVKV 374

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHFQW+  P  YDAF+MVMY+LWLR   ++
Sbjct: 375 QKHFQWNSPPVKYDAFHMVMYDLWLRHHVIK 405


>gi|359495862|ref|XP_002266299.2| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
          Length = 594

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 146/332 (43%), Positives = 206/332 (62%), Gaps = 14/332 (4%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M ++F+VY+YP    DP  N       +  G YASE YF +    S F T DP +A LFF
Sbjct: 269 MEKNFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRDSRFRTNDPDQAHLFF 324

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+ ++  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 325 IPISCHKMR-GKGTSYENMTVIVQNYVGSLISKYPYWNRTLGADHFFVTCHDVGVRATEG 383

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
              +  N+I+VVCS SY + G I HKDV+LPQ+      P        R  L F+AG  N
Sbjct: 384 VPFLVKNSIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDIENRTTLGFWAGHRN 442

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELL------GSKFCLHVKGFEVNTARIADS 233
           S +R  L +VW ND+E+   + R+    A ELL       +KFC+   G +VN+ARIADS
Sbjct: 443 SKIRVILARVWENDTELDIMNNRINRA-AGELLYQKRFYRTKFCICPGGSQVNSARIADS 501

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           ++YGCVPVI++N+YDLPF D+L+W+ FS+V+  LD+  LK+ILK I   E++ L + +++
Sbjct: 502 IHYGCVPVILSNYYDLPFNDILDWRKFSVVLKELDVYQLKQILKDIPDAEFVALHNNLVQ 561

Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           V+KHFQW+  P  YDAF+M+MYELWLR   ++
Sbjct: 562 VQKHFQWNSPPIRYDAFHMIMYELWLRHQVIK 593


>gi|296090519|emb|CBI40850.3| unnamed protein product [Vitis vinifera]
          Length = 416

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 146/332 (43%), Positives = 206/332 (62%), Gaps = 14/332 (4%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M ++F+VY+YP    DP  N       +  G YASE YF +    S F T DP +A LFF
Sbjct: 91  MEKNFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRDSRFRTNDPDQAHLFF 146

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+ ++  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 147 IPISCHKMR-GKGTSYENMTVIVQNYVGSLISKYPYWNRTLGADHFFVTCHDVGVRATEG 205

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
              +  N+I+VVCS SY + G I HKDV+LPQ+      P        R  L F+AG  N
Sbjct: 206 VPFLVKNSIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDIENRTTLGFWAGHRN 264

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELL------GSKFCLHVKGFEVNTARIADS 233
           S +R  L +VW ND+E+   + R+    A ELL       +KFC+   G +VN+ARIADS
Sbjct: 265 SKIRVILARVWENDTELDIMNNRINRA-AGELLYQKRFYRTKFCICPGGSQVNSARIADS 323

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           ++YGCVPVI++N+YDLPF D+L+W+ FS+V+  LD+  LK+ILK I   E++ L + +++
Sbjct: 324 IHYGCVPVILSNYYDLPFNDILDWRKFSVVLKELDVYQLKQILKDIPDAEFVALHNNLVQ 383

Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           V+KHFQW+  P  YDAF+M+MYELWLR   ++
Sbjct: 384 VQKHFQWNSPPIRYDAFHMIMYELWLRHQVIK 415


>gi|255563758|ref|XP_002522880.1| catalytic, putative [Ricinus communis]
 gi|223537865|gb|EEF39480.1| catalytic, putative [Ricinus communis]
          Length = 406

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 142/331 (42%), Positives = 205/331 (61%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M R+F+VY+YP    DP  N       +  G YASE YF +   +S F T DP +A LFF
Sbjct: 81  MERNFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESKFRTNDPDQAHLFF 136

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+ +++ KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 137 IPISCHKMR-GKGTSYENMTIIVQNYVESLAVKYPYWNRTLGADHFFVTCHDVGVRATEG 195

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
              +  NAI+VVCS SY + G I HKDV+LPQ+      P        R  L F+AG  N
Sbjct: 196 VPFLVKNAIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDLENRTTLGFWAGHRN 254

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L +VW ND+E+   + R+        Y      +KFC+   G +VN+ARIADS+
Sbjct: 255 SKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRTKFCICPGGSQVNSARIADSI 314

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGCVPVI++++YDLPF D+L+WK FS+++   D+  LK++LK IS +E++ L   +++V
Sbjct: 315 HYGCVPVILSDYYDLPFNDILDWKRFSVILKEKDVYRLKQVLKDISDDEFVALHENLVEV 374

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHFQW+  P  YDAF+MVM++LWLR   ++
Sbjct: 375 QKHFQWNSPPIKYDAFHMVMFDLWLRHHVIK 405


>gi|356546741|ref|XP_003541781.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 410

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 145/332 (43%), Positives = 202/332 (60%), Gaps = 13/332 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M   F+V+VYP    DP      P      G YASE YF K   +S F T DP +A LFF
Sbjct: 84  MEEEFKVFVYPD--GDPETYFHTPRKLT--GKYASEGYFFKNIRESRFFTDDPRRAHLFF 139

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LP S  +MR  R +  E + D +  Y+ ++  +YPYWNRT GADHF+V CH IG  A + 
Sbjct: 140 LPISCHKMR-GRGLTNERMIDEVEKYVEHLKFEYPYWNRTLGADHFFVTCHDIGVKATKG 198

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVN 179
              +  N+I+V+CSS Y   G+I HKDV+LPQ+      PP     K RN LAF+AG  +
Sbjct: 199 VPHMMKNSIRVICSSRYDDDGYIPHKDVTLPQVQLPFFHPPGGNDIKNRNTLAFWAGRSD 258

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLK------TPYADELLGSKFCLHVKGFEVNTARIADS 233
           S ++E L+ +W ND+EI   + R+         Y ++L  SKFCL   G  + ++RIADS
Sbjct: 259 SRLKEDLIAIWDNDTEIDIQNSRVDLRATGPVVYMEKLYKSKFCLCPHG-PIGSSRIADS 317

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           +++GCVPVI++ +YDLPF D+L+W  FSIV+   D+  LK  L+ IS + ++ L   ++K
Sbjct: 318 IHFGCVPVIMSKYYDLPFNDILDWTQFSIVLKETDVYQLKYTLRSISEKHFITLNHNIVK 377

Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           ++KHF+W+  P   DAF+MVMYELW RR  +R
Sbjct: 378 IQKHFKWNTPPVRQDAFHMVMYELWRRRHLIR 409


>gi|297802132|ref|XP_002868950.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314786|gb|EFH45209.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 428

 Score =  267 bits (682), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 140/331 (42%), Positives = 201/331 (60%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+VY+YP    DP  N       +  G YASE YF +   +S F T DP +ADLFF
Sbjct: 103 MEKRFKVYIYPD--GDP--NTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPEEADLFF 158

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 159 IPISCHKMR-GKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGVRAFEG 217

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
           +  +  N I+VVCS SY + G I HKDV+LPQ+      P        R  L F+AG  N
Sbjct: 218 SLLLIKNTIRVVCSPSYNV-GFIPHKDVALPQVLQPFALPAGGNDVENRTTLGFWAGHRN 276

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L +VW ND+E+   + R+        Y      +KFC+   G +VN+ARI DS+
Sbjct: 277 SKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRTKFCICPGGSQVNSARITDSI 336

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGC+PVI++++YDLPF D+LNW+ F++V+   D+  LK+ILK I   E++ L + ++KV
Sbjct: 337 HYGCIPVILSDYYDLPFNDILNWRKFAVVLREQDVYNLKQILKNIPHSEFISLHNNLVKV 396

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHFQW+  P  +DAF+M+MYELWLR   ++
Sbjct: 397 QKHFQWNSPPVKFDAFHMIMYELWLRHHVIK 427


>gi|449431924|ref|XP_004133750.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 412

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 143/331 (43%), Positives = 203/331 (61%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M   F+VY+YP    DP  N       +  G YASE YF +   +S F T+DP +A LFF
Sbjct: 87  MESKFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDQAHLFF 142

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 143 IPISCHKMR-GKGTSYENMTVIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVGVRASEG 201

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVN 179
              +  NAI+VVCS SY + G I HKDV+LPQ+      P     ++ R  L F+AG  N
Sbjct: 202 LPFLIKNAIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDTENRTTLGFWAGHRN 260

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L +VW ND+E+   + R+        Y      +KFC+   G +VN+ARIADS+
Sbjct: 261 SKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKTKFCICPGGSQVNSARIADSI 320

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGCVPVI++++YDLPF D+L+W+ FS++V   D+  LK+ILK IS  E++ L   +++V
Sbjct: 321 HYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDIEFIKLHKNLMQV 380

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHFQW+  P  YDAF+MVMY+LWLR   ++
Sbjct: 381 QKHFQWNSPPIKYDAFHMVMYDLWLRHHVIK 411


>gi|115444695|ref|NP_001046127.1| Os02g0187200 [Oryza sativa Japonica Group]
 gi|46390046|dbj|BAD15422.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|46390077|dbj|BAD15452.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|113535658|dbj|BAF08041.1| Os02g0187200 [Oryza sativa Japonica Group]
 gi|215694561|dbj|BAG89554.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 402

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 145/332 (43%), Positives = 205/332 (61%), Gaps = 14/332 (4%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSF+VY+YP    DP      P      G YASE YF +   +S F T DP KA LFF
Sbjct: 77  MERSFKVYMYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTGDPDKAHLFF 132

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   +  Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 133 VPISPHKMR-GKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 191

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
              +  N+I+VVCS SY +   I HKD++LPQ+  +    P+ G+   N+  L F+AG  
Sbjct: 192 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDVENRTILGFWAGHR 249

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVKGFEVNTARIADS 233
           NS +R  L QVW ND+E+   + R+     +  Y  +   +KFC+   G +VN+ARI+DS
Sbjct: 250 NSKIRVILAQVWENDTELAISNNRISRAIGELVYQKQFYHTKFCICPGGSQVNSARISDS 309

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           ++YGCVPVI++++YDLPF D+L+W+ F++V+   D+  LK ILK IS EE++ L   +++
Sbjct: 310 IHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDVYQLKSILKSISQEEFVELHKSLVQ 369

Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           V+KHF WH  P  YDAF+MVMYELWLR   ++
Sbjct: 370 VQKHFVWHSPPLPYDAFHMVMYELWLRHHVIK 401


>gi|15233650|ref|NP_195517.1| Exostosin family protein [Arabidopsis thaliana]
 gi|4467110|emb|CAB37544.1| putative protein [Arabidopsis thaliana]
 gi|7270787|emb|CAB80469.1| putative protein [Arabidopsis thaliana]
 gi|15293111|gb|AAK93666.1| unknown protein [Arabidopsis thaliana]
 gi|21280961|gb|AAM45007.1| unknown protein [Arabidopsis thaliana]
 gi|332661466|gb|AEE86866.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 425

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 141/331 (42%), Positives = 200/331 (60%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+VY+YP    DP  N       +  G YASE YF +   +S F T DP +ADLFF
Sbjct: 100 MEKRFKVYIYPD--GDP--NTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPDEADLFF 155

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 156 IPISCHKMR-GKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGVRAFEG 214

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
           +  +  N I+VVCS SY + G I HKDV+LPQ+      P        R  L F+AG  N
Sbjct: 215 SPLLIKNTIRVVCSPSYNV-GFIPHKDVALPQVLQPFALPAGGNDVENRTTLGFWAGHRN 273

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L  VW ND+E+   + R+        Y      +KFC+   G +VN+ARI DS+
Sbjct: 274 SKIRVILAHVWENDTELDISNNRINRATGHLVYQKRFYRTKFCICPGGSQVNSARITDSI 333

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGC+PVI++++YDLPF D+LNW+ F++V+   D+  LK+ILK I   E++ L + ++KV
Sbjct: 334 HYGCIPVILSDYYDLPFNDILNWRKFAVVLREQDVYNLKQILKNIPHSEFVSLHNNLVKV 393

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHFQW+  P  +DAF+M+MYELWLR   V+
Sbjct: 394 QKHFQWNSPPVKFDAFHMIMYELWLRHHVVK 424


>gi|356496874|ref|XP_003517290.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 404

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 139/331 (41%), Positives = 203/331 (61%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+VY+YP    DP  N       +  G YASE YF +   +S F T++P +A LFF
Sbjct: 79  MEKKFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFCTENPDEAHLFF 134

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+ ++  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 135 IPISCHKMR-GKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGVRATEG 193

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
              +  N+I+ VCS SY + G I HKDV+LPQ+      P        R  L F+AG  N
Sbjct: 194 LEFLVKNSIRAVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDIENRTTLGFWAGHRN 252

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L +VW ND+E+   + R+        Y      SKFC+   G +VN+ARIADS+
Sbjct: 253 SKIRVILARVWENDTELDISNNRISRATGHLVYQKRFYRSKFCICPGGSQVNSARIADSI 312

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGC+PVI++N+YDLPF D+L+W  F++V+   D+  LK+ILK IS  E++ L + ++KV
Sbjct: 313 HYGCIPVILSNYYDLPFNDILDWNKFAVVLKESDVYQLKQILKNISDAEFVTLHNNLVKV 372

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHFQW+  P  +DAF++VMY+LWLR  +++
Sbjct: 373 QKHFQWNSPPIRFDAFHLVMYDLWLRHHTIK 403


>gi|356557595|ref|XP_003547101.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g03795-like [Glycine max]
          Length = 412

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 143/332 (43%), Positives = 201/332 (60%), Gaps = 13/332 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M   F+++VYP    DP      P      G YASE YF K   +S F T DP +A LFF
Sbjct: 86  MEEEFKIFVYPD--GDPETYFHTPRKLT--GKYASEGYFFKNIRESRFFTDDPRRAHLFF 141

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LP S  +MR  R +  E + D +  Y+ ++  KYPYWNRT GADHF+V CH IG  A + 
Sbjct: 142 LPISCHKMR-GRGLTIERMIDEVEKYVEHLKLKYPYWNRTLGADHFFVTCHDIGVKATKG 200

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVN 179
              +  N+I+V CSSSY    ++ HKDV+LPQ+      PP     K RN  AF+AG  +
Sbjct: 201 VPHLTKNSIRVACSSSYDDDDYVPHKDVTLPQVQLPFFHPPGENDIKNRNTFAFWAGRSD 260

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLK------TPYADELLGSKFCLHVKGFEVNTARIADS 233
           S +++ L+ +W ND+E+   + R+         Y ++L  SKFCL   G  V  + IADS
Sbjct: 261 SRLKDDLMAMWDNDTELDIQNXRVDLRATGPVVYMEKLYKSKFCLCPHG-PVGNSLIADS 319

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           +++GCVPVI+ N+YDLPF D+L+W  FS+V+   +I LLK IL+ IS + ++ L   ++K
Sbjct: 320 IHFGCVPVIMPNYYDLPFNDILDWSQFSVVLKETNIYLLKDILRSISEKHFISLNRNIVK 379

Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           ++KHF+W+  P   DAF+MVMYE+WLRR  +R
Sbjct: 380 IQKHFKWNTPPVRQDAFHMVMYEIWLRRHLIR 411


>gi|357485421|ref|XP_003612998.1| Exostosin-like protein [Medicago truncatula]
 gi|355514333|gb|AES95956.1| Exostosin-like protein [Medicago truncatula]
          Length = 415

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 139/334 (41%), Positives = 203/334 (60%), Gaps = 18/334 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR---GNYASESYFKKAFMKSHFVTKDPSKAD 57
           M + F+VY+YP   +  F          PR   G YASE YF +   +S F T DP +A 
Sbjct: 90  MEKKFKVYIYPDGDSKTFYQT-------PRKLTGKYASEGYFFQNIRESRFRTLDPDEAH 142

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P S  +MR  +    E +   + +Y+ ++  KYPYWNRT GADHF+V CH +G  A
Sbjct: 143 LFFIPISCHKMR-GKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGVRA 201

Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAG 176
            +    +  N+I+ VCS SY + G I HKDV+LPQ+      P        R  L F+AG
Sbjct: 202 TEGLPLLVKNSIRAVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDVENRTSLGFWAG 260

Query: 177 AVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIA 231
             NS +R  L +VW ND+E+   + R+        Y      +KFC+   G +VN+ARIA
Sbjct: 261 HRNSKIRVILARVWENDTELDISNNRISRATGHLVYQKRFYSTKFCICPGGSQVNSARIA 320

Query: 232 DSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYV 291
           DS++YGC+PVI++N+YDLPF D+L+W+ F++V+   D+  LK+ILK  S +E++ L + +
Sbjct: 321 DSIHYGCIPVILSNYYDLPFNDILDWRKFAVVLKESDVYQLKQILKNKSQDEFVALHNNL 380

Query: 292 LKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +K++KHFQW+  P  YDAF+MVMY+LWLR  +++
Sbjct: 381 VKIQKHFQWNSPPVRYDAFHMVMYDLWLRHHTIQ 414


>gi|296088784|emb|CBI38234.3| unnamed protein product [Vitis vinifera]
          Length = 326

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 141/331 (42%), Positives = 201/331 (60%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M   F+V+VYP    DP      P      G YASE YF +   +S F T DP +A LFF
Sbjct: 1   MEMKFKVFVYPD--GDPNTYYQTPRKLT--GKYASEGYFFQNIRQSQFRTDDPDQAHLFF 56

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+ ++  KYPYWNRT GADHF++ CH +G  A + 
Sbjct: 57  IPISCHKMR-GKGTSYENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDVGVRATEG 115

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVN 179
              +  N+I+VVCS SY + G I HKDV+LPQ+      P      K R  L F+AG  N
Sbjct: 116 VPLLVKNSIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPTGGRDIKNRTTLGFWAGHRN 174

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L ++W ND+E+   + R+        Y ++   +KFC+   G +VN+ARIADS+
Sbjct: 175 SKIRVILARIWENDTELDIKNNRINRATGHLVYQNKFYRTKFCICPGGSQVNSARIADSI 234

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGCVPVI++++YDLPF D+L+W+ FS+++   D+  LK ILK I   E++ L   ++KV
Sbjct: 235 HYGCVPVILSDYYDLPFNDILDWRKFSVILKERDVYRLKYILKDIPDAEFIALHDNLVKV 294

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHFQW+  P  YDAF+MVMYELWLR   ++
Sbjct: 295 QKHFQWNTPPIKYDAFHMVMYELWLRHHVIK 325


>gi|168019297|ref|XP_001762181.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686585|gb|EDQ72973.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 344

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 149/334 (44%), Positives = 203/334 (60%), Gaps = 17/334 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M   F+VYVYP    DP      P      G Y+SE YF +   +S FVT D + ADLFF
Sbjct: 18  MEEKFKVYVYPD--GDPETYYQTPRKLT--GKYSSEGYFFQNLRESRFVTNDSAAADLFF 73

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LP S  +MR  + +  E + D +  Y+ ++  KYP+WNRT GADHF+V CH +G  A  K
Sbjct: 74  LPVSCHKMR-GKGLSYEKMADIVRAYVESLIIKYPFWNRTVGADHFFVTCHDVGVRATAK 132

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSS---KRNKLAFFAGA 177
              +  N+I+VVCS SY  S  I HKDV+LPQ+   Q  P   G      R  L F+AG 
Sbjct: 133 VEHLVKNSIRVVCSPSYNGS-FIPHKDVALPQVL--QPFPLPAGGDDIHNRTVLGFWAGH 189

Query: 178 VNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIAD 232
            NS VR  L   W+ D  ++  + RL        Y ++   SKFC+   G +VN+ARIA+
Sbjct: 190 RNSKVRVNLADAWQYDPILFVANNRLNRSTGDYIYQNQFYRSKFCICPAGSQVNSARIAE 249

Query: 233 SLYYGCVP-VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYV 291
           S++YGCVP VI+A+ YDLPF D+L+W+ FS+VV   +   LKKIL+ ++ ++Y +L + V
Sbjct: 250 SIHYGCVPEVIMADFYDLPFNDILDWRKFSLVVREREYDNLKKILQAVTVQKYRMLHAGV 309

Query: 292 LKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
            +VR+HF+WH  P  YDAF+MVMYELWLRR ++R
Sbjct: 310 RQVRRHFEWHSPPIKYDAFHMVMYELWLRRFTIR 343


>gi|225464075|ref|XP_002268836.1| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
          Length = 417

 Score =  263 bits (673), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 141/331 (42%), Positives = 201/331 (60%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M   F+V+VYP    DP      P      G YASE YF +   +S F T DP +A LFF
Sbjct: 92  MEMKFKVFVYPD--GDPNTYYQTPRKLT--GKYASEGYFFQNIRQSQFRTDDPDQAHLFF 147

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+ ++  KYPYWNRT GADHF++ CH +G  A + 
Sbjct: 148 IPISCHKMR-GKGTSYENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDVGVRATEG 206

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVN 179
              +  N+I+VVCS SY + G I HKDV+LPQ+      P      K R  L F+AG  N
Sbjct: 207 VPLLVKNSIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPTGGRDIKNRTTLGFWAGHRN 265

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L ++W ND+E+   + R+        Y ++   +KFC+   G +VN+ARIADS+
Sbjct: 266 SKIRVILARIWENDTELDIKNNRINRATGHLVYQNKFYRTKFCICPGGSQVNSARIADSI 325

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGCVPVI++++YDLPF D+L+W+ FS+++   D+  LK ILK I   E++ L   ++KV
Sbjct: 326 HYGCVPVILSDYYDLPFNDILDWRKFSVILKERDVYRLKYILKDIPDAEFIALHDNLVKV 385

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHFQW+  P  YDAF+MVMYELWLR   ++
Sbjct: 386 QKHFQWNTPPIKYDAFHMVMYELWLRHHVIK 416


>gi|224071347|ref|XP_002303415.1| predicted protein [Populus trichocarpa]
 gi|222840847|gb|EEE78394.1| predicted protein [Populus trichocarpa]
          Length = 326

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 142/331 (42%), Positives = 200/331 (60%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M  +F++Y+YP    DP  N       +  G YASE YF +   +S F T DP +A LFF
Sbjct: 1   MEHNFKIYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFRTLDPDQAHLFF 56

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+ ++  KY YWNRT GADHF+V CH +G  A + 
Sbjct: 57  IPISCHKMR-GKGTSYENMTVIVENYVESLIAKYSYWNRTLGADHFFVTCHDVGVRATEG 115

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
              +  NAI+VVCS SY + G I HKDV+LPQ+      P        R  L F+AG  N
Sbjct: 116 VPFLIKNAIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDVENRTTLGFWAGHRN 174

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L +VW ND+E+   S R+        Y     G+KFC+   G +VN+ARIADS+
Sbjct: 175 SRIRVILARVWENDTELDISSNRINRATGHLVYQKRFYGTKFCICPGGSQVNSARIADSI 234

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGCVPVI++N+YDLPF D+L+W  FS+++   D+  LK+ILK I   +++ L   ++KV
Sbjct: 235 HYGCVPVILSNYYDLPFNDILDWHKFSVILKEQDVYQLKQILKDIPDNQFVSLHKNLIKV 294

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHFQW+  P   DAF+MVMY+LWLRR  ++
Sbjct: 295 QKHFQWNSPPIKNDAFHMVMYDLWLRRHVIK 325


>gi|242064274|ref|XP_002453426.1| hypothetical protein SORBIDRAFT_04g005890 [Sorghum bicolor]
 gi|241933257|gb|EES06402.1| hypothetical protein SORBIDRAFT_04g005890 [Sorghum bicolor]
          Length = 403

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 142/332 (42%), Positives = 204/332 (61%), Gaps = 14/332 (4%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSF+VY+YP    DP      P      G YASE YF +   +S F T DP +A LFF
Sbjct: 78  MERSFKVYIYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDPDQAHLFF 133

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   +  Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 134 VPISPHKMR-GKGTTYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 192

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
              +  N+I+VVCS SY +   I HKD++LPQ+  +    P+ G+   N+  L F+AG  
Sbjct: 193 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDIENRTILGFWAGHR 250

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADS 233
           NS +R  L +VW ND+E+   + R+        Y  +   +KFC+   G +VN+ARI+DS
Sbjct: 251 NSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDS 310

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           ++YGCVPVI++++YDLPF D L+W+ F++++   D+  LK ILK IS EE++ L   +++
Sbjct: 311 IHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFVSLHKSLVQ 370

Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           V+KHF WH  P  YDAF+MVMYELWLR + ++
Sbjct: 371 VQKHFVWHSPPVSYDAFHMVMYELWLRHNVIK 402


>gi|212721192|ref|NP_001132213.1| exostosin-like protein [Zea mays]
 gi|194693780|gb|ACF80974.1| unknown [Zea mays]
 gi|413935957|gb|AFW70508.1| exostosin-like protein [Zea mays]
          Length = 403

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 141/332 (42%), Positives = 204/332 (61%), Gaps = 14/332 (4%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSF+VY+YP    DP      P      G YASE YF +   +S F T DP +A LFF
Sbjct: 78  MERSFKVYIYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDPDQAHLFF 133

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   +  Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 134 VPISPHKMR-GKGTSYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 192

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
              +  N+I+VVCS SY +   I HKD++LPQ+  +    P+ G+   N+  L F+AG  
Sbjct: 193 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDIENRTTLGFWAGHR 250

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADS 233
           NS +R  L +VW ND+E+   + R+        Y  +   +KFC+   G +VN+ARI+DS
Sbjct: 251 NSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDS 310

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           ++YGCVPVI++++YDLPF D L+W+ F++++   D+  LK ILK IS EE++ L   +++
Sbjct: 311 IHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFVSLHKSLVQ 370

Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           V+KHF WH  P  YDAF+M+MYELWLR + ++
Sbjct: 371 VQKHFVWHSPPVSYDAFHMIMYELWLRHNVIK 402


>gi|195624236|gb|ACG33948.1| exostosin-like [Zea mays]
          Length = 403

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 140/332 (42%), Positives = 202/332 (60%), Gaps = 14/332 (4%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M R F+VY+YP    DP      P      G YASE YF +   +S F T DP +A LFF
Sbjct: 78  MERXFKVYIYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDPDQAHLFF 133

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   +  Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 134 VPISPHKMR-GKGTSYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 192

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS--SKRNKLAFFAGAV 178
              +  N+I+VVCS SY +   I HKD++LPQ+  +    P+ G+    R  L F+AG  
Sbjct: 193 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDIENRTTLGFWAGHR 250

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADS 233
           NS +R  L +VW ND+E+   + R+        Y  +   +KFC+   G +VN+ARI+DS
Sbjct: 251 NSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDS 310

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           ++YGCVPVI++++YDLPF D L+W+ F++++   D+  LK ILK IS EE++ L   +++
Sbjct: 311 IHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFVSLHKSLVQ 370

Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           V+KHF WH  P  YDAF+M+MYELWLR + ++
Sbjct: 371 VQKHFVWHSPPVSYDAFHMIMYELWLRHNVIK 402


>gi|226498050|ref|NP_001147269.1| exostosin-like [Zea mays]
 gi|195609318|gb|ACG26489.1| exostosin-like [Zea mays]
 gi|413926327|gb|AFW66259.1| exostosin-like protein [Zea mays]
          Length = 403

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 141/331 (42%), Positives = 200/331 (60%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSF+VY+YP    DP      P      G YASE YF +   +S F T DP +A LFF
Sbjct: 78  MERSFKVYIYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDPDQAHLFF 133

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   +  Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 134 VPISPHKMR-GKGTSYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 192

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-PRQEDPPKLGSSKRNKLAFFAGAVN 179
              +  N+I+VVCS SY +   I HKD++LPQ+  P            R  L F+AG  N
Sbjct: 193 LKFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVLQPFALHEGGNDIDNRVILGFWAGHRN 251

Query: 180 SPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L +VW ND+E+   + R+     +  Y  +   +KFC+   G +VN+ARI+DS+
Sbjct: 252 SKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDSI 311

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGCVPVI++++YDLPF D L+W+ F++++   D+  LK ILK IS EE++ L   +++V
Sbjct: 312 HYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFISLHKSLVQV 371

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHF WH  P  YDAF+MVMYELWLR + ++
Sbjct: 372 QKHFVWHSPPVSYDAFHMVMYELWLRHNVIK 402


>gi|125581086|gb|EAZ22017.1| hypothetical protein OsJ_05674 [Oryza sativa Japonica Group]
          Length = 408

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 145/338 (42%), Positives = 204/338 (60%), Gaps = 20/338 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSF+VY+YP    DP      P      G YASE YF +   +S F T DP KA LFF
Sbjct: 77  MERSFKVYMYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTGDPDKAHLFF 132

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   +  Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 133 VPISPHKMR-GKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 191

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
              +  N+I+VVCS SY +   I HKD++LPQ+  +    P+ G+   N+  L F+AG  
Sbjct: 192 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDVENRTILGFWAGHR 249

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADS 233
           NS +R  L QVW ND+E+   + R+        Y  +   +KFC+   G +VN+ARI+DS
Sbjct: 250 NSKIRVILAQVWENDTELAISNNRISRAIGELVYQKQFYHTKFCICPGGSQVNSARISDS 309

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL- 292
           ++YGCVPVI++++YDLPF D+L+W+ F++V+   D+  LK ILK IS EE++ L   ++ 
Sbjct: 310 IHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDVYQLKSILKSISQEEFVELHKSLVQ 369

Query: 293 -----KVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
                +V+KHF WH  P  YDAF+MVMYELWLR   ++
Sbjct: 370 NISCPQVQKHFVWHSPPLPYDAFHMVMYELWLRHHVIK 407


>gi|363808264|ref|NP_001242494.1| uncharacterized protein LOC100777368 [Glycine max]
 gi|255635742|gb|ACU18220.1| unknown [Glycine max]
          Length = 409

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 137/331 (41%), Positives = 201/331 (60%), Gaps = 12/331 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+VY+YP    DP  N       +  G YASE YF +    S F T++P +A LFF
Sbjct: 84  MEKKFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRDSRFRTENPDEAHLFF 139

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+ ++  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 140 IPISCHKMR-GKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGVRATEG 198

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
              +  N+I+ VCS SY + G I HKDV+LPQ+      P        R  L F+AG  N
Sbjct: 199 LEFLVKNSIRAVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDIENRTTLGFWAGHRN 257

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L +VW ND+E+   + R+        Y      SKFC+   G +VN+ARIADS+
Sbjct: 258 SKIRVILARVWENDTELDISNNRISRATGHLVYQKRFYRSKFCICPGGSQVNSARIADSI 317

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGC+PV+++N+YDLPF D+L+W  F++V+   D+  LK+ILK IS  E++ L + ++KV
Sbjct: 318 HYGCIPVMLSNYYDLPFNDILDWNKFAVVLKESDVYQLKQILKNISDAEFVTLHNNLVKV 377

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +KHFQW+     +DAF++VMY+LWLR  +++
Sbjct: 378 QKHFQWNSPSIRFDAFHLVMYDLWLRHHTIK 408


>gi|125538390|gb|EAY84785.1| hypothetical protein OsI_06153 [Oryza sativa Indica Group]
          Length = 332

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/338 (42%), Positives = 205/338 (60%), Gaps = 20/338 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSF+VY+YP    DP      P      G YASE YF +   +S F T DP KA LFF
Sbjct: 1   MERSFKVYMYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTGDPDKAHLFF 56

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   +  Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 57  VPISPHKMR-GKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 115

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
              +  N+I+VVCS SY +   I HKD++LPQ+  +    P+ G+   N+  L F+AG  
Sbjct: 116 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDVENRTILGFWAGHR 173

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVKGFEVNTARIADS 233
           NS +R  L QVW ND+E+   + R+     +  Y  +   +KFC+   G +VN+ARI+DS
Sbjct: 174 NSKIRVILAQVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDS 233

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL- 292
           ++YGCVPVI++++YDLPF D+L+W+ F++V+   D+  LK ILK IS EE++ L   ++ 
Sbjct: 234 IHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDVYQLKSILKSISQEEFVELHKSLVQ 293

Query: 293 -----KVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
                +V+KHF WH  P  YDAF+MVMYELWLR   ++
Sbjct: 294 NISCPQVQKHFVWHSPPLPYDAFHMVMYELWLRHHVIK 331


>gi|358343604|ref|XP_003635890.1| Exostosin-like protein [Medicago truncatula]
 gi|358344094|ref|XP_003636128.1| Exostosin-like protein [Medicago truncatula]
 gi|355501825|gb|AES83028.1| Exostosin-like protein [Medicago truncatula]
 gi|355502063|gb|AES83266.1| Exostosin-like protein [Medicago truncatula]
          Length = 435

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 137/329 (41%), Positives = 199/329 (60%), Gaps = 24/329 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR---GNYASESYFKKAFMKSHFVTKDPSKAD 57
           M + F+VY+YP   +  F          PR   G YASE YF +   +S F T DP +A 
Sbjct: 126 MEKKFKVYIYPDGDSKTFYQT-------PRKLTGKYASEGYFFQNIRESRFRTLDPDQAH 178

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P S  +MR  +    + +   + +Y+ ++  KYPYWNRT GADHF+V CH +G  A
Sbjct: 179 LFFIPISCHKMR-GKGTSYDNMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGVRA 237

Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAG 176
            +    +  N+I+ VCS SY + G I HKDV+LPQ+      P        R  L F+AG
Sbjct: 238 TEGLPLLVKNSIRAVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDVENRTSLGFWAG 296

Query: 177 AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
             NS +R  L +VW ND+E+   + R+ T         KFC+   G +VN+ARIADS++Y
Sbjct: 297 HRNSKIRVILARVWENDTELDISNNRIYT---------KFCICPGGSQVNSARIADSIHY 347

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK 296
           GC+PVI++N+YDLPF D+L+W+ F++V    D+  LK+ILK  S +E++ L + +  ++K
Sbjct: 348 GCIPVILSNYYDLPFNDILDWRKFAVVHNESDVYQLKQILKNKSQDEFIALHNNL--IQK 405

Query: 297 HFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           HFQW+  P  YDAF+MVMY+LWLR  +++
Sbjct: 406 HFQWNSPPVRYDAFHMVMYDLWLRHHTIQ 434


>gi|302823230|ref|XP_002993269.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300138939|gb|EFJ05690.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 353

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 142/335 (42%), Positives = 197/335 (58%), Gaps = 33/335 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M +  +V+VYP    DP     L       G YASE YF +  M+S FVT DP KA LFF
Sbjct: 41  MEQQLKVFVYP----DPVVYTKLA------GKYASEGYFFRNLMESRFVTTDPEKAQLFF 90

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S AR+R +  +  + I D ++ ++ ++  K+PYWNRT GADHF+V CH IG  A  K
Sbjct: 91  VPISCARLR-EEGLDHDEISDNVASFVESVIAKFPYWNRTMGADHFFVTCHEIGTRATSK 149

Query: 121 AWEVKLNAIQVVCSSSYFISG-HIAHKDVSLPQIW-----PRQEDPPKLGSSKRNKLAFF 174
              +  N+I+VVC+SSY  SG  I HKDV+LPQI      PR  D     + KR  L F+
Sbjct: 150 VAHLVKNSIRVVCASSY--SGPFIPHKDVALPQILQPFPSPRGGD----DTEKRETLGFW 203

Query: 175 AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSL 234
           AG  NS  R  L + W+ DS++   +  +      +   SKFC+   G  V+TARI +S+
Sbjct: 204 AGPANSKTRILLTKTWQEDSDMVISTKHVGM---QQFYRSKFCICPSGTRVSTARIVESI 260

Query: 235 YYGCVP----VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSY 290
           ++GCVP    +I+++HYDLPF DVL+W+ F++++   D   LK  L+      Y  L   
Sbjct: 261 HFGCVPGKSFLILSDHYDLPFNDVLDWRKFAVILPEQDAGTLKDALE---LAPYATLHRN 317

Query: 291 VLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +L+ + HF+WH  P  YD F+MVMYELWLR S+VR
Sbjct: 318 LLQAQAHFEWHSPPIKYDTFHMVMYELWLRHSTVR 352


>gi|302820946|ref|XP_002992138.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300140064|gb|EFJ06793.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 342

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 140/345 (40%), Positives = 205/345 (59%), Gaps = 26/345 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F++++YP    DP  N       +  G YASE YF +   +S FVTK+P+KA LFF
Sbjct: 1   MLKRFKIFIYPD--GDP--NTYYQTPRKITGKYASEGYFFQNLRESKFVTKNPNKAHLFF 56

Query: 61  LPFSIARMRHD------------RRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
           +P S  +MR              + I  E + D +  Y+  +  KYPYWNRT GADHF+V
Sbjct: 57  IPISCHKMRGKVPYYLTSNWNKMQGISYEKMADIVQEYVEGLIVKYPYWNRTLGADHFFV 116

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKR 168
            CH +G  A  K   +  N+I+VVCS SY     I HKD+++PQ+  +    P+ G+  R
Sbjct: 117 TCHDVGARATNKVANLVKNSIRVVCSPSYN-GDFIPHKDIAMPQVL-QPFALPRGGNDVR 174

Query: 169 NK--LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVK 221
           N+  L F+AG  NS +R  L ++W  D  +   + R+     +  Y  +   SKFC+   
Sbjct: 175 NRTILGFWAGHRNSKIRVVLAKLWEEDDVLAISNNRISRATGELVYQKQFYRSKFCICPG 234

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G +VN+ARI DS++YGCVPVI+++HYDLPF DVL+WK F++++   D+  LK  L+ +S 
Sbjct: 235 GSQVNSARIVDSIHYGCVPVILSDHYDLPFNDVLDWKRFALLLRERDVGDLKLKLQSVSK 294

Query: 282 EEYLLLQSYVLK-VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           E+YL L   +++ V+  F+WH  P  YDAF+MV+YELWLR  +V+
Sbjct: 295 EQYLSLHRGLVEVVQDRFEWHTPPRPYDAFHMVVYELWLRHFTVQ 339


>gi|302824287|ref|XP_002993788.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300138384|gb|EFJ05154.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 353

 Score =  249 bits (635), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 139/335 (41%), Positives = 197/335 (58%), Gaps = 33/335 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M +  +V+VYP    DP     L       G YASE YF +  M+S FVT DP +A LFF
Sbjct: 41  MEQQLKVFVYP----DPVVYTKLA------GKYASEGYFFRNLMESRFVTTDPEEAQLFF 90

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S AR++ +  +  + I D ++ ++ ++  K+PYWNRT GADHF+V CH IG  A  K
Sbjct: 91  VPISCARLK-EEGLDHDEISDNVASFVESVIAKFPYWNRTMGADHFFVTCHEIGTRATSK 149

Query: 121 AWEVKLNAIQVVCSSSYFISG-HIAHKDVSLPQIW-----PRQEDPPKLGSSKRNKLAFF 174
              +  N+I+VVC+SSY  SG  I HKDV+LPQI      PR  D     + KR  L F+
Sbjct: 150 VAHLVKNSIRVVCASSY--SGPFIPHKDVALPQILQPFPSPRGGD----DTEKRETLGFW 203

Query: 175 AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSL 234
           AG  NS  R  L + W+ DS++   +  +      +   SKFC+   G  V+TARI +S+
Sbjct: 204 AGPANSKTRILLTKTWQEDSDMVISTKHVGM---QQFYRSKFCICPSGTRVSTARIVESI 260

Query: 235 YYGCVP----VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSY 290
           ++GCVP    +I+++HYDLPF DVL+W+ F++++   D   LK  L+      Y  L   
Sbjct: 261 HFGCVPGKSYLILSDHYDLPFNDVLDWRKFAVILPEQDAGTLKDALE---LAPYATLHRN 317

Query: 291 VLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +L+ + HF+WH  P  YD F+MVMYELWLR S++R
Sbjct: 318 LLQAQAHFEWHSPPIKYDTFHMVMYELWLRHSTIR 352


>gi|302790752|ref|XP_002977143.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300155119|gb|EFJ21752.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 342

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 136/345 (39%), Positives = 198/345 (57%), Gaps = 26/345 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F++++YP    DP  N       +  G YASE YF +   +S FVTK+P+KA LFF
Sbjct: 1   MLKRFKIFIYPD--GDP--NTYYQTPRKITGKYASEGYFFQNLRESKFVTKNPNKAHLFF 56

Query: 61  LPFSIARMRHD------------RRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
           +P S  +MR              + I  E + D +  Y+  +  KYPYWNRT GADHF+V
Sbjct: 57  IPISCHKMRGKVPYYLTSNWNKMQGISYEKMADIVQEYVEGLIVKYPYWNRTLGADHFFV 116

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKR 168
            CH +G  A  K   +  N+I+VVCS SY     I HKD+++PQ+  +    P+ G+  R
Sbjct: 117 TCHDVGARATNKVANLVKNSIRVVCSPSYN-GDFIPHKDIAMPQVL-QPFALPRGGNDVR 174

Query: 169 NK--LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVK 221
           N+  L F+AG  NS +R  L ++W  D  +   + R+     +  Y  +   SKFC+   
Sbjct: 175 NRTILGFWAGHRNSKIRVVLAKLWEEDDVLAISNNRISRATGELVYQKQFYRSKFCICPG 234

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G +VN+ARI DS++YGCVPVI+++HYDLPF DVL+WK F++++   D+  LK        
Sbjct: 235 GSQVNSARIVDSIHYGCVPVILSDHYDLPFNDVLDWKRFALLLRERDVGDLKLFFFSFFL 294

Query: 282 EEY-LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
               +L+ S   +V+  F+WH  P  YDAF+MV+YELWLR  +V+
Sbjct: 295 FSSRVLIASLFRQVQDRFEWHTPPRPYDAFHMVVYELWLRHFTVQ 339


>gi|224093268|ref|XP_002309859.1| predicted protein [Populus trichocarpa]
 gi|222852762|gb|EEE90309.1| predicted protein [Populus trichocarpa]
          Length = 420

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 139/354 (39%), Positives = 197/354 (55%), Gaps = 45/354 (12%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN----YASESYFKKAFMKSHFVTKDPSKA 56
           M   F+V+VYP R  DP         ++PR      YASE YF K  + S F T DP+ A
Sbjct: 82  MENDFKVFVYPGR--DPTTC------YDPRDKLKRKYASEHYFLKNLIPSSFFTDDPTVA 133

Query: 57  DLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
            LF +P S       ++ G     D I +Y+ ++   YPYWNRT GADHFY +CH I   
Sbjct: 134 HLFLIPLSC------KKTGGREEKD-IENYVKSLISSYPYWNRTLGADHFYFSCHGIDSG 186

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ-----------IWPR---QEDP-- 160
            +++   +  N I++VCS SY  S +I HKD+SLPQ           +W R      P  
Sbjct: 187 TIEEVPLLMKNVIRLVCSPSY-DSKYIPHKDISLPQTLELSLHDGDDVWSRSTVMSRPLM 245

Query: 161 --PKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAH------SGRLKTPYADELL 212
             P++   +R KL F+AG++NS VR+ L   ++   E   H         +   Y +EL 
Sbjct: 246 IYPEMMLPRRTKLGFWAGSLNSDVRKNLQVFYKGAPEFNFHFFDKMKKAAILDAYENELY 305

Query: 213 GSKFCLHVKGFE-VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
           GSKFC+  +G   V +  + +S+ +GCVPVI+ ++YD PF DVL+W +FS+++    +P 
Sbjct: 306 GSKFCICPRGNNHVGSVCLTESMTFGCVPVILHDYYDFPFNDVLDWNNFSVILKEEHVPD 365

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           L+KILKGI  E Y  +   +L+VRKHFQW+  P  YD F M+MYELWLRR  ++
Sbjct: 366 LEKILKGIPEENYKKMHQNLLQVRKHFQWNSLPVKYDLFRMIMYELWLRRHIIK 419


>gi|413947133|gb|AFW79782.1| putative homeodomain-like transcription factor superfamily protein
           [Zea mays]
          Length = 544

 Score =  233 bits (594), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 138/341 (40%), Positives = 196/341 (57%), Gaps = 30/341 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
           M R  +V++Y H    P  +       E +G YASE +F K      +FV +DP+ A LF
Sbjct: 209 MERLLKVFIY-HDGAKPIFH-----SPELKGIYASEGWFMKLMETNQNFVVRDPNTAHLF 262

Query: 60  FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LP+S  ++ H+  +      E +  F+ +YI  IS KYPYWNRT GADHF+VACH  G 
Sbjct: 263 YLPYSSRQLEHNLYVPGSNTIEPLSIFVKNYIDLISAKYPYWNRTKGADHFFVACHDWGP 322

Query: 116 SAMQKAWEVKLNAIQVVC----SSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SS 166
              +   E++ N I+ +C    S  +FI G    KDVSLP+ + R    P   +G   ++
Sbjct: 323 YTTKLHDELRKNTIKALCNADLSEGFFIRG----KDVSLPETFLRSPRRPLRDIGGRPAA 378

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHV 220
           +R  LAFFAG ++  VR  LL+ W N   D  IY+   H    K  Y   +  SK+C+  
Sbjct: 379 QRTILAFFAGQMHGRVRPVLLKYWGNKDDDMRIYSRLPHRITRKRNYVQHMKSSKYCICP 438

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            G+EVN+ RI +++YY CVPVIIA+H+ LPF   LNW +FS+VV   D+P LK+IL  I 
Sbjct: 439 MGYEVNSPRIVEAIYYECVPVIIADHFVLPFDAALNWSTFSVVVPESDVPKLKQILLAIP 498

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
              Y+ LQ+ V +V+KHF WH  P  YD F+M+++ +W  R
Sbjct: 499 ESRYITLQANVKRVQKHFMWHPNPVKYDIFHMILHSVWFSR 539


>gi|115434052|ref|NP_001041784.1| Os01g0107700 [Oryza sativa Japonica Group]
 gi|52076197|dbj|BAD44851.1| pectin-glucuronyltransferase -like [Oryza sativa Japonica Group]
 gi|52076236|dbj|BAD44890.1| pectin-glucuronyltransferase -like [Oryza sativa Japonica Group]
 gi|113531315|dbj|BAF03698.1| Os01g0107700 [Oryza sativa Japonica Group]
 gi|215697737|dbj|BAG91731.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222617591|gb|EEE53723.1| hypothetical protein OsJ_00064 [Oryza sativa Japonica Group]
          Length = 550

 Score =  233 bits (594), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 133/337 (39%), Positives = 195/337 (57%), Gaps = 22/337 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
           M R  +V+VY H    P  +       E +G YASE +F K      HFV +DP++A LF
Sbjct: 213 MERLLKVFVY-HDGAKPIFH-----SPELKGIYASEGWFMKLMEGNQHFVVRDPNRAHLF 266

Query: 60  FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LP+S  ++ H+  +      E +  F+  YI  IS K+PYWNRT GADHF+VACH  G 
Sbjct: 267 YLPYSSRQLEHNLYVPGSNTIEPLSIFVKKYIDFISTKFPYWNRTKGADHFFVACHDWGP 326

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKRNK 170
              +   E++ N I+ +C++       I  +DVSLP+ + R    P  G     +++R+ 
Sbjct: 327 YTTKLHDELRKNTIKALCNADLSEGVFIHGRDVSLPETFLRSPRRPLRGIGGKPAAERSI 386

Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIY---AHSGRLKTPYADELLGSKFCLHVKGFE 224
           LAFFAG ++  VR  LLQ W     D  IY    H    +  Y   +  SK+C+   G+E
Sbjct: 387 LAFFAGQMHGRVRPVLLQYWGGKDADMRIYDRLPHRITRRMNYIQHMKSSKYCICPMGYE 446

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ RI +++YY CVPVIIA+++ LPF D LNW +FS+V+   D+P LK+IL  I  ++Y
Sbjct: 447 VNSPRIVEAIYYECVPVIIADNFVLPFDDALNWSAFSVVIPEKDVPKLKQILLAIPDDQY 506

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           + +QS V +V+KHF WH  P  YD F+M+++ +W  R
Sbjct: 507 MAMQSNVQRVQKHFIWHPNPIKYDIFHMILHSIWYSR 543


>gi|224093262|ref|XP_002309856.1| predicted protein [Populus trichocarpa]
 gi|222852759|gb|EEE90306.1| predicted protein [Populus trichocarpa]
          Length = 326

 Score =  232 bits (592), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 128/329 (38%), Positives = 191/329 (58%), Gaps = 19/329 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN--YASESYFKKAFMKSHFVTKDPSKADL 58
           M R F+V+VY  R      N+    D   + N  Y SE YF      S F+T D ++A L
Sbjct: 1   MEREFKVFVYQDR------NITKHCDLPSKHNSRYESEEYFFSNLKMSPFLTDDAAEAHL 54

Query: 59  FFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAM 118
           FF+P    +M   R      I   +  ++ ++  KYPYWNRT GADHF+V C  I  +A 
Sbjct: 55  FFIPIFSQKMTKKRSEDERAIA--VEDFVKSLISKYPYWNRTLGADHFFVTCADINVTAT 112

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ-IWPRQEDPPKLGSSKRNKLAFFAGA 177
            +   +  N+I+V+C+ SY    ++ HKDVSLPQ + P    P     + R  LAF+ G 
Sbjct: 113 ARIANLMKNSIKVMCTPSY-NDEYVPHKDVSLPQRVPPLALTPAGNNITNRITLAFWRGL 171

Query: 178 VNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELL------GSKFCLHVKGFEVNTARIA 231
            NS +R+KLL+ W ND E++   GR  +    +L+       SK+C+   G E++   IA
Sbjct: 172 NNSDIRQKLLEAWENDLELFIQKGRKPSLEQGDLVHHEAFNNSKYCICPGGPELDRT-IA 230

Query: 232 DSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYV 291
            +++YGCVPVI++++YDLPF D+L+W+ FSI++    +  L++ LK +   EY  +Q+  
Sbjct: 231 LAIHYGCVPVIMSDYYDLPFKDILDWRKFSIILEESQVYYLREHLKEMLEHEYRAMQTNT 290

Query: 292 LKVRKHFQWHVFPSDYDAFYMVMYELWLR 320
           + VRKHFQW++ P+ YDAF+M MY+LWLR
Sbjct: 291 VMVRKHFQWNLVPAKYDAFHMTMYDLWLR 319


>gi|102139904|gb|ABF70053.1| exostosin family protein [Musa acuminata]
          Length = 585

 Score =  231 bits (588), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 124/307 (40%), Positives = 184/307 (59%), Gaps = 16/307 (5%)

Query: 31  GNYASESYFKKAFMK-SHFVTKDPSKADLFFLPFSIARMRHDRRIG----TEGIPDFISH 85
           G YASE +F K   + + FV KDP+KA LF+LP+S  ++R    +      + +  F+  
Sbjct: 273 GIYASEGWFMKLMEENTQFVVKDPNKAHLFYLPYSSRQLRTHLYVAGSRSMQPLSIFLRD 332

Query: 86  YIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH 145
           Y+ +I+ KYP+WNRT GADHF VACH           +++ N I+VVC++       +  
Sbjct: 333 YVNSIAAKYPFWNRTRGADHFLVACHDWATYTTNLHEDLRKNTIKVVCNADVSEGVFVRG 392

Query: 146 KDVSLPQIWPRQEDPPKLG-----SSKRNKLAFFAGAVNSPVREKLLQVWRNDS------ 194
           KDVSL + + R  + P+       +S+R+ LAFFAG ++  VR  LL+ WR         
Sbjct: 393 KDVSLAETYVRTPNSPRKAIGGRPASRRSILAFFAGQMHGRVRPILLRHWRGRDRDMRIY 452

Query: 195 EIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
           E+       K  Y + +  SKFC+   G+EVN+ RI +++YY CVPVIIAN++ LPF +V
Sbjct: 453 EVLPDEIAAKMSYIEHMKSSKFCICPMGYEVNSPRIVEAIYYDCVPVIIANNFVLPFEEV 512

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           L+W +FS+VVA  DIP LK+IL GIS   Y+ +Q+ V ++RKHF W+  P  YD F+M++
Sbjct: 513 LDWGAFSVVVAEKDIPKLKQILLGISGRRYVRMQTNVRRLRKHFLWNDKPVKYDLFHMIL 572

Query: 315 YELWLRR 321
           + +W  R
Sbjct: 573 HSIWFNR 579


>gi|102140005|gb|ABF70140.1| exostosin family protein [Musa balbisiana]
          Length = 533

 Score =  230 bits (587), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 135/339 (39%), Positives = 195/339 (57%), Gaps = 26/339 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR--GNYASESYFKKAFMK-SHFVTKDPSKAD 57
           M R  +VY+Y    + P      P+   P   G YASE +F K   + + FV  DP+KA 
Sbjct: 197 MERILKVYIY---EDGP-----KPLCHTPHLDGIYASEGWFMKLMEENTQFVVNDPNKAH 248

Query: 58  LFFLPFSIARMR-HDRRIGTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           LF+LP+S  ++R H    G+  +     F+  Y+ +IS KYP+WNRT GADHF VACH  
Sbjct: 249 LFYLPYSSRQLRTHLYVAGSRSMRPLSIFLRDYVNSISAKYPFWNRTRGADHFLVACHDW 308

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKR 168
                    E++ N I+VVC++       +  KDVSL + + R  + P+       +S+R
Sbjct: 309 ATYTTNLHEELRKNTIKVVCNADVSEGVFVRGKDVSLAETYVRTPNSPRKAIGGRPASRR 368

Query: 169 NKLAFFAGAVNSPVREKLLQVWRNDS------EIYAHSGRLKTPYADELLGSKFCLHVKG 222
           + LAFFAG ++  VR  LL+ WR         E+       K  Y + +  SKFC+   G
Sbjct: 369 SILAFFAGQMHGRVRPILLRYWRGRDRDMRIYEVLPDEIAAKMSYIEHMKSSKFCICPMG 428

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
           +EVN+ RI +++YY CVPVIIAN++ LPF +VL+W +FS+VVA  DIP LK+IL GIS  
Sbjct: 429 YEVNSPRIVEAIYYDCVPVIIANNFVLPFEEVLDWGAFSVVVAEKDIPKLKQILLGISGR 488

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            Y+ +Q  V ++RKHF W+  P  YD F+M+++ +W  R
Sbjct: 489 RYVRMQRNVRRLRKHFLWNDKPVKYDLFHMILHSIWFNR 527


>gi|218187361|gb|EEC69788.1| hypothetical protein OsI_00072 [Oryza sativa Indica Group]
          Length = 536

 Score =  230 bits (587), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 194/337 (57%), Gaps = 22/337 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
           M R  +V+VY H    P  +       E +G YASE +F K      HFV +DP++A LF
Sbjct: 199 MERLLKVFVY-HDGAKPIFH-----SPELKGIYASEGWFMKLMEGNQHFVVRDPNRAHLF 252

Query: 60  FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LP+S  ++ H+  +      E +  F+  YI  IS K+PYWNRT GADHF+VACH  G 
Sbjct: 253 YLPYSSRQLEHNLYVPGSNTIEPLSIFVKKYIDFISTKFPYWNRTKGADHFFVACHDWGP 312

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKRNK 170
              +   E++ N I+ +C++       I  +DVSLP+ + R    P  G     +++R+ 
Sbjct: 313 YTTKLHDELRKNTIKALCNADLSEGVFIHGRDVSLPETFLRSPRRPLRGIGGKPAAERSI 372

Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIY---AHSGRLKTPYADELLGSKFCLHVKGFE 224
           LAFFAG ++  VR  LLQ W     D  IY    H    +  Y   +  SK+C+   G+E
Sbjct: 373 LAFFAGQMHGRVRPVLLQYWGGKDADMRIYDRLPHRITRRMNYIQHMKSSKYCICPMGYE 432

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ RI +++YY CVPVIIA+++ LPF D  NW +FS+V+   D+P LK+IL  I  ++Y
Sbjct: 433 VNSPRIVEAIYYECVPVIIADNFVLPFDDAFNWSAFSVVILEKDVPKLKQILLEIPDDQY 492

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           + +QS V +V+KHF WH  P  YD F+M+++ +W  R
Sbjct: 493 MAMQSNVQRVQKHFIWHPNPIKYDIFHMILHSIWYSR 529


>gi|224068931|ref|XP_002326234.1| predicted protein [Populus trichocarpa]
 gi|222833427|gb|EEE71904.1| predicted protein [Populus trichocarpa]
          Length = 328

 Score =  229 bits (584), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 139/336 (41%), Positives = 194/336 (57%), Gaps = 20/336 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+V+VY       F N      +   GN+       +  +  HF TKDP KA ++F
Sbjct: 1   MEKQFKVFVYGEGEPPVFHNGPCRSIYSMEGNF-----IHRMEIDGHFRTKDPDKAHVYF 55

Query: 61  LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           LPFS+A M      R  R  G   I   +S YI  IS KYP+WNR+ GADHF +ACH  G
Sbjct: 56  LPFSVAMMVRFVYERESRDFGP--IRRTVSDYINLISGKYPFWNRSLGADHFMLACHDWG 113

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK----LGSSKRNK 170
             A      +   +I+ +C+++     +   KDVSLP+I  R          L  SKR+ 
Sbjct: 114 PEASFSVPHLGKISIRALCNANTSEKFNPI-KDVSLPEINLRTGSIKGFVGGLSPSKRSI 172

Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
           LAFFAG ++ P+R  +L+ W N D +I  H    K   Y + + GSKFCL   G+EV + 
Sbjct: 173 LAFFAGRLHGPIRPVVLEHWENKDDDIKVHQQLPKGVSYYEMMRGSKFCLCPSGYEVASP 232

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           RI ++LY GCVPV+I++HY  PF+DVLNWKSFS+ V   DIP LKKIL  IS  +Y+ +Q
Sbjct: 233 RIVEALYAGCVPVLISDHYVPPFSDVLNWKSFSVEVPVSDIPSLKKILTSISPRQYIRMQ 292

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSV 324
             VL+VR+HF+ +  P  +D F+M+++ +WLRR +V
Sbjct: 293 RRVLQVRRHFEVNSPPKRFDVFHMILHSIWLRRLNV 328


>gi|357125316|ref|XP_003564340.1| PREDICTED: probable glycosyltransferase At5g03795-like
           [Brachypodium distachyon]
          Length = 553

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 195/337 (57%), Gaps = 22/337 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
           M R  +V++Y H    P  +       E +G YASE +F K      +FV +DP++A LF
Sbjct: 216 MERLLKVFIY-HDGAKPIFH-----SPELKGIYASEGWFMKLIEGNQNFVVRDPNRAHLF 269

Query: 60  FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LP+S  ++ H+  +      E +  F+ +YI  IS K+PYWNRT GADHF+VACH  G 
Sbjct: 270 YLPYSSRQLEHNLYVPGSNTLEPLSIFVKNYIDMISAKFPYWNRTKGADHFFVACHDWGP 329

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SSKRNK 170
              +   E++ N I+ +C++       I  +DVSLP+ + R    P   +G   +++R+ 
Sbjct: 330 YTTKLHDELRRNTIKALCNADLSEGVFIRGRDVSLPETFVRSPRRPLRDIGGKPATERSI 389

Query: 171 LAFFAGAVNSPVREKLLQVW---RNDSEIYAHSGRLKT---PYADELLGSKFCLHVKGFE 224
           LAFFAG ++  VR  LLQ W     D  IY    R  T    Y   +  SK+C+   G+E
Sbjct: 390 LAFFAGQMHGRVRPILLQYWGGKDTDMRIYGRLPRRITRRMNYVQHMKSSKYCICPMGYE 449

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ RI +++YY C+PVIIA+++ LPF D L+W +FS+VV   D+P LK+IL  I    Y
Sbjct: 450 VNSPRIVEAIYYECIPVIIADNFVLPFDDALDWSTFSVVVPEKDVPRLKEILLRIPESRY 509

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           + +QS V KV+KHF WH  P  YD F+M+++ +W  R
Sbjct: 510 ITMQSNVKKVQKHFLWHAKPVKYDIFHMILHSVWFSR 546


>gi|242056743|ref|XP_002457517.1| hypothetical protein SORBIDRAFT_03g008610 [Sorghum bicolor]
 gi|241929492|gb|EES02637.1| hypothetical protein SORBIDRAFT_03g008610 [Sorghum bicolor]
          Length = 521

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 133/337 (39%), Positives = 195/337 (57%), Gaps = 22/337 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
           M R  +V++Y H    P  +       E +G YASE +F +      +FV +DP++A LF
Sbjct: 186 MERLLKVFIY-HDGAKPIFH-----SPELKGIYASEGWFMRLMETNQNFVVRDPNRAHLF 239

Query: 60  FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LP+S  ++ H+  +      E +  F+ +YI  IS KYPYWNRT GADHF+VACH  G 
Sbjct: 240 YLPYSSRQLEHNLYVPGSNSIEPLSIFVKNYIDLISAKYPYWNRTKGADHFFVACHDWGP 299

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SSKRNK 170
              +   E++ N I+ +C++       I  KDVSLP+ + R    P   +G   +++R  
Sbjct: 300 YTTKLHDELRKNTIKALCNADLSEGIFIRGKDVSLPETFLRSPRRPLRDIGGRPAAQRTI 359

Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHVKGFE 224
           LAFFAG ++  VR  LL+ W +   D  IY+   H    K  Y   +  SK+C+   G+E
Sbjct: 360 LAFFAGQMHGRVRPVLLKYWGDKDADMRIYSRLPHRITRKRNYVQHMKSSKYCICPMGYE 419

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ RI +++YY CVPVIIA+++ LPF   LNW +FS+VV   D+P LK+IL  I    Y
Sbjct: 420 VNSPRIVEAIYYECVPVIIADNFVLPFDAALNWSAFSVVVPESDVPKLKEILLAIPESRY 479

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           + LQS V +V+KHF WH  P  YD F+M+++ +W  R
Sbjct: 480 ITLQSNVKRVQKHFLWHPNPVKYDIFHMILHSVWFSR 516


>gi|30683790|ref|NP_567512.2| Exostosin family protein [Arabidopsis thaliana]
 gi|19347795|gb|AAL86348.1| unknown protein [Arabidopsis thaliana]
 gi|26983908|gb|AAN86206.1| unknown protein [Arabidopsis thaliana]
 gi|332658395|gb|AEE83795.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 542

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 140/353 (39%), Positives = 206/353 (58%), Gaps = 32/353 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH--FVTKDPSKA 56
           M    +VY+YP            P+  EP   G YASE +F K  M+S+  FVTK+P +A
Sbjct: 196 MELILKVYIYPDGDK--------PIFHEPHLNGIYASEGWFMK-LMESNKQFVTKNPERA 246

Query: 57  DLFFLPFSIARMRHDRRI-GTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACHS 112
            LF++P+S+ +++    + G+  I     F+  Y+  +S KYP+WNRT G+DHF VACH 
Sbjct: 247 HLFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNRTHGSDHFLVACHD 306

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS----S 166
            G   + +  E+K NAI+ +C++       +  KDVSLP+   R    P   +G+    S
Sbjct: 307 WGPYTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRVS 366

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHV 220
           +R  LAFFAG ++  VR KLL+ WRN   D +IY    H+   K  Y   +  SK+CL  
Sbjct: 367 QRPILAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTYVQHMKSSKYCLCP 426

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            G+EVN+ RI +++YY CVPV+IA+++ LPF+DVL+W +FS+VV   +IP LK+IL  I 
Sbjct: 427 MGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKEILLEIP 486

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLD 333
              YL +QS V  V++HF W   P  YD F+M+++ +W    ++  Q  TS D
Sbjct: 487 MRRYLKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIWF---NLLNQNQTSFD 536


>gi|302768010|ref|XP_002967425.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300165416|gb|EFJ32024.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 372

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 197/337 (58%), Gaps = 23/337 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYF-KKAFMKSHFVTKDPSKADLF 59
           M R F+VYVY  +  +P      P+     G Y+SE  F  +    S FVT DP +A +F
Sbjct: 28  MERLFKVYVY--KEGEPRLVHKGPLT----GIYSSEGRFIHEMNQNSRFVTHDPQEAHMF 81

Query: 60  FLPFSIARMRHDRRI-GTEG---IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           FLP+S+A M  D  + G+     +  FI  Y+  I+ K+P+WN T G+DHF+ +CH  G 
Sbjct: 82  FLPYSVAHMVLDLYVPGSHSMLPLATFIKDYVNLIASKHPFWNLTRGSDHFFASCHDWGP 141

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPKLGS---SKRNKL 171
           +  +   E++ N+++VVC+S       +  KD SLP+ +    + P KLG    SKR  L
Sbjct: 142 ATARDHPELRKNSVKVVCNSD-LTEEFVPDKDASLPETYLHAVKLPTKLGGPGPSKRPIL 200

Query: 172 AFFAGAVNSPVREKLLQVWRNDS-------EIYAHSGRLKTPYADELLGSKFCLHVKGFE 224
           AFFAG ++  VR  L++ W++         E+       +T Y   +  SKFC+   GFE
Sbjct: 201 AFFAGQMHGRVRPALIKHWKDRGDPDMRIYEVLPPDVARRTSYVQHMKSSKFCICAMGFE 260

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ RI +S+YY CVPV+IA+++ LPF+DVLNW SFS+ V+  D+P LK++L  +S + Y
Sbjct: 261 VNSPRIVESIYYDCVPVLIADNFVLPFSDVLNWGSFSLTVSEKDVPRLKELLLAVSEDRY 320

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
             +QS + KVRKHF WH     +D F+M+++ +W RR
Sbjct: 321 RKMQSRLKKVRKHFLWHDSAERFDMFHMILHSVWTRR 357


>gi|302753728|ref|XP_002960288.1| glycosylransferase-like protein [Selaginella moellendorffii]
 gi|300171227|gb|EFJ37827.1| glycosylransferase-like protein [Selaginella moellendorffii]
          Length = 372

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 197/337 (58%), Gaps = 23/337 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYF-KKAFMKSHFVTKDPSKADLF 59
           M R F+VYVY  +  +P      P+     G Y+SE  F  +    S FVT DP +A +F
Sbjct: 28  MERLFKVYVY--KEGEPRLVHKGPLT----GIYSSEGRFIHEMNQNSRFVTHDPQEAHMF 81

Query: 60  FLPFSIARMRHDRRI-GTEG---IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           FLP+S+A M  D  + G+     +  FI  Y+  I+ K+P+WN T G+DHF+ +CH  G 
Sbjct: 82  FLPYSVAHMVLDLYVPGSHTMLPLATFIKDYVNLIASKHPFWNLTRGSDHFFTSCHDWGP 141

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPKLGS---SKRNKL 171
           +  +   E++ N+++VVC+S       +  KD SLP+ +    + P KLG    SKR  L
Sbjct: 142 ATARDHPELRKNSVKVVCNSD-LTEEFVPDKDASLPETYLHAVKLPTKLGGPGPSKRPIL 200

Query: 172 AFFAGAVNSPVREKLLQVWRNDS-------EIYAHSGRLKTPYADELLGSKFCLHVKGFE 224
           AFFAG ++  VR  L++ W++         E+       +T Y   +  SKFC+   GFE
Sbjct: 201 AFFAGQMHGRVRPALIKHWKDRGDPDMRIYEVLPPEVARRTSYVQHMKSSKFCICAMGFE 260

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ RI +S+YY CVPV+IA+++ LPF+DVLNW SFS+ V+  D+P LK++L  +S + Y
Sbjct: 261 VNSPRIVESIYYDCVPVLIADNFVLPFSDVLNWGSFSLTVSEKDVPRLKELLLAVSEDRY 320

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
             +QS + KVRKHF WH     +D F+M+++ +W RR
Sbjct: 321 RKMQSRLKKVRKHFLWHDSAERFDMFHMILHSVWTRR 357


>gi|224133768|ref|XP_002321656.1| predicted protein [Populus trichocarpa]
 gi|222868652|gb|EEF05783.1| predicted protein [Populus trichocarpa]
          Length = 407

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 135/355 (38%), Positives = 199/355 (56%), Gaps = 39/355 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M +  +V+VYP    +P       +D + + NYASE YF        F+T++P +A LFF
Sbjct: 61  MEKDLKVFVYP--GGNP-KTCYHSIDKKLKSNYASEHYFFMNLRNGSFLTENPDEAHLFF 117

Query: 61  LPFSIARMRHDRRIGTEGIPDF----ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
           +P S   M        + +P +    I +Y+  ++ KYPYWNRT GADHF+V+CH IG  
Sbjct: 118 IPLSCQPME-----DQDALPRYKEMVIQNYVRALTIKYPYWNRTLGADHFFVSCHGIGNR 172

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI----WPRQED------------- 159
           A      +  NAI++VCS SY  S +I HKDVSLPQI    +P + D             
Sbjct: 173 ATAAFPFLLKNAIRLVCSPSY-DSNYIPHKDVSLPQILELSFPPEGDGMWNDSTMESLPI 231

Query: 160 --PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAH------SGRLKTPYADEL 211
              P      R KL F+AG+ NS VR+ L   ++   E   H         +   +  E+
Sbjct: 232 QLSPVETHPSRTKLCFWAGSPNSEVRKNLRVHYKGLEEFEIHFVENVKRALVLDTFQKEI 291

Query: 212 LGSKFCLHVKG-FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
             SKFC+  +G  +V    +A+S+ +GCVPVI++++YDLPF D+L+W +FS+++   D+P
Sbjct: 292 HRSKFCICPRGKTQVGGVCLAESMAFGCVPVIMSDYYDLPFNDILDWNAFSVILKEHDVP 351

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           ++ +ILKGI  + +  ++  VLKV K+F+WH  P  YD F+MVMYELW RR  +R
Sbjct: 352 IMGEILKGIPEDMFEKMRQNVLKVSKYFKWHFRPVKYDEFHMVMYELWKRRHIIR 406


>gi|297804498|ref|XP_002870133.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297315969|gb|EFH46392.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 540

 Score =  228 bits (580), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 136/352 (38%), Positives = 202/352 (57%), Gaps = 30/352 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAF-MKSHFVTKDPSKAD 57
           M    +VY+YP            P+  +P   G YASE +F K     + FVTK+P +A 
Sbjct: 194 MELILKVYIYPDGEK--------PIFHQPHLNGIYASEGWFMKLMESNTQFVTKNPERAH 245

Query: 58  LFFLPFSIARMRHDRRI-GTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           LF++P+S+ +++    + G+  I     F+  Y+  +S KYP+WNRT G+DHF VACH  
Sbjct: 246 LFYMPYSVKQLQTSIFVPGSHNIKPLSIFLRDYVNMLSTKYPFWNRTHGSDHFLVACHDW 305

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS----SK 167
           G   + +  E++ N I+ +C++       I  KDVSLP+   R    P   +G+    S+
Sbjct: 306 GPYTVNEHPELRRNTIKALCNADLADGIFIPGKDVSLPETSIRNAGKPLRNIGNGNRVSQ 365

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHVK 221
           R  LAFFAG ++  VR KLL+ WRN   D +IY    H+   K  Y   +  SK+CL   
Sbjct: 366 RPILAFFAGNLHGRVRPKLLKHWRNKDDDMKIYGPLPHNVARKMTYVQHMKSSKYCLCPM 425

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G+EVN+ RI +++YY CVPV+IA+++ LPF+DVL+W +FS+VV   +IP LK+IL  I  
Sbjct: 426 GYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKEILLEIPM 485

Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLD 333
             YL +QS V  V++HF W   P  YD F+M+++ +W    ++  Q  TS D
Sbjct: 486 RRYLKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIWF---NLLNQNQTSFD 534


>gi|356537024|ref|XP_003537031.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 472

 Score =  228 bits (580), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 134/338 (39%), Positives = 191/338 (56%), Gaps = 20/338 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+V+VY       F N      +   GN+          M  HF TKDP KA +FF
Sbjct: 139 MEKQFKVFVYEEGETPVFHNGPCKSIYSMEGNF-----IHAIEMNDHFRTKDPKKAHVFF 193

Query: 61  LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           LPFS+  M      R  R  G   I   +  YI  I+ +Y YWNR+ GADHF +ACH  G
Sbjct: 194 LPFSVVMMVRFVYQRDSRDFGP--IRKTVIDYINLIAARYSYWNRSLGADHFMLACHDWG 251

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNK 170
             A      +  N+I+V+C+++    G    KDVS P+I  +            +SKR+ 
Sbjct: 252 PEASLSLPYLHKNSIRVLCNANT-SEGFKPAKDVSFPEINLQTGSINGFIGGPSASKRSI 310

Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
           LAFFAG V+ P+R  LL+ W N D +I  H    K   Y D+L  SKFCL   G+EV + 
Sbjct: 311 LAFFAGGVHGPIRPILLEHWENKDEDIQVHKYLPKGVSYYDKLRNSKFCLCPSGYEVASP 370

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           R+ +++Y GCVPV+I+ HY  PF+DVLNWKSFS+ ++  DIP LK IL  IS  +Y+ +Q
Sbjct: 371 RVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFSVELSVKDIPNLKDILMSISPRQYIRMQ 430

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
             V+++++HF+ H  P  +D F+M+++ +WLRR + R+
Sbjct: 431 RRVIQIQRHFEVHSPPKRFDVFHMILHSVWLRRLNFRM 468


>gi|356546040|ref|XP_003541440.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 491

 Score =  226 bits (577), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 134/339 (39%), Positives = 191/339 (56%), Gaps = 20/339 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+V+VY       F N      +   GN+          M  HF TKDP KA +FF
Sbjct: 158 MEKQFKVFVYEEGETPVFHNGPCKSIYSMEGNF-----IHAIEMNDHFRTKDPKKAHVFF 212

Query: 61  LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           LPFS+  M      R  R  G   I   +  Y+  I+ +YPYWNR+ GADHF +ACH  G
Sbjct: 213 LPFSVVMMVRFVYERDSRDFGP--IKKTVIDYVNLIATRYPYWNRSLGADHFMLACHDWG 270

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNK 170
             A      +  N+I+V+C+++    G    KDVS P+I  +            +SKR+ 
Sbjct: 271 PEASFSLPYLHKNSIRVLCNANT-SEGFKPAKDVSFPEINLQTGSINGFVGGPSASKRSI 329

Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
           LAFFAG V+ P+R  LL+ W N D +I  H    K   Y   L  SKFCL   G+EV + 
Sbjct: 330 LAFFAGGVHGPIRPILLEHWENKDEDIQVHKYLPKGVSYYGMLRKSKFCLCPSGYEVASP 389

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           R+ +++Y GCVPV+I+ HY  PF+DVLNWKSFS+ ++  DIP+LK IL  IS  +++ +Q
Sbjct: 390 RVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFSVELSVKDIPILKDILMSISPRQHIRMQ 449

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
             V ++R+HF+ H  P  +D F+M+++ +WLRR + RV+
Sbjct: 450 RRVGQIRRHFEVHSPPKRFDVFHMILHSVWLRRLNFRVR 488


>gi|356570806|ref|XP_003553575.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 537

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 134/339 (39%), Positives = 192/339 (56%), Gaps = 22/339 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
           M + F+V+VY       F N         +  Y+ E  F  A  M   F T+DP +A +F
Sbjct: 204 MEKQFKVFVYEEGEPPVFHNGPC------KSIYSMEGNFIHAIEMNDQFRTRDPEEAHVF 257

Query: 60  FLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           FLPFS+A +      R     G   I   ++ Y+  I  +YPYWNR+ GADHFY+ACH  
Sbjct: 258 FLPFSVAMLVQFVYVRDSHDFGP--IKKTVTDYVNVIGGRYPYWNRSLGADHFYLACHDW 315

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
           G    +    +  N+I+V+C+++    G    KDVS P+I  +            +S+R 
Sbjct: 316 GPETSRSIPNLNKNSIRVLCNANTS-EGFKPSKDVSFPEINLQTGSINGFIGGPSASRRP 374

Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
            LAFFAG ++ P+R  LL+ W N D +I  H    K   Y + L  SKFCL   G+EV +
Sbjct: 375 LLAFFAGGLHGPIRPVLLEHWENKDEDIQVHKYLPKGVSYYEMLRKSKFCLCPSGYEVAS 434

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            R+ +++Y GCVPV+I++HY  PF DVLNWKSFS+ V+  DIP LK+IL  IS  +Y+ +
Sbjct: 435 PRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVEVSVKDIPRLKEILLSISPRQYIRM 494

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           Q  V +VR+HF+ H  P  YD F+M+++ +WLRR + RV
Sbjct: 495 QRRVGQVRRHFEVHSPPKRYDVFHMILHSVWLRRLNFRV 533


>gi|227202794|dbj|BAH56870.1| AT4G38040 [Arabidopsis thaliana]
          Length = 407

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 123/302 (40%), Positives = 178/302 (58%), Gaps = 12/302 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+VY+YP    DP  N       +  G YASE YF +   +S F T DP +ADLFF
Sbjct: 100 MEKRFKVYIYPD--GDP--NTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPDEADLFF 155

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 156 IPISCHKMR-GKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGVRAFEG 214

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
           +  +  N I+VVCS SY + G I HKDV+LPQ+      P        R  L F+AG  N
Sbjct: 215 SPLLIKNTIRVVCSPSYNV-GFIPHKDVALPQVLQPFALPAGGNDVENRTTLGFWAGHRN 273

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
           S +R  L  VW ND+E+   + R+        Y      +KFC+   G +VN+ARI DS+
Sbjct: 274 SKIRVILAHVWENDTELDISNNRINRATGHLVYQKRFYRTKFCICPGGSQVNSARITDSI 333

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +YGC+PVI++++YDLPF D+LNW+ F++V+   D+  LK+ILK I   E++ L + ++KV
Sbjct: 334 HYGCIPVILSDYYDLPFNDILNWRKFAVVLREQDVYNLKQILKNIPHSEFVSLHNNLVKV 393

Query: 295 RK 296
           ++
Sbjct: 394 KR 395


>gi|168059393|ref|XP_001781687.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666856|gb|EDQ53500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 376

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 129/339 (38%), Positives = 194/339 (57%), Gaps = 26/339 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR-GNYASESYF-KKAFMKSHFVTKDPSKADL 58
           M + F+VY+Y   R  P       V   P+ G YASE  F ++    S F+T DPS+A +
Sbjct: 29  MEKVFKVYIYKDGRK-PL------VHSGPQLGIYASEGQFIERMEAASEFLTDDPSRAHM 81

Query: 59  FFLPFSIARMRHDRRIGTE----GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           FFLP+S+ RM     +        +  FI  Y+  ++++YPYWNRT GADHF+V+CH  G
Sbjct: 82  FFLPYSVYRMVTHLYVPNSRSMLPLATFIKDYVEALARQYPYWNRTKGADHFFVSCHDWG 141

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW---PRQEDPPKLGS---SKR 168
            +  +    ++ NA++VVC++       +  KD SLP+++    + + P KLG     +R
Sbjct: 142 PATARDHPTLRSNAVKVVCNAD-LTEEFVVGKDASLPEVYMHKSKTKAPIKLGGPGYDER 200

Query: 169 NKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP------YADELLGSKFCLHVKG 222
             LAFFAG ++  VR  LL  W++        G L  P      Y   +  SK+C+   G
Sbjct: 201 PYLAFFAGQMHGRVRPILLDHWKDKDPDLMIYGVLPKPIAKQISYVQHMKMSKYCICAAG 260

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
           +EVN+ RI +S++Y CVPVIIA+++ LPF+DVLNW +FS+ +   DIP LK IL  I  +
Sbjct: 261 YEVNSPRIVESIHYDCVPVIIADNFVLPFSDVLNWDAFSVTMPESDIPKLKAILNDIPEK 320

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            Y  +Q  + K+R+HF WH  P  YD F+M+++ +W+ R
Sbjct: 321 TYRSMQIRLRKIRQHFVWHKKPEKYDVFHMILHSVWMSR 359


>gi|356533401|ref|XP_003535253.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 523

 Score =  225 bits (573), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 139/345 (40%), Positives = 197/345 (57%), Gaps = 32/345 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAF-MKSHFVTKDPSKAD 57
           M + F+V+VY            LPV  D      Y++E  F  A  M  HF T+DP KA+
Sbjct: 182 MEKQFKVFVYEEGE--------LPVFHDGPCSSIYSTEGSFIHAIEMNEHFRTRDPKKAN 233

Query: 58  LFFLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +FFLPFSIA M      R+    G   I   +  Y+  I+ +YPYWNR+ GADHF ++CH
Sbjct: 234 VFFLPFSIAWMVRYVYIRNSYDFGP--IKRTVRDYVNVIATRYPYWNRSLGADHFMLSCH 291

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQED-----PPKLG 164
             G    +    ++ N+I+V+C+++    G    KD S P+I   P  +D     PP   
Sbjct: 292 DWGPETSKSIPYLRKNSIRVLCNANT-SEGFDPIKDASFPEINLQPGLKDSFVGGPP--- 347

Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKG 222
           +SKR+ LAFFAG  + P+R  LL+ W N D +I  H    K   Y   L  SKFCL   G
Sbjct: 348 ASKRSILAFFAGGNHGPIRPILLEHWENKDEDIQVHKYLPKGVSYYGMLRNSKFCLCPSG 407

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
           +EV + R+ +++Y GCVPV+I+ HY  PF+DVLNWK FS+ V+  +IP LK IL  IS  
Sbjct: 408 YEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKMFSVNVSVKEIPNLKDILTSISPR 467

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           +Y+ +Q  V ++R+HF+ H  P  YD F+M+++ +WLRR + RV 
Sbjct: 468 QYIRMQKRVGQIRRHFEVHSPPKRYDVFHMILHSVWLRRLNFRVH 512


>gi|356548353|ref|XP_003542567.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 505

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 138/341 (40%), Positives = 199/341 (58%), Gaps = 26/341 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFE-PRGN-YASESYFKKAF-MKSHFVTKDPSKAD 57
           M + F+V+VY            LPV  E P  + Y++E  F  A  M  HF T+DP KA 
Sbjct: 172 MEKQFKVFVYEEGE--------LPVFHEGPCASIYSTEGSFIHAIEMNEHFRTRDPKKAH 223

Query: 58  LFFLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +FFLPFS+  M      R     G   I   +  YI  I+ +YPYWNR+ GADHF ++CH
Sbjct: 224 VFFLPFSVVMMVRYVYIRDSHDFGP--IKRTVRDYINVIAARYPYWNRSLGADHFMLSCH 281

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQEDPPKLG---SSK 167
             G  A + +  ++ N+I+V+C+++    G    KDVS P+I   R      LG   +S+
Sbjct: 282 DWGPEASKFSPYLRKNSIRVLCNANT-SEGFDPRKDVSFPEINLQRGPIDGLLGGPSASQ 340

Query: 168 RNKLAFFAGAVNSPVREKLLQVW-RNDSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEV 225
           R+ LAFFAG ++ P+R  LL+ W + D +I  H    K   Y   L  SKFCL   G+EV
Sbjct: 341 RSILAFFAGGIHGPIRPILLEHWEKKDEDIQVHQYLPKGVSYYGMLRKSKFCLCPSGYEV 400

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
            + R+ +++Y GCVPV+I++HY  PF+DVLNWK FS+ V+  +IP LK IL  IS  +Y+
Sbjct: 401 ASPRVVEAIYTGCVPVLISDHYVPPFSDVLNWKMFSVEVSMKEIPNLKDILMNISPRKYI 460

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            +Q  V ++R+HF+ H  P  YD F+M+++ +WLRR + RV
Sbjct: 461 RMQKRVRQIRRHFEVHSPPKRYDVFHMILHSVWLRRLNFRV 501


>gi|356505300|ref|XP_003521429.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 534

 Score =  223 bits (569), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 134/339 (39%), Positives = 190/339 (56%), Gaps = 22/339 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
           M + F+V+VY       F N         +  Y+ E  F  A  M   F T+DP KA +F
Sbjct: 201 MEKQFKVFVYEEGEPPVFHNGPC------KSIYSMEGNFIHAIEMNDQFRTRDPEKAHVF 254

Query: 60  FLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           FLPFS+A +      R     G   I   ++ Y+  I+ +YPYWNR+ GADHFY+ACH  
Sbjct: 255 FLPFSVAMLVQFVYVRDSHDFGP--IKKTVTDYVNVIAGRYPYWNRSLGADHFYLACHDW 312

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
           G    +    +  N+I+V+C+++    G    KDVS P+I  +            +S R 
Sbjct: 313 GPETSRSIPNLNENSIRVLCNANTS-EGFKPSKDVSFPEINLQTGSINGFIGGPSASGRP 371

Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
            LAFFAG ++ P+R  LL+ W N D +I  H    K   Y + L  S+FCL   G+EV +
Sbjct: 372 LLAFFAGGLHGPIRPVLLEHWENRDEDIQVHKYLPKGVSYYEMLRKSRFCLCPSGYEVAS 431

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            R+ +++Y GCVPV+I++HY  PF DVLNWKSFS+ V+  DIP LK+IL  IS   Y+ +
Sbjct: 432 PRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVEVSVKDIPRLKEILLSISPRHYIRM 491

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           Q  V  VR+HF+ H  P  YD F+M+++ +WLRR + RV
Sbjct: 492 QRRVGLVRRHFEVHSPPKRYDVFHMILHSVWLRRLNFRV 530


>gi|414881250|tpg|DAA58381.1| TPA: hypothetical protein ZEAMMB73_758303 [Zea mays]
          Length = 474

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 137/351 (39%), Positives = 198/351 (56%), Gaps = 36/351 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M R  +VY+Y   R   F    L       G YASE +F K   +S   V  D  KA LF
Sbjct: 133 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRHVVADAGKAHLF 186

Query: 60  FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +LP+S  ++R        H+ R     +  ++ +++  ++ KYP+WNRT GADHF VACH
Sbjct: 187 YLPYSSQQLRLTLYEAGSHNLR----PLAAYLRNFVRGLASKYPFWNRTRGADHFLVACH 242

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLGSS 166
             G        +++ NAI+ +C++          KDVSLP+   R    P      L  S
Sbjct: 243 DWGPYTTTAHRDLRKNAIKALCNADSSEGIFTPGKDVSLPETTIRTPRRPLRYVGGLPVS 302

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN----DSEIYAH-----SGRLKTPYADELLGSKFC 217
           +R+ LAFFAG V+  VR  LL+ W N    D  +Y+      S R+   Y   +  S+FC
Sbjct: 303 RRSILAFFAGNVHGRVRPVLLRHWGNGQDDDMRVYSLLPSRVSRRMN--YIQHMKNSRFC 360

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L   G+EVN+ RI ++LYY CVPVIIA+++ LPF++VL+W +FS+V+A  DIP LKKILK
Sbjct: 361 LCPMGYEVNSPRIVEALYYECVPVIIADNFVLPFSEVLDWSAFSVVIAEKDIPDLKKILK 420

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR-SSVRVQ 327
           GIS   Y+ +   V ++++HF WH  P  YD F+M+++ +WL R + V+VQ
Sbjct: 421 GISLRRYVAMHDSVKRLQRHFLWHARPIKYDLFHMILHSIWLSRVNQVQVQ 471


>gi|359481952|ref|XP_002284018.2| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
          Length = 546

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 133/339 (39%), Positives = 187/339 (55%), Gaps = 20/339 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M +  +VYVY       F N      +   GN+       K  M SHF TKDP KA LFF
Sbjct: 210 MEKQLKVYVYDEGEPPLFHNGPCKSIYSMEGNF-----IHKMEMDSHFRTKDPEKAHLFF 264

Query: 61  LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           LPFS+A +      R    +G   I   +  Y+  +S KYPYWNR+ GADHF +ACH  G
Sbjct: 265 LPFSVAMLVRFVYVRDSHDLGP--IKQTVIDYVNVVSTKYPYWNRSLGADHFMLACHDWG 322

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNK 170
                    +  N+I+V+C+++    G    KDVS P+I                S R  
Sbjct: 323 PETSFSIPYLHKNSIRVLCNANTS-EGFNPSKDVSFPEINLLTGSTDSFIGGPSPSHRTL 381

Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
           LAFFAG ++ P+R  LL+ W N D ++  H    K   Y + +  SK+CL   G+EV + 
Sbjct: 382 LAFFAGGLHGPIRPILLEHWENKDEDVKVHKYLPKGVSYYEMMRKSKYCLCPSGYEVASP 441

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           R+ ++LY GCVPV+I++HY  PF+DVLNWKSFS+ V   +IP LK+IL  IS  +Y+ +Q
Sbjct: 442 RVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVREIPNLKRILMDISPRQYIRMQ 501

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
              ++ R+HF+ +  P  YD F+M+++ LWLRR + RV 
Sbjct: 502 RRGIQARRHFEVNSPPKRYDVFHMILHSLWLRRLNFRVH 540


>gi|326519558|dbj|BAK00152.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 564

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 130/337 (38%), Positives = 196/337 (58%), Gaps = 23/337 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
           M R  +V++Y H    P  +       E +G YASE +F K      +FV +DP++A LF
Sbjct: 226 MERLLKVFIY-HDGAKPIFH-----SPELKGIYASEGWFMKLIEADQNFVVRDPNRAHLF 279

Query: 60  FLPFSIARMRHDRRIGTEGIPD----FISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LP+S  ++ H+  +      D    F+ +YI  IS K+PYWNRT GADHF+VACH  G 
Sbjct: 280 YLPYSSRQLEHNLYVPGSNTLDPLSVFVKNYIDMISAKFPYWNRTKGADHFFVACHDWGP 339

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SSKRNK 170
              +   E++ N I+ +C++       I  +DVSLP+ + +    P   +G   +++R+ 
Sbjct: 340 YTTKLHDELRKNTIKALCNADVSEGVFIRGRDVSLPETYIKSARRPVRDIGGKPAAERSI 399

Query: 171 LAFFAGAVNSPVREKLLQVW---RNDSEIYAHSGRLKT---PYADELLGSKFCLHVKGFE 224
           LAFFAG ++  VR  +L+ W     D  IY+   R  T    YA  +  SK+C+   G+E
Sbjct: 400 LAFFAGQMHGRVR-PVLKYWGGKDTDMRIYSRIPRQITRRMNYAKHMKSSKYCICPMGYE 458

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ RI +++YY CVPVIIA+++ LPF D L+W +FS+VVA  D+P LK IL  I    Y
Sbjct: 459 VNSPRIVEAIYYECVPVIIADNFVLPFDDALDWSAFSVVVAEKDVPKLKAILLAIPESRY 518

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           + ++S V KV++HF WH  P  YD F+M+++ +W  R
Sbjct: 519 ITMRSNVKKVQRHFLWHAKPVKYDIFHMILHSVWFSR 555


>gi|255575457|ref|XP_002528630.1| catalytic, putative [Ricinus communis]
 gi|223531919|gb|EEF33733.1| catalytic, putative [Ricinus communis]
          Length = 574

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 130/339 (38%), Positives = 193/339 (56%), Gaps = 20/339 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+V+VY       F N      +   GN+       +  +   F TKDP KA ++F
Sbjct: 236 MEKQFKVFVYEEGEPPVFHNGPCKSIYSMEGNF-----IHRMEIDDQFRTKDPEKAHVYF 290

Query: 61  LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           LPFS+A M      R     G   I   +  Y+  ++ KYPYWNR+ GADHF +ACH  G
Sbjct: 291 LPFSVAMMVQFVYVRDSHDFGP--IKRTVRDYVNLVAGKYPYWNRSLGADHFMLACHDWG 348

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNK 170
                   ++  N+I+ +C+++     +   KDVS P+I  +             SKR+ 
Sbjct: 349 PETSFSLPDLAKNSIRALCNANTSERFNPI-KDVSFPEINLQTGTTKGFIGGPSPSKRSI 407

Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
           LAFFAG ++ P+R  LL+ W N D+++  H    K   Y + +  SKFCL   G+EV + 
Sbjct: 408 LAFFAGGLHGPIRPILLEHWENKDNDMKVHRYLPKGVSYYEMMRKSKFCLCPSGYEVASP 467

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           R+ ++LY GCVPV+I++HY  PF+DVLNWKSFS+ V   DIP LK+IL  ISS +Y+ +Q
Sbjct: 468 RVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVSDIPNLKRILTSISSRQYIRMQ 527

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
             VL+VR+HF+ +  P  YD F+M+++ +WLRR +V++ 
Sbjct: 528 RRVLQVRRHFEVNSPPKRYDVFHMILHSIWLRRLNVKIH 566


>gi|147784355|emb|CAN72733.1| hypothetical protein VITISV_033460 [Vitis vinifera]
          Length = 321

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 112/255 (43%), Positives = 163/255 (63%), Gaps = 7/255 (2%)

Query: 77  EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSS 136
           E +   + +Y+ ++  KYPYWNRT GADHF++ CH  G  A +    +  N+I+VVCS S
Sbjct: 67  ENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDXGVRATEGVPLLVKNSIRVVCSPS 126

Query: 137 YFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVNSPVREKLLQVWRNDSE 195
           Y + G I HKDV+LPQ+      P      K R  L F+AG  NS +R  L ++W ND+E
Sbjct: 127 YDV-GFIPHKDVALPQVLQPFALPTGGRDIKNRTTLGFWAGHRNSKIRVILARIWENDTE 185

Query: 196 IYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
           +   + R+        Y ++   +KFC+   G +VN+ARIADS++YGCVPVI++++YDLP
Sbjct: 186 LDIKNNRINRATGHLVYQNKFYRTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLP 245

Query: 251 FADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
           F D+L+W+ FS+++   D+  LK ILK I   E++ L   ++KV+KHFQW+  P  YDAF
Sbjct: 246 FNDILDWRKFSVILKERDVYRLKYILKDIPDAEFIALHDNLVKVQKHFQWNTPPIKYDAF 305

Query: 311 YMVMYELWLRRSSVR 325
           +MVMYELWLR   ++
Sbjct: 306 HMVMYELWLRHHVIK 320


>gi|297740031|emb|CBI30213.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 133/339 (39%), Positives = 187/339 (55%), Gaps = 20/339 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M +  +VYVY       F N      +   GN+       K  M SHF TKDP KA LFF
Sbjct: 1   MEKQLKVYVYDEGEPPLFHNGPCKSIYSMEGNF-----IHKMEMDSHFRTKDPEKAHLFF 55

Query: 61  LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           LPFS+A +      R    +G   I   +  Y+  +S KYPYWNR+ GADHF +ACH  G
Sbjct: 56  LPFSVAMLVRFVYVRDSHDLGP--IKQTVIDYVNVVSTKYPYWNRSLGADHFMLACHDWG 113

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNK 170
                    +  N+I+V+C+++    G    KDVS P+I                S R  
Sbjct: 114 PETSFSIPYLHKNSIRVLCNANT-SEGFNPSKDVSFPEINLLTGSTDSFIGGPSPSHRTL 172

Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
           LAFFAG ++ P+R  LL+ W N D ++  H    K   Y + +  SK+CL   G+EV + 
Sbjct: 173 LAFFAGGLHGPIRPILLEHWENKDEDVKVHKYLPKGVSYYEMMRKSKYCLCPSGYEVASP 232

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           R+ ++LY GCVPV+I++HY  PF+DVLNWKSFS+ V   +IP LK+IL  IS  +Y+ +Q
Sbjct: 233 RVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVREIPNLKRILMDISPRQYIRMQ 292

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
              ++ R+HF+ +  P  YD F+M+++ LWLRR + RV 
Sbjct: 293 RRGIQARRHFEVNSPPKRYDVFHMILHSLWLRRLNFRVH 331


>gi|51535578|dbj|BAD37522.1| pectin-glucuronyltransferase-like [Oryza sativa Japonica Group]
          Length = 342

 Score =  221 bits (563), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 136/347 (39%), Positives = 197/347 (56%), Gaps = 28/347 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSFRV+VYP    DP      P      G YASE YF +   +S F T D  KA LFF
Sbjct: 1   MERSFRVFVYPD--GDPGTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDLEKAHLFF 56

Query: 61  LPFSIARMRHD-----RRIGTEGIPDFISHYIFNISQKYPYW----------NRTGGADH 105
           +P S  +MR         +    +   I  Y  +I     YW          +   GADH
Sbjct: 57  VPISPHKMRGKVPSSLLLVTYAWLILHIRSYDRSILFLDLYWWCPLCSSFRGHWGVGADH 116

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS 165
           F+V CH +G  A +    +  N+I+VVCS SY  +G+I HKDV+LPQI  +    P  G+
Sbjct: 117 FFVTCHDVGVRAFEGLPFIIKNSIRVVCSPSY-NAGYIPHKDVALPQIL-QPFALPAGGN 174

Query: 166 SKRNK--LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCL 218
              N+  L F+AG  NS +R  L ++W ND+E+   + R+        Y      +KFC+
Sbjct: 175 DIENRTILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGNLVYQKHFFRTKFCV 234

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
              G +VN+ARI+DS++YGC+PVI++++YDL F+ +LNW+ F++V+   D+  LK ILK 
Sbjct: 235 CPGGSQVNSARISDSIHYGCMPVILSDYYDLRFSGILNWRKFAVVLKESDVYELKSILKS 294

Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +S +E++ L   +++V+KHF+WH  P  YDAF+M+MYELWLR   ++
Sbjct: 295 LSQKEFVSLHKSLVQVQKHFEWHSPPVPYDAFHMIMYELWLRHHVIK 341


>gi|326529697|dbj|BAK04795.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 348

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 137/342 (40%), Positives = 190/342 (55%), Gaps = 31/342 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M R  +VY+Y   R   F    L       G YASE +F K   +S  FV  D +KA LF
Sbjct: 9   MERILKVYIYQDGRRPIFHTPPLS------GIYASEGWFMKLLKESRRFVVADAAKAHLF 62

Query: 60  FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +LP+S   +R        H+ R     + DF+      ++ KYP+WNRT GADHF VACH
Sbjct: 63  YLPYSSQHLRLSLYVPDSHNLRPLAVYLRDFVK----GLAAKYPFWNRTRGADHFLVACH 118

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLGSS 166
             G        ++  N+I+ +C++          KDVSLP+   R    P      L  S
Sbjct: 119 DWGPYTTTAHRDLSKNSIKALCNADSSEGIFTPGKDVSLPETTIRTPKRPLRYVGGLPVS 178

Query: 167 KRNKLAFFAGAVNSPVREKLLQVW----RNDSEIYAH-SGRLKTP--YADELLGSKFCLH 219
           +R  LAFFAG V+  VR  LLQ W     +D  +YA   GR+     Y   +  SKFCL 
Sbjct: 179 RRRILAFFAGNVHGRVRPVLLQHWGKGQDDDMRVYALLPGRVSRTMNYIQHMKNSKFCLC 238

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
             G+EVN+ RI ++LYY CVPVIIA+++ LPF+DVL+W +FS+VVA  DIP LK+IL+GI
Sbjct: 239 PMGYEVNSPRIVEALYYECVPVIIADNFVLPFSDVLDWSAFSVVVAEKDIPELKRILQGI 298

Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           S   Y+ +   V ++++HF W+  P  YD F+M+++ +WL R
Sbjct: 299 SLRRYVAMHDCVKRLQRHFLWYDRPLRYDLFHMILHSIWLSR 340


>gi|242053777|ref|XP_002456034.1| hypothetical protein SORBIDRAFT_03g029210 [Sorghum bicolor]
 gi|241928009|gb|EES01154.1| hypothetical protein SORBIDRAFT_03g029210 [Sorghum bicolor]
          Length = 517

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 132/342 (38%), Positives = 193/342 (56%), Gaps = 31/342 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M R  +VY+Y   R   F    L       G YASE +F K   +S   V  D  KA LF
Sbjct: 178 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRHVVADAGKAHLF 231

Query: 60  FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +LP+S  ++R        H+ R     +  ++ +++  ++ KYP+WNRT GADHF VACH
Sbjct: 232 YLPYSSQQLRLTLYQADSHNLR----PLAAYLRNFVRGLANKYPFWNRTRGADHFLVACH 287

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLGSS 166
             G        +++ NAI+ +C++          KDVSLP+   R    P      L  S
Sbjct: 288 DWGPYTTAAHRDLRKNAIKALCNADSSEGIFTPGKDVSLPETTIRNPRRPLRYVGGLPVS 347

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN--DSEIYAHS---GRL--KTPYADELLGSKFCLH 219
           +R+ LAFFAG V+  VR  LL+ W +  D E+  +S    R+  +  Y   +  S+FCL 
Sbjct: 348 RRSILAFFAGNVHGRVRPVLLRHWGDGQDDEMRVYSLLPNRVSRRMNYIQHMKNSRFCLC 407

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
             G+EVN+ RI ++ YY CVPVIIA+++ LP ++VL+W +FS+VVA  DIP LKKIL+GI
Sbjct: 408 PMGYEVNSPRIVEAFYYECVPVIIADNFVLPLSEVLDWSAFSVVVAEKDIPDLKKILQGI 467

Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           S   Y+ + S V ++++HF WH  P  YD F+M+++ +WL R
Sbjct: 468 SPRRYVAMHSCVKRLQRHFLWHARPIKYDLFHMILHSIWLSR 509


>gi|218188734|gb|EEC71161.1| hypothetical protein OsI_03019 [Oryza sativa Indica Group]
          Length = 499

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 130/343 (37%), Positives = 190/343 (55%), Gaps = 33/343 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M R  +VY+Y   R   F    L       G YASE +F K   +S  F   DP+KA LF
Sbjct: 160 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRFAVTDPAKAHLF 213

Query: 60  FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +LP+S  ++R        H+ R     + DF+      ++ KYP+WNRT GADHF VACH
Sbjct: 214 YLPYSSQQLRISLYVPDSHNLRPLAAYLRDFVK----GLAAKYPFWNRTRGADHFLVACH 269

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLGSS 166
             G        +++ N ++ +C++          +DVSLP+   R    P      L  S
Sbjct: 270 DWGSYTTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIRTPRRPLRYVGGLPVS 329

Query: 167 KRNKLAFFAGAVNSPVREKLLQVW----RNDSEIY----AHSGRLKTPYADELLGSKFCL 218
           +R  LAFFAG V+  VR  LL+ W     +D  +Y    A   R +  Y   +  S+FCL
Sbjct: 330 RRGILAFFAGNVHGRVRPVLLKHWGDGRDDDMRVYGPLPARVSR-RMSYIQHMKNSRFCL 388

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
              G+EVN+ RI ++LYY CVPVIIA+++ LP +DVL+W +F++VVA  D+P LKKIL+G
Sbjct: 389 CPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPDLKKILQG 448

Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           I+  +Y+ +   V ++++HF WH  P  YD F+M+++ +WL R
Sbjct: 449 ITLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSIWLSR 491


>gi|296083497|emb|CBI23466.3| unnamed protein product [Vitis vinifera]
          Length = 585

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 128/337 (37%), Positives = 187/337 (55%), Gaps = 22/337 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
           M    +VY+YP      F    L      RG YASE +F K   ++  FVT+DP KA LF
Sbjct: 199 METILKVYIYPDGARPIFHAPHL------RGIYASEGWFMKLMEENRQFVTRDPKKAHLF 252

Query: 60  FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LP+S  ++     +        +  F+  ++  I+ KYP+WNRT G+DHF VACH  G 
Sbjct: 253 YLPYSARQLETALYVPNSHNIRPLSIFLRDHVNMIAAKYPFWNRTHGSDHFLVACHDWGP 312

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNK 170
             + +  E+  N I+ +C++       +A KDVSLP+   R    P   +G    S+R  
Sbjct: 313 YTVNEHQELSRNTIKALCNADLSEGIFVAGKDVSLPETTIRNPRRPLRNVGGRRVSQRPI 372

Query: 171 LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKGFE 224
           LAFFAG ++  VR  LL+ W +  E     G L      K  Y   +  S+FC+   G+E
Sbjct: 373 LAFFAGNMHGRVRPTLLKYWSDKDEDMRIYGPLPNRISRKMSYIQHMKSSRFCICPMGYE 432

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ RI +++YY CVPVIIA+++  P  DVL+W +FS++VA  DIP LK+IL  I    Y
Sbjct: 433 VNSPRIVEAIYYECVPVIIADNFVPPLNDVLDWTAFSVIVAEKDIPKLKEILLAIPLRRY 492

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           L++Q+ V  V+KHF W+  P  YD F+M+++ +W  R
Sbjct: 493 LVMQTNVKMVQKHFLWNPKPVRYDLFHMILHSIWFSR 529


>gi|356565368|ref|XP_003550913.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 534

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 128/315 (40%), Positives = 185/315 (58%), Gaps = 29/315 (9%)

Query: 30  RGNYASESYFKKAFMKS-HFVTKDPSKADLFFLPFSIARM--------RHDRRIGTEGIP 80
           +G YASE +F K   ++  FVTKDP KA LF+LP+S  +M         HD     + + 
Sbjct: 217 KGIYASEGWFMKLMEENKQFVTKDPEKAHLFYLPYSARQMGLTLYVPGSHD----LKPLS 272

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS 140
            F+  Y+  I+ KYP+WNRT G+DHF VACH  G   +    E+K N I+ +C++     
Sbjct: 273 IFLRDYVNKIAAKYPFWNRTQGSDHFLVACHDWGPYTVTGHEELKRNTIKALCNADLSEG 332

Query: 141 GHIAHKDVSLPQIWPRQEDPPK--LGSSK---RNKLAFFAGAVNSPVREKLLQVW----R 191
             +A +DVSLP+   R    P   LG ++   R  LAFFAG+++  VR  LL  W     
Sbjct: 333 VFVAGRDVSLPETTIRAPRRPLRYLGGNRVSLRPILAFFAGSMHGRVRPTLLTYWGGGKD 392

Query: 192 NDSEIYAH-----SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
            D +IY       S R+   Y   +  SK+C+   GFEVN+ RI +++YY CVPVIIA++
Sbjct: 393 EDMKIYKRLPLRVSQRMT--YIQHMKSSKYCVCPMGFEVNSPRIVEAIYYECVPVIIADN 450

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
           + LPF++VL+W +FS+VVA  DIP LK+IL  I   +YL +Q+ V  V+KHF W+  P  
Sbjct: 451 FVLPFSEVLDWSAFSVVVAEKDIPRLKEILLSIPLRKYLTMQNNVKMVQKHFLWNPRPIR 510

Query: 307 YDAFYMVMYELWLRR 321
           YD F+M+++ +W  +
Sbjct: 511 YDLFHMILHSIWFNK 525


>gi|413926326|gb|AFW66258.1| hypothetical protein ZEAMMB73_148778 [Zea mays]
          Length = 267

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 110/255 (43%), Positives = 161/255 (63%), Gaps = 7/255 (2%)

Query: 77  EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSS 136
           E +   +  Y+  +  KYPYWNRT GADHF+V CH +G  A +    +  N+I+VVCS S
Sbjct: 13  ENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEGLKFMVKNSIRVVCSPS 72

Query: 137 YFISGHIAHKDVSLPQIW-PRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE 195
           Y +   I HKD++LPQ+  P            R  L F+AG  NS +R  L +VW ND+E
Sbjct: 73  YNVD-FIPHKDIALPQVLQPFALHEGGNDIDNRVILGFWAGHRNSKIRVILARVWENDTE 131

Query: 196 IYAHSGRL-----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
           +   + R+     +  Y  +   +KFC+   G +VN+ARI+DS++YGCVPVI++++YDLP
Sbjct: 132 LAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLP 191

Query: 251 FADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
           F D L+W+ F++++   D+  LK ILK IS EE++ L   +++V+KHF WH  P  YDAF
Sbjct: 192 FNDALDWRKFAVILRERDVYQLKNILKSISQEEFISLHKSLVQVQKHFVWHSPPVSYDAF 251

Query: 311 YMVMYELWLRRSSVR 325
           +MVMYELWLR + ++
Sbjct: 252 HMVMYELWLRHNVIK 266


>gi|359496494|ref|XP_002269459.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
          Length = 554

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 127/335 (37%), Positives = 187/335 (55%), Gaps = 22/335 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
           M    +VY+YP      F    L      RG YASE +F K   ++  FVT+DP KA LF
Sbjct: 199 METILKVYIYPDGARPIFHAPHL------RGIYASEGWFMKLMEENRQFVTRDPKKAHLF 252

Query: 60  FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LP+S  ++     +        +  F+  ++  I+ KYP+WNRT G+DHF VACH  G 
Sbjct: 253 YLPYSARQLETALYVPNSHNIRPLSIFLRDHVNMIAAKYPFWNRTHGSDHFLVACHDWGP 312

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNK 170
             + +  E+  N I+ +C++       +A KDVSLP+   R    P   +G    S+R  
Sbjct: 313 YTVNEHQELSRNTIKALCNADLSEGIFVAGKDVSLPETTIRNPRRPLRNVGGRRVSQRPI 372

Query: 171 LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKGFE 224
           LAFFAG ++  VR  LL+ W +  E     G L      K  Y   +  S+FC+   G+E
Sbjct: 373 LAFFAGNMHGRVRPTLLKYWSDKDEDMRIYGPLPNRISRKMSYIQHMKSSRFCICPMGYE 432

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ RI +++YY CVPVIIA+++  P  DVL+W +FS++VA  DIP LK+IL  I    Y
Sbjct: 433 VNSPRIVEAIYYECVPVIIADNFVPPLNDVLDWTAFSVIVAEKDIPKLKEILLAIPLRRY 492

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWL 319
           L++Q+ V  V+KHF W+  P  YD F+M+++ +W+
Sbjct: 493 LVMQTNVKMVQKHFLWNPKPVRYDLFHMILHSIWV 527


>gi|356511373|ref|XP_003524401.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 643

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 193/337 (57%), Gaps = 26/337 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
           M R+ +VY+Y       F   ++      +G YASE +F K   ++ HFV KDP+KA LF
Sbjct: 310 MERTLKVYIYKDGNKPIFHQPIM------KGLYASEGWFMKLMEENKHFVLKDPAKAHLF 363

Query: 60  FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           ++PFS   + H   +        +  F+  Y   IS KY Y+NRTGGADHF VACH    
Sbjct: 364 YMPFSSRMLEHALYVRNSHNRTNLRQFLKDYTDKISAKYRYFNRTGGADHFLVACHDWAP 423

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPK-LGSS---KRNK 170
              +   E     I+ +C++     G    +DVSLP+ + R   DP + LG     +R  
Sbjct: 424 YETRHHMEY---CIKALCNAD-VTQGFKIGRDVSLPEAYVRSVRDPQRDLGGKPPHQRPI 479

Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHVKGFE 224
           LAF+AG ++  +R  LL+ W++   D +IY    H    K  Y + +  SK+C+  KG+E
Sbjct: 480 LAFYAGNMHGYLRPILLKHWKDKDPDMKIYGPMPHGAASKMNYINHMKNSKYCICPKGYE 539

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ R+ ++++Y CVPVII++++  PF +VLNW +FSI++A  DIP LK+IL  +S E+Y
Sbjct: 540 VNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWDAFSIILAEKDIPNLKQILLSVSQEKY 599

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           L LQ  V K +KHF WHV P  YD F+M ++ +W  R
Sbjct: 600 LKLQLGVRKAQKHFFWHVKPLKYDLFHMTLHSIWYNR 636


>gi|357476811|ref|XP_003608691.1| hypothetical protein MTR_4g100730 [Medicago truncatula]
 gi|355509746|gb|AES90888.1| hypothetical protein MTR_4g100730 [Medicago truncatula]
          Length = 535

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 133/345 (38%), Positives = 193/345 (55%), Gaps = 37/345 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
           M    +VY+Y       F N  L      +G YASE +F K   ++  FVTKDP +A LF
Sbjct: 200 METVLKVYIYRDGSRPIFHNPSL------KGIYASEGWFMKLMQENKQFVTKDPERAHLF 253

Query: 60  FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +LP+S  +M         HD     + +  F+  Y+  I+ KYP+WNRT G+DHF VACH
Sbjct: 254 YLPYSARQMEVTLYVPGSHD----LKPLSIFLRDYVNKIAAKYPFWNRTHGSDHFLVACH 309

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LGSSK-- 167
             G   + +  E+  N ++ +C++       I  +DVSLP+   R    P   LG ++  
Sbjct: 310 DWGPYTVTEHEELARNTLKALCNADLSERIFIEGRDVSLPETTIRAPRRPLRYLGGNRAS 369

Query: 168 -RNKLAFFAGAVNSPVREKLLQVWRN----DSEIYAHSGRL------KTPYADELLGSKF 216
            R  LAFFAG+++  VR  LL+ W      D +IY    RL      K  Y   +  SK+
Sbjct: 370 LRPILAFFAGSMHGRVRPTLLKYWGGEKYEDMKIYK---RLPLRVSKKMTYIQHMKSSKY 426

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
           CL   GFEVN+ RI +++YY CVPVIIA+++ LP ++VL+W +FS+VVA  DIP LK IL
Sbjct: 427 CLCPMGFEVNSPRIVEAIYYECVPVIIADNFVLPLSEVLDWSAFSVVVAEKDIPRLKDIL 486

Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
             I   +Y+ +Q+ V  V+KHF W+  P  YD F+M+++ +WL +
Sbjct: 487 LSIPMRKYVAMQNNVKMVQKHFLWNPKPIRYDLFHMILHSIWLNK 531


>gi|222618931|gb|EEE55063.1| hypothetical protein OsJ_02778 [Oryza sativa Japonica Group]
          Length = 482

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 126/335 (37%), Positives = 184/335 (54%), Gaps = 34/335 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M R  +VY+Y   R   F    L       G YASE +F K   +S  F   DP+KA LF
Sbjct: 160 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRFAVTDPAKAHLF 213

Query: 60  FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +LP+S  ++R        H+ R     + DF+      ++ KYP+WNRT GADHF VACH
Sbjct: 214 YLPYSSQQLRISLYVPDSHNLRPLAAYLRDFVK----GLAAKYPFWNRTRGADHFLVACH 269

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLGSS 166
             G        +++ N ++ +C++          +DVSLP+   R    P      L  S
Sbjct: 270 DWGSYTTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIRTPRRPLRYVGGLPVS 329

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
           +R  LAFFAG V+  VR  LL+ W          G  +      +  S+FCL   G+EVN
Sbjct: 330 RRGILAFFAGNVHGRVRPVLLKHW----------GDGRDDDMRHMKNSRFCLCPMGYEVN 379

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
           + RI ++LYY CVPVIIA+++ LP +DVL+W +F++VVA  D+P LKKIL+GI+  +Y+ 
Sbjct: 380 SPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPDLKKILQGITLRKYVA 439

Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +   V ++++HF WH  P  YD F+M+++ +WL R
Sbjct: 440 MHGCVKRLQRHFLWHARPLRYDLFHMILHSIWLSR 474


>gi|168033546|ref|XP_001769276.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679541|gb|EDQ65988.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 408

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 130/348 (37%), Positives = 194/348 (55%), Gaps = 35/348 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
           MN++F++YVY     D F  ++     +  G YA+E  F K    S+  F   +PSKA +
Sbjct: 61  MNKTFKIYVY----RDGFKPLVHGA--KTGGIYATEGLFLKRMEDSNNRFTVSEPSKAHM 114

Query: 59  FFLPFSIARM---------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           F LP+S+ +M         R  R + T     FIS+Y+  ++ KYPYWNRT GADHF+V+
Sbjct: 115 FLLPYSVRQMVDILQDPYSRSMRPLKT-----FISNYVDTLASKYPYWNRTHGADHFFVS 169

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQE----DPPKLGS 165
           CH     +     E+  N+++VVC++   ++  I  KDVS+PQ          D   LG 
Sbjct: 170 CHDWAPLSTMLHGELHTNSMKVVCNADLTVNFDI-EKDVSIPQTLKGGNQSDLDVGSLGP 228

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDS------EIYAHSGRLKTPYADELLGSKFCLH 219
            +R+ LAF+AG ++  VR  LL  W+         E+      +   YA  +  S++CL 
Sbjct: 229 EERDFLAFYAGQMHGTVRPVLLDYWKGKDPTMKVYEVLPSDIAVNISYAQHMKRSRYCLC 288

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
            KGFEVN+ RI +++  GCVPVIIA+++ LP+ DVL+W  FS+ V   DIP LKKIL  I
Sbjct: 289 PKGFEVNSPRIVEAILSGCVPVIIADNFVLPYNDVLDWTKFSVTVPEEDIPDLKKILSSI 348

Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSD--YDAFYMVMYELWLRRSSVR 325
           S+  Y  +Q  +  +R+HF W   P D  YD+F+M +Y +W +  ++R
Sbjct: 349 SNVTYRSMQRRLRYIRRHFLWLEDPEDTQYDSFHMTLYSIWRQSMNLR 396


>gi|449482564|ref|XP_004156325.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 516

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 132/330 (40%), Positives = 189/330 (57%), Gaps = 20/330 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
           M +  +++VY       F N         +  Y++E  F  A  M S F TKDP+KA +F
Sbjct: 186 MEKEMKIFVYEEGEPPLFHNGPC------KSIYSTEGNFIHAIEMDSQFRTKDPNKAHVF 239

Query: 60  FLPFSIARMR-----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           FLP S+A +      HD    T  I   +  YI  I  KYP+WNR+ GADHF ++CH  G
Sbjct: 240 FLPLSVAMLVRFVYVHDSHDFTP-IRHTVVDYINVIGTKYPFWNRSLGADHFMLSCHDWG 298

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK-LGS---SKRNK 170
             A +    +  N+I+V+C+++    G    KDVS P+I  +       LG    S R  
Sbjct: 299 PEASKSVPNLYKNSIRVLCNANT-SEGFNPSKDVSFPEINLQTGHLTGFLGGPSPSHRPI 357

Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
           LAFFAG ++ P+R  L+Q W N D +I  H    K   Y D +  SKFCL   G+EV + 
Sbjct: 358 LAFFAGGLHGPIRPILIQQWENQDQDIQVHQYLPKGVSYIDMMRKSKFCLCPSGYEVASP 417

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           RI +++Y GCVPV+I++HY  PF+DV+NWKSFS+ V+  DIP LK IL GIS+ +YL + 
Sbjct: 418 RIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVEVSVDDIPNLKTILTGISTRQYLRMY 477

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
             V+KVR+HF+ +  P  YD ++M+++ +W
Sbjct: 478 RRVVKVRRHFEVNSPPKRYDVYHMILHSVW 507


>gi|115438799|ref|NP_001043679.1| Os01g0640600 [Oryza sativa Japonica Group]
 gi|55297176|dbj|BAD68851.1| pectin-glucuronyltransferase-like [Oryza sativa Japonica Group]
 gi|113533210|dbj|BAF05593.1| Os01g0640600 [Oryza sativa Japonica Group]
          Length = 501

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/345 (37%), Positives = 190/345 (55%), Gaps = 35/345 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M R  +VY+Y   R   F    L       G YASE +F K   +S  F   DP+KA LF
Sbjct: 160 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRFAVTDPAKAHLF 213

Query: 60  FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +LP+S  ++R        H+ R     + DF+      ++ KYP+WNRT GADHF VACH
Sbjct: 214 YLPYSSQQLRISLYVPDSHNLRPLAAYLRDFVK----GLAAKYPFWNRTRGADHFLVACH 269

Query: 112 S--IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLG 164
               G        +++ N ++ +C++          +DVSLP+   R    P      L 
Sbjct: 270 DWLQGSYTTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIRTPRRPLRYVGGLP 329

Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVW----RNDSEIY----AHSGRLKTPYADELLGSKF 216
            S+R  LAFFAG V+  VR  LL+ W     +D  +Y    A   R +  Y   +  S+F
Sbjct: 330 VSRRGILAFFAGNVHGRVRPVLLKHWGDGRDDDMRVYGPLPARVSR-RMSYIQHMKNSRF 388

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
           CL   G+EVN+ RI ++LYY CVPVIIA+++ LP +DVL+W +F++VVA  D+P LKKIL
Sbjct: 389 CLCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPDLKKIL 448

Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +GI+  +Y+ +   V ++++HF WH  P  YD F+M+++ +WL R
Sbjct: 449 QGITLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSIWLSR 493


>gi|356563757|ref|XP_003550126.1| PREDICTED: probable glycosyltransferase At5g03795 [Glycine max]
          Length = 645

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 127/340 (37%), Positives = 188/340 (55%), Gaps = 33/340 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAF-MKSHFVTKDPSKAD 57
           M ++ +VYVY             P+   P   G YASE +F K       F+T+DP+KA 
Sbjct: 313 MEQTLKVYVYREGAR--------PIMHSPFFTGLYASEGWFMKQMEANKRFLTRDPNKAH 364

Query: 58  LFFLPFSIARMRHDRRIGT----EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH-- 111
           LF+LPFS   +     +      + +  ++ +Y+  I+ KY +WNRTGGADHF V CH  
Sbjct: 365 LFYLPFSSRMLEETLYVQNSHNHKNLVQYLHNYVEMIAGKYTFWNRTGGADHFLVGCHDW 424

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL-----GSS 166
           + G + +  A     N I+ +C++     G +  KD SLP+ + R    P        +S
Sbjct: 425 APGETKVDMA-----NCIRSLCNAD-VKEGFVFGKDASLPETYVRDAKIPTKDLSGNSAS 478

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVK 221
           KR  LAFFAG+++  VR  LLQ W N        GRL        Y   +  SK+C+  K
Sbjct: 479 KRTTLAFFAGSMHGYVRPILLQHWENKDPDMKIFGRLPKSKGNRNYIQYMKSSKYCICAK 538

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G+EVN+ R+ ++++Y CVPVII++++  PF +VLNW+SF+++V   DIP LK IL  I  
Sbjct: 539 GYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVIVLEKDIPNLKNILLSIPE 598

Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           ++YL LQ  V KV++HF WH  P  YD F+M+++ +W  R
Sbjct: 599 KQYLRLQMRVKKVQQHFLWHKNPVKYDIFHMILHSVWYNR 638


>gi|224092294|ref|XP_002309547.1| predicted protein [Populus trichocarpa]
 gi|222855523|gb|EEE93070.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 134/341 (39%), Positives = 195/341 (57%), Gaps = 33/341 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
           M    +VY+Y       F   LL       G YASE +F K       FVTKD  KA LF
Sbjct: 137 MEDILKVYIYKEGEMPIFHQPLL------NGIYASEGWFMKLLEGNKKFVTKDSKKAHLF 190

Query: 60  FLPFSI----ARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LPFS      R+        + + +++  Y+  IS+KYP+WNRT GADHF  ACH    
Sbjct: 191 YLPFSSRYLEIRLYVPNSHSHKNLIEYLKKYLDMISEKYPFWNRTQGADHFLAACHDWAP 250

Query: 116 SAMQKAWEVKLNAIQVVCSSSY---FISGHIAHKDVSLPQIWP-RQEDPPK-LG---SSK 167
           S  ++      N I+ +C+S     F+ G    KD SLP+ +   QE+P + LG   +SK
Sbjct: 251 SETRQHMA---NCIRALCNSDAKEDFVYG----KDASLPETYVLTQENPLRDLGGNRASK 303

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAH----SGRLKTPYADELLGSKFCLHV 220
           R+ LAFFAG+++  +R  LLQ W N   D +I+       GR K  YA  +  SK+C+  
Sbjct: 304 RSILAFFAGSMHGYLRPILLQHWENKDPDMKIFGRLPKVKGRGKMNYARYMKSSKYCICA 363

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           KG+EVN+ R+ ++++Y CVPVII++++  PF +VLNW+SF++ V   DIP LKKIL  I 
Sbjct: 364 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVFVLEKDIPNLKKILLSIP 423

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +++Y  +Q  V +V++HF WH  P  YD F+M+++ +W  R
Sbjct: 424 AKKYRRMQMRVKRVQQHFLWHARPVKYDVFHMILHSIWYNR 464


>gi|449451243|ref|XP_004143371.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 516

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 131/330 (39%), Positives = 189/330 (57%), Gaps = 20/330 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
           M +  +++VY       F N         +  Y++E  F  A  M S F TKDP+KA +F
Sbjct: 186 MEKEMKIFVYEEGEPPLFHNGPC------KSIYSTEGNFIHAIEMDSQFRTKDPNKAHVF 239

Query: 60  FLPFSIARMR-----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           FLP S+A +      HD    T  I   +  YI  I  KYP+WNR+ GADHF ++CH  G
Sbjct: 240 FLPLSVAMLVRFVYVHDSHDFTP-IRHTVVDYINVIGTKYPFWNRSLGADHFMLSCHDWG 298

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK-LGS---SKRNK 170
             A +    +  N+I+V+C+++    G    KDVS P+I  +       LG    S R  
Sbjct: 299 PEASKSVPNLYKNSIRVLCNANT-SEGFNPSKDVSFPEINLQTGHLTGFLGGPSPSHRPI 357

Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
           +AFFAG ++ P+R  L+Q W N D +I  H    K   Y D +  SKFCL   G+EV + 
Sbjct: 358 MAFFAGGLHGPIRPILIQRWENQDQDIQVHQYLPKGVSYIDMMRKSKFCLCPSGYEVASP 417

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           RI +++Y GCVPV+I++HY  PF+DV+NWKSFS+ V+  DIP LK IL GIS+ +YL + 
Sbjct: 418 RIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVEVSVDDIPNLKTILTGISTRQYLRMY 477

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
             V+KVR+HF+ +  P  YD ++M+++ +W
Sbjct: 478 RRVVKVRRHFEVNSPPKRYDVYHMILHSVW 507


>gi|357119191|ref|XP_003561329.1| PREDICTED: probable glycosyltransferase At5g03795-like
           [Brachypodium distachyon]
          Length = 569

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 130/342 (38%), Positives = 190/342 (55%), Gaps = 27/342 (7%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPS 54
           M + F++YVY     P   + P  N+           Y++E  F  A  M++   T DP 
Sbjct: 233 MEKLFKIYVYDEGEPPIYHDGPCHNI-----------YSTEGRFIHAMEMENRMRTTDPG 281

Query: 55  KADLFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
            A +FFLPFSIA+M     +      E +   +  YI  +S K+PYWNR+ GADHF ++C
Sbjct: 282 LAHVFFLPFSIAKMEKTIYVPGSHTMEPLRRTVFDYIDVLSTKHPYWNRSQGADHFMLSC 341

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG---SSK 167
           H  G         +  N+I+V+C+++    G I  KDVSLP+I    +    +G   +S 
Sbjct: 342 HDWGPYVSSVDGNLFSNSIRVLCNANTS-EGFIPSKDVSLPEINHLNDFKKDIGGPSASG 400

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT--PYADELLGSKFCLHVKGFEV 225
           R  LAFFAG  + PVR  LL+ W+        S  L     Y + +  SKFCL   GFEV
Sbjct: 401 RPILAFFAGGNHGPVRPLLLKHWKGKDPDVQVSEYLPAGVSYVETMRRSKFCLCPSGFEV 460

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
            + R+A+++Y  CVPV+IA+ Y LPF+DVL+W +FS+ VA  DIP +K+IL  +S   Y+
Sbjct: 461 ASPRVAEAIYVECVPVVIADDYVLPFSDVLSWPAFSLRVAVRDIPDIKRILSAVSPRRYI 520

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            +Q  V  VR+HF  +  P  YD F+M+++ +WLRR +VR+ 
Sbjct: 521 RMQRRVRAVRRHFMLNGVPQRYDVFHMILHSIWLRRLNVRIH 562


>gi|168029668|ref|XP_001767347.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|168029742|ref|XP_001767384.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162681411|gb|EDQ67838.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162681448|gb|EDQ67875.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 333

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 129/345 (37%), Positives = 197/345 (57%), Gaps = 28/345 (8%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPS 54
           M R F+VYVY     P   N P   +           YA E  F +     + FVT DP 
Sbjct: 1   MERRFKVYVYSEGEEPLVHNGPCKEI-----------YAVEGRFIQELQGDNPFVTHDPD 49

Query: 55  KADLFFLPFSIARM---RHDRRIG-TEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
            A ++FLPFS+A M    +++  G  + +  F+  Y+  +  KYP+WNR+GGADHF ++C
Sbjct: 50  NAHVYFLPFSVAMMVAYLYEKESGDMDPLRLFVGDYVDVLMHKYPFWNRSGGADHFMLSC 109

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQEDPPKLG---SS 166
           H  G    ++   +   +I+V+C+++    G++  KDVSLP+I       P +LG   + 
Sbjct: 110 HDWGPLITRENMNLGTRSIRVLCNANS-SEGYVPWKDVSLPEIHLVGGHIPAELGGPPAK 168

Query: 167 KRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFE 224
            R  LAFFAG  + PVR +L + W  ++D  I          Y + +  S++C+   G+E
Sbjct: 169 DRPHLAFFAGRDHGPVRPQLFKHWEGKDDDVIVYQWLPAHLKYHELMKTSRYCICPGGYE 228

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           VN+ RI +++Y  CVPVIIA+ + LPF+DVLNW+SFS+ V   DIP LK IL+ ++ E Y
Sbjct: 229 VNSPRIVEAIYNECVPVIIADSFILPFSDVLNWESFSLHVKESDIPNLKSILQNVTMETY 288

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
             +Q  V +V++HF  H  P  YD F+M+++ +WLRR ++RV +S
Sbjct: 289 TSMQERVSQVQRHFVLHQPPKRYDVFHMILHSVWLRRLNLRVGFS 333


>gi|79326862|ref|NP_001031828.1| putative glycosyltransferase [Arabidopsis thaliana]
 gi|292630778|sp|Q9FFN2.2|GLYT3_ARATH RecName: Full=Probable glycosyltransferase At5g03795
 gi|332003272|gb|AED90655.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 518

 Score =  214 bits (544), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 130/340 (38%), Positives = 193/340 (56%), Gaps = 22/340 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASE-SYFKKAFMKSHFVTKDPSKADLF 59
           M + F++YVY       F       D   +  Y+ E S+  +    + F T +P KA +F
Sbjct: 186 MEKQFKIYVYKEGEPPLFH------DGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVF 239

Query: 60  FLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           +LPFS+ +M      R+ R      I + +  YI  +  KYPYWNR+ GADHF ++CH  
Sbjct: 240 YLPFSVVKMVRYVYERNSRDFSP--IRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDW 297

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
           G  A      +  N+I+ +C+++         KDVS+P+I  R      L      S R 
Sbjct: 298 GPEASFSHPHLGHNSIRALCNANTS-ERFKPRKDVSIPEINLRTGSLTGLVGGPSPSSRP 356

Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
            LAFFAG V+ PVR  LLQ W N D++I  H    + T Y+D +  SKFC+   G+EV +
Sbjct: 357 ILAFFAGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTSYSDMMRNSKFCICPSGYEVAS 416

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            RI ++LY GCVPV+I + Y  PF+DVLNW+SFS++V+  DIP LK IL  IS  +YL +
Sbjct: 417 PRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPRQYLRM 476

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
              VLKVR+HF+ +     +D F+M+++ +W+RR +V+++
Sbjct: 477 YRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVKIR 516


>gi|297808029|ref|XP_002871898.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317735|gb|EFH48157.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 610

 Score =  214 bits (544), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 127/342 (37%), Positives = 191/342 (55%), Gaps = 36/342 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
           M R  +VYVY       F   +L      +G YASE +F K       +  KDP KA L+
Sbjct: 277 MERILKVYVYKEGNRPIFHTPIL------KGLYASEGWFMKLMEGNKQYTVKDPRKAHLY 330

Query: 60  FLPFSIARM---------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
           ++PFS ARM          H+R      +  F+  Y  +IS KYP++NRT GADHF VAC
Sbjct: 331 YMPFS-ARMLEYTLYVRNSHNR----TNLRQFLKEYTEHISSKYPFFNRTDGADHFLVAC 385

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS--- 165
           H       +   E   + I+ +C++    +G    +D+SLP+ + R    P   LG    
Sbjct: 386 HDWAPYETRHHME---HCIKALCNAD-VTAGFKIGRDISLPETYVRAAKNPLRDLGGKPP 441

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLH 219
           S+R  LAF+AG+++  +R+ LLQ W++        GR+      K  Y +++  SK+C+ 
Sbjct: 442 SQRRTLAFYAGSMHGYLRQILLQHWKDKDPDMKIFGRMPFGVASKMNYIEQMKSSKYCIC 501

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
            KG+EVN+ R+ +S++Y CVPVII++++  PF +VL+W +FS++VA  DIP LK IL  I
Sbjct: 502 PKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKDIPRLKDILSSI 561

Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
             E+Y+ +Q  V K ++HF WH  P  YD F+MV++ +W  R
Sbjct: 562 PEEKYVKMQMAVRKAQRHFLWHAKPEKYDLFHMVLHSIWYNR 603


>gi|334186614|ref|NP_001190743.1| Exostosin family protein [Arabidopsis thaliana]
 gi|332658396|gb|AEE83796.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 589

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 132/330 (40%), Positives = 191/330 (57%), Gaps = 29/330 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH--FVTKDPSKA 56
           M    +VY+YP            P+  EP   G YASE +F K  M+S+  FVTK+P +A
Sbjct: 196 MELILKVYIYPDGDK--------PIFHEPHLNGIYASEGWFMK-LMESNKQFVTKNPERA 246

Query: 57  DLFFLPFSIARMRHDRRI-GTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACHS 112
            LF++P+S+ +++    + G+  I     F+  Y+  +S KYP+WNRT G+DHF VACH 
Sbjct: 247 HLFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNRTHGSDHFLVACHD 306

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS----S 166
            G   + +  E+K NAI+ +C++       +  KDVSLP+   R    P   +G+    S
Sbjct: 307 WGPYTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRVS 366

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHV 220
           +R  LAFFAG ++  VR KLL+ WRN   D +IY    H+   K  Y   +  SK+CL  
Sbjct: 367 QRPILAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTYVQHMKSSKYCLCP 426

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            G+EVN+ RI +++YY CVPV+IA+++ LPF+DVL+W +FS+VV   +IP LK+IL  I 
Sbjct: 427 MGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKEILLEIP 486

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
              YL +QS V  V++HF W   P     F
Sbjct: 487 MRRYLKMQSNVKMVQRHFLWSPKPRKIKPF 516


>gi|449461995|ref|XP_004148727.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
 gi|449501299|ref|XP_004161331.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 664

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 127/341 (37%), Positives = 189/341 (55%), Gaps = 34/341 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
           M R+ ++YVY   +   F   +L      +G YASE +F K       FV KDP KA LF
Sbjct: 331 MERTLKIYVYRDGKKPIFHQPIL------KGLYASEGWFMKLMEGNKRFVVKDPRKAHLF 384

Query: 60  FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           ++PFS          R  H+R      +  F+  Y  NI+ KYPYWNRTGGADHF   CH
Sbjct: 385 YMPFSSRMLEYTLYVRNSHNR----TNLRQFLKEYAENIAAKYPYWNRTGGADHFLAGCH 440

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SS 166
                  +   E   + I+ +C++   +   I  +DVSLP+ + R    P   LG   +S
Sbjct: 441 DWAPYETRHHME---HCIKALCNADVTVGFKIG-RDVSLPETYVRSARNPLRDLGGKPAS 496

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHV 220
           +R+ LAF+AG ++  VR  LL+ W++ +      G +      K  Y   +  SK+C+  
Sbjct: 497 QRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSKYCICP 556

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           KG+EVN+ R+ ++++Y CVPVII++++  PF +VL+W++FS++VA  DIP L+ IL  I 
Sbjct: 557 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPNLQDILLSIP 616

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            + YL +Q  V KV+KHF WH  P  YD F+M ++ +W  R
Sbjct: 617 KDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 657


>gi|9758008|dbj|BAB08605.1| unnamed protein product [Arabidopsis thaliana]
          Length = 408

 Score =  213 bits (543), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 130/340 (38%), Positives = 193/340 (56%), Gaps = 22/340 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASE-SYFKKAFMKSHFVTKDPSKADLF 59
           M + F++YVY       F       D   +  Y+ E S+  +    + F T +P KA +F
Sbjct: 76  MEKQFKIYVYKEGEPPLFH------DGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVF 129

Query: 60  FLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           +LPFS+ +M      R+ R      I + +  YI  +  KYPYWNR+ GADHF ++CH  
Sbjct: 130 YLPFSVVKMVRYVYERNSRDFSP--IRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDW 187

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
           G  A      +  N+I+ +C+++         KDVS+P+I  R      L      S R 
Sbjct: 188 GPEASFSHPHLGHNSIRALCNANTS-ERFKPRKDVSIPEINLRTGSLTGLVGGPSPSSRP 246

Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
            LAFFAG V+ PVR  LLQ W N D++I  H    + T Y+D +  SKFC+   G+EV +
Sbjct: 247 ILAFFAGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTSYSDMMRNSKFCICPSGYEVAS 306

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            RI ++LY GCVPV+I + Y  PF+DVLNW+SFS++V+  DIP LK IL  IS  +YL +
Sbjct: 307 PRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPRQYLRM 366

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
              VLKVR+HF+ +     +D F+M+++ +W+RR +V+++
Sbjct: 367 YRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVKIR 406


>gi|357440259|ref|XP_003590407.1| hypothetical protein MTR_1g061760 [Medicago truncatula]
 gi|355479455|gb|AES60658.1| hypothetical protein MTR_1g061760 [Medicago truncatula]
          Length = 508

 Score =  213 bits (543), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 129/340 (37%), Positives = 190/340 (55%), Gaps = 21/340 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+V+VY       F N      +   GN+          +   F T+DP KA ++F
Sbjct: 174 MEKQFKVFVYEEGEPPVFHNGPCKSIYSMEGNF-----IHAIELNDQFRTRDPQKAHVYF 228

Query: 61  LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           LPFS+  +      R  R  G   I   ++ YI  I+ KYPYWNR+ GADHF +ACH  G
Sbjct: 229 LPFSVVMLVRFVYLRDSRDFGP--IRKTVTDYINVIAGKYPYWNRSLGADHFMLACHDWG 286

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDP----PKLGSSKRNK 170
                    +  N+I+V+C+++     + A KDVS P+I  +          L +SKR  
Sbjct: 287 PETSFSVPYLHKNSIRVLCNANTSERFNPA-KDVSFPEINLQTGSINGFLGGLSASKRPI 345

Query: 171 LAFFAGAVNSPVREKLLQVWRN--DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
           LAFFAG ++  +R  LL+ W N  D ++       K   Y + L  SKFCL   G+EV +
Sbjct: 346 LAFFAGGLHGHIRAILLEHWENNKDQDMMIQKYLPKGVSYYEMLRKSKFCLCPSGYEVAS 405

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            RI +++Y GCVPV+I++HY  PF+DVLNWKSFS+ ++  DIP LK IL  IS  +Y+ +
Sbjct: 406 PRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEISVEDIPKLKDILMRISPTQYIRM 465

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           Q  V+++R+HF+ H  P  +D F+M+++ +WLRR + RV 
Sbjct: 466 QRRVVQIRRHFEVHSPPKRFDVFHMILHSVWLRRLNFRVH 505


>gi|297810461|ref|XP_002873114.1| hypothetical protein ARALYDRAFT_487158 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318951|gb|EFH49373.1| hypothetical protein ARALYDRAFT_487158 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 519

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 130/340 (38%), Positives = 192/340 (56%), Gaps = 22/340 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASE-SYFKKAFMKSHFVTKDPSKADLF 59
           M + F++YVY       F       D   +  Y+ E S+  +    + F T +P KA  F
Sbjct: 187 MEKQFKIYVYKEGEPPLFH------DGPCKSIYSMEGSFIYEMETDTRFRTNNPDKAHAF 240

Query: 60  FLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           +LPFS+ +M      R+ R      I + +  YI  +  KYPYWNR+ GADHF ++CH  
Sbjct: 241 YLPFSVVKMVRYVYERNSRDFSP--IRNTVRDYINLVGDKYPYWNRSIGADHFILSCHDW 298

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
           G  A      +  N+I+ +C+++         KDVS+P+I  R      L      S R 
Sbjct: 299 GPEASFSHPHLGHNSIRALCNANTS-EKFKPRKDVSIPEINLRTGSLTGLVGGPSPSSRP 357

Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
            LAFFAG V+ PVR  LL+ W N D++I  H    + T Y+D +  SKFC+   G+EV +
Sbjct: 358 ILAFFAGGVHGPVRPVLLEHWENKDNDIRVHKYLPRGTSYSDMMRNSKFCICPSGYEVAS 417

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            RI ++LY GCVPV+I + Y  PF+DVLNW+SFS++V+  DIP LK IL  IS  +YL +
Sbjct: 418 PRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTAISPRQYLRM 477

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
              VLKVR+HF+ +     +D F+M+++ +W+RR +VR++
Sbjct: 478 YRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVRIR 517


>gi|359484345|ref|XP_002280595.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
 gi|297738776|emb|CBI28021.3| unnamed protein product [Vitis vinifera]
          Length = 665

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 129/340 (37%), Positives = 197/340 (57%), Gaps = 32/340 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR--GNYASESYFKKAFM-KSHFVTKDPSKAD 57
           M R  ++Y+Y             P+  +PR  G YASE +F K       FV +DP KA 
Sbjct: 332 MERVLKIYIYREGEK--------PIFHQPRLRGIYASEGWFMKLIEGNKRFVVRDPRKAH 383

Query: 58  LFFLPFSIARMR---HDRRIGT-EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS- 112
           LF++PFS   +R   +++   T   +  +  +Y+  I+ KY +WNRTGGADH  VACH  
Sbjct: 384 LFYVPFSSKMLRTVFYEQNSSTPRDLEKYFKNYVGLIAGKYRFWNRTGGADHLIVACHDW 443

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPK-LGS---SK 167
             R   Q +W    N+I+ +C+S+   SG    KD +LP  + R+ EDP K LG    S+
Sbjct: 444 APRITRQCSW----NSIRALCNSN-IASGFKIGKDTTLPVTYIRKSEDPLKYLGGKPPSQ 498

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHVK 221
           R  LAFFAG+++  +R  LLQ W N   D +I+    R    K+ Y D +  SK+C+  +
Sbjct: 499 RPILAFFAGSMHGYLRPILLQYWENKEQDIKIFGPMSRDDGGKSRYRDHMKSSKYCICAR 558

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G+EV+T R+ ++++Y CVPVII+++Y  PF ++LNW++F++ +   D+P L+ IL  I  
Sbjct: 559 GYEVHTPRVVEAIFYECVPVIISDNYVPPFFEILNWEAFAVFILEKDVPNLRNILLSIPE 618

Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           E+YL +Q  V  V++HF WH  P  YD F+M+++ +W  R
Sbjct: 619 EKYLQMQMRVKMVQQHFLWHKKPVKYDLFHMILHSVWYNR 658


>gi|225429942|ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
          Length = 675

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 131/343 (38%), Positives = 189/343 (55%), Gaps = 38/343 (11%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH-FVTKDPSKAD 57
           M  + +VY Y             PV  +P  +G YASE +F K    +  FVTK+  KA 
Sbjct: 336 MENTLKVYTYREGER--------PVFHQPPIKGIYASEGWFMKLMQANKKFVTKNGRKAH 387

Query: 58  LFFLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           LF+LPFS   +         H R+     +  ++ +Y+  I  KYP+WNRTGGADHF VA
Sbjct: 388 LFYLPFSSLMLEEALYVPNSHSRK----NLEQYLKNYLDMIGAKYPFWNRTGGADHFLVA 443

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS-- 165
           CH    S   K   +  N+I+ +C+S     G    KDVSLP+   R    P  +LG   
Sbjct: 444 CHDWAPSETLK---LMANSIRALCNSD-IREGFKLGKDVSLPETCVRIPQNPLRQLGGKP 499

Query: 166 -SKRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTP---YADELLGSKFCL 218
            S+R  LAFFAG+++  VR  LL+ W N   D +IY    + K     Y   +  SK+C+
Sbjct: 500 PSQRRILAFFAGSMHGYVRPILLKYWENKDPDMKIYGRMPKAKKGTMNYIQHMKSSKYCI 559

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
             KG+EVN+ R+ ++++Y CVPVII++++  PF  VLNW+SF++ +   DIP LK IL  
Sbjct: 560 CAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFGVLNWESFAVFILEKDIPNLKSILLS 619

Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           I  + YL +Q  V +V++HF WH  P  YD F+M+++ +W  R
Sbjct: 620 IPEKSYLEIQMRVKQVQQHFLWHAKPVKYDVFHMILHSVWYNR 662


>gi|224071129|ref|XP_002303362.1| predicted protein [Populus trichocarpa]
 gi|222840794|gb|EEE78341.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 130/345 (37%), Positives = 188/345 (54%), Gaps = 37/345 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR--GNYASESYFKKAFMKS--HFVTKDPSKA 56
           M    +VY+YP            P+  +P   G YASE +F K FM++   FV++DP KA
Sbjct: 46  METILKVYIYPDGDK--------PIFHQPHLYGIYASEGWFMK-FMEASREFVSRDPEKA 96

Query: 57  DLFFLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
            LF+LP+S  ++         H+ R     +  F+  Y   I+ KYPYWNRT G DHF V
Sbjct: 97  HLFYLPYSARQLEVAVYVPNSHNLR----PLSIFMRDYANMIAAKYPYWNRTHGRDHFLV 152

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP------K 162
           ACH  G  A+    E+  N ++ +C++        A +DVSLP+   R    P       
Sbjct: 153 ACHDWGPYALTMHEELTKNTMKALCNADVSEGIFTAGQDVSLPETTIRSPKRPLRNVGGG 212

Query: 163 LGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKF 216
           +  S+R  LAFFAG ++  VR  LL+ W N  +     G L      K  Y   +  SK+
Sbjct: 213 IRVSQRPILAFFAGNLHGRVRPTLLKYWHNKDDDMKIYGPLPIGISRKMTYVQHMKSSKY 272

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
           C+   G+EVN+ RI +++YY CVPVIIA+++ LPF +VL+W +FS+VVA  DIP LK+IL
Sbjct: 273 CICPMGYEVNSPRIVEAIYYECVPVIIADNFVLPFNEVLDWSAFSVVVAEKDIPKLKEIL 332

Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
             I    YL + + +  V+KHF W+  P  YD F+M+++ +W  R
Sbjct: 333 LAIPLRRYLTMLANLKTVQKHFLWNPRPLRYDLFHMILHSIWFSR 377


>gi|334187775|ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]
 gi|332005353|gb|AED92736.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 610

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 126/342 (36%), Positives = 191/342 (55%), Gaps = 36/342 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
           M R  +VYVY       F   +L      +G YASE +F K       +  KDP KA L+
Sbjct: 277 MERILKVYVYKEGNRPIFHTPIL------KGLYASEGWFMKLMEGNKQYTVKDPRKAHLY 330

Query: 60  FLPFSIARM---------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
           ++PFS ARM          H+R      +  F+  Y  +IS KYP++NRT GADHF VAC
Sbjct: 331 YMPFS-ARMLEYTLYVRNSHNR----TNLRQFLKEYTEHISSKYPFFNRTDGADHFLVAC 385

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS--- 165
           H       +   E   + I+ +C++    +G    +D+SLP+ + R    P   LG    
Sbjct: 386 HDWAPYETRHHME---HCIKALCNAD-VTAGFKIGRDISLPETYVRAAKNPLRDLGGKPP 441

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLH 219
           S+R  LAF+AG+++  +R+ LLQ W++        GR+      K  Y +++  SK+C+ 
Sbjct: 442 SQRRTLAFYAGSMHGYLRQILLQHWKDKDPDMKIFGRMPFGVASKMNYIEQMKSSKYCIC 501

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
            KG+EVN+ R+ +S++Y CVPVII++++  PF +VL+W +FS++VA  DIP LK IL  I
Sbjct: 502 PKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKDIPRLKDILLSI 561

Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
             ++Y+ +Q  V K ++HF WH  P  YD F+MV++ +W  R
Sbjct: 562 PEDKYVKMQMAVRKAQRHFLWHAKPEKYDLFHMVLHSIWYNR 603


>gi|255579651|ref|XP_002530666.1| catalytic, putative [Ricinus communis]
 gi|223529799|gb|EEF31735.1| catalytic, putative [Ricinus communis]
          Length = 528

 Score =  211 bits (537), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 129/339 (38%), Positives = 185/339 (54%), Gaps = 25/339 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
           M    +VY+YP  +   F    L       G YASE +F K FM+ +  FVT+DP KA L
Sbjct: 188 MELILKVYIYPDGKRPIFHVPHL------NGIYASEGWFMK-FMEENRQFVTRDPEKAHL 240

Query: 59  FFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           F+LP+S  +++    +        +  F+  Y   I+ KYP+WNRT G DHF VACH  G
Sbjct: 241 FYLPYSARQLQMALYVPNSHNLRPLSIFMRDYANMIATKYPFWNRTHGRDHFLVACHDWG 300

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP------KLGSSKR 168
              +    E+  N I+ +C++          KDVSLP+   R    P       +  S+R
Sbjct: 301 PYTLTMHEELTKNTIKALCNADASEGIFDPTKDVSLPETTIRIPRRPLKNVGGGIRVSQR 360

Query: 169 NKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKG 222
             LAFFAG ++  VR  LLQ W+N  E     G L      K  Y   +  S++C+   G
Sbjct: 361 PILAFFAGNMHGRVRPTLLQYWQNKDEDLKIYGPLPARISRKMNYVQHMKSSRYCICPMG 420

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
            EVN+ RI +++YY CVPVIIA+++ LPF+DVL+W +FSIVVA  DIP LK+IL  I   
Sbjct: 421 HEVNSPRIVEAIYYECVPVIIADNFVLPFSDVLDWSAFSIVVAEKDIPKLKEILLAIPLR 480

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            YL + + +  +++HF W+  P  YD F+M+++ +W  R
Sbjct: 481 RYLTMLTNLKMLQRHFLWNPRPLRYDLFHMILHSIWFSR 519


>gi|359484343|ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268163 [Vitis vinifera]
          Length = 738

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 129/341 (37%), Positives = 190/341 (55%), Gaps = 34/341 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
           M R  +VYVY       F   +L      +G YASE +F K   ++ HFV KDP +A LF
Sbjct: 405 MERILKVYVYKDGEKPIFHQPIL------KGLYASEGWFMKLMERNKHFVVKDPRQAQLF 458

Query: 60  FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           ++PFS          R  H+R      +  ++  Y   I+ KY +WNRTGGADHF VACH
Sbjct: 459 YMPFSSRMLEYKLYVRNSHNR----TNLRQYLKQYSEKIAAKYRFWNRTGGADHFLVACH 514

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---S 166
                  +   E     I+ +C++    +G    +DVSLP+ + R    P   LG    S
Sbjct: 515 DWAPYETRHHME---QCIKALCNAD-VTAGFKIGRDVSLPETYVRSARNPLRDLGGKPPS 570

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAH---SGRLKTPYADELLGSKFCLHV 220
           +R+ LAF+AG ++  +R  LL+ W++   D +IY         K  Y   +  SKFC+  
Sbjct: 571 ERHILAFYAGNMHGYLRPILLKYWKDKDPDMKIYGPMPPGVASKMNYIQHMKSSKFCICP 630

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           KG+EVN+ R+ ++++Y CVPVII++++  PF DVL+W +FSI++A  DIP LK +L  I 
Sbjct: 631 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILAEKDIPNLKDVLLSIP 690

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +++YL +Q  V KV+KHF WH  P  YD F+M ++ +W  R
Sbjct: 691 NDKYLQMQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 731


>gi|357130589|ref|XP_003566930.1| PREDICTED: probable glycosyltransferase At5g03795-like
           [Brachypodium distachyon]
          Length = 510

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 132/349 (37%), Positives = 187/349 (53%), Gaps = 38/349 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M R  +VY+Y   R   F    L       G YASE +F K   +S  FV  D +KA LF
Sbjct: 164 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRFVVADGAKAHLF 217

Query: 60  FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +LP+S   +R        H+ R     + DF+      ++ KYP+WNR  GADHF VACH
Sbjct: 218 YLPYSSQHLRLSLYVPDSHNLRPLAVYLRDFVQ----GLAAKYPFWNRNRGADHFLVACH 273

Query: 112 S-------IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--- 161
                    G        +++ N+I+ +C++          KDVSLP+   R    P   
Sbjct: 274 DWRKFYYLQGPYTTTAHRDLRRNSIKALCNADSSERIFSPGKDVSLPETTIRTPKRPLRY 333

Query: 162 --KLGSSKRNKLAFFAGAVNSPVREKLLQVW----RNDSEIYA---HSGRLKTPYADELL 212
              L  S+R  LAFFAG V+  VR  LL+ W     +D  +Y    +    +  Y   + 
Sbjct: 334 VGGLPVSRRRILAFFAGNVHGRVRPVLLKHWGDGRDDDMRVYGPLPNRVSRQMSYIQHMK 393

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
            S+FCL   G EVN+ RI ++LYY CVPV+IA+++ LPF+DVL+W +FS+VVA  DIP L
Sbjct: 394 NSRFCLCPMGHEVNSPRIVEALYYECVPVVIADNFVLPFSDVLDWTAFSVVVAEKDIPDL 453

Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           KKIL+GIS   Y+ +   V ++++HF WH  P  YD F+M+++ +WL R
Sbjct: 454 KKILQGISLRRYVAMHDCVKRLQRHFLWHARPLRYDLFHMILHSIWLSR 502


>gi|356518344|ref|XP_003527839.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 637

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 129/339 (38%), Positives = 187/339 (55%), Gaps = 31/339 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKS-HFVTKDPSKAD 57
           M ++ +VYVY  R  D       P+   P   G YASE +F +    S  FVTKDP KA 
Sbjct: 305 MEKTLKVYVY--REGDK------PIMHSPYLLGIYASEGWFMRLMEASKQFVTKDPKKAH 356

Query: 58  LFFLPFSIARMRHD-----RRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
           LF+LPFS +RM  +         +  +  ++ +Y+  I+ K+ +WNRTGGADHF VACH 
Sbjct: 357 LFYLPFS-SRMLEETLYVPNSHSSRNLIQYLKNYVDMIAGKHRFWNRTGGADHFLVACHD 415

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SK 167
              +  ++     L A+   C++     G +  KD+SLP+ + R    P   +G    SK
Sbjct: 416 WAPTETRQHMARCLRAL---CNAD-VKEGFVLGKDISLPETYVRNAQKPTRNIGGNRVSK 471

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKG 222
           R  LAFFAG ++  VR  LLQ W N        G L        Y   +  SK+C+  KG
Sbjct: 472 RKTLAFFAGGMHGYVRPILLQHWENKDPAMKIFGILPKSKGNRNYIQYMKSSKYCICAKG 531

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
           +EVN+ R+ +++ Y CVPVI+++++  PF ++LNW+SF++ V   DIP LK IL  I  +
Sbjct: 532 YEVNSPRVVEAILYECVPVILSDNFVPPFFEMLNWESFAVFVLEKDIPNLKNILLSIPQK 591

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            YL +Q  V KV++HF WH  P  YD F+MV++ +W  R
Sbjct: 592 RYLQMQMMVRKVQQHFLWHRSPVKYDIFHMVLHSIWYNR 630


>gi|115470965|ref|NP_001059081.1| Os07g0188700 [Oryza sativa Japonica Group]
 gi|34394611|dbj|BAC83913.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|50508944|dbj|BAD31848.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|113610617|dbj|BAF20995.1| Os07g0188700 [Oryza sativa Japonica Group]
 gi|125599387|gb|EAZ38963.1| hypothetical protein OsJ_23384 [Oryza sativa Japonica Group]
 gi|215707162|dbj|BAG93622.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 606

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 135/349 (38%), Positives = 193/349 (55%), Gaps = 38/349 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAF-MKSHFVTKDPSKAD 57
           M + F+V+VY     +P      PV  D   R  Y++E  F  A  M++   T+DP +A 
Sbjct: 268 MEKVFKVFVY--EEGEP------PVFHDGPCRSIYSTEGRFIYAMEMENRMRTRDPDQAH 319

Query: 58  LFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           +FFLPFS+ +M         HD     + +   IS YI  +S KYP+WNR+ GADHF ++
Sbjct: 320 VFFLPFSVVKMVKMIYEPNSHD----MDPLRRTISDYINVVSTKYPHWNRSLGADHFMLS 375

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQEDPPKL 163
           CH  G         +  N+I+V+C+++    G    +DVSLP+I        RQ   P  
Sbjct: 376 CHDWGPYVSSANGHLFSNSIRVLCNANTS-EGFDPSRDVSLPEINLRSDVVDRQVGGPS- 433

Query: 164 GSSKRNKLAFFAGAVNSPVREKLLQVWR--NDSEIYAHS---GRLKTPYADELLGSKFCL 218
            +S R  LAFFAG  + PVR  LLQ W    D++I        R    Y D +  S+FCL
Sbjct: 434 -ASHRPILAFFAGGDHGPVRPLLLQHWGKGQDADIQVSEYLPRRHGMSYTDMMRRSRFCL 492

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
              G+EV + R+ +++Y  CVPV+I + Y LPFADVLNW +FS+ VA  DIP LK+IL  
Sbjct: 493 CPSGYEVASPRVVEAIYLECVPVVIGDDYTLPFADVLNWAAFSVRVAVGDIPRLKEILAA 552

Query: 279 ISSEEYLLLQSYVLKVRKHFQW-HVFPSDYDAFYMVMYELWLRRSSVRV 326
           +S  +Y+ +Q  V  VR+HF      P  +D F+M+++ +WLRR +VRV
Sbjct: 553 VSPRQYIRMQRRVRAVRRHFMVSDGAPRRFDVFHMILHSIWLRRLNVRV 601


>gi|224142972|ref|XP_002324801.1| predicted protein [Populus trichocarpa]
 gi|222866235|gb|EEF03366.1| predicted protein [Populus trichocarpa]
          Length = 398

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 122/309 (39%), Positives = 180/309 (58%), Gaps = 21/309 (6%)

Query: 30  RGNYASESYFKKAF-MKSHFVTKDPSKADLFFLPFSIARMRHDRRIGT----EGIPDFIS 84
           +G YASE +F K       FVTKDP K+ LF+LPFS   +  +  +      + +  ++ 
Sbjct: 87  KGIYASEGWFMKLLETNKKFVTKDPKKSHLFYLPFSSRNLEVNLYVPNSHSHKNLIQYLK 146

Query: 85  HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
           +Y+  IS KYP+WNRT GADHF VACH    +  ++      N I+ +C+S     G + 
Sbjct: 147 NYLDMISAKYPFWNRTRGADHFLVACHDWAPTETRQHMA---NCIRALCNSDA-KGGFVF 202

Query: 145 HKDVSLPQIWPR--QEDPPKLG---SSKRNKLAFFAGAVNSPVREKLLQVWRN---DSEI 196
            KD +LP+   R  Q     LG   +SKR+ LAFFAG+++  +R  LLQ W N   D ++
Sbjct: 203 GKDAALPETTVRTPQNLLRDLGGKPASKRSILAFFAGSMHGYLRPILLQHWGNKDPDVKV 262

Query: 197 YAH----SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
           +       GR K  Y   +  SK+C+  KGFEVN+ R+ ++++Y CVPVII++++  PF 
Sbjct: 263 FGKLPKVKGRGKMNYPQYMKSSKYCICAKGFEVNSPRVVEAIFYECVPVIISDNFVPPFF 322

Query: 253 DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYM 312
           +VLNW+SF++ V   DIP LK IL  I   +Y  +Q  V KV++HF WH  P  YD F+M
Sbjct: 323 EVLNWESFAVFVLEKDIPNLKNILLSIPENKYREMQMRVKKVQQHFLWHARPVKYDIFHM 382

Query: 313 VMYELWLRR 321
           +++ +W  R
Sbjct: 383 ILHSVWYNR 391


>gi|255546377|ref|XP_002514248.1| catalytic, putative [Ricinus communis]
 gi|223546704|gb|EEF48202.1| catalytic, putative [Ricinus communis]
          Length = 676

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 125/341 (36%), Positives = 190/341 (55%), Gaps = 34/341 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
           M R+ +VY+Y   +   F    LP+    +G YASE +F K      HF+ KDP +A LF
Sbjct: 343 MERTLKVYIYKDGKKPIFH---LPI---MKGLYASEGWFMKLMQGNKHFLVKDPRRAHLF 396

Query: 60  FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           ++PFS          R  H+R      +  ++  Y   I+ KYP+WNRT GADHF VACH
Sbjct: 397 YMPFSSRMLEYTLYVRNSHNR----TNLRQYLKDYSEKIAAKYPFWNRTDGADHFLVACH 452

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---S 166
                  +   E   + I+ +C++    +G    +D+SLP+ + R    P   LG    S
Sbjct: 453 DWAPYETRHHME---HCIKALCNAD-VTAGFKIGRDISLPETYVRSARNPLRDLGGKPPS 508

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHV 220
           +R+ LAF+AG+++  +R  LL+ W++        G +      K  Y   +  SK+C+  
Sbjct: 509 QRHILAFYAGSMHGYLRPILLKYWKDKDPSMKIFGPMPPGVASKMNYIQHMKSSKYCICP 568

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           KG+EVN+ R+ ++++Y CVPVII++++  PF +V NW +FS+++A  DIP LK+IL  I 
Sbjct: 569 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVFNWGAFSLILAEKDIPNLKEILLSIP 628

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            E+YL +Q  V KV+KHF WH  P  YD FYM ++ +W  R
Sbjct: 629 EEKYLEMQLGVRKVQKHFLWHPSPMKYDLFYMTLHAIWYNR 669


>gi|302761638|ref|XP_002964241.1| xylosyltransferase-like protein [Selaginella moellendorffii]
 gi|300167970|gb|EFJ34574.1| xylosyltransferase-like protein [Selaginella moellendorffii]
          Length = 332

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 130/341 (38%), Positives = 188/341 (55%), Gaps = 36/341 (10%)

Query: 1   MNRSFRVYVYPHRR-----NDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPS 54
           M +  +V+ YP        N P   +           YA E  F +    K+ ++T DP 
Sbjct: 1   MEKRLKVFAYPEGEEPLVHNGPCKEI-----------YAIEGRFIQELQGKNSYLTSDPE 49

Query: 55  KADLFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
           KA LFFLPFS+A M         HD  +G  G   F   YI  IS +Y  WNR+ GADHF
Sbjct: 50  KAHLFFLPFSVAMMVTYLYTPGSHD--MGPLG--RFTRDYIDVISHRYSAWNRSRGADHF 105

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQEDPPKLGS 165
            V+CH  G    +   ++  N+I+V+C+++    G++  KD SLP+I     + P  LG 
Sbjct: 106 MVSCHDWGPHISRAHPDLMANSIRVLCNANT-SEGYVPSKDASLPEIHLVGGQVPSVLGG 164

Query: 166 ---SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHV 220
               +R  LAFFAG  + PVR  L + W+   E      +L  +  Y D +  SK+CL  
Sbjct: 165 PPPEERRYLAFFAGGDHGPVRPVLFKYWKEKDEDVRVFEKLPSRDAYLDYMSHSKYCLCP 224

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            G+EVN+ RI +++Y  CVPV+IA+ + LPF+DVL+W +FS+ V   DIP LK IL+ I 
Sbjct: 225 GGYEVNSPRIVEAIYNDCVPVVIADDFVLPFSDVLDWDAFSVKVLERDIPRLKTILQAIP 284

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +  YL +Q+ V KVR+HF+++  P  YD F M+++ +WLRR
Sbjct: 285 TARYLEMQARVSKVRRHFRFNQPPERYDVFNMILHSVWLRR 325


>gi|224092292|ref|XP_002309546.1| predicted protein [Populus trichocarpa]
 gi|222855522|gb|EEE93069.1| predicted protein [Populus trichocarpa]
          Length = 417

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 129/340 (37%), Positives = 185/340 (54%), Gaps = 33/340 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
           M +  +VYVY       F   +L       G YASE +F K      +FVTKDP KA LF
Sbjct: 85  MEKMLKVYVYQDGEKPIFHQPIL------DGIYASEGWFMKHMEANENFVTKDPGKAHLF 138

Query: 60  FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +LPFS          R  H R      + +++ +Y   I+ KY +WNRTGGADHF  ACH
Sbjct: 139 YLPFSSRLLELTLYVRHSHSR----TNLIEYMRNYAGMIAAKYHFWNRTGGADHFVAACH 194

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SS 166
               +  +      LN I+ +C++   +   I  KDVSLP+ + R    P         S
Sbjct: 195 DWAPAETRGPL---LNCIRALCNADIEVGFSIG-KDVSLPETYVRSAQNPLKNLEGNPPS 250

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLK--TPYADELLGSKFCLHVK 221
           +R  LAFFAG ++  VR  LL  W N   D +I+     +K  T Y   +  SKFC+  +
Sbjct: 251 QRPILAFFAGNMHGYVRPVLLDYWGNKDPDMKIFGPMPHVKGNTNYIQHMKSSKFCICPR 310

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G EVN+ RI ++++  CVPVII++++  PF +VL+W+SF+++V   DIP LK IL  IS 
Sbjct: 311 GHEVNSPRIVEAIFLECVPVIISDNFVPPFFEVLDWESFAVIVLEKDIPNLKNILVSISE 370

Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           E+Y+ +   V KV++HF WH  P  YD F+M+++ +W  R
Sbjct: 371 EKYIEMHKRVKKVQQHFLWHSKPEKYDLFHMILHSVWYNR 410


>gi|255546379|ref|XP_002514249.1| catalytic, putative [Ricinus communis]
 gi|223546705|gb|EEF48203.1| catalytic, putative [Ricinus communis]
          Length = 334

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 128/341 (37%), Positives = 188/341 (55%), Gaps = 34/341 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
           M R  +VYVY       F    +      RG YASE +F K       F+ KDP KA LF
Sbjct: 1   MERLLKVYVYKEGEKPIFHQSKM------RGIYASEGWFMKHIEGNKKFLVKDPRKAHLF 54

Query: 60  FLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS--- 112
           FLPFS   +R      +    + + +++ +Y+  +++KY +WNRTGG DHF V CH    
Sbjct: 55  FLPFSPQMLRTVIFGQKLQSQKDLEEYLKNYVDLVARKYSFWNRTGGTDHFLVGCHDWAS 114

Query: 113 -IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---S 166
            I R  MQ       N I+V+C+++    G    KD +LP  + R  + P   +G    S
Sbjct: 115 RITRKYMQ-------NCIRVLCNAN-VAKGFKIGKDTTLPVTYIRSAENPLKDVGGKHPS 166

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHV 220
           +R  LAFFAG ++  +R  L+Q W N   D +I+    R    K  Y + +  SK+C+  
Sbjct: 167 ERYTLAFFAGGMHGYLRPILVQFWENKESDMKIFGPMPRDIEGKRLYREYMKSSKYCICA 226

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           +G+EV+T RI +++ Y CVPVII+++Y  PF +VLNW++FS+ V   DIP L+ IL  I 
Sbjct: 227 RGYEVHTPRIVEAILYECVPVIISDNYVPPFFEVLNWEAFSVFVQEKDIPNLRSILLSIP 286

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            E+YL +Q  V  V++HF WH  P  YD F+M+++ +W  R
Sbjct: 287 EEKYLEMQLRVKMVQQHFLWHKNPVKYDLFHMILHSVWHNR 327


>gi|357520911|ref|XP_003630744.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355524766|gb|AET05220.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 653

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 128/341 (37%), Positives = 189/341 (55%), Gaps = 34/341 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
           M  + +VY+Y       F   +L      +G YASE +F K   ++  FV KDP+KA LF
Sbjct: 318 MELTLKVYIYMEGNKPIFHQPIL------KGLYASEGWFMKLMEENKQFVVKDPAKAHLF 371

Query: 60  FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           ++PFS          R  H+R      +  ++  Y   IS KY Y+NRTGGADHF VACH
Sbjct: 372 YMPFSSRMLEFSVYVRNSHNR----TNLRQYLKEYTDKISAKYRYFNRTGGADHFLVACH 427

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LGS---S 166
                  +   E     I+ +C+S     G    +DVSLP+   R    P+  LG     
Sbjct: 428 DWAPYETRHHMEY---CIKALCNSD-VTQGFKIGRDVSLPETMVRSVRNPQRDLGGKPPQ 483

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHV 220
           +R+ LAF+AG ++  +R  LL+ W+    D +I+    H    K  Y + +  SK+C+  
Sbjct: 484 QRSILAFYAGNMHGYLRSILLKHWKEKDPDMKIFGPMPHGVAHKMNYIEHMKSSKYCICP 543

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           KG+EVN+ R+ ++++Y CVPVII++++  PF +VLNW SFS+++A  DIP LK+IL  + 
Sbjct: 544 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWDSFSLILAEKDIPNLKQILLSVP 603

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            E+YL LQ  V +V+KHF WH  P  YD F+M ++ +W  R
Sbjct: 604 EEKYLKLQLGVRRVQKHFLWHTKPLKYDLFHMTLHSIWYNR 644


>gi|297802718|ref|XP_002869243.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297315079|gb|EFH45502.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 583

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 129/341 (37%), Positives = 189/341 (55%), Gaps = 34/341 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH-FVTKDPSKAD 57
           M +  +VYVY   +         PV  +P  +G YASE +F K    S  FVTKDP KA 
Sbjct: 253 MEKKLKVYVYREGKR--------PVLHKPVLKGIYASEGWFMKQLKSSRTFVTKDPRKAH 304

Query: 58  LFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           LF+LPFS   +     +      + +  F+ +Y+  IS KY +WN+TGG+DHF VACH  
Sbjct: 305 LFYLPFSSKMLEETLYVPGSHSDQNLIQFLKNYLDMISSKYNFWNKTGGSDHFLVACHDW 364

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ---IWPRQEDPPKLGS--SKR 168
             S  ++        I+ +C+S     G +  KDV+LP+   + PR+      G   S+R
Sbjct: 365 APSETRQYMA---KCIRALCNSD-VSEGFVFGKDVALPETTILVPRRPLRALGGKPVSQR 420

Query: 169 NKLAFFAGAVNSPVREKLLQVWRND--------SEIYAHSGRLKTPYADELLGSKFCLHV 220
             LAFFAG ++  +R  LL+ W  +        SEI    G  K  Y + +  SKFC+  
Sbjct: 421 QILAFFAGGMHGYLRPLLLRNWGGNRDPDMKIFSEIPKSKG--KKSYMEYMKSSKFCICP 478

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           KG EVN+ R+ ++L+Y CVPVII++++  PF +VLNW++F++ V   DIP LK IL  I+
Sbjct: 479 KGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWEAFAVFVLEKDIPDLKNILVSIT 538

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            E Y  +Q+ V  V+KHF WH  P  +D F+M+++ +W  R
Sbjct: 539 EERYREMQTRVKMVQKHFLWHSKPERFDIFHMILHSIWYNR 579


>gi|449442160|ref|XP_004138850.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
           sativus]
 gi|449530249|ref|XP_004172108.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
           sativus]
          Length = 465

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 133/343 (38%), Positives = 191/343 (55%), Gaps = 26/343 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFE-PRGN-YASESYFKKAFMK--SHFVTKDPSKA 56
           M + F+VYVYP           LP+    P  N Y  E  F        + F T DPS+A
Sbjct: 131 MEKRFKVYVYPEGE--------LPITHAGPCKNIYTIEGRFIHEMEDGGNGFRTVDPSRA 182

Query: 57  DLFFLPFSIARM----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
            + F+PFS+A M      D       +  F+S Y+  +S+KYP+WN+T GADHF +ACH 
Sbjct: 183 HVLFMPFSVAWMVKYLYKDGSYDQTPLRMFVSDYVEVVSKKYPFWNKTNGADHFILACHD 242

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPKLGSSK---- 167
            G  A +    +   +I+V+C+++    G    KDVSLP+I     E  PKL S+     
Sbjct: 243 WGPIATEGNRFLYNTSIRVLCNANS-SEGFNPQKDVSLPEIHLYDGEISPKLLSASNSHH 301

Query: 168 -RNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL---KTPYADELLGSKFCLHVKGF 223
            R  LAFFAG ++ P+R  LL  W+N +    H       +  Y DE+L S+FCL   G+
Sbjct: 302 HRPHLAFFAGGLHGPIRPILLNHWKNRTHTNIHVYEYLPKQLDYYDEMLQSRFCLCPSGY 361

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EV + RI +++Y  CVPVII+  Y LPF+DVL W+ FSI V+  +IP L++IL G+S E 
Sbjct: 362 EVASPRIVEAIYAECVPVIISERYVLPFSDVLRWEGFSIEVSVSEIPRLEEILMGVSEER 421

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           Y  L   +  VRKHF  +     +DAF+M+++ +WLRR +V++
Sbjct: 422 YEKLIQGLRTVRKHFVLNRPAKRFDAFHMILHSVWLRRLNVKL 464


>gi|255550904|ref|XP_002516500.1| catalytic, putative [Ricinus communis]
 gi|223544320|gb|EEF45841.1| catalytic, putative [Ricinus communis]
          Length = 456

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 127/340 (37%), Positives = 186/340 (54%), Gaps = 33/340 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M    +V++Y       F   +L       G YASE +F K    +  FVTKDP +A LF
Sbjct: 124 MENMLKVFIYQEGEKPIFHQSIL------EGIYASEGWFIKLMEANEKFVTKDPKEAHLF 177

Query: 60  FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           ++PFS          R  H R      + +++ +Y   I+ KYP+W+RTGGADHF  ACH
Sbjct: 178 YIPFSSRLLELTLYVRKSHSRN----NLIEYMKNYTDMIAAKYPFWSRTGGADHFVAACH 233

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SS 166
               +  +      LN I+ +C++   +   I  KDVSLP+ + R    P         S
Sbjct: 234 DWAPAETRGRM---LNCIRALCNADIDVGFRIG-KDVSLPETYVRSAQNPLKNLDGNPPS 289

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLK--TPYADELLGSKFCLHVK 221
           +R  LAFFAG V+  VR  LL+ W N   + +I+    R+K  T Y   +  SK+C+  +
Sbjct: 290 QRPILAFFAGNVHGFVRPILLEYWENKDPEMKIFGPMPRVKGNTNYIQLMKSSKYCICPR 349

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G EVN+ RI +S++Y CVPVII+++Y  PF +VL+W+SF++ V   DIP LKKIL  I  
Sbjct: 350 GHEVNSPRIVESIFYECVPVIISDNYVPPFFEVLDWESFAVFVLEKDIPNLKKILLSIPE 409

Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           E Y+ +   V KV++HF WH  P  +D F+M+++ +W  R
Sbjct: 410 ETYVEMHKRVKKVQQHFLWHSEPEKHDLFHMILHSVWYNR 449


>gi|147799433|emb|CAN76867.1| hypothetical protein VITISV_012309 [Vitis vinifera]
          Length = 1908

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 128/341 (37%), Positives = 188/341 (55%), Gaps = 34/341 (9%)

Query: 1    MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
            M R  +VYVY       F   +L      +G YASE +F K   ++  FV KDP +A LF
Sbjct: 1061 MERILKVYVYKDGEKPIFHQPIL------KGLYASEGWFMKLMERNKXFVVKDPRQAQLF 1114

Query: 60   FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
            ++PFS          R  H+R      +  ++  Y   I+ KY +WNRTGG DHF VACH
Sbjct: 1115 YMPFSSRMLEYKLYVRNSHNR----TNLRQYLKQYSEKIAAKYRFWNRTGGXDHFLVACH 1170

Query: 112  SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---S 166
                   +   E     I+ +C++    +G    +DVSLP+ + R    P   LG    S
Sbjct: 1171 DWAPYETRHHME---QCIKALCNAD-VTAGFKIGRDVSLPETYVRSARNPLRDLGGKPPS 1226

Query: 167  KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAH---SGRLKTPYADELLGSKFCLHV 220
            +R+ LAF+AG ++  +R  LL+ W++   D +IY         K  Y   +  SKFC+  
Sbjct: 1227 ERHILAFYAGNMHGYLRPILLKYWKDKDPDMKIYGPMPPGVASKMNYIQHMKSSKFCICP 1286

Query: 221  KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            KG+EVN+ R+ ++++Y CVPVII++++  PF DVL+W +FSI++A  DIP LK +L  I 
Sbjct: 1287 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILAEKDIPNLKDVLLSIP 1346

Query: 281  SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            +E+YL +Q  V KV+KHF WH  P  YD F+M ++ +W  R
Sbjct: 1347 NEKYLQMQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 1387



 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 129/384 (33%), Positives = 199/384 (51%), Gaps = 72/384 (18%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR--GNYASESYFKKAFMKS-HFVTKDPSKAD 57
           M R  ++Y+Y             P+  +PR  G YASE +F K    +  FV +DP KA 
Sbjct: 332 MERVLKIYIYREGEK--------PIFHQPRLRGIYASEGWFMKLIEGNKRFVVRDPRKAH 383

Query: 58  LFFLPFSIARMR---HDRRIGT-EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS- 112
           LF++PFS   +R   +++   T   +  +  +Y+  I+ KY +WNRTGGADH  VACH  
Sbjct: 384 LFYVPFSSKMLRTVFYEQNSSTPRDLEKYFKNYVGLIAGKYRFWNRTGGADHLIVACHDW 443

Query: 113 --IGRSAMQKAWEVKL------------------------------------------NA 128
             I R+       +K                                           N+
Sbjct: 444 NPIYRTISTNTIRIKSQAITMPPFIFVGGESTYDLVSGTFSNKGFNSQAPRITRQCSWNS 503

Query: 129 IQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPK-LGS---SKRNKLAFFAGAVNSPVR 183
           I+ +C+S+   SG    KD +LP  + R+ EDP K LG    S+R  LAFFAG+++  +R
Sbjct: 504 IRALCNSN-IASGFKIGKDTTLPVTYIRKSEDPLKYLGGKPPSQRPILAFFAGSMHGYLR 562

Query: 184 EKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
             LLQ W N   D +I+    R    K+ Y D +  SK+C+  +G+EV+T R+ ++++Y 
Sbjct: 563 PILLQYWENKEQDIKIFGPMSRDDGGKSRYRDHMKSSKYCICARGYEVHTPRVVEAIFYE 622

Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKH 297
           CVPVII+++Y  PF ++LNW++F++ +   D+P L+ IL  I  E+YL +Q  V  V++H
Sbjct: 623 CVPVIISDNYVPPFFEILNWEAFAVFILEKDVPNLRNILLSIPEEKYLQMQMRVKMVQQH 682

Query: 298 FQWHVFPSDYDAFYMVMYELWLRR 321
           F WH  P  YD F+M+++ +W  R
Sbjct: 683 FLWHKKPVKYDLFHMILHSVWYNR 706


>gi|15233924|ref|NP_195005.1| Exostosin family protein [Arabidopsis thaliana]
 gi|3063691|emb|CAA18582.1| putative protein [Arabidopsis thaliana]
 gi|7270226|emb|CAB79996.1| putative protein [Arabidopsis thaliana]
 gi|332660719|gb|AEE86119.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 593

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 130/341 (38%), Positives = 188/341 (55%), Gaps = 34/341 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH-FVTKDPSKAD 57
           M +  +VYVY   +         PV  +P  +G YASE +F K    S  FVTKDP KA 
Sbjct: 263 MEKKLKVYVYREGKR--------PVLHKPVLKGIYASEGWFMKQLKSSRTFVTKDPRKAH 314

Query: 58  LFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           LF+LPFS   +     +      + +  F+ +Y+  IS KY +WN+TGG+DHF VACH  
Sbjct: 315 LFYLPFSSKMLEETLYVPGSHSDKNLIQFLKNYLDMISSKYSFWNKTGGSDHFLVACHDW 374

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ---IWPRQEDPPKLGS--SKR 168
             S  ++        I+ +C+S     G +  KDV+LP+   + PR+      G   S+R
Sbjct: 375 APSETRQYMA---KCIRALCNSD-VSEGFVFGKDVALPETTILVPRRPLRALGGKPVSQR 430

Query: 169 NKLAFFAGAVNSPVREKLLQVWRND--------SEIYAHSGRLKTPYADELLGSKFCLHV 220
             LAFFAG ++  +R  LLQ W  +        SEI    G  K  Y + +  SK+C+  
Sbjct: 431 QILAFFAGGMHGYLRPLLLQNWGGNRDPDMKIFSEIPKSKG--KKSYMEYMKSSKYCICP 488

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           KG EVN+ R+ ++L+Y CVPVII++++  PF +VLNW+SF++ V   DIP LK IL  I+
Sbjct: 489 KGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPDLKNILVSIT 548

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            E Y  +Q  V  V+KHF WH  P  +D F+M+++ +W  R
Sbjct: 549 EERYREMQMRVKMVQKHFLWHSKPERFDIFHMILHSIWYNR 589


>gi|302815783|ref|XP_002989572.1| xylosyltransferase-like protein [Selaginella moellendorffii]
 gi|300142750|gb|EFJ09448.1| xylosyltransferase-like protein [Selaginella moellendorffii]
          Length = 332

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 129/341 (37%), Positives = 187/341 (54%), Gaps = 36/341 (10%)

Query: 1   MNRSFRVYVYPHRR-----NDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPS 54
           M +  +V+ YP        N P   +           YA E  F +    K+ ++T DP 
Sbjct: 1   MEKRLKVFAYPEGEEPLVHNGPCKEI-----------YAIEGRFIQELQGKNSYLTSDPE 49

Query: 55  KADLFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
           KA LFFLPFS+A M         HD  +G  G   F   YI  IS +Y  WNR+ GADHF
Sbjct: 50  KAHLFFLPFSVAMMVTYLYTPGSHD--MGPLG--RFTRDYIDVISHRYSSWNRSRGADHF 105

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQEDPPKLGS 165
            V+CH  G    +   ++  N+I+V+C+++    G++  KD SLP+I     + P  LG 
Sbjct: 106 MVSCHDWGPHISRAHPDLMANSIRVLCNANT-SEGYVPSKDASLPEIHLVGGQVPSVLGG 164

Query: 166 ---SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHV 220
               +R  LAFFAG  + PVR  L + W+   E      +L  +  Y D +  SK+CL  
Sbjct: 165 PPPEERRYLAFFAGGDHGPVRPVLFKYWKEKDEDVRVFEKLPSRDAYLDYMSHSKYCLCP 224

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            G+EVN+ RI +++Y  CVPV+IA+ + LPF+DVL+W +FS+ V   DIP LK IL+ I 
Sbjct: 225 GGYEVNSPRIVEAIYNDCVPVVIADDFVLPFSDVLDWDAFSVKVLERDIPRLKTILQAIP 284

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +  YL +Q+   KVR+HF+++  P  YD F M+++ +WLRR
Sbjct: 285 TARYLEMQARASKVRRHFRFNQPPERYDVFNMILHSVWLRR 325


>gi|224142181|ref|XP_002324437.1| predicted protein [Populus trichocarpa]
 gi|222865871|gb|EEF03002.1| predicted protein [Populus trichocarpa]
          Length = 334

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 126/337 (37%), Positives = 187/337 (55%), Gaps = 26/337 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
           M R  +VYVY       F    +      RG YASE +F K       FV +DP KA LF
Sbjct: 1   MERMLKVYVYKEGEKPIFHQSKM------RGIYASEGWFMKLIEGNKKFVVRDPRKAHLF 54

Query: 60  FLPFSIARMR----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LPFS   +R        +  + + +F+ +Y+  +++KY +WNRTGG DHF V CH    
Sbjct: 55  YLPFSPHMLRTALFDHNSLNQKELAEFLKNYVDLVAKKYSFWNRTGGTDHFLVGCHDW-- 112

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNK 170
            A Q       N I+V+C+S+    G    KD +LP  + R  + P  +LG    S+R  
Sbjct: 113 -ASQMTRHHMRNCIRVLCNSN-VAKGFKIGKDTTLPVTYIRSVENPLKELGGKSPSERPI 170

Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHVKGFE 224
           LAFFAG ++  +R  LL+ W N   D +I     R    K  Y + +  SK+C+  +G+E
Sbjct: 171 LAFFAGNMHGYLRPILLEYWENKEPDMKILGPMSRDIAGKRRYREYMKRSKYCICARGYE 230

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           V+T R+ +S++Y CVPVII+++Y  P  +VLNW++FS+ +   DIP L+ IL  I  E+Y
Sbjct: 231 VHTPRVVESIFYECVPVIISDNYVPPLFEVLNWEAFSVFIQEKDIPNLRNILLSIPQEKY 290

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           + +Q  V KV++HF WH  P  YD F+M+++ +W  R
Sbjct: 291 VAMQLGVKKVQQHFLWHKKPVKYDLFHMILHSVWHSR 327


>gi|356510025|ref|XP_003523741.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 619

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 121/336 (36%), Positives = 183/336 (54%), Gaps = 25/336 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
           M    +VY+Y       F   LL       G YASE +F K       FVT+DP KA LF
Sbjct: 287 MENMLKVYIYQDGDRPIFHEPLL------DGIYASEGWFMKLMEANKQFVTRDPGKAHLF 340

Query: 60  FLPFSIARMRHDRRIGTE----GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           ++PFS   ++    +        + +++ +Y+  I+ KYP+WNRT GADHF VACH    
Sbjct: 341 YIPFSSRLLQQTLYVRNSHRRSNLIEYMKNYVDMIAGKYPFWNRTSGADHFVVACHDWAP 400

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNK 170
           +  +      L+ I+ +C++   +   I  KDVSLP+ + R  + P   +G    SKR  
Sbjct: 401 AETRGRM---LSCIRALCNADIEVGFKIG-KDVSLPETYIRSSENPVKNIGGDPPSKRPI 456

Query: 171 LAFFAGAVNSPVREKLLQVWRNDSEIYAHSG-----RLKTPYADELLGSKFCLHVKGFEV 225
           LAFFAG ++  VR  LL+ W N       SG     R    Y   +  SKFC+  +G EV
Sbjct: 457 LAFFAGGLHGYVRPILLKHWENKEPDMKISGPLPHVRGNVNYIQLMKSSKFCICARGHEV 516

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
           N+ R+ +++++ C+PVII++++  PF ++LNW+SF++ V   +IP L+ IL  IS E YL
Sbjct: 517 NSPRVVEAIFHECIPVIISDNFIPPFFEILNWESFAVFVKEEEIPNLRNILLSISEERYL 576

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            +     KV++HF WH  P  YD F+M+++ +W  R
Sbjct: 577 EMHKRAKKVQEHFLWHAEPVKYDLFHMLLHSIWYNR 612


>gi|125557509|gb|EAZ03045.1| hypothetical protein OsI_25187 [Oryza sativa Indica Group]
          Length = 601

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 134/349 (38%), Positives = 193/349 (55%), Gaps = 38/349 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAF-MKSHFVTKDPSKAD 57
           M + F+V+VY     +P      PV  D      Y++E  F  A  M++   T+DP++A 
Sbjct: 263 MEKVFKVFVY--EEGEP------PVFHDGPCHSIYSTEGRFIYAMEMENRMRTRDPNQAH 314

Query: 58  LFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           +FFLPFS+ +M         HD     + +   IS YI  +S KYP+WNR+ GADHF ++
Sbjct: 315 VFFLPFSVVKMVKMIYEPNSHD----MDPLRRTISDYINVVSTKYPHWNRSLGADHFMLS 370

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQEDPPKL 163
           CH  G         +  N+I+V+C+++    G    +DVSLP+I        RQ   P  
Sbjct: 371 CHDWGPYVSSANGHLFSNSIRVLCNANTS-EGFDPSRDVSLPEINLRSDVVARQVGGPS- 428

Query: 164 GSSKRNKLAFFAGAVNSPVREKLLQVWR--NDSEIYAHS---GRLKTPYADELLGSKFCL 218
            +S R  LAFFAG  + PVR  LLQ W    D++I        R    Y D +  S+FCL
Sbjct: 429 -ASHRPILAFFAGGDHGPVRPLLLQHWGKGQDADIQVSEYLPRRHSMSYTDMMRRSRFCL 487

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
              G+EV + R+ +++Y  CVPV+I + Y LPFADVLNW +FS+ VA  DIP LK+IL  
Sbjct: 488 CPSGYEVASPRVVEAIYLECVPVVIGDDYALPFADVLNWAAFSVRVAVGDIPRLKEILAA 547

Query: 279 ISSEEYLLLQSYVLKVRKHFQW-HVFPSDYDAFYMVMYELWLRRSSVRV 326
           +S  +Y+ +Q  V  VR+HF      P  +D F+M+++ +WLRR +VRV
Sbjct: 548 VSPRQYIRMQRRVRAVRRHFMVSDGAPWRFDVFHMILHSIWLRRLNVRV 596


>gi|359484716|ref|XP_002263848.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
           vinifera]
 gi|296084516|emb|CBI25537.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 124/335 (37%), Positives = 185/335 (55%), Gaps = 17/335 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+V+ Y       F    L   +   G +  E  F     KS F+ K P +A  FF
Sbjct: 139 MVKRFKVWTYREGAQPIFHEGPLTNIYAIEGQFIDEMDFIVG--KSPFIAKHPDEAHAFF 196

Query: 61  LPFSIARMRHDRRIGTEGIPDF--------ISHYIFNISQKYPYWNRTGGADHFYVACHS 112
           LP S+ ++     +      D+        ++ Y+  ++ KYPYWNR+GGADHF V+CH 
Sbjct: 197 LPLSVVKVVQFLYLPITSPEDYSRKRLQRIVTDYVKVVADKYPYWNRSGGADHFMVSCHD 256

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGSSKRNK 170
              S      E+  N I+V+C+++    G    +DVSLP++  P  E  PP LG    N+
Sbjct: 257 WAPSVSYANPELFKNFIRVLCNANS-SEGFRPGRDVSLPEVNLPAGELGPPHLGQPSNNR 315

Query: 171 --LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGFEVN 226
             LAFFAG  +  +R+ L + W++ D+E+  H    K     +L+G SKFCL   G+EV 
Sbjct: 316 PVLAFFAGRAHGNIRKILFEHWKDQDNEVLVHERLHKGQNYAKLMGQSKFCLCPSGYEVA 375

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
           + R+ ++++ GCVPVII+N+Y LPF DVL+W  FSI +    IP +K IL GIS  +YL 
Sbjct: 376 SPRVVEAIHAGCVPVIISNNYSLPFNDVLDWSQFSIQIPVAKIPEIKTILLGISKNKYLK 435

Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +Q  VL+VR+HF  +     +D  +M+++ LWLRR
Sbjct: 436 MQERVLRVRRHFVLNRPARPFDIIHMILHSLWLRR 470


>gi|357466145|ref|XP_003603357.1| hypothetical protein MTR_3g106690 [Medicago truncatula]
 gi|355492405|gb|AES73608.1| hypothetical protein MTR_3g106690 [Medicago truncatula]
          Length = 393

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 129/340 (37%), Positives = 191/340 (56%), Gaps = 21/340 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS--HFVTKDPSKADL 58
           M + F+VYVYP        ++ +  D   +  Y++E  F     +    F T DP+ A +
Sbjct: 59  MEKIFKVYVYPD------GDLPIVHDGPCKDIYSTEGRFLHEMERGVGKFRTNDPNAAHV 112

Query: 59  FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           +FLPFS+  M              +  F+S Y+  +S +YP+WNRT GADHF +ACH  G
Sbjct: 113 YFLPFSVTWMVKYLYTPSSYDITPLTQFVSDYVRVVSMRYPFWNRTHGADHFMLACHDWG 172

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQEDPPKLGS-----SKR 168
             A +    +   +I+V+C+++    G    KDVSLP+I     E  PKL S     + R
Sbjct: 173 PHASKGNPFLYNTSIRVLCNANT-SEGFNPLKDVSLPEIHLYGGEVSPKLLSLPPENAPR 231

Query: 169 NKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVN 226
             LAFFAG ++ P+R  LLQ W+N D +I  +    K   Y   +L SKFCL   GFEV 
Sbjct: 232 RYLAFFAGGMHGPIRPILLQHWKNRDKDILVNEYLPKGIDYYSIMLSSKFCLCPSGFEVA 291

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
           + RI +S+Y  CVPVI++N+Y LPF+DVL W++FS+ V   DIP LK+IL  I   +Y  
Sbjct: 292 SPRIVESIYAECVPVILSNYYVLPFSDVLQWEAFSVQVDVSDIPRLKEILSAIPESKYKK 351

Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           L+  V  VR+HF  +     +D F+M+++ +WLRR ++++
Sbjct: 352 LKQGVRAVRRHFTLNQPAKRFDVFHMILHSIWLRRLNIKL 391


>gi|15239118|ref|NP_196722.1| Exostosin family protein [Arabidopsis thaliana]
 gi|7573418|emb|CAB87721.1| putative protein [Arabidopsis thaliana]
 gi|332004317|gb|AED91700.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 546

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 126/339 (37%), Positives = 191/339 (56%), Gaps = 31/339 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR----GNYASESYFKKAFMKSH-FVTKDPSK 55
           M ++ +VYVY             P+  +P     G YASE +F K    SH F+TKDP+K
Sbjct: 216 MEQTLKVYVYSEGDR--------PIFHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTK 267

Query: 56  ADLFFLPFSIARMR-----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
           A LF++PFS   ++     HD       +  ++ +YI  I+  YP WNRT G+DHF+ AC
Sbjct: 268 AHLFYIPFSSRILQQKLYVHDSH-SRNNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTAC 326

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSS-- 166
           H    +  +  +   +N I+ +C++   I   +  KDVSLP+  +   Q    K+G S  
Sbjct: 327 HDWAPTETRGPY---INCIRALCNADVGID-FVVGKDVSLPETKVSSLQNPNGKIGGSRP 382

Query: 167 -KRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
            KR  LAFFAG+++  VR  LL  W  R + ++   +      Y   +  S+FC+  KG+
Sbjct: 383 SKRTILAFFAGSLHGYVRPILLNQWSSRPEQDMKIFNRIDHKSYIRYMKRSRFCVCAKGY 442

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EVN+ R+ +S+ YGCVPVII++++  PF ++LNW+SF++ V   +IP L+KIL  I    
Sbjct: 443 EVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPNLRKILISIPVRR 502

Query: 284 YLLLQSYVLKVRKHFQWH-VFPSDYDAFYMVMYELWLRR 321
           Y+ +Q  VLKV+KHF WH   P  YD F+M+++ +W  R
Sbjct: 503 YVEMQKRVLKVQKHFMWHDGEPVRYDIFHMILHSVWYNR 541


>gi|26450497|dbj|BAC42362.1| unknown protein [Arabidopsis thaliana]
          Length = 546

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 126/339 (37%), Positives = 191/339 (56%), Gaps = 31/339 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR----GNYASESYFKKAFMKSH-FVTKDPSK 55
           M ++ +VYVY             P+  +P     G YASE +F K    SH F+TKDP+K
Sbjct: 216 MEQTLKVYVYSEGDR--------PIFHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTK 267

Query: 56  ADLFFLPFSIARMR-----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
           A LF++PFS   ++     HD       +  ++ +YI  I+  YP WNRT G+DHF+ AC
Sbjct: 268 AHLFYIPFSSRILQQKLYVHDSH-SRNNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTAC 326

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSS-- 166
           H    +  +  +   +N I+ +C++   I   +  KDVSLP+  +   Q    K+G S  
Sbjct: 327 HDWAPTETRGPY---INCIRALCNADVGID-FVVGKDVSLPETKVSSLQNPNGKIGGSRP 382

Query: 167 -KRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
            KR  LAFFAG+++  VR  LL  W  R + ++   +      Y   +  S+FC+  KG+
Sbjct: 383 SKRTILAFFAGSLHGYVRPILLNQWSSRPEQDMKIFNRIDHKSYIRYMKRSRFCVCAKGY 442

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EVN+ R+ +S+ YGCVPVII++++  PF ++LNW+SF++ V   +IP L+KIL  I    
Sbjct: 443 EVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPNLRKILISIPVRR 502

Query: 284 YLLLQSYVLKVRKHFQWH-VFPSDYDAFYMVMYELWLRR 321
           Y+ +Q  VLKV+KHF WH   P  YD F+M+++ +W  R
Sbjct: 503 YVEMQKRVLKVQKHFMWHDGEPVRYDIFHMILHSVWYNR 541


>gi|449469432|ref|XP_004152424.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 472

 Score =  204 bits (519), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 120/308 (38%), Positives = 177/308 (57%), Gaps = 16/308 (5%)

Query: 30  RGNYASESYFKKAFMKS-HFVTKDPSKADLFFLPFSIARMRHDRRI----GTEGIPDFIS 84
           RG YASE +F K   ++  FVTKDP KA LF+L +S  +++    +      + +  ++ 
Sbjct: 156 RGIYASEGWFMKLMEENRQFVTKDPEKAHLFYLAYSSRQLQTALYVPDSHNMKPLSIYLR 215

Query: 85  HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
            ++  I+ KYPYWNRT G DHF VACH  G   + +  E+  + I+ +C++         
Sbjct: 216 DHVNWIAGKYPYWNRTHGYDHFLVACHDWGPYTVNEHRELSQHTIKALCNADLSEGVFKL 275

Query: 145 HKDVSLPQIWPRQEDPP--KLGS---SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAH 199
            KDVSLP+   R    P   +G    S+R  LAFFAG ++  VR  LL+ W +  +    
Sbjct: 276 GKDVSLPETTIRTPRKPLRNVGGKRVSQRPILAFFAGNMHGRVRPILLKHWNDKDDDIRV 335

Query: 200 SGRL------KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
            G L      K  Y   +  SK+C+   G+EVN+ RI +++YY CVPVIIA+++ LPF++
Sbjct: 336 YGPLPLRVSRKMTYIQHMKSSKYCICPMGYEVNSPRIIEAIYYECVPVIIADNFVLPFSE 395

Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            L+W +FS+VVA  DIP LK+IL  I  + YL +Q  V  V+KHF W+  P  YD F+MV
Sbjct: 396 FLDWSAFSVVVAEKDIPKLKEILTAIPLKRYLTMQINVKMVQKHFLWNPKPLKYDLFHMV 455

Query: 314 MYELWLRR 321
           ++ +W  R
Sbjct: 456 LHSIWFSR 463


>gi|15239502|ref|NP_197954.1| Exostosin family protein [Arabidopsis thaliana]
 gi|110738111|dbj|BAF00988.1| hypothetical protein [Arabidopsis thaliana]
 gi|332006108|gb|AED93491.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 654

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 132/344 (38%), Positives = 184/344 (53%), Gaps = 38/344 (11%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
           M +  +VY Y         + +L      RG YASE +F      ++  FVTKDP+KA L
Sbjct: 320 MEKILKVYAYKEGNKPIMHSPIL------RGIYASEGWFMNIIESNNNKFVTKDPAKAHL 373

Query: 59  FFLPFSIARM---------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           F+LPFS +RM          H  R     +  ++  YI  IS KYP+WNRT GADHF  A
Sbjct: 374 FYLPFS-SRMLEVTLYVQDSHSHR----NLIKYLKDYIDFISAKYPFWNRTSGADHFLAA 428

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGSSK 167
           CH    S  +K       +I+ +C+S     G +  KD SLP+ + R    P   +G   
Sbjct: 429 CHDWAPSETRKHMA---KSIRALCNSD-VKEGFVFGKDTSLPETFVRDPKKPLSNMGGKS 484

Query: 168 RNK---LAFFAGAVNSP-VREKLLQVWRN----DSEIYAHSGRLK--TPYADELLGSKFC 217
            N+   LAFFAG  +   +R  LL  W N    D +I+    R K    Y   +  SK+C
Sbjct: 485 ANQRPILAFFAGKPDHGYLRPILLSYWGNNKDPDLKIFGKLPRTKGNKNYLQFMKTSKYC 544

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           +  KGFEVN+ R+ ++++Y CVPVII++++  PF +VLNW+SF+I +   DIP LKKIL 
Sbjct: 545 ICAKGFEVNSPRVVEAIFYDCVPVIISDNFVPPFFEVLNWESFAIFIPEKDIPNLKKILM 604

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            I    Y  +Q  V KV+KHF WH  P  YD F+M+++ +W  R
Sbjct: 605 SIPESRYRSMQMRVKKVQKHFLWHAKPEKYDMFHMILHSIWYNR 648


>gi|9758523|dbj|BAB08970.1| unnamed protein product [Arabidopsis thaliana]
          Length = 559

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 123/342 (35%), Positives = 191/342 (55%), Gaps = 32/342 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
           M R  ++YVY       F   +      PRG YASE +F K  M+S+  FV KDP KA L
Sbjct: 222 MERKLKIYVYKEGGKPIFHTPM------PRGIYASEGWFMK-LMESNKKFVVKDPRKAHL 274

Query: 59  FFLPFSIARMRHDRRIG---TEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           F++P SI  +R    +     + + D +  Y+  I+ KY +WNRTGGADHF VACH  G 
Sbjct: 275 FYIPISIKALRSSLGLDFQTPKSLADHLKEYVDLIAGKYKFWNRTGGADHFLVACHDWGN 334

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LG---SSKRNK 170
               K  +   N+++ +C+S+    G     D +LP  + R  + P   LG   SS+R  
Sbjct: 335 KLTTKTMK---NSVRSLCNSN-VAQGFRIGTDTALPVTYIRSSEAPLEYLGGKTSSERKI 390

Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGS-----KFCLH 219
           LAFFAG+++  +R  L+++W N   D +I+       + K  Y + +  S     ++C+ 
Sbjct: 391 LAFFAGSMHGYLRPILVKLWENKEPDMKIFGPMPRDPKSKKQYREYMKSSSSHFNRYCIC 450

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
            +G+EV+T R+ +++   CVPVIIA++Y  PF +VLNW+ F++ V   DIP L+ IL  I
Sbjct: 451 ARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEKDIPNLRNILLSI 510

Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
             + Y+ +Q+ V  V++HF WH  P  +D F+M+++ +W  R
Sbjct: 511 PEDRYIGMQARVKAVQQHFLWHKKPVKFDQFHMILHSIWYSR 552


>gi|224129250|ref|XP_002320538.1| predicted protein [Populus trichocarpa]
 gi|222861311|gb|EEE98853.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  203 bits (517), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 123/340 (36%), Positives = 190/340 (55%), Gaps = 22/340 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAFMKSH-FVTKDPSKAD 57
           M + F++Y+Y  +  DP      P+  D   +  Y+SE  F     K   F T DP +A 
Sbjct: 1   MEKLFKIYIY--KEGDP------PMFHDGPCKSIYSSEGRFIHELEKGKSFTTTDPDEAL 52

Query: 58  LFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           ++FLPFS+  +     +      + I + +  YI  I+ KYP+WNR+ GADHF ++CH  
Sbjct: 53  VYFLPFSVVMLVQYLYVPGSHEIDAIGNTVVDYINVIADKYPFWNRSLGADHFILSCHDW 112

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
           G         +  N+I+V+C+++    G    KD S P+I  R  +   L      S+R+
Sbjct: 113 GPRTSSYVPHLFNNSIRVLCNANTS-EGFNPKKDASFPEIHLRTGEITGLVGGPSPSRRS 171

Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSG-RLKTPYADELLGSKFCLHVKGFEVNT 227
            LAFFAG ++  +R  LL+ W++ D ++  H   R    Y   L  S+FCL   G+EV +
Sbjct: 172 ILAFFAGRLHGHIRRLLLEQWKDKDQDVQVHDQLRNGMSYDSMLKNSRFCLCPSGYEVAS 231

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            RI +++Y  CVPV+I++ Y  PF+DVLNWK+FSI V   DIP +K IL GIS  +YL +
Sbjct: 232 PRIVEAIYAECVPVLISDGYVPPFSDVLNWKAFSIQVQVKDIPKIKDILMGISQRQYLRM 291

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           Q  V +V++HF  +  P  +D F+M ++ +WLRR ++R+ 
Sbjct: 292 QRRVKQVQRHFVVNGIPKRFDVFHMTIHSIWLRRLNIRIH 331


>gi|449531189|ref|XP_004172570.1| PREDICTED: probable glycosyltransferase At5g03795-like, partial
           [Cucumis sativus]
          Length = 344

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 120/308 (38%), Positives = 177/308 (57%), Gaps = 16/308 (5%)

Query: 30  RGNYASESYFKKAFMKS-HFVTKDPSKADLFFLPFSIARMRHDRRI----GTEGIPDFIS 84
           RG YASE +F K   ++  FVTKDP KA LF+L +S  +++    +      + +  ++ 
Sbjct: 28  RGIYASEGWFMKLMEENRQFVTKDPEKAHLFYLAYSSRQLQTALYVPDSHNMKPLSIYLR 87

Query: 85  HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
            ++  I+ KYPYWNRT G DHF VACH  G   + +  E+  + I+ +C++         
Sbjct: 88  DHVNWIAGKYPYWNRTHGYDHFLVACHDWGPYTVNEHRELSQHTIKALCNADLSEGVFKL 147

Query: 145 HKDVSLPQIWPRQEDPP--KLGS---SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAH 199
            KDVSLP+   R    P   +G    S+R  LAFFAG ++  VR  LL+ W +  +    
Sbjct: 148 GKDVSLPETTIRTPRKPLRNVGGKRVSQRPILAFFAGNMHGRVRPILLKHWNDKDDDIRV 207

Query: 200 SGRL------KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
            G L      K  Y   +  SK+C+   G+EVN+ RI +++YY CVPVIIA+++ LPF++
Sbjct: 208 YGPLPLRVSRKMTYIQHMKSSKYCICPMGYEVNSPRIIEAIYYECVPVIIADNFVLPFSE 267

Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            L+W +FS+VVA  DIP LK+IL  I  + YL +Q  V  V+KHF W+  P  YD F+MV
Sbjct: 268 FLDWSAFSVVVAEKDIPKLKEILTAIPLKRYLTMQINVKMVQKHFLWNPKPLKYDLFHMV 327

Query: 314 MYELWLRR 321
           ++ +W  R
Sbjct: 328 LHSIWFSR 335


>gi|51477380|gb|AAU04753.1| EXO [Cucumis melo]
          Length = 343

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 122/338 (36%), Positives = 189/338 (55%), Gaps = 18/338 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M R  ++YVY  +  +P      P     +  Y++E  F     K + + T DP +A L+
Sbjct: 12  MERLLKIYVY--KEGEPPMFHGGPC----KSIYSTEGRFIHEMEKGNLYTTNDPDQALLY 65

Query: 60  FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           FLPFS+  +     +        I   I+ YI  IS+K+P+W+R+ GADHF ++CH  G 
Sbjct: 66  FLPFSVVNLVQYLYVPNSHEVNAIGRAITDYINVISKKHPFWDRSLGADHFMLSCHDWGP 125

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNKL 171
                   +  N+I+V+C+++    G +  KD S P+I  R  +   L      S+R+ L
Sbjct: 126 RTTSYVPLLFNNSIRVLCNAN-VSEGFLPSKDASFPEIHLRTGEIDGLIGGLSPSRRSVL 184

Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT--PYADELLGSKFCLHVKGFEVNTAR 229
           AFFAG ++  +R  LLQ W+   E       L +   Y   L  S+FCL   G+EV + R
Sbjct: 185 AFFAGRLHGHIRYLLLQEWKEKDEDVLVYEELPSGISYNSMLKKSRFCLCPSGYEVASPR 244

Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
           + +++Y  CVPV+I+  Y  PF+DVLNWKSFS+ +   DIP +KKILKGIS  +YL +Q 
Sbjct: 245 VVEAIYAECVPVLISESYVPPFSDVLNWKSFSVQIQVKDIPNIKKILKGISQTQYLRMQR 304

Query: 290 YVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            V +V++HF  +  P  +DAF+M+++ +WLRR ++ +Q
Sbjct: 305 RVKQVQRHFALNGTPKRFDAFHMILHSIWLRRLNIHIQ 342


>gi|358348291|ref|XP_003638181.1| hypothetical protein MTR_121s0013 [Medicago truncatula]
 gi|355504116|gb|AES85319.1| hypothetical protein MTR_121s0013 [Medicago truncatula]
          Length = 550

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 127/341 (37%), Positives = 189/341 (55%), Gaps = 35/341 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFM-KSHFVTKDPSKAD 57
           M R  +VY+Y  R  +       P+  +P  RG YASE +F K       F+ KDP KA 
Sbjct: 222 MERKLKVYIY--REGEK------PIFHQPKMRGIYASEGWFMKLMEGNKRFIVKDPKKAH 273

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS----I 113
           LF+LPFS   +R +     + +  ++  Y+  I+ KY +WNRTGGADHF VACH     I
Sbjct: 274 LFYLPFSSQMLRANLS-DNKKMEQYLDKYVNIIAGKYRFWNRTGGADHFLVACHDWASRI 332

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKR 168
            R  M+       N I+ +C+++    G    KD +LP  +      P  K+     S+R
Sbjct: 333 TRQPMK-------NCIRSLCNAN-VAKGFQIGKDTTLPATYIHSVMNPLRKIAGKHPSER 384

Query: 169 NKLAFFAGAVNSPVREKLLQVWRN---DSEIY---AHSGRLKTPYADELLGSKFCLHVKG 222
             LAFFAG+++  +R  LL+ W N   D +I+   A     K  Y D +  SK+C+  +G
Sbjct: 385 TILAFFAGSMHGYLRPILLKHWENKEPDMKIFGAMARDAEGKRIYMDYMNSSKYCICARG 444

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
           +EV + RI ++++  CVPVII+++Y  PF +VL W++FS+ V   D+P L+ IL  I+ E
Sbjct: 445 YEVYSPRIVEAIFSECVPVIISDNYVPPFFEVLKWEAFSVFVRERDVPNLRSILLSITEE 504

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
           +YL L   V KV++HF WH  P  YD F+M+++ +W  R S
Sbjct: 505 KYLALHLGVKKVQQHFLWHKVPVKYDLFHMILHSIWNNRLS 545


>gi|297805260|ref|XP_002870514.1| hypothetical protein ARALYDRAFT_915839 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316350|gb|EFH46773.1| hypothetical protein ARALYDRAFT_915839 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 536

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 122/334 (36%), Positives = 185/334 (55%), Gaps = 41/334 (12%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
           M R  +VYVY       F   L      PRG YASE +F K  M+S+  FV +DP KA L
Sbjct: 224 MERKLKVYVYKEGGKPIFHKPL------PRGIYASEGWFMK-LMESNKKFVVRDPRKAHL 276

Query: 59  FFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAM 118
           F++P                  + +  Y+  I+ KY +WNRTGGADHF VACH  G    
Sbjct: 277 FYIPI-----------------NHLKEYVDLIAGKYKFWNRTGGADHFIVACHDWGNKLT 319

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LGS---SKRNKLAF 173
           +K  +   N+++ +C+S+    G     D +LP  + R  + P   LG    SKR  LAF
Sbjct: 320 KKTMK---NSVRALCNSN-VAQGFRIGTDTALPVTYIRSAESPLEYLGGKTPSKRKILAF 375

Query: 174 FAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHVKGFEVNT 227
           FAG+++  +R  L+++W N   D +I     R    KT Y + +  SK+C+  +G+EV+T
Sbjct: 376 FAGSMHGYLRPILVKLWENKEPDMKIVGPMPRDPESKTQYREYMKSSKYCICARGYEVHT 435

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            R+ +++   CVPVIIA++Y  PF ++LNW+ F++ V   +I  L+ IL  IS E Y+++
Sbjct: 436 PRVVEAIINECVPVIIADNYVPPFFEILNWEEFAVFVEEKEIANLRNILLSISEERYIVM 495

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           Q+ V  V++HF WH  P  +D F+M+++ +W  R
Sbjct: 496 QARVKAVQQHFLWHKKPVKFDLFHMILHSIWHSR 529


>gi|449462352|ref|XP_004148905.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
           sativus]
 gi|449523501|ref|XP_004168762.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g25310-like [Cucumis sativus]
          Length = 684

 Score =  200 bits (509), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 128/336 (38%), Positives = 185/336 (55%), Gaps = 25/336 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M  + +VY+Y       F    L      +  YASE +F K    +  FVTK+P KA LF
Sbjct: 351 MESTLKVYIYREGARPIFHQGPL------QSIYASEGWFMKILESNKKFVTKNPRKAHLF 404

Query: 60  FLPFSIARMRHDRRI-GTEGIPDFISH---YIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +LPFS  ++     +  +    + I H   Y+  I+ KYP+WNRTGGADHF VACH    
Sbjct: 405 YLPFSSRQLEEVLYVRDSHSHKNLIQHLKNYLDFIAAKYPHWNRTGGADHFLVACHDWAP 464

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SSKRNK 170
           +  +K        I+ +C+S     G +  KDVSLP+ + R    P   +G   SSKR  
Sbjct: 465 AETRKYMA---KCIRALCNSD-VKEGFVFGKDVSLPETFVRVARNPLRDVGGNPSSKRPI 520

Query: 171 LAFFAGAVNSPVREKLLQVW-RNDSEIYAHSGRLKTPYADELL----GSKFCLHVKGFEV 225
           LAFFAG+++  +R  LL+ W R D ++       K   +   L     SK+C+  KG+EV
Sbjct: 521 LAFFAGSMHGYLRSTLLEYWERKDPDMKISGPMPKVKGSKNYLWHMKNSKYCICAKGYEV 580

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
           N+ R+ +S+ Y CVPVII++++  P  +VLNW+SF++ VA  DIP LKKIL  I  + Y 
Sbjct: 581 NSPRVVESILYECVPVIISDNFVPPLFEVLNWESFAVFVAEKDIPNLKKILLSIPEKRYR 640

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            +Q  V K++ HF WH  P  YD F+M+++ +W  R
Sbjct: 641 EMQMRVKKLQPHFLWHAKPQKYDMFHMILHSIWYNR 676


>gi|168062371|ref|XP_001783154.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162665352|gb|EDQ52040.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 126/341 (36%), Positives = 186/341 (54%), Gaps = 35/341 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDPSKADL 58
           M ++F+VYVY     D +  ++     +  G YA+E  F K      + +   DP++A +
Sbjct: 1   MQKTFKVYVY----KDGYKPLVHAA--KTGGIYATEGLFLKRMDDPGNRYTVSDPTQAHM 54

Query: 59  FFLPFSIARM---------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           F LP+S+ ++         R  R + T     FI++Y+  I+ KYPYWNRT GADHF+V+
Sbjct: 55  FLLPYSVRQLVDFIQDPYSRSMRPLKT-----FIANYVERITSKYPYWNRTRGADHFFVS 109

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP----RQEDPPKLGS 165
           CH     +     E+  N+++VVC++    +  I  KDVS+PQ        + D   L  
Sbjct: 110 CHDWAPLSTILHDELHNNSMKVVCNADLTANFDI-QKDVSIPQAVKGGNQSELDIDNLPP 168

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDS------EIYAHSGRLKTPYADELLGSKFCLH 219
            KR+ LAF+AG ++  VR  L+Q WR         E+          YA  +  SKFCL 
Sbjct: 169 GKRDYLAFYAGQMHGLVRPVLIQHWRGKDSSMKVYEVLPPEIAKNISYAQHMKRSKFCLC 228

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
            KGFEVN+ RI +++  GCVPVIIA+++ LPF++VL+W  FSI V   DIP LK+IL  +
Sbjct: 229 PKGFEVNSPRIVEAILSGCVPVIIADNFVLPFSNVLDWSKFSITVEEKDIPNLKRILTNV 288

Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSD--YDAFYMVMYELW 318
               Y  +QS +  +R+HF W     D  YD+F+M MY +W
Sbjct: 289 PDGTYRSMQSCLKYIRRHFVWLEDQEDTQYDSFHMTMYSIW 329


>gi|224120358|ref|XP_002331028.1| predicted protein [Populus trichocarpa]
 gi|222872958|gb|EEF10089.1| predicted protein [Populus trichocarpa]
          Length = 334

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 188/344 (54%), Gaps = 30/344 (8%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDP 53
           M + F+++VY     P   N P  ++           Y+ E  F       KS F+ ++ 
Sbjct: 1   MVKRFKIWVYREGETPMVHNGPMKHI-----------YSIEGQFIDEMESGKSPFLARNH 49

Query: 54  SKADLFFLPFSIARMRHDRRIGT-----EGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
            +A  FFLP S+A +     +       E +      Y+  ++ KYPYWNR+ G DHF V
Sbjct: 50  DEAHAFFLPISVAYIVEFVYLPITTYHRERLVRIFKDYVTVVANKYPYWNRSRGGDHFMV 109

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQEDPP--KLG 164
           +CH       +   E+  N I+V+C+++    G    +D +LP++   P +  P    L 
Sbjct: 110 SCHDWAPQVSRDDPELYKNLIRVMCNANTS-EGFRPRRDATLPELNCPPLKLTPACRGLA 168

Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKG 222
             +R   AFFAG  +  +R+ LL+ W+  D EI  H    K     EL+G SKFCL   G
Sbjct: 169 PHERKIFAFFAGGAHGDIRKILLRHWKEKDDEIQVHEYLPKDQDYMELMGQSKFCLCPSG 228

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
           FEV + R+A+S+Y GCVPVII++HY+LPF+DVL+W  FS+ +    IP +K IL+GIS +
Sbjct: 229 FEVASPRVAESIYSGCVPVIISDHYNLPFSDVLDWSQFSVQIPVEKIPEIKTILRGISYD 288

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           EYL +Q  V+KV++HF  +     YD  +MV++ +WLRR ++RV
Sbjct: 289 EYLKMQKGVMKVQRHFVLNRPAKPYDVLHMVLHSVWLRRLNIRV 332


>gi|356518348|ref|XP_003527841.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g03795-like [Glycine max]
          Length = 549

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 115/318 (36%), Positives = 177/318 (55%), Gaps = 28/318 (8%)

Query: 24  PVDFEP--RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTE---- 77
           P+  EP   G YASE +F K    +  VT DP KA LF++PFS   ++    +       
Sbjct: 233 PIFHEPLLDGIYASEGWFMKLMEANKXVTGDPGKAHLFYIPFSSRLLQQTLYVRNSHRHS 292

Query: 78  GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSY 137
            + +++ +Y+  I+ KYP+WNRT GADHF VACH    +  +      L++I+ +C++  
Sbjct: 293 NLIEYMKNYVKMIAGKYPFWNRTSGADHFVVACHDWAPAETRGRM---LSSIRALCNADI 349

Query: 138 FISGHIAHKDVSLPQIWPR---------QEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQ 188
            +   I  KDVSLP+ + R         + DPP    S+R  LAFFAG ++  V   LL+
Sbjct: 350 EVGFKIG-KDVSLPETYIRSSENPVKNIEGDPP----SQRPILAFFAGGLHVYVXPILLK 404

Query: 189 VWRNDSEIYAHSG-----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVII 243
            W N       SG     R    Y   +  SKFC+H +G EVN+ R+ +++++ C+PVII
Sbjct: 405 HWENKEPDMKISGPLPHVRGNVNYIQFMKSSKFCIHARGHEVNSPRVVEAIFHECIPVII 464

Query: 244 ANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
           ++++  PF ++LNW+SF++ V   +IP L+ IL  IS E YL +   V KV++HF WH  
Sbjct: 465 SDNFIPPFFEILNWESFAVFVTEEEIPNLRNILLSISEERYLEMHKRVKKVQEHFPWHAE 524

Query: 304 PSDYDAFYMVMYELWLRR 321
           P   D  +M+++ +W  R
Sbjct: 525 PVKDDLSHMLLHSIWYNR 542


>gi|356518346|ref|XP_003527840.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 633

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 128/336 (38%), Positives = 184/336 (54%), Gaps = 25/336 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
           M R+ +VYVY   R    A +  P+     G YASE +F K    S  FVT DP  A LF
Sbjct: 301 MERTLKVYVY---REGDKAIMHSPI---LSGLYASEGWFMKHMEASKQFVTTDPKNAHLF 354

Query: 60  FLPFSIARMRHDR---RIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
           +LPFS  R+       +     +  ++S+Y+  I+ K+ +WNRTGGADHF VACH    +
Sbjct: 355 YLPFSSQRLVDALWVPKSSYGNLIQYLSNYVDMIAGKHHFWNRTGGADHFLVACHDWAPA 414

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNKL 171
             ++     L A+   C++     G +  KD+SLP+   R    P   +G    SKR  L
Sbjct: 415 ETKQHMAKCLRAL---CNAD-VKQGFVFGKDMSLPETVVRSPRNPTRSIGGNQVSKRKTL 470

Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVN 226
           AFFAG ++  VR  LLQ W N        GRL        Y   +  SK+C+  KG+EVN
Sbjct: 471 AFFAGQMHGYVRPILLQHWENKDPDMKIFGRLPKSKGNRNYIQYMKSSKYCICAKGYEVN 530

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
           + R+ +++ Y CVPVI+++++  PF ++LNW+SF++ V   DIP LK IL  I  + YL 
Sbjct: 531 SPRVVEAILYECVPVILSDNFVPPFFEMLNWESFAVFVLEKDIPNLKNILLSIPRKRYLQ 590

Query: 287 LQSYVLKVRKHFQWH-VFPSDYDAFYMVMYELWLRR 321
           +Q  V KV++HF WH   P  YD F+M+++ +W  R
Sbjct: 591 MQMMVRKVQQHFLWHNKSPVKYDIFHMILHSIWYNR 626


>gi|356575096|ref|XP_003555678.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
           max]
          Length = 493

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 126/344 (36%), Positives = 188/344 (54%), Gaps = 30/344 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN-YASESYFKKAF-MKSHFVTKDPSKADL 58
           M + F+++VY       F        + P  N Y+ E  F  +  + S F T++P +A +
Sbjct: 163 MEKVFKIFVYEEGEPPLF-------HYGPCKNIYSMEGIFINSLEINSQFRTQNPDEAHV 215

Query: 59  FFLPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
           +FLPFS+  +          D+ +    I D    Y+  IS KY YWNR+ GADHF ++C
Sbjct: 216 YFLPFSVVMILEHLFHPVIRDKAVLERTIGD----YVHIISHKYKYWNRSYGADHFMLSC 271

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQI-WPRQEDPPKLGS--- 165
           H  G  A     E+   AI+V+C+++  IS H    KD S P+I     E    +G    
Sbjct: 272 HDWGPRATWYVKELYFIAIRVLCNAN--ISEHFNPKKDASFPEINLVNGETRGLIGGYPP 329

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHVKGF 223
             R  LAFFAG ++  +R  L Q W    +      +L    PY + +  SK+C+   GF
Sbjct: 330 CNRTILAFFAGQMHGRIRPVLFQHWEGKDKDVLVYEKLPDGVPYHETMKKSKYCICPSGF 389

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EV + RI +++Y  CVPVII+  Y LPF+DVLNW SFS+ +   D+P LK+IL GIS ++
Sbjct: 390 EVASPRIVEAIYAQCVPVIISQQYVLPFSDVLNWDSFSVQILVSDVPKLKEILLGISEDK 449

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           Y+ LQ  V +V++HF  +  P  YD F+M+++ +WLRR +VRV+
Sbjct: 450 YMRLQEGVKQVQRHFVVNNPPKRYDVFHMIIHSIWLRRLNVRVK 493


>gi|297811281|ref|XP_002873524.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319361|gb|EFH49783.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 546

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 122/342 (35%), Positives = 190/342 (55%), Gaps = 37/342 (10%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR----GNYASESYFKKAFMKSH-FVTKDPSK 55
           M ++ +VY+Y             P+  +P     G YASE +F K    +H F+TKDP+ 
Sbjct: 216 MEQTLKVYIYSEGDR--------PIFHQPEAIMEGIYASEGWFMKLMESNHRFLTKDPNI 267

Query: 56  ADLFFLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
           A LF+LPFS   ++        H RR     +  ++ +Y+  I+  YP+WNRT G+DHF+
Sbjct: 268 AHLFYLPFSTRILQQKLYVHDSHSRR----NLVKYLKNYLDLIASNYPFWNRTRGSDHFF 323

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGS 165
            ACH    +  +  +   +N I+ +C++   +   +  KDVSLP+  I   Q     +G 
Sbjct: 324 TACHDWAPAETRGPY---INCIRSLCNADVGVD-FVVGKDVSLPETKISSSQNPNGNIGG 379

Query: 166 ---SKRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHV 220
              SKR  LAFFAG ++  VR  LL  W  R + ++   +      Y   +  S+FC+  
Sbjct: 380 NRPSKRTILAFFAGNLHGYVRPILLNQWSSRPEPDMKIFNRIDHKSYIRYMKRSRFCVCA 439

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           KG+EVN+ R+ +S+ YGCVPVII++++  PF ++LNW+SF++ V   +IP L+KIL  I 
Sbjct: 440 KGYEVNSPRVVESVLYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPNLRKILISIP 499

Query: 281 SEEYLLLQSYVLKVRKHFQWH-VFPSDYDAFYMVMYELWLRR 321
              Y+ +Q  V+KV+KHF WH   P  YD F+M+++ +W  R
Sbjct: 500 VRRYVEMQKRVMKVQKHFMWHDGEPVRYDVFHMILHSVWYNR 541


>gi|356554237|ref|XP_003545455.1| PREDICTED: probable glycosyltransferase At5g25310-like [Glycine
           max]
          Length = 483

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 191/344 (55%), Gaps = 26/344 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYF--KKAFMKSHFVTKDPSKADL 58
           M + F+VYVYP        ++ +  D   +  Y+ E  F  +       F T DP+ A +
Sbjct: 149 MEKIFKVYVYPD------GDLPIAHDGPCKDIYSIEGRFLHEMEHGAGRFRTNDPNAAHV 202

Query: 59  FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           +FLPFS+  M              +  F+S Y+  IS ++P+WN T GADHF +ACH  G
Sbjct: 203 YFLPFSVTWMVKYLYTPLSFNVTPLKQFVSDYVRVISTRHPFWNITHGADHFMLACHDWG 262

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-------PRQEDPPKLGSSK 167
             A Q    +   +I+V+C+++    G    KDVSLP+I        P+   PP   ++ 
Sbjct: 263 PHASQGNPFLYNTSIRVLCNANT-SEGFNPRKDVSLPEIHLYGGEVSPKLLSPPP-DTAP 320

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRNDSE----IYAHSGRLKTPYADELLGSKFCLHVKGF 223
           R  LAFF+G ++ P+R  LL+ W+ND++    +Y +  +    Y   +L SKFCL   G 
Sbjct: 321 RRYLAFFSGGLHGPIRPALLRHWKNDNDDDIRVYEYLPK-DLDYYSFMLNSKFCLCPSGH 379

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EV + RI +++Y  CVPVI++ +Y LPF+DVL W++FS+ V   DIP LK+IL  IS ++
Sbjct: 380 EVASPRIVEAIYAECVPVILSEYYVLPFSDVLQWEAFSVQVDVSDIPRLKEILSAISEDK 439

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           Y  L+  V  VR HF  +     +D F+M+++ +WLRR +++++
Sbjct: 440 YRKLKEGVKAVRGHFTLNRPAKRFDVFHMILHSIWLRRLNIKLR 483


>gi|356565157|ref|XP_003550811.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 411

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 116/300 (38%), Positives = 177/300 (59%), Gaps = 15/300 (5%)

Query: 46  SHFVTKDPSKADLFFLPFSIARMRH------DRRIGTEGI--PDFISHYIFNISQKYPYW 97
           SHF  + P +A +FFLPFSIA + H       +    E I     +  YI  I+ KYPYW
Sbjct: 112 SHFRAEHPDQAQVFFLPFSIANVVHYVYKPIRKHSDYEPIRLQRLVEDYIGVIANKYPYW 171

Query: 98  NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-PR 156
           NR+ GADHF ++CH  G        ++  N I+V+C+++    G + +KDVS+P+++ P+
Sbjct: 172 NRSEGADHFLLSCHDWGPKVSYGNPKLFKNFIRVLCNANTS-EGFLPNKDVSIPEVYLPK 230

Query: 157 QE-DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELL 212
            +  PP LG   + R+ LAFFAG  +  +R+ LL  W+  D++I  H    K     +L+
Sbjct: 231 GKLGPPNLGQRPNDRSILAFFAGREHGDIRKILLNHWKGKDNDIQVHEYLPKGKNYTQLM 290

Query: 213 G-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
           G SKFCL   G+EV + R+ ++++ GCVPV+I++ Y  PF DVLNW  FS+ +    I  
Sbjct: 291 GQSKFCLCPSGYEVASPRVVEAIHAGCVPVLISSSYSPPFTDVLNWSQFSVEIPVEKISE 350

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
           +K IL+ IS   YL L   VL+VR+HF  +     +D  +M+++ +WLRR ++R+  STS
Sbjct: 351 IKTILQSISRNRYLRLHMNVLRVRRHFMLNRPAKPFDLMHMILHSIWLRRLNLRLIGSTS 410


>gi|147805375|emb|CAN76361.1| hypothetical protein VITISV_035438 [Vitis vinifera]
          Length = 459

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 115/300 (38%), Positives = 174/300 (58%), Gaps = 24/300 (8%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDF--------ISHYIFNISQKYPY 96
           KS F+ K P +A  FFLP S+ ++     +      D+        ++ Y+  ++ KYPY
Sbjct: 154 KSPFIAKHPDEAHAFFLPLSVVKVVQFLYLPITSPEDYSRKRLQRVVTDYVKVVADKYPY 213

Query: 97  WNRTGGADHFYVACHS-IGRSAMQKAW--------EVKLNAIQVVCSSSYFISGHIAHKD 147
           WNR+GGADHF V+CH  +  S+++           E+  N I+V+C+++    G    +D
Sbjct: 214 WNRSGGADHFMVSCHDWVSSSSLEPMAPSVSYANPELFKNFIRVLCNANS-SEGFRPGRD 272

Query: 148 VSLPQI-WPRQE-DPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGR 202
           VSLP++  P  E  PP LG    N+  LAFFAG  +  +R+ L + W++ D+E+  H   
Sbjct: 273 VSLPEVNLPAGELGPPHLGQPSNNRPILAFFAGRAHGNIRKILFEHWKDQDNEVLVHERL 332

Query: 203 LKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
            K     +L+G SKFCL   G+EV + R+ ++++ GCVPVII+N Y LPF DVL+W  FS
Sbjct: 333 HKGQNYAKLMGQSKFCLCPSGYEVASPRVVEAIHAGCVPVIISNXYSLPFNDVLDWSQFS 392

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           I +    IP +K IL GIS  +YL +Q  VL+VR+HF  +     +D  +M+++ LWLRR
Sbjct: 393 IQIPEAKIPEIKTILLGISKNKYLKMQERVLRVRRHFVLNRPARPFDIIHMILHSLWLRR 452


>gi|356548516|ref|XP_003542647.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 340

 Score =  197 bits (501), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 112/299 (37%), Positives = 176/299 (58%), Gaps = 24/299 (8%)

Query: 46  SHFVTKDPSKADLFFLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNR 99
           SHF  ++P++A +F +PFSI  +      R+ R+ G++ I   +  YI  I+ KYPYWNR
Sbjct: 44  SHFRARNPNQAHVFLIPFSIVNIVQYVYNRNLRQPGSQSIQLLVEDYIRVIAHKYPYWNR 103

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQ 157
           T GADHF ++CH  G +      ++  N I+V+C+++    G   +KDVS+P++   PR 
Sbjct: 104 TEGADHFLLSCHDWGPTISYANPKLFKNFIRVLCNANTS-EGFRPNKDVSIPEVNLLPRG 162

Query: 158 EDPPKLGSSKRNK-------LAFFAGAVNSPVREKLLQVWR---NDSEIYAHSGRLKTPY 207
                LGS  R +       LAFFAG  +  +R  LL  W+   ND +IY    + K  Y
Sbjct: 163 ----TLGSPNRGQHPNDRTILAFFAGREHGAIRTILLNHWKDKDNDVQIYESLPKGKV-Y 217

Query: 208 ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATL 267
              +  SKFCL   G+EV + R+ +++Y GCVPV+I++ Y  PF DVLNW  FS+ +   
Sbjct: 218 TKLMGQSKFCLCPSGYEVASPRVVEAIYAGCVPVLISSSYSPPFTDVLNWSQFSVEIPVE 277

Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            IP +K IL+ +S ++YL LQ  VL+V++HF  +     +D  +M+++ +WLRR ++++
Sbjct: 278 KIPEIKTILQSVSPKKYLKLQMNVLRVQRHFTINRPAKPFDLMHMILHSIWLRRLNLKL 336


>gi|255567220|ref|XP_002524591.1| catalytic, putative [Ricinus communis]
 gi|223536144|gb|EEF37799.1| catalytic, putative [Ricinus communis]
          Length = 507

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 120/339 (35%), Positives = 189/339 (55%), Gaps = 20/339 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M + F++YVY      P  +     D   +  Y+SE  F     K   + T DP +A ++
Sbjct: 173 MEKQFKIYVY-EEGGPPMYH-----DGPCKSIYSSEGRFIHELEKGKLYRTLDPDEALVY 226

Query: 60  FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           FLPFS+  M     +     T  I   I  YI  IS K+P+WNR+ GADHF ++CH  G 
Sbjct: 227 FLPFSVVMMVEYLYVPDSHETNAIGRAIVDYIHVISNKHPFWNRSLGADHFMLSCHDWGP 286

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNKL 171
            A      +  ++I+V+C+++    G    KD S P+I  +  +   L      S+R+ L
Sbjct: 287 RASSYVPHLFNSSIRVLCNANTS-EGFNPSKDASFPEIHLKTGEISGLLGGVSPSRRSIL 345

Query: 172 AFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTA 228
           AFFAG ++  +R+ LL+ W+N   D ++Y         Y   L  S+FCL   G+EV + 
Sbjct: 346 AFFAGRLHGHIRQILLEQWKNKDEDVQVYDQMPN-GVSYESMLKTSRFCLCPSGYEVASP 404

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           RI +++Y  CVPV+I+++Y  PF+DVLNWK+FS+ +   DIP +K+IL GIS  +YL +Q
Sbjct: 405 RIVEAIYTECVPVLISDNYVPPFSDVLNWKAFSVQIQVRDIPKIKEILMGISQRQYLRMQ 464

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
             + +V++HF  +  P  +D F+M ++ +WLRR ++ +Q
Sbjct: 465 RRLKQVQRHFVVNGPPKRFDMFHMTIHSIWLRRLNIHIQ 503


>gi|334187609|ref|NP_196674.2| putative glycosyltransferase [Arabidopsis thaliana]
 gi|292630779|sp|Q9LFP3.2|GLYT4_ARATH RecName: Full=Probable glycosyltransferase At5g11130
 gi|332004254|gb|AED91637.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 480

 Score =  196 bits (499), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 174/314 (55%), Gaps = 22/314 (7%)

Query: 33  YASESYFKKAFMK--SHFVTKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISH 85
           YA E  F        S F    P +A +F++P  I  +     R       + + + +  
Sbjct: 171 YAIEGQFMDEIENGNSRFKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKD 230

Query: 86  YIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH 145
           YI  IS +YPYWNR+ GADHF+++CH           E+  + I+ +C+++    G    
Sbjct: 231 YISLISNRYPYWNRSRGADHFFLSCHDWAPDVSAVDPELYKHFIRALCNANS-SEGFTPM 289

Query: 146 KDVSLPQI--------WPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEI 196
           +DVSLP+I        +    +PP+     R  LAFFAG  +  VR+ L Q W+  D ++
Sbjct: 290 RDVSLPEINIPHSQLGFVHTGEPPQ----NRKLLAFFAGGSHGDVRKILFQHWKEKDKDV 345

Query: 197 YAHSGRLKT-PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
             +    KT  Y   +  +KFCL   G+EV + RI +SLY GCVPVIIA++Y LPF+DVL
Sbjct: 346 LVYENLPKTMNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVL 405

Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
           NWK+FS+ +    +P +KKIL+ I+ EEYL +Q  VL+VRKHF  +     YD  +M+M+
Sbjct: 406 NWKTFSVHIPISKMPDIKKILEAITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMH 465

Query: 316 ELWLRRSSVRVQWS 329
            +WLRR +VR+  S
Sbjct: 466 SIWLRRLNVRIPLS 479


>gi|8953375|emb|CAB96648.1| putative protein [Arabidopsis thaliana]
          Length = 336

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 124/351 (35%), Positives = 185/351 (52%), Gaps = 38/351 (10%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDP 53
           M + F+++ Y     P     P  N+           YA E  F        S F    P
Sbjct: 1   MEKRFKIWTYREGEAPLFHKGPLNNI-----------YAIEGQFMDEIENGNSRFKAASP 49

Query: 54  SKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
            +A +F++P  I  +     R       + + + +  YI  IS +YPYWNR+ GADHF++
Sbjct: 50  EEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHFFL 109

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDP 160
           +CH           E+  + I+ +C+++    G    +DVSLP+I        +    +P
Sbjct: 110 SCHDWAPDVSAVDPELYKHFIRALCNANSS-EGFTPMRDVSLPEINIPHSQLGFVHTGEP 168

Query: 161 PKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKT-PYADELLGSKFCL 218
           P+     R  LAFFAG  +  VR+ L Q W+  D ++  +    KT  Y   +  +KFCL
Sbjct: 169 PQ----NRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKTMNYTKMMDKAKFCL 224

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
              G+EV + RI +SLY GCVPVIIA++Y LPF+DVLNWK+FS+ +    +P +KKIL+ 
Sbjct: 225 CPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMPDIKKILEA 284

Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
           I+ EEYL +Q  VL+VRKHF  +     YD  +M+M+ +WLRR +VR+  S
Sbjct: 285 ITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRLNVRIPLS 335


>gi|356562965|ref|XP_003549738.1| PREDICTED: probable glycosyltransferase At3g42180-like [Glycine
           max]
          Length = 473

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 111/298 (37%), Positives = 176/298 (59%), Gaps = 15/298 (5%)

Query: 46  SHFVTKDPSKADLFFLPFSIARM----RHDRRIGTEGIPD----FISHYIFNISQKYPYW 97
           S F    P +A LF LP+S++++       RR  ++  PD     ++ YI  ++ +YPYW
Sbjct: 176 SPFKATHPEQAHLFLLPYSVSKVIRYVYKPRRSRSDYDPDRLQRLVADYINILANRYPYW 235

Query: 98  NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--P 155
           NR+ GADHF V+CH  G        E+    I+ +C+++    G   ++DVS+P+++   
Sbjct: 236 NRSKGADHFLVSCHDWGPRISDANPELFKYFIRALCNANTS-EGFQPNRDVSIPEVYLPS 294

Query: 156 RQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELL 212
            +  PP +G    N+  LAFFAG  +  +R+KLL+ W+N D E+  H    K     +L+
Sbjct: 295 GKLGPPNMGQHPNNRTILAFFAGGAHGKIRKKLLKRWKNKDKEVQVHEYLPKGQDYTKLM 354

Query: 213 G-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
           G SKFCL   G EV + R+ +++Y GCVPVII ++Y LPF DVLNW+ FS+ +A   +P 
Sbjct: 355 GLSKFCLCPSGHEVASPRVVEAIYAGCVPVIICDNYSLPFIDVLNWRKFSMEIAVERMPE 414

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
           +K IL+ +S ++YL L S V +VR+HF  +     +D  +M+++ LWLRR + ++  S
Sbjct: 415 IKTILQSVSKDKYLELYSNVRRVRRHFVINRPAKPFDLIHMILHSLWLRRLNFKLTAS 472


>gi|356562054|ref|XP_003549290.1| PREDICTED: probable glycosyltransferase At5g25310-like [Glycine
           max]
          Length = 482

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 122/345 (35%), Positives = 188/345 (54%), Gaps = 27/345 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYF--KKAFMKSHFVTKDPSKADL 58
           M + F+VYVYP        ++ +  D   +  Y+ E  F  +       F T DP+ A +
Sbjct: 147 MEKIFKVYVYPD------GDLPIAHDGPCKDIYSIEGRFLHEMEHGAGRFRTNDPNAAHV 200

Query: 59  FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           FFLPFS+  M              +  F+S Y+  +S ++P+WN T GADHF +ACH  G
Sbjct: 201 FFLPFSVTWMVKYLYTPLSFNVTPLKKFVSDYVRVVSTRHPFWNITHGADHFMLACHDWG 260

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-------PRQEDPPKLGSSK 167
             A Q    +   +I+V+C+++    G    KDVSLP+I        P+   PP   ++ 
Sbjct: 261 PHASQGNPFLYNTSIRVLCNANT-SEGFNPRKDVSLPEIHLYGGEVSPKLLSPPP-DTAP 318

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRNDSE-----IYAHSGRLKTPYADELLGSKFCLHVKG 222
           R  LAFF+G ++ P+R  LL  W+N  E     +Y +  +    Y   +L SKFCL   G
Sbjct: 319 RRYLAFFSGGLHGPIRPALLGHWKNHDENDVIRVYEYLPK-DLDYYSFMLTSKFCLCPSG 377

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
            EV + RI +++Y  CVPVI++ +Y LPF+DVL W++FS+ V   DIP LK+IL  IS +
Sbjct: 378 HEVASPRIVEAIYAECVPVILSEYYVLPFSDVLQWEAFSVQVDVSDIPRLKEILSAISED 437

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           +Y  L+  V  VR+HF  +     +D F+M+++ +WLRR ++ ++
Sbjct: 438 KYRKLKEGVKAVRRHFTLNRPAKRFDVFHMILHSIWLRRLNIELR 482


>gi|297808545|ref|XP_002872156.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317993|gb|EFH48415.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 334

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 121/341 (35%), Positives = 192/341 (56%), Gaps = 23/341 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF--MKSHFVTKDPSKADL 58
           M + F+VYVY      P  +     D   +  YA E  F      M++ F T D ++A +
Sbjct: 1   MEKRFKVYVY-EEGEPPLVH-----DGPCKSVYAVEGRFITEIEKMRTKFRTYDANQAYV 54

Query: 59  FFLPFSIA---RMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +FLPFS+    R  ++     + +  F+S YI  +S  +P+WNRT GADHF +ACH  G 
Sbjct: 55  YFLPFSVTWLVRYLYEGNSDAKPLRTFVSDYIRLVSTNHPFWNRTNGADHFMLACHDWGP 114

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQEDPP-----KLGSSKR 168
              Q   ++   +I+V+C+++    G    KDV+LP+I  +  + DP       L +S R
Sbjct: 115 LTSQADNDLFNTSIRVMCNANS-SEGFNPSKDVTLPEIKLYGGEVDPKLRLSKTLSASPR 173

Query: 169 NKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
             L FFAG V+ PVR  LL  W+    D  +Y +  +    Y D +  SKFC    G+EV
Sbjct: 174 PYLGFFAGGVHGPVRPILLNHWKQRDPDMPVYEYLPK-HLNYYDFMRSSKFCFCPSGYEV 232

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
            + R+ +++Y  C+PVI++ ++ LPF DVL W++FS++V   +IP LK+IL  IS E+Y 
Sbjct: 233 ASPRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVLVDVSEIPRLKEILMSISDEKYE 292

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            L+  +  VR+HF+ +  P  +DAF++ ++ +WLRR ++R+
Sbjct: 293 WLKRNLRYVRRHFELNDPPKRFDAFHLTLHSIWLRRLNLRL 333


>gi|357142838|ref|XP_003572711.1| PREDICTED: probable glycosyltransferase At5g20260-like
           [Brachypodium distachyon]
          Length = 484

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 186/352 (52%), Gaps = 35/352 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDPSKADL 58
           M +  +++ Y      P A++    D      Y+ E  F     +  + F  + P +A++
Sbjct: 144 MEKRLKIWTY-SEGEPPLAHLAPGTDI-----YSIEGQFLAEMEEPLNRFAARHPDEANV 197

Query: 59  FFLPFSIARMRH-DRRIGTEG----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           F LP S+  + H   R+ T      +   ++ Y+  IS K+PYWNR+GGADH  V+CH  
Sbjct: 198 FLLPISVCNLVHYVYRLNTTAHLAPLRKLLADYVAVISDKHPYWNRSGGADHVLVSCHDW 257

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPPKLGS 165
                + + E++ NAI+V+C+++    G +  KD +LP++         P Q  P     
Sbjct: 258 APLVSEGSPELRDNAIRVLCNAN-VSEGFVPRKDATLPEVNLADGVLRLPTQGLP----R 312

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHS---GRLKTPYADE---LLG-SKF 216
             R  LAFFAG +   +R  LL+ W  R D E+  H         P  D+   L+G ++F
Sbjct: 313 QNRTTLAFFAGGMLGEIRRALLEQWAGREDPEMDVHEYLPPHGGGPGYDDYHALMGRARF 372

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
           CL   GFEV + R+ +S++ GCVPVII+  Y LPF DVL+W   S+ V    IP LK IL
Sbjct: 373 CLCPSGFEVASPRVVESVFAGCVPVIISEGYPLPFGDVLDWSKMSVAVPAARIPELKAIL 432

Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
           +G+S   Y +L++ VL+ ++HF  H     +D  +MV++ +WLRR +VR+ +
Sbjct: 433 RGVSERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIWLRRLNVRLPY 484


>gi|359484706|ref|XP_002266646.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
           vinifera]
          Length = 480

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 118/338 (34%), Positives = 186/338 (55%), Gaps = 19/338 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDPSKADL 58
           M + F+++ Y    + P  +     D    G YA E  F       KSHF+ + P +A+ 
Sbjct: 148 MEKRFKIWAY-REGDQPLMH-----DGPSNGIYAIEGQFMDDIESGKSHFLARRPDEANA 201

Query: 59  FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           F++P S+ ++ H          + IP  ++ YI  ++ KYPYWNR+ GADHF V+CH   
Sbjct: 202 FYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLVSCHDWA 261

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGS--SKRNK 170
                   ++  + I+ +C+++     H   +D+S+P+I  PR +  PP L    +KR  
Sbjct: 262 PDVSALKPDLYKHFIRALCNANTSERFHPI-RDISIPEINIPRGKLGPPHLDQPPNKRPI 320

Query: 171 LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHVKGFEVNTA 228
           LAFFAG  +  VR  L + W+   +      RL     Y+  +  SKFCL   G+EV + 
Sbjct: 321 LAFFAGGAHGYVRSVLFKYWKEKDDEVQVFERLPGNRNYSKSMGDSKFCLCPSGYEVASP 380

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           RI +++  GCVP+II +HY LPF+DVL+W  FSI + +  IP +KKILK + +E YL +Q
Sbjct: 381 RIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIKKILKAVPTETYLEMQ 440

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
             V +V++HF  +     YD  +M+++ +WLRR +VR+
Sbjct: 441 KRVKQVQRHFAINRPARPYDMLHMILHSVWLRRLNVRL 478


>gi|240256346|ref|NP_197913.4| putative glycosyltransferase [Arabidopsis thaliana]
 gi|292630776|sp|Q3E7Q9.2|GLYT6_ARATH RecName: Full=Probable glycosyltransferase At5g25310
 gi|332006042|gb|AED93425.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 480

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 120/341 (35%), Positives = 193/341 (56%), Gaps = 23/341 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDPSKADL 58
           M + F+VYVY      P  +     D   +  YA E  F     K  + F T DP++A +
Sbjct: 147 MEKRFKVYVY-EEGEPPLVH-----DGPCKSVYAVEGRFITEMEKRRTKFRTYDPNQAYV 200

Query: 59  FFLPFSIA---RMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           +FLPFS+    R  ++     + +  F+S YI  +S  +P+WNRT GADHF + CH  G 
Sbjct: 201 YFLPFSVTWLVRYLYEGNSDAKPLKTFVSDYIRLVSTNHPFWNRTNGADHFMLTCHDWGP 260

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQED-----PPKLGSSKR 168
              Q   ++   +I+V+C+++    G    KDV+LP+I  +  + D        L +S R
Sbjct: 261 LTSQANRDLFNTSIRVMCNANS-SEGFNPTKDVTLPEIKLYGGEVDHKLRLSKTLSASPR 319

Query: 169 NKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
             L FFAG V+ PVR  LL+ W+    D  +Y +  +    Y D +  SKFC    G+EV
Sbjct: 320 PYLGFFAGGVHGPVRPILLKHWKQRDLDMPVYEYLPK-HLNYYDFMRSSKFCFCPSGYEV 378

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
            + R+ +++Y  C+PVI++ ++ LPF DVL W++FS++V   +IP LK+IL  IS+E+Y 
Sbjct: 379 ASPRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVLVDVSEIPRLKEILMSISNEKYE 438

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            L+S +  VR+HF+ +  P  +DAF++ ++ +WLRR ++++
Sbjct: 439 WLKSNLRYVRRHFELNDPPQRFDAFHLTLHSIWLRRLNLKL 479


>gi|356565155|ref|XP_003550810.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 496

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 122/349 (34%), Positives = 191/349 (54%), Gaps = 35/349 (10%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK----SHFVTK 51
           M + F+V+VY     P   + P  N+           YA E  F          S F  +
Sbjct: 157 MVKRFKVWVYQEGEQPLVHDGPVNNI-----------YAIEGQFMDEIDNNDKWSQFRAR 205

Query: 52  DPSKADLFFLPFSIARMRHDRRIGTEGIPDF--------ISHYIFNISQKYPYWNRTGGA 103
            P +A +FFLPFSIA + H          D+        +  YI  I  KYPYWNR+ GA
Sbjct: 206 HPEEAHVFFLPFSIANVVHYVYKPILKQSDYEPVRLQLLVEDYISVIEDKYPYWNRSKGA 265

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--PRQEDPP 161
           DHF ++CH           E+  + I+ +C+++    G   ++DVS+P+++    +  PP
Sbjct: 266 DHFLLSCHDWAPKVSNGNPELFQSFIRALCNANTS-EGFHPNRDVSIPEVYLPVGKLGPP 324

Query: 162 KLGS--SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFC 217
            LG   + R  LAFFAG V+  +R+ LL+ W++ D+E+  H    K+    +L+G SKFC
Sbjct: 325 SLGQHPNSRTILAFFAGGVHGEIRKILLKHWKDKDNEVRVHEYLPKSQNYTKLMGQSKFC 384

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L   G EV + R+ ++++ GCVPVII ++Y LPF+DVL+W  FS+ V+   IP +K IL+
Sbjct: 385 LCPSGHEVASPRVVEAIHAGCVPVIICDNYSLPFSDVLHWSQFSVKVSVQKIPEIKSILQ 444

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            IS ++YL L   VL+VR+HF  +     +D  +M+++ +WLRR ++++
Sbjct: 445 SISRKKYLRLHMNVLRVRRHFMINRPAKPFDMMHMILHSIWLRRLNIKL 493


>gi|15239475|ref|NP_198516.1| Exostosin family protein [Arabidopsis thaliana]
 gi|332006752|gb|AED94135.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 547

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 119/320 (37%), Positives = 180/320 (56%), Gaps = 27/320 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
           M R  ++YVY       F   +      PRG YASE +F K  M+S+  FV KDP KA L
Sbjct: 236 MERKLKIYVYKEGGKPIFHTPM------PRGIYASEGWFMK-LMESNKKFVVKDPRKAHL 288

Query: 59  FFLPFSIARMRHDRRIG---TEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           F++P SI  +R    +     + + D +  Y+  I+ KY +WNRTGGADHF VACH  G 
Sbjct: 289 FYIPISIKALRSSLGLDFQTPKSLADHLKEYVDLIAGKYKFWNRTGGADHFLVACHDWGN 348

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LG---SSKRNK 170
               K  +   N+++ +C+S+    G     D +LP  + R  + P   LG   SS+R  
Sbjct: 349 KLTTKTMK---NSVRSLCNSN-VAQGFRIGTDTALPVTYIRSSEAPLEYLGGKTSSERKI 404

Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHVKGFE 224
           LAFFAG+++  +R  L+++W N   D +I+    R    K  Y + +  S++C+  +G+E
Sbjct: 405 LAFFAGSMHGYLRPILVKLWENKEPDMKIFGPMPRDPKSKKQYREYMKSSRYCICARGYE 464

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           V+T R+ +++   CVPVIIA++Y  PF +VLNW+ F++ V   DIP L+ IL  I  + Y
Sbjct: 465 VHTPRVVEAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEKDIPNLRNILLSIPEDRY 524

Query: 285 LLLQSYVLKVRKHFQWHVFP 304
           + +Q+ V  V++HF WH  P
Sbjct: 525 IGMQARVKAVQQHFLWHKKP 544


>gi|414883870|tpg|DAA59884.1| TPA: hypothetical protein ZEAMMB73_690759, partial [Zea mays]
          Length = 577

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 133/359 (37%), Positives = 192/359 (53%), Gaps = 46/359 (12%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYF-------KKAFMKSHFVTK 51
           M R  +VYVY     +P      PV  D   R  Y++E  F        +A       T+
Sbjct: 230 MERQLKVYVY--EEGEP------PVFHDGPCRSIYSTEGRFIHSMETETEAEEGRRLRTR 281

Query: 52  DPSKADLFFLPFSIARMRHDRRIGTEGIPDF------ISHYIFNISQKYPYWNRTGGADH 105
           DP++A +FFLPFS+ +M     I   G  D       ++ Y+  +S KYPYWNR+ GADH
Sbjct: 282 DPARAHVFFLPFSVVKMVQT--IYEPGSRDMAPLKRTVADYVRVLSSKYPYWNRSLGADH 339

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQED 159
           F ++CH  G        ++  N+I+V+C+++    G    +DVSLPQ+        RQ  
Sbjct: 340 FMLSCHDWGPYVSSANAQLFGNSIRVLCNANT-SEGFDPARDVSLPQVNLRSDAVERQVG 398

Query: 160 PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEI----------YAHSGRLKTPYAD 209
            P   +S+R  LAFFAG  + PVR  LL  W                Y   G     YAD
Sbjct: 399 GPS--ASRRPVLAFFAGGNHGPVRPALLAHWGPGGRRGGDPDVRVSEYLPRGGGAPSYAD 456

Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIAN-HYDLPFADVLNWKSFSIVVATLD 268
            +  S+FCL   G+EV + R+A++LY GCVPV++ +  Y LPFADVL+W +F++ +   D
Sbjct: 457 MMRRSRFCLCPGGYEVASPRLAEALYLGCVPVVVDDGEYALPFADVLDWDAFALRLRVAD 516

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS-DYDAFYMVMYELWLRRSSVRV 326
           IP LK+IL  +S  +Y+ +Q  V  VR+HF  H  P   YDAF+M+++ +WLRR +VR+
Sbjct: 517 IPRLKEILAAVSPRQYIRMQRRVRMVRRHFMLHGGPPRRYDAFHMILHSVWLRRLNVRI 575


>gi|296085067|emb|CBI28482.3| unnamed protein product [Vitis vinifera]
          Length = 634

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 120/342 (35%), Positives = 187/342 (54%), Gaps = 28/342 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F++++Y       F N      +   G      +F      +HF T+DP +A ++F
Sbjct: 305 MEKLFKIFIYKEGEPPLFHNGPCKSIYSIEG-----VFFSLMEGDTHFRTQDPDEAHVYF 359

Query: 61  LPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
           LPFS+  + H        D+ +    +   +S Y+  ISQKY YWNR+ GADHF ++CH 
Sbjct: 360 LPFSVVMIIHHLFDPIVRDKYV----MKHVVSDYVKVISQKYRYWNRSLGADHFMLSCHD 415

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG------SS 166
            G  A     ++  N+I+++C+++     +   KD S+P+I     D   +G       S
Sbjct: 416 WGPRATWYVPQLYYNSIRLLCNANTSECFN-PRKDASIPEI--NLIDGETIGLTGGLPPS 472

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT--PYADELLGSKFCLHVKGFE 224
           KR  LAFFAG ++  +R  LLQ W+   E       L     Y D +  SK+C+   G E
Sbjct: 473 KRTILAFFAGGLHGRIRPALLQHWKEKDEQVQVYETLPEGLSYPDLMKKSKYCICPSGHE 532

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           V + RI +++Y  CVPV+I+ HY LPF+DVL+W SFSI V+  +IP LKKIL GI  + Y
Sbjct: 533 VASPRIVEAIYAECVPVLISQHYVLPFSDVLDWGSFSIQVSVNEIPNLKKILLGIPQDRY 592

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           + +Q  V +V++HF  +  P  +D F+M+++ +WLRR +V +
Sbjct: 593 IRMQERVKQVQQHFVVNNPPKRFDVFHMIIHSIWLRRLNVAI 634


>gi|296084507|emb|CBI25528.3| unnamed protein product [Vitis vinifera]
          Length = 862

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 118/338 (34%), Positives = 186/338 (55%), Gaps = 19/338 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDPSKADL 58
           M + F+++ Y    + P  +     D    G YA E  F       KSHF+ + P +A+ 
Sbjct: 530 MEKRFKIWAY-REGDQPLMH-----DGPSNGIYAIEGQFMDDIESGKSHFLARRPDEANA 583

Query: 59  FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           F++P S+ ++ H          + IP  ++ YI  ++ KYPYWNR+ GADHF V+CH   
Sbjct: 584 FYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLVSCHDWA 643

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGS--SKRNK 170
                   ++  + I+ +C+++     H   +D+S+P+I  PR +  PP L    +KR  
Sbjct: 644 PDVSALKPDLYKHFIRALCNANTSERFHPI-RDISIPEINIPRGKLGPPHLDQPPNKRPI 702

Query: 171 LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHVKGFEVNTA 228
           LAFFAG  +  VR  L + W+   +      RL     Y+  +  SKFCL   G+EV + 
Sbjct: 703 LAFFAGGAHGYVRSVLFKYWKEKDDEVQVFERLPGNRNYSKSMGDSKFCLCPSGYEVASP 762

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           RI +++  GCVP+II +HY LPF+DVL+W  FSI + +  IP +KKILK + +E YL +Q
Sbjct: 763 RIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIKKILKAVPTETYLEMQ 822

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
             V +V++HF  +     YD  +M+++ +WLRR +VR+
Sbjct: 823 KRVKQVQRHFAINRPARPYDMLHMILHSVWLRRLNVRL 860


>gi|359485860|ref|XP_002264111.2| PREDICTED: probable glycosyltransferase At3g07620-like [Vitis
           vinifera]
          Length = 410

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 120/342 (35%), Positives = 187/342 (54%), Gaps = 28/342 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F++++Y       F N      +   G      +F      +HF T+DP +A ++F
Sbjct: 81  MEKLFKIFIYKEGEPPLFHNGPCKSIYSIEG-----VFFSLMEGDTHFRTQDPDEAHVYF 135

Query: 61  LPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
           LPFS+  + H        D+ +    +   +S Y+  ISQKY YWNR+ GADHF ++CH 
Sbjct: 136 LPFSVVMIIHHLFDPIVRDKYV----MKHVVSDYVKVISQKYRYWNRSLGADHFMLSCHD 191

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG------SS 166
            G  A     ++  N+I+++C+++     +   KD S+P+I     D   +G       S
Sbjct: 192 WGPRATWYVPQLYYNSIRLLCNANTSECFN-PRKDASIPEI--NLIDGETIGLTGGLPPS 248

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT--PYADELLGSKFCLHVKGFE 224
           KR  LAFFAG ++  +R  LLQ W+   E       L     Y D +  SK+C+   G E
Sbjct: 249 KRTILAFFAGGLHGRIRPALLQHWKEKDEQVQVYETLPEGLSYPDLMKKSKYCICPSGHE 308

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           V + RI +++Y  CVPV+I+ HY LPF+DVL+W SFSI V+  +IP LKKIL GI  + Y
Sbjct: 309 VASPRIVEAIYAECVPVLISQHYVLPFSDVLDWGSFSIQVSVNEIPNLKKILLGIPQDRY 368

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           + +Q  V +V++HF  +  P  +D F+M+++ +WLRR +V +
Sbjct: 369 IRMQERVKQVQQHFVVNNPPKRFDVFHMIIHSIWLRRLNVAI 410


>gi|357462311|ref|XP_003601437.1| hypothetical protein MTR_3g080690 [Medicago truncatula]
 gi|355490485|gb|AES71688.1| hypothetical protein MTR_3g080690 [Medicago truncatula]
          Length = 450

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 125/344 (36%), Positives = 189/344 (54%), Gaps = 30/344 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN-YASESYFKKAFMKSH-FVTKDPSKADL 58
           M + F+++VY       F        + P  N Y+ E  F      +  F T++P++A +
Sbjct: 120 MEKLFKIFVYEEGEPPLF-------HYGPCKNIYSMEGIFINLLENNTLFRTQNPNEAHV 172

Query: 59  FFLPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
           +FLPFS+  +          D+ +    I D    Y+  IS KY YWNR+ GADHF ++C
Sbjct: 173 YFLPFSVVMILEHLFHPVIRDKAVLGRTIGD----YVHIISHKYAYWNRSYGADHFMLSC 228

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQI-WPRQEDPPKLGSS-- 166
           H  G  A     E+   AI+V+C+++  IS H    KD S P+I     E    LG    
Sbjct: 229 HDWGPRATWYVKELYFIAIRVLCNAN--ISEHFNPKKDASFPEINLVSGETTGLLGGYPT 286

Query: 167 -KRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGR-LKTPYADELLGSKFCLHVKGF 223
             R  LAFFAG +N  +R  L Q W+N D ++  +     K  Y + +  SK+C+   G+
Sbjct: 287 WNRTILAFFAGQMNGRIRPVLFQHWKNKDKDVLVYEKLPEKISYHETMKMSKYCICPSGW 346

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EV + RI +++Y  CVP++I+  Y LPF+DVLNW SFS+ +   +IP LK+IL GIS E+
Sbjct: 347 EVASPRIVEAIYAECVPILISQQYVLPFSDVLNWDSFSVQIEVSEIPKLKEILLGISEEK 406

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           Y+ LQ  V +V++HF  +  P  YD F+M+++ +WLRR +V V+
Sbjct: 407 YMRLQEGVKQVQRHFVVNNPPKKYDVFHMIIHSIWLRRLNVLVK 450


>gi|359484710|ref|XP_002266490.2| PREDICTED: probable glycosyltransferase At5g20260 [Vitis vinifera]
          Length = 480

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 111/307 (36%), Positives = 177/307 (57%), Gaps = 15/307 (4%)

Query: 33  YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARMRH----DRRIGTEGIPDFISHY 86
           YA E  F       KS F+ + P +A+ F++P S+ R+ H          + IP  ++ Y
Sbjct: 174 YAIEGQFMDEIESGKSQFLARHPDEANAFYIPMSLTRVVHFIYEPPHYHGKWIPRLVTDY 233

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK 146
           I  ++ KYPYWNR+ GADHF V+CH           ++  + I+ +C+++     H   +
Sbjct: 234 INFVADKYPYWNRSKGADHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPI-R 292

Query: 147 DVSLPQI-WPRQE-DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR---NDSEIYAH 199
           D+S+P+I  PR +  PP L    +KR  LAFFAG  +  VR  L + W+   ++ +++  
Sbjct: 293 DISIPEINIPRGKLGPPHLDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFER 352

Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
             R +  Y+  +  SKFCL   G+EV + RI +++  GCVP+II +HY LPF+DVL+W  
Sbjct: 353 LPRNRN-YSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSK 411

Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWL 319
           FSI + +  IP +KKILK + +E YL +Q  V +V++HF  +     YD  +M+++ +WL
Sbjct: 412 FSIYITSDKIPEIKKILKAVPTETYLEMQKRVKQVQRHFAINRPARPYDMLHMILHSVWL 471

Query: 320 RRSSVRV 326
           RR +VR+
Sbjct: 472 RRLNVRL 478


>gi|255548866|ref|XP_002515489.1| catalytic, putative [Ricinus communis]
 gi|223545433|gb|EEF46938.1| catalytic, putative [Ricinus communis]
          Length = 481

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 121/343 (35%), Positives = 191/343 (55%), Gaps = 27/343 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYF--KKAFMKSHFVTKDPSKA 56
           M R F+VYVY     DP      P+  D   +  Y  E  F  +       + T+DP +A
Sbjct: 148 MERRFKVYVY--SEGDP------PIVHDGPCKDIYTIEGRFIHEMEHGARRYRTRDPKRA 199

Query: 57  DLFFLPFSIARMRHD--RRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
            ++F+PFS+  M     + +  +  P   F++ Y+  +S KYP+WNRT GADHF +ACH 
Sbjct: 200 HVYFMPFSVTWMVKYLYKPLTYDHSPLRQFVADYVRVLSTKYPFWNRTHGADHFMLACHD 259

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW------PRQEDPPKLGSS 166
            G  A +    +   +I+V+C+++    G    KDVSLP+I       P Q   P   ++
Sbjct: 260 WGPHASRGDHLLYNTSIRVLCNANT-SEGFNPRKDVSLPEIHLYGGNVPPQLLSPPPANT 318

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
            R  LAFFAG ++ P+R  LL+ W++   D  ++ +  +    Y   +L SKFCL   G 
Sbjct: 319 TRPHLAFFAGGLHGPIRPLLLKHWKDRESDLRVFEYLPK-HLDYYSFMLRSKFCLCPSGH 377

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EV + RI +S+Y  CVPVI+++HY LPF+DVL W +FSI +   +IP L+++L+ +  E+
Sbjct: 378 EVASPRIVESIYAECVPVILSDHYVLPFSDVLRWDAFSIQLNVSEIPRLEEVLRSVPEEK 437

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           Y  L+  +  VR HF  +     +D F+M+++ +WLRR ++R+
Sbjct: 438 YERLKEGLRTVRTHFMLNQPAKRFDVFHMILHSIWLRRLNLRL 480


>gi|449443279|ref|XP_004139407.1| PREDICTED: probable glycosyltransferase At3g42180-like [Cucumis
           sativus]
 gi|449494024|ref|XP_004159425.1| PREDICTED: probable glycosyltransferase At3g42180-like [Cucumis
           sativus]
          Length = 447

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 115/305 (37%), Positives = 170/305 (55%), Gaps = 17/305 (5%)

Query: 33  YASESYFKKAF--MKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDF-------- 82
           YA E  F       KS F    P +A +F LP SI  + H          D+        
Sbjct: 137 YAIEGQFIDELDCSKSPFRASHPDEAHVFLLPLSITNIIHFIYRPITSPADYNRDRMHRV 196

Query: 83  ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGH 142
            + YI  ++ +YPYWNR+ GADHF V+CH           ++  N I+VVC+++    G 
Sbjct: 197 TTDYIRVVANRYPYWNRSNGADHFVVSCHDWAPEISDANPQLFKNFIRVVCNAN-ITEGF 255

Query: 143 IAHKDVSLPQI--WPRQEDPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIY 197
             + D+ LP+I   P    PP LG    +R  LAFFAG  +  +R+ L++ W+  D+E+ 
Sbjct: 256 RPNIDIPLPEINIHPGTLGPPDLGQPPERRPILAFFAGGAHGYIRKILIKHWKEKDNEVQ 315

Query: 198 AHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
            H    KT    +L+G SKFCL   G+EV + R+ +++Y GCVPVII+++Y LPF+DVL+
Sbjct: 316 VHEYLPKTQNYTKLIGESKFCLCPSGYEVASPRVVEAIYGGCVPVIISDNYSLPFSDVLD 375

Query: 257 WKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYE 316
           W  FS+ +    IP +K ILK IS E+YL L   V+KV++HF+ +     +D  +M+++ 
Sbjct: 376 WSRFSVQIPVQRIPEIKTILKAISEEKYLKLYKGVIKVKRHFKINRPAKPFDVIHMLLHS 435

Query: 317 LWLRR 321
           LWLRR
Sbjct: 436 LWLRR 440


>gi|296084509|emb|CBI25530.3| unnamed protein product [Vitis vinifera]
          Length = 387

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 111/307 (36%), Positives = 177/307 (57%), Gaps = 15/307 (4%)

Query: 33  YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARMRH----DRRIGTEGIPDFISHY 86
           YA E  F       KS F+ + P +A+ F++P S+ R+ H          + IP  ++ Y
Sbjct: 81  YAIEGQFMDEIESGKSQFLARHPDEANAFYIPMSLTRVVHFIYEPPHYHGKWIPRLVTDY 140

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK 146
           I  ++ KYPYWNR+ GADHF V+CH           ++  + I+ +C+++     H   +
Sbjct: 141 INFVADKYPYWNRSKGADHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPI-R 199

Query: 147 DVSLPQI-WPRQE-DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR---NDSEIYAH 199
           D+S+P+I  PR +  PP L    +KR  LAFFAG  +  VR  L + W+   ++ +++  
Sbjct: 200 DISIPEINIPRGKLGPPHLDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFER 259

Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
             R +  Y+  +  SKFCL   G+EV + RI +++  GCVP+II +HY LPF+DVL+W  
Sbjct: 260 LPRNRN-YSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSK 318

Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWL 319
           FSI + +  IP +KKILK + +E YL +Q  V +V++HF  +     YD  +M+++ +WL
Sbjct: 319 FSIYITSDKIPEIKKILKAVPTETYLEMQKRVKQVQRHFAINRPARPYDMLHMILHSVWL 378

Query: 320 RRSSVRV 326
           RR +VR+
Sbjct: 379 RRLNVRL 385


>gi|242043198|ref|XP_002459470.1| hypothetical protein SORBIDRAFT_02g005180 [Sorghum bicolor]
 gi|241922847|gb|EER95991.1| hypothetical protein SORBIDRAFT_02g005180 [Sorghum bicolor]
          Length = 584

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 135/358 (37%), Positives = 194/358 (54%), Gaps = 45/358 (12%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAF-MKSHFVTKDPSKAD 57
           M +  +VYVY     +P      PV  D   R  Y++E  F  A    +   T DPS+A 
Sbjct: 234 MEKQLKVYVY--EEGEP------PVFHDGPCRSIYSTEGRFIHAMETATRLRTSDPSQAH 285

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDF------ISHYIFNISQKYPYWNRTGGADHFYVACH 111
           +FFLPFS+ +M   + I   G  D       ++ Y+  IS KYP+WNR+ GADHF ++CH
Sbjct: 286 VFFLPFSVVKMV--KTIYEPGSHDMAPLKRTVADYLRVISDKYPFWNRSAGADHFMLSCH 343

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQEDPPKLGS 165
             G        E+  N+I+V+C+++      +A KDVSLP+I        RQ   P   +
Sbjct: 344 DWGPYVSSANAELFGNSIRVLCNANTSEGFDLA-KDVSLPEINLRSDAVERQVGGPS--A 400

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW-----RNDSEIYAHSGRLKTP----------YADE 210
           S+R  LAFFAG  + PVR  LL  W     R D      S  L T           Y D 
Sbjct: 401 SRRPFLAFFAGGNHGPVRPALLAHWGPGSGREDDPDVRVSEYLPTRGGRAGASAAAYTDM 460

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIAN-HYDLPFADVLNWKSFSIVVATLDI 269
           +  S+FCL   G+EV + R+A+++Y  CVPV++ +  Y LPFADVLNW +F++ V   D+
Sbjct: 461 MRRSRFCLCPGGYEVASPRLAEAVYLECVPVVVDDGEYALPFADVLNWDAFAVRVRVADV 520

Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS-DYDAFYMVMYELWLRRSSVRV 326
           P +K+IL  +S  +Y+ +Q  V  VR+HF  H  P   YDAF+M+++ +WLRR +VR+
Sbjct: 521 PRIKEILSAVSPRQYIRMQRRVRMVRRHFMVHGGPPRRYDAFHMILHSVWLRRLNVRI 578


>gi|356495238|ref|XP_003516486.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
           max]
          Length = 610

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 125/345 (36%), Positives = 187/345 (54%), Gaps = 43/345 (12%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFM-KSHFVTKDPSKAD 57
           M R  +V++Y             P+  +P  RG YASE +F K       F+ KDP KA 
Sbjct: 282 MERKLKVFIYREGAK--------PIFHQPKMRGIYASEGWFMKLMEGNKRFIVKDPRKAH 333

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS----I 113
           LF+LPFS ++M        + +   +  Y+  I+ +Y +WNRT GADHF VACH     I
Sbjct: 334 LFYLPFS-SQMLRVTLSNPKQMEQHLEKYVELIAGRYRFWNRTDGADHFLVACHDWASRI 392

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW------PRQE---DPPKLG 164
            R  M+         I+ +C+S+    G    KD +LP  +      P +E    PP   
Sbjct: 393 TRQPMK-------GCIRSLCNSN-VAKGFQIGKDTTLPVTYIHSVMDPLKECAGKPP--- 441

Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCL 218
            S+R+ LAFFAG+++  +R  LL+ W N   D +I+    R    K  Y + +  SK+C+
Sbjct: 442 -SERSALAFFAGSMHGYLRPILLKHWANKEPDMKIFGPMPRDLEGKKMYMEYMNSSKYCI 500

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
             +G+EV+T RI ++++ GCVPVII+++Y  P  +VL W++FS+ V   D+P L+ IL  
Sbjct: 501 CARGYEVHTPRIIEAIFSGCVPVIISDNYVPPLFEVLKWEAFSLFVRERDVPSLRDILLS 560

Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
           I  E+YL L   V KV++HF WH  P  YD F+M+++ +W  R S
Sbjct: 561 IPEEKYLALHLGVKKVQQHFLWHKVPVKYDLFHMILHAIWKNRLS 605


>gi|224143183|ref|XP_002324873.1| predicted protein [Populus trichocarpa]
 gi|222866307|gb|EEF03438.1| predicted protein [Populus trichocarpa]
          Length = 335

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 112/295 (37%), Positives = 171/295 (57%), Gaps = 15/295 (5%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARMRH-----DRRIGTEGIPDFISHYIFNISQKYPYWNR 99
           KS F  K+P +A  FF+P SIA + H           + I D    YI  IS KYPYWNR
Sbjct: 41  KSPFSAKNPDEALAFFIPVSIASILHFIYRPYVTYSRKQIQDIAEDYIGLISSKYPYWNR 100

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE 158
           + GADHF ++CH           ++  N I+V+C+++    G    +DVSLP+   PR +
Sbjct: 101 SSGADHFMISCHDWAPDVSAANPDLYRNFIRVLCNANSS-EGFKPARDVSLPEFKLPRGK 159

Query: 159 DPP----KLGSSKRNKLAFFAGAVNSPVREKLLQVWR---NDSEIYAHSGRLKTPYADEL 211
             P    +   + R+ LAFFAG  +  VR+ L + W+   ND ++Y +       Y +++
Sbjct: 160 LEPEHILQPCDNNRSILAFFAGGSHGSVRKILFKHWKEKDNDIQVYKYLPETLN-YTEQM 218

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
             S++CL   G+EV + R+ +++Y GCVPVII+++Y LPF+DVL+W  FS+ +    IP 
Sbjct: 219 SKSRYCLCPSGWEVASPRVVEAIYSGCVPVIISDYYVLPFSDVLDWIKFSVHIPVSGIPE 278

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           +K IL+ I  EEYL  Q  VL+V++HF+ H     +D  +MVM+ +WLRR ++R+
Sbjct: 279 IKTILQSIPVEEYLEKQKRVLQVQQHFKLHRPAKPFDVVHMVMHSVWLRRLNIRL 333


>gi|147775379|emb|CAN73458.1| hypothetical protein VITISV_022180 [Vitis vinifera]
          Length = 333

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 116/342 (33%), Positives = 184/342 (53%), Gaps = 27/342 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDPSKADL 58
           M + F+++ Y    + P  +     D    G YA E  F       KSHF+ + P +A+ 
Sbjct: 1   MEKRFKIWAY-REGDQPLMH-----DGPSNGIYAIEGQFMDDIESGKSHFLARRPDEANA 54

Query: 59  FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           F++P S+ ++ H          + IP  ++ YI  ++ KYPYWNR+ GADHF V+CH   
Sbjct: 55  FYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLVSCHDWA 114

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPPKLGSS 166
                   ++  + I+ +C+++     H   +D+S+P+I         P  + PP    +
Sbjct: 115 PDVSALKPDLYKHFIRALCNANTSERFHPI-RDISIPEINIPXGKLGPPHLDQPP----N 169

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHVKGFE 224
           KR  LAFFAG  +  VR  L + W+   +      RL     Y+  +  SKFCL   G+E
Sbjct: 170 KRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFERLPGNRNYSKSMGDSKFCLCPSGYE 229

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           V + RI +++  GCVP+II +HY LPF+DVL+W  FSI + +  IP +KKILK + +E Y
Sbjct: 230 VASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIKKILKAVPTETY 289

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           L +Q  V +V++HF  +     YD  +M+++ +WLRR +VR+
Sbjct: 290 LEMQKRVKQVQRHFAINRPARPYDMLHMILHSVWLRRLNVRL 331


>gi|297807149|ref|XP_002871458.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317295|gb|EFH47717.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 336

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 124/347 (35%), Positives = 185/347 (53%), Gaps = 30/347 (8%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDP 53
           M + F+++ Y     P     P  N+           YA E  F        S +    P
Sbjct: 1   MEKRFKIWTYREGEAPLFHKGPLNNI-----------YAIEGQFMDEIENGNSRYKAASP 49

Query: 54  SKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
            +A +F++P  I  +     R       + + + +  YI  IS +YPYWNR+ GADHF++
Sbjct: 50  EEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHFFL 109

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPR-QEDPPKLGSS 166
           +CH           E+  + I+ +C+++    G    +DVSLP+I  P  Q      G +
Sbjct: 110 SCHDWAPDVSAVDPELYRHFIRALCNANAS-EGFTPMRDVSLPEINIPHSQLGFVHTGEA 168

Query: 167 KRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKT-PYADELLGSKFCLHVKG 222
            +N+  LAFFAG  +  VR+ L + W+  D ++  +    KT  Y   +  +KFCL   G
Sbjct: 169 PQNRKLLAFFAGGSHGEVRKILFEQWKEKDKDVLVYEYLPKTMNYTKMMDKAKFCLCPSG 228

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
           +EV + RI +SLY GCVPVIIA+ Y LPF+DVLNWK+FS+ +    +P +KKIL+ IS E
Sbjct: 229 WEVASPRIVESLYSGCVPVIIADSYVLPFSDVLNWKTFSVHIPISKMPDIKKILEAISEE 288

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
           EYL +Q  VL+VRKHF  +     YD  +M+M+ +WLRR +VR+  S
Sbjct: 289 EYLEMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRLNVRIPLS 335


>gi|356546126|ref|XP_003541482.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 490

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 120/349 (34%), Positives = 185/349 (53%), Gaps = 35/349 (10%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK----SHFVTK 51
           M + F+V+VY     P   + P  N+           YA E  F          S F  +
Sbjct: 151 MVKRFKVWVYQEGEQPLVHDGPVNNI-----------YAIEGQFMDEMDNNGKWSQFRAR 199

Query: 52  DPSKADLFFLPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA 103
            P +A +FFLP SIA + H                +   +  YI  I  KYPYWNR+ GA
Sbjct: 200 HPEEAHVFFLPISIANVVHYVYKPILKQSDYEPVRLQHLVEDYIGVIQDKYPYWNRSIGA 259

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--PRQEDPP 161
           DHF ++CH  G        E+    I+ +C+++     H  ++DVS+P+++    +  P 
Sbjct: 260 DHFLLSCHDWGPKVSYGNPELFQTFIRALCNANTSEGFH-PNRDVSIPEVYLPVGKLGPA 318

Query: 162 KLGS--SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFC 217
            LG   + R  LAFFAG V+  +R+ LL+ W++ D+E+  H    K     +L+G SKFC
Sbjct: 319 SLGQHPNSRTTLAFFAGGVHGEIRKILLKHWKDKDNEVLVHEYLPKGQDYTKLMGQSKFC 378

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L   G EV + R+ ++++ GCVPVII ++Y LPF+DVLNW  FS+ +    IP +K IL+
Sbjct: 379 LCPSGHEVASPRVVEAIHAGCVPVIICDNYSLPFSDVLNWSQFSVEIPVEKIPEIKSILQ 438

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            IS  +YL L   VL+VR+HF  +     +D  +M+++ +WLRR ++++
Sbjct: 439 SISRNKYLRLHMNVLRVRRHFMINRPTKPFDMMHMILHSIWLRRLNIKL 487


>gi|296082060|emb|CBI21065.3| unnamed protein product [Vitis vinifera]
          Length = 402

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 119/343 (34%), Positives = 182/343 (53%), Gaps = 25/343 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + FRV+ Y       F +  + + +   G +  E   K       F  + P +A LF+
Sbjct: 66  MEKRFRVWTYQEGEKPLFHSGPMNLIYGIEGQFMDELESKNC----PFSARHPDEAMLFY 121

Query: 61  LPFSIARM-----------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           +P S+  +               R+  +G+   ++ YI  IS+KYP+WNR+ GADHF V+
Sbjct: 122 IPISVVNIIRYVYQPYTNSASYSRVRLQGL---VADYIDVISRKYPFWNRSSGADHFMVS 178

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP--RQEDPPKLGSSK 167
           CH           +   + I+V+C+++    G    +DVSLP+I    R   PP LG   
Sbjct: 179 CHDWAPEISAANPKFFRHFIRVLCNANT-SEGFKPVRDVSLPEILVPYRMLGPPYLGQPP 237

Query: 168 RNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGF 223
            N+  LAFFAG  +  VR  L   W+  D +I  H     T    EL+G SKFCL   GF
Sbjct: 238 TNRSILAFFAGGAHGKVRSILFHYWKEKDEDIQVHEYLPTTLNYTELMGRSKFCLCPSGF 297

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EV + R+ +S+Y GCVPVII+++Y LPF+DVL+W  FS+ +    IP  K IL+ I  +E
Sbjct: 298 EVASPRVVESIYAGCVPVIISDNYSLPFSDVLDWSQFSVHIPIARIPETKTILQAIPIQE 357

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           YL  Q  V++V++HF  +     +D  +MV++ +WLRR ++++
Sbjct: 358 YLTKQKTVMQVQRHFTLNRPAKRFDVLHMVLHSIWLRRINIQL 400


>gi|359476148|ref|XP_002285239.2| PREDICTED: probable glycosyltransferase At5g11130 [Vitis vinifera]
          Length = 472

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 119/343 (34%), Positives = 182/343 (53%), Gaps = 25/343 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + FRV+ Y       F +  + + +   G +  E   K       F  + P +A LF+
Sbjct: 136 MEKRFRVWTYQEGEKPLFHSGPMNLIYGIEGQFMDELESKNC----PFSARHPDEAMLFY 191

Query: 61  LPFSIARM-----------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           +P S+  +               R+  +G+   ++ YI  IS+KYP+WNR+ GADHF V+
Sbjct: 192 IPISVVNIIRYVYQPYTNSASYSRVRLQGL---VADYIDVISRKYPFWNRSSGADHFMVS 248

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP--RQEDPPKLGSSK 167
           CH           +   + I+V+C+++    G    +DVSLP+I    R   PP LG   
Sbjct: 249 CHDWAPEISAANPKFFRHFIRVLCNANT-SEGFKPVRDVSLPEILVPYRMLGPPYLGQPP 307

Query: 168 RNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGF 223
            N+  LAFFAG  +  VR  L   W+  D +I  H     T    EL+G SKFCL   GF
Sbjct: 308 TNRSILAFFAGGAHGKVRSILFHYWKEKDEDIQVHEYLPTTLNYTELMGRSKFCLCPSGF 367

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EV + R+ +S+Y GCVPVII+++Y LPF+DVL+W  FS+ +    IP  K IL+ I  +E
Sbjct: 368 EVASPRVVESIYAGCVPVIISDNYSLPFSDVLDWSQFSVHIPIARIPETKTILQAIPIQE 427

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           YL  Q  V++V++HF  +     +D  +MV++ +WLRR ++++
Sbjct: 428 YLTKQKTVMQVQRHFTLNRPAKRFDVLHMVLHSIWLRRINIQL 470


>gi|296085068|emb|CBI28483.3| unnamed protein product [Vitis vinifera]
          Length = 416

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 120/344 (34%), Positives = 185/344 (53%), Gaps = 30/344 (8%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK-SHFVTKDPS 54
           M + F++YVY     P   N P  ++           Y++E  F     K S + T DP 
Sbjct: 85  MEKLFKIYVYEEGEPPMFHNGPCKSI-----------YSTEGRFIHEMEKGSVYRTTDPD 133

Query: 55  KADLFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
           +A L+FLPFS+  M     +        I   +  YI  IS  +P+WNR+ GADHF ++C
Sbjct: 134 QALLYFLPFSVVMMVQYLYVPDSHEIHAIEKTVIDYINLISHNHPFWNRSLGADHFMLSC 193

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK----LGSS 166
           H  G  A      +  N+I+V+C+++    G    KDVS P+I  R  +       L  S
Sbjct: 194 HDWGPRASTSVPYLYNNSIRVLCNANT-SEGFNPSKDVSFPEIHLRTGEMSGPLGGLSPS 252

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
           +R  L FFAG ++  +R  LL+ W++   D ++Y         Y   L  S+FCL   G+
Sbjct: 253 RRPILGFFAGRLHGHIRYLLLEQWKDKDKDLQVYDQLPN-GLSYDSMLKKSRFCLCPSGY 311

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EV + R+ +++Y  CVPV+I+++Y  PF DVLNWKSF++ V   DI  +K+IL GIS  +
Sbjct: 312 EVASPRVVEAIYAECVPVLISDNYVPPFNDVLNWKSFAVQVQVRDIANIKRILMGISQTQ 371

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           YL +   V +V++HF  +  P  +D F+M ++ +WLRR ++R+Q
Sbjct: 372 YLRMYRRVKQVQRHFMVNAAPQRFDVFHMTIHSIWLRRLNIRIQ 415


>gi|359485862|ref|XP_002264076.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
          Length = 484

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 120/344 (34%), Positives = 185/344 (53%), Gaps = 30/344 (8%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK-SHFVTKDPS 54
           M + F++YVY     P   N P  ++           Y++E  F     K S + T DP 
Sbjct: 153 MEKLFKIYVYEEGEPPMFHNGPCKSI-----------YSTEGRFIHEMEKGSVYRTTDPD 201

Query: 55  KADLFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
           +A L+FLPFS+  M     +        I   +  YI  IS  +P+WNR+ GADHF ++C
Sbjct: 202 QALLYFLPFSVVMMVQYLYVPDSHEIHAIEKTVIDYINLISHNHPFWNRSLGADHFMLSC 261

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK----LGSS 166
           H  G  A      +  N+I+V+C+++    G    KDVS P+I  R  +       L  S
Sbjct: 262 HDWGPRASTSVPYLYNNSIRVLCNANTS-EGFNPSKDVSFPEIHLRTGEMSGPLGGLSPS 320

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
           +R  L FFAG ++  +R  LL+ W++   D ++Y         Y   L  S+FCL   G+
Sbjct: 321 RRPILGFFAGRLHGHIRYLLLEQWKDKDKDLQVYDQLPN-GLSYDSMLKKSRFCLCPSGY 379

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EV + R+ +++Y  CVPV+I+++Y  PF DVLNWKSF++ V   DI  +K+IL GIS  +
Sbjct: 380 EVASPRVVEAIYAECVPVLISDNYVPPFNDVLNWKSFAVQVQVRDIANIKRILMGISQTQ 439

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           YL +   V +V++HF  +  P  +D F+M ++ +WLRR ++R+Q
Sbjct: 440 YLRMYRRVKQVQRHFMVNAAPQRFDVFHMTIHSIWLRRLNIRIQ 483


>gi|449511061|ref|XP_004163851.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
           sativus]
          Length = 341

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 115/336 (34%), Positives = 181/336 (53%), Gaps = 25/336 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M +  +VYVY       F   +L       G YASE +F K    +  FV KDP KA LF
Sbjct: 9   MEKMLKVYVYDEGEKPIFHQPIL------TGIYASEGWFMKLLEDNKKFVVKDPEKAHLF 62

Query: 60  FLPFSIARMRH---DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
           +LPFS   +R    ++      +   + +YI  I +KY +WN+ GG+DHF VACH     
Sbjct: 63  YLPFSSQFLRSAFGNKFRNKRDLQKPLKNYIDVIGKKYRFWNKNGGSDHFLVACHDWAPK 122

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDP--PKLGS---SKRNKL 171
             ++  +   N I+ +C+++      I  KD SLP  +    +    K+G    S+R  L
Sbjct: 123 LTKRLVK---NCIRALCNANGAGDFEIG-KDTSLPVTFVHSTEDLITKIGGKPPSERTTL 178

Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKGFEV 225
           AFFAG+++  +R  LL  W N        G +      K  Y +++  SK+C+  +G++V
Sbjct: 179 AFFAGSMHGYLRPILLHYWENKEPDMMIVGPMPNSIEGKNAYMEQMKSSKYCICARGYQV 238

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
           ++ R+ +++   C+PVII+++Y  P  +VLNW+SFS+ V   +IP L+ IL  I  E Y 
Sbjct: 239 HSPRVIEAILNECIPVIISDNYVPPLFEVLNWESFSVFVKEREIPYLRDILLSIPEENYR 298

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           ++ S V  V++HF WH  P+ YDAF+M+++ +W  R
Sbjct: 299 VMHSRVKMVQQHFLWHEKPAKYDAFHMILHSIWYTR 334


>gi|449465629|ref|XP_004150530.1| PREDICTED: probable glycosyltransferase At5g20260-like [Cucumis
           sativus]
          Length = 299

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 109/298 (36%), Positives = 172/298 (57%), Gaps = 21/298 (7%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNR 99
           KS F   +P +A +FFLP SI  +     +       + +    + Y+  ++ KYPYWNR
Sbjct: 5   KSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNR 64

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------ 153
           T GADHF V+CH       ++   +    I+V+C+++    G    +D SLP+I      
Sbjct: 65  TRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANT-SEGFNPMRDASLPEINLPPTF 123

Query: 154 ---WPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYAD 209
               PR   PP+     R+ LAFFAG  +  +R  L+Q W++ D EI  H     +    
Sbjct: 124 HLNLPRLGQPPQ----NRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYT 179

Query: 210 ELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
           EL+  SKFCL   G+EV + R+ ++++ GCVPV+I+++Y LPF DVL+W  FS+ + +  
Sbjct: 180 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSER 239

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           IP +K IL+G+S ++YL LQ  V+KV++HF+ H     +D F+MV++ +WLRR +V++
Sbjct: 240 IPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 297


>gi|224142559|ref|XP_002324623.1| predicted protein [Populus trichocarpa]
 gi|222866057|gb|EEF03188.1| predicted protein [Populus trichocarpa]
          Length = 337

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 109/295 (36%), Positives = 169/295 (57%), Gaps = 15/295 (5%)

Query: 46  SHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDF--------ISHYIFNISQKYPYW 97
           S F  +DP +A +FFLP S+A + H   +      D+        ++ Y+  +++KYPYW
Sbjct: 42  SPFRAQDPDEAHVFFLPVSVASIVHFIYLPITAAADYSRDRLRRVVTDYVHIVAKKYPYW 101

Query: 98  NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP-- 155
           NR+ GADHF V+CH           E+    I+V+C+++  I G    +DV LP+I+   
Sbjct: 102 NRSNGADHFMVSCHDWAPDVSIANSELFNKFIRVLCNANISI-GFRPPRDVLLPEIYLPF 160

Query: 156 RQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELL 212
                  +G +  N+  LAFF G  +  +R+ L + W+N D+E+  H    K      L+
Sbjct: 161 SGLGTTHMGQAPNNRPILAFFEGRAHGYIRQVLFKHWKNKDNEVQVHELLPKGKNYTRLM 220

Query: 213 G-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
           G SKFCL   GFEV + R+ +++Y GCVPVII+N+Y LPF+DVLNW  FS+ +    IP 
Sbjct: 221 GQSKFCLCPSGFEVASPRVVEAIYQGCVPVIISNNYSLPFSDVLNWSQFSVQIPVEKIPE 280

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           +K IL+ IS+ +YL +   V +V++HF  +     +D  +MV++ LWLRR + R+
Sbjct: 281 IKMILQRISNSKYLRMHERVKRVQRHFVLNRPAKPFDVIHMVLHSLWLRRLNFRL 335


>gi|224143179|ref|XP_002324871.1| predicted protein [Populus trichocarpa]
 gi|222866305|gb|EEF03436.1| predicted protein [Populus trichocarpa]
          Length = 334

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 119/343 (34%), Positives = 188/343 (54%), Gaps = 27/343 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYF--KKAFMKSHFVTKDPSKA 56
           M + F+V+VY        +   LP+  D   +  Y  E  F  +       F T+DP +A
Sbjct: 1   MEKRFKVHVY--------SEGELPIVHDGPCKDIYTIEGRFIHEMEHGAKRFKTRDPRRA 52

Query: 57  DLFFLPFSIARM----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
            ++F+PFS+  M              +  F++ Y+  +S KYP+WNRT GADHF ++CH 
Sbjct: 53  HVYFMPFSVTWMVKYLYKPLTYDHTAMKQFVADYVRVVSSKYPFWNRTQGADHFMLSCHD 112

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED-PPKL-----GSS 166
            G  A      +   +I+V+C+++    G    KDVSLP+I     + PPKL      +S
Sbjct: 113 WGPHASHGNPFLYNTSIRVLCNANS-SEGFSPRKDVSLPEIHLYGGNVPPKLISPPPATS 171

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
            R  LAFF+G ++ P+R  LL  W+    D ++Y +  +    Y   +L SKFCL   G 
Sbjct: 172 PRPYLAFFSGGLHGPIRPILLDHWKGRDPDLQVYEYLPK-DLDYYSFMLRSKFCLCPSGH 230

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           EV + RI +++Y  CVPVI+++HY LPF+DVL W++F+I V   +IP LK++L  +  E 
Sbjct: 231 EVASPRIVEAIYAECVPVILSDHYVLPFSDVLRWEAFAIQVNVSEIPRLKEVLISVPEER 290

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           Y  L+  +  +RKHF  +     +D F+M+++ +WLRR ++R+
Sbjct: 291 YRRLKEGLRAIRKHFVLNQPAKRFDVFHMILHSIWLRRLNLRL 333


>gi|255582309|ref|XP_002531945.1| catalytic, putative [Ricinus communis]
 gi|223528391|gb|EEF30427.1| catalytic, putative [Ricinus communis]
          Length = 453

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 106/297 (35%), Positives = 172/297 (57%), Gaps = 13/297 (4%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNR 99
           +SHF+ + P +A +FFLP S+A +     +       + +   ++ Y+  ++ KYPYWNR
Sbjct: 158 RSHFIARRPEEAQVFFLPVSVANIINFIYKPIVTYSRDQLQRLVTDYVGTVANKYPYWNR 217

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE 158
           + GADHF V+CH           E+  N I+V+C+++     +   +DVS+P+I  P  +
Sbjct: 218 SSGADHFLVSCHDWAPDISTANPELYRNFIRVLCNANTSERFN-PKRDVSIPEINIPSGK 276

Query: 159 DPPKL----GSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG 213
             P L      S R  LAFFAG  +  +R+ LL+ W+  DSEI  H    K     +L+G
Sbjct: 277 LGPPLHQASSPSNRTILAFFAGGSHGYIRKLLLEHWKGKDSEIQVHEYLDKKQNYFKLMG 336

Query: 214 -SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
            S+FCL   G+EV + R+  ++  GCVPV I+++Y LPF+D+L+W  FS+ + +  I  +
Sbjct: 337 QSRFCLCPSGYEVASPRVVTAIQLGCVPVTISDNYTLPFSDILDWSKFSVHIPSGKIQEI 396

Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
           K ILKGIS  +YL +   V+ VR+HF  +     +D  +M+++ +WLRR ++R+ +S
Sbjct: 397 KTILKGISPRQYLKMHKRVMLVRRHFMLNRPAQPFDMIHMMLHSIWLRRLNIRLPFS 453


>gi|449518133|ref|XP_004166098.1| PREDICTED: probable glycosyltransferase At5g20260-like, partial
           [Cucumis sativus]
          Length = 362

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 109/298 (36%), Positives = 172/298 (57%), Gaps = 21/298 (7%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNR 99
           KS F   +P +A +FFLP SI  +     +       + +    + Y+  ++ KYPYWNR
Sbjct: 68  KSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNR 127

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------ 153
           T GADHF V+CH       ++   +    I+V+C+++    G    +D SLP+I      
Sbjct: 128 TRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANT-SEGFNPMRDASLPEINLPPTF 186

Query: 154 ---WPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYAD 209
               PR   PP+     R+ LAFFAG  +  +R  L+Q W++ D EI  H     +    
Sbjct: 187 HLNLPRLGQPPQ----NRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYT 242

Query: 210 ELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
           EL+  SKFCL   G+EV + R+ ++++ GCVPV+I+++Y LPF DVL+W  FS+ + +  
Sbjct: 243 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSER 302

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           IP +K IL+G+S ++YL LQ  V+KV++HF+ H     +D F+MV++ +WLRR +V++
Sbjct: 303 IPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 360


>gi|224142567|ref|XP_002324627.1| predicted protein [Populus trichocarpa]
 gi|222866061|gb|EEF03192.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 111/301 (36%), Positives = 170/301 (56%), Gaps = 24/301 (7%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARMRH---DRRI--GTEGIPDFISHYIFNISQKYPYWNR 99
           KS F+ + P +A  FFLP S+A + H     RI    + +   ++ Y+  ++ KY YWNR
Sbjct: 41  KSPFIARHPDEAHAFFLPISVAYIIHYVYKPRITFARDQLQRLVTDYVRVVADKYTYWNR 100

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW----- 154
           T GADHF V+CH       +   E+    ++V+C+++    G    +DVS+P+I+     
Sbjct: 101 TQGADHFSVSCHDWAPDVSRANPELFRYFVRVLCNAN-ISEGFRPQRDVSIPEIFLPVGK 159

Query: 155 ---PRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHS-----GRLKT 205
              PR+   P    SKR+ LAFFAG  +  +R+ LL  W+  D E+  H       +  T
Sbjct: 160 LGPPREYTKP---PSKRSILAFFAGGAHGHIRKVLLTHWKEKDDEVQVHEYLTQRNKKNT 216

Query: 206 PYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
               EL+G SKFCL   G EV + R+  ++  GCVPV I+ +Y LPF+DVL+W  FS+ +
Sbjct: 217 NLYFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISANYSLPFSDVLDWSKFSVDI 276

Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSV 324
               IP +K ILKGISS  YL +Q  V+++++HF  +     YD  +M+++ +WLRR +V
Sbjct: 277 PPEKIPEIKTILKGISSRRYLTMQRRVMQIQRHFMLNRPAQPYDMLHMILHSVWLRRLNV 336

Query: 325 R 325
           +
Sbjct: 337 K 337


>gi|224129246|ref|XP_002320537.1| predicted protein [Populus trichocarpa]
 gi|222861310|gb|EEE98852.1| predicted protein [Populus trichocarpa]
          Length = 395

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 113/294 (38%), Positives = 167/294 (56%), Gaps = 19/294 (6%)

Query: 46  SHFVTKDPSKADLFFLPFS----IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
           + F T +P +A ++FLPFS    I  + H        +   +S Y+  IS KY YWNR+ 
Sbjct: 104 TKFRTSNPDEAHVYFLPFSVVMIIEHLFHPIIRDKAVLERTVSDYVRIISHKYLYWNRSL 163

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVC---SSSYFISGHIAHKDVSLPQIWPRQE 158
           GADHF ++CH  G  A     ++  N+I+V+C   +S YF       KD S P+I  +  
Sbjct: 164 GADHFMLSCHDWGPRATWYVRQLYYNSIRVLCNANTSEYFN----PKKDASFPEINLKTG 219

Query: 159 DPPKLGS----SKRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADEL 211
           +   L      S R  LAFFAG ++  +R  LLQ W     D ++Y    +    Y + +
Sbjct: 220 EITGLTGGLPPSNRTVLAFFAGKMHGKLRPALLQHWMGKDKDVQVYETLPQ-GISYHEMM 278

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
             SK+C+   G EV + RIA+++Y  CVPV+I+ HY  PF+DVLNW SF+I V   +IP 
Sbjct: 279 KKSKYCICPSGHEVASPRIAEAIYAECVPVLISQHYIFPFSDVLNWDSFTIQVPVTEIPN 338

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           LK IL+GI  ++YL +Q  V +V++HF  +  P  YD F+M+++ +WLRR +VR
Sbjct: 339 LKNILEGIPEDQYLRMQERVRQVQRHFVVNNPPRRYDVFHMIIHSIWLRRLNVR 392


>gi|357478279|ref|XP_003609425.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355510480|gb|AES91622.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 617

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 104/295 (35%), Positives = 168/295 (56%), Gaps = 15/295 (5%)

Query: 46  SHFVTKDPSKADLFFLPFSIAR--------MRHDRRIGTEGIPDFISHYIFNISQKYPYW 97
           S F  K P++A +FFLPFS+A         +   +    + +   +  Y+  ++ KYPYW
Sbjct: 320 SPFKAKHPNEAHIFFLPFSVANVVQYVYKPIMSKKDFNRDRLHRMVEDYVNVVAHKYPYW 379

Query: 98  NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--P 155
           NR+ GADHF ++CH            +  N  +V+C+++    G    +DVS+P+++   
Sbjct: 380 NRSNGADHFLLSCHDWAPEISDANPNLFKNFTRVLCNANTS-EGFQPKRDVSIPEVYLPV 438

Query: 156 RQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELL 212
            +  PP LG S  N+  LAFF+G  +  +R+ LL  W+N D+++  H    K     EL+
Sbjct: 439 GKLGPPNLGQSPLNRTILAFFSGGAHGDIRKLLLNHWKNKDAQVQVHEYLPKGQNYTELM 498

Query: 213 G-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
           G SKFCL   G+EV + RI +++  GCVPVII+ +Y LPF DVLNW  FS+ +    I  
Sbjct: 499 GLSKFCLCPSGYEVASPRIVEAINAGCVPVIISQNYSLPFNDVLNWSEFSVEIPVEKIVE 558

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           +K IL+ ++ ++Y+ L   V+KV+KHF  +     +D  +M+++ +WLRR + R+
Sbjct: 559 IKNILQNVTKDKYMKLHMNVMKVQKHFVMNRPAKPFDVMHMILHSIWLRRLNFRL 613


>gi|449466083|ref|XP_004150756.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
           sativus]
          Length = 341

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 115/336 (34%), Positives = 180/336 (53%), Gaps = 25/336 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M +  +VYVY       F   +L       G YASE +F K    +  FV KDP KA LF
Sbjct: 9   MEKMLKVYVYDEGEKPIFHQPIL------TGIYASEGWFMKLLEDNKKFVVKDPEKAHLF 62

Query: 60  FLPFSIARMRH---DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
           +LPFS   +R    ++      +   + +YI  I +KY +WN+ GG+DHF VACH     
Sbjct: 63  YLPFSSQFLRSAFGNKFRNKRDLQKPLKNYIDVIGKKYRFWNKNGGSDHFLVACHDWAPK 122

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDP--PKLGS---SKRNKL 171
             ++  +   N I+ +C+++      I  KD SLP  +    +    K+G    S+R  L
Sbjct: 123 LTKRLVK---NCIRALCNANGAGDFEIG-KDTSLPVTFVHSTEDLITKIGGKPPSERTTL 178

Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKGFEV 225
           AFFAG+++  +R  LL  W N        G +      K  Y +++  SK+C+  +G++V
Sbjct: 179 AFFAGSMHGYLRPILLHYWENKEPDMMIVGPMPNSIEGKNAYMEQMKSSKYCICARGYQV 238

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
           ++ R+ +++   C+PVII+++Y  P  +VLNW+SFS+ V   +IP L+ IL  I  E Y 
Sbjct: 239 HSPRVIEAILNECIPVIISDNYVPPLFEVLNWESFSVFVKEREIPNLRDILLSIPEENYR 298

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            + S V  V++HF WH  P+ YDAF+M+++ +W  R
Sbjct: 299 AMHSRVKMVQQHFLWHEKPAKYDAFHMILHSIWYTR 334


>gi|255548864|ref|XP_002515488.1| catalytic, putative [Ricinus communis]
 gi|223545432|gb|EEF46937.1| catalytic, putative [Ricinus communis]
          Length = 474

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 116/338 (34%), Positives = 180/338 (53%), Gaps = 16/338 (4%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + F+V+ Y       F N  +   +   G +  E  F+    KS F  + P +A  FF
Sbjct: 137 MEKRFKVWTYKEGEPPLFHNGPMKEIYSTEGQFIDE--FESG--KSLFSARRPDEAHAFF 192

Query: 61  LPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           LP SI  +     R         + + +  Y+  IS KYP+WNR+ GADHF  +CH    
Sbjct: 193 LPVSIVSIVKYVYRPYSDYSRIRLQNVVKDYVGVISSKYPFWNRSDGADHFLTSCHDWAP 252

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQE----DPPKLGSSKRNKL 171
                  E+     +V+C+++    G +  +DVSLP+I  R      +P  L    R  L
Sbjct: 253 DVSAGHPELYKYFTRVLCNANT-SEGFVPERDVSLPEIRLRDRKLSPEPHSLPPKDRRIL 311

Query: 172 AFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGFEVNTAR 229
           AFFAG  +  VR KL + W+  D ++  +    KT    EL+  SKFCL   G+EV + R
Sbjct: 312 AFFAGGEHGHVRTKLFEHWKGKDRDVQVYEYLPKTLNYTELMSHSKFCLCPSGWEVASPR 371

Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
           + +++Y GCVPVII+++Y LPF+DVL+W  FS+ +    IP +K +L+ I   +YL +Q 
Sbjct: 372 VPEAIYSGCVPVIISDYYYLPFSDVLDWSKFSVHIPVARIPEIKTVLQKIPMRKYLTMQK 431

Query: 290 YVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            V++V++HF+ +     YD  +MV++ +WLRR +VR+ 
Sbjct: 432 RVIQVQRHFKLNRPAKPYDVLHMVLHSIWLRRLNVRLN 469


>gi|356546130|ref|XP_003541484.1| PREDICTED: probable glycosyltransferase At5g20260-like [Glycine
           max]
          Length = 468

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 118/349 (33%), Positives = 190/349 (54%), Gaps = 30/349 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN-YASESYFKKAF---MKSHFVTKDPSKA 56
           M + F+V+VY    + P       V + P  + YA E  F        +S F  K+P +A
Sbjct: 129 MVKRFKVWVY-EEGDQPL------VHYGPVNDIYAIEGQFIDEMDNSKRSPFKAKNPDEA 181

Query: 57  DLFFLPFSIARMRH----------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
             FFLPFS+  + H          D R   + +   +  YI  ++ KYPYWNR+ GADHF
Sbjct: 182 HAFFLPFSVVNVVHYAYKPYMSQNDYR--RDRLQRLVEDYIVVVADKYPYWNRSNGADHF 239

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR--QEDPPKLG 164
            ++CH           ++  N I+V+C+++    G    +DVS+P+++    +  PP LG
Sbjct: 240 LLSCHDWAPEISHANPDLFKNFIRVLCNANNS-EGFQPKRDVSIPEVYLSVGKLGPPNLG 298

Query: 165 SSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHV 220
               N+  LAFF+G  +  +R+ LL+ W++ D+++  H    K     EL+G SKFCL  
Sbjct: 299 QHPMNRTILAFFSGGAHGDIRKLLLKHWKDKDNQVQVHEYLPKGQNYTELMGLSKFCLCP 358

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            G+EV + R+ +++   CVPVII+ +Y LP +DVLNW  FSI ++  +IP +K IL+ ++
Sbjct: 359 SGYEVASPRVVEAINAVCVPVIISENYSLPLSDVLNWSQFSIQISVENIPDIKTILQNVT 418

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
            ++Y  L   V +VR+HF  H     +D  +M+++ +WLRR + R+  S
Sbjct: 419 QKKYKKLYRNVRRVRRHFVMHRPAKPFDLMHMIIHSIWLRRLNFRLMAS 467


>gi|334187794|ref|NP_197526.5| Exostosin family protein [Arabidopsis thaliana]
 gi|332005439|gb|AED92822.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 458

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 124/341 (36%), Positives = 183/341 (53%), Gaps = 23/341 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN-YASESYFKKAFMK--SHFVTKDPSKAD 57
           M + F+V+VY      P       V   P  N Y+ E  F        S F   +P +A 
Sbjct: 125 MEKKFKVWVY-REGETPL------VHMGPMNNIYSIEGQFMDEIETGMSPFAANNPEEAH 177

Query: 58  LFFLPFSIARMRHD--RRIGT---EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
            F LP S+A + H   R + T   E +      Y+  ++ KYPYWNR+ GADHFYV+CH 
Sbjct: 178 AFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRSLGADHFYVSCHD 237

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGSSK--- 167
                     E+  N I+V+C+++    G +  +DVS+P+I  P     PP+L  S    
Sbjct: 238 WAPDVSGSNPELMKNLIRVLCNANTS-EGFMPQRDVSIPEINIPGGHLGPPRLSRSSGHD 296

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTP-YADELLGSKFCLHVKGFEV 225
           R  LAFFAG  +  +R  LLQ W++ D E+  H    K   Y   +  ++FCL   G+EV
Sbjct: 297 RPILAFFAGGSHGYIRRILLQHWKDKDEEVQVHEYLAKNKDYFKLMATARFCLCPSGYEV 356

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
            + R+  ++  GCVPVII++HY LPF+DVL+W  F+I V +  IP +K ILK IS   Y 
Sbjct: 357 ASPRVVAAINLGCVPVIISDHYALPFSDVLDWTKFTIHVPSKKIPEIKTILKSISWRRYR 416

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           +LQ  VL+V++HF  +     +D   M+++ +WLRR ++R+
Sbjct: 417 VLQRRVLQVQRHFVINRPSQPFDMLRMLLHSVWLRRLNLRL 457


>gi|224087074|ref|XP_002308055.1| predicted protein [Populus trichocarpa]
 gi|222854031|gb|EEE91578.1| predicted protein [Populus trichocarpa]
          Length = 338

 Score =  187 bits (474), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 119/352 (33%), Positives = 187/352 (53%), Gaps = 41/352 (11%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDP 53
           M + F+++VY     P     P  N+           Y+ E  F       KSHF+ + P
Sbjct: 1   MEKRFKIWVYKEGELPVLHGGPVNNI-----------YSVEGQFLDEIERGKSHFIARHP 49

Query: 54  SKADLFFLPFSIARMRH---DRRI--GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
            +A  F LP S+A + H     R+      +   ++ Y+  I+ KY YWNRT GADHF +
Sbjct: 50  DEAHAFLLPLSVAYIMHYIYKPRVTFSRHQLQTLVTDYVRVIADKYTYWNRTNGADHFSI 109

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--------PRQEDP 160
           +CH  G    +   E+    I+ +C+++    G    +DVS+P+I+        PR+   
Sbjct: 110 SCHDWGPDISRTNPELFKYFIRALCNANTS-EGFQPQRDVSVPEIFLHVGKLGLPREGAQ 168

Query: 161 PKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYAD----ELLG-S 214
           P    SKR  LAFFAG  +  +R+ LL+ W++ D EI  H    +    +    +L+G S
Sbjct: 169 P---PSKRPILAFFAGGAHGRIRKVLLKRWKDKDGEIQVHEYVTQRKKNNNLYFKLMGQS 225

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKK 274
           KFCL   G EV + R+  ++  GCVPVII+++Y LPF+DVL+W  FS+ + +  I  +K 
Sbjct: 226 KFCLCPSGHEVASPRVVTAIQLGCVPVIISDNYSLPFSDVLDWSKFSVNIPSEKIQEIKT 285

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           ILKGIS + YL +Q  V++ ++HF  +     YD  +M+++ +WLRR + R+
Sbjct: 286 ILKGISHKRYLTMQRRVIQAQRHFTLNRPAKPYDMIHMILHSIWLRRLNHRM 337


>gi|292630785|sp|Q3E9A4.3|GLYT5_ARATH RecName: Full=Probable glycosyltransferase At5g20260
          Length = 466

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 124/341 (36%), Positives = 183/341 (53%), Gaps = 23/341 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN-YASESYFKKAFMK--SHFVTKDPSKAD 57
           M + F+V+VY      P       V   P  N Y+ E  F        S F   +P +A 
Sbjct: 133 MEKKFKVWVY-REGETPL------VHMGPMNNIYSIEGQFMDEIETGMSPFAANNPEEAH 185

Query: 58  LFFLPFSIARMRHD--RRIGT---EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
            F LP S+A + H   R + T   E +      Y+  ++ KYPYWNR+ GADHFYV+CH 
Sbjct: 186 AFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRSLGADHFYVSCHD 245

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGSSK--- 167
                     E+  N I+V+C+++    G +  +DVS+P+I  P     PP+L  S    
Sbjct: 246 WAPDVSGSNPELMKNLIRVLCNANTS-EGFMPQRDVSIPEINIPGGHLGPPRLSRSSGHD 304

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTP-YADELLGSKFCLHVKGFEV 225
           R  LAFFAG  +  +R  LLQ W++ D E+  H    K   Y   +  ++FCL   G+EV
Sbjct: 305 RPILAFFAGGSHGYIRRILLQHWKDKDEEVQVHEYLAKNKDYFKLMATARFCLCPSGYEV 364

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
            + R+  ++  GCVPVII++HY LPF+DVL+W  F+I V +  IP +K ILK IS   Y 
Sbjct: 365 ASPRVVAAINLGCVPVIISDHYALPFSDVLDWTKFTIHVPSKKIPEIKTILKSISWRRYR 424

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           +LQ  VL+V++HF  +     +D   M+++ +WLRR ++R+
Sbjct: 425 VLQRRVLQVQRHFVINRPSQPFDMLRMLLHSVWLRRLNLRL 465


>gi|224120366|ref|XP_002331030.1| predicted protein [Populus trichocarpa]
 gi|222872960|gb|EEF10091.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 112/303 (36%), Positives = 171/303 (56%), Gaps = 26/303 (8%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARMRH---DRRI--GTEGIPDFISHYIFNISQKYPYWNR 99
           KS F+ + P +A  F LP S+A + H     RI    + +   ++ Y+  I+ K+PYWNR
Sbjct: 41  KSQFIARHPEEAQAFLLPVSVAYIIHYVYRPRITFSRDQLQRLVTDYVRVIADKHPYWNR 100

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW----- 154
           T GADHF V+CH       +    +    I+ +C+++    G    +DVS+P+I+     
Sbjct: 101 THGADHFSVSCHDWAPDVSRADPGLFKYFIRALCNANTS-EGFQPQRDVSIPEIFLPVGK 159

Query: 155 --PRQE--DPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYAD 209
             P QE   PP    SKR+ LAFFAG  +  +R+ LL+ W+  D EI  H    +    +
Sbjct: 160 LGPPQEYAQPP----SKRSILAFFAGGAHGHIRKILLERWKEKDDEIQVHEYLTRKNKKN 215

Query: 210 -----ELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
                EL+G SKFCL   G EV + R+  ++  GCVPV I+++Y LPF+DVL+W  FS+ 
Sbjct: 216 NNLYFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISDNYSLPFSDVLDWSKFSVD 275

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
           + +  IP +K ILKGIS   YL +Q  V+++R+HF  +     YD  +M+++ +WLRR  
Sbjct: 276 IPSEKIPDIKIILKGISVRRYLTMQRRVMQIRRHFTLNRPAQPYDMLHMILHSVWLRRLD 335

Query: 324 VRV 326
           V++
Sbjct: 336 VKL 338


>gi|302799042|ref|XP_002981280.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300150820|gb|EFJ17468.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 352

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 113/336 (33%), Positives = 185/336 (55%), Gaps = 23/336 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M R FR++VY     D +  ++   +    G YASE  F     + + F   +P  A +F
Sbjct: 21  MQRVFRIFVY----KDGYKPLMH--EGPKTGIYASEGLFIATMERGNPFAVTEPKIATMF 74

Query: 60  FLPFSIARM---RHDRRI-GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           F+PFS+ +M    +D      + I  +I+ Y+  ++ KYPY N T G DHF+V+CH    
Sbjct: 75  FIPFSLKQMVDYMYDTNSHSMKNIQSYIAGYLRRLASKYPYMNATNGIDHFFVSCHDWAL 134

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ--EDPPKLGSSKRNK--L 171
            A++K  + + N ++VVC++     G    +DVSLP+   RQ    P    +S  ++  L
Sbjct: 135 MALEKQ-DCQRNIVKVVCNADSS-RGFNTSRDVSLPETRVRQGKHSPIIRDTSGMDRPYL 192

Query: 172 AFFAGAVNSPVREKLLQVWRNDS------EIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
           AFFAG ++  +R  LL  W++        E+   S   +  Y++ +  SK+C+   GFEV
Sbjct: 193 AFFAGQMHGKLRPVLLAHWKDKDPEMKIYEVLPPSVAERISYSEHMRLSKYCICAAGFEV 252

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
           N+ R+ +++   CVPVI+A+++ LPF++V+NW S S+ VA  D+  LK IL GI    Y 
Sbjct: 253 NSPRLVEAIVNECVPVILADNFVLPFSEVINWDSISVTVAEKDVANLKAILAGIPLRTYK 312

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            +Q+ +  V++HF+W   P  YD F M+++ LW ++
Sbjct: 313 EMQARLKHVKRHFEWKNSPEKYDIFNMIVHSLWTQQ 348


>gi|224087070|ref|XP_002308053.1| predicted protein [Populus trichocarpa]
 gi|222854029|gb|EEE91576.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 112/303 (36%), Positives = 171/303 (56%), Gaps = 26/303 (8%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARMRH---DRRI--GTEGIPDFISHYIFNISQKYPYWNR 99
           KS F+ + P +A  F LP S+A + H     RI    + +   ++ Y+  I+ K+PYWNR
Sbjct: 41  KSQFIARHPEEAQAFLLPVSVAYIIHYVYRPRITFSRDQLQRLVTDYVRVIADKHPYWNR 100

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW----- 154
           T GADHF V+CH       +    +    I+ +C+++    G    +DVS+P+I+     
Sbjct: 101 THGADHFSVSCHDWAPDVSRADPGLFKYFIRALCNANTS-EGFQPQRDVSIPEIFLPVGK 159

Query: 155 --PRQE--DPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYAD 209
             P QE   PP    SKR+ LAFFAG  +  +R+ LL+ W+  D EI  H    +    +
Sbjct: 160 LGPPQEYAQPP----SKRSILAFFAGGAHGHIRKILLERWKEKDDEIQVHEYLTQKNKKN 215

Query: 210 -----ELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
                EL+G SKFCL   G EV + R+  ++  GCVPV I+++Y LPF+DVL+W  FS+ 
Sbjct: 216 NNLYFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISDNYSLPFSDVLDWSKFSVD 275

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
           + +  IP +K ILKGIS   YL +Q  V+++R+HF  +     YD  +M+++ +WLRR  
Sbjct: 276 IPSEKIPDIKIILKGISVRRYLTMQRRVMQIRRHFTLNRPAQPYDMLHMILHSVWLRRLD 335

Query: 324 VRV 326
           V++
Sbjct: 336 VKL 338


>gi|224120362|ref|XP_002331029.1| predicted protein [Populus trichocarpa]
 gi|222872959|gb|EEF10090.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 119/339 (35%), Positives = 183/339 (53%), Gaps = 18/339 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M + FR++VY         +  L + +   G +  E        KS F    P +A  F 
Sbjct: 1   MEKRFRIWVYREGELPLVHSAPLNLIYSIEGQFLDEM----ESGKSPFAASHPDEAHTFL 56

Query: 61  LPFSIARMRHD--RRIGTEG---IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           LP S+A + H   R + T     +   +  Y+  ++ KYPYWNRT GADHF V+CH    
Sbjct: 57  LPISVAYIIHYIYRPLVTFSRVELQRLVQDYVTVVAGKYPYWNRTEGADHFLVSCHDWAP 116

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLG--SSKRNKL 171
              +    +  N I+V+C+++         +DVS+P+I  P  +  PP  G   SKR+  
Sbjct: 117 DISRANPRLYKNFIRVLCNANT-SERFEPRRDVSIPEINIPFGKFGPPGKGLPPSKRSIF 175

Query: 172 AFFAGAVNSPVREKLLQVWRN-DSEIYAHS---GRLKTPYADELLGSKFCLHVKGFEVNT 227
           AFFAG  +  +R+ LL+ W++ D EI  H       K  Y   +  SKFCL   G+EV +
Sbjct: 176 AFFAGGAHGYIRKLLLEHWKDKDDEIQVHEYLDHNKKNDYFKLMGQSKFCLCPSGYEVAS 235

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            R+  ++  GC+PV I+++Y LPF+DVL+W  FS+ + +  IP +K ILK IS   YL+L
Sbjct: 236 PRVVTAIQSGCIPVTISDNYTLPFSDVLDWSKFSVNIPSEKIPEIKTILKKISFRRYLIL 295

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           Q  V+K+R+HF+ +     YD  +M+++ +WLRR +VR+
Sbjct: 296 QGRVIKIRRHFKLNRPAQPYDMLHMILHSIWLRRLNVRL 334


>gi|356518828|ref|XP_003528079.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 505

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 118/339 (34%), Positives = 186/339 (54%), Gaps = 20/339 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
           M + F+++VY       F N L       +  YA+E  F     K  ++ T DP +A ++
Sbjct: 173 MEKVFKIFVYEEGEPPLFHNGL------SKDIYATEGRFIHEMEKGRYYRTYDPDEAFVY 226

Query: 60  FLPFSIARMR---HDR--RIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           +LPFS+  +    +DR      + +   +  YI  I+ K+P+WNR+ G DH  ++CH  G
Sbjct: 227 YLPFSVVMLVEYVYDRGSNYNLDPLGLVVKDYIQIIAHKHPFWNRSLGHDHVMLSCHDWG 286

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS---SKRNKL 171
                    +  NAI+V+C+++    G    KDVS P+I   + +   LG    S+R  L
Sbjct: 287 PLVSSYVDHLYNNAIRVLCNANT-SEGFKPAKDVSFPEIKLIKGEVKGLGGYPPSQRTIL 345

Query: 172 AFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTA 228
           AFFAG ++  +R  LL  W+N   D +IY         Y  +L  SKFCL   G+EV + 
Sbjct: 346 AFFAGHLHGYIRYLLLSTWKNKDQDMQIYEELPE-GISYYTKLRSSKFCLCPSGYEVASP 404

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           R+ ++++  CVPV+I++ Y  PF+DVLNW SFS+ V   DIP +K+IL  IS ++YL + 
Sbjct: 405 RVVEAIFAECVPVLISDSYVPPFSDVLNWNSFSVQVNVKDIPNIKRILMEISEKQYLRMH 464

Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
             V +V++HF  +  P  YD F+M ++ +WLRR ++ +Q
Sbjct: 465 KRVKQVQRHFVPNEPPKRYDMFHMTVHSIWLRRLNINIQ 503


>gi|15231488|ref|NP_187419.1| putative glycosyltransferase [Arabidopsis thaliana]
 gi|75207507|sp|Q9SSE8.1|GLYT1_ARATH RecName: Full=Probable glycosyltransferase At3g07620
 gi|6466945|gb|AAF13080.1|AC009176_7 hypothetical protein [Arabidopsis thaliana]
 gi|332641055|gb|AEE74576.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 470

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 119/340 (35%), Positives = 187/340 (55%), Gaps = 23/340 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFV---TKDPSKAD 57
           M + F++YVY    + P  +  L  D      Y+ E  F   FM++  +   T+DP KA 
Sbjct: 139 MEKMFKIYVY-EEGDPPIFHYGLCKDI-----YSMEGLFLN-FMENDVLKYRTRDPDKAH 191

Query: 58  LFFLPFSIARMRH---DRRIGTEGIPD-FISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           ++FLPFS+  + H   D  +  + + +  I+ Y+  IS+KYPYWN + G DHF ++CH  
Sbjct: 192 VYFLPFSVVMILHHLFDPVVRDKAVLERVIADYVQIISKKYPYWNTSDGFDHFMLSCHDW 251

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQIWPRQEDPPKLGSS----KR 168
           G  A     ++  N+I+V+C+++  IS +    KD   P+I     D   L        R
Sbjct: 252 GHRATWYVKKLFFNSIRVLCNAN--ISEYFNPEKDAPFPEINLLTGDINNLTGGLDPISR 309

Query: 169 NKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVN 226
             LAFFAG  +  +R  LL  W+  D +I  +        Y + +  S+FC+   G EV 
Sbjct: 310 TTLAFFAGKSHGKIRPVLLNHWKEKDKDILVYENLPDGLDYTEMMRKSRFCICPSGHEVA 369

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
           + R+ +++Y GCVPV+I+ +Y LPF+DVLNW+ FS+ V+  +IP LK+IL  I  E Y+ 
Sbjct: 370 SPRVPEAIYSGCVPVLISENYVLPFSDVLNWEKFSVSVSVKEIPELKRILMDIPEERYMR 429

Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           L   V KV++H   +  P  YD F M+++ +WLRR +V++
Sbjct: 430 LYEGVKKVKRHILVNDPPKRYDVFNMIIHSIWLRRLNVKL 469


>gi|359476146|ref|XP_002285237.2| PREDICTED: probable glycosyltransferase At5g25310-like [Vitis
           vinifera]
 gi|296082059|emb|CBI21064.3| unnamed protein product [Vitis vinifera]
          Length = 478

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 118/341 (34%), Positives = 184/341 (53%), Gaps = 23/341 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYF--KKAFMKSHFVTKDPSKADL 58
           M +  +VYVY          V +  D   +  Y  E  F  +       F T+D  +A +
Sbjct: 145 MEKRLKVYVYEE------GEVPMIHDGPCKDIYTIEGRFIHEIEHGDGKFRTRDAERAHV 198

Query: 59  FFLPFSIARM----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
           +F+PFS+  M              +  F+S Y+  IS KYP+WN T GADHF +ACH  G
Sbjct: 199 YFMPFSVTWMVKYLYKPLTYNLTPLRQFVSDYVSVISTKYPFWNTTQGADHFMLACHDWG 258

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQEDPPKLGSSKRNK-- 170
             A +    +   +I+V+C+++    G    KDVSLP+I  +    +P  L     N   
Sbjct: 259 PHASRGHPVLYNTSIRVLCNANT-SEGFNPQKDVSLPEIHLYGGNVNPKLLSPPPPNSPR 317

Query: 171 --LAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
             LAFFAG ++ P+R  L+Q W     D  +Y +  +    Y   +L SK+CL   G EV
Sbjct: 318 PFLAFFAGGLHGPIRPILIQHWMGRDTDLRVYEYLPK-DMDYYSLMLQSKYCLCPSGHEV 376

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
            + RI +++Y  CVPVI+++HY LPF+DVL W++FS+ V   +IP LK++L+ IS E+Y 
Sbjct: 377 ASPRIVEAIYSECVPVILSDHYVLPFSDVLRWEAFSVKVEASEIPRLKEVLQAISEEKYT 436

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            L+  V  VR+HF+ +     +D F+M+++ +WLRR ++++
Sbjct: 437 RLKEGVRAVRRHFELNQPAKRFDVFHMILHSVWLRRINLKL 477


>gi|224142563|ref|XP_002324625.1| predicted protein [Populus trichocarpa]
 gi|222866059|gb|EEF03190.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 115/347 (33%), Positives = 187/347 (53%), Gaps = 31/347 (8%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYF--KKAFMKSHFVTKDP 53
           M + F+++VY     P     P  N+           Y  E  F  +    KS F    P
Sbjct: 1   MEKRFKIWVYKEGERPLVHGGPLNNI-----------YGVEGQFLDEMEHGKSPFAASHP 49

Query: 54  SKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
            +A +F LP S+A +     +       + +   +  Y+  ++ KYPYWNR+ GADHF V
Sbjct: 50  DEAHMFLLPISVAYIISYVYKPIVTYSRDELQRLVQDYVGVVADKYPYWNRSKGADHFLV 109

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLG-- 164
           +CH           ++  N I+V+C+++         +DVS+P+I  P  +  PP  G  
Sbjct: 110 SCHDWAPDISGANPDLYKNFIRVLCNANTS-ERFEPRRDVSIPEINIPNGKLGPPHKGLP 168

Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHS--GRLKTPYADELLGSKFCLHVK 221
            SKR+  AFFAG  +  +R+ LL+ W++ D EI  H    +  T Y + +  SKFCL   
Sbjct: 169 PSKRSIFAFFAGGAHGYIRKVLLENWKDKDDEIQVHEYLDKKGTDYFELMGQSKFCLCPS 228

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G+EV + R+  ++  GCVPV I+++Y LPF+DVL+W  FS+ + +  IP +K ILK IS 
Sbjct: 229 GYEVASPRVVTAIQLGCVPVTISDNYTLPFSDVLDWSKFSVHIPSEKIPEIKTILKKISP 288

Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
           + YL++Q  V++V++HF+ +     YD  +M+++ +W+RR +V+V +
Sbjct: 289 QRYLMMQMRVIQVQRHFELNRPARPYDLLHMLLHSVWVRRLNVKVPY 335


>gi|302772499|ref|XP_002969667.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300162178|gb|EFJ28791.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 352

 Score =  185 bits (469), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 113/336 (33%), Positives = 183/336 (54%), Gaps = 23/336 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
           M R FR++VY     D +  ++   +    G YASE  F     + + F   +P  A +F
Sbjct: 21  MQRVFRIFVY----KDGYKPLMH--EGPKTGIYASEGLFIATMERGNPFAVTEPKIATMF 74

Query: 60  FLPFSIARM---RHDRRI-GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           F+PFS+ +M    +D      + I  +I+ Y+  ++ KYPY N T G DHF+V+CH    
Sbjct: 75  FIPFSLKQMVDYMYDTNSHSMKNIQSYIAGYLRRLASKYPYMNATNGIDHFFVSCHDWAL 134

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPP---KLGSSKRNKL 171
            A++K  + + N ++VVC++     G    +DVSLP+   RQ +  P    +    R  L
Sbjct: 135 MALEKQ-DCQRNIVKVVCNADSS-RGFNTSRDVSLPETRVRQGKHSPIIRDISGMDRPYL 192

Query: 172 AFFAGAVNSPVREKLLQVWRNDS------EIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
           AFFAG ++  +R  LL  W++        E+   S   +  Y++ +  SK+C+   GFEV
Sbjct: 193 AFFAGQMHGKLRPVLLAHWKDKDPEMKIYEVLPPSVAERISYSEHMRLSKYCICAAGFEV 252

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
           N+ R+ +++   CVPVI+A+++ LPF++V+NW S S+ VA  D+  LK IL GI    Y 
Sbjct: 253 NSPRLVEAIVNECVPVILADNFVLPFSEVINWDSISVTVAEKDVANLKAILAGIPLRRYK 312

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            +Q+ +  V++HF W   P  YD F M+++ LW ++
Sbjct: 313 EMQARLKHVKRHFVWKNSPEKYDIFNMIVHSLWTQQ 348


>gi|168043245|ref|XP_001774096.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674642|gb|EDQ61148.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 112/338 (33%), Positives = 176/338 (52%), Gaps = 17/338 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFAN-------VLLPVDFEPRGNYASESYFKKAFMKSHFVTKDP 53
           M ++ R+Y+YP  +N  F           +  +     +  ++++F        FVT D 
Sbjct: 17  MAKNLRIYLYPASQNYNFTQYEYGMNPSEMVSELGVETSSTTDTFFNLLVESKRFVTDDA 76

Query: 54  SKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
             A L+FLP SI R+     +G   + + + HY+  +   Y  W+ + GADHFY + H+ 
Sbjct: 77  DGAHLYFLPISIDRVW--AAVGPAKVGEHLRHYLQWLRNTYKLWDLSLGADHFYFSSHAY 134

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ--EDPPKLGSSKRNKL 171
                +   E+  NAIQV  S          HKD+SLP    +   E    +G+S+R KL
Sbjct: 135 DPINHRNNLELTKNAIQVASSPLRRNQNFFPHKDISLPSYKSQHIAEVQNLVGASQRPKL 194

Query: 172 AFFAGAVNS--PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTAR 229
            F +       P+   ++Q W +DS+ +  S    +P  ++LL S+FC+ V    +    
Sbjct: 195 VFVSSPPEDIDPIVASVIQKWTSDSDFHVESADQPSPPFEKLLSSRFCVSVSPQAM--LN 252

Query: 230 IADSLYYGCVPVIIANH--YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
           + DSL  GCVPV+IA+   YDLPF DVLNWK FS+V+   + P LK +L  IS++EY  +
Sbjct: 253 VVDSLRLGCVPVLIADSIIYDLPFQDVLNWKEFSVVLGVKESPNLKTLLSSISTDEYRKM 312

Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           Q    +  KH +W+  P  +DAF+M ++ELW+RR S++
Sbjct: 313 QYLGHQASKHMEWNDPPKPWDAFHMTLHELWVRRHSIK 350


>gi|297833558|ref|XP_002884661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297330501|gb|EFH60920.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 470

 Score =  184 bits (467), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 119/340 (35%), Positives = 187/340 (55%), Gaps = 23/340 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFV---TKDPSKAD 57
           M R F++YVY    + P  +  L  D      Y+ E  F   FM++  +   T+DP KA 
Sbjct: 139 MERMFKIYVY-EEGDPPIFHYGLCKDI-----YSMEGLFLN-FMENDVLKYRTRDPDKAH 191

Query: 58  LFFLPFSIARMRH---DRRIGTEGIPD-FISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           ++FLPFS+  + H   D  +  + + +  I+ Y+  IS+KYPYWN + G DHF ++CH  
Sbjct: 192 VYFLPFSVVMILHHLFDPVVRDKAVLESVIADYVQIISKKYPYWNTSDGFDHFMLSCHDW 251

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQIWPRQEDPPKLGSS----KR 168
           G  A     ++  N+I+V+C+++  IS +    KD   P+I     +   L        R
Sbjct: 252 GHRATWYVKKLFFNSIRVLCNAN--ISEYFNPEKDAPFPEINLLTGEINNLTGGLDPISR 309

Query: 169 NKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKT-PYADELLGSKFCLHVKGFEVN 226
             LAFFAG  +  +R  LL  W+  D +I  +    +   Y + +  ++FC+   G EV 
Sbjct: 310 KTLAFFAGKSHGKIRPVLLNHWKEKDKDILVYENLPEDLNYTEMMRKTRFCICPSGHEVA 369

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
           + RI +++Y GCVPV+I+ +Y LPF+DVLNW+ FS+ V+  +IP LK+IL  I  E Y+ 
Sbjct: 370 SPRIPEAIYSGCVPVLISENYVLPFSDVLNWEKFSVSVSVKEIPELKRILMDIPEERYMR 429

Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           L   V KV+ H   +  P  YD F M+++ +WLRR +V++
Sbjct: 430 LYEGVKKVKMHILVNDPPKRYDVFNMIIHSIWLRRLNVKL 469


>gi|2245029|emb|CAB10449.1| limonene cyclase like protein [Arabidopsis thaliana]
 gi|7268424|emb|CAB78716.1| limonene cyclase like protein [Arabidopsis thaliana]
          Length = 1024

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 120/334 (35%), Positives = 182/334 (54%), Gaps = 27/334 (8%)

Query: 1    MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH--FVTKDPSKA 56
            M    +VY+YP            P+  EP   G YASE +F K  M+S+  FVTK+P +A
Sbjct: 684  MELILKVYIYPDGDK--------PIFHEPHLNGIYASEGWFMK-LMESNKQFVTKNPERA 734

Query: 57   DLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
             LF++P+S+ +++  ++  +   P         + Q       T G    +      G  
Sbjct: 735  HLFYMPYSVKQLQ--KKTTSTCSPSNTPSGTALMGQIISLSLATIGYRKCFYVKDQWGPY 792

Query: 117  AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS----SKRNK 170
             + +  E+K NAI+ +C++       +  KDVSLP+   R    P   +G+    S+R  
Sbjct: 793  TVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRVSQRPI 852

Query: 171  LAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHVKGFE 224
            LAFFAG ++  VR KLL+ WRN   D +IY    H+   K  Y   +  SK+CL   G+E
Sbjct: 853  LAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTYVQHMKSSKYCLCPMGYE 912

Query: 225  VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
            VN+ RI +++YY CVPV+IA+++ LPF+DVL+W +FS+VV   +IP LK+IL  I    Y
Sbjct: 913  VNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKEILLEIPMRRY 972

Query: 285  LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
            L +QS V  V++HF W   P  YD F+M+++ +W
Sbjct: 973  LKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIW 1006


>gi|147789666|emb|CAN71921.1| hypothetical protein VITISV_031190 [Vitis vinifera]
          Length = 310

 Score =  184 bits (466), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 104/291 (35%), Positives = 167/291 (57%), Gaps = 13/291 (4%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
           KS F+ + P +A+ F++P S+ R+ H          + IP  +  YI  ++ KYPYWNR+
Sbjct: 18  KSQFLARHPDEANAFYIPMSLTRVVHFIYEPPHYXGKWIPRLVXDYINFVADKYPYWNRS 77

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE- 158
            GADHF V+CH           ++  + I+ +C+++     H   +D+S+P+I  PR + 
Sbjct: 78  KGADHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPI-RDISIPEINIPRGKL 136

Query: 159 DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR---NDSEIYAHSGRLKTPYADELLG 213
            PP L    +KR  LAFFAG  +  VR  L + W+   ++ +++    R +  Y+  +  
Sbjct: 137 GPPHLDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFERLPRNRN-YSKSMGD 195

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           SKFCL   G+EV + RI  ++  GCVP+II +HY LPF+D L+W  FSI + +  IP +K
Sbjct: 196 SKFCLCPSGYEVASPRIVKAIAAGCVPMIICDHYSLPFSDXLDWSKFSIYITSDKIPEIK 255

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSV 324
           KILK + +E YL +Q  V +V++HF  +     YD  +M+++ +W RR +V
Sbjct: 256 KILKAVPTETYLEMQKRVKQVQRHFAINRPAXPYDMLHMILHSVWXRRLNV 306


>gi|359484704|ref|XP_002264922.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
           vinifera]
          Length = 467

 Score =  183 bits (465), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 108/294 (36%), Positives = 174/294 (59%), Gaps = 14/294 (4%)

Query: 46  SHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEG--IPDFISHYIFNISQKYPYWNRT 100
           SHFV   P +A +F++P S+ R+ H      +   G  +   ++ YI+ +S KYPYWNR+
Sbjct: 175 SHFVAGHPDEAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSDKYPYWNRS 234

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE- 158
            GADHF V+CH           ++  + I+V+C+++         +D+SLP++  P+ + 
Sbjct: 235 NGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPI-RDISLPEVNIPKGKL 293

Query: 159 DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR-NDSEI--YAHSGRLKTPYADELLG 213
            PP L    ++R+ LAFFAG  +  +R  L + W+ ND E+  Y H    +  YA  +  
Sbjct: 294 GPPHLDKPPNQRHILAFFAGRESGYMRTLLFRSWKENDDEVQVYEHLPSNRD-YAKSMGD 352

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           SKFCL   G+EV + R+ +++  GCVPVII ++Y LPF++VL W  FSI + +  IP +K
Sbjct: 353 SKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLVWSKFSINITSDKIPEIK 412

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           KILK + +E YL +Q  V +V++HF  +     YD  +M+++ +WLRR +VR++
Sbjct: 413 KILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMILHSVWLRRLNVRLR 466


>gi|356565153|ref|XP_003550809.1| PREDICTED: probable glycosyltransferase At5g20260-like [Glycine
           max]
          Length = 398

 Score =  183 bits (465), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 102/299 (34%), Positives = 169/299 (56%), Gaps = 15/299 (5%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPY 96
           +S F  ++P +A  FFLP S+  + H              + +   +  YI  ++ KYPY
Sbjct: 100 RSPFKARNPDEAHAFFLPLSVVNVVHYVYKPYMSQNDYSRDRLQRLVEDYIGVVADKYPY 159

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-- 154
           WNR+ GADHF ++CH           ++  N I+V+C+++    G    +DVS+P+++  
Sbjct: 160 WNRSNGADHFLLSCHDWAPEISHANPDLFKNFIRVLCNANNS-EGFQPKRDVSIPEVYLP 218

Query: 155 PRQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADEL 211
             +  PP LG    N+  LAFF+G  +  +R+ LL+ W++ D+ +  H    K     EL
Sbjct: 219 VGKLGPPNLGQHPMNRTILAFFSGGAHGDIRKLLLKHWKDKDNHVQVHEYLPKGQNYTEL 278

Query: 212 LG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           +G SKFCL   G+EV + R+ +++  GCVPVII+ +Y LPF+DVLNW  FSI ++  +I 
Sbjct: 279 MGLSKFCLCPSGYEVASPRVVEAINAGCVPVIISENYSLPFSDVLNWSQFSIQISVENIS 338

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
            +K IL+ ++ ++Y  L   V +V++HF  +     +D  +M+++ +WLRR + RV  S
Sbjct: 339 DIKTILQNVTQKKYKKLHRNVRRVQRHFVMNRPAKPFDLMHMILHSIWLRRLNFRVMAS 397


>gi|359484708|ref|XP_002264880.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
           vinifera]
          Length = 467

 Score =  183 bits (465), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 106/294 (36%), Positives = 175/294 (59%), Gaps = 14/294 (4%)

Query: 46  SHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEG--IPDFISHYIFNISQKYPYWNRT 100
           SHF+   P  A +F++P S+ R+ H      +   G  +   ++ YI+ +S KYPYWNR+
Sbjct: 175 SHFMAGHPDVAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSNKYPYWNRS 234

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE- 158
            GADHF V+CH           ++  + I+V+C+++         +D+SLP++  P+ + 
Sbjct: 235 NGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPI-RDISLPEVNIPKGKL 293

Query: 159 DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR-NDSEI--YAHSGRLKTPYADELLG 213
            PP L    ++R+ LAFF+G  +  +R  L + W+ ND E+  Y H    +  YA  ++ 
Sbjct: 294 GPPHLDKPPNQRHILAFFSGRESGYMRTLLFRSWKENDDEVQVYEHLPSNRD-YAKSMVD 352

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           SKFCL   G+EV + R+ +++  GCVPVII ++Y LPF++VL+W  FSI + +  IP +K
Sbjct: 353 SKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLDWSKFSINITSDKIPEIK 412

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           KILK + +E YL +Q  V +V++HF  +     YD  +M+++ +WLRR +VR++
Sbjct: 413 KILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMILHSVWLRRLNVRLR 466


>gi|413952956|gb|AFW85605.1| hypothetical protein ZEAMMB73_586394 [Zea mays]
          Length = 509

 Score =  183 bits (465), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 103/291 (35%), Positives = 166/291 (57%), Gaps = 15/291 (5%)

Query: 50  TKDPSKADLFFLPFSIARM-RHDRRIGT---EGIPDFISHYIFNISQKYPYWNRTGGADH 105
           T +P++A  FFLPFS+++M +   R  T     +   ++ Y+  ++ ++PYWNR+ GADH
Sbjct: 218 TSEPARAHAFFLPFSVSQMVQFAYRPNTYDKTPLRAIVADYVRVVASRHPYWNRSAGADH 277

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQED 159
           F +ACH  G  A     E+  N I+ +C+++    G    +DVS+P I       PRQ  
Sbjct: 278 FMLACHDWGPEASTGHPELHANGIRALCNANSS-EGFRPWQDVSVPDINLYDGDMPRQLL 336

Query: 160 PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE----IYAHSGRLKTPYADELLGSK 215
            P  G + R  LAFFAG  +  +R+ LL+ W+        +Y H       Y   +  ++
Sbjct: 337 APAPGVTSRPFLAFFAGGRHGHIRDLLLRHWKGRDPDFFPVYEHRHEDGFDYYSFMRRAR 396

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
           FCL   G++V + R+ +S++  CVPVI+++ Y LPFADVL W++FS+ V   DIP L+++
Sbjct: 397 FCLCPSGYQVASPRVVESIHAECVPVILSDGYVLPFADVLRWEAFSVAVPVADIPRLREV 456

Query: 276 LKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           L+ I + E   LQ  V  V++HF  +  P   D F M+++ +WLRR ++R+
Sbjct: 457 LERIPAPEVEKLQQGVRLVKRHFMLNQPPQRLDMFNMILHSVWLRRLNLRL 507


>gi|296084508|emb|CBI25529.3| unnamed protein product [Vitis vinifera]
          Length = 388

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 106/294 (36%), Positives = 175/294 (59%), Gaps = 14/294 (4%)

Query: 46  SHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEG--IPDFISHYIFNISQKYPYWNRT 100
           SHF+   P  A +F++P S+ R+ H      +   G  +   ++ YI+ +S KYPYWNR+
Sbjct: 96  SHFMAGHPDVAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSNKYPYWNRS 155

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE- 158
            GADHF V+CH           ++  + I+V+C+++         +D+SLP++  P+ + 
Sbjct: 156 NGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPI-RDISLPEVNIPKGKL 214

Query: 159 DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR-NDSEI--YAHSGRLKTPYADELLG 213
            PP L    ++R+ LAFF+G  +  +R  L + W+ ND E+  Y H    +  YA  ++ 
Sbjct: 215 GPPHLDKPPNQRHILAFFSGRESGYMRTLLFRSWKENDDEVQVYEHLPSNRD-YAKSMVD 273

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           SKFCL   G+EV + R+ +++  GCVPVII ++Y LPF++VL+W  FSI + +  IP +K
Sbjct: 274 SKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLDWSKFSINITSDKIPEIK 333

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           KILK + +E YL +Q  V +V++HF  +     YD  +M+++ +WLRR +VR++
Sbjct: 334 KILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMILHSVWLRRLNVRLR 387


>gi|357478275|ref|XP_003609423.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355510478|gb|AES91620.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 398

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 108/297 (36%), Positives = 171/297 (57%), Gaps = 15/297 (5%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARM----RHDRRIGTEGIPD----FISHYIFNISQKYPY 96
           KS F    P  A +FFLPFS++++       R+  ++  P      +  YI  ++ KYPY
Sbjct: 96  KSPFKATHPELAHVFFLPFSVSKVIRYVYKPRKSRSDYNPHRLQLLVEDYIKIVANKYPY 155

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WP 155
           WN + GADHF ++CH  G        ++  + I+ +C+++    G   ++DVS+PQ+  P
Sbjct: 156 WNISQGADHFLLSCHDWGPRVSYANPKLFKHFIRALCNANTS-EGFWPNRDVSIPQLNLP 214

Query: 156 RQE-DPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADEL 211
             +  PP       N+  L FFAG  +  +R+KLL+ W++ D E+  H    K     +L
Sbjct: 215 VGKLGPPNTDQHPNNRTILTFFAGGAHGKIRKKLLKSWKDKDEEVQVHEYLPKGQDYTKL 274

Query: 212 LG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           +G SKFCL   G EV + R+ +++Y GCVPVII ++Y LPF+DVLNW  FS+ +A   IP
Sbjct: 275 MGLSKFCLCPSGHEVASPRVVEAIYAGCVPVIICDNYSLPFSDVLNWSQFSMEIAVDRIP 334

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            +K IL+ I+  +Y +L S V +VRKHF+ +     +D  +M+++ +WLRR + R+ 
Sbjct: 335 EIKTILQNITETKYRVLYSNVRRVRKHFEMNRPAKPFDLIHMILHSVWLRRLNFRLH 391


>gi|413937765|gb|AFW72316.1| hypothetical protein ZEAMMB73_504926 [Zea mays]
          Length = 472

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 180/346 (52%), Gaps = 29/346 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDPSKADL 58
           M R F+++ Y      P A++    D      Y+ E  F +     ++ F  +DP +A  
Sbjct: 138 MERKFKIWTY-REGEPPLAHLGPSADI-----YSIEGQFLEEIEDPRNPFAARDPGEAHA 191

Query: 59  FFLPFSIARMRH-DRRIGTEG----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           F LP S+  + H   R+ T      +   ++ Y+  ++ KYPYWNR+ GADH  V+CH  
Sbjct: 192 FLLPVSVCNLVHYIYRLNTTAYMAHMRRALADYVDVVAHKYPYWNRSRGADHVIVSCHDW 251

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPPKLGS 165
                +   E+  NAI+V+C+++    G    KD +LP++         P    PP+   
Sbjct: 252 APLVSEANRELYANAIRVLCNANTS-EGFRPRKDATLPEVNLADGLLRRPTLGLPPE--- 307

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHS-GRLKTPYADELLGSKFCLHVKG 222
             R  LAFFAG ++  +R  LL  W  R D ++  H        Y   +  ++FCL   G
Sbjct: 308 -NRTTLAFFAGGMHGHIRRALLGYWLGRKDPDMDIHEYLPAGQDYHALMARARFCLCPSG 366

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
           FEV + R+ +S++ GCVPVII++ Y  PF+DVL+W   S+ V    IP LK +LKG+S  
Sbjct: 367 FEVASPRVVESVFTGCVPVIISDGYPPPFSDVLDWSKMSVTVPPARIPELKAVLKGVSER 426

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
            Y +L++ VL+ ++HF  H     +D   MV++ +WLRR +VR+ +
Sbjct: 427 RYRVLRARVLQAQRHFVVHRPARRFDMIRMVLHSIWLRRINVRLPY 472


>gi|255582307|ref|XP_002531944.1| catalytic, putative [Ricinus communis]
 gi|223528390|gb|EEF30426.1| catalytic, putative [Ricinus communis]
          Length = 363

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 115/347 (33%), Positives = 184/347 (53%), Gaps = 62/347 (17%)

Query: 33  YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARM---------RHDRRIGTEGIPD 81
           YA E  F       +S F+ +   +A +FFLP S+A +          +DR    + +  
Sbjct: 26  YAIEGQFIDEMESGESPFIARHADEAHVFFLPVSVAHIVEYIYLPITSYDR----DKLIR 81

Query: 82  FISHYIFNISQKYPYWNRTGGADHFYVACH------------------SIGRSAM----- 118
               Y+  ++ KYP+WNR+ G+DHF ++CH                  ++GR+       
Sbjct: 82  VFKDYVKVVADKYPFWNRSSGSDHFMLSCHDWAMVHINSIPVELVRAFALGRNLKFEISF 141

Query: 119 ---------QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK------- 162
                    Q   E+  N I+V+C+++    G +  +DV+LP++      PP        
Sbjct: 142 HSHRAPEISQHDPELYKNLIRVICNANTS-EGFLPTRDVTLPEL----NIPPGGFDHVHH 196

Query: 163 -LGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLH 219
            L S KR  LAFFAG  +  +R+ LL  W+N D E+  H    K     +L+G SKFCL 
Sbjct: 197 CLPSHKRRILAFFAGGAHGYIRKILLHHWKNKDDEVQVHEYLSKDEDYRKLMGQSKFCLC 256

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
             G+EV + RI +S+Y GC+PVII++HY+LPF+DVL+W   S+ +    IP +K ILKG+
Sbjct: 257 PSGYEVASPRIVESIYAGCIPVIISDHYNLPFSDVLDWSQISVQIPVEKIPEIKTILKGV 316

Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           S+++YL +Q  V +V++HF+ +     +D  +MV++ +WLRR  +R+
Sbjct: 317 SNDKYLRMQKRVRRVQRHFEINRPSKPFDVLHMVLHSVWLRRLDIRL 363


>gi|326517848|dbj|BAK03842.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 451

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 108/309 (34%), Positives = 168/309 (54%), Gaps = 32/309 (10%)

Query: 50  TKDPSKADLFFLPFSIARMRH--DRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADH 105
           T+D  +A  FFLPFS+A+M     R++  +  P    +  Y+  ++ ++P+WNR+ GADH
Sbjct: 143 TRDADRAHAFFLPFSVAQMMQFAYRQLSYDRGPLLSLVGDYVRVVASRHPFWNRSAGADH 202

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQED 159
           F ++CH  G  A +   E+  N I+ +C+++    G    KDVS+P+I       PRQ  
Sbjct: 203 FMLSCHDWGPDASKGDPELYANGIRALCNANT-SEGFRPGKDVSIPEINLYDGDTPRQLL 261

Query: 160 PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWR--------------------NDSEIYAH 199
            P  G S R  LAFFAG  +  VR+ LL+ W+                    N S  +  
Sbjct: 262 GPSPGLSARPYLAFFAGGRHGHVRDLLLRHWKGRDPATFPVYEYDIPSTTGGNSSGRHNR 321

Query: 200 SGR-LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
            GR  ++ Y   +  S+FCL   G EV + R+ ++++  CVPV+++  Y  PFADVL W+
Sbjct: 322 RGRDRQSDYFAYMHRSRFCLCPSGHEVASPRVVEAIHAECVPVLVSEGYAPPFADVLRWE 381

Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
           SFS+ V  +DIP LK++L+GI   E   L+  V  V++HF     P   D F+M+++ +W
Sbjct: 382 SFSVSVPVVDIPRLKEVLEGIPMAEVERLREGVRLVKRHFTLRQPPERLDMFHMILHSVW 441

Query: 319 LRRSSVRVQ 327
           LRR + R+ 
Sbjct: 442 LRRLNFRLD 450


>gi|356507416|ref|XP_003522463.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At3g07620-like [Glycine max]
          Length = 471

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 117/335 (34%), Positives = 178/335 (53%), Gaps = 24/335 (7%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
           M + F+++VY       F N L       +  YA+E  F     K  ++ T DP +A ++
Sbjct: 151 MEKVFKIFVYEEGEPPLFHNGL------SKDIYATEGRFIHEMEKGRYYRTYDPDEAFVY 204

Query: 60  FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQ 119
           +LPFS         +  + +   +  YI  I+ K+P+WNR+ G DHF ++CH  G     
Sbjct: 205 YLPFS--------GVYVDPLGLVVKDYIQVIAHKHPFWNRSLGYDHFMLSCHDWGPLVSS 256

Query: 120 KAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----SKRNKLAFFA 175
                  NAI+V+C+++    G    KDVS P+I   + +   L      S+R  LAFFA
Sbjct: 257 YVDHFYNNAIRVLCNAN-VSEGFKPAKDVSFPEIKLIKGEVTNLVGGYPPSQRTILAFFA 315

Query: 176 GAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
           G  +  +R  L   W+N   D +IY         Y  +L  SKFCL   G+EV + R+  
Sbjct: 316 GHQHGYIRXLLQSTWKNKDQDMQIYEELPE-GISYYTKLRSSKFCLCPSGYEVASPRVVK 374

Query: 233 SLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
           +++  CVPV+I++ Y  PF+DVLNW SFS+ V   DIP +KKIL GIS  +YL +   V 
Sbjct: 375 AIFAECVPVLISDGYVPPFSDVLNWNSFSVQVDVKDIPNIKKILMGISERQYLRMYKRVK 434

Query: 293 KVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           +V++HF  +  P  YD F+M ++ +WLRR ++ +Q
Sbjct: 435 QVQRHFVPNEPPKRYDMFHMTVHSIWLRRLNIHIQ 469


>gi|297818734|ref|XP_002877250.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323088|gb|EFH53509.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 422

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 103/293 (35%), Positives = 166/293 (56%), Gaps = 15/293 (5%)

Query: 48  FVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HYIFN-----ISQKYPYWNR 99
           F    P +A  FFLPFS+A + H          DF     H IFN     +++K+P+WN+
Sbjct: 129 FRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVARKHPFWNQ 188

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--PRQ 157
           + GADHF V+CH           E   + ++ +C+++    G   + D+S+P+I    R+
Sbjct: 189 SNGADHFMVSCHDWAPDVADSKPEFFKDFMRGLCNANT-TEGFRPNIDISIPEINIPKRK 247

Query: 158 EDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG- 213
             PP +G +  N+  LAFFAG  +  +RE L   W+  D ++  +    K     EL+G 
Sbjct: 248 LKPPFMGQTPENRTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQNYHELIGH 307

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           SKFCL   G+EV + R  +++Y GCVPV+I+++Y LPF DVL+W  FS+ +    IP +K
Sbjct: 308 SKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDKIPDIK 367

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           KIL+ I  ++Y+ +   V+KVR+HF  +     +D  +M+++ +WLRR ++R+
Sbjct: 368 KILQEIPHDKYIRMYQNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRRLNIRL 420


>gi|297812207|ref|XP_002873987.1| hypothetical protein ARALYDRAFT_326421 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319824|gb|EFH50246.1| hypothetical protein ARALYDRAFT_326421 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 654

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 120/332 (36%), Positives = 179/332 (53%), Gaps = 21/332 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDPSKADL 58
           M + F+V+VY  R  D     + PV+      Y+ E  F        S F    P +A  
Sbjct: 320 MEKKFKVWVY--REGDTPLVHMGPVN----NIYSIEGQFMDEIETGMSPFAASGPEEAHA 373

Query: 59  FFLPFSIARMRHD--RRIGT---EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           F LP SIA + H   R + T   E +      Y+  ++ KYPYWNR+ GADHF+V+CH  
Sbjct: 374 FLLPVSIANVVHYLYRPLVTYSREQLHKVFLDYVNVVAHKYPYWNRSLGADHFFVSCHDW 433

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPR-QEDPPKLGSSK---R 168
                    E+  N I+V+C+++    G +  +DVS+P+I  PR Q  PP+L  S    R
Sbjct: 434 APDVSGSNPEMMKNLIRVLCNANT-SEGFMPQRDVSIPEINIPRGQLGPPQLSRSSGHDR 492

Query: 169 NKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTP-YADELLGSKFCLHVKGFEVN 226
             LAFF+G  +  +R+ LLQ W++ D E+  H        Y   +  ++FCL   G+EV 
Sbjct: 493 PILAFFSGGSHGYIRKILLQHWKDKDEEVQVHEYLTNNKDYFKLMATARFCLCPSGYEVA 552

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
           + R+  ++  GCVPVII++HY LPF+DVL+W  F+I V +  IP +K ILK IS   Y +
Sbjct: 553 SPRVVAAINLGCVPVIISDHYALPFSDVLDWTKFTIHVPSEKIPEIKTILKSISWRRYKV 612

Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
           LQ  VL+V++HF  +     +D   M+++ +W
Sbjct: 613 LQRRVLQVQRHFVINRPSQPFDMLRMLLHSVW 644


>gi|145339080|ref|NP_189804.4| putative glycosyltransferase [Arabidopsis thaliana]
 gi|292630777|sp|Q3EAR7.2|GLYT2_ARATH RecName: Full=Probable glycosyltransferase At3g42180
 gi|332644208|gb|AEE77729.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 470

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 168/311 (54%), Gaps = 15/311 (4%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HY 86
            G +  E  +        F    P +A  FFLPFS+A + H          DF     H 
Sbjct: 159 EGQFIDELSYVMGGPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHR 218

Query: 87  IFN-----ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG 141
           IFN     ++ K+P+WN++ GADHF V+CH           E   N ++ +C+++    G
Sbjct: 219 IFNDYVDVVAHKHPFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT-SEG 277

Query: 142 HIAHKDVSLPQIW--PRQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEI 196
              + D S+P+I    R+  PP +G +  N+  LAFFAG  +  +RE L   W+  D ++
Sbjct: 278 FRRNIDFSIPEINIPKRKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDV 337

Query: 197 YAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
             +    K     EL+G SKFCL   G+EV + R  +++Y GCVPV+I+++Y LPF DVL
Sbjct: 338 QVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVL 397

Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
           +W  FS+ +    IP +KKIL+ I  ++YL +   V+KVR+HF  +     +D  +M+++
Sbjct: 398 DWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILH 457

Query: 316 ELWLRRSSVRV 326
            +WLRR ++R+
Sbjct: 458 SVWLRRLNIRL 468


>gi|109946595|gb|ABG48476.1| At3g42180 [Arabidopsis thaliana]
          Length = 425

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 168/311 (54%), Gaps = 15/311 (4%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HY 86
            G +  E  +        F    P +A  FFLPFS+A + H          DF     H 
Sbjct: 114 EGQFIDELSYVMGGPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHR 173

Query: 87  IFN-----ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG 141
           IFN     ++ K+P+WN++ GADHF V+CH           E   N ++ +C+++    G
Sbjct: 174 IFNDYVDVVAHKHPFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT-SEG 232

Query: 142 HIAHKDVSLPQIW--PRQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEI 196
              + D S+P+I    R+  PP +G +  N+  LAFFAG  +  +RE L   W+  D ++
Sbjct: 233 FRRNIDFSIPEINIPKRKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDV 292

Query: 197 YAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
             +    K     EL+G SKFCL   G+EV + R  +++Y GCVPV+I+++Y LPF DVL
Sbjct: 293 QVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVL 352

Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
           +W  FS+ +    IP +KKIL+ I  ++YL +   V+KVR+HF  +     +D  +M+++
Sbjct: 353 DWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILH 412

Query: 316 ELWLRRSSVRV 326
            +WLRR ++R+
Sbjct: 413 SVWLRRLNIRL 423


>gi|6735291|emb|CAB68119.1| putative protein [Arabidopsis thaliana]
          Length = 340

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 168/311 (54%), Gaps = 15/311 (4%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HY 86
            G +  E  +        F    P +A  FFLPFS+A + H          DF     H 
Sbjct: 29  EGQFIDELSYVMGGPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHR 88

Query: 87  IFN-----ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG 141
           IFN     ++ K+P+WN++ GADHF V+CH           E   N ++ +C+++    G
Sbjct: 89  IFNDYVDVVAHKHPFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT-SEG 147

Query: 142 HIAHKDVSLPQI--WPRQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEI 196
              + D S+P+I    R+  PP +G +  N+  LAFFAG  +  +RE L   W+  D ++
Sbjct: 148 FRRNIDFSIPEINIPKRKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDV 207

Query: 197 YAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
             +    K     EL+G SKFCL   G+EV + R  +++Y GCVPV+I+++Y LPF DVL
Sbjct: 208 QVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVL 267

Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
           +W  FS+ +    IP +KKIL+ I  ++YL +   V+KVR+HF  +     +D  +M+++
Sbjct: 268 DWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILH 327

Query: 316 ELWLRRSSVRV 326
            +WLRR ++R+
Sbjct: 328 SVWLRRLNIRL 338


>gi|90265152|emb|CAH67778.1| H0201G08.5 [Oryza sativa Indica Group]
 gi|125546968|gb|EAY92790.1| hypothetical protein OsI_14593 [Oryza sativa Indica Group]
          Length = 441

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 113/347 (32%), Positives = 182/347 (52%), Gaps = 26/347 (7%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASE-SYFKKAFMKSHFVTKDPS 54
           M R F+VYVY     P     P  N+     +   G++  +      +       T DP+
Sbjct: 100 MERRFKVYVYEEGEPPILHEGPCKNI-----YTIEGSFIEQLELMSPSDAGGGVRTWDPT 154

Query: 55  KADLFFLPFSIARM-RHDRRIGTEGIPDF---ISHYIFNISQKYPYWNRTGGADHFYVAC 110
           +A  FFLPFS+++M +   R  ++  P     ++ Y+  ++ ++P+WNR+ GADHF ++C
Sbjct: 155 RAHAFFLPFSVSQMVKFVYRPPSQDRPPLRAIVADYVRVVAARHPFWNRSAGADHFMLSC 214

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQEDPPKLG 164
           H  G  A +   E+  NAI+ +C+++    G    KDVS+P+I       PR+   P  G
Sbjct: 215 HDWGPYASRGQPELYTNAIRALCNANT-SEGFRPGKDVSVPEINLYDGDMPRELLAPAPG 273

Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRNDSE----IYAHSGRLKTPYADELLGSKFCLHV 220
              R  LAFFAG  +  VR+ LL+ W+        +Y +       Y   +  ++FCL  
Sbjct: 274 LESRPLLAFFAGGRHGHVRDLLLRHWKGRDAATFPVYEYDLPAAGDYYSFMRRARFCLCP 333

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            G EV + R+ +++   CVPV+IA+ Y LPFADVL W++FS+ VA  DIP L++ L+ I 
Sbjct: 334 SGHEVASPRVVEAIQAECVPVVIADGYALPFADVLRWEAFSVAVAVGDIPRLRERLERIP 393

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           + E   L+  V  V++H      P   D F M+++ +WLR  ++R+ 
Sbjct: 394 AAEVERLRRGVRLVKRHLMLQQPPRRLDMFNMILHSVWLRGLNLRLH 440


>gi|115456850|ref|NP_001052025.1| Os04g0109900 [Oryza sativa Japonica Group]
 gi|38345576|emb|CAE01775.2| OSJNBa0027H06.12 [Oryza sativa Japonica Group]
 gi|113563596|dbj|BAF13939.1| Os04g0109900 [Oryza sativa Japonica Group]
 gi|125589112|gb|EAZ29462.1| hypothetical protein OsJ_13536 [Oryza sativa Japonica Group]
          Length = 441

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 113/351 (32%), Positives = 180/351 (51%), Gaps = 34/351 (9%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASE-SYFKKAFMKSHFVTKDPS 54
           M R F+VYVY     P     P  N+     +   G++  +      +       T DP+
Sbjct: 100 MERRFKVYVYEEGEPPILHEGPCKNI-----YTIEGSFIEQLELMSPSDAGGGVRTWDPT 154

Query: 55  KADLFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
           +A  FFLPFS+++M          DR      +   ++ Y+  ++ ++P+WNR+ GADHF
Sbjct: 155 RAHAFFLPFSVSQMVKFVYRPPSQDR----APLRAIVADYVRVVAARHPFWNRSAGADHF 210

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQEDP 160
            ++CH  G  A +   E+  NAI+ +C+++    G    KDVS+P+I       PR+   
Sbjct: 211 MLSCHDWGPYASRGQPELYTNAIRALCNANT-SEGFRPGKDVSVPEINLYDGDMPRELLA 269

Query: 161 PKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE----IYAHSGRLKTPYADELLGSKF 216
           P  G   R  LAFFAG  +  VR+ LL+ W+        +Y +       Y   +  ++F
Sbjct: 270 PAPGLESRPLLAFFAGGRHGHVRDLLLRHWKGRDAATFPVYEYDLPAAGDYYSFMRRARF 329

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
           CL   G EV + R+ +++   CVPV+IA+ Y LPFADVL W++FS+ VA  DIP L++ L
Sbjct: 330 CLCPSGHEVASPRVVEAIQAECVPVVIADGYALPFADVLRWEAFSVAVAVGDIPRLRERL 389

Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           + I + E   L+  V  V++H      P   D F M+++ +WLR  ++R+ 
Sbjct: 390 ERIPAAEVERLRRGVRLVKRHLMLQQPPRRLDMFNMILHSVWLRGLNLRLH 440


>gi|242065760|ref|XP_002454169.1| hypothetical protein SORBIDRAFT_04g025885 [Sorghum bicolor]
 gi|241934000|gb|EES07145.1| hypothetical protein SORBIDRAFT_04g025885 [Sorghum bicolor]
          Length = 338

 Score =  177 bits (448), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 115/350 (32%), Positives = 180/350 (51%), Gaps = 34/350 (9%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLP-VDFEPRGNYASESYFKKAFM--KSHFVTKDPSKAD 57
           M R F+++ Y  +  +P    L P  D      Y+ E  F +     ++ F  + P +A 
Sbjct: 1   MERKFKIWTY--KEGEPPLTHLGPSADI-----YSIEGQFLEEIEDPRNPFAARHPGEAH 53

Query: 58  LFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
            F LP S+  +        R +       +   ++ Y+  ++ +YPYWNR+ GADH  V+
Sbjct: 54  AFLLPVSVCNLVQYIYPFYRRNTTAYMAHMRRALADYVDVVAGRYPYWNRSRGADHVMVS 113

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPP 161
           CH       +   E+  NAI+V+C+++   S     KD +LP++         P    PP
Sbjct: 114 CHDWAPLVSEANGELYANAIRVLCNANTSESFR-PRKDATLPEVNLGDGLLRRPTFGMPP 172

Query: 162 KLGSSKRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGS-KFCL 218
           +     R  LAFFAG ++  +R+ LL  W  R D ++  H    K      L+ S +FCL
Sbjct: 173 E----NRTTLAFFAGGMHGHIRKALLGYWLGRKDPDMDIHEYLPKGQDYHALMASARFCL 228

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
              GFEV + R+ +S++ GCVPVII++ Y  PF+DVL+W   S+ V    IP LK ILKG
Sbjct: 229 CPSGFEVASPRVVESVFSGCVPVIISDGYPPPFSDVLDWSKMSVTVPPARIPELKDILKG 288

Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
           +S   Y +L++ VL+ ++HF  H     +D   MVM+ +WLRR +VR+ +
Sbjct: 289 VSERRYRVLRARVLQAQRHFVVHRPSQRFDMIRMVMHSIWLRRLNVRLPY 338


>gi|255567222|ref|XP_002524592.1| catalytic, putative [Ricinus communis]
 gi|223536145|gb|EEF37800.1| catalytic, putative [Ricinus communis]
          Length = 388

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 116/349 (33%), Positives = 183/349 (52%), Gaps = 42/349 (12%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPS 54
           M + F+++VY     P     P  N+           Y+ E  F       + F T +P 
Sbjct: 57  MEKLFKIFVYEEGEPPLLHYGPCKNI-----------YSMEGLFLSLMETDTKFRTLNPD 105

Query: 55  KADLFFLPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
           +A +FFLPFS+  +          D+ +    + D    Y+  IS KY YWNR+ GADHF
Sbjct: 106 EAQVFFLPFSVVMIIEHLFHPIIRDKAVLERTVVD----YVRIISHKYTYWNRSLGADHF 161

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVC---SSSYFISGHIAHKDVSLPQIWPRQEDPPKL 163
            ++CH  G  A     ++  N+I+V+C   +S YF       KD S P+I     +   L
Sbjct: 162 MLSCHDWGPRATWYERQLYFNSIRVLCNANTSEYFN----PKKDASFPEINLITGEIADL 217

Query: 164 GS----SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFC 217
                 S R  LAFF+G ++  +R  L Q W+  D ++  +    +   Y + +  S++C
Sbjct: 218 TGGLPPSNRTILAFFSGKMHGKLRPLLFQHWKEKDKDVLVYETFPEGLSYQEMMKKSRYC 277

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           +   G EV + RI +++Y  CVPV+I+ +Y  PF+DVLNW+SFSI V+  DI  LK IL 
Sbjct: 278 ICPSGHEVASPRIVEAIYAECVPVLISQNYVFPFSDVLNWESFSIQVSVSDISNLKNILL 337

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           GI  ++YL ++  V +V++HF  +  P  YD F+M+++ +WLRR +V++
Sbjct: 338 GIPEDQYLRMRERVKQVQQHFLINNPPKRYDVFHMIIHSIWLRRLNVKL 386


>gi|356565159|ref|XP_003550812.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g20260-like [Glycine max]
          Length = 488

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 101/293 (34%), Positives = 165/293 (56%), Gaps = 14/293 (4%)

Query: 48  FVTKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGG 102
           F+ + P +A +F LP S+ ++             + +      Y   I+ +YPYWNRT G
Sbjct: 196 FLARYPDEAHVFMLPISVTQIVRYVYNPLTTYSRDQLMRITVDYTNIIAHRYPYWNRTKG 255

Query: 103 ADHFYVACH--SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI----WPR 156
           ADHF  +CH  +   S  +   E+  N I+V+C+++    G    KDV +P++    +  
Sbjct: 256 ADHFLASCHDWAPDISREESGRELFKNIIRVLCNANTS-EGFKPEKDVPMPEMNLQGFKL 314

Query: 157 QEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-S 214
               P    + R+ LAFFAG  +  +R+ LL+ W++ D E+  H    K      L+G S
Sbjct: 315 SSPIPGFDLNNRSILAFFAGGAHGRIRKILLEHWKDKDEEVQVHEYLPKGVDYQGLMGQS 374

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKK 274
           KFCL   G+EV + RI +S+  GCVPVI++++Y LPF+DVL+W  FS+ + +  I  +K 
Sbjct: 375 KFCLCPSGYEVASPRIVESINIGCVPVIVSDYYQLPFSDVLDWSKFSLHIPSRRIAEIKT 434

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           ILK +   +YL LQ  V+KV++HF+ +     +D F+M+++ +WLRR ++R+ 
Sbjct: 435 ILKNVPHAKYLKLQKRVMKVQRHFELNRPAKPFDVFHMILHSIWLRRLNIRLH 487


>gi|42568129|ref|NP_198314.2| xylogalacturonan beta-1,3-xylosyltransferase [Arabidopsis thaliana]
 gi|292630939|sp|Q94AA9.2|XGD1_ARATH RecName: Full=Xylogalacturonan beta-1,3-xylosyltransferase;
           AltName: Full=Protein XYLOGALACTURONAN DEFICIENT 1
 gi|332006509|gb|AED93892.1| xylogalacturonan beta-1,3-xylosyltransferase [Arabidopsis thaliana]
          Length = 500

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 115/341 (33%), Positives = 183/341 (53%), Gaps = 18/341 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MNR F+V+ Y       F +  +   +   G +  E        +S F    P  A +FF
Sbjct: 161 MNR-FKVWTYTEGEVPLFHDGPVNDIYGIEGQFMDEMCVDGPKSRSRFRADRPENAHVFF 219

Query: 61  LPFSIARMRH---DRRIGTEG-----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
           +PFS+A++ H         EG     +   I  Y+  ++ K+PYWNR+ G DHF V+CH 
Sbjct: 220 IPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKHPYWNRSQGGDHFMVSCHD 279

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-PRQE-DPPKLGSSKRNK 170
                +    ++    I+ +C+++    G   + DVS+P+I+ P+ +  P  LG S R +
Sbjct: 280 WAPDVIDGNPKLFEKFIRGLCNANTS-EGFRPNVDVSIPEIYLPKGKLGPSFLGKSPRVR 338

Query: 171 --LAFFAGAVNSPVREKLLQVWR---NDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
             LAFFAG  +  +R+ L Q W+   N+ ++Y      K  Y   +  SKFCL   G+EV
Sbjct: 339 SILAFFAGRSHGEIRKILFQHWKEMDNEVQVYDRLPPGKD-YTKTMGMSKFCLCPSGWEV 397

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
            + R  +++Y GCVPVII+++Y LPF+DVLNW SFSI +    I  +K IL+ +S   YL
Sbjct: 398 ASPREVEAIYAGCVPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYL 457

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            +   VL+V++HF  +     YD  +M+++ +WLRR ++R+
Sbjct: 458 KMYKRVLEVKQHFVLNRPAKPYDVMHMMLHSIWLRRLNLRL 498


>gi|15146187|gb|AAK83577.1| AT5g33290/F19N2_10 [Arabidopsis thaliana]
 gi|18252279|gb|AAL62020.1| AT5g33290/F19N2_10 [Arabidopsis thaliana]
          Length = 341

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 116/341 (34%), Positives = 184/341 (53%), Gaps = 18/341 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MNR F+V+ Y       F +  +   +   G +  E        +S F    P  A +FF
Sbjct: 2   MNR-FKVWTYTEGEVPLFHDGPVNDIYGIEGQFMDEMCVDGPKSRSRFRADRPENAHVFF 60

Query: 61  LPFSIARMRH---DRRIGTEG-----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
           +PFS+A++ H         EG     +   I  Y+  ++ K+PYWNR+ G DHF V+CH 
Sbjct: 61  IPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKHPYWNRSQGGDHFMVSCHD 120

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-PRQE-DPPKLGSSKRNK 170
                +    ++    I+ +C+++    G   + DVS+P+I+ P+ +  P  LG S R +
Sbjct: 121 WAPDEIDGNPKLFEKFIRGLCNANT-SEGFRPNVDVSIPEIYLPKGKLGPSFLGKSPRVR 179

Query: 171 --LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRL--KTPYADELLGSKFCLHVKGFEV 225
             LAFFAG  +  +R+ L Q W+  D+E+  +  RL     Y   +  SKFCL   G+EV
Sbjct: 180 SILAFFAGRSHGEIRKILFQHWKEMDNEVQVYD-RLPPGKDYTKTMGMSKFCLCPSGWEV 238

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
            + R  +++Y GCVPVII+++Y LPF+DVLNW SFSI +    I  +K IL+ +S   YL
Sbjct: 239 ASPREVEAIYAGCVPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYL 298

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            +   VL+V++HF  +     YD  +M+++ +WLRR ++R+
Sbjct: 299 KMYKRVLEVKQHFVLNRPAKPYDVMHMMLHSIWLRRLNLRL 339


>gi|75223627|gb|ABA18110.1| exostosin family protein [Arabidopsis arenosa]
          Length = 340

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 103/293 (35%), Positives = 163/293 (55%), Gaps = 15/293 (5%)

Query: 48  FVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HYIFN-----ISQKYPYWNR 99
           F    P +A  FFLPFS+A + H          DF     H IFN     ++ K+P+WN+
Sbjct: 47  FRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVACKHPFWNQ 106

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQ 157
           + GADHF V+CH           E   + ++ +C+++    G   + D S+P+I    R+
Sbjct: 107 SNGADHFMVSCHDWAPDVADSKPEFFKDFMRGLCNANT-TEGFRPNIDFSIPEINIPKRK 165

Query: 158 EDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG- 213
             PP +G +  N+  LAFFAG  +  +RE L   W+  D ++  +    K     EL+G 
Sbjct: 166 LKPPFMGQTPENRTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQNYHELIGH 225

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           SKFCL   G+EV + R  +++Y GCVPV+I+++Y LPF DVL+W  FS+ +    IP +K
Sbjct: 226 SKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFKDVLDWSKFSVEIPVDKIPDIK 285

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           KIL+ I  ++Y  +   V+KVR+HF  +     +D  +M+++ +WLRR ++R+
Sbjct: 286 KILQEIPHDKYRRMYQNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRRLNIRL 338


>gi|125540270|gb|EAY86665.1| hypothetical protein OsI_08049 [Oryza sativa Indica Group]
          Length = 482

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 177/359 (49%), Gaps = 45/359 (12%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF--MKSHFVTKDPSKADL 58
           M R F+++ Y      P A++    D      Y+ E  F       +S F  + P  A  
Sbjct: 138 MERRFKIWTY-REGEPPVAHIGPGTDI-----YSIEGQFMYEMDDPRSRFAARRPDDAHA 191

Query: 59  FFLPFSIARMRH-DRRIGTEG----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           F LP S+  + H   R+   G    +   ++ Y+  ++++YPYWNR+ GADH  V+CH  
Sbjct: 192 FLLPISVCNLVHYVYRLNATGDLAPLRGLVADYVRVVAERYPYWNRSRGADHVIVSCHDW 251

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPPKLGS 165
                    ++  NAI+V+C+++    G    KD +LP++         P    PP+   
Sbjct: 252 APMVTSAHRQLYGNAIRVLCNANTS-EGFRPRKDATLPEVNLADGVLRRPTAGLPPE--- 307

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEI----------------YAHSGRLKTPYAD 209
             R  LAFFAG  +  +RE LL+ W   ++                 Y  +G     Y  
Sbjct: 308 -NRTTLAFFAGGRHGHIRESLLRHWLIGNKGGAAADGDGDGDMRVHEYLPAG---EDYHA 363

Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
           ++  ++FCL   GFEV + R+ +S++ GCVPVII+  Y  PF DVL+W   S+ V    I
Sbjct: 364 QMAAARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPPPFGDVLDWGKMSVAVPAARI 423

Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
           P L+ IL+ +S   Y +L++ VL+ ++HF  H     +D  +MV++ +WLRR +VR+ +
Sbjct: 424 PELRAILRRVSERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIWLRRLNVRLPY 482


>gi|357478261|ref|XP_003609416.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355510471|gb|AES91613.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 380

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 109/311 (35%), Positives = 168/311 (54%), Gaps = 20/311 (6%)

Query: 33  YASESYFKKAFMK--SHFVTKDPSKADLFFLPFSIARMRH-----DRRIGTEGIPDFISH 85
           Y  E +F        S F T +P +A +F LP S+  M H           + I      
Sbjct: 71  YGIEGHFMTEIENRLSPFSTHNPDEAHVFMLPLSVTNMVHYLYNPLTTYSRDQIMHVTID 130

Query: 86  YIFNISQKYPYWNRTGGADHFYVACH----SIGRSAMQKAWEVKLNAIQVVCSSSYFISG 141
           Y   I+ KY YWNR+ GADH  V+CH     I R +  K  E+  N I+V+C+++    G
Sbjct: 131 YTNIIAHKYRYWNRSKGADHLLVSCHDWAPEISRESSGK--ELFKNLIKVLCNANTS-EG 187

Query: 142 HIAHKDVSLPQI----WPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEI 196
               +DVS+P++    +      P   S+ R+ LAFFAG  +  +R+ LL  W+  D E+
Sbjct: 188 FDPKRDVSMPEMNLQGYKLSSPIPSKESNNRSILAFFAGGEHGMIRKTLLDQWKGKDKEV 247

Query: 197 YAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
             +    K     +L+G SKFCL   G+EV + R+ +S+  GCVPVI++++Y LPF+DVL
Sbjct: 248 LVYEYLPKKLKYFKLMGKSKFCLCPSGYEVASPRLVESINTGCVPVIVSDNYQLPFSDVL 307

Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
           +W  FS+ + +  I  +K ILK +    YL L   VLKV++HF  +     +D F+M+++
Sbjct: 308 DWSKFSLHIPSKRISEIKTILKSVPHARYLKLHRRVLKVQRHFVLNPPAKPFDVFHMILH 367

Query: 316 ELWLRRSSVRV 326
            +WLRR ++R+
Sbjct: 368 SIWLRRLNIRL 378


>gi|443732873|gb|ELU17436.1| hypothetical protein CAPTEDRAFT_228334 [Capitella teleta]
          Length = 506

 Score =  173 bits (439), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 108/339 (31%), Positives = 171/339 (50%), Gaps = 21/339 (6%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M +  ++Y+Y  +       +   +++     Y  E  F      S ++T+DPS+A  FF
Sbjct: 173 MTKELKIYMYTTK-------IDAHINYVNDWKYGVEELFIHLLKSSPYITQDPSEATFFF 225

Query: 61  LPFSIARMRH---DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LPF     R    DR        + +S  ++ I   Y +W+RT GADHFYV  H  G + 
Sbjct: 226 LPFRCFAYRKTISDRDRAQRFTEEMVSKILYEIKSNYSFWDRTLGADHFYVCAHDFGPAI 285

Query: 118 MQKAWE-VKLNAIQVVCSSSYFISGHIAHKDVSLP--------QIWPRQEDPPKLGSSKR 168
           +  +   +  NAI +V ++ Y    ++ HKD+SLP         +    +    L  S R
Sbjct: 286 VAGSDPFLHKNAIAMVNTADYEHIYYVPHKDISLPPHPSHGKNSLANIGKGGHGLNPSDR 345

Query: 169 NKLAFFAGAVN-SPVREKLLQVWRNDSEIYAHSGRLKTP-YADELLGSKFCLHVKGFEVN 226
             LAF+AG ++   +R  +   W  D +     G L    Y   L  SKFCL ++G E  
Sbjct: 346 TVLAFYAGNLDRGRIRPSIKDFWSTDIDFRIFMGHLTDERYQHYLKTSKFCLILRGNEAW 405

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
           +  + D++++GCVPVII+++YDLP   +L+W  F++V+    +  LK+IL  +S ++   
Sbjct: 406 SPCLMDAIWFGCVPVIISDYYDLPLHGMLDWNQFAVVIRESKVKSLKEILLAVSPQKLTS 465

Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           +Q  + +V  HF W+  P  YDAF  VMY+LW RR  VR
Sbjct: 466 MQEKLKQVYGHFVWNDPPRPYDAFQSVMYQLWKRRGVVR 504


>gi|47497631|dbj|BAD19700.1| exostosin-like protein [Oryza sativa Japonica Group]
          Length = 345

 Score =  173 bits (439), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 177/359 (49%), Gaps = 45/359 (12%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF--MKSHFVTKDPSKADL 58
           M R F+++ Y      P A++    D      Y+ E  F       +S F  + P  A  
Sbjct: 1   MERRFKIWTY-REGEPPVAHIGPGTDI-----YSIEGQFMYEMDDPRSRFAARRPDDAHA 54

Query: 59  FFLPFSIARMRH-DRRIGTEG----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           F LP S+  + H   R+   G    +   ++ Y+  ++++YPYWNR+ GADH  V+CH  
Sbjct: 55  FLLPISVCNLVHYVYRLNATGDLAPLRGLVADYVRVVAERYPYWNRSRGADHVIVSCHDW 114

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPPKLGS 165
                    ++  NAI+V+C+++    G    KD +LP++         P    PP+   
Sbjct: 115 APMVTSAHRQLYGNAIRVLCNANTS-EGFRPRKDATLPEVNLADGVLRRPTAGLPPE--- 170

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEI----------------YAHSGRLKTPYAD 209
             R  LAFFAG  +  +RE LL+ W   ++                 Y  +G     Y  
Sbjct: 171 -NRTTLAFFAGGRHGHIRESLLRHWLIGNKGGAAADGDGDGDMRVHEYLPAG---EDYHA 226

Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
           ++  ++FCL   GFEV + R+ +S++ GCVPVII+  Y  PF DVL+W   S+ V    I
Sbjct: 227 QMAAARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPPPFGDVLDWGKMSVAVPAARI 286

Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
           P L+ IL+ +S   Y +L++ VL+ ++HF  H     +D  +MV++ +WLRR +VR+ +
Sbjct: 287 PELRAILRRVSERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIWLRRLNVRLPY 345


>gi|297805042|ref|XP_002870405.1| hypothetical protein ARALYDRAFT_493575 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316241|gb|EFH46664.1| hypothetical protein ARALYDRAFT_493575 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 509

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 115/341 (33%), Positives = 184/341 (53%), Gaps = 18/341 (5%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           MNR F+V+ Y       F +  +   +   G +  E        +S F    P  A +FF
Sbjct: 170 MNR-FKVWTYTEGEVPLFHDGPVNDIYGIEGQFMDEMCVDGPKSRSRFRADHPEDAHVFF 228

Query: 61  LPFSIARMRH---DRRIGTEG-----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
           +PFS+A++ H         EG     +   I  Y+  ++ K+ YWNR+ G DHF V+CH 
Sbjct: 229 IPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKHRYWNRSKGGDHFMVSCHD 288

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-PRQE-DPPKLGSSKRNK 170
                +    ++    I+ +C+++    G   + DVS+P+I+ P+ +  P  LG S R +
Sbjct: 289 WAPDVIDGNPKLFEKFIRALCNANT-SEGFRPNVDVSIPEIYLPKGKLGPSFLGKSPRIR 347

Query: 171 --LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK--TPYADELLGSKFCLHVKGFEV 225
             LAFFAG  +  +R+ L + W+  D+E+  +  RL     Y   +  SKFCL   G+EV
Sbjct: 348 SILAFFAGRSHGEIRKILFKHWKEMDNEVQVYD-RLPPGKDYTKTMGMSKFCLCPSGWEV 406

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
            + R  +++Y GCVPVII+++Y LPF+DVLNW SFSI +    IP +K IL+ +S   YL
Sbjct: 407 ASPREVEAIYAGCVPVIISDNYSLPFSDVLNWDSFSIQIPVSRIPEIKTILQSVSLVRYL 466

Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            +   VL+V++HF  +     YD  +M+++ +WLRR ++R+
Sbjct: 467 KMYKRVLEVKQHFVLNRPAKPYDVMHMMLHSIWLRRLNLRL 507


>gi|26451681|dbj|BAC42936.1| unknown protein [Arabidopsis thaliana]
          Length = 270

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 98/252 (38%), Positives = 148/252 (58%), Gaps = 15/252 (5%)

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS 140
            F+ +Y+  IS KY +WN+TGG+DHF VACH    S  +   +     I+ +C+S     
Sbjct: 19  QFLKNYLDMISSKYSFWNKTGGSDHFLVACHDWAPSETR---QYMAKCIRALCNSD-VSE 74

Query: 141 GHIAHKDVSLPQ---IWPRQEDPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWRN--- 192
           G +  KDV+LP+   + PR+      G   S+R  LAFFAG ++  +R  LLQ W     
Sbjct: 75  GFVFGKDVALPETTILVPRRPLRALGGKPVSQRQILAFFAGGMHGYLRPLLLQNWGGNRD 134

Query: 193 -DSEIYAH--SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
            D +I++     + K  Y + +  SK+C+  KG EVN+ R+ ++L+Y CVPVII++++  
Sbjct: 135 PDMKIFSEIPKSKGKKSYMEYMKSSKYCICPKGHEVNSPRVVEALFYECVPVIISDNFVP 194

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDA 309
           PF +VLNW+SF++ V   DIP LK IL  I+ E Y  +Q  V  V+KHF WH  P  +D 
Sbjct: 195 PFFEVLNWESFAVFVLEKDIPDLKNILVSITEERYREMQMRVKMVQKHFLWHSKPERFDI 254

Query: 310 FYMVMYELWLRR 321
           F+M+++ +W  R
Sbjct: 255 FHMILHSIWYNR 266


>gi|356527767|ref|XP_003532479.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g03795-like [Glycine max]
          Length = 467

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 106/288 (36%), Positives = 160/288 (55%), Gaps = 20/288 (6%)

Query: 49  VTKDPSKADLFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGAD 104
           + KDP+KA LF++PFS   + H   +        +  F+  Y   IS K PY+NRTGGAD
Sbjct: 178 IMKDPAKAHLFYMPFSSRMLEHSLYVRNSHNRTNLRQFLKDYTDKISAKIPYFNRTGGAD 237

Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK-- 162
           HF  ACH       +   E     I+ +C++     G    +DVSLP+ + R    P+  
Sbjct: 238 HFLAACHDWAPYETRHHMEY---CIKALCNAD-VTQGFKIGRDVSLPEAYVRSVRDPQRD 293

Query: 163 LGSSKRNK---LAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLG 213
           LG    ++   LAF AG ++  +   LL+ W++   D +IY    H    K  Y + +  
Sbjct: 294 LGGKPPHQXPILAFHAGNMHGYLHPILLKHWKDKDPDMKIYGPMPHGVTSKMNYINHMKN 353

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           SK+C+  KG+EVN+ R+ ++++  CVPVII++++   F +VLNW  FSI++A  DIP LK
Sbjct: 354 SKYCICPKGYEVNSPRMVEAIFXECVPVIISDNFVPHFFEVLNWDVFSIILAEKDIPNLK 413

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +IL  +   +YL LQ  V K +KHF WHV P  YD F++ ++ +W  R
Sbjct: 414 QILLSV-XHKYLELQLGVRKAQKHFFWHVKPLKYDLFHITLHLIWYNR 460


>gi|75214755|gb|ABA18105.1| exostosin family protein [Capsella rubella]
          Length = 423

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 104/295 (35%), Positives = 165/295 (55%), Gaps = 15/295 (5%)

Query: 46  SHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HYIFN-----ISQKYPYW 97
           S F    P +A +FFLPFS+A + H          DF     H IFN     +++KYP+W
Sbjct: 128 SRFRAVRPEEAHVFFLPFSVANIVHYVYKPIASPADFNRARLHRIFNDYVDVVARKYPFW 187

Query: 98  NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPR 156
            ++ GADHF V+CH           E   + ++ +C+++    G     D S+P+I  P+
Sbjct: 188 KQSNGADHFMVSCHDWAPDVPDSKPEFFKDFMRGLCNANT-SEGFKPSIDFSIPEINIPK 246

Query: 157 QE-DPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELL 212
            +  PP +G +  N+  LAFFAG  +  +RE L   W+  D ++  +    K     EL 
Sbjct: 247 GKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQNYHELT 306

Query: 213 G-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
           G SKFCL   G+EV + R  +++Y GCVPV+I+++Y LPF+DVL+W  FS+ +    IP 
Sbjct: 307 GHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFSDVLDWSKFSVEIPVDRIPD 366

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
           +K IL+ I  ++Y+ +   VLKVRKHF  +     +D  +M+++ +WLRR ++++
Sbjct: 367 IKNILQEIPHDKYIRMYQNVLKVRKHFVVNRPAQPFDVIHMILHSVWLRRLNIKL 421


>gi|443705810|gb|ELU02170.1| hypothetical protein CAPTEDRAFT_209273 [Capitella teleta]
          Length = 473

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 100/273 (36%), Positives = 155/273 (56%), Gaps = 11/273 (4%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEGIPDFISHYIFN 89
           Y  ES F K   KS FVTKD  +A  FFLPF  A  R+   DR        + +S+ + +
Sbjct: 201 YGVESLFIKLLSKSSFVTKDAEEAHFFFLPFQCATYRNVIRDRAAAQNFTENLVSNILKD 260

Query: 90  ISQKYPYWNRTGGADHFYVACHSIGRSAMQKA-WEVKLNAIQVVCSSSYFISGHIAHKDV 148
           IS +Y YW+R+ GADHFYV  H +G S++  A   ++ NAI +V ++ Y    ++ HKD+
Sbjct: 261 ISSRYTYWDRSLGADHFYVCAHDMGASSVAAADANLQKNAIALVNTADYADPFYVPHKDI 320

Query: 149 SLPQIWPRQEDP-PKLG-----SSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGR 202
           +LP      +   P +G     S++R  LAF+AG ++S     + + W NDS+I+ H G 
Sbjct: 321 ALPPHPAHGKGSLPDIGRGGGKSTERPNLAFYAGNLDSGQLRPVFKDWLNDSDIHIHHGH 380

Query: 203 LK-TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
           +    Y   L  +KFCL  +G  V +  + D+++ GCVPVII+++YDLP   +++W  F+
Sbjct: 381 MSDNVYIKNLQSAKFCLVPRGHRVWSPVVMDAVWTGCVPVIISDYYDLPLHGLIDWTHFA 440

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           + +   ++  LK  LK I  E+   +QSY+ KV
Sbjct: 441 VFLKEKEVLSLKSKLKSIPEEKLRRMQSYIKKV 473


>gi|75214619|gb|ABA18091.1| exostosin family protein [Olimarabidopsis pumila]
          Length = 343

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 104/296 (35%), Positives = 165/296 (55%), Gaps = 15/296 (5%)

Query: 48  FVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HYIFN-----ISQKYPYWNR 99
           F    P +A  FFLPFS+A + H          DF     H IFN     +++K+P+WN+
Sbjct: 48  FRAGRPEEAHAFFLPFSVANIVHYVSQPIASPADFNRARLHRIFNDYVEVVARKHPFWNQ 107

Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE 158
           + GADHF V+CH           E   + I+ +C+++    G     D S+P+I  P+ +
Sbjct: 108 SNGADHFMVSCHDWAPDVAGSKPEFFKDFIRGLCNANT-SEGFRPSIDFSIPEINIPKGK 166

Query: 159 -DPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG- 213
             PP +G +  N+  LAFFAG  +  +RE L   W+  D ++  +    K     EL+G 
Sbjct: 167 LKPPFMGQTPENRTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQNYHELIGH 226

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           SKFCL   G+EV + R  +++Y GCVPV+I+++Y LPF DVL+W  FS+ +    IP +K
Sbjct: 227 SKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDKIPDIK 286

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
           KIL+ I  E+Y+ +   V+KV +HF  +     +D  +M+++ +WLRR ++++  S
Sbjct: 287 KILQEIPHEKYIKMYHNVMKVGRHFVVNRPAQPFDVIHMILHSVWLRRLNIKLPSS 342


>gi|242094892|ref|XP_002437936.1| hypothetical protein SORBIDRAFT_10g005120 [Sorghum bicolor]
 gi|241916159|gb|EER89303.1| hypothetical protein SORBIDRAFT_10g005120 [Sorghum bicolor]
          Length = 412

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 101/295 (34%), Positives = 162/295 (54%), Gaps = 28/295 (9%)

Query: 50  TKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGAD 104
           T DP++A  FFLPFS+++M     R D    T  +   ++ Y+  ++ ++PYWNR+ GAD
Sbjct: 110 TSDPARAHAFFLPFSVSKMVQFAYRPDTYDKTP-LRAIVADYVRVVASRHPYWNRSAGAD 168

Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK-- 162
           HF ++CH  G  A +   E+  N I+ +C+++    G    +DVS+P+I     D P+  
Sbjct: 169 HFMLSCHDWGPEASRGHPELHANGIRALCNANT-SEGFRPGQDVSVPEINLYTGDMPRQL 227

Query: 163 -----LGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE----IYAHS-----GRLKTPYA 208
                   + R  LAFFAG  +  VR+ LL+ W+        +Y H      GR +   A
Sbjct: 228 LAPPAPPLASRPFLAFFAGGRHGHVRDLLLRRWKGHDPDVFPVYEHEHEHSHGRRQQDGA 287

Query: 209 D-----ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
                  +  ++FCL   G+EV + R+ ++++  CVPVI+++ Y LPFADVL W++FS+ 
Sbjct: 288 PLDYYWYMRRARFCLCPSGYEVASPRVVEAIHAECVPVILSDGYALPFADVLRWEAFSVA 347

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
           V   DIP L+++L+ I + E   LQ  V  V++HF  H  P   D F M+++ +W
Sbjct: 348 VPVADIPRLREVLERIPAPEVERLQRGVRLVKRHFMLHQPPERLDMFNMILHSVW 402


>gi|255550902|ref|XP_002516499.1| catalytic, putative [Ricinus communis]
 gi|223544319|gb|EEF45840.1| catalytic, putative [Ricinus communis]
          Length = 601

 Score =  170 bits (430), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 109/332 (32%), Positives = 160/332 (48%), Gaps = 69/332 (20%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH-FVTKDPSKAD 57
           M  + +VY+Y             P+  +P  +G YASE +F K    S  FVTK   KA 
Sbjct: 321 MEENLKVYIYKEGEK--------PILHQPVLKGIYASEGWFMKQLEASKKFVTKKSRKAH 372

Query: 58  LFFLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           LF+LPFS   +         H R+     +  ++ +Y+  I  KYP+WNRT G DHF VA
Sbjct: 373 LFYLPFSSRNLELQLYVPDSHSRK----NLIKYLKNYLDLIVAKYPFWNRTEGVDHFLVA 428

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRN 169
           CH    S  +   ++  N I+ +C++     G I  KD SLP+                 
Sbjct: 429 CHDWAASETE---QLMFNCIRALCNAD-VKEGFIFGKDASLPE----------------- 467

Query: 170 KLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTAR 229
                    NS  +                  R K  Y   +  S++C+  +G+EVN+ R
Sbjct: 468 --------TNSDAK-----------------ARGKMNYVQHMKSSRYCICARGYEVNSPR 502

Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
           I +++ Y CVPVII+++Y  PF +VLNW+SF++ V   DIP LK IL  I  + Y  +Q 
Sbjct: 503 IVEAILYECVPVIISDNYVPPFFEVLNWESFAVFVLEKDIPNLKNILLSIPEKRYREMQM 562

Query: 290 YVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            V  V++HF WH  P  YD F+M+++ +W  R
Sbjct: 563 RVKMVQQHFLWHARPVKYDLFHMILHSVWYNR 594


>gi|302814814|ref|XP_002989090.1| hypothetical protein SELMODRAFT_447545 [Selaginella moellendorffii]
 gi|300143191|gb|EFJ09884.1| hypothetical protein SELMODRAFT_447545 [Selaginella moellendorffii]
          Length = 1522

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 106/321 (33%), Positives = 166/321 (51%), Gaps = 17/321 (5%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFS 64
           F+VYVYP  +N    ++    D +     + +  F  + + S FVT DP  ADLF LP S
Sbjct: 56  FKVYVYPMIQNASAPDLR---DGKAARPGSIDRVFVDSLLASGFVTDDPEAADLFLLPAS 112

Query: 65  IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEV 124
           I+ +   +R   +GI   +  YI  +   YPYW R+ GADHF+V+CH I     +   E+
Sbjct: 113 ISAI-WKKRPDPKGIAHSLKSYIQQLRDLYPYWQRSLGADHFFVSCHDITSDWSRNVLEL 171

Query: 125 KLNAIQVVC--SSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPV 182
           K NAIQ+ C   + +     +AHKD+++P       DPP+    +R  LA +  +     
Sbjct: 172 KKNAIQIACFPLARHGAQEFLAHKDITMPPAG-GSIDPPQ---RRRWNLAVYDSSSQGYA 227

Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
              +   W++D    A + ++       L+ ++FCL +   + +   +  ++  GC+PVI
Sbjct: 228 ASDVPASWKSDESFVAGAVKMDLQL---LVTTRFCLSLGSSDRHL--VIPAVRSGCIPVI 282

Query: 243 IANHY--DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
            +     DLPF D+L+W SF+IV++   +   K IL+ I  E+   LQ    +  KH +W
Sbjct: 283 FSAGKLSDLPFQDILDWNSFAIVLSRDQLHQTKAILESIDEEKLSRLQENGARAAKHMEW 342

Query: 301 HVFPSDYDAFYMVMYELWLRR 321
           H  P   DAFYMV+Y+LW RR
Sbjct: 343 HSPPQPEDAFYMVLYQLWRRR 363


>gi|302804107|ref|XP_002983806.1| hypothetical protein SELMODRAFT_445668 [Selaginella moellendorffii]
 gi|300148643|gb|EFJ15302.1| hypothetical protein SELMODRAFT_445668 [Selaginella moellendorffii]
          Length = 1068

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 165/321 (51%), Gaps = 17/321 (5%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFS 64
           F+VYVYP  +N    ++    D +     + +  F  + + S FVT DP  ADLF LP S
Sbjct: 56  FKVYVYPMIQNASAPDLR---DGKAARPGSIDRVFVDSLLASGFVTDDPEAADLFLLPAS 112

Query: 65  IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEV 124
           I+ +   +R   +GI   +  YI  +   YPYW R+ GADHF+V+CH I     +   E+
Sbjct: 113 ISAI-WKKRPDPKGIAHSLKSYIQQLRDLYPYWQRSLGADHFFVSCHDITSDWSRNVLEL 171

Query: 125 KLNAIQVVC--SSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPV 182
           K NAIQ+ C   + +     +AHKD+++P       DPP+    +R  LA +  +     
Sbjct: 172 KKNAIQIACFPLARHGAQEFLAHKDITMPPAG-GSIDPPQ---RRRWNLAVYDSSSQGYA 227

Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
              +   W++D    A +  L       L+ ++FCL +   + +   +  ++  GC+PVI
Sbjct: 228 ARDVPASWKSDESFVAGAVALDLQL---LVTTRFCLSLGSSDRHL--VIPAVRSGCIPVI 282

Query: 243 IANH--YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
            +     DLPF D+L+W SF+IV++   +   K IL+ I  E+   LQ    +  KH +W
Sbjct: 283 FSAGKLSDLPFQDILDWNSFAIVLSRDQLHQTKGILESIDEEKRSRLQENGARAAKHMEW 342

Query: 301 HVFPSDYDAFYMVMYELWLRR 321
           H  P   DAFYMV+Y+LW RR
Sbjct: 343 HSPPQPEDAFYMVLYQLWRRR 363


>gi|168000699|ref|XP_001753053.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695752|gb|EDQ82094.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score =  167 bits (423), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 99/302 (32%), Positives = 159/302 (52%), Gaps = 15/302 (4%)

Query: 34  ASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQK 93
           ++   F +   +S FVT+   +A LF LPFSI  +  D  +G   + + +  Y+  +   
Sbjct: 162 STADLFFRLLTRSEFVTEKAKRAQLFLLPFSIDVLWVD--LGPTQVAEKLRRYLEKVRTN 219

Query: 94  YPYWNRTGGADHFYVACHSIGRSAMQK-AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ 152
           YPYW  + GADHFY++CH+   ++  +   E+  N+IQ  C+          HKDV  PQ
Sbjct: 220 YPYWESSLGADHFYLSCHAFEHNSKHRNILELGKNSIQAACAPLRHNQKFYPHKDVVFPQ 279

Query: 153 IWPRQEDPPK---LGSSKRNKLAFFAGA--VNSPVREKLLQVWRNDSE--IYAHSGRLKT 205
             P  E+  +   LG   R  LA+F+G   V +P+       W  D +  + A+    + 
Sbjct: 280 YKPVGEEDVRQAILGRRNRTSLAYFSGCPDVTTPLLSAF-HTWETDPDFIVEANPSPHRL 338

Query: 206 PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA--NHYDLPFADVLNWKSFSIV 263
                L  S+FC+ V     +T  + D+L +GCVPV+++    +DLPF   LNW  F++V
Sbjct: 339 SVYRNLARSRFCVSV--LPHDTFSLVDALRFGCVPVLLSKLTFHDLPFQGFLNWGQFAVV 396

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
           +   D+P LK+IL  +SS ++  +Q    +  KH +W+  P  YDAF+M + ELW+RR S
Sbjct: 397 LGIEDLPNLKQILANVSSTKHREMQYLGHQAIKHLEWNNPPVAYDAFHMTLLELWVRRHS 456

Query: 324 VR 325
           ++
Sbjct: 457 IK 458


>gi|357118563|ref|XP_003561022.1| PREDICTED: probable glycosyltransferase At5g25310-like
           [Brachypodium distachyon]
          Length = 479

 Score =  167 bits (423), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 101/306 (33%), Positives = 167/306 (54%), Gaps = 33/306 (10%)

Query: 50  TKDPSKADLFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
           T DP +A  FFLPFS+++M         +DR      +   ++ Y+  ++ ++ +WNR+ 
Sbjct: 174 TWDPERAHAFFLPFSVSQMVQFAYVPLSYDR----APLRALVADYVRVVAARHRFWNRSS 229

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WP 155
           GADHF ++CH  G  A +   E+  N I+ +C+++    G    KDVS+P+I       P
Sbjct: 230 GADHFMLSCHDWGPEASRGDPELYGNGIRALCNANT-SEGFRPGKDVSIPEINLYDGDTP 288

Query: 156 RQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWR----NDSEIYAHS----------G 201
           RQ   P  G S+R  LAFFAG  +  VR+ LL+ W+    ++  +Y +           G
Sbjct: 289 RQLLLPAPGLSERPYLAFFAGGRHGHVRDLLLREWKGRDPDNFPVYEYDLPTTTNTTGGG 348

Query: 202 RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
             +  Y   +  ++FCL   G EV + R+ ++++ GCVPV++++ Y  PFADVL W+ FS
Sbjct: 349 DKQHDYYSYMQRARFCLCPSGHEVASPRVVEAIHAGCVPVLVSDGYAPPFADVLRWEGFS 408

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           + V   DIP L+++L+ I + E   L+     V++HF     P   D F+M+++ +WLRR
Sbjct: 409 VSVPVADIPRLREVLESIPAAEVERLRDGGRLVKQHFTLRQPPERLDMFHMILHSVWLRR 468

Query: 322 SSVRVQ 327
            + R+ 
Sbjct: 469 LNFRLN 474


>gi|357436207|ref|XP_003588379.1| Exostosin-2 [Medicago truncatula]
 gi|355477427|gb|AES58630.1| Exostosin-2 [Medicago truncatula]
          Length = 551

 Score =  167 bits (422), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 112/341 (32%), Positives = 171/341 (50%), Gaps = 65/341 (19%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAF-MKSHFVTKDPSKAD 57
           M    +VYVY             P+   P   G YASE +F K       FVTK+P KA 
Sbjct: 250 MEERLKVYVYREGAR--------PILHSPFLTGIYASEGWFMKLMEANKRFVTKNPKKAH 301

Query: 58  LFFLPFSIARMRHDRRIGT----EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           LF+LPFS   +     +      + +  ++  Y+  I+ ++ +WNRTGGADHF V CH  
Sbjct: 302 LFYLPFSSRMLEEALYVKNSHSHKNLIQYLHDYVDMIAARHSFWNRTGGADHFLVGCHDW 361

Query: 114 GRSAMQKAWEVKL---NAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LGS--- 165
             S      E KL   N I+ +C++     G +  KD SLP+ + R    P   LG    
Sbjct: 362 APS------ETKLRLANCIRSLCNAD-VKEGFVFGKDASLPETYVRNAQIPTRDLGGNSF 414

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAH--SGRLKTPYADELLGSKFCLHV 220
           SK+  LAFFAG+++  VR  LL+ W N   D +I+    + +  + Y   +  SK+C+  
Sbjct: 415 SKKTTLAFFAGSMHGYVRPILLKHWENKDPDMKIFGKLPNSKGNSNYIHYMKSSKYCICA 474

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
           KG+EVN+ R+ ++++Y CVPVII++++  PF +VL+W+SFS++                 
Sbjct: 475 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWESFSVI----------------- 517

Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
                        V+KHF W+  P  YD F+M+++ +W  R
Sbjct: 518 -------------VQKHFLWNKNPVKYDIFHMILHSIWYNR 545


>gi|388493832|gb|AFK34982.1| unknown [Lotus japonicus]
          Length = 229

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 92/226 (40%), Positives = 138/226 (61%), Gaps = 7/226 (3%)

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL---- 163
           ++CH  G  A Q    +  N+I+V+C+++    G    KD S P+I  R      L    
Sbjct: 2   LSCHDWGPMASQAVPNLYKNSIRVLCNANTS-EGFKHGKDASFPEINLRTSSIDDLVGGP 60

Query: 164 GSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVK 221
             SKR+ LA FAG ++ P+R  LL+ W N D ++  +S   K   Y D L  SKFCL   
Sbjct: 61  SPSKRSVLASFAGRLHGPIRPPLLEHWENKDGDMQVYSSLPKGVSYYDMLRKSKFCLCPS 120

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G+EV + R+ +++Y GCVPV+I+ HY  PF+DVLNWKSFS+ V+  DIP LK+IL  +++
Sbjct: 121 GYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFSLEVSVKDIPRLKEILLSVNT 180

Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            +Y+ +Q  V ++R+HF+ H  P  +D F+MV++ +WLRR + RV 
Sbjct: 181 RQYIRMQRRVGQIRRHFEIHSPPKRFDVFHMVLHSVWLRRLNFRVH 226


>gi|320163884|gb|EFW40783.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 478

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 168/352 (47%), Gaps = 64/352 (18%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMR---HDRRIGTEGIPDFISHYIFN 89
           Y +E  F      S F T  P KA++FF+PF     R    +R +G     +  + Y   
Sbjct: 127 YGAEQLFINLLATSAFHTTAPDKANMFFMPFRCTAYRRSVQERVLGDIVAKNVTAQYFDV 186

Query: 90  ISQKYPYWNRTGGADHFYVACHSIGR--SAMQKAWEVKLNAIQVVCSSSY---------- 137
           +  KY +WN + G DHFY+  H +G   +A+     VK NAI +V ++ Y          
Sbjct: 187 VMNKYRWWNVSSGTDHFYICGHDMGTAVTALSHPALVK-NAIGLVNTADYDDARYIPHKD 245

Query: 138 ---------FISGHIA----------------------------HKDVSLPQIWPRQEDP 160
                      S H+A                            H D++   +  R+   
Sbjct: 246 ISLPPNIDVLPSAHVATEEEITADLIRLEMARDRLYRATRQKVAHPDMNFEPLMERRMGK 305

Query: 161 P---KLGS-----SKRNKLAFFAGAVN-SPVREKLLQVWRNDSEIYAHSGRLKTP--YAD 209
                LG       KR KLA+FAG ++   VR K+   + ND++I    GR   P  Y +
Sbjct: 306 LVQYGLGGLIHPREKRTKLAYFAGPLHYGRVRPKVRDAFANDTDIVLFEGRHAQPILYYN 365

Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
           EL  SKFCL ++G+   + R+ D+++ GC+PVII++HYDLP   +L+W  FSI +    I
Sbjct: 366 ELATSKFCLFLRGYRAWSPRLMDAVFMGCIPVIISDHYDLPLGQLLDWSEFSITIPEARI 425

Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           P LK+ L  +S  +   +Q+ + +V +HF W+  P  +DAF+MV+++LW RR
Sbjct: 426 PRLKQTLLAVSDAQLSRMQNRLAEVYQHFVWNDPPKPFDAFHMVLWQLWRRR 477


>gi|356546124|ref|XP_003541481.1| PREDICTED: probable glycosyltransferase At5g20260-like, partial
           [Glycine max]
          Length = 290

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 104/289 (35%), Positives = 166/289 (57%), Gaps = 15/289 (5%)

Query: 52  DPSKADLFFLPFSIARM-RHDRRIGTEGIPD----FISHYIFNISQKYPYWNRTGGADHF 106
           +P +A +F LP S+A++ R+     T    D        Y   I+ +YPYWNRT GADHF
Sbjct: 1   NPDEAHVFMLPISVAQIVRYVYNPLTTYSRDQLMWITIDYTNIIAHRYPYWNRTRGADHF 60

Query: 107 YVACHSIGRSAMQKAW---EVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR--QEDPP 161
             +CH      + +A    E+  N I+V+ +++    G    KDV +P++  +  +   P
Sbjct: 61  LASCHDWAPPDISRAESGKELFKNIIRVLYNANKS-EGFKPEKDVPMPEVNLQGFKLSSP 119

Query: 162 KLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFC 217
            LG    N+  LAFFAG V+  +RE LLQ W++ D E+  H    K      L+G SKFC
Sbjct: 120 ILGLDPNNRSILAFFAGGVHGRIREILLQHWKDKDEEVQVHEYLPKGVDYHGLMGQSKFC 179

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L   G+EV + RI +S+  GCVPVI++++Y LPF+DVL+   FS+ + +  I  +K +LK
Sbjct: 180 LCPSGYEVASPRIVESINIGCVPVIVSDYYQLPFSDVLDRSKFSLHIPSRRIAEIKTMLK 239

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            +   +YL LQ  V+KV++HF  +     ++ F+M+++ +WLR+ ++R+
Sbjct: 240 NVPHAKYLKLQKRVMKVQRHFVLNRPAKSFNVFHMILHSIWLRQLNIRL 288


>gi|167517883|ref|XP_001743282.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778381|gb|EDQ91996.1| predicted protein [Monosiga brevicollis MX1]
          Length = 503

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 110/337 (32%), Positives = 171/337 (50%), Gaps = 28/337 (8%)

Query: 3   RSFRVYVYP-HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFL 61
           R  R++ YP H  + P A    P D++    Y  E    +    S +  + P +A  F +
Sbjct: 175 RQLRIFTYPVHLGSMPRA----PDDYK----YGVERRLPQVLASSPYAVQQPEEATHFLI 226

Query: 62  PFSIARMRH---DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAM 118
           PF     R+   DR  G       ++ +I +IS  YPYWNR+ GA+HFYV  H +G SA+
Sbjct: 227 PFQCTAHRYTVADRAGGQNAAEAGLASWIASISAAYPYWNRSAGANHFYVCSHDMGSSAV 286

Query: 119 -QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSL-PQIWPRQEDPPKLGS----------- 165
            Q +   + N I +V ++         H+D++  P I    +   + G+           
Sbjct: 287 AQLSRAAQQNLIGLVNTADRRDGFFNVHRDLATAPHIGDGCDTCLQGGTRLSVTREAWAG 346

Query: 166 SKRNKLAFFAGAVN-SPVREKLLQVWRNDSEIYAHSGRLKTP-YADELLGSKFCLHVKGF 223
           + RN+LAF AG +   PVR +L Q +  D +     G L    Y   L  S+FCL V+GF
Sbjct: 347 TPRNRLAFMAGNLQRGPVRPRLRQFFDGDPDFLLVDGTLAAAHYRQALAESEFCLVVRGF 406

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
            V T R+ D+++ GC+PVIIA+ Y+LPF+ +L+W SF++ V   D+P LK IL    S+ 
Sbjct: 407 RVWTPRLMDAVWSGCIPVIIADGYELPFSSLLHWPSFAVFVPEHDVPRLKDILLAKLSQA 466

Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLR 320
             LL++ +L   ++  +H      DAF ++M +L  R
Sbjct: 467 P-LLRANLLAASQYLTYHSNWVPLDAFDILMLQLAAR 502


>gi|449528899|ref|XP_004171439.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At3g07620-like, partial [Cucumis sativus]
          Length = 173

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 75/163 (46%), Positives = 110/163 (67%), Gaps = 5/163 (3%)

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKG 222
           R  L F+AG  NS +R  L +VW ND+E+   + R+        Y      +KFC+   G
Sbjct: 10  RTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKTKFCICPGG 69

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
            +VN+ARIADS++YGCVPVI++++YDLPF D+L+W+ FS++V   D+  LK+ILK IS  
Sbjct: 70  SQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDI 129

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           E++ L   +++V+KHFQW+  P  YDAF+MVMY+LWLR   ++
Sbjct: 130 EFIKLHKNLMQVQKHFQWNSXPIKYDAFHMVMYDLWLRHHVIK 172


>gi|147775380|emb|CAN73459.1| hypothetical protein VITISV_022181 [Vitis vinifera]
          Length = 319

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 96/299 (32%), Positives = 152/299 (50%), Gaps = 39/299 (13%)

Query: 46  SHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEG--IPDFISHYIFNISQKYPYWNRT 100
           SHF+   P  A +F++P S+ R+ H      +   G  +   ++ YI+ +S KYPYWNR+
Sbjct: 42  SHFMAGHPDVAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSDKYPYWNRS 101

Query: 101 GGADHFYVACHSIGRS-AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP-------- 151
            GADHF V+CH   +S AM    +    +   +C  S +     AH   + P        
Sbjct: 102 NGADHFLVSCHDWVQSFAMPTPPKGSSPSETSLCQKSTYPKAXSAHLTSTNPLTSVTSSP 161

Query: 152 ---QIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYA 208
               + P   +P   G  K+    F                     ++Y H    +  YA
Sbjct: 162 SSPAVNPATCEPSCSGPGKKMMTKF---------------------QVYEHLPSNRD-YA 199

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
             +  SKFCL   G+EV + R+ +++  GCVPVII ++Y LPF++VL+W  FSI + +  
Sbjct: 200 KSMGDSKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLDWSKFSINITSDK 259

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           IP +KKILK + +E YL +Q  V +V++HF  +     YD  +M+++ +WLRR +VR++
Sbjct: 260 IPEIKKILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMILHSVWLRRLNVRLR 318


>gi|224098485|ref|XP_002311191.1| predicted protein [Populus trichocarpa]
 gi|222851011|gb|EEE88558.1| predicted protein [Populus trichocarpa]
          Length = 345

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 106/333 (31%), Positives = 166/333 (49%), Gaps = 42/333 (12%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M  SF++Y+Y   +   F++              +ES F  +   S FVT++P +A LFF
Sbjct: 40  MLNSFKIYIYTPSKPFSFSS-------------PTESLFFTSLQASPFVTQNPEEAHLFF 86

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +PF+         + T  I  FI     ++  ++PYWNRT GADHFYV+C  +G  + + 
Sbjct: 87  VPFA-------SNLSTRSIARFIR----DLRMEFPYWNRTLGADHFYVSCAGLGYESDRN 135

Query: 121 AWEVKLNAIQVVC---SSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGA 177
             E+K N++Q+ C       F    + HKD+SLP +          G+     L    G 
Sbjct: 136 LVELKKNSVQISCFPVPEGKF----VPHKDISLPPLARITRASHAPGNRTVRYLVRHGGV 191

Query: 178 VNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSK-FCLHVKGFEVNTARIADSLYY 236
            +S    KL    RNDS+    S         E LGS  FCL   G +++   I ++L +
Sbjct: 192 KDS----KLANELRNDSDFLMESEPSNEMTLVERLGSSMFCLFEDGADISG--IGEALRF 245

Query: 237 GCVPVIIANH--YDLPFADVLNWKSFSIVVAT-LDIPLLKKIL-KGISSEEYLLLQSYVL 292
           GCVPV++ +    DLP  DVL+W+  ++ V +   I  +K++L +    +E    +   +
Sbjct: 246 GCVPVMVTDRPMQDLPLMDVLSWQKIAVFVGSGGGIKEMKRVLDRTCKDDECEGTRRLGV 305

Query: 293 KVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
              +HF W+  P  YD+FYMV+Y+LWLRR ++R
Sbjct: 306 AASQHFGWNEIPQPYDSFYMVVYQLWLRRHTIR 338


>gi|296081850|emb|CBI20855.3| unnamed protein product [Vitis vinifera]
          Length = 618

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 120/358 (33%), Positives = 167/358 (46%), Gaps = 76/358 (21%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH-FVTKDPSKAD 57
           M  + +VY Y             PV  +P  +G YASE +F K    +  FVTK+  KA 
Sbjct: 308 MENTLKVYTYREGER--------PVFHQPPIKGIYASEGWFMKLMQANKKFVTKNGRKAH 359

Query: 58  LFFLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           LF+LPFS   +         H R+     +  ++ +Y+  I  KYP+WNRTGGADHF VA
Sbjct: 360 LFYLPFSSLMLEEALYVPNSHSRK----NLEQYLKNYLDMIGAKYPFWNRTGGADHFLVA 415

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS-- 165
           CH    S   K   +  N+I+ +C+S     G    KDVSLP+   R    P  +LG   
Sbjct: 416 CHDWAPSETLK---LMANSIRALCNSD-IREGFKLGKDVSLPETCVRIPQNPLRQLGGKP 471

Query: 166 -SKRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTP---YADELLGSKFCL 218
            S+R  LAFFAG+++  VR  LL+ W N   D +IY    + K     Y   +  SK+C+
Sbjct: 472 PSQRRILAFFAGSMHGYVRPILLKYWENKDPDMKIYGRMPKAKKGTMNYIQHMKSSKYCI 531

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
             KG+EVN+ R                                      DIP LK IL  
Sbjct: 532 CAKGYEVNSPRK-------------------------------------DIPNLKSILLS 554

Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRS-SVRVQWSTSLDSN 335
           I  + YL +Q  V +V++HF WH  P  YD F+M+++ +W  R   +RV+    L  N
Sbjct: 555 IPEKSYLEIQMRVKQVQQHFLWHAKPVKYDVFHMILHSVWYNRVLQIRVRPKQLLGRN 612


>gi|218198616|gb|EEC81043.1| hypothetical protein OsI_23835 [Oryza sativa Indica Group]
          Length = 250

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 97/292 (33%), Positives = 146/292 (50%), Gaps = 60/292 (20%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSFRV+VYP    DP      P      G YASE YF +   +S F T D  +A LFF
Sbjct: 1   MERSFRVFVYPD--GDPGTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDLEQAHLFF 56

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+ ++  KYPYWNRT G        HS       K
Sbjct: 57  VPISPHKMR-GKGTSYENMTIIVQNYVESLINKYPYWNRTLGE-------HS-------K 101

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
            + + L  +                                    S+   L F+AG  NS
Sbjct: 102 GFPLSLRTL------------------------------------SEWTILGFWAGHCNS 125

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSLY 235
            +R  L ++W ND+E+   + R+        Y  +   +KFC+   G +VN+ARI+DS++
Sbjct: 126 KIRVILARIWENDTELAISNNRINRAIGNLVYQKQFFWTKFCVCPGGSQVNSARISDSIH 185

Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
           YGCVPVI++++YDLPF+ +LNW+ F++V+   D+  LK ILK +S +E++ +
Sbjct: 186 YGCVPVILSDYYDLPFSGILNWRKFAVVLKESDVYELKSILKSLSQKEFVFI 237


>gi|255566500|ref|XP_002524235.1| conserved hypothetical protein [Ricinus communis]
 gi|223536512|gb|EEF38159.1| conserved hypothetical protein [Ricinus communis]
          Length = 337

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 103/331 (31%), Positives = 165/331 (49%), Gaps = 48/331 (14%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M +SF++Y Y   +   F +   PV          ES F  +   SHF+T +P +A LFF
Sbjct: 42  MLQSFKIYTYTPPQPFSFTS---PV----------ESLFFTSLQNSHFITLNPEQAHLFF 88

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +PF            ++  P  ++  I ++  ++PYWNRT GADHFY++C  +G  + + 
Sbjct: 89  IPFP-----------SDLSPRSLARVIRDLRTEFPYWNRTLGADHFYISCTGLGYESDRN 137

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
             E+K N++Q+ C  S      + HKD++LP + P      K  + +R   AF       
Sbjct: 138 LVELKKNSVQISCFPSP-NGKFVPHKDITLPPLVPSTIH--KSSNKRRPYKAF------- 187

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
            V+   ++  R D E+   S       +DE   S+FCL    +  N + I ++L  GCVP
Sbjct: 188 -VKYDGVEELRGDLEVLIESQP-----SDEKTRSEFCLF--DYAANISGIGEALSSGCVP 239

Query: 241 VIIANH--YDLPFADVLNWKSFSIVVATLD--IPLLKKILKGISS--EEYLLLQSYVLKV 294
           ++I      DLP  DVL W+  +++V + D     +K++L G  S  +    ++      
Sbjct: 240 LVITERPIQDLPLMDVLRWQEIAVIVGSSDDGFKWVKRVLNGTCSRGDTCERMRRLGAGA 299

Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
            +H  W+  P  YDAF+MVMY+LWLRR ++R
Sbjct: 300 SQHLVWNETPEPYDAFHMVMYQLWLRRHTIR 330


>gi|222635945|gb|EEE66077.1| hypothetical protein OsJ_22095 [Oryza sativa Japonica Group]
          Length = 265

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 103/326 (31%), Positives = 157/326 (48%), Gaps = 63/326 (19%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSFRV+VYP    DP      P      G YASE YF +   +S F T D  KA LFF
Sbjct: 1   MERSFRVFVYPD--GDPGTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDLEKAHLFF 56

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +  G  G+  F  H +  + +          +  F ++  ++    +  
Sbjct: 57  VPISPHKMR-GKDTGGWGLTIFSLHAMMLVLEH---------SKGFPLSLRTLSEWTILG 106

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVN 179
            W                 +GH   K  V L +IW    +           LA     +N
Sbjct: 107 FW-----------------AGHRNSKIRVILARIWENDTE-----------LAISNNRIN 138

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
             +   + Q                         +KFC+   G +VN+ARI+DS++YGC+
Sbjct: 139 RAIGNLVYQ--------------------KHFFRTKFCVCPGGSQVNSARISDSIHYGCM 178

Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ 299
           PVI++++YDL F+ +LNW+ F++V+   D+  LK ILK +S +E++ L   +++V+KHF+
Sbjct: 179 PVILSDYYDLRFSGILNWRKFAVVLKESDVYELKSILKSLSQKEFVSLHKSLVQVQKHFE 238

Query: 300 WHVFPSDYDAFYMVMYELWLRRSSVR 325
           WH  P  YDAF+M+MYELWLR   ++
Sbjct: 239 WHSPPVPYDAFHMIMYELWLRHHVIK 264


>gi|52075645|dbj|BAD44815.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|52075653|dbj|BAD44823.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
          Length = 514

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/386 (29%), Positives = 182/386 (47%), Gaps = 66/386 (17%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSK 55
           M R F+VYVY     P     P  N+     +   G +  E       +     T DP++
Sbjct: 134 MERRFKVYVYEEGEPPIAHEGPCKNI-----YAVEGRFIEELELMAPPLGG-VRTWDPAR 187

Query: 56  ADLFFLPFSIARMRHD--RRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           A   FLP S+++M     R +  +  P    ++ Y+  ++ ++ +WNR+ GADHF ++CH
Sbjct: 188 AHALFLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCH 247

Query: 112 S------------------------IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKD 147
                                    IG  A +   E+  NAI+ +C+++    G    KD
Sbjct: 248 DWAIHTPSVQRDSISGFPTFRVQRLIGPHASRGHPELYANAIRALCNANTS-EGFRPDKD 306

Query: 148 VSLPQIWPRQED-----PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYA--- 198
           VS+P+I     D             R  LAFFAG  +  VR+ LL+ W+  D  ++    
Sbjct: 307 VSIPEINLYDGDMPPELLSPAPPPPRPFLAFFAGGRHGHVRDLLLRHWKGRDPAVFPVYE 366

Query: 199 -----------------HSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPV 241
                              G    PY   +  S+FCL   G EV + R+ ++++ GCVPV
Sbjct: 367 YDLPSIPVSVSGDGDTDAGGEGGNPYYWYMRRSRFCLCPSGHEVASPRVVEAIHAGCVPV 426

Query: 242 IIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           ++A+ Y  PFADVL W++FS+ VA  D+P L+++L+ I + E   L+  V  V++HF  H
Sbjct: 427 VVADGYAPPFADVLRWEAFSVAVAVADVPRLRELLERIPAPEVERLRDGVRLVKRHFMLH 486

Query: 302 VFPSDYDAFYMVMYELWLRRSSVRVQ 327
             P   D F+M+++ +WLRR ++R+ 
Sbjct: 487 QPPERLDMFHMILHSVWLRRLNLRLN 512


>gi|357478277|ref|XP_003609424.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355510479|gb|AES91621.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 229

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 79/205 (38%), Positives = 125/205 (60%), Gaps = 7/205 (3%)

Query: 129 IQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGSSKRNK--LAFFAGAVNSPVRE 184
           I+ +C+++    G   ++DVS+PQ+  P  +  PP       N+  L FFAG  +  +R+
Sbjct: 19  IRALCNANTS-EGFWPNRDVSIPQLNLPVGKLGPPNTDQHPNNRTILTFFAGGAHGKIRK 77

Query: 185 KLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVI 242
           KLL+ W++ D E+  H    K     +L+G SKFCL   G EV + R+ +++Y GCVPVI
Sbjct: 78  KLLKSWKDKDEEVQVHEYLPKGQDYTKLMGLSKFCLCPSGHEVASPRVVEAIYAGCVPVI 137

Query: 243 IANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHV 302
           I ++Y LPF+DVLNW  FS+ +A   IP +K IL+ I+  +Y +L S V +VRKHF+ + 
Sbjct: 138 ICDNYSLPFSDVLNWSQFSMEIAVDRIPEIKTILQNITETKYRVLYSNVRRVRKHFEMNR 197

Query: 303 FPSDYDAFYMVMYELWLRRSSVRVQ 327
               +D  +M+++ +WLRR + R+ 
Sbjct: 198 PAKPFDLIHMILHSVWLRRLNFRLH 222


>gi|224112673|ref|XP_002316258.1| predicted protein [Populus trichocarpa]
 gi|222865298|gb|EEF02429.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 95/299 (31%), Positives = 151/299 (50%), Gaps = 35/299 (11%)

Query: 35  SESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKY 94
           +ES F      S FVT++P +A L+F+PFS         + T  +  FI     ++  ++
Sbjct: 61  TESNFFTCLQNSPFVTQNPEEAHLYFVPFS-------SNLSTRSVARFIR----DLRMEF 109

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGH-IAHKDVSLPQI 153
           PYWNRT GADHFYV+C  +G  + +   E+K N++Q+ C  +    G  + HKD++ P  
Sbjct: 110 PYWNRTLGADHFYVSCAGLGYESDRNLVELKKNSVQISCFPT--TEGRFVPHKDITFPP- 166

Query: 154 WPRQEDPPKLGSSKRNKLAFFAGAV--NSPVREKLLQVWRNDSEIYAHSGRLKTPYADEL 211
                       ++ N+ A + G V  N      L+   R DS+    S           
Sbjct: 167 -----------HAQGNRTAKYLGFVRYNEVKESNLVNELRKDSDFLIESEPSNGMTLVGR 215

Query: 212 LGSK-FCLHVKGFEVNTARIADSLYYGCVPVIIANH--YDLPFADVLNWKSFSIVVATL- 267
           LGS  FCL   G +V+   I ++L +GCVPV++ +    DLP  DV+ W+  +I V +  
Sbjct: 216 LGSSVFCLFEYGADVSG--IGEALRFGCVPVMVMDRPMQDLPLMDVIGWQKIAIFVGSRG 273

Query: 268 DIPLLKKIL-KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
            +  +K+ L +    +E    +   +   +HF W+  P  YD+F+MVMY+LWLRR ++R
Sbjct: 274 GVKEVKRELDRTCKDDECAGRRRLGVVASQHFVWNHMPQPYDSFHMVMYQLWLRRHAIR 332


>gi|224142185|ref|XP_002324439.1| predicted protein [Populus trichocarpa]
 gi|222865873|gb|EEF03004.1| predicted protein [Populus trichocarpa]
          Length = 208

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 76/206 (36%), Positives = 121/206 (58%), Gaps = 12/206 (5%)

Query: 127 NAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNKLAFFAGAVNSP 181
           + I+ +C++    +G    +DVS P+ + R    P   LG    S+RN LAF+AG ++  
Sbjct: 3   HCIKALCNAD-VTAGFKIGRDVSFPETYVRSARNPLRDLGGKPPSQRNILAFYAGNMHGY 61

Query: 182 VREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKGFEVNTARIADSLY 235
           +R  LL+ W++        G +      K  Y   +  SK+C+  KG+EVN+ R+ ++++
Sbjct: 62  LRPILLKYWKDKDPDMKIFGPMPPGVASKMNYIHHMQRSKYCICPKGYEVNSPRVVEAIF 121

Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVR 295
           Y CVPVII++++  PF DVL+W +FS+++A  DI  LK+IL  I  E+YL +Q  V K +
Sbjct: 122 YECVPVIISDNFVPPFFDVLDWGAFSLILAEKDISNLKEILLSIPKEKYLQMQLGVRKAQ 181

Query: 296 KHFQWHVFPSDYDAFYMVMYELWLRR 321
           +HF WH  P  YD FYM ++ +W  R
Sbjct: 182 RHFLWHASPMKYDLFYMTLHSIWYNR 207


>gi|125544214|gb|EAY90353.1| hypothetical protein OsI_11935 [Oryza sativa Indica Group]
          Length = 218

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 75/188 (39%), Positives = 115/188 (61%), Gaps = 18/188 (9%)

Query: 146 KDVSLPQIWPRQEDPP-----KLGSSKRNKLAFFAGAVNSPVREKLLQVWRN--DSEIYA 198
           +DVSLP+   R    P      L  S+R  LAFFAG V+  VR  LL+ W +  D +++ 
Sbjct: 29  RDVSLPETTIRTPRRPLRYVGGLPVSRRGILAFFAGNVHGRVRPVLLKHWGDGRDDDMWV 88

Query: 199 HS---GRL--KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
           +     R+  +  Y   +  S+FCL   G+EVN+ RI ++LYY CVPVIIA+++      
Sbjct: 89  YGPLPARVSRRMSYIQHMKNSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNF------ 142

Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
           VL+W +F++VVA  D+P LKKIL+GI+  +Y+ +   V ++++HF WH  P  YD F+M+
Sbjct: 143 VLDWSAFAVVVAEKDVPDLKKILQGITLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMI 202

Query: 314 MYELWLRR 321
           ++ +WL R
Sbjct: 203 LHSIWLSR 210


>gi|297725119|ref|NP_001174923.1| Os06g0638350 [Oryza sativa Japonica Group]
 gi|255677258|dbj|BAH93651.1| Os06g0638350 [Oryza sativa Japonica Group]
          Length = 257

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 99/270 (36%), Positives = 136/270 (50%), Gaps = 44/270 (16%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M RSFRV+VYP    DP      P      G YASE YF +   +S F T D  KA LFF
Sbjct: 1   MERSFRVFVYPD--GDPGTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDLEKAHLFF 56

Query: 61  LPFSIARMR---------------------HDRRIGTEGIPDFISHYIFN--ISQKYPYW 97
           +P S  +MR                     +DR I       F+  Y +    S    +W
Sbjct: 57  VPISPHKMRGKVPSSLLLVTYAWLILHIRSYDRSI------LFLDLYWWCPLCSSFRGHW 110

Query: 98  NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ 157
               GADHF+V CH +G  A +    +  N+I+VVCS SY  +G+I HKDV+LPQI  + 
Sbjct: 111 GV--GADHFFVTCHDVGVRAFEGLPFIIKNSIRVVCSPSY-NAGYIPHKDVALPQIL-QP 166

Query: 158 EDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADE 210
              P  G+   N+  L F+AG  NS +R  L ++W ND+E+   + R+        Y   
Sbjct: 167 FALPAGGNDIENRTILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGNLVYQKH 226

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
              +KFC+   G +VN+ARI+DS++YGC+P
Sbjct: 227 FFRTKFCVCPGGSQVNSARISDSIHYGCMP 256


>gi|449465854|ref|XP_004150642.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 312

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 98/297 (32%), Positives = 142/297 (47%), Gaps = 65/297 (21%)

Query: 35  SESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKY 94
           +ES F K+ + S + T DP +A LFF+PFS         I T  +    +  I  +    
Sbjct: 65  AESLFYKSLLNSPYTTHDPDQAHLFFIPFS-------PHISTRSL----ARLIRTLRTDL 113

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS--GHIAHKDVSLPQ 152
           PYWNRT GADHF+++   IG  + +   E+K NAIQV   SS+ +S    I HKDVSLP 
Sbjct: 114 PYWNRTLGADHFFLSSSGIGYISDRNVVELKKNAIQV---SSFPVSPGKFIPHKDVSLP- 169

Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELL 212
             P    PP+  S                                         Y D+L 
Sbjct: 170 --PVSTLPPRTPSC----------------------------------------YGDKLA 187

Query: 213 GSKFCL-HVKGFEVNTARIADSLYYGCVPVIIANHY--DLPFADVLNWKSFSIVVA-TLD 268
            S FCL   +G +V+   I ++L +GCVPV+I++ +  DLP  DV+ W+  ++ VA    
Sbjct: 188 KSDFCLFEYEGGDVSG--IGEALRFGCVPVVISDRWIQDLPLMDVVRWEEMAVFVAGGGG 245

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           I  +KK+L+ +  E    ++       +HF W+  P   DAF  V Y+LW+RR +VR
Sbjct: 246 IEGVKKVLRRVDGERLDRMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWVRRHAVR 302


>gi|449503277|ref|XP_004161922.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 310

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 98/297 (32%), Positives = 142/297 (47%), Gaps = 65/297 (21%)

Query: 35  SESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKY 94
           +ES F K+ + S + T DP +A LFF+PFS         I T  +    +  I  +    
Sbjct: 63  AESLFFKSLLNSPYATHDPDQAHLFFIPFS-------PHISTRSL----ARLIRTLRTDL 111

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS--GHIAHKDVSLPQ 152
           PYWNRT GADHF+++   IG  + +   E+K NAIQV   SS+ +S    I HKDVSLP 
Sbjct: 112 PYWNRTLGADHFFLSSSGIGYISDRNVVELKKNAIQV---SSFPVSPGKFIPHKDVSLP- 167

Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELL 212
             P    PP+  S                                         Y D+L 
Sbjct: 168 --PVSTLPPRTPSC----------------------------------------YGDKLA 185

Query: 213 GSKFCL-HVKGFEVNTARIADSLYYGCVPVIIANHY--DLPFADVLNWKSFSIVVA-TLD 268
            S FCL   +G +V+   I ++L +GCVPV+I++ +  DLP  DV+ W+  ++ VA    
Sbjct: 186 KSDFCLFEYEGGDVSG--IGEALRFGCVPVVISDRWIQDLPLMDVVRWEEMAVFVAGGGG 243

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           I  +KK+L+ +  E    ++       +HF W+  P   DAF  V Y+LW+RR +VR
Sbjct: 244 IEGVKKVLRRVDGERLDRMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWVRRHAVR 300


>gi|297738775|emb|CBI28020.3| unnamed protein product [Vitis vinifera]
          Length = 665

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 101/335 (30%), Positives = 147/335 (43%), Gaps = 80/335 (23%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
           M R  +VYVY       F   +L      +G YASE +F K   ++ HFV KDP +A LF
Sbjct: 390 MERILKVYVYKDGEKPIFHQPIL------KGLYASEGWFMKLMERNKHFVVKDPRQAQLF 443

Query: 60  FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           ++PFS          R  H+R      +  ++  Y   I+ KY +WNRTGGADHF VACH
Sbjct: 444 YMPFSSRMLEYKLYVRNSHNR----TNLRQYLKQYSEKIAAKYRFWNRTGGADHFLVACH 499

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---S 166
                  +   E     I+ +C++    +G    +DVSLP+ + R    P   LG    S
Sbjct: 500 DWAPYETRHHME---QCIKALCNAD-VTAGFKIGRDVSLPETYVRSARNPLRDLGGKPPS 555

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
           +R+ LAF+AG ++  +R  LL+ W++           K P                    
Sbjct: 556 ERHILAFYAGNMHGYLRPILLKYWKD-----------KDP-------------------- 584

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
                D   YG +P  +         D+ N K                +L  I +++YL 
Sbjct: 585 -----DMKIYGPMPPGLRGQQPTE-KDIPNLKD---------------VLLSIPNDKYLQ 623

Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +Q  V KV+KHF WH  P  YD F+M ++ +W  R
Sbjct: 624 MQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 658


>gi|449531319|ref|XP_004172634.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 307

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 90/153 (58%), Gaps = 6/153 (3%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M   F+VY+YP    DP  N       +  G YASE YF +   +S F T+DP +A LFF
Sbjct: 147 MESKFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDQAHLFF 202

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P S  +MR  +    E +   + +Y+  +  KYPYWNRT GADHF+V CH +G  A + 
Sbjct: 203 IPISCHKMR-GKGTSYENMTVIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVGVRASEG 261

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI 153
              +  NAI+VVCS SY + G I HKDV+LPQ+
Sbjct: 262 LPFLIKNAIRVVCSPSYDV-GFIPHKDVALPQV 293


>gi|218197695|gb|EEC80122.1| hypothetical protein OsI_21883 [Oryza sativa Indica Group]
          Length = 545

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 112/417 (26%), Positives = 181/417 (43%), Gaps = 97/417 (23%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSK 55
           M R F+VYVY     P     P  N+     +   G +  E       +     T DP++
Sbjct: 134 MERRFKVYVYEEGEPPIAHEGPCKNI-----YAVEGRFIEELELMAPPLGG-VRTWDPAR 187

Query: 56  ADLFFLPFSIARMRHD--RRIGTEGIP--DFISHYIFNISQKYPYWNRTGGA-------- 103
           A   FLP S+++M     R +  +  P    ++ Y+  ++ ++ +WNR+ GA        
Sbjct: 188 AHALFLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCH 247

Query: 104 -----DHFY---------------------------------VACHSI---------GRS 116
                +HF+                                 V   SI         G  
Sbjct: 248 DWASTNHFFPLFFRRKIFSQHQMISIQISKRVQIINSVHTPSVQRDSISGFPTFRVNGPH 307

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED-----PPKLGSSKRNKL 171
           A +   E+  NAI+ +C+++    G    KDVS+P+I     D             R  L
Sbjct: 308 ASRGHPELYANAIRALCNANT-SEGFRPDKDVSIPEINLYDGDMPPELLSPAPPPPRPFL 366

Query: 172 AFFAGAVNSPVREKLLQVWRN-DSEIYA--------------------HSGRLKTPYADE 210
           AFFAG  +  VR+ LL+ W+  D  ++                       G    PY   
Sbjct: 367 AFFAGGRHGHVRDLLLRHWKGRDPAVFPVYEYDLPSIPVSVSGDGDTDAGGEGGNPYYWY 426

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           +  S+FCL   G EV + R+ ++++ GCVPV++A+ Y  PFADVL W++FS+ VA  D+P
Sbjct: 427 MRRSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRWEAFSVAVAVADVP 486

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            L+++L+ I + E   L+  V  V++HF  H  P   D F+M+++ +WLRR ++R+ 
Sbjct: 487 RLRELLERIPAPEVERLRDGVRLVKRHFMLHQPPERLDMFHMILHSVWLRRLNLRLN 543


>gi|115469550|ref|NP_001058374.1| Os06g0680900 [Oryza sativa Japonica Group]
 gi|113596414|dbj|BAF20288.1| Os06g0680900 [Oryza sativa Japonica Group]
 gi|125556494|gb|EAZ02100.1| hypothetical protein OsI_24187 [Oryza sativa Indica Group]
          Length = 477

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 94/289 (32%), Positives = 148/289 (51%), Gaps = 18/289 (6%)

Query: 50  TKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGAD 104
            + P +A  FFLP S+A +     R D     +     ++ Y+  ++  YP+WNR+ GAD
Sbjct: 183 ARHPGEAHAFFLPISVASIAGYVYRRDMIDFWDPQLRLVAGYVDGLAAMYPFWNRSRGAD 242

Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG 164
           HF V+CH           E++ NAI+V+C +     G     DV+LP +       P  G
Sbjct: 243 HFLVSCHQWAPILSAAKAELRGNAIRVMCDAD-MSDGFDPATDVALPPVVASARATPPQG 301

Query: 165 --SSKRNKLAFFAGAVNSP--VREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCL 218
             +S+R  LAFFA        VRE LL  W    +     GRL     + + +  ++FCL
Sbjct: 302 RVASERTVLAFFAAGGGGGGAVREALLARWEGRDDRVVVYGRLPAGVDHGELMRRARFCL 361

Query: 219 ----HVKGFEVNTARIADSLYYGCVPVIIANH-YDLPFADVLNWKSFSIVVATLDIPLLK 273
                 +G    + R+ +++  GCVPV++ +  Y  PF+DVL+W  FS+ V    +  +K
Sbjct: 362 CPCGGGEGAAAASRRVVEAITAGCVPVLVDDGGYSPPFSDVLDWARFSVAVPAERVGEIK 421

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD-YDAFYMVMYELWLRR 321
            IL G+S   Y +L+  VL+VR+HF+ +  P+  +D   MV++ +WLRR
Sbjct: 422 DILGGVSDRRYGVLRRRVLRVRRHFRLNRPPAKRFDVVNMVIHSIWLRR 470


>gi|222635067|gb|EEE65199.1| hypothetical protein OsJ_20321 [Oryza sativa Japonica Group]
          Length = 551

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/241 (32%), Positives = 125/241 (51%), Gaps = 27/241 (11%)

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED-----PPKLGSSK 167
           IG  A +   E+  NAI+ +C+++    G    KDVS+P+I     D             
Sbjct: 310 IGPHASRGHPELYANAIRALCNANTS-EGFRPDKDVSIPEINLYDGDMPPELLSPAPPPP 368

Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN-DSEIYA--------------------HSGRLKTP 206
           R  LAFFAG  +  VR+ LL+ W+  D  ++                       G    P
Sbjct: 369 RPFLAFFAGGRHGHVRDLLLRHWKGRDPAVFPVYEYDLPSIPVSVSGDGDTDAGGEGGNP 428

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   +  S+FCL   G EV + R+ ++++ GCVPV++A+ Y  PFADVL W++FS+ VA 
Sbjct: 429 YYWYMRRSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRWEAFSVAVAV 488

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
            D+P L+++L+ I + E   L+  V  V++HF  H  P   D F+M+++ +WLRR ++R+
Sbjct: 489 ADVPRLRELLERIPAPEVERLRDGVRLVKRHFMLHQPPERLDMFHMILHSVWLRRLNLRL 548

Query: 327 Q 327
            
Sbjct: 549 N 549



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 15/120 (12%)

Query: 1   MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSK 55
           M R F+VYVY     P     P  N+     +   G +  E       +     T DP++
Sbjct: 137 MERRFKVYVYEEGEPPIAHEGPCKNI-----YAVEGRFIEELELMAPPLGG-VRTWDPAR 190

Query: 56  ADLFFLPFSIARMRHD--RRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           A   FLP S+++M     R +  +  P    ++ Y+  ++ ++ +WNR+ GADHF ++CH
Sbjct: 191 AHALFLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCH 250


>gi|326490197|dbj|BAJ94172.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326526653|dbj|BAK00715.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 422

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 92/345 (26%), Positives = 156/345 (45%), Gaps = 51/345 (14%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +A+E +  +  + S   TK+P +AD F+ P  +        +  +G P        +   
Sbjct: 83  FAAEIFMHQFLLASAVRTKNPEEADWFYTPVYVTC-----DLTQQGFPLPFRAPRIMRSA 137

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISG 141
           I  I+  +PYWNRT GADHF++A H  G       ++A E  +  +  +     ++    
Sbjct: 138 IQYIAATWPYWNRTEGADHFFLAPHDFGACFHYQEERAIERGILPLLRRATLVQTFGQRN 197

Query: 142 HIAHKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKL 186
           H+  ++ S+    P   +P K+ +      + R+  A+F G      N P      R   
Sbjct: 198 HVCMQEGSI--TIPPYANPQKMQAHLISPGTPRSIFAYFRGLFYDMGNDPEGGYYARGAR 255

Query: 187 LQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
             VW N  D+ ++  S    + Y +++  + FCL   G+   + R+ +++ +GC+PVIIA
Sbjct: 256 ASVWENFKDNPLFDMSSEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 315

Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQW 300
           +   LPFAD + W+  S+ VA  D+P L  IL  ++ E+ L    LL S  +K    F  
Sbjct: 316 DDIVLPFADAIPWEQISVFVAEADVPRLDSILASVAPEDVLRKQRLLASPAMKQAVLFHQ 375

Query: 301 HVFPSDYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
              P   DAF  V+  L          +L+     + W+  LDS+
Sbjct: 376 PARPG--DAFDQVLNGLARKLPHPDGAFLKPGQKVLDWNAGLDSD 418


>gi|357492457|ref|XP_003616517.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355517852|gb|AES99475.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 259

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 90/332 (27%), Positives = 141/332 (42%), Gaps = 83/332 (25%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           M ++F+V++Y    N          +         ES F  +   S ++T+ P +A LFF
Sbjct: 1   MAQNFKVFMYQPNTNTNITQFSFKTE--------QESLFYSSLQNSSYLTQHPEQAHLFF 52

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           LPFS           T  +  FIS     I   +PYWNR+ GADHFY++C  I     + 
Sbjct: 53  LPFS-------SDTSTRSLARFISR----IRNDFPYWNRSLGADHFYLSCDGISHVNDRN 101

Query: 121 AWEVKLNAIQVVCSSSYFISGH---IAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGA 177
             E+K NAIQ+ C    F + H   I HKD++LP I     +P                 
Sbjct: 102 IVELKKNAIQIAC----FPTRHRSFIPHKDITLPPI----TNP----------------- 136

Query: 178 VNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
            ++PVR                           L   +FC+ V+    +   + ++L  G
Sbjct: 137 -HAPVR---------------------------LSNEEFCV-VEYQNDDVLWLGEALRLG 167

Query: 238 CVPVIIANHY--DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVR 295
           CVPV++      D+PF +VL W+  ++ V +        I      E    ++   +   
Sbjct: 168 CVPVVVTEEAVNDMPFMEVLRWREMAVFVKS-----GVNIETDTWRERKGNMRRLGVVGS 222

Query: 296 KHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           KH +W+     +DAF  +MY+LWLRR ++R +
Sbjct: 223 KHLRWNRPAQPFDAFNTIMYQLWLRRHTIRYE 254


>gi|168043163|ref|XP_001774055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674601|gb|EDQ61107.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 541

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 78/271 (28%), Positives = 129/271 (47%), Gaps = 22/271 (8%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASE-----SYFKKAFMKSHFVTKDPSKADLF 59
            +++VYP +      N      F+ R +   E       F +  ++S F T+ P KA LF
Sbjct: 147 LQIWVYPSQA----GNTSYEHKFDGRKDVMEELSSTADLFSRLLLRSKFSTELPQKAQLF 202

Query: 60  FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI-GRSAM 118
            LPFSI  +R D  +G   I D +  Y+ N+   YPYWN + GA+HFY++  +    +  
Sbjct: 203 LLPFSIDALRVD--LGPSRISDHLRRYVQNVRTSYPYWNLSLGANHFYLSSQAFENNNKH 260

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED---PPKLGSSKRNKLAFFA 175
           +   E++ N+IQ  C+          HKD   P+  P  +        G + R  LA+F 
Sbjct: 261 RNVLELEKNSIQAACAPLRQNQNFYPHKDFIFPRYKPITQTEFYAALEGRTSRTVLAYFG 320

Query: 176 GAV-NSPVREKLLQVWRNDS--EIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
           G + ++P    +L  W++D   E+       +     +L  SKFC++V     +T    D
Sbjct: 321 GTLADTPALVFILDAWKSDPDFEVEVDPSPHRISVYRQLARSKFCVNVPS--RDTFDFVD 378

Query: 233 SLYYGCVPVIIANH--YDLPFADVLNWKSFS 261
           ++ +GCV V+++     DLPF   L+W+ F+
Sbjct: 379 AIRFGCVLVLLSKSVFLDLPFQGFLDWRQFA 409


>gi|356546128|ref|XP_003541483.1| PREDICTED: LOW QUALITY PROTEIN: xylogalacturonan
           beta-1,3-xylosyltransferase-like [Glycine max]
          Length = 252

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 84/286 (29%), Positives = 137/286 (47%), Gaps = 50/286 (17%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARMRH----DRRIGTEGIPD----FISHYIFNISQKYPY 96
           KS F    P +A LF LPFS++++       RR  ++  PD     +  YI  ++ +YP 
Sbjct: 4   KSPFKATHPEQAHLFLLPFSVSKVIRYVYKPRRSRSDYDPDRLQRLVLDYINIVANRYPN 63

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR 156
           WNR+ GADHF V+ H      +    EV    I+ +C+++    G    +DVS+ +++  
Sbjct: 64  WNRSRGADHFLVSFHD----WLDANPEVFKYFIRALCNANTS-EGFQPSRDVSITEVY-- 116

Query: 157 QEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGS-K 215
                                         L   + D E+  H    K     +L+G  K
Sbjct: 117 ------------------------------LPSRKLDKEVQVHEYLPKGLEYTKLMGQRK 146

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
           FCL     +V + R+ +++Y GCVPVII ++Y LPF+DV+    F   +A   IP  K I
Sbjct: 147 FCL-CPSXQVASPRVVEAIYVGCVPVIICDNYSLPFSDVVKRSKF---IAVERIPETKTI 202

Query: 276 LKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           L+ +S ++Y+ L S V +VR+HF  +     +D  +M+++ L  +R
Sbjct: 203 LQNVSKDKYMELYSNVKRVRRHFVINRPAKPFDLIHMILHSLRNKR 248


>gi|23821292|dbj|BAC20928.1| pectin-glucuronyltransferase [Nicotiana plumbaginifolia]
          Length = 341

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 143/309 (46%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E Y  +  + S   T +P +AD F+ P ++   +  +        P  +   I  I+
Sbjct: 2   FAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF++  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 62  SNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 121

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 122 DGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 181

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 182 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 241

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I  EE +L +  +L      Q  +FP      D
Sbjct: 242 ADAIPWEDIGVFVAEKDVPNLDTILTSIPPEE-ILRKQRLLANPSMKQAMLFPQPAQSGD 300

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 301 AFHQILNGL 309


>gi|386576430|gb|AFJ12119.1| glycosyltransferase, partial [Nicotiana tabacum]
          Length = 317

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 143/309 (46%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E Y  +  + S   T +P +AD F+ P ++   +  +        P  +   I  I+
Sbjct: 2   FAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF++  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 62  SNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 121

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 122 DGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 181

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 182 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 241

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I  EE +L +  +L      Q  +FP      D
Sbjct: 242 ADAIPWEDIGVFVAEKDVPNLDTILTSIPPEE-ILRKQRLLANPSMKQAMLFPQPAQSGD 300

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 301 AFHQILNGL 309


>gi|42573339|ref|NP_974766.1| Exostosin family protein [Arabidopsis thaliana]
 gi|332004318|gb|AED91701.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 453

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 81/242 (33%), Positives = 123/242 (50%), Gaps = 30/242 (12%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR----GNYASESYFKKAFMKSH-FVTKDPSK 55
           M ++ +VYVY             P+  +P     G YASE +F K    SH F+TKDP+K
Sbjct: 216 MEQTLKVYVYSEGDR--------PIFHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTK 267

Query: 56  ADLFFLPFSIARMR-----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
           A LF++PFS   ++     HD       +  ++ +YI  I+  YP WNRT G+DHF+ AC
Sbjct: 268 AHLFYIPFSSRILQQKLYVHDSH-SRNNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTAC 326

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGS--- 165
           H    +  +  +   +N I+ +C++   I   +  KDVSLP+  +   Q    K+G    
Sbjct: 327 HDWAPTETRGPY---INCIRALCNADVGID-FVVGKDVSLPETKVSSLQNPNGKIGGSRP 382

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
           SKR  LAFFAG+++  VR  LL  W  R + ++   +      Y   +  S+FC+  KG+
Sbjct: 383 SKRTILAFFAGSLHGYVRPILLNQWSSRPEQDMKIFNRIDHKSYIRYMKRSRFCVCAKGY 442

Query: 224 EV 225
           E 
Sbjct: 443 EC 444


>gi|449459136|ref|XP_004147302.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Cucumis sativus]
          Length = 447

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 135/311 (43%), Gaps = 28/311 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGI---PDFISHYIFN 89
           +ASE    KA + SH  T DPS+AD FF+P  ++        G   I   P  ++  +  
Sbjct: 123 FASEVAIHKALLTSHVRTLDPSEADFFFVPVYVS-CNFSSFNGFPAIAHAPSLLASAVDV 181

Query: 90  ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH---- 145
           IS ++P+WNR+ G DH +VA H  G            N I     +S  +          
Sbjct: 182 ISGQFPFWNRSRGFDHVFVASHDYGACFHSLEDMAIANGIPEFLKNSIILQTFGVKYKHP 241

Query: 146 -KDVS----LPQIWPRQEDPPKLGSSKRNKLAFFAGAV------------NSPVREKLLQ 188
            +DV      P I P   +P  +   +R+  AFF G +               VR  + +
Sbjct: 242 CQDVENILIPPYISPEFMEPAVVDGRRRDIFAFFRGKMEVNPKNVGGRFYGKRVRTTIWK 301

Query: 189 VWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            +  D   Y    R    Y  E+  S FCL   G+   + R+ +S+  GCVPVIIA+   
Sbjct: 302 KFHRDRRFYLRRHRFAG-YRSEIARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIR 360

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKHFQWHVFPSD 306
           LPF   ++W   S+ VA  D+  L+KIL+ +++     +Q  +   K R+   +H     
Sbjct: 361 LPFPSAVDWPGISLTVAEKDVGKLRKILERVAATNLTAIQKNLWDPKNRRALLFHNPTQP 420

Query: 307 YDAFYMVMYEL 317
            DA + V+  L
Sbjct: 421 QDATWQVLSAL 431


>gi|39725600|dbj|BAD04923.1| pectin-glucuronyltransferase [Nicotiana tabacum]
          Length = 317

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 80/282 (28%), Positives = 130/282 (46%), Gaps = 21/282 (7%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E Y  +  + S   T +P +AD F+ P ++   +  +        P  +   I  I+
Sbjct: 2   FAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF++  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 62  SNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 121

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 122 DGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 181

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+  +  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 182 KDNPPFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 241

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           AD + W+   + VA  D+P L  IL  I  EE L  Q  + K
Sbjct: 242 ADAIPWEDIGVFVAEKDVPTLDTILTSIPPEEILRKQRLLAK 283


>gi|150246869|emb|CAM91796.1| putative glycosyltransferase family 47 [Triticum aestivum]
          Length = 420

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 148/333 (44%), Gaps = 35/333 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P +    +           P  +   I  I+
Sbjct: 81  FAAEIFMHRFLLSSAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIA 140

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
            K+PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 141 TKWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 200

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L  G + R+   +F G      N P      R     VW N 
Sbjct: 201 EGSITIPPFAPPQKMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 260

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 261 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 320

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I +E+ +L +  +L      Q  +FP      D
Sbjct: 321 ADAIPWEEIGVFVPEEDVPRLDSILTSIPTED-ILRKQRLLANPSMKQAMLFPQPAQAGD 379

Query: 309 AFYMVM----------YELWLRRSSVRVQWSTS 331
           AF+ ++           +++L+   VR+ W+  
Sbjct: 380 AFHQILNGLARKLPHGEDVFLKPGQVRLNWTAG 412


>gi|242059739|ref|XP_002459015.1| hypothetical protein SORBIDRAFT_03g044510 [Sorghum bicolor]
 gi|241930990|gb|EES04135.1| hypothetical protein SORBIDRAFT_03g044510 [Sorghum bicolor]
          Length = 415

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 88/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP +AD F+ P ++   +           P  +   I  ++
Sbjct: 76  FAAEIFMHQFLLSSAVRTLDPEEADWFYTPVYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 135

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
             +P+WNRT GADHF++  H  G         AM++     L    +V   ++    H+ 
Sbjct: 136 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 193

Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
            +D S+    P   DP K+ +      + R+   +F G      N P      R     V
Sbjct: 194 LQDGSI--TVPPYADPHKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 251

Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
           W N  D+ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+  
Sbjct: 252 WENFKDNPLFDISTEHPYTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 311

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
            LPFAD + W+  S+ VA  D+P L  IL  I   + L  Q  + +  V++   +H    
Sbjct: 312 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLADILRRQRLLARESVKQALLFHQPAR 371

Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
             DAF+ V+  L          +L+     + W   LDS+
Sbjct: 372 TGDAFHQVLNGLARKLPHGDGVFLQPGEKVLDWDAGLDSD 411


>gi|386576428|gb|AFJ12118.1| glycosyltransferase, partial [Nicotiana tabacum]
          Length = 294

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 79/276 (28%), Positives = 128/276 (46%), Gaps = 21/276 (7%)

Query: 34  ASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNISQ 92
           A+E Y  +  + S   T +P +AD F+ P ++   +  +        P  +   I  I+ 
Sbjct: 1   AAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIAS 60

Query: 93  KYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHKD 147
            +PYWNRT GADHF++  H  G       +KA E  +  +  +     ++    H+  KD
Sbjct: 61  NWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKD 120

Query: 148 --VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN-- 192
             +++P   P Q+    L    + R+   +F G      N P      R     VW N  
Sbjct: 121 GSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFK 180

Query: 193 DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
           D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPFA
Sbjct: 181 DNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFA 240

Query: 253 DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           D + W+   + VA  D+P L  IL  I  EE L  Q
Sbjct: 241 DAIPWEDIGVFVAEKDVPNLDTILTSIPPEEILRKQ 276


>gi|326488903|dbj|BAJ98063.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 421

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 146/333 (43%), Gaps = 35/333 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P +    +           P  +   I  I+
Sbjct: 82  FAAEIFMHRFLLSSAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIA 141

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
            K+PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 142 TKWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 201

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L  G + R+   +F G      N P      R     VW N 
Sbjct: 202 DGSITIPPFAPPQKMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 261

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 262 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 321

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I +E+ +L +  +L      Q  +FP      D
Sbjct: 322 ADAIPWEEIGVFVPEEDVPRLDSILTSIPTED-ILRKQRLLANPSMKQAMLFPQPAQAGD 380

Query: 309 AFYMVMYEL----------WLRRSSVRVQWSTS 331
           AF+ ++  L          +L+    R+ W+  
Sbjct: 381 AFHQILNGLARKLPHGESVFLKPGQARLNWTAG 413


>gi|414878904|tpg|DAA56035.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
          Length = 360

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 87/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP +AD F+ P ++   +           P  +   I  ++
Sbjct: 21  FAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 80

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
             +P+WNRT GADHF++  H  G         AM++     L    +V   ++    H+ 
Sbjct: 81  TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 138

Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
            +D S+    P   DP K+ +      + R+   +F G      N P      R     V
Sbjct: 139 LQDGSI--TVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 196

Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
           W N  D+ ++  S    + Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+  
Sbjct: 197 WENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 256

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
            LPFAD + W+  S+ VA  D+P L  IL  I   + L  Q  + +  V++   +H    
Sbjct: 257 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPAR 316

Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
             DAF+ V+  L          +L+     + W   L S+
Sbjct: 317 PGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSD 356


>gi|414878902|tpg|DAA56033.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
          Length = 387

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 87/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP +AD F+ P ++   +           P  +   I  ++
Sbjct: 48  FAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 107

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
             +P+WNRT GADHF++  H  G         AM++     L    +V   ++    H+ 
Sbjct: 108 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 165

Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
            +D S+    P   DP K+ +      + R+   +F G      N P      R     V
Sbjct: 166 LQDGSI--TVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 223

Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
           W N  D+ ++  S    + Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+  
Sbjct: 224 WENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 283

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
            LPFAD + W+  S+ VA  D+P L  IL  I   + L  Q  + +  V++   +H    
Sbjct: 284 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPAR 343

Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
             DAF+ V+  L          +L+     + W   L S+
Sbjct: 344 PGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSD 383


>gi|242090769|ref|XP_002441217.1| hypothetical protein SORBIDRAFT_09g022500 [Sorghum bicolor]
 gi|241946502|gb|EES19647.1| hypothetical protein SORBIDRAFT_09g022500 [Sorghum bicolor]
          Length = 422

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 93/351 (26%), Positives = 150/351 (42%), Gaps = 48/351 (13%)

Query: 24  PVDFEPRG---NYASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGI 79
           PV+ +PR     +A+E +  ++ + S   T DP +AD F+ P ++   +           
Sbjct: 69  PVEKDPRCLTHMFATEIFVHRSLLSSAVRTLDPEEADWFYTPVYTTCDLTASGHPMPFDS 128

Query: 80  PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFI 139
           P  +   I  I+ ++PYWNR+ GADHF+V  H  G     +  +     I  V   +  +
Sbjct: 129 PRMMRSAIRLIADRWPYWNRSEGADHFFVTPHDFGACFHFQEEKAMARGILPVLRRATLV 188

Query: 140 S-----GHIAHKD----VSLPQIWP-----RQEDPPKLGSSKRNKLAFFAG----AVNSP 181
                  H+  KD    +++P   P      Q  PP   ++ R+   +F G    A N P
Sbjct: 189 QTFGQRNHVCLKDGGGSITIPPYAPPWKMEAQLLPP---ATPRSIFVYFRGLFYDAGNDP 245

Query: 182 -----VREKLLQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSL 234
                 R     VW N   + ++  S    T Y  ++  + FCL   G+   + R+ +++
Sbjct: 246 EGGYYARGARASVWENFKSNPLFDISTAHPTTYYQDMQRAVFCLCPLGWAPWSPRLVEAV 305

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSY 290
            +GC+PV+IA+   LPFAD + W    + VA  D+P L  IL  I  E  L    LL S 
Sbjct: 306 VFGCIPVVIADDIVLPFADAIPWADIGVFVAEDDVPRLDTILTSIPVEVVLRKQRLLASP 365

Query: 291 VLKVRKHFQWHVFPSDYDAFYMVMYEL----------WLRRSSVRVQWSTS 331
            +K    F     P   DAF+ ++  L          +LR     + W+  
Sbjct: 366 AMKRAVLFPQPAQPG--DAFHQILNGLARKLPHGDDAFLRNGQTVLNWTAG 414


>gi|414878903|tpg|DAA56034.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
          Length = 393

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 87/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP +AD F+ P ++   +           P  +   I  ++
Sbjct: 54  FAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 113

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
             +P+WNRT GADHF++  H  G         AM++     L    +V   ++    H+ 
Sbjct: 114 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 171

Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
            +D S+    P   DP K+ +      + R+   +F G      N P      R     V
Sbjct: 172 LQDGSI--TVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 229

Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
           W N  D+ ++  S    + Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+  
Sbjct: 230 WENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 289

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
            LPFAD + W+  S+ VA  D+P L  IL  I   + L  Q  + +  V++   +H    
Sbjct: 290 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPAR 349

Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
             DAF+ V+  L          +L+     + W   L S+
Sbjct: 350 PGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSD 389


>gi|414878901|tpg|DAA56032.1| TPA: secondary cell wall glycosyltransferase family 47 [Zea mays]
          Length = 417

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 87/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP +AD F+ P ++   +           P  +   I  ++
Sbjct: 78  FAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 137

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
             +P+WNRT GADHF++  H  G         AM++     L    +V   ++    H+ 
Sbjct: 138 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 195

Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
            +D S+    P   DP K+ +      + R+   +F G      N P      R     V
Sbjct: 196 LQDGSI--TVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 253

Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
           W N  D+ ++  S    + Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+  
Sbjct: 254 WENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 313

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
            LPFAD + W+  S+ VA  D+P L  IL  I   + L  Q  + +  V++   +H    
Sbjct: 314 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPAR 373

Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
             DAF+ V+  L          +L+     + W   L S+
Sbjct: 374 PGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSD 413


>gi|168000965|ref|XP_001753186.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695885|gb|EDQ82227.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 449

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 86/313 (27%), Positives = 141/313 (45%), Gaps = 33/313 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A E Y      +S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 106 FAVEEYLHDFLTESPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRVMRSAISYIS 165

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H          +KA E  +  +  +     ++  + H+  K
Sbjct: 166 SHWPYWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLKRATLIQTFGQNHHVCLK 225

Query: 147 DVSL-------PQIWPRQEDPPKLGSSKRNKLAFFAGAVNSP---------VREKLLQVW 190
           + S+       P+    + +PP   S+ R+  A+F G    P          R     +W
Sbjct: 226 EDSIVIPPYAPPERMQTRLNPP---STPRSIFAYFRGLFYDPGNDPEGGYYARGARAAIW 282

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  D+ ++  S      Y +++  + FCL   G+   + R+ + + +GC+PVIIA+   
Sbjct: 283 ENFKDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIV 342

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFP 304
           LPFAD + W+   + V   D+P+L KIL  I+ EE L    LL +  +K    F     P
Sbjct: 343 LPFADAIPWEKIGVFVEEKDVPILDKILCTINHEEVLEKQRLLANPAMKQAMLFPRPAKP 402

Query: 305 SDYDAFYMVMYEL 317
              DAF+ ++  L
Sbjct: 403 G--DAFHQILNGL 413


>gi|195648072|gb|ACG43504.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 420

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 87/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP +AD F+ P ++   +           P  +   I  ++
Sbjct: 81  FAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 140

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
             +P+WNRT GADHF++  H  G         AM++     L    +V   ++    H+ 
Sbjct: 141 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 198

Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
            +D S+    P   DP K+ +      + R+   +F G      N P      R     V
Sbjct: 199 LQDGSI--TVPPYADPRKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 256

Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
           W N  D+ ++  S    + Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+  
Sbjct: 257 WENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 316

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
            LPFAD + W+  S+ VA  D+P L  IL  I   + L  Q  + +  V++   +H    
Sbjct: 317 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPAR 376

Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
             DAF+ V+  L          +L+     + W   L S+
Sbjct: 377 PGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSD 416


>gi|10176877|dbj|BAB10084.1| unnamed protein product [Arabidopsis thaliana]
 gi|23821294|dbj|BAC20929.1| NpGUT1 homolog [Arabidopsis thaliana]
          Length = 341

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E Y ++  + S   T +P +AD F++P ++   +  +        P  +   I  I+
Sbjct: 2   FAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNRT GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 62  SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLK 121

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 122 EGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 181

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 182 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 241

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 242 ADAIPWEDIGVFVDEKDVPYLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 300

Query: 309 AFYMVMYEL 317
           AF+ V+  L
Sbjct: 301 AFHQVLNGL 309


>gi|357126594|ref|XP_003564972.1| PREDICTED: probable glucuronosyltransferase Os01g0926400-like
           [Brachypodium distachyon]
          Length = 423

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 84/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP +AD F+ P ++   +           P  +   +  I+
Sbjct: 84  FAAEIFMHQFLLASAVRTMDPEEADWFYTPVYTTCDLTQQGFPLPFRAPRMMRSAVQYIA 143

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF++A H  G       ++A E  +  +  +     ++    H+  +
Sbjct: 144 ATWPYWNRTEGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRNHVCMQ 203

Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQVWR 191
           + S+    P   +P K+ +      + R+  A+F G      N P      R     VW 
Sbjct: 204 EGSI--TIPPYANPQKMQAHLISPGTPRSIFAYFRGLFYDMGNDPEGGYYARGARASVWE 261

Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           N  D+ ++  S    + Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   L
Sbjct: 262 NFKDNPLFDMSTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 321

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPS 305
           PFAD + W   S+ V   D+P L  IL  +  E+ L    LL S  +K    F     P 
Sbjct: 322 PFADAIPWDQISVFVPEADVPRLDSILASVPPEDVLRKQRLLASPAMKQAVLFHQPAQPR 381

Query: 306 DYDAFYMVMYEL 317
             DAF  V+  L
Sbjct: 382 --DAFDQVLNGL 391


>gi|297797147|ref|XP_002866458.1| hypothetical protein ARALYDRAFT_496352 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312293|gb|EFH42717.1| hypothetical protein ARALYDRAFT_496352 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 417

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E Y ++  + S   T +P +AD F++P ++   +  +        P  +   I  I+
Sbjct: 78  FAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 137

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNRT GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 138 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLK 197

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 198 EGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 257

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 258 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 317

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 318 ADAIPWEDIGVFVDEKDVPYLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 376

Query: 309 AFYMVMYEL 317
           AF+ V+  L
Sbjct: 377 AFHQVLNGL 385


>gi|317106607|dbj|BAJ53114.1| JHL07K02.4 [Jatropha curcas]
          Length = 417

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 88/310 (28%), Positives = 141/310 (45%), Gaps = 27/310 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 78  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 137

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 138 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 197

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 198 DGSITIPPYAPPQKMQAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 257

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    + Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 258 KDNPLFDISTEHPSTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 317

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPSDY 307
           AD + W+   + V   D+P L  IL  I  E  L    LL +  +K    F   V P   
Sbjct: 318 ADAIPWEQIGVFVDEKDVPNLDAILTSIPPEVILRKQRLLANPSMKQAMLFPQPVQPG-- 375

Query: 308 DAFYMVMYEL 317
           DAF+ V+  L
Sbjct: 376 DAFHQVLNGL 385


>gi|301072486|gb|ADK56173.1| glycosyltransferase 47 [Triticum aestivum]
          Length = 422

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 83/333 (24%), Positives = 147/333 (44%), Gaps = 35/333 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P +    +           P  +   I  I+
Sbjct: 83  FAAEIFMHRFLLSSAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIA 142

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
            K+PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 143 TKWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 202

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L  G + R+   +F G      N P      R     VW N 
Sbjct: 203 EGSITIPPFAPPQKMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 262

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 263 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 322

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I +E+ +L +  +L      Q  +FP      D
Sbjct: 323 ADAIPWEEIGVFVPEEDVPRLDSILTSIPTED-ILRKQRLLANPSMKQAMLFPQPAQAGD 381

Query: 309 AFYMVM----------YELWLRRSSVRVQWSTS 331
           AF+ ++           +++L+    R+ W+  
Sbjct: 382 AFHQILNGLARKLPHGEDVFLKPGQARLNWTAG 414


>gi|18424516|ref|NP_568941.1| Exostosin family protein [Arabidopsis thaliana]
 gi|75163931|sp|Q940Q8.1|IX10L_ARATH RecName: Full=Probable beta-1,4-xylosyltransferase IRX10L; AltName:
           Full=Glucuronoxylan glucuronosyltransferase 1;
           Short=AtGUT1; AltName: Full=Glucuronoxylan
           glucuronosyltransferase 2; Short=AtGUT2; AltName:
           Full=Protein IRREGULAR XYLEM 10-like; AltName:
           Full=Xylan xylosyltransferase IRX10L
 gi|15809826|gb|AAL06841.1| AT5g61840/mac9_140 [Arabidopsis thaliana]
 gi|17978869|gb|AAL47406.1| AT5g61840/mac9_140 [Arabidopsis thaliana]
 gi|332010141|gb|AED97524.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 415

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E Y ++  + S   T +P +AD F++P ++   +  +        P  +   I  I+
Sbjct: 76  FAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 135

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNRT GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 136 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLK 195

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 196 EGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 255

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 256 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 315

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 316 ADAIPWEDIGVFVDEKDVPYLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 374

Query: 309 AFYMVMYEL 317
           AF+ V+  L
Sbjct: 375 AFHQVLNGL 383


>gi|357126610|ref|XP_003564980.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
           [Brachypodium distachyon]
          Length = 418

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 82/309 (26%), Positives = 142/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  I+
Sbjct: 79  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMVRSAIQLIA 138

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
           +K+PYWNR+ GADHF+VA H  G     +  +     I  +   +  +       H+  K
Sbjct: 139 EKWPYWNRSEGADHFFVAPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 198

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L  G + R+   +F G      N P      R     VW N 
Sbjct: 199 DGSITIPPYAPPQKMQNHLIPGDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 258

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 259 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 318

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V+  D+P L  IL  I + + +L +  +L      Q  +FP      D
Sbjct: 319 ADAIPWEDIGVFVSEDDVPKLDSILTSIPT-DVILRKQRLLANPSMKQAMLFPQPAEAGD 377

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 378 AFHQILNGL 386


>gi|34394612|dbj|BAC83914.1| limonene cyclase like protein [Oryza sativa Japonica Group]
 gi|50508943|dbj|BAD31847.1| limonene cyclase like protein [Oryza sativa Japonica Group]
          Length = 332

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 53/122 (43%), Positives = 73/122 (59%), Gaps = 1/122 (0%)

Query: 206 PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
           P A      K  L   G     A I +++Y  CVPV+I + Y LPFADVLNW +FS+ VA
Sbjct: 206 PQAQRRRDVKLELLELGLAKARATIREAIYLECVPVVIGDDYTLPFADVLNWAAFSVRVA 265

Query: 266 TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-VFPSDYDAFYMVMYELWLRRSSV 324
             DIP LK+IL  +S  +Y+ +Q  V  VR+HF      P  +D F+M+++ +WLRR +V
Sbjct: 266 VGDIPRLKEILAAVSPRQYIRMQRRVRAVRRHFMVSDGAPRRFDVFHMILHSIWLRRLNV 325

Query: 325 RV 326
           RV
Sbjct: 326 RV 327


>gi|356508876|ref|XP_003523179.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Glycine max]
          Length = 416

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 77  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 137 SNWPYWNRTEGADHFFVTPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 196

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 197 EGSITIPPYAPPQKMHTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 257 KDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 316

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 317 ADAIPWEEIGVFVDEKDVPQLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 375

Query: 309 AFYMVMYEL 317
           AF+ V+  L
Sbjct: 376 AFHQVLNGL 384


>gi|356516468|ref|XP_003526916.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Glycine max]
          Length = 416

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 77  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 137 SNWPYWNRTEGADHFFVTPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 196

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 197 EGSITIPPYAPPQKMHTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 257 KDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 316

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 317 ADAIPWEEIGVFVDEKDVPQLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 375

Query: 309 AFYMVMYEL 317
           AF+ V+  L
Sbjct: 376 AFHQVLNGL 384


>gi|255540029|ref|XP_002511079.1| catalytic, putative [Ricinus communis]
 gi|223550194|gb|EEF51681.1| catalytic, putative [Ricinus communis]
          Length = 417

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 78  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 137

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 138 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPMLQRATLVQTFGQRNHVCLK 197

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 198 EGSITIPPYAPPQKMQAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 257

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 258 KDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 317

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 318 ADAIPWEEIGVFVDEKDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQAGD 376

Query: 309 AFYMVMYEL 317
           AF+ V+  L
Sbjct: 377 AFHQVLNGL 385


>gi|359473830|ref|XP_003631364.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like [Vitis
           vinifera]
          Length = 412

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 84/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 73  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 132

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+   
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 192

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 193 EGSITIPPYAPPQKMQAHLIPQETPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 252

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 253 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 312

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 313 ADAIPWEEIGVFVAEEDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQSGD 371

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 372 AFHQILNGL 380


>gi|412993472|emb|CCO13983.1| predicted protein [Bathycoccus prasinos]
          Length = 614

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 35/356 (9%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPR---GNYASESYFKKAFMKSHFVTKDPSKADLFFL 61
           F+++VY  +   P  N  L  D +PR     Y +E  F +  +    +T DP +A+ FF+
Sbjct: 260 FKIFVYDLK---PEFNADLARD-QPRCRTDQYGTEIRFHENLLHHSVLTNDPEEAEFFFV 315

Query: 62  PF--SIARMRHDRRIGTEGIPDFISHY----IFNISQKYPYWNRTGGADHFYVACHSIGR 115
           P        R  +  GT       + +    +  I  +YPYWNRT G DH +    + G 
Sbjct: 316 PIYGECYLFRETQNSGTNNAMKVTNLWYRDALKTIQTEYPYWNRTDGRDHVWSFPGARG- 374

Query: 116 SAMQKAWE--VKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQE----DPPKLGSSKRN 169
             + + W+  +K +          F       KD+ +P + P  E       K  S K++
Sbjct: 375 PHIFRDWKKLIKKSIFLTPEGDRSFGEQFNTWKDIVIPGLEPDSEFIDGKLRKQSSLKKD 434

Query: 170 KLAFFAGAV---------NSPVREKLLQVWRNDSEIYAHS---GRLKTPYADELLGSKFC 217
             AFF G +         +  +R K+   ++   ++          +  Y  EL  S FC
Sbjct: 435 IFAFFRGTILNKAGILAYSRGIRPKMEAAFKKHKDVIFTEEIPSCDRDCYRKELRKSTFC 494

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L  +G+   T R   ++  GC+PVIIA+  +LP+ + L+W   S+ +A +D      ILK
Sbjct: 495 LCPRGWSPWTLRAYQAMMVGCIPVIIADEIELPYENSLDWTKLSVKIAEVDAEKTIDILK 554

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSD---YDAFYMVMYELWLRRSSVRVQWST 330
            IS  E    Q  + KV K   W   P      DA   V++EL  ++ +++    T
Sbjct: 555 QISKSEIRNKQKAIEKVWKSVAWGSNPKKLDPMDAMECVLHELGRKKRAMKASTQT 610


>gi|412986347|emb|CCO14773.1| predicted protein [Bathycoccus prasinos]
          Length = 445

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 96/320 (30%), Positives = 146/320 (45%), Gaps = 32/320 (10%)

Query: 33  YASESYFKKAFMKS-HFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           Y S+ +     +KS    T++PS AD FFLP     M      G     D ++  +  + 
Sbjct: 118 YGSQVHIADFLLKSKELRTENPSDADFFFLPGWPKCMLDAPPNGAGLTDDELAKRLNGVI 177

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH------ 145
           +K PY  ++GG DH +V     G + + K W  K+     +    ++   +         
Sbjct: 178 EKLPYIKKSGGRDHVFVWPSGRGPT-LYKNWRCKIPNSIFLTPEGFYTDPYRTLAPYFDP 236

Query: 146 -KDVSLPQIWPRQED---PPKLGSSKRNKLAFFAGAV-------------NSPVREKLLQ 188
            KDV LP     ++D        +SKR KLA FAG V              +  RE+LL+
Sbjct: 237 WKDVVLPGFMDGRKDSYLETNKRTSKRTKLASFAGTVPDGQALKGDEKHVKAHPRERLLK 296

Query: 189 VWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           + +    ++ A SGR  TP   E+LG SKFC+  +G    T R  ++ + GCVPVII++ 
Sbjct: 297 LSKKYPDDLLAISGR--TPKYAEILGDSKFCIVPRGLSPWTLRTYETFFAGCVPVIISDS 354

Query: 247 YDLPFADVLNWKSFSIV--VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
             LPF + L+W   SI    A +D  LL   LK I  EE   +     +VR  F +    
Sbjct: 355 VRLPFQEFLDWSLISIKWPEAKIDESLL-TYLKSIPDEEIEKIVRRGEQVRCVFAYQADA 413

Query: 305 SDYDAFYMVMYELWLRRSSV 324
           +  +AF  +M+ L L+  +V
Sbjct: 414 TKCNAFSAIMWALSLKDRNV 433


>gi|224119858|ref|XP_002318180.1| predicted protein [Populus trichocarpa]
 gi|224122230|ref|XP_002318783.1| predicted protein [Populus trichocarpa]
 gi|222858853|gb|EEE96400.1| predicted protein [Populus trichocarpa]
 gi|222859456|gb|EEE97003.1| predicted protein [Populus trichocarpa]
          Length = 417

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 78  FAAEIFMHRFLLSSPVRTLNPDEADWFYTPVYTTCDLTTNGLPLPFKSPRMMRSAIQLIS 137

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 138 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 197

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 198 DGSITVPPYAPPQKMQTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 257

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 258 KDNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 317

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 318 ADAIPWEEIGVYVDEEDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 376

Query: 309 AFYMVMYEL 317
           AF+ V+  L
Sbjct: 377 AFHQVLNGL 385


>gi|18642697|gb|AAL76189.1|AC092173_1 Unknown protein [Oryza sativa Japonica Group]
 gi|23821296|dbj|BAC20930.1| NpGUT1 homolog [Oryza sativa Japonica Group]
          Length = 401

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 143/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +       T   P  +   I  IS
Sbjct: 62  FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 121

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVV----CSSSYFISGHIAHK 146
           + +PYWNRT GADHF+V  H        Q+A  ++   + V+       ++    H   K
Sbjct: 122 KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 181

Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
           D  +++P   P  +      PP+   + R+   +F G      N P      R     VW
Sbjct: 182 DGSITVPPYTPAHKIRAHLVPPE---TPRSIFVYFRGLFYDTSNDPEGGYYARGARASVW 238

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+  D
Sbjct: 239 ENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDID 298

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LP +D + W+  ++ VA  D+P L  IL  I +E  L  Q+ +L      Q  +FP    
Sbjct: 299 LPLSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQA-MLAEPSMKQTMLFPQPAE 357

Query: 308 --DAFYMVMYEL 317
             D F+ VM  L
Sbjct: 358 PGDGFHQVMNAL 369


>gi|356512936|ref|XP_003525170.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Glycine max]
          Length = 416

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 77  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 137 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILTLLRRATLVQTFGQRNHVCLK 196

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 197 EGSITIPPYAPPQKMHTHLIPDKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 257 KDNLLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 316

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 317 ADAIPWEEIGVFVDEEDVPKLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 375

Query: 309 AFYMVMYEL 317
           AF+ V+  L
Sbjct: 376 AFHQVLNGL 384


>gi|224119530|ref|XP_002318097.1| predicted protein [Populus trichocarpa]
 gi|222858770|gb|EEE96317.1| predicted protein [Populus trichocarpa]
          Length = 243

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 68/192 (35%), Positives = 96/192 (50%), Gaps = 29/192 (15%)

Query: 1   MNRSFRVYVYP--------HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKD 52
           M + F+V+VYP        H  N+             + N+ASE YF      S F+TK+
Sbjct: 55  MEKDFKVFVYPGGNPGTCYHSTNNTL-----------KSNHASEHYFFMNLRDSPFLTKN 103

Query: 53  PSKADLFFLPFSIARMRHDRRIGTEGIPDF----ISHYIFNISQKYPYWNRTGGADHFYV 108
           P +A LFF+  S   +  +     E +P +    I  Y+  +   YPYWNRT GADHF+V
Sbjct: 104 PQEAHLFFIFISCLPLSDE-----EPLPGYRERVIKRYVKGLISTYPYWNRTLGADHFFV 158

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKR 168
           +CH+IG +A ++   +  NAI++VCS SY  S +I  KDV+LPQI      P       R
Sbjct: 159 SCHNIGSTATKEIPFLLKNAIRLVCSPSY-DSSYIPQKDVALPQILELSLPPDGDDMWNR 217

Query: 169 NKLAFFAGAVNS 180
           N +      V S
Sbjct: 218 NLMQLARSTVES 229


>gi|242088705|ref|XP_002440185.1| hypothetical protein SORBIDRAFT_09g027450 [Sorghum bicolor]
 gi|241945470|gb|EES18615.1| hypothetical protein SORBIDRAFT_09g027450 [Sorghum bicolor]
          Length = 416

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 86/314 (27%), Positives = 146/314 (46%), Gaps = 35/314 (11%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +       T   P  +   I  IS
Sbjct: 77  FATEIFMHRFLLTSAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIQYIS 136

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
           +++PYWNRT GADHF+V  H  G        +A+Q+     L    +V   ++    H+ 
Sbjct: 137 KRWPYWNRTEGADHFFVTPHDFGACFYFQEETAIQRGVLPVLRRATLV--QTFGQKHHVC 194

Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
            K+  +++P   P  +      PP+   + R+   +F G      N P      R     
Sbjct: 195 LKEGSITIPPYAPPHKIRTHIVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 251

Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           VW N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+ 
Sbjct: 252 VWENFKNNALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 311

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
             LPFAD + W+  ++ VA  D+  L  IL  I  EE +L +  +L      Q  +FP  
Sbjct: 312 IVLPFADAIPWEEIAVFVAEDDVLKLDTILTSIPMEE-ILRKQRLLANPSMKQAMLFPQP 370

Query: 307 Y---DAFYMVMYEL 317
               DAF+ V+  L
Sbjct: 371 AEPRDAFHQVLNGL 384


>gi|116793780|gb|ABK26875.1| unknown [Picea sitchensis]
          Length = 420

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 83/309 (26%), Positives = 137/309 (44%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 81  FATEIFMNRFLLGSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQYIS 140

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
             +PYWNRT GADHF+V  H  G         A+++     L    +V +        + 
Sbjct: 141 TNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRYHVCLK 200

Query: 145 HKDVSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
              +++P   P Q+    L   S+ R+   +F G      N P      R     VW N 
Sbjct: 201 KGSITVPPYAPPQKMQAHLIPPSTPRSIFVYFRGLFYDVGNDPEGGYYARGARASVWENF 260

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S      Y +++  + FCL   G+   + R+ + + +GC+PVIIA+   LPF
Sbjct: 261 KDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPF 320

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 321 ADAIPWEEIGVFVAEKDVPNLDTILTSIPP-EVILRKQRLLANPAMKQAMLFPQPAQPGD 379

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 380 AFHQILNGL 388


>gi|225456511|ref|XP_002284685.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L [Vitis
           vinifera]
 gi|147860849|emb|CAN83159.1| hypothetical protein VITISV_022554 [Vitis vinifera]
 gi|297734521|emb|CBI15768.3| unnamed protein product [Vitis vinifera]
          Length = 416

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 77  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVK-LNAIQ-VVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E   L+ +Q      ++    H+  K
Sbjct: 137 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTFGQRNHVCLK 196

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 197 EGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S      Y +++  + FCL   G+   + R+ + + +GC+PVIIA+   LPF
Sbjct: 257 KDNPLFDISTEHPMTYYEDMQRAVFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPF 316

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 317 ADAIPWEEIGVFVAEEDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 375

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 376 AFHQILNGL 384


>gi|21592991|gb|AAM64940.1| unknown [Arabidopsis thaliana]
          Length = 415

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 86/316 (27%), Positives = 141/316 (44%), Gaps = 39/316 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E Y ++  + S   T +P +AD F++P ++   +  +        P  +   I  I+
Sbjct: 76  FAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 135

Query: 92  QKYPYWNRTGGADHFYVACH------------SIGRSAMQKAWEVKLNAIQVVCSSSYFI 139
             +PYWNRT GADHF+V  H            +IGR  +       L         ++  
Sbjct: 136 SNWPYWNRTEGADHFFVVPHDFRACFHYQEEKAIGRGILPLLQRATL-------VQTFGQ 188

Query: 140 SGHIAHKD--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKL 186
             H+  K+  +++P   P Q+    L   K  R+   +F G      N P      R   
Sbjct: 189 RNHVCLKEGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGAR 248

Query: 187 LQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
             VW N  D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA
Sbjct: 249 AAVWENFKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIA 308

Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
           +   LPFAD + W+   + V   D+P L  IL  I   E +L +  +L      Q  +FP
Sbjct: 309 DDIVLPFADAIPWEDIGVFVDEKDVPYLDTILTSIPP-EVILRKQRLLANPSMKQAMLFP 367

Query: 305 SDY---DAFYMVMYEL 317
                 DAF+ V+  L
Sbjct: 368 QPAQPGDAFHQVLNGL 383


>gi|302783378|ref|XP_002973462.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300159215|gb|EFJ25836.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 437

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 70  FATEIFVHEFLLGSAVRTLNPEEADWFYTPVYTTCDLTRNGLPLPFKSPRMMRSVIQYIS 129

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVK-LNAIQ-VVCSSSYFISGHIAHK 146
            ++PYWNRT GADHF+V  H  G       +KA E   L+ +Q      ++    H+  K
Sbjct: 130 NQWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTFGQRYHVCLK 189

Query: 147 DVSL--PQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           + S+  P   P Q+    L   S  R+   +F G      N P      R     VW N 
Sbjct: 190 EGSIVVPPYCPPQKMQAHLIPPSIPRSIFVYFRGLFYDYGNDPEGGYYARGARAAVWENF 249

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S      Y +++  + FCL   G+   + R+ + + +GC+PVIIA+   LPF
Sbjct: 250 KDNPLFDISTEHPITYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPF 309

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 310 ADAIPWEEIGVFVAEKDVPKLDTILTSIPP-EVILKKQRLLATPAMKQAMLFPQPAQPGD 368

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 369 AFHQILNGL 377


>gi|449496198|ref|XP_004160070.1| PREDICTED: LOW QUALITY PROTEIN: probable glucuronoxylan
           glucuronosyltransferase IRX7-like [Cucumis sativus]
          Length = 459

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 133/317 (41%), Gaps = 37/317 (11%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +ASE    +A + SH+ T DP +AD FF+P  ++           G P        IS  
Sbjct: 132 FASEVAIHRALLNSHYRTFDPLEADFFFVPVYVS----CNFSTVNGFPAIGHARSLISSA 187

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG----- 141
           + +IS  Y +WNRT G+DH +VA H               + +     +S  +       
Sbjct: 188 VSHISSHYSFWNRTNGSDHVFVASHDFASCFHTMEHVAIADGVPSFLKNSIILQTFGVKY 247

Query: 142 -HIAH--KDVSLPQIWPRQEDPPKLGSS----KRNKLAFFAGAV------------NSPV 182
            H     + V +P   P +     L  S    +R+  AFF G +            +  V
Sbjct: 248 KHPCQDVEHVVIPPYIPPESIENTLERSPVTGRRDIFAFFRGKMEMNPKNVSGRFYSKKV 307

Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
           R  + + +  D   Y    R    Y  E++ S FCL   G+   + R+ +S+  GCVPVI
Sbjct: 308 RTMIWRKFNGDRRFYLQRHRFPG-YQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVI 366

Query: 243 IANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKHFQW 300
           IA+   LPF   +NW   SI VA  DI  L +IL  ++      +Q  +   + R+   +
Sbjct: 367 IADGIRLPFPSAVNWPEISITVAEKDIGKLGRILDHVAGSNLTTIQKNLWDPRNRRALLF 426

Query: 301 HVFPSDYDAFYMVMYEL 317
           H    D DA + V+  L
Sbjct: 427 HNQVEDGDATWQVIGAL 443


>gi|223974777|gb|ACN31576.1| unknown [Zea mays]
          Length = 412

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 83/336 (24%), Positives = 149/336 (44%), Gaps = 33/336 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP  AD F+ P ++   +           P  +   I  ++
Sbjct: 73  FAAEIFMHQFLLSSAVRTLDPEAADWFYTPVYTTCDLTPQGFPLPFRAPRMMRSAIRYVA 132

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +P+WNRT GADHF++  H  G       ++A E  +  +  +     ++    H+  +
Sbjct: 133 ATWPFWNRTDGADHFFLTPHDFGACFHYQEERAVERGILPLLRRATLVQTFGQRNHVCLQ 192

Query: 147 DVSL---PQIWPRQEDPPKLG-SSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           D S+   P   P +     +G  + R+   +F G      N P      R     VW N 
Sbjct: 193 DGSITVPPYASPHRLQAHLVGPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENF 252

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 253 KDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 312

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPSDYDA 309
           AD + W+  S+ VA  D+P L  IL  I   + L  Q  + +  V++   +H      DA
Sbjct: 313 ADAIPWEDMSVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKRALLFHQPARPGDA 372

Query: 310 FYMVM----------YELWLRRSSVRVQWSTSLDSN 335
           F+ V+           +++L+     + W   L+S+
Sbjct: 373 FHQVLNGLARKLPHGEDVFLQPGEKALDWDAGLESD 408


>gi|110288702|gb|ABB46945.2| exostosin family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|215766610|dbj|BAG98714.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 341

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 143/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +       T   P  +   I  IS
Sbjct: 2   FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 61

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVV----CSSSYFISGHIAHK 146
           + +PYWNRT GADHF+V  H        Q+A  ++   + V+       ++    H   K
Sbjct: 62  KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 121

Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
           D  +++P   P  +      PP+   + R+   +F G      N P      R     VW
Sbjct: 122 DGSITVPPYTPAHKIRAHLVPPE---TPRSIFVYFRGLFYDTSNDPEGGYYARGARASVW 178

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 179 ENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 238

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LPF+D + W+  ++ VA  D+P L  IL  I +E  L  Q+ +L      Q  +FP    
Sbjct: 239 LPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQA-MLAEPSMKQTMLFPQPAE 297

Query: 308 --DAFYMVMYEL 317
             D F+ VM  L
Sbjct: 298 PGDGFHQVMNAL 309


>gi|293336613|ref|NP_001170701.1| uncharacterized protein LOC100384778 precursor [Zea mays]
 gi|238007038|gb|ACR34554.1| unknown [Zea mays]
 gi|413951433|gb|AFW84082.1| hypothetical protein ZEAMMB73_582447 [Zea mays]
          Length = 412

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 84/336 (25%), Positives = 148/336 (44%), Gaps = 33/336 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP  AD F+ P ++   +           P  +   I  ++
Sbjct: 73  FAAEIFMHQFLLSSAVRTLDPEAADWFYTPVYTTCDLTPQGFPLPFRAPRMMRSAIRYVA 132

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +P+WNRT GADHF++  H  G       ++A E  +  +  +     ++    H+  +
Sbjct: 133 ATWPFWNRTDGADHFFLTPHDFGACFHYQEERAVERGILPLLRRATLVQTFGQRNHVCLQ 192

Query: 147 DVSL---PQIWPRQEDPPKLG-SSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           D S+   P   P +     +G  + R+   +F G      N P      R     VW N 
Sbjct: 193 DGSITVPPYASPHRLQAHLVGPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENF 252

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 253 KDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 312

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPSDYDA 309
           AD + W+  S+ VA  D+P L  IL  I   + L  Q  + +  V++   +H      DA
Sbjct: 313 ADAIPWEDMSVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKRALLFHQPARPGDA 372

Query: 310 FYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
           F+ V+  L          +L+     + W   L+S+
Sbjct: 373 FHQVLNGLARKLPHREGVFLQPGEKALDWDAGLESD 408


>gi|115481310|ref|NP_001064248.1| Os10g0180000 [Oryza sativa Japonica Group]
 gi|122212636|sp|Q33AH8.2|GT101_ORYSJ RecName: Full=Probable glucuronosyltransferase GUT1; AltName:
           Full=Glucuronoxylan glucuronosyltransferase 1;
           Short=OsGUT1
 gi|110288701|gb|ABB46947.2| exostosin family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113638857|dbj|BAF26162.1| Os10g0180000 [Oryza sativa Japonica Group]
          Length = 417

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 143/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +       T   P  +   I  IS
Sbjct: 78  FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 137

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVV----CSSSYFISGHIAHK 146
           + +PYWNRT GADHF+V  H        Q+A  ++   + V+       ++    H   K
Sbjct: 138 KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 197

Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
           D  +++P   P  +      PP+   + R+   +F G      N P      R     VW
Sbjct: 198 DGSITVPPYTPAHKIRAHLVPPE---TPRSIFVYFRGLFYDTSNDPEGGYYARGARASVW 254

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 255 ENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 314

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LPF+D + W+  ++ VA  D+P L  IL  I +E  L  Q+ +L      Q  +FP    
Sbjct: 315 LPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQA-MLAEPSMKQTMLFPQPAE 373

Query: 308 --DAFYMVMYEL 317
             D F+ VM  L
Sbjct: 374 PGDGFHQVMNAL 385


>gi|21671946|gb|AAM74308.1|AC083944_26 Unknown protein similar to exostosin-2 [Oryza sativa Japonica
           Group]
 gi|110288700|gb|ABB46946.2| exostosin family protein, putative, expressed [Oryza sativa
           Japonica Group]
          Length = 401

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 143/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +       T   P  +   I  IS
Sbjct: 62  FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 121

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVV----CSSSYFISGHIAHK 146
           + +PYWNRT GADHF+V  H        Q+A  ++   + V+       ++    H   K
Sbjct: 122 KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 181

Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
           D  +++P   P  +      PP+   + R+   +F G      N P      R     VW
Sbjct: 182 DGSITVPPYTPAHKIRAHLVPPE---TPRSIFVYFRGLFYDTSNDPEGGYYARGARASVW 238

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 239 ENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 298

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LPF+D + W+  ++ VA  D+P L  IL  I +E  L  Q+ +L      Q  +FP    
Sbjct: 299 LPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQA-MLAEPSMKQTMLFPQPAE 357

Query: 308 --DAFYMVMYEL 317
             D F+ VM  L
Sbjct: 358 PGDGFHQVMNAL 369


>gi|302823891|ref|XP_002993593.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300138521|gb|EFJ05285.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 437

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 70  FATEIFVHEFLLGSAVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSVIQYIS 129

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVK-LNAIQ-VVCSSSYFISGHIAHK 146
            ++PYWNRT GADHF+V  H  G       +KA E   L+ +Q      ++    H+  K
Sbjct: 130 NQWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTFGQRYHVCLK 189

Query: 147 DVSL--PQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           + S+  P   P Q+    L   S  R+   +F G      N P      R     VW N 
Sbjct: 190 EGSIVVPPYCPPQKMQAHLIPPSIPRSIFVYFRGLFYDYGNDPEGGYYARGARAAVWENF 249

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S      Y +++  + FCL   G+   + R+ + + +GC+PVIIA+   LPF
Sbjct: 250 KDNPLFDISTEHPITYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPF 309

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 310 ADAIPWEEIGVFVAEKDVPKLDTILTSIPP-EVILKKQRLLATPAMKQAMLFPQPAQPGD 368

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 369 AFHQILNGL 377


>gi|218184216|gb|EEC66643.1| hypothetical protein OsI_32904 [Oryza sativa Indica Group]
 gi|222612524|gb|EEE50656.1| hypothetical protein OsJ_30886 [Oryza sativa Japonica Group]
          Length = 400

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 143/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +       T   P  +   I  IS
Sbjct: 61  FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 120

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVV----CSSSYFISGHIAHK 146
           + +PYWNRT GADHF+V  H        Q+A  ++   + V+       ++    H   K
Sbjct: 121 KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 180

Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
           D  +++P   P  +      PP+   + R+   +F G      N P      R     VW
Sbjct: 181 DGSITVPPYTPAHKIRAHLVPPE---TPRSIFVYFRGLFYDTSNDPEGGYYARGARASVW 237

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 238 ENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 297

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LPF+D + W+  ++ VA  D+P L  IL  I +E  L  Q+ +L      Q  +FP    
Sbjct: 298 LPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQA-MLAEPSMKQTMLFPQPAE 356

Query: 308 --DAFYMVMYEL 317
             D F+ VM  L
Sbjct: 357 PGDGFHQVMNAL 368


>gi|75122905|sp|Q6H4N0.1|GT21_ORYSJ RecName: Full=Probable glucuronosyltransferase Os02g0520750
 gi|49389081|dbj|BAD26319.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|215769122|dbj|BAH01351.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 434

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 142/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   + H         P  +   I  +S
Sbjct: 94  FAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLS 153

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
           +K+P+WNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 154 RKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLK 213

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     +W N 
Sbjct: 214 EGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENF 273

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 274 KNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 333

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W    + V   D+P L  IL  I  ++ +L +  +L      Q  +FP      D
Sbjct: 334 ADAIPWDEIGVFVDEEDVPRLDSILTSIPIDD-ILRKQRLLANPSMKQAMLFPQPAQPRD 392

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 393 AFHQILNGL 401


>gi|125539672|gb|EAY86067.1| hypothetical protein OsI_07437 [Oryza sativa Indica Group]
          Length = 342

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 142/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   + H         P  +   I  +S
Sbjct: 2   FAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLS 61

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
           +K+P+WNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 62  RKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLK 121

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     +W N 
Sbjct: 122 EGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENF 181

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 182 KNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 241

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W    + V   D+P L  IL  I  ++ +L +  +L      Q  +FP      D
Sbjct: 242 ADAIPWDEIGVFVDEEDVPRLDSILTSIPIDD-ILRKQRLLANPSMKQAMLFPQPAQPRD 300

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 301 AFHQILNGL 309


>gi|297851174|ref|XP_002893468.1| hypothetical protein ARALYDRAFT_472948 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339310|gb|EFH69727.1| hypothetical protein ARALYDRAFT_472948 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 412

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 85/313 (27%), Positives = 140/313 (44%), Gaps = 33/313 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T++P +AD F+ P +    +           P  +   I  IS
Sbjct: 73  FAAEIFMHRFLLSSPVRTRNPDEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 132

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIA-- 144
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+   
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLD 192

Query: 145 HKDVSLPQIWPRQED-----PPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
              +++P   P Q+      PP +    R+   +F G      N P      R     VW
Sbjct: 193 EGSITIPPFAPPQKMQAHFIPPDI---PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 249

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 250 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 309

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFP 304
           LPFAD + W+   + VA  D+P L  IL  I +E  L    LL +  +K    F     P
Sbjct: 310 LPFADAIPWEEIGVFVAEKDVPELDTILTSIPTEVILRKQRLLANPSMKRAMLFPQPAQP 369

Query: 305 SDYDAFYMVMYEL 317
              DAF+ ++  L
Sbjct: 370 G--DAFHQILNGL 380


>gi|15223522|ref|NP_174064.1| putative glycosyltransferase [Arabidopsis thaliana]
 gi|75272002|sp|Q9FZJ1.1|IRX10_ARATH RecName: Full=Probable beta-1,4-xylosyltransferase IRX10; AltName:
           Full=Glucuronoxylan glucuronosyltransferase 1;
           Short=AtGUT1; AltName: Full=Glucuronoxylan
           glucuronosyltransferase 2; AltName: Full=Protein
           IRREGULAR XYLEM 10; AltName: Full=Xylan
           xylosyltransferase IRX10
 gi|9802541|gb|AAF99743.1|AC004557_22 F17L21.23 [Arabidopsis thaliana]
 gi|63003872|gb|AAY25465.1| At1g27440 [Arabidopsis thaliana]
 gi|98960979|gb|ABF58973.1| At1g27440 [Arabidopsis thaliana]
 gi|332192709|gb|AEE30830.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 412

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 85/313 (27%), Positives = 140/313 (44%), Gaps = 33/313 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T++P +AD F+ P +    +           P  +   I  IS
Sbjct: 73  FAAEIFMHRFLLSSPVRTRNPDEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSSIQLIS 132

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIA-- 144
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+   
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLD 192

Query: 145 HKDVSLPQIWPRQED-----PPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
              +++P   P Q+      PP +    R+   +F G      N P      R     VW
Sbjct: 193 EGSITIPPFAPPQKMQAHFIPPDI---PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 249

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 250 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 309

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFP 304
           LPFAD + W+   + VA  D+P L  IL  I +E  L    LL +  +K    F     P
Sbjct: 310 LPFADAIPWEEIGVFVAEKDVPELDTILTSIPTEVILRKQRLLANPSMKRAMLFPQPAQP 369

Query: 305 SDYDAFYMVMYEL 317
              DAF+ ++  L
Sbjct: 370 G--DAFHQILNGL 380


>gi|115441965|ref|NP_001045262.1| Os01g0926400 [Oryza sativa Japonica Group]
 gi|75159223|sp|Q8S1X9.1|GT13_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926400;
           AltName: Full=OsGT47D
 gi|20160726|dbj|BAB89668.1| P0482D04.15 [Oryza sativa Japonica Group]
 gi|20805224|dbj|BAB92891.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|113534793|dbj|BAF07176.1| Os01g0926400 [Oryza sativa Japonica Group]
 gi|215686656|dbj|BAG88909.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215707279|dbj|BAG93739.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 422

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 85/338 (25%), Positives = 146/338 (43%), Gaps = 37/338 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP +AD F+ P ++   +           P  +   +  ++
Sbjct: 83  FAAEIFMHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVA 142

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSL 150
             +PYWNRT GADHF++A H  G     Q+   ++   + V+  ++   +    H     
Sbjct: 143 ATWPYWNRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQ 202

Query: 151 PQ--IWPRQEDPPKL------GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           P     P   DP K+       ++ R+   +F G      N P      R     VW N 
Sbjct: 203 PGSITVPPYADPRKMEAHRISPATPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENF 262

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 263 KDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 322

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPSDY 307
           AD + W   S+ VA  D+P L  IL  +  +E +    LL S  +K    F     P   
Sbjct: 323 ADAIPWGEISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPG-- 380

Query: 308 DAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
           DAF+ ++  L          +L      + W   L+++
Sbjct: 381 DAFHQILNGLARKLPHPKGVFLEPGEKGIDWDQGLEND 418


>gi|125582313|gb|EAZ23244.1| hypothetical protein OsJ_06939 [Oryza sativa Japonica Group]
          Length = 434

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 142/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   + H         P  +   I  +S
Sbjct: 94  FAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLS 153

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
           +K+P+WNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 154 RKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLK 213

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     +W N 
Sbjct: 214 EGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENF 273

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 274 KNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 333

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W    + V   D+P L  IL  I  ++ +L +  +L      Q  +FP      D
Sbjct: 334 ADAIPWDEIGVFVDEEDVPRLDSILTSIPIDD-ILRKQRLLANPSMKQAMLFPQPAQPRD 392

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 393 AFHQILNGL 401


>gi|125528940|gb|EAY77054.1| hypothetical protein OsI_05011 [Oryza sativa Indica Group]
          Length = 422

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 85/338 (25%), Positives = 146/338 (43%), Gaps = 37/338 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP +AD F+ P ++   +           P  +   +  ++
Sbjct: 83  FAAEIFMHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVA 142

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSL 150
             +PYWNRT GADHF++A H  G     Q+   ++   + V+  ++   +    H     
Sbjct: 143 ATWPYWNRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQ 202

Query: 151 PQ--IWPRQEDPPKL------GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           P     P   DP K+       ++ R+   +F G      N P      R     VW N 
Sbjct: 203 PGSITVPPYADPRKMEAHRISPATPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENF 262

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 263 KDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 322

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPSDY 307
           AD + W   S+ VA  D+P L  IL  +  +E +    LL S  +K    F     P   
Sbjct: 323 ADAIPWGEISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPG-- 380

Query: 308 DAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
           DAF+ ++  L          +L      + W   L+++
Sbjct: 381 DAFHQILNGLARKLPHPKGVFLEPGEKGIDWDQGLEND 418


>gi|357141002|ref|XP_003572039.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like
           [Brachypodium distachyon]
          Length = 465

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 88/340 (25%), Positives = 149/340 (43%), Gaps = 58/340 (17%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF----------SIARMRHDRRIGTEGIPDF 82
           +A+E    +A ++ H   + P +ADLF +P            +  ++H R          
Sbjct: 138 FAAEVAVHEALLRRHLRAR-PEEADLFLVPVYVSCNFSTPTGLPSLKHAR--------GL 188

Query: 83  ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQK---AWEVKLNAIQVVCSSSY 137
           ++  +  + +  PYWNR+ G DH +VA H  G    AM+    A  +     + +   ++
Sbjct: 189 LAEAVELVRRDMPYWNRSAGTDHVFVASHDFGACFHAMEDVAIAGGIPEFLKRSILLQTF 248

Query: 138 FISGHIAHKDVS--------LPQIWPRQEDPPKLGSSKRNKLAFFAGAV----------- 178
            + G    ++V         LP++     +P K   S R+  AFF G +           
Sbjct: 249 GVQGRHTCQEVEHVVIPPHVLPEVARELPEPEK---SHRDIFAFFRGKMEVHPKNMSGRF 305

Query: 179 -NSPVREKLLQVWRNDSEIY----AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADS 233
               VR KLLQ++ ++ + Y     H G     Y  E+  S FCL   G+   + R+ +S
Sbjct: 306 YGKKVRTKLLQLYGHNRKFYLKRKQHDG-----YRLEMARSLFCLCPLGWAPWSPRLVES 360

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           +  GC+PVIIA++  LPF  VL W   S+ VA  DI  L+ +L  ++S     +Q  +  
Sbjct: 361 VLLGCIPVIIADNIRLPFPGVLRWPDISLQVAERDIANLEAMLDHVASTNLTTIQGNLWD 420

Query: 294 V--RKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
              RK   ++    + DA + V+ EL  +   +R +   S
Sbjct: 421 PVKRKALVFNQPMEEGDATWQVLKELEAKLGHLRQKGRIS 460


>gi|212275360|ref|NP_001130059.1| uncharacterized protein LOC100191151 precursor [Zea mays]
 gi|194688192|gb|ACF78180.1| unknown [Zea mays]
          Length = 418

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 80/311 (25%), Positives = 140/311 (45%), Gaps = 29/311 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  ++
Sbjct: 78  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIERVA 137

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
             +PYWNR+ GADHF+V  H  G         A+ +     L    +V   ++    H+ 
Sbjct: 138 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGIPPLLQRATLV--QTFGQKNHVC 195

Query: 145 HKD--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWR 191
            KD  +++P   P Q+    L    + R+   +F G      N P      R     VW 
Sbjct: 196 LKDGSITIPPFAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWE 255

Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           N  ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVI+A+   L
Sbjct: 256 NFKNNPLFDISTDHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVL 315

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-- 307
           PFAD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP     
Sbjct: 316 PFADAIPWEDIGVFVAEEDVPRLDTILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQA 374

Query: 308 -DAFYMVMYEL 317
            DAF+ ++  L
Sbjct: 375 GDAFHQILNGL 385


>gi|242059741|ref|XP_002459016.1| hypothetical protein SORBIDRAFT_03g044520 [Sorghum bicolor]
 gi|241930991|gb|EES04136.1| hypothetical protein SORBIDRAFT_03g044520 [Sorghum bicolor]
          Length = 417

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 83/309 (26%), Positives = 141/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  IS
Sbjct: 78  FAAEIFMHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFIS 137

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
            ++PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 138 NRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDHVCLK 197

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 198 EGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENF 257

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 258 KNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 317

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 318 ADAIPWEEIGVFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQPGD 376

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 377 AFHQILNGL 385


>gi|168003473|ref|XP_001754437.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694539|gb|EDQ80887.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 353

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 132/316 (41%), Gaps = 52/316 (16%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +A+E    K  M S   T DP +AD FF+P  ++     +     G P       F+   
Sbjct: 31  FAAEVAIHKVLMTSPIRTLDPCEADFFFIPVYVS----CKFTPKTGFPWLGQARKFMEAA 86

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFI------- 139
           + ++S +  +WNR+GG DH +VA H  G        E   + I      S  +       
Sbjct: 87  VNHVSTRMEFWNRSGGRDHIFVASHDYGACFHTLETEAIAHGIPEFMRKSLILQTFGVQD 146

Query: 140 ------SGHIAHKDVSLPQIWPRQ-EDPPKLGSSKRNKLAFFAGAV------------NS 180
                 + HI       P +     +DPP+    KRN  AFF G +            + 
Sbjct: 147 FHPCQAAEHIQIPPYVSPSVAASYIKDPPE--RQKRNIFAFFRGKMEINPKNVSGLVYSR 204

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
            VR  L + + ++        R    Y  E+L S FCL   G+   + RI +++ +GCVP
Sbjct: 205 GVRTVLYKKFSHNRRFLLKRHRTDN-YQLEMLRSTFCLCPVGWAPWSPRIVEAVVHGCVP 263

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK------- 293
           VIIA++  LP++  ++W   S+ V   D+P L KIL  +++     +Q  + K       
Sbjct: 264 VIIADNISLPYSHAIDWTGISLSVREHDVPKLDKILLNVAATNLSTIQHNLWKEENRRAL 323

Query: 294 ------VRKHFQWHVF 303
                 V+    WHVF
Sbjct: 324 LFTDPLVKGDATWHVF 339


>gi|195619204|gb|ACG31432.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 419

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 83/309 (26%), Positives = 141/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  IS
Sbjct: 80  FAAEIFMHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFIS 139

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
            ++PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 140 NRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDHVCLK 199

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 200 EGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENF 259

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 260 KNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 319

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 320 ADAIPWEEIGVFVAEDDVPRLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQAGD 378

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 379 AFHQILNGL 387


>gi|242059743|ref|XP_002459017.1| hypothetical protein SORBIDRAFT_03g044530 [Sorghum bicolor]
 gi|241930992|gb|EES04137.1| hypothetical protein SORBIDRAFT_03g044530 [Sorghum bicolor]
          Length = 420

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 80/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  I+
Sbjct: 81  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELIA 140

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 141 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 200

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 201 DGSITIPPFAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 260

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 261 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 320

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP      D
Sbjct: 321 ADAIPWEEIGVFVAEEDVPKLDSILTSIPT-DVILRKQRLLANPSMKQAMLFPQPAQAGD 379

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 380 AFHQILNGL 388


>gi|357126596|ref|XP_003564973.1| PREDICTED: probable glucuronosyltransferase Os01g0926600-like
           [Brachypodium distachyon]
          Length = 411

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 84/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  IS
Sbjct: 72  FAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFIS 131

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 132 SHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 191

Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
           +  +++P   P Q+      PP+   + R+   +F G      N P      R     VW
Sbjct: 192 EGSINIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 248

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 249 ENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 308

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LPFAD + W    + VA  D+P L  IL  I   E +L +  +L      Q  +FP    
Sbjct: 309 LPFADAIPWDEIGVFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQ 367

Query: 308 --DAFYMVMYEL 317
             DAF+ ++  L
Sbjct: 368 PGDAFHQILNGL 379


>gi|302826405|ref|XP_002994684.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300137154|gb|EFJ04250.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 332

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 74/275 (26%), Positives = 126/275 (45%), Gaps = 28/275 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +A+E    +  ++S   T DP +AD FF+P  ++     R     G P      D +   
Sbjct: 32  FAAEVAIHQNLLRSPVRTLDPDEADFFFMPVYVSCNFTSR----SGFPTLFHASDILQAA 87

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK 146
           +  +S+  P+W+R  G DH +VA H  G         V +   Q + +S   I      K
Sbjct: 88  VGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAMDLAVTMGIPQFLRNS--IILQTFGEK 145

Query: 147 DVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV------------NSPVREKLLQVWRNDS 194
           +    Q     + PP +   +R  LAFF G +            +  VR  + + + +D 
Sbjct: 146 NKHPCQNVDHIQIPPYV---RRKILAFFRGKMEIHPKNVSGHMYSRGVRTTIWRRFSHDR 202

Query: 195 EIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
             +    R    Y  E+L S FCL   G+   + RI +S+  GC+PVIIA++  LP++ V
Sbjct: 203 RFFIKRKRSDN-YKAEMLRSVFCLCPLGWAPWSPRIVESVIQGCIPVIIADNIQLPYSHV 261

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
           ++W+  S+ VA  D+  L +IL  +++    ++Q+
Sbjct: 262 IDWRKISVTVAERDVHKLDRILSRVAATNVSMIQA 296


>gi|297808283|ref|XP_002872025.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317862|gb|EFH48284.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 465

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 88/324 (27%), Positives = 145/324 (44%), Gaps = 49/324 (15%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDF 82
           +A+E    +A + S   T DP +AD FF+P  ++           + H R + +  + DF
Sbjct: 137 FAAEVAIHRALLSSDVRTLDPEEADFFFVPVYVSCNFSTSNGFPSLSHARSLLSSAV-DF 195

Query: 83  ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQK-AWE------VKLNAIQVVC 133
           +S +       YP+WNRT G+DH +VA H  G    AM+  A E      +K + I    
Sbjct: 196 LSDH-------YPFWNRTQGSDHVFVASHDFGACFHAMEDMAIEEGIPEFMKKSIILQTF 248

Query: 134 SSSYFISGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV---------- 178
              Y          V  P I P       E  P  G  +R+  AFF G +          
Sbjct: 249 GVKYKHPCQEVEHVVIPPYIPPESVQRAIEKAPANG--RRDIWAFFRGKMEVNPKNISGR 306

Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
             +  VR  +L+ +      Y +  R    Y  E++ S FCL   G+   + R+ +S   
Sbjct: 307 FYSKGVRTAILKKFGGRRRFYLNRHRFAG-YRSEIVRSVFCLCPLGWAPWSPRLVESAVL 365

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--V 294
           GCVPV+IA+   LPF++ + W   S+ VA  D+  L+KIL+ +++    ++Q  +     
Sbjct: 366 GCVPVVIADGIKLPFSETVRWPEISLTVAEKDVRSLRKILEHVAATNLSVIQRNLHGPVF 425

Query: 295 RKHFQWHVFPSDYDAFYMVMYELW 318
           ++   ++V   + DA + ++  LW
Sbjct: 426 KRALLYNVPMKEGDATWHILESLW 449


>gi|307135997|gb|ADN33853.1| glucuronoxylan glucuronosyltransferase [Cucumis melo subsp. melo]
          Length = 416

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 77  FAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 137 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 196

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 197 EGSITIPPYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 257 KDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 316

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + +   D+  L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 317 ADAIPWEEIGVFLDEKDVANLDTILTSIPL-EMILRKQRLLANPSMKQAMLFPQPAQPGD 375

Query: 309 AFYMVMYEL 317
           AF+ V+  L
Sbjct: 376 AFHQVLNGL 384


>gi|194697530|gb|ACF82849.1| unknown [Zea mays]
          Length = 418

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 78/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  ++
Sbjct: 78  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELVA 137

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 138 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 197

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 198 DGSITIPPFAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 257

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVI+A+   LPF
Sbjct: 258 KNNPLFDISTDHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPF 317

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP      D
Sbjct: 318 ADAIPWEDIGVFVAEEDVPRLDTILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQAGD 376

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 377 AFHQILNGL 385


>gi|301072484|gb|ADK56172.1| glycosyltransferase 47 [Triticum aestivum]
          Length = 415

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   +  IS
Sbjct: 76  FAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFIS 135

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 136 SHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 195

Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
           +  +++P   P Q+      PP+   + R+   +F G      N P      R     VW
Sbjct: 196 EGSINIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 252

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 253 ENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 312

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LPFAD + W    + VA  D+P L  IL  I   E +L +  +L      Q  +FP    
Sbjct: 313 LPFADAIPWDEIGVFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQ 371

Query: 308 --DAFYMVMYEL 317
             DAF+ ++  L
Sbjct: 372 PGDAFHQILNGL 383


>gi|363543255|ref|NP_001241842.1| uncharacterized protein LOC100857042 precursor [Zea mays]
 gi|194704652|gb|ACF86410.1| unknown [Zea mays]
 gi|224034207|gb|ACN36179.1| unknown [Zea mays]
 gi|414878896|tpg|DAA56027.1| TPA: hypothetical protein ZEAMMB73_615997 [Zea mays]
          Length = 418

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 78/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  ++
Sbjct: 78  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELVA 137

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 138 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 197

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 198 DGSITIPPFAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 257

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVI+A+   LPF
Sbjct: 258 KNNPLFDISTDHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPF 317

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP      D
Sbjct: 318 ADAIPWEDIGVFVAEEDVPRLDTILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQAGD 376

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 377 AFHQILNGL 385


>gi|219363227|ref|NP_001136572.1| hypothetical protein precursor [Zea mays]
 gi|194696226|gb|ACF82197.1| unknown [Zea mays]
 gi|414866559|tpg|DAA45116.1| TPA: hypothetical protein ZEAMMB73_757616 [Zea mays]
          Length = 453

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 81/294 (27%), Positives = 139/294 (47%), Gaps = 26/294 (8%)

Query: 54  SKADLFFLPF--SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           + AD+ F+PF  S++  RH R +          +   +       P W RTGG DH  +A
Sbjct: 136 ADADVVFVPFFASLSFNRHSRVVPPARNSEDRALQRRLLEFLAARPEWRRTGGRDHVVLA 195

Query: 110 CHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP---QIWPRQEDPPKLG 164
            H  G   A  + W      + V+C    Y  S     KD+  P    +     D    G
Sbjct: 196 HHPNGMLDARYRFWPC----VFVLCDFGRYPPSVANLDKDIIAPYRHLVANFAND--TAG 249

Query: 165 SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKF 216
              R  L +F GA+       +R++L  + +++ +++   G +     ++    +  SKF
Sbjct: 250 YDDRPTLLYFQGAIYRKDGGSIRQELYYLLKDEKDVHFSFGSVAGNGIEQATQGMRSSKF 309

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLK 273
           CL++ G   ++ R+ DS+   CVPVII++  +LPF DVL++  FS++V   D      LK
Sbjct: 310 CLNIAGDTPSSNRLFDSIVSHCVPVIISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLK 369

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            ++KGIS EE+  + + + +V KHF++       DA  M+   +  +  S+R++
Sbjct: 370 SLIKGISQEEWTRMWNKLKEVEKHFEYQYPSQTDDAVQMIWKAIARKVPSIRLK 423


>gi|326509565|dbj|BAJ86998.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326509967|dbj|BAJ87200.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 415

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   +  IS
Sbjct: 76  FAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFIS 135

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 136 SHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 195

Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
           +  +++P   P Q+      PP+   + R+   +F G      N P      R     VW
Sbjct: 196 EGSINIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 252

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 253 ENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 312

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LPFAD + W    + VA  D+P L  IL  I   E +L +  +L      Q  +FP    
Sbjct: 313 LPFADAIPWDEIGVFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQ 371

Query: 308 --DAFYMVMYEL 317
             DAF+ ++  L
Sbjct: 372 AGDAFHQILNGL 383


>gi|242088703|ref|XP_002440184.1| hypothetical protein SORBIDRAFT_09g027440 [Sorghum bicolor]
 gi|241945469|gb|EES18614.1| hypothetical protein SORBIDRAFT_09g027440 [Sorghum bicolor]
          Length = 415

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 80/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  I+
Sbjct: 76  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIA 135

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 136 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 195

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 196 DGSITIPPYAPPQKMQTHLIPADTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 255

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 256 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 315

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP      D
Sbjct: 316 ADAIPWEEIGVFVAEEDVPQLDSILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQPGD 374

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 375 AFHQILNGL 383


>gi|414878899|tpg|DAA56030.1| TPA: secondary cell wall glycosyltransferase family 47 isoform 1
           [Zea mays]
 gi|414878900|tpg|DAA56031.1| TPA: secondary cell wall glycosyltransferase family 47 isoform 2
           [Zea mays]
          Length = 419

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/309 (26%), Positives = 140/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P  AD F+ P ++   +           P  +   I  IS
Sbjct: 80  FAAEIFMHRFLLSSAIRTLNPEVADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFIS 139

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
            ++PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 140 NRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDHVCLK 199

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 200 EGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENF 259

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 260 KNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 319

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 320 ADAIPWEEIGVFVAEDDVPRLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQAGD 378

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 379 AFHQILNGL 387


>gi|449456052|ref|XP_004145764.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Cucumis sativus]
          Length = 459

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 134/323 (41%), Gaps = 49/323 (15%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +ASE    +A + S + T DP +AD FF+P  ++           G P        IS  
Sbjct: 132 FASEVAIHRALLNSDYRTFDPLEADFFFVPVYVS----CNFSTVNGFPAIGHARSLISSA 187

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK 146
           + +IS  Y +WNRT G+DH +VA H              +  + +      F+   I  +
Sbjct: 188 VSHISSHYSFWNRTNGSDHVFVASHDFASCFHT------MEHVAIADGVPSFLKNSIILQ 241

Query: 147 DVSLPQIWPRQE-----DPPKLG-------------SSKRNKLAFFAGAV---------- 178
              +    P Q+      PP +              + +R+  AFF G +          
Sbjct: 242 TFGVKYKHPCQDVEHVVIPPYISPESIENTLERSPVTGRRDIFAFFRGKMEMNPKNVSGR 301

Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
             +  VR  + + +  D   Y    R    Y  E++ S FCL   G+   + R+ +S+  
Sbjct: 302 FYSKKVRTMIWRKFNGDRRFYLQRHRFPG-YQSEIVRSVFCLCPLGWAPWSPRLVESVAL 360

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KV 294
           GCVPVIIA+   LPF   +NW   SI VA  DI  L +IL  +++     +Q  +   + 
Sbjct: 361 GCVPVIIADGIRLPFPSAVNWPEISITVAEKDIGKLGRILDHVAASNLTTIQKNLWDPRN 420

Query: 295 RKHFQWHVFPSDYDAFYMVMYEL 317
           R+   +H    D DA + V+  L
Sbjct: 421 RRALLFHNQVEDGDATWQVIGAL 443


>gi|449440484|ref|XP_004138014.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Cucumis sativus]
 gi|449528307|ref|XP_004171146.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Cucumis sativus]
          Length = 416

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 77  FAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 137 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 196

Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L   K  R+   +F G      N P      R     VW N 
Sbjct: 197 EGSITIPPYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 257 KDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 316

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + +   D+  L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 317 ADAIPWEEIGVFLDEKDVANLDTILTSIPL-EMILRKQRLLANPSMKQAMLFPQPAQPGD 375

Query: 309 AFYMVMYEL 317
           AF+ V+  L
Sbjct: 376 AFHQVLNGL 384


>gi|326530125|dbj|BAK08342.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 81/296 (27%), Positives = 140/296 (47%), Gaps = 22/296 (7%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADH 105
            +DP++AD+ F+PF  S++  RH + +      +   +   +       P W R+GG DH
Sbjct: 140 VRDPARADVVFVPFFASLSFNRHSKVVPPARTSEDRTLQRRLIEFLAARPEWRRSGGRDH 199

Query: 106 FYVACHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPK 162
             +A H  G   A  K W      + V+C    Y  S     KDV  P Q          
Sbjct: 200 VVLAHHPNGMLDARYKLWP----CVFVLCDFGRYPHSVANIDKDVIAPYQHVVDDFLNDS 255

Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGS 214
            G   R  L +F GA+       +R++L  + +++ +++   G +     +E    +  S
Sbjct: 256 TGYDDRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVAGNGIEESTRGMRAS 315

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PL 271
           KFCL++ G   ++ R+ DS+   CVPVII++  +LPF D+L++  F I+V   D      
Sbjct: 316 KFCLNIAGDTPSSNRLFDSIVSHCVPVIISDEIELPFEDMLDYSKFCIIVRGADAVKKGF 375

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           L  ++KGIS EE+  + + + +V  HF++       DA  M+   +  +  S+R++
Sbjct: 376 LINLIKGISPEEWTSMWNKLREVEGHFEYQYPSQPEDAVQMIWKTIARKVPSIRLK 431


>gi|75159221|sp|Q8S1X7.1|GT15_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926700
 gi|20160728|dbj|BAB89670.1| P0482D04.17 [Oryza sativa Japonica Group]
 gi|20805226|dbj|BAB92893.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|125528942|gb|EAY77056.1| hypothetical protein OsI_05014 [Oryza sativa Indica Group]
 gi|125573178|gb|EAZ14693.1| hypothetical protein OsJ_04618 [Oryza sativa Japonica Group]
          Length = 417

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 80/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  I+
Sbjct: 78  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELIA 137

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 138 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 197

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 198 DGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 257

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 258 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 317

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP      D
Sbjct: 318 ADAIPWEEIGVFVAEEDVPKLDSILTSIPT-DVILRKQRLLANPSMKQAMLFPQPAQAGD 376

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 377 AFHQILNGL 385


>gi|115441967|ref|NP_001045263.1| Os01g0926600 [Oryza sativa Japonica Group]
 gi|75159222|sp|Q8S1X8.1|GT14_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926600
 gi|20160727|dbj|BAB89669.1| P0482D04.16 [Oryza sativa Japonica Group]
 gi|20805225|dbj|BAB92892.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|113534794|dbj|BAF07177.1| Os01g0926600 [Oryza sativa Japonica Group]
 gi|125528941|gb|EAY77055.1| hypothetical protein OsI_05013 [Oryza sativa Indica Group]
 gi|125573177|gb|EAZ14692.1| hypothetical protein OsJ_04617 [Oryza sativa Japonica Group]
 gi|215687152|dbj|BAG90922.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 415

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 84/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  IS
Sbjct: 76  FAAEIFMHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFIS 135

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 136 SHWPYWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 195

Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
           +  +++P   P Q+      PP+   + R+   +F G      N P      R     VW
Sbjct: 196 EGSITIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 252

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 253 ENFKNNPLFDISTDHPPTYYEDMQRSIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 312

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LPFAD + W    + VA  D+P L  IL  I   + +L +  +L      Q  +FP    
Sbjct: 313 LPFADAIPWDEIGVFVAEDDVPKLDTILTSIPM-DVILRKQRLLANPSMKQAMLFPQPAQ 371

Query: 308 --DAFYMVMYEL 317
             DAF+ ++  L
Sbjct: 372 PGDAFHQILNGL 383


>gi|226499284|ref|NP_001142343.1| uncharacterized protein LOC100274514 precursor [Zea mays]
 gi|194708308|gb|ACF88238.1| unknown [Zea mays]
          Length = 419

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/309 (26%), Positives = 140/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P  AD F+ P ++   +           P  +   I  IS
Sbjct: 80  FAAEIFMHRFLLSSAIRTLNPEVADWFYTPVYTTCDLTPWGHPLPFRSPRIMRSAIQFIS 139

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
            ++PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 140 NRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDHVCLK 199

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 200 EGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENF 259

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 260 KNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 319

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 320 ADAIPWEEIGVFVAEDDVPRLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQAGD 378

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 379 AFHQILNGL 387


>gi|198429575|ref|XP_002120379.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 482

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 76/251 (30%), Positives = 121/251 (48%), Gaps = 28/251 (11%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPR--GNYASESYFKKAFMKSHFVTKDPSKADL 58
           M  + +VY+Y  +         +  D  P   G YA E  F++   KS+F T+ P+ A  
Sbjct: 224 MVNNLKVYIYETK---------IGTDHHPHRVGGYAVERVFQELLEKSNFRTQHPNLATF 274

Query: 59  FFLPFSIARMRHDRRIGTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           FF+P   +    D     EG+ +     ++ +  I  +YPYW+++ GA+HFY+  H +G 
Sbjct: 275 FFIPIRCSSYILDYPTEHEGLMEAKRVTANILHEIQTQYPYWSQSSGANHFYICSHDVGA 334

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP--------QIWPRQEDPPKLGSSK 167
              +   +   NAI +V ++ Y     I HKD+S+P         I    +    +    
Sbjct: 335 KVAEGLMK---NAIGLVSTADYDDPYFIPHKDISIPPTPSSGLSNIHLIGKGGALVDVRG 391

Query: 168 RNKLAFFAGAVNSP-VREKLLQVWRNDSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEV 225
           RN LAFFAG + S  +R    + W +D +I   +  LK + Y ++L  +KFCL  +G EV
Sbjct: 392 RNILAFFAGDITSGRIRPLAWRTWYSDQDIEIINRILKPSAYIEKLKKAKFCLIFRGKEV 451

Query: 226 NTARIADSLYY 236
            T  +  SLYY
Sbjct: 452 ITV-MCYSLYY 461


>gi|326511078|dbj|BAJ91886.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 415

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   +  IS
Sbjct: 76  FAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFIS 135

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 136 SHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 195

Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
           +  +++P   P Q+      PP+   + R+   +F G      N P      R     VW
Sbjct: 196 EGSINIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 252

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 253 ENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 312

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LPFAD + W    + VA  D+P L  IL  I   E +L +  +L      Q  +FP    
Sbjct: 313 LPFADAIPWDEIGMFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQ 371

Query: 308 --DAFYMVMYEL 317
             DAF+ ++  L
Sbjct: 372 AGDAFHQILNGL 383


>gi|226499780|ref|NP_001150738.1| secondary cell wall-related glycosyltransferase family 47 precursor
           [Zea mays]
 gi|195641374|gb|ACG40155.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
 gi|223974207|gb|ACN31291.1| unknown [Zea mays]
          Length = 415

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 80/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  I+
Sbjct: 76  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIA 135

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 136 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 195

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 196 DGSITIPPYAPPQKMQTHLIPADTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 255

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 256 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 315

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP      D
Sbjct: 316 ADAIPWEEIGVFVAEEDVPRLDSILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQAGD 374

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 375 AFHQILNGL 383


>gi|357121010|ref|XP_003562215.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like
           [Brachypodium distachyon]
          Length = 441

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 86/323 (26%), Positives = 149/323 (46%), Gaps = 46/323 (14%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGI 79
           R  +A+E    +A +  H+    P  ADLFF+P  ++           + H R +  + +
Sbjct: 113 RHLFAAEVALHEALL-GHYSAVRPEDADLFFVPVYVSCNFSTPNGFPSLSHARGLLADAV 171

Query: 80  PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKL-NAI-----QVVC 133
            D +        ++ PYWNR+ GADH +VA H  G +      +V + + I     + + 
Sbjct: 172 -DLVR-------REAPYWNRSAGADHVFVASHDFG-ACFHPMEDVAIADGIPDFLKRSIL 222

Query: 134 SSSYFISG-HIAHKD---VSLPQIWPRQE-DPPKLGSSKRNKLAFFAGAV---------- 178
             ++ + G H+  +    V  P + P    +  +L  ++R+  AFF G +          
Sbjct: 223 LQTFGVQGPHVCQEAEHVVIPPHVPPEVALEILELEKTRRDIFAFFRGKMEVHPKNISGR 282

Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
             +  VR +LLQ +  +S+ Y    R    Y  E+  S FCL   G+   + R+ +S+  
Sbjct: 283 FYSKKVRTELLQRYGRNSKFYLKRKRYDN-YRSEMARSLFCLCPLGWAPWSPRLVESVLL 341

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV-- 294
           GC+PVIIA++  LPF  VL W   S+ VA  D+  L+K+L  + +    ++Q  +     
Sbjct: 342 GCIPVIIADNIRLPFPSVLRWSDISLQVAEKDVASLEKVLDHVVATNLTVIQKNLWDPVK 401

Query: 295 RKHFQWHVFPSDYDAFYMVMYEL 317
           RK   ++    + DA + V+ EL
Sbjct: 402 RKALVFNRRLEEGDATWQVLREL 424


>gi|357112427|ref|XP_003558010.1| PREDICTED: probable glycosyltransferase At3g07620-like
           [Brachypodium distachyon]
          Length = 464

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 146/297 (49%), Gaps = 24/297 (8%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADH 105
            +DP++AD+ F+PF  S++  RH + +      +   +   +       P W R+GG DH
Sbjct: 143 VRDPARADVVFVPFFASLSFNRHSKVVPPARTSEDRALQRRLIEFLAARPEWRRSGGRDH 202

Query: 106 FYVACHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQIWPRQED-PPK 162
             +A H  G   A  K W      + V+C    Y  S     KDV  P +          
Sbjct: 203 VVLAHHPNGMLDARYKLWP----CVFVLCDFGRYPHSVANIDKDVIAPYLHVVGNFFNDS 258

Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGS 214
            G   R  L +F GA+       +R++L  + +++ +++   G +     ++    +  S
Sbjct: 259 AGYDARPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVAGNGIEQSTQGMRAS 318

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PL 271
           KFCL++ G   ++ R+ DS+   CVP+II++  +LPF DVL++  F I+V  +D      
Sbjct: 319 KFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGVDAVKKGF 378

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-DAFYMVMYELWLRRSSVRVQ 327
           L  ++KGIS +E+  + + + +V +HF++  +PS + DA  M+   +  +  S+R++
Sbjct: 379 LINLIKGISRQEWTSMWNKLKEVERHFEYQ-YPSQHDDAVQMIWKTIARKVPSIRLK 434


>gi|302791649|ref|XP_002977591.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300154961|gb|EFJ21595.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 345

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 76/285 (26%), Positives = 132/285 (46%), Gaps = 35/285 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +A+E    +  ++S   T DP +AD FF+P  ++     R     G P      D +   
Sbjct: 32  FAAEVAIHQNLLRSPVRTLDPDEADFFFMPVYVSCNFTSR----SGFPTLFHASDILQAA 87

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQKAW------EVKLNAI--QVVCSSS 136
           +  +S+  P+W+R  G DH +VA H  G    AM+         +   N+I  Q     +
Sbjct: 88  VGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAMEDLAVAMGIPQFLRNSIILQTFGEKN 147

Query: 137 YFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV------------NSPVRE 184
                ++ H  +  P + P ++ P   G  +R  LAFF G +            +  VR 
Sbjct: 148 KHPCQNVDHIQIP-PYVVPAKKLPDPRGQ-RRKILAFFRGKMEIHPKNVSGHMYSRGVRT 205

Query: 185 KLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
            + + + +D   +    R    Y  E+L S FCL   G+   + RI +S+  GC+PVIIA
Sbjct: 206 TIWRRFSHDRRFFIKRKRSDN-YKAEMLRSVFCLCPLGWAPWSPRIVESVIQGCIPVIIA 264

Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
           ++  LP++ V++W+  S+ VA  D+  L +IL  +++    ++Q+
Sbjct: 265 DNIQLPYSHVIDWRKISVTVAERDVHKLDRILSKVAATNVSMIQA 309


>gi|60657598|gb|AAX33320.1| secondary cell wall-related glycosyltransferase family 47 [Populus
           tremula x Populus tremuloides]
          Length = 412

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 132/313 (42%), Gaps = 33/313 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P +    +           P  +   I  IS
Sbjct: 73  FAAEIFMHRFLLSSPVRTLNPDEADWFYSPIYPTCDLTPMGLPLPFKSPRMMRSAIQLIS 132

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFI------------ 139
             +PYWNRT GADHF+V  H  G     +  +     I  +   S  +            
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHCQEEKAVERGILPLLQRSTLVQTFGRRNHVCLN 192

Query: 140 SGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
            G I     + PQ     + PP +    R+   +F G      N P      R     VW
Sbjct: 193 EGSITIPPFAPPQKMQAHQIPPDI---PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 249

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 250 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 309

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFP 304
           LPFAD + W+   + VA  D+P L  IL  I  E  L    LL +  +K    F     P
Sbjct: 310 LPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 369

Query: 305 SDYDAFYMVMYEL 317
              DAF+ ++  L
Sbjct: 370 G--DAFHQILNGL 380


>gi|212276027|ref|NP_001130448.1| uncharacterized protein LOC100191546 precursor [Zea mays]
 gi|194689154|gb|ACF78661.1| unknown [Zea mays]
 gi|413946366|gb|AFW79015.1| putative Secondary cell wall glycosyltransferase family 47 [Zea
           mays]
          Length = 415

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 80/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  I+
Sbjct: 76  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIA 135

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 136 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 195

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           D  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 196 DGSITIPPYAPPQKMQTHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 255

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 256 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 315

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP      D
Sbjct: 316 ADAIPWEEIGVFVAEDDVPQLDSILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQPGD 374

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 375 AFHQILNGL 383


>gi|302786830|ref|XP_002975186.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300157345|gb|EFJ23971.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 405

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 85/324 (26%), Positives = 152/324 (46%), Gaps = 41/324 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +A+E    +  ++S   T DP +AD FF+P  ++     R     G P      D +   
Sbjct: 81  FAAEVAIHQNLLRSPVRTLDPDEADFFFMPVYVSCNFTSR----SGFPTLFHASDILQAA 136

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQKAWEVKLNAIQVVCSSSYFIS---- 140
           +  +S+  P+W+R  G DH +VA H  G    AM+    V +   Q + +S    +    
Sbjct: 137 VGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAMED-LAVTMGIPQFLRNSIILQTFGEK 195

Query: 141 -----GHIAHKDVSLPQIWPRQEDP-PKLGSSKRNKLAFFAGAV------------NSPV 182
                 ++ H  +  P + P ++ P P+  S +R  LAFF G +            +  V
Sbjct: 196 NKHPCQNVDHIQIP-PYVVPAKKLPDPR--SQRRKILAFFRGKMEIHPKNVSGHMYSRGV 252

Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
           R  + + + +D   +    R    Y  E+L S FCL   G+   + RI +S+  GC+PVI
Sbjct: 253 RTTIWRRFSHDRRFFIKRKRSDN-YKAEMLRSVFCLCPLGWAPWSPRIVESVIQGCIPVI 311

Query: 243 IANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQW 300
           IA++  LP++ V++W+  S+ VA  D+  L +IL  +++    ++Q+ + +  VR+   +
Sbjct: 312 IADNIQLPYSHVIDWRKISVTVAERDVHKLDRILSRVAATNVSMIQANLWRDEVRQALVY 371

Query: 301 HVFPSDYDAFYMVMYELWLRRSSV 324
           +      DA + V+  L  R++ V
Sbjct: 372 NQPLVRGDATWQVLDLLSKRKNKV 395


>gi|194706890|gb|ACF87529.1| unknown [Zea mays]
          Length = 418

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 144/314 (45%), Gaps = 35/314 (11%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +       T   P  +   I  +S
Sbjct: 79  FATEIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVS 138

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
           +++PYWNRT GADHF+V  H  G         A+Q+     L    +V   ++    H+ 
Sbjct: 139 KRWPYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLV--QTFGQKNHVC 196

Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
            ++  +++P   P  +      PP+   + R+   +F G      N P      R     
Sbjct: 197 LREGSITIPPYAPPHKIRAHIVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 253

Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           VW N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+ 
Sbjct: 254 VWENFKNNALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 313

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
             LPFAD + W+  ++ V   D+  L  IL  I  +E +L +  +L      Q  +FP  
Sbjct: 314 IVLPFADAIPWEEIAVFVPEDDVLRLDTILTSIPMDE-ILRKQRLLANPSMKQAMLFPQP 372

Query: 307 Y---DAFYMVMYEL 317
               DAF+ V+  L
Sbjct: 373 AEPRDAFHQVLNGL 386


>gi|413946367|gb|AFW79016.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
          Length = 418

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 144/314 (45%), Gaps = 35/314 (11%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +       T   P  +   I  +S
Sbjct: 79  FATEIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVS 138

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
           +++PYWNRT GADHF+V  H  G         A+Q+     L    +V   ++    H+ 
Sbjct: 139 KRWPYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLV--QTFGQKNHVC 196

Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
            ++  +++P   P  +      PP+   + R+   +F G      N P      R     
Sbjct: 197 LREGSITIPPYAPPHKIRAHIVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 253

Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           VW N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+ 
Sbjct: 254 VWENFKNNALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 313

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
             LPFAD + W+  ++ V   D+  L  IL  I  +E +L +  +L      Q  +FP  
Sbjct: 314 IVLPFADAIPWEEIAVFVPEDDVLRLDTILTSIPMDE-ILRKQRLLANPSMKQAMLFPQP 372

Query: 307 Y---DAFYMVMYEL 317
               DAF+ V+  L
Sbjct: 373 AEPRDAFHQVLNGL 386


>gi|224166016|ref|XP_002338879.1| predicted protein [Populus trichocarpa]
 gi|222873817|gb|EEF10948.1| predicted protein [Populus trichocarpa]
          Length = 208

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 102/201 (50%), Gaps = 46/201 (22%)

Query: 83  ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGH 142
           + H++ ++  +YPYWNRT GADHF++ C  I   A ++ W +  N+I+V+CS SY +  +
Sbjct: 11  VQHFVNSLISEYPYWNRTLGADHFFITCADIHVIASERIWNLMKNSIRVMCSPSYNVE-Y 69

Query: 143 IAHKDVSLPQ-------------------IWPRQEDPPKLGSSKRN-------------- 169
           + HKDVSLPQ                   I P  + P  L ++K N              
Sbjct: 70  VPHKDVSLPQSVQPFNVSVSQIMPPLYAFIAPTTQ-PLTLPAAKYNMKSRYRYLLCPWII 128

Query: 170 ---KLAFFAGAVNSPVREKLLQVWRNDSEI-------YAHSGRLKTPYADELLGSKFCLH 219
              + +F+ G   + +R+ L+  W NDSE+        A +  ++  Y ++   SKFC+ 
Sbjct: 129 LEQEYSFWRGLKENYIRKSLVNAWENDSELDIKEIQTEASTTEIRRLYHEKFYSSKFCIC 188

Query: 220 VKGFEVNTARIADSLYYGCVP 240
             G +++ A IA +++YGCVP
Sbjct: 189 PGGPQIDGA-IAVAIHYGCVP 208


>gi|224053575|ref|XP_002297880.1| predicted protein [Populus trichocarpa]
 gi|222845138|gb|EEE82685.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 132/313 (42%), Gaps = 33/313 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P +    +           P  +   I  IS
Sbjct: 73  FAAEIFMHRFLLSSPVRTLNPDEADWFYSPIYPTCDLTPMGLPLPFKSPRMMRSAIQLIS 132

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFI------------ 139
             +PYWNRT GADHF+V  H  G     +  +     I  +   S  +            
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRSTLVQTFGQRNHVCLN 192

Query: 140 SGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
            G I     + PQ     + PP +    R+   +F G      N P      R     VW
Sbjct: 193 EGSITIPPFAPPQKMQAHQIPPDI---PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 249

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 250 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 309

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFP 304
           LPFAD + W+   + VA  D+P L  IL  I  E  L    LL +  +K    F     P
Sbjct: 310 LPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 369

Query: 305 SDYDAFYMVMYEL 317
              DAF+ ++  L
Sbjct: 370 G--DAFHQILNGL 380


>gi|115458146|ref|NP_001052673.1| Os04g0398600 [Oryza sativa Japonica Group]
 gi|75143148|sp|Q7XLG3.2|GT42_ORYSJ RecName: Full=Probable glucuronosyltransferase Os04g0398600
 gi|38347167|emb|CAE05157.2| OSJNBa0039C07.13 [Oryza sativa Japonica Group]
 gi|113564244|dbj|BAF14587.1| Os04g0398600 [Oryza sativa Japonica Group]
 gi|116309465|emb|CAH66536.1| H0209H04.3 [Oryza sativa Indica Group]
 gi|125548129|gb|EAY93951.1| hypothetical protein OsI_15724 [Oryza sativa Indica Group]
 gi|125590243|gb|EAZ30593.1| hypothetical protein OsJ_14642 [Oryza sativa Japonica Group]
 gi|215706942|dbj|BAG93402.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741563|dbj|BAG98058.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 420

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 83/333 (24%), Positives = 149/333 (44%), Gaps = 35/333 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  IS
Sbjct: 81  FAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTPAGLPLPFKSPRVMRSAIQYIS 140

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
            K+P+WNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 141 HKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQENHVCLK 200

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     +W N 
Sbjct: 201 EGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENF 260

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 261 KNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 320

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  +  ++ +L +  +L      Q  +FP      D
Sbjct: 321 ADAIPWEEIGVFVEEKDVPKLDTILTSMPIDD-ILRKQRLLANPSMKQAMLFPQPAQPRD 379

Query: 309 AFYMVMYEL----------WLRRSSVRVQWSTS 331
           AF+ ++  L          +L+ S  R+ W+  
Sbjct: 380 AFHQILNGLARKLPHPEGVYLQPSDKRLNWTAG 412


>gi|226529361|ref|NP_001147714.1| LOC100281324 precursor [Zea mays]
 gi|195613258|gb|ACG28459.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 418

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 144/314 (45%), Gaps = 35/314 (11%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +       T   P  +   I  +S
Sbjct: 79  FATEIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVS 138

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
           +++PYWNRT GADHF+V  H  G         A+Q+     L    +V   ++    H+ 
Sbjct: 139 KRWPYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLV--QTFGQKNHVC 196

Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
            ++  +++P   P  +      PP+   + R+   +F G      N P      R     
Sbjct: 197 LREGSITIPPYAPPHKIRAHIVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 253

Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           VW N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+ 
Sbjct: 254 VWENFKNNALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 313

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
             LPFAD + W+  ++ V   D+  L  IL  I  +E +L +  +L      Q  +FP  
Sbjct: 314 IVLPFADAIPWEEIAVFVPEDDVLRLDTILTSIPMDE-ILRKQRLLANPSMKQAMLFPQP 372

Query: 307 Y---DAFYMVMYEL 317
               DAF+ V+  L
Sbjct: 373 AEPRDAFHQVLNGL 386


>gi|363543513|ref|NP_001241766.1| secondary cell wall-related glycosyltransferase family 47 precursor
           [Zea mays]
 gi|195634597|gb|ACG36767.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 417

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 144/314 (45%), Gaps = 35/314 (11%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +       T   P  +   I  +S
Sbjct: 78  FATEIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVS 137

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
           +++PYWNRT GADHF+V  H  G         A+Q+     L    +V   ++    H+ 
Sbjct: 138 KRWPYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLV--QTFGQKNHVC 195

Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
            ++  +++P   P  +      PP+   + R+   +F G      N P      R     
Sbjct: 196 LREGSITIPPYAPPHKIRAHIVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 252

Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           VW N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+ 
Sbjct: 253 VWENFKNNALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 312

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
             LPFAD + W+  ++ V   D+  L  IL  I  +E +L +  +L      Q  +FP  
Sbjct: 313 IVLPFADAIPWEEIAVFVPEDDVLRLDTILTSIPMDE-ILRKQRLLANPSMKQAMLFPQP 371

Query: 307 Y---DAFYMVMYEL 317
               DAF+ V+  L
Sbjct: 372 AEPRDAFHQVLNGL 385


>gi|242042569|ref|XP_002468679.1| hypothetical protein SORBIDRAFT_01g050110 [Sorghum bicolor]
 gi|241922533|gb|EER95677.1| hypothetical protein SORBIDRAFT_01g050110 [Sorghum bicolor]
          Length = 429

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 78/292 (26%), Positives = 131/292 (44%), Gaps = 43/292 (14%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGI 79
           R  +A+E    +A +        P  ADLFF+P  ++           + H R +  E +
Sbjct: 99  RHLFAAEVAVHEALLAYAGRAARPEDADLFFVPVYVSCNFSTPNGFPSLSHARGLLAEAV 158

Query: 80  PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKL-NAI-----QVVC 133
            D +         + PYWNR+ GADH +VA H  G +      +V + + I     + + 
Sbjct: 159 -DLVR-------VRMPYWNRSAGADHVFVASHDFG-ACFHPMEDVAIADGIPEFLKRSIL 209

Query: 134 SSSYFISGHIAHKDVS----LPQIWPR-QEDPPKLGSSKRNKLAFFAGAV---------- 178
             ++ + GH   ++V      P + P    + P+   ++R+  AFF G +          
Sbjct: 210 LQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQRDIFAFFRGKMEVHPKNISGR 269

Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
             +  VR +LLQ +  + + Y    R    Y  E+  S FCL   G+   + R+ +S+  
Sbjct: 270 FYSKKVRTELLQHYGRNRKFYLKRKRFDN-YRSEMARSLFCLCPLGWAPWSPRLVESVLL 328

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           GC+PVIIA++  LPF  VL W   S+ VA  DI  L+ +L  + +    ++Q
Sbjct: 329 GCIPVIIADNIRLPFPSVLQWPEISLQVAEKDIANLEMVLDHVVATNLTMIQ 380


>gi|242061766|ref|XP_002452172.1| hypothetical protein SORBIDRAFT_04g021180 [Sorghum bicolor]
 gi|241932003|gb|EES05148.1| hypothetical protein SORBIDRAFT_04g021180 [Sorghum bicolor]
          Length = 432

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 83/312 (26%), Positives = 139/312 (44%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  IS
Sbjct: 93  FAAEIFMHRFLLSSAVRTLNPEEADWFYTPVYTTCDLTPAGLPLPFKSPRMMRSAIKFIS 152

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
            K+P+WNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 153 NKWPFWNRTDGADHFFVVPHDFGACFHYQEEKATERGILPMLRRATLVQTFGQKNHVCLK 212

Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQVWR 191
           + S+  I P    P K+ +      + R+   +F G      N P      R     +W 
Sbjct: 213 EGSI--IIPPYAPPQKMQAHLVPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWE 270

Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   L
Sbjct: 271 NFKNNPLFDISTDHPVTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 330

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPS 305
           PFAD + W+   + V   D+P L  IL  I  E  L    LL +  +K    F     P 
Sbjct: 331 PFADAIPWEDIGVFVDEEDVPKLDSILTSIPIENILRKQRLLANPSMKKAMLFPQPAQPR 390

Query: 306 DYDAFYMVMYEL 317
             DAF+ ++  L
Sbjct: 391 --DAFHQILNGL 400


>gi|357141481|ref|XP_003572240.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Brachypodium distachyon]
          Length = 543

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 81/297 (27%), Positives = 145/297 (48%), Gaps = 23/297 (7%)

Query: 54  SKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFYVA 109
           S+AD+FF+PF  S++  RH +  G E +     +   +     +   W R GG DH  V 
Sbjct: 241 SQADVFFVPFFASLSYNRHSKLQGKEKMSRNRLLQAELVKYLARQEEWRRWGGKDHLVVP 300

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIWPRQEDPPKLGS 165
            H    ++M +A + KL+A   V S        +A+  KDV  P   +     D      
Sbjct: 301 HHP---NSMMQARK-KLSAAMYVLSDFGRYPPDVANLKKDVVAPYKHVVRSLRDDESPTF 356

Query: 166 SKRNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAHSGRLKTP----YADELLGSKFC 217
            +R  LA+F GA++      VR+KL Q+ +++ +++   G ++          +  SKFC
Sbjct: 357 DQRPVLAYFQGAIHRKDGGKVRQKLYQLLKDEKDVHFTYGSVRQNGIRRATKGMASSKFC 416

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
           L++ G   ++ R+ D++   CVPV+I++  +LPF DVL++  F + V   D      L +
Sbjct: 417 LNIAGDTPSSNRLFDAIVSHCVPVMISDDIELPFEDVLDYSEFCVFVRASDAVRKGFLLR 476

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
           +L+GI+ +E+  +   + +V  HF++       DA  M+   +  +  S+++Q   S
Sbjct: 477 LLRGITRDEWNTMWERLKEVAHHFEYQYPSKPDDAVQMIWGAVARKMHSLKLQLHKS 533


>gi|356541948|ref|XP_003539434.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Glycine max]
          Length = 459

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 127/291 (43%), Gaps = 45/291 (15%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDF---------I 83
           +ASE    +A + S   T DP +AD FF+P  ++            + DF         I
Sbjct: 126 FASEVAIHRALLTSEVRTFDPYEADFFFVPVYVS-------CNFSAVNDFPAIGHARTLI 178

Query: 84  SHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG-H 142
           S  +  +S +YP+WNR+ G+DH +VA H  G            + I ++  +S  +    
Sbjct: 179 SSAVNLVSTEYPFWNRSRGSDHVFVASHDFGACFHTLEDVAMADGIPIILKNSIVLQTFG 238

Query: 143 IAHKD--------VSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV----------- 178
           + H+         V  P + P       E  P  G  +R+  AFF G +           
Sbjct: 239 VIHQHPCQEVENVVIPPYVSPESVRSTLEKFPVTG--RRDIFAFFRGKMEVHPKNVSRRF 296

Query: 179 -NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
            +  VR ++ + +  D   Y    R    Y  E+  S FCL   G+   + R+ +S+  G
Sbjct: 297 YSKRVRTEIWRKFNGDRRFYLQRHRFAG-YQLEIARSVFCLCPLGWAPWSPRLVESVALG 355

Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           CVPV+IA+   LPF+  + W   S+ VA  D+  L KIL+ +++    ++Q
Sbjct: 356 CVPVVIADGIQLPFSSAVRWSEISLSVAERDVGKLGKILERVAATNLSVIQ 406


>gi|242079279|ref|XP_002444408.1| hypothetical protein SORBIDRAFT_07g021480 [Sorghum bicolor]
 gi|241940758|gb|EES13903.1| hypothetical protein SORBIDRAFT_07g021480 [Sorghum bicolor]
          Length = 539

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 88/313 (28%), Positives = 151/313 (48%), Gaps = 43/313 (13%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFY 107
           + S AD+FF+PF  S++  RH +    E +    F+   +     +   W R GG +H  
Sbjct: 219 NASLADVFFVPFFASLSYNRHSKLRRGEKVNRNRFLQAELVRYLMRKEEWRRWGGKNHLI 278

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQE----DPP 161
           V  H    ++M +A + KL+A   V S     S H+A+  KDV  P +   +     D P
Sbjct: 279 VPHHP---NSMMEARK-KLSAAMFVLSDFGRYSPHVANLKKDVIAPYMHVVRSFGDGDSP 334

Query: 162 KLGSSKRNKLAFFAGAV--------------NSP------VREKLLQVWRNDSEIYAHSG 201
                +R  LA+F GA+              N P      VR+KL Q+ +++ +++   G
Sbjct: 335 AF--DQRPILAYFQGAIHRKAVRALCSVLVANRPAFQGGKVRQKLYQLLKDERDVHFTYG 392

Query: 202 RLKTP----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
            ++          +  SKFCL++ G   ++ R+ D++   CVPVII++  +LPF DVL++
Sbjct: 393 SVRQNGIRRATAGMSTSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDY 452

Query: 258 KSFSIVVATLDIP---LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
             F + V + D      L ++L+GIS +E+  +   + KV +HF++       DA  M+ 
Sbjct: 453 SEFCVFVRSADAAKRGFLLRLLRGISRDEWTKMWMRLKKVTRHFEYQYPSRSGDAVQMIW 512

Query: 315 YELWLRRSSVRVQ 327
             +  +  SV++Q
Sbjct: 513 SAVARKMHSVQLQ 525


>gi|357472125|ref|XP_003606347.1| Secondary cell wall-related glycosyltransferase family [Medicago
           truncatula]
 gi|355507402|gb|AES88544.1| Secondary cell wall-related glycosyltransferase family [Medicago
           truncatula]
          Length = 427

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 88/319 (27%), Positives = 141/319 (44%), Gaps = 41/319 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +ASE    +A + S   T DP +AD FF+P  ++           G P        IS  
Sbjct: 95  FASEVAIHRALLTSDVRTFDPYEADFFFVPVYVS----CNFSTVNGFPAIGHARSLISSA 150

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRS-------AMQKAW-EVKLNAIQV-VCSSSY 137
           +  IS +YP+WNR+ G+DH +VA H  G         AM+    E+  N+I +     +Y
Sbjct: 151 VKLISTEYPFWNRSTGSDHVFVASHDFGSCFHTLEDVAMKDGVPEIMKNSIVLQTFGVTY 210

Query: 138 FISGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV------------NS 180
                     V  P + P       E+ P  G  +R+   FF G +            + 
Sbjct: 211 DHPCQKVEHVVIPPFVSPESVRNTLENFPVNG--RRDIWVFFRGKMEVHPKNVSGRFYSK 268

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
            VR  + + +  D   Y    R    Y  E+  S FCL   G+   + R+ +S+  GCVP
Sbjct: 269 KVRTVIWKKFNGDRRFYLRRHRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESVALGCVP 327

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKHF 298
           VIIA+   LPF+  +NW   S+ VA  D+  L +IL+ +++    ++Q  +   + RK  
Sbjct: 328 VIIADSIRLPFSSAVNWPEISVTVAEKDVWRLGEILEKVAATNLSIIQRNLWDPRTRKAL 387

Query: 299 QWHVFPSDYDAFYMVMYEL 317
            ++    + DA + V++ L
Sbjct: 388 LFNSRVHEGDATWQVLHSL 406


>gi|255565439|ref|XP_002523710.1| transferase, putative [Ricinus communis]
 gi|223537014|gb|EEF38650.1| transferase, putative [Ricinus communis]
          Length = 461

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 141/331 (42%), Gaps = 42/331 (12%)

Query: 33  YASESYFKKAFMKSHFV-TKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISH 85
           +ASE    KA   S  + T DP +AD FF+P  ++           G P        +S 
Sbjct: 133 FASEVAIHKAISNSDDIRTFDPYEADFFFVPVYVSCNFST----INGFPAIGHARSLLSS 188

Query: 86  YIFNISQKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWE--VKLNAIQVVCSSS 136
            +  IS  YP+WNR+ GADH +VA H  G         AMQ      +K + I       
Sbjct: 189 AVTFISTNYPFWNRSQGADHVFVASHDFGSCFHTLEERAMQDGVPEFLKKSIILQTFGVK 248

Query: 137 YFISGHIAHKDVSLPQIWP---RQEDPPKLGSSKRNKLAFFAGAV------------NSP 181
           Y          V  P I P   R        + +R+   FF G +            +  
Sbjct: 249 YDHPCQQVENVVIPPYISPVSVRSTLKKAPLTGRRDIWVFFRGKMEVHPKNVSGRFYSKK 308

Query: 182 VREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPV 241
           VR ++ + +  D   Y    R    Y  E+  S FCL   G+   + R+ +S+  GCVPV
Sbjct: 309 VRTEIWRRFNGDRRFYLQRHRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESVALGCVPV 367

Query: 242 IIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKHFQ 299
           IIA+   LPF   + W + S+ VA  D+  L +IL+ +++    L+Q  +    VR+   
Sbjct: 368 IIADGIRLPFPSAVPWPAISLTVAEKDVAKLGRILEDVAATNLTLIQKNIWDPTVRRALL 427

Query: 300 WHVFPSDYDAFYMVMYELWLR----RSSVRV 326
           ++    + DA + V+Y L  +    R +VRV
Sbjct: 428 FNDQIEEGDATWQVLYALTKKLDRSRRTVRV 458


>gi|357163140|ref|XP_003579636.1| PREDICTED: probable glucuronosyltransferase Os04g0398600-like
           [Brachypodium distachyon]
          Length = 429

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/311 (25%), Positives = 141/311 (45%), Gaps = 29/311 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  IS
Sbjct: 90  FAAEIFMHRFLLSSAVRTLNPKEADWFYAPVYTTCDLTPAGLPLPFKSPRVMRSAIQYIS 149

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
            K+P+WN+T GADHF+V  H  G       +KA E  +  +  +     ++    H+  K
Sbjct: 150 NKWPFWNKTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQENHVCLK 209

Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAG----AVNSP-----VREKLLQVWR 191
           + S+  I P    P K+ +      + R+   +F G      N P      R     +W 
Sbjct: 210 EGSI--IIPPFAPPQKMQAHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWE 267

Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   L
Sbjct: 268 NFKNNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 327

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-- 307
           PFAD + W+   + +   D+P L  IL  +  E+ +L +  +L      Q  +FP     
Sbjct: 328 PFADAIPWEEIGVFIEEKDVPKLDTILTSMPIED-ILRKQRLLANPSMKQAMLFPQPAQA 386

Query: 308 -DAFYMVMYEL 317
            DAF+ ++  L
Sbjct: 387 RDAFHQILNGL 397


>gi|413951430|gb|AFW84079.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
          Length = 421

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/309 (25%), Positives = 137/309 (44%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  I+
Sbjct: 82  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMRSAIELIA 141

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 142 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 201

Query: 147 --DVSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
              +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 202 GGSITIPPFAPPQKMQAHLIPLDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 261

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 262 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 321

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP      D
Sbjct: 322 ADAIPWEEIGVFVAEEDVPKLDSILTSIPT-DVILRKQRLLANPAMKQAMLFPQPAQAGD 380

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 381 AFHQILNGL 389


>gi|42568020|ref|NP_197685.2| FRA8-like protein [Arabidopsis thaliana]
 gi|75127070|sp|Q6NMM8.1|F8H_ARATH RecName: Full=Probable glucuronoxylan glucuronosyltransferase F8H;
           AltName: Full=FRA8 homolog; AltName: Full=Protein
           FRAGILE FIBER 8 homolog
 gi|44681390|gb|AAS47635.1| At5g22940 [Arabidopsis thaliana]
 gi|48958521|gb|AAT47813.1| At5g22940 [Arabidopsis thaliana]
 gi|332005716|gb|AED93099.1| FRA8-like protein [Arabidopsis thaliana]
          Length = 469

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/324 (26%), Positives = 145/324 (44%), Gaps = 49/324 (15%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDF 82
           +A+E    +A + S   T DP +AD FF+P  ++           + H R + +  + DF
Sbjct: 136 FAAEVAIHRALLSSDVRTLDPDEADYFFVPVYVSCNFSTSNGFPSLSHARSLLSSAV-DF 194

Query: 83  ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQK-AWE------VKLNAIQVVC 133
           +S +       YP+WNR+ G+DH +VA H  G    AM+  A E      +K + I    
Sbjct: 195 LSDH-------YPFWNRSQGSDHVFVASHDFGACFHAMEDMAIEEGIPKFMKRSIILQTF 247

Query: 134 SSSYFISGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV---------- 178
              Y          V  P I P       E  P  G  +R+  AFF G +          
Sbjct: 248 GVKYKHPCQEVEHVVIPPYIPPESVQKAIEKAPVNG--RRDIWAFFRGKMEVNPKNISGR 305

Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
             +  VR  +L+ +      Y +  R    Y  E++ S FCL   G+   + R+ +S   
Sbjct: 306 FYSKGVRTAILKKFGGRRRFYLNRHRFAG-YRSEIVRSVFCLCPLGWAPWSPRLVESAVL 364

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--V 294
           GCVPV+IA+   LPF++ + W   S+ VA  D+  L+K+L+ +++     +Q  + +   
Sbjct: 365 GCVPVVIADGIQLPFSETVQWPEISLTVAEKDVRNLRKVLEHVAATNLSAIQRNLHEPVF 424

Query: 295 RKHFQWHVFPSDYDAFYMVMYELW 318
           ++   ++V   + DA + ++  LW
Sbjct: 425 KRALLYNVPMKEGDATWHILESLW 448


>gi|414882087|tpg|DAA59218.1| TPA: hypothetical protein ZEAMMB73_484283 [Zea mays]
          Length = 479

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 81/293 (27%), Positives = 140/293 (47%), Gaps = 20/293 (6%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADHFY 107
           DP  ADL F+PF  S++  RH R +  E  G    +   +       P W R GGADH  
Sbjct: 176 DPRDADLVFVPFFASLSYNRHSRPLPPEKVGRDKALQEKLVGYLTARPEWRRFGGADHVI 235

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
           VA H    +++  A      A+ V+     Y        KDV  P +   +       G 
Sbjct: 236 VAHHP---NSLLHARAALSPAVFVLSDFGRYPPRVASLEKDVIAPYKHMAKTFVNDSAGF 292

Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFC 217
             R  L +F GA+       +R++L  + +++ ++Y   G ++   A +    +  SKFC
Sbjct: 293 DDRPTLLYFRGAIYRKEGGTIRQELYYMLKDEKDVYFSFGSVQDHGASKASQGMHSSKFC 352

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
           L++ G   ++ R+ D++   CVPVII++  +LP+ DVL++  FSI V + D      L +
Sbjct: 353 LNIAGDTPSSNRMFDAIVSHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVEKGHLMR 412

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           +L G+S + +  + S + +V +HF++       DA  M+   L  +  S++++
Sbjct: 413 LLSGVSKQRWTEMWSRLREVDRHFEYQYPSQKDDAVQMIWRSLSRKVPSIKLK 465


>gi|168059925|ref|XP_001781950.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666596|gb|EDQ53246.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 356

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 78/289 (26%), Positives = 131/289 (45%), Gaps = 31/289 (10%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGI---PDFISHYIFN 89
           +ASE    K  + S   T +P +AD FF+P   +  +   + G   +   P  +   + +
Sbjct: 31  FASEVAIHKILLSSPIKTLNPYEADFFFMPVYFS-CKFSSKTGFPRLGHAPKLMEDAVNH 89

Query: 90  ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH---- 145
           +S    +WNR+GG DH +VA H  G        E   + I  +  SS  +     H    
Sbjct: 90  VSSMMEFWNRSGGKDHVFVAAHDFGACFHSLESEAIAHGIPEIVQSSLILQTFGVHGFHP 149

Query: 146 ----KDVSLP------QIWPRQEDPPKLGSSKRNKLAFFAG--AVNSPVREKLLQVWRND 193
               +++ +P       ++   + PP+    +RN  AFF G   +N      L+      
Sbjct: 150 CQAAENIQIPPYISPSTVFSYVKKPPE--EQRRNIFAFFRGKMEINPKNVSGLVYSRGVR 207

Query: 194 SEIYAHSGR-----LKTPYAD----ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
           + IY    R     LK   AD    +LL S FCL   G+   + RI +++ YGCVPVIIA
Sbjct: 208 TYIYKKFSRNRRFFLKRHRADNYQLDLLRSTFCLCPLGWAPWSPRIVEAVAYGCVPVIIA 267

Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           ++  LP++  ++W + S+ +   D+  L KIL  ++++    +Q  + K
Sbjct: 268 DNIRLPYSHAIDWSNMSLNIREHDVHKLYKILLNVAAKNLSSIQKNLWK 316


>gi|108863947|gb|ABA91286.2| secondary cell wall-related glycosyltransferase family 47,
           putative, expressed [Oryza sativa Japonica Group]
 gi|215769393|dbj|BAH01622.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 476

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/297 (26%), Positives = 144/297 (48%), Gaps = 28/297 (9%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTE------GIPDFISHYIFNISQKYPYWNRTGGA 103
           D   AD+ F+PF  S++  RH R +  E      G+ + +  Y+       P W R+GGA
Sbjct: 172 DSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKGLQERLVRYLM----AQPEWKRSGGA 227

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPP 161
           DH  VA H    +++  A  V   A+ V+     Y        KDV  P +   +     
Sbjct: 228 DHVIVAHHP---NSLLHARSVLFPAVFVLSDFGRYHPRVASLEKDVIAPYKHMAKTFVND 284

Query: 162 KLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLG 213
             G   R  L +F GA+       +R++L  + +++ ++Y   G ++   A +    +  
Sbjct: 285 SAGFDDRPTLLYFRGAIFRKEGGNIRQELHYMLKDEKDVYFAFGSVQDHGASKASQGMHA 344

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---P 270
           SKFCL++ G   ++ R+ D++   CVPVII++  +LP+ D L++  FSI V + D     
Sbjct: 345 SKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAVKKG 404

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            L ++++G+S  ++ ++   + +V KHF++       DA  M+   L  +  ++R++
Sbjct: 405 YLMRLIRGVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVPAIRLK 461


>gi|414864293|tpg|DAA42850.1| TPA: hypothetical protein ZEAMMB73_024068 [Zea mays]
          Length = 434

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 85/318 (26%), Positives = 140/318 (44%), Gaps = 48/318 (15%)

Query: 6   RVYVY--PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
           R+YVY  P R N  +A          R  +A+E    +A +        P  ADLFF+P 
Sbjct: 79  RIYVYDLPARFNRDWAAADARCS---RHLFAAEVAVHEALLAYAGRAARPEDADLFFVPV 135

Query: 64  SIA----------RMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
            ++           + H R +  + + D +         + PYWNR+ GADH +VA H  
Sbjct: 136 YVSCNFSTPNGFPSLSHARGLLADAV-DLVR-------ARMPYWNRSAGADHVFVASHDF 187

Query: 114 GRSAMQKAWEVKL-NAI-----QVVCSSSYFISGHIAHKDVS----LPQIWPR-QEDPPK 162
           G +      +V + + I     + +   ++ + GH   ++V      P + P    + P+
Sbjct: 188 G-ACFHPMEDVAIADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPE 246

Query: 163 LGSSKRNKLAFFAGAV------------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE 210
              ++R+  AFF G +            +  VR +LLQ +  + + Y    R    Y  E
Sbjct: 247 PEKAQRDIFAFFRGKMEVHPKNISGRFYSKKVRTELLQHYGRNRKFYLKRKRFDN-YRSE 305

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           +  S FCL   G+   + R+ +S+  GC+PVIIA+   LPF  VL W+  S+ VA  DI 
Sbjct: 306 MARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPPVLQWQEISLQVAEKDIA 365

Query: 271 LLKKILKGISSEEYLLLQ 288
            L  +L  + +    ++Q
Sbjct: 366 SLGMVLDHVVATNLTVIQ 383


>gi|10177241|dbj|BAB10615.1| unnamed protein product [Arabidopsis thaliana]
          Length = 498

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/324 (26%), Positives = 145/324 (44%), Gaps = 49/324 (15%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDF 82
           +A+E    +A + S   T DP +AD FF+P  ++           + H R + +  + DF
Sbjct: 165 FAAEVAIHRALLSSDVRTLDPDEADYFFVPVYVSCNFSTSNGFPSLSHARSLLSSAV-DF 223

Query: 83  ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQK-AWE------VKLNAIQVVC 133
           +S +       YP+WNR+ G+DH +VA H  G    AM+  A E      +K + I    
Sbjct: 224 LSDH-------YPFWNRSQGSDHVFVASHDFGACFHAMEDMAIEEGIPKFMKRSIILQTF 276

Query: 134 SSSYFISGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV---------- 178
              Y          V  P I P       E  P  G  +R+  AFF G +          
Sbjct: 277 GVKYKHPCQEVEHVVIPPYIPPESVQKAIEKAPVNG--RRDIWAFFRGKMEVNPKNISGR 334

Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
             +  VR  +L+ +      Y +  R    Y  E++ S FCL   G+   + R+ +S   
Sbjct: 335 FYSKGVRTAILKKFGGRRRFYLNRHRFAG-YRSEIVRSVFCLCPLGWAPWSPRLVESAVL 393

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--V 294
           GCVPV+IA+   LPF++ + W   S+ VA  D+  L+K+L+ +++     +Q  + +   
Sbjct: 394 GCVPVVIADGIQLPFSETVQWPEISLTVAEKDVRNLRKVLEHVAATNLSAIQRNLHEPVF 453

Query: 295 RKHFQWHVFPSDYDAFYMVMYELW 318
           ++   ++V   + DA + ++  LW
Sbjct: 454 KRALLYNVPMKEGDATWHILESLW 477


>gi|212722274|ref|NP_001131151.1| uncharacterized protein LOC100192459 [Zea mays]
 gi|194690716|gb|ACF79442.1| unknown [Zea mays]
 gi|413955856|gb|AFW88505.1| hypothetical protein ZEAMMB73_716681 [Zea mays]
          Length = 391

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 80/294 (27%), Positives = 138/294 (46%), Gaps = 26/294 (8%)

Query: 54  SKADLFFLPF--SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           + AD+ F+PF  S++  RH R +    +     +   +       P W RTGG DH  +A
Sbjct: 73  ADADVVFVPFFASLSFNRHSRVVPPARDSEDRALQRRLLEFLAARPEWRRTGGRDHVVLA 132

Query: 110 CHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP---QIWPRQEDPPKLG 164
            H  G   A  + W      + V+C    Y  S     KDV  P    +     D    G
Sbjct: 133 HHPNGMLDARYRFWP----CVFVLCDFGRYPPSVANLDKDVIAPYRHLVANFAND--TAG 186

Query: 165 SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKF 216
              R  L +F GA+       +R++L  + +++ +++   G +     ++    +  SKF
Sbjct: 187 YDDRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVAGNGIEQATQGMRSSKF 246

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLK 273
           CL++ G   ++ R+ DS+   CVPV I++  +LPF DVL++  FS++V   D      L 
Sbjct: 247 CLNIAGDTPSSNRLFDSIVSHCVPVTISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLM 306

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            ++KGIS EE+  + + + +V KHF++       DA  M+   +  +  S+R++
Sbjct: 307 NLIKGISREEWTRMWNRLKEVEKHFEYQYPSQTDDAVQMIWKAIARKVPSIRLK 360


>gi|224075447|ref|XP_002304637.1| predicted protein [Populus trichocarpa]
 gi|222842069|gb|EEE79616.1| predicted protein [Populus trichocarpa]
          Length = 413

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 82/312 (26%), Positives = 136/312 (43%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P +    +           P  +   I  +S
Sbjct: 74  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFNSPRMMRSAIQLLS 133

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+   
Sbjct: 134 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQRNHVCLN 193

Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQVWR 191
           + S+    P    P K+ +      + R+   +F G      N P      R     VW 
Sbjct: 194 EGSI--TIPPYAPPQKMQAHQIPLDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 251

Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           N  ++ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   L
Sbjct: 252 NFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 311

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPS 305
           PFAD + W+   + VA  D+P L   L  I  E  L    LL +  +K    F     P 
Sbjct: 312 PFADAIPWEEIGVFVAEEDVPHLDTFLTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 371

Query: 306 DYDAFYMVMYEL 317
             DAF+ ++  L
Sbjct: 372 --DAFHQILNGL 381


>gi|226503833|ref|NP_001149319.1| secondary cell wall-related glycosyltransferase family 47 precursor
           [Zea mays]
 gi|195626366|gb|ACG35013.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 421

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 79/309 (25%), Positives = 136/309 (44%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  I+
Sbjct: 82  FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMRSAIELIA 141

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
             +PYWNR+ GADHF+V  H  G     +  +     I  +   +  +       H+  K
Sbjct: 142 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 201

Query: 147 --DVSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
              + +P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 202 GGSIXIPPFAPPQKMQAHLIPLDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 261

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 262 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 321

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP      D
Sbjct: 322 ADAIPWEEIGVFVAEEDVPKLDSILTSIPT-DVILRKQRLLANPAMKQAMLFPQPAQAGD 380

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 381 AFHQILNGL 389


>gi|194701034|gb|ACF84601.1| unknown [Zea mays]
 gi|413955857|gb|AFW88506.1| hypothetical protein ZEAMMB73_716681 [Zea mays]
          Length = 462

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 80/294 (27%), Positives = 138/294 (46%), Gaps = 26/294 (8%)

Query: 54  SKADLFFLPF--SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           + AD+ F+PF  S++  RH R +    +     +   +       P W RTGG DH  +A
Sbjct: 144 ADADVVFVPFFASLSFNRHSRVVPPARDSEDRALQRRLLEFLAARPEWRRTGGRDHVVLA 203

Query: 110 CHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP---QIWPRQEDPPKLG 164
            H  G   A  + W      + V+C    Y  S     KDV  P    +     D    G
Sbjct: 204 HHPNGMLDARYRFWP----CVFVLCDFGRYPPSVANLDKDVIAPYRHLVANFAND--TAG 257

Query: 165 SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKF 216
              R  L +F GA+       +R++L  + +++ +++   G +     ++    +  SKF
Sbjct: 258 YDDRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVAGNGIEQATQGMRSSKF 317

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLK 273
           CL++ G   ++ R+ DS+   CVPV I++  +LPF DVL++  FS++V   D      L 
Sbjct: 318 CLNIAGDTPSSNRLFDSIVSHCVPVTISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLM 377

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            ++KGIS EE+  + + + +V KHF++       DA  M+   +  +  S+R++
Sbjct: 378 NLIKGISREEWTRMWNRLKEVEKHFEYQYPSQTDDAVQMIWKAIARKVPSIRLK 431


>gi|356516800|ref|XP_003527081.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 493

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 140/295 (47%), Gaps = 23/295 (7%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADH 105
            +D S+AD+ F+PF  S++  RH +  G E +     +   +         W R+GG DH
Sbjct: 184 VQDSSQADVIFVPFFSSLSYNRHSKLNGEEKVSLNKMLQDRLVQFLMGQKEWKRSGGKDH 243

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKL 163
             VA H       ++    KL A  +V +        +A+  KD+  P        P   
Sbjct: 244 LIVAHHPNSLLDARR----KLGAAMLVLADFGRYPTELANIKKDIIAPYRHLVSTIPKAK 299

Query: 164 GSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLG 213
            +S  KR  L +F GA+       +R++L  + +++ +++   G +     ++    +  
Sbjct: 300 SASFEKRTTLVYFQGAIYRKDGGAIRQELYYLLKDEKDVHFTFGSIGGNGINQASQGMAM 359

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---P 270
           SKFCL++ G   ++ R+ D++   CVPVII++  +LPF DVL++  FSI V   D     
Sbjct: 360 SKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSDFSIFVRASDSMKKG 419

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
            L  +L+ I+ +E+  +   + ++  HF++       DA  M+  ++  + SS+R
Sbjct: 420 YLLNLLRSITQKEWSKMWERLKQITHHFEYQYPSQPGDAVNMIWQQVERKISSIR 474


>gi|194700342|gb|ACF84255.1| unknown [Zea mays]
 gi|414587472|tpg|DAA38043.1| TPA: secondary cell wall glycosyltransferase family 47 [Zea mays]
          Length = 427

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 79/311 (25%), Positives = 140/311 (45%), Gaps = 29/311 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   + +         P  +   I  IS
Sbjct: 88  FAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYIS 147

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
            K+P+WNRT GADHF+V  H          +KA E  +  +  +     ++    H+  K
Sbjct: 148 NKWPFWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLK 207

Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAG----AVNSP-----VREKLLQVWR 191
           + S+  I P    P K+ +      + R+   +F G      N P      R     +W 
Sbjct: 208 EGSI--IIPPYAPPQKMQAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWE 265

Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           N   + ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   L
Sbjct: 266 NFKSNPLFDISTDHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 325

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-- 307
           PFAD + W+   + V   D+P L  IL  +  ++ +L +  +L      Q  +FP     
Sbjct: 326 PFADAIPWEEIGVFVEEKDVPKLDTILTSMPIDD-ILRKQRLLANPSMKQAMLFPQPAQP 384

Query: 308 -DAFYMVMYEL 317
            DAF+ ++  L
Sbjct: 385 RDAFHQILNGL 395


>gi|226499262|ref|NP_001149267.1| secondary cell wall-related glycosyltransferase family 47 precursor
           [Zea mays]
 gi|195625922|gb|ACG34791.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 427

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 79/311 (25%), Positives = 140/311 (45%), Gaps = 29/311 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   + +         P  +   I  IS
Sbjct: 88  FAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYIS 147

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
            K+P+WNRT GADHF+V  H          +KA E  +  +  +     ++    H+  K
Sbjct: 148 NKWPFWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLK 207

Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAG----AVNSP-----VREKLLQVWR 191
           + S+  I P    P K+ +      + R+   +F G      N P      R     +W 
Sbjct: 208 EGSI--IIPPYAPPQKMQAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWE 265

Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           N   + ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   L
Sbjct: 266 NFKSNPLFDISTDHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 325

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-- 307
           PFAD + W+   + V   D+P L  IL  +  ++ +L +  +L      Q  +FP     
Sbjct: 326 PFADAIPWEEIGVFVEEKDVPKLDTILTSMPIDD-ILRKQRLLANPSMKQAMLFPQPAQP 384

Query: 308 -DAFYMVMYEL 317
            DAF+ ++  L
Sbjct: 385 RDAFHQILNGL 395


>gi|242072854|ref|XP_002446363.1| hypothetical protein SORBIDRAFT_06g014730 [Sorghum bicolor]
 gi|241937546|gb|EES10691.1| hypothetical protein SORBIDRAFT_06g014730 [Sorghum bicolor]
          Length = 430

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 80/309 (25%), Positives = 140/309 (45%), Gaps = 25/309 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   + +         P  +   I  IS
Sbjct: 91  FAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYIS 150

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
            K+P+WNRT GADHF+V  H          +KA E  +  +  +     ++    H+  K
Sbjct: 151 NKWPFWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLK 210

Query: 147 DVSL--PQIWPRQEDPPKLGS--SKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
           + S+  P   P Q+    L S  + R+   +F G      N P      R     +W N 
Sbjct: 211 EGSIIIPPFAPPQKMQAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENF 270

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
             + ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPF
Sbjct: 271 KSNPLFDISTDHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 330

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + V   D+P L  IL  +  ++ +L +  +L      Q  +FP      D
Sbjct: 331 ADAIPWEEIGVFVEEKDVPKLDTILTSMPIDD-ILRKQRLLANPSMKQAMLFPQPAQPRD 389

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 390 AFHQILNGL 398


>gi|293334733|ref|NP_001169191.1| uncharacterized protein LOC100383044 [Zea mays]
 gi|223975431|gb|ACN31903.1| unknown [Zea mays]
 gi|413957212|gb|AFW89861.1| hypothetical protein ZEAMMB73_311893 [Zea mays]
          Length = 428

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 75/291 (25%), Positives = 127/291 (43%), Gaps = 41/291 (14%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGI 79
           R  +A+E    +A +        P  ADLFF+P  ++           + H R +  + +
Sbjct: 98  RHLFAAEVAVHEALLAYTGRAARPEDADLFFVPVYVSCNFSTPNGFPSLSHARGMLADAV 157

Query: 80  PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAI-----QVVCS 134
            D +           PYWNR+ GADH +VA H  G            + I     + +  
Sbjct: 158 -DLVQ-------AGMPYWNRSAGADHVFVASHDFGACFHPMEDVAIADGIPEFLKRSILL 209

Query: 135 SSYFISGHIAHKDVS----LPQIWPRQE-DPPKLGSSKRNKLAFFAGAV----------- 178
            ++ + GH   ++V      P + P  E + P+   ++R+  AFF G +           
Sbjct: 210 QTFGVQGHHTCQEVEHVVIPPHVPPEVEHELPEPEKAQRDIFAFFRGKMEVHPKNISGRF 269

Query: 179 -NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
            +  VR +LLQ +  + + Y    R    Y  E+  S FCL   G+   + R+ +S+  G
Sbjct: 270 YSKKVRTELLQHYGRNRKFYLKRKRFDN-YRSEMARSLFCLCPLGWAPWSPRLVESVLLG 328

Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           C+PVIIA++  +PF  VL W   S+ VA  D+  L+ +L  + +    ++Q
Sbjct: 329 CIPVIIADNIRMPFPSVLQWPEISLQVAEKDVANLEVVLDHVVATNLSVIQ 379


>gi|357122399|ref|XP_003562903.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Brachypodium distachyon]
          Length = 498

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 85/283 (30%), Positives = 137/283 (48%), Gaps = 44/283 (15%)

Query: 52  DPSKADLFFLPF--SIARMRHDRR---------------IGTEGIPDFISHYIFNISQKY 94
           DP++ADLF++PF  S++ + +  R                  E + D +  ++    ++ 
Sbjct: 173 DPAEADLFYVPFFSSLSLVVNPIRPLAAANASVAAAEPAYSDEAMQDELVEWL----ERQ 228

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP- 151
           PYW R  G DH ++ C     +A+ +  +   NA+ +V S    + G  A   KDV LP 
Sbjct: 229 PYWRRHRGRDHVFI-CQDP--NALYRVVDRISNAVLLV-SDFGRLRGDQASLVKDVILPY 284

Query: 152 --QIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG---R 202
             +I P Q D   +    R  L FF G         VR+ L QV  N+ ++    G   R
Sbjct: 285 SHRINPFQGD---VSIEARPALLFFMGNRYRKEGGKVRDTLFQVLENEGDVIIKHGTQSR 341

Query: 203 LKTPYADELL-GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
           +    A + +  SKFCLH  G   +  R+ D+L   CVPVII++H +LPF DV+++ + S
Sbjct: 342 VSRRMATQGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIISDHIELPFEDVIDYSNIS 401

Query: 262 IVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           I V T   +    L  +L+ +SSE  L  Q  + +V+ +F++ 
Sbjct: 402 IFVDTSKAVQPGFLTSMLRRVSSERILEYQREIKRVKHYFEYE 444


>gi|242041079|ref|XP_002467934.1| hypothetical protein SORBIDRAFT_01g036720 [Sorghum bicolor]
 gi|241921788|gb|EER94932.1| hypothetical protein SORBIDRAFT_01g036720 [Sorghum bicolor]
          Length = 462

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 80/294 (27%), Positives = 137/294 (46%), Gaps = 26/294 (8%)

Query: 54  SKADLFFLPF--SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           + AD+ F+PF  S++  RH R +    +     +   +       P W RTGG DH  +A
Sbjct: 144 ADADVVFVPFFASLSFNRHSRVVPPARDSEDRALQRRLLEFLAARPEWRRTGGRDHVVLA 203

Query: 110 CHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP---QIWPRQEDPPKLG 164
            H  G   A  + W      + V+C    Y  S     KDV  P    +     D    G
Sbjct: 204 HHPNGMLDARYRFWP----CVFVLCDFGRYPPSVANLDKDVIAPYRHLVANFAND--TAG 257

Query: 165 SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKF 216
              R  L +F GA+       +R++L  + +++ +++   G +     ++    +  SKF
Sbjct: 258 YDDRPTLLYFQGAIYRKDGGSIRQELYYLLKDEKDVHFSFGSVAGNGIEQSTHGMRSSKF 317

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLK 273
           CL++ G   ++ R+ DS+   CVPVII++  +LPF DVL++  FS++V   D      L 
Sbjct: 318 CLNIAGDTPSSNRLFDSIVSHCVPVIISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLM 377

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            ++ GIS EE+  + + + +V KHF +       DA  M+   +  +  S+R++
Sbjct: 378 SLITGISQEEWAHMWNKLKEVEKHFVYQYPSQTDDAVQMIWKAIARKVPSIRLK 431


>gi|356547155|ref|XP_003541982.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Glycine max]
          Length = 458

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 80/289 (27%), Positives = 127/289 (43%), Gaps = 41/289 (14%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +ASE    +A + S   T DP +AD FF+P  ++           G P        IS  
Sbjct: 124 FASEVAIHRALLTSEVRTFDPYEADFFFVPVYVS----CNFSAVNGFPAIGHARTLISSA 179

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRS-------AMQKAWEVKLNAIQVVCSSSYFI 139
           +  +S +YP+WNR+ G+DH +VA H  G         AM      K+    +V  +   I
Sbjct: 180 VNLVSTEYPFWNRSRGSDHVFVASHDFGACFHTLEDVAMADGIP-KILKNSIVLQTFGVI 238

Query: 140 SGHIAH--KDVSLP------QIWPRQEDPPKLGSSKRNKLAFFAGAV------------N 179
             H     ++V +P       +    E  P  G  +R+  AFF G +            +
Sbjct: 239 HPHPCQDVENVVIPPYVAPESVRSTLEKFPVNG--RRDIWAFFRGKMEVHPKNVSGQFYS 296

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
             VR ++ + +  D   Y    R    Y  E+  S FCL   G+   + R+ +S+  GCV
Sbjct: 297 KRVRTEIWRKFNGDRRFYLQRRRFAG-YQLEIARSVFCLCPLGWAPWSPRLVESVALGCV 355

Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           PV+IA+   LPF+  + W   S+ VA  D+  L KIL+ +++    ++Q
Sbjct: 356 PVVIADGIRLPFSSAVRWSEISLTVAERDVGKLGKILERVAATNLSVIQ 404


>gi|168049543|ref|XP_001777222.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671450|gb|EDQ58002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 368

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 80/285 (28%), Positives = 128/285 (44%), Gaps = 33/285 (11%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGI---PDFISHYIFN 89
           +ASE    K  + S   T DP +AD FF+P  ++  +   + G   +   P  +   + +
Sbjct: 46  FASEVAIHKILLTSPIRTLDPYEADFFFMPVYVS-CKFSPKTGFPWLGHAPKLMQAAVNH 104

Query: 90  ISQKYPYWNRTGGADHFYVACHSIG--------RSAMQKAWEVKLNAI--QVVCSSSYFI 139
           +S K  +WNR+ G DH +VA H  G        ++  Q   +   N++  Q      +  
Sbjct: 105 VSTKMEFWNRSWGRDHIFVAAHDYGACFHTLETQAIAQGIPQFMRNSLILQTFGVKGFHP 164

Query: 140 SGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAG--AVNSPVREKLLQVWRN 192
                H  +  P I P       +DP  L   +R+  A+F G   +N      LL     
Sbjct: 165 CQAAEHIQIP-PYISPSVAVSYVKDP--LEHQQRDIFAYFRGKMEINPKNVSGLLYSKGI 221

Query: 193 DSEIYAHSGR-----LKTPYAD----ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVII 243
            + +Y    R     LK    D    E+L S FCL   G+   + RI +++ YGC+PVII
Sbjct: 222 RTVLYKRFSRNKRFVLKRHRVDNSQQEMLRSTFCLCPLGWAPWSPRIVEAVTYGCIPVII 281

Query: 244 ANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           A++  LP++  ++W S S+ V   D+P L KIL G++      +Q
Sbjct: 282 ADNISLPYSHTIDWSSISLTVPEHDVPKLDKILIGVAVTNLTAIQ 326


>gi|357149222|ref|XP_003575040.1| PREDICTED: probable glucuronosyltransferase Os02g0520750-like
           [Brachypodium distachyon]
          Length = 428

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 81/312 (25%), Positives = 136/312 (43%), Gaps = 31/312 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +    S   T +P +AD F+ P ++   +           P  +   I  IS
Sbjct: 89  FAAEIFMHRFLFSSAVRTVNPEEADWFYTPVYTTCDLTRAGLPLPFKSPRMMRSAIQFIS 148

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
            K+P+WNRT G DHF+V  H           +A+ +     L    +V   ++    H+ 
Sbjct: 149 NKWPFWNRTDGGDHFFVVPHDFAACFHYQEENAIARGILPLLRHATLV--QTFGQKNHVC 206

Query: 145 HKD--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWR 191
            KD  +++P   P Q+    L    + R+   +F G      N P      R     +W 
Sbjct: 207 LKDGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWE 266

Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           N  ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+   L
Sbjct: 267 NFKNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 326

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPS 305
           PFAD + W    + V   D+P L  IL  I  ++ L    LL +  +K    F     P 
Sbjct: 327 PFADAIPWDEIGVFVDEEDVPKLDSILTSIPIDDILRKQRLLANPSMKKAMLFPQPAQPR 386

Query: 306 DYDAFYMVMYEL 317
             DAF+ ++  L
Sbjct: 387 --DAFHQILNGL 396


>gi|225436482|ref|XP_002275679.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase IRX7
           [Vitis vinifera]
 gi|297734915|emb|CBI17149.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 82/319 (25%), Positives = 140/319 (43%), Gaps = 39/319 (12%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPD------FISHY 86
           +ASE    KA  +S   T DP +AD FF+P  ++           G P        ++  
Sbjct: 125 FASEVALHKALQESDVRTFDPWEADFFFVPVYVS----CNFSTVNGFPAIGHARPLLASA 180

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAI-----QVVCSSSYFISG 141
           I +IS + P+WNR+ GADH +VA H  G          + + I     + +   ++ +  
Sbjct: 181 IQHISTQLPFWNRSLGADHVFVASHDYGACFHAMEDVARADGIPEFLKKSIILQTFGVKH 240

Query: 142 HIAHKDVSLPQIWPRQEDPPKLGSS--------KRNKLAFFAGAV------------NSP 181
               +DV    I P    P K+ S+        +R+   FF G +            +  
Sbjct: 241 QHPCQDVENVLI-PPYVSPEKVQSTLDSAPANGQRDIWVFFRGKMEVHPKNISGRFYSKA 299

Query: 182 VREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPV 241
           VR  + Q +  + + Y    R    Y  E++ S FCL   G+   + R+ +S+  GCVPV
Sbjct: 300 VRTAIWQKYGGNRKFYLKRHRFAG-YQSEIVRSVFCLCPLGWAPWSPRLVESVVLGCVPV 358

Query: 242 IIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKHFQ 299
           IIA+   LPF++ + W   S+ VA  D+  L  IL+ +++     +Q  +   + ++   
Sbjct: 359 IIADGIRLPFSEAIRWPEISLTVAEKDVGKLGMILEDVAATNLSTIQKNLWDPENKRALL 418

Query: 300 WHVFPSDYDAFYMVMYELW 318
           ++    + DA + V+  LW
Sbjct: 419 FNNQVQEGDATWQVLNALW 437


>gi|413915952|gb|AFW55884.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
          Length = 484

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 80/293 (27%), Positives = 139/293 (47%), Gaps = 20/293 (6%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFY 107
           D   ADL F+PF  S++  RH R +  E +     +   +       P W R GGADH  
Sbjct: 181 DSRDADLVFVPFFASLSYNRHYRPVPPEKVSRDRVLQEKLVRYLAARPEWRRYGGADHVI 240

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
           VA H    +++  A  V   A+ V+     Y        KDV  P +   +       G 
Sbjct: 241 VAHHP---NSLLHARAVLHPAVFVLSDFGRYPPRVASLEKDVIAPYKHMAKTYANDSAGF 297

Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFC 217
             R  L +F GA+       +R++L  + + + ++Y   G ++   A +    +  SKFC
Sbjct: 298 DDRPTLLYFRGAIYRKEGGSIRQELYYMLKEEKDVYFSFGSVQDHGASKASQGMHSSKFC 357

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
           L++ G   ++ R+ D++   CVPVII++  +LP+ DVL++  FSI V + D      L +
Sbjct: 358 LNIAGDTPSSNRLFDAIVTHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVKKGYLMR 417

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           +L G+S +++  +   + +V KHF++       DA  M+   L  +  S++++
Sbjct: 418 LLSGVSKQQWTKMWDRLKEVDKHFEYQYPSQKDDAVQMIWQALSRKVPSIKLK 470


>gi|218185155|gb|EEC67582.1| hypothetical protein OsI_34941 [Oryza sativa Indica Group]
          Length = 483

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 79/304 (25%), Positives = 144/304 (47%), Gaps = 35/304 (11%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTE------GIPDFISHYIFNISQKYPYWNRTGGA 103
           D   AD+ F+PF  S++  RH R +  E      G+ + +  Y+       P W R+GGA
Sbjct: 172 DSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKGLQERLVRYLM----AQPEWKRSGGA 227

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPP 161
           DH  VA H    +++  A  V   A+ V+     Y        KDV  P +   +     
Sbjct: 228 DHVIVAHHP---NSLLHARSVLFPAVFVLSDFGRYHPRVASLEKDVIAPYKHMAKTFVND 284

Query: 162 KLGSSKRNKLAFFAGAV-----------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE 210
             G   R  L +F GA+              +R++L  + +++ ++Y   G ++   A +
Sbjct: 285 SAGFDDRPTLLYFRGAIFRKEVKIDSWKGGNIRQELHYMLKDEKDVYFAFGSVQDHGASK 344

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
               +  SKFCL++ G   ++ R+ D++   CVPVII++  +LP+ D L++  FSI V +
Sbjct: 345 ASQGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRS 404

Query: 267 LDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
            D      L ++++G+S  ++ ++   + +V KHF++       DA  M+   L  +  +
Sbjct: 405 SDAVKKGYLMRLIRGVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVPA 464

Query: 324 VRVQ 327
           +R++
Sbjct: 465 IRLK 468


>gi|356539378|ref|XP_003538175.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Glycine max]
          Length = 460

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 81/290 (27%), Positives = 124/290 (42%), Gaps = 43/290 (14%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +ASE    +A + S   T DP  AD FF+P  ++           G P        I+  
Sbjct: 128 FASEVAIHRALLTSEVRTFDPYDADFFFVPVYVS----CNFSTVNGFPAIGHARSLIASA 183

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRS-------AMQKAW-EVKLNAIQVVCSSSYF 138
           +  IS +YP+WNR+ G+DH +VA H  G         AM     E+  N+I  V  +   
Sbjct: 184 VNLISSEYPFWNRSRGSDHVFVASHDFGSCFHTLEDVAMADGVPEIVRNSI--VLQTFGV 241

Query: 139 ISGHIAHKD--------VSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV------------ 178
           +  H   K         VS   +    E+ P  G  +R+  AFF G +            
Sbjct: 242 VFDHPCQKVEHVVIPPYVSPESVRDTMENFPVDG--RRDIWAFFRGKMEVHPKNVSGRFY 299

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
           +  VR  + + +  D   Y    R    Y  E+  S FCL   G+   + R+ +S+  GC
Sbjct: 300 SKEVRTVIWRKFNGDRRFYLQRHRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESVALGC 358

Query: 239 VPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           VPV+IA+   LPF   + W   S+ VA  D+  L +IL+ +++     +Q
Sbjct: 359 VPVVIADGIRLPFVSAVKWSEISVTVAEKDVGRLAEILERVAATNLSTIQ 408


>gi|449484890|ref|XP_004157009.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like [Cucumis
           sativus]
          Length = 417

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 140/314 (44%), Gaps = 35/314 (11%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +A+E Y  +  + S   T +P +AD F+ P  +        +   G+P        +   
Sbjct: 78  FAAEIYMHRFLLNSPVRTLNPDEADWFYTPIYVTC-----DLTPNGLPLPFKSPRMMRSA 132

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISG 141
           I  IS  +PYWNRT GADHF+V  H  G       +KA +  +  +  +     ++    
Sbjct: 133 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIDRGILPLLQRATLVQTFGQRN 192

Query: 142 HIAHKD--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQ 188
           H+   +  +++P   P Q+    L  S+  R+   +F G      N P      R     
Sbjct: 193 HVCLNEGSITIPPYCPPQKMKTHLIPSETPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 252

Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           VW N  ++ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+ 
Sbjct: 253 VWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 312

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
             LPFAD + W+   + V   D+  L  IL  I   + +L +  +L      +  +FP  
Sbjct: 313 IVLPFADAIPWEEIGVFVDEKDVSNLDTILTSIPP-DVILRKQRLLANPSMKRAMMFPQP 371

Query: 307 Y---DAFYMVMYEL 317
               DAF+ ++  L
Sbjct: 372 AQSGDAFHQILNGL 385


>gi|115487106|ref|NP_001066040.1| Os12g0124400 [Oryza sativa Japonica Group]
 gi|77552918|gb|ABA95714.1| secondary cell wall-related glycosyltransferase family 47,
           putative, expressed [Oryza sativa Japonica Group]
 gi|113648547|dbj|BAF29059.1| Os12g0124400 [Oryza sativa Japonica Group]
 gi|215678612|dbj|BAG92267.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218186350|gb|EEC68777.1| hypothetical protein OsI_37312 [Oryza sativa Indica Group]
          Length = 475

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 76/293 (25%), Positives = 140/293 (47%), Gaps = 20/293 (6%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFY 107
           D   AD+ F+PF  S++  RH R +  E +     +   +       P W R+GGADH  
Sbjct: 172 DSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKELQEKLVRYLMAQPEWKRSGGADHVI 231

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
           VA H    +++  A  V    + V+     Y        KDV  P +   +       G 
Sbjct: 232 VAHHP---NSLLHARSVLFPVVFVLSDFGRYHPRVASLEKDVIAPYKHMAKTFVNDSAGF 288

Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFC 217
             R  L +F GA+       +R++L  + +++ ++Y   G ++   A +    +  SKFC
Sbjct: 289 DDRPTLLYFRGAIFRKEGGNIRQELYYMLKDEKDVYFAFGSVQDHGASKASKGMHASKFC 348

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
           L++ G   ++ R+ D++   CVPVII++  +LP+ D L++  FSI V + D      L +
Sbjct: 349 LNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAVKKGYLMR 408

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           +++G+S  ++  + + + +V KHF++       DA  M+   L  +  ++R++
Sbjct: 409 LIRGVSKHQWTRMWNRLKEVDKHFEYQYPSQKDDAVQMIWQALARKVPAIRLK 461


>gi|356542668|ref|XP_003539788.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Glycine max]
          Length = 461

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 79/288 (27%), Positives = 124/288 (43%), Gaps = 39/288 (13%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +ASE    +A + S   T DP  AD FF+P  ++           G P        I+  
Sbjct: 129 FASEVAIHRALLTSEVRTFDPYDADFFFVPVYVS----CNFSTVNGFPAIGHARSLIASA 184

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRS-------AMQKAW-EVKLNAIQVVCSSSYF 138
           +  +S +YP+WNR+ G+DH +VA H  G         AM     E+  N+I +      +
Sbjct: 185 VSLVSSEYPFWNRSRGSDHVFVASHDFGSCFHTLEDVAMADGVPEIMRNSIVLQTFGVVY 244

Query: 139 ISGHIAHKDVSLP------QIWPRQEDPPKLGSSKRNKLAFFAGAV------------NS 180
                + + V +P       +    E+ P  G  +R+  AFF G +            + 
Sbjct: 245 DHPCQSVEHVVIPPYVSPESVRDTMENFPVNG--RRDIWAFFRGKMELHPKNVSGRFYSK 302

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
            VR  + + +  D   Y    R    Y  E+  S FCL   G+   + R+ +S+  GCVP
Sbjct: 303 KVRTVIWRKFNGDRRFYLQRQRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESVALGCVP 361

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           VIIA+   LPF   + W   SI VA  D+  L +IL+ +++     +Q
Sbjct: 362 VIIADGIRLPFISAVKWPEISITVAEKDVGRLAEILERVAATNLSTIQ 409


>gi|359491711|ref|XP_002284930.2| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase isoform 1
           [Vitis vinifera]
          Length = 498

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 76/294 (25%), Positives = 141/294 (47%), Gaps = 21/294 (7%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADH 105
            K+ S+AD+ F+PF  S++  RH R  G E   +   +   + N       W + GG +H
Sbjct: 189 VKNSSQADIIFVPFFSSLSYNRHSRLHGKEKVSVNKMLQDKLVNFLMGQDEWKQLGGKNH 248

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQIWPRQEDPPKLG 164
             VA H    ++M  A +   +A+ V+     Y +      KDV  P     + +P    
Sbjct: 249 LIVAHHP---NSMLDARKKLGSAMFVLADFGRYPVEIANIDKDVIAPYKHVLRSNPVADS 305

Query: 165 SS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGS 214
           ++   R  L +F GA+       +R++L  + R++ +++   G ++       ++ +  S
Sbjct: 306 ATFEGRPLLVYFQGAIYRKDGGAIRQELYYLLRDEKDVHFTFGSVRGNGINGASEGMASS 365

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PL 271
           KFCL++ G   ++ R+ D++   CVPVII++  +LPF DVL++  F I V   D      
Sbjct: 366 KFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFCIFVRASDAVKNGF 425

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           L  +L+GI  E++  +   + ++  HF++       DA  M+   +  + SS++
Sbjct: 426 LLNLLRGIKREKWTKMWERLKEIAHHFEYQYPSQAGDAVDMIWGAVSRKISSIQ 479


>gi|297733976|emb|CBI15223.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 76/294 (25%), Positives = 141/294 (47%), Gaps = 21/294 (7%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADH 105
            K+ S+AD+ F+PF  S++  RH R  G E   +   +   + N       W + GG +H
Sbjct: 146 VKNSSQADIIFVPFFSSLSYNRHSRLHGKEKVSVNKMLQDKLVNFLMGQDEWKQLGGKNH 205

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQIWPRQEDPPKLG 164
             VA H    ++M  A +   +A+ V+     Y +      KDV  P     + +P    
Sbjct: 206 LIVAHHP---NSMLDARKKLGSAMFVLADFGRYPVEIANIDKDVIAPYKHVLRSNPVADS 262

Query: 165 SS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGS 214
           ++   R  L +F GA+       +R++L  + R++ +++   G ++       ++ +  S
Sbjct: 263 ATFEGRPLLVYFQGAIYRKDGGAIRQELYYLLRDEKDVHFTFGSVRGNGINGASEGMASS 322

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PL 271
           KFCL++ G   ++ R+ D++   CVPVII++  +LPF DVL++  F I V   D      
Sbjct: 323 KFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFCIFVRASDAVKNGF 382

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
           L  +L+GI  E++  +   + ++  HF++       DA  M+   +  + SS++
Sbjct: 383 LLNLLRGIKREKWTKMWERLKEIAHHFEYQYPSQAGDAVDMIWGAVSRKISSIQ 436


>gi|226529660|ref|NP_001151894.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
 gi|195650683|gb|ACG44809.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 488

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 80/293 (27%), Positives = 139/293 (47%), Gaps = 20/293 (6%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFY 107
           D   ADL F+PF  S++  RH R +  E +     +   +       P W R GGADH  
Sbjct: 185 DSRDADLVFVPFFASLSYNRHYRPVPPEKVSRDRALQEKLVRYLAARPEWRRFGGADHVI 244

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
           VA H    +++  A  V   A+ V+     Y        KDV  P +   +       G 
Sbjct: 245 VAHHP---NSLLHARAVLHPAVFVLSDFGRYPPRVASLEKDVIAPYKHMAKTYANDSAGF 301

Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFC 217
             R  L +F GA+       +R++L  + + + ++Y   G ++   A +    +  SKFC
Sbjct: 302 DDRPTLLYFRGAIYRKEGGSIRQELYYMLKEEKDVYFSFGSVQDHGASKASQGMHSSKFC 361

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
           L++ G   ++ R+ D++   CVPVII++  +LP+ DVL++  FSI V + D      L +
Sbjct: 362 LNIAGDTPSSNRLFDAIVTHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVKKGYLMR 421

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           +L G+S +++  +   + +V KHF++       DA  M+   L  +  S++++
Sbjct: 422 LLSGVSKQQWTKMWDRLKEVDKHFEYQYPSQKDDAVQMIWQALSRKVPSIKLK 474


>gi|413932967|gb|AFW67518.1| hypothetical protein ZEAMMB73_420245 [Zea mays]
          Length = 386

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 80/311 (25%), Positives = 140/311 (45%), Gaps = 30/311 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +           P  +   I  I+
Sbjct: 48  FAAEIFMHRLLLSSAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMLSAIELIA 107

Query: 92  QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
             +PYWNR+ GADHF+V  H  G         A+ +     L    +V   ++    H+ 
Sbjct: 108 TNWPYWNRSEGADHFFVTPHDFGACFHYQDEKAIGRGILPLLQHATLV--QTFGQKNHVC 165

Query: 145 HK--DVSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWR 191
            K   +++P   P Q+    L    + R+   +F G      N P      R     VW 
Sbjct: 166 LKGGSITIPPFAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWE 225

Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           N  ++ ++  S    + Y +++  S FCL   G+   + R+ +++ +GC+P+IIA+   L
Sbjct: 226 NFKNNPLFDISTDHPSTYYEDMERSVFCLCPLGWAPWSPRLVEAVVFGCIPLIIADIV-L 284

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-- 307
           PFAD + W+   + VA  D+P L  IL  I + + +L +  +L      Q  +FP     
Sbjct: 285 PFADAIPWEEIGVFVAEEDVPKLDSILTSIPT-DVILRKQRLLANPSMKQAMLFPQPAQA 343

Query: 308 -DAFYMVMYEL 317
            DAF+ ++  L
Sbjct: 344 GDAFHQILNGL 354


>gi|222616547|gb|EEE52679.1| hypothetical protein OsJ_35064 [Oryza sativa Japonica Group]
          Length = 528

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 76/293 (25%), Positives = 140/293 (47%), Gaps = 20/293 (6%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFY 107
           D   AD+ F+PF  S++  RH R +  E +     +   +       P W R+GGADH  
Sbjct: 225 DSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKELQEKLVRYLMAQPEWKRSGGADHVI 284

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
           VA H    +++  A  V    + V+     Y        KDV  P +   +       G 
Sbjct: 285 VAHHP---NSLLHARSVLFPVVFVLSDFGRYHPRVASLEKDVIAPYKHMAKTFVNDSAGF 341

Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFC 217
             R  L +F GA+       +R++L  + +++ ++Y   G ++   A +    +  SKFC
Sbjct: 342 DDRPTLLYFRGAIFRKEGGNIRQELYYMLKDEKDVYFAFGSVQDHGASKASKGMHASKFC 401

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
           L++ G   ++ R+ D++   CVPVII++  +LP+ D L++  FSI V + D      L +
Sbjct: 402 LNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAVKKGYLMR 461

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           +++G+S  ++  + + + +V KHF++       DA  M+   L  +  ++R++
Sbjct: 462 LIRGVSKHQWTRMWNRLKEVDKHFEYQYPSQKDDAVQMIWQALARKVPAIRLK 514


>gi|356507133|ref|XP_003522325.1| PREDICTED: probable glycosyltransferase At3g42180-like [Glycine
           max]
          Length = 484

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 77/302 (25%), Positives = 142/302 (47%), Gaps = 37/302 (12%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTEGIP------DFISHYIFNISQKYPYWNRTG 101
            +D S+AD+ F+PF  S++  RH +  G E +       D +  ++    +    W R+G
Sbjct: 182 VQDSSQADVIFVPFFSSLSYNRHSKLNGQEKVSLNKRLQDRLVQFLMGRKE----WKRSG 237

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS--SSYFISGHIAHKDVSLP-----QIW 154
           G DH  VA H       ++    +L A  +V +    Y +      KD+  P        
Sbjct: 238 GKDHLIVAHHPNSLLDARR----RLGAAMLVLADFGRYPVELANIKKDIIAPYRHLVGTI 293

Query: 155 PRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE 210
           PR E        KR  L +F GA+       +R++L  + +++++++   G +     ++
Sbjct: 294 PRAESA---SFEKRTTLVYFQGAIYRKDGGAIRQELYYLLKDENDVHFTFGSIGGNGINQ 350

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
               +  SKFCL++ G   ++ R+ D++   CVPVII++  +LPF D L++  FSI+V  
Sbjct: 351 ASQGMALSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDDLDYSDFSIIVHA 410

Query: 267 LDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
            D      L  +L+ I  +E+  +   + ++  HF++       DA  M+  ++  + SS
Sbjct: 411 SDAMKKGYLLNLLRSIKRDEWNKMWERLKQITHHFEYQYPSQPGDAVNMIWQQVEHKISS 470

Query: 324 VR 325
           +R
Sbjct: 471 IR 472


>gi|115476598|ref|NP_001061895.1| Os08g0438600 [Oryza sativa Japonica Group]
 gi|42408650|dbj|BAD09870.1| Exostosin family-like protein [Oryza sativa Japonica Group]
 gi|42408898|dbj|BAD10156.1| Exostosin family-like protein [Oryza sativa Japonica Group]
 gi|113623864|dbj|BAF23809.1| Os08g0438600 [Oryza sativa Japonica Group]
 gi|215715329|dbj|BAG95080.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740942|dbj|BAG97437.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 566

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 70/232 (30%), Positives = 115/232 (49%), Gaps = 19/232 (8%)

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--Q 152
           W R GGADH  V  H    ++M  A   +L+A   V S        +A+  KDV  P   
Sbjct: 311 WRRWGGADHLVVPHHP---NSMMDARR-RLSAAMFVLSDFGRYPPDVANLRKDVIAPYKH 366

Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP-- 206
           + P   D    G  +R  LA+F GA+       VR++L Q+ +++ +++   G ++    
Sbjct: 367 VVPSLGDGDSPGFEQRPVLAYFQGAIHRKNGGRVRQRLYQLIKDEKDVHFTYGSVRQNGI 426

Query: 207 --YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
                 +  SKFCL++ G   ++ R+ D++   CVPVII++  +LPF DVL++ +F + V
Sbjct: 427 RRATKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSAFCVFV 486

Query: 265 ATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
              D      L  +L+GIS EE+  +   + +V  HF++       DA  M+
Sbjct: 487 RASDAVKRGFLLHLLRGISQEEWTAMWRRLKEVAHHFEYQYPSQPGDAVQMI 538


>gi|42570324|ref|NP_850113.2| exostosin-like protein [Arabidopsis thaliana]
 gi|75216857|sp|Q9ZUV3.1|IRX7_ARATH RecName: Full=Probable glucuronoxylan glucuronosyltransferase IRX7;
           AltName: Full=Protein FRAGILE FIBER 8; AltName:
           Full=Protein IRREGULAR XYLEM 7
 gi|4063747|gb|AAC98455.1| hypothetical protein [Arabidopsis thaliana]
 gi|77022037|gb|ABA60868.1| putative glucuronyltransferase [Arabidopsis thaliana]
 gi|77022039|gb|ABA60869.1| putative glucuronyltransferase [Arabidopsis thaliana]
 gi|330252987|gb|AEC08081.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 448

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 82/321 (25%), Positives = 135/321 (42%), Gaps = 43/321 (13%)

Query: 33  YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFIS 84
           +A+E    KAF+  +    T+DP +AD FF+P  ++           G P        I+
Sbjct: 120 FAAEVALHKAFLSLEGDVRTEDPYEADFFFVPVYVS----CNFSTINGFPAIGHARSLIN 175

Query: 85  HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG--- 141
             I  +S +YP+WNRT G+DH + A H  G            + + +   +S  +     
Sbjct: 176 DAIKLVSTQYPFWNRTSGSDHVFTATHDFGSCFHTMEDRAIADGVPIFLRNSIILQTFGV 235

Query: 142 ------HIAHKDVSLPQIWPR-----QEDPPKLGSSKRNKLAFFAGAV------------ 178
                       V  P I P      Q++ P   + +R+   FF G +            
Sbjct: 236 TFNHPCQEVENVVIPPYISPESLHKTQKNIPV--TKERDIWVFFRGKMELHPKNISGRFY 293

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
           +  VR  + + +  D   Y    R    Y  E+  S FCL   G+   + R+ +S+  GC
Sbjct: 294 SKRVRTNIWRSYGGDRRFYLQRQRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESVALGC 352

Query: 239 VPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRK 296
           VPVIIA+   LPF   + W   S+ VA  D+  L  IL+ +++    ++Q  +    VR+
Sbjct: 353 VPVIIADGIRLPFPSTVRWPDISLTVAERDVGKLGDILEHVAATNLSVIQRNLEDPSVRR 412

Query: 297 HFQWHVFPSDYDAFYMVMYEL 317
              ++V   + DA + V+  L
Sbjct: 413 ALMFNVPSREGDATWQVLEAL 433


>gi|27817890|dbj|BAC55656.1| exostosin family protein-like protein [Oryza sativa Japonica Group]
          Length = 453

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 69/225 (30%), Positives = 111/225 (49%), Gaps = 21/225 (9%)

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSL 150
           ++ PYW R  G DH ++ C     +A+ +  +   NA+ ++       S   +  KDV L
Sbjct: 182 ERQPYWRRHQGRDHVFI-CQDP--NALYRVVDRISNAVLLISDFGRLRSEQASLVKDVIL 238

Query: 151 P---QIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AH 199
           P   +I   Q D   +G   R  L FF G         VR+ L QV  N++++     A 
Sbjct: 239 PYAHRINSFQGD---VGVESRPSLLFFMGNRYRKEGGKVRDTLFQVLENEADVIIKHGAQ 295

Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
           S   +      +  SKFCLH  G   +  R+ D+L   CVPVI++++ +LPF DV+++++
Sbjct: 296 SRESRRMATRGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDVIDYRN 355

Query: 260 FSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
            SI V T   +    L   L+GISS+  L  Q  + KV+ +F++ 
Sbjct: 356 ISIFVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKHYFEYE 400


>gi|326499682|dbj|BAJ86152.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 304

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 73/254 (28%), Positives = 118/254 (46%), Gaps = 30/254 (11%)

Query: 90  ISQKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIA 144
           IS  +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+ 
Sbjct: 23  ISSHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVC 82

Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
            K+  +++P   P Q+      PP+   + R+   +F G      N P      R     
Sbjct: 83  LKEGSINIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 139

Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           VW N  ++ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+ 
Sbjct: 140 VWENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 199

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
             LPFAD + W    + VA  D+P L  IL  I   E +L +  +L      Q  +FP  
Sbjct: 200 IVLPFADAIPWDEIGVFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQP 258

Query: 307 Y---DAFYMVMYEL 317
               DAF+ ++  L
Sbjct: 259 AQAGDAFHQILNGL 272


>gi|218199855|gb|EEC82282.1| hypothetical protein OsI_26516 [Oryza sativa Indica Group]
          Length = 393

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 66/222 (29%), Positives = 107/222 (48%), Gaps = 15/222 (6%)

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSL 150
           ++ PYW R  G DH ++ C     +A+ +  +   NA+ ++       S   +  KDV L
Sbjct: 122 ERQPYWRRHQGRDHVFI-CQDP--NALYRVVDRISNAVLLISDFGRLRSEQASLVKDVIL 178

Query: 151 PQIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AHSGR 202
           P           +G   R  L FF G         VR+ L QV  N++++     A S  
Sbjct: 179 PYAHRINSFQGDVGVESRPSLLFFMGNRYRKEGGKVRDTLFQVLENEADVIIKHGAQSRE 238

Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
            +      +  SKFCLH  G   +  R+ D+L   CVPVI++++ +LPF DV+++++ SI
Sbjct: 239 SRRMATRGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDVIDYRNISI 298

Query: 263 VVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
            V T   +    L   L+GISS+  L  Q  + KV+ +F++ 
Sbjct: 299 FVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKHYFEYE 340


>gi|222637299|gb|EEE67431.1| hypothetical protein OsJ_24780 [Oryza sativa Japonica Group]
          Length = 500

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 69/225 (30%), Positives = 111/225 (49%), Gaps = 21/225 (9%)

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSL 150
           ++ PYW R  G DH ++ C     +A+ +  +   NA+ ++       S   +  KDV L
Sbjct: 229 ERQPYWRRHQGRDHVFI-CQDP--NALYRVVDRISNAVLLISDFGRLRSEQASLVKDVIL 285

Query: 151 P---QIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AH 199
           P   +I   Q D   +G   R  L FF G         VR+ L QV  N++++     A 
Sbjct: 286 PYAHRINSFQGD---VGVESRPSLLFFMGNRYRKEGGKVRDTLFQVLENEADVIIKHGAQ 342

Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
           S   +      +  SKFCLH  G   +  R+ D+L   CVPVI++++ +LPF DV+++++
Sbjct: 343 SRESRRMATRGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDVIDYRN 402

Query: 260 FSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
            SI V T   +    L   L+GISS+  L  Q  + KV+ +F++ 
Sbjct: 403 ISIFVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKHYFEYE 447


>gi|115472841|ref|NP_001060019.1| Os07g0567000 [Oryza sativa Japonica Group]
 gi|113611555|dbj|BAF21933.1| Os07g0567000 [Oryza sativa Japonica Group]
          Length = 500

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 69/225 (30%), Positives = 111/225 (49%), Gaps = 21/225 (9%)

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSL 150
           ++ PYW R  G DH ++ C     +A+ +  +   NA+ ++       S   +  KDV L
Sbjct: 229 ERQPYWRRHQGRDHVFI-CQDP--NALYRVVDRISNAVLLISDFGRLRSEQASLVKDVIL 285

Query: 151 P---QIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AH 199
           P   +I   Q D   +G   R  L FF G         VR+ L QV  N++++     A 
Sbjct: 286 PYAHRINSFQGD---VGVESRPSLLFFMGNRYRKEGGKVRDTLFQVLENEADVIIKHGAQ 342

Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
           S   +      +  SKFCLH  G   +  R+ D+L   CVPVI++++ +LPF DV+++++
Sbjct: 343 SRESRRMATRGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDVIDYRN 402

Query: 260 FSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
            SI V T   +    L   L+GISS+  L  Q  + KV+ +F++ 
Sbjct: 403 ISIFVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKHYFEYE 447


>gi|218201205|gb|EEC83632.1| hypothetical protein OsI_29363 [Oryza sativa Indica Group]
          Length = 566

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 70/232 (30%), Positives = 114/232 (49%), Gaps = 19/232 (8%)

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--Q 152
           W R GGADH  V  H    ++M  A   +L+A   V S        +A+  KDV  P   
Sbjct: 311 WRRWGGADHLVVPHHP---NSMMDARR-RLSAAMFVLSDFGRYPPDVANLRKDVIAPYKH 366

Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP-- 206
           + P   D    G  +R  LA+F GA+       VR++L Q+ +++ +++   G ++    
Sbjct: 367 VVPSLGDGDSPGFEQRPVLAYFQGAIHRKNGGRVRQRLYQLIKDEKDVHFTYGSVRQNGI 426

Query: 207 --YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
                 +  SKFCL++ G   ++ R+ D++   CVPVII++  +LPF DVL++  F + V
Sbjct: 427 RRATKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSDFCVFV 486

Query: 265 ATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
              D      L  +L+GIS EE+  +   + +V  HF++       DA  M+
Sbjct: 487 RASDAVKRGFLLHLLRGISQEEWTAMWRRLKEVAHHFEYQYPSQPGDAVQMI 538


>gi|357157638|ref|XP_003577864.1| PREDICTED: probable glycosyltransferase At3g07620-like
           [Brachypodium distachyon]
          Length = 485

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 142/294 (48%), Gaps = 22/294 (7%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFY 107
           D   AD+ F+PF  S++  RH + +  E +     +   +       P W R+GGADH  
Sbjct: 181 DAQDADVVFVPFFASLSYNRHSKPVPPEKVSRDRALQEKLVRYLAARPEWKRSGGADHVI 240

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
           VA H    +++  A      A+ V+     Y        KD+  P +   +       G 
Sbjct: 241 VAHHP---NSLLHARSALFPAVFVLSDFGRYHPRVASLEKDLVAPYRHMAKTFVNDTAGF 297

Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGSKFC 217
             R  L +F GA+       +R++L  + +++ +++   G ++    +  +  +  SKFC
Sbjct: 298 DDRPTLLYFRGAIYRKEGGNIRQELYNMLKDEKDVFFSFGSVQDHGVSKASQGMHSSKFC 357

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
           L++ G   ++ R+ D++   CVPVII++  +LP+ DVL++  FSI V + D      L K
Sbjct: 358 LNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVKRGYLMK 417

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-DAFYMVMYELWLRRSSVRVQ 327
           +++G++   +  +   + +V KHF++  FPS   DA  M+   L  +  S+R++
Sbjct: 418 LIRGVTKHRWTRMWKRLKEVDKHFEYQ-FPSRKDDAVQMIWQALARKVPSIRLK 470


>gi|412994079|emb|CCO14590.1| predicted protein [Bathycoccus prasinos]
          Length = 553

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 80/323 (24%), Positives = 152/323 (47%), Gaps = 42/323 (13%)

Query: 30  RGNYASESYFKKAFM--KSHFVTKDPSKADLFFLP-------FSIARMRHDRR-IGTEGI 79
           R  Y +E +F + F   K+   T +P +ADLFF+P       +S   +RH+ +    E  
Sbjct: 222 RDQYGTEIWFHRNFRDDKNGVRTMNPEEADLFFVPQYGECFLWSREMLRHENQGQAMEET 281

Query: 80  PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKL-NAIQVVCSSSYF 138
            ++    + ++  K PY+NRT G DH +V   + G +   + W+ ++ ++I +       
Sbjct: 282 NEYFLEVLSHVKGKLPYFNRTDGRDHIFVFAGARGPTIF-RDWQKEIPHSIYLTPEGDRT 340

Query: 139 ISGHIAHKDVSLPQIW---------PRQE---DPPKLGSSKRNKLAFFAGAVNSP----- 181
           +      KD+ +P +           R E   +PPK     R  LA F G ++ P     
Sbjct: 341 LPQFDTWKDIVIPGLEYDKRMYLEEHRNELVTNPPK-----RKILAMFRGTIDHPAGFAY 395

Query: 182 ---VREKLLQVWRNDSEIYAHSGRLKTP----YADELLGSKFCLHVKGFEVNTARIADSL 234
              +R KL ++++N +++  +  ++K      Y  E+  S FCL+  G+   T R   ++
Sbjct: 396 SKGLRPKLKKIFQNATDV-IYDTKIKDCDRDCYVREMTESVFCLNPLGWTPWTLRFYQAV 454

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
              C+P+IIA++ + PF   +N+  F++ +   D+  + + ++ +  EE    + Y+ K+
Sbjct: 455 MTRCIPIIIADNIEFPFESEINYSEFALKIPEKDVSDILETMRHMPEEERERRRRYMDKI 514

Query: 295 RKHFQWHVFPSDYDAFYMVMYEL 317
            K F +       DA+Y  + EL
Sbjct: 515 WKQFTYQRPAEIGDAYYSTVKEL 537


>gi|125586098|gb|EAZ26762.1| hypothetical protein OsJ_10674 [Oryza sativa Japonica Group]
          Length = 364

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 75/290 (25%), Positives = 138/290 (47%), Gaps = 24/290 (8%)

Query: 57  DLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFYVACHS 112
           D+ F+PF  S++  RH + +      +   +   + +     P W R+GG DH  +A H 
Sbjct: 50  DVVFVPFFASLSFNRHSKVVPPARASEDRALQRRLLDYLAARPEWRRSGGRDHVVLAHHP 109

Query: 113 IGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP--QIWPRQEDPPKLGSSKR 168
            G   A  K W      + V+C    Y  S     KDV  P   + P   +    G   R
Sbjct: 110 NGMLDARYKLWP----CVFVLCDFGRYPPSVAGLDKDVIAPYRHVVPNFAND-SAGYDDR 164

Query: 169 NKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHV 220
             L +F GA+       +R++L  + +++ +++   G +     ++    +  SKFCL++
Sbjct: 165 PTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVVGNGIEQATQGMRASKFCLNI 224

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILK 277
            G   ++ R+ DS+   CVP+II++  +LPF DVL++  F I+V   D      L  ++ 
Sbjct: 225 AGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGADAVKKGFLMNLIN 284

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           GIS E++  + + + +V +HF++     + DA  M+   +  +  S+R++
Sbjct: 285 GISREDWTRMWNRLKEVERHFEYQYPSQNDDAVQMIWKAIARKAPSIRLK 334


>gi|307111456|gb|EFN59690.1| hypothetical protein CHLNCDRAFT_133230 [Chlorella variabilis]
          Length = 569

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 80/306 (26%), Positives = 133/306 (43%), Gaps = 28/306 (9%)

Query: 39  FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
           F + F++S   T+DPS+A LF++P  +     +   G E     + H    I   +PYW+
Sbjct: 271 FMEEFLESPVRTEDPSEASLFYIPAFLYSYSGNMAGGDEHTQLLLDH----IRATWPYWD 326

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS----GHIAHKDVS----L 150
           R GG DHF       G       +   +  +     S+        GH  H +      L
Sbjct: 327 RHGGRDHFLFVPADRGTCPWGSRFSDLIRIVHFGMHSTRTNHNPHFGHQGHPEFGCYNPL 386

Query: 151 PQIWPRQEDPPKLGSSKRNKLAFFAGAVNSP----------VREKLLQVWRNDSEIYAHS 200
             I       P L       L FFAG++ +           +  +L+  W ND E ++ S
Sbjct: 387 RDIVAAGTGAP-LSLPWAGWLFFFAGSIRTDDNVYSGRTRLILSELVAQW-NDPE-FSFS 443

Query: 201 GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
           G     Y      +KFCL   G+     R+  S+  GCVPV+I  H   P+ +VL +++F
Sbjct: 444 GGYVNNYPAGFREAKFCLAPWGYGFGM-RLHQSILGGCVPVVIQEHVFQPYEEVLPYETF 502

Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW--HVFPSDYDAFYMVMYELW 318
           S+ ++  D+P L++ L+ ++ E+Y  L   V++ ++ F W  H+    +D     +   W
Sbjct: 503 SLRLSNEDLPQLRETLRSVTDEQYRELLEGVVRYKEAFSWERHLGGRAFDYTIASLRRRW 562

Query: 319 LRRSSV 324
           L   S+
Sbjct: 563 LNSLSL 568


>gi|242046000|ref|XP_002460871.1| hypothetical protein SORBIDRAFT_02g036640 [Sorghum bicolor]
 gi|241924248|gb|EER97392.1| hypothetical protein SORBIDRAFT_02g036640 [Sorghum bicolor]
          Length = 500

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 124/273 (45%), Gaps = 26/273 (9%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGI---------PDFISHYIFNISQKYPYWNRT 100
           DPS ADLF++PF  S++ + +  R                D +   +    ++ PYW R 
Sbjct: 176 DPSDADLFYVPFFSSLSLVVNPIRSPPAANASGAAAAYSDDAMQEELLEWLERQPYWRRH 235

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQED 159
            G DH ++ C     +A+ +  +   NA+ +V       S   +  KDV LP        
Sbjct: 236 MGRDHVFI-CQDP--NALYRVIDRISNAVLLVSDFGRLRSDQASLVKDVILPYSHRINSF 292

Query: 160 PPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG----RLKTPYADEL 211
             ++G   R  L FF G         VR+ L Q+  N+ ++    G      +      +
Sbjct: 293 KGEVGVDGRPLLLFFMGNRYRKEGGKVRDALFQILENEDDVTIKHGTQSRESRRAARQGM 352

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LD 268
             SKFCLH  G   +  R+ D+L   CVPVI++++ +LPF D++++   SI V T   + 
Sbjct: 353 HSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDIIDYNKISIFVGTSKAVQ 412

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
              L  +L+ ISSE  L  Q    KV+++F++ 
Sbjct: 413 PGYLTSMLRRISSERILEYQRETKKVKRYFEYE 445


>gi|115452759|ref|NP_001049980.1| Os03g0324700 [Oryza sativa Japonica Group]
 gi|108707908|gb|ABF95703.1| secondary cell wall-related glycosyltransferase family 47,
           putative, expressed [Oryza sativa Japonica Group]
 gi|113548451|dbj|BAF11894.1| Os03g0324700 [Oryza sativa Japonica Group]
 gi|125543687|gb|EAY89826.1| hypothetical protein OsI_11372 [Oryza sativa Indica Group]
 gi|215768347|dbj|BAH00576.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 468

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 75/290 (25%), Positives = 138/290 (47%), Gaps = 24/290 (8%)

Query: 57  DLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFYVACHS 112
           D+ F+PF  S++  RH + +      +   +   + +     P W R+GG DH  +A H 
Sbjct: 154 DVVFVPFFASLSFNRHSKVVPPARASEDRALQRRLLDYLAARPEWRRSGGRDHVVLAHHP 213

Query: 113 IGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP--QIWPRQEDPPKLGSSKR 168
            G   A  K W      + V+C    Y  S     KDV  P   + P   +    G   R
Sbjct: 214 NGMLDARYKLWP----CVFVLCDFGRYPPSVAGLDKDVIAPYRHVVPNFAND-SAGYDDR 268

Query: 169 NKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHV 220
             L +F GA+       +R++L  + +++ +++   G +     ++    +  SKFCL++
Sbjct: 269 PTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVVGNGIEQATQGMRASKFCLNI 328

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILK 277
            G   ++ R+ DS+   CVP+II++  +LPF DVL++  F I+V   D      L  ++ 
Sbjct: 329 AGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGADAVKKGFLMNLIN 388

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           GIS E++  + + + +V +HF++     + DA  M+   +  +  S+R++
Sbjct: 389 GISREDWTRMWNRLKEVERHFEYQYPSQNDDAVQMIWKAIARKAPSIRLK 438


>gi|159478058|ref|XP_001697121.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158274595|gb|EDP00376.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 802

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 149/342 (43%), Gaps = 75/342 (21%)

Query: 34  ASESYFKKAFMKSHFVTKDPSKADLFFLPFSI-----------------ARMRHDRRIGT 76
           A E+Y  +  ++S   T DP +AD F++P  +                 A + H R   T
Sbjct: 400 AVEAYLHETLLQSEHRTFDPDEADFFYVPVYVTCYMWPILGWADGPWWYAPLAHTR---T 456

Query: 77  EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA----------W---E 123
             + + +S    +IS  +P+WNR GG DH ++     G   M  A          W   +
Sbjct: 457 MHVSNMLSEVHAHISSTFPWWNRRGGRDHIWLMAADEGACYMPTAIYNTSIILTHWGRMD 516

Query: 124 VKLNAIQVVCSSSYFIS------------------GHIAH---KDVSLPQIWPRQ--EDP 160
           ++  +       +Y ++                  GH  +   KD+ +P         D 
Sbjct: 517 LEHQSNTAYQQDNYNMAMPGEFKAWPGMDWQSRMRGHPCYDPRKDLVIPAFKSVDHFRDS 576

Query: 161 PKLGSSK--RNKLAFFAGAVNSP--------VREKLLQVW-RND----SEIYAHSGRL-K 204
           P LG +   R+ L +F G +           +R+KL  +W +ND     +IY  +G + +
Sbjct: 577 PLLGGAPLVRDLLCYFRGDIGQARFPQYSRGLRQKLFHLWHKNDWAAKHKIYIGNGEMVR 636

Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
            PY++ LL S+FCL + G +  + R  D++ +GC+PV+I ++    F  +L+W+SFSI +
Sbjct: 637 GPYSEHLLRSRFCLVLPG-DGWSPRAEDAVLHGCIPVVIMDNVHAVFESILDWESFSIRI 695

Query: 265 ATLDIPL--LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
              D  L  L ++L+ +  E    +Q  + +V   F +   P
Sbjct: 696 REDDAALEALPQLLEAVPPERVAKMQRNLARVWHRFAYATGP 737


>gi|326514612|dbj|BAJ96293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 428

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 77/324 (23%), Positives = 140/324 (43%), Gaps = 51/324 (15%)

Query: 33  YASESYFKKAFM-KSHFVTKDPSKADLFFLPF----------SIARMRHDRRIGTEGIPD 81
           +A+E    +A + +       P +ADLF +P            +  + H R         
Sbjct: 100 FAAEVAVHEALLLRQRRAGLRPEEADLFLVPVYACCNFSTPTGLPSLAHAR--------G 151

Query: 82  FISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNA------IQVVCSS 135
            ++  +  +  + P+WNR+ GADH +VA H  G +      +V + A         +   
Sbjct: 152 LLADAVGLVRAQMPFWNRSAGADHVFVASHDFG-ACFHPMEDVAMAAGIPEFLKGSILLQ 210

Query: 136 SYFISGHIAHKDVS--------LPQIWPRQEDPPKLGSSKRNKLAFFAGAV--------- 178
           ++ + G    +DV          P++ PR+   P+   + R+  AFF G +         
Sbjct: 211 TFGVQGRHPCQDVEHVVIPPYVPPELAPRELPEPE--KAHRDIFAFFRGKMEVHPKNISG 268

Query: 179 ---NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLY 235
              +  VR +LL+++  + + Y    R    Y  E+  S FC+   G+   + R+ +S+ 
Sbjct: 269 HFYSRKVRTELLRLYGRNRKFYLKRKR-NDGYRSEMARSLFCICPLGWAPWSPRLVESVL 327

Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV- 294
            GC+PV+IA+   LPF  VL W   S+ VA  D+  L+ +L  +++     +Q  +    
Sbjct: 328 LGCIPVVIADDIRLPFPGVLRWPDISLQVAERDVAGLEAVLDHVAATNLTTIQGNLWDPV 387

Query: 295 -RKHFQWHVFPSDYDAFYMVMYEL 317
            RK   ++    + DA + V+ EL
Sbjct: 388 KRKALVFNRPMEEGDATWQVLKEL 411


>gi|356565097|ref|XP_003550781.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 472

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 77/300 (25%), Positives = 143/300 (47%), Gaps = 33/300 (11%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADH 105
            ++  +AD+ F+PF  S++  RH +  G E   +   +   +  +  +   W R+GG DH
Sbjct: 170 VQNSRQADVVFVPFFSSLSYNRHSKIHGKEKVSVNRMLQQRLVQLLMEREEWKRSGGRDH 229

Query: 106 FYVACH--SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP-----QIWPR 156
             VA H  SI R+        KL +  +V +        +A+  KD+  P        PR
Sbjct: 230 VIVAHHPNSILRARR------KLGSAMLVLADFGRYPSQLANIKKDIIAPYRHLVSTVPR 283

Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE-- 210
            E        +R+ L +F GA+       +R+KL  + +++ +++   G ++    ++  
Sbjct: 284 AESA---SYEERSTLLYFQGAIYRKDGGAIRQKLYYLLKDEKDVHFAFGSIRKNGINQAS 340

Query: 211 --LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
             +  SKFCL+V G   ++ R+ D++   CVPVII++  +LPF DVL++  F + V   D
Sbjct: 341 QGMALSKFCLNVAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFGLFVHASD 400

Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
                 L  +L+ I  E++  +   +  + +HF++       DA  M+  E+  + SS++
Sbjct: 401 AVRKGYLLNLLRSIKPEKWTQMWERLKDITQHFEYQYPSQPGDAVNMIWEEVAHKISSLQ 460


>gi|297822463|ref|XP_002879114.1| hypothetical protein ARALYDRAFT_481698 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324953|gb|EFH55373.1| hypothetical protein ARALYDRAFT_481698 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 82/321 (25%), Positives = 136/321 (42%), Gaps = 43/321 (13%)

Query: 33  YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFIS 84
           +A+E    KAF+  +    T+DP +AD FF+P  ++           G P        I+
Sbjct: 124 FAAEVALHKAFLSLEGDIRTEDPYEADFFFVPVYVSCNFST----INGFPAIGHARTLIN 179

Query: 85  HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS---- 140
             I  +S +YP+WNR  G+DH + A H  G            + +  +  SS  +     
Sbjct: 180 DAIKFVSTQYPFWNRNNGSDHVFTATHDFGSCFHTMEDRAIADGVPKILRSSIVLQTFGV 239

Query: 141 --GHIAHKDVSLPQIWPRQEDPPKLGSS--------KRNKLAFFAGAV------------ 178
              H   +  ++  + P    P  L  +        +R+  AFF G +            
Sbjct: 240 TFNHPCQEVENV--VIPPYISPESLHKTLKNIPVNKERDIWAFFRGKMELHPKNISGRFY 297

Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
           +  VR K+ + +  D   Y    R  + Y  E+  S FCL   G+   + R+ +S+  GC
Sbjct: 298 SKRVRTKIWRSYGGDRRFYLQRQRF-SGYQLEIARSVFCLCPLGWAPWSPRLVESVALGC 356

Query: 239 VPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRK 296
           VPVIIA+   LPF   + W   S+ VA  D+  L  IL+ + +    ++Q  +    VR+
Sbjct: 357 VPVIIADGIRLPFPSAVRWPDISLTVAERDVGKLGDILEHVVATNLSVIQRNLEDPSVRR 416

Query: 297 HFQWHVFPSDYDAFYMVMYEL 317
              ++V   + DA + V+  L
Sbjct: 417 ALMFNVPSREGDATWQVLEAL 437


>gi|307106650|gb|EFN54895.1| hypothetical protein CHLNCDRAFT_135016 [Chlorella variabilis]
          Length = 584

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 87/328 (26%), Positives = 138/328 (42%), Gaps = 74/328 (22%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPD---------- 81
           NY +   F + F+ S   T+DPS+A+LF++P  +         GT G P           
Sbjct: 244 NYIAYQQFLEQFLLSPVRTEDPSEANLFYIPMLLYGYS-----GTPGGPSRAPQVDSLCN 298

Query: 82  ------FISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSS 135
                  I   +  I+ K+PYWNRT G DHFY A       A + A   K  A Q +  S
Sbjct: 299 MMPGQAHIDLVLDQIAHKWPYWNRTRGRDHFYWA------PADRGACYHKGLAEQAIKVS 352

Query: 136 SYFISGHIAHKDVSLPQIWPRQEDPP-----------------KLGSS------------ 166
            + +  H  +  + L  ++   +  P                 KL +S            
Sbjct: 353 HFGL--HATNNSIDLGDLYSHNQMSPDHGCYHPLRDVVAPPFEKLAASWLNTTLRLGLDG 410

Query: 167 ---KRNKLAFFAGAV-------NSPVREKL---LQVWRNDSEIYAHSGRLKT-PYADELL 212
               +N   +F+G V       +   R+KL   ++ W +D E     GRL+   Y   + 
Sbjct: 411 NIKGKNATFYFSGNVQGINLMYSGGTRQKLQALIKQW-DDPEFGFVEGRLQEGAYEQRIR 469

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
            S+FCL   G      R+   ++ G +PVI+  H   P  DVL +++FSI +   D+P L
Sbjct: 470 ESRFCLAPYGHGYGM-RLGQCIFAGSIPVIVQEHVFQPLEDVLPYEAFSIRLTNDDLPQL 528

Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQW 300
           ++IL+GI+  +Y  L + +L+      W
Sbjct: 529 REILRGITEAQYRELMTGLLRYSLALSW 556


>gi|114325715|gb|ABI64067.1| glycosyltransferase GT47C [Populus tremula x Populus alba]
          Length = 442

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 134/330 (40%), Gaps = 62/330 (18%)

Query: 33  YASESYFKKAFMKSHFV-TKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISH 85
           +ASE    KA   S  V T DP +AD FF+P  ++           G P        +S 
Sbjct: 114 FASEVAIHKALSSSLDVRTFDPYEADFFFVPVYVSCNFST----VNGFPAIGHARSLLSS 169

Query: 86  YIFNISQKYPYWNRTGGADHFYVACHSIG--------------------RSAMQKAWEVK 125
            +  IS  YP+WNR+ G+DH +VA H  G                    RS + + + VK
Sbjct: 170 AVQLISSNYPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVK 229

Query: 126 LN----AIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV--- 178
            N     ++ V    Y   G +       P             + +R+  AFF G +   
Sbjct: 230 FNHPCQDVENVVIPPYISPGSVRATLEKYPL------------TGRRDIWAFFRGKMEVH 277

Query: 179 ---------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTAR 229
                    +  VR  +L+ +  D   Y    R    Y  E++ S FCL   G+   + R
Sbjct: 278 PKNISGRYYSKKVRTVILRKYSGDRRFYLQRHRFAG-YQSEIVRSVFCLCPLGWAPWSPR 336

Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
           + +S+  GCVPVIIA+   LPF   + W   S+ VA  D+  L  +L  +++     +Q 
Sbjct: 337 LVESVALGCVPVIIADGIRLPFPTAVRWSEISLTVAEKDVANLGTLLDQVAATNLSAIQK 396

Query: 290 YVL--KVRKHFQWHVFPSDYDAFYMVMYEL 317
            +    VR+   ++      DA + V+Y L
Sbjct: 397 NLWDPDVRRALLFNDPVQGGDATWQVLYAL 426


>gi|168059257|ref|XP_001781620.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666934|gb|EDQ53576.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 460

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 81/303 (26%), Positives = 139/303 (45%), Gaps = 41/303 (13%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPD-------------------FISHYIFNI 90
           DP +AD+FF+PF  +++   H   +      D                    +   + +I
Sbjct: 139 DPEQADVFFVPFFSALSFNSHGHGMSEGAAADKRLQIVLLTFGRHVNASCHLVQAGLVDI 198

Query: 91  SQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVS 149
             K  +W  + G DH  VA H    +A++   ++   +I +V     Y  +     KDV 
Sbjct: 199 LSKSKWWQASQGRDHILVAHHP---NALRHYRDMLNQSIFIVADFGRYDKTVARLSKDVV 255

Query: 150 LP--QIWPR-QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY----- 197
            P   + P   +D P    S R  L FF G +    +  VR KL ++  N+S+++     
Sbjct: 256 APYVHVLPSYDQDNPADPFSLRKTLLFFQGRIHRKGDGIVRTKLAELLANNSDVHYVDSL 315

Query: 198 AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
           A +  + T  A  +  S+FCLH  G   ++ R+ D++   CVPVII++  +LPF D LN+
Sbjct: 316 ASAEAIATSTAG-MRTSRFCLHPAGDTPSSCRLFDAIVSHCVPVIISDRIELPFEDDLNY 374

Query: 258 KSFSIVVA---TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           K FSI  +   ++    L + L+ I+ E +L + + +  V  HF++   P   DA  M+ 
Sbjct: 375 KDFSIFFSSEESVKPGHLLRTLRSITRERWLRMWNALKTVSHHFEYQHPPKKDDAVNMIF 434

Query: 315 YEL 317
            ++
Sbjct: 435 KQV 437


>gi|224106838|ref|XP_002314302.1| predicted protein [Populus trichocarpa]
 gi|222850710|gb|EEE88257.1| predicted protein [Populus trichocarpa]
          Length = 462

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 87/320 (27%), Positives = 137/320 (42%), Gaps = 42/320 (13%)

Query: 33  YASESYFKKAFMKSHFV-TKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISH 85
           +ASE    KA   S  + T DP +AD FF+P  ++           G P        +S 
Sbjct: 134 FASEVAIHKALSNSLDIRTFDPYEADFFFVPVYVS----CNFSTVNGFPAIGHARSLLSS 189

Query: 86  YIFNISQKYPYWNRTGGADHFYVACHSIG---RSAMQKAWE------VKLNAIQVVCSSS 136
            +  IS  YP+WNR+ G+DH +VA H  G    +  ++A E      +K + I       
Sbjct: 190 AVQLISSNYPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVK 249

Query: 137 YFISGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV------------N 179
           +          V  P I P +     E+ P  G  +R+  AFF G +            +
Sbjct: 250 FNHPCQDVENVVIPPYISPERVRTTLENYPLNG--RRDIWAFFRGKMEVHPKNISGRYYS 307

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
             VR  + + +  D   Y    R    Y  E++ S FCL   G+   + R+ +S+  GCV
Sbjct: 308 KKVRTVIWRKYSGDRRFYLQRHRF-AGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCV 366

Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKH 297
           PVIIA+   LPF   + W   S+ VA  D+  L  +L  +++     +Q  +    VR+ 
Sbjct: 367 PVIIADGIRLPFPTAVRWSEISLTVAEKDVANLGTLLDHVAATNLSAIQKNLWDPDVRRA 426

Query: 298 FQWHVFPSDYDAFYMVMYEL 317
             ++    + DA + V+Y L
Sbjct: 427 LLFNDRVQEGDATWQVLYAL 446


>gi|115450193|ref|NP_001048697.1| Os03g0107900 [Oryza sativa Japonica Group]
 gi|122247627|sp|Q10SX7.1|GT31_ORYSJ RecName: Full=Probable glucuronosyltransferase Os03g0107900
 gi|108705764|gb|ABF93559.1| exostosin family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113547168|dbj|BAF10611.1| Os03g0107900 [Oryza sativa Japonica Group]
 gi|125542077|gb|EAY88216.1| hypothetical protein OsI_09667 [Oryza sativa Indica Group]
 gi|215766485|dbj|BAG98793.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222624042|gb|EEE58174.1| hypothetical protein OsJ_09104 [Oryza sativa Japonica Group]
          Length = 427

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 81/320 (25%), Positives = 141/320 (44%), Gaps = 45/320 (14%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDF 82
           +A+E    +A +        P  A LFF+P  ++           + H R +  + + D 
Sbjct: 101 FAAEVALHEALLAYAGRAARPDDATLFFVPVYVSCNFSTDNGFPSLSHARALLADAV-DL 159

Query: 83  ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKL-NAI-----QVVCSSS 136
           +         + PYWNR+ GADH +VA H  G +      +V + + I     + +   +
Sbjct: 160 VR-------AQMPYWNRSAGADHVFVASHDFG-ACFHPMEDVAIADGIPEFLKRSILLQT 211

Query: 137 YFISG-HIAHKD---VSLPQIWPRQE-DPPKLGSSKRNKLAFFAGAV------------N 179
           + + G H+  +    V  P + P    + P+   ++R+  AFF G +            +
Sbjct: 212 FGVQGTHVCQEADHVVIPPHVPPEVALELPEPEKAQRDIFAFFRGKMEVHPKNISGRFYS 271

Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
             VR +LLQ +  + + Y    R    Y  E+  S FCL   G+   + R+ +S+  GC+
Sbjct: 272 KKVRTELLQKYGRNRKFYLKRKRYGN-YRSEMARSLFCLCPLGWAPWSPRLVESVLLGCI 330

Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKH 297
           PVIIA+   LPF  VL W   S+ VA  D+  L+ +L  + +    ++Q  +     RK 
Sbjct: 331 PVIIADDIRLPFPSVLQWLDISLQVAEKDVASLEMVLDHVVATNLTVIQKNLWDPVKRKA 390

Query: 298 FQWHVFPSDYDAFYMVMYEL 317
             ++    + DA + V+ EL
Sbjct: 391 LVFNRPMEEGDATWQVLREL 410


>gi|428181360|gb|EKX50224.1| hypothetical protein GUITHDRAFT_104038 [Guillardia theta CCMP2712]
          Length = 723

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 94/344 (27%), Positives = 146/344 (42%), Gaps = 61/344 (17%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARM--RHDRRIGTEGIPD--------- 81
           YA+E Y  +  + S  +T DP  ADLFF+P   A     H  R G  G PD         
Sbjct: 372 YAAEVYIHEQLLLSDSLTLDPGAADLFFIPLYAACFLSSHFVRPGP-GWPDNDVDIGKTY 430

Query: 82  -FISHYIFNISQKYPYWNRTGGADHFYV------ACH--------------SIGRSAMQK 120
             +   + ++ Q YP+++R+ GADH  V      +C               S  R+ ++ 
Sbjct: 431 QAVQLVLEHVRQTYPFFDRSAGADHVLVLSSDWGSCQGPFLELHNSILLVTSGDRTLVRP 490

Query: 121 AWEVKLNAIQVVCSSSYFISGHIA----HKDVSLPQIWPRQEDPPKLGSS-------KRN 169
           AW     A  +  S  + +   +      KDV +P + P     P L +S        R+
Sbjct: 491 AWYAARAADHMGSSEEFAVRSRLPCFQLFKDVVIPPLVPH----PALTASYMGERTRGRD 546

Query: 170 KLAFFAGAVNSPVREKLLQ----------VWRNDSEI--YAHSGRLKTP-YADELLGSKF 216
            L +F G     V+  L            + R  S +  +  S R+ +  Y DELL S F
Sbjct: 547 ILVYFRGTAAGSVKALLYNKDYSLGIRQLLLRRYSRVRGWVVSDRINSSSYHDELLRSVF 606

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
           CL   G+E+ + R  +++  GC+PV++ +   LPF   L++  F++ V    I  L+ IL
Sbjct: 607 CLAPAGWELWSVRFFEAILLGCIPVLLTDDVQLPFQQRLDYSRFTVKVEQRRILELESIL 666

Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLR 320
             I+       Q  + +V K   +   P D DAF  +M EL  R
Sbjct: 667 SSINETVIRRKQEGLKEVWKRMTYQRPPEDGDAFTGIMDELARR 710


>gi|307111406|gb|EFN59640.1| hypothetical protein CHLNCDRAFT_133110 [Chlorella variabilis]
          Length = 767

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 83/337 (24%), Positives = 139/337 (41%), Gaps = 72/337 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN--- 89
           Y  ES F +  M+S   T DP +AD F++P   +   +  R G   + D+    + N   
Sbjct: 372 YGLESAFLEMLMQSEHRTLDPEEADFFYVPVFTSCFIYPVRSGASSLQDWFYAPVQNRVQ 431

Query: 90  ------------ISQKYPYWNRTGGADHFYVACHSIG----------------------- 114
                       I   +P+W R GG DH ++  H  G                       
Sbjct: 432 GAANMLLEAFHWIQSHHPWWERRGGRDHIWLVTHDEGSCWVPAAIRPSIILSHWGRMDLN 491

Query: 115 RSAMQKAWE-----VKLNAIQVVCSSSYF---------ISGHIAH---KDVSLPQIWP-- 155
            S+    WE          +Q      +          I+GH  +   KD+ +P I    
Sbjct: 492 HSSTTGYWEDDYRQANARKLQHAEQQQHLFEPDGFQQKIAGHACYDPVKDLVVPLIKTPN 551

Query: 156 RQEDPPKLGSSKRNK--LAFFAGAVNSP-------VREKLLQVWRNDSEIYAHSGR---- 202
           R +  P  G+  RN+  LAF  G VN         VR+++    R    +  +  +    
Sbjct: 552 RNKHSPLFGAPTRNRTWLAFHRGRVNHEFPRYSRGVRQRVDNASREHQWLENYGSKFGDE 611

Query: 203 -LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
            L+  Y++ L  S FCL ++G +  +AR+ D++ +GC+PV+I +   + F  VL+   FS
Sbjct: 612 SLQGDYSELLASSIFCLVLQG-DGWSARMDDAMSHGCIPVVIIDDVHVSFESVLDLSQFS 670

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           + V + D+  L +IL+ +S E    LQ  + +V + +
Sbjct: 671 LRVKSADVERLPEILQAVSQERREELQRNLARVWQRY 707


>gi|224136520|ref|XP_002322350.1| predicted protein [Populus trichocarpa]
 gi|222869346|gb|EEF06477.1| predicted protein [Populus trichocarpa]
          Length = 343

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 29/274 (10%)

Query: 72  RRIGTEGI-----PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS---AMQKAWE 123
           RR+  E +     P  I      IS  +PYWNRT GADHF+V  H  G       +KA E
Sbjct: 40  RRVKEEPLFAFQSPRMIRSATQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE 99

Query: 124 VKLNAI--QVVCSSSYFISGHIAHKD--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGA 177
             +  +        ++    H+  KD  +++P   P Q+    L   K  R+   +F G 
Sbjct: 100 RGILPLLQHATLVQTFGQRNHVCLKDGSITVPSYAPPQKMQTHLIPEKTPRSIFVYFRGL 159

Query: 178 V----NSP-----VREKLLQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
                N P      R     VW N  D+ ++  S    T Y +++  + FCL    +   
Sbjct: 160 FYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQQAVFCLCPLSWAPW 219

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
           + R+ ++L +G +PVII +   LPFAD + W+   + V   D+P L  IL  I   E +L
Sbjct: 220 SPRLVEALIFGYIPVIIVDDIVLPFADAIPWEEIGVFVDEKDVPNLDTILTSIPP-EVIL 278

Query: 287 LQSYVLKVRKHFQWHVFP---SDYDAFYMVMYEL 317
            +  +L      Q  +FP      DAF+ V+  L
Sbjct: 279 RKQRLLANPSMKQAMLFPQLAQAGDAFHQVLNGL 312


>gi|168042843|ref|XP_001773896.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674740|gb|EDQ61244.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 373

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 73/301 (24%), Positives = 134/301 (44%), Gaps = 24/301 (7%)

Query: 56  ADLFFLPFSIAR-----MRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
           AD+ F+PF  +       R ++R   E     +   +    +K P W  +GG DH  V  
Sbjct: 75  ADVIFVPFFASLSYNKFTRAEQRALGEDKNQELQEKLMQFLEKQPAWQASGGVDHVIVIH 134

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIWPRQEDPPKLGSS 166
           H      M+        A+ VV     + S  +A+  KD+  P   +    E    +   
Sbjct: 135 HPNSGYFMRDHLR---KAMFVVADFGRYASD-VANIGKDIVAPYKHVVNDFEAEATISYE 190

Query: 167 KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPY----ADELLGSKFCL 218
           KR  L FF GA+       +R +L ++   + +++   G          ++ +  SKFCL
Sbjct: 191 KRKTLLFFQGAIMRKEGGIIRLQLYKLLNGEPDVHFEGGNTTNSAIRSASEGMQNSKFCL 250

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKI 275
           ++ G   ++ R+ D++   CVPVII++  ++PF D LN+ +FSI + + D      +  +
Sbjct: 251 NLAGDTPSSNRLFDAIASHCVPVIISDDIEVPFEDTLNYSTFSIFIKSSDALKSNFIIDL 310

Query: 276 LKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLDSN 335
           L+G+S E++  + + + +V  HF++       DA +M    +  +   VR+  +      
Sbjct: 311 LRGVSREKWTKMWATLKQVEHHFKYQYPTQPDDAVHMTWKAIARKIHKVRLHLNKERRYQ 370

Query: 336 W 336
           W
Sbjct: 371 W 371


>gi|159490314|ref|XP_001703124.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158270754|gb|EDO96589.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 490

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 90/338 (26%), Positives = 136/338 (40%), Gaps = 59/338 (17%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN 89
           R + ++  +F +  + S     D  KAD +++P    RM  D R        F+S  +  
Sbjct: 122 RLDRSTHIHFYQRLLGSGARIADGDKADWYYIPIR-QRMTADSR--------FLSEAVAY 172

Query: 90  ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS--------- 140
           IS  YP+WNRTGG+ HF +    +G    Q     +L A  +   + + ++         
Sbjct: 173 ISATYPWWNRTGGSRHFVIHTGDLGADETQLG--ARLQAPNITWLTHWGLTMDKVFSGWK 230

Query: 141 -GHIAHKDVSLP--------QIWPRQEDP--PKLGSSKRNKLAFFAGAVNSPVREKLLQV 189
             H   KDV +P        + +  +  P  P +   +R    FFAG +    +      
Sbjct: 231 KAHRPDKDVVIPVFLTPGHFKHFGLERTPLHPLMDKQERTTTFFFAGRICGDRKPPKTGS 290

Query: 190 WRN---DSEIYAHSGR----------------LKTP-YADELLGSKFCLHVKGFEVNTAR 229
           W N    S  Y+   R                L  P Y   L  SKFCL   G      +
Sbjct: 291 WPNCGPRSPGYSAGVRQLVHHHHWDPPGFKVVLHEPNYGAALGSSKFCLAPLGGGHGQRQ 350

Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
           I  S + GC+PV IA+    PF    NW  F +  A  DIP L  IL+ +S++EY   Q 
Sbjct: 351 IIVS-FMGCLPVCIADDVYEPFEPQYNWTQFGVRPAESDIPELHTILESVSAKEYAAKQR 409

Query: 290 YVLKVRKHFQWH-----VFPSD--YDAFYMVMYELWLR 320
            +    +HF +      +F  D  YDAF   +  L ++
Sbjct: 410 ALRCAAQHFVYSSIVGGLFGEDGRYDAFETTLEVLRVK 447


>gi|412991334|emb|CCO16179.1| predicted protein [Bathycoccus prasinos]
          Length = 558

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 77/311 (24%), Positives = 134/311 (43%), Gaps = 27/311 (8%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHY----- 86
            Y +E       M+S   T DP +A+ F++P      +    I T G    +        
Sbjct: 234 QYGTEIRIHANIMQSKMYTLDPLEAEFFYVPV-YGECKLFENIATLGAKKGLQETNAWWL 292

Query: 87  --IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLN-AIQVVCSSSYFISGHI 143
             +  ++ +YP+WNRT G DH +    + G   + K W+  +  +I +       +S   
Sbjct: 293 EAMKLVTDQYPFWNRTQGRDHVFTFAGARG-PHIFKDWKRHIKKSIFLTPEGDRSLSEQF 351

Query: 144 AH-KDVSLPQIWPRQE----DPPKLGSSKRNK-LAFFAGAV--------NSPVREKLLQV 189
              KD+ +P + P +        K    KR K  A+F G +        +  +R K+ + 
Sbjct: 352 NTWKDIVIPGLEPEKAFWSGSLRKQKEVKRAKTFAYFRGTIANKLGKQYSKGIRIKMKEA 411

Query: 190 WRNDSEIY---AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           +++  ++     HS   KT Y +E+  S FCL  +G+   T R   +L  GC+PVIIA+ 
Sbjct: 412 FKDIKDVVFTEQHSSCDKTCYREEMRASTFCLCPRGWSPWTLRAYQALMVGCIPVIIADE 471

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
            + P+ +  +W+  SI +          IL+ +  +     +  + K      W    +D
Sbjct: 472 IEFPYENSFDWRQVSIKIPEKRHLETIDILRSVPDDVVERKRKAMAKFWPSVAWKKPAAD 531

Query: 307 YDAFYMVMYEL 317
            DAF++VM EL
Sbjct: 532 DDAFHLVMKEL 542


>gi|168032656|ref|XP_001768834.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679946|gb|EDQ66387.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 368

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 67/236 (28%), Positives = 115/236 (48%), Gaps = 21/236 (8%)

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP- 151
           P W  + GADH  +  H    +AM    E   +A+ VV     +    +A+  KDV  P 
Sbjct: 109 PAWRASNGADHVLIIHHP---NAMVYKREQFRSAMFVVADFGRY-DAEVANIAKDVVAPY 164

Query: 152 -QIWPRQEDPPKLGSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK 204
             I P  +D     SS   R  L FF GA+       +R+KL ++ R++S++   +G   
Sbjct: 165 KHIIPNFDDDIDSVSSFNTRTTLLFFQGAIVRKEGGIIRQKLYELLRDESDVVFVNGTTT 224

Query: 205 TP----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
           +         +  SKFCLH++G   ++ R+ D++   CVP+I+++  +LPF DV+N+  F
Sbjct: 225 SAGIRSATSGMRQSKFCLHMEGDTPSSNRLFDAVASHCVPLIVSDDIELPFEDVINYTEF 284

Query: 261 SIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            + V + D      L  +L+    +E+  +   + +V+KHF++ +     DA  M 
Sbjct: 285 CLFVNSSDALRKGFLTNLLRNFGEKEWTRMHDRMREVQKHFEYQLPSEIGDAVQMT 340


>gi|326490259|dbj|BAJ84793.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 498

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 82/285 (28%), Positives = 137/285 (48%), Gaps = 46/285 (16%)

Query: 52  DPSKADLFFLPF--SIARMRHDRR-------------IGT----EGIPDFISHYIFNISQ 92
           DP +ADLF++PF  S++ + +  R             +G     E + D +  ++    +
Sbjct: 171 DPVEADLFYVPFFSSLSLVVNPIRPPAAANASEAAATVGPWYSDEAMQDELVEWL----E 226

Query: 93  KYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSL 150
           +  YW R  G DH ++ C     +A+ +  +   NA+ +V S    + G  A   KDV L
Sbjct: 227 RQSYWRRYRGRDHVFI-CQDP--NALYRVVDRISNAVLLV-SDFGRLRGDQASLVKDVIL 282

Query: 151 P---QIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG-- 201
           P   +I P + D   +    R  L FF G         +R+ L QV  N+ ++    G  
Sbjct: 283 PYSHRINPFKGD---VNVDSRPALLFFMGNRYRKEGGKIRDTLFQVLENEGDVIIKHGAQ 339

Query: 202 -RLKTPYADELL-GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
            R+    A + +  SKFCLH  G   +  R+ D+L   CVPVI+++H +LPF DV+++ +
Sbjct: 340 SRVSRRMATQGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDHIELPFEDVIDYSN 399

Query: 260 FSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
            SI V T   +    L  +L+ +SSE  L  Q  + +V+ +F++ 
Sbjct: 400 ISIFVDTSKAIQPGFLTSMLRKVSSERILEYQREIQRVKHYFEYE 444


>gi|326437066|gb|EGD82636.1| hypothetical protein PTSG_03293 [Salpingoeca sp. ATCC 50818]
          Length = 366

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 80/157 (50%), Gaps = 15/157 (9%)

Query: 2   NRSFRVYVYPHRRNDPFANVLLPV-DFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           +R  RVY+YP     P +  L P  D++    YA+E+ F +    S F T  P +A LFF
Sbjct: 103 SRHLRVYMYPL----PESLQLPPTRDYK----YAAEATFTRMLRASTFSTDSPEEAQLFF 154

Query: 61  LPFSIARMR---HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG--- 114
           +  S A  R    DR  G        +  + ++ Q+YPYWNRT G DHF+V  H +G   
Sbjct: 155 VRVSCAEARFTQRDREAGQRAADAHATAVLAHVQQRYPYWNRTQGRDHFFVCGHDMGAAP 214

Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP 151
           R+A  + +    N I +V ++      ++ HKD+SLP
Sbjct: 215 RTAAARMFPSARNMIALVNTADVTEPDYVVHKDISLP 251



 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 52/106 (49%), Gaps = 7/106 (6%)

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G    T R+ D+++ GCVPV IA+HYD P A  ++W   ++ +A  D   +K  L+  + 
Sbjct: 255 GDGCPTPRLMDAIWAGCVPVFIADHYDPPLAKYVDWALLAVFIAEADAAHIKAHLEMDAR 314

Query: 282 EEYLLLQSYVLKVRKHFQW-------HVFPSDYDAFYMVMYELWLR 320
             Y    +Y+ +VR    W       H       AF +VM EL LR
Sbjct: 315 TMYAHRSAYIARVRDRLTWWDPAQRQHTMGRSTSAFDLVMLELALR 360


>gi|60657602|gb|AAX33322.1| secondary cell wall-related glycosyltransferase family 47 [Populus
           tremula x Populus tremuloides]
          Length = 442

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 86/330 (26%), Positives = 133/330 (40%), Gaps = 62/330 (18%)

Query: 33  YASESYFKKAFMKSHFV-TKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISH 85
           +ASE    KA   S  V T DP +AD FF+P  ++           G P        +S 
Sbjct: 114 FASEVAIHKALSSSLDVRTFDPYEADFFFVPVYVS----CNFSTVNGFPAIGHARSLLSS 169

Query: 86  YIFNISQKYPYWNRTGGADHFYVACHSIG--------------------RSAMQKAWEVK 125
            +  IS  YP+WNR+ G+DH +VA H  G                    RS + + + VK
Sbjct: 170 AVQLISSNYPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVK 229

Query: 126 LN----AIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV--- 178
            N     ++ V    Y   G +       P             + +R+  AFF G +   
Sbjct: 230 FNHPCQDVENVVIPPYISPGSVRTTLEKYPL------------TGRRDIWAFFRGKMEVH 277

Query: 179 ---------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTAR 229
                    +  VR  + + +  D   Y    R    Y  E++ S FCL   G+   + R
Sbjct: 278 PKNISGRYYSKKVRTVIWRKYSGDRRFYLQRHRF-AGYQSEIVRSVFCLCPLGWAPWSPR 336

Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
           + +S+  GCVPVIIA+   LPF   + W   S+ VA  D+  L  +L  +++     +Q 
Sbjct: 337 LVESVALGCVPVIIADGIRLPFPTAVRWSEISLTVAEKDVANLGTLLDQVAATNLSAIQK 396

Query: 290 YVL--KVRKHFQWHVFPSDYDAFYMVMYEL 317
            +    VR+   ++      DA + V+Y L
Sbjct: 397 NLWDPDVRRALLFNDPVQGGDATWQVLYAL 426


>gi|222619798|gb|EEE55930.1| hypothetical protein OsJ_04615 [Oryza sativa Japonica Group]
          Length = 401

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/331 (24%), Positives = 136/331 (41%), Gaps = 44/331 (13%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T DP +AD F+ P ++   +           P  +   +  ++
Sbjct: 83  FAAEIFMHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVA 142

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSL 150
             +PYWNRT GADHF++A H  G     Q+   ++   + V+  ++   +    H     
Sbjct: 143 ATWPYWNRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQ 202

Query: 151 PQ--IWPRQEDPPKL------GSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE--IYAHS 200
           P     P   DP K+       ++ R+   +F G         L     ND E   YA  
Sbjct: 203 PGSITVPPYADPRKMEAHRISPATPRSIFVYFRG---------LFYDMGNDPEGGYYARG 253

Query: 201 GRLKT--PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
            R      + D  L       +      T R+ +++ +GC+PVIIA+   LPFAD + W 
Sbjct: 254 ARASVWENFKDNPL-----FDISTEHPATPRLVEAVVFGCIPVIIADDIVLPFADAIPWG 308

Query: 259 SFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
             S+ VA  D+P L  IL  +  +E +    LL S  +K    F     P   DAF+ ++
Sbjct: 309 EISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPG--DAFHQIL 366

Query: 315 YEL----------WLRRSSVRVQWSTSLDSN 335
             L          +L      + W   L+++
Sbjct: 367 NGLARKLPHPKGVFLEPGEKGIDWDQGLEND 397


>gi|357475227|ref|XP_003607899.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355508954|gb|AES90096.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 481

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 69/272 (25%), Positives = 129/272 (47%), Gaps = 27/272 (9%)

Query: 52  DPSKADLFFLPF--SIARMRHD-RRIGTEGIPDFISH-------YIFNISQKYPYWNRTG 101
           DP +ADLFF+PF  S++ + +  R  G+  +P+  ++        +    +   +W R+ 
Sbjct: 163 DPEEADLFFVPFFSSLSLIVNPVRPAGSGSVPEKTAYSDEENQEALMEWLEMQEFWKRSK 222

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF--ISGHIAHKDVSLPQIWPRQED 159
           G DH  VA      +AM +  +   N + +V          G +  KDV +P     +  
Sbjct: 223 GRDHVIVASDP---NAMYRVVDRVKNCVLLVSDFGRLRPDQGSLV-KDVIVPYSHRIRTY 278

Query: 160 PPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AHSGRLKTPYADEL 211
              +G  KRN L FF G         +R+ L Q+   + ++     A S   +   +  +
Sbjct: 279 DGGIGVDKRNTLLFFMGNRYRKEGGKIRDTLFQILEKEDDVIIKHGAQSRESRRAASQGM 338

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LD 268
             SKFCLH  G   +  R+ D++   CVPVI+++  +LPF D ++++  ++ V T   + 
Sbjct: 339 HTSKFCLHPAGDTPSACRLFDAIVSLCVPVIVSDSIELPFEDTIDYRKIAVFVETAAAIQ 398

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
              L  IL+G++ +  +  Q  + +V+++F++
Sbjct: 399 PGYLVSILRGMAPDRIVEYQKELKEVKRYFKY 430


>gi|223975191|gb|ACN31783.1| unknown [Zea mays]
 gi|414887161|tpg|DAA63175.1| TPA: exostosin-like protein [Zea mays]
          Length = 500

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 78/273 (28%), Positives = 123/273 (45%), Gaps = 26/273 (9%)

Query: 52  DPSKADLFFLPF--SIARMRHDRR---------IGTEGIPDFISHYIFNISQKYPYWNRT 100
           DPS ADLF++PF  S++ + +  R          GT    + +   +    ++  YW R 
Sbjct: 176 DPSDADLFYVPFFSSLSLVVNPIRSPLAANASGAGTAYSDEALQEELLEWLERQLYWQRH 235

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQED 159
            G DH ++ C     +A+ +  +   NA+ +V       S   +  KDV LP        
Sbjct: 236 RGRDHVFI-CQDP--NALYRVVDRISNAVLLVSDFGRLRSDQASLVKDVILPYSHRINSF 292

Query: 160 PPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----L 211
             ++G   R  L FF G         VR+ L Q+  N+ ++    G        E    +
Sbjct: 293 KGEVGVDGRPSLLFFMGNRYRKEGGKVRDALFQILENEDDVTIKHGTQSRESRREATQGM 352

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LD 268
             SKFCLH  G   +  R+ D+L   CVPVI +++ +LPF D++++   SI V T   + 
Sbjct: 353 HSSKFCLHPAGDTPSACRLFDALVSLCVPVIASDYIELPFEDIIDYNKISIFVGTSKAVQ 412

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
              L   L+ ISSE  L  Q  + KVR +F++ 
Sbjct: 413 PGYLTSTLRRISSERILEYQREIKKVRHYFEYE 445


>gi|449469226|ref|XP_004152322.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like [Cucumis
           sativus]
          Length = 388

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 78/306 (25%), Positives = 129/306 (42%), Gaps = 48/306 (15%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +A+E Y  +  + S   T +P +AD F+ P  +        +   G+P        +   
Sbjct: 78  FAAEIYMHRFLLNSPVRTLNPDEADWFYTPIYVTC-----DLTPNGLPLPFKSPRMMRSA 132

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK 146
           I  IS  +PYWNRT GADHF+V  H  G                     + F        
Sbjct: 133 IQLISSNWPYWNRTEGADHFFVVPHDFG---------------------ACFHYQEEKAI 171

Query: 147 DVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAV----NSP-----VREKLLQVWRN--DS 194
           D  +P   P++     + S + R+   +F G      N P      R     VW N  ++
Sbjct: 172 DRGIPYCPPQKMKTHLIPSETPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 231

Query: 195 EIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
            ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   LPFAD 
Sbjct: 232 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 291

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---DAFY 311
           + W+   + V   D+  L  IL  I   + +L +  +L      +  +FP      DAF+
Sbjct: 292 IPWEEIGVFVDEKDVSNLDTILTSIPP-DVILRKQRLLANPSMKRAMMFPQPAQSGDAFH 350

Query: 312 MVMYEL 317
            ++  L
Sbjct: 351 QILNGL 356


>gi|449439621|ref|XP_004137584.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Cucumis sativus]
          Length = 494

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 133/283 (46%), Gaps = 23/283 (8%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADH 105
            KD S+AD+ F+PF  S++  +H +  G E I     +   + +       W RTGG +H
Sbjct: 185 VKDSSQADVIFVPFFSSLSYNQHSKSHGKEKINVNKILQQKLIDFLFGQKEWRRTGGKNH 244

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKL 163
             +A H    ++M  A + KL +   V +        IA+  KD+  P     +  P   
Sbjct: 245 LVIAHHP---NSMLDARK-KLGSAMFVLADFGRYPAAIANIEKDIIAPYRHIVKTVPSSK 300

Query: 164 GSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLG 213
            ++  +R  L +F GA+       VR++L  + +++ +++   G +K          +  
Sbjct: 301 SATFDERPILVYFQGAIYRKDGGVVRQELYYLLKDEEDVHFTFGSVKGNGINKAGQGMAS 360

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---P 270
           SKFCL++ G   ++ R+ DS+   CVPVII++  +LP+ D+L++  F + V   D     
Sbjct: 361 SKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPYEDILDYSEFCVFVRAADSIRKG 420

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            L  +L+GI  E +  +   + ++   F++       DA  M+
Sbjct: 421 YLLNLLRGIGRERWTKMWDRIKEIVHEFEYQYPSQSGDAVDMI 463


>gi|449533617|ref|XP_004173769.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like, partial [Cucumis sativus]
          Length = 330

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 133/283 (46%), Gaps = 23/283 (8%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADH 105
            KD S+AD+ F+PF  S++  +H +  G E I     +   + +       W RTGG +H
Sbjct: 21  VKDSSQADVIFVPFFSSLSYNQHSKSHGKEKINVNKILQQKLIDFLFGQKEWRRTGGKNH 80

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKL 163
             +A H    ++M  A + KL +   V +        IA+  KD+  P     +  P   
Sbjct: 81  LVIAHHP---NSMLDARK-KLGSAMFVLADFGRYPAAIANIEKDIIAPYRHIVKTVPSSK 136

Query: 164 GSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLG 213
            ++  +R  L +F GA+       VR++L  + +++ +++   G +K          +  
Sbjct: 137 SATFDERPILVYFQGAIYRKDGGVVRQELYYLLKDEEDVHFTFGSVKGNGINKAGQGMAS 196

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---P 270
           SKFCL++ G   ++ R+ DS+   CVPVII++  +LP+ D+L++  F + V   D     
Sbjct: 197 SKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPYEDILDYSEFCVFVRAADSIRKG 256

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            L  +L+GI  E +  +   + ++   F++       DA  M+
Sbjct: 257 YLLNLLRGIGRERWTKMWDRIKEIVHEFEYQYPSQSGDAVDMI 299


>gi|195611602|gb|ACG27631.1| exostosin-like [Zea mays]
          Length = 456

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 77/273 (28%), Positives = 123/273 (45%), Gaps = 26/273 (9%)

Query: 52  DPSKADLFFLPF--SIARMRHDRR---------IGTEGIPDFISHYIFNISQKYPYWNRT 100
           DPS ADLF++PF  S++ + +  R          GT    + +   +    ++  YW R 
Sbjct: 132 DPSDADLFYVPFFSSLSLVVNPIRSPLAANASGAGTAYSDEALQEELLEWLERQLYWQRH 191

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQED 159
            G DH ++ C     +A+ +  +   NA+ +V       S   +  KDV LP        
Sbjct: 192 RGRDHVFI-CQDP--NALYRVVDRISNAVLLVSDFGRLRSDQASLVKDVILPYSHRINSF 248

Query: 160 PPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG----RLKTPYADEL 211
             ++G   R  L FF G         VR+ L Q+  N+ ++    G      +      +
Sbjct: 249 KGEVGVDGRPSLLFFMGNRYRKEGGKVRDALFQILENEDDVTIKHGTQSRESRRAATQGM 308

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LD 268
             SKFCLH  G   +  R+ D+L   CVPVI +++ +LPF D++++   SI V T   + 
Sbjct: 309 HSSKFCLHPAGDTPSACRLFDALVSLCVPVIASDYIELPFEDIIDYNKISIFVGTSKAVQ 368

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
              L   L+ ISSE  L  Q  + KVR +F++ 
Sbjct: 369 PGYLTSTLRRISSERILEYQREIKKVRHYFEYE 401


>gi|255547405|ref|XP_002514760.1| catalytic, putative [Ricinus communis]
 gi|223546364|gb|EEF47866.1| catalytic, putative [Ricinus communis]
          Length = 490

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 70/280 (25%), Positives = 135/280 (48%), Gaps = 21/280 (7%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFY 107
           + S+AD+ F+P+  S++  RH +  G E +     + + +         W R+GG DH  
Sbjct: 187 NSSQADIIFVPYFSSLSYNRHSKLHGKEKVSMNKMLQNRLVEFLMGQDEWKRSGGRDHLI 246

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQIWPRQEDPPKLGS- 165
           VA H    ++M  A ++   A+ V+     Y +      KDV  P     +  P    + 
Sbjct: 247 VAHHP---NSMLDARKMLGAAMFVLADFGRYPVEIANLKKDVIAPYKHVVRTIPSGESAQ 303

Query: 166 -SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKF 216
             +R  L FF GA+       +R++L  + +++ +++   G ++    ++    +  SKF
Sbjct: 304 FEERPILVFFQGAIYRKDGGIIRQELYYLLKDEKDVHFTFGTVRKNGVNKAGQGMASSKF 363

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLK 273
           CL++ G   ++ R+ D++   CVPVII++  +LPF DVL++  FS+ V   D      L 
Sbjct: 364 CLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSEFSVFVRASDAVKEGYLL 423

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            +L+ I  +++ ++   + ++  HF++       DA  M+
Sbjct: 424 NLLQSIDRDKWTMMWERLKEIAPHFEYQYPSQSGDAVDMI 463


>gi|255565307|ref|XP_002523645.1| catalytic, putative [Ricinus communis]
 gi|223537097|gb|EEF38731.1| catalytic, putative [Ricinus communis]
          Length = 452

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 79/291 (27%), Positives = 131/291 (45%), Gaps = 37/291 (12%)

Query: 52  DPSKADLFFLP-FSIARMRHD--RRIGTEGIPDFISHY--------IFNISQKYPYWNRT 100
           DP +ADLF++P FS   +  +  R  GTE  P  + HY        +    ++  YW R 
Sbjct: 131 DPDEADLFYVPVFSSLSLIVNPVRPAGTE--PGLVQHYSDEEMQEQLVEWLEQQEYWKRN 188

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF--ISGHIAHKDVSLPQIWPRQE 158
            G DH  +A      +A+ +  +   NAI ++          G +  KD+ +P       
Sbjct: 189 NGRDHVIIAGDP---NALYRVLDRVKNAILLLSDFGRVRPDQGSLV-KDIIVPYSHRINV 244

Query: 159 DPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRL----KTPYADE 210
               +G   RN L FF G         +R+ L Q+  ++ ++    G      +   +  
Sbjct: 245 YNGDIGVRDRNTLLFFMGNRYRKDGGKIRDLLFQMLESEEDVVIKHGTQSRENRRAASRG 304

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI- 269
           +  SKFCL+  G   +  R+ DS+   CVPVI+++  +LPF DV+++   +I V T D  
Sbjct: 305 MHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFEDVIDYTKIAIFVETTDSL 364

Query: 270 --PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
               L K+L+ ++SE  L  Q  + KV ++F+       YD     + E+W
Sbjct: 365 KPGYLVKLLREVTSERILEYQKELKKVTRYFE-------YDNSNGTVNEIW 408


>gi|224129654|ref|XP_002320639.1| predicted protein [Populus trichocarpa]
 gi|222861412|gb|EEE98954.1| predicted protein [Populus trichocarpa]
          Length = 375

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 77/297 (25%), Positives = 138/297 (46%), Gaps = 23/297 (7%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADH 105
            ++ S+AD+ F+PF  S++  RH +  G E   +   +   +         W R GG DH
Sbjct: 73  VQNSSQADIVFVPFFSSLSYNRHSKLHGKEKVSVNKMLQTKLVQFLTARDEWKRFGGNDH 132

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKL 163
             VA H    ++M  A + KL +   V +        IA+  KD+  P     +  P   
Sbjct: 133 LIVAHHP---NSMLHARK-KLGSAMFVLADFGRYPVEIANLGKDIIAPYKHVVRTIPSGE 188

Query: 164 GSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY----AHSGRLKTPYADELLG 213
            +   +R  L  F GA+       +R++L  + +++ +++     + G      A  +  
Sbjct: 189 SAQFDRRPILMHFQGAIYRKDGGAIRQELYYLLKDEKDVHFTFGTYRGNGIKKAAQGMAS 248

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---P 270
           SKFCL++ G   ++ R+ D++   CVPVII++  +LPF DVL++  F + V   D     
Sbjct: 249 SKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPFEDVLDYSEFCLFVRASDAVKKG 308

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            L  +L+GI  +++  L   + ++  HF++       DA  MV   +  + SSV+ +
Sbjct: 309 YLLDLLRGIEKDQWTKLWERLKEIAPHFEYSYPSQPGDAVDMVWKAVLRKTSSVQFK 365


>gi|168051833|ref|XP_001778357.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670236|gb|EDQ56808.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 74/313 (23%), Positives = 144/313 (46%), Gaps = 23/313 (7%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEG----IPDFISHYIF 88
           Y  + + +K   ++    +DP +AD+FF+PF  +   ++   G EG    +   +   + 
Sbjct: 85  YLLDGWDRKDGRRAAIRVRDPYQADVFFVPFFASLSFNNYGYGMEGPGAELDKNLQECVV 144

Query: 89  NISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA--HK 146
           NI     +W  + G DH  V  H        + +   LN+  ++ +     S  +A   K
Sbjct: 145 NILLNSKWWKASQGRDHVIVLHHPNAF----RHYRHLLNSSMLIVADFGRFSTDVACLQK 200

Query: 147 DVSLP--QIWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI-YAH 199
           D+  P   +     D      S+R+ L +F G +    +  VR KL +   N+ ++ Y  
Sbjct: 201 DIVAPYEHVVQSYVDDHSNSFSQRHILLYFQGRIHRKADGIVRAKLAKALMNEKDVHYMD 260

Query: 200 SGRLKTPYADELLG---SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
           S       A+   G   S+FCLH  G   ++ R+ D++   CVPVI+++  +LPF D ++
Sbjct: 261 SEASSEALAEATSGMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPFEDDID 320

Query: 257 WKSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
           +  FS+  ++ +      L +IL+GI+  ++  + + +  V  HF++       DA  M+
Sbjct: 321 YNEFSLFFSSEEAVRPQYLLRILRGINETKWTQMWTKLKAVSHHFEFQHPAKKDDAVNMI 380

Query: 314 MYELWLRRSSVRV 326
             ++  +  S+++
Sbjct: 381 FKQVQRKLPSMKL 393


>gi|168010648|ref|XP_001758016.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690893|gb|EDQ77258.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 481

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 77/292 (26%), Positives = 133/292 (45%), Gaps = 32/292 (10%)

Query: 48  FVTKDPSKADLFFLPFSIARMRHDRRIGTE----GIPDFIS-------HYIFNISQKYPY 96
           F   D   ADL F+PF  A + ++R   +E    G  D +          +    ++ P 
Sbjct: 169 FRVNDWRAADLMFVPF-FASLAYNRYTKSEHKVGGELDLVGDKNQKLQEKLLKFLEQQPA 227

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--Q 152
           W  +GG+DH  V  H     AM+  +     AI +V     + S  +A+  KDV  P   
Sbjct: 228 WQASGGSDHIVVIHHPNSFHAMRNFFS---KAIFIVADFGRYPS-EVANLRKDVVAPYKH 283

Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP-- 206
           + P   D       +R  L FF G +       +R++L ++ +N+  ++   G   +   
Sbjct: 284 VIPSFVDD-STPFEEREILLFFQGTIVRKQGGVIRQQLYEMLKNEKGVHFEEGSAGSAGI 342

Query: 207 --YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
                 +  SK CL++ G   ++ R+ D++   CVPVII++  +LPF D L++  FSI +
Sbjct: 343 HSATTGMRRSKCCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPFEDELDYSGFSIFI 402

Query: 265 ATLDIPLLK---KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            + D    K    +++ +S +E++ L   + +V  HF++      YDA  MV
Sbjct: 403 NSTDAVQEKFVINLIRSVSRKEWMRLWKRLKEVSLHFEYQHPTKPYDAVNMV 454


>gi|356577103|ref|XP_003556667.1| PREDICTED: probable glucuronosyltransferase Os02g0520750-like
           [Glycine max]
          Length = 459

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 71/274 (25%), Positives = 126/274 (45%), Gaps = 29/274 (10%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRI--GTEGIPDFI------SHYIFNISQKYPYWNRTG 101
           DP +ADLFF+PF  S++ + +  R      G+   +         +    +K  YW R  
Sbjct: 140 DPEEADLFFVPFFSSLSLIVNPVRPPGSNSGLEKPVYSDEENQEALVEWLEKQEYWKRNN 199

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF--ISGHIAHKDVSLPQIWPRQED 159
           G DH  VA      +AM +  +   NA+ +V          G +  KDV +P     +  
Sbjct: 200 GRDHVIVASDP---NAMYRVIDRVRNAVLLVSDFGRLRPDQGSLV-KDVVVPYSHRIRTY 255

Query: 160 PPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AHSGRLKTPYADEL 211
           P  +G   R  L FF G         +R+ L Q+  N+ ++     A S   +   +  +
Sbjct: 256 PGDVGVEDRKTLLFFMGNRYRKEGGKIRDLLFQILENEKDVIIKHGAQSRESRRAASHGM 315

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-- 269
             SKFCLH  G   +  R+ D++   C+PVI++++ +LPF D ++++  ++ V T     
Sbjct: 316 HTSKFCLHPAGDTPSACRLFDAIVSLCIPVIVSDNIELPFEDTIDYRKIAVFVETSSAIK 375

Query: 270 --PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
              LL K L+ ++ +  L  Q  + +V+++F++ 
Sbjct: 376 PGHLLSK-LRAVTPDRVLEYQKKLKEVKRYFEYE 408


>gi|168014691|ref|XP_001759885.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689015|gb|EDQ75389.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 383

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 68/250 (27%), Positives = 122/250 (48%), Gaps = 20/250 (8%)

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSL 150
           +K+P W  + G +H  V  H    ++MQ   +   NA+ VV     Y        KDV  
Sbjct: 131 EKHPAWKASNGKNHVMVIHHP---NSMQAVRDRLRNALYVVSDFGRYENETANIRKDVVA 187

Query: 151 P--QIWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK 204
           P   + P   D        R+ + +F G++       +R +L  + +++ +++  +G   
Sbjct: 188 PYKHVLPTFTDDSS-SFHTRSTVVYFQGSIVRKEGGKIRHELYDLLKDEPDVHFTTGITA 246

Query: 205 TP----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
           +         +  S+FCL++ G   ++ R+ DS+   CVPVII++  +LPF D LN+ SF
Sbjct: 247 SEGFHSATRGMRSSRFCLNLAGDTPSSNRLFDSIASHCVPVIISDDLELPFEDDLNYSSF 306

Query: 261 SIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS-DYDAFYMVMYE 316
            I + +   L    +  +L+ +SSEE+ L+   +L V +HF++  FPS   DA  MV   
Sbjct: 307 CIFINSTRALQPGYVINLLRNVSSEEWTLMWERLLVVERHFEYQ-FPSVANDAVNMVWKA 365

Query: 317 LWLRRSSVRV 326
           +  +  ++R+
Sbjct: 366 IARKLPAIRL 375


>gi|224120296|ref|XP_002318294.1| predicted protein [Populus trichocarpa]
 gi|222858967|gb|EEE96514.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 69/269 (25%), Positives = 126/269 (46%), Gaps = 23/269 (8%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPDFI------SHYIFNISQKYPYWNRTGGA 103
           DP +ADLF++P   S++   +  ++G   + D +         +    +K  YW R  G 
Sbjct: 62  DPEEADLFYVPVFSSLSLTVNPVQVGKVPVSDPVYSDEKMQDELVEWLEKQEYWRRNNGR 121

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPK 162
           DH   A      +A+ +  +   NA+ ++       S   +  KDV +P           
Sbjct: 122 DHVLFAGDP---NALYRVLDRVKNAVLLLSDFGRVRSDQGSLVKDVIVPYAHRINVYNGD 178

Query: 163 LGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY-AHSGRLKTPYADELLG---S 214
           +G  +R  L FF G         +R+ L Q+   + ++  +H  + +       LG   S
Sbjct: 179 IGVDERKTLLFFMGNRYRKDGGKIRDMLFQLLEKEEDVLISHGTQSRESRRTATLGMHTS 238

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LDIPL 271
           KFCL+  G   +  R+ DS+   CVP+I+++  +LPF DV++++  +I V T   L    
Sbjct: 239 KFCLNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKIAIFVDTESSLKPGY 298

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           L K+L+ +S+E  L  Q  + +V+++F++
Sbjct: 299 LVKLLRAVSTERILEYQKEMREVKRYFEY 327


>gi|15230692|ref|NP_190126.1| exostosin family protein [Arabidopsis thaliana]
 gi|6967106|emb|CAB72489.1| putative protein [Arabidopsis thaliana]
 gi|332644505|gb|AEE78026.1| exostosin family protein [Arabidopsis thaliana]
          Length = 475

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 65/240 (27%), Positives = 122/240 (50%), Gaps = 21/240 (8%)

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQ--- 152
           W R+GG DH  +A H    ++M  A      A+ ++     Y  +     KDV  P    
Sbjct: 206 WKRSGGRDHVVLAHHP---NSMLDARNKLFPAMFILSDFGRYPPTVANVEKDVIAPYKHV 262

Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYA 208
           I   + D    G   R  L +F GA+       VR++L  + +++ +++   G ++    
Sbjct: 263 IKAYENDTS--GFDSRPILLYFQGAIYRKDGGFVRQELFYLLQDEKDVHFSFGSVRNGGI 320

Query: 209 DE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
           ++    +  SKFCL++ G   ++ R+ D++   CVPVII++  +LPF DV+++  FS+ V
Sbjct: 321 NKASQGMHNSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPFEDVIDYSEFSVFV 380

Query: 265 ATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            T D      L  +++GI+ EE+  + + + +V K++++H FPS  D    ++++   R+
Sbjct: 381 RTSDALKENFLVNLIRGITKEEWTRMWNRLKEVEKYYEFH-FPSKVDDAVQMIWQAIARK 439


>gi|24476038|gb|AAN62780.1| Unknown protein [Oryza sativa Japonica Group]
          Length = 449

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 82/341 (24%), Positives = 141/341 (41%), Gaps = 65/341 (19%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDF 82
           +A+E    +A +        P  A LFF+P  ++           + H R +  + + D 
Sbjct: 101 FAAEVALHEALLAYAGRAARPDDATLFFVPVYVSCNFSTDNGFPSLSHARALLADAV-DL 159

Query: 83  ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA------WEVKLNAI------- 129
           +         + PYWNR+ GADH +VA H  G             +E+ +NA        
Sbjct: 160 VR-------AQMPYWNRSAGADHVFVASHDFGACFHPMELFVIIHFELGVNAKSNLALGQ 212

Query: 130 --------------QVVCSSSYFISG-HIAHKD---VSLPQIWPRQE-DPPKLGSSKRNK 170
                         + +   ++ + G H+  +    V  P + P    + P+   ++R+ 
Sbjct: 213 EDVAIADGIPEFLKRSILLQTFGVQGTHVCQEADHVVIPPHVPPEVALELPEPEKAQRDI 272

Query: 171 LAFFAGAV------------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCL 218
            AFF G +            +  VR +LLQ +  + + Y    R    Y  E+  S FCL
Sbjct: 273 FAFFRGKMEVHPKNISGRFYSKKVRTELLQKYGRNRKFYLKRKRYGN-YRSEMARSLFCL 331

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
              G+   + R+ +S+  GC+PVIIA+   LPF  VL W   S+ VA  D+  L+ +L  
Sbjct: 332 CPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPSVLQWLDISLQVAEKDVASLEMVLDH 391

Query: 279 ISSEEYLLLQSYVLK--VRKHFQWHVFPSDYDAFYMVMYEL 317
           + +    ++Q  +     RK   ++    + DA + V+ EL
Sbjct: 392 VVATNLTVIQKNLWDPVKRKALVFNRPMEEGDATWQVLREL 432


>gi|302782213|ref|XP_002972880.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300159481|gb|EFJ26101.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 459

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 77/286 (26%), Positives = 132/286 (46%), Gaps = 30/286 (10%)

Query: 52  DPSKADLFFLPF--SIARMRHDRR----IGTEGI--PDFISHYIFNISQKYPYWNRTGGA 103
           DP+ AD+FF+PF  S++  R+ R      G  G    D +   +    +    W R GGA
Sbjct: 156 DPATADIFFVPFFSSLSYNRYCRTGHRFQGGRGCVENDRLEKRLVEFLRGQELWRRNGGA 215

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIWPR-QE 158
           DH  V  H    +++  A  +   A+ VV     F S  +A+  KD+  P   + P    
Sbjct: 216 DHVIVMHHP---NSLMVARSLLKEAMFVVADFGRF-SRAVANMRKDIVAPYKHVIPSFAR 271

Query: 159 DPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKT----PYADE 210
           D     S  R  L FF GA+       +R+KL ++ ++   ++  +G  +          
Sbjct: 272 DATTFES--RETLLFFQGAIVRKEGGIIRQKLYEILKDSPGVHFVTGNTQKDGIRSATAG 329

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---L 267
           +  +KFCLH+ G   ++ R+ D++   CVPVII++  +LPF D L++  F + V +   L
Sbjct: 330 MRNAKFCLHLAGDTPSSNRLFDAIASHCVPVIISDEIELPFEDELDYSQFCVFVESDKAL 389

Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
               + + L+ I  +E+    + +  V +HF++       DA +M 
Sbjct: 390 RKGFVVRALERIGRDEWTRKWAMLKSVERHFEYQHPSLPEDAVHMT 435


>gi|356553158|ref|XP_003544925.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
           [Glycine max]
          Length = 440

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/362 (25%), Positives = 159/362 (43%), Gaps = 40/362 (11%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEP------RGNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D    +L   D +       +G + S+    K  ++S   T    +AD
Sbjct: 66  SLKIYVYQEDEIDGLKELLRGRDAKITDEACLKGQWGSQVKIHKLLLQSKQRTWKKEEAD 125

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P   + ++  R +G     +  S Y+  ISQ  PY+  +GG +H +V     G + 
Sbjct: 126 LFFVP---SYVKCARMMGGLNDKEINSTYVKVISQ-MPYFRLSGGRNHIFVFPSGAG-AH 180

Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS------- 165
           + K+W   +N   ++      +     S     KD+ +P      +   K G        
Sbjct: 181 LFKSWATYINRSIILTPEGDRTDKRDTSAFNTWKDIIIPG--NIDDGMTKTGDTTVQPLP 238

Query: 166 -SKRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSK 215
            SKR  LA + G A     R KL+++ +         D +        +  Y + L  SK
Sbjct: 239 LSKRKYLANYLGRAQGKAGRLKLIELSKQFPEKLECPDLKFSGPDKLGRKEYFEHLRNSK 298

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-PLLKK 274
           FCL  +G    T R  +S +  CVPVI+++  +LPF +V+++   SI   +  I P L +
Sbjct: 299 FCLAPRGESSWTLRFYESFFVECVPVILSDQIELPFQNVIDYSQISIKWPSSQIGPELLQ 358

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLDS 334
            L+ I  EE   ++  + + R+   W V+ SD ++   +   +W  +  VR Q+  S ++
Sbjct: 359 YLESIPDEE---IEKIIARGRQVRCWWVYASDSESCSAMRGIMWELQRKVR-QFHHSAET 414

Query: 335 NW 336
            W
Sbjct: 415 FW 416


>gi|297826983|ref|XP_002881374.1| hypothetical protein ARALYDRAFT_482474 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327213|gb|EFH57633.1| hypothetical protein ARALYDRAFT_482474 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 446

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 78/289 (26%), Positives = 126/289 (43%), Gaps = 37/289 (12%)

Query: 52  DPSKADLFFLP-FS----IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
           DP  ADLF++P FS    I         G+    + +   +    +   +W R GG DH 
Sbjct: 129 DPDDADLFYVPVFSSLSLIVNAGRPVEPGSGYSDEKMQEGLMEWLEGQEWWRRNGGRDHV 188

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH------KDVSLPQIWPRQEDP 160
             A      +A+ +  +   N++ +V        G + H      KDV +P         
Sbjct: 189 IPAGDP---NALYRILDRVKNSVLLVADF-----GRLRHDQGSFVKDVVIPYSHRVNLFN 240

Query: 161 PKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRL----KTPYADELL 212
            ++G   RN L FF G         VR+ L QV   + ++    G      +      + 
Sbjct: 241 GEIGVQDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDDVTIKHGTQSRENRRAATKGMH 300

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDI 269
            SKFCL+  G   +  R+ DS+   CVPVI+++  +LPF DV++++ FSI V   A L  
Sbjct: 301 TSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFEDVIDYRKFSIFVEANAALQP 360

Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
             L ++L+ I +++ L  Q  +  VR++F       DYD     + E+W
Sbjct: 361 GFLVQMLRKIKTKKILEYQREMQPVRRYF-------DYDNPNGAVKEIW 402


>gi|302852375|ref|XP_002957708.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300257002|gb|EFJ41257.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 600

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 77/312 (24%), Positives = 137/312 (43%), Gaps = 55/312 (17%)

Query: 33  YASESYFKKAFMKSHFV-TKDPSKADLFFLP----FSIARMRHDRRIGTEGIPDFISHYI 87
           YA+  YF   F++ H V T++P +A+LF++P    F IA +R+        +P  ++  +
Sbjct: 260 YAAYEYFMTYFLQDHAVRTENPYEANLFYIPMLAYFYIANVRNP-------VPQ-VTLAL 311

Query: 88  FNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV---CSSSYFISGHIA 144
             +  K+P++NRTGG DHFY    +  R A      ++ + I++V             + 
Sbjct: 312 DYVRTKWPFYNRTGGRDHFYFL--TGDRGACSTPRWLQDSCIKLVHFGLQGEELPGTGVP 369

Query: 145 HKDVSLPQIWPRQEDPP---------------------KLGSSKRNKLAFF--------- 174
           +++    Q+      PP                     K G     KL FF         
Sbjct: 370 NREYGCVQVKRDLVIPPINLFTDLVPSETQAYYKWLVSKKGYDSNRKLLFFFAGGVGQVP 429

Query: 175 --AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
             +G V   ++  L  +     ++    GR+   Y + L  SKFC+   GF     R+  
Sbjct: 430 EYSGGVRQAIKGLLSSLTPKPEDVEFFEGRVHN-YKELLQSSKFCIAPYGFGWGL-RLIQ 487

Query: 233 SLYYGCVPVIIANHYDLPFA---DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
           ++ YGC+P+II +H   PF    D L ++ FS+ +  +DIP + ++L+  +  +   L+ 
Sbjct: 488 AIEYGCIPLIIQDHVYQPFERPKDFLPYEEFSVRMGLVDIPYMIELLRSYTEAQLAQLRL 547

Query: 290 YVLKVRKHFQWH 301
            + K  + F W+
Sbjct: 548 GMAKYYQAFIWN 559


>gi|297815694|ref|XP_002875730.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297321568|gb|EFH51989.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 479

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 73/293 (24%), Positives = 144/293 (49%), Gaps = 33/293 (11%)

Query: 52  DPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA 103
           + S+AD+ F+PF  S++  R      H +    + +   +  ++    +    W R+GG 
Sbjct: 161 NSSEADVIFVPFFSSLSYNRFSKVNPHQKTSRNKDLQGKLVTFLTGQEE----WKRSGGR 216

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQ---IWPRQED 159
           DH  +A H    ++M  A      A+ ++     Y  +     KD+  P    I   + D
Sbjct: 217 DHVVLAHHP---NSMLDARNKLFPAMFILSDFGRYPPTVANVEKDIIAPYKHVIKAYEND 273

Query: 160 PPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----L 211
               G   R  L +F GA+       VR++L  + +++ +++   G ++    ++    +
Sbjct: 274 TS--GFDSRPILLYFQGAIYRKDGGFVRQELFYLLQDEKDVHFSFGSVRNGGINKASQGM 331

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-- 269
             SKFCL++ G   ++ R+ D++   CVPVII++  +LPF DV+++  F++ V T D   
Sbjct: 332 HNSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPFEDVIDYSEFAVFVRTSDALK 391

Query: 270 -PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
              L  +++GIS EE+  + + + +V K++++H FPS  D    ++++   R+
Sbjct: 392 ENFLVNLIRGISKEEWTRMWNRLKEVEKYYEFH-FPSKVDDAVQMIWQAIARK 443


>gi|159485716|ref|XP_001700890.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158281389|gb|EDP07144.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 704

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 83/334 (24%), Positives = 134/334 (40%), Gaps = 62/334 (18%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRR-------IGTEGIP----- 80
           Y  E+   +  + S   T DP +AD F++PF  A M +            T G P     
Sbjct: 310 YGLEAALHEYLLISEHRTFDPEEADYFYVPFYGACMIYPVAGWADYPWFWTPGGPRVMQV 369

Query: 81  -DFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS-----AMQKAWEVKLNAIQVV-C 133
            + I   +  I ++YP+W R GG DH ++  H  G            W      +     
Sbjct: 370 INMIREIVEWIDKQYPFWKRRGGRDHIWLFTHDEGACWAPSVIKDSVWLTHWGRLDPEHT 429

Query: 134 SSSYFISGHIAHKDVSLPQ--------------------IWPRQEDPP--------KLGS 165
           S++ F+  +  H  V+  Q                    + P  + PP           S
Sbjct: 430 SNTAFVGDNYTHDMVNWRQPEGYIKYIKGHPCYDPQKDLVVPNFKSPPHYVRSPLQSTPS 489

Query: 166 SKRNKLAFFAGAVNS--------PVREKLL-----QVWRNDSE-IYAHSGRLKTPYADEL 211
             R+   FF G V           +R+K+      Q W N  + +    G +   Y+D L
Sbjct: 490 KPRDIFFFFKGDVGKHRLSHYSRGIRQKIYKMAMEQDWANTQKSLIGDGGNVHGDYSDLL 549

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
             S FCL   G +  + R+ D++ +GC+PVIIA+     F  VL+  SF++ VA  D+P 
Sbjct: 550 SRSLFCLVAPG-DGWSPRLEDAVLHGCIPVIIADRVHAVFESVLDIDSFAVRVAEADVPR 608

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS 305
           +  IL+ +S  +  L QS + +V   +++   P 
Sbjct: 609 VMDILRAVSDIKIRLKQSRLGQVWHRYRYGALPG 642


>gi|168017794|ref|XP_001761432.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687438|gb|EDQ73821.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 377

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 78/304 (25%), Positives = 131/304 (43%), Gaps = 30/304 (9%)

Query: 48  FVTKDPSKADLFFLPFSIARMRHDRRIGTE----GIPDFISHYIFNISQKY-------PY 96
           F   D   AD  F+PF  A + +++   TE    G  D +      + +K        P 
Sbjct: 71  FRVSDWKAADYMFVPF-FASVAYNKYTKTEHHAGGELDLVGDKNQKLQEKLLEYLKQQPA 129

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP--QI 153
           W  + G DH  V  H     AM+ ++    N + V+     Y        KDV  P   I
Sbjct: 130 WQASDGCDHILVMHHPNSMHAMRDSFR---NVLFVLADFGRYPPDVANVEKDVVAPYKHI 186

Query: 154 WPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP--- 206
            P   D        R  L FF G +       +R++L ++ +++  ++   G   +    
Sbjct: 187 IP-SFDNDSSSFEDRETLLFFQGTIVRKQGGVIRQQLYEMLKDEEGVHFEEGSSGSEGVH 245

Query: 207 -YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
                + GSKFCL++ G   ++ R+ DS+   CVPVII++  +LPF D L++  F + + 
Sbjct: 246 SATSGMRGSKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPFEDELDYSEFCVFIK 305

Query: 266 TLDIPLLK---KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRS 322
           + D    K    +L+ I+  ++  L   +  V +HF++      YDA  MV   +  R  
Sbjct: 306 SEDALKEKYVINLLRSITRVQWTFLWKRLKAVARHFEYQHPTKPYDAVNMVWRAIARRAP 365

Query: 323 SVRV 326
           SV++
Sbjct: 366 SVKL 369


>gi|42571577|ref|NP_973879.1| exostosin-like protein [Arabidopsis thaliana]
 gi|332191986|gb|AEE30107.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 410

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 126/285 (44%), Gaps = 31/285 (10%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D    +L   D   +      G + S+    K  ++S F T    +AD
Sbjct: 88  SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P   A ++  R +G     +    Y+  +SQ  PY+ R+GG DH +V     G + 
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202

Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
           + ++W   +N   ++      +     +   + KD+ +P           Q D   L  S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262

Query: 167 KRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSKFC 217
           KR  LA + G A     R KL+ + +         D +        +T Y + L  +KFC
Sbjct: 263 KRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAKFC 322

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           L  +G    T R  +S +  CVPV++++H +LPF +V+++   SI
Sbjct: 323 LAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367


>gi|224075401|ref|XP_002304618.1| predicted protein [Populus trichocarpa]
 gi|222842050|gb|EEE79597.1| predicted protein [Populus trichocarpa]
          Length = 105

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 35/74 (47%), Positives = 52/74 (70%)

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           L G      ++G+EVNT+R++D+++YGC+PV+I+N+ DLPFADVL+W  FS+V+   DI 
Sbjct: 12  LRGENIASMLEGYEVNTSRVSDAIHYGCIPVVISNNRDLPFADVLDWSKFSVVINQRDIA 71

Query: 271 LLKKILKGISSEEY 284
            LK  L   + E Y
Sbjct: 72  FLKTKLLSRTRETY 85


>gi|21537341|gb|AAM61682.1| unknown [Arabidopsis thaliana]
          Length = 462

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 126/285 (44%), Gaps = 31/285 (10%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D    +L   D   +      G + S+    K  ++S F T    +AD
Sbjct: 88  SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P   A ++  R +G     +    Y+  +SQ  PY+ R+GG DH +V     G + 
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202

Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
           + ++W   +N   ++      +     +   + KD+ +P           Q D   L  S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262

Query: 167 KRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSKFC 217
           KR  LA + G A     R KL+ + +         D +        +T Y + L  +KFC
Sbjct: 263 KRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAKFC 322

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           L  +G    T R  +S +  CVPV++++H +LPF +V+++   SI
Sbjct: 323 LAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367


>gi|18394994|ref|NP_564141.1| exostosin-like protein [Arabidopsis thaliana]
 gi|332191985|gb|AEE30106.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 462

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 126/285 (44%), Gaps = 31/285 (10%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D    +L   D   +      G + S+    K  ++S F T    +AD
Sbjct: 88  SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P   A ++  R +G     +    Y+  +SQ  PY+ R+GG DH +V     G + 
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202

Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
           + ++W   +N   ++      +     +   + KD+ +P           Q D   L  S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262

Query: 167 KRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSKFC 217
           KR  LA + G A     R KL+ + +         D +        +T Y + L  +KFC
Sbjct: 263 KRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAKFC 322

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           L  +G    T R  +S +  CVPV++++H +LPF +V+++   SI
Sbjct: 323 LAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367


>gi|357508931|ref|XP_003624754.1| Secondary cell wall-related glycosyltransferase family [Medicago
           truncatula]
 gi|355499769|gb|AES80972.1| Secondary cell wall-related glycosyltransferase family [Medicago
           truncatula]
          Length = 510

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 74/287 (25%), Positives = 137/287 (47%), Gaps = 34/287 (11%)

Query: 51  KDPSKADLFFLPF--SIARMRH------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGG 102
           ++ ++AD+ F+PF  S+   RH      +RR   + + + +  Y+ N  +    W R+GG
Sbjct: 190 RNSTEADVIFVPFFSSLTYNRHSKTGPHERRSRNKVLQEKLVRYLMNQEE----WKRSGG 245

Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPRQ 157
            DH  +A H    ++M  A  +KL     + S       +IA+  KDV  P    I    
Sbjct: 246 RDHLILAHHP---NSMLDA-RMKLWPATFILSDFGRYPPNIANVDKDVIAPYKHVIASYV 301

Query: 158 EDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP----YAD 209
           +D     S  R  L +F GA+        R++L  + + + +++   G ++        +
Sbjct: 302 DDQSTFDS--RKTLLYFQGAIYRKDGGYARQELFYLLKEEKDVHFSFGSVQKGGVRNATN 359

Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
            +  SKFCL++ G   ++ R+ D++   CVPVII++  +LP+ DVL++  F + V T D 
Sbjct: 360 GMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSKFCVFVRTRDA 419

Query: 270 ---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
                L   ++ I  +E+  + + + +V K F++     + DA  M+
Sbjct: 420 VKKKYLINFIRSIGKDEWTRMWNRLKEVEKFFEFQFPSKEGDAVEMI 466


>gi|18650643|gb|AAL75891.1| At1g21480/F24J8_23 [Arabidopsis thaliana]
          Length = 462

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 126/285 (44%), Gaps = 31/285 (10%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D    +L   D   +      G + S+    K  ++S F T    +AD
Sbjct: 88  SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P   A ++  R +G     +    Y+  +SQ  PY+ R+GG DH +V     G + 
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202

Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
           + ++W   +N   ++      +     +   + KD+ +P           Q D   L  S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262

Query: 167 KRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSKFC 217
           KR  LA + G A     R KL+ + +         D +        +T Y + L  +KFC
Sbjct: 263 KRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAKFC 322

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           L  +G    T R  +S +  CVPV++++H +LPF +V+++   SI
Sbjct: 323 LAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367


>gi|359472749|ref|XP_002276440.2| PREDICTED: probable glycosyltransferase At5g11130-like [Vitis
           vinifera]
 gi|147815974|emb|CAN68074.1| hypothetical protein VITISV_007510 [Vitis vinifera]
 gi|297737986|emb|CBI27187.3| unnamed protein product [Vitis vinifera]
          Length = 461

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 67/270 (24%), Positives = 120/270 (44%), Gaps = 23/270 (8%)

Query: 52  DPSKADLFFLPF--------SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA 103
           DP +ADLF++ F        +  R  +    GT    + +   +    ++  YW R  G 
Sbjct: 141 DPEEADLFYVSFFSSLSLVVNPIRPANGEGAGTGYSDEEMQESLMEWLEQQEYWKRNNGR 200

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPK 162
           DH ++ C     +A+    +   N + +V       S   +  KDV LP     +    +
Sbjct: 201 DHVFI-CQD--PNALHLIVDRVKNGVLLVSDFGRLRSDTASLVKDVILPYAHRIKSYSGE 257

Query: 163 LGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AHSGRLKTPYADELLGS 214
           +G   R  L FF G         +R+ L Q+   + ++     A S   +   +  +  S
Sbjct: 258 IGVENRKSLLFFMGNRYRKEGGKIRDLLFQILEQEEDVIIKHGAQSRESRRMASQGMHSS 317

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPL 271
           KFCLH  G   +  R+ D++   CVPVI+++  +LPF DV++++  +I V   + +    
Sbjct: 318 KFCLHPAGDTPSACRLFDAIVSLCVPVIVSDQIELPFEDVIDYRKIAIFVDSTSAVKPGF 377

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           L K L+ I+ E  L  Q  + +V ++F++ 
Sbjct: 378 LVKNLRKITRERILEYQREMQEVTRYFEYE 407


>gi|326511980|dbj|BAJ95971.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 784

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 85/336 (25%), Positives = 140/336 (41%), Gaps = 72/336 (21%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTE 77
           R  Y ++    ++ + S   T +  +AD F++P   +            RM  D R+ + 
Sbjct: 379 RQLYGAQMALYESILASPHRTLNGDEADYFYVPVLDSCLITRSDDAPHLRMPEDLRLRSY 438

Query: 78  GIPDFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNA 128
              ++      +I+Q+YPYWNRT G DH +       AC++   I  S M   W    N 
Sbjct: 439 HTLEYYRKAYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNT 497

Query: 129 IQVVCSSSYFI-----------SGHIA---HKDVSLPQ--------IWPRQEDPPKLGSS 166
                +++Y+              H      KD+ LP         IW +    PK+   
Sbjct: 498 KHEKSTTAYWADNWDDIPFDRRGNHPCFDPRKDLVLPAWKEPNPGAIWLKLWARPKI--- 554

Query: 167 KRNKLAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP------- 206
            R  L +F G +                +R+KL   + +        GR +T        
Sbjct: 555 NRTTLFYFNGNLGPAYEEGRPEDTYSMGIRQKLAAEFGSTPNKQGKLGRQQTANVTVTYL 614

Query: 207 ----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
               Y +EL  S FC  + G +  + R+ DS+  GC+PVII +   LP+ +VLN+ SFS+
Sbjct: 615 KSEMYYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFSV 673

Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            +   DIP L K+L+G++  +   +   V +V + F
Sbjct: 674 RIQEDDIPNLIKVLQGLNGTQIDFMLGNVRQVWQRF 709


>gi|297850568|ref|XP_002893165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297339007|gb|EFH69424.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 125/285 (43%), Gaps = 31/285 (10%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D    +L   D   +      G + S+    K  ++S F T    +AD
Sbjct: 88  SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P   A ++  R +G     +    Y+  +SQ  PY+ R+GG DH +V     G + 
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202

Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
           + ++W   +N   ++      +     +     KD+ +P           Q D   L  S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNTWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262

Query: 167 KRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSKFC 217
           KR  LA + G A     R KL+ + +         D +        +T Y + L  +KFC
Sbjct: 263 KRKYLANYLGRAQGKAGRLKLIDLSKQYPDKLECPDLKFSGTEKFGRTTYFEHLRNAKFC 322

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           L  +G    T R  +S +  CVPV++++H +LPF +V+++   SI
Sbjct: 323 LAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367


>gi|159470363|ref|XP_001693329.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158277587|gb|EDP03355.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 626

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 81/316 (25%), Positives = 140/316 (44%), Gaps = 61/316 (19%)

Query: 33  YASESYFKKAFMKSHFV-TKDPSKADLFFLP----FSIARMRHDRRIGTEGIPDFISHYI 87
           Y++   F + F++ +   T++P +A+LF++P    F I  +R       + +P   +  I
Sbjct: 271 YSAYELFMRYFLQDNVTRTENPWEANLFYVPMLLYFYIGNVR-------DAVPQ-TAWAI 322

Query: 88  FNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV-----CSSSYFISGH 142
            ++  ++P+W+R+GG DHFY      G   + +  E++  AI+VV      + + +I   
Sbjct: 323 NHVRSRWPFWDRSGGRDHFYFMTGDRGTCHLPR--ELQDQAIKVVHWGMQVAGTDWIG-- 378

Query: 143 IAHKDVSLPQIWPRQEDPP---------------------KLGSS-KRNKLAFFAGAV-- 178
           + +KD +  Q+      PP                       G    R  L FFAG +  
Sbjct: 379 LDNKDYACIQLKRDLVVPPINMFAEILPTDTVKHYQTVVANGGQDFGRTLLFFFAGGIAQ 438

Query: 179 --------NSPVREKLLQVW-----RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
                      ++E L  V         +++    GR +  Y   LL SKFC+   GF  
Sbjct: 439 SMEYSGGTRQAIKELLTSVHIANGNSTPADVVFVEGRTQE-YKKLLLTSKFCIAPYGFGW 497

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
              R+  ++ +GC+PVII +H    F D L ++ FS+ +   D+P L  IL+  S E+  
Sbjct: 498 GL-RLVQAIEFGCIPVIIQDHVYQAFEDFLPYEEFSVRLPLRDVPRLLDILRSYSPEQQA 556

Query: 286 LLQSYVLKVRKHFQWH 301
            L+  + K  + F WH
Sbjct: 557 ALRLGMAKYYRAFVWH 572


>gi|356519776|ref|XP_003528545.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Glycine max]
          Length = 452

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 71/279 (25%), Positives = 129/279 (46%), Gaps = 33/279 (11%)

Query: 49  VTKDPSKADLFFLPF--SIARMRHDRR-----IGTEG---IPDFISHYIFNISQKYPYWN 98
           +  DP +ADLFF+PF  S++ + +  R      G+E      +     +    +K  YW 
Sbjct: 131 LVADPEEADLFFVPFFSSLSLIVNPVRPPGSNSGSEKPVYSDEENQEALVEWLEKQEYWK 190

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF--ISGHIAHKDVSLP---QI 153
           R  G DH  VA      +AM +  +   NA+ +V          G +  KDV +P   +I
Sbjct: 191 RNSGRDHVIVASDP---NAMYRVIDRVRNAVLLVSDFGRLRPDQGSLV-KDVVVPYSHRI 246

Query: 154 WPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AHSGRLKT 205
              Q D    G   RN L FF G         +R+ L ++  N+ ++     A S   + 
Sbjct: 247 RTYQGDA---GVEDRNTLLFFMGNRYRKEGGKIRDILFKILENEKDVIIKHGAQSRESRR 303

Query: 206 PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
             +  +  SKFCLH  G   +  R+ D++   C+PVI++++ +LPF D ++++  ++ + 
Sbjct: 304 AASQGMHTSKFCLHPAGDTPSACRLFDAIVSLCIPVIVSDNIELPFEDTIDYRKLAVFIE 363

Query: 266 T---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           T   +    L   L+ ++ +  L  Q  + +V+++F++ 
Sbjct: 364 TSSAIKPGYLVSKLRALTPDRVLAYQKELKEVKRYFEYE 402


>gi|168004467|ref|XP_001754933.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694037|gb|EDQ80387.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 378

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 72/296 (24%), Positives = 133/296 (44%), Gaps = 40/296 (13%)

Query: 50  TKDPSKADLFFLPF-------------SIARMRHDRRIGTEGIPDFISHYIFNISQKYPY 96
            KDP +A++FF+PF             +      DR +  EG+ + +S+          +
Sbjct: 68  VKDPEQAEVFFVPFFASLSFNSFGRNMAAPNAAKDREL-QEGVVEMLSNS--------KW 118

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP--QI 153
           W ++ G DH  V  H    +A +   ++   ++ +V     Y  +     KD+  P   +
Sbjct: 119 WQKSQGRDHIIVIHHP---NAFRYYRDMMNQSMFIVADFGRYNQTVARLKKDIVAPYAHV 175

Query: 154 WPR-QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPY- 207
            P   ED P    S R  L FF G V    +  +R KL ++  N +++Y      +T   
Sbjct: 176 VPSYNEDNPSDPFSARKTLLFFQGRVRRKADGVIRAKLGKLLMNQTDVYYEDSLARTEAI 235

Query: 208 ---ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
                 +  S+FCLH  G   ++ R+ D++   CVPVI+++  +LPF D L++  FSI  
Sbjct: 236 AMSTQGMRFSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPFEDDLDYSEFSIFF 295

Query: 265 ATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
           +  +  +   L   L+ I+ E +L + + +  +  HF++     + DA  ++  ++
Sbjct: 296 SAKEAIIPGHLLGTLRSITRERWLQMWNKLKAISHHFEYQNPSKEDDAVNLIFKQV 351


>gi|449462180|ref|XP_004148819.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Cucumis sativus]
 gi|449524512|ref|XP_004169266.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Cucumis sativus]
          Length = 458

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 84/328 (25%), Positives = 150/328 (45%), Gaps = 30/328 (9%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN 89
           +G + ++    +  ++S F T++  +AD FF+P   A ++  R +G     +    YI  
Sbjct: 116 KGQWGTQVKIHRLLLQSRFRTRNKEEADFFFVP---AYVKCVRMLGGLNDKEINEAYIQV 172

Query: 90  ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC-----SSSYFISGHIA 144
           + Q  PY+  +GG DH +V     G + + K+W   +N   ++      +     S    
Sbjct: 173 LGQ-MPYFRLSGGRDHIFVFPSGAG-AHLFKSWATYINRSIILTPEGDRTDKKDFSAFNT 230

Query: 145 HKDVSLP------QIWPRQEDPPKLGSSKRNKLAFFAGAVNSPV-REKLLQVWRN----- 192
            KD+ +P         P  +    L  SKR  LA + G     V R KL+++ +      
Sbjct: 231 WKDIIIPGNVDDGMTSPGAKIVQPLPLSKRKHLANYLGRDQGKVGRLKLIELAKQFPEKL 290

Query: 193 DSEIYAHSG--RL-KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           +S +   SG  +L K  Y + L  +KFCL  +G    T R  +S +  CVPV++++  +L
Sbjct: 291 ESPVLKFSGPDKLGKLEYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVVLSDQVEL 350

Query: 250 PFADVLNWKSFSIVVATLDI-PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYD 308
           PF +V+++   SI   + +I P L + L+ I  E    +   + + R+     V+ SD +
Sbjct: 351 PFQNVIDYSQISIKWPSSEIGPQLLEYLESIPDET---IDKMIARGRRVRCLWVYASDSE 407

Query: 309 AFYMVMYELWLRRSSVRVQWSTSLDSNW 336
               +   LW  +  VR Q+  S ++ W
Sbjct: 408 PCSTMQGILWELQRKVR-QFHQSTETFW 434


>gi|302852377|ref|XP_002957709.1| hypothetical protein VOLCADRAFT_119761 [Volvox carteri f.
           nagariensis]
 gi|300257003|gb|EFJ41258.1| hypothetical protein VOLCADRAFT_119761 [Volvox carteri f.
           nagariensis]
          Length = 1481

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 85/354 (24%), Positives = 151/354 (42%), Gaps = 64/354 (18%)

Query: 12  HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHD 71
           HRR+D   N  +  D      YA+++   +  + S   T DP +AD F++P S + +   
Sbjct: 332 HRRHDLPGNQTVWSDGW---VYAADTLLHELLLISEHRTFDPEEADFFYVPHSASCLPFP 388

Query: 72  RR--------IGTEG-----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAM 118
                     +G  G     + + +   +  I + YP+W R GG DH ++  H  G    
Sbjct: 389 MGSWADYPWFLGPGGPRIRQMVNMLREVVDWIDKTYPFWRRRGGRDHIWLFTHDEGACWA 448

Query: 119 QKAWE------------VKLNAIQVVCSSSY---FISGH-----IAH----------KDV 148
            K  E            ++  +     +  Y   F+S H     + H          KD+
Sbjct: 449 PKVLENSTWLTHWGRMGLEHRSGTAFLADKYDIDFVSPHQPEGFLTHIKGHPCYDSTKDL 508

Query: 149 SLPQI-WPRQ-EDPPKLGSS--KRNKLAFFAGAVNS--------PVREKLLQV-----WR 191
            +P    PR     P LGS+  +R+   FF G V           VR+KL ++     W+
Sbjct: 509 VVPAFKQPRHYRSSPLLGSATKQRDIFLFFRGDVGKHRMAHYSRGVRQKLYKLSVENNWK 568

Query: 192 NDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
           + + +   +  ++  Y+D L  S+FCL   G +  +AR+ D++ +GC+PVI+ +   + F
Sbjct: 569 SKNVLIGGTHEVRGEYSDLLSRSQFCLVAAG-DGWSARLEDAVLHGCIPVIVIDEVHVVF 627

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS 305
             +LN  SF++ +    +P +  IL  I   +    Q+++  V   F++   P 
Sbjct: 628 ESILNVDSFAVRIDEQQLPQILDILAAIPERKIRAKQAHLGHVWHRFRYGSLPG 681



 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 66/292 (22%), Positives = 116/292 (39%), Gaps = 29/292 (9%)

Query: 33   YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARM----------RHDRRIGTEGIPDF 82
            Y +ES   +  + S   T DP +AD F++P+    M           +    G   I   
Sbjct: 1043 YGAESALHEYLLLSEHRTFDPEEADFFYVPYYGTCMIWPVLHWADFPYFHTTGGPRILQV 1102

Query: 83   ISHYIFN---ISQKYPYWNRTGGADHFYVACHSIGRS-----AMQKAWEVKLNAIQVVCS 134
            I+  I     I++ YP+W R GG DH ++  H  G        +   W        ++  
Sbjct: 1103 INMLIDTVDWINKMYPFWGRRGGRDHIFLFPHDEGACWAPNVLVNATWLTHWGRTDMIHE 1162

Query: 135  SSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDS 194
            S          +D     +  RQ  P    +  R    +    +    +E     W++  
Sbjct: 1163 SKTSFDADNYTRDY----VGWRQ--PGGFVNLIRGHPCYDPVKIYRLAKEN---NWQDKH 1213

Query: 195  EIY-AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
             I    +  +   Y+D L  S FCL   G +  +AR  D++ +GC+PVII +   + F  
Sbjct: 1214 NILIGDAADVPGDYSDLLSRSLFCLVATG-DGWSARTEDAVLHGCIPVIIIDGVHIKFET 1272

Query: 254  VLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS 305
            V +   FSI +   +   + +ILK I   +   +Q+++ +V   +++   P 
Sbjct: 1273 VFSVDEFSIRIPEANASRILEILKEIPKTKIRSIQAHLGRVWHRYRYANLPG 1324


>gi|168060641|ref|XP_001782303.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666233|gb|EDQ52893.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 426

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 64/237 (27%), Positives = 112/237 (47%), Gaps = 23/237 (9%)

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP- 151
           P W  + G++H  V  H    +AM    E K  ++  V +        +A+  KDV  P 
Sbjct: 162 PAWRASEGSNHVVVIHHP---NAMLHTRE-KFRSVMFVVADFGRYGAEVANMAKDVVAPY 217

Query: 152 -QIWPR--QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGR-- 202
             + P   ++    L    R  L FF GA+       +R++L ++   +  I   +G   
Sbjct: 218 KHVIPNFDEDVDAALSFKSRTTLLFFQGAIARKEGGIIRQQLYELLGEEPNIIFSNGTTS 277

Query: 203 ---LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
              +++  A  +  SKFCLH+ G   ++ R+ D++   CVP+II+N  +LPF DVLN+  
Sbjct: 278 NAGIRSATAG-MRQSKFCLHLAGDTPSSNRLFDAVASHCVPLIISNEIELPFEDVLNYSE 336

Query: 260 FSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
           FS+ V + D      +  +L  +  +E+  +   + +V +HFQ+ +     DA +M 
Sbjct: 337 FSLFVNSSDALRKGFVTDLLSNVGEKEWTRMHDRLRQVERHFQYQLPAQIGDAVHMT 393


>gi|302812737|ref|XP_002988055.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300144161|gb|EFJ10847.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 459

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 131/286 (45%), Gaps = 30/286 (10%)

Query: 52  DPSKADLFFLPF--SIARMRHDRR----IGTEGI--PDFISHYIFNISQKYPYWNRTGGA 103
           DP+ AD+FF+PF  S++  R+ R      G  G    D +   +    +    W R GG 
Sbjct: 156 DPATADIFFVPFFSSLSYNRYCRTGHRFQGGRGCVENDRLEKRLVEFLRGQELWRRNGGV 215

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIWPR-QE 158
           DH  V  H    +++  A  +   A+ VV     F S  +A+  KD+  P   + P    
Sbjct: 216 DHVIVMHHP---NSLMVARSLLKEAMFVVADFGRF-SRAVANMRKDIVAPYKHVIPSFAR 271

Query: 159 DPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKT----PYADE 210
           D     S  R  L FF GA+       +R+KL ++ ++   ++  +G  +          
Sbjct: 272 DATTFES--RETLLFFQGAIVRKEGGIIRQKLYEILKDSPGVHFVTGNTQKDGIRSATAG 329

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---L 267
           +  +KFCLH+ G   ++ R+ D++   CVPVII++  +LPF D L++  F + V +   L
Sbjct: 330 MRNAKFCLHLAGDTPSSNRLFDAIASHCVPVIISDEIELPFEDELDYSQFCVFVESDKAL 389

Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
               + + L+ I  +E+    + +  V +HF++       DA +M 
Sbjct: 390 RKGFVVRALERIGRDEWTRKWAMLKSVERHFEYQHPSLPEDAVHMT 435


>gi|302773760|ref|XP_002970297.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300161813|gb|EFJ28427.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 437

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 76/287 (26%), Positives = 129/287 (44%), Gaps = 45/287 (15%)

Query: 52  DPSKADLFFLPFSIA-------------RMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
           DP +AD+FF+P+  +                HD+++   G+ +++S        K P++ 
Sbjct: 128 DPGEADVFFVPYFASLSFNVFGVSMRDPETEHDKKLQV-GMIEYLS--------KSPWYQ 178

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLN-AIQVVCSSSYFISGHIA-HKDVSLP---QI 153
           R+GG DH  V  H      ++     +LN ++ VV     F  G  A HKDV  P    +
Sbjct: 179 RSGGRDHVLVLHHPNAFRFLKD----RLNLSLLVVADFGRFPKGVAALHKDVVAPYSHMV 234

Query: 154 WPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYAD 209
                D       +R  L FF G V    +  VR +L  +  N   ++   G + T +  
Sbjct: 235 PTYNGDDGTDPFEERTTLLFFQGRVKRKDDGVVRTQLAAILENQPRVHFEEG-IATNFTV 293

Query: 210 E-----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI-- 262
           E     +  S+FCLH  G   ++ R+ D++   CVPVI+++  +LPF D L++  FS+  
Sbjct: 294 EQAMQGMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYSEFSLFF 353

Query: 263 -VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYD 308
            V   +    L   L+  S   ++ +   + +V +HF++   PS  D
Sbjct: 354 SVDEAVRPGFLLGALEKFSKRRWMKMWRRLKQVTRHFEYQ-HPSQRD 399


>gi|302793388|ref|XP_002978459.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300153808|gb|EFJ20445.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 437

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 76/287 (26%), Positives = 129/287 (44%), Gaps = 45/287 (15%)

Query: 52  DPSKADLFFLPFSIA-------------RMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
           DP +AD+FF+P+  +                HD+++   G+ +++S        K P++ 
Sbjct: 128 DPGEADVFFVPYFASLSFNVFGVSMRDPETEHDKKLQV-GMIEYLS--------KSPWYQ 178

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLNA-IQVVCSSSYFISGHIA-HKDVSLP---QI 153
           R+GG DH  V  H      ++     +LN+ + VV     F  G  A HKDV  P    +
Sbjct: 179 RSGGRDHVLVLHHPNAFRFLKD----RLNSSLLVVADFGRFPKGVAALHKDVVAPYSHMV 234

Query: 154 WPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYAD 209
                D       +R  L FF G V    +  VR +L  +  N   ++   G + T +  
Sbjct: 235 PTYNGDDGSDPFEERTTLLFFQGRVKRKDDGVVRTQLAAILENQPRVHFEEG-IATNFTV 293

Query: 210 E-----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI-- 262
           E     +  S+FCLH  G   ++ R+ D++   CVPVI+++  +LPF D L++  FS+  
Sbjct: 294 EQAMQGMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYSEFSLFF 353

Query: 263 -VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYD 308
            V   +    L   L+  S   ++ +   + +V +HF++   PS  D
Sbjct: 354 SVDEAVRPGFLLGALEKFSKRRWMKMWRRLKQVTRHFEYQ-HPSQRD 399


>gi|5882750|gb|AAD55303.1|AC008263_34 F25A4.34 [Arabidopsis thaliana]
 gi|12324821|gb|AAG52383.1|AC011765_35 unknown protein; 115857-117304 [Arabidopsis thaliana]
          Length = 458

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 74/283 (26%), Positives = 135/283 (47%), Gaps = 25/283 (8%)

Query: 51  KDPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHF 106
           K+ ++AD+ F+PF  S++  R  +  G E   D   +   +    +    W R  G DH 
Sbjct: 148 KNSNEADIVFVPFFASLSYNRKSKLRGNETSSDDRLLQERLVEFLKSQDEWKRFDGKDHL 207

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKLG 164
            VA H    +++  A    L +   V S     S  IA+  KD+  P +   +       
Sbjct: 208 IVAHHP---NSLLYARNF-LGSAMFVLSDFGRYSSAIANLEKDIIAPYVHVVKTISNNES 263

Query: 165 SS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGS 214
           +S  KR  LA+F GA+       +R++L  + +++ +++   G ++     +    +  S
Sbjct: 264 ASFEKRPVLAYFQGAIYRKDGGTIRQELYNLLKDEKDVHFAFGTVRGNGTKQTGKGMASS 323

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPL 271
           KFCL++ G   ++ R+ D++   CVPVII++  +LPF D L++  FS+ V     +    
Sbjct: 324 KFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFEDTLDYSGFSVFVHASEAVKKEF 383

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-DAFYMV 313
           L  IL+GI+ +++      + +V   F++  FPS   D+  M+
Sbjct: 384 LVNILRGITEDQWKKKWGRLKEVAGCFEYR-FPSQVGDSVNMI 425


>gi|18410670|ref|NP_565089.1| exostosin-like protein [Arabidopsis thaliana]
 gi|16209709|gb|AAL14411.1| At1g74680/F1M20_36 [Arabidopsis thaliana]
 gi|27363220|gb|AAO11529.1| At1g74680/F1M20_36 [Arabidopsis thaliana]
 gi|332197500|gb|AEE35621.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 461

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 74/283 (26%), Positives = 135/283 (47%), Gaps = 25/283 (8%)

Query: 51  KDPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHF 106
           K+ ++AD+ F+PF  S++  R  +  G E   D   +   +    +    W R  G DH 
Sbjct: 151 KNSNEADIVFVPFFASLSYNRKSKLRGNETSSDDRLLQERLVEFLKSQDEWKRFDGKDHL 210

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKLG 164
            VA H    +++  A    L +   V S     S  IA+  KD+  P +   +       
Sbjct: 211 IVAHHP---NSLLYARNF-LGSAMFVLSDFGRYSSAIANLEKDIIAPYVHVVKTISNNES 266

Query: 165 SS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGS 214
           +S  KR  LA+F GA+       +R++L  + +++ +++   G ++     +    +  S
Sbjct: 267 ASFEKRPVLAYFQGAIYRKDGGTIRQELYNLLKDEKDVHFAFGTVRGNGTKQTGKGMASS 326

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPL 271
           KFCL++ G   ++ R+ D++   CVPVII++  +LPF D L++  FS+ V     +    
Sbjct: 327 KFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFEDTLDYSGFSVFVHASEAVKKEF 386

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-DAFYMV 313
           L  IL+GI+ +++      + +V   F++  FPS   D+  M+
Sbjct: 387 LVNILRGITEDQWKKKWGRLKEVAGCFEYR-FPSQVGDSVNMI 428


>gi|255570432|ref|XP_002526175.1| catalytic, putative [Ricinus communis]
 gi|223534552|gb|EEF36251.1| catalytic, putative [Ricinus communis]
          Length = 453

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 149/345 (43%), Gaps = 40/345 (11%)

Query: 4   SFRVYVYPHRRNDPFANVL------LPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY  +  D    +L      +  D   +G + ++    +    S + T+   +AD
Sbjct: 79  SMKIYVYEEKEIDGLKELLRGRDGKISADTCVKGQWGTQVKIHRLLQNSRYRTRKKEEAD 138

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P   A ++  R +G     +    Y+  +SQ  PY+ R+GG DH +V     G + 
Sbjct: 139 LFFVP---AYVKCVRMLGGLNDKEINLTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 193

Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-----PRQEDP--PKLGS----- 165
           + ++W   +N   ++             KD S    W     P   D    K+G+     
Sbjct: 194 LFRSWATYINRSVILTPEG----DRTDKKDTSAFNTWKDIIIPGNVDDGMTKIGTTIVKP 249

Query: 166 ---SKRNKLAFFAGAVNSPV-REKLLQVWRN-----DSEIYAHSGRLK---TPYADELLG 213
              SKR  LA + G     V R KL+++ +      +      SG  K     Y + L  
Sbjct: 250 LPLSKRKFLANYLGRAQGKVGRLKLIELAKQYPDKLECPELKFSGPEKFGKMEYFEHLRN 309

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-PLL 272
           +KFCL  +G    T R  +S +  CVPV++++  +LPF +V+++   SI   +  I P L
Sbjct: 310 AKFCLAPRGESSWTLRFYESFFVECVPVLLSDQAELPFQNVIDYTHVSIKWPSTKIGPEL 369

Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
            + L+ I  E+   + +   +VR  + +        A   +M+EL
Sbjct: 370 LEYLESIPDEDIERMIANGRQVRCLWVYAPESEQCSAMQGIMWEL 414


>gi|223943427|gb|ACN25797.1| unknown [Zea mays]
          Length = 380

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/332 (24%), Positives = 143/332 (43%), Gaps = 62/332 (18%)

Query: 43  FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQ---------- 92
            +KS    +   +AD+F++PF                   IS+++    +          
Sbjct: 55  LLKSVIRVQQQEEADIFYVPFFTT----------------ISYFLLEKQECKALYREALK 98

Query: 93  ---KYPYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI- 143
                P W R+ G DH     H     S+ RS  +  W +      +  + +++  G + 
Sbjct: 99  WVTDQPAWQRSEGRDHVIPVHHPWSFKSVRRSVKKAIWLLP----DMDSTGNWYKPGQVY 154

Query: 144 AHKDVSLPQIWPRQEDPPKL---GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI 196
             KDV LP +        K      SKR+ L FF G +       +R KL++  ++  +I
Sbjct: 155 LEKDVILPYVPNVDLCDHKCVLETQSKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDI 214

Query: 197 YAHSGRL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
               G      K    D +  S FCL   G   ++AR+ D++  GC+PVII++  +LPF 
Sbjct: 215 VIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFE 274

Query: 253 DVLNWKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFP 304
            +L+++  ++ V+  D      L K L+GI+++    +QS ++K  +HF +      + P
Sbjct: 275 GILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYSRHFLYSSPAQPLGP 334

Query: 305 SDYD----AFYMVMYELWLRRSSVRVQWSTSL 332
            D      A  +V  +L +RRS   V+ S S+
Sbjct: 335 EDLTWRMIAGKLVNIKLQIRRSQRLVKESRSI 366


>gi|226496701|ref|NP_001141657.1| uncharacterized protein LOC100273782 [Zea mays]
 gi|194705444|gb|ACF86806.1| unknown [Zea mays]
          Length = 497

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/332 (24%), Positives = 143/332 (43%), Gaps = 62/332 (18%)

Query: 43  FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQ---------- 92
            +KS    +   +AD+F++PF                   IS+++    +          
Sbjct: 172 LLKSVIRVQQQEEADIFYVPFFTT----------------ISYFLLEKQECKALYREALK 215

Query: 93  ---KYPYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI- 143
                P W R+ G DH     H     S+ RS  +  W +      +  + +++  G + 
Sbjct: 216 WVTDQPAWQRSEGRDHVIPVHHPWSFKSVRRSVKKAIWLLP----DMDSTGNWYKPGQVY 271

Query: 144 AHKDVSLPQIWPRQEDPPKL---GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI 196
             KDV LP +        K      SKR+ L FF G +       +R KL++  ++  +I
Sbjct: 272 LEKDVILPYVPNVDLCDHKCVLETQSKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDI 331

Query: 197 YAHSGRL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
               G      K    D +  S FCL   G   ++AR+ D++  GC+PVII++  +LPF 
Sbjct: 332 VIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVTGCIPVIISDELELPFE 391

Query: 253 DVLNWKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFP 304
            +L+++  ++ V+  D      L K L+GI+++    +QS ++K  +HF +      + P
Sbjct: 392 GILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYSRHFLYSSPAQPLGP 451

Query: 305 SDYD----AFYMVMYELWLRRSSVRVQWSTSL 332
            D      A  +V  +L +RRS   V+ S S+
Sbjct: 452 EDLTWRMIAGKLVNIKLQIRRSQRLVKESRSI 483


>gi|224062946|ref|XP_002300942.1| predicted protein [Populus trichocarpa]
 gi|222842668|gb|EEE80215.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/365 (24%), Positives = 157/365 (43%), Gaps = 46/365 (12%)

Query: 4   SFRVYVYPHRRNDPFANVL------LPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D    +L      +  D   +G + ++       ++S F T+   +AD
Sbjct: 86  SLKIYVYEEDEIDGLKELLRGRDGKISADACLKGQWGTQVKIHGLLLESRFRTRKKEEAD 145

Query: 58  LFFLPFSIARMRHDRRIGTEGIPD-FISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
           LFF+P   A ++  R +G  G+ D  I+H    +  + PY+ R+GG DH +V     G +
Sbjct: 146 LFFVP---AYVKCVRMMG--GLNDKEINHTYVKVLSQMPYFRRSGGRDHIFVFPSGAG-A 199

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK-------------- 162
            + ++W   +N   ++ + +         KD S    W     P                
Sbjct: 200 HLFRSWATYINRSIILTTEA----DRTDKKDTSAFNTWKDIIIPGNVEDGMTKRRIAMVQ 255

Query: 163 -LGSSKRNKLAFFAGAVNSPV-REKLLQVWRN-----DSEIYAHSGRLK---TPYADELL 212
            L  SKR  LA + G     V R KL+++ +      +S     SG  K     Y   L 
Sbjct: 256 PLPLSKRKYLANYLGRAQGKVGRLKLIELAKQYPDKLESPELKFSGPGKFGRMEYFQHLR 315

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL- 271
            +KFCL  +G    T R  +S +  CVPVI+++  + PF +V+++   SI   +  I L 
Sbjct: 316 NAKFCLAPRGESSWTLRFYESFFVECVPVILSDQAEFPFQNVIDYTQISIKWPSTRIGLE 375

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
           L + L+ I  E    ++  +   R+     V+  ++++   +   +W  +  VR Q+  S
Sbjct: 376 LLEYLESIPDEN---IEQMIAAGRQIRCLWVYAPEFESCSAMQGIMWELQRKVR-QFHQS 431

Query: 332 LDSNW 336
            ++ W
Sbjct: 432 AETFW 436


>gi|168016669|ref|XP_001760871.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687880|gb|EDQ74260.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 378

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 77/290 (26%), Positives = 132/290 (45%), Gaps = 29/290 (10%)

Query: 48  FVTKDPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHY-------IFNISQKYPYWN 98
           F   D   AD+ F+PF  S++  R  +    + + D I          +    ++ P W 
Sbjct: 67  FRVADWRDADVIFVPFFASLSYNRFGKASEEKRLTDLIKDQNDVLQLKLVKFLEEQPAWK 126

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIW 154
            +GG DH +V  H    ++MQ       N++ +V     + S  +A+  KDV  P   + 
Sbjct: 127 ASGGRDHVFVIHHP---NSMQATRNRLRNSLFIVSDFGRYDS-EVANIQKDVVAPYKHVI 182

Query: 155 PRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP-YAD 209
           P   D        R  L FF GA+       +R +L ++ ++   +   +G      +  
Sbjct: 183 P-TFDFDDSSFHTRKILLFFQGAIVRKEGGKIRHELYRLLKDKPGVRFTTGNTALDGFQS 241

Query: 210 ELLG---SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
             +G   SKFCL++ G   ++ R+ DS+   CVPVII++  +LPF D L++ +F I + +
Sbjct: 242 ATIGMRSSKFCLNMAGDTPSSNRLFDSIVSHCVPVIISDDIELPFEDTLDYSNFCIFINS 301

Query: 267 ---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
              L    +  +L+ +S EE+  L + +L V  HF++       DA  MV
Sbjct: 302 SLALKPGYVINMLRNVSEEEWTQLWNQLLLVEHHFEYQHPTRKNDAVNMV 351


>gi|15241619|ref|NP_199306.1| Exostosin family protein [Arabidopsis thaliana]
 gi|30694651|ref|NP_851132.1| Exostosin family protein [Arabidopsis thaliana]
 gi|10177484|dbj|BAB10875.1| unnamed protein product [Arabidopsis thaliana]
 gi|15081733|gb|AAK82521.1| AT5g44930/K21C13_11 [Arabidopsis thaliana]
 gi|27363262|gb|AAO11550.1| At5g44930/K21C13_11 [Arabidopsis thaliana]
 gi|332007793|gb|AED95176.1| Exostosin family protein [Arabidopsis thaliana]
 gi|332007794|gb|AED95177.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 443

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/281 (24%), Positives = 126/281 (44%), Gaps = 24/281 (8%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           DP++ADLF++    S++ +    R G     + +   + +  +   +W R  G DH  VA
Sbjct: 129 DPAEADLFYVSAFSSLSLIVDSGRPGFGYSDEEMQESLVSWLESQEWWRRNNGRDHVIVA 188

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPKLGSSKR 168
                 +A+++  +   NA+ +V       +   +  KDV +P          +LG  +R
Sbjct: 189 GDP---NALKRVMDRVKNAVLLVTDFDRLRADQGSLVKDVIIPYSHRIDAYEGELGVKQR 245

Query: 169 NKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRLKT----PYADELLGSKFCLHV 220
             L FF G         VR+ L ++   + ++    G             +  SKFCLH+
Sbjct: 246 TNLLFFMGNRYRKDGGKVRDLLFKLLEKEEDVVIKRGTQSRENMRAVKQGMHTSKFCLHL 305

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPLLKKILK 277
            G   +  R+ D++   CVPVI+++  +LPF DV++++ FSI +   A L    + K L+
Sbjct: 306 AGDTSSACRLFDAIASLCVPVIVSDGIELPFEDVIDYRKFSIFLRRDAALKPGFVVKKLR 365

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
            +   + L  Q  + +VR++F       DY      + E+W
Sbjct: 366 KVKPGKILKYQKVMKEVRRYF-------DYTHLNGSVNEIW 399


>gi|30686300|ref|NP_850241.1| Exostosin family protein [Arabidopsis thaliana]
 gi|50253448|gb|AAT71926.1| At2g35100 [Arabidopsis thaliana]
 gi|53828615|gb|AAU94417.1| At2g35100 [Arabidopsis thaliana]
 gi|110737872|dbj|BAF00874.1| hypothetical protein [Arabidopsis thaliana]
 gi|330253970|gb|AEC09064.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 447

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 75/284 (26%), Positives = 124/284 (43%), Gaps = 27/284 (9%)

Query: 52  DPSKADLFFLP-FS----IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
           DP+ ADLF++P FS    I         G+    + +   +    +   +W R  G DH 
Sbjct: 130 DPADADLFYVPVFSSLSLIVNAGRPVEAGSGYSDEKMQEGLVEWLEGQEWWRRNAGRDHV 189

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPKLGS 165
             A      +A+ +  +   NA+ +V           +  KDV +P          ++G 
Sbjct: 190 IPAGDP---NALYRILDRVKNAVLLVSDFGRLRPDQGSFVKDVVIPYSHRVNLFNGEIGV 246

Query: 166 SKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRL----KTPYADELLGSKFC 217
             RN L FF G         VR+ L QV   + ++    G      +      +  SKFC
Sbjct: 247 EDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDDVTIKHGTQSRENRRAATKGMHTSKFC 306

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPLLKK 274
           L+  G   +  R+ DS+   CVP+I+++  +LPF DV++++ FSI V   A L    L +
Sbjct: 307 LNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKFSIFVEANAALQPGFLVQ 366

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
           +L+ I +++ L  Q  +  VR++F       DYD     + E+W
Sbjct: 367 MLRKIKTKKILEYQREMKSVRRYF-------DYDNPNGAVKEIW 403


>gi|414880038|tpg|DAA57169.1| TPA: hypothetical protein ZEAMMB73_490377 [Zea mays]
          Length = 497

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/332 (24%), Positives = 143/332 (43%), Gaps = 62/332 (18%)

Query: 43  FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQ---------- 92
            +KS    +   +AD+F++PF                   IS+++    +          
Sbjct: 172 LLKSVIRVQQQEEADIFYVPFFTT----------------ISYFLLEKQECKALYREALK 215

Query: 93  ---KYPYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI- 143
                P W R+ G DH     H     S+ RS  +  W +      +  + +++  G + 
Sbjct: 216 WVTDQPAWQRSEGRDHVIPVHHPWSFKSVRRSVKKAIWLLP----DMDSTGNWYKPGQVY 271

Query: 144 AHKDVSLPQIWPRQEDPPKL---GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI 196
             KDV LP +        K      SKR+ L FF G +       +R KL++  ++  +I
Sbjct: 272 LEKDVILPYVPNVDLCDHKCVLETQSKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDI 331

Query: 197 YAHSGRL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
               G      K    D +  S FCL   G   ++AR+ D++  GC+PVII++  +LPF 
Sbjct: 332 VIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFE 391

Query: 253 DVLNWKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFP 304
            +L+++  ++ V+  D      L K L+GI+++    +QS ++K  +HF +      + P
Sbjct: 392 GILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYSRHFLYSSPAQPLGP 451

Query: 305 SDYD----AFYMVMYELWLRRSSVRVQWSTSL 332
            D      A  +V  +L +RRS   V+ S S+
Sbjct: 452 EDLTWRMIAGKLVNIKLQIRRSQRLVKESRSI 483


>gi|356570361|ref|XP_003553358.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
           max]
          Length = 520

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 77/289 (26%), Positives = 137/289 (47%), Gaps = 38/289 (13%)

Query: 51  KDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGG 102
           ++ S+AD+ F+PF  S+   R      H++R   + + + +  Y+    +    W R+GG
Sbjct: 202 RNSSEADVIFVPFFSSLCYNRLSKTGPHEKRSRNKLLQEKLVKYVTAQEE----WKRSGG 257

Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPRQ 157
            DH  +A H    ++M  A  +KL     + S       +IA+  KDV  P    +    
Sbjct: 258 KDHVILAHHP---NSMLDA-RMKLWPGTFILSDFGRYPTNIANVEKDVIAPYKHVVGSYD 313

Query: 158 EDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKT----PYAD 209
            D     S  R  L +F GA+       VR +L  + +N+ +++   G ++        +
Sbjct: 314 NDQSSFDS--RTTLLYFQGAIYRKDGGHVRHELYYLLKNEKDVHFSFGSVQKGGVRKATE 371

Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
            +  SKFCL++ G   ++ R+ D++   CVPVII++  +LP+ DVL++  F I V T D 
Sbjct: 372 GMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTRDA 431

Query: 270 PLLKK-----ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
             LKK      ++ I  EE+  + + + +V   F++     + DA  M+
Sbjct: 432 --LKKRYLINFIRSIGKEEWTRMWNRLKEVESFFEFQFPSKEGDAVQMI 478


>gi|356497977|ref|XP_003517832.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 509

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 77/302 (25%), Positives = 145/302 (48%), Gaps = 34/302 (11%)

Query: 50  TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
            ++ S+AD+ F+PF  S++  R      H ++   + + + +  Y+    +    W R+G
Sbjct: 190 VQNSSEADIIFVPFFSSLSYNRYSKSKPHVKKSKNKILQEKLVTYLMAQEE----WKRSG 245

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP---QIWPR 156
           G DH  +A H    ++M  A  +KL     + S       +IA+  KDV  P    I   
Sbjct: 246 GKDHLILAHHP---NSMLDA-RMKLWPATFILSDFGRYPPNIANVEKDVIAPYKHLISSY 301

Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI---YAHSGRLKTPYAD 209
             D     S  R  L +F GA+        R++L  + +++ ++   +   G+     A 
Sbjct: 302 VNDNSNFDS--RPTLLYFQGAIYRKDGGLARQELFYLLKDEKDVHFSFGSIGKDGIKKAT 359

Query: 210 E-LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
           E +  SKFCL++ G   ++ R+ D++   CVPVII++  +LP+ DV+++  F I V T D
Sbjct: 360 EGMRASKFCLNIAGDTPSSNRLFDAIASHCVPVIISDKIELPYEDVIDYSEFCIFVRTSD 419

Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
                 L   ++GI+ EE+  + + + +V   F++H    + DA  M+   +  +  ++R
Sbjct: 420 AIKEKFLINFIRGIAKEEWTRMWNKLKEVEHFFEFHFPSKENDAVQMIWQAVARKVPAMR 479

Query: 326 VQ 327
           ++
Sbjct: 480 LK 481


>gi|297839341|ref|XP_002887552.1| F25A4.34 [Arabidopsis lyrata subsp. lyrata]
 gi|297333393|gb|EFH63811.1| F25A4.34 [Arabidopsis lyrata subsp. lyrata]
          Length = 458

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 69/276 (25%), Positives = 130/276 (47%), Gaps = 28/276 (10%)

Query: 51  KDPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHF 106
           K+ ++AD+ F+PF  S++  R  +  G E I     +   +    +    W R  G DH 
Sbjct: 148 KNSNEADIVFVPFFASLSYNRKSKLRGNETISGDRLLQERLVEFLKSQDEWKRFDGKDHL 207

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA-HKDVSLP-----QIWPRQEDP 160
            +A H    +++  A     +A+ V+     + S +    KD+  P     +     E  
Sbjct: 208 IIAHHP---NSLLYAKNFLGSAMFVLSDFGRYSSANANLEKDIIAPYLHVVKTISNNESA 264

Query: 161 PKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LL 212
           P     KR  LA+F GA+       +R++L  + R++ +++   G ++     +    + 
Sbjct: 265 P---FEKRPVLAYFQGAIYRKDGGTIRQELYNLLRDEKDVHFAFGTVRRNGTKQTGKGMA 321

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDI 269
            SKFCL++ G   ++ R+ D++   CVPVII++  +LPF D L++  FS+ V     +  
Sbjct: 322 SSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFEDSLDYSGFSVFVHASEAVKK 381

Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS 305
             L  +L+GI+ +++      + +V   F++  FPS
Sbjct: 382 GFLVNLLRGITEDQWKKKWGRLKEVAGCFEYR-FPS 416


>gi|384251887|gb|EIE25364.1| exostosin-like glycosyltransferase [Coccomyxa subellipsoidea C-169]
          Length = 705

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 86/336 (25%), Positives = 133/336 (39%), Gaps = 65/336 (19%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRH-------DRRIGTEGIPDFIS 84
            Y  E    +  ++S   T DP  AD F++P   +   H              G P  + 
Sbjct: 316 TYQIEPALHEMLLQSPHRTLDPEAADFFYVPVYTSCFIHPVYGWADTPWFHNPGSPRVMH 375

Query: 85  HYIFNISQK------YPYWNRTGGADHFYVACHSIGR---------SAMQKAWEVKL--- 126
                +  K       PYWNRTGG DH ++  H  G          S +   W  K    
Sbjct: 376 AATMMLEAKRWLETELPYWNRTGGRDHIWLISHDEGSCWAPSEIRSSIILSHWGRKALDH 435

Query: 127 -------------NAIQVVCSS---SYFISGHIAH---KDVSLPQIWP--RQEDPPKLGS 165
                        NA+          + I GH  +   KD+ +P   P  R    P  G+
Sbjct: 436 ESYSAYPFDNYSDNAVHPEWRPHGWRHIIEGHPCYDPDKDLIIPAFVPPARIVPSPLTGA 495

Query: 166 SK--RNKLAFFAGAV--------NSPVREKLL-----QVWRNDSEIYAHSGRLKTP--YA 208
            +  R  L FF G V        +  +R+++      Q WR    I+  + +  TP  Y+
Sbjct: 496 REDPRPLLLFFRGDVGLNRRPHYSRGIRQRIYALSKEQRWREKYRIWIGT-KEDTPGGYS 554

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
           + L  SKFCL V G +  + R  D++ +GCVPV++ +  D  F  +L+W+ F++ +   +
Sbjct: 555 ELLSSSKFCLVVPG-DGWSPRAEDAMLHGCVPVVVNDGVDQVFETLLDWEEFAVRIPERE 613

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
           +  L +IL  IS      LQ  V +V   F +   P
Sbjct: 614 MEFLPEILLSISPSRLQQLQKGVRRVWHRFMYRALP 649


>gi|307110230|gb|EFN58466.1| hypothetical protein CHLNCDRAFT_140476 [Chlorella variabilis]
          Length = 632

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 78/341 (22%), Positives = 137/341 (40%), Gaps = 74/341 (21%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDF---ISHYIF 88
            YA E+ F +  ++S   T DP +AD F++P   +   +  R G + + DF   +SH   
Sbjct: 235 TYALEAGFLEMLLQSEHRTLDPEEADFFYVPVFTSCFINPVRDGADSLRDFFYGVSHNRV 294

Query: 89  N------------ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-- 134
                        +   +PYW R GG DH ++  H      +  A    + +  ++ S  
Sbjct: 295 QGAANMLLEAYHWVQAMFPYWERRGGRDHIWLVTHDEASCWVPAA----IRSTSIILSHW 350

Query: 135 --------SSYFISGHIAHKDVSLPQIWPRQ-----------EDP--------------- 160
                   S    S  +   DV+ PQ  P              DP               
Sbjct: 351 GRMDAHHTSGTGYSADVYSNDVTHPQFEPDGFLGKLNLTQPCYDPVKDLVVPLMKTPEHY 410

Query: 161 ---PKLGSSKRNK--LAFFAGAV---NSP----VREKLLQVWRNDSEIYAHS------GR 202
              P +G+  R +  LAF  G V   N P    +R++L +       +  H         
Sbjct: 411 RLSPLVGAPPRQRTWLAFHRGRVQADNPPYSRGIRQRLAKAAAEGGWLEKHKIAVGEYDT 470

Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           L+  Y++ L  S FC  + G +  +AR+ D++ +GC+PV+I +   + F  V++  +F+I
Sbjct: 471 LQGDYSELLASSVFCPVIPG-DGWSARMDDAMLHGCIPVLIMDEVQVSFESVVDLSTFTI 529

Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
            +   D   L  IL+ ++ E    +Q  + +V + F +  +
Sbjct: 530 RIPEADAEKLPDILQAVTQERREEMQRALARVWQRFTYSSY 570


>gi|145352087|ref|XP_001420390.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580624|gb|ABO98683.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 517

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 47/116 (40%), Positives = 69/116 (59%), Gaps = 6/116 (5%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  SK+CL+V G   +TAR+ D + +GCVPVI+A+ YDLPF+ + +W  FS+ V  
Sbjct: 401 YMTMLSKSKYCLYVYGDRAHTARLYDIITFGCVPVIVADGYDLPFSWLFDWSKFSVRVLE 460

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-DAFYMVMYELWLRR 321
            D+  L  IL      +Y  L+  ++KV   FQ+H   S + DAF++ M  L +RR
Sbjct: 461 DDVATLPSIL---DRADYDSLRRELVKVHSFFQYHNRGSIFGDAFWITM--LGVRR 511


>gi|168017876|ref|XP_001761473.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687479|gb|EDQ73862.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 67/252 (26%), Positives = 115/252 (45%), Gaps = 24/252 (9%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN 89
           RG + ++    K  MKS + T D S+A+ FF+P  +  +R    +  + +    + +   
Sbjct: 47  RGQWGTQVKIHKLLMKSRYRTLDKSRANFFFVPVYVKCVRIFGGLNEKEV----NEHFLK 102

Query: 90  ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC-----SSSYFISGHIA 144
           I ++ PY++R+GG DH +V     G + + K W   LN    +      +     S    
Sbjct: 103 ILRQMPYFHRSGGRDHIFVFPSGAG-AHLVKGWPNFLNRSIFLTPEGDRTDKKAFSSFNT 161

Query: 145 HKDVSLPQI-----WPRQEDPPKLGSSKRNKLAFFAG-AVNSPVREKLLQVWRN-----D 193
            KD+ +P        P       L  SKR  +A + G A     R +L+++ +      D
Sbjct: 162 WKDIIIPGNVDIINHPSNSATSPLPLSKRKYVANYLGRAQGKKGRLQLIELAKQFPAELD 221

Query: 194 SEIYAHSGRLK---TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
           +   A  G  K     Y + L  +KFCL  +G    T R  ++ +  CVPVI+++  +LP
Sbjct: 222 APELAFQGSAKLGRIEYYNRLRNAKFCLAPRGESSWTLRFYEAFFVECVPVILSDEIELP 281

Query: 251 FADVLNWKSFSI 262
           + +VL++  FSI
Sbjct: 282 YQNVLDYSGFSI 293


>gi|356506196|ref|XP_003521873.1| PREDICTED: probable glucuronosyltransferase GUT1-like [Glycine max]
          Length = 505

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/290 (26%), Positives = 138/290 (47%), Gaps = 39/290 (13%)

Query: 51  KDPSKADLFFLPF--SIARMR-------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
           ++ S++D+ F+PF  S+   R       H++R   + + + +  Y+    +    W R+G
Sbjct: 196 RNSSESDVVFVPFFSSLCYNRFSSKTNPHEKRSMNKVLQEKLVKYVTEQEE----WKRSG 251

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPR 156
           G DH  VA H    ++M  A  +KL     + S       +IA+  KDV  P    +   
Sbjct: 252 GKDHVIVAHHP---NSMLDA-RMKLWPGTFILSDFGRYPTNIANVEKDVIAPYKHVVGSY 307

Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP----YA 208
             D     S  R  L +F GA+       VR +L  + +N+ +++   G ++       A
Sbjct: 308 DNDQSSFDS--RPTLLYFQGAIYRKDGGHVRHELYYLVKNEKDVHFSFGNVEKGGVRNAA 365

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
           + +  SKFCL++ G   ++ R+ D++   CVPVII++  +LP+ DV+++  F + V T D
Sbjct: 366 EGMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVIDYSQFCVFVRTRD 425

Query: 269 IPLLKK-----ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
              LKK      ++ I  EE+  + + + +V   F++     + DA  M+
Sbjct: 426 A--LKKRYLINFIRSIGKEEWTRMWNRLKEVESFFEFQFPSKEGDAVQMI 473


>gi|307103001|gb|EFN51266.1| hypothetical protein CHLNCDRAFT_141180 [Chlorella variabilis]
          Length = 404

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 72/294 (24%), Positives = 132/294 (44%), Gaps = 37/294 (12%)

Query: 39  FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
           F +  + S   T+DPS+A+LFF+P +++       +    + D ++ +I   +  YPYWN
Sbjct: 90  FIEQLLMSPVRTEDPSEANLFFVP-ALSWSYGGNALNAVHL-DLVADHI---ASHYPYWN 144

Query: 99  RTGGADHFY------VACHSIGRS-AMQKAWEVKLNAIQVV------CSSSYFISGHIAH 145
           R+ G DH +       AC   GR+ A  K     LN I +        +++   + +   
Sbjct: 145 RSQGRDHIFWLTNDRGACALTGRTEAAIKLTHFGLNTINISVGWGPGAATNPENACYNPL 204

Query: 146 KDVSLPQIWPRQEDPPKLGS--------SKRNKLAFFAGAVNS---------PVREKLLQ 188
           +DV  P       +  ++          + +  L FF+GAV++          +  +L++
Sbjct: 205 RDVVAPPFDDMARELMEVSRKLSVEDIIAAKTSLFFFSGAVSNDSEYSGNTRQLLRELVK 264

Query: 189 VWRNDSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
            W +   I+   G      Y   L  SKFC  V G+     R+   ++ G VP++I    
Sbjct: 265 RWNDPEIIFETEGDTGLGDYVKRLRASKFCPAVFGYGFGM-RLLTCVFSGSVPLVIQERV 323

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
             P  D+L +++FS+ +    +P L +IL+ I+ ++Y  L   +++ R  F W 
Sbjct: 324 AQPLEDLLPYETFSLRLNNGHLPDLPRILRSITDQQYQRLVQGLVRYRDAFHWE 377


>gi|60657600|gb|AAX33321.1| secondary cell wall-related glycosyltransferase family 47 [Populus
           tremula x Populus tremuloides]
          Length = 509

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/302 (25%), Positives = 145/302 (48%), Gaps = 34/302 (11%)

Query: 50  TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
            ++ S+AD+ F+PF  SI+  R      H ++   + + + +  ++   SQK   W R+G
Sbjct: 190 VQNSSEADVIFVPFFSSISYNRYSKVNPHQKKSKNKSLEEKLVKFV--TSQKE--WKRSG 245

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPR 156
           G DH  +A H    ++M  A  +KL     + +     S +IA+  KDV  P    I   
Sbjct: 246 GRDHIILAHHP---NSMLYA-RMKLWTAMFILADFGRYSPNIANVGKDVIAPYKHVIKSY 301

Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYA 208
             D     S  R  L +F GA+        R++L    +++ +++   G ++    +  +
Sbjct: 302 ANDSSNFDS--RPTLLYFQGAIYRKDGGFARQELFYALKDEKDVHFQFGSVQKDGVSKAS 359

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
             +  SKFCL++ G   ++ R+ D++   CVPVII++  +LP+ DVL++  F I V T D
Sbjct: 360 QGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTSD 419

Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
                 L  +++ I  +E+  +   + +V   F++     + DA  M+   +  +  ++R
Sbjct: 420 AVREKFLINLVRSIKKDEWTRMWQRLKEVENFFEFQYPSKEGDAVQMIWQAVARKVPAIR 479

Query: 326 VQ 327
           ++
Sbjct: 480 LK 481


>gi|159479930|ref|XP_001698039.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158273838|gb|EDO99624.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 810

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 92/350 (26%), Positives = 143/350 (40%), Gaps = 95/350 (27%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA---------------------RMRH 70
            Y  E+   +  ++S   T DP +AD F++P  I                      R+ H
Sbjct: 413 TYGIETLMHELMLQSEHRTFDPEEADFFYVPMYITCYFWPILGWADGPWWHAPNGLRVMH 472

Query: 71  DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA--------- 121
              + TE + D++         K PYW+R GG DH ++     G   M KA         
Sbjct: 473 GANMITE-LHDWLR-------TKLPYWDRRGGRDHIWLMAADEGACWMPKAVYDTSIVLT 524

Query: 122 -W-----EVKLNAIQV-----VCSSSYF-----------ISGHIA---HKDVSLPQI--- 153
            W     E K N   +         S F           I GH      KD+ +P     
Sbjct: 525 HWGRLDPEHKSNTAYLQDNYTAKPESAFDAWRGVDFGDRIKGHPCFDPRKDLVVPAFKSP 584

Query: 154 --WPRQEDPPKLGSS--KRNKLAFFAGAVNSP--------VREKLLQV-----WRNDSEI 196
             +PR    P +G+   +R+ L FF G V +         +R++L  +     W N  +I
Sbjct: 585 NHFPRS---PLIGAPPLERDLLLFFRGDVGASRLPHYSRGIRQRLFHLAHKHDWYNRFKI 641

Query: 197 YAHSG-RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
              SG  LK  Y+++L  SKFCL   G +  + R  D++ +GC+PV++ +     F  +L
Sbjct: 642 AIGSGDSLKGDYSEQLARSKFCLVAPG-DGWSPRAEDAILHGCIPVVVMDGVQAVFESIL 700

Query: 256 NWKSFSIVVATLDIPL--LKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
           +W SFS+ +   D  L  L ++L  IS E    +Q ++ +V     WH F
Sbjct: 701 DWDSFSLRIREDDAALEALPQLLASISPERLAHMQRHLARV-----WHRF 745


>gi|224123938|ref|XP_002319201.1| predicted protein [Populus trichocarpa]
 gi|222857577|gb|EEE95124.1| predicted protein [Populus trichocarpa]
          Length = 450

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/302 (25%), Positives = 145/302 (48%), Gaps = 34/302 (11%)

Query: 50  TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
            ++ S+AD+ F+PF  SI+  R      H ++   + + + +  ++   SQK   W R+G
Sbjct: 151 VQNSSEADVIFVPFFSSISYNRYSKVNPHQKKSKNKSLEEKLVKFV--TSQKE--WKRSG 206

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPR 156
           G DH  +A H    ++M  A  +KL     + +     S +IA+  KDV  P    I   
Sbjct: 207 GRDHIILAHHP---NSMLYA-RMKLWTAMFILADFGRYSPNIANVGKDVIAPYKHVIKSY 262

Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYA 208
             D     S  R  L +F GA+        R++L    +++ +++   G ++    +  +
Sbjct: 263 ANDSSNFDS--RPTLLYFQGAIYRKDGGFARQELFYALKDEKDVHFQFGSVQKDGVSKAS 320

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
             +  SKFCL++ G   ++ R+ D++   CVPVII++  +LP+ DVL++  F I V T D
Sbjct: 321 QGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTSD 380

Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
                 L  +++ I  +E+  +   + +V   F++     + DA  M+   +  +  ++R
Sbjct: 381 AVREKFLINLVRSIKKDEWTRMWQRLKEVENFFEFQYPSKEGDAVQMIWQAVARKVPAIR 440

Query: 326 VQ 327
           ++
Sbjct: 441 LK 442


>gi|125547211|gb|EAY93033.1| hypothetical protein OsI_14833 [Oryza sativa Indica Group]
          Length = 429

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 70/287 (24%), Positives = 128/287 (44%), Gaps = 30/287 (10%)

Query: 2   NRSFRVYVYPHRRNDPFANVLLPVDFEP------RGNYASESYFKKAFMKSHFVTKDPSK 55
           +R  R+YVY     D    +L   + +       +G + ++    +  + S F T D  +
Sbjct: 52  HRDLRIYVYAEDEVDGLRALLRGRNDDVTAATCIKGQWGTQVKIHQLLLSSRFRTFDKDE 111

Query: 56  ADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
           ADLFF+P  +  +R   ++  + I      Y+  +  + PY+ R+GG DH +V     G 
Sbjct: 112 ADLFFVPTYVKCVRMTGKLNDKEIN---QTYVKVVLSQMPYFRRSGGRDHIFVFPSGAG- 167

Query: 116 SAMQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLG 164
           + + ++W   LN   ++      +    IS     KD+ +P       +   +     + 
Sbjct: 168 AHLFRSWATFLNRSIILTPEGDRTDKRGISAFNTWKDIIIPGNVDDSMVKSDRLAVKPIP 227

Query: 165 SSKRNKLAFFAGAVNSPV-REKLLQVWRN-DSEIYAHSGRLKTP-------YADELLGSK 215
            +KR  LA F G     V R +L+++ +    ++ +   +L  P       Y   L  +K
Sbjct: 228 LTKRKYLANFLGRAQGKVGRLQLVKLAKQYPDKLESPELKLSGPDKLGRIDYFKHLRNAK 287

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           FCL  +G    T R  +S +  CVPVI+++  +LPF +V+++   SI
Sbjct: 288 FCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNVIDYTEVSI 334


>gi|22330483|ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana]
 gi|115311405|gb|ABI93883.1| At1g67410 [Arabidopsis thaliana]
 gi|332196520|gb|AEE34641.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 430

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 70/285 (24%), Positives = 130/285 (45%), Gaps = 27/285 (9%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPDF---ISHYIFNISQKYPYWNRTGGADHF 106
           DP  AD+F++PF  S++   H + + T+   +F   +   +    +   YWNR+GG DH 
Sbjct: 122 DPDLADVFYVPFFSSLSFNTHGKNM-TDPDTEFDRLLQVELMEFLENSKYWNRSGGKDHV 180

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP----RQEDPPK 162
               H      +++  +V  + + VV    Y        KDV  P +       +E    
Sbjct: 181 IPMTHPNAFRFLRQ--QVNASILIVVDFGRYSKDMARLSKDVVSPYVHVVESLNEEGDDG 238

Query: 163 LGS--SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADEL 211
           +G     R  L +F G         +R +L ++   +S+++     A +  +K    + +
Sbjct: 239 MGDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAGNSDVHFEKSVATTQNIKVS-TEGM 297

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLD 268
             SKFCLH  G   ++ R+ D++   C+PVII++  +LPF D +++  FS+   +  +L+
Sbjct: 298 RSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPFEDEIDYSEFSLFFSIKESLE 357

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
              +   L+    E++L +   +  V  HF++   P   DA  M+
Sbjct: 358 PGYILNNLRQFPKEKWLEMWKRLKNVSHHFEFQYPPKREDAVNML 402


>gi|21592312|gb|AAM64263.1| unknown [Arabidopsis thaliana]
          Length = 273

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 66/118 (55%), Gaps = 15/118 (12%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAF-MKSHFVTKDPSKAD 57
           M    +VY+YP            P+  EP   G YASE +F K     + FVTK+P KA 
Sbjct: 155 MELILKVYIYPDGDK--------PIFHEPHLNGIYASEGWFMKLMESNTQFVTKNPEKAH 206

Query: 58  LFFLPFSIARMRHDRRI-GTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACH 111
           LF++P+S+ +++    + G+  I     F+  Y+  +S KYP+WNRT G+DHF VACH
Sbjct: 207 LFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNRTHGSDHFLVACH 264


>gi|15228598|ref|NP_187015.1| exostosin-like protein [Arabidopsis thaliana]
 gi|6091757|gb|AAF03467.1|AC009327_6 hypothetical protein [Arabidopsis thaliana]
 gi|30102722|gb|AAP21279.1| At3g03650 [Arabidopsis thaliana]
 gi|110736553|dbj|BAF00242.1| hypothetical protein [Arabidopsis thaliana]
 gi|332640446|gb|AEE73967.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 499

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 73/288 (25%), Positives = 136/288 (47%), Gaps = 34/288 (11%)

Query: 50  TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
            K+ S+AD+ F+PF  S++  R        ++   + + + +  Y+   SQK   W  +G
Sbjct: 194 VKNSSEADVVFVPFFSSLSYNRFSKVNQKQKKSQDKELQENVVKYV--TSQKE--WKTSG 249

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP 161
           G DH  +A H    ++M  A      A+ VV     + S H+A+ D  +  + P +   P
Sbjct: 250 GKDHVIMAHHP---NSMSTARHKLFPAMFVVADFGRY-SPHVANVDKDI--VAPYKHLVP 303

Query: 162 KL-----GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYA 208
                  G   R  L +F GA+       VR++L  + + + +++   G ++    +   
Sbjct: 304 SYVNDTSGFDGRPILLYFQGAIYRKAGGFVRQELYNLLKEEKDVHFSFGSVRNHGISKAG 363

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
           + +  SKFCL++ G   ++ R+ D++   C+PVII++  +LP+ DVLN+  F + V + D
Sbjct: 364 EGMRSSKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFCLFVRSSD 423

Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
                 L  +++ I  EEY  +   + +V ++F       D +  Y V
Sbjct: 424 ALKKGFLMGLVRSIGREEYNKMWLRLKEVERYFDLRFPVKDDEGDYAV 471


>gi|225457795|ref|XP_002265632.1| PREDICTED: probable glucuronosyltransferase Os03g0107900 [Vitis
           vinifera]
 gi|302142755|emb|CBI19958.3| unnamed protein product [Vitis vinifera]
          Length = 459

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 86/348 (24%), Positives = 148/348 (42%), Gaps = 46/348 (13%)

Query: 4   SFRVYVYPHRRNDPFANVL------LPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D   ++L      +P +    G + ++    +  +KS F T+   +AD
Sbjct: 85  SLKIYVYEEDEIDGLKSLLYGRDGSIPTEVCVTGQWGTQVKIHRLLLKSRFRTRRKEEAD 144

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P  I  +R    +  + I      Y+  +SQ  PY+  +GG +H +V     G   
Sbjct: 145 LFFVPTYIKCVRMKGGLNDKEIDQM---YVKVLSQ-MPYFRLSGGRNHIFVFPSGAG-PH 199

Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS------------ 165
           + K+W   LN   ++             KD S    W     P  +              
Sbjct: 200 LFKSWATYLNRSIILTPEG----DRTDKKDTSAFNTWKDIIIPGNVADEMTTNGATFVQP 255

Query: 166 ---SKRNKLAFFAGAVNSPV-REKLLQVWRN-----DSEIYAHSGRLK---TPYADELLG 213
              SKR  LA F G     + R +L+++ +      +S     SG  K     Y   L  
Sbjct: 256 LPLSKRKFLANFLGRAQRKLGRLQLIELAKQYPDKLESPELQFSGPDKLGRIEYFHHLRN 315

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-PLL 272
           +KFC   +G    T R  +S +  CVPVI+++  +LPF +V+++   SI   +  I P L
Sbjct: 316 AKFCFAPRGESSWTLRFYESFFVECVPVILSDQVELPFQNVIDYTQVSIKWPSSQIGPQL 375

Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYD---AFYMVMYEL 317
            + L+ I  +   +++  + + R+   W V+ S+ +   A   +M+EL
Sbjct: 376 LEYLESIPDK---VIEEMISRGREVRCWWVYASESEPCSAMRGIMWEL 420


>gi|297841429|ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334437|gb|EFH64855.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 429

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 71/285 (24%), Positives = 129/285 (45%), Gaps = 27/285 (9%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPDF---ISHYIFNISQKYPYWNRTGGADHF 106
           DP  AD F++PF  S++   H + + T+   +F   +   +    +   YWNR+GG DH 
Sbjct: 121 DPDLADAFYVPFFSSLSFNTHGKNM-TDPDTEFDRQLQVELMEFLEGSEYWNRSGGKDHV 179

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP----RQEDPPK 162
               H      +++  +V  + + VV    Y        KDV  P +       +ED   
Sbjct: 180 IPMTHPNAFRFLRQ--QVNASILIVVDFGRYAKDMARLSKDVVSPYVHVVESLNEEDDDG 237

Query: 163 LGS--SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADEL 211
           L      R  L +F G         +R +L ++   +S+++     A +  +K    + +
Sbjct: 238 LTDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAGNSDVHFEKSVATTQNIKVS-TEGM 296

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLD 268
             SKFCLH  G   ++ R+ D++   C+PVII++  +LPF D +++  FS+   +  +L+
Sbjct: 297 RSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPFEDEIDYSEFSLFFSIKESLE 356

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
              +   L+    E++L +   +  V  HF++   P   DA  M+
Sbjct: 357 PGYILNKLRQFPKEKWLEMWKRLKNVSHHFEFQYPPKREDAVNML 401


>gi|326491429|dbj|BAJ94192.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 400

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 81/326 (24%), Positives = 140/326 (42%), Gaps = 31/326 (9%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEP------RGNYASESYFKKAFMKSHFVTKDPSKADL 58
            RVYVY     D    +L   D         +G + ++    +  ++S + T D  +A+L
Sbjct: 54  LRVYVYAEDEVDGLRALLRGRDGAVSAATCLKGQWGTQVKIHQLLLRSRYRTLDKDEANL 113

Query: 59  FFLPFSIARMRHDRRIGTEGIPD-FISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           FF+P  +  +R      T G+ D  I+     +  + PY+ R+GG DH +V     G + 
Sbjct: 114 FFVPSYVKCVRM-----TGGLTDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAG-AH 167

Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLA 172
           + ++W   LN   ++      +    IS     KD+ +P      +D   +G   R KL 
Sbjct: 168 LFRSWATFLNRSIILTPEGDRTDKRGISAFNTWKDIIIPG---NVDD--SMGKVGRLKLV 222

Query: 173 FFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
             A      +    L++   D       GR+   Y   L  +KFCL  +G    T R  +
Sbjct: 223 ELAKQYPDKLESPELKLSGPDK-----LGRID--YFKHLRNAKFCLAPRGESSWTLRFYE 275

Query: 233 SLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-PLLKKILKGISSEEYLLLQSYV 291
           S +  CVPV++++  +LPF +V+++   SI      I P L + L+ I  E    + +  
Sbjct: 276 SFFVECVPVLLSDEVELPFQNVIDYTKISIKWPASKIGPELFQYLESIPEERIEEMIARG 335

Query: 292 LKVRKHFQWHVFPSDYDAFYMVMYEL 317
            +VR  + + +      A   +M+EL
Sbjct: 336 REVRCMWVYALDTEPCSAMTAIMWEL 361


>gi|326432404|gb|EGD77974.1| hypothetical protein PTSG_09607 [Salpingoeca sp. ATCC 50818]
          Length = 377

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 71/277 (25%), Positives = 118/277 (42%), Gaps = 10/277 (3%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSI------ARMRHDRRIGTEGIPDFISHY 86
           +ASE    +A   S   T D   ADL  + + +           D R   E  P     +
Sbjct: 52  FASELLLPQAIRNSALHTDDAEAADLILIQYCVMIWGMQGEKMMDIRPLLEKRPVLARRF 111

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKL-NAIQVVCSSSYFISGHIAH 145
             N ++     +   G  H +   H  GR  +++  +  L NA  ++   S     +   
Sbjct: 112 QQNPARFLVVLSSDHGPCHNFNE-HLTGRRLVKRWIDRSLENATLLMNDGSLVNKCYRPG 170

Query: 146 KDVSLP-QIWPRQED-PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL 203
           KDV +P   W            + R   AFFAGA +S +RE ++    N+  ++      
Sbjct: 171 KDVVIPPSTWIGNATFACSRPITDRKHFAFFAGAASSLIREYIINELGNEDWLFIPHDLQ 230

Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
              Y  E+  + FCL  +G    + R+ ++L  GC+PVIIA+    PF DVL++ +F++ 
Sbjct: 231 HEEYMCEMGNAVFCLAPRGRAAWSPRLVEALEAGCIPVIIADMNHEPFHDVLDYSTFTVQ 290

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           V    +  L + L  ISS +   L +   + R HF++
Sbjct: 291 VHEDKLETLGEQLHSISSGQVARLHANGQRARAHFRY 327


>gi|159480086|ref|XP_001698117.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158273916|gb|EDO99702.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 821

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 82/340 (24%), Positives = 144/340 (42%), Gaps = 68/340 (20%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLP------------FSIARMRHDRRIGTEGI 79
            Y+ E+ F +  ++S   T DP +AD F++P            ++ A   H   + +  +
Sbjct: 390 TYSVETMFHELLLQSEHRTFDPEEADFFYVPHYVTCYFWPIMGWADAPWWHAPYVDSRPM 449

Query: 80  --PDFISHYIFNISQKYPYWNRTGGADHFYV------AC--------------------- 110
              + ++     +    PYW+R GG DH ++      AC                     
Sbjct: 450 HGANMLTELHGWLRTNLPYWDRRGGRDHIWLMAADEGACWMPTAIYNTSIVLTHWGRLEA 509

Query: 111 -HSIGRSAMQKAWEVKLNAIQVVCSSSYF--ISGHIA---HKDVSLPQIWP--RQEDPPK 162
            H+ G + +Q  ++  +   Q      Y   I GH      KD+ +P   P       P 
Sbjct: 510 NHTSGTAYLQDVYDRPVYGFQRWPGVDYHHDIEGHPCFDPKKDLVIPAFKPPFHFARSPL 569

Query: 163 LGSS--KRNKLAFF---AGAVNSP-----VREKLL-----QVWRNDSEIY-AHSGRLKTP 206
           LG+   +R+ L +F   +GA   P     +R+++      Q W N  +I  +H G +   
Sbjct: 570 LGAPPLQRDILLYFRGDSGAFRLPQYSRGIRQRITDLSNRQDWFNRYKIVISHGGMVGGD 629

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y++ L  SKFCL   G +  + R  D++ +GC+PV++ +     F  +L+W SFS+ +  
Sbjct: 630 YSEHLARSKFCLVAPG-DGWSPRAEDAILHGCIPVVVMDGVQAVFESILDWDSFSLRIRE 688

Query: 267 LDIPL--LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
            D  L  L ++L  IS E    +Q ++ +V   F +   P
Sbjct: 689 DDAALEALPQLLASISPERLAHMQRHLARVWHRFAYTTGP 728


>gi|255542540|ref|XP_002512333.1| catalytic, putative [Ricinus communis]
 gi|223548294|gb|EEF49785.1| catalytic, putative [Ricinus communis]
          Length = 434

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 70/286 (24%), Positives = 120/286 (41%), Gaps = 24/286 (8%)

Query: 52  DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
           DP KAD FF+PF    S     H        I   +   + ++  K  YW ++GG DH  
Sbjct: 127 DPEKADAFFVPFFSSLSFNTHGHTMTDPETEIDRQLQVDVIDMLYKSKYWQKSGGRDHVI 186

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS--SSYFISGHIAHKDVSLPQIW---PRQEDPPK 162
              H      +++    +LNA  ++ +    Y  S     KDV  P +       +D   
Sbjct: 187 PMTHPNAFRFLRQ----QLNASILIVADFGRYPKSMSTLSKDVVAPYVHVVDSFTDDEVS 242

Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY----AHSGRLKTPYADELLGS 214
                R  L FF G         VR KL ++     +I+    + +        + +  S
Sbjct: 243 NPFESRTTLLFFRGNTIRKDEGKVRAKLAKILTGYDDIHFERSSATAETIKASTEGMRSS 302

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLDIPL 271
           KFCLH  G   ++ R+ D++   CVPVI+++  +LP+ D +++  FS+   V   +    
Sbjct: 303 KFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYSQFSVFFSVNEAIQPGY 362

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
           +   L+ +  E +L +   +  +  HF++   P   DA  M+  E+
Sbjct: 363 MVDQLRQLPKERWLEMWRKLKSISHHFEFQYPPEKEDAVDMLWREV 408


>gi|242055419|ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
 gi|241928830|gb|EES01975.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
          Length = 432

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 77/288 (26%), Positives = 126/288 (43%), Gaps = 32/288 (11%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTEGIPD-----FISHYIFNISQKYPYWNRTGG 102
            +DP  AD FF+PF  S++   H R +     PD      +   I +I  K  YW R+ G
Sbjct: 125 VRDPDAADAFFVPFFSSLSFNVHGRNMTD---PDTEADRLLQVEIVDILWKSKYWQRSAG 181

Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA--HKDVSLPQIWP----R 156
            DH     H      ++      +NA  ++ S     +  +A   KDV  P +       
Sbjct: 182 RDHVIPMHHPNAFRFLR----AMVNASILIVSDFGRYTKELASLRKDVVAPYVHVVDSFL 237

Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDS----EIYAHSGRLKTPYA 208
            +DPP    + R+ L FF G         +R KL +V +       E    +G       
Sbjct: 238 DDDPPDPFEA-RHTLLFFRGRTVRKDEGKIRAKLGKVLKGKEGVRFEDSIATGDGIKIST 296

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVA 265
           + +  SKFCLH  G   ++ R+ D++   CVPVI+++  +LPF D +++  FS+   V  
Sbjct: 297 EGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEE 356

Query: 266 TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            L    L   L+ I  ++++ + S +  V  H+++   P   DA  M+
Sbjct: 357 ALRPDYLLNQLRQIPKKKWVDMWSKLKNVSHHYEFQYPPRKGDAVNMI 404


>gi|302829555|ref|XP_002946344.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300268090|gb|EFJ52271.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 785

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 85/352 (24%), Positives = 137/352 (38%), Gaps = 90/352 (25%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSI-----------------ARMRHDRRI 74
            Y+ E Y  +  ++S   T DP +AD F++P  I                 A + H R  
Sbjct: 382 TYSVEVYLHEMMLQSEHRTFDPEEADFFYVPMYITCFMWPVMGWADFPWWHAPLAHTR-- 439

Query: 75  GTEGIPDFISHYIFN----ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQ 130
                P  +S+ I      +S  +P+WNR GG DH ++     G   M     V  N+  
Sbjct: 440 -----PMHVSNMILEAYEWLSTTFPWWNRRGGRDHIWLMAPDEGACYMPT---VVYNSSI 491

Query: 131 VVCSSSYFISGH---------IAHKDVSLPQI-------WPRQEDP-------------- 160
           ++         H         I  KD+ + Q        W  +  P              
Sbjct: 492 ILTHWGRMDPDHKSGSAFDQDIYDKDLPVAQFKGWRGLDWMEKSRPHLCYNPEKDLVIPA 551

Query: 161 ----------PKLGSS--KRNKLAFFAGAVNSP--------VREKLLQV-----WRNDSE 195
                     P LG+   +R+ L +F G V           +R+KL Q      W    +
Sbjct: 552 FKSPDHFQESPLLGAPPLERDILLYFRGDVGEGRRDHYSRGIRQKLFQFAHWGKWAEKYK 611

Query: 196 IYAHSGR-LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
           IY  +G  +   Y++ L  SKFCL   G +  +AR  D++ +GCVP+++ +     F  +
Sbjct: 612 IYIGTGETIGGSYSEHLARSKFCLVAPG-DGWSARAEDAILHGCVPLVVMDGVHAVFESI 670

Query: 255 LNWKSFSIVVATLDIPL--LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
           L+W SFSI +   +  L  + ++L  IS E    +Q  + +V   F +   P
Sbjct: 671 LDWDSFSIRIREDNQALQAIPELLTAISPERLAKMQRNLARVWHRFAYATGP 722


>gi|159470401|ref|XP_001693348.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158277606|gb|EDP03374.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 356

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 86/315 (27%), Positives = 134/315 (42%), Gaps = 65/315 (20%)

Query: 33  YASESYFKKAFMKSHFV-TKDPSKADLFFLP----FSIARMRHDRRIGTEGIPDF-ISHY 86
           Y +  YF K F+  + V T++P +A LF++P    F    +R          P++ +   
Sbjct: 25  YTAYEYFLKYFITDNIVRTENPYEAHLFYVPALNFFYSGNLR---------PPEYHLEAV 75

Query: 87  IFNISQKYPYWNRTGGADHFYV------ACH----------SIGRSAMQKAWEVKLNAIQ 130
           + ++   +P++NR+GG DHF        ACH           +    MQK     LN   
Sbjct: 76  MDHVKTAWPFYNRSGGRDHFIFLTGDRGACHMPRDMQDSMIKVVHFGMQKQ---GLNWTS 132

Query: 131 VVCSSSYFISGHI-AHKDVSLPQ-------IWPR------QEDPPKLG-SSKRNKLAFFA 175
           +  +  Y   G I   +D+ +P        +WP       Q      G  + RN    FA
Sbjct: 133 MEHNKEY---GCIRMRQDLVVPPHPNDHKPLWPVGAAAYFQRIAAAGGHDAGRNITFLFA 189

Query: 176 GAVNS---------PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
           G V            VR  LL +   D  I    GR +  Y D L  S+FCL   G    
Sbjct: 190 GGVGEGEYSGGTRQAVRALLLNI--TDPAIMFVEGR-RDDYVDLLWRSQFCLAAYGHGWG 246

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
             R+  S+ +GC+PVII +H    F D L ++ FS+ +   D+P L ++L+  S E+   
Sbjct: 247 I-RVMQSIQFGCIPVIIQDHVYQAFEDFLPYEEFSVRLPLRDVPRLLELLRSYSPEQLAA 305

Query: 287 LQSYVLKVRKHFQWH 301
           L+  + K  + F W+
Sbjct: 306 LRLGMAKYFRAFIWN 320


>gi|255580236|ref|XP_002530948.1| catalytic, putative [Ricinus communis]
 gi|223529463|gb|EEF31420.1| catalytic, putative [Ricinus communis]
          Length = 512

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 73/307 (23%), Positives = 148/307 (48%), Gaps = 34/307 (11%)

Query: 50  TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
            ++ S+AD+ F+PF  S++  R      H +R   + + + +  Y+ +  +    W R+ 
Sbjct: 181 VRNSSEADVIFVPFFSSLSYNRYSKVNPHQKRSKNKLLQEKLVRYVTSQME----WKRSQ 236

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP---QIWPR 156
           G DH  +A H    ++M  A  +KL     + +       +IA+  KD+  P    I   
Sbjct: 237 GQDHIILAHHP---NSMLDA-RMKLWPALFILADFGRYPPNIANVDKDLIAPYKHVIRSY 292

Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYA 208
            +D     S  R  L +F GA+        R++L  + +++ +++   G ++       +
Sbjct: 293 ADDSSTFDS--RPTLLYFQGAIYRKDGGFARQELFYLLKDEKDVHFQFGSVQKDGINKAS 350

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
             +  SKFCL++ G   ++ R+ D++   CVPVII++  +LP+ DVL++  F I V T D
Sbjct: 351 QGMHTSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTSD 410

Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
                 L  +++GI  +E+  +   + +V + F++     + DA  M+   +  +  ++R
Sbjct: 411 AIKEKFLINLIRGIGKDEWTQMWQKLKEVERFFEFQYPSKEGDAVQMIWQAVARKVPAIR 470

Query: 326 VQWSTSL 332
           ++ + S+
Sbjct: 471 MKINKSM 477


>gi|307104488|gb|EFN52741.1| hypothetical protein CHLNCDRAFT_138314 [Chlorella variabilis]
          Length = 647

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 84/348 (24%), Positives = 137/348 (39%), Gaps = 88/348 (25%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPF----------SIARMRHDRRIG-----T 76
            Y+ E    +  ++S   T DP +AD F++P           S A    D   G      
Sbjct: 233 TYSLEFGLLEMMLQSEHRTLDPEEADFFYVPVFPSCFIWPVRSTADSLRDFYYGWAQSRV 292

Query: 77  EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-- 134
           +G  + +      +   YPYW+R GG DH ++  H      +  A    + +  ++ S  
Sbjct: 293 QGAANLLLEAYHWLRAHYPYWDRRGGRDHIWLVTHDEASCYVPAA----IKSASIILSHW 348

Query: 135 --------SSYFISGHIAHKDVSLPQIWPRQE-----------DP--------------- 160
                   S     G++ H +VS P   P              DP               
Sbjct: 349 GRKDPNHTSGTGFPGNVYHLNVSHPHWEPEGSMAKLDLSQPCHDPVKDLVLPLMKTPDHY 408

Query: 161 ---PKLGSSKRNK--LAFFAG---AVNSP-----VREKL--------------LQVWRND 193
              P +G+  RN+  LAF  G     ++P     VR++L              + +  N 
Sbjct: 409 HQSPLVGAPTRNRTWLAFHRGRQHKTDAPEYSRGVRQRLWSASQEHGWLDKYGILLGENP 468

Query: 194 SEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
           S   A   +L   Y+  L  S FCL + G +  +AR+ D+  +GC+PVI+ +  D+ F  
Sbjct: 469 SSPGAEEVKLAGDYSQLLASSIFCLVLPG-DGWSARMDDATLHGCIPVIVMDEVDVSFES 527

Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           V++ + F++ VA  D+  L +IL  IS E    +Q  + +V     WH
Sbjct: 528 VIDLQQFTVRVAQADVERLPEILLEISQERRQEMQRALGRV-----WH 570


>gi|218192182|gb|EEC74609.1| hypothetical protein OsI_10218 [Oryza sativa Indica Group]
          Length = 737

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 143/332 (43%), Gaps = 70/332 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP----FSIAR--------MRHDRRIGTEGIP 80
           Y ++    ++ + S   T +  +AD F++P      I R        M  D R+ +    
Sbjct: 337 YGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRLRSYHTL 396

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           ++      +I+Q+YPYWNRT G DH +       AC++   I  S M   W    N    
Sbjct: 397 EYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHK 455

Query: 132 VCSSSYF--------ISGHIAH------KDVSLPQIWPRQEDPP----KLGSSKRNK--- 170
             +++Y+        I     H      KD+ LP  W +Q +P     KL +  RN    
Sbjct: 456 NSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLP-AW-KQPNPAAIWLKLWARTRNNRTT 513

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP----------- 206
           L +F G +                +R+KL   + +  +     GR  T            
Sbjct: 514 LFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVTYLRTEK 573

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y +EL  S FC  + G +  + R+ DS+  GC+PVII +   LP+ ++LN+ SF++ +  
Sbjct: 574 YYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSFAVRIQE 632

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            DIP L +IL+GI+  +   +   V ++ + F
Sbjct: 633 DDIPNLIRILRGINETQVEFMLRNVRQIWQRF 664


>gi|125585107|gb|EAZ25771.1| hypothetical protein OsJ_09611 [Oryza sativa Japonica Group]
          Length = 779

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 143/332 (43%), Gaps = 70/332 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP----FSIAR--------MRHDRRIGTEGIP 80
           Y ++    ++ + S   T +  +AD F++P      I R        M  D R+ +    
Sbjct: 379 YGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRLRSYHTL 438

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           ++      +I+Q+YPYWNRT G DH +       AC++   I  S M   W    N    
Sbjct: 439 EYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHK 497

Query: 132 VCSSSYF--------ISGHIAH------KDVSLPQIWPRQEDPP----KLGSSKRNK--- 170
             +++Y+        I     H      KD+ LP  W +Q +P     KL +  RN    
Sbjct: 498 NSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPA-W-KQPNPAAIWLKLWARTRNNRTT 555

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP----------- 206
           L +F G +                +R+KL   + +  +     GR  T            
Sbjct: 556 LFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVTYLRTEK 615

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y +EL  S FC  + G +  + R+ DS+  GC+PVII +   LP+ ++LN+ SF++ +  
Sbjct: 616 YYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSFAVRIQE 674

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            DIP L +IL+GI+  +   +   V ++ + F
Sbjct: 675 DDIPNLIRILRGINETQVEFMLRNVRQIWQRF 706


>gi|302845214|ref|XP_002954146.1| hypothetical protein VOLCADRAFT_118607 [Volvox carteri f.
           nagariensis]
 gi|300260645|gb|EFJ44863.1| hypothetical protein VOLCADRAFT_118607 [Volvox carteri f.
           nagariensis]
          Length = 401

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 83/324 (25%), Positives = 122/324 (37%), Gaps = 57/324 (17%)

Query: 41  KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
           +  + S   T D   AD FF+P  +      R  G     + ++  +  I Q +P+W R 
Sbjct: 2   QRLLSSGVRTADGDAADFFFIPLVM------RTKGQSA--NHLTAVVSYIQQHWPWWGRY 53

Query: 101 GGAD-HFYVACHSIGRSAMQKAWEVKLNAIQVVC-----------SSSYFISGHIAHKDV 148
           GG   H  V    +GR  +    E  L  ++ V            S   ++  H   KD+
Sbjct: 54  GGGHRHLLVVPADLGRRMLP---EELLKLVENVTFLTHWGSHTNHSEGAWVESHRPGKDI 110

Query: 149 SLPQIWPRQEDPP-------KLGSSKRNKLA---FFAGAVNSPVREKLLQVWRNDSEIYA 198
            +P +     D P        L S +R +     FF+G + S   E      R +S+   
Sbjct: 111 VVPPL--HNADEPIVFSPLHTLHSKRRRQRTSGLFFSGRICSDGSEPHRGRCRTNSQGNV 168

Query: 199 HSGRLK--------------TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
               LK                YA  L    FCL   G      R   +   GCVPV+I 
Sbjct: 169 RHKVLKHHWNRTTWTLTTRAKAYASALSSHTFCLSPGGGGYGR-RSVQAAVMGCVPVLIG 227

Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW---- 300
           +    PF   L+W  FS+ V   DIP L  IL+ ++S     +Q  +    +H  +    
Sbjct: 228 DGLHQPFEPELDWSQFSMSVPEQDIPHLHTILESMNSSTIAAMQEQLRCAAQHLYYSTTF 287

Query: 301 -HVFPSD--YDAFYMVMYELWLRR 321
             V   D  YDAF  +M  L +RR
Sbjct: 288 GEVMGEDGRYDAFETLMEVLRMRR 311


>gi|302825130|ref|XP_002994198.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300137940|gb|EFJ04733.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 762

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/335 (24%), Positives = 143/335 (42%), Gaps = 71/335 (21%)

Query: 34  ASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHD--------RRIGTEGI--PD 81
            +E  F +  + S   T +  +AD FF P   + A  R D        + +G  G    +
Sbjct: 364 GAELAFLEGLLASPHRTMNGDEADYFFAPVLGACAITRADDAPHFSMEKHMGLRGYFSGE 423

Query: 82  FISHYIFNISQKYPYWNRTGGADHFYVACHSIGR-SAMQKAWEVKL-------NAIQVVC 133
              +   +I ++YP+WNR+ G DH ++     G  SA ++ W   +       N+     
Sbjct: 424 LYKNAYMHIKEQYPFWNRSSGRDHIWLFPWDEGACSAPKEIWNGTMLVHWGNTNSKHKKS 483

Query: 134 SSSYFISG-----------HIAH---KDVSLPQIWPRQEDPPKLGS-------SKRNKLA 172
           ++ YF              H  +   KD+ LP  W +  DP  +          +R  L 
Sbjct: 484 TTGYFADSWDDIPKEWRGDHPCYDPLKDIVLP-AW-KNPDPRSVAERFWSRPREERKTLF 541

Query: 173 FFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP-----------YA 208
           +F G +                +R+++ + + +    +   GR   P           YA
Sbjct: 542 YFNGNLGKGYDFGRPEDRYSMGIRQRVAEEFGSTPNNHGKLGRQAAPDVVVTPQRSDDYA 601

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
            EL  S+FC    G +  + R+ D++ +GC+PVII +   LP+  +L+++SF++ VA   
Sbjct: 602 KELSSSRFCGVFPG-DGWSGRMEDAVLHGCIPVIIQDGIHLPYESLLDYESFTVRVAEDK 660

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
           IP L  IL+ IS+ E   ++S +  VR  +Q  V+
Sbjct: 661 IPELITILRNISNAE---VESKLEAVRGLWQRFVY 692


>gi|449502905|ref|XP_004161776.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
           [Cucumis sativus]
          Length = 482

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 72/289 (24%), Positives = 125/289 (43%), Gaps = 42/289 (14%)

Query: 52  DPSKADLFFLPFSIARMRHDRRIGTE--------GIPDFISHY-IFNISQKYPYWNRTGG 102
           +  +AD+ F+PF  A M  + ++G          G  D+     + +  +    W ++GG
Sbjct: 144 EAEEADVIFVPF-FATMSAEMQLGMAKGAFRKKVGNEDYERQRNVMDFLKSTDAWKKSGG 202

Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVC--------------SSSYFISGHIAHKDV 148
            DH   + HS+        W VK      V               SS+      I H  V
Sbjct: 203 RDHVLFSLHSLTDPVAM--WHVKAEIAPAVLLVVDFGGWFRLDTKSSNGSSPDMIQHTQV 260

Query: 149 SLPQ--IWPRQEDPPKL---GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAH 199
           S+ +  I P     P+L    + KR  L +F GA        VREKL  +  N+ ++   
Sbjct: 261 SVLKDVIVPYTHLLPRLHLSANKKRQTLLYFKGAKHRHRGGLVREKLWDLLVNEPDVIME 320

Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
            G       ++    +  S+FCLH  G    + R+ D++   C+PV+++++ +LPF D++
Sbjct: 321 EGFPNATGKEQSIKGMRSSEFCLHPAGDTPTSCRLFDAIQSLCIPVVVSDNIELPFEDMV 380

Query: 256 NWKSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           ++  FS+ VA  D      L K L+ I  E+    + Y+ +V+  F++ 
Sbjct: 381 DYSEFSVFVAVNDALKPNWLVKHLRTIPEEQRNRFRLYMARVQSVFEYE 429


>gi|224136688|ref|XP_002322391.1| predicted protein [Populus trichocarpa]
 gi|222869387|gb|EEF06518.1| predicted protein [Populus trichocarpa]
          Length = 457

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/271 (24%), Positives = 124/271 (45%), Gaps = 27/271 (9%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIP--------DFISHYIFNISQKYPYWNRTG 101
           DP +ADLF++P   S++ + +  R GT  +P        + +   +    ++  YW R  
Sbjct: 137 DPEEADLFYVPVFSSLSLIVNPARAGT--VPGSDPVYSDEKMQEELVEWLEEQEYWRRNN 194

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDP 160
           G DH   A      +A+ +  +   N + ++       S   +  KDV +P         
Sbjct: 195 GRDHVVFAGDP---NALYRVLDRVKNVVLLLSDFGRVRSDQGSLIKDVIVPYSHRINVYN 251

Query: 161 PKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRL----KTPYADELL 212
             +G  +R  L FF G         +R+ L Q+   + ++    G      +      + 
Sbjct: 252 GDIGVEERKTLLFFMGNRYRKDGGKIRDLLFQMLEKEEDVVIRHGTQSRENRRTATRGMH 311

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LDI 269
            SKFCL+  G   +  R+ DS+   CVP+I+++  +LPF DV++++  +I V T   L  
Sbjct: 312 TSKFCLNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKIAIFVDTESSLKP 371

Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
             L ++L+ +S+E+ L  Q  + +V+++F +
Sbjct: 372 GYLVRMLRAVSTEKILEYQKQMREVKRYFVY 402


>gi|357125540|ref|XP_003564451.1| PREDICTED: probable glucuronosyltransferase GUT1-like [Brachypodium
           distachyon]
          Length = 495

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 106/224 (47%), Gaps = 24/224 (10%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+ G DH     H     S+ R   +  W +      +  + +++  G +   KDV
Sbjct: 219 PAWQRSEGRDHIIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 274

Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            LP +        K  S   SKR+ L FF G +       VR KL+   ++  ++    G
Sbjct: 275 ILPYVPNVDLCDYKCASETQSKRSMLLFFRGRLKRNAGGKVRSKLVTELKDAEDVVIEEG 334

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K    + +  S FCL+  G   ++AR+ D++  GC+PVII++  +LPF  +L++
Sbjct: 335 TAGAEGKVAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILDY 394

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           +  ++ V++ D      L K L+GI ++    +QS ++K  +HF
Sbjct: 395 RKIALFVSSSDALQPGWLVKYLRGIDAKRVREMQSNLVKYSRHF 438


>gi|357463031|ref|XP_003601797.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
 gi|355490845|gb|AES72048.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
          Length = 425

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/293 (22%), Positives = 125/293 (42%), Gaps = 22/293 (7%)

Query: 52  DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
           DP   D++F+PF    S     H  R     I   +   +  +  +  YW R+GG DH +
Sbjct: 119 DPEVVDVYFVPFFSSLSFNTHGHHMRDPETEIDHQLQIDLMGLLGQSKYWQRSGGRDHIF 178

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG-HIAHKDVSLPQIW---PRQEDPPKL 163
              H    +A +   +    +IQVV     +  G    +KDV  P +       +D P  
Sbjct: 179 PMTHP---NAFRFLRDQLNESIQVVVDFGRYPKGVSNLNKDVVSPYVHFVDSYVDDEPHD 235

Query: 164 GSSKRNKLAFFAGAVNSP----VREKLLQVWRNDSEIY----AHSGRLKTPYADELLGSK 215
               R  L FF G  +      VR K  ++     +++    + +G      +  +  SK
Sbjct: 236 PFESRTTLLFFRGGTHRKDKGIVRAKFTKILAGFDDVHYERSSATGENIKLSSKGMRSSK 295

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TLDIPLL 272
           FCLH  G   ++ R+ D++   CVPVI+++  +LPF + +++  FS+  +    L+   +
Sbjct: 296 FCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFENEIDYSQFSLFFSFKEALEPGYM 355

Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
              L+    + +  +   +  +  H+++H  P   DA  M+  ++  +   +R
Sbjct: 356 INQLRSFPKQNWTEMWRQLKNISHHYEFHYPPEREDAVNMLWRQIKHKLPGIR 408


>gi|241997492|ref|XP_002433395.1| exostosin-2, putative [Ixodes scapularis]
 gi|215490818|gb|EEC00459.1| exostosin-2, putative [Ixodes scapularis]
          Length = 714

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 70/272 (25%), Positives = 118/272 (43%), Gaps = 35/272 (12%)

Query: 41  KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
           +A   S + TKD ++A +F         R+  R             +  + Q  P+WN  
Sbjct: 129 EAIQHSPYYTKDAARACVFVPSIDTLNQRNLNR-----------KALSRVLQSLPHWN-- 175

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWP---- 155
            G++H     + +  S  + A  +++N  + + +   F S       D+ +P   P    
Sbjct: 176 NGSNHLLF--NMLPGSLPEYATSLEVNTGRALVAGGGFDSWTFRRSHDLGIPVFNPARPS 233

Query: 156 ------RQEDPPKLGSSKRNKLAF-FAGAVNS--PVREKLLQV------WRNDSEIYAHS 200
                 R+ D P L  S + ++ F F   V S   V   +L +      W  ++      
Sbjct: 234 LDEVSQRRVDRPWLVVSSQTRIHFEFRSEVASLAAVHPGVLNLSTCGRPWDGENNGVRCR 293

Query: 201 GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
           G     Y D L   KFCL V+   +  + ++D+L  GCVPVI+A+ Y LPF++VL+WK  
Sbjct: 294 GDTLYKYPDILAEGKFCLVVRAARLGQSVLSDALMAGCVPVIVADEYILPFSEVLDWKRA 353

Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
           +I +   D+  L  +LKG+S      ++S  L
Sbjct: 354 AIQIREDDLEDLVTVLKGVSKARLFEMRSQAL 385


>gi|356525152|ref|XP_003531191.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform 1
           [Glycine max]
          Length = 427

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 122/283 (43%), Gaps = 26/283 (9%)

Query: 52  DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
           DP  A  FF+PF    S     H  +     I   +   +  + +K  YW R+GG DH +
Sbjct: 120 DPELAQAFFVPFFSSLSFNTHGHTMKDPATQIDRQLQVDLMELLKKSKYWQRSGGRDHVF 179

Query: 108 VACHSIGRSAMQKAWEVKLN-AIQVVCSSSYFISGHI-AHKDVSLPQIW---PRQEDPPK 162
              H      ++     +LN +IQVV     +  G    +KDV  P +       +D P+
Sbjct: 180 PMTHPNAFRFLRG----QLNESIQVVVDFGRYPRGMSNLNKDVVSPYVHVVDSFTDDEPQ 235

Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADELLG 213
                R+ L FF G         VR KL ++     +++     A    +K   +  +  
Sbjct: 236 DPYESRSTLLFFRGRTYRKDEGIVRVKLAKILAGYDDVHYERSVATEENIKAS-SKGMRS 294

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TLDIP 270
           SKFCLH  G   ++ R+ D++   CVPVI+++  +LPF D +++  FS+  +    L   
Sbjct: 295 SKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDDIDYSQFSVFFSFKEALQPG 354

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            +   L+    E++  +   +  +  H+++   P   DA  M+
Sbjct: 355 YMIDQLRKFPKEKWTEMWRQLKSISHHYEFEYPPKREDAVDML 397


>gi|116787500|gb|ABK24530.1| unknown [Picea sitchensis]
          Length = 446

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 69/280 (24%), Positives = 117/280 (41%), Gaps = 20/280 (7%)

Query: 52  DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
           DP +AD+F++PF    S      + R         +   + ++ ++   W R+GG DH  
Sbjct: 139 DPDQADVFYVPFFASLSFNTHGQNMRDPETEFDKKLQIEVVDMLKRSKSWQRSGGRDHVI 198

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ---EDPPKLG 164
           V  H      ++   EV  +   V     Y  S     KDV  P +        D     
Sbjct: 199 VIHHPNAFRFLRD--EVNASIFVVADFGRYPRSVSFLRKDVVAPYVHVVDTYVNDDSSDP 256

Query: 165 SSKRNKLAFFAGAVNSP----VREKLLQVWRNDSEIYAHSGRLKTP----YADELLGSKF 216
              R  L +F G         VR KL ++  N   ++       T         +  S+F
Sbjct: 257 FESRTMLLYFRGRTKRKDEGFVRLKLAKILGNHKRVHFEDSLATTEGFEVAKQGMRSSRF 316

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLDIPLLK 273
           CLH  G   ++ R+ D++   CVPVI+++  +LPF D ++++ FS+   V   L    L 
Sbjct: 317 CLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPFEDEIDYQEFSLFFSVKEALRPGYLM 376

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
           + L+    E++L + + + +V  HF++   P   DA  M+
Sbjct: 377 QKLETFPKEKWLKMWNKLKQVAHHFEYQYPPIKDDAVNML 416


>gi|326525254|dbj|BAK07897.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 440

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 76/299 (25%), Positives = 127/299 (42%), Gaps = 54/299 (18%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTEGIPD-----FISHYIFNISQKYPYWNRTGG 102
            +DP+ A+ FF+PF  S++   H R +     PD      +   + +I  K  YW R+ G
Sbjct: 134 VRDPAAAEAFFVPFFSSLSFNVHGRNMTD---PDTEADRLLQVELMDILGKSEYWQRSAG 190

Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA--HKDVSLPQIW------ 154
            DH     H      M+      +NA  ++ S     +  +A   KDV  P +       
Sbjct: 191 RDHVIPMHHPNAFRFMRDM----VNASVLIVSDFGRYTKELASLRKDVVAPYVHVVDSFL 246

Query: 155 ------PRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRN-------DSEIY 197
                 P + DP          L FF G         +R KL ++ ++       DS   
Sbjct: 247 DDNASDPFEADP---------TLLFFRGRPVRKAEGKIRGKLAKILKDRDGVRFEDSLAI 297

Query: 198 AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
               ++ T   D +  SKFCLH  G   ++ R+ D++   C+PVII++  +LPF D +++
Sbjct: 298 GDGIKIST---DGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISSRIELPFEDEIDY 354

Query: 258 KSFS---IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
             FS    V   L+   L   L+ +  E+++ + S +  V  H+++   P   DA  M+
Sbjct: 355 SEFSPFFSVEEALEPDYLLNQLRQMPKEKWVEMWSKLKNVSSHYEFQYPPRKDDAVNMI 413


>gi|194707922|gb|ACF88045.1| unknown [Zea mays]
          Length = 240

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 55/187 (29%), Positives = 95/187 (50%), Gaps = 20/187 (10%)

Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRL----KTPYADELLGSKFC 217
           SKR+ L FF G +       +R KL++  ++  +I    G      K    D +  S FC
Sbjct: 40  SKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDIVIEEGSTGAQGKAAAQDGMRKSFFC 99

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL---LKK 274
           L   G   ++AR+ D++  GC+PVII++  +LPF  +L+++  ++ V+  D      L K
Sbjct: 100 LSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILDYREIALFVSASDAVQPGWLLK 159

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD----AFYMVMYELWLRRSSVR 325
            L+GI+++    +QS ++K  +HF +      + P D      A  +V  +L +RRS   
Sbjct: 160 YLRGINAKRIREIQSNLVKYSRHFLYSSPAQPLGPEDLTWRMIAGKLVNIKLQIRRSQRL 219

Query: 326 VQWSTSL 332
           V+ S S+
Sbjct: 220 VKESRSI 226


>gi|145355370|ref|XP_001421936.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582175|gb|ABP00230.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 490

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 73/277 (26%), Positives = 125/277 (45%), Gaps = 31/277 (11%)

Query: 36  ESYFKKAFMKSHFVTKDPSKADL---FFLPFS------IARMRHDRRIGTEGIPDFISHY 86
           E +  + F    F +   S  D+   + LP +      + ++RH    G   +   + H 
Sbjct: 150 EDFLNRNFQNGPFSSCASSVDDVNDAYLLPLNPYFDRVMDKLRHFSEKGRSAMARGVKHA 209

Query: 87  IFNISQKYP-YWNRTG-GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
           I   S + P  W R+G     F V  H     ++ +  +V L+   VV SS   IS    
Sbjct: 210 IDYASAEDPNAWVRSGESCSRFSVITHGSSTMSLLRHKDVILSTTFVVASSET-ISPSRP 268

Query: 145 H---------KDVS----LPQIWP-RQEDPPKLGSSKRNKLAFFAGAVNS-PVREK---- 185
           H         KDVS    L  + P          +++R  L  F GA+NS P R +    
Sbjct: 269 HTSQAPFHPDKDVSAINSLSFVIPLHAARRRAFANAERETLLMFRGAINSFPNRREIADF 328

Query: 186 LLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIAN 245
           LL+  R        S      Y  ++  S+FCL+++G  V++ R+ +S+ +GCVPVI+A+
Sbjct: 329 LLKNTRGQMYDLGPSCSTSKEYTAKMKNSRFCLYMRGTRVHSPRLIESMLFGCVPVILAD 388

Query: 246 HYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
            Y+LP + +++W +FS+++   D   +   L+  +S+
Sbjct: 389 DYELPLSWLVDWSAFSVMIPERDFQTIPDALERANSD 425


>gi|326522384|dbj|BAK07654.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 496

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 62/224 (27%), Positives = 104/224 (46%), Gaps = 24/224 (10%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+ G DH     H     S+ R   +  W +      +  + +++  G +   KDV
Sbjct: 220 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 275

Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            LP +        K  S   SKR+ L FF G +       +R KL+   +N  +I    G
Sbjct: 276 ILPYVPNVDLCDYKCVSETQSKRSTLLFFRGRLKRNAGGKIRSKLVTELQNIEDIIIEEG 335

Query: 202 ----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
               + K      +  S FCL+  G   ++AR+ D++  GC+PVII++  +LPF  +L++
Sbjct: 336 SAGAKGKVAALTGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILDY 395

Query: 258 KSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHF 298
              ++ V++ D      L K L+G+  +    +QS +LK  +HF
Sbjct: 396 SKIALFVSSTDAVQPGWLVKYLRGVDGKRVREMQSNLLKYSRHF 439


>gi|356512371|ref|XP_003524893.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
           max]
          Length = 427

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 69/282 (24%), Positives = 121/282 (42%), Gaps = 24/282 (8%)

Query: 52  DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
           DP  A  FF+PF    S     H  +     I   +   +  + +K  YW R+GG DH +
Sbjct: 120 DPELAQAFFVPFFSSLSFNTHGHTMKDPATQIDRQLQVDLMELLKKSNYWQRSGGRDHVF 179

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQIW---PRQEDPPKL 163
              H    +A +   +    +IQVV     +  G    +KDV  P +       +D P+ 
Sbjct: 180 PMTHP---NAFRFLRDQLNESIQVVVDFGRYPRGMSNLNKDVVSPYVHVVDSFTDDEPQD 236

Query: 164 GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADELLGS 214
               R+ L FF G         VR KL ++     +++     A    +K   +  +  S
Sbjct: 237 PYESRSTLLFFRGRTYRKDEGIVRVKLAKILAGYDDVHYERSVATEENIKAS-SKGMRSS 295

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TLDIPL 271
           KFCLH  G   ++ R+ D++   C+PVI+++  +LPF D +++  FS+  +    L    
Sbjct: 296 KFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDQIELPFEDEIDYSQFSVFFSFKEALQPGY 355

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
           +   L+    E++  +   +  +  H+++   P   DA  M+
Sbjct: 356 MIDQLRKFPKEKWTEMWRQLKSISHHYEFRYPPKREDAVDML 397


>gi|449444196|ref|XP_004139861.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Cucumis sativus]
          Length = 478

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 72/287 (25%), Positives = 122/287 (42%), Gaps = 46/287 (16%)

Query: 54  SKADLFFLPFSIARMRHDRRIGTE--------GIPDFISHY-IFNISQKYPYWNRTGGAD 104
            +AD+ F+PF  A M  + ++G          G  D+     + +  +    W ++GG D
Sbjct: 146 EEADVIFVPF-FATMSAEMQLGMAKGAFRKKVGNEDYERQRNVMDFLKSTDAWKKSGGRD 204

Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVC--------------SSSYFISGHIAHKDVSL 150
           H +V    +        W VK      V               SS+      I H  VS+
Sbjct: 205 HVFVLTDPVA------MWHVKTEIAPAVLLVVDFGGWFRLDTKSSNGSSPDMIQHTQVSV 258

Query: 151 PQ--IWPRQEDPPKL---GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            +  I P     P+L    + KR  L +F GA        VREKL  +  N+ ++    G
Sbjct: 259 LKDVIVPYTHLLPRLHLSANKKRQTLLYFKGAKRRHRGGLVREKLWDLLVNEPDVIMEEG 318

Query: 202 ----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K      +  S+FCLH  G    + R+ D++   C+PV+++++ +LPF D++++
Sbjct: 319 FPNATGKEQSIKGMRSSEFCLHPAGDTPTSCRLFDAIQSLCIPVVVSDNIELPFEDMVDY 378

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
             FS+ VA  D      L K L+ I  E+    + Y+ +V+  F++ 
Sbjct: 379 SEFSVFVAVNDALKPNWLVKHLRTIPEEQRNGFRLYMARVQSVFEYE 425


>gi|384252594|gb|EIE26070.1| exostosin-domain-containing protein [Coccomyxa subellipsoidea
           C-169]
          Length = 898

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 81/357 (22%), Positives = 136/357 (38%), Gaps = 87/357 (24%)

Query: 32  NYASESYFKKAFMK---SHFVTKDPSKADLFFLPFSIARMRHD----------------- 71
            Y  ES F +  ++   S   T DP +AD FF+P   +   H                  
Sbjct: 488 TYGIESAFHERLLQALSSEHRTLDPEEADFFFMPVYTSCFLHPVWGYVDHPWYYGPTIDC 547

Query: 72  RRIGTEGIPDFISHYIFN-----------ISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           RR G   I    ++ +             +   +P+W R GG DH ++  H  G     K
Sbjct: 548 RRDGDMQICQTGANRVMQAMFMLLEAQKWVEVNHPWWRRKGGRDHIWLITHDEGSCWAPK 607

Query: 121 AWEVKLNAI------------------------QVVCSSSYFISGHIAH----------K 146
             E++L+ I                        Q V    ++  G+  H          K
Sbjct: 608 --EIRLSIILSHWGRKDVNHTSNSAFKPWDNYTQEVIHPEWWPEGYTHHIKGHACYDPIK 665

Query: 147 DVSLPQIWPRQEDP---PKLGSSK--RNKLAFFAGAVNS--------PVREKLLQVWRND 193
           D+ +P +    E     P +G  +  R+ L  F G V           +R++L  + +  
Sbjct: 666 DLIIPNLKHPAEFANFSPLVGHPQPPRDILFLFRGDVGKHRLPHYSRGIRQRLFALAQEH 725

Query: 194 SEIYAHS------GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
                H+        +   Y++ L  SKFCL   G +  + R  D++ +GCVPV++ +  
Sbjct: 726 DWAGRHAILIGDRDDVAGDYSELLTRSKFCLVAPG-DGFSPRAEDAILHGCVPVVVMDEV 784

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
           D  F+ +L+W +FS+ +A  DI  L +IL  +       +Q  +  V + F+W   P
Sbjct: 785 DPVFSSILDWSAFSLRIAEADIEQLPQILLAVPEARLQAMQRSLRNVWQRFKWSSLP 841


>gi|125576050|gb|EAZ17272.1| hypothetical protein OsJ_32791 [Oryza sativa Japonica Group]
          Length = 506

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 43/170 (25%), Positives = 92/170 (54%), Gaps = 7/170 (4%)

Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHV 220
           +SK   +  F  +    +R++L  + +++ ++Y   G ++   A +    +  SKFCL++
Sbjct: 322 TSKLTDVPTFELSKGGNIRQELHYMLKDEKDVYFAFGSVQDHGASKASQGMHASKFCLNI 381

Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILK 277
            G   ++ R+ D++   CVPVII++  +LP+ D L++  FSI V + D      L ++++
Sbjct: 382 AGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAVKKGYLMRLIR 441

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
           G+S  ++ ++   + +V KHF++       DA  M+   L  +  ++R++
Sbjct: 442 GVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVPAIRLK 491



 Score = 37.4 bits (85), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 12/68 (17%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTE------GIPDFISHYIFNISQKYPYWNRTGGA 103
           D   AD+ F+PF  S++  RH R +  E      G+ + +  Y+       P W R+GGA
Sbjct: 140 DSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKGLQERLVRYLM----AQPEWKRSGGA 195

Query: 104 DHFYVACH 111
           DH  VA H
Sbjct: 196 DHVIVAHH 203


>gi|55297487|dbj|BAD68203.1| exostosin family protein-like [Oryza sativa Japonica Group]
 gi|55297674|dbj|BAD68245.1| exostosin family protein-like [Oryza sativa Japonica Group]
          Length = 512

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 123/269 (45%), Gaps = 33/269 (12%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+ G DH     H     S+ R   +  W +      +  + +++  G +   KDV
Sbjct: 223 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 278

Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            LP +        K  S   S+R+ L FF G +       +R KL+   ++   I    G
Sbjct: 279 ILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGIIIEEG 338

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K    + +  S FCL+  G   ++AR+ D++  GC+PVI+++  +LPF  +L++
Sbjct: 339 TAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 398

Query: 258 KSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
           +  ++ V++ D      L K L+ I ++    +QS +LK  +HF +      + P D   
Sbjct: 399 RKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSPARPLGPEDLTW 458

Query: 309 ---AFYMVMYELWLRRSSVRVQWSTSLDS 334
              A  +V  +L +RRS   V+ S S+ S
Sbjct: 459 RMIAGKLVNIKLHIRRSQRVVRESRSICS 487


>gi|302835860|ref|XP_002949491.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300265318|gb|EFJ49510.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 554

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 70/270 (25%), Positives = 109/270 (40%), Gaps = 28/270 (10%)

Query: 39  FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
           F +  + S     D   AD +++P          R+ T     F+ + I  I + YP+WN
Sbjct: 147 FYQRLLGSGARVADGDLADWYYIPI---------RLRTATDSAFLKYAIEYIREAYPWWN 197

Query: 99  RTGGADHFYVACHSIGR-SAMQKAWEVKLN-------AIQVVCSSSYFISGHIAHKDVSL 150
           RTGGA HF +    +G    M   + +  N        + V  ++S +   H   K  + 
Sbjct: 198 RTGGARHFVIHTGDLGADEVMDDVYGMAANMTWLTHWGLTVDKNTSGWWKAHRPDKARAG 257

Query: 151 PQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQV-----W-RNDSEIYAHSGRLK 204
            +   R     ++  ++R     +     +P R  + Q      W R    I       +
Sbjct: 258 ARWGTRGGYYTRVSVNRRRGSHMWGPPSPAPHRAGVRQKVHFHHWNRTGFRIVT----FE 313

Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
             Y   L+ SKFCL   G      +I  S Y GC+PV IA+    PF    +W  F++  
Sbjct: 314 RNYGKALVSSKFCLAPLGGGHGQRQIIVS-YMGCIPVCIADGVYEPFEPQTDWTEFAVRP 372

Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           A  DIP L +IL+GIS+   L      L+ 
Sbjct: 373 AEADIPRLHEILEGISAGNKLAEMQVALRC 402


>gi|125528116|gb|EAY76230.1| hypothetical protein OsI_04166 [Oryza sativa Indica Group]
          Length = 513

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 123/269 (45%), Gaps = 33/269 (12%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+ G DH     H     S+ R   +  W +      +  + +++  G +   KDV
Sbjct: 224 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 279

Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            LP +        K  S   S+R+ L FF G +       +R KL+   ++   I    G
Sbjct: 280 ILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGIIIEEG 339

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K    + +  S FCL+  G   ++AR+ D++  GC+PVI+++  +LPF  +L++
Sbjct: 340 TAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 399

Query: 258 KSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
           +  ++ V++ D      L K L+ I ++    +QS +LK  +HF +      + P D   
Sbjct: 400 RKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSPARPLGPEDLTW 459

Query: 309 ---AFYMVMYELWLRRSSVRVQWSTSLDS 334
              A  +V  +L +RRS   V+ S S+ S
Sbjct: 460 RMIAGKLVNIKLHIRRSQRVVRESRSICS 488


>gi|296085534|emb|CBI29266.3| unnamed protein product [Vitis vinifera]
          Length = 390

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 127/309 (41%), Gaps = 47/309 (15%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 73  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 132

Query: 92  QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
             +PYWNRT GADHF+V  H  G       +KA E  +  +  +     ++    H+   
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 192

Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
           +  +++P   P Q+    L    + R+   +F G      N P      R     VW N 
Sbjct: 193 EGSITIPPYAPPQKMQAHLIPQETPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 252

Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
            D+ ++  S    T Y +++  + FCL           +AD +              LPF
Sbjct: 253 KDNPLFDISTEHPTTYYEDMQRAIFCL---------CPLADDIV-------------LPF 290

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
           AD + W+   + VA  D+P L  IL  I   E +L +  +L      Q  +FP      D
Sbjct: 291 ADAIPWEEIGVFVAEEDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQSGD 349

Query: 309 AFYMVMYEL 317
           AF+ ++  L
Sbjct: 350 AFHQILNGL 358


>gi|374922021|gb|AFA26188.1| hypothetical protein, partial [Lolium perenne]
          Length = 282

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 69/264 (26%), Positives = 110/264 (41%), Gaps = 60/264 (22%)

Query: 90  ISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQVVCSSSYFIS 140
           I+Q+YPYWNRT G DH +       AC++   I  S M   W    N      +++Y+  
Sbjct: 8   IAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHENSTTAYWAD 66

Query: 141 G--------------HIAHKDVSLPQ--------IWPRQEDPPKLGSSKRNKLAFFAGAV 178
                              KD+ LP         IW +    P+   S R  L +F G +
Sbjct: 67  NWDNIPLDRRGDHPCFDPTKDLVLPAWKDPDPAAIWLKLWARPR---SNRRTLFYFNGNL 123

Query: 179 NSP-------------VREKLLQVWRNDSEIYAHSGRL-----------KTPYADELLGS 214
            S              +R+KL   + +        GR               Y +EL  S
Sbjct: 124 GSAYEQGRPEDTYSMGIRQKLAAEFGSTPNKQGKLGRQHVANVTVTHLRSEKYYEELASS 183

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKK 274
            FC  + G +  + R+ DS+  GC+PVII +   LP+ +VLN+ SF++ +   DIP L  
Sbjct: 184 IFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEDDIPNLIT 242

Query: 275 ILKGISSEEYLLLQSYVLKVRKHF 298
           +L+G++  +   +   V ++ + F
Sbjct: 243 VLRGMNETQIEFMLGNVRQIWQRF 266


>gi|224284379|gb|ACN39924.1| unknown [Picea sitchensis]
          Length = 787

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 87/332 (26%), Positives = 141/332 (42%), Gaps = 70/332 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTEGIP 80
           Y S+    ++ + S + T +  +AD F++P   A             M++   + +    
Sbjct: 381 YGSQMALYESLLASPYRTLNGEEADYFYVPVLDACLITRADDAPHLSMKNHMGLRSYLTL 440

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           DF      +I + Y YWNR+ G DH +       AC++   I  S M   W    N+   
Sbjct: 441 DFYKKAYDHIMEHYTYWNRSSGHDHIWFFAWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 499

Query: 132 VCSSSYFISG--HIA------------HKDVSLPQIWPRQEDPPKLGS-----SKRNK-- 170
             +++Y      HI              KD+ LP  W R  DP  + +     S+R +  
Sbjct: 500 HSTTAYLADNWDHIPIERRGRHPCFDPEKDLVLP-AWKR-PDPYNVKARFWARSRRERFT 557

Query: 171 LAFFAGAV------NSP-------VREKLLQVWRNDSEIYAHSGRLKTP----------- 206
           L +F G +      N P       +R+KL   + ++       GR  T            
Sbjct: 558 LFYFNGNLGASFKNNRPEPTYSLGIRQKLAAEFASEPNKEGKFGRQSTKDVIVVSQKSPN 617

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y  EL  S FC    G +  + R+ DS+  GC+PVII +   + + +VLN+ SF++ +A 
Sbjct: 618 YYSELGSSLFCGVFPG-DGWSGRMEDSVLQGCIPVIIQDGIQVAYENVLNYDSFAVRIAE 676

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            DIP L +IL+GI+  E     + V K+R+ F
Sbjct: 677 DDIPHLVQILRGINETELEFKLANVQKLRQRF 708


>gi|222640628|gb|EEE68760.1| hypothetical protein OsJ_27458 [Oryza sativa Japonica Group]
          Length = 518

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 46/171 (26%), Positives = 87/171 (50%), Gaps = 12/171 (7%)

Query: 155 PRQEDPP-----KLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT---- 205
           PR+  PP     +  +++R  +          VR++L Q+ +++ +++   G ++     
Sbjct: 320 PRRAAPPEQHDGRPAAAQRRHVRPLRLRQGGRVRQRLYQLIKDEKDVHFTYGSVRQNGIR 379

Query: 206 PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
                +  SKFCL++ G   ++ R+ D++   CVPVII++  +LPF DVL++ +F + V 
Sbjct: 380 RATKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSAFCVFVR 439

Query: 266 TLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
             D      L  +L+GIS EE+  +   + +V  HF++       DA  M+
Sbjct: 440 ASDAVKRGFLLHLLRGISQEEWTAMWRRLKEVAHHFEYQYPSQPGDAVQMI 490


>gi|125572389|gb|EAZ13904.1| hypothetical protein OsJ_03829 [Oryza sativa Japonica Group]
          Length = 504

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 123/269 (45%), Gaps = 33/269 (12%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+ G DH     H     S+ R   +  W +      +  + +++  G +   KDV
Sbjct: 215 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 270

Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            LP +        K  S   S+R+ L FF G +       +R KL+   ++   I    G
Sbjct: 271 ILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGIIIEEG 330

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K    + +  S FCL+  G   ++AR+ D++  GC+PVI+++  +LPF  +L++
Sbjct: 331 TAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 390

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
           +  ++ V++ D      L K L+ I ++    +QS +LK  +HF +      + P D   
Sbjct: 391 RKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSPARPLGPEDLTW 450

Query: 309 ---AFYMVMYELWLRRSSVRVQWSTSLDS 334
              A  +V  +L +RRS   V+ S S+ S
Sbjct: 451 RMIAGKLVNIKLHIRRSQRVVRESRSICS 479


>gi|224085069|ref|XP_002307477.1| predicted protein [Populus trichocarpa]
 gi|222856926|gb|EEE94473.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 79/323 (24%), Positives = 138/323 (42%), Gaps = 40/323 (12%)

Query: 4   SFRVYVYPHRRNDPFANVL------LPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D    +L      +  D   +G + ++    +  ++S F T+   +A+
Sbjct: 86  SLKIYVYEEDEIDGLKELLRGREGKISADACVKGQWGTQVKIHRLLLQSRFRTRKKGEAN 145

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P   A  +  R +G     +    Y+  +SQ  PY+ R+GG DH +V     G + 
Sbjct: 146 LFFVP---AYAKCVRMMGGLNDKEINHTYVKALSQ-MPYFRRSGGRDHIFVFPSGAG-AH 200

Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR-------QEDPPKLGS----- 165
           + ++W   +N   ++             KD S    W         ++   K G+     
Sbjct: 201 LFRSWATYINRSIILSPEG----DRTDKKDTSSFNTWKDIIIPGNVEDGMTKRGAAMAQP 256

Query: 166 ---SKRNKLAFFAGAVNSPV-REKLLQVWRN-----DSEIYAHSGRLK---TPYADELLG 213
              SKR  LA + G     V R KL+++ +      +      SG  K     Y   L  
Sbjct: 257 LPLSKRKYLANYLGRAQGKVGRLKLIELAKQYPDKLECPELKFSGPEKFGRMEYFQHLRN 316

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL-L 272
           +KFC+  +G    T R  +S +  CVPVI+++  + PF +V+++   SI   +  I L L
Sbjct: 317 AKFCVAPRGESSWTLRFYESFFVECVPVILSDQAEFPFQNVIDYTQISIKWPSTRIGLEL 376

Query: 273 KKILKGISSEEYLLLQSYVLKVR 295
            + L+ I  E+   + +   +VR
Sbjct: 377 LEYLESIPDEDVERMIAAGRQVR 399


>gi|255070683|ref|XP_002507423.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
 gi|226522698|gb|ACO68681.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
          Length = 698

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 77/151 (50%), Gaps = 20/151 (13%)

Query: 174 FAGAVNSPVREKLL---------QVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHVKG 222
           F G +   VRE++L         + W  R+D ++          Y   +  SKFCLHV+G
Sbjct: 492 FRGTLRGGVRERILGHYLSVGKSRNWDLRSDGQVSPSR------YMRLMRDSKFCLHVRG 545

Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
             V + R+ + + +GCVPVI+A+ Y  P + + +W  FS+ +  ++   L ++L+G+   
Sbjct: 546 TRVQSPRLIEGMLFGCVPVIVADGYVPPLSWLFDWSKFSVRLPEVEHERLPEVLQGV--- 602

Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
           ++  LQ+ + +V   F +H  P   DA +  
Sbjct: 603 DWATLQANLRRVAPFFVYHRTPIPGDALWTT 633


>gi|449681334|ref|XP_002163001.2| PREDICTED: exostosin-1a-like [Hydra magnipapillata]
          Length = 469

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/141 (31%), Positives = 75/141 (53%), Gaps = 5/141 (3%)

Query: 163 LGSSKRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFC 217
           +GS  RN L           + + + EK  Q + +      +S   +  Y + L  S FC
Sbjct: 135 IGSETRNSLYLIHNNEDIILLTTCIHEKNWQKFADSRCEEDNSNYDRFNYTELLANSTFC 194

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L  +G  + + R  +++ YGC+PVI++N +DLPF DV++W  FSIV+    +  L  IL+
Sbjct: 195 LIPRGRRLASFRFLEAIQYGCIPVIMSNGWDLPFNDVIDWVKFSIVLDESLLLQLPSILR 254

Query: 278 GISSEEYLLLQSYVLKVRKHF 298
           GIS ++ L ++   + V K++
Sbjct: 255 GISFDQVLAMKQQTIFVWKNY 275


>gi|302837063|ref|XP_002950091.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300264564|gb|EFJ48759.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 593

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 82/377 (21%), Positives = 148/377 (39%), Gaps = 97/377 (25%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFF-------LPFSI-------------ARMRH-- 70
           YA+++   ++ + S   T DP +AD F+       LPF I             AR  H  
Sbjct: 214 YAADTLLHESLLISEHRTFDPEEADFFYVPHYASCLPFPIGVWADAPWFPEPGARAHHHL 273

Query: 71  ------------------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
                             +R    + + + +   +  I+  +PYW R GG DH ++  H 
Sbjct: 274 CEVLPATRVITVTCRLPVNRGPRVKQMVNMVRDTLDWINSTFPYWRRRGGRDHIWLFTHD 333

Query: 113 IGRSAMQKA-----WEVKLNAIQVVCSSSY----------------------FISGHIA- 144
            G      A     W       ++  +S+                       +I GH   
Sbjct: 334 EGACWAPTAINASIWLTHWGRTELNHTSNTAFLADKYDSDFAGPLQPEGFVKYIKGHPCF 393

Query: 145 --HKDVSLP--------QIWPRQEDPPKLGSSKRNKLAFFAGAVNS--------PVREKL 186
              KD+ +P           P Q +P +    +R+ L FF G V           VR+++
Sbjct: 394 NPEKDLVIPAFKAPSHYHASPLQGNPAR----ERDLLFFFRGDVGKNRLPNYSRGVRQQI 449

Query: 187 LQV-----WRNDSEIYAHSGR-LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
            ++     W      Y   G  ++  Y++ L  + FCL   G +  +AR+ D++ +GC+P
Sbjct: 450 YKMAKEGGWAEKYRFYIGDGSDVEGDYSEMLSRAIFCLVAPG-DGWSARMEDAVLHGCIP 508

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           V+IA+  +  F +VL   +F++ +    +P L  +L+ +        Q+++ +V + ++W
Sbjct: 509 VVIADGVEAVFENVLELDAFALRLPQEAVPRLLDVLRAVPQRAIRSKQAHLGRVWQRYRW 568

Query: 301 HVFPSDYDAFYMVMYEL 317
              P   DAF  +M  L
Sbjct: 569 ASLPKLDDAFATIMQWL 585


>gi|297597828|ref|NP_001044591.2| Os01g0811400 [Oryza sativa Japonica Group]
 gi|255673802|dbj|BAF06505.2| Os01g0811400 [Oryza sativa Japonica Group]
          Length = 497

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 70/266 (26%), Positives = 121/266 (45%), Gaps = 33/266 (12%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+ G DH     H     S+ R   +  W +      +  + +++  G +   KDV
Sbjct: 223 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 278

Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            LP +        K  S   S+R+ L FF G +       +R KL+   ++   I    G
Sbjct: 279 ILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGIIIEEG 338

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K    + +  S FCL+  G   ++AR+ D++  GC+PVI+++  +LPF  +L++
Sbjct: 339 TAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 398

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
           +  ++ V++ D      L K L+ I ++    +QS +LK  +HF +      + P D   
Sbjct: 399 RKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSPARPLGPEDLTW 458

Query: 309 ---AFYMVMYELWLRRSSVRVQWSTS 331
              A  +V  +L +RRS   V+ S S
Sbjct: 459 RMIAGKLVNIKLHIRRSQRVVRESRS 484


>gi|449487654|ref|XP_004157734.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Cucumis sativus]
          Length = 464

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 65/276 (23%), Positives = 118/276 (42%), Gaps = 34/276 (12%)

Query: 52  DPSKADLFFLPF-------------SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPY 96
           DP +ADLF++PF             +    +  R++    E   D    ++    +K  Y
Sbjct: 141 DPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWL----EKQEY 196

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWP 155
           W R+ G DH  +A      +A+ +  +   N+I +V       +   +  KDV +P    
Sbjct: 197 WKRSNGRDHVIIAQDP---NALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPYSHR 253

Query: 156 RQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG----RLKTPY 207
                  +G   R  L FF G         +R+ L  +   + ++    G      +   
Sbjct: 254 INTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILELEQDVIIKHGTQSRESRRAA 313

Query: 208 ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV---V 264
              +  SKFCL+  G   +  R+ DS+   CVPVI+++  +LPF DV+++   ++    V
Sbjct: 314 THGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDSV 373

Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           + +    L   L+ IS E  L  Q  + K++++F++
Sbjct: 374 SAVKPEFLISKLRRISEERILDYQREMKKIKRYFEY 409


>gi|449455387|ref|XP_004145434.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Cucumis sativus]
          Length = 464

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 65/276 (23%), Positives = 117/276 (42%), Gaps = 34/276 (12%)

Query: 52  DPSKADLFFLPF-------------SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPY 96
           DP  ADLF++PF             +    +  R++    E   D    ++    +K  Y
Sbjct: 141 DPEVADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWL----EKQEY 196

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWP 155
           W R+ G DH  +A      +A+ +  +   N+I +V       +   +  KDV +P    
Sbjct: 197 WKRSNGRDHVIIAQDP---NALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPYSHR 253

Query: 156 RQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG----RLKTPY 207
                  +G   R  L FF G         +R+ L  +   + ++    G      +   
Sbjct: 254 INTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVIIKHGTQSRESRRAA 313

Query: 208 ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV---V 264
              +  SKFCL+  G   +  R+ DS+   CVPVI+++  +LPF DV+++   ++    V
Sbjct: 314 THGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDSV 373

Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           + +    L   L+ IS E  L  Q  + K++++F++
Sbjct: 374 SAVKPEFLISKLRRISEERILDYQREMKKIKRYFEY 409


>gi|308799499|ref|XP_003074530.1| Acetylglucosaminyltransferase EXT1/exostosin 1 (ISS) [Ostreococcus
           tauri]
 gi|116000701|emb|CAL50381.1| Acetylglucosaminyltransferase EXT1/exostosin 1 (ISS), partial
           [Ostreococcus tauri]
          Length = 439

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/92 (40%), Positives = 56/92 (60%), Gaps = 3/92 (3%)

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           L  S++CL+V G   +TAR+ D + +GCVPVI+A+ YDLPF+ + +W  FS+ V   D+ 
Sbjct: 345 LSKSRYCLYVYGDRAHTARLYDIITFGCVPVIVADGYDLPFSWLFDWSKFSVRVPEDDVA 404

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHV 302
            L  IL      +Y  L+  ++KV   FQ+H 
Sbjct: 405 KLPGIL---DQADYDSLRGELVKVHSFFQYHA 433


>gi|226508962|ref|NP_001152546.1| exostosin-like [Zea mays]
 gi|195657373|gb|ACG48154.1| exostosin-like [Zea mays]
 gi|414878975|tpg|DAA56106.1| TPA: exostosin-like protein [Zea mays]
          Length = 427

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 73/288 (25%), Positives = 124/288 (43%), Gaps = 32/288 (11%)

Query: 50  TKDPSKADLFFLPF--SIARMRHDRRIGTEGIPD-----FISHYIFNISQKYPYWNRTGG 102
            +DP  AD FF+PF  S++   H R +     PD      +   + +I  K  YW R+ G
Sbjct: 120 VRDPDAADAFFVPFFSSLSFNVHGRNMTD---PDTEADRLLQVELVDILWKSKYWQRSAG 176

Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA--HKDVSLPQIWP----R 156
            DH     H      ++      +NA  ++ S     +  +A   KDV  P +       
Sbjct: 177 RDHVIPMHHPNAFRFLR----AMVNASILIVSDFGRYTKELASLRKDVVAPYVHVVGSFL 232

Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDS----EIYAHSGRLKTPYA 208
            +DPP    + R+ L FF G         +R KL ++ +       E    +G       
Sbjct: 233 DDDPPDPFEA-RHTLLFFRGRTVRKDEGKIRSKLEKILKGKEGVRFEDSIATGDGINIST 291

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVA 265
           + +  SKFCLH  G   ++ R+ D++   CVPVI+++  +LPF D +++  FS+   V  
Sbjct: 292 EGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEE 351

Query: 266 TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            L    L   L+ +   +++ +   +  V  H+++   P   DA  M+
Sbjct: 352 ALRPDYLLNELRQVPKRKWVDMWLKLKNVSHHYEFQYPPRKGDAVNMI 399


>gi|242059273|ref|XP_002458782.1| hypothetical protein SORBIDRAFT_03g040220 [Sorghum bicolor]
 gi|241930757|gb|EES03902.1| hypothetical protein SORBIDRAFT_03g040220 [Sorghum bicolor]
          Length = 426

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 77/279 (27%), Positives = 130/279 (46%), Gaps = 36/279 (12%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN 89
           +G + ++    +  +KS F T +  +A+LFF+P   + ++  R  G     +    Y+  
Sbjct: 84  KGQWGTQVKIHQFLLKSRFRTFNKDQANLFFVP---SYVKCVRMTGALSDKEINQTYVKV 140

Query: 90  ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC---------SSSYF-- 138
           +SQ  PY+ R+GG DH +V     G + + ++W   LN   ++           +S F  
Sbjct: 141 LSQ-MPYFRRSGGRDHIFVFPSGAG-AHLFRSWATFLNRSIILTPEGDRTDKRGTSAFNT 198

Query: 139 -----ISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAG-AVNSPVREKLLQVWRN 192
                I G++   D S+ +   R   P  L  +KR  LA F G A     R +L+++ + 
Sbjct: 199 WKDIIIPGNV---DDSMVKSDARAVQPIPL--TKRKYLANFLGRAQGKAGRLQLVELAKQ 253

Query: 193 -----DSEIYAHSGRLK---TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
                +S +   SG  K     Y   L  +KFCL  +G    T R  +S +  CVPVI++
Sbjct: 254 YPDKLESPVLKLSGPNKLGRIEYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILS 313

Query: 245 NHYDLPFADVLNWKSFSIVVATLDI-PLLKKILKGISSE 282
           +  +LPF +V+++   SI   +  I P L + L+ IS E
Sbjct: 314 DEVELPFQNVIDYGEISIKWPSSRIGPELLEYLESISDE 352


>gi|224144904|ref|XP_002325457.1| predicted protein [Populus trichocarpa]
 gi|222862332|gb|EEE99838.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 75/302 (24%), Positives = 143/302 (47%), Gaps = 34/302 (11%)

Query: 50  TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
            ++ S+AD+ F+PF  S+   R      H ++   + + + +  ++   SQK   W R+G
Sbjct: 144 VRNSSEADVIFVPFFSSLCYNRYSKVNPHQKKSKDKLLQEKLVKFL--TSQKE--WKRSG 199

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPR 156
           G DH  +A H    ++M  A  VKL     + +       +IA+  KDV  P    I   
Sbjct: 200 GRDHVLLAHHP---NSMLDA-RVKLWPAIFILADFGRYPPNIANVAKDVIAPYKHVIRSY 255

Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKT----PYA 208
             D     S  R  L +F GA+        R++L  + +++ E++   G ++       +
Sbjct: 256 VNDSSNFDS--RPTLLYFQGAIYRKDGGFARQELFYLLKDEKEVHFQFGSVQKDGVGKAS 313

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
             +  SKFCL++ G   ++ R+ D++   CVPVII++  +LP+ +VL++  F I V T D
Sbjct: 314 QGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYENVLDYSQFCIFVRTSD 373

Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
                 L  +++ I  +E+  +   + +V   F++     + DA  M+   +  +  ++R
Sbjct: 374 AVREKFLVNLIRSIKKDEWTRMWKRLKEVENFFEFQYPSREGDAVQMIWQAVARKVPAIR 433

Query: 326 VQ 327
           ++
Sbjct: 434 LK 435


>gi|3668093|gb|AAC61825.1| unknown protein [Arabidopsis thaliana]
          Length = 460

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 67/254 (26%), Positives = 111/254 (43%), Gaps = 20/254 (7%)

Query: 52  DPSKADLFFLP-FS----IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
           DP+ ADLF++P FS    I         G+    + +   +    +   +W R  G DH 
Sbjct: 130 DPADADLFYVPVFSSLSLIVNAGRPVEAGSGYSDEKMQEGLVEWLEGQEWWRRNAGRDHV 189

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPKLGS 165
             A      +A+ +  +   NA+ +V           +  KDV +P          ++G 
Sbjct: 190 IPAGDP---NALYRILDRVKNAVLLVSDFGRLRPDQGSFVKDVVIPYSHRVNLFNGEIGV 246

Query: 166 SKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRL----KTPYADELLGSKFC 217
             RN L FF G         VR+ L QV   + ++    G      +      +  SKFC
Sbjct: 247 EDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDDVTIKHGTQSRENRRAATKGMHTSKFC 306

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPLLKK 274
           L+  G   +  R+ DS+   CVP+I+++  +LPF DV++++ FSI V   A L    L +
Sbjct: 307 LNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKFSIFVEANAALQPGFLVQ 366

Query: 275 ILKGISSEEYLLLQ 288
           +L+ I +++ L  Q
Sbjct: 367 MLRKIKTKKILEYQ 380


>gi|289166878|gb|ADC84489.1| glycosyltransferase family 47C [Salix sachalinensis]
          Length = 252

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 99/233 (42%), Gaps = 29/233 (12%)

Query: 82  FISHYIFNISQKYPYWNRTGGADHFYVACHSIG--------RSAMQKAWEVKLNAIQVVC 133
            +S  +  IS  +P+WNR+ G+DH +VA H  G        R+A     E    +I +  
Sbjct: 5   LLSSAVNLISSNHPFWNRSRGSDHVFVASHDYGACFHAMEERAAEDGIPEFLKRSIILQT 64

Query: 134 SSSYFISGHIAHKDVSLPQ-IWPRQ-----EDPPKLGSSKRNKLAFFAGAV--------- 178
               F       ++V +P  I P       E  P  G  +R+   FF G +         
Sbjct: 65  FGVKFDHPCQDVENVVIPPFITPESVQTTLEKYPLTG--RRDIWVFFRGKMEVHPKNISG 122

Query: 179 ---NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLY 235
              +  VR  + + +  D   Y    R    Y  E+  S FCL   G+   + R+ +S+ 
Sbjct: 123 RYYSKKVRTVIWRKYSGDPRFYLRRHRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESIA 181

Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
            GCVPVIIA+   LPF   + W   S+ VA  D+  L+ +L  +++     +Q
Sbjct: 182 LGCVPVIIADGIRLPFPAAVRWSDISLTVAEKDVADLRTLLDHVAASNLSAIQ 234


>gi|242059031|ref|XP_002458661.1| hypothetical protein SORBIDRAFT_03g037670 [Sorghum bicolor]
 gi|241930636|gb|EES03781.1| hypothetical protein SORBIDRAFT_03g037670 [Sorghum bicolor]
          Length = 499

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 72/268 (26%), Positives = 127/268 (47%), Gaps = 35/268 (13%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+ G DH     H     S+ R   +  W +      +  + +++  G +   KDV
Sbjct: 223 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 278

Query: 149 SLPQIWPRQE--DPPKLGSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHS 200
            LP + P  +  D   +  +  KR+ L FF G +       +R KL++  ++  +I    
Sbjct: 279 ILPYV-PNVDLCDHKCVLETQFKRSILLFFRGRLKRNAGGKIRSKLVEELKSAEDIVIEE 337

Query: 201 G----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
           G    + K    D +  S FCL   G   ++AR+ D++  GC+PVII++  +LPF  +L+
Sbjct: 338 GSAGAQGKAAAQDGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILD 397

Query: 257 WKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD 308
           ++  ++ V++ D      L K L+GI ++    +QS ++K  +HF +      + P D  
Sbjct: 398 YREIALFVSSSDAVQPGWLVKYLRGIDAKRIREIQSNLVKYSRHFLYSSPAQPLGPEDLT 457

Query: 309 ----AFYMVMYELWLRRSSVRVQWSTSL 332
               A  +V  +L +RRS   V+ S S+
Sbjct: 458 WRMIAGKVVNIKLQIRRSQRLVRESRSV 485


>gi|269785149|ref|NP_001161530.1| exostosin 1-like protein [Saccoglossus kowalevskii]
 gi|268054045|gb|ACY92509.1| exostosin 1-like protein [Saccoglossus kowalevskii]
          Length = 737

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 91/369 (24%), Positives = 142/369 (38%), Gaps = 100/369 (27%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F+VYVYP +                 GN  SE+Y K   A  +S F T DP KA LF +P
Sbjct: 93  FKVYVYPTQE----------------GNKVSEAYDKILSAIRESRFYTSDPKKACLF-IP 135

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS---------- 112
            SI  +  D        PD++ +    I Q  P WN   G +H     +S          
Sbjct: 136 -SIDTLDRDHLS-----PDYVKNAQSKI-QSLPLWN--NGQNHLIFVLYSGTWPEYSDLD 186

Query: 113 ----IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK- 167
               +G++ + KA    +N          F  G     D+S+P      +D  + G S+ 
Sbjct: 187 LGFELGQAMLAKASTTSIN----------FRPGF----DISIPLF---SKDHAQKGGSRG 229

Query: 168 ----------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHS---GRLKTPYAD 209
                     R  L  F G      + S  R  L  +      I   +   G+     AD
Sbjct: 230 DLQTNNFPVARKYLLVFKGKRYLSGIGSETRNALYHIHNGQDIILLTTCKHGKSWEKNAD 289

Query: 210 E-----------------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
                             L  S FCL  +G  + + R  +SL   C+PV+++N ++LPF+
Sbjct: 290 SRCEQDNAEFDRYDFHILLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFS 349

Query: 253 DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYM 312
           +V++W   SI+     +  +  I++ +S++E L      L+ +  F W  + S  D   M
Sbjct: 350 EVIDWNRASIIGDERLLLQIPSIVRTVSNDEIL-----SLRQQTQFLWETYFSSVDKIVM 404

Query: 313 VMYELWLRR 321
              E+   R
Sbjct: 405 TTLEIIQER 413


>gi|255585230|ref|XP_002533317.1| catalytic, putative [Ricinus communis]
 gi|223526861|gb|EEF29074.1| catalytic, putative [Ricinus communis]
          Length = 478

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 73/272 (26%), Positives = 114/272 (41%), Gaps = 49/272 (18%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSI---ARMRHDRRIGT----EGIPDFIS 84
           N  S+S+ K+ F        D ++AD+ F+PF     A M   R  GT    EG  D+  
Sbjct: 132 NLRSQSFAKRVF--------DFNQADVVFVPFFATLSAEMELARGEGTFRKKEGNEDYKR 183

Query: 85  H-YIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC---------- 133
              +    +    W R+GG DH +V    +        W V+      V           
Sbjct: 184 QKEVIEFVKSSDAWKRSGGKDHVFVLTDPVA------MWHVRAEIAPAVLLVVDFGGWYR 237

Query: 134 ----SSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSS---KRNKLAFFAGAV----NS 180
               SS    S  I H  VSL +  I P     P+L  S   KR  L +F GA       
Sbjct: 238 LDSKSSDGNSSNIIRHTQVSLLKDVIVPYTHLLPQLPLSENKKRQTLLYFKGAKYRHRGG 297

Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYY 236
            VREKL  +  N+  +    G       ++    +  S+FCLH  G    + R+ D++  
Sbjct: 298 MVREKLWDLLVNEPGVIMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQS 357

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
            C+P+I++++ +LPF  ++++  FS+ +A  D
Sbjct: 358 LCIPIIVSDNIELPFEGIVDYLEFSVFMAVDD 389


>gi|168033234|ref|XP_001769121.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679650|gb|EDQ66095.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 765

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 83/334 (24%), Positives = 138/334 (41%), Gaps = 67/334 (20%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMR-----------HDRRIGTEG--I 79
           Y +     ++ + S   T +  +AD F++PF  A +              + +G      
Sbjct: 367 YGAGIALYESLLASEHRTTNGDEADFFYVPFLQACIVEQGDAAPHLTFQGKYMGLRQYFA 426

Query: 80  PDFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQ 130
            D+     F+I Q YPYWNR+ G DH +       AC +   I  S M   W    NA  
Sbjct: 427 GDYSKQIYFHIQQNYPYWNRSAGRDHIWFFPWDEGACSAPKEIWNSMMLSHWG-NTNAKH 485

Query: 131 VVCSSSY-----------FISGHIAH---KDVSLP-----QIWPRQEDPPKLGSSKRNKL 171
              +++Y           +   H  +   KD+ LP       +P  ++        R  L
Sbjct: 486 KASTTAYRADNWDLIPPEWRGDHPCYDPAKDLVLPAWKFPDPYPIVQNLSSRHRQDRPTL 545

Query: 172 AFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRL----------KTP-Y 207
            +F G + S              +R+KL   + +        GR           ++P Y
Sbjct: 546 FYFNGNLGSAYDNGRPEPGYSMGIRQKLAAEFGSQPNKKGLLGRQAVDDVVVQAQRSPQY 605

Query: 208 ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATL 267
             EL  S+FC  + G +  + R+ DS+  GC+PVII +   LPF +VL+++SF++ VA  
Sbjct: 606 KLELSKSRFCGVLPG-DGWSGRMEDSILSGCIPVIIQDGIHLPFENVLDYESFTVRVAED 664

Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           +I  L  ILK I+  +   + + V  + + F +H
Sbjct: 665 NIHNLITILKAINEAQVDSMLAVVRGLWQRFTYH 698


>gi|413956860|gb|AFW89509.1| hypothetical protein ZEAMMB73_873038 [Zea mays]
          Length = 783

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 115/265 (43%), Gaps = 60/265 (22%)

Query: 89  NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQVVCSSSYFI 139
           +I+Q+YPYWNRT G DH +       AC++   I +S M   W    N      +++Y+ 
Sbjct: 451 HIAQQYPYWNRTSGRDHIWFFSWDEGACYAPKEIWKSMMLVHWG-NTNTKHKNSTTAYWA 509

Query: 140 -----------SGHIA---HKDVSLPQ--------IWPRQEDPPKLGSSKRNKLAFFAGA 177
                        H      KD+ LP         IW +    P+   + R  L +F G 
Sbjct: 510 DNWDDIPLDKRGNHPCFDPRKDLVLPAWKEPNPGAIWLKLWARPR---NNRTTLFYFNGN 566

Query: 178 VNSP-------------VREKLLQVWRNDSEIYAHSGR----------LKTP-YADELLG 213
           + S              +R+KL   + +        GR          L+T  Y +EL  
Sbjct: 567 LGSAYEGGRPEDTYSMGIRQKLAAEFGSTPNKQGRLGRQHAADVTVTYLRTEKYYEELAS 626

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           S FC  + G +  + R+ DS+  GC+PVII +   LP+ +VLN+ SF++ +   DIP L 
Sbjct: 627 SVFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEDDIPGLI 685

Query: 274 KILKGISSEEYLLLQSYVLKVRKHF 298
             L+GI+  +   +   V ++ + F
Sbjct: 686 STLRGINDTQVEFMLGNVRQMWQRF 710


>gi|212722910|ref|NP_001131485.1| uncharacterized protein LOC100192822 [Zea mays]
 gi|194691662|gb|ACF79915.1| unknown [Zea mays]
          Length = 426

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 72/274 (26%), Positives = 127/274 (46%), Gaps = 26/274 (9%)

Query: 30  RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN 89
           +G + ++    +  +KS F T +   A+LFF+P   + ++  R  G     +    Y+  
Sbjct: 84  KGQWGTQVKVHQFLLKSRFRTFNKDHANLFFVP---SYVKCVRMTGALSDKEINQTYVKV 140

Query: 90  ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC-----SSSYFISGHIA 144
           +SQ  PY+ R+GG DH +V     G + + ++W + LN   ++      +     S    
Sbjct: 141 LSQ-MPYFRRSGGRDHIFVFPSGAG-AHLFRSWAIFLNRSIILTPEGDRTDKRGTSAFNT 198

Query: 145 HKDVSLPQIWPR---QEDPPKLGS---SKRNKLAFFAG-AVNSPVREKLLQVWRN-DSEI 196
            KD+ +P        + D P +     +KR  LA F G A     R +L+++ +    ++
Sbjct: 199 WKDIIIPGNVDDSMVKSDAPAVQPIPLTKRKYLANFLGRAQGKAGRLQLVELAKQYPDKL 258

Query: 197 YAHSGRLKTP-------YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
            +   +L  P       Y   L  +KFCL  +G    T R  +S +  CVPVI+++  +L
Sbjct: 259 ESPELKLSGPNKLGRIEYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVEL 318

Query: 250 PFADVLNWKSFSIVVATLDI-PLLKKILKGISSE 282
           PF +V+++   SI   +  I P L + L+ IS E
Sbjct: 319 PFQNVIDYSEISIKWPSSRIGPELLEYLESISDE 352


>gi|449452903|ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
           sativus]
          Length = 429

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 68/285 (23%), Positives = 124/285 (43%), Gaps = 30/285 (10%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYI----FNISQKYPYWNRTGGADH 105
           DP  AD FF+PF  S++   H R +        + H +         +  YW R+ G DH
Sbjct: 122 DPENADAFFVPFFSSLSFNSHGRNMTDPATE--VDHQLQIDLMKFLSESKYWQRSKGRDH 179

Query: 106 FYVACHSIGRSAMQKAWEVKLNA-IQVVCS-SSYFISGHIAHKDVSLPQIWPRQ---EDP 160
                H      ++     ++NA IQ+V     Y  +     KDV  P +       +D 
Sbjct: 180 VIPMTHPNAFRFLRN----QVNASIQIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDDN 235

Query: 161 PKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADEL 211
           P      R  L FF G      +  +R KL ++     +++     A    +KT  +  +
Sbjct: 236 PPDPFESRPTLLFFQGKTFRKDDGIIRVKLAKILDGYDDVHYERSAATEKSIKT-SSQGM 294

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TLD 268
             SKFCLH  G   ++ R+ D++   CVPVI+++  +LP+ D +++  F++  +    L 
Sbjct: 295 RSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYSQFTLFFSFEEALQ 354

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
              + + L+    E ++ +   + ++ +H+++   P   DA  M+
Sbjct: 355 PGYMVEKLREFPKERWIEMWKQLKEISRHYEFQYPPKKEDAVNML 399


>gi|357126548|ref|XP_003564949.1| PREDICTED: probable glycosyltransferase At3g42180-like
           [Brachypodium distachyon]
          Length = 432

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 70/284 (24%), Positives = 120/284 (42%), Gaps = 28/284 (9%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPD-----FISHYIFNISQKYPYWNRTGGAD 104
           DP  A+ FF+PF  S++   H R +     PD      +   + ++  K  YW R+ G D
Sbjct: 128 DPEAAEAFFVPFFSSLSFNVHGRNMTD---PDTEADRLLQVELIDVLWKSKYWQRSAGRD 184

Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR----QEDP 160
           H     H      ++    V  + + V     Y        KDV  P +        +DP
Sbjct: 185 HVIPMHHPNAFRFLRDM--VNASVLIVADFGRYTQELASLRKDVVAPYVHVVDSFINDDP 242

Query: 161 PKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDS----EIYAHSGRLKTPYADELL 212
           P    + R  L FF G         +R KL ++ ++      E    +G       + + 
Sbjct: 243 PDPFEA-RPTLLFFRGRTVRKAEGKIRAKLAKILKDKDGVRFEDSLATGEGINTSTEGMR 301

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLDI 269
            SKFCLH  G   ++ R+ D++   C+PVI+++  +LPF D +++  FS+   V   L  
Sbjct: 302 SSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSSRIELPFEDEIDYSEFSLFFSVEEALKP 361

Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
             L   L+ +  E+++ + S +  V  H+++       DA  M+
Sbjct: 362 DYLLDQLRQMPKEKWVEMWSKLKNVSSHYEFQYPTRKGDAVNMI 405


>gi|357168395|ref|XP_003581626.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like
           [Brachypodium distachyon]
          Length = 348

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 75/291 (25%), Positives = 129/291 (44%), Gaps = 26/291 (8%)

Query: 41  KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
           +  +KS F T D  +A LFF+P   + ++  R  G     +    Y+  +SQ  PY+ R+
Sbjct: 44  QLLLKSRFRTLDKDEAHLFFVP---SYVKCVRMTGALTDKEINQTYVKVLSQ-MPYFRRS 99

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLPQIWP 155
           GG DH +V     G + + ++W   LN   ++      +    IS     KD+ +P    
Sbjct: 100 GGRDHIFVFPSGAG-AHLFRSWATFLNRSIILTPEGDRTDKRGISAFNTWKDIIIPG--- 155

Query: 156 RQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSK 215
             +D   +G + R KL   A      +    L++   D       GR+   Y   L  +K
Sbjct: 156 NVDD--SMGKAGRLKLVELAKQYPDKLESPELKLSGPDK-----LGRID--YFKHLRNAK 206

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-PLLKK 274
           FCL  +G    T R  +S +  CVPVI+++  +LPF +++++   SI   +  I P L +
Sbjct: 207 FCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNMIDYTEISIKWPSSKISPELFE 266

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
            L+ I  E    ++  + + R+     V+  D +    +   LW  +  VR
Sbjct: 267 YLESIPEER---IEEMIARGREVRCLWVYAPDLEPCSAMAAILWELQRKVR 314


>gi|255558009|ref|XP_002520033.1| catalytic, putative [Ricinus communis]
 gi|223540797|gb|EEF42357.1| catalytic, putative [Ricinus communis]
          Length = 507

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 58/224 (25%), Positives = 98/224 (43%), Gaps = 24/224 (10%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+GG DH     H     S+ R      W +      +  + +++  G +   KD+
Sbjct: 229 PAWKRSGGRDHILPVHHPWSFKSVRRYVKNAIWLLP----DMDSTGNWYKPGQVFLEKDL 284

Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            LP +        K  S   SKR  L FF G +       +R KL+        +    G
Sbjct: 285 ILPYVPNVDLCDAKCASENESKRTTLLFFRGRLKRNAGGKIRAKLVAELSGAEGVVVEEG 344

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K      +  S FCL   G   ++AR+ D++  GC+PVI+++  +LPF  +L++
Sbjct: 345 TAGEGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 404

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           +  ++ V++ D      L K LK +S  +   +Q  ++K  +HF
Sbjct: 405 RKIAVFVSSSDAIQPGWLIKFLKDVSPAQTREMQRNLVKYSRHF 448


>gi|392575274|gb|EIW68408.1| hypothetical protein TREMEDRAFT_32575 [Tremella mesenterica DSM 1558]
          Length = 1176

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 53/92 (57%)

Query: 207  YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
            Y  E+ G++FC    G    + RI D++Y GC+PV+ A     PFAD L+W  FSI +  
Sbjct: 1032 YMKEISGARFCPQPTGIAGWSPRINDAIYAGCIPVLTAEGTHYPFADFLDWSKFSIRIKP 1091

Query: 267  LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
             ++  L++IL  I  E+   +Q+ ++ VR+ F
Sbjct: 1092 TELDQLERILSAIPLEQLEEMQANLMLVREAF 1123


>gi|357113818|ref|XP_003558698.1| PREDICTED: uncharacterized protein LOC100844507 [Brachypodium
           distachyon]
          Length = 781

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 71/265 (26%), Positives = 111/265 (41%), Gaps = 60/265 (22%)

Query: 89  NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQVVCSSSYFI 139
           +ISQ+Y YWNRT G DH +       AC++   I  S M   W    N      +++Y+ 
Sbjct: 450 HISQRYAYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHENSTTAYWA 508

Query: 140 -----------SGHIA---HKDVSLPQ--------IWPRQEDPPKLGSSKRNKLAFFAGA 177
                        H      KD+ LP         IW +    P++    R  L +F G 
Sbjct: 509 DNWDDIPLDRRGNHPCFDPRKDLVLPAWKVPEPGAIWLKLWARPRI---NRTTLFYFNGN 565

Query: 178 VNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP-----------YADELLG 213
           +                +R+KL   + +        GR  T            Y +EL  
Sbjct: 566 LGPAYEQGRPEDTYSMGIRQKLAAEFGSTPSKEGKLGRQHTANVTVTYLRSEKYYEELAS 625

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           S FC  + G +  + R+ DS+  GC+PVII +   LP+ +VLN+ SF++ +   DIP L 
Sbjct: 626 SVFCGALPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEHDIPNLI 684

Query: 274 KILKGISSEEYLLLQSYVLKVRKHF 298
           +IL GI+  +   +   V ++ + F
Sbjct: 685 RILGGINETQIEFMLGNVRQIWQRF 709


>gi|289166880|gb|ADC84490.1| glycosyltransferase family 47C [Salix miyabeana]
          Length = 252

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 98/233 (42%), Gaps = 29/233 (12%)

Query: 82  FISHYIFNISQKYPYWNRTGGADHFYVACHSIG--------RSAMQKAWEVKLNAIQVVC 133
            +S  +  IS  +P+WNR+ G+DH +VA H  G        R+A     E    +I +  
Sbjct: 5   LLSSAVNLISSNHPFWNRSRGSDHVFVASHDYGACFHAMEERAAEDGIPEFLKRSIILQT 64

Query: 134 SSSYFISGHIAHKDVSLPQ-IWPRQ-----EDPPKLGSSKRNKLAFFAGAV--------- 178
               F       ++V +P  I P       E  P  G  +R+   FF G +         
Sbjct: 65  FGVKFDHPCQDVENVVIPPFITPESVQTTLEKYPLTG--RRDIWVFFRGKMEVHPKNISG 122

Query: 179 ---NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLY 235
              +  VR  + + +  D   Y    R    Y  E+  S FCL   G+   + R+ +S+ 
Sbjct: 123 RYYSKKVRTVIWRKYSGDPRFYLRRHRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESIA 181

Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
            GCVPVIIA+   LPF   + W   S+ VA  D+  L  +L  +++     +Q
Sbjct: 182 LGCVPVIIADGIRLPFPAAVRWSDISLTVAEKDVADLGTLLDHVAASNLSAIQ 234


>gi|449489243|ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At3g07620-like [Cucumis sativus]
          Length = 429

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 68/285 (23%), Positives = 123/285 (43%), Gaps = 30/285 (10%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYI----FNISQKYPYWNRTGGADH 105
           DP  AD FF+PF  S++   H R +        + H +         +  YW R+ G DH
Sbjct: 122 DPENADAFFVPFFSSLSFNSHGRNMTDPATE--VDHQLQIDLMKFLSESKYWQRSKGRDH 179

Query: 106 FYVACHSIGRSAMQKAWEVKLNA-IQVVCS-SSYFISGHIAHKDVSLPQIWPRQ---EDP 160
                H      ++     ++NA IQ+V     Y  +     KDV  P +       +D 
Sbjct: 180 VIPMTHPNAFRFLRN----QVNASIQIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDDN 235

Query: 161 PKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADEL 211
           P      R  L FF G      +  +R KL ++     +++     A    +KT  +  +
Sbjct: 236 PPDPFESRPTLLFFQGKTFRKDDGIIRVKLAKILDGYDDVHYERSAATEKSIKT-SSQGM 294

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLD 268
             SKFCLH  G   ++ R+ D++   CVPVI+++  +LP+ D +++  F++       L 
Sbjct: 295 RSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYSQFTLFFXFEEALQ 354

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
              + + L+    E ++ +   + ++ +H+++   P   DA  M+
Sbjct: 355 PGYMVEKLREFPKERWIEMWKQLKEISRHYEFQYPPKKEDAVNML 399


>gi|225461772|ref|XP_002285599.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Vitis vinifera]
          Length = 513

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 59/223 (26%), Positives = 101/223 (45%), Gaps = 22/223 (9%)

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHIA-HKDVS 149
           P W R+ G DH     H      ++K+ +   NAI ++     + +++  G ++  KD+ 
Sbjct: 238 PAWKRSEGRDHILPVHHPWSFKTVRKSMK---NAIWLLPDMDSTGNWYKPGQVSLEKDLI 294

Query: 150 LPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGR 202
           LP +        K  S   SKR  L FF G +       +R KL+     D  +    G 
Sbjct: 295 LPYVPNVDLCDAKCSSESESKRKTLLFFRGRLKRNAGGKIRAKLMAELSGDDGVVIQEGT 354

Query: 203 L----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
                K      +  S FCL   G   ++AR+ D++  GC+PVI+++  +LPF  +L+++
Sbjct: 355 AGEGGKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYR 414

Query: 259 SFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
             ++ V++ D      L   LK IS  +   +Q  + K  +HF
Sbjct: 415 KIALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYSRHF 457


>gi|302142837|emb|CBI20132.3| unnamed protein product [Vitis vinifera]
          Length = 379

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 59/223 (26%), Positives = 101/223 (45%), Gaps = 22/223 (9%)

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHIA-HKDVS 149
           P W R+ G DH     H      ++K+ +   NAI ++     + +++  G ++  KD+ 
Sbjct: 104 PAWKRSEGRDHILPVHHPWSFKTVRKSMK---NAIWLLPDMDSTGNWYKPGQVSLEKDLI 160

Query: 150 LPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGR 202
           LP +        K  S   SKR  L FF G +       +R KL+     D  +    G 
Sbjct: 161 LPYVPNVDLCDAKCSSESESKRKTLLFFRGRLKRNAGGKIRAKLMAELSGDDGVVIQEGT 220

Query: 203 L----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
                K      +  S FCL   G   ++AR+ D++  GC+PVI+++  +LPF  +L+++
Sbjct: 221 AGEGGKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYR 280

Query: 259 SFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
             ++ V++ D      L   LK IS  +   +Q  + K  +HF
Sbjct: 281 KIALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYSRHF 323


>gi|356503460|ref|XP_003520526.1| PREDICTED: uncharacterized protein LOC100775594 [Glycine max]
          Length = 761

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 81/337 (24%), Positives = 136/337 (40%), Gaps = 81/337 (24%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFS----IARMRHDRRIGT---EGIPDFISH 85
           Y ++    ++ + S   T +  +AD FF+P      I R  H   + T   EG+  F++ 
Sbjct: 360 YGAQIALYESILASPHRTLNGDEADFFFVPVLDSCLIDRADHAPHLSTQNHEGLRSFLTL 419

Query: 86  YIF-----NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAW----EVKLN 127
             +     +I ++YPYWN + G DH +       AC++   I  S M   W        +
Sbjct: 420 DFYKNAYNHIVEQYPYWNCSSGRDHIWFFSWDEGACYAPKEIWSSMMLVHWGNTNTKHYH 479

Query: 128 AIQVVCSSSY---------FISGHIAHKDVSLPQ--------------IWPRQEDPPKLG 164
           +    C  ++         F       KD+ +P                WP +       
Sbjct: 480 STTAYCPDNWDGIPSDRRGFHPCFDPEKDLVIPAWKVTHVHVLSSKLWAWPLE------- 532

Query: 165 SSKRNKLAFFAGAVNSP------------VREKLLQVWRNDSEIYAHSGRLKTP------ 206
             KR  L +F G +               +R+KL + + +        G+ +        
Sbjct: 533 --KRKTLFYFNGNLGPAYPYGRNEWYSMGIRQKLAEEFGSKPNKEGKLGKQRAKDVVVTA 590

Query: 207 -----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
                Y  EL  S FC  + G +  + R+ DS+  GC+PVII +   LP+ +VLN+ SF+
Sbjct: 591 ERSENYEVELASSVFCGVLPG-DGWSGRMEDSVLQGCIPVIIQDGIFLPYENVLNYDSFA 649

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           + +   +IP L KIL+GI+  E     + V K+ + F
Sbjct: 650 VRIPEDEIPNLIKILRGINDTEIKFKLANVQKIWQRF 686


>gi|297611147|ref|NP_001065633.2| Os11g0128000 [Oryza sativa Japonica Group]
 gi|255679743|dbj|BAF27478.2| Os11g0128000, partial [Oryza sativa Japonica Group]
          Length = 199

 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 90/166 (54%), Gaps = 10/166 (6%)

Query: 169 NKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHVKGFE 224
           N + F  G     +R++L  + +++ ++Y   G ++   A +    +  SKFCL++ G  
Sbjct: 22  NCVIFVQGG---NIRQELHYMLKDEKDVYFAFGSVQDHGASKASQGMHASKFCLNIAGDT 78

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILKGISS 281
            ++ R+ D++   CVPVII++  +LP+ D L++  FSI V + D      L ++++G+S 
Sbjct: 79  PSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAVKKGYLMRLIRGVSK 138

Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
            ++ ++   + +V KHF++       DA  M+   L  +  ++R++
Sbjct: 139 HQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVPAIRLK 184


>gi|297833038|ref|XP_002884401.1| hypothetical protein ARALYDRAFT_477609 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330241|gb|EFH60660.1| hypothetical protein ARALYDRAFT_477609 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 495

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 75/290 (25%), Positives = 138/290 (47%), Gaps = 35/290 (12%)

Query: 50  TKDPSKADLFFLPF--SIARMR----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA 103
            K+ S+AD+ F+PF  S++  R    + ++  ++     ++   +  SQK   W  +GG 
Sbjct: 196 VKNSSEADVVFVPFFSSLSYNRFSKVNQKQKKSQDKELQVNVVKYVTSQKE--WKISGGK 253

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIWPRQED 159
           DH  +A H    ++M  A      A+ VV     + S H+A+  KD+  P   + P   +
Sbjct: 254 DHVIMAHHP---NSMSTARHKLYPAMFVVADFGRY-SPHVANIDKDIVAPYKHLVPSYAN 309

Query: 160 PPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADEL 211
               G   R  L +F GA+       VR++L +      +++   G ++    T   + +
Sbjct: 310 DTS-GFDGRPILLYFQGAIYRKAGGFVRQELYK------DVHFSFGSVRNHGITKAGEGM 362

Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-- 269
             SKFCL++ G   ++ R+ D++   C+PVII++  +LP+ DVLN+  F + V + D   
Sbjct: 363 RSSKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFCLFVRSSDALK 422

Query: 270 -PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
              L  ++K I  +EY  +   + +V ++F    FP+  D     +  +W
Sbjct: 423 KGFLMGLVKSIGRDEYNKMWLRLKEVERYFDLR-FPTKDDEGDYAVQMIW 471


>gi|384490457|gb|EIE81679.1| hypothetical protein RO3G_06384 [Rhizopus delemar RA 99-880]
          Length = 793

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 67/308 (21%), Positives = 131/308 (42%), Gaps = 47/308 (15%)

Query: 46  SHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDFISHYIFNISQKYP 95
            H+VT++P +AD FF+PF  +          +   D R   +    ++   +  + Q+YP
Sbjct: 481 EHYVTENPEEADFFFIPFFGSCYLYNCWYENKWNWDERCEVDA--KYVDPLMDMVIQEYP 538

Query: 96  YWNRTGGADHFYVACHSIGRS----AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP 151
           YWN+TGG +H  +  H + ++         ++  +    V    + ++S H  H+D+ +P
Sbjct: 539 YWNKTGGRNHIMI--HPMDKTFTYYQSNPRFQSAIFLKTVGDKRNKWMSRHRYHRDIVIP 596

Query: 152 QI-------------WPRQEDPPKLGSSKRNKLAFFAGAV---------NSPVREKLLQV 189
                          +   +  PK  S KR+  A F G           ++ +R      
Sbjct: 597 SATRMIHHLRANPLDYLNAQGQPK--SGKRDIFALFQGCCPDVQPTDEYSNGIRSLFFNH 654

Query: 190 WRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           + +              Y ++L  +K+ L   G+ ++T RI + + +G VPV+IA+    
Sbjct: 655 FAHYPGYEIGQSVADEEYLEKLSRAKYGLSPMGWTLDTTRIWEFMAFGVVPVVIADGIIE 714

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDA 309
           PF   ++W  F + +   ++  L +ILK I  + Y   Q  + +  +     +     DA
Sbjct: 715 PFEFDVDWDKFIVRIRRDEVHRLDEILKSIDDKTYEYKQKNLWEFGRRVGLEM-----DA 769

Query: 310 FYMVMYEL 317
           ++ ++ EL
Sbjct: 770 WHFIVREL 777


>gi|412986477|emb|CCO14903.1| predicted protein [Bathycoccus prasinos]
          Length = 574

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 5/108 (4%)

Query: 161 PKLGSSKRNKLAFFAGAVNSPVREKLLQ--VWRNDSEIYAHSGRLKTP--YADELLGSKF 216
           PKL  ++RN    F G    P+REK+ +  +     E    +  + +P  Y   +  SK+
Sbjct: 389 PKL-EAERNIRLMFRGNNRGPLREKVFRYLIENGSPEDSIETTGVASPQAYMSLMEHSKY 447

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
           CLHV+G  V + R+ + + +GCVPVI+A+ Y+LP A  L+W  FSI V
Sbjct: 448 CLHVRGTRVMSPRLIELMLFGCVPVIVADAYELPLAWFLDWTKFSIRV 495


>gi|359359217|gb|AEV41121.1| putative exostosin family domain-containing protein [Oryza
           officinalis]
          Length = 460

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 66/250 (26%), Positives = 112/250 (44%), Gaps = 36/250 (14%)

Query: 49  VTKDPSKADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWN 98
           V  D   AD+ F+PF  A +  +  +G          EG  D+     + +    +P W 
Sbjct: 122 VVADWRDADVVFVPF-FATLSAEMELGWGAKGAFRRKEGNEDYRRQREVVDRVTAHPAWR 180

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHIAHKDV 148
           R+GG DH +V    +    ++K  E+  + + VV           S+S  +S  I H  V
Sbjct: 181 RSGGRDHVFVLTDPVAMWHVRK--EIAPSILLVVDFGGWYKLDSNSASSNVSHMIQHTQV 238

Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAH 199
           SL +  I P     P +  S+   R  L +F GA        VREKL  +  N+ ++   
Sbjct: 239 SLLKDVIVPYTHLLPTMQLSENKDRLTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVME 298

Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
            G       ++    +  S+FCLH  G    + R+ D++   C+PVI+++  +LPF  ++
Sbjct: 299 EGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMI 358

Query: 256 NWKSFSIVVA 265
           ++  F+I V+
Sbjct: 359 DYTEFTIFVS 368


>gi|357474287|ref|XP_003607428.1| Exostosin family protein-like protein [Medicago truncatula]
 gi|355508483|gb|AES89625.1| Exostosin family protein-like protein [Medicago truncatula]
          Length = 502

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 59/224 (26%), Positives = 101/224 (45%), Gaps = 24/224 (10%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+GG DH     H     ++ R   +  W +      +  + +++  G +   KD+
Sbjct: 222 PAWKRSGGRDHILPVHHPWSFKTVRRYVKKAIWLLP----DMDSTGNWYKPGQVYLEKDL 277

Query: 149 SLPQIWPRQ-EDPPKLG--SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            LP +      D   L   + KRN L FF G +       +R KL+   R    +    G
Sbjct: 278 ILPYVANVDFCDATCLSEINPKRNTLLFFRGRLKRNAGGKIRSKLVDQLRGADGVVIEEG 337

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K    + +  S FCL+  G   ++AR+ D++  GC+PVI+++  +LPF  +L++
Sbjct: 338 TSGEGGKEAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 397

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           +  ++ V++ D      L K LK I S     +Q  + K  +HF
Sbjct: 398 RKIALFVSSNDALKPSWLLKYLKDIRSAHIKEMQQNLAKYSRHF 441


>gi|359473738|ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g25310-like [Vitis vinifera]
          Length = 437

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 71/284 (25%), Positives = 121/284 (42%), Gaps = 25/284 (8%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHY---IFNISQKYPYWNRTGGADHF 106
           DP  AD FF+PF  S++   H   + T+   +F       I  I ++  YW R+GG DH 
Sbjct: 126 DPEMADAFFVPFFSSLSFNTHGHNM-TDPDTEFDRQLQIDILKILRESKYWQRSGGRDHV 184

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW---PRQEDPPKL 163
               H       ++  +V  + + V     Y        KDV  P +       +D    
Sbjct: 185 IPMHHPNAFRFFRE--QVNTSILIVADFGRYPKEISNLRKDVVAPYVHVVDSFTDDNSPD 242

Query: 164 GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI----YAHSGRLKTPYADELLG-- 213
               R  L FF G         VR+KL+++     +     + H   L         G  
Sbjct: 243 PYESRTTLLFFRGRTIRKDEGIVRDKLVKLLAGXDDYLQLHFHHRSYLSFLVXQSTQGMR 302

Query: 214 -SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TLDI 269
            SKFCLH  G   ++ R+ D++   CVPVI+++  +LP+ D +++  FSI  +    L+ 
Sbjct: 303 SSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYTQFSIFFSDKEALEP 362

Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
             + + L+ I  E ++ +  ++  +  H+++   P   DA  M+
Sbjct: 363 GYMIEQLRQIPKERWVEMWRHLKYISHHYEFQYPPKKGDAIDML 406


>gi|193687008|ref|XP_001944271.1| PREDICTED: exostosin-2-like [Acyrthosiphon pisum]
          Length = 703

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 79/301 (26%), Positives = 127/301 (42%), Gaps = 41/301 (13%)

Query: 2   NRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLF 59
           N  F+VYVYP  R+       +  DF P G   S+ Y     A ++S + T +P +A +F
Sbjct: 86  NGDFKVYVYPMARH-------VDEDFIPIGGKMSKEYHTILSAIVESQYYTTNPEEACVF 138

Query: 60  FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA--DHFYVACHSIGRSA 117
                       R   T      + H+  N  Q +  +N   G   D+  V   SIG+S 
Sbjct: 139 VSSIDTLNQNRFRVKETSQALALLPHW--NDGQNHLIFNMVPGTAPDYKTVVDLSIGKSM 196

Query: 118 MQKA----WEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQIWPRQEDPPKLGSSKRNKLA 172
           +  A    W  +         SS+ IS  I ++  +SL   +  +     + + + N   
Sbjct: 197 VAGAGFDSWTYR---------SSFDISIAIYSNLAISLSNNYTFKHRTTFISTVQTNLHN 247

Query: 173 FFAGAVNSPVREKLLQVWRNDSEI-----YAHSGRLKT---------PYADELLGSKFCL 218
            F  ++ S  ++K +      S I      +HSG  +T          YAD L  S FCL
Sbjct: 248 DFITSLKSIEKQKSMTQNITMSMIRVIEPCSHSGHDRTIICHENITYNYADILADSVFCL 307

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
            + G  +      DSL  GC+P++  NH  LPF +V++WK   I+ +  ++  L  ++ G
Sbjct: 308 ILPGPRLMDMVFIDSLAAGCIPIVAINHVVLPFFEVIDWKRAVIMWSETELNTLLDVVSG 367

Query: 279 I 279
           I
Sbjct: 368 I 368


>gi|34394613|dbj|BAC83915.1| limonene cyclase like protein [Oryza sativa Japonica Group]
 gi|50508945|dbj|BAD31849.1| limonene cyclase like protein [Oryza sativa Japonica Group]
          Length = 417

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/122 (37%), Positives = 66/122 (54%), Gaps = 23/122 (18%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAF-MKSHFVTKDPSKAD 57
           M + F+V+VY     +P      PV  D   R  Y++E  F  A  M++   T+DP +A 
Sbjct: 268 MEKVFKVFVY--EEGEP------PVFHDGPCRSIYSTEGRFIYAMEMENRMRTRDPDQAH 319

Query: 58  LFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
           +FFLPFS+ +M         HD     + +   IS YI  +S KYP+WNR+ GADHF ++
Sbjct: 320 VFFLPFSVVKMVKMIYEPNSHD----MDPLRRTISDYINVVSTKYPHWNRSLGADHFMLS 375

Query: 110 CH 111
           CH
Sbjct: 376 CH 377


>gi|449501885|ref|XP_004161484.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101226446 [Cucumis sativus]
          Length = 859

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 84/334 (25%), Positives = 140/334 (41%), Gaps = 70/334 (20%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTE-----GIPDFISH 85
           Y +E  F ++ + S   T +  +AD FF+P   S    R D           G+  F++ 
Sbjct: 391 YGAEMAFYESILASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSLRDYMGLRSFLTL 450

Query: 86  YIF-----NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
             +     +I ++YPYWNR+ G DH +       AC++   I  S M   W    N+   
Sbjct: 451 DFYKKAHDHIVEQYPYWNRSSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 509

Query: 132 VCSSSYFI-----------SGHIA---HKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
             +++Y+              H      KD+ +P  W R  D  +L          +R  
Sbjct: 510 HSTTAYWGDNWDNIPSSKRGNHPCFDPEKDLVVPA-WKRP-DGSRLSKKLWARPREERKT 567

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
             FF G +                +R+K+ + + +        G+      + TP     
Sbjct: 568 FFFFNGNLGPAYERGRPESTYSMGIRQKVAEEFGSSPNKEGKLGKQHAADVIVTPLRSEN 627

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y ++L  S FC  + G +  + R+ DS+  GC+PVII +   LP+ +VLN+ SF++ +  
Sbjct: 628 YHEDLASSVFCGVMPG-DGWSGRMEDSILQGCIPVIIQDGIFLPYENVLNYDSFAVRIGE 686

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
            DIP L  IL+G +  E     S V K+ + F +
Sbjct: 687 DDIPNLINILRGFNESEIEFKLSNVRKIWQRFMY 720


>gi|449437619|ref|XP_004136589.1| PREDICTED: uncharacterized protein LOC101206674 [Cucumis sativus]
          Length = 791

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 84/334 (25%), Positives = 140/334 (41%), Gaps = 70/334 (20%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTE-----GIPDFISH 85
           Y +E  F ++ + S   T +  +AD FF+P   S    R D           G+  F++ 
Sbjct: 391 YGAEMAFYESILASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSLRDYMGLRSFLTL 450

Query: 86  YIF-----NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
             +     +I ++YPYWNR+ G DH +       AC++   I  S M   W    N+   
Sbjct: 451 DFYKKAHDHIVEQYPYWNRSSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 509

Query: 132 VCSSSYFI-----------SGHIA---HKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
             +++Y+              H      KD+ +P  W R  D  +L          +R  
Sbjct: 510 HSTTAYWGDNWDNIPSSKRGNHPCFDPEKDLVVP-AWKRP-DGSRLSKKLWARPREERKT 567

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
             FF G +                +R+K+ + + +        G+      + TP     
Sbjct: 568 FFFFNGNLGPAYERGRPESTYSMGIRQKVAEEFGSSPNKEGKLGKQHAADVIVTPLRSEN 627

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y ++L  S FC  + G +  + R+ DS+  GC+PVII +   LP+ +VLN+ SF++ +  
Sbjct: 628 YHEDLASSVFCGVMPG-DGWSGRMEDSILQGCIPVIIQDGIFLPYENVLNYDSFAVRIGE 686

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
            DIP L  IL+G +  E     S V K+ + F +
Sbjct: 687 DDIPNLINILRGFNESEIEFKLSNVRKIWQRFMY 720


>gi|348519397|ref|XP_003447217.1| PREDICTED: exostosin-1c-like [Oreochromis niloticus]
          Length = 740

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 80/356 (22%), Positives = 133/356 (37%), Gaps = 81/356 (22%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
            FRVY+YP  +ND                  SESY K   +  +S + T DP +A LF L
Sbjct: 108 GFRVYIYPSEKND----------------RVSESYRKILASIAESRYYTSDPREACLFVL 151

Query: 62  PFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSAM 118
              I  +  D+  G          ++ N+ ++   YP WN   G +H     +S      
Sbjct: 152 --GIDTLDRDQLSG---------QFVPNVDERIRGYPLWN--DGRNHLIFNLYSGTWPNY 198

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSK---------- 167
            +  ++  N  Q + + +   + H     DVS+P      +D P+ G  +          
Sbjct: 199 TE--DLGFNIGQAILAKASLNTEHFRPGFDVSIPLF---SKDHPQKGGERGWLVRNTVPP 253

Query: 168 -RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL- 203
            R  L  F G      + S  R  L  +                 W    +       L 
Sbjct: 254 RRKYLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNLE 313

Query: 204 --KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
             +  Y + L  S FCL  +G  + + R  +SL   C+PV+++N ++LPF+DV+ W    
Sbjct: 314 YERFDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAV 373

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
           I      +  +   ++ + +E  L      L+ R    W  + S  D   +   E+
Sbjct: 374 IEGDERLLLQVPSTVRAVGNERVL-----ALRQRTQMLWEAYFSSVDKIVLTTLEI 424


>gi|224135879|ref|XP_002327326.1| predicted protein [Populus trichocarpa]
 gi|222835696|gb|EEE74131.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 83/307 (27%), Positives = 131/307 (42%), Gaps = 49/307 (15%)

Query: 34  ASESYFKKAFMKSHFVTKDPSKADLFFLPFSI---ARMRHDRRIGT----EGIPDFISH- 85
            SE    ++F K  F   D ++AD+ F+PF     A M   +  G+    EG  D+    
Sbjct: 100 TSEKLKSRSFAKRVF---DFNEADVVFVPFFATLSAEMELAKGKGSFRRKEGNEDYQRQK 156

Query: 86  YIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLN---AIQVVCS-------- 134
            + +  +    W R+GG DH +V    +        W V+     AI +V          
Sbjct: 157 EVVDFVRNSEAWKRSGGKDHVFVLTDPVA------MWHVRAEIAPAILLVVDFGGWYRLD 210

Query: 135 ---SSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSS---KRNKLAFFAGAV----NSPV 182
              S+   S  I H  VSL +  I P     P+   S   KRN L +F GA        V
Sbjct: 211 SKSSNGSSSDMIRHTQVSLLKDVIVPYTHLLPRFQFSENKKRNTLLYFKGAKHRHRGGIV 270

Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADEL-----LGSKFCLHVKGFEVNTARIADSLYYG 237
           RE L  +  N+  +    G        EL       S+FCLH  G    + R+ D++   
Sbjct: 271 RENLWDLLVNEPGVIMEEG-FPNATGRELSIRGMRTSEFCLHPAGDTPTSCRLFDAIQSL 329

Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVA---TLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           C+PVI++++ +LPF  +L++  FS+ VA    L    L   L+ IS ++   L+  + K+
Sbjct: 330 CIPVIVSDNIELPFEGILDYTEFSVFVAGDDALKPTWLMDHLRSISEKQKEELRRNMAKI 389

Query: 295 RKHFQWH 301
           +  +Q+ 
Sbjct: 390 QLIYQYE 396


>gi|115451051|ref|NP_001049126.1| Os03g0174300 [Oryza sativa Japonica Group]
 gi|108706455|gb|ABF94250.1| exostosin family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113547597|dbj|BAF11040.1| Os03g0174300 [Oryza sativa Japonica Group]
 gi|215695509|dbj|BAG90700.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 576

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/311 (26%), Positives = 133/311 (42%), Gaps = 70/311 (22%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP----FSIAR--------MRHDRRIGTEGIP 80
           Y ++    ++ + S   T +  +AD F++P      I R        M  D R+ +    
Sbjct: 258 YGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRLRSYHTL 317

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           ++      +I+Q+YPYWNRT G DH +       AC++   I  S M   W    N    
Sbjct: 318 EYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHK 376

Query: 132 VCSSSYF--------ISGHIAH------KDVSLPQIWPRQEDPP----KLGSSKRNK--- 170
             +++Y+        I     H      KD+ LP  W +Q +P     KL +  RN    
Sbjct: 377 NSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPA-W-KQPNPAAIWLKLWARTRNNRTT 434

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP----------- 206
           L +F G +                +R+KL   + +  +     GR  T            
Sbjct: 435 LFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVTYLRTEK 494

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y +EL  S FC  + G +  + R+ DS+  GC+PVII +   LP+ ++LN+ SF++ +  
Sbjct: 495 YYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSFAVRIQE 553

Query: 267 LDIPLLKKILK 277
            DIP L +IL+
Sbjct: 554 DDIPNLIRILR 564


>gi|432883098|ref|XP_004074204.1| PREDICTED: exostosin-1c-like [Oryzias latipes]
          Length = 740

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/353 (23%), Positives = 133/353 (37%), Gaps = 75/353 (21%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
            FRVY+YP  +ND                  SESY K   +  +S + T DP +A LF L
Sbjct: 108 GFRVYIYPSEKND----------------RVSESYRKILASIAESRYYTSDPREACLFVL 151

Query: 62  PFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
              I  +  D+  G      F+S+    I + YP WN   G +H     +S       + 
Sbjct: 152 --GIDTLDRDQLSG-----QFVSNVDDRI-RGYPLWNE--GRNHLIFNLYSGTWPNYTE- 200

Query: 122 WEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSK-----------RN 169
            ++  N  Q + + +   + H     DVS+P      +D P+ G  +           R 
Sbjct: 201 -DLGFNIGQAILAKASLNTEHFRPGFDVSIPLF---SKDHPQKGGERGWLVRNSTPPRRK 256

Query: 170 KLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL---K 204
            L  F G      + S  R  L  +                 W    +       L   +
Sbjct: 257 YLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNLEYER 316

Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
             Y + L  S FCL  +G  + + R  +SL   C+PV+++N ++LPF+DV+ W    I  
Sbjct: 317 FDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVIEG 376

Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
               +  +   ++ + +E  L      L+ R    W  + S  D   +   E+
Sbjct: 377 DERLLLQVPSTVRAVGNERVL-----ALRQRTQMLWDAYFSSVDKIVLTTLEI 424


>gi|388512347|gb|AFK44235.1| unknown [Lotus japonicus]
          Length = 267

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 64/254 (25%), Positives = 113/254 (44%), Gaps = 27/254 (10%)

Query: 92  QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLN-AIQVVCSSSYFISGHIAHKDVSL 150
           +K  +W R+ G DH +   H      ++     +LN +IQVV     +  G   +KDV  
Sbjct: 5   KKSVHWQRSRGRDHVFPMTHPNAFRFLRN----QLNESIQVVVDFGRYPKGSNLNKDVVS 60

Query: 151 PQIW---PRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAH---- 199
           P +       +D P+     R  L FF G         VR KL ++     +++      
Sbjct: 61  PYVHVVDSFTDDEPQDPYESRPTLLFFRGRTFRKDEGIVRAKLAKILTGFDDVHYERSFA 120

Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
           +G      +  +  SKFCLH  G   ++ R+ D++   CVPVI+++  +LPF D +++  
Sbjct: 121 TGENIKLSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDEIDYSQ 180

Query: 260 FSIVVATLD-------IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYM 312
           FS+  +  +       I  L+K  K   SE +  L++    +  H+++   P   DA  M
Sbjct: 181 FSLFFSFKEALQPGYMIDQLRKFPKDKWSEMWRQLKN----ISHHYEFQYPPKKEDAVNM 236

Query: 313 VMYELWLRRSSVRV 326
           +  ++  +   VR+
Sbjct: 237 LWRQVKHKLPGVRL 250


>gi|297820564|ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324003|gb|EFH54424.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 792

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 84/337 (24%), Positives = 144/337 (42%), Gaps = 73/337 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTEGIP 80
           Y S+  F +  + +   T +  +AD FF+P   +             M++   + +    
Sbjct: 390 YGSQMAFYENILATAHRTLNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSFTL 449

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           +F      +I +KYPYWNR+ G DH +       AC++   I  S M   W    N+   
Sbjct: 450 EFYKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 508

Query: 132 VCSSSYF------ISGH--------IAHKDVSLPQIWPRQEDPPKLGSS-------KRNK 170
             +++Y+      IS             KD+ +P  W +  DP  + ++       KR  
Sbjct: 509 HSTTAYWGDNWDDISDERRGDHPCFDPRKDLVIP-AW-KVPDPYSMRANYWARPREKRKT 566

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
           L +F G +                +R+KL + + +        G+      + TP     
Sbjct: 567 LFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDN 626

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y  ++  S FC    G +  + R+ DS+  GCVPVII +   LP+ ++LN++SF++ V+ 
Sbjct: 627 YHKDIANSIFCGAFPG-DGWSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVSE 685

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
            DIP L   L+G S  E   +Q  +  V+K +Q  +F
Sbjct: 686 DDIPNLINTLRGFSETE---IQFRLANVKKLWQRFLF 719


>gi|32488405|emb|CAE02830.1| OSJNBa0043A12.35 [Oryza sativa Japonica Group]
 gi|90265244|emb|CAH67697.1| H0624F09.5 [Oryza sativa Indica Group]
          Length = 464

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 66/251 (26%), Positives = 112/251 (44%), Gaps = 36/251 (14%)

Query: 49  VTKDPSKADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWN 98
           V  D   AD+ F+PF  A +  +  +G          EG  D+     + +    +P W 
Sbjct: 126 VVADWRDADVVFVPF-FATLSAEMELGWGAKGAFRRKEGNEDYRRQREVVDRVTAHPAWR 184

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHIAHKDV 148
           R+GG DH +V    +    ++K  E+    + VV           S+S  +S  I H  V
Sbjct: 185 RSGGRDHVFVLTDPVAMWHVRK--EIAPAILLVVDFGGWYKLDSNSASSNVSHMIQHTQV 242

Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAH 199
           SL +  I P     P +  S+   R  L +F GA +      VREKL  +  N+ ++   
Sbjct: 243 SLLKDVIVPYTHLLPTMHLSENKDRPTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVME 302

Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
            G       ++    +  S+FCLH  G    + R+ D++   C+PVI+++  +LPF  ++
Sbjct: 303 EGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMI 362

Query: 256 NWKSFSIVVAT 266
           ++  F+I V+ 
Sbjct: 363 DYTEFAIFVSV 373


>gi|147818414|emb|CAN68950.1| hypothetical protein VITISV_039537 [Vitis vinifera]
          Length = 488

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 89/175 (50%), Gaps = 11/175 (6%)

Query: 168 RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGSKFCLH 219
           R  L +F GA+       +R++L  + +++ +++   G  +       +  +  SKFCL+
Sbjct: 290 RPTLLYFQGAIYRKDGGFIRQELFYLLKDEKDVHFAFGNTQGNGINKASQGMHSSKFCLN 349

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKIL 276
           + G   ++ R+ D++   CVPVII++  +LP+ DVL++  F I V T D      L K++
Sbjct: 350 IAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSQFCIFVRTSDALKDKFLXKLI 409

Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
           + I  +E+  +   + +V   F++     + DA  M+   +  +  ++R++   S
Sbjct: 410 RSIKKDEWTRMWRRLKEVENFFEFQYPSKEGDAVQMIWQAITRKVPAIRMKLHKS 464


>gi|359493064|ref|XP_002270238.2| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
           [Vitis vinifera]
          Length = 483

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 67/253 (26%), Positives = 111/253 (43%), Gaps = 43/253 (16%)

Query: 52  DPSKADLFFLPFSIARMRHDRRIG--------TEGIPDFISH-YIFNISQKYPYWNRTGG 102
           D ++AD+ F+PF  A +  + ++G         EG  D+     +    +    W R+GG
Sbjct: 149 DVNEADVVFVPF-FATISAEIQLGGGKGVFRKKEGNEDYERQRQVMEFVRGTEAWKRSGG 207

Query: 103 ADHFYVACHSIGRSAMQKAWEVKLN---AIQVVC-----------SSSYFISGHIAHKDV 148
            DH +V    +        W VK     AI +V            +S+  +S  I H  V
Sbjct: 208 RDHVFVLTDPVA------MWHVKAEIAPAILLVVDFGGWYKLDSKASNNSLSEMIQHTQV 261

Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAH 199
           SL +  I P     P+L  S+   R  L +F GA        VREKL  +   +  +   
Sbjct: 262 SLLKDVIVPYTHLLPRLHLSENQIRQTLLYFKGAKHRHRGGLVREKLWDLLVYEQGVIME 321

Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
            G       ++    +  S+FCLH  G    + R+ D++   C+PVI++++ +LPF  ++
Sbjct: 322 EGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDNIELPFEGMV 381

Query: 256 NWKSFSIVVATLD 268
           ++  FS+ VA  D
Sbjct: 382 DYSEFSVFVAVRD 394


>gi|356538648|ref|XP_003537813.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Glycine max]
          Length = 502

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 100/225 (44%), Gaps = 26/225 (11%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+GG DH     H     S+ R      W +      +  + +++  G +   KD+
Sbjct: 222 PAWKRSGGRDHILPVHHPWSFKSVRRYVKNAIWLLP----DMDSTGNWYKPGQVYLEKDL 277

Query: 149 SLPQIWPRQE--DPPKLGSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHS 200
            LP + P  +  D   L  +  KR+ L FF G +       +R KL         +    
Sbjct: 278 ILPYV-PNVDLCDAKCLSETNPKRSTLLFFRGRLKRNAGGKIRSKLGAELSGVDGVVIEE 336

Query: 201 GRL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
           G      K      +  S FCL   G   ++AR+ D++  GC+PVII++  +LPF  +L+
Sbjct: 337 GTAGDGGKEAAQRGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILD 396

Query: 257 WKSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           ++  ++ ++++D      L K LKGI       +Q  ++K  +HF
Sbjct: 397 YRKIAVFISSIDAVKPGWLLKYLKGIRPAHIKAMQQNLVKYSRHF 441


>gi|357162592|ref|XP_003579459.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Brachypodium distachyon]
          Length = 474

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 75/289 (25%), Positives = 128/289 (44%), Gaps = 44/289 (15%)

Query: 52  DPSKADLFFLPFSIARMRHDRRIG----------TEGIPDFISHY-IFNISQKYPYWNRT 100
           D ++AD+ F+PF  A +  +  +G           EG  D+     + +    +P W R+
Sbjct: 138 DWTEADVVFVPF-FATLSAELELGWGATKGAFRRKEGNADYRRQREVVDRVTAHPAWRRS 196

Query: 101 GGADHFYVACHSIG----RSAMQKAWEV--------KLNAIQVVCSSSYFISGHIAHKDV 148
           GG DH +V    +     R+ +  A  +        KL++     +SS+ I     H  V
Sbjct: 197 GGRDHVFVLTDPMAMWHVRAEIAPAILLVVDFGGWYKLDSKSAGSNSSHMIQ----HTQV 252

Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAH 199
           SL +  I P     P L  S+   R  L +F GA        VREKL  V  N+  +   
Sbjct: 253 SLLKDVIIPYTHLLPTLQLSENMDRPTLLYFKGAKHRHRGGLVREKLWDVMINEPGVVME 312

Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
            G       ++    +  S+FCLH  G   ++ R+ D++   C+PVI+++  +LPF  ++
Sbjct: 313 EGFPNATGREQSIKGMRTSEFCLHPAGDTPSSCRLFDAVASLCIPVIVSDDIELPFEGMI 372

Query: 256 NWKSFSIVVA---TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           ++  FSI V+    +    L   LK IS ++    +  + KV+  F++ 
Sbjct: 373 DYTEFSIFVSVGNAMRPKWLASYLKTISKQQKDEFRRNLAKVQHIFEYE 421


>gi|224097754|ref|XP_002311068.1| predicted protein [Populus trichocarpa]
 gi|222850888|gb|EEE88435.1| predicted protein [Populus trichocarpa]
          Length = 380

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 72/280 (25%), Positives = 117/280 (41%), Gaps = 20/280 (7%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGT-EGIPDF-ISHYIFNISQKYPYWNRTGGADHFY 107
           DP  A+ +F+PF  S++   H R +   E   D  +   + +  QK  YW R+GG DH  
Sbjct: 76  DPEIAEAYFVPFFSSLSFNTHGRNMTDPETEKDRQLQVDLIDFLQKSKYWQRSGGRDHVI 135

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP---QIWPRQEDPPKLG 164
              H      +++   V  + + V     Y  S     KDV  P    +   ++D     
Sbjct: 136 PMTHPNAFRFLRQL--VNASILIVADFGRYPKSLSTLSKDVVSPYVHNVDSFKDDDLLDP 193

Query: 165 SSKRNKLAFFAGAV----NSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG---SKF 216
              R  L FF G         VR KL ++    D   Y  S            G   SKF
Sbjct: 194 FESRKTLLFFRGNTVRKDKGKVRAKLEKILAGYDDVRYERSSPTAEAIQASTQGMRSSKF 253

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLDIPLLK 273
           CLH  G   ++ R+ D++   CVPVI+++  +LP+ D +++  FSI   +   +    L 
Sbjct: 254 CLHPAGDTPSSCRLFDAIVSHCVPVIVSDLIELPYEDEIDYSQFSIFFSINEAIQPDYLV 313

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
             L+    + ++ +   + K+  HF++   P   DA  M+
Sbjct: 314 NQLRKFPKDRWIEMWRQLKKISHHFEFQYPPVKEDAVNML 353


>gi|18399194|ref|NP_564443.1| exostosin-like protein [Arabidopsis thaliana]
 gi|5091619|gb|AAD39607.1|AC007454_6 F23M19.7 [Arabidopsis thaliana]
 gi|15450928|gb|AAK96735.1| Unknown protein [Arabidopsis thaliana]
 gi|20148711|gb|AAM10246.1| unknown protein [Arabidopsis thaliana]
 gi|332193570|gb|AEE31691.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 477

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 67/281 (23%), Positives = 118/281 (41%), Gaps = 39/281 (13%)

Query: 54  SKADLFFLPFSIARMRHDRRIGT--------EGIPDFISH-YIFNISQKYPYWNRTGGAD 104
           S AD+ F+PF  A +  +  +G          G  D+     + +  +    W R+ G D
Sbjct: 144 SDADVVFVPF-FATLSAEMELGNGKGSFRKKSGNEDYQRQRQVLDFVKNTKAWKRSNGRD 202

Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ------------ 152
           H +V    +    +++  E+ L+ + VV    +F     +    SLP+            
Sbjct: 203 HVFVLTDPVAMWHVRE--EIALSILLVVDFGGWFRQDSKSSNGTSLPERIQHTQVSVIKD 260

Query: 153 -IWPRQEDPPKLGSS---KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK 204
            I P     P+L  S   +R+ L +F GA        +REKL  +  N+  +    G   
Sbjct: 261 VIVPYTHLLPRLDLSQNQRRHSLLYFKGAKHRHRGGLIREKLWDLLVNEPGVVMEEGFPN 320

Query: 205 TPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
               ++    +  S+FCLH  G    + R+ D++   C+PVI+++  +LPF  ++++  F
Sbjct: 321 ATGREQSIRGMRNSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDTIELPFEGIIDYSEF 380

Query: 261 SIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHF 298
           S+  +  D      L   L   S  E   L+S + KV+  F
Sbjct: 381 SVFASVSDALTPKWLANHLGRFSEREKETLRSRIAKVQSVF 421


>gi|359359122|gb|AEV41028.1| putative exostosin family domain-containing protein [Oryza minuta]
          Length = 459

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 66/250 (26%), Positives = 110/250 (44%), Gaps = 36/250 (14%)

Query: 49  VTKDPSKADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWN 98
           V  D   AD+ F+PF  A +  +  +G          EG  D+     + +    +P W 
Sbjct: 121 VVADWRDADVVFVPF-FATLSAEMELGWGAKGAFRRKEGNEDYCRQREVVDRVTAHPAWR 179

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHIAHKDV 148
           R+GG DH +V    +    ++K  E+    + VV           S+S   S  I H  V
Sbjct: 180 RSGGRDHVFVLTDPVAMWHVRK--EIAPAILLVVDFGGWYKLDSNSASSNFSHMIQHTQV 237

Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAH 199
           SL +  I P     P +  S+   R  L +F GA +      VREKL  +  N+ ++   
Sbjct: 238 SLLKDVIVPYTHLLPTMQLSENKERTTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVME 297

Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
            G       ++    +  S+FCLH  G    + R+ D++   C+PVI+++  +LPF  ++
Sbjct: 298 EGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMI 357

Query: 256 NWKSFSIVVA 265
           ++  F I V+
Sbjct: 358 DYTEFVIFVS 367


>gi|359359070|gb|AEV40977.1| putative exostosin family domain-containing protein [Oryza
           punctata]
          Length = 459

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 66/250 (26%), Positives = 110/250 (44%), Gaps = 36/250 (14%)

Query: 49  VTKDPSKADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWN 98
           V  D   AD+ F+PF  A +  +  +G          EG  D+     + +    +P W 
Sbjct: 121 VVADWRDADVVFVPF-FATLSAEMELGWGAKGAFRRKEGNEDYCRQREVVDRVTAHPAWR 179

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHIAHKDV 148
           R+GG DH +V    +    ++K  E+    + VV           S+S   S  I H  V
Sbjct: 180 RSGGRDHVFVLTDPVAMWHVRK--EIAPAILLVVDFGGWYKLDSNSASSNFSHMIQHTQV 237

Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAH 199
           SL +  I P     P +  S+   R  L +F GA +      VREKL  +  N+ ++   
Sbjct: 238 SLLKDVIVPYTHLLPTMQLSENKERTTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVME 297

Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
            G       ++    +  S+FCLH  G    + R+ D++   C+PVI+++  +LPF  ++
Sbjct: 298 EGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMI 357

Query: 256 NWKSFSIVVA 265
           ++  F I V+
Sbjct: 358 DYTEFVIFVS 367


>gi|224145217|ref|XP_002325567.1| predicted protein [Populus trichocarpa]
 gi|222862442|gb|EEE99948.1| predicted protein [Populus trichocarpa]
          Length = 476

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 68/259 (26%), Positives = 118/259 (45%), Gaps = 36/259 (13%)

Query: 41  KAFMKSHFVTKDPSKADLFFLPFSI---ARMRHDRRIGT----EGIPDFISH-YIFNISQ 92
           ++F K  F   D ++AD+ F+PF     A M   +  G+    EG  D+     + +I +
Sbjct: 134 RSFAKRVF---DCNEADVVFVPFFATLSAEMELAKGKGSFRRKEGNEDYRRQKQVVDIVR 190

Query: 93  KYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF----------ISGH 142
               W R+GG DH +V    +    ++   E+    + VV    ++           S  
Sbjct: 191 NSDAWKRSGGKDHVFVLTDPVAMWHLRA--EIAPAILLVVDFGGWYRLDSKSSNGSSSDM 248

Query: 143 IAHKDVSLPQ--IWPRQEDPPKLGSS---KRNKLAFFAGAV----NSPVREKLLQVWRND 193
           I H  VSL +  I P     P+L  S   KR+ L +F GA        VREKL  +  N+
Sbjct: 249 IQHTQVSLLKDVIVPYTHLLPRLQLSENKKRSTLLYFKGAKHRHRGGIVREKLWDLLVNE 308

Query: 194 SEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
             +    G       ++    +  S+FCLH  G   ++ R+ D++   C+PV+++++ +L
Sbjct: 309 PGVIIEEGFPNATGREQSIRGMRSSEFCLHPAGDTPSSCRLFDAIQSLCIPVVVSDNIEL 368

Query: 250 PFADVLNWKSFSIVVATLD 268
           PF  ++++  F++ VA  D
Sbjct: 369 PFEGMVDYTEFAVFVAVDD 387


>gi|302800594|ref|XP_002982054.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300150070|gb|EFJ16722.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 453

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 71/297 (23%), Positives = 127/297 (42%), Gaps = 43/297 (14%)

Query: 52  DPSKADLFFLPF-----SIARMRHDR-----RIGTEGIPDFISHYIFNISQKYPYWNRTG 101
           DP +AD+ F+PF     + A++R+ +     R   E         +  I      W R+G
Sbjct: 116 DPDQADVVFVPFFAALSAEAQLRNGKGHFRHRKDNEDYER--QKAVMEIVTSSSRWQRSG 173

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--------- 152
           G DH +V    +     +   E+  + + VV    +++    + +++S PQ         
Sbjct: 174 GRDHVFVLTDPMAMYHFRA--EIANSILLVVDFGGWYMEDAKSSRNLSSPQPIYHTQVSL 231

Query: 153 ----IWPRQEDPPKLGSSK----RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHS 200
               I P     P L  S+    R  L +F GA        VR++L  V   +  +    
Sbjct: 232 IKDVIVPYTHLLPTLALSQDNAVRTTLLYFKGARYRHRTGLVRDQLWSVLDGEPGVLLEE 291

Query: 201 G----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
           G      +      +  S FCLH  G   ++ R+ D++   C+PVI+++  +LPF  +L+
Sbjct: 292 GFPNRTGQVQAVQGMRNSHFCLHPAGDTPSSCRLFDAVASLCIPVIVSDSIELPFEGMLD 351

Query: 257 WKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQW-HVFPSDYDA 309
           +  F+I V+  D  L   L + L  +SS+    ++  +  V+ HF++ + FP    A
Sbjct: 352 YTQFAIFVSVHDALLPKWLVRHLSSLSSKVRNQMRHNLASVQHHFEYENGFPGGRGA 408


>gi|42566010|ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]
 gi|44917463|gb|AAS49056.1| At3g57630 [Arabidopsis thaliana]
 gi|46931284|gb|AAT06446.1| At3g57630 [Arabidopsis thaliana]
 gi|332646159|gb|AEE79680.1| exostosin family protein [Arabidopsis thaliana]
          Length = 793

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 82/332 (24%), Positives = 137/332 (41%), Gaps = 70/332 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTEGIP 80
           Y S+  F +  + +   T +  +AD FF+P   +             M++   + +    
Sbjct: 391 YGSQMAFYENILATAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSLTL 450

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           +F      +I +KYPYWNR+ G DH +       AC++   I  S M   W    N+   
Sbjct: 451 EFYKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 509

Query: 132 VCSSSYF------ISGH--------IAHKDVSLPQIWPRQEDPPKLGSS-------KRNK 170
             +++YF      IS             KD+ +P  W +  DP  +  +       KR  
Sbjct: 510 HSTTAYFGDNWDDISDERRGDHPCFDPRKDLVIP-AW-KVPDPYSMRKNYWERPREKRKT 567

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
           L +F G +                +R+KL + + +        G+      + TP     
Sbjct: 568 LFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDN 627

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y  ++  S FC    G +  + R+ DS+  GCVPVII +   LP+ ++LN++SF++ V  
Sbjct: 628 YHKDIANSIFCGAFPG-DGWSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVNE 686

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            DIP L   L+G S  E       V ++ + F
Sbjct: 687 DDIPNLINTLRGFSEAEIQFRLGNVKELWQRF 718


>gi|42572713|ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]
 gi|110740929|dbj|BAE98560.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646160|gb|AEE79681.1| exostosin family protein [Arabidopsis thaliana]
          Length = 791

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 82/332 (24%), Positives = 137/332 (41%), Gaps = 70/332 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTEGIP 80
           Y S+  F +  + +   T +  +AD FF+P   +             M++   + +    
Sbjct: 389 YGSQMAFYENILATAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSLTL 448

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           +F      +I +KYPYWNR+ G DH +       AC++   I  S M   W    N+   
Sbjct: 449 EFYKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 507

Query: 132 VCSSSYF------ISGH--------IAHKDVSLPQIWPRQEDPPKLGSS-------KRNK 170
             +++YF      IS             KD+ +P  W +  DP  +  +       KR  
Sbjct: 508 HSTTAYFGDNWDDISDERRGDHPCFDPRKDLVIP-AW-KVPDPYSMRKNYWERPREKRKT 565

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
           L +F G +                +R+KL + + +        G+      + TP     
Sbjct: 566 LFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDN 625

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y  ++  S FC    G +  + R+ DS+  GCVPVII +   LP+ ++LN++SF++ V  
Sbjct: 626 YHKDIANSIFCGAFPG-DGWSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVNE 684

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            DIP L   L+G S  E       V ++ + F
Sbjct: 685 DDIPNLINTLRGFSEAEIQFRLGNVKELWQRF 716


>gi|296085881|emb|CBI31205.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 89/175 (50%), Gaps = 11/175 (6%)

Query: 168 RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGSKFCLH 219
           R  L +F GA+       +R++L  + +++ +++   G  +       +  +  SKFCL+
Sbjct: 257 RPTLLYFQGAIYRKDGGFIRQELFYLLKDEKDVHFAFGNTQGNGINKASQGMHSSKFCLN 316

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKIL 276
           + G   ++ R+ D++   CVPVII++  +LP+ DVL++  F I V T D      L K++
Sbjct: 317 IAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSQFCIFVRTSDALKDKFLIKLI 376

Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
           + I  +E+  +   + +V   F++     + DA  M+   +  +  ++R++   S
Sbjct: 377 RSIKKDEWTRMWRRLKEVENFFEFQYPSKEGDAVQMIWQAITRKVPAIRMKLHKS 431


>gi|218195798|gb|EEC78225.1| hypothetical protein OsI_17865 [Oryza sativa Indica Group]
 gi|222629749|gb|EEE61881.1| hypothetical protein OsJ_16573 [Oryza sativa Japonica Group]
          Length = 452

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 64/244 (26%), Positives = 110/244 (45%), Gaps = 36/244 (14%)

Query: 56  ADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWNRTGGADH 105
           AD+ F+PF  A +  +  +G          EG  D+     + +    +P W R+GG DH
Sbjct: 121 ADVVFVPF-FATLSAEMELGWGAKGAFRRKEGNEDYRRQREVVDRVTAHPAWRRSGGRDH 179

Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHIAHKDVSLPQ--I 153
            +V    +    ++K  E+    + VV           S+S  +S  I H  VSL +  I
Sbjct: 180 VFVLTDPVAMWHVRK--EIAPAILLVVDFGGWYKLDSNSASSNVSHMIQHTQVSLLKDVI 237

Query: 154 WPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAHSGRLKTP 206
            P     P +  S+   R  L +F GA +      VREKL  +  N+ ++    G     
Sbjct: 238 VPYTHLLPTMHLSENKDRPTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVMEEGYPNAT 297

Query: 207 YADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
             ++    +  S+FCLH  G    + R+ D++   C+PVI+++  +LPF  ++++  F+I
Sbjct: 298 GREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAI 357

Query: 263 VVAT 266
            V+ 
Sbjct: 358 FVSV 361


>gi|357474305|ref|XP_003607437.1| Cation proton exchanger [Medicago truncatula]
 gi|355508492|gb|AES89634.1| Cation proton exchanger [Medicago truncatula]
          Length = 1198

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 59/226 (26%), Positives = 102/226 (45%), Gaps = 24/226 (10%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+GG DH     H     ++ R   +  W +      +  + +++  G +   KD+
Sbjct: 222 PAWKRSGGRDHILPVHHPWSFKTVRRYVKKAIWLLP----DMDSTGNWYKPGQVYLEKDL 277

Query: 149 SLPQIWPRQ-EDPPKLG--SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            LP +      D   L   + KRN L FF G +       +R KL+   R    +    G
Sbjct: 278 ILPYVANVDFCDATCLSEINPKRNTLLFFRGRLKRNAGGKIRSKLVDQLRGADGVVIEEG 337

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K    + +  S FCL+  G   ++AR+ D++  GC+PVI+++  +LPF  +L++
Sbjct: 338 TSGEGGKEAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 397

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           +  ++ V++ D      L K LK I S     +Q  + K  +HF +
Sbjct: 398 RKIALFVSSNDALKPSWLLKYLKDIRSAHIKEMQQNLAKYSRHFLY 443


>gi|225439168|ref|XP_002268382.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase [Vitis
           vinifera]
          Length = 488

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 72/300 (24%), Positives = 137/300 (45%), Gaps = 26/300 (8%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADHFY 107
           + S+AD+ F+PF  S++     +  G +     + +   +         W R+ G DH  
Sbjct: 171 NSSEADVIFVPFFSSLSYNHFGKFKGRQKKNENNLLQDKLVKFLTAQEEWIRSEGRDHII 230

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPRQEDPPK 162
           +A H    ++M  A  +KL     + S       +IA+  KDV  P    I     D   
Sbjct: 231 MAHHP---NSMLDA-RMKLWPAIFILSDFGRYPPNIANVGKDVIAPYKHVIKSFINDTSD 286

Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGS 214
             S  R  L +F GA+       +R++L  + +++ +++   G  +       +  +  S
Sbjct: 287 FDS--RPTLLYFQGAIYRKDGGFIRQELFYLLKDEKDVHFAFGNTQGNGINKASQGMHSS 344

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PL 271
           KFCL++ G   ++ R+ D++   CVPVII++  +LP+ DVL++  F I V T D      
Sbjct: 345 KFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSQFCIFVRTSDALKDKF 404

Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
           L K+++ I  +E+  +   + +V   F++     + DA  M+   +  +  ++R++   S
Sbjct: 405 LIKLIRSIKKDEWTRMWRRLKEVENFFEFQYPSKEGDAVQMIWQAITRKVPAIRMKLHKS 464


>gi|255644422|gb|ACU22716.1| unknown [Glycine max]
          Length = 189

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 43/161 (26%), Positives = 85/161 (52%), Gaps = 10/161 (6%)

Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNT 227
           AF  G     +R++L  + +++ +++   G +     ++    +  SKFCL++ G   ++
Sbjct: 13  AFLKGGA---IRQELYYLLKDEKDVHFTFGSIGGNGINQASQGMAMSKFCLNIAGDTPSS 69

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILKGISSEEY 284
            R+ D++   CVPVII++  +LPF DVL++  FSI V   D      L  +L+ I+ +E+
Sbjct: 70  NRLFDAIVSHCVPVIISDEIELPFEDVLDYSDFSIFVRASDSMKKGYLLNLLRSITQKEW 129

Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
             +   + ++  HF++       DA  M+  ++  + SS+R
Sbjct: 130 SKMWERLKQITHHFEYQYPSQPGDAVNMIWQQVERKISSIR 170


>gi|302792172|ref|XP_002977852.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300154555|gb|EFJ21190.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 434

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 66/269 (24%), Positives = 122/269 (45%), Gaps = 24/269 (8%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIP---DFISHYIFNISQKYPYWNRTGGADHF 106
           +P +AD+F++PF  S++ +  + +   +  P   + I   +    ++   W +  G DH 
Sbjct: 128 NPQEADVFYVPFFSSLSLVVGNGKSEDDEDPYSDEDIQEELMAWLEEQESWKKNKGRDHV 187

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPKLGS 165
            V C     +A+++  +   N + ++     F     +  KDV LP          +  +
Sbjct: 188 -VICQDP--NALKRLRDRLKNTVLLLSDFERFKPDQASLVKDVVLPYTHRIDSYFNENVT 244

Query: 166 SKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY------AHSGRLKTPYADELLGSK 215
             R+ L FF G         +R++L QV   + ++       +  GR        +  SK
Sbjct: 245 LDRDTLLFFMGNRYRKEGGKIRDQLFQVLDVEPDMVMKHGTQSREGRRLAKVG--MQTSK 302

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LDIPLL 272
           FCLH  G   +  R+ D++   CVPVI+++  +LPF D L++  F+I V +   L+   L
Sbjct: 303 FCLHPAGDTPSACRLFDAIVSVCVPVIVSDDIELPFEDELDYSEFAIFVPSINALEPGYL 362

Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWH 301
              L+ IS +     Q  + +VRK+F++ 
Sbjct: 363 GSYLRSISPDLLKQKQQRLREVRKYFEYE 391


>gi|303271929|ref|XP_003055326.1| glycosyltransferase family 47 protein [Micromonas pusilla CCMP1545]
 gi|226463300|gb|EEH60578.1| glycosyltransferase family 47 protein [Micromonas pusilla CCMP1545]
          Length = 595

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 63/123 (51%)

Query: 191 RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
           R D ++ +      + Y   L  SKFCLHV+G  V   R+ +++ +GCVPVIIA+ YDLP
Sbjct: 451 RTDWDLDSDGATTPSGYMKLLARSKFCLHVRGTRVYAPRLVEAMLFGCVPVIIADGYDLP 510

Query: 251 FADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
            +  L+W +FS+ +   +     +  + + + ++      + +V   F +H  P   DA 
Sbjct: 511 LSWFLDWDAFSVRMTEREGVNATRAAEIVDAADWREKHEALRRVVGFFMYHDPPVFGDAL 570

Query: 311 YMV 313
           +  
Sbjct: 571 WAT 573


>gi|3176669|gb|AAC18793.1| End is cut off [Arabidopsis thaliana]
          Length = 440

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 72/314 (22%), Positives = 132/314 (42%), Gaps = 55/314 (17%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGTEGIPDF---ISHYIFNISQKYPYWNRTGGADHF 106
           DP  AD+F++PF  S++   H + + T+   +F   +   +    +   YWNR+GG DH 
Sbjct: 122 DPDLADVFYVPFFSSLSFNTHGKNM-TDPDTEFDRLLQVELMEFLENSKYWNRSGGKDHV 180

Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP----RQEDPPK 162
               H      +++  +V  + + VV    Y        KDV  P +       +E    
Sbjct: 181 IPMTHPNAFRFLRQ--QVNASILIVVDFGRYSKDMARLSKDVVSPYVHVVESLNEEGDDG 238

Query: 163 LGS--SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----------------AH 199
           +G     R  L +F G         +R +L ++   +S+++                  +
Sbjct: 239 MGDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAGNSDVHFEKSVATTQNIKVSDLEQN 298

Query: 200 SGR-LKTPYADE----------------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
            GR L   Y ++                +  SKFCLH  G   ++ R+ D++   C+PVI
Sbjct: 299 RGRYLMLTYQNDSNCSTCVLFCYQSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVI 358

Query: 243 IANHYDLPFADVLNWKSFSI---VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ 299
           I++  +LPF D +++  FS+   +  +L+   +   L+    E++L +   +  V  HF+
Sbjct: 359 ISDKIELPFEDEIDYSEFSLFFSIKESLEPGYILNNLRQFPKEKWLEMWKRLKNVSHHFE 418

Query: 300 WHVFPSDYDAFYMV 313
           +   P   DA  M+
Sbjct: 419 FQYPPKREDAVNML 432


>gi|293336592|ref|NP_001167834.1| uncharacterized protein LOC100381534 [Zea mays]
 gi|223944319|gb|ACN26243.1| unknown [Zea mays]
          Length = 241

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 48/171 (28%), Positives = 80/171 (46%), Gaps = 18/171 (10%)

Query: 135 SSYFISGHIAHKDVS----LPQIWPR-QEDPPKLGSSKRNKLAFFAGAV----------- 178
            ++ + GH   ++V      P + P    + P+   ++R+  AFF G +           
Sbjct: 21  QTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQRDIFAFFRGKMEVHPKNISGRF 80

Query: 179 -NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
            +  VR +LLQ +  + + Y    R    Y  E+  S FCL   G+   + R+ +S+  G
Sbjct: 81  YSKKVRTELLQHYGRNRKFYLKRKRFDN-YRSEMARSLFCLCPLGWAPWSPRLVESVLLG 139

Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           C+PVIIA+   LPF  VL W+  S+ VA  DI  L  +L  + +    ++Q
Sbjct: 140 CIPVIIADDIRLPFPPVLQWQEISLQVAEKDIASLGMVLDHVVATNLTVIQ 190


>gi|307107123|gb|EFN55367.1| hypothetical protein CHLNCDRAFT_52572 [Chlorella variabilis]
          Length = 815

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 78/339 (23%), Positives = 139/339 (41%), Gaps = 75/339 (22%)

Query: 25  VDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP-----------------FSIAR 67
            ++E  G Y +E+   +A ++S   T DP +AD F++P                 F  A 
Sbjct: 409 TNWEDSGGYLAETGLHEALLQSKHRTLDPEEADYFYIPVYSSCYMYPIHGFADTPFFHAF 468

Query: 68  MRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG--------RSAMQ 119
            +  R   T  +   + H++      +PYW+R+GG DH  +  H  G        R A  
Sbjct: 469 HKIPRVHATTNMLIEVYHWL---RAHHPYWDRSGGRDHIILQSHDEGSCWLPAVLRPATM 525

Query: 120 KAWEVKLNAIQVVCSSSYFI------------------SGHIAH-------KDVSLPQIW 154
                +++      SS+ +I                   G +         KD+ +P + 
Sbjct: 526 LTHWGRMDLGHT--SSTGYIDDVYSRPARHPIYMPEGTEGKLGDFPCYDPAKDLVVPPMT 583

Query: 155 P--RQEDPPKLGSSKRNK--LAFFAGAV---NSP----VREKLLQVWRNDS-----EIYA 198
              + E  P +G+  RN+  LAFF G     N P    +R+ L  + R+       +I+ 
Sbjct: 584 SPLKYELSPLVGAFTRNRTTLAFFKGRTQQNNKPYSRGIRQTLENLCRDKDWWGKFKIWI 643

Query: 199 HSGR---LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
             G    +   Y+  L  S FC  + G +  + R  D++ +GC+PVII +   L F  ++
Sbjct: 644 GEGNPPDMDRTYSQLLASSTFCFVLPG-DGFSPRFEDAVQHGCLPVIIQDEVHLAFESII 702

Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           +++ F + +   D+  + +IL  I  E+   +Q  +  V
Sbjct: 703 DYRKFVVRIQQKDMERVPEILGAIPPEKVQTMQKALATV 741


>gi|302837496|ref|XP_002950307.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300264312|gb|EFJ48508.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 368

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 86/349 (24%), Positives = 147/349 (42%), Gaps = 55/349 (15%)

Query: 4   SFRVYVY--PHRRNDPFANVLLPVDFEPRGN-YASESYFKKAFMKSHFV-TKDPSKADLF 59
            F++Y+Y  P     P+    L  D   R N Y +   F K F+    V T++P +A+LF
Sbjct: 10  EFKIYMYELPWEIAFPYE---LGEDVHTRDNIYTAYEEFMKYFLVDDMVRTQNPYEANLF 66

Query: 60  FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQ 119
           ++P        + R G       I +    I  K+P++NRTGG DHF            Q
Sbjct: 67  YVPALTYFYATNVRNGQWQAEAVIEY----IRTKWPFYNRTGGRDHFVFFTGDRASCHFQ 122

Query: 120 KAWEVKLNAIQVV---CSSSYFISGHIAHKDVSLPQIWPRQEDPPK---LG--------- 164
           + W ++ + I+VV             I+++D +  Q       PP+   LG         
Sbjct: 123 R-W-IQDSVIKVVHFGMQHRNLTWNEISNRDYACIQNKRDLVVPPRTVNLGPLLPSFSTP 180

Query: 165 -----------SSKRNKLAFFAGAV-----NSPVREKLLQVWRNDSEIYAH----SGRL- 203
                         R  L FFAG V     +  VR  + Q+  + + + A      GR+ 
Sbjct: 181 YYKWLVSNQGYDGNRTLLFFFAGGVADGEYSGGVRLAIKQMLSSITHLPADVKFVEGRVG 240

Query: 204 --KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
             +  Y   +  SKFC+   G      R+  +++ GCVPVII ++    F D L ++ FS
Sbjct: 241 GGEDEYFAMIRASKFCIAPYGHGWGN-RLVQAVHLGCVPVIIQDYVYQAFEDFLPYEDFS 299

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
           + +   D+P +  +L+  S  +   L+   L + ++++  ++  +YD  
Sbjct: 300 VRMRLADVPHMIDLLRSYSEADLARLR---LGLARYYRAFIWEREYDGL 345


>gi|125582866|gb|EAZ23797.1| hypothetical protein OsJ_07509 [Oryza sativa Japonica Group]
          Length = 322

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 76/160 (47%), Gaps = 14/160 (8%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF--MKSHFVTKDPSKADL 58
           M R F+++ Y      P A++    D      Y+ E  F       +S F  + P  A  
Sbjct: 102 MERRFKIWTY-REGEPPVAHIGPGTDI-----YSIEGQFMYEMDDPRSRFAARRPDDAHA 155

Query: 59  FFLPFSIARMRH-DRRIGTEG----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
           F LP S+  + H   R+   G    +   ++ Y+  ++++YPYWNR+ GADH  V+CH  
Sbjct: 156 FLLPISVCNLVHYVYRLNATGDLAPLRGLVADYVRVVAERYPYWNRSRGADHVIVSCHDW 215

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI 153
                    ++  NAI+V+C+++    G    KD +LP++
Sbjct: 216 APMVTSAHRQLYGNAIRVLCNANTS-EGFRPRKDATLPEM 254



 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 57/124 (45%), Gaps = 22/124 (17%)

Query: 226 NTARIADSLYYGC---VPVIIANHYDL------------------PFADVLNWKSFSIVV 264
           N +R AD +   C    P++ + H  L                  P  D       S+ V
Sbjct: 200 NRSRGADHVIVSCHDWAPMVTSAHRQLYGNAIRVLCNANTSEGFRPRKDA-TLPEMSVAV 258

Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSV 324
               IP L+ IL+ +S   Y +L++ VL+ ++HF  H     +D  +MV++ +WLRR +V
Sbjct: 259 PAARIPELRAILRRVSERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIWLRRLNV 318

Query: 325 RVQW 328
           R+ +
Sbjct: 319 RLPY 322


>gi|302847711|ref|XP_002955389.1| hypothetical protein VOLCADRAFT_96311 [Volvox carteri f.
           nagariensis]
 gi|300259231|gb|EFJ43460.1| hypothetical protein VOLCADRAFT_96311 [Volvox carteri f.
           nagariensis]
          Length = 1222

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 71/287 (24%), Positives = 111/287 (38%), Gaps = 73/287 (25%)

Query: 52  DPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           D   AD +F+P     +RH     T      +S  +  I + +P+WNRT G  HF +A  
Sbjct: 616 DGDTADWYFIPVV---LRHFCDATT------LSAAVSYIREMHPWWNRTQGHRHFVIATG 666

Query: 112 SIGRSAMQKAWE------VKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS 165
            +GRS  ++         V    +     SS + + H    D+ LP           LGS
Sbjct: 667 DMGRSESERGHLTANVTFVSYWGLHAPKLSSGWRASHRNATDIVLPVF---------LGS 717

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRL----------------KTP-- 206
            K +++  F   ++     K     R  +  I+  +GR+                K+P  
Sbjct: 718 PKLSRMGIFTSRLHPKFATKAPHELRERNGPIFFFAGRICGDHSKPQVDGVWPNCKSPHN 777

Query: 207 -----------------------------YADELLGSKFCLHVKGFEVNTARIADSLYYG 237
                                        YA  LL SKFC    G   +  R   ++  G
Sbjct: 778 MGYSGGTRQKIHFHHWNRTGYFIQLGDRHYAKHLLTSKFCFGPTGGG-HGQRQMQAVQAG 836

Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           CVPV+I++     F   L+W +F + +A  DIP + ++L+ IS EEY
Sbjct: 837 CVPVVISDDVLEAFEPFLDWNTFGVRLAEADIPRMHEVLEAISPEEY 883



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 62/233 (26%), Positives = 104/233 (44%), Gaps = 40/233 (17%)

Query: 52  DPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           D   AD +F+P +   +RH     T      +S  +  I + +P+WNRT G  HF +A  
Sbjct: 139 DGDTADWYFIPVT---LRHTADAYT------LSAAVSYIREMHPWWNRTHGHRHFVIAIG 189

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKL 171
            +GR   ++  +          ++  F++    H  +  P+++   +      +S RN  
Sbjct: 190 DMGRLESERGRQ---------STNVTFVT----HWGLHAPKLFSGWK------ASHRN-- 228

Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIA 231
              A  +  PV       W N +  +   G     YA  LL SK+C    G   +  R  
Sbjct: 229 ---ATDIVLPVH---FHHW-NRTGYFIQLG--DRHYAKHLLTSKYCFGPTGGG-HGQRQM 278

Query: 232 DSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
            ++  GCVPV+I++     F   L+W +F + +A  DIP + ++L+ IS EEY
Sbjct: 279 QAVQAGCVPVVISDDVLEAFEPFLDWNTFGVRLAEADIPRMHEVLEAISPEEY 331


>gi|302795426|ref|XP_002979476.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300152724|gb|EFJ19365.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 434

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 125/288 (43%), Gaps = 24/288 (8%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTEGIP---DFISHYI 87
           Y  E    +    S     +P +AD+F++PF  S++ +  + +   +  P   +     +
Sbjct: 109 YDLEQGEDRRLSDSSVRVMNPQEADVFYVPFFSSLSLVVGNGKSEDDEDPYSDEDTQEEL 168

Query: 88  FNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-K 146
               ++   W +  G DH  V C     +A+++  +   N + ++     F     +  K
Sbjct: 169 MAWLEEQESWKKNKGRDHV-VICQDP--NALKRLRDRLKNTVLLLSDFERFKPDQASLVK 225

Query: 147 DVSLPQIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEI------ 196
           DV LP          +  +  R+ L FF G         +R++L QV   + ++      
Sbjct: 226 DVVLPYTHRIDSYSNENVTLDRDTLLFFMGNRYRKEGGKIRDQLFQVLDVEPDMVMKHGT 285

Query: 197 YAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
            +  GR        +  SKFCLH  G   +  R+ D++   CVPVI+++  +LPF D L+
Sbjct: 286 QSREGRRLAKVG--MQTSKFCLHPAGDTPSACRLFDAIVSVCVPVIVSDDIELPFEDELD 343

Query: 257 WKSFSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           +  F+I V +   L+   L   L+ IS +     Q  + +VRK+F++ 
Sbjct: 344 YSEFAIFVPSINALEPGYLGSYLRSISPDLLKQKQQRLREVRKYFEYE 391


>gi|256088012|ref|XP_002580154.1| exostosin-1 [Schistosoma mansoni]
 gi|353230126|emb|CCD76297.1| putative exostosin-1 [Schistosoma mansoni]
          Length = 766

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 89/370 (24%), Positives = 148/370 (40%), Gaps = 78/370 (21%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFS 64
           F+VYVYP       AN+ L           +      A   S  +T DP++A   F+P S
Sbjct: 95  FKVYVYPLSSKQ--ANLSL-----------TYQKILTALHNSKLLTSDPNEA-CIFIP-S 139

Query: 65  IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN--------RTGGADHFYVACHSIGRS 116
           +  +  DR     G   +I+H + N+    P+WN        +  G +H     H+ G  
Sbjct: 140 LDTLDRDRLSPHFG--QYIAHELVNL----PFWNSLPRRDLDKYAGRNHLIFNLHA-GTW 192

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSKRNKLAF-- 173
                 E +L   Q + + + F + H   K D+SLP I    + P + GSS+ N+L    
Sbjct: 193 PYYYEDEYRLWLGQAMLAKASFSTKHFRPKFDISLPLI--HSQHPLQSGSSQLNQLVSSE 250

Query: 174 ------------------FAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP---YADE-- 210
                             +   + S  R+ L  +      I   + R  T    YAD+  
Sbjct: 251 HLRGRLDLPYLLSFKGKRYVSGIGSASRDILFHLHNGKDIIMLTTCRHGTDWTRYADKRC 310

Query: 211 ---------------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
                          +  S FCL  +G  + + R  + L  GC+PV+++N  +LPF++V+
Sbjct: 311 ATDMALYDAYDYWELMYNSTFCLVPRGRRLGSYRFLEVLQAGCIPVMLSNDLELPFSEVI 370

Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
           +W    I        LL   L+ I+S + +  +  V+     F WH + S  ++  +   
Sbjct: 371 DWNRAVIWADERLPLLLPLSLRRITSHQIIQYRQQVM-----FLWHTYLSSIESIVLTTL 425

Query: 316 ELWLRRSSVR 325
           E+   R S+R
Sbjct: 426 EIIRDRVSLR 435


>gi|255543228|ref|XP_002512677.1| catalytic, putative [Ricinus communis]
 gi|223548638|gb|EEF50129.1| catalytic, putative [Ricinus communis]
          Length = 253

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 90/192 (46%), Gaps = 18/192 (9%)

Query: 140 SGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
            G I     + PQ    ++ PP+   + R+   +F G      N P      R     VW
Sbjct: 34  EGSITIPPYAPPQKMQARQIPPE---TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 90

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+   
Sbjct: 91  ENFKNNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 150

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
           LPFAD + W+   + VA  D+P L  IL  I + + +L +  +L      +  +FP    
Sbjct: 151 LPFADAIPWEEIGVFVAEEDVPNLDTILTSIPT-QVVLRKQRLLANPSMKRAMLFPQPAQ 209

Query: 308 --DAFYMVMYEL 317
             DAF+ ++  L
Sbjct: 210 SGDAFHQILNGL 221


>gi|81298858|ref|YP_399066.1| TPR repeat-containing protein [Synechococcus elongatus PCC 7942]
 gi|81167739|gb|ABB56079.1| TPR repeat [Synechococcus elongatus PCC 7942]
          Length = 788

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 53/191 (27%), Positives = 94/191 (49%), Gaps = 31/191 (16%)

Query: 131 VVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKRNKLAFFAGAVNSPVREK 185
           VV S  + +  +    DVS P +       P L      S  R+ L  F GA + PVRE+
Sbjct: 114 VVVSHCHTLGNYRVGIDVSFPPM-------PLLDQHCYPSRDRSTLLSFRGANSHPVREQ 166

Query: 186 LLQVWRNDSEIYAHS------GRL------------KTPYADELLGSKFCLHVKGFEVNT 227
           L Q      EI A        G L            +  Y D +  S+F +  +G ++ +
Sbjct: 167 L-QRLHQPPEIAAELIQQSYWGTLNYVDEAEGLSAEQQVYTDLIARSRFSVAPRGHDIFS 225

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            R+ + +  G +PVI+A+ + LPF+++L+W  FS+ VA      L ++L+ IS++++ ++
Sbjct: 226 YRLLEVMAGGAIPVILADDWVLPFSELLDWSEFSLSVAEDRCWELPQLLQAISTDQWQVM 285

Query: 288 QSYVLKVRKHF 298
           Q ++ +V +H+
Sbjct: 286 QQHLQQVYQHY 296


>gi|56751461|ref|YP_172162.1| hypothetical protein syc1452_c [Synechococcus elongatus PCC 6301]
 gi|56686420|dbj|BAD79642.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 788

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 53/191 (27%), Positives = 94/191 (49%), Gaps = 31/191 (16%)

Query: 131 VVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKRNKLAFFAGAVNSPVREK 185
           VV S  + +  +    DVS P +       P L      S  R+ L  F GA + PVRE+
Sbjct: 114 VVVSHCHTLGNYRVGIDVSFPPM-------PLLDQHCYPSRDRSTLLSFRGANSHPVREQ 166

Query: 186 LLQVWRNDSEIYAHS------GRL------------KTPYADELLGSKFCLHVKGFEVNT 227
           L Q      EI A        G L            +  Y D +  S+F +  +G ++ +
Sbjct: 167 L-QRLHQPPEIAAELIQQSYWGTLNYVDEAEGLSAEQQVYTDLIARSRFSVAPRGHDIFS 225

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
            R+ + +  G +PVI+A+ + LPF+++L+W  FS+ VA      L ++L+ IS++++ ++
Sbjct: 226 YRLLEVMAGGAIPVILADDWVLPFSELLDWSEFSLSVAEDRCWELPQLLQAISTDQWQVM 285

Query: 288 QSYVLKVRKHF 298
           Q ++ +V +H+
Sbjct: 286 QQHLQQVYQHY 296


>gi|414866628|tpg|DAA45185.1| TPA: hypothetical protein ZEAMMB73_313698 [Zea mays]
          Length = 588

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 71/140 (50%), Gaps = 6/140 (4%)

Query: 183 REKLLQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           R     VW N  ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+P
Sbjct: 418 RGACASVWENFKNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIP 477

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           VIIA+   LPFAD + W+   + VA  D+P L  IL  I + + +L +  +L      Q 
Sbjct: 478 VIIADDIVLPFADAIPWEEIGVFVAEEDVPKLDSILMSIPT-DVILRKQRLLANPSMKQA 536

Query: 301 HVFPSDY---DAFYMVMYEL 317
            +FP      DAF+ ++  L
Sbjct: 537 MLFPQPAQAGDAFHQILNGL 556


>gi|302766093|ref|XP_002966467.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300165887|gb|EFJ32494.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 453

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 64/265 (24%), Positives = 115/265 (43%), Gaps = 39/265 (14%)

Query: 52  DPSKADLFFLPF-----SIARMRHDR-----RIGTEGIPDFISHYIFNISQKYPYWNRTG 101
           DP +AD+ F+PF     + A++R+ +     R   E         +  I      W R+G
Sbjct: 116 DPDQADVVFVPFFAALSAEAQLRNGKGHFRHRKDNEDYER--QRAVMEIVTSSSRWQRSG 173

Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--------- 152
           G DH +V    +     +   E+  + + VV    +++    + +++S PQ         
Sbjct: 174 GRDHVFVLTDPMAMYHFRA--EIANSILLVVDFGGWYMEDAKSSRNLSSPQPIYHTQVSL 231

Query: 153 ----IWPRQEDPPKLGSSK----RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHS 200
               I P     P L  S+    R+ L +F GA        VR++L  V   +  +    
Sbjct: 232 IKDVIVPYTHLLPTLALSQDNAVRSTLLYFKGARYRHRTGLVRDQLWSVLDGEPGVLLEE 291

Query: 201 G----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
           G      +      +  S FCLH  G   ++ R+ D++   C+PVI+++  +LPF  +L+
Sbjct: 292 GFPNRTGQVQAVQGMRNSHFCLHPAGDTPSSCRLFDAVASLCIPVIVSDSIELPFEGMLD 351

Query: 257 WKSFSIVVATLDIPLLKKILKGISS 281
           +  F+I V+  D  L K +++ +SS
Sbjct: 352 YTQFAIFVSVHDALLPKWLVRHLSS 376


>gi|414866629|tpg|DAA45186.1| TPA: hypothetical protein ZEAMMB73_313698 [Zea mays]
          Length = 206

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 71/140 (50%), Gaps = 6/140 (4%)

Query: 183 REKLLQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           R     VW N  ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+P
Sbjct: 36  RGACASVWENFKNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIP 95

Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           VIIA+   LPFAD + W+   + VA  D+P L  IL  I + + +L +  +L      Q 
Sbjct: 96  VIIADDIVLPFADAIPWEEIGVFVAEEDVPKLDSILMSIPT-DVILRKQRLLANPSMKQA 154

Query: 301 HVFPSDY---DAFYMVMYEL 317
            +FP      DAF+ ++  L
Sbjct: 155 MLFPQPAQAGDAFHQILNGL 174


>gi|224093354|ref|XP_002334839.1| predicted protein [Populus trichocarpa]
 gi|222875141|gb|EEF12272.1| predicted protein [Populus trichocarpa]
          Length = 175

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 68/134 (50%), Gaps = 6/134 (4%)

Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           VW N  D+ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+ 
Sbjct: 11  VWENFKDNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADD 70

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
             LPFAD + W+   + V   D+P L  IL  I   E +L +  +L      Q  +FP  
Sbjct: 71  IVLPFADAIPWEEIGVYVDEEDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQP 129

Query: 307 Y---DAFYMVMYEL 317
               DAF+ V+  L
Sbjct: 130 AQPGDAFHQVLNGL 143


>gi|302848153|ref|XP_002955609.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300259018|gb|EFJ43249.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 572

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 72/320 (22%), Positives = 118/320 (36%), Gaps = 59/320 (18%)

Query: 50  TKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN-ISQKYPYWNRTGGADHFYV 108
           T DP KAD F++P S   ++            F+   + N I + +P WN+TGGA H   
Sbjct: 179 TADPEKADFFYIPASARDLKRA----------FLLEPLLNYIIEAWPIWNQTGGARHIMP 228

Query: 109 ACHSIGRSAM-QKAWEVKLNAIQVVCSSSYFISGH------------IAHKDVSLPQIWP 155
           A   +G   +  K   +  N   +     Y    H            +  +D+ +P +  
Sbjct: 229 AEGDVGTCELPMKIRNMTANVTWLQFWGMYDFHPHWTQIFHNRVPCMVPGRDIVVPFMAM 288

Query: 156 RQED------PPKLGSSKRNKLA--FFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP- 206
              D      P    + KRN+    FFAG +     ++ L       +   +SG ++   
Sbjct: 289 SSHDRFVIETPLHPRNQKRNRTNTFFFAGGICGSGNKRALPPHCTYYKQVRYSGGVRQAV 348

Query: 207 ------------------YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
                             YA +   S FCL   G       I  ++Y GC+PV   +   
Sbjct: 349 YYHYHKRPGWRVVPGTDDYARDYASSIFCLAAAGGGWGKRGIVATMY-GCIPVAATDMLY 407

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF----- 303
             F   ++W  F + V+   IP L  +L+  + E+   +Q       +H  W        
Sbjct: 408 EAFEPEMDWNRFGVRVSQAQIPQLGDMLEAFTPEQIRQMQIRTACAAQHLHWSTNLGGIM 467

Query: 304 --PSDYDAFYMVMYELWLRR 321
               ++DAF  +M  L +RR
Sbjct: 468 GETGEFDAFNTIMAILRMRR 487


>gi|296084506|emb|CBI25527.3| unnamed protein product [Vitis vinifera]
          Length = 275

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 62/112 (55%), Gaps = 6/112 (5%)

Query: 46  SHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEG--IPDFISHYIFNISQKYPYWNRT 100
           SHFV   P +A +F++P S+ R+ H      +   G  +   ++ YI+ +S KYPYWNR+
Sbjct: 96  SHFVAGHPDEAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSDKYPYWNRS 155

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ 152
            GADHF V+CH           ++  + I+V+C+++         +D+SLP+
Sbjct: 156 NGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPI-RDISLPE 206



 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDA 309
           P  D+ +   FSI + +  IP +KKILK + +E YL +Q  V +V++HF  +     YD 
Sbjct: 198 PIRDI-SLPEFSINITSDKIPEIKKILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDM 256

Query: 310 FYMVMYELWLRRSSVRVQ 327
            +M+++ +WLRR +VR++
Sbjct: 257 LHMILHSVWLRRLNVRLR 274


>gi|159489064|ref|XP_001702517.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158280539|gb|EDP06296.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 489

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 70/307 (22%), Positives = 116/307 (37%), Gaps = 69/307 (22%)

Query: 41  KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
           +  + S   T D   AD +F+P  +    H          + ++  +  + + +P+W+RT
Sbjct: 149 QRMLSSGVRTTDGDSADYYFIPLLMRTRTH--------TVNHLAAVVHYVRKHWPWWDRT 200

Query: 101 GGAD-HFYVACHSIGRSAMQKA-------------WEVKLNAIQVVCSSSYFISGHIAHK 146
           GG   H  VA   IGR  +                W +  N      S   +++ H   K
Sbjct: 201 GGGHRHLLVAPGDIGRRILTPELLHMTENCTFLTHWGLHRNH-----SGGNWLASHRPGK 255

Query: 147 DVSLPQIWPRQED---PPKLGSSKRNKLA-----FFAGAVNSPVREKLLQVWRNDSEIYA 198
           D+ +P + P  E     P   + K+N+ +     FFAG +    ++          + Y+
Sbjct: 256 DIVVPPLTPPDEPIVYSPLHATLKKNRKSRLGELFFAGRICGDNQKPTDGKCSEKRQDYS 315

Query: 199 HSGRLK----------------TP-YADELLGSKFCLHVKG--------FEVNT------ 227
              R +                TP YA+ L    FCL   G         +++T      
Sbjct: 316 AGTRQQVAHHHWNRPNWTITTHTPAYAEALSTHIFCLSPTGTARLSHARLDLSTLQRPGG 375

Query: 228 ---ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
               R   SL  GC+PV + +H   PF   ++W  FS+ V   DI  L  +L G+ +  +
Sbjct: 376 GYGRRSVQSLLMGCIPVTVTDHVHQPFEPEVDWARFSVPVREDDIAQLHHVLTGLRASPH 435

Query: 285 LLLQSYV 291
            L Q  V
Sbjct: 436 TLAQMQV 442


>gi|297851878|ref|XP_002893820.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297339662|gb|EFH70079.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 478

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 61/248 (24%), Positives = 106/248 (42%), Gaps = 36/248 (14%)

Query: 54  SKADLFFLPFSIARMRHDRRIGT--------EGIPDFISH-YIFNISQKYPYWNRTGGAD 104
           S AD+ F+PF  A +  +  +G          G  D+     + +  +    W R+ G D
Sbjct: 145 SDADVVFVPF-FATLSAEMELGNGKGSFRKKNGNEDYQRQRQVLDFVKNTEAWKRSNGRD 203

Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ------------ 152
           H +V    +    +++  E+ L+ + VV    +F     +    SLP+            
Sbjct: 204 HVFVLTDPVAMWHVRE--EIALSILLVVDFGGWFRQDSKSSNGTSLPERIEHTQVSVIKD 261

Query: 153 -IWPRQEDPPKLGSS---KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG-RL 203
            I P     P L  S   +R+ L +F GA        +REKL  +  ++  I    G   
Sbjct: 262 VIVPYTHLLPSLDLSQNQRRHSLLYFKGAKHRHRGGLIREKLWDLLVDEQGIVMEEGFPN 321

Query: 204 KTPYADELLG---SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
            T     ++G   S+FCLH  G    + R+ D++   C+PVI+++  +LPF  ++++  F
Sbjct: 322 ATGREQSIIGMRNSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDTIELPFEGIIDYSEF 381

Query: 261 SIVVATLD 268
           S+ V   D
Sbjct: 382 SVFVPVSD 389


>gi|302842849|ref|XP_002952967.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300261678|gb|EFJ45889.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 728

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 81/330 (24%), Positives = 125/330 (37%), Gaps = 63/330 (19%)

Query: 43  FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT-G 101
            + +     D + AD F +P  + RM +D         D +   +  I   +PYWN T G
Sbjct: 263 LLSAGLRVADAASADFFLVPIRV-RMAYDS--------DRVIQAVSYIRSVWPYWNATRG 313

Query: 102 GADHFYVACHSIGRSAM-QKAWEVKLNAIQVV---CSSSYFISG----HIAHKDVSLPQI 153
           G  H +V     GR  + + A  +  NA  +     +  +  +G    H   KDV LP +
Sbjct: 314 GGRHIFVHTGDWGRDELSEDAQLLTRNATWLTHWGLARDHEFAGWRQSHRPGKDVVLPLM 373

Query: 154 W----------PRQED-PPKLGSSKRNKLAFFAGAV-NSPVREKLLQVWRNDSEIYA--- 198
                      PR     P     +R    FFAG +  S     L   + N   +     
Sbjct: 374 LAASLLSTYQLPRASPLHPAGPRPERTTTLFFAGRICGSRATPSLNGTYPNCPNVLGSED 433

Query: 199 -------------HSGRL--------KTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
                        H GR         + P A E+  +KFCL   G       +   L  G
Sbjct: 434 AYSAATRQRAYFYHHGRANWKLVTASRAPAA-EMATAKFCLAPSGGGQGKRSVLAPLM-G 491

Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKH 297
           CVPV + +    PF   L W+ F++ V   D+P++ ++L  +  E+    Q+ +    +H
Sbjct: 492 CVPVPVTDGLMQPFEPELRWERFAVGVRERDLPVMHELLDRLMPEQVAGFQAELTCAAQH 551

Query: 298 FQW-----HVFPSD--YDAFYMVMYELWLR 320
             W      VF  D  YDAF  ++  L +R
Sbjct: 552 LFWSSLYGSVFGEDGAYDAFETLVQVLRMR 581


>gi|302783725|ref|XP_002973635.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300158673|gb|EFJ25295.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 452

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 60/232 (25%), Positives = 108/232 (46%), Gaps = 33/232 (14%)

Query: 97  WNRTGGADHFYVACHSIG----RSAMQKA---------WEVKLNAIQVVCSSSYFISGHI 143
           W R+ G DH +V    +     R  +  A         W ++ +A   + SS+      +
Sbjct: 177 WRRSNGVDHVFVLADPMAMWHVREQISTAMFLVVDFGGWYLE-DAKNKLNSSTIIQHSQV 235

Query: 144 AH-KDVSLPQ--IWPRQEDPPKLGSSK-RNKLAFFAGAV----NSPVREKLLQVWRNDSE 195
           +  KDV +P   + P    P K+   + R  L +F GA     +  VREKL ++  N+ E
Sbjct: 236 SPIKDVIIPHTHLLP----PLKIADDQHRTVLLYFRGARHRHRSGLVREKLWKILDNEPE 291

Query: 196 IYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
           +    G        E    +  S+FCL   G   ++ R+ D++   C+PVI+++   LPF
Sbjct: 292 VLLEKGLPDDAGLAEATRGMRSSEFCLTPAGDTPSSCRLYDAIASLCIPVIVSDDIQLPF 351

Query: 252 ADVLNWKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQW 300
              +N++ F + V+T D      L + L+ I SEE   ++  + +V+++F++
Sbjct: 352 EGFVNYEEFCVFVSTRDATQPGWLVQKLRSIGSEERSTMRQTLSRVQRYFEY 403


>gi|359359170|gb|AEV41075.1| putative exostosin family domain-containing protein [Oryza minuta]
          Length = 317

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 92/195 (47%), Gaps = 25/195 (12%)

Query: 94  YPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHI 143
           +P W R+GG DH +V    +    ++K  E+  + + VV           S+S  +S  I
Sbjct: 33  HPAWRRSGGRDHVFVLTDPVAMWHVRK--EIAPSILLVVDFGGWYKLDSNSASSNVSHMI 90

Query: 144 AHKDVSLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDS 194
            H  VSL +  I P     P +  S+   R  L +F GA        VREKL  +  N+ 
Sbjct: 91  QHTQVSLLKDVIVPYTHLLPTMQLSENKDRLTLLYFKGAKHRHRGGLVREKLWDLMVNEP 150

Query: 195 EIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
           ++    G       ++    +  S+FCLH  G    + R+ D++   C+PVI+++  +LP
Sbjct: 151 DVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELP 210

Query: 251 FADVLNWKSFSIVVA 265
           F  ++++  F+I V+
Sbjct: 211 FEGMIDYTEFTIFVS 225


>gi|255571564|ref|XP_002526728.1| catalytic, putative [Ricinus communis]
 gi|223533917|gb|EEF35642.1| catalytic, putative [Ricinus communis]
          Length = 728

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 81/332 (24%), Positives = 138/332 (41%), Gaps = 70/332 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF----SIAR--------MRHDRRIGTEGIP 80
           Y ++    ++ + S + T +  +AD FF+P      I R        M+    + +    
Sbjct: 328 YGAQMALYESLLASPYRTLNGEEADFFFVPILDSCIITRADDAPHLSMQDHMGLRSSLTL 387

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           ++      +I + YPYWNRT G DH +       AC++   I  S M   W    N+   
Sbjct: 388 EYYRKAYDHIVEHYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 446

Query: 132 VCSSSYFISG--------------HIAHKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
             +++Y+                     KD+ LP  W R  D   L +        +R  
Sbjct: 447 HSTTAYWADNWDKISSDRRGRHPCFDPDKDLVLP-AWKRP-DVSALSTKLWARPLERRKT 504

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
           L FF G +                +R+KL + + +        G+      + TP     
Sbjct: 505 LFFFNGNLGPAYPNGRPELSYSMGIRQKLAEEFGSSPNKDGKLGKQHAEDVIVTPLRSEN 564

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y ++L  S FC  + G +  + R+ DS+  GC+PVII +   LP+ +VLN++SF++ +  
Sbjct: 565 YHEDLASSIFCGVLPG-DGWSGRMEDSILQGCIPVIIQDGIFLPYENVLNYESFAVRIRE 623

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            +I  L KIL+G +  E     + V K+ + F
Sbjct: 624 DEISNLLKILRGFNETEKEFKLANVRKIWQRF 655


>gi|159470791|ref|XP_001693540.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158283043|gb|EDP08794.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 510

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 69/297 (23%), Positives = 109/297 (36%), Gaps = 58/297 (19%)

Query: 35  SESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKY 94
           +  +F +    +     DP+ AD F+LP  +       R  ++G    +   +  +    
Sbjct: 144 TTRHFLEMLTATGARVGDPAAADWFYLPVRL-------RSSSDG--HVLRRALEYVQAAQ 194

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKA-----------WEVKLNAIQVVCSSSYFISGHI 143
           P++N TGG DHF +A   +GR   ++            W +  +  + +  S ++ + H 
Sbjct: 195 PWFNATGGKDHFVLAVGDMGRLESERGPLSANVTFVSHWGLYRSKAEQL-QSPHWRASHR 253

Query: 144 AHKDVSLP-----------QIWPRQEDP------PKLGSSKRNKLAFFAGAVNSPVREKL 186
              D+ LP            I   +  P      P     +   L +FAG V        
Sbjct: 254 NATDIVLPVYLTLRKLQKFGILGSRHHPKFATVAPPDVRERNGPLFWFAGRVCQDSSPPR 313

Query: 187 LQVWRNDSEIYAHSG-------------------RLKTPYADELLGSKFCLHVKGFEVNT 227
             VW N  +   +S                    R    YA  +L +KFC    G     
Sbjct: 314 TDVWPNCPKAMGYSAMTRQAVYFHHWNRTGFAVLRGDKQYAKHMLTAKFCFGPMGGGHGQ 373

Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
            +   +L  GCVPV+I +     +   L+W  F + VA  DIP L  IL  I  EEY
Sbjct: 374 RQFQAALA-GCVPVVIGDGVLEAWEPYLDWNDFGVRVAEADIPRLHTILGAIGPEEY 429


>gi|259155096|ref|NP_001158790.1| exostosin-1c [Salmo salar]
 gi|223647436|gb|ACN10476.1| Exostosin-1c [Salmo salar]
          Length = 759

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 81/353 (22%), Positives = 130/353 (36%), Gaps = 75/353 (21%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
            FRVYVYP  +                G   SESY K   +  +S + T DP +A LF L
Sbjct: 108 GFRVYVYPSEK----------------GERGSESYRKILTSIGESRYYTTDPREACLFVL 151

Query: 62  PFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
              I  +  D+  G + +P+          + YP WN   G +H     +S       + 
Sbjct: 152 --GIDTLDRDQLSG-QFVPNLDDRI-----KGYPLWNE--GQNHLIFNLYSGTWPNYTE- 200

Query: 122 WEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSK-----------RN 169
            ++  N  Q + + +   + H     DVS+P      +D P+ G  +           R 
Sbjct: 201 -DLGFNIGQAILAKASLNTEHFRPGFDVSIPLF---SKDHPQKGGERGWLVRNTVPPRRK 256

Query: 170 KLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL---K 204
            L  F G      + S  R  L  +                 W    +       L   +
Sbjct: 257 YLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNLEYER 316

Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
             Y + L  S FCL  +G  + + R  +SL   CVPV+++N ++LPF+DV+ W    I  
Sbjct: 317 FDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACVPVLLSNGWELPFSDVIQWNQAVIEG 376

Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
               +  +   +  + +E  L      L+ R    W  + S  D   +   E+
Sbjct: 377 DERLLLQVPSTVHAVGNERVL-----ALRQRTQMLWDAYFSSVDKIVLTTLEI 424


>gi|291190070|ref|NP_001167078.1| Exostosin-2 [Salmo salar]
 gi|223647990|gb|ACN10753.1| Exostosin-2 [Salmo salar]
          Length = 724

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 49/82 (59%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +  A ++D L  GCVPVI+A+ Y LPF++VL+WK  S+V+  
Sbjct: 314 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 373

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +P +  ILK I   +   +Q
Sbjct: 374 EKLPEMYTILKSIPHRQVEEMQ 395


>gi|356544337|ref|XP_003540609.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Glycine max]
          Length = 494

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 98/225 (43%), Gaps = 26/225 (11%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+GG DH     H     S+ R      W +      +  + +++  G +   KD+
Sbjct: 214 PAWKRSGGRDHILPVHHPWSFKSVRRYVKNAIWLLP----DMDSTGNWYKPGQVYLEKDL 269

Query: 149 SLPQIWPRQE--DPPKLGSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHS 200
            LP + P  +  D   L  +  KR+ L FF G +       +R KL         +    
Sbjct: 270 ILPYV-PNVDLCDAKCLSETNPKRSTLLFFRGRLKRNAGGKIRSKLGAELSGADGVVIEE 328

Query: 201 GRL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
           G      K      +  S FCL   G   ++AR+ D++  GC+PVII++  +LPF  +L+
Sbjct: 329 GTAGEGGKEAAQRGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILD 388

Query: 257 WKSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           ++  ++ +++ D      L K LKGI       +Q  + K  +HF
Sbjct: 389 YRKIAVFISSNDAVKPGWLLKYLKGIRPAHIKEMQQNLAKYSRHF 433


>gi|56606002|ref|NP_001008400.1| exostosin-2 [Danio rerio]
 gi|55247892|gb|AAV48783.1| exostosin-2 [Danio rerio]
          Length = 719

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 49/82 (59%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +  A ++D L  GCVPVI+A+ Y LPF++VL+WK  S+V+  
Sbjct: 309 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVIMADSYILPFSEVLDWKRASVVIPE 368

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +P +  ILK I   +   +Q
Sbjct: 369 EKLPEMYTILKSIPHRQVEEMQ 390


>gi|224051022|ref|XP_002199808.1| PREDICTED: exostosin-2 [Taeniopygia guttata]
          Length = 718

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 49/82 (59%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQVLQESTFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +P +  IL+ I   +   +Q
Sbjct: 368 EKMPEMYSILQSIPQRQIEEMQ 389


>gi|321463593|gb|EFX74608.1| hypothetical protein DAPPUDRAFT_307216 [Daphnia pulex]
          Length = 724

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 59/257 (22%), Positives = 108/257 (42%), Gaps = 29/257 (11%)

Query: 41  KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
           KA  KS + T DP++A LF        + +  RI  + +   +S          PYWN  
Sbjct: 143 KAITKSMYFTSDPNEACLFI---PTIDLLNQNRIRPKDVGKALS--------SLPYWNE- 190

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDP 160
            G +H                 EV +    ++ S  +    + +  DVS+P   P   + 
Sbjct: 191 -GRNHLIFNILPGSMPEYHPFLEVDI-GYAMMTSGGFSSLTYRSGFDVSVPVYSPLAAEL 248

Query: 161 PKLGSSKRNKLAFFAGA-VNSPVREKLLQVWRNDSEIYAHSGRLKTP------------- 206
               S  RN L   +   ++   RE + ++          +    +P             
Sbjct: 249 RTQSSQDRNWLVISSQPYIHEDFREVISEMAAEHPGFLVLNSCGSSPLDTKLRCRDEETY 308

Query: 207 -YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
            + D L+ + FCL V+G  +    + +SL  GC+PV++++ Y LP+ +V++WKS  + + 
Sbjct: 309 NFPDILMNATFCLVVRGARLGQPTLMESLAAGCIPVVVSDSYVLPYEEVIDWKSAVLQLY 368

Query: 266 TLDIPLLKKILKGISSE 282
             D+  +  +L+G+SS+
Sbjct: 369 EDDLSKMMDLLRGVSSD 385


>gi|387015840|gb|AFJ50039.1| Exostosin-2-like [Crotalus adamanteus]
          Length = 718

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 50/82 (60%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D+L  GC+PV+IA+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQVLQEATFCIVLRGARLGQAVLSDALQAGCIPVVIADSYVLPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +P L  IL+ I   +   +Q
Sbjct: 368 EKMPELYNILQSIPQRQIEEMQ 389


>gi|224144703|ref|XP_002325382.1| predicted protein [Populus trichocarpa]
 gi|222862257|gb|EEE99763.1| predicted protein [Populus trichocarpa]
          Length = 505

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 56/224 (25%), Positives = 100/224 (44%), Gaps = 24/224 (10%)

Query: 95  PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
           P W R+ G +H +   H     S+ R      W +      +  + +++  G +   KD+
Sbjct: 228 PAWKRSEGRNHIFPIHHPWSFKSVRRYVKNAIWLLP----DMDSTGNWYKPGQVFLEKDL 283

Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            LP +        K  S   SKR+ L +F G +       +R KL+        ++   G
Sbjct: 284 ILPYVPNVNLCDTKCISESESKRSTLLYFRGRLKRNAGGKIRAKLVAELSGAEGVFIEEG 343

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K      +  S FCL   G   ++AR+ D++  GC+PV++++  +LPF  +L++
Sbjct: 344 TAGEGGKAAAQIGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVVVSDELELPFEGILDY 403

Query: 258 KSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHF 298
           +  ++ V++ D      L K LKGIS  +   +Q  + K  +HF
Sbjct: 404 RKIALFVSSSDAVQPGWLLKFLKGISLAQIRGMQRNLAKYSRHF 447


>gi|443721868|gb|ELU10993.1| hypothetical protein CAPTEDRAFT_157657 [Capitella teleta]
          Length = 712

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 58/100 (58%)

Query: 192 NDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
           N ++     G     Y   L  S+FCL ++   +    ++D++  GCVPVI+A+ + LPF
Sbjct: 288 NMTDRCTQEGNTALEYPQVLQESEFCLVIRTSRLGQLTLSDAMKAGCVPVIVADSFILPF 347

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYV 291
           ++V++WK  +IVVA  ++  + ++++ IS +  L ++  V
Sbjct: 348 SEVIDWKRAAIVVAEDNLSTVNEVVRAISRDSLLQMRRQV 387


>gi|125528898|gb|EAY77012.1| hypothetical protein OsI_04969 [Oryza sativa Indica Group]
          Length = 416

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 62/245 (25%), Positives = 106/245 (43%), Gaps = 24/245 (9%)

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA-- 144
           +  I  K  YW R+ G DH     H      ++      +NA  ++ +     +  +A  
Sbjct: 150 LMEILWKSKYWQRSAGRDHVIPMHHPNAFRFLRDM----VNASILIVADFGRYTKELASL 205

Query: 145 HKDVSLPQIWP----RQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI 196
            KDV  P +        +DPP    + R  L FF G         +R KL ++ +    +
Sbjct: 206 RKDVVAPYVHVVDSFLNDDPPDPFDA-RPTLLFFRGRTVRKDEGKIRAKLAKILKGKDGV 264

Query: 197 -----YAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
                 A    +KT   + +  SKFCLH  G   ++ R+ D++   CVPVI+++  +LPF
Sbjct: 265 RFEDSLATGEGIKT-STEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPF 323

Query: 252 ADVLNWKSFSI---VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYD 308
            D +++  FS+   V   L    L   L+ I   +++ + S +  V  H+++   P   D
Sbjct: 324 EDEIDYSEFSLFFSVEEALRPDYLLNQLRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGD 383

Query: 309 AFYMV 313
           A  M+
Sbjct: 384 AVNMI 388


>gi|332030617|gb|EGI70305.1| Exostosin-2 [Acromyrmex echinatior]
          Length = 594

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 50/83 (60%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y D L  S FCL ++G  +  + + D++  GC+PVIIA+   +PF DV++W   ++ +  
Sbjct: 187 YPDSLQTSTFCLIIRGARLAQSALLDAMAAGCIPVIIADSLMMPFHDVIDWTKAAVFIRE 246

Query: 267 LDIPLLKKILKGISSEEYLLLQS 289
           +DI L  ++LK IS +  + +Q 
Sbjct: 247 VDILLTIQLLKKISPQRIMDMQE 269


>gi|307215037|gb|EFN89864.1| Exostosin-2 [Harpegnathos saltator]
          Length = 594

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 52/83 (62%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y D L  + FCL ++G  +  + + D++  GC+PVIIA+   +PF DV++W   +I+V  
Sbjct: 187 YPDILHSATFCLIIRGARLAQSVLLDAMAAGCIPVIIADSLIMPFHDVIDWTKAAILVRE 246

Query: 267 LDIPLLKKILKGISSEEYLLLQS 289
           +DI L+ ++LK IS +  + +Q 
Sbjct: 247 VDILLIIQLLKKISHQRIVEMQE 269


>gi|227206252|dbj|BAH57181.1| AT3G03650 [Arabidopsis thaliana]
          Length = 244

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/161 (26%), Positives = 81/161 (50%), Gaps = 11/161 (6%)

Query: 164 GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGSK 215
           G   R  L +F GA+       VR++L  + + + +++   G ++    +   + +  SK
Sbjct: 56  GFDGRPILLYFQGAIYRKAGGFVRQELYNLLKEEKDVHFSFGSVRNHGISKAGEGMRSSK 115

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLL 272
           FCL++ G   ++ R+ D++   C+PVII++  +LP+ DVLN+  F + V + D      L
Sbjct: 116 FCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFCLFVRSSDALKKGFL 175

Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
             +++ I  EEY  +   + +V ++F       D +  Y V
Sbjct: 176 MGLVRSIGREEYNKMWLRLKEVERYFDLRFPVKDDEGDYAV 216


>gi|449270126|gb|EMC80844.1| Exostosin-2, partial [Columba livia]
          Length = 714

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 49/82 (59%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+V+  
Sbjct: 304 YPQVLQESTFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 363

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +P +  IL+ +   +   +Q
Sbjct: 364 EKMPEMYSILQSVPQRQIEEMQ 385


>gi|342320455|gb|EGU12395.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
          Length = 1322

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 64/137 (46%), Gaps = 6/137 (4%)

Query: 168  RNKLAFFAGAVNS--PVREKLLQVWRN----DSEIYAHSGRLKTPYADELLGSKFCLHVK 221
            R  LAFFAG V     +    +   R     +S I          Y   L  SKFCL  +
Sbjct: 1149 RKHLAFFAGGVRGFGAIARTKIGCGRTGQDPNSAILYQQFSPGQRYLGTLNASKFCLLPR 1208

Query: 222  GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
            G      R  +++Y GC+P  I +    PF D+L++  FS+ +   D   +++IL   + 
Sbjct: 1209 GIPAWMTRTFEAIYAGCIPAFIVDRNLFPFQDILDYSRFSVTIPEADAHRIEEILSAYTP 1268

Query: 282  EEYLLLQSYVLKVRKHF 298
            E+   LQ+ ++KVR+ F
Sbjct: 1269 EQLSELQANLVKVREAF 1285


>gi|224090294|ref|XP_002308967.1| predicted protein [Populus trichocarpa]
 gi|222854943|gb|EEE92490.1| predicted protein [Populus trichocarpa]
          Length = 793

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 77/332 (23%), Positives = 139/332 (41%), Gaps = 70/332 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF----------SIARMRHDRRIGTEG--IP 80
           Y ++    ++ + S + T +  +AD FF+P               +  ++ +G       
Sbjct: 392 YGAQMALYESILASPYRTLNGEEADFFFVPVLDSCIITRADDAPHLSMEQHLGLRSSLTL 451

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           +F      +I + YP+WNR+ G DH +       AC++   I  S M   W    N+   
Sbjct: 452 EFYRKAYDHIVEHYPFWNRSSGRDHLWSFSWDEGACYAPKEIWNSMMVVHWG-NTNSKHN 510

Query: 132 VCSSSYFISG--------------HIAHKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
             +++Y+                     KD+ LP  W R  D   L +        KR  
Sbjct: 511 HSTTAYWADNWDKISSDRRGKHPCFDPDKDLVLPA-WKRP-DVNALSTKLWARPLEKRKT 568

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
           L +F G +                +R+KL + + +      + G+      + +P     
Sbjct: 569 LFYFNGNLGPAYLNGRPEALYSMGIRQKLAEEFGSTPNKDGNLGKQHAENVIVSPLRSES 628

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y ++L  S FC  + G +  + R+ DS+  GC+PV+I +   LP+ +VLN++SF++ +  
Sbjct: 629 YHEDLASSVFCGVMPG-DGWSGRMEDSILQGCIPVVIQDGIYLPYENVLNYESFAVRILE 687

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            +IP L KIL+G +  E     + V K+ + F
Sbjct: 688 DEIPNLIKILQGFNETEIENKLTSVQKIGQRF 719


>gi|260825668|ref|XP_002607788.1| hypothetical protein BRAFLDRAFT_64151 [Branchiostoma floridae]
 gi|229293137|gb|EEN63798.1| hypothetical protein BRAFLDRAFT_64151 [Branchiostoma floridae]
          Length = 733

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 66/269 (24%), Positives = 110/269 (40%), Gaps = 49/269 (18%)

Query: 46  SHFVTKDPSKADLF-----FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN-- 98
           S + T DP+KA LF      L  +I R+R   ++                  + P WN  
Sbjct: 149 SEYNTNDPAKACLFVPSIDLLNQNILRLRETGQV----------------LGRLPRWNHG 192

Query: 99  ----RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW 154
               +  GA+H               A EV      ++    +    +    DVS+P   
Sbjct: 193 HHDFQLQGANHLLFNMLPGMEPDYNTALEVP-RGKAILAGGGFSSWTYRPGYDVSIPVFN 251

Query: 155 PRQED---PPKLGSSKRNKLAFFAGAV-NSPVREKLLQVWRNDSEIY------------- 197
           P   D   P K     R  L   A  V +   R +L  V R++ ++              
Sbjct: 252 PFTADMELPGKPDGYSRPWLIVSAQTVIHMDFRAELDAVARDNDDVLVLDRCNDLPEGIP 311

Query: 198 AHSGRLKTP----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
           AH  R K      Y D L  + FC+ ++G  +  + ++D++  GC+PVI  + Y +PF++
Sbjct: 312 AHRRRCKEDRVYNYPDILQEATFCMILRGARMGQSALSDAMMAGCIPVIAIDTYVMPFSE 371

Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSE 282
           VL+WK  ++++   D+P +  +L+ IS E
Sbjct: 372 VLDWKRAAVILREEDLPDVHNVLRRISQE 400


>gi|302829362|ref|XP_002946248.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300269063|gb|EFJ53243.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 656

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 86/352 (24%), Positives = 150/352 (42%), Gaps = 73/352 (20%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGN------YASESYFKKAFMK--SHFVTKDPSK 55
           S ++Y+Y     D   N++ P  FE  GN      Y S   F+  F+   S   T++P +
Sbjct: 290 SLKIYMY-----DIPPNIVGPHQFE-DGNGGIHPQYESFLRFQGLFLNDVSGIRTENPHE 343

Query: 56  ADLFFLP----FSIARMRHDRRIGTEGIPDFISHYIFN-ISQKYPYWNRTGGADHFYVAC 110
           A+LF++P    +S + +         G P   +    N ++  +P++NRTGG DHF +  
Sbjct: 344 ANLFYIPAFTYYSSSNL---------GDPTGAAVRAVNWVAATFPFFNRTGGRDHFVLLS 394

Query: 111 HSIGRSAMQKAWEVKLNAIQV----------------VCSSSYFISGHIAHKDVSLP--- 151
              G   ++   + + N I+V                V ++ Y      A +DV +P   
Sbjct: 395 GDRGACYLKTLPQTE-NLIRVTHFGYERPNITDMGPLVTNTEY--GCFKAGRDVVMPPYV 451

Query: 152 --------QIWPRQEDPPKLGS--SKRNKLAFFAGAV--NSP-----VREKLLQVWRNDS 194
                    +  + E+P    +  + ++ L FF+G +  N P     VR+ L  +  N S
Sbjct: 452 KSNVAGIQGVRAKLEEPGGAEALLAGKDTLLFFSGDIRHNEPEYSGGVRQALALLLANTS 511

Query: 195 EI-----YAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
                    +       Y   L  SKFCL   G      R+  ++ + C+PVII +    
Sbjct: 512 YPDVVFKGGYMMMGMGEYESLLRRSKFCLAPYGHGWGI-RLIHAITHACIPVIIQDKVRQ 570

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           PF D+L++  FS+ V+  ++P L +IL+ +   + L +     +V + F W 
Sbjct: 571 PFEDILHYPDFSVRVSKAELPRLVEILRAVPEPDLLRMIKENSRVYRAFLWQ 622


>gi|297739695|emb|CBI29877.3| unnamed protein product [Vitis vinifera]
          Length = 822

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 139/332 (41%), Gaps = 70/332 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRR----------IGTEGIP 80
           Y ++    ++ + S   T D  +AD FF+P   S   +R D            + +    
Sbjct: 421 YGAQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTL 480

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           +F      +I ++YP+WNR+ G DH +       AC++   I  S M   W    N+   
Sbjct: 481 EFYKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWG-NTNSKHN 539

Query: 132 VCSSSYFISGHIA--------------HKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
             +++Y+     +              +KD+ LP  W R  D   L S        +R  
Sbjct: 540 HSTTAYWADNWDSVSSDRRGNHPCFDPYKDLVLPA-WKRP-DVVSLSSKLWSRPREQRKT 597

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
           L +F G +                +R+K+ + + +        G+      + TP     
Sbjct: 598 LFYFNGNLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGN 657

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  S FC  + G +  + R  DS+  GC+PV+I +   LPF ++LN++SF++ +  
Sbjct: 658 YHESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIRE 716

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            +IP L KIL+G++  E       V K+ + F
Sbjct: 717 DEIPNLIKILRGMNETEIEFKLENVRKIWQRF 748


>gi|225441752|ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267584 [Vitis vinifera]
          Length = 794

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 139/332 (41%), Gaps = 70/332 (21%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRR----------IGTEGIP 80
           Y ++    ++ + S   T D  +AD FF+P   S   +R D            + +    
Sbjct: 393 YGAQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTL 452

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           +F      +I ++YP+WNR+ G DH +       AC++   I  S M   W    N+   
Sbjct: 453 EFYKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWG-NTNSKHN 511

Query: 132 VCSSSYFISGHIA--------------HKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
             +++Y+     +              +KD+ LP  W R  D   L S        +R  
Sbjct: 512 HSTTAYWADNWDSVSSDRRGNHPCFDPYKDLVLPA-WKR-PDVVSLSSKLWSRPREQRKT 569

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
           L +F G +                +R+K+ + + +        G+      + TP     
Sbjct: 570 LFYFNGNLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGN 629

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  S FC  + G +  + R  DS+  GC+PV+I +   LPF ++LN++SF++ +  
Sbjct: 630 YHESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIRE 688

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            +IP L KIL+G++  E       V K+ + F
Sbjct: 689 DEIPNLIKILRGMNETEIEFKLENVRKIWQRF 720


>gi|405969796|gb|EKC34747.1| Exostosin-2 [Crassostrea gigas]
          Length = 698

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 59/101 (58%), Gaps = 5/101 (4%)

Query: 201 GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
           G ++  Y D L  SKFC+ ++   +    ++D+L  GC+PVI+A+ Y LPF++VL+WK  
Sbjct: 284 GTVEYTYPDILQDSKFCMVLRSARLGHTALSDALRTGCIPVIVADGYVLPFSEVLDWKRA 343

Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           ++V+   ++  + ++LK  S E     + Y ++ +  F W 
Sbjct: 344 AVVIREENLKDVVEVLKSYSME-----RIYQMRRQARFFWE 379


>gi|159478515|ref|XP_001697348.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158274506|gb|EDP00288.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 620

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 67/281 (23%), Positives = 106/281 (37%), Gaps = 55/281 (19%)

Query: 43  FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGG 102
            M     T +P +AD FFLP               G   +   + + +     YWNR  G
Sbjct: 187 LMSLGLRTVNPEEADYFFLPGC-----------GRGCNKWDQKFKYIMEHYAKYWNRRNG 235

Query: 103 ADHFYVACHSIGRS---------------AMQKAWEVKLNAIQVVCSSSYFISGHIAHKD 147
            DH        GR                 M   W + ++  Q       F + HI  +D
Sbjct: 236 RDHLMTHAGDWGRCEKAWGPDSAPFIANLTMLNHWGITVDRSQET-EHDLFNACHIPDQD 294

Query: 148 VSLP----QIWPRQEDP---PKLGSSKRNK--LAFFAGAV---NSP-------------V 182
           + +P     ++P+ E     PK  +   NK  LA  AG++   NS              V
Sbjct: 295 IQVPVLCGDLYPQFEHNVWHPKRRAHPVNKTVLASVAGSICGWNSAEEPPCKNKYYSFGV 354

Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
           R  L    R+      H  +        +  S+FC    G      ++  S+  GC+PVI
Sbjct: 355 RAALWTTLRDKPGF--HIAKRVPVLGQSMAESEFCFAPTGAGHGKRQVV-SVTLGCMPVI 411

Query: 243 IANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           I++H   PF   L+W  F + +A  D+P ++ IL+G + ++
Sbjct: 412 ISDHVAQPFEPFLDWNDFGVWIAEADLPDVEAILRGFTPQQ 452


>gi|242077606|ref|XP_002448739.1| hypothetical protein SORBIDRAFT_06g032380 [Sorghum bicolor]
 gi|241939922|gb|EES13067.1| hypothetical protein SORBIDRAFT_06g032380 [Sorghum bicolor]
          Length = 456

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 125/288 (43%), Gaps = 47/288 (16%)

Query: 55  KADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWNRTGGAD 104
           +AD+ F+PF  A +  +  +G          +G  D+     + +    +P W R+GG D
Sbjct: 124 EADVVFVPF-FATLSAEMELGWGTKGAFRKKDGNEDYRRQREVVDRVTSHPAWRRSGGRD 182

Query: 105 HFYVACHSIGRSAMQKAWEVKLN---AIQVVC-----------SSSYFISGHIAHKDVSL 150
           H +V    +        W V+     AI +V            S++   S  I H  VSL
Sbjct: 183 HVFVMTDPVA------MWHVRAEIAPAILLVVDFGGWYKVDSKSANRNSSRMIQHTQVSL 236

Query: 151 PQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAHSG 201
            +  I P     P L  S+   R  L +F GA +      VREKL  +  N+ ++    G
Sbjct: 237 LKDVIVPYTHLLPTLLLSENKDRPTLLYFKGAKHRHRGGLVREKLWDLLGNEPDVIMEEG 296

Query: 202 RLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                  ++    +  S+FCLH  G    + R+ D++   C+PVI+++  +LP+  ++++
Sbjct: 297 FPNATGREQSIKGMQTSEFCLHPAGDTPTSCRLFDAIASLCIPVIVSDEVELPYEGMIDY 356

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHV 302
             FSI V+  +      L   L+ I  ++    +  + +V+  F+++ 
Sbjct: 357 TEFSIFVSVRNAMRPKWLTSYLRNIPKQQKDEFRKNLARVQPIFEYNT 404


>gi|115441893|ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group]
 gi|19386797|dbj|BAB86176.1| OJ1485_B09.5 [Oryza sativa Japonica Group]
 gi|57899432|dbj|BAD88370.1| exostosin-like [Oryza sativa Japonica Group]
 gi|113534757|dbj|BAF07140.1| Os01g0921300 [Oryza sativa Japonica Group]
 gi|125573139|gb|EAZ14654.1| hypothetical protein OsJ_04578 [Oryza sativa Japonica Group]
 gi|215741014|dbj|BAG97509.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767487|dbj|BAG99715.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 437

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 121/283 (42%), Gaps = 26/283 (9%)

Query: 52  DPSKADLFFLPF--SIARMRHDRRIGT-EGIPD-FISHYIFNISQKYPYWNRTGGADHFY 107
           DP  A+ FF+PF  S++   H R +   E   D  +   +  I  K  YW R+ G DH  
Sbjct: 132 DPDAAEAFFVPFFSSLSFNVHGRNMTDPETEADRLLQVELMEILWKSKYWQRSAGRDHVI 191

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA--HKDVSLPQIWPRQ---EDPPK 162
              H      ++      +NA  ++ +     +  +A   KDV  P +        D P 
Sbjct: 192 PMHHPNAFRFLRDM----VNASILIVADFGRYTKELASLRKDVVAPYVHVVDSFLNDDPP 247

Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI-----YAHSGRLKTPYADELLG 213
                R  L FF G         +R KL ++ +    +      A    +KT   + +  
Sbjct: 248 DPFDDRPTLLFFRGRTVRKDEGKIRAKLAKILKGKDGVRFEDSLATGEGIKT-STEGMRS 306

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLDIP 270
           SKFCLH  G   ++ R+ D++   CVPVI+++  +LPF D +++  FS+   V   L   
Sbjct: 307 SKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRPD 366

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
            L   L+ I   +++ + S +  V  H+++   P   DA  M+
Sbjct: 367 YLLNQLRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGDAVNMI 409


>gi|302787839|ref|XP_002975689.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300156690|gb|EFJ23318.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 452

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 59/232 (25%), Positives = 107/232 (46%), Gaps = 33/232 (14%)

Query: 97  WNRTGGADHFYVACHSIG----RSAMQKA---------WEVKLNAIQVVCSSSYFISGHI 143
           W R+ G DH +V    +     R  +  A         W ++ +A   + SS+      +
Sbjct: 177 WRRSNGVDHVFVLADPMAMWHVREQISTAVFLVVDFGGWYLE-DAKNKLNSSTIIQHSQV 235

Query: 144 AH-KDVSLPQ--IWPRQEDPPKLGSSK-RNKLAFFAGAV----NSPVREKLLQVWRNDSE 195
           +  KDV +P   + P    P K+   + R  L +F GA     +  VREKL ++  N+ E
Sbjct: 236 SPIKDVIIPHTHLLP----PLKIADDQHRTVLLYFRGARHRHRSGLVREKLWKILDNEPE 291

Query: 196 IYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
           +    G        E    +  S+FCL   G   ++ R+ D++   C+PVI+++   LPF
Sbjct: 292 VLLEEGLPDDAGLAEATRGMRSSEFCLTPAGDTPSSCRLYDAIASLCIPVIVSDDIQLPF 351

Query: 252 ADVLNWKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQW 300
              +N++ F + V+  D      L + L+ I SEE   ++  + +V+++F++
Sbjct: 352 EGFVNYEEFCVFVSARDATQPGWLVQKLRSIGSEERSTMRQTLSRVQRYFEY 403


>gi|449453962|ref|XP_004144725.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Cucumis sativus]
          Length = 518

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 59/224 (26%), Positives = 102/224 (45%), Gaps = 24/224 (10%)

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHI-AHKDVS 149
           P W R+ G DH     H      ++K  +   NAI ++     + +++  G +   KD+ 
Sbjct: 238 PAWKRSEGRDHILPVHHPWSFKTVRKFMK---NAIWLLPDMDSTGNWYKPGQVFLEKDLI 294

Query: 150 LPQIWPRQE--DPPKLG--SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
           LP + P  E  D   L    SKR+ L FF G +       +R KL        ++    G
Sbjct: 295 LPYV-PNVELCDSKCLSYQQSKRSILLFFRGRLKRNAGGKIRAKLGGELSGADDVLIEEG 353

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K      +  S FCL   G   ++AR+ D++  GC+PVI+++  +LPF  +L++
Sbjct: 354 TAGEGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 413

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           +  ++ V++ D      L   L+  S+ +   LQ  + K+ +HF
Sbjct: 414 RKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAKLSRHF 457


>gi|148229160|ref|NP_001083108.1| uncharacterized protein LOC398750 [Xenopus laevis]
 gi|37805422|gb|AAH60367.1| MGC68803 protein [Xenopus laevis]
          Length = 718

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 49/82 (59%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +    ++D L  GCVPVIIA+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQILQESTFCIVLRGARLGQGVLSDVLQAGCVPVIIADSYVLPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+GIS  +   +Q
Sbjct: 368 EKMFEMYSILQGISQRQVEEMQ 389


>gi|302835858|ref|XP_002949490.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300265317|gb|EFJ49509.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 499

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 76/341 (22%), Positives = 118/341 (34%), Gaps = 72/341 (21%)

Query: 38  YFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPD--FISHYIFNISQKYP 95
           YF +  + S     D   AD +F+P           +      D  F+   +  I   +P
Sbjct: 95  YFWQRLLGSGARVADGDLADWYFIP-----------VRQRSFSDSWFLREALSYIRTHHP 143

Query: 96  YWNRTGGADHFYVACHSIGRSAMQK-------------------------AWEVKLNAIQ 130
           +WNRT G  H  +     G   + K                          W       +
Sbjct: 144 WWNRTEGHRHMVLHTGDWGLGEVAKDVRQMSLNVTWLTHWGLSTDRPNIQRWTRAFRPER 203

Query: 131 VVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK-LAFFAGAV----------- 178
            V    Y   GH  H  ++   + P      +  +  RN+ L FFAG +           
Sbjct: 204 DVVIPVYISPGHFVHFGINRSPLHPVTAASRRTAARPRNESLLFFAGRICHDAKRPNPDT 263

Query: 179 ------------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
                          VREK      N S    H  R +  Y+  +  S FCL   G   +
Sbjct: 264 FPACGDDTAEWYGGGVREKFFVSHWNRSGF--HVVRSEPRYSHYMSRSVFCLAPPG-AGH 320

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
             R   +L+ GCVPV +A+    PF   L+W+ + + +A  DIP   ++L G++ E+   
Sbjct: 321 GQRQIQALFMGCVPVTVADGVYEPFEPALSWEEWGLRIAEQDIPRAHELLGGLTREQLAE 380

Query: 287 LQSYVLKVRKHFQWHVFP-------SDYDAFYMVMYELWLR 320
            QS +    +H  +             YDAF   +  L +R
Sbjct: 381 KQSRMHCAAQHMLYSTITGAVLGEDGRYDAFETTLEVLRVR 421


>gi|321261385|ref|XP_003195412.1| hypothetical protein CGB_G5240C [Cryptococcus gattii WM276]
 gi|317461885|gb|ADV23625.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 1130

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 84/171 (49%), Gaps = 17/171 (9%)

Query: 145  HKDVSLPQIWPR----QEDPPKLGSSK----RNKLAFFAGAVNSPVREKLLQVWRN---- 192
            H+DV +P    R    +E  P + + K    R+ L  ++G  +   + + +++  N    
Sbjct: 921  HQDVVIPARTCRSNTLRETFPNVEAIKPMRERSNLLMWSGTYSGTGKSERIRLTCNRGGA 980

Query: 193  -DSEIYAHSGR----LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
             D E+    G+      + Y  +L  ++FC   +G    + + +D++Y GC+PV I+   
Sbjct: 981  GDRELIKGGGKQSNFASSDYMKDLNNARFCAQPRGIAGWSPQTSDAIYAGCIPVFISEGT 1040

Query: 248  DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
              PFAD L+W   S+ VA  ++  ++K+L  I   +   LQ+ ++ VR+ F
Sbjct: 1041 HYPFADFLDWSKLSVRVAPTELDKIEKVLAAIPLSKVEELQANLVCVREAF 1091


>gi|326920358|ref|XP_003206441.1| PREDICTED: exostosin-2-like [Meleagris gallopavo]
          Length = 718

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 49/82 (59%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +P +  IL+ +   +   +Q
Sbjct: 368 EKMPEMYSILQSVPQRQIEEMQ 389


>gi|159489402|ref|XP_001702686.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158280708|gb|EDP06465.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 615

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 77/337 (22%), Positives = 128/337 (37%), Gaps = 65/337 (19%)

Query: 39  FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
           F +  M +   T +  +AD FF+P +   +    +        +I  YI N    +PYW+
Sbjct: 207 FWQRLMSAGLRTVNGDEADYFFIPLNTRTLMAPEQAA------WILPYIRNT---WPYWD 257

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLN------------AIQVVCSSSYFISGHIAHK 146
           R  G  H  +    +G   +      K+N             +        +   H   K
Sbjct: 258 RDNGHRHLIIHTGDMGLHELPLGLRRKMNETLSNITWLTHWGLHTYHPIGTWFPAHRPGK 317

Query: 147 DVSLP--------QIWPRQ----EDPPKLGSS-KRNKLAFFAGAV--------------- 178
           D+ +P        Q+ P      E   K G    R +  FFAG +               
Sbjct: 318 DIVIPVMITTPGFQLSPLNPAVAEKAAKRGRPYTREQTFFFAGRICGDRKPPDPLTHECA 377

Query: 179 ------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
                 ++ VR+++     N +     +G  K  Y  E+   KFCL   G      ++  
Sbjct: 378 PKRTDYSASVRQRVYFHHHNRTGFKVLTGTSK--YMQEITSHKFCLAPTGGGHGKRQVLV 435

Query: 233 SLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
           +L  GC+PV I +    PF   L W  FS+ VA  DIP L ++L+ +  E+   +QS + 
Sbjct: 436 ALM-GCIPVTITDGVYQPFEPELPWADFSVPVAEDDIPRLHEVLEALPPEQVEQMQSRLH 494

Query: 293 KVRKHFQWH-----VFPSD--YDAFYMVMYELWLRRS 322
              +H  +      +   D  YDAF  ++  L +R++
Sbjct: 495 CAAQHMFYSSSLGAIIGEDGRYDAFETMIEILRVRKA 531


>gi|289166872|gb|ADC84486.1| glycosyltransferase family 47A [Salix miyabeana]
          Length = 215

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 75/162 (46%), Gaps = 15/162 (9%)

Query: 142 HIAHKD--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQ 188
           H+  KD  +++P   P Q+    L   K  R+   +F G      N P      R     
Sbjct: 41  HVCLKDGSITIPPYAPPQKMQTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAA 100

Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           VW N  D+ ++  S      Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+ 
Sbjct: 101 VWENFKDNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADD 160

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
             LPFAD + W+   + V   D+P L  IL  I  E  L  Q
Sbjct: 161 IVLPFADAIPWEDIGVYVDEEDVPNLDTILTSIPPEVILRKQ 202


>gi|449520748|ref|XP_004167395.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Cucumis sativus]
          Length = 517

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 59/224 (26%), Positives = 102/224 (45%), Gaps = 24/224 (10%)

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHI-AHKDVS 149
           P W R+ G DH     H      ++K  +   NAI ++     + +++  G +   KD+ 
Sbjct: 238 PAWKRSEGRDHILPVHHPWSFKTVRKFMK---NAIWLLPDMDSTGNWYKPGQVFLEKDLI 294

Query: 150 LPQIWPRQE--DPPKLG--SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
           LP + P  E  D   L    SKR+ L FF G +       +R KL        ++    G
Sbjct: 295 LPYV-PNVELCDRKCLSYQQSKRSILLFFRGRLKRNAGGKIRAKLGGELSGADDVLIEEG 353

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K      +  S FCL   G   ++AR+ D++  GC+PVI+++  +LPF  +L++
Sbjct: 354 TAGEGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 413

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           +  ++ V++ D      L   L+  S+ +   LQ  + K+ +HF
Sbjct: 414 RKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAKLSRHF 457


>gi|71897059|ref|NP_001026520.1| exostosin-2 [Gallus gallus]
 gi|60098757|emb|CAH65209.1| hypothetical protein RCJMB04_7p22 [Gallus gallus]
          Length = 567

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 49/82 (59%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +P +  IL+ +   +   +Q
Sbjct: 368 DKMPEMYSILQSVPQRQIEEMQ 389


>gi|405121937|gb|AFR96705.1| hypothetical protein CNAG_03480 [Cryptococcus neoformans var. grubii
            H99]
          Length = 1125

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 72/151 (47%), Gaps = 13/151 (8%)

Query: 161  PKLGS----SKRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR----LKTPY 207
            P +GS    S+R+ L  +AG       +  +R    +    D E+    G+        Y
Sbjct: 936  PNVGSIKPMSERSNLLMWAGTHWVTGKSERIRLTCDRGGAGDRELIKGGGKQSNFANGDY 995

Query: 208  ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATL 267
             ++L  ++FC   +G    + +  D++Y GC+PV IA     PFA  L+W   S+ VA  
Sbjct: 996  INDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFIAEGTHYPFAGFLDWSKLSVRVAPT 1055

Query: 268  DIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            ++  ++KIL  I   +   LQ+ ++ VR+ F
Sbjct: 1056 ELDKIEKILAAIPLSKVEELQANLVSVREAF 1086


>gi|296088686|emb|CBI38136.3| unnamed protein product [Vitis vinifera]
          Length = 291

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 33/199 (16%)

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLN---AIQVVC-----------SSSYFISGH 142
           W R+GG DH +V    +        W VK     AI +V            +S+  +S  
Sbjct: 10  WKRSGGRDHVFVLTDPVA------MWHVKAEIAPAILLVVDFGGWYKLDSKASNNSLSEM 63

Query: 143 IAHKDVSLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRND 193
           I H  VSL +  I P     P+L  S+   R  L +F GA        VREKL  +   +
Sbjct: 64  IQHTQVSLLKDVIVPYTHLLPRLHLSENQIRQTLLYFKGAKHRHRGGLVREKLWDLLVYE 123

Query: 194 SEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
             +    G       ++    +  S+FCLH  G    + R+ D++   C+PVI++++ +L
Sbjct: 124 QGVIMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDNIEL 183

Query: 250 PFADVLNWKSFSIVVATLD 268
           PF  ++++  FS+ VA  D
Sbjct: 184 PFEGMVDYSEFSVFVAVRD 202


>gi|289166870|gb|ADC84485.1| glycosyltransferase family 47A [Salix sachalinensis]
          Length = 215

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/160 (29%), Positives = 73/160 (45%), Gaps = 14/160 (8%)

Query: 140 SGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
            G IA    + PQ     + PP    + R+   +F G      N P      R     VW
Sbjct: 46  EGSIAIPPFAPPQKMQAHQIPP---DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 102

Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
            N  ++ ++  S    T Y +++  + FCL   G+   + R+ +++ +GC+ VIIA+   
Sbjct: 103 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCISVIIADDIV 162

Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
           LPFAD + W+   + VA  D+P L  IL  I  E  L  Q
Sbjct: 163 LPFADAIPWEEIGVFVAEEDVPNLDTILTTIPPEVILRKQ 202


>gi|324508547|gb|ADY43609.1| Exostosin-1b, partial [Ascaris suum]
          Length = 628

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 74/323 (22%), Positives = 130/323 (40%), Gaps = 70/323 (21%)

Query: 5   FRVYVYPHRRNDP----FANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           FRVYVYP          +AN+L                  K   +S + T DPSKA LF 
Sbjct: 82  FRVYVYPDNNESTVSIVYANIL------------------KVLRESVYYTDDPSKACLFV 123

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYP-YWNRTGGADHFYV----------A 109
           L  SI  +  DR+       +++ H    I   +   WN  GG +H             +
Sbjct: 124 L--SIDTIDRDRKS-----ENYVKHVDEQIEALHSDLWN--GGRNHIIFNLYHGTYPDYS 174

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSL----PQIWPRQEDPPKLGS 165
            H +G   +  A   + +A   +   ++ +S  + HK+ SL      +WP +     L S
Sbjct: 175 DHDLGFD-VGYAMVARASANAQIFRPNFDLSFPLFHKEHSLRTVVESVWPLKLKDEYLVS 233

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT---PYADE------------ 210
            K  +  +    + S  R+ L  +    S +   + +       Y DE            
Sbjct: 234 FKGKRYVY---GIGSETRDSLYHLHNAHSVVMVTTCKHNNDWKKYEDERCDEDNIEYERW 290

Query: 211 -----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
                +  S FCL  +G  + + R  +SL  GC+PVI+++ ++LPF+++++W   +++  
Sbjct: 291 DYETTMSNSTFCLTPRGRRLGSFRFLESLRLGCIPVILSDDWELPFSEIIDWSQAAVIAH 350

Query: 266 TLDIPLLKKILKGISSEEYLLLQ 288
              +  +  +L  I  E  L ++
Sbjct: 351 EDTVLTISDVLNAIPLERVLYMK 373


>gi|224128538|ref|XP_002320357.1| predicted protein [Populus trichocarpa]
 gi|222861130|gb|EEE98672.1| predicted protein [Populus trichocarpa]
          Length = 137

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 65/118 (55%), Gaps = 3/118 (2%)

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI- 269
           +  SKFCL++ G   ++ R+ D++   CVPVII++  +LPF DVL++  F + V   D  
Sbjct: 1   MASSKFCLNIAGDTPSSNRLFDAIASQCVPVIISDGIELPFEDVLDYSEFGVFVRASDAV 60

Query: 270 --PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
               L  +L+GI  +++ +L   + ++   F++       DA  MV   +  ++SSV+
Sbjct: 61  KKGYLLYLLRGIKKDQWTILWERLKEIAPQFEYRYPSQPGDAVDMVWEAVLRKKSSVQ 118


>gi|356576131|ref|XP_003556187.1| PREDICTED: probable glucuronosyltransferase Os01g0926400-like
           [Glycine max]
          Length = 481

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 62/254 (24%), Positives = 107/254 (42%), Gaps = 44/254 (17%)

Query: 52  DPSKADLFFLPFSIARMRHDRRIGTEGIPDFISH---------YIFNISQKYPYWNRTGG 102
           DP  AD+ F+PF  A +  + ++G         H          + +  +    WNR+GG
Sbjct: 146 DPLLADVVFVPF-FATLSAEMQLGANKGAFRKKHDNDDYKRQRQVMDAVKNTHAWNRSGG 204

Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVC---------------SSSYFISGHIAHKD 147
            DH +V    +        W VK      V                 S+   S  I H  
Sbjct: 205 RDHVFVLTDPVA------MWHVKDEIAPAVLLVVDFGGWYRLDSRGGSNCSESDVIPHTQ 258

Query: 148 VSLPQ--IWPRQEDPPKLGSS---KRNKLAFFAGAVN----SPVREKLLQVWRNDSEIYA 198
           VS+ +  I P     P+L  S   +R++L +F GA +      +REKL  +  ++  +  
Sbjct: 259 VSVIKDVIVPYTHLLPRLDLSDNKERHQLLYFKGAKHRHRGGIIREKLWDLLVSEPGVIM 318

Query: 199 HSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
             G       ++    +  S+FCLH  G    + R+ D++   C+PVI++++ +LPF  +
Sbjct: 319 EEGFPNATGREQSIKGMQTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDNIELPFEGM 378

Query: 255 LNWKSFSIVVATLD 268
           +++  FS+  A  D
Sbjct: 379 VDYAEFSVFAAVSD 392


>gi|432908332|ref|XP_004077815.1| PREDICTED: exostosin-1b-like [Oryzias latipes]
          Length = 741

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/357 (21%), Positives = 136/357 (38%), Gaps = 83/357 (23%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   SESY     +   S F T DP+KA LF L
Sbjct: 108 GFKVYVYPQQK----------------GEKLSESYQNILSSIEGSRFHTPDPAKACLFVL 151

Query: 62  PFSIARMRHDRRIGTEGIPDFISHYIFNIS---QKYPYWNRTGGADHFYVACHSIGRSAM 118
             S+  +  D+             Y+ N+    Q  P WN   G +H     +S      
Sbjct: 152 --SLDTLDRDQ---------LSPQYVHNLKAKIQSLPLWNE--GRNHIIFNLYSGTWPDY 198

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK----------- 167
            +     +    ++  +S        + DVS+P      +D P+ G  +           
Sbjct: 199 TEDLGFDI-GFAMLAKASISTENFRPNFDVSVPLF---SKDHPRTGGERGFLKHNTIPPY 254

Query: 168 RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL-- 203
           R  +  F G      + S  R  L  V                 W+   ++      +  
Sbjct: 255 RKYMLVFKGKRYLTGIGSDTRNALYHVHNSEDVVLLTTCKHGKDWQKHKDVRCDKDNVEY 314

Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
            K  Y + L  S FCL  +G  + + R  ++L   CVPV+++N ++LPF++++NW + ++
Sbjct: 315 DKYDYREMLYNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIINWNTAAV 374

Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
           +         +++L  I S    + Q  +L +R+  Q  W  + +  +   +   E+
Sbjct: 375 IGD-------ERLLLQIPSTVRSIHQDKILSLRQQTQFLWEAYFNSLEKIVLTTLEI 424


>gi|159471277|ref|XP_001693783.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158283286|gb|EDP09037.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 611

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 69/296 (23%), Positives = 110/296 (37%), Gaps = 60/296 (20%)

Query: 38  YFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYW 97
           +F +    S     D +KAD +F+P  + R   D  +        +   I ++   +P++
Sbjct: 183 HFLEMLTASGARVADGAKADWYFIPVRL-RSSSDAYV--------LQRAIHHLRHAHPWF 233

Query: 98  NRTGGADHFYVACHSIGRSAMQKA-----------WEVKLNAIQVVCSSSYFISGHIAHK 146
           N TGG  HF +A   +GR   ++            W +  +      +SS + + H    
Sbjct: 234 NATGGGRHFVIAVGDMGRLESERGPLSANVTFVSHWGLASSKKAERLNSSPWRASHRNAT 293

Query: 147 DVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREK----------LLQ-------- 188
           D+ LP ++       K G ++      FA      +RE+          + Q        
Sbjct: 294 DIVLP-VYISLRKLEKYGITRSRHHPKFATMAPPEIRERNGPLFWFAGRICQDRSKPATD 352

Query: 189 -VWRNDSEIYAHSGRLKTP-------------------YADELLGSKFCLHVKGFEVNTA 228
            VW N      +S   +                     Y+  LL SKFC    G      
Sbjct: 353 GVWPNCPGAMGYSAMTRQAVHFHHWDRPGYFVAPGDKNYSRHLLTSKFCFGAMGGGHGQR 412

Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
           ++  +L  GCVPV+I +     +   L+W  F + VA  DIP L  IL  I  EEY
Sbjct: 413 QLQAALA-GCVPVVIGDGVLEAWEPYLDWNDFGVRVAEADIPRLHTILGAIGPEEY 467


>gi|291384884|ref|XP_002708900.1| PREDICTED: exostosin 2 [Oryctolagus cuniculus]
          Length = 765

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/163 (31%), Positives = 77/163 (47%), Gaps = 21/163 (12%)

Query: 147 DVSLPQIWP--RQEDPPKLGSSKRNKLAFFAG-AVNSPVREKL--LQVWRNDSEIYAH-- 199
           DVS+P   P   + D P+ G   R      +  AV+   RE+L  LQ    +S +  H  
Sbjct: 274 DVSIPVYSPLSAEVDLPEKGPGPRRYFLLSSQMAVHPEFREELEALQAKHGESVLVLHQC 333

Query: 200 ----SGRLKTP----------YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIAN 245
                G L             Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+
Sbjct: 334 TNLSEGALSVRKRCHKHQVFHYPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVVIAD 393

Query: 246 HYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
            Y LPF++VL+WK  S+VV    +  + KIL+ I   +   +Q
Sbjct: 394 SYILPFSEVLDWKRASVVVPEEKMADVYKILQSIPQRQIQEMQ 436


>gi|159470093|ref|XP_001693194.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158277452|gb|EDP03220.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 673

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 66/303 (21%), Positives = 118/303 (38%), Gaps = 29/303 (9%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFF-------LPFSIARMRHDRRIGTEGIP----- 80
           YA+++   +  + S   T DP +AD F+       LPF I            G P     
Sbjct: 297 YAADTLLHELLLVSPHRTFDPEEADFFYVPHQASCLPFPIGNWADWPWFKGPGGPRIRQM 356

Query: 81  -DFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFI 139
            + I      I Q YP+W R GG DH +   H  G       W   +    +  +    +
Sbjct: 357 LNMIMETRDWIDQHYPFWKRRGGRDHIWTFTHDEG-----ACWAPNVLNTSIWLTHWGRM 411

Query: 140 SGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAH 199
                     +P  + R  D       +  ++         P ++ ++  ++      A 
Sbjct: 412 DPDHTSNTAFVPDRYDR--DFKSAYQPEGYRVHMQGHPCYRPGQDLVIPAFKRPDHYRAS 469

Query: 200 SGRLKTPYADELLG--------SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
                T    EL G        S FCL   G +  +AR+ D++ +GC+PVII ++  + F
Sbjct: 470 PLAAATSKPRELPGDYSDMLSRSLFCLVAAG-DGWSARLEDAVLHGCIPVIIIDNVHVVF 528

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFY 311
             +L+  SFS+ +A  D+  + +IL+ I   +    Q+++  V   +++   P   +   
Sbjct: 529 ESILDIDSFSVRIAEADVDRILEILQAIPERKIRFKQAHLGHVFHRYRYAALPGLENNLR 588

Query: 312 MVM 314
            +M
Sbjct: 589 QLM 591


>gi|297721259|ref|NP_001172992.1| Os02g0520750 [Oryza sativa Japonica Group]
 gi|255670949|dbj|BAH91721.1| Os02g0520750 [Oryza sativa Japonica Group]
          Length = 213

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 68/134 (50%), Gaps = 6/134 (4%)

Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
           +W N  ++ ++  S      Y +++  S FCL   G+   + R+ +++ +GC+PVIIA+ 
Sbjct: 48  LWENFKNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 107

Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
             LPFAD + W    + V   D+P L  IL  I  ++ +L +  +L      Q  +FP  
Sbjct: 108 IVLPFADAIPWDEIGVFVDEEDVPRLDSILTSIPIDD-ILRKQRLLANPSMKQAMLFPQP 166

Query: 307 Y---DAFYMVMYEL 317
               DAF+ ++  L
Sbjct: 167 AQPRDAFHQILNGL 180


>gi|255076909|ref|XP_002502118.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
 gi|226517383|gb|ACO63376.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
          Length = 717

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 48/168 (28%), Positives = 77/168 (45%), Gaps = 19/168 (11%)

Query: 168 RNKLAFFAGA-VNSP-------VREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLH 219
           R  L FFAGA V+ P       + E +    +    +  H+G+    Y      S FC+ 
Sbjct: 449 RRWLLFFAGAWVDKPAYADRRAIAEAMAGREQEGIHVVQHAGQF---YEKNYASSTFCIA 505

Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
             G      R+  +   GC+PVI+ ++   P+ DVL +  FS+ VA  DIP +  I+K I
Sbjct: 506 PTGSGWGR-RMNLATQSGCIPVIVQDNIAAPYDDVLPYDEFSVRVAKADIPKIPDIVKAI 564

Query: 280 SSEEYLLLQSYVLKVRKHFQW-HVFPSDY------DAFYMVMYELWLR 320
           + E+   ++  +    +  QW  +  SD+      DAF ++M  L  R
Sbjct: 565 TPEKLDRMRQQLACAARALQWSSILGSDFGEGGENDAFALLMLTLQHR 612


>gi|345305619|ref|XP_001510258.2| PREDICTED: exostosin-2 [Ornithorhynchus anatinus]
          Length = 615

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 49/82 (59%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  + ++D L  GCVPVI+A+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQSVLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +P +  IL+ I   +   +Q
Sbjct: 368 EKMPEMYSILQSIPQRQIEEMQ 389


>gi|356500491|ref|XP_003519065.1| PREDICTED: uncharacterized protein LOC100783624 [Glycine max]
          Length = 795

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 135/334 (40%), Gaps = 70/334 (20%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF----SIAR--------MRHDRRIGTEGIP 80
           Y ++    ++ + S   T +  +AD FF+P      I R        M+    + +    
Sbjct: 395 YGAQIALYESLLASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSMQEHMGLRSSLTL 454

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           ++      +I ++YPYWNR+ G DH +       AC++   I  S M   W    N    
Sbjct: 455 EYYKKAYIHIVEQYPYWNRSSGRDHVWSFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHN 513

Query: 132 VCSSSYFISG--------------HIAHKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
             +++Y+                     KD+ LP  W +  D   L S        KR  
Sbjct: 514 HSTTAYWADNWDKISSDKRGTHPCFDPDKDLVLPA-W-KVPDANVLTSKLWAWSHEKRKT 571

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
           L +F G +                +R+KL + + +        G+      + TP     
Sbjct: 572 LFYFNGNLGPAYPHGRPEDTYSMGIRQKLAEEFGSSPNKDGKLGKQHAKDVIVTPERSEN 631

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y  +L  S FC    G +  + R+ DS+  GC+PV+I +   LP+ +VLN+ SF++ +  
Sbjct: 632 YHLDLASSVFCGVFPG-DGWSGRMEDSILQGCIPVVIQDGIFLPYENVLNYDSFAVRIPE 690

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
            +IP L KIL+G +  E       V K+ + F +
Sbjct: 691 AEIPNLIKILRGFNDTEIEFKLENVQKIWQRFMY 724


>gi|355686834|gb|AER98200.1| exostoses 2 [Mustela putorius furo]
          Length = 717

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 49/82 (59%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPVI+A+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAALSDVLRAGCVPVIVADSYVLPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+GI   +   +Q
Sbjct: 368 EKLSDVYGILQGIPRRQIEEMQ 389


>gi|148235505|ref|NP_001080448.1| exostosin 2 [Xenopus laevis]
 gi|27881752|gb|AAH44703.1| Ext2-prov protein [Xenopus laevis]
          Length = 718

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +    ++D L  GCVPVIIA+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQILQESTFCIVLRGARLGQGLLSDVLQAGCVPVIIADSYVLPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+GI   +   +Q
Sbjct: 368 EKMFEMYSILQGIPQRQVEEMQ 389


>gi|222636093|gb|EEE66225.1| hypothetical protein OsJ_22377 [Oryza sativa Japonica Group]
          Length = 341

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/152 (30%), Positives = 69/152 (45%), Gaps = 10/152 (6%)

Query: 50  TKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGAD 104
            + P +A  FFLP S+A +     R D     +     ++ Y+  ++  YP+WNR+ GAD
Sbjct: 183 ARHPGEAHAFFLPISVASIAGYVYRRDMIDFWDPQLRLVAGYVDGLAAMYPFWNRSRGAD 242

Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG 164
           HF V+CH           E++ NAI+V+C +     G     DV+LP +       P  G
Sbjct: 243 HFLVSCHQWAPILSAAKAELRGNAIRVMCDAD-MSDGFDPATDVALPPVVASARATPPQG 301

Query: 165 --SSKRNKL--AFFAGAVNSPVREKLLQVWRN 192
             +S+R  L      G     VRE LL  W  
Sbjct: 302 RVASERTVLAFFAAGGGGGGAVREVLLTRWEG 333


>gi|307176975|gb|EFN66281.1| Exostosin-2 [Camponotus floridanus]
          Length = 563

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 49/79 (62%)

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           L  + FCL ++G  +  + + D++  GC+PVIIA+   +PF DV++W   ++ V  +DI 
Sbjct: 160 LPSATFCLIIRGARLAQSSLLDAMAAGCIPVIIADSLTMPFHDVIDWTKAAVFVREVDIL 219

Query: 271 LLKKILKGISSEEYLLLQS 289
           L+ ++LK IS +  + +Q 
Sbjct: 220 LIIQLLKKISHQRIMEMQE 238


>gi|432851754|ref|XP_004067068.1| PREDICTED: exostosin-2-like [Oryzias latipes]
          Length = 719

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +  A ++D L  GCVPVI+A+ Y LPF++VL+WK  S+V+  
Sbjct: 309 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 368

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  ILK I   +   +Q
Sbjct: 369 EKLSEMYTILKSIPHRQVEEMQ 390


>gi|348558689|ref|XP_003465149.1| PREDICTED: exostosin-2-like isoform 2 [Cavia porcellus]
          Length = 669

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+VV  
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVVPE 366

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             I  +  IL+ I   +   +Q
Sbjct: 367 EKIADVYSILQSIPRRQMEEMQ 388


>gi|348558687|ref|XP_003465148.1| PREDICTED: exostosin-2-like isoform 1 [Cavia porcellus]
          Length = 717

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+VV  
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVVPE 366

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             I  +  IL+ I   +   +Q
Sbjct: 367 EKIADVYSILQSIPRRQMEEMQ 388


>gi|118481039|gb|ABK92473.1| unknown [Populus trichocarpa]
          Length = 239

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/167 (26%), Positives = 76/167 (45%), Gaps = 15/167 (8%)

Query: 165 SSKRNKLAFFAGAV------------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELL 212
           + +R+  AFF G +            +  VR  + + +  D   Y    R    Y  E++
Sbjct: 58  NGRRDIWAFFRGKMEVHPKNISGRYYSKKVRTVIWRKYSGDRRFYLQRHRFAG-YQSEIV 116

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
            S FCL   G+   + R+ +S+  GCVPVIIA+   LPF   + W   S+ VA  D+  L
Sbjct: 117 RSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPTAVRWSEISLTVAEKDVANL 176

Query: 273 KKILKGISSEEYLLLQSYVL--KVRKHFQWHVFPSDYDAFYMVMYEL 317
             +L  +++     +Q  +    VR+   ++    + DA + V+Y L
Sbjct: 177 GTLLDHVAATNLSAIQKNLWDPDVRRALLFNDRVQEGDATWQVLYAL 223


>gi|348509623|ref|XP_003442347.1| PREDICTED: exostosin-2-like [Oreochromis niloticus]
          Length = 719

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +  A ++D L  GCVPVI+A+ Y LPF++VL+WK  S+V+  
Sbjct: 309 YPQILQESSFCVVLRGARLGQAALSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 368

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  ILK I   +   +Q
Sbjct: 369 EKLSEMYTILKSIPHRQVEEMQ 390


>gi|410911648|ref|XP_003969302.1| PREDICTED: exostosin-1b-like [Takifugu rubripes]
          Length = 740

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 74/342 (21%), Positives = 139/342 (40%), Gaps = 53/342 (15%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   SESY         S F T DP +A LF L
Sbjct: 107 GFKVYVYPQQK----------------GEKISESYQNILSTIEGSRFYTSDPGQACLFVL 150

Query: 62  PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGAD--HFYV 108
                       +  H+ +   + +P   D  +H IFN+ S  +P +    G D     +
Sbjct: 151 SLDTLDRDQLSPQYVHNMKTKVQNLPLWNDGRNHLIFNLYSGTWPDYTEDLGFDIGQAML 210

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ------IWPRQEDPPK 162
           A  SI     +  ++V +         +    G++ +   S+P       ++  +     
Sbjct: 211 AKASISTDNFRPNFDVSIPLFSKEHPRTGGERGYLKYN--SIPPFRKYMLVFKGKRYLTG 268

Query: 163 LGSSKRNKLAFFAGAVNSPVRE--KLLQVWRNDSEIYAHSGRL---KTPYADELLGSKFC 217
           +GS  RN L     A +  +    K  + W+   +           K  Y + L  S FC
Sbjct: 269 IGSDTRNALHHVHNAEDVVLLTTCKHGKDWQKHKDARCDKDNAEYDKYDYKEMLHNSTFC 328

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L  +G  + + R  ++L   CVPV+++N ++LPF+++++W + +++         +++L 
Sbjct: 329 LVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGD-------ERLLL 381

Query: 278 GISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
            I +  + + Q  +L +R+  Q  W  + S  +   +   E+
Sbjct: 382 QIPTTVHSIHQDKILSLRQQTQFLWEAYFSSVEKIVLTTLEI 423


>gi|58269252|ref|XP_571782.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
 gi|57228018|gb|AAW44475.1| conserved hypothetical protein [Cryptococcus neoformans var.
            neoformans JEC21]
          Length = 1132

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 52/92 (56%)

Query: 207  YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
            Y ++L  ++FC   +G    + +  D++Y GC+PV I+     PFAD L+W   S+ VA 
Sbjct: 1002 YMNDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFISEGTHYPFADFLDWSKLSVRVAP 1061

Query: 267  LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
             ++  ++KIL  I   +   LQ+ ++ +R+ F
Sbjct: 1062 TELDKIEKILAAIPLSKVEELQANLVSMREAF 1093


>gi|134114275|ref|XP_774385.1| hypothetical protein CNBG3660 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|50257020|gb|EAL19738.1| hypothetical protein CNBG3660 [Cryptococcus neoformans var.
            neoformans B-3501A]
          Length = 1132

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 52/92 (56%)

Query: 207  YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
            Y ++L  ++FC   +G    + +  D++Y GC+PV I+     PFAD L+W   S+ VA 
Sbjct: 1002 YMNDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFISEGTHYPFADFLDWSKLSVRVAP 1061

Query: 267  LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
             ++  ++KIL  I   +   LQ+ ++ +R+ F
Sbjct: 1062 TELDKIEKILAAIPLSKVEELQANLVSMREAF 1093


>gi|47224077|emb|CAG12906.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 743

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 74/338 (21%), Positives = 129/338 (38%), Gaps = 81/338 (23%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   SESY     +   S F T DP +A LF L
Sbjct: 110 GFKVYVYPQQK----------------GEKMSESYQNILSSIEGSRFYTSDPEQACLFVL 153

Query: 62  PFSIARMRHDRRIGTEGIPDFISHYIFNIS---QKYPYWNRTGGADHFYVACHSIGRSAM 118
             S+  +  D+             Y+ N+    Q  P WN  GG +H     +S      
Sbjct: 154 --SLDTLDRDQ---------LSPQYVHNLKGRIQSLPLWN--GGKNHIIFNLYSGTWPDY 200

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK----------- 167
            +     +    ++  +S        + DVS+P      +D P+ G  +           
Sbjct: 201 TEDLGFDI-GFAMLAKASISTESFRPNFDVSIPLF---SKDHPRTGGERGYLRHNTIPPF 256

Query: 168 RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL-- 203
           R  +  F G      + S  R  L  V                 W+   +       +  
Sbjct: 257 RKYILVFKGKRYLTGIGSDTRNALYHVHNSEDVVLLTTCKHGKDWQKHKDARCDRDNMEY 316

Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
            K  Y + L  S FCL  +G  + + R  ++L   CVPV+++N ++LPF+++++W + ++
Sbjct: 317 DKYDYKEMLYNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAV 376

Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           +         +++L  I S    + Q  +L +R+  Q+
Sbjct: 377 IGD-------ERLLLQIPSTVRSIHQDQILSLRQQTQF 407


>gi|322790909|gb|EFZ15575.1| hypothetical protein SINV_11283 [Solenopsis invicta]
          Length = 676

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 49/82 (59%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y D L  + FCL ++G  +  + + D++  GC+P IIA+   +PF DV++W   ++ +  
Sbjct: 269 YPDSLQTATFCLIIRGARLAQSVLLDAMAAGCIPAIIADSLMMPFHDVIDWTKAAVFIRE 328

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
           +DI L  ++LK IS +  + +Q
Sbjct: 329 VDILLTIQLLKKISHQRIMEMQ 350


>gi|242020116|ref|XP_002430502.1| Exostosin-2, putative [Pediculus humanus corporis]
 gi|212515659|gb|EEB17764.1| Exostosin-2, putative [Pediculus humanus corporis]
          Length = 696

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 130/302 (43%), Gaps = 45/302 (14%)

Query: 5   FRVYVYPHRR--NDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFF 60
            +VYVYP +   N+   +V+  +  E         Y+K  KA + S + + +P +A    
Sbjct: 95  IQVYVYPLKTYINEKAEHVVPQMSLE---------YYKILKAIINSKYYSPNPDEA-CIL 144

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           +P SI  +  ++    E         +  +    P+W    G +H  +  + I  ++   
Sbjct: 145 VP-SIDTLNQNKLKLKE---------VSQVLASLPHWR--NGENH--LIWNFIPGTSPDY 190

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSKRNKLAFF----- 174
           +  V+LN  + + + + F S    +  D+SLP   P  +    L  S+  K         
Sbjct: 191 STVVELNVGKSIIAGAGFSSWSFRYGFDISLPLFSPYAKSIKTLNKSQEKKWFVINSQAN 250

Query: 175 --------AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
                    GA++  V+ +L +    D     H G +   Y   L  + FC+ ++G  + 
Sbjct: 251 LHSDYERELGALSGVVKLELCE--NGDPRFRCHQG-IAYNYPSVLQHATFCIIIRGARLA 307

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
              + +SL  GC+PVI A+   LPF DV++WK  SI +   D+  L + L  +S ++ L 
Sbjct: 308 QQALLESLSAGCIPVIAADLMVLPFQDVIDWKRASITILESDLSSLIEKLSSVSDDKKLE 367

Query: 287 LQ 288
           LQ
Sbjct: 368 LQ 369


>gi|348512501|ref|XP_003443781.1| PREDICTED: exostosin-1a-like [Oreochromis niloticus]
          Length = 743

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 75/357 (21%), Positives = 135/357 (37%), Gaps = 83/357 (23%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   SESY     +   S F T DP +A LF L
Sbjct: 110 GFKVYVYPQQK----------------GEKLSESYQNILSSIEGSRFYTSDPGQACLFVL 153

Query: 62  PFSIARMRHDRRIGTEGIPDFISHYIFNIS---QKYPYWNRTGGADHFYVACHSIGRSAM 118
             S+  +  D+             Y+ N+    Q  P WN  GG +H     +S      
Sbjct: 154 --SLDTLDRDQ---------LSPQYVHNLKAKIQNLPLWN--GGKNHIIFNLYSGTWPDY 200

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK----------- 167
            +     +  + ++  +S          DVS+P      +D P+ G  K           
Sbjct: 201 TEDLGFDI-GLAMLAKASISTENFRPDFDVSIPLF---SKDHPRTGGEKGYLKYNTIPPY 256

Query: 168 RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRL------------------- 203
           R  +  F G      + S  R  L  +  ++  +   + +                    
Sbjct: 257 RKYMLVFKGKRYLTGIGSDTRNALYHIHNSEDVVLLTTCKHGKDWQKHKDARCDKDNAEY 316

Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
            K  Y + L  S FCL  +G  + + R  ++L   CVPV+++N ++LPF+++++W   ++
Sbjct: 317 DKYDYREMLYNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNRAAV 376

Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
           +         +++L  I S    + Q  +L +R+  Q  W  + +  +   +   E+
Sbjct: 377 IGD-------ERLLLQIPSTVRSIHQDKILSLRQQTQLLWEAYFNSVEKIVLTTLEI 426


>gi|327280286|ref|XP_003224883.1| PREDICTED: exostosin-1-like [Anolis carolinensis]
          Length = 753

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 120 GFKVYVYPQQKGEKIA----------------ESYQNILAAVEGSRFYTSDPSQACLFIL 163

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   +G+    +  +H +FN+ S  +P +    G D      
Sbjct: 164 SLDTLDRDQLSPQYVHNLRSKVQGLHLWNNGRNHLVFNLYSGTWPDYTEDVGFD------ 217

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG + + KA              S        + DVS+P      +D P+ G  K   
Sbjct: 218 --IGHAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 258

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 259 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 318

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 319 CDRDNAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 378

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           ++W   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 379 IDWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 433

Query: 315 YEL 317
            E+
Sbjct: 434 LEI 436


>gi|363731065|ref|XP_003640902.1| PREDICTED: exostosin-1-like [Gallus gallus]
          Length = 740

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 78/363 (21%), Positives = 136/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 107 GFKVYVYPQQKGEKIA----------------ESYQNVLAAIEGSRFYTSDPSQACLFVL 150

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 151 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 204

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  K   
Sbjct: 205 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 245

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 246 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 305

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 306 CDRDNAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 365

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           ++WK  +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 366 IDWKQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 420

Query: 315 YEL 317
            E+
Sbjct: 421 LEI 423


>gi|347963135|ref|XP_311070.5| AGAP000081-PA [Anopheles gambiae str. PEST]
 gi|333467342|gb|EAA06302.6| AGAP000081-PA [Anopheles gambiae str. PEST]
          Length = 761

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 55/93 (59%), Gaps = 1/93 (1%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L    FCL  +G  +    + +++  GC+PV++A++Y LPFAD+L+W+  ++ +  
Sbjct: 353 YPALLTTGTFCLVARGVRLGQPALLEAMAAGCIPVVMADNYVLPFADLLDWELLAVRLPE 412

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKV-RKHF 298
            ++  +  +L+ IS+E    +Q+ +  V R++F
Sbjct: 413 ANLHTIVPVLRAISAERVAEMQAQIRSVYRRYF 445


>gi|351696937|gb|EHA99855.1| Exostosin-2 [Heterocephalus glaber]
          Length = 717

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 49/85 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 366

Query: 267 LDIPLLKKILKGISSEEYLLLQSYV 291
             I  +  +L+ I   +   +Q  V
Sbjct: 367 EKIADMYGVLQSIPRRQMEEMQRQV 391


>gi|348538380|ref|XP_003456670.1| PREDICTED: exostosin-1a-like [Oreochromis niloticus]
          Length = 776

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 80/357 (22%), Positives = 144/357 (40%), Gaps = 83/357 (23%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
            F+VY+YP +R                G+  SE+Y K   +  +S F T DP +A LF L
Sbjct: 142 GFKVYIYPQQR----------------GSEISETYKKILTSIEESRFHTTDPLRACLFIL 185

Query: 62  PFSIARMRHDR---------RIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFYV 108
             ++  +  D+         R   + +P   D  +H IFN+ S  +P +    G +    
Sbjct: 186 --AVDTLDRDQLSVQYVQNIRSRIQNLPTWNDGRNHLIFNLYSGSWPDYTEDLGFE---- 239

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ----IWPRQEDPPKLG 164
               +G++ + KA     +A  V   S+Y IS  +  KD  L           + PP   
Sbjct: 240 ----VGQAMLAKA-----SADVVNFRSNYDISIPLFSKDHPLKGGGIGYLTLNDAPP--- 287

Query: 165 SSKRNKLAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR------------------ 202
            S++ +L F    +   + S  R  L  +   +  I   + +                  
Sbjct: 288 -SRKYQLVFKGKRYLTGIGSETRNALYHIHNGEDIILLTTCKHGKDWEKHKDSRCDRDNE 346

Query: 203 --LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
              K  Y + L  S FCL  +G  + + R  ++L   C+PVI++N ++LPF++V++W+  
Sbjct: 347 DYSKFDYQELLHNSTFCLVPRGRRLGSFRFLEALQAACIPVILSNGWELPFSEVIDWRKA 406

Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
           +I+     +  +  I + +  +  L      L+ +  F W  + S      +   E+
Sbjct: 407 AIIGDERLLLQVPSITRSVGRDRIL-----ALRQQTQFLWDAYFSSVAKIVLTTLEI 458


>gi|255633862|gb|ACU17292.1| unknown [Glycine max]
          Length = 141

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 6/106 (5%)

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
           FCL   G+   + R+ +++ +GC+PVIIA+   LPFAD + W+   + V   D+P L  I
Sbjct: 6   FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVDEKDVPQLDTI 65

Query: 276 LKGISSEEYL----LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
           L  I  E  L    LL +  +K    F     P   DAF+ V+  L
Sbjct: 66  LTSIPPEVILRKQRLLANPFMKQAMLFPQPAQPG--DAFHQVLNGL 109


>gi|344281096|ref|XP_003412316.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like [Loxodonta
           africana]
          Length = 728

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 49/85 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQSYV 291
             +  +  IL+ I   +   +Q  V
Sbjct: 368 EKMSDMYSILQSIPQRQIEEMQRQV 392


>gi|168012366|ref|XP_001758873.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690010|gb|EDQ76379.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 426

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 102/221 (46%), Gaps = 17/221 (7%)

Query: 97  WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSS-SYFISGHIA-HKDVSLPQIW 154
           W R+GG DH     H     + ++  +  +  +  + SS +++  G ++  KDV +P + 
Sbjct: 150 WQRSGGRDHVLAVHHPWSMKSHRRFLKSAIWLLSDLDSSGNWYKEGEVSLEKDVIMPYVA 209

Query: 155 PRQE-DPPKLGSSK--RNKLAFFAGAV----NSPVREKLLQVWRNDSE-IYAHSG----R 202
                D   L +SK  R  L FF G +       VR +L  V R++ E I    G     
Sbjct: 210 NVDACDDNCLATSKPSRKTLLFFQGRIVRGSAGKVRSRLAAVLRDEKERIVFQEGFSGAE 269

Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
            K      +  S FCL   G   ++AR+ D++  GC+PV++++  +LPF  +L+++  ++
Sbjct: 270 GKATAQHGMRSSVFCLSPAGDTPSSARLFDAIVSGCIPVVVSDELELPFEGILDYRQVAL 329

Query: 263 VVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQW 300
            V          L   L+  + ++   +Q  + +  +HF++
Sbjct: 330 FVPAARAAQKGWLVAHLRNKTPQDVAAMQQRLAQYGRHFRY 370


>gi|355686831|gb|AER98199.1| exostoses 1 [Mustela putorius furo]
          Length = 702

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   +ESY     A   S F T DPS+A LF L
Sbjct: 69  GFKVYVYPQQK----------------GEKIAESYQNILAAIEGSRFYTSDPSQACLFVL 112

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 113 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 166

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 167 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 207

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 208 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 267

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 268 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 327

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 328 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 382

Query: 315 YEL 317
            E+
Sbjct: 383 LEI 385


>gi|166183792|gb|ABY84155.1| exostosin 1 (predicted) [Callithrix jacchus]
          Length = 746

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|147902262|ref|NP_001091564.1| exostosin-1 [Bos taurus]
 gi|238686639|sp|A5D7I4.1|EXT1_BOVIN RecName: Full=Exostosin-1; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 1 homolog
 gi|146186820|gb|AAI40569.1| EXT1 protein [Bos taurus]
 gi|296480555|tpg|DAA22670.1| TPA: exostosin-1 [Bos taurus]
          Length = 746

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|190402228|gb|ACE77644.1| exostosin 1 (predicted) [Sorex araneus]
          Length = 746

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        H DVS+P      ++ P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPHFDVSIPLF---SKEHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDILLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|403283504|ref|XP_003933159.1| PREDICTED: exostosin-1 [Saimiri boliviensis boliviensis]
          Length = 746

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|73974379|ref|XP_539145.2| PREDICTED: exostosin-1 isoform 1 [Canis lupus familiaris]
          Length = 746

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|301774588|ref|XP_002922714.1| PREDICTED: exostosin-1-like [Ailuropoda melanoleuca]
          Length = 746

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|388503508|gb|AFK39820.1| unknown [Lotus japonicus]
          Length = 141

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 4/105 (3%)

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
           FCL   G+   + R+ +++ +GC+PVI+A+   LPFAD + W+   + V   D+P L  I
Sbjct: 6   FCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFVDEEDVPKLDTI 65

Query: 276 LKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---DAFYMVMYEL 317
           L  I   E +L +  +L      Q  +FP      DAF+ V+  L
Sbjct: 66  LTSIPP-EIILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGL 109


>gi|46370066|ref|NP_000118.2| exostosin-1 [Homo sapiens]
 gi|20141422|sp|Q16394.2|EXT1_HUMAN RecName: Full=Exostosin-1; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 1; AltName:
           Full=Putative tumor suppressor protein EXT1
 gi|12654671|gb|AAH01174.1| Exostoses (multiple) 1 [Homo sapiens]
 gi|119612378|gb|EAW91972.1| exostoses (multiple) 1 [Homo sapiens]
 gi|123981930|gb|ABM82794.1| exostoses (multiple) 1 [synthetic construct]
 gi|123996761|gb|ABM85982.1| exostoses (multiple) 1 [synthetic construct]
 gi|189053697|dbj|BAG35949.1| unnamed protein product [Homo sapiens]
 gi|261859910|dbj|BAI46477.1| exostoses (multiple) 1 [synthetic construct]
          Length = 746

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|281183009|ref|NP_001162444.1| exostosin-1 [Papio anubis]
 gi|384475833|ref|NP_001245062.1| exostosin 1 [Macaca mulatta]
 gi|114621433|ref|XP_001141496.1| PREDICTED: exostosin-1 isoform 1 [Pan troglodytes]
 gi|397505670|ref|XP_003823375.1| PREDICTED: exostosin-1 [Pan paniscus]
 gi|238687365|sp|A9X1C8.1|EXT1_PAPAN RecName: Full=Exostosin-1; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 1 homolog
 gi|163781066|gb|ABY40823.1| exostoses 1 (predicted) [Papio anubis]
 gi|355698182|gb|EHH28730.1| Exostosin-1 [Macaca mulatta]
 gi|383410151|gb|AFH28289.1| exostosin-1 [Macaca mulatta]
 gi|387541472|gb|AFJ71363.1| exostosin-1 [Macaca mulatta]
 gi|410224816|gb|JAA09627.1| exostosin 1 [Pan troglodytes]
 gi|410259738|gb|JAA17835.1| exostosin 1 [Pan troglodytes]
 gi|410302704|gb|JAA29952.1| exostosin 1 [Pan troglodytes]
 gi|410353629|gb|JAA43418.1| exostosin 1 [Pan troglodytes]
          Length = 746

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|1168162|gb|AAB62283.1| putative tumour suppressor/hereditary multiple exostoses candidate
           gene [Homo sapiens]
 gi|1586817|prf||2204384A EXT1 gene
          Length = 746

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|426235700|ref|XP_004011818.1| PREDICTED: exostosin-1 [Ovis aries]
          Length = 746

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|348588241|ref|XP_003479875.1| PREDICTED: exostosin-1 [Cavia porcellus]
          Length = 746

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|395818023|ref|XP_003782438.1| PREDICTED: exostosin-1 [Otolemur garnettii]
 gi|197215619|gb|ACH53015.1| exostosin 1 (predicted) [Otolemur garnettii]
          Length = 746

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 78/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIY 197
                   R  +  F G      + S  R  L  V                 W+   +  
Sbjct: 252 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 198 AHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                +   K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNMEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|348519387|ref|XP_003447212.1| PREDICTED: exostosin-1b-like [Oreochromis niloticus]
          Length = 740

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/342 (22%), Positives = 139/342 (40%), Gaps = 53/342 (15%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   SESY         S F T DP +A LF L
Sbjct: 107 GFKVYVYPQQK----------------GEKISESYQNILSTIEGSRFYTSDPGQACLFVL 150

Query: 62  PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGAD--HFYV 108
                       +  H+ +   + +P   D  +H IFN+ S  +P +    G D     +
Sbjct: 151 SLDTLDRDQLSPQYVHNLKTKVQNLPLWNDGRNHLIFNLYSGTWPDYTEDLGFDIGQAML 210

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ------IWPRQEDPPK 162
           A  SI     +  ++V +         +    G++ +   S+P       ++  +     
Sbjct: 211 AKASISTENFRPNFDVSIPLFSKEHPRTGGERGYLKYN--SIPPFRKYMLVFKGKRYLTG 268

Query: 163 LGSSKRNKLAFFAGAVNSPVRE--KLLQVWRNDSEIYAHSGRL---KTPYADELLGSKFC 217
           +GS  RN L     A +  +    K  + W+   +           K  Y + L  S FC
Sbjct: 269 IGSDTRNALYHVHNAEDVVLLTTCKHGKDWQKHKDARCDKDNAEYDKYDYKEMLHNSTFC 328

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L  +G  + + R  ++L   CVPV+++N ++LPF+++++W + +++    D  LL +I  
Sbjct: 329 LVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIG---DERLLLQIPT 385

Query: 278 GISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
            + S    + Q  +L +R+  Q  W  + S  +   +   E+
Sbjct: 386 TVRS----IHQDKILSLRQQTQFLWEAYFSSVEKIVLTTLEI 423


>gi|431915722|gb|ELK16055.1| Exostosin-2 [Pteropus alecto]
          Length = 1849

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 7/94 (7%)

Query: 207  YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
            Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 1439 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 1498

Query: 267  LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
              I  +  IL+ I        Q  + +++K  +W
Sbjct: 1499 EKISDVYSILQSIP-------QRQIEEMQKQARW 1525


>gi|350537983|ref|NP_001233696.1| exostosin-1 [Cricetulus griseus]
 gi|20138355|sp|Q9JK82.1|EXT1_CRIGR RecName: Full=Exostosin-1; AltName: Full=Heparan sulfate
           copolymerase; AltName: Full=Multiple exostoses protein 1
           homolog
 gi|7960285|gb|AAF71276.1|AF252858_1 exostosin 1 [Cricetulus griseus]
          Length = 746

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|19909908|dbj|BAB87180.1| XEXT1 [Xenopus laevis]
          Length = 735

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 71/359 (19%), Positives = 131/359 (36%), Gaps = 87/359 (24%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
            F+VY+YP ++ D  ++              S  +   A   S F T DPS+A +F L  
Sbjct: 102 GFKVYIYPQQKGDKLSD--------------SYQHILAAIETSRFYTSDPSQACVFVL-- 145

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYP---YWNRTGGADHFYVACHSIGRSAMQK 120
           S+  +  D+            HY+ N+  K      WN   G +H     +S        
Sbjct: 146 SLDTLDRDQ---------LSPHYVHNLKSKVQNLHLWN--NGRNHLIFNLYS-------G 187

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK------DVSLPQIWPRQEDPPKLGSSK------- 167
            W      +      +      I+ +      DVS+P      +D P+ G  K       
Sbjct: 188 TWPDYTEDVGFDIGQAMLAKASISTENFRPNFDVSIPLF---SKDHPRTGGDKGFLRFNN 244

Query: 168 ----RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR---------------- 202
               R  +  F G      + S  R  L  V   +  +   + +                
Sbjct: 245 IPPMRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKD 304

Query: 203 ----LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
                K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V++W 
Sbjct: 305 NAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWN 364

Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +   E+
Sbjct: 365 QAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 418


>gi|50416406|gb|AAH77234.1| XEXT1 protein [Xenopus laevis]
          Length = 735

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 71/359 (19%), Positives = 131/359 (36%), Gaps = 87/359 (24%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
            F+VY+YP ++ D  ++              S  +   A   S F T DPS+A +F L  
Sbjct: 102 GFKVYIYPQQKGDKLSD--------------SYQHILAAIETSRFYTSDPSQACVFVL-- 145

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYP---YWNRTGGADHFYVACHSIGRSAMQK 120
           S+  +  D+            HY+ N+  K      WN   G +H     +S        
Sbjct: 146 SLDTLDRDQ---------LSPHYVHNLKSKVQNLHLWN--NGRNHLIFNLYS-------G 187

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK------DVSLPQIWPRQEDPPKLGSSK------- 167
            W      +      +      I+ +      DVS+P      +D P+ G  K       
Sbjct: 188 TWPDYTEDVGFDIGQAMLAKASISTENFRPNFDVSIPLF---SKDHPRTGGDKGFLRFNN 244

Query: 168 ----RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR---------------- 202
               R  +  F G      + S  R  L  V   +  +   + +                
Sbjct: 245 IPPMRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKD 304

Query: 203 ----LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
                K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V++W 
Sbjct: 305 NAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWN 364

Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +   E+
Sbjct: 365 QAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 418


>gi|221041480|dbj|BAH12417.1| unnamed protein product [Homo sapiens]
          Length = 337

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 189 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 248

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 249 EKMSDVYSILQSIPQRQIEEMQ 270


>gi|156378657|ref|XP_001631258.1| predicted protein [Nematostella vectensis]
 gi|156218295|gb|EDO39195.1| predicted protein [Nematostella vectensis]
          Length = 563

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 51/88 (57%), Gaps = 5/88 (5%)

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           + FCL ++G  +    + DSL  GC+P+++++ Y LPF++VL+WK  ++VV+  +I  + 
Sbjct: 237 ATFCLVIRGARLGQTALLDSLMMGCIPIVVSDDYILPFSEVLDWKRAAVVVSENEIDRIP 296

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWH 301
            ILK  S       Q   ++++  F W 
Sbjct: 297 LILKDYSQN-----QIKDMRLQGKFMWE 319


>gi|440897880|gb|ELR49485.1| Exostosin-2, partial [Bos grunniens mutus]
          Length = 731

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+VV  
Sbjct: 321 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 380

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 381 EKMSDVYSILQSIPRRQIEEMQ 402


>gi|354491358|ref|XP_003507822.1| PREDICTED: exostosin-2 [Cricetulus griseus]
          Length = 718

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDMYSILQNIPQRQIEEMQ 389


>gi|117935049|ref|NP_803462.2| exostosin-2 [Bos taurus]
 gi|117306659|gb|AAI26571.1| Exostoses (multiple) 2 [Bos taurus]
 gi|296479641|tpg|DAA21756.1| TPA: exostosin-2 [Bos taurus]
          Length = 718

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPRRQIEEMQ 389


>gi|338711993|ref|XP_001489915.2| PREDICTED: LOW QUALITY PROTEIN: exostosin-2 [Equus caballus]
          Length = 728

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 50/87 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLK 293
             +  +  IL+ I   +   +Q  + +
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFQ 394


>gi|149721624|ref|XP_001496484.1| PREDICTED: exostosin-1 [Equus caballus]
          Length = 746

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|20138161|sp|O77783.1|EXT2_BOVIN RecName: Full=Exostosin-2; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=HS-polymerase; Short=HS-POL; AltName: Full=Multiple
           exostoses protein 2 homolog
 gi|3599568|gb|AAC35386.1| glucuronyl/N-acetylglucosaminyl transferase [Bos taurus]
          Length = 718

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPRRQIEEMQ 389


>gi|148234643|ref|NP_001083782.1| exostosin 1 [Xenopus laevis]
 gi|62871603|gb|AAH94398.1| XEXT1 protein [Xenopus laevis]
          Length = 738

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 71/359 (19%), Positives = 131/359 (36%), Gaps = 87/359 (24%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
            F+VY+YP ++ D  ++              S  +   A   S F T DPS+A +F L  
Sbjct: 105 GFKVYIYPQQKGDKLSD--------------SYQHILAAIETSRFYTSDPSQACVFVL-- 148

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYP---YWNRTGGADHFYVACHSIGRSAMQK 120
           S+  +  D+            HY+ N+  K      WN   G +H     +S        
Sbjct: 149 SLDTLDRDQ---------LSPHYVHNLKSKVQNLHLWN--NGRNHLIFNLYS-------G 190

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK------DVSLPQIWPRQEDPPKLGSSK------- 167
            W      +      +      I+ +      DVS+P      +D P+ G  K       
Sbjct: 191 TWPDYTEDVGFDIGQAMLAKASISTENFRPNFDVSIPLF---SKDHPRTGGDKGFLRFNN 247

Query: 168 ----RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR---------------- 202
               R  +  F G      + S  R  L  V   +  +   + +                
Sbjct: 248 IPPMRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKD 307

Query: 203 ----LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
                K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V++W 
Sbjct: 308 NAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWN 367

Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +   E+
Sbjct: 368 QAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 421


>gi|344257014|gb|EGW13118.1| Exostosin-2 [Cricetulus griseus]
          Length = 701

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 291 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 351 EKMSDMYSILQNIPQRQIEEMQ 372


>gi|291388448|ref|XP_002710791.1| PREDICTED: exostosin 1 [Oryctolagus cuniculus]
          Length = 746

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|183637125|gb|ACC64545.1| exostosin 1 (predicted) [Rhinolophus ferrumequinum]
          Length = 746

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|94536924|ref|NP_001035420.1| exostosin-1c [Danio rerio]
 gi|92098257|gb|AAI15217.1| Exostoses (multiple) 1c [Danio rerio]
          Length = 737

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 82/357 (22%), Positives = 135/357 (37%), Gaps = 83/357 (23%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
            FRVY+YP  +                G   SESY K   +  +S + T DP +A LF L
Sbjct: 105 GFRVYIYPPEK----------------GERVSESYRKILTSVSESRYYTSDPREACLFVL 148

Query: 62  PFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSAM 118
              I  +  D+             ++ N+ ++   YP WN   G +H     +S      
Sbjct: 149 --GIDTLDRDQ---------LSQQFVPNVDERIRGYPLWN--DGRNHVIFNLYSGTWPNY 195

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS----------K 167
            +  ++  N  Q + + +   + H     D+S+P     +E P K G            +
Sbjct: 196 TE--DLGFNVGQAILAKASLNTEHFRPGFDISIPLF--SKEHPQKGGKRGWLVRNSVPPR 251

Query: 168 RNKLAFFAG-----AVNSPVREKL-----------LQVWRNDSEIYAHSGRL-------- 203
           R  L  F G      + S  R  L           L   R+  +   H            
Sbjct: 252 RKYLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNQEY 311

Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
            +  Y + L  S FCL  +G  + + R  +SL   C+PV+++N ++LPF+DV+ W     
Sbjct: 312 ERFDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQ--- 368

Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
            V   D    +++L  + S    +    VL +R+  Q  W  + S  D   +   E+
Sbjct: 369 AVVEGD----ERLLLQVPSTVRAVGMDRVLALRQQTQTLWDAYFSSVDKIVLTTLEI 421


>gi|1235559|emb|CAA65443.1| ext1 [Mus musculus]
          Length = 745

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 78/363 (21%), Positives = 136/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   +ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQK----------------GEKIAESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA            S+  F        DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA----------TISTENFRPNF----DVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|112807209|ref|NP_034292.2| exostosin-1 [Mus musculus]
 gi|283837898|ref|NP_001124012.1| exostosin 1 [Rattus norvegicus]
 gi|3023731|sp|P97464.1|EXT1_MOUSE RecName: Full=Exostosin-1; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 1 homolog
 gi|1813640|gb|AAB41728.1| Ext1 [Mus musculus]
 gi|13435765|gb|AAH04741.1| Exostoses (multiple) 1 [Mus musculus]
 gi|74205633|dbj|BAE21106.1| unnamed protein product [Mus musculus]
 gi|148697307|gb|EDL29254.1| exostoses (multiple) 1, isoform CRA_a [Mus musculus]
 gi|149066396|gb|EDM16269.1| similar to Ext1 [Rattus norvegicus]
          Length = 746

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|148697308|gb|EDL29255.1| exostoses (multiple) 1, isoform CRA_b [Mus musculus]
          Length = 566

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 127 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 170

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 171 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 224

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 225 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 265

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 266 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 325

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 326 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 385

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 386 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 440

Query: 315 YEL 317
            E+
Sbjct: 441 LEI 443


>gi|302845050|ref|XP_002954064.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300260563|gb|EFJ44781.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 1122

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 68/277 (24%), Positives = 108/277 (38%), Gaps = 40/277 (14%)

Query: 39  FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
           F +  M +   T D  +AD +F+P +           TE  P  +   +  I + YP+W+
Sbjct: 637 FWQRLMSAGIRTVDGDEADYYFIPVNTR---------TELAPGMVEWVLSYIRRTYPWWS 687

Query: 99  RTGGADHFYVACHSIG--------RSAMQKA-----WEVKLNAIQVVCSSSYFISGHIAH 145
           +  G  H  +    +G        RS ++ A     W       Q    + ++   H   
Sbjct: 688 KDNGNRHLIIHTGDMGIADLPADMRSRLKSAFSNITWLTHWGIYQYHPVAKWY-PAHRPG 746

Query: 146 KDVSLP-----------QIWPRQEDPPKL--GSSKRNKLAFFAGAVNSPVREKLLQVWRN 192
           KDV LP            + PR E   +    +  R+   FFAG +    +         
Sbjct: 747 KDVVLPVMVTTQGFHLSPLNPRVEARARRRNQTMARSGTFFFAGRICGDRKPPDPATGDC 806

Query: 193 DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
                 +SG ++     ++   KFCL   G      ++  SL  GCVPV+I N    PF 
Sbjct: 807 SRTRPDYSGGVRQ---LDISSHKFCLAPLGGGHGKRQVLVSLM-GCVPVLIGNGVLQPFE 862

Query: 253 DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
             ++W  FS+ V   DIP L +IL  IS +    +Q+
Sbjct: 863 PEIDWSRFSVSVPEADIPDLPRILANISDQRVADMQA 899


>gi|126332604|ref|XP_001362716.1| PREDICTED: exostosin-2 [Monodelphis domestica]
          Length = 718

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQVLQEASFCMVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  +L+ I   +   +Q
Sbjct: 368 EKMRDMYSVLRSIPQRQIEEMQ 389


>gi|302845052|ref|XP_002954065.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300260564|gb|EFJ44782.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 600

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 78/335 (23%), Positives = 127/335 (37%), Gaps = 60/335 (17%)

Query: 39  FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
           F +  M S   T D +KAD F++P +        R G+    + +   +  I + YP+W+
Sbjct: 191 FWQRLMSSGMRTLDGNKADYFYIPINT-------RTGSLAREE-LEWTLPYIKKTYPWWS 242

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLN------------AIQVVCSSSYFISGHIAHK 146
           +  G  H  +    +G +    A   +LN             +      + +   H   K
Sbjct: 243 KDNGNRHLIIHTGDMGINDFPLATRRELNESLSNITWLTHWGLHEYHPIAKWYPAHRPGK 302

Query: 147 DVSLP-----------QIWPRQEDPPKLGSSKR--NKLAFFAGAVNS------PVREKLL 187
           D+ +P            + PR E   K   + R  N   FFAG +        P   K  
Sbjct: 303 DIVIPVMIMTQGFHLSPMNPRMEAEIKAQGAPRLRNGTLFFAGRICGDRDLPDPKTGKCG 362

Query: 188 QVWRNDS-----EIYAHSGRLK--------TPYADELLGSKFCLHVKGFEVNTARIADSL 234
               + S      +Y     +K        + Y +++   KFCL   G       I  + 
Sbjct: 363 PGHEDYSFGVRQAVYLQHRNVKGFRIVAWTSTYLEDISSHKFCLAPVGGGHGKRNILVA- 421

Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           + GC+PV+I +H   PF   ++W  FSI V   DIP L +IL  + + E    Q  +   
Sbjct: 422 FMGCLPVLIGDHVLQPFEPEIDWSRFSISVPEADIPDLPRILANVPASEVASKQKRLRCA 481

Query: 295 RKHFQWH-----VFPSD--YDAFYMVMYELWLRRS 322
            +H  +      +   D  YDAF  +M  L +R+ 
Sbjct: 482 AQHMFYSSTLGAILGEDGRYDAFETLMEILRVRKE 516


>gi|410912512|ref|XP_003969733.1| PREDICTED: exostosin-2-like [Takifugu rubripes]
          Length = 719

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +  A ++D L  GCVPVI+A+ Y LPF++VL+WK  S+ +  
Sbjct: 309 YPQVLQDSSFCVVLRGARLGQAALSDVLQAGCVPVILADSYILPFSEVLDWKRASVFIPE 368

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  ILK I   +   +Q
Sbjct: 369 EKLSEMYGILKSIPHRQVEEMQ 390


>gi|297791307|ref|XP_002863538.1| hypothetical protein ARALYDRAFT_494490 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309373|gb|EFH39797.1| hypothetical protein ARALYDRAFT_494490 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 476

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 51/282 (18%)

Query: 52  DPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFN---ISQKYPYW-------NRT 100
           DP++ADLFF+  FS   +     I     P+F S + ++   + +    W        R 
Sbjct: 158 DPAEADLFFVAAFSSLSL-----IVNSDRPEFGSGFGYSEEVMQESLVSWLEGQEWCRRN 212

Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQED 159
            G DH  VA      +A+ +  +   NA+ +V    +F +   +  KDV +P        
Sbjct: 213 NGRDHVIVAGDP---NALNRVMDRVKNAVLLVTDLGWFRADQGSLVKDVIIPYSHRVDAY 269

Query: 160 PPKLGSSKRNKLAF------FAGAV-----------NSPVREKLLQVWRNDSEIYAHSGR 202
             +LG  +RN L +        G+V              VR+ L ++  N+ ++    G 
Sbjct: 270 EGELGVKQRNNLLYRETSHNLLGSVLVYGLALNVKYGGRVRDLLFKLLENEEDVVIKHGT 329

Query: 203 L----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
                +      +  SKFCLH  G   +  R+ D+L   CVPVI+++  +LPF D     
Sbjct: 330 QSRENRRAAKQGMHTSKFCLHSAGDTHSACRLFDALASLCVPVIVSDGIELPFED----- 384

Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
                 A L    + K L+ +  E+ L  Q  + +VR++F +
Sbjct: 385 -----DAALKPGFVVKKLRKVKPEKILKYQKAMKEVRRYFDY 421


>gi|395543711|ref|XP_003773757.1| PREDICTED: exostosin-2 [Sarcophilus harrisii]
          Length = 917

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 47/78 (60%)

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           + FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+V+    +  + 
Sbjct: 294 ASFCIVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPEEKMRDMY 353

Query: 274 KILKGISSEEYLLLQSYV 291
            +L+ I   +   +Q  V
Sbjct: 354 SVLRSIPQRQIEEMQRQV 371


>gi|226526925|gb|ACO71282.1| exostoses 1 (predicted) [Dasypus novemcinctus]
          Length = 744

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 111 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 154

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 155 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 208

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 209 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 249

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 250 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDILLLTTCKHGKDWQKHKDSR 309

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 310 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 369

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 370 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 424

Query: 315 YEL 317
            E+
Sbjct: 425 LEI 427


>gi|390332053|ref|XP_003723408.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like
           [Strongylocentrotus purpuratus]
          Length = 707

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 52/95 (54%), Gaps = 5/95 (5%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++   +  A ++D+L  GC+PVII++ Y LPF++V++WK  S+VV  
Sbjct: 297 YPHILQDATFCIVLRRTRLGQAALSDALQAGCIPVIISDAYILPFSEVIDWKRASLVVRE 356

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
             IP L  IL  +  E       Y ++ +  F W 
Sbjct: 357 DRIPDLPDILHAVELEHI-----YEMRQQVRFLWQ 386


>gi|219362519|ref|NP_001137073.1| uncharacterized protein LOC100217246 [Zea mays]
 gi|194691480|gb|ACF79824.1| unknown [Zea mays]
 gi|413919898|gb|AFW59830.1| hypothetical protein ZEAMMB73_270023 [Zea mays]
          Length = 458

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 66/286 (23%), Positives = 117/286 (40%), Gaps = 47/286 (16%)

Query: 55  KADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWNRTGGAD 104
           +AD+ F+PF  A +  +  +G          +G  D+     + +    +P W R+ G D
Sbjct: 126 EADVVFVPF-FATLSAEMELGWGTKGAFRKKDGNEDYRRQREVVDRVTSHPAWRRSSGRD 184

Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGH--------------IAHKDVSL 150
           H +V    +        W V+      +     F   +              I H  VSL
Sbjct: 185 HIFVLTDPVA------MWHVRAEIAPAILLVVDFGGWYKVDSKSSSKNSSRVIQHTQVSL 238

Query: 151 PQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
            +  I P     P L  S+   R  L +F GA        VREKL  +  N+ ++    G
Sbjct: 239 LKDVIVPYTHLLPTLLLSENKDRRTLLYFKGAKHRHRGGLVREKLWDLLGNEPDVIMEEG 298

Query: 202 RLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                  ++    L  S+FCLH  G    + R+ D++   C+PVI+++  +LPF  ++++
Sbjct: 299 FPNATGREQSIKGLRTSEFCLHPAGDTPTSCRLFDAIASLCIPVIVSDEVELPFEGIIDY 358

Query: 258 KSFSIVVA---TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
              SI V+    +    L   L+ IS ++    +  + +V+  F++
Sbjct: 359 TEISIFVSVSNAMRPKWLTSYLRNISKQQKDEFRRNLARVQPIFEY 404


>gi|355752199|gb|EHH56319.1| Exostosin-2, partial [Macaca fascicularis]
          Length = 733

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 323 YPQVLQEATFCVVLRGARLGQAMLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 382

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 383 EKMSDVYSILQSIPQRQIEEMQ 404


>gi|148230947|ref|NP_001082080.1| exostosin [Xenopus laevis]
 gi|13183631|gb|AAK15278.1|AF319538_1 exostosin [Xenopus laevis]
 gi|37921194|gb|AAO84329.1| exostosin 1 [Xenopus laevis]
          Length = 738

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 71/359 (19%), Positives = 131/359 (36%), Gaps = 87/359 (24%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
            F+VY+YP ++ D  ++              S  +   A   S F T DPS+A +F L  
Sbjct: 105 GFKVYIYPQQKGDKLSD--------------SYQHILAAIETSRFYTSDPSQACVFVL-- 148

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYP---YWNRTGGADHFYVACHSIGRSAMQK 120
           S+  +  D+            HY+ N+  K      WN   G +H     +S        
Sbjct: 149 SLDTLDRDQ---------LSPHYVHNLKSKVQNLHLWN--NGRNHLIFNLYS-------G 190

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK------DVSLPQIWPRQEDPPKLGSSK------- 167
            W      +      +      I+ +      DVS+P      +D P+ G  K       
Sbjct: 191 TWPDYTEDVGFDIGQAMLAKASISTENFRPNFDVSIPLF---SKDHPRTGGDKGFLRFNN 247

Query: 168 ----RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR---------------- 202
               R  +  F G      + S  R  L  V   +  +   + +                
Sbjct: 248 IPPMRKYMLVFKGKRYLTGIGSDTRNALYHVIMGEDVVLLTTCKHGKKWQKHKDARCDKD 307

Query: 203 ----LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
                K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V++W 
Sbjct: 308 NAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWN 367

Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +   E+
Sbjct: 368 QAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 421


>gi|148695683|gb|EDL27630.1| exostoses (multiple) 2, isoform CRA_b [Mus musculus]
          Length = 706

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC  ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 372 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 431

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 432 EKMSDVYSILQNIPQRQIEEMQ 453


>gi|2251238|gb|AAB62718.1| multiple exostoses type II protein EXT2.I [Homo sapiens]
          Length = 728

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 53/93 (56%), Gaps = 2/93 (2%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSE--EYLLLQSYVLKVRKH 297
             +  +  IL+ I     E +  Q ++  VR+ 
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFMEPVRRE 400


>gi|297688867|ref|XP_002821894.1| PREDICTED: exostosin-2 isoform 2 [Pongo abelii]
          Length = 728

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|296010875|ref|NP_001171554.1| exostosin-2 isoform 3 [Homo sapiens]
 gi|119588476|gb|EAW68070.1| exostoses (multiple) 2, isoform CRA_c [Homo sapiens]
          Length = 728

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|74214457|dbj|BAE31083.1| unnamed protein product [Mus musculus]
          Length = 690

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC  ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 412 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 471

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 472 EKMSDVYSILQNIPQRQIEEMQ 493


>gi|426368047|ref|XP_004051026.1| PREDICTED: exostosin-2 isoform 2 [Gorilla gorilla gorilla]
          Length = 728

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|426245345|ref|XP_004016473.1| PREDICTED: exostosin-2 [Ovis aries]
          Length = 718

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPVIIA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMLDVYSILQSIPRRQIEEMQ 389


>gi|196014406|ref|XP_002117062.1| hypothetical protein TRIADDRAFT_1026 [Trichoplax adhaerens]
 gi|190580284|gb|EDV20368.1| hypothetical protein TRIADDRAFT_1026 [Trichoplax adhaerens]
          Length = 657

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 71/146 (48%), Gaps = 10/146 (6%)

Query: 163 LGSSKRNKLAFFAGAVNSPVRE--KLLQVWRNDSEIYAHSGRL---KTPYADELLGSKFC 217
           +GS  RN L      +N  +    K    WR   +       L   +  Y D L+ S FC
Sbjct: 179 IGSETRNALHHLHDDLNYIMVTTCKHGNTWREHQDSRCVVDELTYGEYDYQDLLINSTFC 238

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L  +G  + + R  ++L +GC+P++++N + LPF++V++WK   + +    +  + ++++
Sbjct: 239 LVPRGRRLGSFRFLEALQFGCIPIVLSNGWVLPFSEVIDWKKACVQIDERQLFDVPELIE 298

Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVF 303
            IS E+ L      +K +  F W  +
Sbjct: 299 SISDEKIL-----AMKQQSIFLWQTY 319


>gi|332836218|ref|XP_003313041.1| PREDICTED: exostosin-2 isoform 1 [Pan troglodytes]
          Length = 728

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|332210817|ref|XP_003254509.1| PREDICTED: exostosin-2 isoform 2 [Nomascus leucogenys]
          Length = 728

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|357508935|ref|XP_003624756.1| Cysteine synthase [Medicago truncatula]
 gi|355499771|gb|AES80974.1| Cysteine synthase [Medicago truncatula]
          Length = 407

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/139 (27%), Positives = 76/139 (54%), Gaps = 9/139 (6%)

Query: 183 REKLLQVWRNDSEIYAHSGRLKTP----YADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
           R++L  + + + +++   G ++        + +  SKFCL++ G   ++ R+ D++   C
Sbjct: 13  RQELFYLLKEEKDVHFSFGSVQKGGVRNATNGMRSSKFCLNIAGDTPSSNRLFDAIASHC 72

Query: 239 VPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVR 295
           VPVII++  +LP+ DVL++  F + V T D      L   ++ I  +E+  + + + +V 
Sbjct: 73  VPVIISDEIELPYEDVLDYSKFCVFVRTRDAVKKKYLINFIRSIGKDEWTRMWNRLKEVE 132

Query: 296 KHFQWHVFPS-DYDAFYMV 313
           K F++  FPS + DA  M+
Sbjct: 133 KFFEFQ-FPSKEGDAVEMI 150


>gi|159470095|ref|XP_001693195.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158277453|gb|EDP03221.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 659

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 70/285 (24%), Positives = 113/285 (39%), Gaps = 47/285 (16%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARM------RHDRRIGTEGIPDFISHY 86
           Y  ES   +  + S   T DP +AD F++P  I+ +        D  +        + H 
Sbjct: 338 YGVESGLHEYLLLSEHRTFDPEEADFFYVPVYISCLIWPVLNYADFPVFYSNGGTRVMHA 397

Query: 87  IFNISQ-------KYPYWNRTGGADHFYVACHSIG-----RSAMQKAWEVKLNAIQVV-C 133
           +  +S+        YP+W R GG DH +   H  G      S +   W      +     
Sbjct: 398 VNMLSEARDWIDANYPFWKRRGGRDHIWTFPHDEGACWAPNSIVSSIWLTHWGRMDPDHT 457

Query: 134 SSSYFISGHIAHKDVSLPQIWPRQEDPPKL--------GSSKRNKLAFFAGAVNSPVREK 185
           S S F + +     VS     PRQ   PK         G     K+   + A N      
Sbjct: 458 SKSSFDADNYTRDFVS-----PRQ---PKGYTHLIQGHGCYDPKKIYNMSIANN------ 503

Query: 186 LLQVWRNDSEIYAHSGR-LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
               WR    +    G+ ++  Y+D L  S FCL   G +  +AR  D++ +GC+PV++ 
Sbjct: 504 ----WRQKYNVLVGDGQDVQGDYSDLLSRSLFCLVATG-DGWSARTEDAVLHGCIPVVVI 558

Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
           +   + F  + +  SFSI +   D+  +  ILK +  E    +Q+
Sbjct: 559 DGVHMKFETLFDVDSFSIRIPEADVANILTILKALPEERVRAMQA 603


>gi|46250443|gb|AAH68545.1| Exostoses (multiple) 2 [Homo sapiens]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|390470425|ref|XP_002755247.2| PREDICTED: exostosin-2 isoform 1 [Callithrix jacchus]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|46370069|ref|NP_997005.1| exostosin-2 isoform 2 [Homo sapiens]
 gi|3023739|sp|Q93063.1|EXT2_HUMAN RecName: Full=Exostosin-2; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 2; AltName:
           Full=Putative tumor suppressor protein EXT2
 gi|1518042|gb|AAB07008.1| EXT2 [Homo sapiens]
 gi|1519605|gb|AAC51219.1| multiple exostosis 2 [Homo sapiens]
 gi|1621113|gb|AAC50764.1| hereditary multiple exostoses gene 2 protein [Homo sapiens]
 gi|14603196|gb|AAH10058.1| Exostoses (multiple) 2 [Homo sapiens]
 gi|119588474|gb|EAW68068.1| exostoses (multiple) 2, isoform CRA_b [Homo sapiens]
 gi|189065454|dbj|BAG35293.1| unnamed protein product [Homo sapiens]
 gi|325463523|gb|ADZ15532.1| exostoses (multiple) 2 [synthetic construct]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|432090368|gb|ELK23794.1| Exostosin-2 [Myotis davidii]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMADVYSILQSIPQRQIEEMQ 389


>gi|402893736|ref|XP_003910045.1| PREDICTED: exostosin-2-like isoform 1 [Papio anubis]
          Length = 535

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422


>gi|332210815|ref|XP_003254508.1| PREDICTED: exostosin-2 isoform 1 [Nomascus leucogenys]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|74213823|dbj|BAE29346.1| unnamed protein product [Mus musculus]
          Length = 786

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC  ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 376 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 435

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 436 EKMSDVYSILQNIPQRQIEEMQ 457


>gi|395742863|ref|XP_003777829.1| PREDICTED: exostosin-2 [Pongo abelii]
          Length = 751

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422


>gi|426368045|ref|XP_004051025.1| PREDICTED: exostosin-2 isoform 1 [Gorilla gorilla gorilla]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|397473541|ref|XP_003808267.1| PREDICTED: exostosin-2 [Pan paniscus]
 gi|410305316|gb|JAA31258.1| exostosin 2 [Pan troglodytes]
          Length = 751

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422


>gi|168277694|dbj|BAG10825.1| exostosin-2 [synthetic construct]
          Length = 746

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 336 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 395

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 396 EKMSDVYSILQSIPQRQIEEMQ 417


>gi|147846684|emb|CAN80640.1| hypothetical protein VITISV_016911 [Vitis vinifera]
          Length = 1363

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 136/320 (42%), Gaps = 71/320 (22%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRR----------IGTEGIP 80
           Y ++    ++ + S   T D  +AD FF+P   S   +R D            + +    
Sbjct: 409 YGAQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTL 468

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           +F      +I ++YP+WNR+ G DH +       AC++   I  S M   W    N+   
Sbjct: 469 EFYKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWG-NTNSKHN 527

Query: 132 VCSSSYFI-----------SGHIA---HKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
             +++Y+              H     +KD+ LP  W R  D   L S        +R  
Sbjct: 528 HSTTAYWADNWDSVSSDRRGNHPCFDPYKDLVLPA-WKR-PDVVSLSSKLWSRPREQRKT 585

Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
           L +F G +                +R+K+ + + +        G+      + TP     
Sbjct: 586 LFYFNGNLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGN 645

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  S FC  + G +  + R  DS+  GC+PV+I +   LPF ++LN++SF++ +  
Sbjct: 646 YHESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIRE 704

Query: 267 LDIPLLKKILKGISSEEYLL 286
            +IP L KIL+ +S + Y+L
Sbjct: 705 DEIPNLIKILR-LSGDPYVL 723


>gi|332836216|ref|XP_508383.3| PREDICTED: exostosin-2 isoform 3 [Pan troglodytes]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|332210819|ref|XP_003254510.1| PREDICTED: exostosin-2 isoform 3 [Nomascus leucogenys]
          Length = 751

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422


>gi|296010873|ref|NP_000392.3| exostosin-2 isoform 1 [Homo sapiens]
 gi|119588477|gb|EAW68071.1| exostoses (multiple) 2, isoform CRA_d [Homo sapiens]
          Length = 751

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422


>gi|73982548|ref|XP_533196.2| PREDICTED: exostosin-2 isoform 2 [Canis lupus familiaris]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|403254631|ref|XP_003920065.1| PREDICTED: exostosin-2 [Saimiri boliviensis boliviensis]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|332836220|ref|XP_003313042.1| PREDICTED: exostosin-2 isoform 2 [Pan troglodytes]
          Length = 751

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422


>gi|355566589|gb|EHH22968.1| Exostosin-2 [Macaca mulatta]
          Length = 751

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 53/94 (56%), Gaps = 7/94 (7%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAMLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
             +  +  IL+ I        Q  + ++++  +W
Sbjct: 401 EKMSDVYSILQSIP-------QRQIEEMKRQARW 427


>gi|119588473|gb|EAW68067.1| exostoses (multiple) 2, isoform CRA_a [Homo sapiens]
 gi|119588475|gb|EAW68069.1| exostoses (multiple) 2, isoform CRA_a [Homo sapiens]
          Length = 731

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 381 EKMSDVYSILQSIPQRQIEEMQ 402


>gi|124301269|gb|ABN04853.1| Exostosin-like [Medicago truncatula]
          Length = 196

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 68/126 (53%), Gaps = 7/126 (5%)

Query: 182 VREKLLQVWRNDSEIY----AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
           +R+ L Q+   + ++     A S   +   +  +  SKFCLH  G   +  R+ D++   
Sbjct: 20  IRDTLFQILEKEDDVIIKHGAQSRESRRAASQGMHTSKFCLHPAGDTPSACRLFDAIVSL 79

Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKV 294
           CVPVI+++  +LPF D ++++  ++ V T   +    L  IL+G++ +  +  Q  + +V
Sbjct: 80  CVPVIVSDSIELPFEDTIDYRKIAVFVETAAAIQPGYLVSILRGMAPDRIVEYQKELKEV 139

Query: 295 RKHFQW 300
           +++F++
Sbjct: 140 KRYFKY 145


>gi|297688871|ref|XP_002821896.1| PREDICTED: exostosin-2 isoform 4 [Pongo abelii]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|335281986|ref|XP_003122905.2| PREDICTED: exostosin-2-like [Sus scrofa]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYVLPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPRRQMEEMQ 389


>gi|402893738|ref|XP_003910046.1| PREDICTED: exostosin-2-like isoform 2 [Papio anubis]
          Length = 515

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 381 EKMSDVYSILQSIPQRQIEEMQ 402


>gi|311253457|ref|XP_001925015.2| PREDICTED: exostosin-1 [Sus scrofa]
          Length = 746

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 76/363 (20%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ +   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLKSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|297268065|ref|XP_001106468.2| PREDICTED: exostosin-2 [Macaca mulatta]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|387015838|gb|AFJ50038.1| Exostosin-1-like [Crotalus adamanteus]
          Length = 751

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 75/363 (20%), Positives = 134/363 (36%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 118 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 161

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H +FN+ S  +P +    G D      
Sbjct: 162 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLVFNLYSGTWPDYTEDVGFD------ 215

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S          DVS+P      +D P+ G  +   
Sbjct: 216 --IGQAMLAKA--------------SISTENFRPRFDVSIPLF---SKDHPRTGGERGFL 256

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 257 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 316

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   C+PV+++N ++LPF++V
Sbjct: 317 CDRDNAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACIPVMLSNGWELPFSEV 376

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 377 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 431

Query: 315 YEL 317
            E+
Sbjct: 432 LEI 434


>gi|161612233|gb|AAI55813.1| Exostoses (multiple) 1c [Danio rerio]
          Length = 737

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 82/357 (22%), Positives = 135/357 (37%), Gaps = 83/357 (23%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
            FRVY+YP  +                G   SESY K   +  +S + T DP +A LF L
Sbjct: 105 GFRVYIYPPEK----------------GERVSESYRKILTSVSESRYYTSDPREACLFVL 148

Query: 62  PFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSAM 118
              I  +  D+             ++ N+ ++   YP WN   G +H     +S      
Sbjct: 149 --GIDTLDRDQ---------LSQQFVPNVDERIRGYPLWN--DGRNHVIFNLYSGTWPNY 195

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS----------K 167
            +  ++  N  Q + + +   + H     D+S+P     +E P K G            +
Sbjct: 196 TE--DLGFNVGQAILAKASLNTEHFRPGFDISIPLF--SKEHPQKGGKRGWLVRNSVPPR 251

Query: 168 RNKLAFFAG-----AVNSPVREKL-----------LQVWRNDSEIYAHSGRL-------- 203
           R  L  F G      + S  R  L           L   R+  +   H            
Sbjct: 252 RKYLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNQEY 311

Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
            +  Y + L  S FCL  +G  + + R  +SL   C+PV+++N ++LPF+DV+ W     
Sbjct: 312 ERFDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQ--- 368

Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
            V   D    +++L  + S    +    VL +R+  Q  W  + S  D   +   E+
Sbjct: 369 AVVEGD----ERLLLQVPSTVRAVGIDRVLALRQQTQTLWDAYFSSVDKIVLTTLEI 421


>gi|297688865|ref|XP_002821893.1| PREDICTED: exostosin-2 isoform 1 [Pongo abelii]
          Length = 731

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 381 EKMSDVYSILQSIPQRQIEEMQ 402


>gi|410973605|ref|XP_003993238.1| PREDICTED: exostosin-2 [Felis catus]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYVLPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ +   +   +Q
Sbjct: 368 EKMSEVYSILQSVPQRQIEEMQ 389


>gi|196004286|ref|XP_002112010.1| hypothetical protein TRIADDRAFT_24279 [Trichoplax adhaerens]
 gi|190585909|gb|EDV25977.1| hypothetical protein TRIADDRAFT_24279 [Trichoplax adhaerens]
          Length = 668

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 47/76 (61%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FCL ++G+ +  +   D+L +GC+PV++++ Y LPF++VL+WK  ++V   
Sbjct: 251 YPTILQDSTFCLMLRGYRLIQSNFLDALKFGCIPVVLSDEYILPFSEVLDWKRAALVFRE 310

Query: 267 LDIPLLKKILKGISSE 282
             +  L  +L  IS++
Sbjct: 311 DQLLSLPAVLSSISTK 326


>gi|47230478|emb|CAF99671.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 722

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +  A ++D L  GCVPVI+A+ Y LPF++VL+WK  S+ +  
Sbjct: 309 YPQILQESSFCVVLRGARLGQAVLSDVLQAGCVPVILADSYILPFSEVLDWKRASVFIPE 368

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  ILK I   +   +Q
Sbjct: 369 EKLSEMYSILKSIPHRQVEEMQ 390


>gi|307111637|gb|EFN59871.1| hypothetical protein CHLNCDRAFT_133704 [Chlorella variabilis]
          Length = 833

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 69/323 (21%), Positives = 123/323 (38%), Gaps = 74/323 (22%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF--------------------SIARMRHDR 72
           Y  E+   +  ++S   T DP +AD F+LP                       A  R   
Sbjct: 413 YQPETGIHEMLLQSEHRTLDPEEADYFYLPVYASCAIWPVLYSNDFPYFHGGPAAQR--- 469

Query: 73  RIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV 132
              T G  +        +   +PYW+R GG DH  +  H  G       W   +    +V
Sbjct: 470 ---THGATNMFMEVQSWVRSHFPYWDRNGGRDHIVLTVHDEG-----SCWLPAVLRPAIV 521

Query: 133 CSSSYFISGHIAHKDVSLPQ----------------IWPRQEDPPKLG---SSKRNKLAF 173
            S       H    DV+ P                 +W  +    KLG       +K+ +
Sbjct: 522 MS-------HWGRTDVNPPAGTGYDADTYSNEVRHPVWQPEGHLSKLGEFPCYDPSKVTY 574

Query: 174 FAGAVNSP--------VREKLLQV-----WRNDSEIYAHSGRLKTP---YADELLGSKFC 217
             G    P         R+ L  +     W +   I+  +G        Y++ +  S FC
Sbjct: 575 ILGGRIQPENARYSRGTRQFLANISEAEGWWDKYRIHVGAGSPPGGPGDYSECMARSVFC 634

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L + G +  ++R  D++ +GC+PVI+ +  +L +  +L+  ++S+ V   D+  + +IL+
Sbjct: 635 LALMG-DGYSSRFDDAVLHGCIPVIVQDGIELTWHSLLDIPAYSLRVPQADMARIPQILQ 693

Query: 278 GISSEEYLLLQSYVLKVRKHFQW 300
            +  E+   +Q+ + KV +   W
Sbjct: 694 AVPQEDIARMQANLAKVWRRHIW 716


>gi|188528923|ref|NP_001120887.1| exostosin 2 [Xenopus (Silurana) tropicalis]
 gi|183986340|gb|AAI66243.1| ext2 protein [Xenopus (Silurana) tropicalis]
          Length = 718

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FC+ ++G  +  + ++D L  GCVPVIIA+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQILQESTFCIVLRGARLGQSVLSDVLQAGCVPVIIADSYVLPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ +   +   +Q
Sbjct: 368 EKMFEMYSILQAVPQRQLEEMQ 389


>gi|74201817|dbj|BAE28510.1| unnamed protein product [Mus musculus]
          Length = 701

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC  ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 291 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 351 EKMSDVYSILQNIPQRQIEEMQ 372


>gi|226442845|ref|NP_034293.2| exostosin-2 [Mus musculus]
 gi|341940670|sp|P70428.2|EXT2_MOUSE RecName: Full=Exostosin-2; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 2 homolog
 gi|74216635|dbj|BAE37749.1| unnamed protein product [Mus musculus]
 gi|148695682|gb|EDL27629.1| exostoses (multiple) 2, isoform CRA_a [Mus musculus]
          Length = 718

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC  ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQ 389


>gi|13879260|gb|AAH06597.1| Exostoses (multiple) 2 [Mus musculus]
          Length = 718

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC  ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQ 389


>gi|260826381|ref|XP_002608144.1| hypothetical protein BRAFLDRAFT_91377 [Branchiostoma floridae]
 gi|229293494|gb|EEN64154.1| hypothetical protein BRAFLDRAFT_91377 [Branchiostoma floridae]
          Length = 595

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 46/69 (66%)

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           + FC+ ++G  +  + ++D++  GC+PVI  + Y +PF++VL+WK  ++++   D+P + 
Sbjct: 194 ATFCMILRGARMGQSALSDAMMAGCIPVIAIDTYVMPFSEVLDWKRAAVILREEDLPDVH 253

Query: 274 KILKGISSE 282
            +L+ IS E
Sbjct: 254 NVLRRISQE 262


>gi|395512319|ref|XP_003760388.1| PREDICTED: exostosin-1 [Sarcophilus harrisii]
          Length = 746

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   +ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQK----------------GEKIAESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ +   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLKSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  K   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDKDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           ++W   +I+     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 IDWNQAAIIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|187469677|gb|AAI66748.1| Ext2 protein [Rattus norvegicus]
          Length = 718

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 76/307 (24%), Positives = 120/307 (39%), Gaps = 44/307 (14%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFLP 62
            +VY+YP ++    A V       P  +  S  Y     A   S + T D S+A LF   
Sbjct: 104 IKVYIYPLKKYVDDAGV-------PVSSAISREYNELLTAISDSDYYTDDISRACLFVPS 156

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
             +              P  I      ++Q  P W+R  G +H        G      A 
Sbjct: 157 IDVLNQN----------PLRIKETAQALAQ-LPRWDR--GTNHLLFNMLPGGPPDYNTAL 203

Query: 123 EVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED---PPKLGSSKRNKLAFFAGAVN 179
           +V  +   ++    +    +    DVS+P   P   +   P K   S+R  L     A++
Sbjct: 204 DVPRDR-ALLAGGGFSTWTYRQGYDVSIPVFSPLSAEVALPEKAPGSRRYFLLSSQMAIH 262

Query: 180 SPVREKL--LQVWRNDSEIYAH------SGRLKT----------PYADELLGSKFCLHVK 221
              RE+L  LQ    +S +          G L             Y   L  + FC  ++
Sbjct: 263 PEYREELEALQAKHRESVLVLDKCTNLSEGVLSVRKRCHQHQVFDYPQVLQEATFCTVLR 322

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+V+    +  +  IL+ I  
Sbjct: 323 GARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVIPEEKMSDVYSILQNIPQ 382

Query: 282 EEYLLLQ 288
            +   +Q
Sbjct: 383 RQIEEMQ 389


>gi|74144073|dbj|BAE22143.1| unnamed protein product [Mus musculus]
          Length = 435

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC  ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 25  YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 84

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 85  EKMSDVYSILQNIPQRQIEEMQ 106


>gi|197101852|ref|NP_001125538.1| exostosin-1 [Pongo abelii]
 gi|75042009|sp|Q5RBC3.1|EXT1_PONAB RecName: Full=Exostosin-1; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 1 homolog
 gi|55728386|emb|CAH90937.1| hypothetical protein [Pongo abelii]
          Length = 746

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 76/363 (20%), Positives = 134/363 (36%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VY YP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYAYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|126322251|ref|XP_001370057.1| PREDICTED: exostosin-1 [Monodelphis domestica]
          Length = 746

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   +ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQK----------------GEKIAESYQNILAAIEGSRFYTSDPSQACLFVL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ +   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLKSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  K   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDKDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           ++W   +I+     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 IDWNQAAIIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|444707556|gb|ELW48821.1| Exostosin-2 [Tupaia chinensis]
          Length = 740

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FCL ++G  +  A ++D L  GCVPVI+A+ Y LPF++VL+WK  S+ V  
Sbjct: 330 YPQVLQEATFCLVLRGARLGQAVLSDVLQAGCVPVIVADSYILPFSEVLDWKRASVAVPE 389

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ +   +   +Q
Sbjct: 390 EKLSDVYSILQSVPQRQIEEMQ 411


>gi|327259699|ref|XP_003214673.1| PREDICTED: exostosin-2-like [Anolis carolinensis]
          Length = 718

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GC+PV+IA+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQVLQEATFCIVLRGARLGQAVLSDVLQAGCIPVVIADSYILPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL  I   +   +Q
Sbjct: 368 EKMSEMYSILHSIPQRQIEEMQ 389


>gi|302830910|ref|XP_002947021.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300268065|gb|EFJ52247.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 834

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 78/340 (22%), Positives = 138/340 (40%), Gaps = 80/340 (23%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEG----IPD-----F 82
            Y  E  F +  ++S   T DP  AD F++P   +          +G    +P       
Sbjct: 431 TYGLEVLFHEMLLQSEHRTFDPEAADYFYVPVYGSCFIFPLHCYADGPWWHVPSGPRVMH 490

Query: 83  ISHYIFN----ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV------ 132
           +++ +      I + +PYW+R GG DH ++  H  G  A     E+  ++I +       
Sbjct: 491 VTNMMLEVRDWIRKHFPYWDRRGGRDHIWLMTHDEG--ACYAPTEIYNSSIFLTHWGRID 548

Query: 133 ---CSSSYF---------------------ISGHIAH---KDVSLPQI--------WPRQ 157
               S++ F                     I GH  +   KD+ +P +         P  
Sbjct: 549 KHHASNTAFTPDNYTQEYVHPEQPGGWLHLIDGHPCYTPGKDLVVPALKLPHHFRQSPLL 608

Query: 158 EDPPKLGSSKRNKLAFFAGAVNS--------PVREKLLQVWRNDSEIYAHS------GRL 203
             PP+    +R+ L +  G V           +R++L ++WR+   +  ++        +
Sbjct: 609 FHPPR----QRDILLYLRGDVGKHRLPNYSRGIRQRLYRLWRDQQWLQGYNVMIGDGSDV 664

Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
              Y++ L  SKFCL V G +  + R+ D++ +GCVPVII +     + D L  + FSI 
Sbjct: 665 PGDYSEHLSRSKFCLVVPG-DGWSPRLEDAVLHGCVPVIIMDGVHGVWEDQLELERFSIR 723

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
           V   ++  L + L  +      +L+    K+RK   WH +
Sbjct: 724 VGEDELEGLPQQLAVVPQR---VLEDMQRKLRK--VWHRY 758


>gi|356533217|ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807663 [Glycine max]
          Length = 795

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 75/337 (22%), Positives = 134/337 (39%), Gaps = 80/337 (23%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPF----SIAR--------MRHDRRIGTEGIP 80
           Y ++    ++ + S   T +  +AD FF+P      I R        M+    + +    
Sbjct: 395 YGAQIALYESLLASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSMQEHMGLRSSLTL 454

Query: 81  DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
           ++  +   +I ++YPYW+ + G DH +       AC++   I  S M   W    N    
Sbjct: 455 EYYKNTYTHIVEQYPYWSHSSGRDHIWSFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHN 513

Query: 132 VCSSSYFISG--------------HIAHKDVSLP------------QIWPRQEDPPKLGS 165
             +++Y+                     KD+ LP            ++W R  +      
Sbjct: 514 HSTTAYWADNWDKISSDRRGIHPCFDPDKDLVLPAWKVPDAYVLTSKLWARSHE------ 567

Query: 166 SKRNKLAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP 206
            KR  L +F G +                +R+KL + + +        G+      + TP
Sbjct: 568 -KRKTLFYFNGNLGPAYPHGRPEDTYSMGIRQKLAEEFGSSPNKDGKLGKQHAKDVIVTP 626

Query: 207 -----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
                Y  +L  S FC    G +  + R+ DS+  GC+PV+I +   LP+ +VLN+ SF+
Sbjct: 627 ERSEDYHMDLASSVFCGVFPG-DGWSGRMEDSILQGCIPVVIQDGIFLPYENVLNYDSFA 685

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           + +   +IP L K L+G +  E     + V K+ + F
Sbjct: 686 VRIPEAEIPNLIKTLRGFNDTEIEFKLANVQKIWQRF 722


>gi|323456573|gb|EGB12440.1| hypothetical protein AURANDRAFT_70660 [Aureococcus anophagefferens]
          Length = 1731

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 35/145 (24%), Positives = 69/145 (47%), Gaps = 20/145 (13%)

Query: 175  AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----------------YADELLGSKFC 217
            AG +   +R  L   W     + A  G  + P                 Y + +  +K+C
Sbjct: 1465 AGGLEYDLRRALTDAWDPGGPLRAKRGVARAPPPLDPRVDVRFLMAKANYTESMRRAKYC 1524

Query: 218  LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
            +  +GF   + R+++++  GCVP  ++     P+A VL+W +FS+ +A  D+  L ++L 
Sbjct: 1525 VVTEGFSPWSPRLSEAVALGCVPCFLSPSLAPPYATVLDWSAFSVEIAEADVGRLPEVL- 1583

Query: 278  GISSEEYLLLQSYVLKVRKHFQWHV 302
              ++ ++  L + +L+VR  F + V
Sbjct: 1584 --AAYDWAYLHANLLRVRPLFAFCV 1606


>gi|417404013|gb|JAA48784.1| Putative glycosyl transferase family 64 domain protein [Desmodus
           rotundus]
          Length = 701

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 291 YPQVLQEATFCVVLRGARLGQAVLSDILRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 351 EKMSDVYGILQSIPQRQIGEMQ 372


>gi|224046654|ref|XP_002200457.1| PREDICTED: exostosin-1 [Taeniopygia guttata]
          Length = 741

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/363 (21%), Positives = 134/363 (36%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DP +A LF L
Sbjct: 108 GFKVYVYPQQKGEKIA----------------ESYQNVLAAIEGSRFYTSDPGQACLFVL 151

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 152 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 205

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  K   
Sbjct: 206 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 246

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 247 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 306

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 307 CDRDNAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 366

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           ++W   +I+     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 367 IDWNQAAIIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 421

Query: 315 YEL 317
            E+
Sbjct: 422 LEI 424


>gi|157820411|ref|NP_001101221.1| exostosin-2 [Rattus norvegicus]
 gi|149022701|gb|EDL79595.1| exostoses (multiple) 2 (predicted) [Rattus norvegicus]
          Length = 670

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 47/82 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC  ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+V+  
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVIPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQ 389


>gi|395815565|ref|XP_003781296.1| PREDICTED: exostosin-2 [Otolemur garnettii]
          Length = 718

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 48/82 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC+ ++G  +  A +++ L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQDATFCVVLRGARLGQAVLSEVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389


>gi|322511376|gb|ADX06684.1| putative exostosin-like protein [Organic Lake phycodnavirus]
          Length = 290

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 70/144 (48%), Gaps = 15/144 (10%)

Query: 156 RQEDPPKLGSSKRNKLAFFAGAVNSP-----VREKLLQVW-RNDSEIYAHSGRLKTPYAD 209
           R E  PK   S+++ L  F G + S      VR+K+  V  RN +    +SG  K    +
Sbjct: 119 RLERMPKQSFSQKSILCSFVGNITSNNVVPNVRKKMFDVLSRNANFTMINSGGWKPVVTE 178

Query: 210 EL--------LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD-LPFADVLNWKSF 260
            L        L SKF L  +G+   + R  + L  G +PV I N  + LPF DV+N+K  
Sbjct: 179 TLQDIFIKTTLKSKFALAPRGYGRGSFRFFECLQLGTIPVYIWNDVNWLPFQDVINYKQL 238

Query: 261 SIVVATLDIPLLKKILKGISSEEY 284
            +V+   +I  L+++L  IS  +Y
Sbjct: 239 CVVIHADEIDTLEQVLLNISETQY 262


>gi|308491052|ref|XP_003107717.1| CRE-RIB-1 protein [Caenorhabditis remanei]
 gi|308249664|gb|EFO93616.1| CRE-RIB-1 protein [Caenorhabditis remanei]
          Length = 347

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 67/273 (24%), Positives = 111/273 (40%), Gaps = 42/273 (15%)

Query: 41  KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
           K F +    T DP++A LF L         DR + +      ++ YI ++ Q    WN  
Sbjct: 39  KHFQEREHYTNDPNEACLFLLGIDTT----DRDVRSLNYVKNVNEYIESLDQ--AVWN-- 90

Query: 101 GGADHF-YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP---QIWPR 156
            G +H  Y   H             K     ++  +S   +      D+SLP   +  P 
Sbjct: 91  NGRNHLIYNFYHGTFPDYDDHNLNFKTGE-AIIARASSSENNFFKDFDISLPLFHENHPY 149

Query: 157 QEDPPKLGSS----KRNKLAFFAG-----AVNSPVREKLL-------------------- 187
           Q +  +L +     KR  LA F G      + S  R  +                     
Sbjct: 150 QIESQRLHNEPKEEKRRYLASFKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDW 209

Query: 188 QVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
           QV+++D     +    +  Y D L  S FCL  +G  + + R  ++L  GC+PV+I++ +
Sbjct: 210 QVYQDDRCQRDNEEYDRWEYDDLLSNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSW 269

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            LPF++ ++W S +IVVA  D   + ++L   S
Sbjct: 270 ILPFSETIDWHSAAIVVAERDALSIPELLMSTS 302


>gi|410905063|ref|XP_003966011.1| PREDICTED: exostosin-1b-like [Takifugu rubripes]
          Length = 743

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 71/322 (22%), Positives = 134/322 (41%), Gaps = 51/322 (15%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+V+VYP ++                G   SESY     +   S F T DP +A LF L
Sbjct: 110 GFKVFVYPQQK----------------GEKMSESYQNILSSIEGSRFYTSDPEQACLFVL 153

Query: 62  PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFY--V 108
                       +  H+ +   + +P   +  +H IFN+ S  +P +    G D  Y  +
Sbjct: 154 SLDTLDRDQLSPQYVHNLKGRIQSLPLWNEGKNHIIFNLYSGTWPNYTEDLGFDIGYAML 213

Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ------IWPRQEDPPK 162
           A  SI     +  ++V +         +    G++ H   S+P       ++  +     
Sbjct: 214 AKASISTENFRPNFDVSIPLFSKDHPRTGGERGYLRHN--SIPPFRKYMLVFKGKRYLTG 271

Query: 163 LGSSKRNKLAFFAGAVNSPVRE--KLLQVWRNDSEIYAHSGRL---KTPYADELLGSKFC 217
           +GS  RN L     + +  +    K  + W+   +       +   K  Y + L  S FC
Sbjct: 272 IGSDTRNALYHVHNSEDVVLLTTCKHGKDWQKHKDARCDRDNMEYDKYDYREMLYNSTFC 331

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L  +G  + + R  ++L   CVPV+++N ++LPF+++++W + +++         +++L 
Sbjct: 332 LVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGD-------ERLLL 384

Query: 278 GISSEEYLLLQSYVLKVRKHFQ 299
            I S    + Q  +L +R+  Q
Sbjct: 385 QIPSTVRSIHQDQILSLRQQTQ 406


>gi|297807715|ref|XP_002871741.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317578|gb|EFH48000.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 510

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 119/267 (44%), Gaps = 33/267 (12%)

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHIA-HKDVS 149
           P W R+ G DH +   H     +++K  +   NAI ++     + +++  G ++  KD+ 
Sbjct: 227 PAWKRSEGRDHIFPIHHPWSFKSVRKFVK---NAIWLLPDMDSTGNWYKPGQVSLEKDLI 283

Query: 150 LPQIWPRQE--DPPKLGSSK--RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
           LP + P  +  D   L  S   R  L FF G +       +R KL         +    G
Sbjct: 284 LPYV-PNVDICDAKCLSESAPMRTTLLFFRGRLKRNAGGKIRAKLGAELSGVKGVIISEG 342

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K      +  S FCL   G   ++AR+ D++  GC+PVI+++  + PF  +L++
Sbjct: 343 TAGEGGKLAAQGGMRRSLFCLCPAGDTPSSARLFDAIVSGCIPVIVSDELEFPFEGILDY 402

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
           K  +++V++ D+     L   L+ ++  +   LQ  + +  +HF +      + P D   
Sbjct: 403 KKVAVLVSSNDVVQPGWLVNHLRSLTPFQIKELQKNLAQYSRHFLYSSPAQPLGPEDLTW 462

Query: 309 ---AFYMVMYELWLRRSSVRVQWSTSL 332
              A  +V  +L  RRS   V+ S SL
Sbjct: 463 RMMAGKLVNIKLHTRRSQRVVKGSRSL 489


>gi|401886285|gb|EJT50333.1| hypothetical protein A1Q1_00388 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 1041

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 74/148 (50%), Gaps = 11/148 (7%)

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW-------RNDSEIYAHSGRLKTPYADELLGSKFCL 218
           ++R  LA F G+ N       L+          +   ++AH    KT Y D L  ++FC 
Sbjct: 845 AERQTLATFKGSPNGQGTSLRLKTTCPRLLPPGSLEPVWAHIPEGKT-YLDLLGDTRFCP 903

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
              G    T R++D +Y GC+PV++ +   + +  + +W  FS+ V   ++  L++IL G
Sbjct: 904 IPFGTAGWTYRLSDVVYAGCIPVLVGDQSHMTYWSMFDWSLFSVQVFEHELDHLERILSG 963

Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
           I+ E+    Q  ++ VR+ F   ++PS+
Sbjct: 964 ITEEDAQRKQDALMLVREAF---LYPSE 988


>gi|68052297|sp|Q5IGR6.1|EXT1C_DANRE RecName: Full=Exostosin-1c; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase 1c; AltName:
           Full=Multiple exostoses protein 1 homolog c
 gi|56785795|gb|AAW29035.1| EXT1c [Danio rerio]
          Length = 737

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 81/357 (22%), Positives = 134/357 (37%), Gaps = 83/357 (23%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
            FRVY+YP  +                G   SE Y K   +  +S + T DP +A LF L
Sbjct: 105 GFRVYIYPPEK----------------GERVSEGYRKILTSVSESRYYTSDPREACLFVL 148

Query: 62  PFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSAM 118
              I  +  D+             ++ N+ ++   YP WN   G +H     +S      
Sbjct: 149 --GIDTLDRDQ---------LSQQFVPNVDERIRGYPLWN--DGRNHVIFNLYSGTWPNY 195

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS----------K 167
            +  ++  N  Q + + +   + H     D+S+P     +E P K G            +
Sbjct: 196 TE--DLGFNVGQAILAKASLNTEHFRPGFDISIPLF--SKEHPQKGGKRGWLVRNSVPPR 251

Query: 168 RNKLAFFAG-----AVNSPVREKL-----------LQVWRNDSEIYAHSGRL-------- 203
           R  L  F G      + S  R  L           L   R+  +   H            
Sbjct: 252 RKYLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNQEY 311

Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
            +  Y + L  S FCL  +G  + + R  +SL   C+PV+++N ++LPF+DV+ W     
Sbjct: 312 ERFDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQ--- 368

Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
            V   D    +++L  + S    +    VL +R+  Q  W  + S  D   +   E+
Sbjct: 369 AVVEGD----ERLLLQVPSTVRAVGIDRVLALRQQTQTLWDAYFSSVDKIVLTTLEI 421


>gi|55742053|ref|NP_001006730.1| exostosin 1 [Xenopus (Silurana) tropicalis]
 gi|49523045|gb|AAH75481.1| exostoses (multiple) 1 [Xenopus (Silurana) tropicalis]
          Length = 738

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 75/363 (20%), Positives = 134/363 (36%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VY+YP ++ D                  SESY     A   S   T DPS+A LF L
Sbjct: 105 GFKVYIYPQQKGDKL----------------SESYQHILAAIESSRLYTSDPSQACLFVL 148

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ +   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 149 SLDTLDRDQLSPQYVHNLKSKVQNLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 202

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  K   
Sbjct: 203 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 243

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 244 RFNNIPPMRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDAR 303

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 304 CDKDNAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 363

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           ++W   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 364 IDWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 418

Query: 315 YEL 317
            E+
Sbjct: 419 LEI 421


>gi|405975819|gb|EKC40363.1| Exostosin-1b [Crassostrea gigas]
          Length = 714

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 84/358 (23%), Positives = 132/358 (36%), Gaps = 89/358 (24%)

Query: 3   RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFF 60
           R F+VYVYP         VL  V         SESY K      +S + T +P +A +F 
Sbjct: 85  RGFKVYVYP---------VLERV---------SESYSKIISTIQESKYYTTNPDEACIFI 126

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
           L  SI  +  DR I +      I   I ++      WN     D+  V     G + M +
Sbjct: 127 L--SIDTL--DRDINSVNSAKNIMPKIAHLGA---LWNNGTWPDYSEVLDFDTGEAIMAR 179

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK------------R 168
           A            S S F        D+SLP +      PPK G               R
Sbjct: 180 A----------SISESRFRPNF----DISLPLVGSTH--PPKGGERGYMYTSINNIPPLR 223

Query: 169 NKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRL-------------------- 203
           + L  F G      V S  R  L  +   D  +   + R                     
Sbjct: 224 HYLLGFKGKRYLTGVGSETRNSLYHMHNGDDIVLLTTCRHGKFWQKKAKELNDTRCDIDN 283

Query: 204 ----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
               +  Y   L  + FCL  +G  + + R  ++L  GC+PV+++N ++LPF +V++WK 
Sbjct: 284 REFDRYDYKKLLYNATFCLVPRGRRLGSFRFLETLQAGCIPVLLSNGWELPFGEVIDWKK 343

Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
            ++      +  +  I+ G+S  E      + ++ +  F W  + S  D     + E+
Sbjct: 344 AAVWADERLLFQVPSIVHGLSQPEI-----FAMRQQTQFLWEAYFSSMDKIINTVLEI 396


>gi|341881773|gb|EGT37708.1| CBN-RIB-1 protein [Caenorhabditis brenneri]
          Length = 383

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 68/297 (22%), Positives = 115/297 (38%), Gaps = 61/297 (20%)

Query: 27  FEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHY 86
           FE     A  S   K F +S   T +P++A +F L         DR + ++     ++ Y
Sbjct: 59  FEVAPQSAIYSKILKYFQESEHYTNNPNEACIFLLGIDTT----DRDVRSQNYVKNVNEY 114

Query: 87  IFNISQKYPYWNRTGGADH----FYVAC------HSIGRSAMQKAWEVKLNAIQVVCSSS 136
           I ++      WN   G +H    FY         H++G    +          +   S +
Sbjct: 115 IDSLDP--AVWN--NGRNHLIFNFYHGTFPDYDDHNLGFDTGEAM------IARASSSEN 164

Query: 137 YFISGHIAHKDVSLP--------QIWPRQEDPPKLGSSKRNKLAFFAG-----AVNSPVR 183
            F  G     DVSLP         + P+ +   +   S+R  L  F G      + S  R
Sbjct: 165 NFFKGF----DVSLPLFHENHPDDVEPKVKIDDQRNESRRKYLVSFKGKRYVYGIGSGTR 220

Query: 184 EKLLQVWRNDSEIYAHSGRLKTP--------------------YADELLGSKFCLHVKGF 223
             +  +   D  +   + +                        Y D L  S FCL  +G 
Sbjct: 221 NLVHHLHNGDDMVMVTTCKHNNDWQVYQDDRCQRDNDEYDQWDYEDLLTNSTFCLVPRGR 280

Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            + + R  ++L  GC+PV+I++ + LPF + ++W S +IVVA  D   + ++L  +S
Sbjct: 281 RLGSFRFLETLRSGCIPVVISDSWILPFTETIDWSSAAIVVAERDALSIPELLMSMS 337


>gi|326932982|ref|XP_003212589.1| PREDICTED: exostosin-1c-like [Meleagris gallopavo]
          Length = 734

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 81/372 (21%), Positives = 141/372 (37%), Gaps = 84/372 (22%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
            F+V+ YP    +P                 SESY K   +   S + T  P +A LF L
Sbjct: 103 GFKVFTYPREHGEPM----------------SESYSKILASIESSRYYTPHPEEACLFVL 146

Query: 62  PFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSAM 118
             SI  +  D            +HYI ++ Q+   +P WN   G +H     ++      
Sbjct: 147 --SIDTLDRD---------PLSAHYIRSVEQRIRSFPLWN--SGRNHLIFNLYAGTWPNY 193

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS----------K 167
            +  E+  +    + + + F S +     DVS+P ++PR E P + G S          K
Sbjct: 194 TE--ELGFDIGHAMLAKASFHSKNFRPGFDVSIP-LFPR-EHPQRGGQSGWLHHNSVPPK 249

Query: 168 RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL-- 203
           +  L  F G      + S  R  L  +                 W    +       +  
Sbjct: 250 KKYLLVFKGKRYLTGIGSGTRNALHHIHNGRDIVSLTTCRHGKDWERHKDTRCDQDNVDY 309

Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
            K  Y + L  S FC+  +G  + + R  ++L   C+PV++++ ++LPF++ ++W   ++
Sbjct: 310 EKFDYQELLHNSTFCIVPRGRRLGSFRFLEALQAACIPVLLSDGWELPFSEAIDWGKAAV 369

Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYE-----L 317
           V +   +  +   ++ I  E  L  Q      +  F W  + S  D       E     L
Sbjct: 370 VGSERLLLQIPSAVRCIRPERVLAFQQ-----QTQFLWDAYFSSVDKIVHTTLEIIRDRL 424

Query: 318 WLRRSSVRVQWS 329
              RS  R+ W+
Sbjct: 425 LPHRSRSRILWN 436


>gi|15237870|ref|NP_197191.1| Exostosin family protein [Arabidopsis thaliana]
 gi|9755690|emb|CAC01702.1| putative protein [Arabidopsis thaliana]
 gi|15810401|gb|AAL07088.1| unknown protein [Arabidopsis thaliana]
 gi|23296585|gb|AAN13125.1| unknown protein [Arabidopsis thaliana]
 gi|332004972|gb|AED92355.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 511

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 120/267 (44%), Gaps = 33/267 (12%)

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHIA-HKDVS 149
           P W R+ G DH +   H     +++K  +   NAI ++     + +++  G ++  KD+ 
Sbjct: 228 PAWKRSEGRDHIFPIHHPWSFKSVRKFVK---NAIWLLPDMDSTGNWYKPGQVSLEKDLI 284

Query: 150 LPQIWPRQE--DPPKLGSSK--RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
           LP + P  +  D   L  S   R  L FF G +       +R KL        +I    G
Sbjct: 285 LPYV-PNVDICDTKCLSESAPMRTTLLFFRGRLKRNAGGKIRAKLGAELSGIKDIIISEG 343

Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                 K      +  S FCL   G   ++AR+ D++  GC+PVI+++  + PF  +L++
Sbjct: 344 TAGEGGKLAAQRGMRRSLFCLCPAGDTPSSARLFDAIVSGCIPVIVSDELEFPFEGILDY 403

Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
           K  +++V++ D      L   L+ ++  +   LQ+ + +  +HF +      + P D   
Sbjct: 404 KKVAVLVSSSDAIQPGWLVNHLRSLTPFQVKGLQNNLAQYSRHFLYSSPAQPLGPEDLTW 463

Query: 309 ---AFYMVMYELWLRRSSVRVQWSTSL 332
              A  +V  +L  RRS   V+ S S+
Sbjct: 464 RMIAGKLVNIKLHTRRSQRVVKGSRSI 490


>gi|256079045|ref|XP_002575801.1| exostosin-2 [Schistosoma mansoni]
 gi|353230856|emb|CCD77273.1| putative exostosin-2 [Schistosoma mansoni]
          Length = 1001

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 51/92 (55%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y+  L  S+FCL +    +N   + DSL  GC+PVI+ +   LPF+++L+W   +I +  
Sbjct: 607 YSVSLCSSEFCLIIDADNLNRFNLYDSLACGCIPVIVNDDIVLPFSEILDWYKIAIRIPQ 666

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           +    +  IL   SS+E  LL+  ++ + + +
Sbjct: 667 VKFQKIPSILSTYSSKEKFLLRKQIMFIYQRY 698


>gi|393907069|gb|EFO19790.2| exostosin-1 [Loa loa]
          Length = 677

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 68/319 (21%), Positives = 127/319 (39%), Gaps = 60/319 (18%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
           +FRVYVYP   +   + V               +   K   +S + T DP KA LF L  
Sbjct: 80  AFRVYVYPSDNSSAMSIVY--------------NNILKVIRESMYYTDDPQKACLFVL-- 123

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV----------ACHSI 113
            I  +  DRR  +E    +++  I N+  +   WNR  G +H             + H +
Sbjct: 124 GIDTVDRDRR--SENYVKYVNELIGNLPTEI--WNR--GRNHVIFNLYHGTYPDYSDHDL 177

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKD----VSLPQIWPRQEDPPKLGSSKRN 169
           G      A   + +A   V   ++ +S  + HK+     ++   W  +     L S K  
Sbjct: 178 GFDT-GYALIARASANTRVFRENFDLSFPLFHKEHPLRTTIKAEWSLKIKDKYLVSFKGK 236

Query: 170 KLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP--------------------YAD 209
           +  +    + S  R+ L  +    S I   + +  T                     Y  
Sbjct: 237 RYVY---GIGSETRDSLYHLHNGQSVIMVTTCKHNTDWKKYEDGRCEEDNVEYDHWDYEM 293

Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
            +  S FCL  +G  + + R  ++L  GC+PV++++ ++LPF++V++W+   ++     +
Sbjct: 294 TMANSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVVIGHEDTV 353

Query: 270 PLLKKILKGISSEEYLLLQ 288
             +  +L  I  +  L ++
Sbjct: 354 LTISDVLSAIPFDRILFMK 372


>gi|256079047|ref|XP_002575802.1| exostosin-2 [Schistosoma mansoni]
 gi|353230857|emb|CCD77274.1| putative exostosin-2 [Schistosoma mansoni]
          Length = 1022

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 51/92 (55%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y+  L  S+FCL +    +N   + DSL  GC+PVI+ +   LPF+++L+W   +I +  
Sbjct: 628 YSVSLCSSEFCLIIDADNLNRFNLYDSLACGCIPVIVNDDIVLPFSEILDWYKIAIRIPQ 687

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           +    +  IL   SS+E  LL+  ++ + + +
Sbjct: 688 VKFQKIPSILSTYSSKEKFLLRKQIMFIYQRY 719


>gi|312084451|ref|XP_003144281.1| exostosin-1 [Loa loa]
          Length = 694

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 68/319 (21%), Positives = 127/319 (39%), Gaps = 60/319 (18%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
           +FRVYVYP   +   + V               +   K   +S + T DP KA LF L  
Sbjct: 80  AFRVYVYPSDNSSAMSIVY--------------NNILKVIRESMYYTDDPQKACLFVL-- 123

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV----------ACHSI 113
            I  +  DRR  +E    +++  I N+  +   WNR  G +H             + H +
Sbjct: 124 GIDTVDRDRR--SENYVKYVNELIGNLPTEI--WNR--GRNHVIFNLYHGTYPDYSDHDL 177

Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKD----VSLPQIWPRQEDPPKLGSSKRN 169
           G      A   + +A   V   ++ +S  + HK+     ++   W  +     L S K  
Sbjct: 178 GFDT-GYALIARASANTRVFRENFDLSFPLFHKEHPLRTTIKAEWSLKIKDKYLVSFKGK 236

Query: 170 KLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP--------------------YAD 209
           +  +    + S  R+ L  +    S I   + +  T                     Y  
Sbjct: 237 RYVY---GIGSETRDSLYHLHNGQSVIMVTTCKHNTDWKKYEDGRCEEDNVEYDHWDYEM 293

Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
            +  S FCL  +G  + + R  ++L  GC+PV++++ ++LPF++V++W+   ++     +
Sbjct: 294 TMANSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVVIGHEDTV 353

Query: 270 PLLKKILKGISSEEYLLLQ 288
             +  +L  I  +  L ++
Sbjct: 354 LTISDVLSAIPFDRILFMK 372


>gi|441647993|ref|XP_004090847.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-1 [Nomascus leucogenys]
          Length = 746

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 76/363 (20%), Positives = 134/363 (36%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++ +  A                ESY     A   S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFXL 156

Query: 62  PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ R   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + DVS+P      +D P+ G  +   
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
                   R  +  F G      + S  R  L  V   +  +   + +            
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311

Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  + FCL  +G  + +    ++L   CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFIFLEALQAACVPVMLSNGWELPFSEV 371

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           +NW   +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426

Query: 315 YEL 317
            E+
Sbjct: 427 LEI 429


>gi|17541994|ref|NP_502180.1| Protein RIB-1 [Caenorhabditis elegans]
 gi|6434267|emb|CAB61014.1| Protein RIB-1 [Caenorhabditis elegans]
 gi|130381603|dbj|BAF48989.1| heparan sulfate polymerase [Caenorhabditis elegans]
          Length = 382

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 76/325 (23%), Positives = 126/325 (38%), Gaps = 81/325 (24%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
           S +VY++P  +            FE        S   K F++S+  T DP++A +F L  
Sbjct: 44  SKKVYIHPMEKR-----------FEESPQSVIYSKILKHFLESNHYTNDPNEACIFLLGI 92

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADH----FYVAC------HSI 113
                  DR + ++     ++ YI ++      WN   G +H    FY         H++
Sbjct: 93  DTT----DRDVRSQNYVKNVNDYIESLDP--SVWN--NGRNHLIFNFYHGTFPDYDDHNL 144

Query: 114 ----GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRN 169
               G + + +A   + N I+V               DVSLP          K   S+RN
Sbjct: 145 NFDTGEAMIARASSSENNFIKVF--------------DVSLPLFHENHPYEIKESKSERN 190

Query: 170 K---------LAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYA------- 208
                     L  F G      + S  R  +  +   D  +   + +    +        
Sbjct: 191 DDRIENQRKYLVSFKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDWQVYQDDRC 250

Query: 209 ------------DELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
                       DELL  S FCL  +G  + + R  ++L  GCVPV+I++ + LPF++ +
Sbjct: 251 QRDNDEYDRWEYDELLANSTFCLVPRGRRLGSFRFLETLRSGCVPVVISDSWILPFSETI 310

Query: 256 NWKSFSIVVATLDIPLLKKILKGIS 280
           +W S +IVVA  D   + ++L   S
Sbjct: 311 DWNSAAIVVAERDALSIPELLMSTS 335


>gi|159483641|ref|XP_001699869.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158281811|gb|EDP07565.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 427

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 68/304 (22%), Positives = 111/304 (36%), Gaps = 54/304 (17%)

Query: 39  FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
           F +  +     T +P +AD F++P S   ++             +   +  IS  +P+WN
Sbjct: 125 FMERILSGGHRTHNPEEADFFYIPGSSRDLKKAF---------LLQPLLAYISTTWPFWN 175

Query: 99  RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF---------------ISGHI 143
            TGGA H   A   +G   +    +V+L    V     +                I   +
Sbjct: 176 ATGGARHIMPAEGDVGTCEL--PLKVRLFTANVTWLQFWGMYDFHPHWTQIFHNRIPCMV 233

Query: 144 AHKDVSLPQIWPRQED------PPKLGSSKRNKLA--FFAGAVNSPVREKLL-------- 187
             +D+ +P +     D      P    + KRN+    FFAG V     ++ L        
Sbjct: 234 PGRDIVVPFMAMSSHDRFVIETPLHPRNQKRNRTNTFFFAGGVCGSGNKRALPPHCTYYK 293

Query: 188 QVWRNDS---EIYAH----SGRLKTP----YADELLGSKFCLHVKGFEVNTARIADSLYY 236
           QV  +      +Y H    +G    P    YA +   S+FCL   G       I  ++Y 
Sbjct: 294 QVRYSGGVRQAVYLHFHNRTGWRVVPGTDDYARDYASSRFCLAAAGGGWGKRGIVAAMY- 352

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK 296
           GC+PV   +     F   ++W  F + +   +IP L   L+  S  E   +Q       +
Sbjct: 353 GCIPVAATDMLYEAFEPEMDWGRFGVRITQAEIPQLADKLEAYSEAEVARMQERTACAAQ 412

Query: 297 HFQW 300
           H  W
Sbjct: 413 HLHW 416


>gi|343171904|gb|AEL98656.1| exostosin-like protein, partial [Silene latifolia]
          Length = 250

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 76/160 (47%), Gaps = 17/160 (10%)

Query: 168 RNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-------AHSGRLKTPYADELLGSKF 216
           R  L FF G         VR +L ++  +  +I+         S +L T     +  SKF
Sbjct: 67  RTTLLFFRGRTVRKDEGVVRARLAKILVDYEDIHYERSVASEESIKLST---QGMRSSKF 123

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LDIPLLK 273
           CL   G   ++ R+ D++   CVPVI+++  +LPF D L++ +FSI  +T   L    + 
Sbjct: 124 CLDPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDELDYSNFSIFFSTEEALKPGYMV 183

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
           + L+ +  E++L +   +  +  H+++   P   D   M+
Sbjct: 184 EELRKVPEEKWLNMYKKLKNISHHYEFQYPPKREDGVNML 223


>gi|308491504|ref|XP_003107943.1| hypothetical protein CRE_12808 [Caenorhabditis remanei]
 gi|308249890|gb|EFO93842.1| hypothetical protein CRE_12808 [Caenorhabditis remanei]
          Length = 847

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 124/312 (39%), Gaps = 56/312 (17%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
           S +VY++P  +            FE        S   K F +S   T DP++A LF L  
Sbjct: 13  SRKVYIHPMEKR-----------FEESPQSVIYSKILKHFQESEHYTNDPNEACLFLLGI 61

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF-YVACHSIGRSAMQKAW 122
                  DR I +      ++ YI ++ Q    WN   G +H  Y   H  G        
Sbjct: 62  DTT----DRDIRSLNYVKNVNEYIESLDQAV--WN--NGRNHLIYNFYH--GTFPDYDDH 111

Query: 123 EVKLNAIQ-VVCSSSYFISGHIAHKDVSLP--------QIWPRQEDPPKLGSSKRNKLAF 173
            +  N  + ++  +S   +      D+SLP        QI  ++    +    KR  LA 
Sbjct: 112 NLNFNTGEAIIARASSSENNFFKDFDISLPLFHENHPYQIESQRALHNEPKEEKRRYLAS 171

Query: 174 FAG-----AVNSPVREKLL--------------------QVWRNDSEIYAHSGRLKTPYA 208
           F G      + S  R  +                     QV+++D     +    +  Y 
Sbjct: 172 FKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDWQVYQDDRCQRDNEEYDRWEYD 231

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
           D L  S FCL  +G  + + R  ++L  GC+PV+I++ + LPF++ ++W S +IVVA  D
Sbjct: 232 DLLSNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWILPFSETIDWHSAAIVVAERD 291

Query: 269 IPLLKKILKGIS 280
              + ++L   S
Sbjct: 292 ALSIPELLMSTS 303


>gi|3023724|sp|O01704.1|EXT1_CAEEL RecName: Full=Multiple exostoses homolog 1; AltName: Full=Related
           to mammalian RIB protein 1
 gi|2058697|gb|AAC47509.1| multiple exostoses homolog 1 [Caenorhabditis elegans]
          Length = 378

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 76/325 (23%), Positives = 126/325 (38%), Gaps = 81/325 (24%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
           S +VY++P  +            FE        S   K F++S+  T DP++A +F L  
Sbjct: 40  SKKVYIHPMEKR-----------FEESPQSVIYSKILKHFLESNHYTNDPNEACIFLLGI 88

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADH----FYVAC------HSI 113
                  DR + ++     ++ YI ++      WN   G +H    FY         H++
Sbjct: 89  DTT----DRDVRSQNYVKNVNDYIESLDP--SVWN--NGRNHLIFNFYHGTFPDYDDHNL 140

Query: 114 ----GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRN 169
               G + + +A   + N I+V               DVSLP          K   S+RN
Sbjct: 141 NFDTGEAMIARASSSENNFIKVF--------------DVSLPLFHENHPYEIKESKSERN 186

Query: 170 K---------LAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYA------- 208
                     L  F G      + S  R  +  +   D  +   + +    +        
Sbjct: 187 DDRIENQRKYLVSFKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDWQVYQDDRC 246

Query: 209 ------------DELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
                       DELL  S FCL  +G  + + R  ++L  GCVPV+I++ + LPF++ +
Sbjct: 247 QRDNDEYDRWEYDELLANSTFCLVPRGRRLGSFRFLETLRSGCVPVVISDSWILPFSETI 306

Query: 256 NWKSFSIVVATLDIPLLKKILKGIS 280
           +W S +IVVA  D   + ++L   S
Sbjct: 307 DWNSAAIVVAERDALSIPELLMSTS 331


>gi|401888173|gb|EJT52138.1| hypothetical protein A1Q1_06676 [Trichosporon asahii var. asahii CBS
            2479]
          Length = 1189

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 26/103 (25%), Positives = 56/103 (54%), Gaps = 3/103 (2%)

Query: 207  YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
            Y +EL  ++FC   +G    + R+ D+++ GC+PV+ +     PFA +++W   S+ V  
Sbjct: 1063 YLEELNTARFCPQPRGIAGWSPRVNDAIFAGCIPVLTSEDTHYPFAGLIDWSQISVRVHP 1122

Query: 267  LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDA 309
             ++  ++++L  I       +Q+ ++ +R  F   ++P+D  A
Sbjct: 1123 TELDHVEELLASIPLARLEQIQANIVAIRDAF---MYPADGKA 1162


>gi|198437606|ref|XP_002128024.1| PREDICTED: similar to Exostosin-1
           (Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase) (Putative tumor
           suppressor protein EXT1) (Multiple exostoses protein 1)
           [Ciona intestinalis]
          Length = 766

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 77/356 (21%), Positives = 129/356 (36%), Gaps = 82/356 (23%)

Query: 4   SFRVYVYP---HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
            F+VYVYP    + +  F N+L                   +   S + T DP KA LF 
Sbjct: 126 GFKVYVYPDVGEKTSTNFQNIL------------------ASLRASQYYTSDPEKACLFV 167

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSA 117
             +      H             + YI N+  K     YWN   G +H     +S   + 
Sbjct: 168 PAYDTLDRDH-----------LSADYIHNLGAKISRLKYWN--NGKNHIIFNLYS--GTW 212

Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS---------- 166
            +   +V  N  + + + + F   +  H  D+S P I   +  P   G+           
Sbjct: 213 PEYLEDVGFNLGEAILAKASFGDNYYRHGFDISFPLI--GKTHPHMQGTQGFLKANYFPP 270

Query: 167 KRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHS---GRLKTPYADE-------- 210
           +R  L  F G      + S  R  L  +   D  I   +   G+    ++D+        
Sbjct: 271 RRKYLLSFKGKRYTYGIGSSTRNALYHIHNGDDIIILTTCKHGKNWQSFSDQKCETDNEE 330

Query: 211 ---------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
                    L  S FC+  +G  + + R  +SL   C+PV++AN + LPF +V++W   S
Sbjct: 331 YEKWDYQSLLHNSTFCMVPRGRRLGSFRFLESLQAACIPVVLANGWKLPFDEVIDWSKAS 390

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
           +      +  +  IL+ +     +LL+      +  F W  + S  D       E+
Sbjct: 391 LAWEERLLLQVPGILREVQDNRIMLLRQ-----QSQFLWDKYFSSMDVIIRSTLEI 441


>gi|326429784|gb|EGD75354.1| hypothetical protein PTSG_06430 [Salpingoeca sp. ATCC 50818]
          Length = 351

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 57/97 (58%), Gaps = 7/97 (7%)

Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC--VPVIIANHYDLPFADVLNWKSFS 261
           K  Y +  L +KF L V+GF  ++ R     Y+GC  +PVI+ +HY LP+ D+L+W++FS
Sbjct: 193 KYTYTELALETKFGLIVEGFGYHSLR-----YHGCGRLPVIVVDHYVLPYQDLLDWETFS 247

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
           + +    +  L +IL+ I  E   ++Q  V+ V + F
Sbjct: 248 MRIPEHRLLELPRILRSIPDEVVEMIQRRVVFVFEDF 284


>gi|302843170|ref|XP_002953127.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300261514|gb|EFJ45726.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 616

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 58/122 (47%), Gaps = 10/122 (8%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y  ++  SKFCL   G      ++     YGC+PV I ++   PF   L+W +FS+ V  
Sbjct: 413 YVRDMSTSKFCLAPTGGGHGKRQVLVG-RYGCIPVPITDYVLQPFEPELDWPAFSVTVKE 471

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH------VFPSD--YDAFYMVMYELW 318
            D+P L  IL  I+  +   +Q  +    KH  W+      +F  D  YDAF  +M  L 
Sbjct: 472 EDVPNLHTILAAINDTKLAEMQRALACAAKHL-WYSSMWGAIFGEDSRYDAFATLMEILR 530

Query: 319 LR 320
           +R
Sbjct: 531 VR 532


>gi|402589373|gb|EJW83305.1| hypothetical protein WUBG_05784 [Wuchereria bancrofti]
          Length = 689

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 73/325 (22%), Positives = 128/325 (39%), Gaps = 72/325 (22%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
           +FRVYVYP   +   + V               S   K   +S + T DP KA LF L  
Sbjct: 81  AFRVYVYPSDNSSAMSVVY--------------SNILKVIRESIYYTDDPQKACLFVL-- 124

Query: 64  SIARMRHDRRIGT---------EGIPDFI-----SHYIFNISQ-KYP-YWNRTGGADHFY 107
            I  +  DRR            E +P  I     +H IFN+    YP Y +   G D  Y
Sbjct: 125 GIDTVDRDRRSENYVKYVNELIENLPSEIWNHGRNHIIFNLYHGTYPDYSDHDLGFDTGY 184

Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKD----VSLPQIWPRQEDPPKL 163
                I R++          A   +   ++ +S  + HK+     ++   W  +     L
Sbjct: 185 AV---IARAS----------ANTRIFRENFDLSFPLFHKEHPLRTTVKAKWSLKIKDKYL 231

Query: 164 GSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT---PYADE---------- 210
            + K  +  +    + S  R+ L  +    S +   + +  T    Y DE          
Sbjct: 232 VTFKGKRYVY---GIGSETRDSLYHLHNGQSVVMVTTCKHNTDWKKYEDERCEEDNVEYD 288

Query: 211 -------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
                  +  S FCL  +G  + + R  ++L  GC+PV++++ ++LPF++V++W+   I+
Sbjct: 289 HWDYEMIMSNSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVII 348

Query: 264 VATLDIPLLKKILKGISSEEYLLLQ 288
                +  +  +L  I  +  L ++
Sbjct: 349 GHEDTVLTISDVLSAIPLDRILFMK 373


>gi|343171902|gb|AEL98655.1| exostosin-like protein, partial [Silene latifolia]
          Length = 250

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/160 (25%), Positives = 76/160 (47%), Gaps = 17/160 (10%)

Query: 168 RNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-------AHSGRLKTPYADELLGSKF 216
           R  L FF G         VR +L ++  +  +++         S +L T     +  SKF
Sbjct: 67  RTTLLFFRGRTVRKDEGVVRARLAKILVDYEDVHYERSVASEESIKLST---QGMRSSKF 123

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LDIPLLK 273
           CL   G   ++ R+ D++   CVPVI+++  +LPF D L++ +FSI  +T   L    + 
Sbjct: 124 CLDPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDELDYSNFSIFFSTEEALKPGYMV 183

Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
           + L+ +  E++L +   +  +  H+++   P   D   M+
Sbjct: 184 EELRKVPEEKWLNMYKKLKNISHHYEFQYPPKREDGVNML 223


>gi|1619954|gb|AAB17006.1| multiple exostosis 2 protein [Mus musculus]
          Length = 718

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 46/82 (56%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC  ++   +  A ++D L  GCVPV+IA+ Y LPF++VL+WK  S+VV  
Sbjct: 308 YPQVLQEATFCTVLRRARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKKASVVVPE 367

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
             +  +  IL+ I   +   +Q
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQ 389


>gi|110637313|ref|YP_677520.1| hypothetical protein CHU_0899 [Cytophaga hutchinsonii ATCC 33406]
 gi|110279994|gb|ABG58180.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
          Length = 330

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/147 (31%), Positives = 80/147 (54%), Gaps = 15/147 (10%)

Query: 167 KRNKLAF-FAGAVNSPVREKLLQVWRNDS------EI---YAHSGRLKTPYADELLGSKF 216
           +  KL F F+G+ + P R KL   +RN+S      EI   Y HS   K  Y +++L S F
Sbjct: 131 QNKKLLFTFSGSCSHPFRIKLFNAYRNESSEYKVAEIKRWYNHSDFEKETYLEDILSSYF 190

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
            L  +G    + RI +++  G VPVIIA+ + +PF+  +   ++ + +A  D+  +  IL
Sbjct: 191 VLCPRGIASYSHRIIETMALGSVPVIIADEW-VPFS--IEEDNYYVRIAESDVENIYAIL 247

Query: 277 KGISSEEYLLLQSYVLKV-RKHFQWHV 302
           K   + +Y  L++ V  V +K+F+ H+
Sbjct: 248 KAKQT-DYENLRNNVSDVYKKYFESHI 273


>gi|383847805|ref|XP_003699543.1| PREDICTED: exostosin-2-like [Megachile rotundata]
          Length = 707

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 70/303 (23%), Positives = 128/303 (42%), Gaps = 44/303 (14%)

Query: 7   VYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF--MKSHFVTKDPSKADLFFLPFS 64
           VY+YP +       + +     P  +  S+ +++  F  + S F T +P +A   F+P S
Sbjct: 102 VYIYPPK-------IYVDALERPVTSQMSKEFYQILFTIISSKFYTPNPYEA-CIFIP-S 152

Query: 65  IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA--DHFYVACHSIGRSAMQKAW 122
           +  +  +R    E      S   +N  + +  +N   G+  D+  V   S+G++ +  A 
Sbjct: 153 LDTLNQNRLKLQEVSQALKSLPFWNNGENHVVFNMVSGSVPDYNTVIDVSVGKAMIAGAG 212

Query: 123 EVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGA-VNSP 181
                      SS  + SG     D+SLP   P   +     S  R  LA  +   +NS 
Sbjct: 213 ----------MSSLTYRSGF----DISLPVYSPLTNNFKPNISDTRLWLAISSQININSA 258

Query: 182 VREKLLQV----------------WRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
             + LL++                + + +     +G     Y D L  + FCL ++G  +
Sbjct: 259 FEQDLLEIKSLSPKDILILGPCLQYSSMNSTIRCAGIDVYKYPDVLQTATFCLVIRGARL 318

Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
             + + + +  G +PVIIA+   +PF DV++W   +I +  +DI  +  +LK +S +   
Sbjct: 319 GQSTLLECMATGSIPVIIADSLAMPFHDVIDWTRAAIFIREVDILSVISVLKKVSPKRIT 378

Query: 286 LLQ 288
            LQ
Sbjct: 379 ELQ 381


>gi|157110617|ref|XP_001651176.1| exostosin-2 [Aedes aegypti]
 gi|108878646|gb|EAT42871.1| AAEL005626-PA [Aedes aegypti]
          Length = 711

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 44/76 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L   +FCL  +G  ++   + D+L  GC+PVI+A++  LPF+D+L+W   SI +  
Sbjct: 303 YPGVLENGQFCLIARGVRLSQPTLMDALAAGCIPVIMADNLILPFSDILDWDLISIRIYE 362

Query: 267 LDIPLLKKILKGISSE 282
            ++  +   LK +S E
Sbjct: 363 NNLHSVITTLKAVSKE 378


>gi|302794690|ref|XP_002979109.1| hypothetical protein SELMODRAFT_418827 [Selaginella moellendorffii]
 gi|300153427|gb|EFJ20066.1| hypothetical protein SELMODRAFT_418827 [Selaginella moellendorffii]
          Length = 435

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 60/101 (59%), Gaps = 3/101 (2%)

Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           L+T ++     S FCL   G   ++AR+ D++  GC+PVI+++  + PF  +++++  ++
Sbjct: 279 LETSWSQRHTFSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELEPPFEGLVDYRKVAL 338

Query: 263 V---VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
               V T +   L   L+ I++ +  +L+S++L+  +HFQ+
Sbjct: 339 FVPSVKTTEKGWLVSYLRAITARQLSMLRSHMLEFSRHFQY 379


>gi|47222207|emb|CAG11086.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1040

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 5/111 (4%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  S FCL  +G  + + R  +SL   C+PV+++N ++LPF+DV+ W    I    
Sbjct: 568 YQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVIEGDE 627

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             +  +   ++ + +E  L      L+ R    W  + S  D   +   E+
Sbjct: 628 RLLLQVPSTVRAVGNERVL-----ALRQRTQMLWEAYFSSVDKIVLTTLEI 673


>gi|345480513|ref|XP_001603840.2| PREDICTED: exostosin-1-like isoform 1 [Nasonia vitripennis]
 gi|345480515|ref|XP_003424163.1| PREDICTED: exostosin-1-like isoform 2 [Nasonia vitripennis]
          Length = 713

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 76/322 (23%), Positives = 122/322 (37%), Gaps = 56/322 (17%)

Query: 45  KSHFVTKDPSKADLFFLPFSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGA 103
           +S + T DPS+A LF L         DR  + TE +     H + +   + PYWN   G 
Sbjct: 115 ESRYYTSDPSRACLFVLALDTL----DRDPLSTEFV-----HNLPSKVARLPYWN--NGK 163

Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP---QIWPRQEDP 160
           +H     +S       +          ++  +S  I       DVS+P   +  P +   
Sbjct: 164 NHLIFNLYSGTWPDYIEDAMAFDYGYAMLAKASMSIMKLREDFDVSIPLFTKQHPERGGE 223

Query: 161 PKLG-----SSKRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRLKTPY--- 207
           P L       +K+  LA F G      + S  R  L  +      I+  + R    +   
Sbjct: 224 PGLAIHNHFPNKKKYLAAFKGKRYVHGIGSETRNALHHLHNGKDLIFVTTCRHGKSWREL 283

Query: 208 ADE-----------------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
            DE                 LL S FCL  +G  + + R  ++L  GC+PVI++N + LP
Sbjct: 284 QDEHCPQDIREYDMYDYDVLLLNSTFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALP 343

Query: 251 FADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
           F D ++W    I      +  +  IL+ +  E+ +     VL+    F W  + S  +  
Sbjct: 344 FHDRIDWFQAVIYADERLLFQVPDILRSVVEEKIM-----VLRQTTQFLWERYFSSLEKI 398

Query: 311 YMVMYEL------WLRRSSVRV 326
               +E+      W  R   RV
Sbjct: 399 IFTTFEIIRERIPWEGRREYRV 420


>gi|68052299|sp|Q5IGR8.1|EXT1A_DANRE RecName: Full=Exostosin-1a; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase 1a; AltName:
           Full=Multiple exostoses protein 1 homolog a
 gi|56785791|gb|AAW29033.1| EXT1a [Danio rerio]
          Length = 730

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 75/365 (20%), Positives = 137/365 (37%), Gaps = 99/365 (27%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   SESY     +   S F T DP +A LF L
Sbjct: 97  GFKVYVYPQQK----------------GEKISESYQNILSSIEGSRFYTSDPGQACLFVL 140

Query: 62  PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ +   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 141 NLDTLDRDQLSPQYVHNLKTKIQNLNLWNNGRNHLIFNLYSGTWPDYTEDLGFD------ 194

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + D+S+P      +D P+ G  +   
Sbjct: 195 --IGQAMLAKA--------------SISTESFRPNFDISIPLF---SKDHPRTGGERGFL 235

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIY 197
                   R  +  F G      + S  R  L  +                 W+   +  
Sbjct: 236 KYNTIPPFRKYMLVFKGKRYLTGIGSDTRNALYHIHNAEDVVLLTTCKHGKDWQKHKDAR 295

Query: 198 AHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    +  Y + L  S FCL  +G  + + R  ++L   CVPV+++N ++LPF+++
Sbjct: 296 CDKDNAEYDRYDYKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEI 355

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYM 312
           ++W++ +++         +++L  I S    + Q  +L +R+   F W  + S  +   +
Sbjct: 356 IDWRTAAVIGD-------ERLLLQIPSTVRSIHQDRILSLRQQTQFLWEAYFSSVEKIVL 408

Query: 313 VMYEL 317
              E+
Sbjct: 409 TTLEI 413


>gi|321468156|gb|EFX79142.1| hypothetical protein DAPPUDRAFT_304945 [Daphnia pulex]
          Length = 729

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/111 (25%), Positives = 58/111 (52%), Gaps = 5/111 (4%)

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           L  S FCL  +G  + + R  + L  GC+PV+++N++ +PF+++++WK+ +I      + 
Sbjct: 307 LHNSTFCLVPRGRRLGSFRFIEVLQAGCIPVLLSNNWVIPFSEIIDWKTSAIWADERLLL 366

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            +  I++ I +E  +      L+ +    WH++ S  D       E+  +R
Sbjct: 367 QVPDIVRSIEAERVM-----ALRQQSQLLWHMYFSSIDRIIFTTLEIVRQR 412


>gi|159490316|ref|XP_001703125.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158270755|gb|EDO96590.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 528

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 58/128 (45%), Gaps = 8/128 (6%)

Query: 199 HSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
           H  R +  Y   L  S +CL   G      +I  +L+ GCVPV IA+    PF   +NW 
Sbjct: 371 HVARSEKRYGQYLARSLYCLAPPGAGHGQRQI-QALFMGCVPVTIADGVAEPFEPAVNWT 429

Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSD--YDAFY 311
            + + VA  D+P +  +L  I  E+  + Q+ +    +H  +      VF  D  +DAF 
Sbjct: 430 DWGVRVAEADVPQMHTLLDDIGPEQLAVKQARMRCAAQHMLYSSITGGVFGEDGAFDAFE 489

Query: 312 MVMYELWL 319
             +  + L
Sbjct: 490 TTLEVMRL 497


>gi|129270192|ref|NP_001012368.2| exostosin-1a [Danio rerio]
 gi|126632203|gb|AAI33118.1| Exostoses (multiple) 1a [Danio rerio]
          Length = 730

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 75/365 (20%), Positives = 137/365 (37%), Gaps = 99/365 (27%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   SESY     +   S F T DP +A LF L
Sbjct: 97  GFKVYVYPQQK----------------GEKISESYQNILSSIEGSRFYTSDPGQACLFVL 140

Query: 62  PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ +   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 141 NLDTLDRDQLSPQYVHNLKTKIQNLNLWNNGRNHLIFNLYSGTWPDYTEDLGFD------ 194

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + D+S+P      +D P+ G  +   
Sbjct: 195 --IGQAMLAKA--------------SISTESFRPNFDISIPLF---SKDHPRTGGERGFL 235

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIY 197
                   R  +  F G      + S  R  L  +                 W+   +  
Sbjct: 236 KYNTIPPFRKYMLVFKGKRYLTGIGSDTRNALYHIHNAEDVVLLTTCKHGKDWQKHKDAR 295

Query: 198 AHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    +  Y + L  S FCL  +G  + + R  ++L   CVPV+++N ++LPF+++
Sbjct: 296 CDKDNAEYDRYDYKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEI 355

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYM 312
           ++W++ +++         +++L  I S    + Q  +L +R+   F W  + S  +   +
Sbjct: 356 IDWRTAAVIGD-------ERLLLQIPSTVRSIHQDRLLSLRQQTQFLWEAYFSSVETIVL 408

Query: 313 VMYEL 317
              E+
Sbjct: 409 TTLEI 413


>gi|297738432|emb|CBI27633.3| unnamed protein product [Vitis vinifera]
          Length = 441

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 58/106 (54%), Gaps = 3/106 (2%)

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TL 267
           +  SKFCLH  G   ++ R+ D++   CVPVI+++  +LP+ D +++  FSI  +    L
Sbjct: 305 MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYTQFSIFFSDKEAL 364

Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
           +   + + L+ I  E ++ +  ++  +  H+++   P   DA  M+
Sbjct: 365 EPGYMIEQLRQIPKERWVEMWRHLKYISHHYEFQYPPKKGDAIDML 410


>gi|380025736|ref|XP_003696624.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like [Apis florea]
          Length = 700

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 44/82 (53%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y D L  + FCL ++G  +    + + +  G +PVIIA+   +PF  +++W    I +  
Sbjct: 295 YPDVLQTATFCLVIRGARLGQTTLLECMAAGSIPVIIADSLTMPFHGIIDWTRAVIFIRE 354

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
           +DI  L  +LK IS E  + LQ
Sbjct: 355 VDILSLISVLKKISQERIIELQ 376


>gi|115461230|ref|NP_001054215.1| Os04g0670600 [Oryza sativa Japonica Group]
 gi|113565786|dbj|BAF16129.1| Os04g0670600, partial [Oryza sativa Japonica Group]
          Length = 275

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 72/145 (49%), Gaps = 13/145 (8%)

Query: 134 SSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVRE 184
           S+S  +S  I H  VSL +  I P     P +  S+   R  L +F GA +      VRE
Sbjct: 39  SASSNVSHMIQHTQVSLLKDVIVPYTHLLPTMHLSENKDRPTLLYFKGAKHRHRGGLVRE 98

Query: 185 KLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           KL  +  N+ ++    G       ++    +  S+FCLH  G    + R+ D++   C+P
Sbjct: 99  KLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIP 158

Query: 241 VIIANHYDLPFADVLNWKSFSIVVA 265
           VI+++  +LPF  ++++  F+I V+
Sbjct: 159 VIVSDEIELPFEGMIDYTEFAIFVS 183


>gi|215686890|dbj|BAG89740.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215693850|dbj|BAG89049.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 264

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 72/145 (49%), Gaps = 13/145 (8%)

Query: 134 SSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVRE 184
           S+S  +S  I H  VSL +  I P     P +  S+   R  L +F GA +      VRE
Sbjct: 28  SASSNVSHMIQHTQVSLLKDVIVPYTHLLPTMHLSENKDRPTLLYFKGAKHRHRGGLVRE 87

Query: 185 KLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
           KL  +  N+ ++    G       ++    +  S+FCLH  G    + R+ D++   C+P
Sbjct: 88  KLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIP 147

Query: 241 VIIANHYDLPFADVLNWKSFSIVVA 265
           VI+++  +LPF  ++++  F+I V+
Sbjct: 148 VIVSDEIELPFEGMIDYTEFAIFVS 172


>gi|1524415|gb|AAC53143.1| multiple exostosis protein [Mus musculus]
          Length = 718

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 44/77 (57%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  + FC  ++   +  A ++D L  GCVPV+IA+ Y LPF+++L+WK  S+VV  
Sbjct: 308 YPQVLQEATFCTVLRRARLGQAVLSDVLQAGCVPVVIADSYILPFSEILDWKKASVVVPE 367

Query: 267 LDIPLLKKILKGISSEE 283
             +  +  IL+ I   +
Sbjct: 368 EKMSDVYSILQNIPQRQ 384


>gi|159466264|ref|XP_001691329.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158279301|gb|EDP05062.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 815

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 76/348 (21%), Positives = 135/348 (38%), Gaps = 94/348 (27%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPF---------------------SIARMRH 70
            Y  E    +  ++S   T DP  AD F++P                      S  R+ H
Sbjct: 410 TYGLEVLMHEMLLQSEHRTFDPEAADFFYVPVYGSCFIFPLHCYADGPWWYAPSGPRVMH 469

Query: 71  DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS----------IGRSAMQK 120
              +  E + D I  +       +PYW+R GG DH ++  H            G S    
Sbjct: 470 VTNMMLE-VRDLIRKH-------FPYWDRRGGRDHIWLMTHDEGACYAPSDIYGSSIFLT 521

Query: 121 AWEVKLNAIQVVCSSS------------------YFISGHIAH---KDVSLPQI------ 153
            W  + +  +   + +                  + I GH  +   KD+ +P +      
Sbjct: 522 HWGRRDSDHKSNTAFTPDNYTQEYVHPMQDKGWLHLIEGHPCYTPGKDLIVPALKLPHHF 581

Query: 154 --WPRQEDPPKLGSSKRNKLAFFAGAVNS--------PVREKLLQVWRND------SEIY 197
              P    PP+     R+ L +  G V           +R++L ++W++       + + 
Sbjct: 582 SGSPLLFHPPR----PRDILLYLRGDVGKHRLPNYSRGIRQRLYKLWKDHDWQNKYNAMI 637

Query: 198 AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
                +   Y++ L  SKFC+   G +  +AR+ D++ +GCVPVI+ ++    F + L++
Sbjct: 638 GDGSDVPGGYSEHLASSKFCVVAPG-DGWSARLEDAVLHGCVPVIVMDNVSAVFEEQLDF 696

Query: 258 KSFSIVV--ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
            SFSI V  A  ++  L + LK +      +L+    K+R    WH +
Sbjct: 697 NSFSIRVGEAEAELAYLPERLKSVPPR---ILEGMQKKLRT--IWHRY 739


>gi|356534007|ref|XP_003535549.1| PREDICTED: LOW QUALITY PROTEIN: xylogalacturonan
           beta-1,3-xylosyltransferase-like [Glycine max]
          Length = 373

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 59/241 (24%), Positives = 104/241 (43%), Gaps = 31/241 (12%)

Query: 52  DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISH-YIFNISQKYPYWNRTGGADHF 106
           DP  AD+ F+PF    S  +    ++ G +   D+     + +  +    WNR+GG DH 
Sbjct: 51  DPLLADVVFVPFFATLSANKGAFRKKHGND---DYKRQRQVVDAVKSTQVWNRSGGRDHV 107

Query: 107 YVA----CHSIGRSAMQKA--WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--IWPRQE 158
           +V     C +   S +                C  S  +     H  VS+ +  I P   
Sbjct: 108 FVLTGAFCKNPSFSFVPGGDFGGWSRGGGGSNCGESDVV----PHTQVSVIKDVIVPYMH 163

Query: 159 DPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE- 210
             P+L  S+   R++L +F GA        +REKL  +  ++  +    G       ++ 
Sbjct: 164 LLPRLDLSENKVRHQLLYFKGAKHRHRGGIIREKLWDLLVSEPGVIMEEGFPNATGREQS 223

Query: 211 ---LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATL 267
              +  S+FCLH  G    + R+ D++   C+PVI+++  +LPF  ++++  FS+  A  
Sbjct: 224 IKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDIIELPFEGMVDYAEFSVFPAVN 283

Query: 268 D 268
           D
Sbjct: 284 D 284


>gi|268536482|ref|XP_002633376.1| C. briggsae CBR-RIB-1 protein [Caenorhabditis briggsae]
          Length = 349

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 43/74 (58%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y D L  S FCL  +G  + + R  ++L  GC+PV+I++ + LPF++  +W S  IVVA 
Sbjct: 230 YDDLLANSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWVLPFSETTDWNSAVIVVAE 289

Query: 267 LDIPLLKKILKGIS 280
            D   + ++L   S
Sbjct: 290 RDALSIPELLMSTS 303


>gi|384245403|gb|EIE18897.1| hypothetical protein COCSUDRAFT_45043 [Coccomyxa subellipsoidea
           C-169]
          Length = 420

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 87/380 (22%), Positives = 138/380 (36%), Gaps = 70/380 (18%)

Query: 7   VYVY--PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFS 64
           VYVY  P + N       LP  + P   Y  +    K    S   TKDPS A +F++P  
Sbjct: 34  VYVYDLPEKFNKDIKE--LPTIWHPE-QYDIDQVLHKHLTHSEVNTKDPSVAKVFYIPVY 90

Query: 65  IARMRHDR------------------------------RIGTEGIPDFISHYIFNISQKY 94
           + R  + +                              ++      D +   I ++ + Y
Sbjct: 91  LGRYFNAQWQRFSDPSDAWLINKECHGLDSVDCWAEKWKVAENATSDLVRSAIAHVKENY 150

Query: 95  PYWNRTGGADHFYVACHSIGRSAMQKAWEV----KLNAIQVVCSSSYFISGHIAHKDV-- 148
           PYWN + GADHF V  +  G+  M KA       ++ +IQ   S  Y  +  +   D   
Sbjct: 151 PYWNASNGADHFMVFSYDHGKCEMAKALRFEEFGEMFSIQAYGSLVYRSNAKVQAVDRGD 210

Query: 149 ----SLPQIWPRQEDP-----PKLGSSKRNK-LAFFAGAVNSPVREKLLQVWRNDSEIYA 198
               S P  W           P       N  L+ FA   N  +  +      +   + A
Sbjct: 211 SYSWSGPSTWACYRPDADVLVPVFSPYGHNTILSPFAVERNISLLMRFDYPLNDGKSLVA 270

Query: 199 HSG-RLKTPYAD-----ELLGSKFCLH-VKGFEV-------NTARIADSLYYGCVPVIIA 244
           H G RL+    D      L GS   L   K   +       ++AR+  ++ +GC+PV   
Sbjct: 271 HHGHRLRKELIDYWQEQPLNGSDLGLRSTKARPIPAARNTQDSARVWRAITFGCIPVTFF 330

Query: 245 NHYDLPFADVLN--WKSFSIVVATLDIPLLKKILKGISSEEYLL--LQSYVLKVRKHFQW 300
             +DLPF   +   +  F + +   D   L   ++ +      L  +Q  + + ++ F W
Sbjct: 331 RAFDLPFQRHMGMPYADFVLNIQPDDYRQLNARIQALLDNPSQLRRMQEALERHQRFFVW 390

Query: 301 HVFPSDYDAFYMVMYELWLR 320
                D   + MV  EL LR
Sbjct: 391 DE-AGDGGIYTMVEKELTLR 409


>gi|389744786|gb|EIM85968.1| hypothetical protein STEHIDRAFT_168993 [Stereum hirsutum FP-91666
            SS1]
          Length = 1111

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 55/117 (47%), Gaps = 4/117 (3%)

Query: 207  YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
            Y   L  + FC    G      R+ DS+Y GC+PV+I      PF D+L+W   SI V  
Sbjct: 972  YMSLLNDTVFCPQPAGTTGWATRLVDSMYAGCIPVLIGQASHFPFYDMLDWGKISIRVEP 1031

Query: 267  LDIPLLKKIL-KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRS 322
             D+  L+ IL    S ++   LQ+ ++ VR      V+P D     +V  E+  RR 
Sbjct: 1032 SDLAQLEDILFSRYSLDDIERLQANIMLVRDAL---VYPLDDVDDTVVREEMIERRG 1085


>gi|296227312|ref|XP_002759355.1| PREDICTED: exostosin-1 [Callithrix jacchus]
          Length = 475

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 70/147 (47%), Gaps = 12/147 (8%)

Query: 176 GAVNSPVREKLLQVWRNDSEIYAHSGRLKT---PYADELLGSKFCLHVKGFEVNTARIAD 232
           G  N P+   L+Q    + +I     +       Y + L  + FCL  +G  + + R  +
Sbjct: 19  GMPNIPLIPGLIQAGLAEKDILDQERKWTEHWYDYREMLHNATFCLVPRGRRLGSFRFLE 78

Query: 233 SLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
           +L   CVPV+++N ++LPF++V+NW   +++    D  LL +I   I S    + Q  +L
Sbjct: 79  ALQAACVPVMLSNGWELPFSEVINWNQAAVIG---DERLLLQIPSTIRS----IHQDKIL 131

Query: 293 KVRK--HFQWHVFPSDYDAFYMVMYEL 317
            +R+   F W  + S  +   +   E+
Sbjct: 132 ALRQQTQFLWEAYFSSVEKIVLTTLEI 158


>gi|291238132|ref|XP_002738985.1| PREDICTED: tout-velu-like, partial [Saccoglossus kowalevskii]
          Length = 403

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 43/67 (64%)

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           + FC+ ++G  +    ++D+L  GC+PV++++ Y LPF+ VL+WK  +I +   D+  + 
Sbjct: 2   ATFCIVLRGARLGQTALSDALKAGCIPVVLSDTYVLPFSQVLDWKRAAIRLNEEDLDQVA 61

Query: 274 KILKGIS 280
            +L+ IS
Sbjct: 62  SVLRSIS 68


>gi|170054218|ref|XP_001863025.1| exostosin-2 [Culex quinquefasciatus]
 gi|167874545|gb|EDS37928.1| exostosin-2 [Culex quinquefasciatus]
          Length = 758

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 43/76 (56%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L   +FCL  +G  ++   + D+L  GC+PVI+A++  LPF +VL+W   SI +  
Sbjct: 350 YPSVLENGQFCLVARGVRLSQPTLMDALASGCIPVIMADNLVLPFGEVLDWDLVSIRIHE 409

Query: 267 LDIPLLKKILKGISSE 282
            ++  +   LK +S E
Sbjct: 410 NNLHSVISTLKAVSKE 425


>gi|326429052|gb|EGD74622.1| hypothetical protein PTSG_05987 [Salpingoeca sp. ATCC 50818]
          Length = 452

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 22/176 (12%)

Query: 161 PKLGSSKRNKLAFFAGAV-NSPVREKLLQVWRNDSEIYAHSGRLK-----TPYAD----- 209
           P+  +  R+   FFAG + N+ VR  L +          H  ++K      P +      
Sbjct: 268 PQTSARARDTFLFFAGTLSNNRVRTVLKKAVEGHPHCVIHDAQVKMMQDKNPRSSHSLLV 327

Query: 210 ---------ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV-LNWKS 259
                    E+  S+FCL  +G    T RI +++  GC+PVI+++ Y  PF  +     +
Sbjct: 328 RQVVRDTLAEMATSEFCLCPRGLTAGTRRIFEAVLVGCIPVIVSDGYTWPFPHLAAELDA 387

Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHV-FPSDYDAFYMVM 314
            S+ V   D   +  IL  +S  E +  +  +  +  +  +H+  P   DAFY ++
Sbjct: 388 ASVRVPEKDAARVLDILGHVSRRERVAKRVRLAHLAHNVTYHLPAPQPGDAFYNII 443


>gi|302809300|ref|XP_002986343.1| hypothetical protein SELMODRAFT_425371 [Selaginella moellendorffii]
 gi|300145879|gb|EFJ12552.1| hypothetical protein SELMODRAFT_425371 [Selaginella moellendorffii]
          Length = 421

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 59/101 (58%), Gaps = 3/101 (2%)

Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           L+T ++     S FCL   G   ++AR+ D++  GC+PVI+++  + PF  +++++  ++
Sbjct: 279 LETSWSQRHTFSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELEPPFEGLVDYRKVAL 338

Query: 263 V---VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
               V T +   L   L+ I++ +  +L+ ++L+  +HFQ+
Sbjct: 339 FVPSVKTTEKGWLVSYLRAITARQLSMLRGHMLEFSRHFQY 379


>gi|358256597|dbj|GAA50186.1| glucuronyl/N-acetylglucosaminyl transferase EXT1 [Clonorchis
           sinensis]
          Length = 802

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 50/195 (25%), Positives = 86/195 (44%), Gaps = 42/195 (21%)

Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS--------------- 166
           E +L+  Q + + + F +  I H  D+SLP I P+  +   L +S               
Sbjct: 226 EYRLSLGQAILAKASFSTTKIRHTFDISLPLIHPQHPEKISLEASGHDASYHRVHQRTKV 285

Query: 167 KRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRLKTP---YAD--------- 209
           +R  L  F G      + S  R  L  +   D  I   + R  T    YAD         
Sbjct: 286 RRPILLSFKGKRYVSGIGSASRNTLFHLHNGDDVIMVTTCRHGTDWIRYADKRCSVDMAT 345

Query: 210 -------ELL-GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
                  EL+  S FCL  +G  + + R  ++L   C+PV+++N ++LPF++V++W S +
Sbjct: 346 YDQYDYNELMHNSTFCLVPRGRRLGSYRFLEALEASCIPVMLSNDWELPFSEVIDW-SKA 404

Query: 262 IVVATLDIPLLKKIL 276
           ++ A   +PL   ++
Sbjct: 405 VIWADEHLPLTLSLM 419


>gi|117606177|ref|NP_001012369.2| exostosin-1b [Danio rerio]
 gi|116487511|gb|AAI25901.1| Exostoses (multiple) 1b [Danio rerio]
 gi|182891996|gb|AAI65657.1| Ext1b protein [Danio rerio]
          Length = 741

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 73/363 (20%), Positives = 133/363 (36%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   SESY         S F T DP +A +F L
Sbjct: 108 GFKVYVYPQQK----------------GEKISESYQNILSTIEGSRFYTSDPGQACVFVL 151

Query: 62  PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ +   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 152 SLDTLDRDQLSPQYVHNLKTKVQSLALWNNGRNHLIFNLYSGTWPDYTEDLGFD------ 205

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + D+S+P      ++ P+ G  +   
Sbjct: 206 --IGQAMLAKA--------------SISTENFRPNFDISIPLF---SKEHPRTGGDRGYL 246

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIY 197
                   R  +  F G      + S  R  L  V                 W+   +  
Sbjct: 247 KYNTIPPFRKYMLVFKGKRYLTGIGSDTRNALYHVHNAEDVVLLTTCKHGKDWQKHKDAR 306

Query: 198 AHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  S FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 307 CDKDNAEYDKYDYREMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 366

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           ++W + +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 367 IDWNTAAVIGDERLLLQIPSTVRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 421

Query: 315 YEL 317
            E+
Sbjct: 422 LEI 424


>gi|68052298|sp|Q5IGR7.1|EXT1B_DANRE RecName: Full=Exostosin-1b; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase 1b; AltName:
           Full=Multiple exostoses protein 1 homolog b
 gi|56785793|gb|AAW29034.1| EXT1b [Danio rerio]
          Length = 741

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 73/363 (20%), Positives = 133/363 (36%), Gaps = 95/363 (26%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
            F+VYVYP ++                G   SESY         S F T DP +A +F L
Sbjct: 108 GFKVYVYPQQK----------------GEKISESYQNILSTIEGSRFYTSDPGQACVFVL 151

Query: 62  PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
                       +  H+ +   + +    +  +H IFN+ S  +P +    G D      
Sbjct: 152 SLDTLDRDQLSPQYVHNLKTKVQSLALWNNGRNHLIFNLYSGTWPDYTEDLGFD------ 205

Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
             IG++ + KA              S        + D+S+P      ++ P+ G  +   
Sbjct: 206 --IGQAMLAKA--------------SISTENFRPNFDISIPLF---SKEHPRTGGDRGYL 246

Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIY 197
                   R  +  F G      + S  R  L  V                 W+   +  
Sbjct: 247 KYNTIPPFRKYMLVFKGKRYLTGIGSDTRNALYHVHNAEDVVLLTTCKHGKDWQKHKDAR 306

Query: 198 AHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
                    K  Y + L  S FCL  +G  + + R  ++L   CVPV+++N ++LPF++V
Sbjct: 307 CDKDNAEYDKYDYREMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 366

Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
           ++W + +++     +  +   ++ I  ++ L      L+ +  F W  + S  +   +  
Sbjct: 367 IDWNTAAVIGDERLLLQIPSTVRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 421

Query: 315 YEL 317
            E+
Sbjct: 422 LEI 424


>gi|9454585|gb|AAF87908.1|AC015447_18 Hypothetical protein [Arabidopsis thaliana]
          Length = 414

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 64/276 (23%), Positives = 107/276 (38%), Gaps = 61/276 (22%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D    +L   D   +      G + S+    K  ++S F T    +AD
Sbjct: 88  SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P   A ++  R +G     +    Y+  +SQ  PY+ R+GG DH +V     G + 
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202

Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
           + ++W   +N   ++      +     +   + KD+ +P           Q D   L  S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262

Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
           KR  LA + G                       +GRLK     +    K           
Sbjct: 263 KRKYLANYLGRAQ------------------GKAGRLKLIDLSKQFPDK----------- 293

Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
                      CVPV++++H +LPF +V+++   SI
Sbjct: 294 ----------ECVPVLLSDHAELPFQNVIDYAQVSI 319


>gi|340369783|ref|XP_003383427.1| PREDICTED: exostosin-2-like [Amphimedon queenslandica]
          Length = 703

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 38/93 (40%), Positives = 51/93 (54%), Gaps = 3/93 (3%)

Query: 204 KTPYADELLGSKFCLHVK-GFEVNTARIADSLYYGCVPVIIANH-YDLPFADVLNWKSFS 261
           K PY D L  SK+C+ V  G    T  + D L  GCVPVII N+   LPF++V++W+ F+
Sbjct: 304 KLPYPDILQDSKYCMIVSDGTGRGTPDLMDVLMMGCVPVIIRNYELVLPFSEVIDWQRFA 363

Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
           + V    +  L  IL G S    +L Q  VL V
Sbjct: 364 VFVWLEQLFQLMPIL-GSSRNGLILKQKQVLHV 395


>gi|380799435|gb|AFE71593.1| exostosin-1, partial [Macaca mulatta]
          Length = 542

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 60/116 (51%), Gaps = 9/116 (7%)

Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
           K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW   +++
Sbjct: 117 KYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVI 176

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
               D  LL +I   I S    + Q  +L +R+  Q  W  + S  +   +   E+
Sbjct: 177 G---DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 225


>gi|349603640|gb|AEP99427.1| Exostosin-1-like protein, partial [Equus caballus]
          Length = 435

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 61/120 (50%), Gaps = 9/120 (7%)

Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
           SG     Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW  
Sbjct: 6   SGVCLYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQ 65

Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
            +++    D  LL +I   I S    + Q  +L +R+   F W  + S  +   +   E+
Sbjct: 66  AAVIG---DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 118


>gi|307106602|gb|EFN54847.1| hypothetical protein CHLNCDRAFT_134888 [Chlorella variabilis]
          Length = 833

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/176 (25%), Positives = 84/176 (47%), Gaps = 20/176 (11%)

Query: 147 DVSLPQIWPRQE--DPPKLGSSKRNK--LAFFAGAV---NSPVREKLLQVWRNDSEIYAH 199
           D+ LP +   Q+  + P LG+  R +  LAFF G     N      + Q   N +  +  
Sbjct: 590 DLVLPVMHSAQKYLESPMLGAPTRERRILAFFKGRTQQSNPEYSRGIRQTLENLTREHDW 649

Query: 200 SGRLKTPYADE------------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
            G+ K    +E            L  S FC  + G +  ++R  D++ +GC+PV+I +  
Sbjct: 650 WGKHKVHVGEEMPEGESDSYSAMLAQSVFCFALMG-DGFSSRTDDAIIHGCIPVLIQDGV 708

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
           +  ++++L+  S+S+ +   D+  + +IL+ IS E+   +Q+ + KV +   W  F
Sbjct: 709 EPTWSNLLDTGSYSVRILQKDMERVPEILQAISKEDVARMQANLGKVWRRHLWSGF 764



 Score = 38.9 bits (89), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 40/96 (41%), Gaps = 14/96 (14%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTE------GIPDFISHY 86
           Y  E    +A ++S   T DP +AD F++P  ++         T+      G   + +H 
Sbjct: 414 YNPEPGLHEALLQSEHRTLDPDEADFFYIPAFVSCFLFPVLSATDFPYFHGGPVAWRTHA 473

Query: 87  IFN--------ISQKYPYWNRTGGADHFYVACHSIG 114
             N        I   YPYW+R GG DH   + H  G
Sbjct: 474 AANMFIEVYHWIRSHYPYWDRNGGRDHIVGSFHDEG 509


>gi|431901712|gb|ELK08589.1| Exostosin-1 [Pteropus alecto]
          Length = 452

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 9/113 (7%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW   +++   
Sbjct: 30  YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIG-- 87

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
            D  LL +I   I S    + Q  +L +R+   F W  + S  +   +   E+
Sbjct: 88  -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 135


>gi|198416199|ref|XP_002119284.1| PREDICTED: similar to exostoses (multiple) 2, partial [Ciona
           intestinalis]
          Length = 659

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 47/85 (55%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L    FCL +    +  + + +S+  GC+PV   + Y LPF++VL+W   S+++  
Sbjct: 298 YPEILQEGTFCLLLPTSRLGQSALMESMQAGCIPVFACDTYILPFSEVLDWSRASVLIRE 357

Query: 267 LDIPLLKKILKGISSEEYLLLQSYV 291
             +P +  IL+ I  E+ +L++  V
Sbjct: 358 DSLPDIMNILRRIPHEQVVLMKKQV 382


>gi|440904844|gb|ELR55304.1| Exostosin-1, partial [Bos grunniens mutus]
          Length = 426

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 26/111 (23%), Positives = 56/111 (50%), Gaps = 5/111 (4%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW   +++   
Sbjct: 4   YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 63

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             +  +   ++ I  ++ L      L+ +  F W  + S  +   +   E+
Sbjct: 64  RLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 109


>gi|355757951|gb|EHH61385.1| Exostosin-1, partial [Macaca fascicularis]
          Length = 431

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 26/111 (23%), Positives = 56/111 (50%), Gaps = 5/111 (4%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW   +++   
Sbjct: 9   YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 68

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             +  +   ++ I  ++ L      L+ +  F W  + S  +   +   E+
Sbjct: 69  RLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 114


>gi|281354411|gb|EFB29995.1| hypothetical protein PANDA_011713 [Ailuropoda melanoleuca]
          Length = 426

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 9/113 (7%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW   +++   
Sbjct: 4   YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIG-- 61

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
            D  LL +I   I S    + Q  +L +R+   F W  + S  +   +   E+
Sbjct: 62  -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 109


>gi|410987732|ref|XP_004000149.1| PREDICTED: exostosin-1 [Felis catus]
          Length = 488

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 9/113 (7%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW   +++   
Sbjct: 66  YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIG-- 123

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
            D  LL +I   I S    + Q  +L +R+   F W  + S  +   +   E+
Sbjct: 124 -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 171


>gi|326918032|ref|XP_003205297.1| PREDICTED: exostosin-1-like, partial [Meleagris gallopavo]
          Length = 535

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 61/116 (52%), Gaps = 9/116 (7%)

Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
           K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V++WK  +++
Sbjct: 159 KYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVI 218

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
               D  LL +I   I S    + Q  +L +R+  Q  W  + S  +   +   E+
Sbjct: 219 G---DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 267


>gi|77748248|gb|AAI05840.1| Ext1 protein [Rattus norvegicus]
          Length = 557

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/114 (23%), Positives = 57/114 (50%), Gaps = 5/114 (4%)

Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
           K  Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW   +++
Sbjct: 132 KYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVI 191

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
                +  +   ++ I  ++ L      L+ +  F W  + S  +   +   E+
Sbjct: 192 GDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 240


>gi|194698238|gb|ACF83203.1| unknown [Zea mays]
 gi|413919897|gb|AFW59829.1| hypothetical protein ZEAMMB73_270023 [Zea mays]
          Length = 264

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 65/137 (47%), Gaps = 13/137 (9%)

Query: 143 IAHKDVSLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRND 193
           I H  VSL +  I P     P L  S+   R  L +F GA        VREKL  +  N+
Sbjct: 37  IQHTQVSLLKDVIVPYTHLLPTLLLSENKDRRTLLYFKGAKHRHRGGLVREKLWDLLGNE 96

Query: 194 SEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
            ++    G       ++    L  S+FCLH  G    + R+ D++   C+PVI+++  +L
Sbjct: 97  PDVIMEEGFPNATGREQSIKGLRTSEFCLHPAGDTPTSCRLFDAIASLCIPVIVSDEVEL 156

Query: 250 PFADVLNWKSFSIVVAT 266
           PF  ++++   SI V+ 
Sbjct: 157 PFEGIIDYTEISIFVSV 173


>gi|363731063|ref|XP_418396.3| PREDICTED: exostosin-1 [Gallus gallus]
          Length = 581

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 60/113 (53%), Gaps = 9/113 (7%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V++WK  +++   
Sbjct: 159 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVIG-- 216

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
            D  LL +I   I S    + Q  +L +R+   F W  + S  +   +   E+
Sbjct: 217 -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 264


>gi|427796253|gb|JAA63578.1| Putative tout-velu, partial [Rhipicephalus pulchellus]
          Length = 835

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 77/347 (22%), Positives = 135/347 (38%), Gaps = 54/347 (15%)

Query: 2   NRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFL 61
            R F+VYVYP   + P A+   PV               +   +S + T D S+A +F  
Sbjct: 184 GRDFKVYVYPSESDAPPAS---PV----------YQRILRVIRQSSYATADASEACVFVP 230

Query: 62  --------PFSIARMRHDRRIGTEGIPDFISHYIFNI-SQKYPYWNRTGGAD--HFYVAC 110
                   P S    R  R   +       +H IFN+ S  +P ++   G D     +A 
Sbjct: 231 AVDTVDRDPLSPDYARTARLTDSPLWNGGQNHLIFNLFSGTWPDYSEELGLDIGLAMLAK 290

Query: 111 HSIGRSAMQKAWEVKL----------NAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDP 160
            SI  SA +  +++ L               + S+     G++         ++  +   
Sbjct: 291 SSIPESAFRPGFDIALPLFPKAHPERGGKPAIQSAGPVDKGYLL--------VFKGKRYV 342

Query: 161 PKLGSSKRNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGS 214
             +GS  RN L       +    +  R     + R D    A + RL  +  Y   +  +
Sbjct: 343 YGIGSDTRNALHHLNNGRDVLLLTTCRHGKQWMERRDERCEADN-RLYDRYDYGSLMENA 401

Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKK 274
            FCL  +G  + + R  +SL  GCVPV++AN ++LPF + L W+  ++     D  LL +
Sbjct: 402 TFCLVPRGRRLGSFRFLESLQAGCVPVLLANGWELPFGESLRWEGAALRA---DERLLLQ 458

Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +   + S     +  + ++ R    W  + S  D   + + ++   R
Sbjct: 459 VPDTLRSMPRRRV--HAMRQRSQLLWETYFSSVDKIVLTVLQILRER 503


>gi|406700155|gb|EKD03339.1| hypothetical protein A1Q2_02368 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 1041

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 73/148 (49%), Gaps = 11/148 (7%)

Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW-------RNDSEIYAHSGRLKTPYADELLGSKFCL 218
           ++R  LA F G+ N       L+          +   ++AH    +T Y D L  ++FC 
Sbjct: 845 AERQTLATFKGSPNGQGTSLRLKTTCPRLLPPGSLEPVWAHIPEGQT-YLDLLGDTRFCP 903

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
              G    T R++D +Y GC+PV++ +   + +  + +W  FS+ V   ++  L++IL  
Sbjct: 904 IPFGTAGWTYRLSDVVYAGCIPVLVGDQSHMTYWSMFDWSLFSVQVFEHELDHLERILGA 963

Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
           ++ E+    Q  ++ VR+ F   ++PS+
Sbjct: 964 VTEEDAQRKQDALMLVREAF---LYPSE 988


>gi|242018853|ref|XP_002429885.1| Exostosin-1, putative [Pediculus humanus corporis]
 gi|212514919|gb|EEB17147.1| Exostosin-1, putative [Pediculus humanus corporis]
          Length = 725

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 68/297 (22%), Positives = 114/297 (38%), Gaps = 61/297 (20%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
            F++YVYP           +P+   P  +Y          ++S + T DPS+A LF L  
Sbjct: 91  GFKIYVYPQTDESG-----IPLPLVPSASYQK---IINVIIESRYYTSDPSQACLFVL-- 140

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWE 123
           +I  +  D         D++ +    I QK PYWN+  G +H     +S          E
Sbjct: 141 AIDTLDRDSLSN-----DYVRNLPLRI-QKLPYWNK--GRNHVIFNLYS---GTWPDYAE 189

Query: 124 VKLN---AIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSS-----------KRN 169
             L       ++  +S        + DVS+P       +  K G S           K+ 
Sbjct: 190 ENLGFDYGFSILAKASMSTVHFRPNFDVSIPLFHKNHRE--KGGESGYVSSNNFPVIKKY 247

Query: 170 KLAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE-------- 210
            LAF    +   + S  R  L  +      +   + +       +K    DE        
Sbjct: 248 VLAFKGKRYVHGIGSETRNSLYHLHNEKDMVLVTTCKHGKSWKEMKDERCDEDNQEYDRY 307

Query: 211 -----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
                L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF +V++W   ++
Sbjct: 308 DYEVLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNGWVLPFQEVIDWTKAAV 364


>gi|168041166|ref|XP_001773063.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675610|gb|EDQ62103.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 395

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 117/288 (40%), Gaps = 52/288 (18%)

Query: 50  TKDPSKADLFFLPF-----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGAD 104
           T +P +ADLFF+PF     ++  +  D +   +   + +     N +Q    W R GG D
Sbjct: 80  TSNPGEADLFFIPFFSGLEALPYLYTDGKRRLQQGRELVEWLEANATQT---WRRHGGHD 136

Query: 105 HFYVACHSIGRSAMQKA--------WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR 156
           HF +A    GR+A            W   L +   + +++  +    + +   +   +P 
Sbjct: 137 HFLIA----GRTAWDFCRPLTAVTWWGTSLFSNPEMENTTAMLLERRSWRGDEMAVPYPV 192

Query: 157 QEDPPK----------LGSSKRNKLAFFAGAVNS----PVREKLLQVWRNDSEIYAH--S 200
              P            + SS R  L  F+GA+       +RE L Q         +    
Sbjct: 193 GFHPSTSASLQSWIKLVRSSTRKYLFSFSGALRPQLVFSIREILSQQCTQAGSACSRLDC 252

Query: 201 GRLKT-----PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
           G++K      P    LL +KFCL  +G       + DS+  GC+PV    H D  F    
Sbjct: 253 GKIKCSHEPQPIYTSLLQAKFCLQPRGDTATRRSVIDSIVSGCIPVFF--HKDTAFTQYR 310

Query: 256 -----NWKSFSIVVATLDI----PLLKKILKGISSEEYLLLQSYVLKV 294
                ++ +FS+ +   DI      +KKIL+G S+++   ++  ++ +
Sbjct: 311 WHLPNDYDNFSVFIDEEDIKNGKADVKKILEGYSAKQVEQMRERLIGI 358


>gi|328788198|ref|XP_001121972.2| PREDICTED: exostosin-2 [Apis mellifera]
          Length = 708

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 44/82 (53%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL ++G  +    + + +  G +PVIIA+   +PF  +++W    I +  
Sbjct: 301 YPNVLQTATFCLVIRGARLGQTTLLECMAAGSIPVIIADSLTMPFHGIIDWTRAVIFIRE 360

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
           +DI  L  +LK IS E  + LQ
Sbjct: 361 VDILSLISVLKKISQERIIELQ 382


>gi|351697844|gb|EHB00763.1| Exostosin-like 1 [Heterocephalus glaber]
          Length = 674

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 31/117 (26%), Positives = 60/117 (51%), Gaps = 6/117 (5%)

Query: 201 GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
           G  +T   ++L  + FCL + G     +R   +L  GC+PV+++ H++LPF++V++W   
Sbjct: 256 GPKQTHPGEKLPNATFCL-IPGRRAEASRFLQALQAGCIPVLLSPHWELPFSEVIDWTKA 314

Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
           +I VA   +PL  ++L  +  +E    +   L+ +  F W  + S  +       E+
Sbjct: 315 AI-VADERLPL--QVLAAL--QEMAPARVLALRQQTQFLWDAYFSSVEKVIHTTLEI 366


>gi|351697330|gb|EHB00249.1| Exostosin-1, partial [Heterocephalus glaber]
          Length = 426

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 26/111 (23%), Positives = 56/111 (50%), Gaps = 5/111 (4%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW   +++   
Sbjct: 4   YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 63

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             +  +   ++ I  ++ L      L+ +  F W  + S  +   +   E+
Sbjct: 64  RLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 109


>gi|432094719|gb|ELK26199.1| Exostosin-1 [Myotis davidii]
          Length = 544

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 9/113 (7%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW   +++   
Sbjct: 122 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIG-- 179

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
            D  LL +I   I S    + Q  +L +R+   F W  + S  +   +   E+
Sbjct: 180 -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 227


>gi|410931012|ref|XP_003978890.1| PREDICTED: exostosin-1c-like, partial [Takifugu rubripes]
          Length = 424

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 28/111 (25%), Positives = 52/111 (46%), Gaps = 5/111 (4%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  S FCL  +G  + + R  +SL   C+PV+++N ++LPF+DV+ W    I    
Sbjct: 3   YQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVIEGDE 62

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             +  +   ++ + ++  L      L+ R    W  + S  D   +   E+
Sbjct: 63  RLLLQVPSTVRAVGNDRVL-----ALRQRTQMLWEAYFSSVDKIVLTTLEI 108


>gi|307177266|gb|EFN66444.1| Exostosin-1 [Camponotus floridanus]
          Length = 711

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 79/371 (21%), Positives = 140/371 (37%), Gaps = 77/371 (20%)

Query: 3   RSFRVYVYP-HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFL 61
           + F VYVYP      P    +L V  E R                 + T DP++A +F L
Sbjct: 87  QGFTVYVYPIEEAISPLYQKILNVITESR-----------------YYTSDPARACIFVL 129

Query: 62  PFSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
                    DR  + TE + +  S  +       PYWN   G +H     +S       +
Sbjct: 130 ALDTL----DRDPLSTEFVHNLPSKLL-----HLPYWN--NGRNHLIFNLYSGTWPDYSE 178

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----------SKRNK 170
                     ++  +S  I  H +  DVS+P     ++ P + G           + +  
Sbjct: 179 ESLAFDMGYAILAKASMSIFRHRSDFDVSIPLF--GKQHPERGGEPGQALENNFPNNKKY 236

Query: 171 LAFFAG-----AVNSPVREKLLQVWRNDSEIYAHS---GRLKTPYADE------------ 210
           +A F G      + S  R  L  +      ++  +   G+    + DE            
Sbjct: 237 VAAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWREFQDEHCQQNNQEYDTY 296

Query: 211 -----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
                L+ + FCL  +G  + + R  ++L  GC+PVI++N + LPF + ++W   +I   
Sbjct: 297 DYEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAAIFSD 356

Query: 266 TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD-----YDAFYMVMYELWLR 320
              +  +  I++ +S+     +Q   L+ +  F W  + S      + AF  +   L   
Sbjct: 357 ERLLLQIPDIVRSVSN-----VQILKLRQQTQFLWERYFSSIEKIVFTAFENIRERLPWE 411

Query: 321 RSSVRVQWSTS 331
            +  R+ W+TS
Sbjct: 412 GARERIVWNTS 422


>gi|449278615|gb|EMC86416.1| Exostosin-1 [Columba livia]
          Length = 1015

 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 59/113 (52%), Gaps = 9/113 (7%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V++W   +++   
Sbjct: 38  YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIG-- 95

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
            D  LL +I   I S    + Q  +L +R+  Q  W  + S  +   +   E+
Sbjct: 96  -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 143


>gi|402218829|gb|EJT98904.1| hypothetical protein DACRYDRAFT_110241 [Dacryopinax sp. DJM-731 SS1]
          Length = 1153

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 65/151 (43%), Gaps = 8/151 (5%)

Query: 162  KLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVK 221
            KL   KR +             ++L+ VW +  +  ++   L          + +C   +
Sbjct: 976  KLNCEKRLRTLEDVATPRLETEQRLMTVWDSLGDYESYPAILND--------TIWCPLPE 1027

Query: 222  GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
            G      R+ D +Y GC+PV + +    PF D+L+W   SI +   D+  ++++L   + 
Sbjct: 1028 GVTGWATRLEDVVYGGCIPVFVGHASQYPFYDMLDWSKLSIAIERKDLQRIEEVLMSYTM 1087

Query: 282  EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYM 312
            EE    Q+ ++ VR  F + +  +  D   M
Sbjct: 1088 EEIERFQTNLMLVRDAFLYPLDGNHKDQLTM 1118


>gi|348571154|ref|XP_003471361.1| PREDICTED: exostosin-like 1-like [Cavia porcellus]
          Length = 674

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/117 (26%), Positives = 60/117 (51%), Gaps = 6/117 (5%)

Query: 201 GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
           G  +T   ++L  + FCL + G     +R   +L  GC+PV+++ H++LPF++V++W   
Sbjct: 256 GPKQTRPEEKLPNATFCL-IPGQRAEASRFLQALQAGCIPVLLSPHWELPFSEVIDWTKA 314

Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
           +I VA   +PL  ++L  +  +E    +   L+ +  F W  + S  +       E+
Sbjct: 315 AI-VADKRLPL--QVLAAL--QEMAPARVLALRQQTQFLWAAYFSSVEKVIHTTLEI 366


>gi|168023962|ref|XP_001764506.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162684370|gb|EDQ70773.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 438

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 66/291 (22%), Positives = 111/291 (38%), Gaps = 52/291 (17%)

Query: 38  YFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYW 97
           YFK A  +S + T DP KA L+F+         DRR   E IP   +          PYW
Sbjct: 123 YFKDALERSPYYTADPEKACLYFVTV-------DRR--AEKIPSLPT---------LPYW 164

Query: 98  NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ 157
           N   G +H  V+         +KA  V+   +    +S    + + A  D+S+     R 
Sbjct: 165 NH--GLNHVIVSISDFW--VQRKATPVETIEMASTMTSIAHHASYRAGFDISVALPQKRS 220

Query: 158 -EDPPKLGSSKRNKLAFFAG-----------------------------AVNSPVREKLL 187
             D   L + +R     F G                                 P  E LL
Sbjct: 221 YSDVQGLKAFERKYFLTFTGMQFLGSSGSRINPVLRSMHNGEDVIIAITCKQDPNSEALL 280

Query: 188 QVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
                 +E        +     EL+ S F L   G   +++R+ + L  G +PV+I++++
Sbjct: 281 SRPELRAECVQDQSVYEQYTTRELMDSTFGLVQAGRGSSSSRLLEVLSAGSIPVVISDNF 340

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            LPF  +L+W+   +V  +  +  + + L+ +S  E    + + L + + F
Sbjct: 341 VLPFESLLDWRRCLLVFPSSQMQRIVRTLRSLSKGEIEFRREHCLYIYRDF 391


>gi|312378632|gb|EFR25154.1| hypothetical protein AND_09769 [Anopheles darlingi]
          Length = 732

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 54/99 (54%)

Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
           SG  +  Y   L    FCL  +G  +    + +++  GC+PVI+A++  LPF+++L+W+ 
Sbjct: 316 SGGNEYEYPAVLESGVFCLIARGVRLAQPVLLEAMATGCIPVIVADNLVLPFSNILDWEL 375

Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
            S+ V    +  +  +LK +S +    LQ++V  V + +
Sbjct: 376 LSVRVYESQLHSVLALLKRVSDQRIRELQAHVRYVYERY 414


>gi|223975397|gb|ACN31886.1| unknown [Zea mays]
          Length = 134

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 55/106 (51%), Gaps = 3/106 (2%)

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATL 267
           +  SKFCLH  G   ++ R+ D++   CVPVI+++  +LPF D +++  FS+   V   L
Sbjct: 1   MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEAL 60

Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
               L   L+ +   +++ +   +  V  H+++   P   DA  M+
Sbjct: 61  RPDYLLNELRQVPKRKWVDMWLKLKNVSHHYEFQYPPRKGDAVNMI 106


>gi|260782087|ref|XP_002586123.1| hypothetical protein BRAFLDRAFT_105910 [Branchiostoma floridae]
 gi|229271215|gb|EEN42134.1| hypothetical protein BRAFLDRAFT_105910 [Branchiostoma floridae]
          Length = 551

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 55/116 (47%), Gaps = 10/116 (8%)

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL--PFADVLNWKSFSIVVATLDIP- 270
           S FCL   G         D++  GCVPV     + +  PF  VLN+  FS+++   D+  
Sbjct: 417 SVFCLQPPGDSPTRKSFYDAVACGCVPVTFTLEHPVRYPFDQVLNYSDFSVIIDGKDVTD 476

Query: 271 ---LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS---DYDAFYMVMYELWLR 320
               +  IL+ I SE   +LQ  + KV    Q+  +PS     DAF MV+ E+  R
Sbjct: 477 RNITILNILRKIPSERIKMLQDNLKKVAPLLQYS-YPSTVPSQDAFTMVLEEMAQR 531


>gi|395521813|ref|XP_003765009.1| PREDICTED: exostosin-like 1 [Sarcophilus harrisii]
          Length = 694

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/107 (24%), Positives = 52/107 (48%), Gaps = 5/107 (4%)

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           L  S FC         + R+  +L  GCVPV+++  ++LPFA+V++W + ++++    + 
Sbjct: 286 LHNSTFCFIPSSCHAGSFRLLQALKAGCVPVLLSRGWELPFAEVIDWGTAAVIIDERHLL 345

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
            +K +L+G+     L      L+ +  F W  + S  +       E+
Sbjct: 346 QIKSVLQGLPPARVL-----ALRQQTQFLWDAYFSSVEKIVHSTLEI 387


>gi|391340944|ref|XP_003744793.1| PREDICTED: exostosin-1a-like [Metaseiulus occidentalis]
          Length = 1047

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 120/302 (39%), Gaps = 77/302 (25%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           FRVY+YP                   G   S +Y K   A  +S F T +P +A LF   
Sbjct: 359 FRVYIYPLED----------------GQAVSATYMKLLSAIRRSRFYTSNPEEACLFVPN 402

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADH--FYVACHSIGRSAMQK 120
                   DR I ++         ++N+    P+WN  GG +H  F +   S    +   
Sbjct: 403 VDTL----DRDILSDEYVRSAQARLWNL----PHWN--GGRNHLIFNLFSGSWPDYSQDL 452

Query: 121 AWEVKLNAI-QVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKRNK---- 170
            ++  L  + +     + F  G+    D+S+P ++PR    P++G     SS  +K    
Sbjct: 453 GFDPGLAMLAKSSAPETIFRPGY----DISIP-LFPRTH--PEIGGEPGFSSSESKIITP 505

Query: 171 -----LAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL 203
                L  F G      + S +R  L  +                 W+   +    S   
Sbjct: 506 LRKRFLLTFKGKRYLYGIGSEIRNSLFHLNNVNDVLLLTTCKHGKQWKLKKDERCDSDNA 565

Query: 204 ---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
              K  Y   +  S FCL  +G  + + R  +SL  GC+P++++N + LPF +V++WKS 
Sbjct: 566 DYDKQDYTVLMQNSTFCLVPRGRRLGSFRFLESLQAGCIPIVLSNDWRLPFDEVIDWKSA 625

Query: 261 SI 262
           +I
Sbjct: 626 TI 627


>gi|91076108|ref|XP_968944.1| PREDICTED: similar to tout-velu CG10117-PA [Tribolium castaneum]
 gi|270014706|gb|EFA11154.1| hypothetical protein TcasGA2_TC004758 [Tribolium castaneum]
          Length = 719

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 116/302 (38%), Gaps = 73/302 (24%)

Query: 2   NRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLF 59
            + FRVYVYP   N                +  S SY K     M+S + T DP  A LF
Sbjct: 79  QQGFRVYVYPPDDN----------------STPSPSYQKLLNVLMESRYYTADPRLACLF 122

Query: 60  FLPFSIARMRHDRRIGTEGIPDFISHYIFNIS---QKYPYWNRTGGADHFYVACHSIGRS 116
            L  SI  +  DR           + Y+ N+    Q  P+WN   G +H     +S G  
Sbjct: 123 VL--SIDTLDRDR---------LSADYVRNMQSRLQHLPHWN--NGLNHVIFNLYS-GTW 168

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLP---QIWPRQEDPPKLGSS------ 166
                  +  +    + + +     H+    D+S+P   ++ P  E   ++GS       
Sbjct: 169 PNYTENNLDFDYGMAILAKASMSDSHMRPGFDISIPLFHKVHP--EKGGEVGSVLANSLP 226

Query: 167 -KRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--- 210
            ++N L  F G      + S  R  L  +      I   + R       +K    D+   
Sbjct: 227 LQKNYLLAFKGKRYVHGIGSDTRNSLYHLHNRKDMIMVTTCRHGKSWKDMKDERCDQDNK 286

Query: 211 ----------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
                     L  S FCL  +G  + + R  ++L  GC+PV+++N + LPFA  ++W   
Sbjct: 287 EYDKYDYEVLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNGWALPFAQKIDWSKA 346

Query: 261 SI 262
           +I
Sbjct: 347 AI 348


>gi|260786761|ref|XP_002588425.1| hypothetical protein BRAFLDRAFT_153069 [Branchiostoma floridae]
 gi|229273587|gb|EEN44436.1| hypothetical protein BRAFLDRAFT_153069 [Branchiostoma floridae]
          Length = 107

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 5/110 (4%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FCL  +G  + + R  ++L   C+PVI AN ++LPF++V+ W   +I    
Sbjct: 3   YNTLLHNSTFCLVPRGRRLGSFRFLEALQAACIPVIEANGWELPFSEVIEWDRATITADE 62

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYE 316
             +  L  IL+ I  E+ L      L+ +  F W  + S  +   +   E
Sbjct: 63  RLLFQLPSILRAIPPEKIL-----ALRQQTQFLWETYFSSVERIVLTTLE 107


>gi|47207620|emb|CAG13862.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 426

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 60/113 (53%), Gaps = 9/113 (7%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  S FCL  +G  + + R  ++L   CVPV+++N ++LPF+++++W + +++   
Sbjct: 4   YKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIG-- 61

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
            D  LL +I   + S    + Q  +L +R+   F W  + S  +   +   E+
Sbjct: 62  -DERLLLQIPTTVRS----IHQDRILSLRQQTQFLWEAYFSSVEKIVLTTLEI 109


>gi|322795454|gb|EFZ18199.1| hypothetical protein SINV_05251 [Solenopsis invicta]
          Length = 711

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 88/378 (23%), Positives = 153/378 (40%), Gaps = 91/378 (24%)

Query: 3   RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
           + F VYVYP        +V+ P+ ++   N  +ES          + T DP++A +F L 
Sbjct: 87  QGFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPARACIFVLA 130

Query: 63  FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS---IGRSAM 118
                   DR  + TE + +  S  +       PYWN   G +H     +S      +  
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLL-----HLPYWN--NGRNHLIFNLYSGTWPDYAEE 179

Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKR--------NK 170
             A++V      ++  +S  I  H    DVS+P     ++ P + G S +        NK
Sbjct: 180 SLAFDV---GYAMLAKASMSIFRHRPDFDVSIPLF--GKQHPERGGESGQALENNFPNNK 234

Query: 171 --LAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE------ 210
             +A F G      + S  R  L  +      ++  + R       L+  +  +      
Sbjct: 235 KYVAAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYD 294

Query: 211 -------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---F 260
                  L+ + FCL  +G  + + R  ++L  GC+PVI++N + LPF + ++W     F
Sbjct: 295 TYDYEILLMNATFCLVPRGRRLGSFRFLEALRSGCIPVILSNGWALPFHERIDWTQAVIF 354

Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYELW 318
           S     L IP    IL+ +S+ +       +LKVR+   F W  + S  +     ++E  
Sbjct: 355 SDERLLLQIP---DILRSVSNVQ-------ILKVRQQTQFLWERYFSSIEKIVFTVFENI 404

Query: 319 LRR-----SSVRVQWSTS 331
             R     +  R+ W+T+
Sbjct: 405 RERLPWEGARERIVWNTN 422


>gi|390338044|ref|XP_783281.3| PREDICTED: exostosin-1b-like [Strongylocentrotus purpuratus]
          Length = 702

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 5/111 (4%)

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           L  S FCL  +G  + + R  +SL   C+P++++N ++LPF++V++W    +      + 
Sbjct: 276 LHNSTFCLVPRGRRLGSFRFLESLQAACIPMLLSNGWELPFSEVIDWSKAVVFGDERLLL 335

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            +  I++ I++E+ LLL+      +  F W+ + S  +       E+   R
Sbjct: 336 QVPSIVRSITAEQILLLRQ-----QTQFLWNSYFSSTEKIVHTTLEIIKER 381


>gi|149410585|ref|XP_001509292.1| PREDICTED: exostosin-1 [Ornithorhynchus anatinus]
          Length = 443

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 9/113 (7%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V++W   +I+   
Sbjct: 21  YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIG-- 78

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
            D  LL +I   I S    + Q  +L +R+   F W  + S  +   +   E+
Sbjct: 79  -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 126


>gi|332017043|gb|EGI57842.1| Exostosin-1 [Acromyrmex echinatior]
          Length = 911

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 86/379 (22%), Positives = 150/379 (39%), Gaps = 93/379 (24%)

Query: 3   RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
           + F VYVYP        +V+ P+ ++   N  +ES          + T DP++A +F L 
Sbjct: 87  QGFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPARACIFVLA 130

Query: 63  FSIARMRHDRR-IGTEGIPDFISHYIFNISQKY---PYWNRTGGADHFYVACHS---IGR 115
                   DR  + TE        ++ N+  K    PYWN   G +H     +S      
Sbjct: 131 LDTL----DRDPLSTE--------FVHNLPLKLLHLPYWN--NGRNHLIFNLYSGTWPDY 176

Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS---------- 165
           +    A++V      ++  +S  I  H    DVS+P     ++ P + G           
Sbjct: 177 AEESLAFDV---GYAMLAKASMSIFKHRPDFDVSIPLFG--KQHPERGGEPGQALENNFP 231

Query: 166 SKRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHS---GRLKTPYADE------- 210
           + +  +A F G      + S  R  L  +      ++  +   G+    + DE       
Sbjct: 232 NNKKYVAAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKTWREFQDEHCQQDNQ 291

Query: 211 ----------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS- 259
                     L+ + FCL  +G  + + R  ++L  GC+PVI++N + LPF + ++W   
Sbjct: 292 EYDTYDYEILLMNATFCLVPRGRRLGSFRFLEALRSGCIPVILSNSWALPFHERIDWNQA 351

Query: 260 --FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             FS     L IP    IL+ +S+     +Q   L+ +  F W  + S  +     ++E 
Sbjct: 352 VIFSDERLLLQIP---DILRSVSN-----VQILKLRQQTQFLWERYFSSIEKIVFTVFEN 403

Query: 318 WLRR-----SSVRVQWSTS 331
              R     +  R+ W+TS
Sbjct: 404 IRERLPWEGARERIVWNTS 422


>gi|449662559|ref|XP_002154865.2| PREDICTED: exostosin-2-like [Hydra magnipapillata]
          Length = 729

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 64/280 (22%), Positives = 114/280 (40%), Gaps = 42/280 (15%)

Query: 6   RVYVYPHR--RNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
           +VY+YP+   R      + +   FE        +  K+A  KS + T++ ++A LF  P 
Sbjct: 144 KVYIYPNNEYRTSNNEKIFMEESFE-------FNLIKEAIEKSKYFTENITEACLFIPPL 196

Query: 64  SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWE 123
            +        +  EGI   ++  +     ++   N     + F +   SI  +AM  A+ 
Sbjct: 197 DL--------LTEEGIDLKLASAVLKFLSRWESGNNHVIFNLFPLHPTSIN-TAMSLAYG 247

Query: 124 VKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVNSPV 182
              NAI  V  +++  S +    D+S+P          K  + S   K       V+  +
Sbjct: 248 ---NAI--VAGANFISSNYRQGFDISIPLFTKAMTSTQKFSTFSGTRKWKLIVSQVSMDI 302

Query: 183 RE-----KLLQVWRNDSEIYAHSGRLKTPYADELLG-------------SKFCLHVKGFE 224
            +     KL  +  +   I A  G     Y + L                 FCL + G+ 
Sbjct: 303 EKRNVLNKLKDLHPDLLMIQACPGTFDMMYPEVLCKDSKPFYYPAVLQEGTFCLLLPGYY 362

Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
             ++ + D++  GC+PVI+ N Y LPF +V++W   +I+V
Sbjct: 363 YGSSLLLDAMMMGCIPVIMMNDYVLPFNEVIDWSRAAIIV 402


>gi|444706299|gb|ELW47642.1| Exostosin-like 1, partial [Tupaia chinensis]
          Length = 595

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/118 (26%), Positives = 59/118 (50%), Gaps = 6/118 (5%)

Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
           +G  +T   D L  + FCL + G      R   +L  GC+PV+++  ++LPF++V++W  
Sbjct: 175 AGAEQTHPGDTLPSATFCL-IPGRRPGAGRFLQALQAGCIPVLLSPRWELPFSEVIDWTK 233

Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
            +I VA   +PL  ++L  +  +E    +   L+ +  F W  + S  +   +   E+
Sbjct: 234 AAI-VADERLPL--QVLAAL--QEMPPARVLALRQQTQFLWGAYFSSVEKVVLTTLEV 286


>gi|390340500|ref|XP_003725255.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
           [Strongylocentrotus purpuratus]
          Length = 495

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 64/139 (46%), Gaps = 13/139 (9%)

Query: 189 VWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVII---AN 245
           VW N  E +     +  P  D +  S FCL   G+        DS+  GC+PV      +
Sbjct: 355 VWFNTPECHQD---IHLPIIDWMRHSIFCLQPPGYSNIRKSFYDSIMSGCIPVTFRSKRS 411

Query: 246 HYDLPFADVLNWKSFSIVVATLDI----PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
           H   PF   L+++ F++ +   ++      +  ILKGI+  +   LQ+ + +V   FQ+ 
Sbjct: 412 HVIYPFERTLDYRRFTVNIPIDEVLSGKTNVTNILKGITKWKIAELQTELAEVAPKFQYS 471

Query: 302 VFP---SDYDAFYMVMYEL 317
             P    DYDAF  ++ E+
Sbjct: 472 YPPIRGPDYDAFAAIIEEM 490


>gi|116786286|gb|ABK24052.1| unknown [Picea sitchensis]
          Length = 208

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 9/106 (8%)

Query: 166 SKRNKLAFFAGAVNSPV-REKLLQVWRN-DSEIYAHSGRLKTP-------YADELLGSKF 216
           SKR  LA F G V   V R +LL++ +    ++ A   +   P       Y   L  +KF
Sbjct: 8   SKRKYLANFLGRVQGKVGRLQLLKLSQQFPDKLEAPELKFSGPEKFGRIEYFQHLRNAKF 67

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           CL  +G    T R  ++ +  CVPVI+++  +LPF +VL++  FSI
Sbjct: 68  CLAPRGESSWTLRFYEAFFVECVPVILSDQIELPFQNVLDYSQFSI 113


>gi|384251773|gb|EIE25250.1| hypothetical protein COCSUDRAFT_40557 [Coccomyxa subellipsoidea
           C-169]
          Length = 280

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 53/98 (54%), Gaps = 3/98 (3%)

Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
           T Y  +   S FCL   G      R+  +L +GC+PVIIA++  +PF DVL ++ F++ V
Sbjct: 145 TQYMADFGRSTFCLAATGAGWG-VRLKLALMHGCIPVIIADNVQMPFEDVLPYQDFAVHV 203

Query: 265 ATLDIPLLKKILKGISSEEYLL--LQSYVLKVRKHFQW 300
               +  L ++L  I S E L+  +Q  V  + ++F W
Sbjct: 204 REHALYRLPEVLDAILSTEGLVKRMQINVSCIWRYFTW 241


>gi|307197832|gb|EFN78943.1| Exostosin-1 [Harpegnathos saltator]
          Length = 711

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 83/373 (22%), Positives = 146/373 (39%), Gaps = 81/373 (21%)

Query: 3   RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
           + F VYVYP        +V+ P+ ++   N  +ES          + T DP++A +F L 
Sbjct: 87  QGFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPARACIFVLA 130

Query: 63  FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
                   DR  + TE + +  S  +       PYWN   G +H     +S       + 
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLL-----HLPYWN--NGRNHLIFNLYSGTWPDYAEE 179

Query: 122 WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----------SKRNKL 171
             V      ++  +S  I  +    DVS+P     ++ P + G           S +  +
Sbjct: 180 SLVFDVGYAILAKASMSIFRYRPDFDVSIPLF--GKQHPERGGEPGQALENNFPSNKKYV 237

Query: 172 AFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           A F G      + S  R  L  +      ++  + R       L+  +  +         
Sbjct: 238 AAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYD 297

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIV 263
               L+ + FCL  +G  + + R  ++L  GC+PVI++N + LPF + ++W     FS  
Sbjct: 298 YEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDE 357

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR-- 321
              L IP    I++ +S+     +Q   L+ +  F W  + S  +     ++E    R  
Sbjct: 358 RLLLQIP---DIVRSVSN-----VQILKLRQQTQFLWERYFSSIEKIVFTVFENIRERLP 409

Query: 322 ---SSVRVQWSTS 331
              +  RV W+TS
Sbjct: 410 WEGARERVVWNTS 422


>gi|159473595|ref|XP_001694919.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158276298|gb|EDP02071.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 674

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 61/125 (48%), Gaps = 3/125 (2%)

Query: 177 AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
           A +  VR+ LL ++ N +       +   P    +L S+FC    GF     R+  +   
Sbjct: 526 AYSGGVRQGLLALFGNTTRPDVSINKGGGP--SLMLRSRFCFTPMGFGWGV-RLTQAAMT 582

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK 296
           GCVPV++ +H      DVL ++ FSI V+  ++  L +IL  I++EE   LQ+ +    +
Sbjct: 583 GCVPVMVQDHVWPTLWDVLPYEKFSIRVSRHNLYRLFEILDSITAEELASLQAGLAHWHR 642

Query: 297 HFQWH 301
            F W 
Sbjct: 643 AFVWQ 647


>gi|388496634|gb|AFK36383.1| unknown [Medicago truncatula]
          Length = 316

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 10/88 (11%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
           +ASE    +A + S   T DP +AD FF+P  ++           G P        IS  
Sbjct: 133 FASEVAIHRALLTSDVRTFDPYEADFFFVPVYVS----CNFSTVNGFPAIGHARSLISSA 188

Query: 87  IFNISQKYPYWNRTGGADHFYVACHSIG 114
           +  IS +YP+WNR+ G+DH +VA H  G
Sbjct: 189 VKLISTEYPFWNRSTGSDHVFVASHDFG 216


>gi|339247053|ref|XP_003375160.1| exostosin-1 [Trichinella spiralis]
 gi|316971539|gb|EFV55297.1| exostosin-1 [Trichinella spiralis]
          Length = 1019

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 74/303 (24%), Positives = 118/303 (38%), Gaps = 70/303 (23%)

Query: 1   MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADL 58
           ++R F+VY+YP   N          D E +    S SY K  +   +S + T DP +A L
Sbjct: 377 IDRPFKVYIYPDISN---------FDEESK---TSASYRKILQILRQSKYFTDDPDQACL 424

Query: 59  FFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYP-----YWN--RTGGADHFYVACH 111
           F L +            T       + Y+ N++ K        WN  +  G +H     +
Sbjct: 425 FVLSYD-----------TLSRDSLSAEYVENMNAKIKSLPTNLWNNGKVSGMNHLIFNLY 473

Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQI----------------- 153
           S G        E+     Q + + + F + H  +H D+SL                    
Sbjct: 474 S-GTWPDYDLTELGFEPGQAILAKASFSTRHYRSHFDISLALFHDILPLRGLNATDVENV 532

Query: 154 ---WPRQEDPPKL-----------GSSKRNKLAFFAGAVNSPVRE--KLLQVW-RNDSEI 196
              WPR      L           GS  RN L     A +  +    K  + W +N  E 
Sbjct: 533 DLNWPRSNWSYTLVFKGKRYVFGIGSETRNALYHLHNAKDIIMLTTCKHGKDWMKNQDER 592

Query: 197 YAHSGRLKTP--YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
            +    L     Y + +  SKFCL  +G  + + R  ++L  GC+PVI++N + LPF++V
Sbjct: 593 CSIDNDLYDNWNYEELMANSKFCLVPRGRRLGSFRFLEALEKGCIPVILSNDWVLPFSEV 652

Query: 255 LNW 257
           ++W
Sbjct: 653 IDW 655


>gi|195431447|ref|XP_002063753.1| GK15839 [Drosophila willistoni]
 gi|194159838|gb|EDW74739.1| GK15839 [Drosophila willistoni]
          Length = 776

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 78/326 (23%), Positives = 127/326 (38%), Gaps = 59/326 (18%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VYVYP            P++        S +Y K   A  +S + T DPS A LF L 
Sbjct: 111 FLVYVYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTTDPSIACLFVLA 160

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV----------ACHS 112
                   DR   +E   D++ +    ++ + P+WN   G +H             A HS
Sbjct: 161 IDTL----DRDSLSE---DYVRNVPSRLA-RLPHWN--NGRNHIIFNLYSGTWPDYAEHS 210

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLA 172
           +G  A + A   K +   +     + +S  + HK   L              ++K+  LA
Sbjct: 211 LGFDAGE-AILAKASMSVLQLRPGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKYLLA 269

Query: 173 F----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE----------- 210
           F    +   + S  R  L  +      +   + R       L+    DE           
Sbjct: 270 FKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYE 329

Query: 211 --LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
             L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I      
Sbjct: 330 TLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERL 389

Query: 269 IPLLKKILKGISSEEYLLL--QSYVL 292
           +  +  I++ IS+E    L  Q+ VL
Sbjct: 390 LLQVPDIVRSISAERIFALRQQTQVL 415


>gi|156362595|ref|XP_001625861.1| predicted protein [Nematostella vectensis]
 gi|156212714|gb|EDO33761.1| predicted protein [Nematostella vectensis]
          Length = 728

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 77/357 (21%), Positives = 144/357 (40%), Gaps = 77/357 (21%)

Query: 5   FRVYVYP----HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
           FR+YVYP     + +  + N++                  K   +S + T+ P +A LF 
Sbjct: 101 FRIYVYPVHPGAKLSTTYTNII------------------KVIKESRYYTEFPEEACLFI 142

Query: 61  LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
              +I  +  D+        D++ H I+N  ++ PYW    G +H  +  +    +    
Sbjct: 143 T--AIDTLDRDKLSA-----DYV-HNIYNKIRQLPYWK--NGENH--IIFNLFAGTWPDY 190

Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGS----------SKRN 169
           + +V  +  + +   +   S  I    DVSLP ++P+      LG+           ++ 
Sbjct: 191 SEDVGFDFGKAILVKASLSSDLIRPGFDVSLP-LFPKTHPHKDLGNLPHSCSAFPLERKY 249

Query: 170 KLAF----FAGAVNSPVREKLLQVWRNDSEIY----AHSGRLKTPYADE----------- 210
           KLAF    +   + S  R  L  +  N  +I        G+    + DE           
Sbjct: 250 KLAFKGKRYLNGIGSESRNALYHI-HNGRDIVLLTTCKHGKAWHKHKDERCDGDNALYDR 308

Query: 211 ------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
                 LL + FCL  +G  + + R  +SL  GC+P ++++ ++LPFA+V++WK   I  
Sbjct: 309 YSYDELLLNATFCLVPRGRRLGSFRFLESLKVGCIPFLLSDGWELPFAEVIDWKKAVIDG 368

Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
           +   +  +  I++  S  + L ++   L     F W+ + S  +       E+   R
Sbjct: 369 SERLLMQVPGIVRSYSRSQVLAMKQQSL-----FLWNAYFSSVEKIVHTTIEVIRSR 420


>gi|380027202|ref|XP_003697318.1| PREDICTED: exostosin-1-like [Apis florea]
          Length = 711

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 79/355 (22%), Positives = 142/355 (40%), Gaps = 80/355 (22%)

Query: 3   RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
           + F VYVYP        +V+ P+ ++   N  +ES          + T DP++A +F L 
Sbjct: 87  QGFTVYVYP------IEDVISPL-YQKILNVITES---------RYYTSDPTRACIFVLA 130

Query: 63  FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
                   DR  + TE + +  S  +     + PYWN   G +H     +S       + 
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLM-----RLPYWN--NGRNHLIFNLYSGTWPDYAEE 179

Query: 122 WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKR--------NK--L 171
                    ++  +S  I  H    D+S+P     ++ P + G S +        NK  +
Sbjct: 180 SLAFDLGYAMLAKASMSIFRHRPEFDISIPLF--GKQHPERGGESGQALENNFPNNKKYI 237

Query: 172 AFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           A F G      + S  R  L  +      ++  + R       L+  +  +         
Sbjct: 238 AAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYD 297

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIV 263
               L+ + FCL  +G  + + R  ++L  GC+PVI++N + LPF + ++W     FS  
Sbjct: 298 YEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDE 357

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYE 316
              L IP    I++ +S+        ++LK+R+   F W  + S  +     ++E
Sbjct: 358 RLLLQIP---DIVRSVSN-------VHILKLRQQTQFLWERYFSSIEKIVFTVFE 402


>gi|281347217|gb|EFB22801.1| hypothetical protein PANDA_019397 [Ailuropoda melanoleuca]
          Length = 360

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 35/52 (67%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
           Y   L  + FC+ ++G  +  A ++D L  GCVPV+IA+ Y LPF++VL+WK
Sbjct: 308 YPQVLQEAIFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWK 359


>gi|302835519|ref|XP_002949321.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300265623|gb|EFJ49814.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 855

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 78/338 (23%), Positives = 127/338 (37%), Gaps = 72/338 (21%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----RMRHDRRIGTEGIPDFISHYI 87
            Y+ E+YF +    S   T DP +AD F++P         +     +   G P     Y 
Sbjct: 444 TYSVEAYFHEVLSISPHRTFDPEEADFFYVPVYYTCWMWPINGWADMPFYGAPTSWHRYS 503

Query: 88  FN----------ISQKYPYWNRTGGADHFYVACHSIG----------RSAMQKAW-EVKL 126
                       I   +P+W+R GG DH ++  H  G           S M   W  + L
Sbjct: 504 NAANLWLKAKTWIQSNFPFWDRRGGRDHIWMTNHDEGACYMPTEIYQTSIMLTHWGRMDL 563

Query: 127 NAIQVVC------SSSYFISGHIAHKDVSL-----PQIWPRQE-------------DPPK 162
           N            S      G +  KDV       P   PR++               P 
Sbjct: 564 NHTSNTAYRPDNYSDGITWKGVLDGKDVKTLYQGHPCYDPRKDLVIPAFKTPDHFSQSPL 623

Query: 163 LGS--SKRNKLAFFAGAVNS--------PVREKLLQV-----WRNDSEIY-AHSGRLKTP 206
           LGS   +R+ L +  G V           +R+KL ++     W     I+      ++  
Sbjct: 624 LGSWPRQRDILLYLRGDVGKHREPNYSRGIRQKLYKLAVDNEWAKKHRIFIGEQFEIQGS 683

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  S FC  V G +  + R  D++ +GC+P+II ++  + F  +++  SFS+ ++ 
Sbjct: 684 YGEHLSRSLFCAVVPG-DGYSPRFEDAVLHGCLPLIIVDNTHVLFESIIDVDSFSLRISE 742

Query: 267 LDI-PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
             +   L  +L  IS ++   +Q      R    WH F
Sbjct: 743 AALNEYLPHLLTAISPDQIARMQR-----RLSLVWHRF 775


>gi|432882349|ref|XP_004073986.1| PREDICTED: exostosin-1b-like [Oryzias latipes]
          Length = 429

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/111 (22%), Positives = 56/111 (50%), Gaps = 5/111 (4%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  S FCL  +G  + + R  ++L   CVPV+++N ++LPF+++++W   +++   
Sbjct: 7   YKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVVLSNGWELPFSEIIDWNRATVIGDE 66

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
             +  +   ++ I  ++ L      L+ +  F W  + S  +   +   E+
Sbjct: 67  RLLLQIPTTVRSIHPDKIL-----SLRQQTQFLWEAYFSSVEKIVLTTLEI 112


>gi|302791425|ref|XP_002977479.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300154849|gb|EFJ21483.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 466

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 71/305 (23%), Positives = 109/305 (35%), Gaps = 46/305 (15%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYI 87
            ++ E  F +  +     T DP  A  FF+PF     I+R        +  +       +
Sbjct: 133 QFSGEIIFHRRMLDHRCRTLDPDGASAFFVPFYAGLDISRNLWASGKSSSDVDSLGEQLL 192

Query: 88  FNISQKYPYWNRTGGADHFYVACHSIGR--------SAMQKAWEVKLNAIQVVCSSSYFI 139
             + +++P++NR+GGADHF VA    GR         +    W   L     + S    +
Sbjct: 193 HWLQRQHPHFNRSGGADHFLVA----GRISWDFRRMPSAAGEWGSSLFHQIEMRSVKRLV 248

Query: 140 SGHIAHKDVSLPQIWPRQEDPPK----------LGSSKRNKLAFFAGAVNSPVREKLLQV 189
                  D  L   +P    P            +  S R  L  FAG+     R    QV
Sbjct: 249 IERNPWDDSELGVPYPTSFHPSSDEDLAQWVEFVQGSPRPHLVAFAGSPRPGYRSDFRQV 308

Query: 190 W--------RNDSEIYAHSGRLKTPYADEL------LGSKFCLHVKGFEVNTARIADSLY 235
                    R  S     +       +D L      L S FCL  +G       + DSL 
Sbjct: 309 LLGQCRAAPRGISRCLDCTADTAGCTSDPLRVTKLFLSSVFCLQPRGDSFTRKSLFDSLI 368

Query: 236 YGCVPVIIAN-----HYDLPFA-DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
            GC+PV+  N      Y+L    D   +  F    +  +   +  +L+GIS E    +Q 
Sbjct: 369 SGCIPVLFWNQSAYWQYELYLPRDPEEYSVFIPHQSVKNGTNVLDVLQGISRERIGRMQR 428

Query: 290 YVLKV 294
            VL++
Sbjct: 429 AVLRI 433


>gi|159483755|ref|XP_001699926.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158281868|gb|EDP07622.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 785

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 69/140 (49%), Gaps = 9/140 (6%)

Query: 168 RNKLAFFAG------AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVK 221
           R+ L +F G      A ++ VR+ LL ++ N +   A     K   +  +L S+FC    
Sbjct: 619 RDTLFYFNGFTKPDLAYSAGVRQGLLALFGNSTR--ADLSINKGGGSQRMLRSRFCFTPM 676

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           GF     R++ ++  GCVP+++ +H      DVL ++ FSI V+  ++  L   L+ I+ 
Sbjct: 677 GFGWGI-RLSQAMLTGCVPIMVHDHVWPTLWDVLPYEQFSIRVSRHNMYRLLDYLESITP 735

Query: 282 EEYLLLQSYVLKVRKHFQWH 301
           ++   LQ  V +  K F W 
Sbjct: 736 QQLARLQDGVAQWHKAFVWQ 755


>gi|358335371|dbj|GAA53901.1| exostosin-2 [Clonorchis sinensis]
          Length = 1030

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 47/95 (49%)

Query: 197 YAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
           Y    R    Y + L  S FCL ++   V    + DS+  GC+PVI  +++ LPF++VL+
Sbjct: 683 YDPPSRTIVDYGEALARSLFCLIIQIPPVGQFALFDSMNAGCIPVIADDNFILPFSEVLD 742

Query: 257 WKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYV 291
           W   +I V   ++  +   L   +SEE    Q  V
Sbjct: 743 WSKIAIRVRHSELHKIVTTLTSFTSEEIAQFQRQV 777


>gi|307108297|gb|EFN56537.1| hypothetical protein CHLNCDRAFT_51514 [Chlorella variabilis]
          Length = 810

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 66/134 (49%), Gaps = 16/134 (11%)

Query: 161 PKLGSSKRNK--LAFFAGAV---NSPVREKLLQVWRNDSE----------IYAHSGRLKT 205
           P LG++ R++  LAF  G V   N P    L Q   N S           +      ++ 
Sbjct: 585 PLLGATPRSRTWLAFHRGRVQQDNPPYSRGLRQRLANASAAGGWLEKHKIVMGEHDVVEG 644

Query: 206 PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
            Y++ L  S FCL + G +  +AR+ D++ +GC+PVII ++  + F  +L+  +F++ + 
Sbjct: 645 DYSELLASSTFCLVLPG-DGWSARMDDAMLHGCIPVIIMDNVHVSFESILDLAAFTVRIP 703

Query: 266 TLDIPLLKKILKGI 279
             D   L ++L  +
Sbjct: 704 QADAERLPEVLAAV 717



 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 39/91 (42%), Gaps = 15/91 (16%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-------FSIARMRHDRRIGTEGIPDFISH 85
           YA+ES F +  ++S   T DP +AD  ++P         + R     R    G+ +   H
Sbjct: 381 YAAESGFLEMLLQSEHRTLDPEEADFLYVPAYTSCLITPVQRTADSLRDMWYGVENLRVH 440

Query: 86  YIFN--------ISQKYPYWNRTGGADHFYV 108
              +        I    PYWNR GG DH ++
Sbjct: 441 AATHMLLEAYYWIKAHAPYWNRRGGWDHIWL 471


>gi|440797580|gb|ELR18663.1| exostosin, putative [Acanthamoeba castellanii str. Neff]
          Length = 423

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 56/276 (20%), Positives = 104/276 (37%), Gaps = 53/276 (19%)

Query: 40  KKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNR 99
           + A   S +  +DP KA +    F      ++  +GT       S  I    +  P W R
Sbjct: 99  RAALRASKYFVEDPEKACVLVPNFEHTLASNE--MGT-------SLVIAQTLRSLPTWER 149

Query: 100 TGGA-----------------DHFYVACHSIGRSA--MQKAWEVKLNAIQVVCSSSYFIS 140
            GG                  D  Y     +G S    + A+++ LN     C       
Sbjct: 150 YGGPGFNHLLFNKHDDVGVEYDPAYAMVAKVGWSTGHYRPAFDISLNP---PCGK----- 201

Query: 141 GHIAHKDVS---LPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRND---- 193
           G    KD +   +P  W      P      +  L F     N P+R  + + + +     
Sbjct: 202 GRPGLKDAAGHVVPTWWAN----PDRKRENKYFLTFLGTMRNYPLRRAIAERFHDPDNGV 257

Query: 194 ------SEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
                  E       ++  Y D L  ++F L  +G  + + R  +++  G +PVI+ + Y
Sbjct: 258 IIQTSVEEQIGGKPSVEVEYLDTLFHTQFTLCPRGRALYSYRTTEAIAAGAIPVILGDGY 317

Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
             P+ ++++W+SF++++       +  +L+  +SEE
Sbjct: 318 AFPYNELIDWRSFAVILPESSWETMMDVLRSFTSEE 353


>gi|383847108|ref|XP_003699197.1| PREDICTED: exostosin-1-like [Megachile rotundata]
          Length = 711

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 77/350 (22%), Positives = 136/350 (38%), Gaps = 72/350 (20%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
            F VYVYP        +V+ P+ ++   N  +ES          + T DP++A +F L  
Sbjct: 88  GFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPARACIFVLAL 131

Query: 64  SIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
                  DR  + TE I +  +  +     + PYWN   G +H     +S       +  
Sbjct: 132 DTL----DRDPLSTEFIHNLPAKLM-----RLPYWN--NGRNHLIFNLYSGTWPDYAEES 180

Query: 123 EVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW---------PRQEDPPKLGSSKRNKLAF 173
                   ++  +S  I  H  + DVS+P            P Q       +SK+   AF
Sbjct: 181 LAFDIGYAMLAKASMSIFRHRPNFDVSIPLFGKQHPERGGEPGQALENNFPNSKKYVAAF 240

Query: 174 ----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE------------ 210
               +   + S  R  L  +      ++  + R       L+  +  +            
Sbjct: 241 KGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEI 300

Query: 211 -LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIVVAT 266
            L+ + FCL  +G  + + R  ++L  GC+PVI++N + LPF + ++W     FS     
Sbjct: 301 LLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLL 360

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYE 316
           L IP    I++ +S+     +Q   L+ +  F W  + S  +     ++E
Sbjct: 361 LQIP---DIVRSVSN-----VQILKLRQQTQFLWERYFSSIEKIVFTVFE 402


>gi|170583869|ref|XP_001896766.1| exostosin-1 [Brugia malayi]
 gi|158595933|gb|EDP34387.1| exostosin-1, putative [Brugia malayi]
          Length = 411

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 20/76 (26%), Positives = 44/76 (57%)

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
            S FCL  +G  + + R  ++L  GC+PV++++ ++LPF++V++W+   I+     +  +
Sbjct: 158 NSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVIIGHEDTVLTI 217

Query: 273 KKILKGISSEEYLLLQ 288
             +L  I  +  L ++
Sbjct: 218 SDVLNAIPLDRILFMK 233


>gi|340727245|ref|XP_003401958.1| PREDICTED: exostosin-2-like isoform 2 [Bombus terrestris]
          Length = 710

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 46/82 (56%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL ++G  +  + + + +  G +P+IIA+   +P+  +++W   +I V  
Sbjct: 301 YPNVLQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVRE 360

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
           +DI  +  +LK IS +  + LQ
Sbjct: 361 VDILSIISVLKKISPQRIIELQ 382


>gi|350423159|ref|XP_003493402.1| PREDICTED: exostosin-2-like [Bombus impatiens]
          Length = 708

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 46/82 (56%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL ++G  +  + + + +  G +P+IIA+   +P+  +++W   +I V  
Sbjct: 301 YPNVLQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVRE 360

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
           +DI  +  +LK IS +  + LQ
Sbjct: 361 VDILSIISVLKKISPQRIIELQ 382


>gi|340727243|ref|XP_003401957.1| PREDICTED: exostosin-2-like isoform 1 [Bombus terrestris]
          Length = 708

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 46/82 (56%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL ++G  +  + + + +  G +P+IIA+   +P+  +++W   +I V  
Sbjct: 301 YPNVLQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVRE 360

Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
           +DI  +  +LK IS +  + LQ
Sbjct: 361 VDILSIISVLKKISPQRIIELQ 382


>gi|195381971|ref|XP_002049706.1| GJ20607 [Drosophila virilis]
 gi|194144503|gb|EDW60899.1| GJ20607 [Drosophila virilis]
          Length = 757

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 77/326 (23%), Positives = 128/326 (39%), Gaps = 59/326 (18%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VYVYP            P++       +S +Y K   A  +S + T DP+ A LF L 
Sbjct: 100 FLVYVYPPE----------PLNSLGAAPPSSANYQKILTAIQESRYHTSDPAAACLFVLG 149

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV----------ACHS 112
                   DR   +E   D++ +    ++ + P+WN   G +H             A HS
Sbjct: 150 IDTL----DRDSLSE---DYVRNVPSRLA-RLPHWN--NGRNHIIFNLYSGTWPDYAEHS 199

Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLA 172
           +G  A + A   K +   +     + +S  + HK   L              ++K+  LA
Sbjct: 200 LGFDAGE-AILAKASMSVLQMRHGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKYLLA 258

Query: 173 F----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE----------- 210
           F    +   + S  R  L  +      +   + R       L+    DE           
Sbjct: 259 FKGKRYVHGIGSETRNSLFHLHNGRDMVMVTTCRHGKSWRELQDNRCDEDNREYDRYDYE 318

Query: 211 --LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
             L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I      
Sbjct: 319 TLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERL 378

Query: 269 IPLLKKILKGISSEEYLLL--QSYVL 292
           +  +  I++ IS+E    L  Q+ VL
Sbjct: 379 LLQVPDIVRSISAERIFALRQQTQVL 404


>gi|198457239|ref|XP_001360599.2| GA10084 [Drosophila pseudoobscura pseudoobscura]
 gi|198135910|gb|EAL25174.2| GA10084 [Drosophila pseudoobscura pseudoobscura]
          Length = 765

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 130/328 (39%), Gaps = 63/328 (19%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VYVYP            P++        S +Y K   A  +S + T DPS A LF L 
Sbjct: 108 FLVYVYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYHTSDPSAACLFVLG 157

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
                   DR   +E   D++ +    ++ + P+WN   G +H     +S G        
Sbjct: 158 MDTL----DRDSLSE---DYVRNVPSRLA-RLPHWN--NGRNHIIFNLYS-GTWPDYAEN 206

Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
            +  +A   + + +      + H  DVS+P     ++ P + G++           K+  
Sbjct: 207 SLGFDAGHAILAKASMSVLQVRHGFDVSIPLF--HKQFPLRAGATGSVQSNNFPANKKYL 264

Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           LAF    +   + S  R  L  +      +   + R       L+    DE         
Sbjct: 265 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 324

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
               L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I    
Sbjct: 325 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 384

Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
             +  +  I++ IS+E    L  Q+ VL
Sbjct: 385 RLLLQVPDIVRSISAERIFALRQQTQVL 412


>gi|357628255|gb|EHJ77645.1| hypothetical protein KGM_04618 [Danaus plexippus]
          Length = 655

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 68/328 (20%), Positives = 125/328 (38%), Gaps = 50/328 (15%)

Query: 29  PRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHY 86
           P     S +Y K      +S + T DP++A LF    ++  +  D  +  E IPD  S  
Sbjct: 20  PTDGPVSATYRKVLSVVRESRYATHDPAEACLFIP--AVDTLDADP-LSPEHIPDVASRL 76

Query: 87  IFNISQKYPYWNRTGGADHFYVACHS-----IGRSAMQ----KAWEVKLNAIQVVCSSSY 137
                 + PYW    G +H     ++         A+      A   + +A + +    +
Sbjct: 77  -----SRLPYWK--NGRNHLLFNLYAGTWPDYAEGALGFDPGDAILARASASETIFRDGF 129

Query: 138 FISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAF----FAGAVNSPVREKLLQVWRND 193
            IS  + HK+       P         + +++ LAF    +   + S  R  L  +   +
Sbjct: 130 DISLPLFHKEHPERGGVPPSATGNPFPAPRKHLLAFKGKRYVHGIGSETRNSLWHLHDGN 189

Query: 194 SEIYAHSGR-------LKTPYADE-------------LLGSKFCLHVKGFEVNTARIADS 233
           + I   + R       L+    DE             L  S FCL  +G  + + R  ++
Sbjct: 190 NLILVTTCRHGKSWKDLRDERCDEDNREYDKFDYEQLLANSTFCLVARGRRLGSYRFLEA 249

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           L  GCVPV+++N + LPF + ++W+   I      +  + ++++ +  E  L      L+
Sbjct: 250 LAAGCVPVLLSNGWRLPFDERIDWRRAVIWADERLLLQVPELVRSVPPERIL-----ALR 304

Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRR 321
            +    W  + S  +       E+ L R
Sbjct: 305 QQTQLLWEQYFSSIEKIVFTTIEILLER 332


>gi|195025057|ref|XP_001985992.1| GH20787 [Drosophila grimshawi]
 gi|193901992|gb|EDW00859.1| GH20787 [Drosophila grimshawi]
          Length = 754

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 124/329 (37%), Gaps = 65/329 (19%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VYVYP            P++       +S +Y K   A  +S + T DP+ A LF L 
Sbjct: 98  FLVYVYPPE----------PLNSLGAAPPSSANYQKILTAIQESRYHTSDPAAACLFVL- 146

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKY---PYWNRTGGADHFYV----------A 109
                      I T         Y+ N+  +    P+WN   G +H             A
Sbjct: 147 ----------GIDTLDRDSLSDDYVRNVPSRLARLPHWN--NGRNHIIFNLYSGTWPDYA 194

Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRN 169
            HS+G  A + A   K +   +     + +S  + HK   L              ++K+ 
Sbjct: 195 EHSLGFDAGE-AILAKASMSVLQMRHGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKY 253

Query: 170 KLAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE-------- 210
            LAF    +   + S  R  L  +      +   + R       L+    DE        
Sbjct: 254 LLAFKGKRYVHGIGSETRNSLFHLHNGRDMVMVTTCRHGKSWRELQDNRCDEDNREYDRY 313

Query: 211 -----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
                L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I   
Sbjct: 314 DYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWAD 373

Query: 266 TLDIPLLKKILKGISSEEYLLL--QSYVL 292
              +  +  I++ IS+E    L  Q+ VL
Sbjct: 374 ERLLLQVPDIVRSISAERIFALRQQTQVL 402


>gi|412988024|emb|CCO19420.1| exostosin-like glycosyltransferase [Bathycoccus prasinos]
          Length = 909

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 77/156 (49%), Gaps = 24/156 (15%)

Query: 165 SSKRNKLAFFAG--AVNSP--------VREKLLQVWRN--DSEIYAHSGRLKTPYADELL 212
           +++R    FF+G   +  P        +R+K+ + W+N    +I +H+      Y   + 
Sbjct: 695 ATQRTTFCFFSGNLGLEKPWGEDYSRGLRQKVARRWQNVYGFDILSHT----DDYLGRIR 750

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV-LNWKSFSIVVATLDIP- 270
            SKFCL + G +  +  ++  +  GC+PVI+ +  D+P+    L++  FSI V   D+  
Sbjct: 751 SSKFCLALPG-DGWSGGLSVYIRNGCIPVIVQDGVDMPWEGTFLDYSKFSIRVREGDVEN 809

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
            L+ +L+ ++ EE   LQ+ +  V     WH F  D
Sbjct: 810 RLQSVLETVTPEELQNLQNGLKNV-----WHFFSYD 840


>gi|302817463|ref|XP_002990407.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300141792|gb|EFJ08500.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 530

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 72/311 (23%), Positives = 127/311 (40%), Gaps = 48/311 (15%)

Query: 27  FEPRGNYASESY-----FKKAFMKSHFVTKDPSKADLFFLPF----SIARMRHDRRIGTE 77
           F  RG YA+  +     F     +   +T D S A   F+P+     I+R      + T 
Sbjct: 146 FSDRGWYATNQFAVEVIFHNRMKQYDCLTNDSSIAAAIFVPYYAGLDISRYLWGVNVSTR 205

Query: 78  GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI----GRSAMQKAWEVKLNAI-QVV 132
              D  +  + +   + P W R GG DHF VA         ++  +  W  KL  I +V 
Sbjct: 206 ---DSGALRMVDWLVRQPEWRRMGGRDHFMVAGRITWDFRRKTEKEDDWGNKLFIIPEVK 262

Query: 133 CSSSYFISGHIAH-KDVSLP---QIWPRQED-----PPKLGSSKRNKLAFFAGA----VN 179
             +S  I     H  D ++P      P Q+        ++   +R  L  FAGA    + 
Sbjct: 263 NITSLVIEASPWHFNDFAIPYPTYFHPTQDSDVVDWQVRMRGMERPALFSFAGAPRQQLR 322

Query: 180 SPVREKLLQVWRNDSE---IYAHSGRLKTPYADELLG----SKFCLHVKGFEVNTARIAD 232
             +RE+++   R   +   +    G  K      ++     S FCL  +G       I D
Sbjct: 323 KSIRERIMDQCRESPQCKLLECDFGESKCHVPSAVMKLFEESVFCLQPQGDSFTRRSIFD 382

Query: 233 SLYYGCVPVIIANHYDLPFADVL-----NWKSFSIVVATLDIPL----LKKILKGISSEE 283
           S+  GC+PV    H D  ++  +     N + +S+ ++ +DI      ++ +L+ I ++E
Sbjct: 383 SMLAGCIPVFF--HPDSAYSQFVWHLPRNHRKYSVFISEIDIRRGNVSIESVLRQIPADE 440

Query: 284 YLLLQSYVLKV 294
            L ++  V+++
Sbjct: 441 VLRMREEVIQL 451


>gi|195334376|ref|XP_002033859.1| GM21554 [Drosophila sechellia]
 gi|194125829|gb|EDW47872.1| GM21554 [Drosophila sechellia]
          Length = 760

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 75/328 (22%), Positives = 130/328 (39%), Gaps = 63/328 (19%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VY+YP            P++        S +Y K   A  +S + T DP+ A LF L 
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
                   DR   +E   D++ +    ++ + PYWN   G +H     +S G        
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202

Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
            +  +A + + + +      + H  DVS+P     ++ P + G++           K+  
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGTVQSNNFPANKKYL 260

Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           LAF    +   + S  R  L  +      +   + R       L+    DE         
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
               L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I    
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380

Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
             +  +  I++ I +E    L  Q+ VL
Sbjct: 381 RLLLQVPDIVRSIPAERIFALRQQTQVL 408


>gi|3435314|gb|AAC32397.1| putative tumor suppressor homolog [Drosophila melanogaster]
          Length = 760

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 75/355 (21%), Positives = 136/355 (38%), Gaps = 66/355 (18%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VY+YP            P++        S +Y K   A  +S + T DP+ A LF L 
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
                   DR   +E   D++ +    ++ + PYWN   G +H     +S G        
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202

Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
            +  +A + + + +      + H  DVS+P     ++ P + G++           K+  
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGTVQSNNFPANKKYL 260

Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           LAF    +   + S  R  L  +      +   + R       L+    DE         
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
               L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I    
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
             +  +  I++ I +E       + L+ +    W  +    +      +E+   R
Sbjct: 381 RLLLQVPDIVRSIPAERI-----FALRQQTQVLWERYFGSIEKIVFTTFEIIRER 430


>gi|302812249|ref|XP_002987812.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300144431|gb|EFJ11115.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 530

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 72/311 (23%), Positives = 127/311 (40%), Gaps = 48/311 (15%)

Query: 27  FEPRGNYASESY-----FKKAFMKSHFVTKDPSKADLFFLPF----SIARMRHDRRIGTE 77
           F  RG YA+  +     F     +   +T D S A   F+P+     I+R      + T 
Sbjct: 146 FSDRGWYATNQFAVEVIFHNRMKQYDCLTNDSSIAAAIFVPYYAGLDISRYLWGVNVSTR 205

Query: 78  GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI----GRSAMQKAWEVKLNAI-QVV 132
              D  +  + +   + P W R GG DHF VA         ++  +  W  KL  I +V 
Sbjct: 206 ---DSGALRMVDWLVRQPEWRRMGGRDHFMVAGRITWDFRRKTEKEDDWGNKLFIIPEVK 262

Query: 133 CSSSYFISGHIAH-KDVSLP---QIWPRQED-----PPKLGSSKRNKLAFFAGA----VN 179
             +S  I     H  D ++P      P Q+        ++   +R  L  FAGA    + 
Sbjct: 263 NITSLVIEASPWHFNDFAIPYPTYFHPTQDSDVVDWQVRMRGMERPALFSFAGAPRQQLR 322

Query: 180 SPVREKLLQVWRNDSE---IYAHSGRLKTPYADELLG----SKFCLHVKGFEVNTARIAD 232
             +RE+++   R   +   +    G  K      ++     S FCL  +G       I D
Sbjct: 323 KSIRERIMDQCRESPQCKLLECDFGESKCHVPSAVMKLFEESVFCLQPQGDSFTRRSIFD 382

Query: 233 SLYYGCVPVIIANHYDLPFADVL-----NWKSFSIVVATLDIPL----LKKILKGISSEE 283
           S+  GC+PV    H D  ++  +     N + +S+ ++ +DI      ++ +L+ I ++E
Sbjct: 383 SMLAGCIPVFF--HPDSAYSQFVWHLPRNHRKYSVFISEIDIRRGNVSIESVLRQIPADE 440

Query: 284 YLLLQSYVLKV 294
            L ++  V+++
Sbjct: 441 VLRMREEVIQL 451


>gi|195583406|ref|XP_002081513.1| GD11060 [Drosophila simulans]
 gi|194193522|gb|EDX07098.1| GD11060 [Drosophila simulans]
          Length = 760

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 75/328 (22%), Positives = 130/328 (39%), Gaps = 63/328 (19%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VY+YP            P++        S +Y K   A  +S + T DP+ A LF L 
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
                   DR   +E   D++ +    ++ + PYWN   G +H     +S G        
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202

Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
            +  +A + + + +      + H  DVS+P     ++ P + G++           K+  
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGTVQSNNFPANKKYL 260

Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           LAF    +   + S  R  L  +      +   + R       L+    DE         
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
               L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I    
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380

Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
             +  +  I++ I +E    L  Q+ VL
Sbjct: 381 RLLLQVPDIVRSIPAERIFALRQQTQVL 408


>gi|66517433|ref|XP_391845.2| PREDICTED: exostosin-1 [Apis mellifera]
          Length = 711

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 76/355 (21%), Positives = 140/355 (39%), Gaps = 80/355 (22%)

Query: 3   RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
           + F VYVYP        +V+ P+ ++   N  +ES          + T DP++A +F L 
Sbjct: 87  QGFTVYVYP------IEDVISPL-YQKILNVITES---------RYYTSDPTRACIFVLA 130

Query: 63  FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
                   DR  + TE + +  S  +     + PYWN   G +H     +S       + 
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLM-----RLPYWN--NGRNHLIFNLYSGTWPDYAEE 179

Query: 122 WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----------SKRNKL 171
                    ++  +S  I  H    D+S+P     ++ P + G           + +  +
Sbjct: 180 SLAFDLGYAMLAKASMSIFRHRPEFDISIPLF--GKQHPERGGEPGQALENNFPNNKKYI 237

Query: 172 AFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           A F G      + S  R  L  +      ++  + R       L+  +  +         
Sbjct: 238 AAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYD 297

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIV 263
               L+ + FCL  +G  + + R  ++L  GC+PVI++N + LPF + ++W     FS  
Sbjct: 298 YEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDE 357

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYE 316
              L IP    I++ +S+        ++LK+R+   F W  + S  +     ++E
Sbjct: 358 RLLLQIP---DIVRSVSN-------VHILKLRQQTQFLWERYFSSIEKIVFTVFE 402


>gi|17137330|ref|NP_477231.1| tout-velu, isoform A [Drosophila melanogaster]
 gi|442623684|ref|NP_001260971.1| tout-velu, isoform B [Drosophila melanogaster]
 gi|61212925|sp|Q9V730.1|EXT1_DROME RecName: Full=Exostosin-1; AltName: Full=Protein tout-velu
 gi|7303172|gb|AAF58236.1| tout-velu, isoform A [Drosophila melanogaster]
 gi|60678089|gb|AAX33551.1| LD10920p [Drosophila melanogaster]
 gi|85701308|dbj|BAE78509.1| Tout-velu [Drosophila melanogaster]
 gi|220960240|gb|ACL92656.1| ttv-PA [synthetic construct]
 gi|440214386|gb|AGB93503.1| tout-velu, isoform B [Drosophila melanogaster]
          Length = 760

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 75/328 (22%), Positives = 130/328 (39%), Gaps = 63/328 (19%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VY+YP            P++        S +Y K   A  +S + T DP+ A LF L 
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
                   DR   +E   D++ +    ++ + PYWN   G +H     +S G        
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202

Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
            +  +A + + + +      + H  DVS+P     ++ P + G++           K+  
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGTVQSNNFPANKKYL 260

Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           LAF    +   + S  R  L  +      +   + R       L+    DE         
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
               L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I    
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380

Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
             +  +  I++ I +E    L  Q+ VL
Sbjct: 381 RLLLQVPDIVRSIPAERIFALRQQTQVL 408


>gi|194883052|ref|XP_001975618.1| GG20465 [Drosophila erecta]
 gi|190658805|gb|EDV56018.1| GG20465 [Drosophila erecta]
          Length = 760

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 75/328 (22%), Positives = 130/328 (39%), Gaps = 63/328 (19%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VY+YP            P++        S +Y K   A  +S + T DP+ A LF L 
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
                   DR   +E   D++ +    ++ + PYWN   G +H     +S G        
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202

Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
            +  +A + + + +      + H  DVS+P     ++ P + G++           K+  
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGSVQSNNFPANKKYL 260

Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           LAF    +   + S  R  L  +      +   + R       L+    DE         
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
               L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I    
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380

Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
             +  +  I++ I +E    L  Q+ VL
Sbjct: 381 RLLLQVPDIVRSIPAERIFALRQQTQVL 408


>gi|195486003|ref|XP_002091324.1| GE13596 [Drosophila yakuba]
 gi|194177425|gb|EDW91036.1| GE13596 [Drosophila yakuba]
          Length = 760

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 75/328 (22%), Positives = 130/328 (39%), Gaps = 63/328 (19%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VY+YP            P++        S +Y K   A  +S + T DP+ A LF L 
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
                   DR   +E   D++ +    ++ + PYWN   G +H     +S G        
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202

Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
            +  +A + + + +      + H  DVS+P     ++ P + G++           K+  
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGSVQSNNFPANKKYL 260

Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           LAF    +   + S  R  L  +      +   + R       L+    DE         
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
               L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I    
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380

Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
             +  +  I++ I +E    L  Q+ VL
Sbjct: 381 RLLLQVPDIVRSIPAERIFALRQQTQVL 408


>gi|444732432|gb|ELW72727.1| Exostosin-1 [Tupaia chinensis]
          Length = 436

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 47/92 (51%), Gaps = 17/92 (18%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y + L  + FCL  +G  + + R  ++L   CVPV+++N ++LPF++V+NW   +++   
Sbjct: 54  YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVI--- 110

Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
                          +E LLLQ    ++ KH 
Sbjct: 111 --------------GDERLLLQIIQDRIFKHI 128


>gi|350421075|ref|XP_003492724.1| PREDICTED: exostosin-1-like isoform 1 [Bombus impatiens]
 gi|350421078|ref|XP_003492725.1| PREDICTED: exostosin-1-like isoform 2 [Bombus impatiens]
          Length = 711

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 77/355 (21%), Positives = 140/355 (39%), Gaps = 80/355 (22%)

Query: 3   RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
           + F VYVYP        +V+ P+ ++   N  +ES          + T DP++A +F L 
Sbjct: 87  QGFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPTRACIFVLA 130

Query: 63  FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
                   DR  + TE + +  S  I     + PYWN   G +H     +S       + 
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLI-----RLPYWN--NGRNHLIFNLYSGTWPDYAEE 179

Query: 122 WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----------SKRNKL 171
                    ++  +S  I  H    D+S+P     ++ P + G           + +  +
Sbjct: 180 SLAFDLGYAMLAKASMSIFRHRPDFDISIPLF--GKQHPERGGEPGQALENNFPNNKKYV 237

Query: 172 AFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           A F G      + S  R  L  +      ++  + R       L+  +  +         
Sbjct: 238 AAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYD 297

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIV 263
               L+ + FCL  +G  + + R  ++L  GC+PVI++N + LPF + ++W     FS  
Sbjct: 298 YEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDE 357

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYE 316
              L IP    I++ +S+        ++LK+R+   F W  + S  +     ++E
Sbjct: 358 RLLLQIP---DIVRSVSN-------VHILKLRQQTQFLWERYFSSIEKIVFTVFE 402


>gi|118781611|ref|XP_311558.3| AGAP010388-PA [Anopheles gambiae str. PEST]
 gi|116130033|gb|EAA07205.3| AGAP010388-PA [Anopheles gambiae str. PEST]
          Length = 744

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 81/360 (22%), Positives = 143/360 (39%), Gaps = 70/360 (19%)

Query: 2   NRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLF 59
           +++F VYVYP    +P  ++  P          S++Y K   +  +S + T DP +A LF
Sbjct: 98  DKNFYVYVYPP---EPLNSLGAPPPI-------SQNYQKIISSIQESRYYTTDPEQACLF 147

Query: 60  FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQ 119
            L         DR   +E   DF+ +    + Q+ P+WN   G +H     +S G     
Sbjct: 148 VLGIDTL----DRDSLSE---DFVRNVPSRL-QRLPHWN--NGRNHIIFNLYS-GTWPDY 196

Query: 120 KAWEVKLNAIQ-VVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSS-----------K 167
               +  +  Q ++  +S  I       DVS+P     ++ P + G++           K
Sbjct: 197 NENGLGFDPGQAILAKASMSIQSLRPGFDVSIPLF--HKQFPLRGGNTGFVISNNFPANK 254

Query: 168 RNKLAF----FAGAVNSPVREKLLQV-----------------WRNDSEIYAHSGRL--- 203
           +  LAF    +   + S  R  L  +                 WR+  +           
Sbjct: 255 KYLLAFKGKRYVHGIGSETRNSLFHLHNARDFVLVTTCKHGKSWRDLQDARCDEDNREYD 314

Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
           +  Y   L  S FCL  +G  + + R  + L  GC+PV+++N + LPF   ++WK  +I 
Sbjct: 315 RYDYETLLQNSTFCLVPRGRRLGSFRFLEVLQAGCIPVLLSNSWVLPFQSKIDWKQAAIW 374

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYELWLRR 321
                +  +  I++ IS+       S +L +R+  Q  W  + S  +      +E+   R
Sbjct: 375 ADERLLLQVPDIVRSIST-------SRILALRQQTQVLWERYFSSIEKIIFTTFEIIRER 427


>gi|340719750|ref|XP_003398310.1| PREDICTED: exostosin-1-like isoform 1 [Bombus terrestris]
 gi|340719752|ref|XP_003398311.1| PREDICTED: exostosin-1-like isoform 2 [Bombus terrestris]
          Length = 711

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 77/355 (21%), Positives = 140/355 (39%), Gaps = 80/355 (22%)

Query: 3   RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
           + F VYVYP        +V+ P+ ++   N  +ES          + T DP++A +F L 
Sbjct: 87  QGFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPTRACIFVLA 130

Query: 63  FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
                   DR  + TE + +  S  I     + PYWN   G +H     +S       + 
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLI-----RLPYWN--NGRNHLIFNLYSGTWPDYAEE 179

Query: 122 WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----------SKRNKL 171
                    ++  +S  I  H    D+S+P     ++ P + G           + +  +
Sbjct: 180 SLAFDLGYAMLAKASMSIFRHRPDFDISIPLF--GKQHPERGGEPGQALENNFPNNKKYV 237

Query: 172 AFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           A F G      + S  R  L  +      ++  + R       L+  +  +         
Sbjct: 238 AAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYD 297

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIV 263
               L+ + FCL  +G  + + R  ++L  GC+PVI++N + LPF + ++W     FS  
Sbjct: 298 YEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDE 357

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYE 316
              L IP    I++ +S+        ++LK+R+   F W  + S  +     ++E
Sbjct: 358 RLLLQIP---DIVRSVSN-------VHILKLRQQTQFLWERYFSSIEKIVFTVFE 402


>gi|195150443|ref|XP_002016164.1| GL11446 [Drosophila persimilis]
 gi|194110011|gb|EDW32054.1| GL11446 [Drosophila persimilis]
          Length = 676

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 2/88 (2%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I    
Sbjct: 325 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 384

Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
             +  +  I++ IS+E    L  Q+ VL
Sbjct: 385 RLLLQVPDIVRSISAERIFALRQQTQVL 412


>gi|195123279|ref|XP_002006135.1| GI20872 [Drosophila mojavensis]
 gi|193911203|gb|EDW10070.1| GI20872 [Drosophila mojavensis]
          Length = 761

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 2/88 (2%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
           Y   L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I    
Sbjct: 321 YGTLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380

Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
             +  +  I++ IS+E    L  Q+ VL
Sbjct: 381 RLLLQVPDIVRSISAERIFALRQQTQVL 408


>gi|299116802|emb|CBN74915.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1052

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 60/248 (24%), Positives = 106/248 (42%), Gaps = 19/248 (7%)

Query: 36  ESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYIFNISQK 93
           E +  +A +   +   +P +ADLF++P    ++    + R G +   + I+  +  ++  
Sbjct: 200 EIWLHRAMLAHPWRVANPEEADLFYVPMYPVLSTKLGNNRCGGKTHDELINTSVEYLALS 259

Query: 94  YPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSS---SYFISGHIAHKDVSL 150
             Y+ R GGADH  V      +SA+     + L    V  +     +   G    K V++
Sbjct: 260 SVYFRRFGGADHTLVCAWWNCKSALGPKPRMLLRRTVVGINEKMLEWTRWGCGLDKMVTI 319

Query: 151 P----QIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE-----IYAHS- 200
           P     +    E      +  R+   FF G        + L V    +E     +  H  
Sbjct: 320 PYTASSVLTTSEMIGGRAAEDRDIPFFFVGTARGRPERQNLDVVTGMAEGSVMMLGDHQS 379

Query: 201 --GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVII-ANHYDLPFAD-VLN 256
             G   T YA  +  S+FC   +G   ++ RI D++  GC P++  A+   LPF++ VLN
Sbjct: 380 DWGMNSTQYAAHIARSRFCFCPRGDTESSRRIFDAVAAGCTPIVTEASVAVLPFSEHVLN 439

Query: 257 WKSFSIVV 264
           +  F++VV
Sbjct: 440 YSDFAVVV 447


>gi|302780765|ref|XP_002972157.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300160456|gb|EFJ27074.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 470

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 70/305 (22%), Positives = 109/305 (35%), Gaps = 46/305 (15%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYI 87
            ++ E  F +  +     T DP  A  FF+PF     I+R        +  +       +
Sbjct: 133 QFSGEIIFHRRMLDHRCRTLDPDGASAFFVPFYAGLDISRNLWASGKSSSDVDSLGEQLL 192

Query: 88  FNISQKYPYWNRTGGADHFYVACHSIGR--------SAMQKAWEVKLNAIQVVCSSSYFI 139
             + +++P++NR+GGADHF VA    GR         +    W   L     + S    +
Sbjct: 193 HWLQRQHPHFNRSGGADHFLVA----GRISWDFRRMPSAAGEWGSSLFHQIEMRSVKRLV 248

Query: 140 SGHIAHKDVSLPQIWPRQEDPPK----------LGSSKRNKLAFFAGAVNSPVREKLLQV 189
                  D  L   +P    P            +  S R  L  FAG+     R    QV
Sbjct: 249 IERNPWDDSELGVPYPTSFHPSSDEDLAQWVEFVQGSPRPHLVAFAGSPRPGYRSDFRQV 308

Query: 190 W--------RNDSEIYAHSGRLKTPYADEL------LGSKFCLHVKGFEVNTARIADSLY 235
                    R  S     +       +D L      L S FCL  +G       + DSL 
Sbjct: 309 LLGQCRAAQRGISGCLDCTADTAGCTSDPLRVTKLFLSSVFCLQPRGDSFTRKSLFDSLI 368

Query: 236 YGCVPVIIAN-----HYDLPFA-DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
            GC+PV+  N      Y+L    D   +  F    +  +   +  +L+G+S E    +Q 
Sbjct: 369 SGCIPVLFWNQSAYWQYELYLPRDPEEYSVFIPHQSVKNGTNVLDVLQGVSRERIGRMQR 428

Query: 290 YVLKV 294
            VL++
Sbjct: 429 AVLRI 433


>gi|123474050|ref|XP_001320210.1| Exostosin family protein [Trichomonas vaginalis G3]
 gi|121903010|gb|EAY07987.1| Exostosin family protein [Trichomonas vaginalis G3]
          Length = 353

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 62/269 (23%), Positives = 109/269 (40%), Gaps = 49/269 (18%)

Query: 41  KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYP---YW 97
           K+  +  F  KDP +ADLF++P   A           G+ D+ +     I Q      Y+
Sbjct: 37  KSLERYEFRVKDPEEADLFYVPLFAALF--------NGLKDYANIDTIIIPQLRAFGKYF 88

Query: 98  NRTGGADHFYV-ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH---KDVSLP-- 151
           +R GG D+ ++    S     +    +  L ++  +   +Y  S +      ++V+ P  
Sbjct: 89  DRYGGVDYAFIQMLFSQDNIPITVHQQKTLASMITLGDLNYNYSKYQMRESWRNVNFPLT 148

Query: 152 -QIWPRQEDPPKLGSSKRNKLAFFAGAVN--------SPVREKLLQVWRNDSEIYAHSGR 202
             I  + E  P+   S R+   FF G +N        +P+RE +  V R    +  HS  
Sbjct: 149 SNIAQQFEIKPE---SSRHISTFFIGQINLTDFDTVAAPIREGMANVMR----VIPHSIV 201

Query: 203 LKTPYADELLG------------SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
           +     D + G            SKFC    G    T R+ D+    C+P+++++    P
Sbjct: 202 IDARRYDPITGVYSYNFSRMMSNSKFCCVPHGDGPTTKRLFDTFRTLCIPIVLSDEIKFP 261

Query: 251 FADV-LNWKSFSIVVATLD---IPLLKKI 275
           F D+ +N+    I +   +   IP+   I
Sbjct: 262 FEDLFINYPEILIQIPAFEPDRIPIAMSI 290


>gi|302838103|ref|XP_002950610.1| hypothetical protein VOLCADRAFT_104742 [Volvox carteri f.
           nagariensis]
 gi|300264159|gb|EFJ48356.1| hypothetical protein VOLCADRAFT_104742 [Volvox carteri f.
           nagariensis]
          Length = 1119

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 49/92 (53%), Gaps = 1/92 (1%)

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
           D +  SKFCL   G      R+A+++  GCVPV+I +H   P  DV+ ++ FS+  +  D
Sbjct: 587 DLMARSKFCLAPMGAGWGI-RLAEAMVRGCVPVVIQDHVYQPLWDVVPFEEFSLRFSRRD 645

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           +  L   L  ++SE+   LQ  V +  +  +W
Sbjct: 646 VADLVDHLDDVTSEQLARLQGGVERYHRWEEW 677


>gi|195431325|ref|XP_002063693.1| GK15814 [Drosophila willistoni]
 gi|194159778|gb|EDW74679.1| GK15814 [Drosophila willistoni]
          Length = 728

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/102 (27%), Positives = 53/102 (51%), Gaps = 9/102 (8%)

Query: 199 HSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
           H  RL  P+   L  SKFCL  +   +    + + +   C+PVI  ++Y LPF DV++W 
Sbjct: 313 HQKRLDYPHL--LAKSKFCLVARSLRLGQPDLLEIMSQNCIPVIAIDNYILPFEDVVDWS 370

Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
             S+ +   ++  + + L+ IS+ +       +++++K  QW
Sbjct: 371 LASVRIRESELHSVLRKLESISNVK-------IVEMQKQVQW 405


>gi|302774815|ref|XP_002970824.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300161535|gb|EFJ28150.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 454

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/248 (24%), Positives = 100/248 (40%), Gaps = 36/248 (14%)

Query: 31  GNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP--DFISHYIF 88
           G +A E    + F +   +T DP  A+LF++P+  A +   + + T+ +   D +   + 
Sbjct: 125 GQFALEIMIHERFRRYQCLTDDPHLANLFYIPY-YAGLDVSQYLFTKQVQMRDKLGQRLL 183

Query: 89  NISQKYPYWNRTGGADHFYVACHSI---GRSAMQ-KAWEVKLNAIQVVCSSSYFISGHIA 144
              Q   +WNR  G DH  V    +   GRS    ++W   L +IQ + +++  +     
Sbjct: 184 GYLQGNRHWNRKRGRDHVLVLGRIVWDFGRSEENHESWGSSLLSIQELDNATKLLIERDV 243

Query: 145 HKDVSLPQIWP-------RQEDPPKLG---SSKRNKLAFFAGAV------NSPVR----- 183
            +   +   +P       RQE    L     S R+ L  FAGA+       + +R     
Sbjct: 244 WRSSQMALPYPTGFHPDSRQEIDEWLAVVNGSSRDLLVSFAGALRDGNGSTATMRRSLRR 303

Query: 184 -----EKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
                E+L  + R +      +  + T  A   L S FCL   G         D L  GC
Sbjct: 304 QCQRHERLCTILRCERINCEENPEIVTCVA---LRSVFCLMPPGDSPTRKAFFDGLVAGC 360

Query: 239 VPVIIANH 246
           +PV+ + H
Sbjct: 361 IPVVFSEH 368


>gi|340377881|ref|XP_003387457.1| PREDICTED: exostosin-1b-like [Amphimedon queenslandica]
          Length = 729

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/228 (21%), Positives = 88/228 (38%), Gaps = 40/228 (17%)

Query: 131 VVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSS----------KRNKLAFFAG---- 176
           ++  +S+ +  +    D+SLP I     +  K              +R  L  F G    
Sbjct: 195 IIAKASFALENYRTGFDISLPLIHKLHSEKGKFTGGVSAHGNLFPIRRKYLLIFKGKRYL 254

Query: 177 -AVNSPVREKLLQVWRNDSEIYAHS---GRLKTPYADE-----------------LLGSK 215
             + S  R  +  +   D  I   +   G+  + Y DE                 +  S 
Sbjct: 255 WGLGSATRNAIYHLDNGDDVIMLTTCKHGKFWSRYRDEKCKKDNIFFEKYNYNHLISNST 314

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
           FCL  +G  + + R  +SL+ GC+PV ++N   LPF ++++W     V     +  +  +
Sbjct: 315 FCLVPRGRRLGSFRFLESLHAGCIPVSLSNGLVLPFHELIDWSKALFVFDERQLFQVPHM 374

Query: 276 LKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
           L+ I  ++ L      +++   F W  + S  DA      E+  +R S
Sbjct: 375 LRHIPEDKIL-----SMRLHTQFYWEQYFSSVDAIMHTTLEIVKQRIS 417


>gi|299472640|emb|CBN78292.1| Glycosyltransferase, family GT47 [Ectocarpus siliculosus]
          Length = 587

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 43/90 (47%)

Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
           +P   +L+ + F L   G    T R+A++L  G +PV I   +  PF   + W  FS   
Sbjct: 470 SPSYGDLMNTTFALLPAGRSPATYRLAEALSAGALPVFIHQDFVKPFPGKIPWSDFSFSF 529

Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
              ++P + K L+ +   E   +Q+  L+V
Sbjct: 530 PPEEVPRMLKTLRAVPDRELAQMQATALEV 559


>gi|168017355|ref|XP_001761213.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687553|gb|EDQ73935.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 442

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 74/337 (21%), Positives = 132/337 (39%), Gaps = 66/337 (19%)

Query: 11  PHRRNDPFANVLLPV------DFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF- 63
           PH +N  F  V+         D+     Y  E  F +        T DP++ADLFF+P+ 
Sbjct: 71  PHHKNHGFGAVVNATVEVFRQDWYGTDAYMLEVIFYERMQTYSCRTSDPAEADLFFIPYF 130

Query: 64  ----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQ 119
               ++  +  D +   +   + +     N  +    W R GG DHFY+A    GR+   
Sbjct: 131 AGLDALPYLYTDSKRELQQGREVVEWLEENAPKT---WRRHGGHDHFYIA----GRT--- 180

Query: 120 KAWEVKLNAIQVV-CSSSYFISGHIAHKDVSLPQIWPRQED----PPKLG---------- 164
            AW+      +V    +S F +  + +    + +  P ++D    P  +G          
Sbjct: 181 -AWDFCRPLTKVNWWGTSLFNNPEMENTTAMVLERRPWRDDEVAIPYPVGFHPSTSATLH 239

Query: 165 -------SSKRNKLAFFAGAVNSPVREKLLQVW-RNDSEIYAHSGRLKT----------P 206
                  SS R  L  F+GA+   +   + ++  R  SE      RL            P
Sbjct: 240 SWIEVVRSSPRKHLFSFSGALRPHLTISIREILSRQCSEAGNACSRLDCGKIKCSHEPEP 299

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV-----LNWKSFS 261
               LL + FCL  +G       + DS+  GC+PV    H D  +         ++++FS
Sbjct: 300 IYTSLLQATFCLQPRGDTSTRRSVIDSIVSGCIPVFF--HEDTAYTQYHWFLPKDYENFS 357

Query: 262 IVVATLDI----PLLKKILKGISSEEYLLLQSYVLKV 294
           + +   D+      + KIL   ++++   ++  ++K+
Sbjct: 358 VFIDEKDMKDGNADVSKILGAYTAKQVEQIRERLIKI 394


>gi|194755878|ref|XP_001960206.1| GF13249 [Drosophila ananassae]
 gi|190621504|gb|EDV37028.1| GF13249 [Drosophila ananassae]
          Length = 765

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 76/328 (23%), Positives = 130/328 (39%), Gaps = 63/328 (19%)

Query: 5   FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
           F VYVYP            P++       +S +Y K   A  +S + T DPS A LF L 
Sbjct: 105 FLVYVYPPE----------PLNSLGAAPPSSANYQKILTAIQESRYHTNDPSAACLFVLG 154

Query: 63  FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
                   DR   +E   D++ +    ++ + P+WN   G +H     +S G        
Sbjct: 155 IDTL----DRDSLSE---DYVRNVPSRLA-RLPHWN--NGRNHIIFNLYS-GTWPDYAEN 203

Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
            +  +A + + + +      + H  DVS+P     ++ P + G++           K+  
Sbjct: 204 SLGFDAGEAILAKASMSVLQVRHGFDVSIPLF--HKQFPLRAGATGSVQSNNFPANKKYL 261

Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
           LAF    +   + S  R  L  +      +   + R       L+    DE         
Sbjct: 262 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 321

Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
               L  S FCL  +G  + + R  ++L  GC+PV+++N + LPF   ++WK  +I    
Sbjct: 322 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 381

Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
             +  +   ++ IS E    L  Q+ VL
Sbjct: 382 RLLLQVPDTVRSISVERIFALRQQTQVL 409


>gi|363742350|ref|XP_003642625.1| PREDICTED: exostosin-1b-like, partial [Gallus gallus]
          Length = 456

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/114 (22%), Positives = 54/114 (47%), Gaps = 5/114 (4%)

Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
           K  Y + L  S FC+  +G  + + R  ++L   C+PV++++ ++LPF++ ++W   ++V
Sbjct: 37  KFDYQELLHNSTFCIVPRGRRLGSFRFLEALQAACIPVLLSDGWELPFSEAIDWGKAAVV 96

Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
            +   +  +   ++ I  E  L  Q      +  F W  + S  D       E+
Sbjct: 97  GSERLLLQIPSAVRCIRPERVLAFQQ-----QTQFLWDAYFSSVDKIVHTTLEI 145


>gi|302769322|ref|XP_002968080.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300163724|gb|EFJ30334.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 494

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/237 (22%), Positives = 93/237 (39%), Gaps = 28/237 (11%)

Query: 31  GNYASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYIF 88
             YA +  F +  +    +T  P +A LF++P+   +  +R+     T    + +   + 
Sbjct: 171 NQYALDVLFHQRLLHYPCLTDSPEEASLFYVPYYAGLDVLRYHYTNETLEQKNELGLEVM 230

Query: 89  NISQKYPYWNRTGGADHFYVACHSIG--RSAMQKAWEVKL-------NAIQVVCSSSYFI 139
           ++ ++  +W R  G DH  V        R   +  W   L       N  +++     F 
Sbjct: 231 DLLKRQQWWWRRNGRDHLLVMGKITWDFRRNNETMWGNTLLKMAEFENMTKLLLERDPFE 290

Query: 140 SGHIAHKDVSLPQIWPRQEDP------PKLGSSKRNKLAFFAGAVNSP--VREKLLQVWR 191
              IA   V  P  +    D        ++ SS R+ L  FAG    P  +R  L+   +
Sbjct: 291 PNEIA---VPHPTYFHPSSDSDISTWISRIASSSRDNLVSFAGMPRDPEHLRTHLINQCK 347

Query: 192 NDSE---IYAHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
           +  +   + A SG L     P  +  L S+FC+   G       + DSL  GC+PV+
Sbjct: 348 DRPDRCKLLACSGNLCDSPEPTMELFLSSQFCMQPPGDSATRRSVFDSLIAGCIPVL 404


>gi|326429779|gb|EGD75349.1| hypothetical protein PTSG_06425 [Salpingoeca sp. ATCC 50818]
          Length = 401

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 40/65 (61%)

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
           +  G +PVI+ +HY LP+ D+L+W++FSI +    +  L +IL+ I  E   ++Q  V+ 
Sbjct: 195 MAAGAIPVIVVDHYVLPYQDLLDWETFSIRIPEHRLLELPRILRSIPDEVVEMMQRRVVF 254

Query: 294 VRKHF 298
           V + F
Sbjct: 255 VFEEF 259


>gi|388510202|gb|AFK43167.1| unknown [Medicago truncatula]
          Length = 216

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 1/83 (1%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
           +A+E +  +  + S   T +P +AD F+ P ++   +  +        P  +   I  IS
Sbjct: 77  FAAEIFMHRFLLSSAVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136

Query: 92  QKYPYWNRTGGADHFYVACHSIG 114
             +PYWNRT GADHF+V  H  G
Sbjct: 137 SNWPYWNRTEGADHFFVVPHDFG 159


>gi|159478873|ref|XP_001697525.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158274404|gb|EDP00187.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 597

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 49/92 (53%), Gaps = 1/92 (1%)

Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
           +K  Y+D L  +KFCL   G +  +AR+ D++ +GC+PVIIA+     F  +L+   F +
Sbjct: 500 VKGEYSDMLSRAKFCLVAPG-DGWSARMEDAVLHGCIPVIIADGVHAVFESILDIDGFGL 558

Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
            +    +P +  IL  +        Q+++ +V
Sbjct: 559 RIPQEQVPRILDILLAVPPRAIRSKQAHLGRV 590



 Score = 37.7 bits (86), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 25/95 (26%), Positives = 39/95 (41%), Gaps = 13/95 (13%)

Query: 33  YASESYFKKAFMKSHFVTKDPSKADLFF-------LPFSIARMRHDRRIGTEGIP----- 80
           YA ++   ++ + S   T DP +AD F+       LPF I            G P     
Sbjct: 252 YAMDTLLHESLLGSPHRTFDPEEADFFYVPHQASCLPFPIGAWADFPWFPDAGGPRTRQM 311

Query: 81  -DFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
            + +   +  I+  +P+W R GG DH +   H  G
Sbjct: 312 LNLVIDTVQWINATFPFWQRRGGRDHIFTFTHDEG 346


>gi|194755024|ref|XP_001959792.1| GF13049 [Drosophila ananassae]
 gi|190621090|gb|EDV36614.1| GF13049 [Drosophila ananassae]
          Length = 717

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 68/324 (20%), Positives = 127/324 (39%), Gaps = 65/324 (20%)

Query: 5   FRVYVYP-----HRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKAD 57
            +VY+YP       R+D  A+ L            S  YF+  +A +KS + T +P++A 
Sbjct: 108 LKVYIYPLQDFVDERSDNVASTL------------SSEYFQILEAVLKSRYYTSNPNEAC 155

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LF     +               +F  H          +W+R  GA+H  +  + +  +A
Sbjct: 156 LFLPSLDLLNQN-----------NFDKHLAGAALASLDFWDR--GANH--IIFNMLPGAA 200

Query: 118 MQKAWEVKLNAIQVVCSSSYFIS-GHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAG 176
                 + +N    +     F S  +    DVS+P +W  +  P    ++ + K      
Sbjct: 201 PSYNTVLDVNTDNAIIFGGGFDSWSYRPGFDVSIP-VWSPRLVPQHAHATAQRKFLLLVA 259

Query: 177 AVN------SPVREK---------LLQVWRN-----DSEIYAHSGRLKTPYADELLGSKF 216
            +N        +RE          LL    N        +  H   L+ P    L   KF
Sbjct: 260 QLNILPRFVRTMREVALAHNDQMLLLGACENMDLTSRCPVSQHHKSLEYPRL--LSRGKF 317

Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
           CL  K   +    + + +   C+PV+  ++Y +PF DV++W   S+ +   ++  + + L
Sbjct: 318 CLITKSLRLGQPDLVEIMSQHCIPVVAVDNYIMPFEDVIDWSLASVRIRESELHSVMQKL 377

Query: 277 KGISSEEYLLLQSYVLKVRKHFQW 300
           + IS+ +       +++++K  QW
Sbjct: 378 QAISNIK-------IVEMQKQVQW 394


>gi|326435600|gb|EGD81170.1| hypothetical protein PTSG_11210 [Salpingoeca sp. ATCC 50818]
          Length = 107

 Score = 49.7 bits (117), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/62 (37%), Positives = 39/62 (62%)

Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK 296
           G +PVI+ +HY LP+ D+L+W++FSI +    +  L +IL+ I  E   ++Q  V+ V +
Sbjct: 4   GAIPVIVVDHYVLPYQDLLDWETFSIRIPEHRLLELPRILRSIPDEVVEMMQRRVVFVFE 63

Query: 297 HF 298
            F
Sbjct: 64  EF 65


>gi|47203855|emb|CAG13860.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 72

 Score = 49.7 bits (117), Expect = 0.002,   Method: Composition-based stats.
 Identities = 18/57 (31%), Positives = 38/57 (66%)

Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
           Y + L  S FCL  +G  + + R  ++L   CVPV+++N ++LPF+++++W + +++
Sbjct: 4   YKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVI 60


>gi|326521044|dbj|BAJ92885.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 289

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 64/270 (23%), Positives = 103/270 (38%), Gaps = 72/270 (26%)

Query: 41  KAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTEGIPDFISHYIF 88
           ++ + S   T +  +AD F++P   +            RM  D R+ +    ++      
Sbjct: 15  ESILASPHRTLNGDEADYFYVPVLDSCLITRSDDAPHLRMPEDLRLRSYHTLEYYRKAYD 74

Query: 89  NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQVVCSSSYFI 139
           +I+Q+YPYWNRT G DH +       AC++   I  S M   W    N      +++Y+ 
Sbjct: 75  HIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHEKSTTAYWA 133

Query: 140 SG--------------HIAHKDVSLP--------QIWPRQEDPPKLGSSKRNKLAFFAGA 177
                               KD+ LP         IW +    PK+    R  L +F G 
Sbjct: 134 DNWDDIPFDRRGNHPCFDPRKDLVLPAWKEPNPGAIWLKLWARPKI---NRTTLFYFNGN 190

Query: 178 VNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP-----------YADELLG 213
           +                +R+KL   + +        GR +T            Y +EL  
Sbjct: 191 LGPAYEEGRPEDTYSMGIRQKLAAEFGSTPNKQGKLGRQQTANVTVTYLKSEMYYEELAS 250

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVII 243
           S FC  + G +  + R+ DS+  GC+PVII
Sbjct: 251 SIFCGVLPG-DGWSGRMEDSMLQGCIPVII 279


>gi|159473731|ref|XP_001694987.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158276366|gb|EDP02139.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 721

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 49/92 (53%), Gaps = 1/92 (1%)

Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
           D +  +KFCL   G      R+A+++  GCVPVII +H      D+L ++ FSI +   +
Sbjct: 586 DSMKLAKFCLAPMGAGWGI-RLAEAMVSGCVPVIIQDHIYQAHWDILPFEEFSIRIGRNE 644

Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
           +  L  IL  +S ++   LQ+ + +  + F W
Sbjct: 645 LHQLVDILDDVSPQQLDSLQAGIERYHRAFFW 676


>gi|255642058|gb|ACU21295.1| unknown [Glycine max]
          Length = 340

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 115/280 (41%), Gaps = 43/280 (15%)

Query: 4   SFRVYVYPHRRNDPFANVLLPVDFEP------RGNYASESYFKKAFMKSHFVTKDPSKAD 57
           S ++YVY     D    +L   D +       +G + S+    K  ++S   T    +AD
Sbjct: 66  SLKIYVYQENEIDGLKELLRGRDAKITDEACLKGQWGSQVKIHKLLLQSKQRTWKKEEAD 125

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LFF+P   + ++  R +G     +  S Y+  ISQ  PY+  +GG +H +V     G + 
Sbjct: 126 LFFVP---SYVKCARMMGGLNDKEINSTYVKVISQ-MPYFRLSGGRNHIFVFPSGAG-AH 180

Query: 118 MQKAWEVKLNAIQVVC---------SSSYF-------ISGHIAHKDVSLPQIWPRQEDPP 161
           + K+W   +N   ++           +S F       I G+I   D  + +       P 
Sbjct: 181 LFKSWATYINRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNI---DDGMTKTGDTTVQPL 237

Query: 162 KLGSSKRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELL 212
            L  SKR  LA + G A     R KL+++ +         D +        +  Y + L 
Sbjct: 238 PL--SKRKYLANYLGRAQGKAGRLKLIELSKQFPEKLECPDLKFSGPDKLGRKEYFEHLR 295

Query: 213 GSKFCLHVKGFEVNTARIADSLYYG-CVPVII-ANHYDLP 250
            SKFCL  +G    T R  +S + G C   II +N   +P
Sbjct: 296 NSKFCLAPRGESSWTLRFYESFFCGMCSSYIIRSNRIAIP 335


>gi|326435785|gb|EGD81355.1| hypothetical protein PTSG_11833 [Salpingoeca sp. ATCC 50818]
          Length = 944

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 9/98 (9%)

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
           G    +H+   +V+ A I   +  G +PVI+ +HY LP+ D+L+W +FSI +    +  L
Sbjct: 630 GPGSLVHLLKPDVDPAYI---MATGAIPVIVVDHYVLPYQDLLDWGTFSIRIPEHRLLEL 686

Query: 273 KKILKGISSEEYLLLQSYVLKVRKHF------QWHVFP 304
            +IL+ I  E   ++Q  V+ V   F      Q H  P
Sbjct: 687 PRILRSIPDEVVEMMQRRVVFVYGEFFKSLSTQVHTAP 724


>gi|302772274|ref|XP_002969555.1| hypothetical protein SELMODRAFT_451122 [Selaginella moellendorffii]
 gi|300163031|gb|EFJ29643.1| hypothetical protein SELMODRAFT_451122 [Selaginella moellendorffii]
          Length = 453

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 58/245 (23%), Positives = 98/245 (40%), Gaps = 30/245 (12%)

Query: 31  GNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP--DFISHYIF 88
           G +A E    + F +   +T DP  A+LF++P+  A +   + + T+ +   D +   + 
Sbjct: 129 GQFAVEIMIHERFRRYQCLTDDPHLANLFYIPY-YAGLDVSQYLFTKQVQMRDKLGQRLL 187

Query: 89  NISQKYPYWNRTGGADHFYVACHSI---GRSAMQ-KAWEVKLNAIQVVCSSSYFISGHIA 144
              Q   +WNR  G DH  V    +   GRS    ++W   L +IQ + +++  +     
Sbjct: 188 GYLQGNRHWNRKRGRDHVLVLGRIVWDFGRSEENHESWGSSLLSIQELDNATKLLIERDV 247

Query: 145 HKDVSLPQIWPRQEDPPK----------LGSSKRNKLAFFAGAV------NSPVREKL-L 187
            +   +   +P    P            +  S R+ L  FAGA+       + +R  L  
Sbjct: 248 WRSSQMALPYPTGFHPDSRREIDEWLAVVNGSSRDLLVSFAGALRDGNGSTATMRRSLRR 307

Query: 188 QVWRNDSEI-YAHSGRLKTPYADEL-----LGSKFCLHVKGFEVNTARIADSLYYGCVPV 241
           Q  R++S        R+      E+     L S FCL   G         D L  GC+PV
Sbjct: 308 QCQRHESLCTILRCERINCEENPEIVTCVALRSIFCLMPPGDSPTRKGFFDGLVAGCIPV 367

Query: 242 IIANH 246
           + + H
Sbjct: 368 VFSEH 372


>gi|19699001|gb|AAL91236.1| unknown protein [Arabidopsis thaliana]
 gi|30023662|gb|AAP13364.1| At2g28110 [Arabidopsis thaliana]
          Length = 214

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 12/90 (13%)

Query: 33  YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFIS 84
           +A+E    KAF+  +    T+DP +AD FF+P  ++           G P        I+
Sbjct: 120 FAAEVALHKAFLSLEGDVRTEDPYEADFFFVPVYVSC----NFSTINGFPAIGHARSLIN 175

Query: 85  HYIFNISQKYPYWNRTGGADHFYVACHSIG 114
             I  +S +YP+WNRT G+DH + A H  G
Sbjct: 176 DAIKLVSTQYPFWNRTSGSDHVFTATHDFG 205


>gi|260813939|ref|XP_002601673.1| hypothetical protein BRAFLDRAFT_94550 [Branchiostoma floridae]
 gi|229286975|gb|EEN57685.1| hypothetical protein BRAFLDRAFT_94550 [Branchiostoma floridae]
          Length = 786

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 86/367 (23%), Positives = 142/367 (38%), Gaps = 68/367 (18%)

Query: 25  VDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS 84
           V F     ++ E    +  + S F T DP KAD+F++P+  A       + T   P    
Sbjct: 415 VSFRSTHMFSLEVILHQKLLSSTFRTLDPEKADVFYIPYYPALAAACEPVSTIDSPALDR 474

Query: 85  HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSY-FISGHI 143
                I+  YPY+ +  G  H      ++GR   + A +V    ++   S S  F++  I
Sbjct: 475 ELWQFITSNYPYFQQ--GKPHMM----ALGRIEREHA-DVTGGILKTRESRSVTFVA--I 525

Query: 144 AHKDVSLPQIWPRQEDPPKL-----------------GSSK---------RNKLAFFAGA 177
            H+       + R+   P +                 G SK         R+ L  FAG+
Sbjct: 526 EHESDPKTLKFIRRSGLPMVVAPYPSCGHLLSDNKFGGESKSERTQLDIPRDVLVLFAGS 585

Query: 178 --VNSPVREKLLQVWRNDSEIYAHSGRLKT-------------PYADELLG----SKFCL 218
             ++  +R  L Q  R  SE Y  +  L                + + L+     S FCL
Sbjct: 586 RRMSHDIRRILSQQLRPTSEKYDATSSLNKQNVWFITQECRDRSWQENLVEWMHHSVFCL 645

Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYD--LPFADVLNWKSFSIVVATLDIPLLKK-- 274
              G         D++  GC+PVI    ++   PF DVL++  F++ V   D    K+  
Sbjct: 646 QPPGDSPTRKSFFDAVQCGCIPVIFKLDHEPVYPFDDVLDYSKFTVKVTDGDFFQEKRSI 705

Query: 275 --ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD----YDAFYMVMYELWLR---RSSVR 325
             IL+ I        ++ + +V    Q+   P       DAF M+M E+      RS+VR
Sbjct: 706 VDILQDIPEAVIAAKRAELRQVTPLLQYSYPPLPETHVQDAFDMIMQEIGRTRGGRSNVR 765

Query: 326 VQWSTSL 332
            ++ + +
Sbjct: 766 RRYGSKV 772


>gi|224131234|ref|XP_002328488.1| predicted protein [Populus trichocarpa]
 gi|222838203|gb|EEE76568.1| predicted protein [Populus trichocarpa]
          Length = 90

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 41/69 (59%)

Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
            SKFCL +     ++ R+ D++   CVPVII++  + P+ DV+++  F I V T ++   
Sbjct: 3   SSKFCLDIASDTPSSNRLIDAIASHCVPVIISDDIEFPYEDVIDYSQFCISVRTSNVVRE 62

Query: 273 KKILKGISS 281
           K ++  ISS
Sbjct: 63  KFLVNLISS 71


>gi|57044145|ref|XP_544489.1| PREDICTED: exostoses (multiple)-like 1 [Canis lupus familiaris]
          Length = 672

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/125 (23%), Positives = 56/125 (44%), Gaps = 6/125 (4%)

Query: 193 DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
           D     + G  +TP    L  + FC+ + G          +L  GC+PV+++  ++LPF+
Sbjct: 249 DGRCEQNRGPEQTPLGATLPNATFCV-IPGHRPEALHFLQALQAGCIPVLLSPRWELPFS 307

Query: 253 DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYM 312
           +V++W   +IV    D  L  ++L  +  +E    +   L+ +  F W  + S  +    
Sbjct: 308 EVIDWTKAAIVA---DKRLPFQVLAAL--QEMPPTRVLALRQQTQFLWDAYFSSVEKVIH 362

Query: 313 VMYEL 317
              E+
Sbjct: 363 TTLEI 367


>gi|297814922|ref|XP_002875344.1| hypothetical protein ARALYDRAFT_904882 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321182|gb|EFH51603.1| hypothetical protein ARALYDRAFT_904882 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 64

 Score = 49.3 bits (116), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 38/62 (61%)

Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
           S FCL   G+ + + R+ +S   GCVPV+IAN   LPF++++ W    +++A  D   L+
Sbjct: 3   SIFCLCPFGWGIWSPRLVESAVSGCVPVVIANGIQLPFSEIVRWPEILLMMAKKDDMNLQ 62

Query: 274 KI 275
           KI
Sbjct: 63  KI 64


>gi|291399546|ref|XP_002716175.1| PREDICTED: exostoses-like 1 [Oryctolagus cuniculus]
          Length = 671

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/118 (24%), Positives = 57/118 (48%), Gaps = 7/118 (5%)

Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
            G L  P  + L  + FCL + G   +  R   +L  GC+PV+++  ++LPF++ ++W  
Sbjct: 253 GGELSQP-TETLPNATFCL-IPGHRSDAWRFLQALQAGCIPVLLSPLWELPFSERIDWTK 310

Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
            +I+    D    +++L  +  +E  L +   L+ +  F W  + S  +     M E+
Sbjct: 311 ATIIA---DERQPRQVLAAL--QEMPLARVLSLRQQTQFLWAAYFSSVEKVIHTMLEI 363


>gi|440798055|gb|ELR19126.1| hypothetical protein ACA1_335890 [Acanthamoeba castellanii str.
           Neff]
          Length = 350

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 18/138 (13%)

Query: 162 KLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVK 221
           K G+++R++L          VR  L    R DS            Y D +L +KF L V+
Sbjct: 204 KGGNNRRHRL-------RPKVRAGLDDSSRGDS----------GDYHDLMLNTKFALIVQ 246

Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
           G  +++ R+ +++    VPVI+A++Y LPF++ + W   +I V       +  ++  I  
Sbjct: 247 GNGLHSYRLTEAMRANAVPVILADNYVLPFSEAVRWDEIAIFVPESQWASIPDVIGRIDD 306

Query: 282 EEYLLLQSYVLKV-RKHF 298
           E    ++  +  V   HF
Sbjct: 307 EALARMREKLATVYEAHF 324


>gi|302855272|ref|XP_002959133.1| hypothetical protein VOLCADRAFT_121779 [Volvox carteri f.
           nagariensis]
 gi|300255495|gb|EFJ39797.1| hypothetical protein VOLCADRAFT_121779 [Volvox carteri f.
           nagariensis]
          Length = 771

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 55/274 (20%), Positives = 98/274 (35%), Gaps = 60/274 (21%)

Query: 52  DPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
           DP++AD FF+P               G   +   + F ++    YW +  G DH      
Sbjct: 302 DPARADYFFIPGC-----------GRGCDKWDDKFHFILAHYGQYWTQNQGRDHIMTHPG 350

Query: 112 SIGRS---------------AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP----- 151
             GR                 M + W + ++    V   + F + H  ++D+ +P     
Sbjct: 351 DWGRCEHSWDAFADKFISNVTMLQHWGMTVDRSSEV-EHNLFNTCHKPNQDILVPPMCGD 409

Query: 152 -------QIW--PRQEDPPKLGSSKRNKLAFFAGAV-------NSPVREKLLQV------ 189
                   IW   R+E+P       +  LA  AG++         P R +   +      
Sbjct: 410 LYPQFEYNIWHPNRKENP-----ITKTNLASVAGSICGWNSVEEPPCRNRYYSLGVRAAL 464

Query: 190 WRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
           W+       H  +        +  S+FC    G       +  +   GC+PVII++H   
Sbjct: 465 WQLRDVPGFHIAKRVAMMGQSMAESEFCFAPTGAGYGKRNVMATTL-GCMPVIISDHVAQ 523

Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
           P+   LNW  F + +       ++ IL+G + ++
Sbjct: 524 PYEPFLNWNEFGVWIPESQAKDVEIILRGFTPQQ 557


>gi|326434432|gb|EGD80002.1| hypothetical protein PTSG_10278 [Salpingoeca sp. ATCC 50818]
          Length = 583

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 37/59 (62%)

Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
           +  G +PVI+ +HY LP+ D+L+W++FSI +    +  L +IL+ I  E   ++Q  V+
Sbjct: 201 MAAGTIPVIVVDHYVLPYQDLLDWETFSIRIPEHRLLELPRILRSIPDEVVEMMQRRVV 259


>gi|302760669|ref|XP_002963757.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300169025|gb|EFJ35628.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 411

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 66/284 (23%), Positives = 108/284 (38%), Gaps = 43/284 (15%)

Query: 32  NYASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYIFN 89
            ++ E  F    ++   VT D   A++F++PF   +   R+  R       D + H +  
Sbjct: 77  QFSGEVIFHTRILRHPCVTNDSDSANVFYVPFYAGLDVSRYLWRPSKAEDRDHLGHKLVE 136

Query: 90  ISQKYPYWNRTGGADHFYVACHSIG------RSAMQKAWEVKL-------NAIQVVCSSS 136
                P W R  G DHF +    IG      R   + AW   L       N  ++   S+
Sbjct: 137 WLSTQPAWTRARGRDHFTM----IGRITWDFRRPEENAWGSGLLNMAEMKNMTRLAIESN 192

Query: 137 YFISGH--IAHKDVSLPQIWPRQEDPPKLGSSKRNKLAF-FAGAVNSPV----REKLLQV 189
            +  G   + +     PQ   + ++  +   +K   L F FAGA    +    R +LL  
Sbjct: 193 PWEGGEYGVPYPTSFHPQNEHQLQEWQEFVRNKERGLVFSFAGATRKRIPNDFRLELLAQ 252

Query: 190 WRND----SEIYAHSGRLKT--PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVII 243
             +     S +     + +T  P     L S FCL  +G       I DS+  GC+PV  
Sbjct: 253 CSDSRGACSAMDCSDSKCETPEPVVQLFLNSTFCLQPRGDGYTRRSIFDSVLAGCIPVFF 312

Query: 244 ANHYD-------LPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
            N           P  D    +S+S+ +   D+    KI++ +S
Sbjct: 313 WNQSSYWQYKWFFPEED----ESYSVFIDREDVRKGTKIMEVLS 352


>gi|443712570|gb|ELU05824.1| hypothetical protein CAPTEDRAFT_92231 [Capitella teleta]
          Length = 668

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 51/107 (47%), Gaps = 5/107 (4%)

Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
           L  S FCL  +G  + + R  ++L   C+PV ++N++ LPF++V++W   +I      + 
Sbjct: 253 LHNSTFCLVPRGRRLGSYRFLEALQAACIPVFLSNNWVLPFSEVIDWNQAAIWGDERLLL 312

Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
            +  I++ I   + L      L+ +  F W  + S  D       E+
Sbjct: 313 QIPSIVRSIRHADLL-----ALRQQTQFLWETYFSSIDKIVATTLEI 354


>gi|297282579|ref|XP_001108035.2| PREDICTED: exostoses (multiple)-like 1 isoform 1 [Macaca mulatta]
          Length = 675

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 60/126 (47%), Gaps = 7/126 (5%)

Query: 193 DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTA-RIADSLYYGCVPVIIANHYDLPF 251
           D+      G  +T   + L  + FCL + G     A R   +L  GC+PV+++  ++LPF
Sbjct: 248 DARCEQDPGPGQTQRGETLPNATFCL-ISGHRPEAASRFLQALQAGCIPVLLSPRWELPF 306

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFY 311
           ++V++W   +I VA   +PL  ++L  +  +E    +   L+ +  F W  + S  +   
Sbjct: 307 SEVIDWTKAAI-VADERLPL--QVLAAL--QEMSPARVLALRQQTQFLWDAYFSSVEKVI 361

Query: 312 MVMYEL 317
               E+
Sbjct: 362 HTTLEV 367


>gi|355557695|gb|EHH14475.1| hypothetical protein EGK_00405 [Macaca mulatta]
          Length = 675

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 60/126 (47%), Gaps = 7/126 (5%)

Query: 193 DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTA-RIADSLYYGCVPVIIANHYDLPF 251
           D+      G  +T   + L  + FCL + G     A R   +L  GC+PV+++  ++LPF
Sbjct: 248 DARCEQDPGPGQTQRGETLPNATFCL-ISGHRPEAASRFLQALQAGCIPVLLSPRWELPF 306

Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFY 311
           ++V++W   +I VA   +PL  ++L  +  +E    +   L+ +  F W  + S  +   
Sbjct: 307 SEVIDWTKAAI-VADERLPL--QVLAAL--QEMSPARVLALRQQTQFLWDAYFSSVEKVI 361

Query: 312 MVMYEL 317
               E+
Sbjct: 362 HTTLEV 367


>gi|195334813|ref|XP_002034071.1| GM20090 [Drosophila sechellia]
 gi|194126041|gb|EDW48084.1| GM20090 [Drosophila sechellia]
          Length = 717

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 69/325 (21%), Positives = 125/325 (38%), Gaps = 67/325 (20%)

Query: 5   FRVYVYP-----HRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKAD 57
            +VY+YP       ++D  A  L            S  YF+  +A +KS + T +P++A 
Sbjct: 108 LKVYIYPLQEFVDEQSDKTATTL------------SSEYFQILEAVLKSRYYTSNPNEAC 155

Query: 58  LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
           LF     +                F  H   +      +W+R  GA+H        G  +
Sbjct: 156 LFLPSLDLLNQNV-----------FDKHLAGSALASLDFWDR--GANHLIFNMLPGGAPS 202

Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGA 177
                +V  +   ++    +    +    DV++P   PR        +++R K       
Sbjct: 203 YNTVLDVNTDN-AIIFGGGFDSWSYRPGFDVAIPVWSPRLVRQHAHATAQR-KFLLVVAQ 260

Query: 178 VNSPVREKLLQVWRNDSEIYAHSGRL-------------KTP---------YADELLGSK 215
           +N  +  + ++  R  S   AHS +L             + P         Y   L   K
Sbjct: 261 LN--ILPRFVRTLRELS--LAHSEQLLLLGACENLDLTMRCPLSQHHKSLEYPRLLSRGK 316

Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
           FCL  +   +    + + +   C+PVI  ++Y LPF DV++W   S+ V   ++  + + 
Sbjct: 317 FCLLGRSLRMGQPDLVEIMSQHCIPVIAVDNYVLPFEDVIDWSLASVRVRENELHSVMQK 376

Query: 276 LKGISSEEYLLLQSYVLKVRKHFQW 300
           LK ISS +       +++++K  QW
Sbjct: 377 LKAISSVK-------IVEMQKQVQW 394


>gi|195488335|ref|XP_002092270.1| GE14097 [Drosophila yakuba]
 gi|194178371|gb|EDW91982.1| GE14097 [Drosophila yakuba]
          Length = 717

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 79/366 (21%), Positives = 142/366 (38%), Gaps = 73/366 (19%)

Query: 5   FRVYVYP-----HRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKAD 57
            +VY+YP       ++D  A  L            S  YF+  +A +KS + T +P++A 
Sbjct: 108 LKVYIYPLQEFVDEQSDKTATTL------------SSEYFQILEAVLKSRYYTSNPNEAC 155

Query: 58  LFFLPFSIARMR-HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
           LF     +      D+ +    +                +W+R  GA+H        G  
Sbjct: 156 LFLPSLDLLNQNVFDKHLAGAALASL------------DFWDR--GANHIIFNMLPGGAP 201

Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFA- 175
           +     +V  +   ++    +    +    DV++P   PR        +++R  L   A 
Sbjct: 202 SYNTVLDVNTDN-AIIFGGGFDSWSYRPGFDVAIPVWSPRLVRQHAHATAQRKFLLVVAQ 260

Query: 176 -----------GAVNSPVREKLLQVWRNDS-------EIYAHSGRLKTPYADELLGSKFC 217
                        V+    E+LL +   +S        +  H   L+ P    L   KFC
Sbjct: 261 LNILPRFLRTLKEVSLANSEQLLLLGACESLDLTMRCPLSQHHKSLEYPRL--LSRGKFC 318

Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
           L  +   +    + + +   C+PVI  ++Y LPF DV++W   SI +   ++  + + LK
Sbjct: 319 LLGRSLRMGQPDLVEIMSQHCIPVIAVDNYVLPFEDVIDWSLASIRIRENELHSVMQKLK 378

Query: 278 GISSEEYLLLQSYVLKVRKHFQW--HVFPSDYDAFYMVMYELW------LRRSSVRVQWS 329
            ISS +       +++++K  QW    +  D     +   E+       LR  S R QW+
Sbjct: 379 AISSVK-------IVEMQKQVQWLFSKYFKDLKTVTLTALEVLESRIFPLRARSSR-QWN 430

Query: 330 TSLDSN 335
           T +D+N
Sbjct: 431 T-IDTN 435


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.324    0.138    0.439 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,569,800,435
Number of Sequences: 23463169
Number of extensions: 227970700
Number of successful extensions: 535364
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1092
Number of HSP's successfully gapped in prelim test: 152
Number of HSP's that attempted gapping in prelim test: 532326
Number of HSP's gapped (non-prelim): 1666
length of query: 336
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 193
effective length of database: 9,003,962,200
effective search space: 1737764704600
effective search space used: 1737764704600
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)